CN116312786B - Single cell expression pattern difference evaluation method based on multi-group comparison - Google Patents
Single cell expression pattern difference evaluation method based on multi-group comparison Download PDFInfo
- Publication number
- CN116312786B CN116312786B CN202310083585.7A CN202310083585A CN116312786B CN 116312786 B CN116312786 B CN 116312786B CN 202310083585 A CN202310083585 A CN 202310083585A CN 116312786 B CN116312786 B CN 116312786B
- Authority
- CN
- China
- Prior art keywords
- cell
- expression
- cell type
- expression profile
- single cell
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 230000014509 gene expression Effects 0.000 title claims abstract description 103
- 238000011156 evaluation Methods 0.000 title claims abstract description 13
- 238000000034 method Methods 0.000 claims abstract description 19
- 238000012216 screening Methods 0.000 claims abstract description 7
- 108090000623 proteins and genes Proteins 0.000 claims description 22
- 238000004590 computer program Methods 0.000 claims description 9
- 239000003550 marker Substances 0.000 claims description 8
- 238000004458 analytical method Methods 0.000 abstract description 8
- 238000011160 research Methods 0.000 abstract description 6
- 238000001228 spectrum Methods 0.000 abstract 1
- 210000004027 cell Anatomy 0.000 description 121
- 238000009826 distribution Methods 0.000 description 5
- 210000003483 chromatin Anatomy 0.000 description 4
- 108010077544 Chromatin Proteins 0.000 description 3
- 238000012163 sequencing technique Methods 0.000 description 3
- 238000000585 Mann–Whitney U test Methods 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 238000011835 investigation Methods 0.000 description 2
- 230000011987 methylation Effects 0.000 description 2
- 238000007069 methylation reaction Methods 0.000 description 2
- 238000010606 normalization Methods 0.000 description 2
- 101100202389 Haloferax volcanii (strain ATCC 29605 / DSM 3757 / JCM 8879 / NBRC 14742 / NCIMB 2012 / VKM B-1768 / DS2) samp1 gene Proteins 0.000 description 1
- 101000804764 Homo sapiens Lymphotactin Proteins 0.000 description 1
- 102100035304 Lymphotactin Human genes 0.000 description 1
- 210000001744 T-lymphocyte Anatomy 0.000 description 1
- 230000010632 Transcription Factor Activity Effects 0.000 description 1
- 102000008579 Transposases Human genes 0.000 description 1
- 108010020764 Transposases Proteins 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 238000003556 assay Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000004422 calculation algorithm Methods 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 230000010267 cellular communication Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 230000004069 differentiation Effects 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 1
- 238000010201 enrichment analysis Methods 0.000 description 1
- 230000001973 epigenetic effect Effects 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 230000004907 flux Effects 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 238000007429 general method Methods 0.000 description 1
- 238000010921 in-depth analysis Methods 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 238000009827 uniform distribution Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B25/00—ICT specially adapted for hybridisation; ICT specially adapted for gene or protein expression
- G16B25/10—Gene or protein expression profiling; Expression-ratio estimation or normalisation
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B40/00—ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/10—Pre-processing; Data cleansing
- G06F18/15—Statistical pre-processing, e.g. techniques for normalisation or restoring missing data
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/213—Feature extraction, e.g. by transforming the feature space; Summarisation; Mappings, e.g. subspace methods
- G06F18/2135—Feature extraction, e.g. by transforming the feature space; Summarisation; Mappings, e.g. subspace methods based on approximation criteria, e.g. principal component analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/243—Classification techniques relating to the number of classes
- G06F18/2431—Multiple classes
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02A—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
- Y02A90/00—Technologies having an indirect contribution to adaptation to climate change
- Y02A90/10—Information and communication technologies [ICT] supporting adaptation to climate change, e.g. for weather forecasting or climate simulation
Abstract
The invention discloses a single cell expression pattern difference evaluation method based on multi-group comparison, and relates to the technical field of biological information analysis. The method comprises the following steps: s101, combining single cell transcriptome expression profiles of a plurality of groups; s102, grouping all cells in the combined single cell transcriptome expression profile; s103, identifying cell types of each cell population; s104, screening out cell types which exist in two or more groups simultaneously, and extracting coexisting expression profiles of the cell types; s105, calculating a difference enrichment score of the corresponding cell type according to the coexisting expression profile; and S106, ranking the cell types from large to small according to the difference enrichment scores. The method can evaluate the exact index of the difference size and the difference enrichment degree of the single cell expression spectrum under the comparison of multiple groups, and provides a basis for scientifically selecting the subsequent research direction.
Description
Technical Field
The invention relates to the technical field of biological information analysis, in particular to a single cell expression pattern difference evaluation method based on multi-group comparison.
Background
Single cell level sequencing techniques such as single cell transcriptome, single cell ATAC histology (high throughput analysis of single cell chromatin transposase accessibility), single cell epigenetic histology and the like can obtain RNA and chromatin information of thousands or tens of thousands of genes in a single cell, and comprehensively display the gene expression difference between each cell. The high-flux single-cell sequencing platform (such as a related platform of 10X Genomics) can realize high-flux cell sorting and capturing by utilizing technologies such as micro-flow control, oil drop wrapping, barcode labeling and the like, can separate and label hundreds or even tens of thousands of cells at one time, can obtain information such as transcriptome or chromatin, site methylation and the like of each cell after processing such as amplification, sequencing and the like, and has the advantages of high cell flux, low library construction cost, high efficiency and the like. The technology can be used for analyzing the expression, chromatin or methylation characteristics of different cell types by combining with the characteristic of a marker signal (marker gene) or a cell type marking algorithm (such as SingleR) of different cell types, and further can be used for researching the aspects of biological development, disease development, immune change and the like.
Analysis of single cell histology data typically involves the following steps: filtering low quality cells, identifying cell types, initially analyzing the overall characteristics or expression patterns of each cell type, selecting cell types for in-depth analysis, and performing personalized analysis (e.g., pathway enrichment analysis, predicting differentiation trajectories, transcription factor activity, cellular communication, etc.) for the target cell type. The last step of personalized analysis, theoretically, can be performed using a virtually unlimited number of existing software, and therefore typically takes the most manpower, effort and time in this step. The former step of selecting a cell type is the basis of personalized analysis, namely, selecting a key and proper cell type, then the subsequent personalized analysis can mine important and meaningful information, and if the cell type is selected improperly, a great deal of time and effort can be spent, but only valuable or low-value information is mined. Particularly in single cell data where there are multiple sets of comparisons, intensive investigation of each cell type increases its cost of investigation by a factor of two compared to either no comparison or two sets of comparisons.
Currently, researchers select directions for subsequent studies on single cell data with multiple treatment groups (three or more) at the same time, usually based on existing biological knowledge, rather than on the characteristics and differences of the data itself. This may lead to many potentially valuable research directions being missed. Whether there are differences between groups of different cell types, and the magnitude of the differences, the significance of the differences, are predictive of the extent to which the treatment or experimental conditions affect the expression patterns of the different cells. However, due to the huge data volume and cell volume of single-cell histology and complex data conditions, the prior art is difficult to directly and accurately evaluate the difference of cell expression patterns.
Disclosure of Invention
In order to solve at least one technical problem in the background art, the invention aims to provide a single cell expression pattern difference evaluation method based on multiple groups of comparison, which can evaluate the exact indexes of the difference size and the difference enrichment degree of single cell expression patterns under multiple groups of comparison and provide basis for scientifically selecting the subsequent research direction.
In order to achieve the above purpose, the present invention provides the following technical solutions:
a single cell expression pattern difference evaluation method based on multi-group comparison comprises the following steps:
s101, combining single cell transcriptome expression profiles of a plurality of groups;
s102, grouping all cells in the combined single cell transcriptome expression profile;
s103, identifying cell types of each cell population;
s104, screening out cell types which exist in two or more groups simultaneously, and extracting coexisting expression profiles of the cell types;
s105, calculating a difference enrichment score of the corresponding cell type according to the coexisting expression profile;
and S106, ranking the cell types from large to small according to the difference enrichment scores.
Further, the method for calculating the differential enrichment score is as follows:
s1051, obtaining the total characteristic expression profile of the cell type based on the coexisting expression profile;
s1052, dividing the total characteristic expression profile into a plurality of sub-characteristic expression profiles according to groups;
s1053, calculating the distance between each sub-feature expression profile and the corresponding total feature expression profile;
s1054, repeating S1051 to S1053 on the rest of the coexisting expression profiles to obtain corresponding distances;
s1055, ordering all distances from big to small, and giving ranking;
s1056, obtaining a differential enrichment score of the cell type based on the ranking:
wherein,representing differential enrichment score,/->Representing ranking, ->Representing the cognate cell type->Is a ranking set corresponding to the distance of (a) subscript +.>Representing cell type, subscript->Representative group; />Representing a total number of ranks; />Representing the same cell typeThe number of groups; />To enrich the weight, ++>。
Further, the method comprises the steps of,。
further, the distance is a euclidean distance or a manhattan distance.
Further, the method for solving the total characteristic expression profile is as follows: and taking an average value of the gene expression of each gene as a characteristic value of the gene aiming at the cell group corresponding to the screened cell type, wherein the characteristic values of a plurality of genes form the total characteristic expression profile.
Further, in S106, reliability screening is performed on the cell type:
s1061, comparing the ranking of all distances in the cell type with the ranking of all distances of other cell types, and calculating a p value by using Wilcoxon rank sum test;
s1062, obtaining corresponding p values for other cell types;
s1063, deleting cell types with p values higher than a preset threshold.
Further, in S102, the cells are normalized and/or PCA reduced in dimension prior to being clustered.
Further, in S103, identifying the cell type by using a marker gene having cell type specificity, and identifying the cell population with high expression of the marker gene as the corresponding cell type; alternatively, identification of each cell type was performed using SingleR software.
A computer storage medium having stored thereon a computer program, characterized in that the program when executed by a processor implements a single cell expression pattern difference assessment method based on a plurality of sets of comparisons as described above.
A terminal device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, characterized in that the processor implements a single cell expression pattern difference evaluation method based on a plurality of sets of comparisons as described above when executing the computer program.
Compared with the prior art, the invention has the beneficial effects that:
according to the invention, the distances between the characteristic expression profile of each group and the expression profile of the cell type are calculated under each cell type, all the distances are ranked, the difference enrichment score of each cell type is calculated based on the distance ranking, the difference enrichment score differences of each group in each cell type can be compared, and a corresponding expression mode difference amplitude evaluation result is obtained, so that a basis is provided for scientifically selecting the subsequent research direction.
Drawings
FIG. 1 is an overall flow chart of an embodiment of the present invention.
FIG. 2 is a flow chart of differential enrichment scoring according to an embodiment of the present invention.
Detailed Description
The following description of the embodiments of the present invention will be made clearly and completely, and it is apparent that the described embodiments are only some embodiments of the present invention, but not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.
Embodiment one:
referring to fig. 1, the embodiment provides a single cell expression pattern difference evaluation method based on multiple sets of comparison, which includes the following steps:
s101, combining single cell transcriptome expression profiles of a plurality of groups; as shown in table 1 below:
TABLE 1 pooled expression profiles
In table 1 above, groups 1 to 3 represent groups; samp1 to Samp6 represent sample names; c1 to C22001 represent cells; gene1 to Gene3 represent genes; the numbers of the body part represent the gene expression level.
S102, grouping all cells in the combined single cell transcriptome expression profile by using the SEurat software; and meanwhile, normalization and dimension reduction can be performed, wherein the normalization method selects 'lognormal', and the screening method of the hypervariable genes uses 'vst'.
S103, identifying the cell types of the cell populations. There are two general methods of identification, one is to identify using marker genes with cell type specificity (such as CD4 and CD8 genes of human T cells), and identify cell populations highly expressing marker genes as corresponding cell types; the other is the use of cell identification function software (e.g., singleR, scCATCH) for each cell population. The results of the assay are shown in table 2 below:
TABLE 2 cell identification results
In Table 2 above, types 1 through 4 are identified cell types.
S104, counting the number of groups existing in each cell type, screening out the cell types existing in two or more groups simultaneously, and extracting the coexisting expression profile of the cell types.
If there are three groups with cell Type1, the number of groups with cell Type1 is 3.
Removing cell types with the number of the existing groups being less than 2, and respectively marking the names of the remaining n cell types asTo the point ofIts corresponding group number is marked +.>To->And extracting the corresponding expression profile +.>. Table 3 below shows the coexisting expression profile corresponding to cell Type 1->:
TABLE 3 expression profile corresponding to cell Type1
In Table 3 above, T1 represents the primitive cell Type1, which is present in three groups simultaneously, and therefore=3。
S105, calculating a differential enrichment score of the corresponding cell type according to the coexisting expression profile so as to measure the differential enrichment degree of the cell type.
Specifically, as shown in fig. 2, the method for calculating the differential enrichment score is as follows:
s1051, based on coexistence expression profileObtaining the total characteristic expression profile of the cell type>(0 in subscript represents the characteristic expression profile of the cell type as a whole).
The solving method of the total characteristic expression profile is as follows: for the cell group corresponding to the screened cell type, taking average value of gene expression of each gene, and taking the average value as the characteristic value of the gene, wherein the characteristic values of a plurality of genes form the total characteristic expression profileThe cell Type1 (i.e. corresponding coexisting expression profile +.>) Is->:
TABLE 4 Total characteristic expression profile of cell Type1
In Table 4, the coexisting expression profile was calculated using a value of 17.6 as an exampleThe average value of the Gene expression level of the medium Gene Gene 1.
In other embodiments, the method of calculating the characteristic expression profile may be adjusted according to the data distribution, which may be obtained by calculating the average value of the gene expression level of each gene when the distribution of the gene expression level is a normal distribution or a uniform distribution, or by calculating the average value of the square (or square root or logarithm) of the gene expression level if the distribution is other, which may eliminate the bias on the distribution.
S1052, dividing the total characteristic expression profile into a plurality of sub-characteristic expression profiles according to groups. In particular, there areThe individual groups comprise cell type T1, calculated group1 to +.>Sub-characteristic expression profile of group T1 cells and is marked +.>To->(To->The first 1 of the subscripts represents the cell type T1, the second 1 and +.>Representative group). The following table 5 shows several sub-characteristic expression profiles into which the total characteristic expression profile of the cell type T1 is partitioned:
TABLE 5 several subfraction profiles into which the total profile of cell type T1 is partitioned
S1053, calculating the distance between each sub-feature expression profile and the corresponding total feature expression profile;
calculating the sub-characteristic expression profilesTo->) Respective and cell type characteristic expression profile->Is a distance of (3). As shown in Table 5 above, the expression profile of cell type T1 was first +.>Split into->The expression profile is calculated using the same calculation method as the total characteristic expression profile in step S1051 (e.g., calculating the average value of the gene expression amount of each gene, or the median of the gene expression amounts, or by calculating the square or square root or logarithm of the gene expression amount)>Sub-characteristic expression profile of the individual expression profile +.>To->. Respectively calculating sub-characteristic expression profile +.>To->And total characteristic expression profile->Is of the European distance of (2)Separation ofTo->(Manhattan distance may also be used instead), will +.>To->Merging into a set->。
S1054 for the rest cell typesTo->) Repeating S1051-S1053 to obtain corresponding distances to obtain the set +.>To->. Will collect->To->Combining to form a set B of all Euclidean distances, comprising all distances in each cell type, i.e.>To->、/>To->、…、/>To->。
S1055, ranking all distances according to the order from big to small. Specifically, all distances in set B are ordered from big to small, and corresponding ranks are assignedWill->To->The corresponding rank is defined as the set +.>To the point of。
S1056, obtaining a differential enrichment score of the cell type based on the ranking:
wherein,representing differential enrichment score,/->Representing ranking, ->Representing the cognate cell type->Is a ranking set corresponding to the distance of (a) subscript +.>Representing cell type, subscript->Representative group; />Representing a total number of ranks; />Representing the same cell typeThe number of groups; />To enrich the weight, ++>Can be adjusted according to the situation, and is usually set to 0.75.
S106, enriching the cell types according to the differenceRanking from large to small, as a result of evaluation of differential enrichment degree of expression patterns of each cell type under multiple groups of comparison and ranking of potential research value, differential enrichment score +.>The larger represents the higher the corresponding potential research value.
Embodiment two:
based on the first embodiment, in the second embodiment, in S106, the reliability screening is further performed on the cell type, which specifically includes the following steps:
s1061, comparing the ranking of all distances in the cell type with the ranking of all distances of other cell types, and calculating a p value by using Wilcoxon rank sum test;
s1062, obtaining corresponding p values for other cell types;
s1063, delete cell types with p-value higher than the preset threshold G, which is typically set to 0.05.
Embodiment III:
the present embodiment provides a computer storage medium having a computer program stored thereon, wherein the program when executed by a processor implements the single cell expression pattern difference evaluation method based on the multi-group comparison as described in the first or second embodiment.
Embodiment four:
the embodiment provides a terminal device, including a memory, a processor, and a computer program stored in the memory and capable of running on the processor, wherein the processor implements the single cell expression pattern difference evaluation method based on the multiple group comparison as described in the first embodiment or the second embodiment when executing the computer program.
It will be evident to those skilled in the art that the invention is not limited to the details of the foregoing illustrative embodiments, and that the present invention may be embodied in other specific forms without departing from the spirit or essential characteristics thereof. The present embodiments are, therefore, to be considered in all respects as illustrative and not restrictive, the scope of the invention being indicated by the appended claims rather than by the foregoing description, and all changes which come within the meaning and range of equivalency of the claims are therefore intended to be embraced therein.
Claims (8)
1. The single cell expression pattern difference evaluation method based on the multi-group comparison is characterized by comprising the following steps of:
s101, combining single cell transcriptome expression profiles of a plurality of groups;
s102, grouping all cells in the combined single cell transcriptome expression profile;
s103, identifying cell types of each cell population;
s104, screening out cell types which exist in two or more groups simultaneously, and extracting coexisting expression profiles of the cell types;
s105, calculating a difference enrichment score of the corresponding cell type according to the coexisting expression profile;
s106, ranking the cell types according to the difference enrichment scores from large to small;
the method for calculating the difference enrichment score comprises the following steps:
s1051, obtaining the total characteristic expression profile of the cell type based on the coexisting expression profile;
s1052, dividing the total characteristic expression profile into a plurality of sub-characteristic expression profiles according to groups;
s1053, calculating the distance between each sub-feature expression profile and the corresponding total feature expression profile;
s1054, repeating S1051 to S1053 on the rest of the coexisting expression profiles to obtain corresponding distances;
s1055, ordering all distances from big to small, and giving ranking;
s1056, obtaining a differential enrichment score of the cell type based on the ranking:
wherein,representing differential enrichment score,/->Representing ranking, ->Representing the cognate cell type->Is a ranking set corresponding to the distance of (a) subscript +.>Representing cell type, subscript->Representative group; />Representing a total number of ranks; />Representing the same cell type->The number of groups; />To enrich the weight, ++>。
2. The method for evaluating the differences of single-cell expression patterns based on the multiple sets of comparison according to claim 1, wherein,。
3. the method of claim 1, wherein the distance is a euclidean distance or a manhattan distance.
4. The method for evaluating the differences of single cell expression patterns based on multiple sets of comparison according to claim 1, wherein the method for solving the total characteristic expression profile is as follows: and taking an average value of the gene expression of each gene as a characteristic value of the gene aiming at the cell group corresponding to the screened cell type, wherein the characteristic values of a plurality of genes form the total characteristic expression profile.
5. The method according to claim 1, wherein in S102, the cells are normalized and/or PCA reduced in dimension before being grouped.
6. The method according to claim 1, wherein in S103, the cell type is identified by using a marker gene having cell type specificity, and the cell population with high expression of the marker gene is identified as the corresponding cell type; alternatively, identification of each cell type was performed using SingleR software.
7. A computer storage medium having stored thereon a computer program, which when executed by a processor implements the single cell expression pattern difference assessment method based on a multi-group comparison as claimed in any one of claims 1 to 6.
8. A terminal device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, characterized in that the processor implements the single cell expression pattern difference assessment method based on a plurality of sets of comparisons according to any one of claims 1 to 6 when executing the computer program.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310083585.7A CN116312786B (en) | 2023-02-08 | 2023-02-08 | Single cell expression pattern difference evaluation method based on multi-group comparison |
CN202311487781.7A CN117457072A (en) | 2023-02-08 | 2023-02-08 | Single cell expression pattern difference reliability screening method based on multi-group comparison |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310083585.7A CN116312786B (en) | 2023-02-08 | 2023-02-08 | Single cell expression pattern difference evaluation method based on multi-group comparison |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202311487781.7A Division CN117457072A (en) | 2023-02-08 | 2023-02-08 | Single cell expression pattern difference reliability screening method based on multi-group comparison |
Publications (2)
Publication Number | Publication Date |
---|---|
CN116312786A CN116312786A (en) | 2023-06-23 |
CN116312786B true CN116312786B (en) | 2023-11-28 |
Family
ID=86793200
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202311487781.7A Pending CN117457072A (en) | 2023-02-08 | 2023-02-08 | Single cell expression pattern difference reliability screening method based on multi-group comparison |
CN202310083585.7A Active CN116312786B (en) | 2023-02-08 | 2023-02-08 | Single cell expression pattern difference evaluation method based on multi-group comparison |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202311487781.7A Pending CN117457072A (en) | 2023-02-08 | 2023-02-08 | Single cell expression pattern difference reliability screening method based on multi-group comparison |
Country Status (1)
Country | Link |
---|---|
CN (2) | CN117457072A (en) |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107873054A (en) * | 2014-09-09 | 2018-04-03 | 博德研究所 | The method and apparatus based on droplet for compound single-cell nucleic acid analysis |
CN112820353A (en) * | 2021-01-22 | 2021-05-18 | 中山大学 | Method and system for analyzing cell fate conversion key transcription factor |
CN114364814A (en) * | 2020-03-19 | 2022-04-15 | 新格元(南京)生物科技有限公司 | Single cell RNA sequencing |
CN114708910A (en) * | 2022-02-24 | 2022-07-05 | 上海市第一人民医院 | Method for calculating cell subset enrichment fraction in cell sequencing by using single cell sequencing data |
CN115440303A (en) * | 2022-11-03 | 2022-12-06 | 杭州联川生物技术股份有限公司 | Filtering method, medium and device for low-quality cells of single-cell transcriptome |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20180057859A1 (en) * | 2016-05-06 | 2018-03-01 | Craig E. Nelson | Method for identifying rare cell types by single cell assisted deconvolution of population gene expression data |
-
2023
- 2023-02-08 CN CN202311487781.7A patent/CN117457072A/en active Pending
- 2023-02-08 CN CN202310083585.7A patent/CN116312786B/en active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107873054A (en) * | 2014-09-09 | 2018-04-03 | 博德研究所 | The method and apparatus based on droplet for compound single-cell nucleic acid analysis |
CN114364814A (en) * | 2020-03-19 | 2022-04-15 | 新格元(南京)生物科技有限公司 | Single cell RNA sequencing |
CN112820353A (en) * | 2021-01-22 | 2021-05-18 | 中山大学 | Method and system for analyzing cell fate conversion key transcription factor |
CN114708910A (en) * | 2022-02-24 | 2022-07-05 | 上海市第一人民医院 | Method for calculating cell subset enrichment fraction in cell sequencing by using single cell sequencing data |
CN115440303A (en) * | 2022-11-03 | 2022-12-06 | 杭州联川生物技术股份有限公司 | Filtering method, medium and device for low-quality cells of single-cell transcriptome |
Also Published As
Publication number | Publication date |
---|---|
CN117457072A (en) | 2024-01-26 |
CN116312786A (en) | 2023-06-23 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10347365B2 (en) | Systems and methods for visualizing a pattern in a dataset | |
KR102349921B1 (en) | taxonomy profiling method for microorganism in sample | |
US11954614B2 (en) | Systems and methods for visualizing a pattern in a dataset | |
Torang et al. | An elastic-net logistic regression approach to generate classifiers and gene signatures for types of immune cells and T helper cell subsets | |
CN111081317B (en) | Gene spectrum-based breast cancer lymph node metastasis prediction method and prediction system | |
CN107391963A (en) | Eucaryon based on calculating cloud platform is without ginseng transcript profile interaction analysis system and method | |
US20210381056A1 (en) | Systems and methods for joint interactive visualization of gene expression and dna chromatin accessibility | |
CN112133368A (en) | Automated analysis method of metagenome sequencing data based on third-generation sequencing technology | |
CN111312334B (en) | Receptor-ligand system analysis method for influencing intercellular communication | |
Marsh et al. | Bioinformatic analysis of bacteria and host cell dual RNA-sequencing experiments | |
CN112599199A (en) | Analysis method suitable for 10x single cell transcriptome sequencing data | |
CN107463801A (en) | A kind of Drop seq data quality controls and analysis method | |
Chen et al. | Tree2GD: a phylogenomic method to detect large-scale gene duplication events | |
CN116312786B (en) | Single cell expression pattern difference evaluation method based on multi-group comparison | |
Trapnell et al. | Monocle: Cell counting, differential expression, and trajectory analysis for single-cell RNA-Seq experiments | |
CN116525004B (en) | Single cell expression pattern difference evaluation method, medium and device based on two groups of comparison | |
CN110970093B (en) | Method and device for screening primer design template and application | |
CN115948521A (en) | Method for detecting aneuploid missing chromosome information | |
CN113362895A (en) | Comprehensive analysis method for predicting anti-cancer drug response related gene | |
US6994965B2 (en) | Method for displaying results of hybridization experiment | |
Khan et al. | MSuPDA: A memory efficient algorithm for sequence alignment | |
Meyer et al. | ReadZS detects developmentally regulated RNA processing programs in single cell RNA-seq and defines subpopulations independent of gene expression | |
Jimenez | Practical metagenomics: microbiome tutorial with QIIME 2 | |
CN113793641B (en) | Method for rapidly judging sample gender from FASTQ file | |
CN115527610B (en) | Cluster analysis method for single-cell histology data |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |