CN116312786B - Single cell expression pattern difference evaluation method based on multi-group comparison - Google Patents

Single cell expression pattern difference evaluation method based on multi-group comparison Download PDF

Info

Publication number
CN116312786B
CN116312786B CN202310083585.7A CN202310083585A CN116312786B CN 116312786 B CN116312786 B CN 116312786B CN 202310083585 A CN202310083585 A CN 202310083585A CN 116312786 B CN116312786 B CN 116312786B
Authority
CN
China
Prior art keywords
cell
expression
cell type
expression profile
single cell
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202310083585.7A
Other languages
Chinese (zh)
Other versions
CN116312786A (en
Inventor
陈哲名
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hangzhou Lianchuan Biotechnology Co ltd
Original Assignee
Hangzhou Lianchuan Biotechnology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hangzhou Lianchuan Biotechnology Co ltd filed Critical Hangzhou Lianchuan Biotechnology Co ltd
Priority to CN202310083585.7A priority Critical patent/CN116312786B/en
Priority to CN202311487781.7A priority patent/CN117457072A/en
Publication of CN116312786A publication Critical patent/CN116312786A/en
Application granted granted Critical
Publication of CN116312786B publication Critical patent/CN116312786B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B25/00ICT specially adapted for hybridisation; ICT specially adapted for gene or protein expression
    • G16B25/10Gene or protein expression profiling; Expression-ratio estimation or normalisation
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B40/00ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/10Pre-processing; Data cleansing
    • G06F18/15Statistical pre-processing, e.g. techniques for normalisation or restoring missing data
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/213Feature extraction, e.g. by transforming the feature space; Summarisation; Mappings, e.g. subspace methods
    • G06F18/2135Feature extraction, e.g. by transforming the feature space; Summarisation; Mappings, e.g. subspace methods based on approximation criteria, e.g. principal component analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/243Classification techniques relating to the number of classes
    • G06F18/2431Multiple classes
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02ATECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
    • Y02A90/00Technologies having an indirect contribution to adaptation to climate change
    • Y02A90/10Information and communication technologies [ICT] supporting adaptation to climate change, e.g. for weather forecasting or climate simulation

Abstract

The invention discloses a single cell expression pattern difference evaluation method based on multi-group comparison, and relates to the technical field of biological information analysis. The method comprises the following steps: s101, combining single cell transcriptome expression profiles of a plurality of groups; s102, grouping all cells in the combined single cell transcriptome expression profile; s103, identifying cell types of each cell population; s104, screening out cell types which exist in two or more groups simultaneously, and extracting coexisting expression profiles of the cell types; s105, calculating a difference enrichment score of the corresponding cell type according to the coexisting expression profile; and S106, ranking the cell types from large to small according to the difference enrichment scores. The method can evaluate the exact index of the difference size and the difference enrichment degree of the single cell expression spectrum under the comparison of multiple groups, and provides a basis for scientifically selecting the subsequent research direction.

Description

Single cell expression pattern difference evaluation method based on multi-group comparison
Technical Field
The invention relates to the technical field of biological information analysis, in particular to a single cell expression pattern difference evaluation method based on multi-group comparison.
Background
Single cell level sequencing techniques such as single cell transcriptome, single cell ATAC histology (high throughput analysis of single cell chromatin transposase accessibility), single cell epigenetic histology and the like can obtain RNA and chromatin information of thousands or tens of thousands of genes in a single cell, and comprehensively display the gene expression difference between each cell. The high-flux single-cell sequencing platform (such as a related platform of 10X Genomics) can realize high-flux cell sorting and capturing by utilizing technologies such as micro-flow control, oil drop wrapping, barcode labeling and the like, can separate and label hundreds or even tens of thousands of cells at one time, can obtain information such as transcriptome or chromatin, site methylation and the like of each cell after processing such as amplification, sequencing and the like, and has the advantages of high cell flux, low library construction cost, high efficiency and the like. The technology can be used for analyzing the expression, chromatin or methylation characteristics of different cell types by combining with the characteristic of a marker signal (marker gene) or a cell type marking algorithm (such as SingleR) of different cell types, and further can be used for researching the aspects of biological development, disease development, immune change and the like.
Analysis of single cell histology data typically involves the following steps: filtering low quality cells, identifying cell types, initially analyzing the overall characteristics or expression patterns of each cell type, selecting cell types for in-depth analysis, and performing personalized analysis (e.g., pathway enrichment analysis, predicting differentiation trajectories, transcription factor activity, cellular communication, etc.) for the target cell type. The last step of personalized analysis, theoretically, can be performed using a virtually unlimited number of existing software, and therefore typically takes the most manpower, effort and time in this step. The former step of selecting a cell type is the basis of personalized analysis, namely, selecting a key and proper cell type, then the subsequent personalized analysis can mine important and meaningful information, and if the cell type is selected improperly, a great deal of time and effort can be spent, but only valuable or low-value information is mined. Particularly in single cell data where there are multiple sets of comparisons, intensive investigation of each cell type increases its cost of investigation by a factor of two compared to either no comparison or two sets of comparisons.
Currently, researchers select directions for subsequent studies on single cell data with multiple treatment groups (three or more) at the same time, usually based on existing biological knowledge, rather than on the characteristics and differences of the data itself. This may lead to many potentially valuable research directions being missed. Whether there are differences between groups of different cell types, and the magnitude of the differences, the significance of the differences, are predictive of the extent to which the treatment or experimental conditions affect the expression patterns of the different cells. However, due to the huge data volume and cell volume of single-cell histology and complex data conditions, the prior art is difficult to directly and accurately evaluate the difference of cell expression patterns.
Disclosure of Invention
In order to solve at least one technical problem in the background art, the invention aims to provide a single cell expression pattern difference evaluation method based on multiple groups of comparison, which can evaluate the exact indexes of the difference size and the difference enrichment degree of single cell expression patterns under multiple groups of comparison and provide basis for scientifically selecting the subsequent research direction.
In order to achieve the above purpose, the present invention provides the following technical solutions:
a single cell expression pattern difference evaluation method based on multi-group comparison comprises the following steps:
s101, combining single cell transcriptome expression profiles of a plurality of groups;
s102, grouping all cells in the combined single cell transcriptome expression profile;
s103, identifying cell types of each cell population;
s104, screening out cell types which exist in two or more groups simultaneously, and extracting coexisting expression profiles of the cell types;
s105, calculating a difference enrichment score of the corresponding cell type according to the coexisting expression profile;
and S106, ranking the cell types from large to small according to the difference enrichment scores.
Further, the method for calculating the differential enrichment score is as follows:
s1051, obtaining the total characteristic expression profile of the cell type based on the coexisting expression profile;
s1052, dividing the total characteristic expression profile into a plurality of sub-characteristic expression profiles according to groups;
s1053, calculating the distance between each sub-feature expression profile and the corresponding total feature expression profile;
s1054, repeating S1051 to S1053 on the rest of the coexisting expression profiles to obtain corresponding distances;
s1055, ordering all distances from big to small, and giving ranking;
s1056, obtaining a differential enrichment score of the cell type based on the ranking:
wherein,representing differential enrichment score,/->Representing ranking, ->Representing the cognate cell type->Is a ranking set corresponding to the distance of (a) subscript +.>Representing cell type, subscript->Representative group; />Representing a total number of ranks; />Representing the same cell typeThe number of groups; />To enrich the weight, ++>
Further, the method comprises the steps of,
further, the distance is a euclidean distance or a manhattan distance.
Further, the method for solving the total characteristic expression profile is as follows: and taking an average value of the gene expression of each gene as a characteristic value of the gene aiming at the cell group corresponding to the screened cell type, wherein the characteristic values of a plurality of genes form the total characteristic expression profile.
Further, in S106, reliability screening is performed on the cell type:
s1061, comparing the ranking of all distances in the cell type with the ranking of all distances of other cell types, and calculating a p value by using Wilcoxon rank sum test;
s1062, obtaining corresponding p values for other cell types;
s1063, deleting cell types with p values higher than a preset threshold.
Further, in S102, the cells are normalized and/or PCA reduced in dimension prior to being clustered.
Further, in S103, identifying the cell type by using a marker gene having cell type specificity, and identifying the cell population with high expression of the marker gene as the corresponding cell type; alternatively, identification of each cell type was performed using SingleR software.
A computer storage medium having stored thereon a computer program, characterized in that the program when executed by a processor implements a single cell expression pattern difference assessment method based on a plurality of sets of comparisons as described above.
A terminal device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, characterized in that the processor implements a single cell expression pattern difference evaluation method based on a plurality of sets of comparisons as described above when executing the computer program.
Compared with the prior art, the invention has the beneficial effects that:
according to the invention, the distances between the characteristic expression profile of each group and the expression profile of the cell type are calculated under each cell type, all the distances are ranked, the difference enrichment score of each cell type is calculated based on the distance ranking, the difference enrichment score differences of each group in each cell type can be compared, and a corresponding expression mode difference amplitude evaluation result is obtained, so that a basis is provided for scientifically selecting the subsequent research direction.
Drawings
FIG. 1 is an overall flow chart of an embodiment of the present invention.
FIG. 2 is a flow chart of differential enrichment scoring according to an embodiment of the present invention.
Detailed Description
The following description of the embodiments of the present invention will be made clearly and completely, and it is apparent that the described embodiments are only some embodiments of the present invention, but not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.
Embodiment one:
referring to fig. 1, the embodiment provides a single cell expression pattern difference evaluation method based on multiple sets of comparison, which includes the following steps:
s101, combining single cell transcriptome expression profiles of a plurality of groups; as shown in table 1 below:
TABLE 1 pooled expression profiles
In table 1 above, groups 1 to 3 represent groups; samp1 to Samp6 represent sample names; c1 to C22001 represent cells; gene1 to Gene3 represent genes; the numbers of the body part represent the gene expression level.
S102, grouping all cells in the combined single cell transcriptome expression profile by using the SEurat software; and meanwhile, normalization and dimension reduction can be performed, wherein the normalization method selects 'lognormal', and the screening method of the hypervariable genes uses 'vst'.
S103, identifying the cell types of the cell populations. There are two general methods of identification, one is to identify using marker genes with cell type specificity (such as CD4 and CD8 genes of human T cells), and identify cell populations highly expressing marker genes as corresponding cell types; the other is the use of cell identification function software (e.g., singleR, scCATCH) for each cell population. The results of the assay are shown in table 2 below:
TABLE 2 cell identification results
In Table 2 above, types 1 through 4 are identified cell types.
S104, counting the number of groups existing in each cell type, screening out the cell types existing in two or more groups simultaneously, and extracting the coexisting expression profile of the cell types.
If there are three groups with cell Type1, the number of groups with cell Type1 is 3.
Removing cell types with the number of the existing groups being less than 2, and respectively marking the names of the remaining n cell types asTo the point ofIts corresponding group number is marked +.>To->And extracting the corresponding expression profile +.>. Table 3 below shows the coexisting expression profile corresponding to cell Type 1->
TABLE 3 expression profile corresponding to cell Type1
In Table 3 above, T1 represents the primitive cell Type1, which is present in three groups simultaneously, and therefore=3。
S105, calculating a differential enrichment score of the corresponding cell type according to the coexisting expression profile so as to measure the differential enrichment degree of the cell type.
Specifically, as shown in fig. 2, the method for calculating the differential enrichment score is as follows:
s1051, based on coexistence expression profileObtaining the total characteristic expression profile of the cell type>(0 in subscript represents the characteristic expression profile of the cell type as a whole).
The solving method of the total characteristic expression profile is as follows: for the cell group corresponding to the screened cell type, taking average value of gene expression of each gene, and taking the average value as the characteristic value of the gene, wherein the characteristic values of a plurality of genes form the total characteristic expression profileThe cell Type1 (i.e. corresponding coexisting expression profile +.>) Is->
TABLE 4 Total characteristic expression profile of cell Type1
In Table 4, the coexisting expression profile was calculated using a value of 17.6 as an exampleThe average value of the Gene expression level of the medium Gene Gene 1.
In other embodiments, the method of calculating the characteristic expression profile may be adjusted according to the data distribution, which may be obtained by calculating the average value of the gene expression level of each gene when the distribution of the gene expression level is a normal distribution or a uniform distribution, or by calculating the average value of the square (or square root or logarithm) of the gene expression level if the distribution is other, which may eliminate the bias on the distribution.
S1052, dividing the total characteristic expression profile into a plurality of sub-characteristic expression profiles according to groups. In particular, there areThe individual groups comprise cell type T1, calculated group1 to +.>Sub-characteristic expression profile of group T1 cells and is marked +.>To->To->The first 1 of the subscripts represents the cell type T1, the second 1 and +.>Representative group). The following table 5 shows several sub-characteristic expression profiles into which the total characteristic expression profile of the cell type T1 is partitioned:
TABLE 5 several subfraction profiles into which the total profile of cell type T1 is partitioned
S1053, calculating the distance between each sub-feature expression profile and the corresponding total feature expression profile;
calculating the sub-characteristic expression profilesTo->) Respective and cell type characteristic expression profile->Is a distance of (3). As shown in Table 5 above, the expression profile of cell type T1 was first +.>Split into->The expression profile is calculated using the same calculation method as the total characteristic expression profile in step S1051 (e.g., calculating the average value of the gene expression amount of each gene, or the median of the gene expression amounts, or by calculating the square or square root or logarithm of the gene expression amount)>Sub-characteristic expression profile of the individual expression profile +.>To->. Respectively calculating sub-characteristic expression profile +.>To->And total characteristic expression profile->Is of the European distance of (2)Separation ofTo->(Manhattan distance may also be used instead), will +.>To->Merging into a set->
S1054 for the rest cell typesTo->) Repeating S1051-S1053 to obtain corresponding distances to obtain the set +.>To->. Will collect->To->Combining to form a set B of all Euclidean distances, comprising all distances in each cell type, i.e.>To->、/>To->、…、/>To->
S1055, ranking all distances according to the order from big to small. Specifically, all distances in set B are ordered from big to small, and corresponding ranks are assignedWill->To->The corresponding rank is defined as the set +.>To the point of
S1056, obtaining a differential enrichment score of the cell type based on the ranking:
wherein,representing differential enrichment score,/->Representing ranking, ->Representing the cognate cell type->Is a ranking set corresponding to the distance of (a) subscript +.>Representing cell type, subscript->Representative group; />Representing a total number of ranks; />Representing the same cell typeThe number of groups; />To enrich the weight, ++>Can be adjusted according to the situation, and is usually set to 0.75.
S106, enriching the cell types according to the differenceRanking from large to small, as a result of evaluation of differential enrichment degree of expression patterns of each cell type under multiple groups of comparison and ranking of potential research value, differential enrichment score +.>The larger represents the higher the corresponding potential research value.
Embodiment two:
based on the first embodiment, in the second embodiment, in S106, the reliability screening is further performed on the cell type, which specifically includes the following steps:
s1061, comparing the ranking of all distances in the cell type with the ranking of all distances of other cell types, and calculating a p value by using Wilcoxon rank sum test;
s1062, obtaining corresponding p values for other cell types;
s1063, delete cell types with p-value higher than the preset threshold G, which is typically set to 0.05.
Embodiment III:
the present embodiment provides a computer storage medium having a computer program stored thereon, wherein the program when executed by a processor implements the single cell expression pattern difference evaluation method based on the multi-group comparison as described in the first or second embodiment.
Embodiment four:
the embodiment provides a terminal device, including a memory, a processor, and a computer program stored in the memory and capable of running on the processor, wherein the processor implements the single cell expression pattern difference evaluation method based on the multiple group comparison as described in the first embodiment or the second embodiment when executing the computer program.
It will be evident to those skilled in the art that the invention is not limited to the details of the foregoing illustrative embodiments, and that the present invention may be embodied in other specific forms without departing from the spirit or essential characteristics thereof. The present embodiments are, therefore, to be considered in all respects as illustrative and not restrictive, the scope of the invention being indicated by the appended claims rather than by the foregoing description, and all changes which come within the meaning and range of equivalency of the claims are therefore intended to be embraced therein.

Claims (8)

1. The single cell expression pattern difference evaluation method based on the multi-group comparison is characterized by comprising the following steps of:
s101, combining single cell transcriptome expression profiles of a plurality of groups;
s102, grouping all cells in the combined single cell transcriptome expression profile;
s103, identifying cell types of each cell population;
s104, screening out cell types which exist in two or more groups simultaneously, and extracting coexisting expression profiles of the cell types;
s105, calculating a difference enrichment score of the corresponding cell type according to the coexisting expression profile;
s106, ranking the cell types according to the difference enrichment scores from large to small;
the method for calculating the difference enrichment score comprises the following steps:
s1051, obtaining the total characteristic expression profile of the cell type based on the coexisting expression profile;
s1052, dividing the total characteristic expression profile into a plurality of sub-characteristic expression profiles according to groups;
s1053, calculating the distance between each sub-feature expression profile and the corresponding total feature expression profile;
s1054, repeating S1051 to S1053 on the rest of the coexisting expression profiles to obtain corresponding distances;
s1055, ordering all distances from big to small, and giving ranking;
s1056, obtaining a differential enrichment score of the cell type based on the ranking:
wherein,representing differential enrichment score,/->Representing ranking, ->Representing the cognate cell type->Is a ranking set corresponding to the distance of (a) subscript +.>Representing cell type, subscript->Representative group; />Representing a total number of ranks; />Representing the same cell type->The number of groups; />To enrich the weight, ++>
2. The method for evaluating the differences of single-cell expression patterns based on the multiple sets of comparison according to claim 1, wherein,
3. the method of claim 1, wherein the distance is a euclidean distance or a manhattan distance.
4. The method for evaluating the differences of single cell expression patterns based on multiple sets of comparison according to claim 1, wherein the method for solving the total characteristic expression profile is as follows: and taking an average value of the gene expression of each gene as a characteristic value of the gene aiming at the cell group corresponding to the screened cell type, wherein the characteristic values of a plurality of genes form the total characteristic expression profile.
5. The method according to claim 1, wherein in S102, the cells are normalized and/or PCA reduced in dimension before being grouped.
6. The method according to claim 1, wherein in S103, the cell type is identified by using a marker gene having cell type specificity, and the cell population with high expression of the marker gene is identified as the corresponding cell type; alternatively, identification of each cell type was performed using SingleR software.
7. A computer storage medium having stored thereon a computer program, which when executed by a processor implements the single cell expression pattern difference assessment method based on a multi-group comparison as claimed in any one of claims 1 to 6.
8. A terminal device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, characterized in that the processor implements the single cell expression pattern difference assessment method based on a plurality of sets of comparisons according to any one of claims 1 to 6 when executing the computer program.
CN202310083585.7A 2023-02-08 2023-02-08 Single cell expression pattern difference evaluation method based on multi-group comparison Active CN116312786B (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN202310083585.7A CN116312786B (en) 2023-02-08 2023-02-08 Single cell expression pattern difference evaluation method based on multi-group comparison
CN202311487781.7A CN117457072A (en) 2023-02-08 2023-02-08 Single cell expression pattern difference reliability screening method based on multi-group comparison

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310083585.7A CN116312786B (en) 2023-02-08 2023-02-08 Single cell expression pattern difference evaluation method based on multi-group comparison

Related Child Applications (1)

Application Number Title Priority Date Filing Date
CN202311487781.7A Division CN117457072A (en) 2023-02-08 2023-02-08 Single cell expression pattern difference reliability screening method based on multi-group comparison

Publications (2)

Publication Number Publication Date
CN116312786A CN116312786A (en) 2023-06-23
CN116312786B true CN116312786B (en) 2023-11-28

Family

ID=86793200

Family Applications (2)

Application Number Title Priority Date Filing Date
CN202311487781.7A Pending CN117457072A (en) 2023-02-08 2023-02-08 Single cell expression pattern difference reliability screening method based on multi-group comparison
CN202310083585.7A Active CN116312786B (en) 2023-02-08 2023-02-08 Single cell expression pattern difference evaluation method based on multi-group comparison

Family Applications Before (1)

Application Number Title Priority Date Filing Date
CN202311487781.7A Pending CN117457072A (en) 2023-02-08 2023-02-08 Single cell expression pattern difference reliability screening method based on multi-group comparison

Country Status (1)

Country Link
CN (2) CN117457072A (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107873054A (en) * 2014-09-09 2018-04-03 博德研究所 The method and apparatus based on droplet for compound single-cell nucleic acid analysis
CN112820353A (en) * 2021-01-22 2021-05-18 中山大学 Method and system for analyzing cell fate conversion key transcription factor
CN114364814A (en) * 2020-03-19 2022-04-15 新格元(南京)生物科技有限公司 Single cell RNA sequencing
CN114708910A (en) * 2022-02-24 2022-07-05 上海市第一人民医院 Method for calculating cell subset enrichment fraction in cell sequencing by using single cell sequencing data
CN115440303A (en) * 2022-11-03 2022-12-06 杭州联川生物技术股份有限公司 Filtering method, medium and device for low-quality cells of single-cell transcriptome

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20180057859A1 (en) * 2016-05-06 2018-03-01 Craig E. Nelson Method for identifying rare cell types by single cell assisted deconvolution of population gene expression data

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107873054A (en) * 2014-09-09 2018-04-03 博德研究所 The method and apparatus based on droplet for compound single-cell nucleic acid analysis
CN114364814A (en) * 2020-03-19 2022-04-15 新格元(南京)生物科技有限公司 Single cell RNA sequencing
CN112820353A (en) * 2021-01-22 2021-05-18 中山大学 Method and system for analyzing cell fate conversion key transcription factor
CN114708910A (en) * 2022-02-24 2022-07-05 上海市第一人民医院 Method for calculating cell subset enrichment fraction in cell sequencing by using single cell sequencing data
CN115440303A (en) * 2022-11-03 2022-12-06 杭州联川生物技术股份有限公司 Filtering method, medium and device for low-quality cells of single-cell transcriptome

Also Published As

Publication number Publication date
CN117457072A (en) 2024-01-26
CN116312786A (en) 2023-06-23

Similar Documents

Publication Publication Date Title
US10347365B2 (en) Systems and methods for visualizing a pattern in a dataset
KR102349921B1 (en) taxonomy profiling method for microorganism in sample
US11954614B2 (en) Systems and methods for visualizing a pattern in a dataset
Torang et al. An elastic-net logistic regression approach to generate classifiers and gene signatures for types of immune cells and T helper cell subsets
CN111081317B (en) Gene spectrum-based breast cancer lymph node metastasis prediction method and prediction system
CN107391963A (en) Eucaryon based on calculating cloud platform is without ginseng transcript profile interaction analysis system and method
US20210381056A1 (en) Systems and methods for joint interactive visualization of gene expression and dna chromatin accessibility
CN112133368A (en) Automated analysis method of metagenome sequencing data based on third-generation sequencing technology
CN111312334B (en) Receptor-ligand system analysis method for influencing intercellular communication
Marsh et al. Bioinformatic analysis of bacteria and host cell dual RNA-sequencing experiments
CN112599199A (en) Analysis method suitable for 10x single cell transcriptome sequencing data
CN107463801A (en) A kind of Drop seq data quality controls and analysis method
Chen et al. Tree2GD: a phylogenomic method to detect large-scale gene duplication events
CN116312786B (en) Single cell expression pattern difference evaluation method based on multi-group comparison
Trapnell et al. Monocle: Cell counting, differential expression, and trajectory analysis for single-cell RNA-Seq experiments
CN116525004B (en) Single cell expression pattern difference evaluation method, medium and device based on two groups of comparison
CN110970093B (en) Method and device for screening primer design template and application
CN115948521A (en) Method for detecting aneuploid missing chromosome information
CN113362895A (en) Comprehensive analysis method for predicting anti-cancer drug response related gene
US6994965B2 (en) Method for displaying results of hybridization experiment
Khan et al. MSuPDA: A memory efficient algorithm for sequence alignment
Meyer et al. ReadZS detects developmentally regulated RNA processing programs in single cell RNA-seq and defines subpopulations independent of gene expression
Jimenez Practical metagenomics: microbiome tutorial with QIIME 2
CN113793641B (en) Method for rapidly judging sample gender from FASTQ file
CN115527610B (en) Cluster analysis method for single-cell histology data

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant