CN116486916A - Single cell transcriptome dying cell and multicellular filtration method, medium and equipment - Google Patents

Single cell transcriptome dying cell and multicellular filtration method, medium and equipment Download PDF

Info

Publication number
CN116486916A
CN116486916A CN202310175918.9A CN202310175918A CN116486916A CN 116486916 A CN116486916 A CN 116486916A CN 202310175918 A CN202310175918 A CN 202310175918A CN 116486916 A CN116486916 A CN 116486916A
Authority
CN
China
Prior art keywords
cell
multicellular
cells
neighborhood
expression profile
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202310175918.9A
Other languages
Chinese (zh)
Inventor
陈哲名
郎秋蕾
韩斐然
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hangzhou Lianchuan Biotechnology Co ltd
Original Assignee
Hangzhou Lianchuan Biotechnology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hangzhou Lianchuan Biotechnology Co ltd filed Critical Hangzhou Lianchuan Biotechnology Co ltd
Priority to CN202310175918.9A priority Critical patent/CN116486916A/en
Publication of CN116486916A publication Critical patent/CN116486916A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B35/00ICT specially adapted for in silico combinatorial libraries of nucleic acids, proteins or peptides
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B40/00ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02ATECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
    • Y02A90/00Technologies having an indirect contribution to adaptation to climate change
    • Y02A90/10Information and communication technologies [ICT] supporting adaptation to climate change, e.g. for weather forecasting or climate simulation

Landscapes

  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Medical Informatics (AREA)
  • Evolutionary Biology (AREA)
  • Theoretical Computer Science (AREA)
  • Biophysics (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Biotechnology (AREA)
  • General Health & Medical Sciences (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Biochemistry (AREA)
  • Molecular Biology (AREA)
  • Library & Information Science (AREA)
  • Artificial Intelligence (AREA)
  • Bioethics (AREA)
  • Chemical & Material Sciences (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • Epidemiology (AREA)
  • Databases & Information Systems (AREA)
  • Public Health (AREA)
  • Software Systems (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

The invention discloses a single-cell transcriptome low-quality cell filtering method, and relates to a biological data processing method. The method comprises the following steps: grouping cells; taking the average value of the expression quantity according to the genes to generate a characteristic expression profile of the cell population; randomly combining the characteristic expression profiles of the cell population two by two to generate artificial multicellular; combining the artificial multicellular expression profile and the real cell expression profile, and calculating the distance between each cell; setting a plurality of equidistant neighborhoods in a specified range, and calculating the artificial multicellular proportion of each real cell in the neighborhood under each neighborhood; counting the proportion distribution of the artificial multicellular under each neighborhood, solving the bimodal coefficient of the artificial multicellular, and taking the neighborhood with the largest bimodal coefficient as the optimal neighborhood; and in the optimal neighborhood, the specified number of real cells with the maximum artificial multicellular proportion are regarded as multicellular, and the real cells are deleted from the expression profile of the real cells. The filtering standard and the accuracy of single cell transcriptome data are improved, and the reliability of the data is enhanced.

Description

Single cell transcriptome dying cell and multicellular filtration method, medium and equipment
Cross Reference to Related Applications
The application is based on application number 2022113673004, and the application date is: 2022, 11, 03, under the name: a single cell transcriptome low quality cell filtration method, medium and apparatus are filed.
Technical Field
The invention relates to a biological data processing method, in particular to a single-cell transcriptome dying cell and multicellular filtration method, medium and equipment.
Background
Single cell transcriptome sequencing based on microfluidic technology enables quantification of gene expression of tens of thousands of cells in a single experiment. It is mainly based on sequence tags to identify single cells, and its core technology is to add a unique sequence tag to each cell, and to treat nucleic acid sequences carrying the same tag as being from the same cell when sequencing. The 10X Genomics single-cell transcriptome sequencing platform is a technology widely applied at present, and the platform utilizes technologies such as micro-flow control, oil drop package, barcode label and the like to realize high-flux cell sorting and capturing, can separate and mark 500 to tens of thousands of single cells at one time, can obtain transcriptome information of each cell after sequencing, and has the advantages of high cell flux, low library construction cost, short capturing period and the like.
A typical single cell transcriptome sequencing protocol is as follows, first a cell suspension is prepared, mixed with magnetic beads using microfluidic chips on a corresponding platform instrument, and encapsulated with oil droplets. Each microbead carries a unique nucleotide sequence, namely, a barcode tag, which can label individual cells. Each barcode tag is further linked to a molecular identifier (unique molecular identifier, UMI) consisting of a nucleotide sequence, each UMI being capable of labeling an mRNA transcript. Through reverse transcription, PCR amplification, library generation and sequencing, in sequencing data, whether each sequence in the result is from the same cell and the same mRNA can be determined according to the barcode tag and UMI mark, and the method can reduce the preference influence of PCR on different molecules. By matching and counting the barcode and UMI, the gene expression information is summarized in a counting matrix, thereby obtaining a transcriptome expression profile of individual cells.
Single cell experiments often obtain single cells in bulk based on dissociation and disruption of biological tissue, which often results in many cell fragments or apoptosis. Drop-based single cell transcriptome techniques also exist where two or more cells (or whole cells + cell debris) form a drop. Information of hundreds of thousands or millions of droplets is contained in single-cell transcriptome data, but the barcode in the droplets does not automatically identify whether the droplets contain cells or whether the cells are cell fragments or dead/dying cells or multicellular, that is, whether the quality of the cells cannot be automatically determined. The level of cell mass greatly influences the results in subsequent analyses, so that the type of droplet represented by barcode needs to be determined before data analysis. The software cellrange of the 10X Genomics authorities can only determine whether the barcode is empty droplets and cannot recognize the cell quality, which may lead to a large deviation of the single cell transcriptional analysis results from the actual situation and even the opposite results in a biological sense. There is currently no systematic way to identify low quality cells and filter cells.
Disclosure of Invention
In order to solve at least one technical problem mentioned in the background art, the invention aims to provide a single cell transcriptome low quality cell filtering method, medium and equipment, which are used for identifying and filtering low quality cells in single cell transcriptome data, so that the filtering standard and accuracy of the single cell transcriptome data are improved, and the reliability of the data is enhanced.
In order to achieve the above purpose, the present invention provides the following technical solutions:
a single cell transcriptome low quality cell filtration method comprising the steps of:
s101, grouping cells based on a true cell expression profile;
s1041, taking the average value of the expression quantity of the cell expression profile of each cell group according to genes, and generating the characteristic expression profile of each cell group;
s1042, combining the characteristic expression profiles of the cell population in a random mode to generate a certain number of artificial multicellular;
s1043, combining the artificial multicellular expression profile and the real cell expression profile, and calculating the distance between each cell;
s1044, setting a plurality of equidistant neighborhoods in a specified range, and calculating the artificial multicellular proportion of each real cell in the neighborhood under each neighborhood;
s1045, counting the proportion distribution of the artificial multicellular under each neighborhood, solving the bimodal coefficient of the artificial multicellular, and taking the neighborhood with the largest bimodal coefficient as the optimal neighborhood;
s1046, under the optimal neighborhood, identifying the actual cells with the maximum artificial multicellular proportion and the specified quantity as multicellular, and deleting the actual cells from the expression profile of the actual cells.
Further, the method for combining the characteristic expression profiles in pairs comprises the following steps:
Y=a1*X1+a2*X2
wherein Y is the generated artificial multicellular, X1 and X2 are the characteristic expression profiles of the cell population; a1 and a2 are proportionality coefficients, one of a1 and a2 is set to 1, and the other is set to a random value larger than 0 and smaller than 1.
Further, the distance between the cells is Euclidean distance or Manhattan distance.
Further, the artificial multicellular ratio is: the ratio of the number of artificial multicellular to the total number of cells in the neighborhood for which the combined expression profile is located.
Further, the bimodal coefficient is:
wherein BC is a bimodal coefficient; s and k are the skewness and kurtosis of the artificial multicellular proportional distribution respectively; n (N) real Is the true cell number.
Further, the method for determining the specified number is as follows: the multicellular rate is set so that the product of the actual cell number and the multicellular rate is defined as the actual number of cells identified as multicellular.
Further, in S1044, the setting of the neighborhood is as follows: 100 equidistant neighborhoods are arranged in the range of 0.0001-0.01.
A computer storage medium having stored thereon a computer program which when executed by a processor implements a single cell transcriptome low quality cell filtration method as described above.
A terminal device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, the processor implementing a single cell transcriptome low quality cell filtration method as described above when the computer program is executed.
Compared with the prior art, the invention has the beneficial effects that: according to the invention, a certain amount of artificial multicellular is generated, the artificial multicellular proportion distribution of each real cell in the neighborhood is counted for each set neighborhood, the optimal neighborhood is determined, then a plurality of real cells with the maximum artificial multicellular proportion are identified as multicellular under the optimal neighborhood, and the multicellular (low-quality cells) is deleted from the real cell expression profile, so that the filtering standard and the filtering accuracy of single-cell transcriptome data are improved, and the reliability of the data is enhanced.
Drawings
FIG. 1 is an overall flow chart of an embodiment of the present invention.
FIG. 2 is a flow chart of multi-cell filtration according to an embodiment of the invention.
Detailed Description
The following description of the embodiments of the present invention will be made clearly and completely, and it is apparent that the described embodiments are only some embodiments of the present invention, but not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.
Embodiment one:
referring to fig. 1, the embodiment provides a single cell transcriptome low quality cell filtration method, the implementation process mainly includes steps S100 to S104, and the detailed description is as follows:
in step S100, the original expression profile (also called the true cell expression profile) may contain non-cellular data, so that the original expression profile is initially filtered by using cellrange software to remove non-cellular data in the data, and a primary filtered cell expression profile is generated. If the original expression profile does not contain non-cellular data, the original expression profile can be directly adopted for the next operation without filtering.
The original expression profile of this example is shown in table 1 below.
Table 1: original expression profile
In Table 1, C301, C15001 and the like represent cell numbers, G1, G2, G53 and the like represent gene numbers, and the data in the text of the tables represent gene expression levels.
The primary filtered cell expression profile of this example is shown in table 2 below.
Table 2: once-through filtration of cell expression profiles
In comparison to table 1 above, once-filtered cell expression profiles filtered cell C10001 and cell C15001.
Step S101, based on the primary filtered cell expression profile or the original expression profile, the cells are normalized, dimension reduced and clustered by using the SEURat software. The normalization method selects 'Log normal', and the scale.factor parameter is set to 10000; the screening method of the hypervariable gene uses 'vst'; PCA reduces the dimension to 50 PC dimensions; grouping was performed using the FindNeighbors function, with the resolution parameter set to 0.8. To this end, all cells are divided into different cell populations. The results of the clustering of the primary filtered cell expression profiles of this example are shown in Table 3 below:
table 3: grouping results of primary filtration of cell expression profiles
In table 3, the cells C1 and C601 are divided into the cell group a, and the other cells are divided into the cell group B, the cell group C and the cell group D, respectively.
Filtering cell debris and dead/dying cells is accomplished by steps S102 to S103. It is worth mentioning that cell debris and dead/dying cells can be filtered together or removed separately.
Step S102, setting four gene sets A respectively gene 、A mt 、A active 、A antioxi And calculating the expressed gene fraction S of each cell population gene Mitochondrial fraction S mt Score S of Activity active Oxidation resistance fraction S antioxi
Wherein A is gene For one filtration of the total gene set in the cell expression profile (including G1 to G53 in table 3 above); s is S gene Is the average value of the number of expressed genes (i.e., the number of genes whose expression level is greater than 0) in all cells in the cell population.
A mt Mitochondrial gene sets for the corresponding species (including G1 and G2 in table 3 above); s is S mt A for all cells in the cell population mt Average value of Gene expression proportion, A mt Ratio of gene expression = a in single cells mt Sum of Gene expression amount/A in Individual cells gene The sum of the gene expression amounts is 100%.
A active Housekeeping gene sets for the corresponding species (including G6 and G7 in Table 3 above) The method comprises the steps of carrying out a first treatment on the surface of the Such as human ACTB and GAPDH genes; s is S active A for all cells in the cell population active Average value of average expression level of Gene A active Average expression level of gene = a in single cell active Gene expression level sum/A active Number of genes.
A antioxi Sets of antioxidant genes for the corresponding species (including G20 and G21 in table 3 above); such as human SOD1, SOD2, SOD3, CAT, GPX1, GPX2, GPX3, GPX4, GPX5, GPX6, GPX7, GPX8, NQO1, NFE2L2 genes. S is S antioxi A for all cells in the cell population antioxi Average value of average expression level of Gene A antioxi Average expression level of gene = a in single cell antioxi Gene expression level sum/A antioxi Number of genes.
In summary, for each cell population, a corresponding expressed gene fraction S is obtained gene Mitochondrial fraction S mt Score S of Activity active Oxidation resistance fraction S antioxi Table 4 below is a table showing the scores obtained.
Table 4: four score display table
Fraction \cell population A B C D
Sgene 3499 2169 1469 320
Smt 1.37% 3.33% 5.33% 49.55%
Sactive 16.9 10.5 21.5 1.7
Santioxi 4 5 7 0.9
Step S103, set S gene 、S mt 、S active 、S antioxi Corresponding threshold G gene 、G mt 、G active 、G antioxi And determining the cell population type.
When S is gene Score less than G gene When the cell population is judged to be cell debris.
When S is mt Greater than G mt And S is antioxi Less than G antioxi At this time, the cell population was judged as dead/moribund cells.
When S is active Less than G active At this time, the cell population was judged as dead/moribund cells.
G gene 、G mt 、G active 、G antioxi Typically 500, 25%, 2.
As shown in table 4 above, cell population D (corresponding cell C8001) satisfies the above three simultaneously, and thus cell population D is judged as both cell debris and dead/dying cells.
In the primary filtered cell expression profile, cells judged as cell debris and dead/dying cells (either of which can be deleted) are deleted, and a secondary filtered cell expression profile is generated. The secondary filtered cell expression profile of this example is shown in Table 5.
Table 5: secondary filtration cell expression profile
The cells in the above primary and secondary filtered cell expression profiles are both truly present and therefore are also called true cell expression profiles.
Step S104, based on the simulated multicellular characteristic expression profile, the multicellular in the real cells is screened out based on knn algorithm, and the multicellular is filtered from the secondary filtered cell expression profile to generate a final filtered expression profile. The filtration of the multicellular may be performed independently or may be carried out after filtration of cell debris or dead/dying cells.
Referring to fig. 2, the steps S1041 and S1046 are specifically implemented (the grouping step is completed by the aforementioned step S101):
s1041, taking the average value (rounding) of the expression quantity of each cell group according to the genes, and generating the characteristic expression profile of the cell group.
Taking cell population a as an example, if it contains only 2 cells in table 5, the characteristic expression profile is as shown in table 5 below:
cell population\Gene G1 G2 G6 G7 G20 G21 G51 G52 G53
A 2 4 13 18 4 6 740 9 5
S1042, combining the characteristic expression profiles of all cell populations in pairs randomly to generate a certain duty ratio P N Is an artificial multicellular in the cell line, the duty ratio P N Typically set to 25%; duty cycle P of artificial multicellular N =artificial multicellular number/(artificial multicellular number+true cell number) ×100%;
the feature expression profile combining method comprises the following steps:
Y=a1*X1+a2*X2
wherein Y is the generated artificial multicellular, X1 and X2 are the characteristic expression profiles of the cell population; a1 and a2 are proportionality coefficients, one of a1 and a2 is set to be 1, and the other is set to be a random value larger than 0 and smaller than 1, namely, when a1 is 1, a2 is a random value between 0 and 1, and when a2 is 1, a1 is a random value 1< a1+a2<2 between 0 and 1. Each artificial multicellular thus produced contained one intact cell and one defective cell, i.e., simulated the production process of the multicellular in the actual experiment.
S1043, combining the artificial multicellular expression profile and the secondary filtration cell expression profile, and re-normalizing and PCA dimension reduction by using the SEURat software. Based on the PCA dimension reduction result, the distance between each cell is calculated, and is commonly known as Euclidean distance or Manhattan distance, and other distance measurement methods can play a similar role.
S1044, setting 100 equidistant neighborhoods P in the range of 0.0001-0.01 kn Calculate each domain P kn Under each real cell in the neighborhood P kn Artificial multicellular proportion P in ANN I.e. counting the nearest real cell (total number of cells N in the combined expression profile merge *P kn ) Number of artificial multicellular N in individual cells ANN Calculate P ANN =N ANN /(N merge *P kn )。
S1045, counting each neighborhood P kn P under ANN Distribution, the skewness s and kurtosis k of the distribution are calculated. The calculation formula of the skewness s isThe calculation formula of kurtosis k is +.>Wherein n is the same neighborhood P kn Lower P ANN Number, p of i For the same neighborhood P kn The ith P ANN M is the same neighborhood P kn All P below ANN Is the average value of SD and is the same neighborhood P kn All P below ANN Standard deviation of (2).
Recalculating each P ANN The bi-modal coefficient BC of the distribution,wherein N is real Is the true cell number. Selecting the neighborhood size when the bimodal coefficient BC is maximum as the last used optimal neighborhood P K
In the optimal neighborhood P K Under the neighborhood, boundaries of single cells and multiple cells can be maximally clarified, namely all cells can be divided into single cells and multiple cells as far as possible, and the number of intermediate state cells with fuzzy classification is reduced, so that a classification result as accurate as possible is obtained.
S1046, setting multiple cell rate R doub Calculating the expected value E of the multicellular number doub =N real *R doub . Calculating the size of the neighborhood as the optimal neighborhood P K At the time, P of each real cell ANN Will P ANN Maximum E doub Individual cells were identified as multicellular, which were deleted from the secondary filtered cell expression profile, resulting in a final filtered cell expression profile.
Embodiment two:
a computer storage medium having stored thereon a computer program which when executed by a processor implements the single cell transcriptome RNA contamination identification method of embodiment one and/or the single cell transcriptome low quality cell filtration method of embodiment two.
Embodiment III:
a terminal device comprising a memory, a processor, and a computer program stored on the memory and executable on the processor, the processor implementing the single cell transcriptome low quality cell filtration method of embodiment one when the computer program is executed.
It will be evident to those skilled in the art that the invention is not limited to the details of the foregoing illustrative embodiments, and that the present invention may be embodied in other specific forms without departing from the spirit or essential characteristics thereof. The present embodiments are, therefore, to be considered in all respects as illustrative and not restrictive, the scope of the invention being indicated by the appended claims rather than by the foregoing description, and all changes which come within the meaning and range of equivalency of the claims are therefore intended to be embraced therein.

Claims (8)

1. A single cell transcriptome dying cell and multicellular filtration method comprising the steps of:
s101, grouping cells based on a true cell expression profile;
gene set A was set for each gene mt 、A active 、A antioxi And calculating mitochondrial fraction S for each cell population mt Score S of Activity active Oxidation resistance fraction S antioxi ;A mt For mitochondrial gene sets of the corresponding species, S mt A for all cells in the cell population mt Average value of gene expression ratio; a is that active S for housekeeping Gene sets of the corresponding species active A for all cells in the cell population active Average value of average expression amount of gene; a is that antioxi Is the antioxidant gene set of the corresponding species, S antioxi A for all cells in the cell population antioxi Average value of average expression amount of gene; set S mt 、S active 、S antioxi Corresponding threshold G mt 、G active 、G antioxi And determining the cell population type; in the true cell expression profile, cells judged as dead/dying cells are deleted;
s1041, taking the average value of the expression quantity of the cell expression profile of each cell group according to genes, and generating the characteristic expression profile of each cell group;
s1042, combining the characteristic expression profiles of the cell population in a random mode to generate a certain number of artificial multicellular;
s1043, combining the artificial multicellular expression profile and the real cell expression profile, and calculating the distance between each cell;
s1044, setting a plurality of equidistant neighborhoods in a specified range, and calculating the artificial multicellular proportion of each real cell in the neighborhood under each neighborhood;
s1045, counting the proportion distribution of the artificial multicellular under each neighborhood, solving the bimodal coefficient of the artificial multicellular, and taking the neighborhood with the largest bimodal coefficient as the optimal neighborhood;
s1046, under the optimal neighborhood, identifying the actual cells with the maximum artificial multicellular proportion and the specified quantity as multicellular, and deleting the actual cells from the expression profile of the actual cells;
the bimodal coefficient is:
wherein BC is a bimodal coefficient; s and k are the skewness and kurtosis of the artificial multicellular proportional distribution respectively; n (N) real Is the true cell number.
2. The method for filtering dying cells and multiple cells from a single cell transcriptome according to claim 1, wherein said characteristic expression profile is obtained by a two-by-two combination method comprising:
Y=a1*X1+a2*X2
wherein Y is the generated artificial multicellular, X1 and X2 are the characteristic expression profiles of the cell population; one of a1 and a2 is set to 1, and the other is set to a random value greater than 0 and less than 1.
3. The method of claim 1, wherein the distance between cells is euclidean distance or manhattan distance.
4. The single cell transcriptome dying cell and multicellular filtration method of claim 1, wherein said artificial multicellular ratio is: the ratio of the number of artificial multicellular to the total number of cells in the neighborhood for which the combined expression profile is located.
5. The method for filtering dying cells and multiple cells from a single cell transcriptome according to claim 1, wherein said predetermined number is determined as follows: the multicellular rate is set so that the product of the actual cell number and the multicellular rate is defined as the actual number of cells identified as multicellular.
6. The method for filtering dying cells and multiple cells from a single cell transcriptome according to claim 1, wherein in said S1044, the neighborhood is set as follows: 100 equidistant neighborhoods are arranged in the range of 0.0001-0.01.
7. A computer storage medium having stored thereon a computer program which when executed by a processor implements the single cell transcriptome dying cell and multicellular filtration method of any one of claims 1 to 6.
8. A terminal device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, wherein the processor, when executing the computer program, implements the single cell transcriptome dying cell and multicellular filtration method of any one of claims 1 to 6.
CN202310175918.9A 2022-11-03 2022-11-03 Single cell transcriptome dying cell and multicellular filtration method, medium and equipment Pending CN116486916A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310175918.9A CN116486916A (en) 2022-11-03 2022-11-03 Single cell transcriptome dying cell and multicellular filtration method, medium and equipment

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202211367300.4A CN115440303B (en) 2022-11-03 2022-11-03 Method, medium and equipment for filtering low-quality cells of unicellular transcriptome
CN202310175918.9A CN116486916A (en) 2022-11-03 2022-11-03 Single cell transcriptome dying cell and multicellular filtration method, medium and equipment

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
CN202211367300.4A Division CN115440303B (en) 2022-11-03 2022-11-03 Method, medium and equipment for filtering low-quality cells of unicellular transcriptome

Publications (1)

Publication Number Publication Date
CN116486916A true CN116486916A (en) 2023-07-25

Family

ID=84252019

Family Applications (3)

Application Number Title Priority Date Filing Date
CN202310175918.9A Pending CN116486916A (en) 2022-11-03 2022-11-03 Single cell transcriptome dying cell and multicellular filtration method, medium and equipment
CN202310181167.1A Pending CN116805511A (en) 2022-11-03 2022-11-03 Single cell transcriptome cell debris and multicellular filtration method, medium and equipment
CN202211367300.4A Active CN115440303B (en) 2022-11-03 2022-11-03 Method, medium and equipment for filtering low-quality cells of unicellular transcriptome

Family Applications After (2)

Application Number Title Priority Date Filing Date
CN202310181167.1A Pending CN116805511A (en) 2022-11-03 2022-11-03 Single cell transcriptome cell debris and multicellular filtration method, medium and equipment
CN202211367300.4A Active CN115440303B (en) 2022-11-03 2022-11-03 Method, medium and equipment for filtering low-quality cells of unicellular transcriptome

Country Status (1)

Country Link
CN (3) CN116486916A (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116312786B (en) * 2023-02-08 2023-11-28 杭州联川生物技术股份有限公司 Single cell expression pattern difference evaluation method based on multi-group comparison
CN117995275A (en) * 2023-03-02 2024-05-07 杭州联川生物技术股份有限公司 Single cell expression mode difference evaluation method, medium and equipment based on reliability screening

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101149327A (en) * 2007-11-06 2008-03-26 浙江大学 Antineoplastic drug evaluation and screening method based on cell microscopic image information
CN102483419A (en) * 2009-05-11 2012-05-30 博格生物系统有限责任公司 Methods for the diagnosis of metabolic disorders using epimetabolic shifters, multidimensional intracellular molecules, or environmental influencers
CA3008289A1 (en) * 2015-12-16 2017-06-22 The Walter And Eliza Hall Institute Of Medical Research Inhibition of cytokine-induced sh2 protein in nk cells
CN107368701A (en) * 2017-07-31 2017-11-21 浙江绍兴千寻生物科技有限公司 In high volume unicellular ATAC seq data quality controls and analysis method
CN110675914A (en) * 2019-09-17 2020-01-10 佛山市第一人民医院(中山大学附属佛山医院) Method for screening tumor specific T cells and TCR
CN112700820A (en) * 2021-01-07 2021-04-23 广州华银健康医疗集团股份有限公司 Cell subset annotation method based on single cell transcriptome sequencing
WO2021094421A1 (en) * 2019-11-15 2021-05-20 Miltenyi Biotec B.V. & Co. KG Color and bardcoded beads for single cell indexing
CN114438230A (en) * 2022-03-08 2022-05-06 南通大学附属医院 Cell development map and marker gene of human three-month-old embryo mandible tissue
CN116525010A (en) * 2023-04-26 2023-08-01 杭州联川生物技术股份有限公司 Single-cell transcriptome double-source multi-cell filtering method, medium and equipment

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20200176080A1 (en) * 2017-07-21 2020-06-04 The Board Of Trustees Of The Leland Stanford Junior University Systems and Methods for Analyzing Mixed Cell Populations
CN111292807B (en) * 2018-12-06 2021-10-08 新格元(南京)生物科技有限公司 Method for analyzing double cells in single-cell transcriptome data
CN111951892A (en) * 2020-08-04 2020-11-17 荣联科技集团股份有限公司 Method for analyzing cell trajectory based on single cell sequencing data and electronic equipment
CN113674800B (en) * 2021-08-25 2022-02-08 中国农业科学院蔬菜花卉研究所 Cell clustering method based on single cell transcriptome sequencing data
CN114708910B (en) * 2022-02-24 2024-05-14 上海市第一人民医院 Method for calculating enrichment score of cell subpopulations in cell sequencing by using single cell sequencing data

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101149327A (en) * 2007-11-06 2008-03-26 浙江大学 Antineoplastic drug evaluation and screening method based on cell microscopic image information
CN102483419A (en) * 2009-05-11 2012-05-30 博格生物系统有限责任公司 Methods for the diagnosis of metabolic disorders using epimetabolic shifters, multidimensional intracellular molecules, or environmental influencers
CA3008289A1 (en) * 2015-12-16 2017-06-22 The Walter And Eliza Hall Institute Of Medical Research Inhibition of cytokine-induced sh2 protein in nk cells
CN107368701A (en) * 2017-07-31 2017-11-21 浙江绍兴千寻生物科技有限公司 In high volume unicellular ATAC seq data quality controls and analysis method
CN110675914A (en) * 2019-09-17 2020-01-10 佛山市第一人民医院(中山大学附属佛山医院) Method for screening tumor specific T cells and TCR
WO2021094421A1 (en) * 2019-11-15 2021-05-20 Miltenyi Biotec B.V. & Co. KG Color and bardcoded beads for single cell indexing
CN112700820A (en) * 2021-01-07 2021-04-23 广州华银健康医疗集团股份有限公司 Cell subset annotation method based on single cell transcriptome sequencing
CN114438230A (en) * 2022-03-08 2022-05-06 南通大学附属医院 Cell development map and marker gene of human three-month-old embryo mandible tissue
CN116525010A (en) * 2023-04-26 2023-08-01 杭州联川生物技术股份有限公司 Single-cell transcriptome double-source multi-cell filtering method, medium and equipment

Non-Patent Citations (5)

* Cited by examiner, † Cited by third party
Title
GRANDI, F等: "popsicleR: A R Package for Pre-processing and Quality Control Analysis of Single Cell RNA-seq Data", JOURNAL OF MOLECULAR BIOLOGY, vol. 434, no. 11, 15 June 2022 (2022-06-15) *
JEANETTE BARAN-GALE等: "Experimental design for single-cell RNA sequencing", BRIEFINGS IN FUNCTIONAL GENOMICS, vol. 17, no. 4, 31 July 2018 (2018-07-31), pages 233 - 239 *
师航等: "基于单细胞转录组测序分析人肋来源软骨细胞的异质性", 中国组织工程研究, vol. 26, no. 19, 31 July 2022 (2022-07-31), pages 3011 - 3017 *
程玖玲: "基于单细胞转录组测序技术对支气管哮喘急性发作期患者肺内重要免疫细胞表型的分析", 中国优秀硕士学位论文全文数据库(电子期刊)医药卫生科技辑, 15 May 2022 (2022-05-15), pages 063 - 4 *
红外线BIUBIU: "2019-Molecular Systems Biology-单细胞最佳流程指导", Retrieved from the Internet <URL:https://www.jianshu.com/p/93c83fe1ee16> *

Also Published As

Publication number Publication date
CN116805511A (en) 2023-09-26
CN115440303A (en) 2022-12-06
CN115440303B (en) 2023-02-10

Similar Documents

Publication Publication Date Title
CN116486916A (en) Single cell transcriptome dying cell and multicellular filtration method, medium and equipment
Fleming et al. CellBender remove-background: a deep generative model for unsupervised removal of background noise from scRNA-seq datasets
Richardson et al. Evaluating and optimizing the performance of software commonly used for the taxonomic classification of DNA metabarcoding sequence data
Boedigheimer et al. Sources of variation in baseline gene expression levels from toxicogenomics study control animals across multiple laboratories
Christie et al. Bayesian parentage analysis with systematic accountability of genotyping error, missing data and false matching
CN112349346A (en) Method for detecting structural variations in genomic regions
Sekula et al. Detection of differentially expressed genes in discrete single-cell RNA sequencing data using a hurdle model with correlated random effects
CN107832584B (en) Gene analysis method, device, equipment and storage medium of metagenome
CN116525010A (en) Single-cell transcriptome double-source multi-cell filtering method, medium and equipment
Schwender et al. Identifying interesting genes with siggenes
Boggy et al. BFF and cellhashR: analysis tools for accurate demultiplexing of cell hashing data
Kim et al. Prioritizing hypothesis tests for high throughput data
Gorin et al. Distinguishing biophysical stochasticity from technical noise in single-cell RNA sequencing using Monod
Song et al. Microarray blob-defect removal improves array analysis
CN113628682B (en) T790M and C797S cis-trans mutation type identification and calculation method
Kuo et al. Quantification and statistical modeling of Chromium-based single-nucleus RNA-sequencing data
Pan et al. The Poisson distribution model fits UMI-based single-cell RNA-sequencing data
Yang et al. Background modeling, quality control and normalization for GeoMx RNA data with GeoDiff
CN107609349A (en) A kind of project implementation quality control system in bioanalysis platform
CN115472222B (en) Single cell transcriptome RNA pollution identification method, medium and equipment
CN116525004B (en) Single cell expression pattern difference evaluation method, medium and device based on two groups of comparison
CN116072217B (en) Single cell transcriptome data availability processing method, medium and equipment
CN110706745B (en) Single nucleotide polymorphism site integration method and device
CN103377317A (en) Computer-implemented method and computer system for rank normalization for differential expression analysis of transcriptome sequencing data
US11468970B2 (en) Allelotyping methods for massively parallel sequencing

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination