CN110846411B - Method for distinguishing gene mutation types of single tumor sample based on next generation sequencing - Google Patents
Method for distinguishing gene mutation types of single tumor sample based on next generation sequencing Download PDFInfo
- Publication number
- CN110846411B CN110846411B CN201911147268.7A CN201911147268A CN110846411B CN 110846411 B CN110846411 B CN 110846411B CN 201911147268 A CN201911147268 A CN 201911147268A CN 110846411 B CN110846411 B CN 110846411B
- Authority
- CN
- China
- Prior art keywords
- mutation
- tissue sample
- tumor
- tumor tissue
- normal tissue
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6876—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
- C12Q1/6883—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
- C12Q1/6886—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material for cancer
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B20/00—ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
- G16B20/50—Mutagenesis
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B30/00—ICT specially adapted for sequence analysis involving nucleotides or amino acids
- G16B30/10—Sequence alignment; Homology search
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/156—Polymorphic or mutational markers
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Physics & Mathematics (AREA)
- Engineering & Computer Science (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Analytical Chemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biophysics (AREA)
- Organic Chemistry (AREA)
- Biotechnology (AREA)
- Genetics & Genomics (AREA)
- General Health & Medical Sciences (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Evolutionary Biology (AREA)
- Theoretical Computer Science (AREA)
- Zoology (AREA)
- Immunology (AREA)
- Wood Science & Technology (AREA)
- Pathology (AREA)
- Medical Informatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Molecular Biology (AREA)
- Microbiology (AREA)
- Oncology (AREA)
- Hospice & Palliative Care (AREA)
- Biochemistry (AREA)
- General Engineering & Computer Science (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
The invention relates to a method for distinguishing gene mutation types by using single tumor samples based on next generation sequencing, which respectively uses the tumor tissue samples and normal tissue samples to build a library and carry out NGS sequencing, takes the chain preference, different types of base frequencies, base comparison quality and noise frequency of mutation sites stored in an intermediate file BAM for analyzing the biological information of the tumor tissue samples as training characteristics of machine learning, simultaneously pairs the type information of the corresponding mutation sites of the normal tissue samples as a prediction mutation type, constructs a classification prediction model for distinguishing somatic mutation and germ system mutation, distinguishes the somatic mutation and the germ system mutation by using the model, has high detection efficiency and high specificity, can use the single tumor samples to carry out NGS sequencing and mutation detection after the model is built, can well save the detection cost of the normal or cancer samples, meanwhile, the problem that normal tissues of specific types of tumor patients are not easy to obtain can be solved.
Description
Technical Field
The invention belongs to the technical field of gene detection, and particularly relates to a method for distinguishing gene mutation types of single tumor samples based on next generation sequencing.
Background
High-throughput sequencing (NGS) is a large-scale parallel sequencing technology, sequences covering all genes in a sample can be detected by using a high-throughput sequencing mode, and geneticists and oncologists can obtain gene mutation information of a plurality of target regions on a genome at one time by combining related variation detection software. Different types of genetic mutations may occur in each cell of an individual, and the genetic mutations can be divided into genetic mutations and somatic mutations according to sources, the genetic mutations are inherited by parents, the genetic mutations are also called germ line mutations or germ line mutations because the mutations exist in germ cells of parents, and if the germ line mutations occur, all cells in the individual carry the germ line mutations; somatic mutations occur at any stage of development of an individual from a fertilized egg and are present only in certain specific cells, not every cell of the individual, and are not inherited from parents but are altered by the influence of the environment and other factors, and gene mutations are caused by errors in self-replication of DNA during cell division.
The genetic mutation analysis and judgment of tumor tissues of tumor patients usually needs to be combined with normal tissue samples, namely, the samples are matched for detection, and meanwhile, the germ line mutation and the somatic mutation are distinguished through bioinformatics analysis processes such as GATK and the like. Theoretically, the mutation ratio of homozygous germline mutation is 100%, and the mutation ratio of heterozygous germline mutation is 50%, based on which the germline mutation and the somatic mutation can be distinguished, but actually, because of the different preference of DNA double-strand amplification in the experimental detection process, the proportion of DNA fragments obtained by second-generation sequencing to DNA fragments of germline mutation is 100% or less or about 50%.
In the prior art, the single tumor sample region somatic mutation and embryonic line mutation of the used whole exon sequencing and targeted sequencing are mainly compared with the existing mutation database to filter out the embryonic line mutation of the detected mutant, the method adopts the self-built VariantDx to detect all the mutations, filters out the mutation with the frequency of more than 1% of the human population in the thousand human genome project and the mutation in dbSNP (version number 138), and selects the mutation in COSMIC and kinase structural domain to determine as somatic mutation. However, the above method has the disadvantages that the judgment criteria of the germline mutation and the somatic mutation mainly depend on the existing database, and the positive and negative chain preference of the sequenced DNA fragment generated in the experimental detection process and the mutation frequency characteristics of the germline mutation per se are not considered, so that some germline mutations which are not recorded in the database are finally found and are not filtered and are judged as the somatic mutation, and meanwhile some somatic mutations are filtered out due to the consistency of the locus and the variant amino acid with dbSNP and are judged as the germline mutation, so that the specificity of the differentiation judgment is low and is only 67% effective.
Disclosure of Invention
The invention aims to provide a method for distinguishing gene mutation types of single tumor samples based on next generation sequencing, which can really and quickly distinguish somatic mutation and germ line mutation by establishing a classification model as a classifier for predicting the mutation types of the single tumor samples.
In order to achieve the purpose, the invention is realized by the following technical scheme:
a method for differentiating gene mutation types based on single tumor samples of next generation sequencing, comprising the following steps:
s1, extracting DNA from the tumor tissue sample and the normal tissue sample, and performing library building and sequencing on the DNA extracted from the tumor tissue sample and the normal tissue sample by adopting a probe capture method;
s2, performing sequence alignment on the DNA sequencing data from the tumor tissue sample and the normal tissue sample respectively obtained in step S1 by using BWA MEM algorithm, and generating an alignment file tumor.dup.bam belonging to the tumor tissue sample and an alignment file normal.dup.bam belonging to the normal tissue sample;
s3, analyzing the germ line mutation in the normal tissue sample and the mixed mutation in the tumor tissue sample respectively by utilizing the GATK standard process, and analyzing the somatic mutation in the tumor tissue sample by combining the normal tissue sample and the tumor tissue sample;
s4, using the mutation site information of the mixed mutation in the tumor tissue sample obtained in the step S3 as the input characteristic of a data set required for creating machine learning, and simultaneously combining the germline mutation site in the normal tissue sample obtained in the step S3 and the somatic mutation site in the tumor tissue sample as the mutation type result to be predicted;
s5, randomly dividing the data set obtained in the step S4 into a training data set and a testing data set, performing model training on the training data set by adopting an SVM or KNN, constructing a training model, testing by using the testing data set, evaluating the prediction effect of the training model, and selecting an optimal training model to further obtain a machine learning model capable of distinguishing the gene mutation type in the tumor tissue sample;
s6, the machine learning model obtained in the step S5 is used for carrying out splitting verification on the whole data set obtained in the step S4, and finally the obtained classification model can be used as a classifier for prediction of mutation types of brand-new single tumor samples and distinguishing somatic mutation and germ line mutation.
Further, in step S3, the HaploCaller tool in the GATK standard process is used to separately process the normal tissue sample alignment file, so as to analyze the germline mutation in the normal tissue sample; the tumor tissue sample comparison file is independently processed by utilizing a Mutect2 tool in the GATK standard flow, and then mixed mutation in the tumor tissue sample is analyzed; and (3) processing the comparison file of the normal tissue sample and the comparison file of the tumor tissue sample by utilizing a Mutect2 tool in the GATK standard flow, and further analyzing the somatic mutation in the tumor tissue sample.
Further, in step S4, the mutation site information includes information of the mutation site corresponding to the tumor tissue sample alignment file, and bam-recount software is used to calculate the base numbers of the positive strand and the negative strand of the reference base, the base numbers of the positive strand and the negative strand of the mutant allele, the base numbers of the positive strand and the negative strand of other noise bases, and the corresponding base average quality values of the mutation site, and the aligned average quality values.
Further, the training set tested in step S4 includes ATCG base frequencies, base average quality values, and aligned average quality values based on the mutated sites of somatic mutations and the mutated sites of germline mutations known through step S3.
Further, the whole data set is split 10-20 times in the step S6.
The invention has the following beneficial effects:
1. the invention respectively uses the tumor tissue sample and the normal tissue sample to build a library and carry out NGS sequencing, takes the chain preference, different types of base frequencies, base comparison quality and noise frequency of mutation sites stored in an intermediate file BAM for analyzing biological information of the tumor tissue sample as training characteristics of machine learning, simultaneously pairs the mutation information of the normal tissue sample as a prediction mutation type, constructs a classification prediction model for distinguishing somatic mutation and germ system mutation, and uses the model to distinguish the somatic mutation from the germ system mutation, so that the detection efficiency and the specificity are high, and after the model is built, the NGS sequencing and mutation detection can be carried out by using the single tumor sample, thereby well saving the detection cost of the normal or cancer sample, and simultaneously solving the problem that the normal tissue of a tumor patient of a specific type is difficult to obtain.
2. The invention overcomes the defect that the standard for distinguishing the germline mutation and the somatic mutation depends on the existing database in the prior art, avoids the judgment error that the germline mutation which is not recorded by some databases is judged as the somatic mutation or some somatic mutations are misjudged as the germline mutation because the sites and the variant amino acids are consistent with dbSNP because the sequencing DNA segment positive and negative chain preference generated by an experimental method and the mutation frequency characteristics of the germline mutation per se are not considered in the prior art, utilizes the prediction classification model as a classifier for the prediction of the mutation types of individual tumor samples, and can truly and rapidly distinguish the somatic mutation and the germline mutation.
3. According to the invention, through the analysis of the results of the target NGC sequencing of 189 tumor tissues, the germline mutation type and the somatic mutation type in a mutation site can be better identified, the target gene of the target sequencing is the gene related to tumor target chemotherapy drugs, the sensitivity of classification of the germline mutation after model training by adopting an SVM algorithm can reach 91.5%, and the specificity reaches 85.32%;
drawings
FIG. 1 is a flow chart of the present invention.
Detailed Description
The present invention will be described in detail below with reference to specific embodiments and the accompanying drawings.
Referring to fig. 1, a method for distinguishing gene mutation types of individual tumor samples based on next generation sequencing specifically comprises the following steps:
s1, 189 target areas are selected, DNA extraction is carried out on the 189 target areas in a tumor tissue sample and a normal tissue sample, the DNA extracted from the tumor tissue sample and the normal tissue sample is subjected to library building and sequencing by a probe capture method, and original data of 189 target area sequencing is obtained;
s2, performing sequence alignment on the DNA sequencing data from the tumor tissue sample and the normal tissue sample respectively obtained in step S1 by using BWA MEM algorithm, and generating an alignment file tumor.dup.bam belonging to the tumor tissue sample and an alignment file normal.dup.bam belonging to the normal tissue sample;
s3, independently processing the normal tissue sample comparison file by utilizing a HaploCaller tool in the GATK standard flow, and further analyzing the germ line mutation in the normal tissue sample; processing a normal tissue sample comparison file and a tumor tissue sample comparison file by using a Mutect2 tool in a GATK standard flow, and further analyzing somatic mutation in the tumor tissue sample; processing the tumor tissue sample comparison file (without adding a normal tissue sample comparison file) by using a Mutect2 tool in a GATK standard process to obtain a VCF file mixed with somatic mutation and potential germline mutation, wherein all mutations contained in the VCF file are mixed mutations;
s4, extracting mutation site information from the mixed mutation in the tumor tissue sample obtained in the step S3 to be used as a training data set required by machine learning; calculating the base numbers of the positive strand and the negative strand of the reference base, the base numbers of the positive strand and the negative strand of the mutant allele, the base numbers of the positive strand and the negative strand of other noise bases, the corresponding mutant site base average quality values and the aligned average quality values by adopting bam-ready count software according to the information of the mutant site corresponding to the tumor tissue sample alignment file; the bam-readcount operation method comprises the following steps:
bam-readcount-f hg19.fasta${sample_bam}-l${snpf}-i>${outdir}/${sample}_snps.txt 2>/dev/null
wherein, $ { sample _ bam } is a duplication removal comparison file after comparison of the tumor tissue samples; $ snpf is the file name, including the chromosome of the detected mutation site and the corresponding base coordinates;
extracting mutation sites in the mixed mutation, marking corresponding homozygous germline mutation, heterozygous germline mutation and somatic mutation existing in the paired sample, namely the germline mutation sites in the normal tissue sample and the somatic mutation in the tumor tissue sample, to obtain 13130 mutation sites, wherein the ATCG base frequency, the average base quality value and the compared average quality value of the mutation sites form the input features of the data set;
s5, characteristic values and corresponding mutation types of 13130 mutation sites are extracted to form a whole data set for machine learning, 3/4 of total data are randomly extracted to serve as a training data set, the rest of total data serve as a testing data set, training is carried out by using a kNN algorithm and an SVM algorithm of an open source machine learning kit skleran of python, and then a testing model capable of distinguishing gene mutation types in a tumor tissue sample is obtained, wherein code statement blocks for training are as follows:
using the KNN algorithm:
using the SVM algorithm:
s6, carrying out multiple random splitting (preferably 10-20 times) verification on the whole data set obtained in S5 by the test model obtained in the step S5, and finally obtaining a classification model which can be used as a classifier for prediction of mutation types of brand-new single tumor samples and distinguishing somatic mutation and germ line mutation, wherein the number of predicted sites of the prediction classification model obtained by respectively training two algorithms is as follows:
test method | TP | TN | FP | FN(het) | FN(hom) | Sensitivity of the probe | Specificity of |
SVM | 8614 | 3174 | 546 | 706 | 90 | 92.5% | 85.32% |
kNN | 8385 | 3180 | 540 | 858 | 167 | 89.1% | 85.48% |
From the table above, it can be seen that the prediction accuracy using SVM is high, and accurate 8614 true germline mutations can be predicted from 9410 germline mutations, 3174 of 3720 individual cell mutations can be predicted, the sensitivity reaches 92.5%, and the specificity reaches 86.32%.
The above description is only an embodiment of the present invention, and not intended to limit the scope of the present invention, and all modifications of equivalent structures and equivalent processes, which are made by the present specification, or directly or indirectly applied to other related technical fields, are included in the scope of the present invention.
Claims (4)
1. A method for distinguishing gene mutation types of single tumor samples based on next generation sequencing is characterized by comprising the following steps:
s1, extracting DNA from the tumor tissue sample and the normal tissue sample, and performing library building and sequencing on the DNA extracted from the tumor tissue sample and the normal tissue sample by adopting a probe capture method;
s2, performing sequence comparison on the DNA sequencing data which are respectively obtained from the tumor tissue sample and the normal tissue sample in the step S1 by using a BWAMEM algorithm, and simultaneously generating a comparison file Tumor.dup.bam belonging to the tumor tissue sample and a comparison file Normal.dup.bam belonging to the normal tissue sample;
s3, analyzing the germ line mutation in the normal tissue sample and the mixed mutation in the tumor tissue sample respectively by utilizing the GATK standard process, and analyzing the somatic mutation in the tumor tissue sample by combining the normal tissue sample and the tumor tissue sample;
s4, using mutation site information of mixed mutation in the tumor tissue sample obtained in the step S3 as a data set input characteristic required by machine learning creation; calculating the base numbers of the positive strand and the negative strand of the reference base, the base numbers of the positive strand and the negative strand of the mutant allele, the base numbers of the positive strand and the negative strand of other noise bases, the corresponding base average quality values of the mutant sites and the aligned average quality values by adopting bam-ready count software according to the information of the mutant sites in the tumor tissue sample alignment file; meanwhile, combining the germ line mutation site in the normal tissue sample obtained in the step S3 and the somatic mutation site in the tumor tissue sample, inputting characteristics as a mutation type result to be predicted;
s5, randomly dividing the data set obtained in the step S4 into a training data set and a testing data set, performing model training on the training data set by adopting an SVM or KNN, constructing a training model, testing by using the testing data set, evaluating the prediction effect of the training model, and selecting an optimal training model to further obtain a machine learning model capable of distinguishing the gene mutation type in the tumor tissue sample;
s6, the machine learning model obtained in the step S5 is used for carrying out splitting verification on the whole data set obtained in the step S4, and finally the obtained classification model can be used as a classifier for prediction of mutation types of brand-new single tumor samples and distinguishing somatic mutation and germ line mutation;
the method for distinguishing gene mutation types based on single tumor samples of next generation sequencing is used for the purpose of diagnosis of non-diseases.
2. The method of claim 1, wherein the single tumor sample is subjected to next generation sequencing to distinguish the types of gene mutations, and wherein: in the step S3, a HaploCaller tool in the GATK standard flow is used for independently processing the comparison file of the normal tissue sample, and further the germ line mutation in the normal tissue sample is analyzed; the tumor tissue sample comparison file is independently processed by utilizing a Mutect2 tool in the GATK standard flow, and then mixed mutation in the tumor tissue sample is analyzed; and (3) processing the comparison file of the normal tissue sample and the comparison file of the tumor tissue sample by utilizing a Mutect2 tool in the GATK standard flow, and further analyzing the somatic mutation in the tumor tissue sample.
3. The method of claim 1, wherein the single tumor sample is subjected to next generation sequencing to distinguish the types of gene mutations, and wherein: the test training set of step S5 includes the ATCG base frequency, the average quality value of bases, and the average quality value of alignment based on the mutated site of somatic mutation and the mutated site of germline mutation known through step S3.
4. The method of claim 3, wherein the single tumor sample is subjected to next generation sequencing to distinguish the types of gene mutations, wherein: in the step S6, the whole data set is split 10-20 times.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201911147268.7A CN110846411B (en) | 2019-11-21 | 2019-11-21 | Method for distinguishing gene mutation types of single tumor sample based on next generation sequencing |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201911147268.7A CN110846411B (en) | 2019-11-21 | 2019-11-21 | Method for distinguishing gene mutation types of single tumor sample based on next generation sequencing |
Publications (2)
Publication Number | Publication Date |
---|---|
CN110846411A CN110846411A (en) | 2020-02-28 |
CN110846411B true CN110846411B (en) | 2020-09-18 |
Family
ID=69602955
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201911147268.7A Active CN110846411B (en) | 2019-11-21 | 2019-11-21 | Method for distinguishing gene mutation types of single tumor sample based on next generation sequencing |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110846411B (en) |
Families Citing this family (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111304308A (en) * | 2020-03-02 | 2020-06-19 | 北京泛生子基因科技有限公司 | Method for auditing detection result of high-throughput sequencing gene variation |
CN111429968B (en) * | 2020-03-11 | 2021-06-22 | 至本医疗科技(上海)有限公司 | Method, electronic device, and computer storage medium for predicting tumor type |
CN111793678A (en) * | 2020-07-30 | 2020-10-20 | 臻悦生物科技江苏有限公司 | Method and kit for detecting homologous recombination pathway gene mutation based on next-generation sequencing technology |
CN112289376B (en) * | 2020-10-26 | 2021-07-06 | 北京吉因加医学检验实验室有限公司 | Method and device for detecting somatic cell mutation |
CN113764044B (en) * | 2021-08-31 | 2023-07-21 | 华南理工大学 | Method for constructing myelodysplastic syndrome progress gene prediction model |
CN118006770A (en) * | 2022-11-09 | 2024-05-10 | 杭州诺辉健康科技有限公司 | Detection of cancer using positive and negative strand differences of free DNA molecules |
CN116646006B (en) * | 2023-07-27 | 2023-11-14 | 华测检测认证集团股份有限公司 | Tumor related gene system mutation detection method and device based on high-throughput sequencing and Gaussian mixture model |
CN117238365A (en) * | 2023-08-24 | 2023-12-15 | 深圳爱湾医学检验实验室 | Neonatal genetic disease early screening method and device based on high-throughput sequencing technology |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104946765A (en) * | 2015-06-25 | 2015-09-30 | 华中农业大学 | Somatic mutation site excavation method based on genomic sequencing |
CN108021789A (en) * | 2017-12-16 | 2018-05-11 | 普瑞基准科技(北京)有限公司 | A kind of comprehensive strategic for identifying somatic mutation |
CN109337957A (en) * | 2018-12-25 | 2019-02-15 | 江苏医联生物科技有限公司 | The method for detecting genome multimutation type |
CN110093417A (en) * | 2018-01-31 | 2019-08-06 | 北京大学 | A method of the detection unicellular somatic mutation of tumour |
CN110468189A (en) * | 2019-08-29 | 2019-11-19 | 北京优迅医学检验实验室有限公司 | The method and device of detection sample somatic variation is sequenced based on single two generation of sample |
-
2019
- 2019-11-21 CN CN201911147268.7A patent/CN110846411B/en active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104946765A (en) * | 2015-06-25 | 2015-09-30 | 华中农业大学 | Somatic mutation site excavation method based on genomic sequencing |
CN108021789A (en) * | 2017-12-16 | 2018-05-11 | 普瑞基准科技(北京)有限公司 | A kind of comprehensive strategic for identifying somatic mutation |
CN110093417A (en) * | 2018-01-31 | 2019-08-06 | 北京大学 | A method of the detection unicellular somatic mutation of tumour |
CN109337957A (en) * | 2018-12-25 | 2019-02-15 | 江苏医联生物科技有限公司 | The method for detecting genome multimutation type |
CN110468189A (en) * | 2019-08-29 | 2019-11-19 | 北京优迅医学检验实验室有限公司 | The method and device of detection sample somatic variation is sequenced based on single two generation of sample |
Non-Patent Citations (2)
Title |
---|
Detection of Somatic Mutations in Exome Sequencing of Tumor-only Samples;Hsu et al;《Scientific REPORTS》;20171121;全文 * |
Optimized targeted sequencing of cell-free plasma DNA from bladder cancer patients;Christensen et al;《SCIENTIFIC REPORTS》;20181231;全文 * |
Also Published As
Publication number | Publication date |
---|---|
CN110846411A (en) | 2020-02-28 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110846411B (en) | Method for distinguishing gene mutation types of single tumor sample based on next generation sequencing | |
CN109767810B (en) | High-throughput sequencing data analysis method and device | |
US11193175B2 (en) | Normalizing tumor mutation burden | |
CN107229841B (en) | A kind of genetic mutation appraisal procedure and system | |
CN111304303B (en) | Method for predicting microsatellite instability and application thereof | |
CN108256292B (en) | Copy number variation detection device | |
CN113096728B (en) | Method, device, storage medium and equipment for detecting tiny residual focus | |
CN112289376B (en) | Method and device for detecting somatic cell mutation | |
CN112746097A (en) | Method for detecting sample cross contamination and method for predicting cross contamination source | |
CN113674803A (en) | Detection method of copy number variation and application thereof | |
US10294518B2 (en) | Methods and systems for ultra-sensitive detection of genomic alterations | |
CN109461473B (en) | Method and device for acquiring concentration of free DNA of fetus | |
CN113789371B (en) | Batch correction-based copy number variation detection method | |
WO2024140368A1 (en) | Sample cross contamination detection method and device | |
CN112837748A (en) | System and method for distinguishing tumors of different anatomical origins | |
WO2016112539A1 (en) | Method and device for determining fetal nucleic acid content | |
CN104598775A (en) | RNA (Ribonucleic Acid) edit event recognition mechanism and high-efficiency implementation scheme of mechanism | |
Mayrink et al. | Bayesian factor models for the detection of coherent patterns in gene expression data | |
Subramanian et al. | Novel multisample scheme for inferring phylogenetic markers from whole genome tumor profiles | |
Anderson et al. | Simultaneous brain cell type and lineage determined by scRNA-seq reveals stereotyped cortical development | |
Wang et al. | Single-cell copy number lineage tracing enabling gene discovery | |
WO2024175089A1 (en) | Single-molecule strand-specific end modalities | |
EP4297037A1 (en) | Device for determining an indicator of presence of hrd in a genome of a subject | |
CN107447021A (en) | A kind of method and its application of the accurate identification genotype based on high-flux sequence | |
US20170226588A1 (en) | Systems and methods for dna amplification with post-sequencing data filtering and cell isolation |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right |
Effective date of registration: 20210928 Address after: 201318 1-3 / F, 1 / F, No. 11, Lane 100, Banxia Road, Pudong New Area, Shanghai Patentee after: SHANGHAI RENDONG MEDICAL LABORATORY Co.,Ltd. Patentee after: Suzhou Rendong Bioengineering Co.,Ltd. Address before: 200120 floor 1-3, basement 1, No. 11, Lane 100, Banxia Road, Pudong New Area, Shanghai Patentee before: SHANGHAI RENDONG MEDICAL LABORATORY Co.,Ltd. |
|
TR01 | Transfer of patent right |