CN109767812A - Method for detecting tumor peripheries blood sample series of variation - Google Patents

Method for detecting tumor peripheries blood sample series of variation Download PDF

Info

Publication number
CN109767812A
CN109767812A CN201811588416.4A CN201811588416A CN109767812A CN 109767812 A CN109767812 A CN 109767812A CN 201811588416 A CN201811588416 A CN 201811588416A CN 109767812 A CN109767812 A CN 109767812A
Authority
CN
China
Prior art keywords
blood sample
algorithm
variation
detecting
genereader
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201811588416.4A
Other languages
Chinese (zh)
Inventor
杨文婷
陈亮
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Jiangsu Medical Union Biotechnology Co Ltd
Original Assignee
Jiangsu Medical Union Biotechnology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Jiangsu Medical Union Biotechnology Co Ltd filed Critical Jiangsu Medical Union Biotechnology Co Ltd
Priority to CN201811588416.4A priority Critical patent/CN109767812A/en
Publication of CN109767812A publication Critical patent/CN109767812A/en
Pending legal-status Critical Current

Links

Abstract

The present invention relates to a kind of methods for detecting tumor peripheries blood sample series of variation, GeneReader algorithm and SiNVICT algorithm detection tumor peripheries blood sample are respectively adopted simultaneously, the GeneReader algorithm detection obtained result of tumor peripheries blood sample is compared with the SiNVICT algorithm detection obtained result of tumor peripheries blood sample, retain consistent result data as final detection result, the GeneReader algorithm is algorithm obtained from combining the method for having supervision and unsupervised method.Method provided by the present invention for detecting tumor peripheries blood sample series of variation, high sensitivity, suitable for tumor peripheries blood sample, it can delicately detect very much variation information, the accuracy detected that makes a variation is high, substantially true abrupt information will not be missed, whether be credible variation information to effectively filter out false positive mutational site, can meet the needs of practical application well if can judge by many kinds of parameters.

Description

Method for detecting tumor peripheries blood sample series of variation
Technical field
The invention belongs to technical field of gene detection, and in particular to one kind is for detecting tumor peripheries blood sample series of variation Method.
Background technique
With the technology maturation and price decline of the sequencing of two generations, gene order-checking obtains various extensive on medical domain Using.By taking the drug of tumour is used with clinical trial as an example, researcher can take cancerous tissue or blood sample to study tool There are the cancer types of different biomarkers to the mark of validity, the cancer process mechanism shifted and generation of drug Object, screening tumour early stage or marker of recurrence etc..
Due to the acquisition of tumor tissues be it is extremely difficult, in recent years detect tumor peripheries blood series of variation (ctDNA) at For important technical, the variation information and post-operative recovery situation of patient can be also detected to a certain extent by the technology.But outside tumour The sequencing of all blood (ctDNA) has many differences with tissue sequencing, if sequence is short (being generally shorter than 150bp), aberration rate it is low (thousand/ One) etc..Therefore the variation detection of tumor peripheries blood is not suitable for conventional analysis process, such as official's process of GATK.It needs to this Special setting new algorithm process and parameter adjustment.
The cell that tissue includes is purer, and detection variation difficulty is relatively low, and the source DNA that blood sample contains is more Add it is mixed and disorderly, the content of the DNA fragmentation of Tumor mutations be it is at a fairly low, need more sensitive algorithm to identify.GATK algorithm Defect is, firstly, being not sensitive enough to detect mutation rate site extremely low in blood.The model parameter that second, GATK are used It is to be trained using group organization data, is not suitable for blood sample.Third, GATK can use random drop for the high efficiency of algorithm The mode of (downsampling) is sampled to reduce data volume, and this processing mode can allow staff to miss true mutation letter Breath.4th, GATK do not have flexile filtration parameter, can not filter out false positive mutation using Multi-parameter Combined Tool well Site.
Summary of the invention
For above-mentioned problems of the prior art, it can avoid above-mentioned skill occur the purpose of the present invention is to provide one kind The method for detecting tumor peripheries blood sample series of variation of art defect.
In order to achieve the above-mentioned object of the invention, technical solution provided by the invention is as follows:
A method of for detecting tumor peripheries blood sample series of variation, at the same be respectively adopted GeneReader algorithm and SiNVICT algorithm detects tumor peripheries blood sample, and GeneReader algorithm is detected the obtained result of tumor peripheries blood sample It is compared with the SiNVICT algorithm detection obtained result of tumor peripheries blood sample, retains consistent result data as most Whole testing result.
Further, the GeneReader algorithm is that the method for having supervision and unsupervised method are combined and obtained Algorithm.
Further, when detecting tumor peripheries blood sample using GeneReader algorithm, when discovery insertion and deletion mutation When situation, the sequence of mispairing is read using there is the method for supervision, is added in the gene pool of insertion and deletion mutation, come Increase gene frequency, scans the local sequence near soft slice with unsupervised method to search more insertions and missing Mutation.
Further, the local sequence near soft slice is scanned with unsupervised method to search more insertions and missing The step of mutation includes: to search consensus sequence from the soft Slice Sequence that allele group position is sheared, if found shared Sequence, then using it come it is customized apart from it is interior search whether there is or not matched sequences;It is found when in the position far from Slice Sequence When matching sequence, then it is assumed that detect the mutation of deletion type;When the matching of the end of consensus sequence is adjacent with soft Slice Sequence, It is determined as detecting the mutation of insertion type.
Further, the customized distance is 125bp.
Further, for obtained potential mutation as a result, being distributed the Heuristic Model with Poisson distribution in conjunction with Bayes It screens, and obtains most reliable mutational site.
Further, when detecting potential variation information, it is to judge by many kinds of parameters using GeneReader algorithm No is credible variation information, and many kinds of parameters includes: lowest depth, minimum support mutation count and chain deviation.
Further, when detecting tumor peripheries blood sample using SiNVICT algorithm, SiNVICT algorithm is first with Poisson Distributed model detects potential abrupt information, does in conjunction with more screening parameters as hard as filter.
Method provided by the present invention for detecting tumor peripheries blood sample series of variation, high sensitivity are suitable for tumour Peripheral blood sample can delicately detect variation information very much, and the accuracy for the detection that makes a variation is high, will not miss substantially true prominent Become information, whether be credible variation information to effectively filter out false positive mutational site, can if can judge by many kinds of parameters To meet the needs of practical application well.
Specific embodiment
In order to make the objectives, technical solutions, and advantages of the present invention clearer, combined with specific embodiments below to this Invention is described further.It should be appreciated that described herein, specific examples are only used to explain the present invention, is not used to limit The present invention.Based on the embodiments of the present invention, those of ordinary skill in the art are obtained without making creative work The every other embodiment obtained, shall fall within the protection scope of the present invention.
A method of for detecting tumor peripheries blood sample series of variation, at the same be respectively adopted GeneReader algorithm and SiNVICT algorithm detects tumor peripheries blood sample, and GeneReader algorithm is detected the obtained result of tumor peripheries blood sample It is compared with the SiNVICT algorithm detection obtained result of tumor peripheries blood sample, retains consistent result data as most Whole testing result.
In the present invention, gene frequency is more accurately estimated using GeneReader algorithm, optimizes part The precision of comparison, and accomplished in speed with the linear growth of increase that depth is sequenced.GeneReader algorithm will There is the method for supervision and algorithm that unsupervised method combines.Gene frequency is for measuring base in a population Yin Ku enriches the measurement of degree.It is inserted into the sequence length much shorter with deletion mutation than reading, positioned at the centre bit for reading sequence Set, usually with it is most of compare tool and obtain the gap of sequence be aligned.This mutation, which normally results in, forces mispairing pair, works as pairing Sequence errors it is too many when will form soft slice, these would generally by other mutation location algorithms and tool ignore and missing inspection, but They but provide the important evidence of insertion and deletion mutation.
In the method for the invention, using GeneReader algorithm detect tumor peripheries blood sample when, when discovery insertion and When deletion mutation situation, the sequence of mispairing is read using there is the method for supervision, is added to the base of insertion and deletion mutation Because in library, Lai Zengjia gene frequency.
When detecting tumor peripheries blood sample using GeneReader algorithm, scanned near soft slice with unsupervised method For local sequence to search more insertions and deletion mutation, specific steps include: soft to cut from what is sheared in allele group position Consensus sequence is searched in piece sequence, if consensus sequence can be found, (is defaulted as using it in customized distance Whether there is or not matched sequences for lookup in 125bp), allow small-scale non-match error at this time;When in the position far from Slice Sequence It was found that when matching sequence, then it is assumed that detect the mutation of deletion type;When the matching of the end of consensus sequence and soft Slice Sequence phase When adjacent, that is, it is determined as detecting the mutation of insertion type.
For obtained potential mutation as a result, being screened in conjunction with the Heuristic Model of Bayes's distribution and Poisson distribution, and Obtain most reliable mutational site.
Detection mutation model uses the Heuristic Model for combining Bayesian model and Poisson distribution model, utilizes Intelligence, which is adjusted, joins a variety of models to detect variation information.When detecting potential variation information, passed through using GeneReader algorithm more Whether kind parameter is credible variation information, such as lowest depth, minimum support mutation count, chain deviation to judge, thus effectively Filter out false positive mutational site.
Tumor peripheries blood sample is detected using SiNVICT algorithm simultaneously.SiNVICT algorithm is first with Poisson distribution model Potential abrupt information is detected, is done in conjunction with more screening parameters as hard as filter.Same patient can also be using SiNVICT algorithm Time series analysis, to monitor the post-operative recovery situation of tumor patient.
It will be outside the GeneReader algorithm detection obtained result of tumor peripheries blood sample and SiNVICT algorithm detection tumour All obtained results of blood sample are compared, and retain consistent result data as final detection result, delete inconsistent Data, ensure that the high degree of accuracy of testing result, avoid missing variation information to the maximum extent.
Method provided by the present invention for detecting tumor peripheries blood sample series of variation, high sensitivity are suitable for tumour Peripheral blood sample can delicately detect variation information very much, and the accuracy for the detection that makes a variation is high, will not miss substantially true prominent Become information, whether be credible variation information to effectively filter out false positive mutational site, can if can judge by many kinds of parameters To meet the needs of practical application well.
Embodiments of the present invention above described embodiment only expresses, the description thereof is more specific and detailed, but can not Therefore limitations on the scope of the patent of the present invention are interpreted as.It should be pointed out that for those of ordinary skill in the art, Without departing from the inventive concept of the premise, various modifications and improvements can be made, these belong to protection model of the invention It encloses.Therefore, the scope of protection of the patent of the invention shall be subject to the appended claims.

Claims (8)

1. a kind of method for detecting tumor peripheries blood sample series of variation, which is characterized in that be respectively adopted simultaneously GeneReader algorithm and SiNVICT algorithm detect tumor peripheries blood sample, and GeneReader algorithm is detected tumor peripheries blood The obtained result of sample is compared with the SiNVICT algorithm detection obtained result of tumor peripheries blood sample, retains consistent Result data as final detection result.
2. the method according to claim 1 for detecting tumour blood sample series of variation, which is characterized in that described GeneReader algorithm is algorithm obtained from combining the method for having supervision and unsupervised method.
3. the method according to claim 1 for detecting tumour blood sample series of variation, which is characterized in that use When GeneReader algorithm detects tumor peripheries blood sample, when discovery insertion and deletion mutation situation, using the side for having supervision Method reads the sequence of mispairing, is added in the gene pool of insertion and deletion mutation, Lai Zengjia gene frequency, with nothing The method of supervision scans the local sequence near soft slice to search more insertions and deletion mutation.
4. the method according to claim 1 for detecting tumour blood sample series of variation, which is characterized in that with unsupervised Method the step of scanning the local sequence near soft slice to search more insertions and deletion mutation include: from equipotential base Because searching consensus sequence in group soft Slice Sequence of position shearing, if finding consensus sequence, using it come customized Apart from interior lookup, whether there is or not matched sequences;When in the position discovery matching sequence far from Slice Sequence, then it is assumed that detect scarce Lose the mutation of type;When the matching of the end of consensus sequence is adjacent with soft Slice Sequence, that is, it is determined as detecting insertion type Mutation.
5. the method according to claim 1 for detecting tumour blood sample series of variation, which is characterized in that described to make by oneself The distance of justice is 125bp.
6. the method described in -5 for detecting tumor peripheries blood sample series of variation according to claim 1, which is characterized in that needle To obtained potential mutation as a result, being screened in conjunction with the Heuristic Model of Bayes's distribution and Poisson distribution, and obtain most reliable Mutational site.
7. the method described in -6 for detecting tumour blood sample series of variation according to claim 1, which is characterized in that detect When potential variation information, whether be credible variation information, described more if being judged using GeneReader algorithm by many kinds of parameters Kind parameter includes: lowest depth, minimum support mutation count and chain deviation.
8. the method described in -7 for detecting tumor peripheries blood sample series of variation according to claim 1, which is characterized in that benefit When detecting tumor peripheries blood sample with SiNVICT algorithm, SiNVICT algorithm is potential prominent first with Poisson distribution model detection Become information, does in conjunction with more screening parameters as hard as filter.
CN201811588416.4A 2018-12-25 2018-12-25 Method for detecting tumor peripheries blood sample series of variation Pending CN109767812A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811588416.4A CN109767812A (en) 2018-12-25 2018-12-25 Method for detecting tumor peripheries blood sample series of variation

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811588416.4A CN109767812A (en) 2018-12-25 2018-12-25 Method for detecting tumor peripheries blood sample series of variation

Publications (1)

Publication Number Publication Date
CN109767812A true CN109767812A (en) 2019-05-17

Family

ID=66451644

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811588416.4A Pending CN109767812A (en) 2018-12-25 2018-12-25 Method for detecting tumor peripheries blood sample series of variation

Country Status (1)

Country Link
CN (1) CN109767812A (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103540658A (en) * 2013-09-30 2014-01-29 杭州艾迪康医学检验中心有限公司 Method, primer and kit for detecting hot mutation site of human XPD (Xeroderma Pigmentosum group D) gene
CN107451422A (en) * 2017-07-24 2017-12-08 杨文婷 A kind of gene sequence data analysis and online interaction visualization method
CN107893116A (en) * 2017-12-12 2018-04-10 北京雅康博生物科技有限公司 For detecting primer pair combination, kit and the method for building library of gene mutation
US20180230530A1 (en) * 2013-12-28 2018-08-16 Guardant Health, Inc. Methods and systems for detecting genetic variants

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103540658A (en) * 2013-09-30 2014-01-29 杭州艾迪康医学检验中心有限公司 Method, primer and kit for detecting hot mutation site of human XPD (Xeroderma Pigmentosum group D) gene
US20180230530A1 (en) * 2013-12-28 2018-08-16 Guardant Health, Inc. Methods and systems for detecting genetic variants
CN107451422A (en) * 2017-07-24 2017-12-08 杨文婷 A kind of gene sequence data analysis and online interaction visualization method
CN107893116A (en) * 2017-12-12 2018-04-10 北京雅康博生物科技有限公司 For detecting primer pair combination, kit and the method for building library of gene mutation

Non-Patent Citations (5)

* Cited by examiner, † Cited by third party
Title
C KOCKAN: ""SiNVICT"", 《搜狗搜索:HTTPS://GITHUB.COM/SFU-COMPBIO/SINVICT//BLOB/MASTER/README.MD#SINVICT》 *
ZHONGWU LAI等: ""VarDict:a novel and versatile variant caller for next-generation sequencing in cancer research"", 《NUCLEIC ACIDS RESEARCH》 *
生物通: ""QIAGEN推出新一代测序仪GeneReader[新品推荐]"", 《搜狗搜索:HTTP://WWW.EBIOTRADE.COM/NEWSF/2015-11/20151111114703582.HTM?FROM=TUIJIAN2 》 *
高迎心 等: ""基于混合泊松分布的新生突变识别算法"", 《中国生物化学与分子生物学报》 *
鼎晶生物: ""干货:基因测序技术和原理介绍 "", 《搜狗搜索:HTTPS://MP.WEIXIN.QQ.COM/S?SRC=11&TIMESTAMP=1590137579&VER=2353&SIGNATURE=QFVOSTRVDQZMWMHVO*FOCD8KU4D5CV0NH9GL1N0WOFLSRDH3E3NNRH5GKRMNDUUMKKKHQCEEXEOYIKAL4FA744G3REJ**O1*NZDSPXJKNGZJDGPI0EKJCKRLYPUXJ3NA&NEW=1》 *

Similar Documents

Publication Publication Date Title
CN109337957A (en) The method for detecting genome multimutation type
Nair et al. Radiogenomic models using machine learning techniques to predict EGFR mutations in non-small cell lung cancer
CN106047998B (en) A kind of detection method and application of lung cancer gene
Yuan et al. Analysis of gene expression profiles of lung cancer subtypes with machine learning algorithms
CN107391965A (en) A kind of lung cancer somatic mutation determination method based on high throughput sequencing technologies
CN106599616B (en) Ultralow frequency mutational site determination method based on duplex-seq
CN112086129B (en) Method and system for predicting cfDNA of tumor tissue
Ostrovnaya et al. A metastasis or a second independent cancer? Evaluating the clonal origin of tumors using array copy number data
CN107423578A (en) Detect the device of somatic mutation
CN1484806A (en) A process for discriminating between biological states based on hidden patterns from
CN110060733A (en) Tumour somatic variation detection device is sequenced in two generations based on single sample
US20140330162A1 (en) Biological cell assessment using whole genome sequence and oncological therapy planning using same
CN114743594A (en) Method, device and storage medium for detecting structural variation
CN108154010A (en) A kind of ctDNA low frequencies mutation sequencing data analysis method and device
CN109949862A (en) A kind of microsatellite instability detection method of blood ctDNA
CN113724785A (en) Tumor typing method, device, storage medium and equipment based on second-generation sequencing
CN115424666A (en) Method and system for screening pan-cancer early-screening molecular marker based on whole genome bisulfite sequencing data
Martinez-Ledesma et al. Computational methods for detecting cancer hotspots
Zhao et al. GFusion: an effective algorithm to identify fusion genes from cancer RNA-Seq data
CN114694750A (en) Single-sample tumor somatic mutation distinguishing and TMB (Tetramethylbenzidine) detecting method based on NGS (Next Generation System) platform
CN109767812A (en) Method for detecting tumor peripheries blood sample series of variation
KR101223270B1 (en) Method for determining low―mass ions to screen colorectal cancer, method for providing information to screen colorectal cancer by using low―mass ions, and operational unit therefor
CN108350507A (en) The method that histodiagnosis and treatment are carried out to disease
CN114093421B (en) Method, device and storage medium for distinguishing lymphoma molecular subtype
CN109762881A (en) It is a kind of for detecting the Bioinformatic methods in the ultralow frequency mutational site in tumor patient blood ctDNA

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20190517

RJ01 Rejection of invention patent application after publication