CN109887546B - 基于二代测序的单基因或多基因拷贝数检测系统及方法 - Google Patents
基于二代测序的单基因或多基因拷贝数检测系统及方法 Download PDFInfo
- Publication number
- CN109887546B CN109887546B CN201910077588.3A CN201910077588A CN109887546B CN 109887546 B CN109887546 B CN 109887546B CN 201910077588 A CN201910077588 A CN 201910077588A CN 109887546 B CN109887546 B CN 109887546B
- Authority
- CN
- China
- Prior art keywords
- module
- copy number
- depth
- sample
- coverage
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000001514 detection method Methods 0.000 title claims abstract description 55
- 238000000034 method Methods 0.000 title claims abstract description 48
- 238000007481 next generation sequencing Methods 0.000 title claims abstract description 11
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 76
- 238000012549 training Methods 0.000 claims abstract description 43
- 238000012163 sequencing technique Methods 0.000 claims abstract description 38
- 238000012417 linear regression Methods 0.000 claims abstract description 37
- 238000012546 transfer Methods 0.000 claims abstract description 17
- 238000012937 correction Methods 0.000 claims abstract description 13
- 108700024394 Exon Proteins 0.000 claims abstract description 11
- 238000010801 machine learning Methods 0.000 claims abstract description 11
- 108091081062 Repeated sequence (DNA) Proteins 0.000 claims abstract description 10
- 239000012634 fragment Substances 0.000 claims description 30
- 239000011159 matrix material Substances 0.000 claims description 25
- 230000008569 process Effects 0.000 claims description 16
- 238000012408 PCR amplification Methods 0.000 claims description 6
- 230000003252 repetitive effect Effects 0.000 claims description 6
- 238000002864 sequence alignment Methods 0.000 claims description 5
- 230000006870 function Effects 0.000 claims description 3
- 238000002790 cross-validation Methods 0.000 claims description 2
- 238000010606 normalization Methods 0.000 claims description 2
- 108700026220 vif Genes Proteins 0.000 claims description 2
- 230000000694 effects Effects 0.000 abstract description 13
- 238000005516 engineering process Methods 0.000 abstract description 12
- 238000004364 calculation method Methods 0.000 abstract description 2
- 239000000523 sample Substances 0.000 description 39
- 238000012217 deletion Methods 0.000 description 6
- 230000037430 deletion Effects 0.000 description 6
- 101150015424 dmd gene Proteins 0.000 description 5
- 101150030803 SLC26A4 gene Proteins 0.000 description 3
- 238000012216 screening Methods 0.000 description 3
- 230000035945 sensitivity Effects 0.000 description 3
- 208000026350 Inborn Genetic disease Diseases 0.000 description 2
- 206010028980 Neoplasm Diseases 0.000 description 2
- 101150025954 Ptprq gene Proteins 0.000 description 2
- 208000035977 Rare disease Diseases 0.000 description 2
- 230000003321 amplification Effects 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 201000011510 cancer Diseases 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 230000018109 developmental process Effects 0.000 description 2
- 230000004907 flux Effects 0.000 description 2
- 208000016361 genetic disease Diseases 0.000 description 2
- 238000003199 nucleic acid amplification method Methods 0.000 description 2
- 208000032170 Congenital Abnormalities Diseases 0.000 description 1
- 206010010356 Congenital anomaly Diseases 0.000 description 1
- 206010013801 Duchenne Muscular Dystrophy Diseases 0.000 description 1
- 238000011529 RT qPCR Methods 0.000 description 1
- 208000037340 Rare genetic disease Diseases 0.000 description 1
- 230000002159 abnormal effect Effects 0.000 description 1
- 201000006288 alpha thalassemia Diseases 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000007698 birth defect Effects 0.000 description 1
- 230000000052 comparative effect Effects 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 238000007405 data analysis Methods 0.000 description 1
- 238000013499 data model Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 238000009396 hybridization Methods 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 238000002493 microarray Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 108020004707 nucleic acids Proteins 0.000 description 1
- 150000007523 nucleic acids Chemical class 0.000 description 1
- 102000039446 nucleic acids Human genes 0.000 description 1
- 239000002773 nucleotide Substances 0.000 description 1
- 125000003729 nucleotide group Chemical group 0.000 description 1
- 230000002265 prevention Effects 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 238000011946 reduction process Methods 0.000 description 1
- 230000011218 segmentation Effects 0.000 description 1
- 208000002320 spinal muscular atrophy Diseases 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
- 238000012800 visualization Methods 0.000 description 1
- 238000012070 whole genome sequencing analysis Methods 0.000 description 1
Landscapes
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
Description
Claims (10)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910034767 | 2019-01-15 | ||
CN2019100347679 | 2019-01-15 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN109887546A CN109887546A (zh) | 2019-06-14 |
CN109887546B true CN109887546B (zh) | 2019-12-27 |
Family
ID=66927109
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910077588.3A Active CN109887546B (zh) | 2019-01-15 | 2019-01-28 | 基于二代测序的单基因或多基因拷贝数检测系统及方法 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN109887546B (zh) |
Families Citing this family (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110895959B (zh) * | 2019-11-08 | 2022-05-20 | 至本医疗科技(上海)有限公司 | 基因拷贝数评估方法、装置、系统以及计算机可读介质 |
CN111192629A (zh) * | 2019-12-23 | 2020-05-22 | 苏州金唯智生物科技有限公司 | 一种基因序列难度分析模型的构建方法及其应用 |
CN111462816B (zh) * | 2020-03-31 | 2022-05-20 | 至本医疗科技(上海)有限公司 | 用于检测胚系基因微缺失微重复的方法、电子设备和计算机存储介质 |
CN112634987B (zh) * | 2020-12-25 | 2021-07-27 | 北京吉因加医学检验实验室有限公司 | 一种单样本肿瘤dna拷贝数变异检测的方法和装置 |
CN113823353B (zh) * | 2021-08-12 | 2024-02-09 | 上海厦维医学检验实验室有限公司 | 基因拷贝数扩增检测方法、装置及可读介质 |
CN116386718B (zh) * | 2023-05-30 | 2023-08-01 | 北京华宇亿康生物工程技术有限公司 | 检测拷贝数变异的方法、设备和介质 |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108256292A (zh) * | 2016-12-29 | 2018-07-06 | 安诺优达基因科技(北京)有限公司 | 一种拷贝数变异检测装置 |
CN108256289A (zh) * | 2018-01-17 | 2018-07-06 | 湖南大地同年生物科技有限公司 | 一种基于目标区域捕获测序基因组拷贝数变异的方法 |
CN108875311A (zh) * | 2018-06-22 | 2018-11-23 | 安徽医科大学第附属医院 | 基于高通量测序和高斯混合模型的拷贝数变异检测方法 |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102682224B (zh) * | 2011-03-18 | 2015-01-21 | 深圳华大基因科技服务有限公司 | 检测拷贝数变异的方法和装置 |
EP4227947A1 (en) * | 2013-10-21 | 2023-08-16 | Verinata Health, Inc. | Method for improving the sensitivity of detection in determining copy number variations |
CN105760712B (zh) * | 2016-03-01 | 2019-03-26 | 西安电子科技大学 | 一种基于新一代测序的拷贝数变异检测方法 |
CA3030890A1 (en) * | 2016-07-27 | 2018-02-01 | Sequenom, Inc. | Genetic copy number alteration classifications |
-
2019
- 2019-01-28 CN CN201910077588.3A patent/CN109887546B/zh active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108256292A (zh) * | 2016-12-29 | 2018-07-06 | 安诺优达基因科技(北京)有限公司 | 一种拷贝数变异检测装置 |
CN108256289A (zh) * | 2018-01-17 | 2018-07-06 | 湖南大地同年生物科技有限公司 | 一种基于目标区域捕获测序基因组拷贝数变异的方法 |
CN108875311A (zh) * | 2018-06-22 | 2018-11-23 | 安徽医科大学第附属医院 | 基于高通量测序和高斯混合模型的拷贝数变异检测方法 |
Also Published As
Publication number | Publication date |
---|---|
CN109887546A (zh) | 2019-06-14 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109887546B (zh) | 基于二代测序的单基因或多基因拷贝数检测系统及方法 | |
US10354747B1 (en) | Deep learning analysis pipeline for next generation sequencing | |
CN105980578B (zh) | 用于使用机器学习进行dna测序的碱基判定器 | |
WO2020014280A1 (en) | DEEP LEARNING-BASED FRAMEWORK FOR IDENTIFYING SEQUENCE PATTERNS THAT CAUSE SEQUENCE-SPECIFIC ERRORS (SSEs) | |
JP2009529723A (ja) | スペクトル分析を介したdnaパターンの同定方法及びシステム | |
JP2005531853A (ja) | Snp遺伝子型クラスタリングのためのシステムおよび方法 | |
IL258999A (en) | Methods for detecting copy-number variations in next-generation sequencing | |
US20220277811A1 (en) | Detecting False Positive Variant Calls In Next-Generation Sequencing | |
EP3619712A1 (en) | DEEP LEARNING-BASED FRAMEWORK FOR IDENTIFYING SEQUENCE PATTERNS THAT CAUSE SEQUENCE-SPECIFIC ERRORS (SSEs) | |
CN114999573A (zh) | 一种基因组变异检测方法及检测系统 | |
Wang et al. | MSB: a mean-shift-based approach for the analysis of structural variation in the genome | |
CN116189763A (zh) | 一种基于二代测序的单样本拷贝数变异检测方法 | |
CN109461473B (zh) | 胎儿游离dna浓度获取方法和装置 | |
CN111210873B (zh) | 基于外显子测序数据的拷贝数变异检测方法及系统、终端和存储介质 | |
CN116959585A (zh) | 基于深度学习的全基因组预测方法 | |
AU2022218581B2 (en) | Sequencing data-based itd mutation ratio detecting apparatus and method | |
Guha et al. | Bayesian hidden Markov modeling of array CGH data | |
EP1630709B1 (en) | Mathematical analysis for the estimation of changes in the level of gene expression | |
CN110910955A (zh) | 一种易感基因罕见变异位点纵向分析模型的建立方法 | |
CN116646010B (zh) | 人源性病毒检测方法及装置、设备、存储介质 | |
CN117095744A (zh) | 一种基于单样本高通量转录组测序数据的拷贝数变异检测方法 | |
Yang et al. | Improved detection algorithm for copy number variations based on hidden Markov model | |
Hedges | Bioinformatics of Human Genetic Disease Studies | |
Alshawaqfeh et al. | Robust Fussed Lasso Model for Recurrent Copy Number Variation Detection | |
Piccolboni | Multivariate segmentation in the analysis of transcription tiling array data |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20221017 Address after: Room 607, Building 1, No. 55, Aona Road, China (Shanghai) Pilot Free Trade Zone, Pudong New Area, Shanghai, 200137 Patentee after: Shanghai xuzhenda Biotechnology Co.,Ltd. Address before: 200131 Room D04, Floor 3, No. 207, Fute North Road, Free Trade Pilot Zone, Pudong New Area, Shanghai Patentee before: WUXI NEXTCODE GENOMICS (SHANGHAI) CO.,LTD. |
|
PE01 | Entry into force of the registration of the contract for pledge of patent right | ||
PE01 | Entry into force of the registration of the contract for pledge of patent right |
Denomination of invention: Single gene or multi gene copy number detection system and method based on second-generation sequencing Effective date of registration: 20231130 Granted publication date: 20191227 Pledgee: Industrial Bank Co.,Ltd. Shanghai Zhangyang Sub branch Pledgor: Shanghai xuzhenda Biotechnology Co.,Ltd. Registration number: Y2023310000791 |