CN110462056A - 基于dna测序数据的样本来源检测方法、装置和存储介质 - Google Patents

基于dna测序数据的样本来源检测方法、装置和存储介质 Download PDF

Info

Publication number
CN110462056A
CN110462056A CN201780089043.3A CN201780089043A CN110462056A CN 110462056 A CN110462056 A CN 110462056A CN 201780089043 A CN201780089043 A CN 201780089043A CN 110462056 A CN110462056 A CN 110462056A
Authority
CN
China
Prior art keywords
curve
depth
sample
segments
complementary
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201780089043.3A
Other languages
English (en)
Other versions
CN110462056B (zh
Inventor
梁瀚
李甫强
吴逵
赵鑫
乔斯坦
史旭莲
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
BGI Shenzhen Co Ltd
Original Assignee
BGI Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by BGI Shenzhen Co Ltd filed Critical BGI Shenzhen Co Ltd
Publication of CN110462056A publication Critical patent/CN110462056A/zh
Application granted granted Critical
Publication of CN110462056B publication Critical patent/CN110462056B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B30/00ICT specially adapted for sequence analysis involving nucleotides or amino acids
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02ATECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
    • Y02A90/00Technologies having an indirect contribution to adaptation to climate change
    • Y02A90/10Information and communication technologies [ICT] supporting adaptation to climate change, e.g. for weather forecasting or climate simulation

Landscapes

  • Chemical & Material Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Organic Chemistry (AREA)
  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Biophysics (AREA)
  • Wood Science & Technology (AREA)
  • Analytical Chemistry (AREA)
  • Biotechnology (AREA)
  • Zoology (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
  • Biochemistry (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • General Engineering & Computer Science (AREA)
  • Molecular Biology (AREA)
  • Genetics & Genomics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Evolutionary Biology (AREA)
  • Medical Informatics (AREA)
  • Microbiology (AREA)
  • Theoretical Computer Science (AREA)
  • Immunology (AREA)

Abstract

一种基于DNA测序数据的样本来源检测方法、装置和存储介质,该方法包括:分别将同一来源的多个样本的DNA测序数据比对到参考基因组上,分别统计参考基因组上每个窗口中的测序深度,并生成深度曲线;从同一来源的多个样本的深度曲线中提取相似曲线片段集合;对于两两不同来源的相似曲线片段集合,计算每一曲线片段分别属于这两个不同来源的集合的权重,并过滤掉权重低于设定阈值的曲线片段,得到两两互补有效曲线片段集合;将待测样本深度曲线与两两互补有效曲线片段集合进行比较,根据待测样本深度曲线分别与两两互补有效曲线片段集合的匹配程度判断待测样本的来源。本方法直接从DNA测序数据的比对结果搜索不同组织的特征,从而对样本来源进行预测。

Description

PCT国内申请,说明书已公开。

Claims (10)

  1. PCT国内申请,权利要求书已公开。
CN201780089043.3A 2017-05-19 2017-05-19 基于dna测序数据的样本来源检测方法、装置和存储介质 Active CN110462056B (zh)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2017/085150 WO2018209704A1 (zh) 2017-05-19 2017-05-19 基于dna测序数据的样本来源检测方法、装置和存储介质

Publications (2)

Publication Number Publication Date
CN110462056A true CN110462056A (zh) 2019-11-15
CN110462056B CN110462056B (zh) 2023-08-29

Family

ID=64273080

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201780089043.3A Active CN110462056B (zh) 2017-05-19 2017-05-19 基于dna测序数据的样本来源检测方法、装置和存储介质

Country Status (2)

Country Link
CN (1) CN110462056B (zh)
WO (1) WO2018209704A1 (zh)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110504014B (zh) * 2019-08-20 2022-03-11 福州大学 一种具有实时反馈功能的颈椎康复训练信息管理方法及系统
WO2023236058A1 (zh) * 2022-06-07 2023-12-14 深圳华大生命科学研究院 肺结节筛查模型的组建方法和装置以及肺结节筛查方法和装置

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120003634A1 (en) * 2010-02-19 2012-01-05 Nucleix Identification of source of dna samples
CN103955630A (zh) * 2014-03-26 2014-07-30 田埂 制备参考数据库及对待测游离核酸样本进行目标区域序列比对的方法
JP2014530629A (ja) * 2011-10-28 2014-11-20 ビージーアイダイアグノーシス カンパニー リミテッドBgi Diagnosis Co., Ltd. 染色体の微細欠失及び微細重複を検出する方法
US20160017419A1 (en) * 2014-07-18 2016-01-21 The Chinese University Of Hong Kong Methylation pattern analysis of tissues in a dna mixture
CN105349678A (zh) * 2015-12-03 2016-02-24 上海美吉生物医药科技有限公司 一种染色体拷贝数变异的检测方法
WO2016090583A1 (zh) * 2014-12-10 2016-06-16 深圳华大基因研究院 测序数据处理装置和方法
CN105765076A (zh) * 2013-12-17 2016-07-13 深圳华大基因股份有限公司 一种染色体非整倍性检测方法及装置
WO2016183106A1 (en) * 2015-05-11 2016-11-17 Natera, Inc. Methods and compositions for determining ploidy
US20170009287A1 (en) * 2015-07-08 2017-01-12 Quest Diagnostics Investments Incorporated Detecting genetic copy number variation

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120003634A1 (en) * 2010-02-19 2012-01-05 Nucleix Identification of source of dna samples
JP2014530629A (ja) * 2011-10-28 2014-11-20 ビージーアイダイアグノーシス カンパニー リミテッドBgi Diagnosis Co., Ltd. 染色体の微細欠失及び微細重複を検出する方法
CN105765076A (zh) * 2013-12-17 2016-07-13 深圳华大基因股份有限公司 一种染色体非整倍性检测方法及装置
CN103955630A (zh) * 2014-03-26 2014-07-30 田埂 制备参考数据库及对待测游离核酸样本进行目标区域序列比对的方法
US20160017419A1 (en) * 2014-07-18 2016-01-21 The Chinese University Of Hong Kong Methylation pattern analysis of tissues in a dna mixture
WO2016090583A1 (zh) * 2014-12-10 2016-06-16 深圳华大基因研究院 测序数据处理装置和方法
WO2016183106A1 (en) * 2015-05-11 2016-11-17 Natera, Inc. Methods and compositions for determining ploidy
US20170009287A1 (en) * 2015-07-08 2017-01-12 Quest Diagnostics Investments Incorporated Detecting genetic copy number variation
CN105349678A (zh) * 2015-12-03 2016-02-24 上海美吉生物医药科技有限公司 一种染色体拷贝数变异的检测方法

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
MATTHEW W. SNYDER等: "Cell-free DNA comprises an in vivo nucleosome footprint that informs its tissues-of-origin", 《CELL》 *

Also Published As

Publication number Publication date
WO2018209704A1 (zh) 2018-11-22
CN110462056B (zh) 2023-08-29

Similar Documents

Publication Publication Date Title
US11961589B2 (en) Models for targeted sequencing
US20220101944A1 (en) Methods for detecting copy-number variations in next-generation sequencing
US11869661B2 (en) Systems and methods for determining whether a subject has a cancer condition using transfer learning
WO2019023517A2 (en) GENOMIC SEQUENCING CLASSIFIER
CN112218957A (zh) 用于确定在无细胞核酸中的肿瘤分数的系统及方法
US20210358626A1 (en) Systems and methods for cancer condition determination using autoencoders
KR20200107774A (ko) 표적화 핵산 서열 분석 데이터를 정렬하는 방법
WO2020028989A1 (en) Systems and methods for determining effects of therapies and genetic variation on polyadenylation site selection
JP6141310B2 (ja) 強固な変異体特定および検証
CN112951327A (zh) 药物敏感预测方法、电子设备及计算机可读存储介质
US20200082910A1 (en) Systems and Methods for Determining Effects of Genetic Variation of Splice Site Selection
CN110462056A (zh) 基于dna测序数据的样本来源检测方法、装置和存储介质
CN113862351B (zh) 体液样本中鉴定胞外rna生物标志物的试剂盒及方法
CN116312800A (zh) 一种基于血浆中循环rna全转录组测序的肺癌特征识别方法、装置和存储介质
Povoa et al. A Multi-Learning Training Approach for distinguishing low and high risk cancer patients
CN113160895A (zh) 一种结直肠癌风险评估模型及系统
CN113159529A (zh) 一种肠道息肉的风险评估模型及相关系统
CN112746108A (zh) 用于肿瘤预后分层评估的基因标志物、评估方法及应用
US20140288847A1 (en) Systems and techniques for segmentation of sequential data
SS SVM based lung cancer prediction using microRNA expression profiling from NGS data
Hua et al. Combining protein-protein interactions information with support vector machine to identify chronic obstructive pulmonary disease related genes
EP4318493A1 (en) Artificial-intelligence-based method for detecting tumor-derived mutation of cell-free dna, and method for early diagnosis of cancer, using same
WO2022262569A1 (zh) 一种用于区分体细胞突变和种系突变的方法
Chieruzzi Identification of RAS co-occurrent mutations in colorectal cancer patients: workflow assessment and enhancement
KR20230064172A (ko) 세포유리 핵산단편 위치별 서열 빈도 및 크기를 이용한 암 진단 방법

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant