CA2963868A1 - Methods, systems and processes of de novo assembly of sequencing reads - Google Patents

Methods, systems and processes of de novo assembly of sequencing reads Download PDF

Info

Publication number
CA2963868A1
CA2963868A1 CA2963868A CA2963868A CA2963868A1 CA 2963868 A1 CA2963868 A1 CA 2963868A1 CA 2963868 A CA2963868 A CA 2963868A CA 2963868 A CA2963868 A CA 2963868A CA 2963868 A1 CA2963868 A1 CA 2963868A1
Authority
CA
Canada
Prior art keywords
read
overlaps
contig
reads
contigs
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
CA2963868A
Other languages
English (en)
French (fr)
Inventor
Karel Konvicka
Kevin Jacobs
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Invitae Corp
Original Assignee
Invitae Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Invitae Corp filed Critical Invitae Corp
Publication of CA2963868A1 publication Critical patent/CA2963868A1/en
Abandoned legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B20/00ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B20/00ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
    • G16B20/20Allele or variant detection, e.g. single nucleotide polymorphism [SNP] detection
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B30/00ICT specially adapted for sequence analysis involving nucleotides or amino acids
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B30/00ICT specially adapted for sequence analysis involving nucleotides or amino acids
    • G16B30/10Sequence alignment; Homology search
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B30/00ICT specially adapted for sequence analysis involving nucleotides or amino acids
    • G16B30/20Sequence assembly

Landscapes

  • Life Sciences & Earth Sciences (AREA)
  • Physics & Mathematics (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Health & Medical Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Biotechnology (AREA)
  • Medical Informatics (AREA)
  • Biophysics (AREA)
  • Theoretical Computer Science (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Chemical & Material Sciences (AREA)
  • Evolutionary Biology (AREA)
  • General Health & Medical Sciences (AREA)
  • Analytical Chemistry (AREA)
  • Molecular Biology (AREA)
  • Genetics & Genomics (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
CA2963868A 2014-10-10 2015-10-09 Methods, systems and processes of de novo assembly of sequencing reads Abandoned CA2963868A1 (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US201462062636P 2014-10-10 2014-10-10
US62/062,636 2014-10-10
PCT/IB2015/057716 WO2016055971A2 (en) 2014-10-10 2015-10-09 Methods, systems and processes of de novo assembly of sequencing reads

Publications (1)

Publication Number Publication Date
CA2963868A1 true CA2963868A1 (en) 2016-04-14

Family

ID=55653914

Family Applications (1)

Application Number Title Priority Date Filing Date
CA2963868A Abandoned CA2963868A1 (en) 2014-10-10 2015-10-09 Methods, systems and processes of de novo assembly of sequencing reads

Country Status (8)

Country Link
US (1) US20190244678A1 (https=)
EP (1) EP3204522A4 (https=)
JP (1) JP6762932B2 (https=)
CN (1) CN106795568A (https=)
BR (1) BR112017007282A2 (https=)
CA (1) CA2963868A1 (https=)
IL (1) IL251277B (https=)
WO (1) WO2016055971A2 (https=)

Families Citing this family (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10395759B2 (en) 2015-05-18 2019-08-27 Regeneron Pharmaceuticals, Inc. Methods and systems for copy number variant detection
US12071669B2 (en) 2016-02-12 2024-08-27 Regeneron Pharmaceuticals, Inc. Methods and systems for detection of abnormal karyotypes
WO2018057775A1 (en) * 2016-09-22 2018-03-29 Invitae Corporation Methods, systems and processes of identifying genetic variations
WO2019028189A2 (en) * 2017-08-01 2019-02-07 Human Longevity, Inc. DETERMINING THE STR LENGTH BY SHORT READ SEQUENCING
US11728007B2 (en) 2017-11-30 2023-08-15 Grail, Llc Methods and systems for analyzing nucleic acid sequences using mappability analysis and de novo sequence assembly
JP7361774B2 (ja) * 2018-07-27 2023-10-16 ミリアド・ウィメンズ・ヘルス・インコーポレーテッド シーケンスリードの独立したアラインメントおよびペアリングによって高度に相同なシーケンスにおける遺伝的変異を検出するための方法
EP3853763A1 (en) * 2018-09-20 2021-07-28 Aivf Ltd Image feature detection
BR112020026259A2 (pt) * 2018-11-01 2021-07-27 Illumina, Inc. métodos e composições para detecção de variante de linhagem germinativa
CN113557572B (zh) * 2019-01-25 2025-02-07 加利福尼亚太平洋生物科学股份有限公司 基于图的映射核酸片段的系统和方法
CN110060734B (zh) * 2019-03-29 2021-08-13 天津大学 一种高鲁棒性dna测序用条形码生成和读取方法
SG11202109079YA (en) * 2019-12-05 2021-09-29 Illumina Inc Rapid detection of gene fusions
US12093803B2 (en) * 2020-07-01 2024-09-17 International Business Machines Corporation Downsampling genomic sequence data
CA3184609A1 (en) * 2020-12-11 2022-06-16 Illumina Inc. Methods and systems for visualizing short reads in repetitive regions of the genome
US20240117445A1 (en) * 2021-03-16 2024-04-11 University Of North Texas Health Science Center At Fort Worth Macrohaplotypes for Forensic DNA Mixture Deconvolution
CN115938480B (zh) * 2021-09-23 2026-03-13 武汉华大基因技术服务有限公司 长读长测序对基因组组装结果纠错方法优化装置和系统
CN118380052B (zh) * 2024-06-24 2024-09-17 安诺优达基因科技(北京)有限公司 基因组结构预测的方法及电子装置

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8383345B2 (en) * 2008-09-12 2013-02-26 University Of Washington Sequence tag directed subassembly of short sequencing reads into long sequencing reads
EP2511843B1 (en) * 2009-04-29 2016-12-21 Complete Genomics, Inc. Method and system for calling variations in a sample polynucleotide sequence with respect to a reference polynucleotide sequence
US20110257889A1 (en) * 2010-02-24 2011-10-20 Pacific Biosciences Of California, Inc. Sequence assembly and consensus sequence determination
WO2012177774A2 (en) * 2011-06-21 2012-12-27 Life Technologies Corporation Systems and methods for hybrid assembly of nucleic acid sequences
WO2013103759A2 (en) * 2012-01-04 2013-07-11 Dow Agrosciences Llc Haplotype based pipeline for snp discovery and/or classification
US9916416B2 (en) * 2012-10-18 2018-03-13 Virginia Tech Intellectual Properties, Inc. System and method for genotyping using informed error profiles
CN103258145B (zh) * 2012-12-22 2016-06-29 中国科学院深圳先进技术研究院 一种基于De Bruijn图的并行基因拼接方法
CN103761453B (zh) * 2013-12-09 2017-10-27 天津工业大学 一种基于簇图结构的并行基因拼接方法

Also Published As

Publication number Publication date
JP2018500625A (ja) 2018-01-11
IL251277A0 (en) 2017-05-29
JP6762932B2 (ja) 2020-09-30
CN106795568A (zh) 2017-05-31
WO2016055971A2 (en) 2016-04-14
IL251277B (en) 2020-08-31
EP3204522A2 (en) 2017-08-16
EP3204522A4 (en) 2018-06-20
US20190244678A1 (en) 2019-08-08
BR112017007282A2 (pt) 2018-06-19
WO2016055971A3 (en) 2016-06-02

Similar Documents

Publication Publication Date Title
US20190244678A1 (en) Methods, systems and processes of de novo assembly of sequencing reads
JP7284849B2 (ja) 不均一分子長を有するユニーク分子インデックスセットの生成およびエラー補正のための方法およびシステム
US10777301B2 (en) Hierarchical genome assembly method using single long insert library
JP2020058393A (ja) 母体血漿の無侵襲的出生前分子核型分析
US20160117444A1 (en) Methods for determining absolute genome-wide copy number variations of complex tumors
US11761036B2 (en) Methods, systems and processes of identifying genetic variations
US11862299B2 (en) Algorithms for sequence determinations
US20160154930A1 (en) Methods for identification of individuals
EP4721076A1 (en) Improving structural variant alignment and variant calling by utilizing a structural-variant reference genome
Au Bioinformatic methods in mammalian genomics: applications towards conservation and health
WO2025006565A1 (en) Variant calling with methylation-level estimation
KR20250092241A (ko) 핵산 오류 억제
HK40051826A (en) Methods and systems for generation and error-correction of unique molecular index sets with heterogeneous molecular lengths
Hosseinkhan Ali Masoudi-Nejad Zahra Narimani
Heinrich Aspects of Quality Control for Next Generation Sequencing Data in Medical Genetics
HK40014695A (en) Method,machine readable medium and computer system for sequencing nucleic acid molecules

Legal Events

Date Code Title Description
EEER Examination request

Effective date: 20200908

FZDE Discontinued

Effective date: 20230214