CN104217134A - 用于snp分析和基因组测序的系统和方法 - Google Patents
用于snp分析和基因组测序的系统和方法 Download PDFInfo
- Publication number
- CN104217134A CN104217134A CN201410228956.7A CN201410228956A CN104217134A CN 104217134 A CN104217134 A CN 104217134A CN 201410228956 A CN201410228956 A CN 201410228956A CN 104217134 A CN104217134 A CN 104217134A
- Authority
- CN
- China
- Prior art keywords
- nucleic acid
- acid sequence
- subsequence
- index
- computer
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/22—Indexing; Data structures therefor; Storage structures
- G06F16/2228—Indexing structures
- G06F16/2255—Hash tables
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B30/00—ICT specially adapted for sequence analysis involving nucleotides or amino acids
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B30/00—ICT specially adapted for sequence analysis involving nucleotides or amino acids
- G16B30/10—Sequence alignment; Homology search
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Life Sciences & Earth Sciences (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Medical Informatics (AREA)
- Health & Medical Sciences (AREA)
- Analytical Chemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Biotechnology (AREA)
- Evolutionary Biology (AREA)
- General Health & Medical Sciences (AREA)
- Biophysics (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Chemical & Material Sciences (AREA)
- Software Systems (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
- Apparatus Associated With Microorganisms And Enzymes (AREA)
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US13/904,738 US10191929B2 (en) | 2013-05-29 | 2013-05-29 | Systems and methods for SNP analysis and genome sequencing |
| US13/904,738 | 2013-05-29 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| CN104217134A true CN104217134A (zh) | 2014-12-17 |
Family
ID=50884687
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN201410228956.7A Pending CN104217134A (zh) | 2013-05-29 | 2014-05-27 | 用于snp分析和基因组测序的系统和方法 |
Country Status (5)
Cited By (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN106021992A (zh) * | 2015-03-27 | 2016-10-12 | 知源生信公司(美国硅谷) | 位置相关变体识别计算流水线 |
| CN107111690A (zh) * | 2014-12-23 | 2017-08-29 | 皇家飞利浦有限公司 | 用于序列对齐的系统、方法、和装置 |
| CN110168647A (zh) * | 2016-11-16 | 2019-08-23 | 宜曼达股份有限公司 | 测序数据读段重新比对的方法 |
| CN110476215A (zh) * | 2017-03-29 | 2019-11-19 | 南托米克斯有限责任公司 | 用于多序列文件的签名-散列 |
| CN112309501A (zh) * | 2019-08-02 | 2021-02-02 | 华为技术有限公司 | 基因比对技术 |
| CN112534507A (zh) * | 2018-10-31 | 2021-03-19 | Illumina公司 | 用于测序读值的分组和折叠的系统和方法 |
Families Citing this family (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US10262107B1 (en) * | 2013-03-15 | 2019-04-16 | Bao Tran | Pharmacogenetic drug interaction management system |
| US10191929B2 (en) | 2013-05-29 | 2019-01-29 | Noblis, Inc. | Systems and methods for SNP analysis and genome sequencing |
| US10560552B2 (en) | 2015-05-21 | 2020-02-11 | Noblis, Inc. | Compression and transmission of genomic information |
| MX2018015412A (es) | 2016-10-07 | 2019-05-27 | Illumina Inc | Sistema y metodo para analisis secundario de datos de secuenciamiento de nucleotido. |
| US11222712B2 (en) | 2017-05-12 | 2022-01-11 | Noblis, Inc. | Primer design using indexed genomic information |
| WO2020190891A2 (en) * | 2019-03-15 | 2020-09-24 | The Trustees Of Columbia University In The City Of New York | Systems and methods for analyzing sequencing data |
| US12195729B2 (en) | 2020-04-16 | 2025-01-14 | Noblis, Inc. | Portable field-deployable nucleic acid sequencing kit |
Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6141657A (en) * | 1995-10-24 | 2000-10-31 | Curagen Corporation | Method and apparatus for identifying classifying or quantifying DNA sequences in a sample without sequencing |
| WO2007137225A2 (en) * | 2006-05-19 | 2007-11-29 | The University Of Chicago | Method for indexing nucleic acid sequences for computer based searching |
| WO2010104608A2 (en) * | 2009-03-13 | 2010-09-16 | Life Technologies Corporation | Computer implemented method for indexing reference genome |
| CN102682226A (zh) * | 2012-04-18 | 2012-09-19 | 盛司潼 | 一种核酸测序信息处理系统及方法 |
| WO2012168815A2 (en) * | 2011-06-06 | 2012-12-13 | Koninklijke Philips Electronics N.V. | Method for assembly of nucleic acid sequence data |
Family Cites Families (54)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CA2036946C (en) * | 1990-04-06 | 2001-10-16 | Kenneth V. Deugau | Indexing linkers |
| GB9214873D0 (en) * | 1992-07-13 | 1992-08-26 | Medical Res Council | Process for categorising nucleotide sequence populations |
| EP0880598A4 (en) * | 1996-01-23 | 2005-02-23 | Affymetrix Inc | RAPID EVALUATION OF NUCLEIC ACID ABUNDANCE DIFFERENCE, WITH A HIGH-DENSITY OLIGONUCLEOTIDE SYSTEM |
| US5994068A (en) * | 1997-03-11 | 1999-11-30 | Wisconsin Alumni Research Foundation | Nucleic acid indexing |
| US6057779A (en) * | 1997-08-14 | 2000-05-02 | Micron Technology, Inc. | Method of controlling access to a movable container and to a compartment of a vehicle, and a secure cargo transportation system |
| EP1044417B1 (en) * | 1998-10-30 | 2002-12-11 | International Business Machines Corporation | Methods and apparatus for performing sequence homology detection |
| AU3078400A (en) * | 1999-01-25 | 2000-08-07 | Institute Of Medicinal Molecular Design. Inc. | Describing and storing method of alignment information |
| AU2001271255A1 (en) * | 2000-05-19 | 2002-01-02 | Curagen Corporation | Method for analyzing a nucleic acid |
| WO2002002740A2 (en) * | 2000-07-05 | 2002-01-10 | Rosetta Inpharmatics, Inc. | Methods and compositions for determining gene function |
| US6963807B2 (en) * | 2000-09-08 | 2005-11-08 | Oxford Glycosciences (Uk) Ltd. | Automated identification of peptides |
| WO2003017138A1 (fr) * | 2001-08-21 | 2003-02-27 | Institute Of Medicinal Molecular Design. Inc. | Procede de lecture d'informations d'une sequence biologique et procede de stockage |
| US7406385B2 (en) * | 2001-10-25 | 2008-07-29 | Applera Corporation | System and method for consensus-calling with per-base quality values for sample assemblies |
| US7809510B2 (en) * | 2002-02-27 | 2010-10-05 | Ip Genesis, Inc. | Positional hashing method for performing DNA sequence similarity search |
| US20080168572A1 (en) * | 2002-05-01 | 2008-07-10 | Diversa Corporation | Nanoarchaeum genome, Nanoarchaeum polypeptides and nucleic acids encoding them and methods for making and using them |
| US20040224330A1 (en) * | 2003-01-15 | 2004-11-11 | Liyan He | Nucleic acid indexing |
| US20050142584A1 (en) * | 2003-10-01 | 2005-06-30 | Willson Richard C. | Microbial identification based on the overall composition of characteristic oligonucleotides |
| US20050239102A1 (en) * | 2003-10-31 | 2005-10-27 | Verdine Gregory L | Nucleic acid binding oligonucleotides |
| US20080263002A1 (en) | 2004-03-31 | 2008-10-23 | Shinichi Morishita | Base Sequence Retrieval Apparatus |
| US7487169B2 (en) * | 2004-11-24 | 2009-02-03 | International Business Machines Corporation | Method for finding the longest common subsequences between files with applications to differential compression |
| US7424371B2 (en) * | 2004-12-21 | 2008-09-09 | Helicos Biosciences Corporation | Nucleic acid analysis |
| US20060286566A1 (en) * | 2005-02-03 | 2006-12-21 | Helicos Biosciences Corporation | Detecting apparent mutations in nucleic acid sequences |
| WO2007008987A1 (en) * | 2005-07-11 | 2007-01-18 | Emolecules, Inc. | Molecular keyword indexing for chemical structure database storage, searching and retrieval |
| WO2008000090A1 (en) * | 2006-06-30 | 2008-01-03 | University Of Guelph | Dna barcode sequence classification |
| US8700334B2 (en) * | 2006-07-31 | 2014-04-15 | International Business Machines Corporation | Methods and systems for reconstructing genomic common ancestors |
| CA2681568C (en) * | 2006-11-23 | 2019-01-08 | Querdenker Aps | Oligonucleotides for modulating target rna activity |
| WO2008093098A2 (en) * | 2007-02-02 | 2008-08-07 | Illumina Cambridge Limited | Methods for indexing samples and sequencing multiple nucleotide templates |
| GB0703793D0 (en) * | 2007-02-27 | 2007-04-04 | Biosystems Informatics Inst | Signature peptide identification |
| US20100114918A1 (en) * | 2007-05-31 | 2010-05-06 | Isentio As | Generation of degenerate sequences and identification of individual sequences from a degenerate sequence |
| US7809765B2 (en) * | 2007-08-24 | 2010-10-05 | General Electric Company | Sequence identification and analysis |
| WO2009143212A1 (en) * | 2008-05-21 | 2009-11-26 | Mito Tech Llc | Computer system and computer-facilitated method for nucleic acid sequence alignment and analysis |
| WO2010056131A1 (en) * | 2008-11-14 | 2010-05-20 | Real Time Genomics, Inc. | A method and system for analysing data sequences |
| WO2010091023A2 (en) * | 2009-02-03 | 2010-08-12 | Complete Genomics, Inc. | Indexing a reference sequence for oligomer sequence mapping |
| WO2010105428A1 (en) * | 2009-03-19 | 2010-09-23 | Google Inc. | Input method editor |
| US8594248B2 (en) * | 2009-11-05 | 2013-11-26 | Nec Laboratories America, Inc. | Reverse indexing methods and systems |
| US9165109B2 (en) * | 2010-02-24 | 2015-10-20 | Pacific Biosciences Of California, Inc. | Sequence assembly and consensus sequence determination |
| US20110257889A1 (en) * | 2010-02-24 | 2011-10-20 | Pacific Biosciences Of California, Inc. | Sequence assembly and consensus sequence determination |
| WO2011137368A2 (en) * | 2010-04-30 | 2011-11-03 | Life Technologies Corporation | Systems and methods for analyzing nucleic acid sequences |
| KR101638594B1 (ko) * | 2010-05-26 | 2016-07-20 | 삼성전자주식회사 | Dna 서열 검색 방법 및 장치 |
| EP2591433A4 (en) * | 2010-07-06 | 2017-05-17 | Life Technologies Corporation | Systems and methods to detect copy number variation |
| EP3138919B1 (en) * | 2010-07-23 | 2018-12-19 | Board of Trustees of Michigan State University | Feruloyl-coa:monolignol-transferase |
| US20130053541A1 (en) * | 2011-03-11 | 2013-02-28 | Lynntech, Inc. | Methods for discovering molecules that bind to proteins |
| US9276911B2 (en) | 2011-05-13 | 2016-03-01 | Indiana University Research & Technology Corporation | Secure and scalable mapping of human sequencing reads on hybrid clouds |
| US20130090266A1 (en) * | 2011-10-11 | 2013-04-11 | Biolauncher Ltd. | Methods and systems for optimization of peptide screening |
| US8209130B1 (en) * | 2012-04-04 | 2012-06-26 | Good Start Genetics, Inc. | Sequence assembly |
| US9600625B2 (en) | 2012-04-23 | 2017-03-21 | Bina Technologies, Inc. | Systems and methods for processing nucleic acid sequence data |
| US8812243B2 (en) | 2012-05-09 | 2014-08-19 | International Business Machines Corporation | Transmission and compression of genetic data |
| WO2014081456A1 (en) * | 2012-11-26 | 2014-05-30 | Illumina, Inc. | Efficient comparison of polynucleotide sequences |
| WO2014099979A2 (en) * | 2012-12-17 | 2014-06-26 | Virginia Tech Intellectual Properties, Inc. | Methods and compositions for identifying global microsatellite instability and for characterizing informative microsatellite loci |
| US20130309666A1 (en) * | 2013-01-25 | 2013-11-21 | Sequenom, Inc. | Methods and processes for non-invasive assessment of genetic variations |
| EP2970951B1 (en) * | 2013-03-13 | 2019-02-20 | Illumina, Inc. | Methods for nucleic acid sequencing |
| US10191929B2 (en) | 2013-05-29 | 2019-01-29 | Noblis, Inc. | Systems and methods for SNP analysis and genome sequencing |
| CN104699998A (zh) | 2013-12-06 | 2015-06-10 | 国际商业机器公司 | 用于对基因组进行压缩和解压缩的方法和装置 |
| US10560552B2 (en) | 2015-05-21 | 2020-02-11 | Noblis, Inc. | Compression and transmission of genomic information |
| US11222712B2 (en) | 2017-05-12 | 2022-01-11 | Noblis, Inc. | Primer design using indexed genomic information |
-
2013
- 2013-05-29 US US13/904,738 patent/US10191929B2/en active Active
-
2014
- 2014-05-20 IN IN1682MU2014 patent/IN2014MU01682A/en unknown
- 2014-05-27 CN CN201410228956.7A patent/CN104217134A/zh active Pending
- 2014-05-28 EP EP14170198.7A patent/EP2808814A3/en not_active Withdrawn
- 2014-05-28 SG SG10201402714UA patent/SG10201402714UA/en unknown
-
2019
- 2019-01-25 US US16/257,552 patent/US11308056B2/en active Active
-
2022
- 2022-04-18 US US17/723,053 patent/US12141116B2/en active Active
Patent Citations (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6141657A (en) * | 1995-10-24 | 2000-10-31 | Curagen Corporation | Method and apparatus for identifying classifying or quantifying DNA sequences in a sample without sequencing |
| WO2007137225A2 (en) * | 2006-05-19 | 2007-11-29 | The University Of Chicago | Method for indexing nucleic acid sequences for computer based searching |
| US20090270277A1 (en) * | 2006-05-19 | 2009-10-29 | The University Of Chicago | Method for indexing nucleic acid sequences for computer based searching |
| WO2010104608A2 (en) * | 2009-03-13 | 2010-09-16 | Life Technologies Corporation | Computer implemented method for indexing reference genome |
| WO2012168815A2 (en) * | 2011-06-06 | 2012-12-13 | Koninklijke Philips Electronics N.V. | Method for assembly of nucleic acid sequence data |
| CN102682226A (zh) * | 2012-04-18 | 2012-09-19 | 盛司潼 | 一种核酸测序信息处理系统及方法 |
Non-Patent Citations (1)
| Title |
|---|
| ZEMIN NING ET AL.: "SSAHA: A Fast Search Method for Large DNA Databases", 《GENOME RESEARCH》 * |
Cited By (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN107111690A (zh) * | 2014-12-23 | 2017-08-29 | 皇家飞利浦有限公司 | 用于序列对齐的系统、方法、和装置 |
| CN106021992A (zh) * | 2015-03-27 | 2016-10-12 | 知源生信公司(美国硅谷) | 位置相关变体识别计算流水线 |
| CN110168647A (zh) * | 2016-11-16 | 2019-08-23 | 宜曼达股份有限公司 | 测序数据读段重新比对的方法 |
| CN110168647B (zh) * | 2016-11-16 | 2023-10-31 | 宜曼达股份有限公司 | 测序数据读段重新比对的方法 |
| CN110476215A (zh) * | 2017-03-29 | 2019-11-19 | 南托米克斯有限责任公司 | 用于多序列文件的签名-散列 |
| CN112534507A (zh) * | 2018-10-31 | 2021-03-19 | Illumina公司 | 用于测序读值的分组和折叠的系统和方法 |
| CN112534507B (zh) * | 2018-10-31 | 2024-03-15 | 因美纳有限公司 | 用于测序读值的分组和折叠的系统和方法 |
| CN112309501A (zh) * | 2019-08-02 | 2021-02-02 | 华为技术有限公司 | 基因比对技术 |
Also Published As
| Publication number | Publication date |
|---|---|
| EP2808814A3 (en) | 2015-06-03 |
| US20140358937A1 (en) | 2014-12-04 |
| EP2808814A2 (en) | 2014-12-03 |
| US11308056B2 (en) | 2022-04-19 |
| IN2014MU01682A (GUID-C5D7CC26-194C-43D0-91A1-9AE8C70A9BFF.html) | 2015-09-04 |
| SG10201402714UA (en) | 2014-12-30 |
| US10191929B2 (en) | 2019-01-29 |
| US20190146962A1 (en) | 2019-05-16 |
| US12141116B2 (en) | 2024-11-12 |
| US20220253420A1 (en) | 2022-08-11 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US12141116B2 (en) | Systems and methods for SNP analysis and genome sequencing | |
| US12249402B2 (en) | Sequencing methods | |
| Sahlin et al. | De novo clustering of long-read transcriptome data using a greedy, quality value-based algorithm | |
| Lin et al. | GSAlign: an efficient sequence alignment tool for intra-species genomes | |
| Bao et al. | Review of current methods, applications, and data management for the bioinformatics analysis of whole exome sequencing | |
| Mielczarek et al. | Review of alignment and SNP calling algorithms for next-generation sequencing data | |
| Parkinson et al. | Making sense of EST sequences by CLOBBing them | |
| US9165109B2 (en) | Sequence assembly and consensus sequence determination | |
| Shajii et al. | Fast genotyping of known SNPs through approximate k-mer matching | |
| Gong et al. | Analysis and performance assessment of the whole genome bisulfite sequencing data workflow: currently available tools and a practical guide to advance DNA methylation studies | |
| Wang et al. | Vertebrate gene predictions and the problem of large genes | |
| Johnston et al. | PEMapper and PECaller provide a simplified approach to whole-genome sequencing | |
| CA2942816A1 (en) | Systems and methods for genomic variant annotation | |
| US20140244639A1 (en) | Surprisal data reduction of genetic data for transmission, storage, and analysis | |
| US8855938B2 (en) | Minimization of surprisal data through application of hierarchy of reference genomes | |
| Kallenborn et al. | CARE: context-aware sequencing read error correction | |
| Homer et al. | Local alignment of two-base encoded DNA sequence | |
| Liu et al. | Prioritization of cancer-related genomic variants by SNP association network | |
| CA2871563C (en) | Minimization of surprisal data through application of hierarchy of reference genomes | |
| McGeachie et al. | Joint GWAS analysis: comparing similar GWAS at different genomic resolutions identifies novel pathway associations with six complex diseases | |
| Min et al. | Survey of programs used to detect alternative splicing isoforms from deep sequencing data in silico | |
| Kim et al. | Reprever: resolving low-copy duplicated sequences using template driven assembly | |
| Nakato et al. | Cgaln: fast and space-efficient whole-genome alignment | |
| Sun et al. | Genome-scale NCRNA homology search using a Hamming distance-based filtration strategy | |
| Wu et al. | User-friendly genome assembly and gene annotation pipelines for vertebrates |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| C06 | Publication | ||
| PB01 | Publication | ||
| C10 | Entry into substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20141217 |
|
| WD01 | Invention patent application deemed withdrawn after publication |