SG10201402714UA - Systems and methods for snp analysis and genome sequencing - Google Patents

Systems and methods for snp analysis and genome sequencing

Info

Publication number
SG10201402714UA
SG10201402714UA SG10201402714UA SG10201402714UA SG10201402714UA SG 10201402714U A SG10201402714U A SG 10201402714UA SG 10201402714U A SG10201402714U A SG 10201402714UA SG 10201402714U A SG10201402714U A SG 10201402714UA SG 10201402714U A SG10201402714U A SG 10201402714UA
Authority
SG
Singapore
Prior art keywords
acid sequence
nucleic acid
index
systems
methods
Prior art date
Application number
SG10201402714UA
Inventor
Thomas Sterling
Dellinger Nathan
Original Assignee
Noblis Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Noblis Inc filed Critical Noblis Inc
Publication of SG10201402714UA publication Critical patent/SG10201402714UA/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/22Indexing; Data structures therefor; Storage structures
    • G06F16/2228Indexing structures
    • G06F16/2255Hash tables
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B30/00ICT specially adapted for sequence analysis involving nucleotides or amino acids
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B30/00ICT specially adapted for sequence analysis involving nucleotides or amino acids
    • G16B30/10Sequence alignment; Homology search

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Medical Informatics (AREA)
  • Health & Medical Sciences (AREA)
  • Analytical Chemistry (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Biotechnology (AREA)
  • Evolutionary Biology (AREA)
  • General Health & Medical Sciences (AREA)
  • Biophysics (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Chemical & Material Sciences (AREA)
  • Software Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
  • Apparatus Associated With Microorganisms And Enzymes (AREA)

Abstract

SYSTEMS AND METHODS FOR SNP ANALYSIS AND GENOME In one embodiment, system comprising a a processor and a memory storing instructions executable by the processor creates an index for a nucleic acid sequence. The index comprises plurality of elements. a Each element corresponds to a permutation of nucleic a acid sequence. Data representing a nucleic acid sequence is received. A subsequence of the nucleic acid sequence is identified in the data at a first position of the nucleic acid sequence. A hash of the subsequence is computed to determine a corresponding element of the index. Position data reflecting the first position is stored in the corresponding element of the index. 41
SG10201402714UA 2013-05-29 2014-05-28 Systems and methods for snp analysis and genome sequencing SG10201402714UA (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US13/904,738 US10191929B2 (en) 2013-05-29 2013-05-29 Systems and methods for SNP analysis and genome sequencing

Publications (1)

Publication Number Publication Date
SG10201402714UA true SG10201402714UA (en) 2014-12-30

Family

ID=50884687

Family Applications (1)

Application Number Title Priority Date Filing Date
SG10201402714UA SG10201402714UA (en) 2013-05-29 2014-05-28 Systems and methods for snp analysis and genome sequencing

Country Status (5)

Country Link
US (3) US10191929B2 (en)
EP (1) EP2808814A3 (en)
CN (1) CN104217134A (en)
IN (1) IN2014MU01682A (en)
SG (1) SG10201402714UA (en)

Families Citing this family (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10262107B1 (en) * 2013-03-15 2019-04-16 Bao Tran Pharmacogenetic drug interaction management system
US10191929B2 (en) 2013-05-29 2019-01-29 Noblis, Inc. Systems and methods for SNP analysis and genome sequencing
WO2016103148A1 (en) * 2014-12-23 2016-06-30 Koninklijke Philips N.V. Systems, methods, and apparatuses for sequence alignment
US10424396B2 (en) * 2015-03-27 2019-09-24 Sentieon Inc. Computation pipeline of location-dependent variant calls
US10560552B2 (en) 2015-05-21 2020-02-11 Noblis, Inc. Compression and transmission of genomic information
CA3027179C (en) * 2016-10-07 2023-06-27 Illumina, Inc. System and method for secondary analysis of nucleotide sequencing data
EP3542293B1 (en) * 2016-11-16 2023-12-27 Illumina, Inc. Methods of sequencing data read realignment
SG11201908893UA (en) * 2017-03-29 2019-10-30 Nantomics Llc Signature-hash for multi-sequence files
US11222712B2 (en) 2017-05-12 2022-01-11 Noblis, Inc. Primer design using indexed genomic information
WO2020092309A1 (en) * 2018-10-31 2020-05-07 Illumina, Inc. Systems and methods for grouping and collapsing sequencing reads
WO2020190891A2 (en) * 2019-03-15 2020-09-24 The Trustees Of Columbia University In The City Of New York Systems and methods for analyzing sequencing data
CN112309501A (en) * 2019-08-02 2021-02-02 华为技术有限公司 Gene comparison technology

Family Cites Families (58)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CA2036946C (en) * 1990-04-06 2001-10-16 Kenneth V. Deugau Indexing linkers
GB9214873D0 (en) * 1992-07-13 1992-08-26 Medical Res Council Process for categorising nucleotide sequence populations
US5871697A (en) * 1995-10-24 1999-02-16 Curagen Corporation Method and apparatus for identifying, classifying, or quantifying DNA sequences in a sample without sequencing
EP0880598A4 (en) * 1996-01-23 2005-02-23 Affymetrix Inc Nucleic acid analysis techniques
US5994068A (en) * 1997-03-11 1999-11-30 Wisconsin Alumni Research Foundation Nucleic acid indexing
US6057779A (en) * 1997-08-14 2000-05-02 Micron Technology, Inc. Method of controlling access to a movable container and to a compartment of a vehicle, and a secure cargo transportation system
CA2315147C (en) * 1998-10-30 2004-12-28 International Business Machines Corporation Methods and apparatus for performing sequence homology detection
EP1149356A1 (en) * 1999-01-25 2001-10-31 Institute of Medicinal Molecular Design, Inc. Method for describing and storing alignment information
WO2001098535A2 (en) * 2000-05-19 2001-12-27 Curagen Corporation Method for analyzing a nucleic acid
WO2002002740A2 (en) * 2000-07-05 2002-01-10 Rosetta Inpharmatics, Inc. Methods and compositions for determining gene function
US6963807B2 (en) * 2000-09-08 2005-11-08 Oxford Glycosciences (Uk) Ltd. Automated identification of peptides
EP1429259A4 (en) * 2001-08-21 2005-08-31 Inst Med Molecular Design Inc Biological sequence information reading method and storing method
US7406385B2 (en) * 2001-10-25 2008-07-29 Applera Corporation System and method for consensus-calling with per-base quality values for sample assemblies
US7809510B2 (en) * 2002-02-27 2010-10-05 Ip Genesis, Inc. Positional hashing method for performing DNA sequence similarity search
US20080168572A1 (en) * 2002-05-01 2008-07-10 Diversa Corporation Nanoarchaeum genome, Nanoarchaeum polypeptides and nucleic acids encoding them and methods for making and using them
US20040224330A1 (en) * 2003-01-15 2004-11-11 Liyan He Nucleic acid indexing
US20050239102A1 (en) * 2003-10-31 2005-10-27 Verdine Gregory L Nucleic acid binding oligonucleotides
EP1732022A4 (en) 2004-03-31 2008-09-24 Bio Think Tank Co Ltd Base sequence retrieval apparatus
US7487169B2 (en) * 2004-11-24 2009-02-03 International Business Machines Corporation Method for finding the longest common subsequences between files with applications to differential compression
US7424371B2 (en) * 2004-12-21 2008-09-09 Helicos Biosciences Corporation Nucleic acid analysis
US20060286566A1 (en) * 2005-02-03 2006-12-21 Helicos Biosciences Corporation Detecting apparent mutations in nucleic acid sequences
WO2007008987A1 (en) * 2005-07-11 2007-01-18 Emolecules, Inc. Molecular keyword indexing for chemical structure database storage, searching and retrieval
US8116988B2 (en) * 2006-05-19 2012-02-14 The University Of Chicago Method for indexing nucleic acid sequences for computer based searching
WO2008000090A1 (en) * 2006-06-30 2008-01-03 University Of Guelph Dna barcode sequence classification
US8700334B2 (en) * 2006-07-31 2014-04-15 International Business Machines Corporation Methods and systems for reconstructing genomic common ancestors
CA2681568C (en) * 2006-11-23 2019-01-08 Querdenker Aps Oligonucleotides for modulating target rna activity
WO2008093098A2 (en) * 2007-02-02 2008-08-07 Illumina Cambridge Limited Methods for indexing samples and sequencing multiple nucleotide templates
GB0703793D0 (en) * 2007-02-27 2007-04-04 Biosystems Informatics Inst Signature peptide identification
US20100114918A1 (en) * 2007-05-31 2010-05-06 Isentio As Generation of degenerate sequences and identification of individual sequences from a degenerate sequence
US7809765B2 (en) * 2007-08-24 2010-10-05 General Electric Company Sequence identification and analysis
WO2009143212A1 (en) * 2008-05-21 2009-11-26 Mito Tech Llc Computer system and computer-facilitated method for nucleic acid sequence alignment and analysis
WO2010056131A1 (en) * 2008-11-14 2010-05-20 Real Time Genomics, Inc. A method and system for analysing data sequences
WO2010091023A2 (en) * 2009-02-03 2010-08-12 Complete Genomics, Inc. Indexing a reference sequence for oligomer sequence mapping
WO2010104608A2 (en) * 2009-03-13 2010-09-16 Life Technologies Corporation Computer implemented method for indexing reference genome
JP5362095B2 (en) * 2009-03-19 2013-12-11 グーグル・インコーポレーテッド Input method editor
US8718156B2 (en) * 2009-11-05 2014-05-06 Nec Laboratories America, Inc. Indexing methods and systems
US9165109B2 (en) * 2010-02-24 2015-10-20 Pacific Biosciences Of California, Inc. Sequence assembly and consensus sequence determination
US20110257889A1 (en) * 2010-02-24 2011-10-20 Pacific Biosciences Of California, Inc. Sequence assembly and consensus sequence determination
WO2011137368A2 (en) * 2010-04-30 2011-11-03 Life Technologies Corporation Systems and methods for analyzing nucleic acid sequences
KR101638594B1 (en) * 2010-05-26 2016-07-20 삼성전자주식회사 Method and apparatus for searching DNA sequence
EP2591433A4 (en) * 2010-07-06 2017-05-17 Life Technologies Corporation Systems and methods to detect copy number variation
BR112013001671A8 (en) * 2010-07-23 2019-09-03 Univ Michigan State feruloyl-coa: monolignol transferase.
US20130053541A1 (en) * 2011-03-11 2013-02-28 Lynntech, Inc. Methods for discovering molecules that bind to proteins
US9276911B2 (en) 2011-05-13 2016-03-01 Indiana University Research & Technology Corporation Secure and scalable mapping of human sequencing reads on hybrid clouds
EP2718862B1 (en) * 2011-06-06 2018-10-31 Koninklijke Philips N.V. Method for assembly of nucleic acid sequence data
US20130090266A1 (en) * 2011-10-11 2013-04-11 Biolauncher Ltd. Methods and systems for optimization of peptide screening
US8209130B1 (en) * 2012-04-04 2012-06-26 Good Start Genetics, Inc. Sequence assembly
CN102682226B (en) * 2012-04-18 2015-09-30 盛司潼 A kind of nucleic acid sequencing information handling system and method
US9600625B2 (en) 2012-04-23 2017-03-21 Bina Technologies, Inc. Systems and methods for processing nucleic acid sequence data
US8812243B2 (en) 2012-05-09 2014-08-19 International Business Machines Corporation Transmission and compression of genetic data
US20150310165A1 (en) * 2012-11-26 2015-10-29 Illumina, Inc. Efficient comparison of polynucleotide sequences
US20140235456A1 (en) * 2012-12-17 2014-08-21 Virginia Tech Intellectual Properties, Inc. Methods and Compositions for Identifying Global Microsatellite Instability and for Characterizing Informative Microsatellite Loci
US20130309666A1 (en) * 2013-01-25 2013-11-21 Sequenom, Inc. Methods and processes for non-invasive assessment of genetic variations
CA2898456C (en) * 2013-03-13 2020-11-10 Illumina, Inc. Methods and compositions for nucleic acid sequencing
US10191929B2 (en) 2013-05-29 2019-01-29 Noblis, Inc. Systems and methods for SNP analysis and genome sequencing
CN104699998A (en) 2013-12-06 2015-06-10 国际商业机器公司 Method and device for compressing and decompressing genome
US10560552B2 (en) 2015-05-21 2020-02-11 Noblis, Inc. Compression and transmission of genomic information
US11222712B2 (en) 2017-05-12 2022-01-11 Noblis, Inc. Primer design using indexed genomic information

Also Published As

Publication number Publication date
IN2014MU01682A (en) 2015-09-04
CN104217134A (en) 2014-12-17
EP2808814A3 (en) 2015-06-03
US20140358937A1 (en) 2014-12-04
US20190146962A1 (en) 2019-05-16
US20220253420A1 (en) 2022-08-11
US10191929B2 (en) 2019-01-29
US11308056B2 (en) 2022-04-19
EP2808814A2 (en) 2014-12-03

Similar Documents

Publication Publication Date Title
SG10201402714UA (en) Systems and methods for snp analysis and genome sequencing
ES2547321T3 (en) System and method to optimize a tracking system
WO2013019869A3 (en) Data fingerpringting for copy accuracy assurance
WO2014178050A3 (en) 3d registration of a plurality of 3d models
MX2015011901A (en) Systems and methods for disease associated human genomic variant analysis and reporting.
EP4261828A3 (en) Methods and processes for non-invasive assessment of genetic variations
GB2550798A (en) Order pairing system and method
JP2014096164A5 (en)
BR112015006948A2 (en) system for recording a coordinate system of a format detection system, method for recording a coordinate system of a format detection system and computer program product
GB2510506A (en) Generating compiled code that indicates register liveness
EA201690256A1 (en) SYSTEM AND METHOD FOR PLANNING AREA FOR VEHICLES
TW201612743A (en) Bit group interleave processors, methods, systems, and instructions
EP2889760A3 (en) SMS4 acceleration processors, methods, systems, and instructions
GB2535364A (en) System and method for indicating queue characteristics of electronic terminals
MX361184B (en) Systems and methods for quantitative evaluation of a property for renovation.
BR112015012452A2 (en) user action-based feature query in online systems
BR112015018922A8 (en) device, method and one or more computer readable non-transient storage media for routine estimation
BR112012028406A2 (en) method, at least one memory and at least one computer program code configured with at least one data and apparatus
WO2014120851A3 (en) Method and system for visualizing documents
GB2531678A (en) Moving objects in primary computer based on memory errors in secondary computer
JP2016509714A5 (en)
WO2014140848A3 (en) Systems and methods for providing retail process analytics information based on physiological indicator data
RU2015138548A (en) SYSTEM FOR SECURING THE FLOW OF OPERATIONS OF BUSINESS PROCESS
BR112014013562A2 (en) method, storage instructions through non-transient storage, and apparatus
GB201205560D0 (en) Location text