WO2001063543A8 - Procede et systeme d'assemblage d'un genome entier au moyen d'un ensemble de donnees prises au hasard - Google Patents

Procede et systeme d'assemblage d'un genome entier au moyen d'un ensemble de donnees prises au hasard

Info

Publication number
WO2001063543A8
WO2001063543A8 PCT/US2001/002704 US0102704W WO0163543A8 WO 2001063543 A8 WO2001063543 A8 WO 2001063543A8 US 0102704 W US0102704 W US 0102704W WO 0163543 A8 WO0163543 A8 WO 0163543A8
Authority
WO
WIPO (PCT)
Prior art keywords
shot
genome
assembly
dna
data set
Prior art date
Application number
PCT/US2001/002704
Other languages
English (en)
Other versions
WO2001063543A3 (fr
WO2001063543A2 (fr
Inventor
Gene W Myers
Arthur L Delcher
Ian M Dew
Michael J Flanigan
Saul A Kravitz
Clark M Mobarry
Knut Reinert
Karin A Remington
Granger G Sutton
Original Assignee
Pe Corp Ny
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US09/526,131 external-priority patent/US6714874B1/en
Application filed by Pe Corp Ny filed Critical Pe Corp Ny
Priority to JP2001562433A priority Critical patent/JP2003530631A/ja
Priority to EP01908713A priority patent/EP1285390A2/fr
Priority to AU2001236555A priority patent/AU2001236555A1/en
Priority to CA002400890A priority patent/CA2400890A1/fr
Publication of WO2001063543A2 publication Critical patent/WO2001063543A2/fr
Publication of WO2001063543A8 publication Critical patent/WO2001063543A8/fr
Publication of WO2001063543A3 publication Critical patent/WO2001063543A3/fr

Links

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B30/00ICT specially adapted for sequence analysis involving nucleotides or amino acids
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B30/00ICT specially adapted for sequence analysis involving nucleotides or amino acids
    • G16B30/10Sequence alignment; Homology search
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B30/00ICT specially adapted for sequence analysis involving nucleotides or amino acids
    • G16B30/20Sequence assembly

Landscapes

  • Physics & Mathematics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Analytical Chemistry (AREA)
  • Biophysics (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Health & Medical Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Biotechnology (AREA)
  • Evolutionary Biology (AREA)
  • General Health & Medical Sciences (AREA)
  • Medical Informatics (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Theoretical Computer Science (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Investigating Or Analysing Biological Materials (AREA)

Abstract

La présente invention concerne des procédés et des systèmes d'assemblage d'un génome à partir d'un ensemble pris au hasard de fragments d'ADN à extrémité séquentielle. Plus particulièrement, la présente invention concerne un procédé destiné à déterminer la séquence génomique (séquence de base et orientation) d'un génome complexe au moyen d'informations d'ADN en séquence générées à partir de plusieurs fragments d'ADN obtenus par le génome. Ce procédé est particulièrement utile dans l'assemblage de génomes d'au moins 10MB (jusqu'à 5GB) et qui sont constitués d'au moins 5 % de séquences d'ADN répétitif (jusqu'à 25%), mais il peut également être utilisé pour de plus petits génomes avec un pourcentage inférieur d'ADN répétitif.
PCT/US2001/002704 2000-02-22 2001-01-29 Procede et systeme d'assemblage d'un genome entier au moyen d'un ensemble de donnees prises au hasard WO2001063543A2 (fr)

Priority Applications (4)

Application Number Priority Date Filing Date Title
JP2001562433A JP2003530631A (ja) 2000-02-22 2001-01-29 ショットガンデータ集合を用いた全ゲノムのアセンブリのための方法及びシステム
EP01908713A EP1285390A2 (fr) 2000-02-22 2001-01-29 Procede et systeme d'assemblage d'un genome entier au moyen d'un ensemble de donnees prises au hasard
AU2001236555A AU2001236555A1 (en) 2000-02-22 2001-01-29 Method and system for the assembly of a whole genome using a shot-gun data set
CA002400890A CA2400890A1 (fr) 2000-02-22 2001-01-29 Procede et systeme d'assemblage d'un genome entier au moyen d'un ensemble de donnees prises au hasard

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US18375800P 2000-02-22 2000-02-22
US60/183,758 2000-02-22
US09/526,131 2000-03-15
US09/526,131 US6714874B1 (en) 2000-03-15 2000-03-15 Method and system for the assembly of a whole genome using a shot-gun data set

Publications (3)

Publication Number Publication Date
WO2001063543A2 WO2001063543A2 (fr) 2001-08-30
WO2001063543A8 true WO2001063543A8 (fr) 2002-02-07
WO2001063543A3 WO2001063543A3 (fr) 2002-12-05

Family

ID=26879491

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2001/002704 WO2001063543A2 (fr) 2000-02-22 2001-01-29 Procede et systeme d'assemblage d'un genome entier au moyen d'un ensemble de donnees prises au hasard

Country Status (5)

Country Link
EP (1) EP1285390A2 (fr)
JP (1) JP2003530631A (fr)
AU (1) AU2001236555A1 (fr)
CA (1) CA2400890A1 (fr)
WO (1) WO2001063543A2 (fr)

Families Citing this family (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2003330934A (ja) * 2002-05-10 2003-11-21 Celestar Lexico-Sciences Inc 変異体配列解析装置、変異体配列解析方法、プログラム、および、記録媒体
US7575865B2 (en) * 2003-01-29 2009-08-18 454 Life Sciences Corporation Methods of amplifying and sequencing nucleic acids
US20110004616A1 (en) * 2007-10-31 2011-01-06 National Institute Of Agrobiological Sciences Base sequence determination program, base sequence determination device, and base sequence determination method
CN101504697B (zh) * 2008-12-12 2010-09-08 深圳华大基因研究院 一种片段连接支架的构建方法和系统
CN101457253B (zh) * 2008-12-12 2011-08-31 深圳华大基因研究院 一种测序序列纠错方法、系统及设备
US20120197533A1 (en) * 2010-10-11 2012-08-02 Complete Genomics, Inc. Identifying rearrangements in a sequenced genome
TWI420007B (zh) * 2011-03-04 2013-12-21 Hsueh Ting Chu 基因測序序列的組合系統及方法
WO2012171213A1 (fr) * 2011-06-17 2012-12-20 深圳华大基因科技有限公司 Procédé et système pour l'assemblage d'un génome
BR102012031096B1 (pt) * 2012-12-05 2019-10-22 Empresa Brasileira De Pesquisa Agropecuaria Embrapa método e uso para verificação de erros de montagem em genomas
AU2013382195B2 (en) 2013-03-13 2019-09-19 Illumina, Inc. Methods and systems for aligning repetitive DNA elements
CN104164479B (zh) * 2014-04-04 2017-09-19 深圳华大基因科技服务有限公司 杂合基因组处理方法
CN104298892B (zh) * 2014-09-18 2017-05-10 天津诺禾致源生物信息科技有限公司 基因融合的检测装置和方法

Also Published As

Publication number Publication date
AU2001236555A1 (en) 2001-09-03
JP2003530631A (ja) 2003-10-14
WO2001063543A3 (fr) 2002-12-05
CA2400890A1 (fr) 2001-08-30
EP1285390A2 (fr) 2003-02-26
WO2001063543A2 (fr) 2001-08-30

Similar Documents

Publication Publication Date Title
WO2001063543A3 (fr) Procede et systeme d'assemblage d'un genome entier au moyen d'un ensemble de donnees prises au hasard
US20110237444A1 (en) Methods of mapping genomic methylation patterns
WO2004061616A3 (fr) Systemes et procedes informatiques permettant d'associer des genes avec des caracteristiques au moyen de donnees heterospecifiques
WO2000040755A3 (fr) Acceleration de l'identification des polymorphismes d'un nucleotide unique et alignement de clones dans le sequençage genomique
WO2000024937A3 (fr) Techniques paralleles pour analyse genomique
NO994441L (no) Ekstraksjon og anvendelse av VNTR-alleler
WO2002002806A3 (fr) Procede et acides nucleiques pour analyse de methylation pharmacogenomique
WO1997027331A3 (fr) Procedes et compositions permettant de determiner la sequence de molecules d'acides nucleiques
WO2001071042A3 (fr) Necessaires de detection, tels que des jeux ordonnes d'echantillons d'acide nucleique, servant a detecter l'expression d'au moins 10.000 genes de drosophila et leur utilisation
WO2001016378A3 (fr) Analyse s'etendant aux chromosomes des interactions entre une proteine et l'adn
WO2002036831A3 (fr) Colza canola pv-bngt(rt73), compositions et procedes de detection correspondants
AU4438099A (en) Nucleotide analogues with 3'-pro-fluorescent fluorophores in nucleic acid sequence analysis
WO2000012726A3 (fr) Heparinases a conception rationnelle derivees de l'heparinase i et ii
EP1717312A4 (fr) Puce a adn pour l analyse de la methylation de l'adn et son procede de fabrication, et procede d analyse de la methylation de l adn
AU2002252297A1 (en) Methods and tools for nucleic acid sequence analysis selection and generation
WO2000022171A3 (fr) Systemes et procedes de sequençage par hybridation
WO2011063210A2 (fr) Methodes de mappage de profils de methylation genomique
AU2002352902A1 (en) Thermus thermophilus nucleic acid polymerases
CN1252103A (zh) 分析dna特征的方法
DE69917636D1 (en) Polymerasesignalversuch
EP1117779A4 (fr) PROTEINE $i(MORAXELLA CATARRHALIS), SEQUENCE D'ACIDE NUCLEIQUE ET UTILISATIONS DE CELLES-CI
EP1132483A3 (fr) Méthode de diagnostic de la schizophrénie utilisant des indices objectifs
AU2002346517A1 (en) Thermus oshimai nucleic acid polymerases
AU2001294653A1 (en) Automated method of identifying and archiving nucleic acid sequences
WO2001038351A3 (fr) Sequence nucleotidique primaire du virus bacilliforme des crevettes a points blancs (wsbv), systemes de decouverte contenant cette sequence ainsi que des materiels de detection et des cibles antivirales de detection et de lutte contre la poussee et la propagation du virus

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
AK Designated states

Kind code of ref document: C1

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: C1

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

CFP Corrected version of a pamphlet front page
CR1 Correction of entry in section i

Free format text: PAT. BUL. 35/2001 UNDER (30) REPLACE "90/526131, 15.03.00, US" BY "09/526131, 15.03.00, US"

WWE Wipo information: entry into national phase

Ref document number: 2400890

Country of ref document: CA

ENP Entry into the national phase

Ref country code: JP

Ref document number: 2001 562433

Kind code of ref document: A

Format of ref document f/p: F

WWE Wipo information: entry into national phase

Ref document number: 2001908713

Country of ref document: EP

AK Designated states

Kind code of ref document: A3

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A3

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

WWP Wipo information: published in national office

Ref document number: 2001908713

Country of ref document: EP

WWW Wipo information: withdrawn in national office

Ref document number: 2001908713

Country of ref document: EP