CN101430742B - 一种组装基因组的方法 - Google Patents
一种组装基因组的方法 Download PDFInfo
- Publication number
- CN101430742B CN101430742B CN2008102183389A CN200810218338A CN101430742B CN 101430742 B CN101430742 B CN 101430742B CN 2008102183389 A CN2008102183389 A CN 2008102183389A CN 200810218338 A CN200810218338 A CN 200810218338A CN 101430742 B CN101430742 B CN 101430742B
- Authority
- CN
- China
- Prior art keywords
- node
- short
- bruijn
- sequential value
- short string
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B30/00—ICT specially adapted for sequence analysis involving nucleotides or amino acids
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B30/00—ICT specially adapted for sequence analysis involving nucleotides or amino acids
- G16B30/20—Sequence assembly
Landscapes
- Physics & Mathematics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Analytical Chemistry (AREA)
- Biophysics (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Biotechnology (AREA)
- Evolutionary Biology (AREA)
- General Health & Medical Sciences (AREA)
- Medical Informatics (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Theoretical Computer Science (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
Claims (5)
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2008102183389A CN101430742B (zh) | 2008-12-12 | 2008-12-12 | 一种组装基因组的方法 |
PCT/CN2009/001427 WO2010066115A1 (zh) | 2008-12-12 | 2009-12-11 | 一种降低短序列组装过程的时间复杂度的方法及系统 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2008102183389A CN101430742B (zh) | 2008-12-12 | 2008-12-12 | 一种组装基因组的方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN101430742A CN101430742A (zh) | 2009-05-13 |
CN101430742B true CN101430742B (zh) | 2011-06-29 |
Family
ID=40646135
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN2008102183389A Active CN101430742B (zh) | 2008-12-12 | 2008-12-12 | 一种组装基因组的方法 |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN101430742B (zh) |
WO (1) | WO2010066115A1 (zh) |
Families Citing this family (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101430742B (zh) * | 2008-12-12 | 2011-06-29 | 深圳华大基因研究院 | 一种组装基因组的方法 |
US8223043B2 (en) | 2009-12-23 | 2012-07-17 | Industrial Technology Research Institute | Method and apparatus for compressing nucleotide sequence data |
WO2012171213A1 (zh) * | 2011-06-17 | 2012-12-20 | 深圳华大基因科技有限公司 | 一种基因组组装方法和系统 |
WO2013004005A1 (zh) * | 2011-07-05 | 2013-01-10 | 深圳华大基因科技有限公司 | 组装测序片段的方法 |
US8751166B2 (en) * | 2012-03-23 | 2014-06-10 | International Business Machines Corporation | Parallelization of surprisal data reduction and genome construction from genetic data for transmission, storage, and analysis |
US8812243B2 (en) | 2012-05-09 | 2014-08-19 | International Business Machines Corporation | Transmission and compression of genetic data |
US8855938B2 (en) | 2012-05-18 | 2014-10-07 | International Business Machines Corporation | Minimization of surprisal data through application of hierarchy of reference genomes |
US10353869B2 (en) | 2012-05-18 | 2019-07-16 | International Business Machines Corporation | Minimization of surprisal data through application of hierarchy filter pattern |
US8972406B2 (en) | 2012-06-29 | 2015-03-03 | International Business Machines Corporation | Generating epigenetic cohorts through clustering of epigenetic surprisal data based on parameters |
US9002888B2 (en) | 2012-06-29 | 2015-04-07 | International Business Machines Corporation | Minimization of epigenetic surprisal data of epigenetic data within a time series |
CN103258145B (zh) * | 2012-12-22 | 2016-06-29 | 中国科学院深圳先进技术研究院 | 一种基于De Bruijn图的并行基因拼接方法 |
CN103093121B (zh) * | 2012-12-28 | 2016-01-27 | 深圳先进技术研究院 | 双向多步deBruijn图的压缩存储和构造方法 |
CN103699819B (zh) * | 2013-12-10 | 2016-09-07 | 深圳先进技术研究院 | 基于多步双向De Bruijn图的变长kmer查询的顶点扩展方法 |
CN104751015B (zh) * | 2013-12-30 | 2017-08-29 | 中国科学院天津工业生物技术研究所 | 一种基因组测序数据序列组装方法 |
CN106067824B (zh) * | 2016-06-02 | 2019-11-05 | 洛阳晶云信息科技有限公司 | 一种基于二联密码子的测序数据压缩方法 |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2004055709A3 (en) * | 2002-12-13 | 2005-04-14 | Applera Corp | Methods for identifying, viewing, and analyzing syntenic and orthologous genomic regions between two or more species |
US6952651B2 (en) * | 2002-06-17 | 2005-10-04 | Intel Corporation | Methods and apparatus for nucleic acid sequencing by signal stretching and data integration |
CN101196921A (zh) * | 2007-12-24 | 2008-06-11 | 北京大学 | 用于近似查询的长序列数据降维方法 |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101430742B (zh) * | 2008-12-12 | 2011-06-29 | 深圳华大基因研究院 | 一种组装基因组的方法 |
-
2008
- 2008-12-12 CN CN2008102183389A patent/CN101430742B/zh active Active
-
2009
- 2009-12-11 WO PCT/CN2009/001427 patent/WO2010066115A1/zh active Application Filing
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6952651B2 (en) * | 2002-06-17 | 2005-10-04 | Intel Corporation | Methods and apparatus for nucleic acid sequencing by signal stretching and data integration |
WO2004055709A3 (en) * | 2002-12-13 | 2005-04-14 | Applera Corp | Methods for identifying, viewing, and analyzing syntenic and orthologous genomic regions between two or more species |
CN101196921A (zh) * | 2007-12-24 | 2008-06-11 | 北京大学 | 用于近似查询的长序列数据降维方法 |
Also Published As
Publication number | Publication date |
---|---|
WO2010066115A1 (zh) | 2010-06-17 |
CN101430742A (zh) | 2009-05-13 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN101430742B (zh) | 一种组装基因组的方法 | |
CN110457319B (zh) | 区块链状态数据存储方法及装置、电子设备 | |
CN103150394B (zh) | 面向高性能计算的分布式文件系统元数据管理方法 | |
Rahman et al. | Representation of k-mer sets using spectrum-preserving string sets | |
CN110347684B (zh) | 基于区块链的分级存储方法及装置、电子设备 | |
JP2022547956A (ja) | ブロックチェーンデータをインデックスする方法およびブロックチェーンデータを格納する方法 | |
CN105117355A (zh) | 存储器、存储器系统及其数据处理方法 | |
CN101577662B (zh) | 一种基于树形数据结构的最长前缀匹配方法和装置 | |
CN1983266B (zh) | 闪速类介质中存储事务记录的文件系统 | |
CN110275864B (zh) | 索引建立方法、数据查询方法及计算设备 | |
CN104794177B (zh) | 一种数据存储方法及装置 | |
CN1318960C (zh) | 用于进行寄存器重命名的处理器的方法 | |
US20120124216A1 (en) | Address generation and cluster extension in distrubted systems using tree method | |
CN103164490A (zh) | 一种不固定长度数据的高效存储实现方法和装置 | |
CN104424199A (zh) | 搜索方法和装置 | |
CN103051543A (zh) | 一种路由前缀的处理、查找、增加及删除方法 | |
US9065469B2 (en) | Compression match enumeration | |
CN104731886A (zh) | 一种海量小文件的处理方法及系统 | |
CN109033278A (zh) | 数据处理方法、装置、电子设备及计算机存储介质 | |
Goldwasser et al. | Linear-time algorithms for computing maximum-density sequence segments with bioinformatics applications | |
Dasari et al. | Multi-start heuristics for the profitable tour problem | |
CN107451070A (zh) | 一种数据的处理方法和服务器 | |
CN103207866A (zh) | 一种基于分块策略的文件存储方法及系统 | |
CN103077214A (zh) | 文件存储方法及装置 | |
WO2013054588A1 (ja) | 情報処理装置、データストア操作方法、データ構築装置、データ構築方法、データ結合装置、データ結合方法およびプログラム |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
ASS | Succession or assignment of patent right |
Owner name: BGI TECHNOLOGY SOLUTIONS CO., LTD. Free format text: FORMER OWNER: BGI-SHENZHEN Effective date: 20130826 |
|
C41 | Transfer of patent application or patent right or utility model | ||
COR | Change of bibliographic data |
Free format text: CORRECT: ADDRESS; FROM: 518083 SHENZHEN, GUANGDONG PROVINCE TO: 518000 SHENZHEN, GUANGDONG PROVINCE |
|
TR01 | Transfer of patent right |
Effective date of registration: 20130826 Address after: 518000 science and Technology Pioneer Park, comprehensive building, Beishan Industrial Zone, Yantian District, Guangdong, Shenzhen 201 Patentee after: BGI Technology Solutions Co., Ltd. Address before: Beishan Industrial Zone Building in Yantian District of Shenzhen city of Guangdong Province in 518083 Patentee before: BGI-Shenzhen |