CN103049680A - gene sequencing data reading method and system - Google Patents
gene sequencing data reading method and system Download PDFInfo
- Publication number
- CN103049680A CN103049680A CN2012105920612A CN201210592061A CN103049680A CN 103049680 A CN103049680 A CN 103049680A CN 2012105920612 A CN2012105920612 A CN 2012105920612A CN 201210592061 A CN201210592061 A CN 201210592061A CN 103049680 A CN103049680 A CN 103049680A
- Authority
- CN
- China
- Prior art keywords
- gene sequencing
- blocks
- files
- sequencing data
- task
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000012163 sequencing technique Methods 0.000 title claims abstract description 85
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 68
- 238000000034 method Methods 0.000 title claims abstract description 25
- 238000007405 data analysis Methods 0.000 claims abstract description 10
- 238000004458 analytical method Methods 0.000 claims description 7
- 230000011218 segmentation Effects 0.000 description 8
- 241000235342 Saccharomycetes Species 0.000 description 4
- 238000004891 communication Methods 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 4
- 238000011161 development Methods 0.000 description 3
- 230000018109 developmental process Effects 0.000 description 3
- 238000011160 research Methods 0.000 description 2
- 238000007480 sanger sequencing Methods 0.000 description 2
- 240000008067 Cucumis sativus Species 0.000 description 1
- 235000010799 Cucumis sativus var sativus Nutrition 0.000 description 1
- 241000588724 Escherichia coli Species 0.000 description 1
- 108091005461 Nucleic proteins Proteins 0.000 description 1
- 208000025174 PANDAS Diseases 0.000 description 1
- 208000021155 Paediatric autoimmune neuropsychiatric disorders associated with streptococcal infection Diseases 0.000 description 1
- 240000000220 Panda oleosa Species 0.000 description 1
- 235000016496 Panda oleosa Nutrition 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- 230000006399 behavior Effects 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 210000004027 cell Anatomy 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 239000012634 fragment Substances 0.000 description 1
- 210000002429 large intestine Anatomy 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 102000039446 nucleic acids Human genes 0.000 description 1
- 108020004707 nucleic acids Proteins 0.000 description 1
- 150000007523 nucleic acids Chemical class 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 102000004169 proteins and genes Human genes 0.000 description 1
- 239000013535 sea water Substances 0.000 description 1
- 241000894007 species Species 0.000 description 1
Images
Landscapes
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
Description
Claims (11)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201210592061.2A CN103049680B (en) | 2012-12-29 | 2012-12-29 | gene sequencing data reading method and system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201210592061.2A CN103049680B (en) | 2012-12-29 | 2012-12-29 | gene sequencing data reading method and system |
Publications (2)
Publication Number | Publication Date |
---|---|
CN103049680A true CN103049680A (en) | 2013-04-17 |
CN103049680B CN103049680B (en) | 2016-09-07 |
Family
ID=48062314
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201210592061.2A Active CN103049680B (en) | 2012-12-29 | 2012-12-29 | gene sequencing data reading method and system |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN103049680B (en) |
Cited By (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103559020A (en) * | 2013-11-07 | 2014-02-05 | 中国科学院软件研究所 | Method for realizing parallel compression and parallel decompression on FASTQ file containing DNA (deoxyribonucleic acid) sequence read data |
CN104657627A (en) * | 2013-11-18 | 2015-05-27 | 广州中国科学院软件应用技术研究所 | Searching and determining method and system started from FASTQ format read segment |
CN106096332A (en) * | 2016-06-28 | 2016-11-09 | 深圳大学 | Parallel fast matching method and system thereof towards the DNA sequence stored |
CN106407743A (en) * | 2016-08-31 | 2017-02-15 | 上海美吉生物医药科技有限公司 | Cluster-based high-throughput data analyzing method |
CN106603591A (en) * | 2015-10-14 | 2017-04-26 | 北京聚道科技有限公司 | Processing method and system facing transmission and preprocessing of genome detection data |
CN107145766A (en) * | 2017-03-27 | 2017-09-08 | 中国科学院深圳先进技术研究院 | Gene order read method and reading system |
CN107169313A (en) * | 2017-03-29 | 2017-09-15 | 中国科学院深圳先进技术研究院 | The read method and computer-readable recording medium of DNA data files |
CN109616156A (en) * | 2018-12-03 | 2019-04-12 | 郑州云海信息技术有限公司 | A kind of gene sequencing date storage method and device |
CN109997194A (en) * | 2016-11-03 | 2019-07-09 | 伊路米纳有限公司 | The system and method for exceptional value conspicuousness evaluation |
CN110506272A (en) * | 2016-10-11 | 2019-11-26 | 基因组系统公司 | For accessing with the method and apparatus of the biological data of access unit structuring |
CN110750362A (en) * | 2019-12-19 | 2020-02-04 | 深圳华大基因科技服务有限公司 | Method and apparatus for analyzing biological information, and storage medium |
CN111326216A (en) * | 2020-02-27 | 2020-06-23 | 中国科学院计算技术研究所 | Rapid partitioning method for big data gene sequencing file |
CN113192558A (en) * | 2021-05-26 | 2021-07-30 | 北京自由猫科技有限公司 | Reading and writing method for third-generation gene sequencing data and distributed file system |
-
2012
- 2012-12-29 CN CN201210592061.2A patent/CN103049680B/en active Active
Non-Patent Citations (2)
Title |
---|
曹宗雁 等: "超大规模序列比对计算的并行优化", 《计算机应用》 * |
郭新 等: "基于大规模序列比对软件的并行优化方案", 《计算机工程》 * |
Cited By (18)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103559020B (en) * | 2013-11-07 | 2016-07-06 | 中国科学院软件研究所 | A kind of DNA reads ordinal number according to the compression of FASTQ file in parallel and decompression method |
CN103559020A (en) * | 2013-11-07 | 2014-02-05 | 中国科学院软件研究所 | Method for realizing parallel compression and parallel decompression on FASTQ file containing DNA (deoxyribonucleic acid) sequence read data |
CN104657627B (en) * | 2013-11-18 | 2017-12-05 | 广州中国科学院软件应用技术研究所 | The searching of FASTQ forms read beginning and determination methods and system |
CN104657627A (en) * | 2013-11-18 | 2015-05-27 | 广州中国科学院软件应用技术研究所 | Searching and determining method and system started from FASTQ format read segment |
CN106603591A (en) * | 2015-10-14 | 2017-04-26 | 北京聚道科技有限公司 | Processing method and system facing transmission and preprocessing of genome detection data |
CN106603591B (en) * | 2015-10-14 | 2020-02-07 | 北京聚道科技有限公司 | Processing method and system for genome detection data transmission and preprocessing |
CN106096332A (en) * | 2016-06-28 | 2016-11-09 | 深圳大学 | Parallel fast matching method and system thereof towards the DNA sequence stored |
CN106407743A (en) * | 2016-08-31 | 2017-02-15 | 上海美吉生物医药科技有限公司 | Cluster-based high-throughput data analyzing method |
CN106407743B (en) * | 2016-08-31 | 2019-03-05 | 上海美吉生物医药科技有限公司 | A kind of high-throughput data analysing method based on cluster |
CN110506272A (en) * | 2016-10-11 | 2019-11-26 | 基因组系统公司 | For accessing with the method and apparatus of the biological data of access unit structuring |
CN110506272B (en) * | 2016-10-11 | 2023-08-01 | 基因组系统公司 | Method and device for accessing bioinformatic data structured in access units |
CN109997194A (en) * | 2016-11-03 | 2019-07-09 | 伊路米纳有限公司 | The system and method for exceptional value conspicuousness evaluation |
CN107145766A (en) * | 2017-03-27 | 2017-09-08 | 中国科学院深圳先进技术研究院 | Gene order read method and reading system |
CN107169313A (en) * | 2017-03-29 | 2017-09-15 | 中国科学院深圳先进技术研究院 | The read method and computer-readable recording medium of DNA data files |
CN109616156A (en) * | 2018-12-03 | 2019-04-12 | 郑州云海信息技术有限公司 | A kind of gene sequencing date storage method and device |
CN110750362A (en) * | 2019-12-19 | 2020-02-04 | 深圳华大基因科技服务有限公司 | Method and apparatus for analyzing biological information, and storage medium |
CN111326216A (en) * | 2020-02-27 | 2020-06-23 | 中国科学院计算技术研究所 | Rapid partitioning method for big data gene sequencing file |
CN113192558A (en) * | 2021-05-26 | 2021-07-30 | 北京自由猫科技有限公司 | Reading and writing method for third-generation gene sequencing data and distributed file system |
Also Published As
Publication number | Publication date |
---|---|
CN103049680B (en) | 2016-09-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN103049680A (en) | gene sequencing data reading method and system | |
Zhang et al. | Comprehensive profiling of circular RNAs with nanopore sequencing and CIRI-long | |
Wu et al. | Detection of differentially methylated regions from whole-genome bisulfite sequencing data without replicates | |
US20200232029A1 (en) | Systems and methods for mitochondrial analysis | |
Slatko et al. | Overview of next‐generation sequencing technologies | |
Laver et al. | Assessing the performance of the oxford nanopore technologies minion | |
Davis et al. | Kraken: a set of tools for quality control and analysis of high-throughput sequence data | |
Adiconis et al. | Comparative analysis of RNA sequencing methods for degraded or low-input samples | |
Tulin et al. | A quantitative reference transcriptome for Nematostella vectensis earlyembryonic development: a pipeline for de novo assembly in emergingmodel systems | |
Ozsolak et al. | RNA sequencing: advances, challenges and opportunities | |
Lake et al. | Deriving the genomic tree of life in the presence of horizontal gene transfer: conditioned reconstruction | |
Izuogu et al. | PTESFinder: a computational method to identify post-transcriptional exon shuffling (PTES) events | |
Ye et al. | Utilizing de Bruijn graph of metagenome assembly for metatranscriptome analysis | |
Kruse et al. | A complex network framework for unbiased statistical analyses of DNA–DNA contact maps | |
Deschamps et al. | Characterization, correction and de novo assembly of an Oxford Nanopore genomic dataset from Agrobacterium tumefaciens | |
CN103902852A (en) | Gene expression quantitative method and device | |
CN105760706A (en) | Compression method for next generation sequencing data | |
McDonald et al. | The evolutionary dynamics of tRNA-gene copy number and codon-use in E. coli. | |
Shiau et al. | High throughput single cell long-read sequencing analyses of same-cell genotypes and phenotypes in human tumors | |
Sauvage et al. | Promising prospects of nanopore sequencing for algal hologenomics and structural variation discovery | |
Theis et al. | RNA 3D modules in genome-wide predictions of RNA 2D structure | |
Florea | Bioinformatics of alternative splicing and its regulation | |
Wang et al. | UNI-RNA: universal pre-trained models revolutionize RNA research | |
Galata et al. | Functional meta-omics provide critical insights into long-and short-read assemblies | |
Wong et al. | SpliceWiz: interactive analysis and visualization of alternative splicing in R |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
CB03 | Change of inventor or designer information |
Inventor after: Meng Jintao Inventor after: Wei Yanjie Inventor after: Cheng Jiefeng Inventor after: Feng Shengzhong Inventor before: Meng Jintao Inventor before: Wei Yanjie Inventor before: Cheng Jiefeng Inventor before: Feng Shengzhong |
|
CB03 | Change of inventor or designer information | ||
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20211202 Address after: 518000 A-301, office building, Shenzhen Institute of advanced technology, No. 1068, Xue Yuan Avenue, Shenzhen University Town, Shenzhen, Guangdong, Nanshan District, China Patentee after: Shenzhen shen-tech advanced Cci Capital Ltd. Address before: 1068 No. 518055 Guangdong city in Shenzhen Province, Nanshan District City Xili University School Avenue Patentee before: SHENZHEN INSTITUTES OF ADVANCED TECHNOLOGY |
|
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20220118 Address after: 518000 b402, blocks a and B, Nanshan medical device Industrial Park, No. 1019, Nanhai Avenue, Yanshan community, merchants street, Nanshan District, Shenzhen, Guangdong Patentee after: Shenzhen hongzhituoxin venture capital enterprise (L.P.) Address before: 518000 A-301, office building, Shenzhen Institute of advanced technology, No. 1068, Xue Yuan Avenue, Shenzhen University Town, Shenzhen, Guangdong, Nanshan District, China Patentee before: Shenzhen shen-tech advanced Cci Capital Ltd. |
|
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20220429 Address after: 518000 b402, blocks a and B, Nanshan medical device Industrial Park, No. 1019, Nanhai Avenue, Yanshan community, merchants street, Nanshan District, Shenzhen, Guangdong Patentee after: Senris Biotechnology (Shenzhen) Co.,Ltd. Address before: 518000 b402, blocks a and B, Nanshan medical device Industrial Park, No. 1019, Nanhai Avenue, Yanshan community, merchants street, Nanshan District, Shenzhen, Guangdong Patentee before: Shenzhen hongzhituoxin venture capital enterprise (L.P.) |