CN107590362B - Method for judging whether overlapping assembly is correct or incorrect based on long read sequence sequencing - Google Patents
Method for judging whether overlapping assembly is correct or incorrect based on long read sequence sequencing Download PDFInfo
- Publication number
- CN107590362B CN107590362B CN201710720048.3A CN201710720048A CN107590362B CN 107590362 B CN107590362 B CN 107590362B CN 201710720048 A CN201710720048 A CN 201710720048A CN 107590362 B CN107590362 B CN 107590362B
- Authority
- CN
- China
- Prior art keywords
- assembly
- read
- window
- result
- comparison
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 22
- 238000012163 sequencing technique Methods 0.000 title claims description 12
- 238000007671 third-generation sequencing Methods 0.000 claims abstract description 26
- 230000002159 abnormal effect Effects 0.000 claims description 30
- 238000001914 filtration Methods 0.000 claims description 9
- 238000012549 training Methods 0.000 claims description 6
- 238000012545 processing Methods 0.000 claims description 3
- 239000013598 vector Substances 0.000 claims description 3
- 238000002372 labelling Methods 0.000 claims 1
- 238000012937 correction Methods 0.000 description 8
- 241000244206 Nematoda Species 0.000 description 4
- 238000001514 detection method Methods 0.000 description 4
- 239000012634 fragment Substances 0.000 description 3
- 238000000926 separation method Methods 0.000 description 3
- 241000244203 Caenorhabditis elegans Species 0.000 description 1
- 230000006907 apoptotic process Effects 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- 238000013145 classification model Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 230000009368 gene silencing by RNA Effects 0.000 description 1
- 238000012268 genome sequencing Methods 0.000 description 1
- 238000011331 genomic analysis Methods 0.000 description 1
- 238000012165 high-throughput sequencing Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
Landscapes
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
Description
Claims (8)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710720048.3A CN107590362B (en) | 2017-08-21 | 2017-08-21 | Method for judging whether overlapping assembly is correct or incorrect based on long read sequence sequencing |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710720048.3A CN107590362B (en) | 2017-08-21 | 2017-08-21 | Method for judging whether overlapping assembly is correct or incorrect based on long read sequence sequencing |
Publications (2)
Publication Number | Publication Date |
---|---|
CN107590362A CN107590362A (en) | 2018-01-16 |
CN107590362B true CN107590362B (en) | 2019-12-06 |
Family
ID=61041668
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710720048.3A Active CN107590362B (en) | 2017-08-21 | 2017-08-21 | Method for judging whether overlapping assembly is correct or incorrect based on long read sequence sequencing |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN107590362B (en) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113160893B (en) * | 2021-06-09 | 2022-08-19 | 中国科学院昆明植物研究所 | Mining plant ITSs sequence from second generation sequencing data and using the same for identifying variety families |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103617256A (en) * | 2013-11-29 | 2014-03-05 | 北京诺禾致源生物信息科技有限公司 | Method and device for processing file needing mutation detection |
CN104239750A (en) * | 2014-08-25 | 2014-12-24 | 北京百迈客生物科技有限公司 | High-throughput sequencing data-based genome de novo assembly method |
CN106156536A (en) * | 2015-04-15 | 2016-11-23 | 深圳华大基因科技有限公司 | The method and system that sample immune group storehouse sequencing data is processed |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CA2613248A1 (en) * | 2005-06-23 | 2006-12-28 | Keygene N.V. | Improved strategies for sequencing complex genomes using high throughput sequencing technologies |
EP2602734A1 (en) * | 2011-12-08 | 2013-06-12 | Koninklijke Philips Electronics N.V. | Robust variant identification and validation |
JP5938484B2 (en) * | 2012-01-20 | 2016-06-22 | 深▲せん▼華大基因医学有限公司Bgi Diagnosis Co., Ltd. | Method, system, and computer-readable storage medium for determining presence / absence of genome copy number variation |
CN104298892B (en) * | 2014-09-18 | 2017-05-10 | 天津诺禾致源生物信息科技有限公司 | Detection device and method for gene fusion |
SG11201705996PA (en) * | 2015-02-09 | 2017-09-28 | 10X Genomics Inc | Systems and methods for determining structural variation and phasing using variant call data |
-
2017
- 2017-08-21 CN CN201710720048.3A patent/CN107590362B/en active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103617256A (en) * | 2013-11-29 | 2014-03-05 | 北京诺禾致源生物信息科技有限公司 | Method and device for processing file needing mutation detection |
CN104239750A (en) * | 2014-08-25 | 2014-12-24 | 北京百迈客生物科技有限公司 | High-throughput sequencing data-based genome de novo assembly method |
CN106156536A (en) * | 2015-04-15 | 2016-11-23 | 深圳华大基因科技有限公司 | The method and system that sample immune group storehouse sequencing data is processed |
Non-Patent Citations (2)
Title |
---|
Multiple Sequence Assembly from Reads Alignable to a Common Reference Genome;Qian Peng等;《 IEEE/ACM Transactions on Computational Biology and Bioinformatics 》;20101028;第1283-1295页 * |
下一代测序纠错方法综述;江育娥 等;《北京工业大学学报》;20160531;第42卷(第3期);第377-386页 * |
Also Published As
Publication number | Publication date |
---|---|
CN107590362A (en) | 2018-01-16 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10354747B1 (en) | Deep learning analysis pipeline for next generation sequencing | |
US20130166221A1 (en) | Method and system for sequence correlation | |
CN106650739B (en) | Novel license plate character cutting method | |
CN108197434B (en) | Method for removing human gene sequence in metagenome sequencing data | |
CN111584006B (en) | Circular RNA identification method based on machine learning strategy | |
CN1008022B (en) | Character recognition system | |
CN105389481A (en) | Method for detecting variable spliceosome in third generation full-length transcriptome | |
CN111081315A (en) | Method for detecting homologous pseudogene variation | |
CN110692101A (en) | Method for aligning targeted nucleic acid sequencing data | |
KR20140006846A (en) | Data analysis of dna sequences | |
CN112086131B (en) | Screening method for false positive variation sites in resequencing database | |
CN104794371A (en) | Method and device for detecting insertion polymorphism of retrotransposon | |
CN107590362B (en) | Method for judging whether overlapping assembly is correct or incorrect based on long read sequence sequencing | |
CN112733884A (en) | Welding defect recognition model training method and device and computer terminal | |
CN115101124A (en) | Whole genome allele identification method and device | |
CN111180013A (en) | Device for detecting blood disease fusion gene | |
US11335438B1 (en) | Detecting false positive variant calls in next-generation sequencing | |
CN114155914B (en) | Detection and correction system based on metagenome splicing errors | |
CN116564406A (en) | Automatic analysis method and equipment for genetic variation | |
CN113571132B (en) | Method for judging sample degradation based on CNV result | |
CN112397148A (en) | Sequence comparison method, sequence correction method and device thereof | |
CN111916147B (en) | Transcript classification method | |
CN114627967A (en) | Method for accurately annotating three-generation full-length transcript | |
CN116646006B (en) | Tumor related gene system mutation detection method and device based on high-throughput sequencing and Gaussian mixture model | |
CN113378244B (en) | Intelligent electronic signature calling system and method based on data analysis |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
PE01 | Entry into force of the registration of the contract for pledge of patent right |
Denomination of invention: A method for judging the correctness and error of overlapping assembly based on long reading sequence sequencing Effective date of registration: 20210918 Granted publication date: 20191206 Pledgee: Wuhan area branch of Hubei pilot free trade zone of Bank of China Ltd. Pledgor: WUHAN FRASERGEN INFORMATION Co.,Ltd. Registration number: Y2021420000096 |
|
PE01 | Entry into force of the registration of the contract for pledge of patent right | ||
PC01 | Cancellation of the registration of the contract for pledge of patent right |
Granted publication date: 20191206 Pledgee: Wuhan area branch of Hubei pilot free trade zone of Bank of China Ltd. Pledgor: WUHAN FRASERGEN INFORMATION CO.,LTD. Registration number: Y2021420000096 |
|
PE01 | Entry into force of the registration of the contract for pledge of patent right |
Denomination of invention: A method for determining the correctness of overlapping assembly based on long read sequencing Granted publication date: 20191206 Pledgee: Guanggu Branch of Wuhan Rural Commercial Bank Co.,Ltd. Pledgor: WUHAN FRASERGEN INFORMATION CO.,LTD. Registration number: Y2024980021037 |