CN107451428A - 下一代测序中末端短串联序列的优化处理方法 - Google Patents
下一代测序中末端短串联序列的优化处理方法 Download PDFInfo
- Publication number
- CN107451428A CN107451428A CN201710650049.5A CN201710650049A CN107451428A CN 107451428 A CN107451428 A CN 107451428A CN 201710650049 A CN201710650049 A CN 201710650049A CN 107451428 A CN107451428 A CN 107451428A
- Authority
- CN
- China
- Prior art keywords
- sequence
- sequencing
- noise
- short tandem
- treatment method
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000012163 sequencing technique Methods 0.000 title claims abstract description 31
- 238000000034 method Methods 0.000 title claims abstract description 14
- 238000002864 sequence alignment Methods 0.000 claims abstract description 16
- 238000012545 processing Methods 0.000 claims abstract description 14
- 229920001519 homopolymer Polymers 0.000 claims abstract description 10
- 238000010801 machine learning Methods 0.000 claims abstract description 6
- 230000004069 differentiation Effects 0.000 claims description 5
- 238000002790 cross-validation Methods 0.000 claims description 3
- 230000007935 neutral effect Effects 0.000 claims description 3
- 230000009467 reduction Effects 0.000 claims description 3
- 238000012552 review Methods 0.000 claims description 3
- 238000012360 testing method Methods 0.000 claims description 3
- 238000001514 detection method Methods 0.000 abstract description 5
- 238000005516 engineering process Methods 0.000 abstract description 5
- 108090000623 proteins and genes Proteins 0.000 abstract description 2
- 208000003028 Stuttering Diseases 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 238000003672 processing method Methods 0.000 description 2
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 1
- 238000013480 data collection Methods 0.000 description 1
- 230000003902 lesion Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B40/00—ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B30/00—ICT specially adapted for sequence analysis involving nucleotides or amino acids
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- Medical Informatics (AREA)
- Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Biophysics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Biotechnology (AREA)
- Evolutionary Biology (AREA)
- Theoretical Computer Science (AREA)
- Artificial Intelligence (AREA)
- Chemical & Material Sciences (AREA)
- Analytical Chemistry (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Bioethics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Epidemiology (AREA)
- Evolutionary Computation (AREA)
- Public Health (AREA)
- Software Systems (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
Description
Claims (4)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710650049.5A CN107451428B (zh) | 2017-08-02 | 2017-08-02 | 下一代测序中末端短串联序列的优化处理方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710650049.5A CN107451428B (zh) | 2017-08-02 | 2017-08-02 | 下一代测序中末端短串联序列的优化处理方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN107451428A true CN107451428A (zh) | 2017-12-08 |
CN107451428B CN107451428B (zh) | 2020-05-22 |
Family
ID=60490716
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710650049.5A Active CN107451428B (zh) | 2017-08-02 | 2017-08-02 | 下一代测序中末端短串联序列的优化处理方法 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN107451428B (zh) |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20110257889A1 (en) * | 2010-02-24 | 2011-10-20 | Pacific Biosciences Of California, Inc. | Sequence assembly and consensus sequence determination |
CN103975329A (zh) * | 2011-12-08 | 2014-08-06 | 皇家飞利浦有限公司 | 鲁棒的变异识别和验证 |
CN104615911A (zh) * | 2015-01-12 | 2015-05-13 | 上海交通大学 | 基于稀疏编码及链学习预测膜蛋白beta-barrel跨膜区域的方法 |
CN105980578A (zh) * | 2013-12-16 | 2016-09-28 | 考利达基因组股份有限公司 | 用于使用机器学习进行dna测序的碱基判定器 |
CN105989246A (zh) * | 2015-01-28 | 2016-10-05 | 深圳华大基因研究院 | 一种基于基因组组装的变异检测方法和装置 |
CN106599614A (zh) * | 2016-11-07 | 2017-04-26 | 为朔医学数据科技(北京)有限公司 | 一种高通量测序数据处理及分析流程控制方法及系统 |
CN106845155A (zh) * | 2016-12-29 | 2017-06-13 | 安诺优达基因科技(北京)有限公司 | 一种用于检测内部串联重复的装置 |
-
2017
- 2017-08-02 CN CN201710650049.5A patent/CN107451428B/zh active Active
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20110257889A1 (en) * | 2010-02-24 | 2011-10-20 | Pacific Biosciences Of California, Inc. | Sequence assembly and consensus sequence determination |
CN103975329A (zh) * | 2011-12-08 | 2014-08-06 | 皇家飞利浦有限公司 | 鲁棒的变异识别和验证 |
CN105980578A (zh) * | 2013-12-16 | 2016-09-28 | 考利达基因组股份有限公司 | 用于使用机器学习进行dna测序的碱基判定器 |
CN104615911A (zh) * | 2015-01-12 | 2015-05-13 | 上海交通大学 | 基于稀疏编码及链学习预测膜蛋白beta-barrel跨膜区域的方法 |
CN105989246A (zh) * | 2015-01-28 | 2016-10-05 | 深圳华大基因研究院 | 一种基于基因组组装的变异检测方法和装置 |
CN106599614A (zh) * | 2016-11-07 | 2017-04-26 | 为朔医学数据科技(北京)有限公司 | 一种高通量测序数据处理及分析流程控制方法及系统 |
CN106845155A (zh) * | 2016-12-29 | 2017-06-13 | 安诺优达基因科技(北京)有限公司 | 一种用于检测内部串联重复的装置 |
Non-Patent Citations (3)
Title |
---|
KEVIN VERVIER 等: "Large-scale machine learning for metagenomics sequence classification", 《BIOINFORMATICS》 * |
刘圣 等: "下一代测序数据的质量控制研究", 《军事医学》 * |
毛成光: "两核昔酸实时合成测序信息分析", 《中国优秀硕士学位论文全文数据库 基础科学辑》 * |
Also Published As
Publication number | Publication date |
---|---|
CN107451428B (zh) | 2020-05-22 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Dueholm et al. | Generation of comprehensive ecosystem-specific reference databases with species-level resolution by high-throughput full-length 16S rRNA gene sequencing and automated taxonomy assignment (AutoTax) | |
Sha et al. | Effect of low-expression gene filtering on detection of differentially expressed genes in RNA-seq data | |
CN107403074B (zh) | 一种突变蛋白的检测方法及装置 | |
US10127351B2 (en) | Accurate and fast mapping of reads to genome | |
CN104657628A (zh) | 基于Proton的转录组测序数据的比较分析方法和系统 | |
CN102682224B (zh) | 检测拷贝数变异的方法和装置 | |
CN103993074B (zh) | 水稻黄单胞杆菌的分子标记及其应用 | |
CN104630206A (zh) | 转录组文库的构建方法 | |
CN107267646A (zh) | 一种基于下一代测序的多基因融合检测方法 | |
CN114121160B (zh) | 一种检测样本中宏病毒组的方法和系统 | |
Sánchez‐Vallet et al. | Nature's genetic screens: using genome‐wide association studies for effector discovery | |
CN105950707A (zh) | 一种确定核酸序列的方法及系统 | |
CN109920480B (zh) | 一种校正高通量测序数据的方法和装置 | |
CN107451428A (zh) | 下一代测序中末端短串联序列的优化处理方法 | |
CN105063210A (zh) | 一种环状rna的鉴定方法 | |
CN103184275A (zh) | 一种水稻基因组基因标识的新方法 | |
Warwick-Dugdale et al. | Long-read powered viral metagenomics in the oligotrophic Sargasso Sea | |
CN101024851A (zh) | 基于梯状回收的基因拷贝数鉴定和各拷贝序列获得的方法 | |
CN113311168A (zh) | 金黄色葡萄球菌耐药表型蛋白质指纹图谱库的构建方法 | |
Gülay et al. | An improved method to set significance thresholds for β diversity testing in microbial community comparisons | |
CN113971986B (zh) | 一种通过序列相似性排查测序样本交叉污染的方法 | |
CN115410649B (zh) | 一种同时检测甲基化和突变信息的方法及装置 | |
CN113699222A (zh) | 一种基于dna甲基化位点基因型的全基因组分型方法 | |
CN117935927A (zh) | 一种黏连蛋白介导的细胞特异性染色质环的预测方法 | |
CN110660452B (zh) | 检测细菌基因水平转移dna片段及转移供体菌株的方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
PE01 | Entry into force of the registration of the contract for pledge of patent right |
Denomination of invention: Optimization of terminal short tandem sequences in next generation sequencing Effective date of registration: 20211214 Granted publication date: 20200522 Pledgee: Bank of China Limited by Share Ltd. Guangzhou Panyu branch Pledgor: GUANGDONG ARDENT BIOMED TECHNOLOGY CO.,LTD. Registration number: Y2021980014989 |
|
PE01 | Entry into force of the registration of the contract for pledge of patent right | ||
PC01 | Cancellation of the registration of the contract for pledge of patent right |
Granted publication date: 20200522 Pledgee: Bank of China Limited by Share Ltd. Guangzhou Panyu branch Pledgor: GUANGDONG ARDENT BIOMED TECHNOLOGY CO.,LTD. Registration number: Y2021980014989 |
|
PC01 | Cancellation of the registration of the contract for pledge of patent right |