CN109997193A - 一种对特定群中的亚群进行定量分析的方法 - Google Patents

一种对特定群中的亚群进行定量分析的方法 Download PDF

Info

Publication number
CN109997193A
CN109997193A CN201680090780.0A CN201680090780A CN109997193A CN 109997193 A CN109997193 A CN 109997193A CN 201680090780 A CN201680090780 A CN 201680090780A CN 109997193 A CN109997193 A CN 109997193A
Authority
CN
China
Prior art keywords
matrix
frequency
snp
base
vector
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201680090780.0A
Other languages
English (en)
Other versions
CN109997193B (zh
Inventor
彭也
李俊桦
唐珊媚
张慧
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
BGI Shenzhen Co Ltd
Original Assignee
BGI Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by BGI Shenzhen Co Ltd filed Critical BGI Shenzhen Co Ltd
Publication of CN109997193A publication Critical patent/CN109997193A/zh
Application granted granted Critical
Publication of CN109997193B publication Critical patent/CN109997193B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B25/00ICT specially adapted for hybridisation; ICT specially adapted for gene or protein expression
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B30/00ICT specially adapted for sequence analysis involving nucleotides or amino acids
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02ATECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
    • Y02A90/00Technologies having an indirect contribution to adaptation to climate change
    • Y02A90/10Information and communication technologies [ICT] supporting adaptation to climate change, e.g. for weather forecasting or climate simulation

Landscapes

  • Life Sciences & Earth Sciences (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Biotechnology (AREA)
  • Theoretical Computer Science (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Engineering & Computer Science (AREA)
  • Medical Informatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Biophysics (AREA)
  • Evolutionary Biology (AREA)
  • General Health & Medical Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Analytical Chemistry (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Genetics & Genomics (AREA)
  • Molecular Biology (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

一种对特定群中的亚群进行定量分析的方法。具体地,包括以下步骤:(1)提供对应于所述特定群的(a)参考基因组序列数据、(b)参考SNP矩阵和(c)宏基因组测序数据;(2)将所述宏基因组测序数据比对到对应于所述特定群的参考基因组数据上,以便获得比对结果;(3)根据所述参考SNP矩阵的位点信息,构建频率矩阵;(4)根据频率矩阵对参考SNP矩阵做二值化处理,得到二值化SNP矩阵;和(5)基于所述的频率矩阵、所述的二值化SNP矩阵、理论碱基频率向量f(x)和观测碱基频率向量y,通过有约束的线性模型,得出所述特定群中各亚群的相对丰度,从而获得对所述特定群中的亚群的定量分析结果。

Description

PCT国内申请,说明书已公开。

Claims (10)

  1. PCT国内申请,权利要求书已公开。
CN201680090780.0A 2016-11-10 2016-11-10 一种对特定群中的亚群进行定量分析的方法 Active CN109997193B (zh)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2016/105372 WO2018086045A1 (zh) 2016-11-10 2016-11-10 一种对特定群中的亚群进行定量分析的方法

Publications (2)

Publication Number Publication Date
CN109997193A true CN109997193A (zh) 2019-07-09
CN109997193B CN109997193B (zh) 2023-03-14

Family

ID=62109084

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201680090780.0A Active CN109997193B (zh) 2016-11-10 2016-11-10 一种对特定群中的亚群进行定量分析的方法

Country Status (2)

Country Link
CN (1) CN109997193B (zh)
WO (1) WO2018086045A1 (zh)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112151120A (zh) * 2020-09-23 2020-12-29 易会广 用于快速转录组表达定量的数据处理方法、装置及存储介质

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112786102B (zh) * 2021-01-25 2022-10-21 北京大学 一种基于宏基因组学分析精准识别水体中未知微生物群落的方法
CN114300055B (zh) * 2021-12-28 2023-04-25 江苏先声医学诊断有限公司 优化的宏基因组纳米孔测序数据定量方法

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102007407A (zh) * 2007-11-21 2011-04-06 考斯摩斯德公司 基因组鉴定系统
US20120004111A1 (en) * 2007-11-21 2012-01-05 Cosmosid Inc. Direct identification and measurement of relative populations of microorganisms with direct dna sequencing and probabilistic methods
CN102952854A (zh) * 2011-08-25 2013-03-06 深圳华大基因科技有限公司 单细胞分类和筛选方法及其装置
US20130073217A1 (en) * 2011-04-13 2013-03-21 The Board Of Trustees Of The Leland Stanford Junior University Phased Whole Genome Genetic Risk In A Family Quartet
CN103955629A (zh) * 2014-02-18 2014-07-30 吉林大学 基于模糊k均值的宏基因组片段聚类方法
CN105095688A (zh) * 2014-08-28 2015-11-25 吉林大学 检测人体肠道宏基因组的细菌群落及丰度的方法
CN105121661A (zh) * 2013-02-01 2015-12-02 加利福尼亚大学董事会 用于基因组组装及单体型定相的方法
CN106055924A (zh) * 2016-05-19 2016-10-26 完美(中国)有限公司 微生物操作分类单元确定和序列辅助分离

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102007407A (zh) * 2007-11-21 2011-04-06 考斯摩斯德公司 基因组鉴定系统
US20120004111A1 (en) * 2007-11-21 2012-01-05 Cosmosid Inc. Direct identification and measurement of relative populations of microorganisms with direct dna sequencing and probabilistic methods
US20130073217A1 (en) * 2011-04-13 2013-03-21 The Board Of Trustees Of The Leland Stanford Junior University Phased Whole Genome Genetic Risk In A Family Quartet
CN102952854A (zh) * 2011-08-25 2013-03-06 深圳华大基因科技有限公司 单细胞分类和筛选方法及其装置
CN105121661A (zh) * 2013-02-01 2015-12-02 加利福尼亚大学董事会 用于基因组组装及单体型定相的方法
CN103955629A (zh) * 2014-02-18 2014-07-30 吉林大学 基于模糊k均值的宏基因组片段聚类方法
CN105095688A (zh) * 2014-08-28 2015-11-25 吉林大学 检测人体肠道宏基因组的细菌群落及丰度的方法
CN106055924A (zh) * 2016-05-19 2016-10-26 完美(中国)有限公司 微生物操作分类单元确定和序列辅助分离

Non-Patent Citations (6)

* Cited by examiner, † Cited by third party
Title
ALEXEEV,D.等: "Bacterial rose garden for metagenomic SNP-based phylogeny visualization", 《BIODATA MINING》 *
LUO,C.W.等: "ConStrains identifies microbial strains in metagenomic datasets", 《NAT.BIOTECHNOL》 *
NAYFACH,S.等: "An integrated metagenomics pipeline for strain profiling reveals novel patterns of bacterial transmission and biogeography", 《GENOME RES》 *
SAHL,J.W.等: "Phylogenetically typing bacterial strains from partial SNP genotypes observed from direct sequencing of clinical specimen metagenomic data", 《GENOME MEDICINE》 *
许先国等: "基于免疫分选的T淋巴细胞亚群嵌合性定量分析", 《中国卫生检验杂志》 *
陈波等: "宏基因组分类问题中的特征提取及其降维研究", 《计算机系统应用》 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112151120A (zh) * 2020-09-23 2020-12-29 易会广 用于快速转录组表达定量的数据处理方法、装置及存储介质
CN112151120B (zh) * 2020-09-23 2024-03-12 易会广 用于快速转录组表达定量的数据处理方法、装置及存储介质

Also Published As

Publication number Publication date
CN109997193B (zh) 2023-03-14
WO2018086045A1 (zh) 2018-05-17

Similar Documents

Publication Publication Date Title
Olm et al. Consistent metagenome-derived metrics verify and delineate bacterial species boundaries
Almeida et al. A new genomic blueprint of the human gut microbiota
Knight et al. Best practices for analysing microbiomes
Kermarrec et al. Next‐generation sequencing to inventory taxonomic diversity in eukaryotic communities: a test for freshwater diatoms
Papudeshi et al. Optimizing and evaluating the reconstruction of Metagenome-assembled microbial genomes
Nielsen et al. Statistical approaches for DNA barcoding
US9593382B2 (en) Compositions and methods for identifying and comparing members of microbial communities using amplicon sequences
Thines et al. Ten reasons why a sequence-based nomenclature is not useful for fungi anytime soon
Davies et al. Building consensus around the assessment and interpretation of Symbiodiniaceae diversity
Foster et al. Measuring the microbiome: perspectives on advances in DNA-based techniques for exploring microbial life
Snedecor et al. Fast and accurate kinship estimation using sparse SNPs in relatively large database searches
Baran et al. Joint analysis of multiple metagenomic samples
CN109997193B (zh) 一种对特定群中的亚群进行定量分析的方法
JP2016518822A (ja) アセンブルされていない配列情報、確率論的方法、及び形質固有(trait−specific)のデータベースカタログを用いた生物材料の特性解析
Iwaszkiewicz‐Eggebrecht et al. Optimizing insect metabarcoding using replicated mock communities
Lobanov et al. Ecosystem-specific microbiota and microbiome databases in the era of big data
Bonilla-Rosso et al. Understanding microbial community diversity metrics derived from metagenomes: performance evaluation using simulated data sets
Epstein Uncultivated microorganisms
Smith et al. Scalable microbial strain inference in metagenomic data using StrainFacts
Ji et al. HOTSPOT: hierarchical host prediction for assembled plasmid contigs with transformer
Hickl et al. binny: an automated binning algorithm to recover high-quality genomes from complex metagenomic datasets
JP2023517904A (ja) 細菌ゲノムにおいてゲノム配列を検出するための分子技術
Ma et al. MetaBMF: a scalable binning algorithm for large-scale reference-free metagenomic studies
CN113260710A (zh) 用于通过多个定制掺合混合物验证微生物组序列处理和差异丰度分析的组合物、系统、设备和方法
Owen Bacterial taxonomics: finding the wood through the phylogenetic trees

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
REG Reference to a national code

Ref country code: HK

Ref legal event code: DE

Ref document number: 40010212

Country of ref document: HK

GR01 Patent grant
GR01 Patent grant