CN107451425A - A kind of bacterial metabolism function prediction analysis method based on microorganism rRNA gene sequencing technologies - Google Patents

A kind of bacterial metabolism function prediction analysis method based on microorganism rRNA gene sequencing technologies Download PDF

Info

Publication number
CN107451425A
CN107451425A CN201710720330.1A CN201710720330A CN107451425A CN 107451425 A CN107451425 A CN 107451425A CN 201710720330 A CN201710720330 A CN 201710720330A CN 107451425 A CN107451425 A CN 107451425A
Authority
CN
China
Prior art keywords
gene
function
bacterium
rrna
microorganism
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201710720330.1A
Other languages
Chinese (zh)
Inventor
薛正晟
寇文伯
郭桐舟
姜丽荣
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
SHANGHAI PERSONAL BIOTECHNOLOGY CO Ltd
Original Assignee
SHANGHAI PERSONAL BIOTECHNOLOGY CO Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by SHANGHAI PERSONAL BIOTECHNOLOGY CO Ltd filed Critical SHANGHAI PERSONAL BIOTECHNOLOGY CO Ltd
Priority to CN201710720330.1A priority Critical patent/CN107451425A/en
Publication of CN107451425A publication Critical patent/CN107451425A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B25/00ICT specially adapted for hybridisation; ICT specially adapted for gene or protein expression

Landscapes

  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Biophysics (AREA)
  • Genetics & Genomics (AREA)
  • Molecular Biology (AREA)
  • Engineering & Computer Science (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Biotechnology (AREA)
  • Evolutionary Biology (AREA)
  • General Health & Medical Sciences (AREA)
  • Medical Informatics (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Theoretical Computer Science (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

A kind of bacterial metabolism function prediction analysis method based on microorganism rRNA gene sequencing technologies disclosed by the invention, it is by the way that existing 16S rRNA gene sequencing data and microorganism reference gene group database known to metabolic function are compared, so as to realize the prediction to bacterium and ancient bacterium metabolic function;It is comprised the following steps that:(1) first infer that the gene function of their common ancestor is composed according to the 16S rRNA full length gene sequences of micrometer biological genome;(2) to other gene function spectrums for not surveying species are inferred in Greengenes databases, the Gene correlation spectrum of the ancient full pedigree in bacterium and bacterium domain is built;(3) obtained flora composition " mapping " will be sequenced into database, bacterial metabolism function is predicted.The present invention is by the way that existing 16SrRNA gene sequencing data and microorganism reference gene group database known to metabolic function are compared, so as to realize the prediction to bacterium and ancient bacterium metabolic function.

Description

A kind of bacterial metabolism function prediction analysis based on microorganism rRNA gene sequencing technologies Method
Technical field
The present invention relates to technical field of biological, more particularly to a kind of bacterium based on microorganism rRNA gene sequencing technologies Group's metabolic function prediction analysis method.
Background technology
The focal point of microorganism rRNA gene sequencing technologies is the Nomenclature Composition and Structure of Complexes of flora.But for microbial ecology For research, that most pays close attention to is undoubtedly flora possessed metabolic function., now can root with the development of data analysis technique According to known microbial genome data, to the sequencing data (typical such as the sequencing result of 16S rRNA genes) of flora composition The prediction of bacterial metabolism function is carried out, so as to which " identity " and their " function " of species is mapped.According to bacterial metabolism Function prediction result, on the one hand can one peep flora function spectrum general picture, play bacterial diversity composition spectrum sequencing it is cost-effective Advantage;Another aspect can also help to instruct the experimental design of follow-up grand genome De novo shotgun sequencings, more reasonably screen Sample for follow-up study.
The content of the invention
The technical problems to be solved by the invention are to provide a kind of flora based on microorganism rRNA gene sequencing technologies Metabolic function prediction analysis method.
The technical problems to be solved by the invention can be achieved through the following technical solutions:
A kind of bacterial metabolism function prediction analysis method based on microorganism rRNA gene sequencing technologies, it is by by now Some 16S rRNA gene sequencing data compare with microorganism reference gene group database known to metabolic function, so as to realize Prediction to bacterium and ancient bacterium metabolic function;Comprise the following steps that:
(1) first infer their common ancestor's according to the 16S rRNA full length gene sequences of micrometer biological genome Gene function is composed;
(2) other gene function spectrums for not surveying species in Greengenes databases are inferred, builds ancient bacterium and thin The Gene correlation spectrum of the full pedigree in bacterium domain;
(3) obtained flora composition " mapping " will be sequenced into database, bacterial metabolism function is predicted.
In a preferred embodiment of the invention, the step (3) is realized especially by the following manner:
(3.1) the 16S rRNA gene orders obtained to sequencing, carry out " closed " with reference to OTU divisions (Closed- Reference OTU picking), by being compared with Greengenes databases, each sequencing sequence of searching " refers to sequence Row nearest-neighbors ", and be classified as referring to OTU;
(3.2) according to the rRNA gene copy numbers of " reference sequences nearest-neighbors ", school is carried out to the OTU abundance matrix of acquisition Just;
(3.3) the gene function modal data such as KEGG/EggNOG according to corresponding to " reference sequences nearest-neighbors ", conversion prediction The overall metabolic function of flora.
As a result of technical scheme as above, the present invention is by by existing 16S rRNA gene sequencing data and metabolism Microorganism reference gene group database compares known to function, so as to realize the prediction to bacterium and ancient bacterium metabolic function.This The difference of different plant species 16S rRNA gene copy numbers is also contemplated during the prediction of invention, and to the species in initial data Abundance data is corrected, and makes prediction result more accurately and reliably.
Embodiment
A kind of bacterial metabolism function prediction analysis method based on microorganism rRNA gene sequencing technologies, it is by by now Some 16S rRNA gene sequencing data compare with microorganism reference gene group database known to metabolic function, so as to realize Prediction to bacterium and ancient bacterium metabolic function;Comprise the following steps that:
(1) first infer their common ancestor's according to the 16S rRNA full length gene sequences of micrometer biological genome Gene function is composed;
(2) other gene function spectrums for not surveying species in Greengenes databases are inferred, builds ancient bacterium and thin The Gene correlation spectrum of the full pedigree in bacterium domain;
(3) obtained flora composition " mapping " will be sequenced into database, bacterial metabolism function is predicted.
Above-mentioned steps (3) are realized especially by the following manner:
(3.1) the 16S rRNA gene orders obtained to sequencing, carry out " closed " with reference to OTU divisions (Closed- Reference OTU picking), by being compared with Greengenes databases, each sequencing sequence of searching " refers to sequence Row nearest-neighbors ", and be classified as referring to OTU;
(3.2) according to the rRNA gene copy numbers of " reference sequences nearest-neighbors ", school is carried out to the OTU abundance matrix of acquisition Just;
(3.3) the gene function modal data such as KEGG/EggNOG according to corresponding to " reference sequences nearest-neighbors ", conversion prediction The overall metabolic function of flora.

Claims (2)

1. a kind of bacterial metabolism function prediction analysis method based on microorganism rRNA gene sequencing technologies, it is by will be existing 16S rRNA gene sequencing data compared with microorganism reference gene group database known to metabolic function, so as to realize pair Bacterium and the prediction of ancient bacterium metabolic function;Characterized in that, comprise the following steps that:
(1) the first 16S rRNA full length gene sequences according to micrometer biological genome, the gene of their common ancestor is inferred Function is composed;
(2) to other gene function spectrums for not surveying species are inferred in Greengenes databases, ancient bacterium and bacterium domain is built The Gene correlation spectrum of full pedigree;
(3) obtained flora composition " mapping " will be sequenced into database, bacterial metabolism function is predicted.
A kind of 2. bacterial metabolism function prediction analysis side based on microorganism rRNA gene sequencing technologies as claimed in claim 1 Method, it is characterised in that the step (3) is realized especially by the following manner:
(3.1) to sequencing obtain 16S rRNA gene orders, carry out it is " closed " with reference to OTU divide, by with Greengenes databases compare, and find " the reference sequences nearest-neighbors " of each sequencing sequence, and are classified as referring to OTU;
(3.2) according to the rRNA gene copy numbers of " reference sequences nearest-neighbors ", the OTU abundance matrix of acquisition is corrected;
(3.3) the gene function modal data such as KEGG/EggNOG according to corresponding to " reference sequences nearest-neighbors ", conversion prediction flora Overall metabolic function.
CN201710720330.1A 2017-08-21 2017-08-21 A kind of bacterial metabolism function prediction analysis method based on microorganism rRNA gene sequencing technologies Pending CN107451425A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710720330.1A CN107451425A (en) 2017-08-21 2017-08-21 A kind of bacterial metabolism function prediction analysis method based on microorganism rRNA gene sequencing technologies

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710720330.1A CN107451425A (en) 2017-08-21 2017-08-21 A kind of bacterial metabolism function prediction analysis method based on microorganism rRNA gene sequencing technologies

Publications (1)

Publication Number Publication Date
CN107451425A true CN107451425A (en) 2017-12-08

Family

ID=60492903

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710720330.1A Pending CN107451425A (en) 2017-08-21 2017-08-21 A kind of bacterial metabolism function prediction analysis method based on microorganism rRNA gene sequencing technologies

Country Status (1)

Country Link
CN (1) CN107451425A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112365932A (en) * 2020-10-19 2021-02-12 北京大学 Method for constructing bacterial community genome scale metabolic network
CN113403409A (en) * 2021-06-13 2021-09-17 中国疾病预防控制中心传染病预防控制所 Bacterial species level detection and analysis method based on bacterial 16S rRNA gene sequence

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
LDJ215323: "派森诺回馈新老用户助力国家自然科学基金撰写", 《HTTPS://MAX.BOOK118.COM/HTML/2017/0723/123844737.SHTM》 *
测序猫_PLTH1: "菌群代谢功能预测:宏基因组的一小步,多样性组成谱分析的一大步!", 《HTTPS://WWW.SOHU.COM/A/118675824_465237》 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112365932A (en) * 2020-10-19 2021-02-12 北京大学 Method for constructing bacterial community genome scale metabolic network
CN113403409A (en) * 2021-06-13 2021-09-17 中国疾病预防控制中心传染病预防控制所 Bacterial species level detection and analysis method based on bacterial 16S rRNA gene sequence

Similar Documents

Publication Publication Date Title
Grady et al. Assembly and seasonality of core phyllosphere microbiota on perennial biofuel crops
Gweon et al. PIPITS: an automated pipeline for analyses of fungal internal transcribed spacer sequences from the I llumina sequencing platform
Yuan et al. Specialized microbiome of a halophyte and its role in helping non-host plants to withstand salinity
Chen et al. Miscanthus cultivation shapes rhizosphere microbial community structure and function as assessed by Illumina MiSeq sequencing combined with PICRUSt and FUNGUIld analyses
Konstantinidis et al. Classifying the uncultivated microbial majority: a place for metagenomic data in the Candidatus proposal
Mareš Multilocus and SSU rRNA gene phylogenetic analyses of available cyanobacterial genomes, and their relation to the current taxonomic system
Ihrmark et al. New primers to amplify the fungal ITS2 region–evaluation by 454-sequencing of artificial and natural communities
Kis-Papo et al. Genomic adaptations of the halophilic Dead Sea filamentous fungus Eurotium rubrum
Martin Molecular mycorrhizal symbiosis
Pesaro et al. Identification of novel Crenarchaeota and Euryarchaeota clusters associated with different depth layers of a forest soil
Sorensen et al. Ecological selection for small microbial genomes along a temperate-to-thermal soil gradient
Dumack et al. Rhogostomidae (Cercozoa) from soils, roots and plant leaves (Arabidopsis thaliana): Description of Rhogostoma epiphylla sp. nov. and R. cylindrica sp. nov.
Nguyen et al. Bacterial diversity and community structure in Korean ginseng field soil are shifted by cultivation time
Imhoff et al. Phylogeny and taxonomy of Chlorobiaceae
Annapurna et al. Whole genome shotgun sequence of Bacillus paralicheniformis strain KMS 80, a rhizobacterial endophyte isolated from rice (Oryza sativa L.)
Dong et al. Structure and function of bacterial microbiota in Eucommia ulmoides bark
Hayer et al. Identification of growing bacteria during litter decomposition in freshwater through quantitative stable isotope probing
Zhao et al. High-throughput sequencing analysis of the endophytic bacterial diversity and dynamics in roots of the halophyte Salicornia europaea
Sharma et al. 'Species concept'in microbial taxonomy and systematics
Zervas et al. Genomics of aerobic photoheterotrophs in wheat phyllosphere reveals divergent evolutionary patterns of photosynthetic genes in Methylobacterium spp.
CN107451425A (en) A kind of bacterial metabolism function prediction analysis method based on microorganism rRNA gene sequencing technologies
Kyselková et al. Evaluation of rhizobacterial indicators of tobacco black root rot suppressiveness in farmers' fields
Medina-Cordoba et al. Genomic characterization and computational phenotyping of nitrogen-fixing bacteria isolated from Colombian sugarcane fields
Ontivero et al. Impact of land use history on the arbuscular mycorrhizal fungal diversity in arid soils of Argentinean farming fields
Liu et al. Response of soil microbiome structure and its network profiles to four soil amendments in monocropping strawberry greenhouse

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20171208

RJ01 Rejection of invention patent application after publication