CN116790779A - Reference composition, kit and method for quantifying absolute abundance of microbial population - Google Patents
Reference composition, kit and method for quantifying absolute abundance of microbial population Download PDFInfo
- Publication number
- CN116790779A CN116790779A CN202311018065.4A CN202311018065A CN116790779A CN 116790779 A CN116790779 A CN 116790779A CN 202311018065 A CN202311018065 A CN 202311018065A CN 116790779 A CN116790779 A CN 116790779A
- Authority
- CN
- China
- Prior art keywords
- internal reference
- sample
- seq
- dna
- copy number
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 35
- 239000000203 mixture Substances 0.000 title claims abstract description 23
- 230000000813 microbial effect Effects 0.000 title claims abstract description 21
- 238000012163 sequencing technique Methods 0.000 claims abstract description 33
- 238000004458 analytical method Methods 0.000 claims abstract description 14
- 241000894007 species Species 0.000 claims description 16
- 244000005700 microbiome Species 0.000 claims description 11
- 238000012408 PCR amplification Methods 0.000 claims description 8
- 230000001580 bacterial effect Effects 0.000 claims description 8
- 238000002156 mixing Methods 0.000 claims description 8
- 238000011002 quantification Methods 0.000 claims description 7
- 238000007400 DNA extraction Methods 0.000 claims description 6
- 238000012937 correction Methods 0.000 claims description 3
- 238000001914 filtration Methods 0.000 claims description 3
- 230000002194 synthesizing effect Effects 0.000 claims description 2
- 108020004414 DNA Proteins 0.000 abstract description 51
- 238000002474 experimental method Methods 0.000 abstract description 24
- 230000003321 amplification Effects 0.000 abstract description 16
- 238000003199 nucleic acid amplification method Methods 0.000 abstract description 16
- 108091093088 Amplicon Proteins 0.000 abstract description 10
- 230000007613 environmental effect Effects 0.000 abstract description 5
- 238000006243 chemical reaction Methods 0.000 abstract description 2
- 230000008569 process Effects 0.000 description 8
- 238000005516 engineering process Methods 0.000 description 6
- VLKZOEOYAKHREP-UHFFFAOYSA-N n-Hexane Chemical compound CCCCCC VLKZOEOYAKHREP-UHFFFAOYSA-N 0.000 description 6
- 239000013612 plasmid Substances 0.000 description 6
- 108091028043 Nucleic acid sequence Proteins 0.000 description 5
- 238000013461 design Methods 0.000 description 5
- 238000001514 detection method Methods 0.000 description 5
- 238000003908 quality control method Methods 0.000 description 5
- 239000002096 quantum dot Substances 0.000 description 5
- 108020004707 nucleic acids Proteins 0.000 description 4
- 102000039446 nucleic acids Human genes 0.000 description 4
- 150000007523 nucleic acids Chemical class 0.000 description 4
- 239000002689 soil Substances 0.000 description 4
- 238000011161 development Methods 0.000 description 3
- 238000000605 extraction Methods 0.000 description 3
- 108020004465 16S ribosomal RNA Proteins 0.000 description 2
- 229920000936 Agarose Polymers 0.000 description 2
- 241000894006 Bacteria Species 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 2
- 238000012512 characterization method Methods 0.000 description 2
- 239000013599 cloning vector Substances 0.000 description 2
- 238000007621 cluster analysis Methods 0.000 description 2
- 150000001875 compounds Chemical class 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 238000013400 design of experiment Methods 0.000 description 2
- 230000018109 developmental process Effects 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 238000001962 electrophoresis Methods 0.000 description 2
- 238000011049 filling Methods 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 238000001727 in vivo Methods 0.000 description 2
- 108090000623 proteins and genes Proteins 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 238000007671 third-generation sequencing Methods 0.000 description 2
- 238000012795 verification Methods 0.000 description 2
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 2
- 108020004463 18S ribosomal RNA Proteins 0.000 description 1
- 108020000946 Bacterial DNA Proteins 0.000 description 1
- 102000053602 DNA Human genes 0.000 description 1
- 108090000790 Enzymes Proteins 0.000 description 1
- 102000004190 Enzymes Human genes 0.000 description 1
- 102100030569 Nuclear receptor corepressor 2 Human genes 0.000 description 1
- 101710153660 Nuclear receptor corepressor 2 Proteins 0.000 description 1
- 238000011529 RT qPCR Methods 0.000 description 1
- 108091081062 Repeated sequence (DNA) Proteins 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 239000000654 additive Substances 0.000 description 1
- 230000000996 additive effect Effects 0.000 description 1
- 239000002671 adjuvant Substances 0.000 description 1
- 238000000246 agarose gel electrophoresis Methods 0.000 description 1
- 239000003570 air Substances 0.000 description 1
- 238000003766 bioinformatics method Methods 0.000 description 1
- 229960000074 biopharmaceutical Drugs 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 239000003054 catalyst Substances 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 230000019771 cognition Effects 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000003745 diagnosis Methods 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 230000006806 disease prevention Effects 0.000 description 1
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 1
- 238000005530 etching Methods 0.000 description 1
- 238000000105 evaporative light scattering detection Methods 0.000 description 1
- 210000003608 fece Anatomy 0.000 description 1
- 238000000855 fermentation Methods 0.000 description 1
- 230000004151 fermentation Effects 0.000 description 1
- 239000012634 fragment Substances 0.000 description 1
- 238000012165 high-throughput sequencing Methods 0.000 description 1
- 238000013427 histology analysis Methods 0.000 description 1
- 230000036039 immunity Effects 0.000 description 1
- 239000003999 initiator Substances 0.000 description 1
- 230000000968 intestinal effect Effects 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 230000004630 mental health Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000008121 plant development Effects 0.000 description 1
- 230000008635 plant growth Effects 0.000 description 1
- 229920000642 polymer Polymers 0.000 description 1
- 238000003752 polymerase chain reaction Methods 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 238000007619 statistical method Methods 0.000 description 1
- 230000001360 synchronised effect Effects 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 238000010998 test method Methods 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6876—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
- C12Q1/6888—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for detection or identification of organisms
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6806—Preparing nucleic acids for analysis, e.g. for polymerase chain reaction [PCR] assay
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6869—Methods for sequencing
-
- C—CHEMISTRY; METALLURGY
- C40—COMBINATORIAL TECHNOLOGY
- C40B—COMBINATORIAL CHEMISTRY; LIBRARIES, e.g. CHEMICAL LIBRARIES
- C40B50/00—Methods of creating libraries, e.g. combinatorial synthesis
- C40B50/06—Biochemical methods, e.g. using enzymes or whole viable microorganisms
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B20/00—ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
- G16B20/30—Detection of binding sites or motifs
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B25/00—ICT specially adapted for hybridisation; ICT specially adapted for gene or protein expression
- G16B25/20—Polymerase chain reaction [PCR]; Primer or probe design; Probe optimisation
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B30/00—ICT specially adapted for sequence analysis involving nucleotides or amino acids
- G16B30/10—Sequence alignment; Homology search
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B40/00—ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/166—Oligonucleotides used as internal standards, controls or normalisation probes
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Organic Chemistry (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Analytical Chemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biophysics (AREA)
- Biotechnology (AREA)
- General Health & Medical Sciences (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Molecular Biology (AREA)
- Medical Informatics (AREA)
- Genetics & Genomics (AREA)
- Biochemistry (AREA)
- Bioinformatics & Computational Biology (AREA)
- Microbiology (AREA)
- Evolutionary Biology (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Theoretical Computer Science (AREA)
- General Engineering & Computer Science (AREA)
- Immunology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Bioethics (AREA)
- Data Mining & Analysis (AREA)
- Artificial Intelligence (AREA)
- Computer Vision & Pattern Recognition (AREA)
- General Chemical & Material Sciences (AREA)
- Medicinal Chemistry (AREA)
- Software Systems (AREA)
- Public Health (AREA)
- Evolutionary Computation (AREA)
- Epidemiology (AREA)
- Databases & Information Systems (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
The application relates to an internal reference composition, a kit and a method for quantifying absolute abundance of a microbial population, wherein the internal reference composition comprises 8 internal reference sequences shown in SEQ ID No.1-SEQ ID No.8, the SEQ ID No.1-SEQ ID No.8 has a concentration gradient from low to high, and the ratio of the internal reference composition to a sample is 10% -30%. The absolute content of different microbial communities is quantified by adding reference DNA with different concentration gradients into an environmental sample, amplicons with expected sizes are generated by the reference DNA molecules in the PCR reaction process, a standard curve between the copy number of the reference DNA and reads is established by combining the copy number of the reference DNA and the reads output by sequencing, the absolute abundance of the copy number of each flora in the sample can be further calculated, and the absolute abundance of the species in a unit sample can be accurately estimated. The method can obtain two analysis results of relative quantitative and absolute quantitative results at the same time by one amplification experiment, saves the consumption of samples and internal reference DNA, and saves sequencing cost to the greatest extent.
Description
Technical Field
The application belongs to the field of biotechnology and histology analysis, and particularly relates to an internal reference composition, a kit and a method for quantifying absolute abundance of a microbial population.
Background
Microorganisms are the most abundant biological resources on the earth, play an important role in soil, plant growth and development, food fermentation, human activities and the like, promote the development of global element circulation, ecological environment treatment, biopharmaceutical, disease diagnosis, disease prevention and the like, and influence the aspects of human immunity, digestion, mental health and the like.
With the development of second generation sequencing technology, research and cognition on microbial community diversity and evolution has been greatly improved, however, accurate characterization of microbial communities has been a challenging problem, and amplicon sequencing based on 16S rRNA, 18S rRNA and ITS genes is a common tool in microbial ecology to measure specific microbial community structure and feature distribution.
The community composition condition of the sample can be known by extracting DNA in the sample, carrying out PCR amplification on the 16S or ITS gene region, constructing a library, sequencing based on a sequencing platform such as Illumina, pacBio and the like, then carrying out sequence analysis and species annotation by combining a bioinformatics method, and further comparing the differences between the samples by alpha and beta diversity analysis. Compared with the second generation sequencing, which only selects 1-2 hypervariable regions for sequencing, the third generation sequencing can obtain a full-length 16S or ITS sequence, more variable region information can improve the resolution of species identification, can improve the identification precision of community composition, and can more truly reduce the microbial community structure in a sample. The amplicon sequencing method has important guiding function on researching the analysis of the microbial community composition in the environments such as water, soil, air, intestinal feces and the like, and is also widely used in phylogenetic and ecological researches.
Although the conventional amplicon sequencing technology has the great advantages of high throughput, low cost and objective reduction of the flora structure and the relative abundance ratio, most of the amplicon sequencing technologies at present only generate data of relative abundance due to the technical limitation, only the percentage of a certain species in the whole environmental sample can be described, the absolute content of the species in the environmental community can not be represented, the high false positive rate and related analysis deviation in OTU analysis can be caused, and the absolute abundance change of the microbial community can not be represented due to the real number of the species in the sample and the real difference of the samples among groups, which is an important problem faced by the amplicon sequencing technology. For example, species S accounts for 10% and 20% of sample a and sample B, respectively, and when the total amounts of microorganisms in sample A, B are consistent, the absolute abundance of species S in sample a is higher than that in sample B; if A sample (10) 9 The total amount of microorganisms in cells/g was less than that in sample B (10 10 cells/g), the absolute abundance of species S in the a samples is lower than that in the B samples.
Therefore, the method can correct amplicon sequencing bias, accurately determine microorganism population abundance, increase comparability among samples, further truly reflect the real number of species in the samples and the real difference of samples among groups, make sequencing results more accurate and reliable, make up for the defects of the conventional amplicon sequencing technology method, and have very important significance for comprehensively researching and knowing the environmental microorganism community structure and taxonomies thereof.
Disclosure of Invention
Aiming at the technical problems existing at present, the application provides an internal reference composition, a kit and a method for quantifying absolute abundance of a microbial population.
In order to achieve the above purpose, the application adopts the following technical means:
in a first aspect, the application provides an internal reference composition for quantifying the absolute abundance of a microbial population, comprising 8 internal reference sequences as shown in SEQ ID No.1-SEQ ID No. 8.
Further, the reference composition has a concentration gradient from low to high of SEQ ID No.1-SEQ ID No.8, and the ratio of the reference composition to the sample is 10% -30%, in some embodiments the ratio of the concentration gradient of SEQ ID No.1-SEQ ID No.8 in the reference composition is 1:3:6:15:30:60:150:300. the design of the internal reference DNA sequence needs to contain a prokaryotic universal primer binding site, an optimized synthetic filling sequence with the same length and GC content as the in-vivo target and a cloning vector which is easy to obtain and process, and the designed sequence has no matching relation with the sequence of a natural species known in nature through NCBI verification.
Further, the kit also comprises a forward primer SEQ ID No.9 and a reverse primer SEQ ID No.11; forward primer SEQ ID No.11 and reverse primer SEQ ID No.12; forward primer SEQ ID No.13 and reverse primer SEQ ID No.14; one of 4 pairs of forward primer SEQ ID No.15 and reverse primer SEQ ID No.16. In some embodiments, the primer pair is compatible with 16S and ITS amplification regions, and the correspondence between primer names and primer sequences is: ITS3-F corresponds to sequence SEQ ID No.9; ITS4-R corresponds to sequence SEQ ID No.10;27F corresponds to sequence SEQ ID No.11;1492R corresponds to sequence SEQ ID No.12; ITS1F corresponds to sequence SEQ ID No.13; ITS4R corresponds to sequence SEQ ID No.14;338F corresponds to sequence SEQ ID No.15;806R corresponds to the sequence SEQ ID No.16.
In some embodiments, 27F corresponding sequence SEQ ID No.11 is used as a forward primer; 1492R corresponding sequence SEQ ID No.12 is a reverse primer, and 16S full-length amplification is carried out.
In a second aspect the application provides a kit for quantifying absolute abundance of a population of microorganisms, the kit comprising a reference composition as hereinbefore described.
In a third aspect, the application provides a method for quantifying absolute abundance of a microorganism population, comprising the steps of:
s1, synthesizing internal reference DNA according to the internal reference sequence of claim 1;
s2, sample DNA extraction:
s3, adding the internal reference composition meeting the requirement of claim 2 into sample DNA, adding the primer pair of claim 3, uniformly mixing, and performing PCR amplification to obtain a PCR product; in some embodiments, S3 comprises amplification with primers with different barcode, several different bases added in front of the forward and reverse primers, as a barcode tag to label and distinguish between different PCR amplification products;
s4, constructing a library and sequencing on a machine: purifying the PCR product amplified in the step S3, constructing a library and sequencing; in some embodiments, S4 comprises concentration comparison of the different PCR amplification products obtained, and mixing and purifying the PCR products on an equal basis. Library construction requires that the total amount of each PCR product be the same and that the fragments be similar, and that the volumes of each PCR product be mixed based on the concentrations determined.
S5, controlling the quality of the data of the off-machine; in some embodiments, second generation sequencing is adopted, and the quality control of the data of the next generation comprises removing the joint sequence, low quality and repeated sequence and the like, so as to obtain high quality sequencing data; in some embodiments, three generations of sequencing are used, off-machine data quality control includes sequence correction, removal of low quality and short sequences, resulting in high quality sequencing data.
S6, constructing a standard curve: comparing the qualified off-machine data with an internal reference sequence database, outputting an OTU table of the internal reference DNA, and constructing a standard curve between the copy number of the internal reference DNA and the reads number based on the copy number of the internal reference DNA and the reads data of the internal reference sequence sequenced by the off-machine;
s7, outputting a quantitative result.
Further, S1 further comprises the following steps: and (3) measuring the concentration of the synthesized reference DNA, and accurately calculating the copy number of the reference DNA.
Further, S6 further includes a copy number correction step: and (3) further correcting the copy number of the bacterial groups of the sample according to the constructed standard curve, and calculating the absolute copy number of each bacterial group in the sample.
Further, the step S6 further comprises the following steps: and filtering the internal reference sequence of the machine-setting data with qualified quality control to obtain a high-quality non-internal reference effective sequence for species analysis.
In some embodiments, the sample type is soil, water, distillers grains, fermented grains, pit mud, stool, or the like; sample DNA extraction adopts extraction methods known in the industry, and a suitable DNA extraction method is selected according to different types of samples so as to improve the concentration and total amount of DNA.
In some embodiments, high quality non-internal reference effective sequences are obtained for OTU abundance analysis, OTU cluster analysis, species annotation analysis, and statistical analysis, among others.
Compared with the prior art, the application has the beneficial effects that:
1. the method has wide application range, can cover samples of different types, can select and combine the advantages of sequencing methods of second generation sequencing and third generation sequencing, and accurately quantitate the content of different bacterial groups in the samples.
2. According to the method, the absolute content of different microbial communities can be quantified by adding the internal reference DNA with different concentration gradients into the environmental sample; in the PCR reaction process, the reference DNA molecule can generate amplicons with expected sizes, and the absolute abundance of each flora copy number in a sample can be further calculated by combining the copy number of the reference DNA and the ready number of sequencing output and establishing a standard curve between the reference DNA copy number (log copy number) and reads (log reads), so that the absolute abundance of the species in a unit sample can be accurately estimated.
3. The method can make the processing conditions of the sample to be detected in the operation step consistent with those of the internal reference system by constructing the internal reference system, so that the amplification efficiency of the sample to be detected and the internal reference system are as close as possible, and absolute quantification is realized.
4. The method can avoid the speed difference in the amplification process caused by the possible difference between different microorganism concentrations in the sample, thereby ensuring more accurate calculation result.
5. The method can obtain two analysis results of relative quantitative and absolute quantitative results at the same time by one amplification experiment, saves the consumption of samples and internal reference DNA, and saves sequencing cost to the greatest extent.
Drawings
FIG. 1 is a standard graph of experimental group 1 in example 3;
FIG. 2 is a standard graph of experimental group 2 in example 3;
FIG. 3 is a standard graph of experimental group 3 in example 3;
FIG. 4 is a graph comparing the quantitative results of the sample flora in example 3;
FIG. 5 is a graph showing the relative concentration of uncorrected bacterial cells in a fermented grain sample according to example 4;
FIG. 6 is a standard graph of the fermented grain sample experimental group 1 in example 4;
FIG. 7 is a standard graph of the fermented grain sample experimental group 2 in example 4;
FIG. 8 is a standard graph of the fermented grain sample experimental group 3 in example 4;
FIG. 9 is a graph showing the absolute content of the bacterial strain after calibration of the fermented grain sample in example 4.
Detailed Description
Unless otherwise indicated, implied from the context, or common denominator in the art, all parts and percentages in the present application are based on weight and the test and characterization methods used are synchronized with the filing date of the present application. Where applicable, the disclosure of any patent, patent application, or publication referred to in this disclosure is incorporated herein by reference in its entirety, and the equivalent patents are incorporated herein by reference, especially with respect to the definitions of synthetic techniques, product and process designs, polymers, comonomers, initiators or catalysts, etc. in the art, as disclosed in these documents. If the definition of a particular term disclosed in the prior art is inconsistent with any definition provided in the present application, the definition of the term provided in the present application controls.
The numerical ranges in the present application are approximations, so that it may include the numerical values outside the range unless otherwise indicated. The numerical range includes all values from the lower value to the upper value that increase by 1 unit, provided that there is a spacing of at least 2 units between any lower value and any higher value. For example, if a component, physical or other property (e.g., molecular weight, melt index, etc.) is recited as being 100 to 1000, it is intended that all individual values, e.g., 100, 101, 102, etc., and all subranges, e.g., 100 to 166, 155 to 170, 198 to 200, etc., are explicitly recited. For ranges containing values less than 1 or containing fractions greater than 1 (e.g., 1.1,1.5, etc.), then 1 unit is suitably considered to be 0.0001,0.001,0.01, or 0.1. For a range containing units of less than 10 (e.g., 1 to 5), 1 unit is generally considered to be 0.1. These are merely specific examples of what is intended to be provided, and all possible combinations of numerical values between the lowest value and the highest value enumerated are to be considered to be expressly stated in this disclosure.
As used with respect to chemical compounds, the singular includes all isomeric forms and vice versa unless explicitly stated otherwise (e.g., "hexane" includes all isomers of hexane, either individually or collectively). In addition, unless explicitly stated otherwise, the use of the terms "a," "an," or "the" include plural referents.
The terms "comprises," "comprising," "including," and their derivatives do not exclude the presence of any other component, step or process, and are not related to whether or not such other component, step or process is disclosed in the present application. For the avoidance of any doubt, all use of the terms "comprising", "including" or "having" herein, unless expressly stated otherwise, may include any additional additive, adjuvant or compound. Rather, the term "consisting essentially of … …" excludes any other component, step or process from the scope of any of the terms recited below, as those out of necessity for operability. The term "consisting of … …" does not include any components, steps or processes not specifically described or listed. The term "or" refers to the listed individual members or any combination thereof unless explicitly stated otherwise.
In order to make the technical problems, technical schemes and beneficial effects solved by the application more clear, the application is further described in detail below with reference to the embodiments.
Examples
The following examples are presented herein to demonstrate preferred embodiments of the present application. It will be appreciated by those skilled in the art that the techniques disclosed in the examples which follow represent techniques discovered by the inventor to function in the practice of the application, and thus can be considered to constitute preferred modes for its practice. Those of skill in the art should, in light of the present disclosure, appreciate that many changes can be made in the specific embodiments which are disclosed and still obtain a like or similar result without departing from the spirit or scope of the application.
Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this application belongs, the disclosure of which is incorporated herein by reference as is commonly understood by reference. Those skilled in the art will recognize, or be able to ascertain using no more than routine experimentation, many equivalents to the specific embodiments of the application described herein. Such equivalents are intended to be encompassed by the claims.
EXAMPLE 1 design and Synthesis of reference DNA
The design of the internal reference DNA sequence needs to contain a prokaryotic universal primer binding site, an optimized synthetic filling sequence with the same length and GC content as the in-vivo target and a cloning vector which is easy to obtain and process, and the designed sequence has no matching relation with the sequence of a natural species known in nature through NCBI verification.
The internal reference DNA sequence designed by the application is shown in Table 1, the internal reference DNA sequence is inserted into plasmid, and plasmid extraction is carried out after amplification culture.
According to the sequences of the reference DNAs shown in Table 1, the DNA was sent to Shanghai Biotechnology Co., ltd and synthesized according to the standard synthetic procedure to obtain 8 reference sequences. The synthesized reference DNA was subjected to concentration measurement and mass detection using a Thermo NanoDrop instrument and agarose electrophoresis, and the theoretical copy number of each reference DNA was calculated as shown in Table 2.
For the internal reference DNA sequence designed by the application, the corresponding primer compatible with the 16S and ITS amplified regions is also designed, and the primer names, the primer sequences and the band size information are shown in Table 3.
EXAMPLE 2 reference DNA addition gradient exploration
1. Single reference addition experiment
The samples and information used are shown in table 4.
Design of experiment
The plasmids used: plasmid-1 in Table 2.
The experimental method comprises the following steps: the Qubit concentrations of sample DNA are shown in table 4, divided into two experimental groups, 1 concentration of internal reference DNA was added to each sample DNA, and 8 different concentration experiments were set for a single internal reference: the amount of reference DNA (pg) added per ug sample DNA is shown in Table 5; each of the experimental groups 1 and 2 was set with 8 experiments, each set with 3 groups in parallel, and 16S full-length amplification was performed and sequenced. The ratio of the internal reference DNA in each group of experiments was counted after the data was taken off the machine, and the average value was taken for statistics, and the results are shown in Table 5.
Experimental analysis: the addition of too much or too little internal DNA can cause errors in sequencing, so that the gradient range set by the experimental group is reasonable from the perspective of the whole experimental result in order to avoid the influence of the expansion of the errors, but fine adjustment is also needed, and in the experimental group 2, the addition concentration span is larger.
2. Multi-internal reference gradient addition pre-experiment
The samples and information used are shown in table 6.
Design of experiment
The plasmids used: plasmid 1-plasmid 8 in table 2.
Experiment design: according to the following table, the sample DNA was added with 8 internal reference DNA at different concentrations, and 3 groups of the samples were arranged in parallel, and 16S full-length amplification was performed and sequenced after mixing uniformly. And counting the proportion of each group of internal references according to the number of reads of the machine, and taking an average value for counting. The results are shown in Table 7.
Experimental analysis: from the data of the reference ratio in each group of experiments, the addition gradient of the experiment group 1 is suitable.
Example 3 accurate quantitative detection of full-Length amplification of Artificial mock community 16S
4 pure bacteria samples are selected, and the sample names are respectively as follows: 30-1-1, QXD-8, D39, Q41, the 4 pure bacterial sample genomic DNAs were extracted using a bacterial DNA extraction kit (arvensis).
After nucleic acid extraction, nucleic acid concentration and quality detection were performed using two instruments, thermo Nanodrop and Qubit, and agarose electrophoresis, respectively, according to volume 1:1:1:1 mixing 4 pure bacteria sample DNA uniformly, re-etching artificial simulated community, measuring concentration by using Qubit, measuring sample nucleic acid concentration, and the sample nucleic acid concentration is shown in Table 8.
Meanwhile, the extracted sample DNA is sent to Tianyi remote technology limited company for qPCR absolute quantification, and the 16S copy number of each sample is measured.
8 reference DNAs with known copy numbers and concentrations were added to the well-mixed artificial simulated community DNAs in a gradient manner, and the addition gradients of the reference DNAs are shown in Table 9.
After mixing uniformly, a specific primer pair with barcode is added to carry out 16S rRNA full-length PCR amplification. Wherein: forward primer 27F, sequence: AGRGTTYGATYMTGGCTCAG;
reverse primer 1492R, sequence: RGYTACCTTGTTACGACTT.
The PCR amplification system is as follows: 25 mu L of enzyme for amplification, 2.5 mu L of primer, the amount of sample DNA is shown in Table 8, and the amount of internal reference DNA is shown in Table 9,H 2 O was made up to a volume of 50. Mu.L.
The PCR amplification procedure was: 98 ℃ for 30s;98 ℃ for 10s;62 ℃,30s;72 ℃,1min,27 cycles; 72 ℃ for 5min; preserving at 4 ℃.
For the artificial simulated community samples, 3 sets of parallel experiments were set up: experimental group 1, experimental group 2, experimental group 3, and 3 control experiments without adding internal reference DNA, which were set corresponding to parallel experiments: control group 1, control group 2, control group 3. In 3 sets of parallel experiments, primers were tagged with different barcode. Wherein, in the experimental group 1, the barcode is B192; barcode in experimental group 2 was B193; the barcode in experimental group 3 was B194.
And (3) carrying out agarose gel electrophoresis on PCR products obtained by gradient addition of internal reference and primer amplification of the experiment group 1, the experiment group 2 and the experiment group 3, detecting the amplification effect and the band range of the products, comparing the concentrations of the PCR products by using GeneTools Analysis Software, calculating the required sample mixing volume of each sample according to an equivalent principle, and carrying out Qubit quantification after mixing each PCR product.
Library construction was performed according to the standard procedure of 16S Amplification SMRT bell. Library Preparation, and the constructed amplicon library was high throughput sequenced using the PacBio platform.
After the data which are subjected to machine-off and quality control pass through the comparison of the internal reference sequence database and the output of the OTU table of the internal reference DNA, the data are based onReference DNA copy number and next sequence reads data of next machine sequencing, standard curves between reference DNA copy number (log copy number) and reads (log reads) are constructed, next machine data of experimental group and control group are shown in Table 10, and the constructed standard curves, equations and R 2 The values are shown in Table 11 and FIGS. 1-3.
Filtering the internal reference sequence of the machine-down data with qualified quality control to obtain a high-quality non-internal reference effective sequence for subsequent OTU cluster analysis and species annotation analysis.
And further correcting the copy number of the sample flora according to the constructed standard curve, and calculating the absolute copy number of each flora in the sample, the ready numbers of the samples of the experimental group and the control group and the corrected absolute copy number results are shown in table 12.
The result shows that the absolute quantification of the 16S copy number of the microbial flora in the sample can be realized by adding an internal reference sequence with known copy number and concentration into the sample DNA in a gradient way and constructing an internal reference system, and the comparison result is shown in Table 13 and FIG. 4; the proportion of added internal reference sequences is shown in Table 14 for the number of total sequencing reads.
If the overall internal reference addition is lower, the standard curve fitting result is poorer; in contrast, the addition of an excessively high internal reference sequence occupies more sequencing data and leads to an increase in sequencing cost, so that the addition ratio of the internal reference has a certain influence on the result of an absolute quantification experiment, and the internal reference is recommended to be between 10% and 30%.
Example 4 accurate quantitative detection of full-Length amplification of fermented grain sample 16S
In order to evaluate the accuracy and applicability of the method of the application, 1 sample of fermented grains was collected, and a quantitative detection experiment was performed on this sample according to the method of example 3.
And (3) extracting the DNA of the fermented grain sample by using a small amount of soil DNA extraction kit (ark organism), accurately quantifying and marking the DNA concentration by using Qubit, adding internal reference DNA into the DNA in a gradient manner, carrying out three parallel experiments by adding specific primer pairs with different barcode, obtaining PCR amplification products corresponding to the experiment group 1, the experiment group 2 and the experiment group 3, and carrying out high-throughput sequencing on a PCR product constructed sequencing library. Analysis of the off-press data was performed as in example 3, with uncorrected relative population accumulation maps shown in FIG. 5, and standard curves constructed with reference systems shown in FIGS. 6-8; the corrected absolute copy numbers of the flora are shown in Table 15 and the pile-up of the flora structures in FIG. 9.
The result shows that the method can effectively detect the absolute copy number of each classification level 16S in the actual fermented grain sample, has extremely high consistency among repetition, and has higher repeatability of the detection result.
All documents mentioned in this disclosure are incorporated by reference in this disclosure as if each were individually incorporated by reference. Further, it will be appreciated that various changes and modifications may be made by those skilled in the art after reading the above teachings, and such equivalents are intended to fall within the scope of the application as defined in the appended claims.
Claims (8)
1. A reference composition for quantifying absolute abundance of a microbial population, characterized by: the reference composition comprises 8 reference sequences shown as SEQ ID No.1-SEQ ID No. 8.
2. The reference combination for absolute abundance quantification of a microbial population of claim 1, wherein: the reference composition has a concentration gradient from low to high of SEQ ID No.1-SEQ ID No.8, and the ratio of the reference composition to the sample is 10% -30%.
3. A reference composition for quantifying the absolute abundance of a microbial population according to claim 1, wherein: the primer is a forward primer SEQ ID No.9 and a reverse primer SEQ ID No.11; forward primer SEQ ID No.11 and reverse primer SEQ ID No.12; forward primer SEQ ID No.13 and reverse primer SEQ ID No.14; one of 4 pairs of forward primer SEQ ID No.15 and reverse primer SEQ ID No.16.
4. A microbial population absolute abundance quantitative kit is characterized in that: the kit comprising the internal reference composition of any one of claims 1-3.
5. A microbial population absolute abundance quantification method is characterized by comprising the following steps of: the method comprises the following steps:
s1, synthesizing internal reference DNA according to the internal reference sequence of claim 1;
s2, sample DNA extraction:
s3, adding the internal reference composition meeting the requirement of claim 2 into sample DNA, adding the primer pair of claim 3, uniformly mixing, and performing PCR amplification to obtain a PCR product;
s4, constructing a library and sequencing on a machine: purifying the PCR product amplified in the step S3, constructing a library and sequencing;
s5, controlling the quality of the data of the off-machine;
s6, constructing a standard curve: comparing the qualified off-machine data with an internal reference sequence database, outputting an OTU table of the internal reference DNA, and constructing a standard curve between the copy number of the internal reference DNA and the reads number based on the copy number of the internal reference DNA and the reads data of the internal reference sequence sequenced by the off-machine;
s7, outputting a quantitative result.
6. The method for quantifying the absolute abundance of a microorganism population of claim 5, wherein: s1 further comprises the following steps: and (3) measuring the concentration of the synthesized reference DNA, and accurately calculating the copy number of the reference DNA.
7. The method for quantifying the absolute abundance of a microorganism population of claim 5, wherein: s6, further comprising a copy number correction step: and (3) further correcting the copy number of the bacterial groups of the sample according to the constructed standard curve, and calculating the absolute copy number of each bacterial group in the sample.
8. The method for quantifying the absolute abundance of a microorganism population of claim 5, wherein: s6, further comprising the following steps: and filtering the internal reference sequence to obtain a high-quality non-internal reference effective sequence for species information analysis.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202311018065.4A CN116790779A (en) | 2023-08-14 | 2023-08-14 | Reference composition, kit and method for quantifying absolute abundance of microbial population |
CN202410033610.5A CN117844939A (en) | 2023-08-14 | 2024-01-08 | Reference composition, kit and method for quantifying absolute abundance of microbial population |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202311018065.4A CN116790779A (en) | 2023-08-14 | 2023-08-14 | Reference composition, kit and method for quantifying absolute abundance of microbial population |
Publications (1)
Publication Number | Publication Date |
---|---|
CN116790779A true CN116790779A (en) | 2023-09-22 |
Family
ID=88034708
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202311018065.4A Pending CN116790779A (en) | 2023-08-14 | 2023-08-14 | Reference composition, kit and method for quantifying absolute abundance of microbial population |
CN202410033610.5A Pending CN117844939A (en) | 2023-08-14 | 2024-01-08 | Reference composition, kit and method for quantifying absolute abundance of microbial population |
Family Applications After (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202410033610.5A Pending CN117844939A (en) | 2023-08-14 | 2024-01-08 | Reference composition, kit and method for quantifying absolute abundance of microbial population |
Country Status (1)
Country | Link |
---|---|
CN (2) | CN116790779A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN117844939A (en) * | 2023-08-14 | 2024-04-09 | 广东美格基因科技有限公司 | Reference composition, kit and method for quantifying absolute abundance of microbial population |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1147276A (en) * | 1994-04-25 | 1997-04-09 | 希巴-盖吉股份公司 | Detection of fungal pathogens using polymerase chain reaction |
CN111554349A (en) * | 2020-02-18 | 2020-08-18 | 中国检验检疫科学研究院 | Species identification system and method based on high-throughput sequencing |
CN116064863A (en) * | 2022-09-16 | 2023-05-05 | 北京百迈客生物科技有限公司 | Method for detecting bacterial flora composition and absolute content in sample |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109943654B (en) * | 2019-04-12 | 2023-06-30 | 上海天昊生物科技有限公司 | Method for detecting bacterial flora composition and absolute content based on internal reference sequence |
CN116790779A (en) * | 2023-08-14 | 2023-09-22 | 广东美格基因科技有限公司 | Reference composition, kit and method for quantifying absolute abundance of microbial population |
-
2023
- 2023-08-14 CN CN202311018065.4A patent/CN116790779A/en active Pending
-
2024
- 2024-01-08 CN CN202410033610.5A patent/CN117844939A/en active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1147276A (en) * | 1994-04-25 | 1997-04-09 | 希巴-盖吉股份公司 | Detection of fungal pathogens using polymerase chain reaction |
CN111554349A (en) * | 2020-02-18 | 2020-08-18 | 中国检验检疫科学研究院 | Species identification system and method based on high-throughput sequencing |
CN116064863A (en) * | 2022-09-16 | 2023-05-05 | 北京百迈客生物科技有限公司 | Method for detecting bacterial flora composition and absolute content in sample |
Non-Patent Citations (1)
Title |
---|
王行: "内参基因加标法定量土壤微生物目标基因绝对拷贝数", 微生物学通报, vol. 43, no. 2, pages 424 - 433 * |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN117844939A (en) * | 2023-08-14 | 2024-04-09 | 广东美格基因科技有限公司 | Reference composition, kit and method for quantifying absolute abundance of microbial population |
Also Published As
Publication number | Publication date |
---|---|
CN117844939A (en) | 2024-04-09 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Pearson et al. | The molecular genetics of cyanobacterial toxicity as a basis for monitoring water quality and public health risk | |
Gentry et al. | Microarray applications in microbial ecology research | |
Boughner et al. | Microbial ecology: where are we now? | |
Rohrlack et al. | Oligopeptide chemotypes of the toxic freshwater cyanobacterium Planktothrix can form sub‐populations with dissimilar ecological traits | |
CN110305979B (en) | Microbial community absolute quantification method based on multi-internal standard system | |
CN117844939A (en) | Reference composition, kit and method for quantifying absolute abundance of microbial population | |
Thies | Molecular approaches to studying the soil biota | |
Smalla | Culture‐independent microbiology | |
CN112342279A (en) | Kit and method for simultaneously detecting specific genes of cyanobacteria bloom Ralstonia and prototheca toxin | |
Rueckert et al. | Development and field assessment of a quantitative PCR for the detection and enumeration of the noxious bloom‐former Anabaena planktonica | |
CN116622820A (en) | High-throughput quantitative sequencing and analysis method for microbial amplicon | |
Faniyan et al. | Analyzing bacterial species from different environments using direct 16S rRNA gene sequencing methods | |
US7947440B2 (en) | Method for determining the specific growth rate of a distinct cell population within a non-homogeneous system | |
CN108753997A (en) | A kind of fluorescent quantitation recombinase polymeric enzymatic amplification method for detecting Brucella sp | |
Lundberg et al. | Measuring both microbial load and diversity with a single amplicon sequencing library. | |
CN113528614A (en) | Plant phyllospheric surface microorganism metagenome detection method | |
CN109355412A (en) | It is a kind of artificial synthesized exogenous referring to molecule compared with fungal species and abundance for measuring bacterium | |
CN114277185B (en) | Adenovirus MNP (MNP) marking combination, primer pair combination, kit and application thereof | |
D’Elia et al. | Source tracking of Escherichia coli by 16S–23S intergenic spacer region denaturing gradient gel electrophoresis (DGGE) of the rrnB ribosomal operon | |
Rajeev et al. | Culture-Dependent and-Independent Strategies in Bacterial Diversity Appraisal | |
CN118186118B (en) | Multi-PCR rapid detection kit and rapid detection method for microorganisms of experimental animals | |
CN114277163B (en) | MNP (MNP) labeling combination of chlamydia pneumoniae, primer pair combination, kit and application of kit | |
CN114277164B (en) | MNP (MNP-associated protein) marking combination, primer pair combination, kit and application of MNP marking combination and primer pair combination | |
CN108359722B (en) | Integrated system and method for monitoring dynamic change of microorganism | |
Hallam et al. | Molecular tools for investigating ANME community structure and function |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20230922 |
|
WD01 | Invention patent application deemed withdrawn after publication |