CN116790779A

CN116790779A - Reference composition, kit and method for quantifying absolute abundance of microbial population

Info

Publication number: CN116790779A
Application number: CN202311018065.4A
Authority: CN
Inventors: 王茜; 刘飞飞; 金桃; 张智闵; 束文圣; 李绍春
Original assignee: Guangzhou Microchip Biotechnology Co ltd; Guangdong Magigene Technology Co ltd
Current assignee: Guangzhou Microchip Biotechnology Co ltd; Guangdong Magigene Technology Co ltd
Priority date: 2023-08-14
Filing date: 2023-08-14
Publication date: 2023-09-22
Also published as: CN117844939A

Abstract

The application relates to an internal reference composition, a kit and a method for quantifying absolute abundance of a microbial population, wherein the internal reference composition comprises 8 internal reference sequences shown in SEQ ID No.1-SEQ ID No.8, the SEQ ID No.1-SEQ ID No.8 has a concentration gradient from low to high, and the ratio of the internal reference composition to a sample is 10% -30%. The absolute content of different microbial communities is quantified by adding reference DNA with different concentration gradients into an environmental sample, amplicons with expected sizes are generated by the reference DNA molecules in the PCR reaction process, a standard curve between the copy number of the reference DNA and reads is established by combining the copy number of the reference DNA and the reads output by sequencing, the absolute abundance of the copy number of each flora in the sample can be further calculated, and the absolute abundance of the species in a unit sample can be accurately estimated. The method can obtain two analysis results of relative quantitative and absolute quantitative results at the same time by one amplification experiment, saves the consumption of samples and internal reference DNA, and saves sequencing cost to the greatest extent.

Description

Reference composition, kit and method for quantifying absolute abundance of microbial population

Technical Field

The application belongs to the field of biotechnology and histology analysis, and particularly relates to an internal reference composition, a kit and a method for quantifying absolute abundance of a microbial population.

Background

Microorganisms are the most abundant biological resources on the earth, play an important role in soil, plant growth and development, food fermentation, human activities and the like, promote the development of global element circulation, ecological environment treatment, biopharmaceutical, disease diagnosis, disease prevention and the like, and influence the aspects of human immunity, digestion, mental health and the like.

With the development of second generation sequencing technology, research and cognition on microbial community diversity and evolution has been greatly improved, however, accurate characterization of microbial communities has been a challenging problem, and amplicon sequencing based on 16S rRNA, 18S rRNA and ITS genes is a common tool in microbial ecology to measure specific microbial community structure and feature distribution.

The community composition condition of the sample can be known by extracting DNA in the sample, carrying out PCR amplification on the 16S or ITS gene region, constructing a library, sequencing based on a sequencing platform such as Illumina, pacBio and the like, then carrying out sequence analysis and species annotation by combining a bioinformatics method, and further comparing the differences between the samples by alpha and beta diversity analysis. Compared with the second generation sequencing, which only selects 1-2 hypervariable regions for sequencing, the third generation sequencing can obtain a full-length 16S or ITS sequence, more variable region information can improve the resolution of species identification, can improve the identification precision of community composition, and can more truly reduce the microbial community structure in a sample. The amplicon sequencing method has important guiding function on researching the analysis of the microbial community composition in the environments such as water, soil, air, intestinal feces and the like, and is also widely used in phylogenetic and ecological researches.

Although the conventional amplicon sequencing technology has the great advantages of high throughput, low cost and objective reduction of the flora structure and the relative abundance ratio, most of the amplicon sequencing technologies at present only generate data of relative abundance due to the technical limitation, only the percentage of a certain species in the whole environmental sample can be described, the absolute content of the species in the environmental community can not be represented, the high false positive rate and related analysis deviation in OTU analysis can be caused, and the absolute abundance change of the microbial community can not be represented due to the real number of the species in the sample and the real difference of the samples among groups, which is an important problem faced by the amplicon sequencing technology. For example, species S accounts for 10% and 20% of sample a and sample B, respectively, and when the total amounts of microorganisms in sample A, B are consistent, the absolute abundance of species S in sample a is higher than that in sample B; if A sample (10) ⁹ The total amount of microorganisms in cells/g was less than that in sample B (10 ¹⁰ cells/g), the absolute abundance of species S in the a samples is lower than that in the B samples.

Therefore, the method can correct amplicon sequencing bias, accurately determine microorganism population abundance, increase comparability among samples, further truly reflect the real number of species in the samples and the real difference of samples among groups, make sequencing results more accurate and reliable, make up for the defects of the conventional amplicon sequencing technology method, and have very important significance for comprehensively researching and knowing the environmental microorganism community structure and taxonomies thereof.

Disclosure of Invention

Aiming at the technical problems existing at present, the application provides an internal reference composition, a kit and a method for quantifying absolute abundance of a microbial population.

In order to achieve the above purpose, the application adopts the following technical means:

in a first aspect, the application provides an internal reference composition for quantifying the absolute abundance of a microbial population, comprising 8 internal reference sequences as shown in SEQ ID No.1-SEQ ID No. 8.

Further, the reference composition has a concentration gradient from low to high of SEQ ID No.1-SEQ ID No.8, and the ratio of the reference composition to the sample is 10% -30%, in some embodiments the ratio of the concentration gradient of SEQ ID No.1-SEQ ID No.8 in the reference composition is 1:3:6:15:30:60:150:300. the design of the internal reference DNA sequence needs to contain a prokaryotic universal primer binding site, an optimized synthetic filling sequence with the same length and GC content as the in-vivo target and a cloning vector which is easy to obtain and process, and the designed sequence has no matching relation with the sequence of a natural species known in nature through NCBI verification.

Further, the kit also comprises a forward primer SEQ ID No.9 and a reverse primer SEQ ID No.11; forward primer SEQ ID No.11 and reverse primer SEQ ID No.12; forward primer SEQ ID No.13 and reverse primer SEQ ID No.14; one of 4 pairs of forward primer SEQ ID No.15 and reverse primer SEQ ID No.16. In some embodiments, the primer pair is compatible with 16S and ITS amplification regions, and the correspondence between primer names and primer sequences is: ITS3-F corresponds to sequence SEQ ID No.9; ITS4-R corresponds to sequence SEQ ID No.10;27F corresponds to sequence SEQ ID No.11;1492R corresponds to sequence SEQ ID No.12; ITS1F corresponds to sequence SEQ ID No.13; ITS4R corresponds to sequence SEQ ID No.14;338F corresponds to sequence SEQ ID No.15;806R corresponds to the sequence SEQ ID No.16.

In some embodiments, 27F corresponding sequence SEQ ID No.11 is used as a forward primer; 1492R corresponding sequence SEQ ID No.12 is a reverse primer, and 16S full-length amplification is carried out.

In a second aspect the application provides a kit for quantifying absolute abundance of a population of microorganisms, the kit comprising a reference composition as hereinbefore described.

In a third aspect, the application provides a method for quantifying absolute abundance of a microorganism population, comprising the steps of:

s1, synthesizing internal reference DNA according to the internal reference sequence of claim 1;

s2, sample DNA extraction:

s3, adding the internal reference composition meeting the requirement of claim 2 into sample DNA, adding the primer pair of claim 3, uniformly mixing, and performing PCR amplification to obtain a PCR product; in some embodiments, S3 comprises amplification with primers with different barcode, several different bases added in front of the forward and reverse primers, as a barcode tag to label and distinguish between different PCR amplification products;

s4, constructing a library and sequencing on a machine: purifying the PCR product amplified in the step S3, constructing a library and sequencing; in some embodiments, S4 comprises concentration comparison of the different PCR amplification products obtained, and mixing and purifying the PCR products on an equal basis. Library construction requires that the total amount of each PCR product be the same and that the fragments be similar, and that the volumes of each PCR product be mixed based on the concentrations determined.

S5, controlling the quality of the data of the off-machine; in some embodiments, second generation sequencing is adopted, and the quality control of the data of the next generation comprises removing the joint sequence, low quality and repeated sequence and the like, so as to obtain high quality sequencing data; in some embodiments, three generations of sequencing are used, off-machine data quality control includes sequence correction, removal of low quality and short sequences, resulting in high quality sequencing data.

S6, constructing a standard curve: comparing the qualified off-machine data with an internal reference sequence database, outputting an OTU table of the internal reference DNA, and constructing a standard curve between the copy number of the internal reference DNA and the reads number based on the copy number of the internal reference DNA and the reads data of the internal reference sequence sequenced by the off-machine;

s7, outputting a quantitative result.

Further, S1 further comprises the following steps: and (3) measuring the concentration of the synthesized reference DNA, and accurately calculating the copy number of the reference DNA.

Further, S6 further includes a copy number correction step: and (3) further correcting the copy number of the bacterial groups of the sample according to the constructed standard curve, and calculating the absolute copy number of each bacterial group in the sample.

Further, the step S6 further comprises the following steps: and filtering the internal reference sequence of the machine-setting data with qualified quality control to obtain a high-quality non-internal reference effective sequence for species analysis.

In some embodiments, the sample type is soil, water, distillers grains, fermented grains, pit mud, stool, or the like; sample DNA extraction adopts extraction methods known in the industry, and a suitable DNA extraction method is selected according to different types of samples so as to improve the concentration and total amount of DNA.

In some embodiments, high quality non-internal reference effective sequences are obtained for OTU abundance analysis, OTU cluster analysis, species annotation analysis, and statistical analysis, among others.

Compared with the prior art, the application has the beneficial effects that:

1. the method has wide application range, can cover samples of different types, can select and combine the advantages of sequencing methods of second generation sequencing and third generation sequencing, and accurately quantitate the content of different bacterial groups in the samples.

2. According to the method, the absolute content of different microbial communities can be quantified by adding the internal reference DNA with different concentration gradients into the environmental sample; in the PCR reaction process, the reference DNA molecule can generate amplicons with expected sizes, and the absolute abundance of each flora copy number in a sample can be further calculated by combining the copy number of the reference DNA and the ready number of sequencing output and establishing a standard curve between the reference DNA copy number (log copy number) and reads (log reads), so that the absolute abundance of the species in a unit sample can be accurately estimated.

3. The method can make the processing conditions of the sample to be detected in the operation step consistent with those of the internal reference system by constructing the internal reference system, so that the amplification efficiency of the sample to be detected and the internal reference system are as close as possible, and absolute quantification is realized.

4. The method can avoid the speed difference in the amplification process caused by the possible difference between different microorganism concentrations in the sample, thereby ensuring more accurate calculation result.

5. The method can obtain two analysis results of relative quantitative and absolute quantitative results at the same time by one amplification experiment, saves the consumption of samples and internal reference DNA, and saves sequencing cost to the greatest extent.

Drawings

FIG. 1 is a standard graph of experimental group 1 in example 3;

FIG. 2 is a standard graph of experimental group 2 in example 3;

FIG. 3 is a standard graph of experimental group 3 in example 3;

FIG. 4 is a graph comparing the quantitative results of the sample flora in example 3;

FIG. 5 is a graph showing the relative concentration of uncorrected bacterial cells in a fermented grain sample according to example 4;

FIG. 6 is a standard graph of the fermented grain sample experimental group 1 in example 4;

FIG. 7 is a standard graph of the fermented grain sample experimental group 2 in example 4;

FIG. 8 is a standard graph of the fermented grain sample experimental group 3 in example 4;

FIG. 9 is a graph showing the absolute content of the bacterial strain after calibration of the fermented grain sample in example 4.

Detailed Description

Unless otherwise indicated, implied from the context, or common denominator in the art, all parts and percentages in the present application are based on weight and the test and characterization methods used are synchronized with the filing date of the present application. Where applicable, the disclosure of any patent, patent application, or publication referred to in this disclosure is incorporated herein by reference in its entirety, and the equivalent patents are incorporated herein by reference, especially with respect to the definitions of synthetic techniques, product and process designs, polymers, comonomers, initiators or catalysts, etc. in the art, as disclosed in these documents. If the definition of a particular term disclosed in the prior art is inconsistent with any definition provided in the present application, the definition of the term provided in the present application controls.

The numerical ranges in the present application are approximations, so that it may include the numerical values outside the range unless otherwise indicated. The numerical range includes all values from the lower value to the upper value that increase by 1 unit, provided that there is a spacing of at least 2 units between any lower value and any higher value. For example, if a component, physical or other property (e.g., molecular weight, melt index, etc.) is recited as being 100 to 1000, it is intended that all individual values, e.g., 100, 101, 102, etc., and all subranges, e.g., 100 to 166, 155 to 170, 198 to 200, etc., are explicitly recited. For ranges containing values less than 1 or containing fractions greater than 1 (e.g., 1.1,1.5, etc.), then 1 unit is suitably considered to be 0.0001,0.001,0.01, or 0.1. For a range containing units of less than 10 (e.g., 1 to 5), 1 unit is generally considered to be 0.1. These are merely specific examples of what is intended to be provided, and all possible combinations of numerical values between the lowest value and the highest value enumerated are to be considered to be expressly stated in this disclosure.

As used with respect to chemical compounds, the singular includes all isomeric forms and vice versa unless explicitly stated otherwise (e.g., "hexane" includes all isomers of hexane, either individually or collectively). In addition, unless explicitly stated otherwise, the use of the terms "a," "an," or "the" include plural referents.

The terms "comprises," "comprising," "including," and their derivatives do not exclude the presence of any other component, step or process, and are not related to whether or not such other component, step or process is disclosed in the present application. For the avoidance of any doubt, all use of the terms "comprising", "including" or "having" herein, unless expressly stated otherwise, may include any additional additive, adjuvant or compound. Rather, the term "consisting essentially of … …" excludes any other component, step or process from the scope of any of the terms recited below, as those out of necessity for operability. The term "consisting of … …" does not include any components, steps or processes not specifically described or listed. The term "or" refers to the listed individual members or any combination thereof unless explicitly stated otherwise.

In order to make the technical problems, technical schemes and beneficial effects solved by the application more clear, the application is further described in detail below with reference to the embodiments.

Examples

The following examples are presented herein to demonstrate preferred embodiments of the present application. It will be appreciated by those skilled in the art that the techniques disclosed in the examples which follow represent techniques discovered by the inventor to function in the practice of the application, and thus can be considered to constitute preferred modes for its practice. Those of skill in the art should, in light of the present disclosure, appreciate that many changes can be made in the specific embodiments which are disclosed and still obtain a like or similar result without departing from the spirit or scope of the application.

Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this application belongs, the disclosure of which is incorporated herein by reference as is commonly understood by reference. Those skilled in the art will recognize, or be able to ascertain using no more than routine experimentation, many equivalents to the specific embodiments of the application described herein. Such equivalents are intended to be encompassed by the claims.

EXAMPLE 1 design and Synthesis of reference DNA

The design of the internal reference DNA sequence needs to contain a prokaryotic universal primer binding site, an optimized synthetic filling sequence with the same length and GC content as the in-vivo target and a cloning vector which is easy to obtain and process, and the designed sequence has no matching relation with the sequence of a natural species known in nature through NCBI verification.

The internal reference DNA sequence designed by the application is shown in Table 1, the internal reference DNA sequence is inserted into plasmid, and plasmid extraction is carried out after amplification culture.

According to the sequences of the reference DNAs shown in Table 1, the DNA was sent to Shanghai Biotechnology Co., ltd and synthesized according to the standard synthetic procedure to obtain 8 reference sequences. The synthesized reference DNA was subjected to concentration measurement and mass detection using a Thermo NanoDrop instrument and agarose electrophoresis, and the theoretical copy number of each reference DNA was calculated as shown in Table 2.

For the internal reference DNA sequence designed by the application, the corresponding primer compatible with the 16S and ITS amplified regions is also designed, and the primer names, the primer sequences and the band size information are shown in Table 3.

EXAMPLE 2 reference DNA addition gradient exploration

1. Single reference addition experiment

The samples and information used are shown in table 4.

Design of experiment

The plasmids used: plasmid-1 in Table 2.

The experimental method comprises the following steps: the Qubit concentrations of sample DNA are shown in table 4, divided into two experimental groups, 1 concentration of internal reference DNA was added to each sample DNA, and 8 different concentration experiments were set for a single internal reference: the amount of reference DNA (pg) added per ug sample DNA is shown in Table 5; each of the experimental groups 1 and 2 was set with 8 experiments, each set with 3 groups in parallel, and 16S full-length amplification was performed and sequenced. The ratio of the internal reference DNA in each group of experiments was counted after the data was taken off the machine, and the average value was taken for statistics, and the results are shown in Table 5.

Experimental analysis: the addition of too much or too little internal DNA can cause errors in sequencing, so that the gradient range set by the experimental group is reasonable from the perspective of the whole experimental result in order to avoid the influence of the expansion of the errors, but fine adjustment is also needed, and in the experimental group 2, the addition concentration span is larger.

2. Multi-internal reference gradient addition pre-experiment

The samples and information used are shown in table 6.

Design of experiment

The plasmids used: plasmid 1-plasmid 8 in table 2.

Experiment design: according to the following table, the sample DNA was added with 8 internal reference DNA at different concentrations, and 3 groups of the samples were arranged in parallel, and 16S full-length amplification was performed and sequenced after mixing uniformly. And counting the proportion of each group of internal references according to the number of reads of the machine, and taking an average value for counting. The results are shown in Table 7.

Experimental analysis: from the data of the reference ratio in each group of experiments, the addition gradient of the experiment group 1 is suitable.

Example 3 accurate quantitative detection of full-Length amplification of Artificial mock community 16S

4 pure bacteria samples are selected, and the sample names are respectively as follows: 30-1-1, QXD-8, D39, Q41, the 4 pure bacterial sample genomic DNAs were extracted using a bacterial DNA extraction kit (arvensis).

After nucleic acid extraction, nucleic acid concentration and quality detection were performed using two instruments, thermo Nanodrop and Qubit, and agarose electrophoresis, respectively, according to volume 1:1:1:1 mixing 4 pure bacteria sample DNA uniformly, re-etching artificial simulated community, measuring concentration by using Qubit, measuring sample nucleic acid concentration, and the sample nucleic acid concentration is shown in Table 8.

Meanwhile, the extracted sample DNA is sent to Tianyi remote technology limited company for qPCR absolute quantification, and the 16S copy number of each sample is measured.

8 reference DNAs with known copy numbers and concentrations were added to the well-mixed artificial simulated community DNAs in a gradient manner, and the addition gradients of the reference DNAs are shown in Table 9.

After mixing uniformly, a specific primer pair with barcode is added to carry out 16S rRNA full-length PCR amplification. Wherein: forward primer 27F, sequence: AGRGTTYGATYMTGGCTCAG;

reverse primer 1492R, sequence: RGYTACCTTGTTACGACTT.

The PCR amplification system is as follows: 25 mu L of enzyme for amplification, 2.5 mu L of primer, the amount of sample DNA is shown in Table 8, and the amount of internal reference DNA is shown in Table 9,H ₂ O was made up to a volume of 50. Mu.L.

The PCR amplification procedure was: 98 ℃ for 30s;98 ℃ for 10s;62 ℃,30s;72 ℃,1min,27 cycles; 72 ℃ for 5min; preserving at 4 ℃.

For the artificial simulated community samples, 3 sets of parallel experiments were set up: experimental group 1, experimental group 2, experimental group 3, and 3 control experiments without adding internal reference DNA, which were set corresponding to parallel experiments: control group 1, control group 2, control group 3. In 3 sets of parallel experiments, primers were tagged with different barcode. Wherein, in the experimental group 1, the barcode is B192; barcode in experimental group 2 was B193; the barcode in experimental group 3 was B194.

And (3) carrying out agarose gel electrophoresis on PCR products obtained by gradient addition of internal reference and primer amplification of the experiment group 1, the experiment group 2 and the experiment group 3, detecting the amplification effect and the band range of the products, comparing the concentrations of the PCR products by using GeneTools Analysis Software, calculating the required sample mixing volume of each sample according to an equivalent principle, and carrying out Qubit quantification after mixing each PCR product.

Library construction was performed according to the standard procedure of 16S Amplification SMRT bell. Library Preparation, and the constructed amplicon library was high throughput sequenced using the PacBio platform.

After the data which are subjected to machine-off and quality control pass through the comparison of the internal reference sequence database and the output of the OTU table of the internal reference DNA, the data are based onReference DNA copy number and next sequence reads data of next machine sequencing, standard curves between reference DNA copy number (log copy number) and reads (log reads) are constructed, next machine data of experimental group and control group are shown in Table 10, and the constructed standard curves, equations and R ² The values are shown in Table 11 and FIGS. 1-3.

Filtering the internal reference sequence of the machine-down data with qualified quality control to obtain a high-quality non-internal reference effective sequence for subsequent OTU cluster analysis and species annotation analysis.

And further correcting the copy number of the sample flora according to the constructed standard curve, and calculating the absolute copy number of each flora in the sample, the ready numbers of the samples of the experimental group and the control group and the corrected absolute copy number results are shown in table 12.

The result shows that the absolute quantification of the 16S copy number of the microbial flora in the sample can be realized by adding an internal reference sequence with known copy number and concentration into the sample DNA in a gradient way and constructing an internal reference system, and the comparison result is shown in Table 13 and FIG. 4; the proportion of added internal reference sequences is shown in Table 14 for the number of total sequencing reads.

If the overall internal reference addition is lower, the standard curve fitting result is poorer; in contrast, the addition of an excessively high internal reference sequence occupies more sequencing data and leads to an increase in sequencing cost, so that the addition ratio of the internal reference has a certain influence on the result of an absolute quantification experiment, and the internal reference is recommended to be between 10% and 30%.

Example 4 accurate quantitative detection of full-Length amplification of fermented grain sample 16S

In order to evaluate the accuracy and applicability of the method of the application, 1 sample of fermented grains was collected, and a quantitative detection experiment was performed on this sample according to the method of example 3.

And (3) extracting the DNA of the fermented grain sample by using a small amount of soil DNA extraction kit (ark organism), accurately quantifying and marking the DNA concentration by using Qubit, adding internal reference DNA into the DNA in a gradient manner, carrying out three parallel experiments by adding specific primer pairs with different barcode, obtaining PCR amplification products corresponding to the experiment group 1, the experiment group 2 and the experiment group 3, and carrying out high-throughput sequencing on a PCR product constructed sequencing library. Analysis of the off-press data was performed as in example 3, with uncorrected relative population accumulation maps shown in FIG. 5, and standard curves constructed with reference systems shown in FIGS. 6-8; the corrected absolute copy numbers of the flora are shown in Table 15 and the pile-up of the flora structures in FIG. 9.

The result shows that the method can effectively detect the absolute copy number of each classification level 16S in the actual fermented grain sample, has extremely high consistency among repetition, and has higher repeatability of the detection result.

All documents mentioned in this disclosure are incorporated by reference in this disclosure as if each were individually incorporated by reference. Further, it will be appreciated that various changes and modifications may be made by those skilled in the art after reading the above teachings, and such equivalents are intended to fall within the scope of the application as defined in the appended claims.

Claims

1. A reference composition for quantifying absolute abundance of a microbial population, characterized by: the reference composition comprises 8 reference sequences shown as SEQ ID No.1-SEQ ID No. 8.

2. The reference combination for absolute abundance quantification of a microbial population of claim 1, wherein: the reference composition has a concentration gradient from low to high of SEQ ID No.1-SEQ ID No.8, and the ratio of the reference composition to the sample is 10% -30%.

3. A reference composition for quantifying the absolute abundance of a microbial population according to claim 1, wherein: the primer is a forward primer SEQ ID No.9 and a reverse primer SEQ ID No.11; forward primer SEQ ID No.11 and reverse primer SEQ ID No.12; forward primer SEQ ID No.13 and reverse primer SEQ ID No.14; one of 4 pairs of forward primer SEQ ID No.15 and reverse primer SEQ ID No.16.

4. A microbial population absolute abundance quantitative kit is characterized in that: the kit comprising the internal reference composition of any one of claims 1-3.

5. A microbial population absolute abundance quantification method is characterized by comprising the following steps of: the method comprises the following steps:

s2, sample DNA extraction:

s3, adding the internal reference composition meeting the requirement of claim 2 into sample DNA, adding the primer pair of claim 3, uniformly mixing, and performing PCR amplification to obtain a PCR product;

s4, constructing a library and sequencing on a machine: purifying the PCR product amplified in the step S3, constructing a library and sequencing;

s5, controlling the quality of the data of the off-machine;

s7, outputting a quantitative result.

6. The method for quantifying the absolute abundance of a microorganism population of claim 5, wherein: s1 further comprises the following steps: and (3) measuring the concentration of the synthesized reference DNA, and accurately calculating the copy number of the reference DNA.

7. The method for quantifying the absolute abundance of a microorganism population of claim 5, wherein: s6, further comprising a copy number correction step: and (3) further correcting the copy number of the bacterial groups of the sample according to the constructed standard curve, and calculating the absolute copy number of each bacterial group in the sample.

8. The method for quantifying the absolute abundance of a microorganism population of claim 5, wherein: s6, further comprising the following steps: and filtering the internal reference sequence to obtain a high-quality non-internal reference effective sequence for species information analysis.