WO2019149247A1 - 用于2型糖尿病的生物标志物及其用途 - Google Patents

用于2型糖尿病的生物标志物及其用途 Download PDF

Info

Publication number
WO2019149247A1
WO2019149247A1 PCT/CN2019/074162 CN2019074162W WO2019149247A1 WO 2019149247 A1 WO2019149247 A1 WO 2019149247A1 CN 2019074162 W CN2019074162 W CN 2019074162W WO 2019149247 A1 WO2019149247 A1 WO 2019149247A1
Authority
WO
WIPO (PCT)
Prior art keywords
cag
abundance
subject
sequence
index
Prior art date
Application number
PCT/CN2019/074162
Other languages
English (en)
French (fr)
Inventor
张晨虹
吴国军
张梦晖
赵立平
Original Assignee
完美(中国)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 完美(中国)有限公司 filed Critical 完美(中国)有限公司
Publication of WO2019149247A1 publication Critical patent/WO2019149247A1/zh

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6876Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
    • C12Q1/6888Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for detection or identification of organisms
    • C12Q1/689Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for detection or identification of organisms for bacteria
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6876Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
    • C12Q1/6883Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • G01N33/48Biological material, e.g. blood, urine; Haemocytometers
    • G01N33/50Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B20/00ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B30/00ICT specially adapted for sequence analysis involving nucleotides or amino acids
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H20/00ICT specially adapted for therapies or health-improving plans, e.g. for handling prescriptions, for steering therapy or for monitoring patient compliance
    • G16H20/60ICT specially adapted for therapies or health-improving plans, e.g. for handling prescriptions, for steering therapy or for monitoring patient compliance relating to nutrition control, e.g. diets
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H50/00ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/106Pharmacogenomics, i.e. genetic variability in individual responses to drugs and drug metabolism
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/158Expression markers

Definitions

  • the present invention relates to a method for assessing the presence or risk of developing type 2 diabetes in a subject based on abundance data of several CAGs.
  • the present invention also provides methods for assessing the efficacy of a dietary intervention or disease treatment in a subject with type 2 diabetes based on the abundance data of these CAGs.
  • the gut microbiota provides many beneficial functions for human hosts. Some of these features are necessary for us because we don't encode them in our own genome. From an ecological point of view, such a function can be considered as an “ecosystem service” (1). Functionally, the "healthy" intestinal microbiota is a gut microbiota capable of providing all the ecosystem services required. Short-chain fatty acid (SCFA) production is the most prominent example of such services provided by enteric bacteria. There is a large body of literature on how people can directly benefit from SCFA: for example, butyrate is the main energy substrate for colon cells, and many SCFAs function as signaling molecules that regulate inflammation and appetite regulation (2) . Thus, bacteria that supply SCFA to humans are key members of the ecosystem service provider (ESP) and the intestinal microbiota used to keep human hosts healthy.
  • ESP ecosystem service provider
  • T2DM type 2 diabetes mellitus
  • This application uses shotgun macrogenomic sequencing to reveal intestinal microbiome changes in T2D patients in response to high fiber intervention.
  • 15 CAGs (co-abundance group) expressed as CAG NO.: 1 to 15 were found to be up-regulated and identified as ESP, and 49 were expressed as CAG NO.: 16 to 64.
  • CAG is downregulated in T2D patients. These CAGs can be used as biomarkers for efficient, accurate and patient-friendly characterization of T2D.
  • the invention provides a method of assessing the risk or risk of developing type 2 diabetes in a subject, comprising the steps of:
  • CAG NO.: 1 to 15 comprise SEQ ID NO.: 1 to 191, 192 to 326, 327 to 593, 594 to 835, 836 to 885, 886 to 960, 961 to 1097, 1098 to 1264, 1265 to 1433, respectively.
  • the nucleic acid sequences shown in 1434 to 1684, 1685 to 1833, 1834 to 1979, 1980 to 2163, 2164 to 2447, and 2448 to 2783, and CAG NO.: 16 to 64 include SEQ ID NO.: 2784 to 2961, 2962, respectively.
  • the DNA analysis in step b) comprises the steps of obtaining a DNA sequence and aligning the obtained DNA sequence with the nucleic acid sequences set forth in SEQ ID No.: 1 to 14850.
  • obtaining the DNA sequence comprises the steps of obtaining an original sequence read in the sample and processing the original sequence read to obtain a qualified sequence read.
  • the original sequence reads are obtained by PCR-based high throughput sequencing techniques. In some embodiments, the original sequence reads are obtained by Illumina sequencing.
  • processing the original sequence readout comprises: removing the adaptor, trimming the sequence at the 3' end until reaching the first nucleotide with a quality threshold above 20, removing the short sequence, and removing the matching to the human genome the sequence of.
  • the short sequence is 59 bp or less in length.
  • the alignment of the DNA sequences uses a seed-and-extend strategy.
  • the abundance of each reference CAG is determined in step b) using sequences that are not mismatched in the seed sequence.
  • the seed sequence is 4 bp or greater in length, 5 bp or greater, 6 bp or greater, 7 bp or greater, 8 bp or greater, 9 bp or greater, 10 bp or greater, 11 bp or greater. 12 bp or more, 13 bp or more, 14 bp or more, 15 bp or more, 16 bp or more, 17 bp or more, 18 bp or more, or 19 bp or more.
  • the length of the seed sequence is 31 bp or less, 30 bp or less, 29 bp or less, 28 bp or less, 27 bp or less, 26 bp or less, 25 bp or less, 24 bp or less. , 23 bp or less, 22 bp or less, or 21 bp or less. In some embodiments, the seed sequence is 20 bp in length.
  • the predetermined level is about -1.028883.
  • the invention provides a method of assessing the efficacy of a dietary or disease treatment in a subject having type 2 diabetes comprising the steps of:
  • CAG NO.: 1 to 15 comprise SEQ ID NO.: 1 to 191, 192 to 326, 327 to 593, 594 to 835, 836 to 885, 886 to 960, 961 to 1097, 1098 to 1264, 1265 to 1433, respectively.
  • the nucleic acid sequences shown in 1434 to 1684, 1685 to 1833, 1834 to 1979, 1980 to 2163, 2164 to 2447, and 2448 to 2783, and CAG NO.: 16 to 64 include SEQ ID NO.: 2784 to 2961, 2962, respectively.
  • the DNA analysis in step b) comprises the steps of obtaining a DNA sequence and aligning the obtained DNA sequence with the nucleic acid sequences set forth in SEQ ID No.: 1 to 14850.
  • obtaining the DNA sequence comprises the steps of obtaining an original sequence read in the sample and processing the original sequence read to obtain a qualified sequence read.
  • the original sequence reads are obtained by PCR-based high throughput sequencing techniques. In some embodiments, the original sequence reads are obtained by Illumina sequencing.
  • processing the original sequence readout comprises: removing the adaptor, trimming the sequence at the 3' end until reaching the first nucleotide with a quality threshold above 20, removing the short sequence, and removing the matching to the human genome the sequence of.
  • the short sequence is 59 bp or less in length.
  • the alignment of the DNA sequences uses a seed extension strategy.
  • the abundance of each reference CAG is determined in step b) using sequences that are not mismatched in the seed sequence.
  • the seed sequence is 4 bp or greater in length, 5 bp or greater, 6 bp or greater, 7 bp or greater, 8 bp or greater, 9 bp or greater, 10 bp or greater, 11 bp or greater. 12 bp or more, 13 bp or more, 14 bp or more, 15 bp or more, 16 bp or more, 17 bp or more, 18 bp or more, or 19 bp or more.
  • the length of the seed sequence is 31 bp or less, 30 bp or less, 29 bp or less, 28 bp or less, 27 bp or less, 26 bp or less, 25 bp or less, 24 bp or less. , 23 bp or less, 22 bp or less, or 21 bp or less. In some embodiments, the seed sequence is 20 bp in length.
  • the stool sample is collected 1 week, 2 weeks, 3 weeks, and/or 4 weeks after the start of the dietary intervention or disease treatment during the dietary intervention or disease treatment.
  • the subject is determined to have a positive response to a dietary intervention or disease treatment when the GMM index becomes near or above a predetermined level during a dietary intervention or disease treatment.
  • the predetermined level is -1.028883.
  • the invention provides a method of assessing the risk or risk of developing type 2 diabetes in a subject, comprising the steps of:
  • CAG NO.: 1 to 15 comprise SEQ ID NO.: 1 to 191, 192 to 326, 327 to 593, 594 to 835, 836 to 885, 886 to 960, 961 to 1097, 1098 to 1264, 1265 to 1433, respectively. , nucleic acid sequences shown in 1434 to 1684, 1685 to 1833, 1834 to 1979, 1980 to 2163, 2164 to 2447, and 2448 to 2783.
  • the DNA analysis in step b) comprises the steps of obtaining a DNA sequence and aligning the obtained DNA sequence with the nucleic acid sequences set forth in SEQ ID No.: 1 to 2783.
  • obtaining the DNA sequence comprises the steps of obtaining an original sequence read in the sample and processing the original sequence read to obtain a qualified sequence read.
  • the original sequence reads are obtained by PCR-based high throughput sequencing techniques. In some embodiments, the original sequence reads are obtained by Illumina sequencing.
  • processing the original sequence readout comprises: removing the adaptor, trimming the sequence at the 3' end until reaching the first nucleotide with a quality threshold above 20, removing the short sequence, and removing the matching to the human genome the sequence of.
  • the short sequence is 59 bp or less in length.
  • the alignment of the DNA sequences uses a seed extension strategy.
  • the abundance of each reference CAG is determined in step b) using sequences that are not mismatched in the seed sequence.
  • the seed sequence is 4 bp or greater in length, 5 bp or greater, 6 bp or greater, 7 bp or greater, 8 bp or greater, 9 bp or greater, 10 bp or greater, 11 bp or greater. 12 bp or more, 13 bp or more, 14 bp or more, 15 bp or more, 16 bp or more, 17 bp or more, 18 bp or more, or 19 bp or more.
  • the length of the seed sequence is 31 bp or less, 30 bp or less, 29 bp or less, 28 bp or less, 27 bp or less, 26 bp or less, 25 bp or less, 24 bp or less. , 23 bp or less, 22 bp or less, or 21 bp or less. In some embodiments, the seed sequence is 20 bp in length.
  • the predetermined level is about 4.4.
  • the invention provides a method of assessing the efficacy of a dietary or disease treatment in a subject having type 2 diabetes, comprising the steps of:
  • CAG NO.: 1 to 15 comprise SEQ ID NO.: 1 to 191, 192 to 326, 327 to 593, 594 to 835, 836 to 885, 886 to 960, 961 to 1097, 1098 to 1264, 1265 to 1433, respectively. , nucleic acid sequences shown in 1434 to 1684, 1685 to 1833, 1834 to 1979, 1980 to 2163, 2164 to 2447, and 2448 to 2783.
  • the DNA analysis in step b) comprises the steps of obtaining a DNA sequence and aligning the obtained DNA sequence with the nucleic acid sequences set forth in SEQ ID No.: 1 to 2783.
  • obtaining the DNA sequence comprises the steps of obtaining an original sequence read in the sample and processing the original sequence read to obtain a qualified sequence read.
  • the original sequence reads are obtained by PCR-based high throughput sequencing techniques. In some embodiments, the original sequence reads are obtained by Illumina sequencing.
  • processing the original sequence readout comprises: removing the adaptor, trimming the sequence at the 3' end until reaching the first nucleotide with a quality threshold above 20, removing the short sequence, and removing the matching to the human genome the sequence of.
  • the short sequence is 59 bp or less in length.
  • the alignment of the DNA sequences uses a seed extension strategy.
  • the abundance of each reference CAG is determined in step b) using sequences that are not mismatched in the seed sequence.
  • the seed sequence is 4 bp or greater in length, 5 bp or greater, 6 bp or greater, 7 bp or greater, 8 bp or greater, 9 bp or greater, 10 bp or greater, 11 bp or greater. 12 bp or more, 13 bp or more, 14 bp or more, 15 bp or more, 16 bp or more, 17 bp or more, 18 bp or more, or 19 bp or more.
  • the length of the seed sequence is 31 bp or less, 30 bp or less, 29 bp or less, 28 bp or less, 27 bp or less, 26 bp or less, 25 bp or less, 24 bp or less. , 23 bp or less, 22 bp or less, or 21 bp or less. In some embodiments, the seed sequence is 20 bp in length.
  • the stool sample is collected 1 week, 2 weeks, 3 weeks, and/or 4 weeks after the start of the dietary intervention or disease treatment during the dietary intervention or disease treatment.
  • the subject is determined to have a positive response to a dietary intervention or disease treatment when the ESP index becomes near or above a predetermined level during a dietary intervention or disease treatment.
  • the predetermined level is 4.4.
  • Figure 1 shows an overview of clinical trials in one embodiment.
  • Figure 2 shows that a high dietary fiber diet alters the intestinal microbiota and improves glucose homeostasis in patients with type 2 diabetes.
  • HbA1c Cyclic parameters of glucose homeostasis
  • MTT mean tolerance test
  • AUC area-under-curve
  • the box shows the median and interquartile range, and the whisker indicates the lowest and highest values within 1.5 times the IQR with the first and third quartiles, and the outlier is expressed as Point it alone.
  • Each pairwise comparison was analyzed within each group using Wilcoxon matching versus signed rank test (two-tailed). The Mann-Whitney test was used to analyze the difference between the W group and the U group at the same time point. *P ⁇ 0.05, **P ⁇ 0.01 and ***P ⁇ 0.001 (adjusted according to Benjamini & Hochberg, 1995).
  • W acarbose plus WTP diet;
  • U acarbose plus conventional care (control).
  • Figure 3 shows the improvement of glucose tolerance in mice by transplantation of a dietary microbiota supplemented with dietary fiber.
  • A body weight
  • B fasting blood glucose
  • C oral glucose tolerance test (2 weeks after transplantation)
  • D fasting circulating insulin in sterile mice transplanted with fecal microbiota .
  • W acarbose plus WTP diet
  • U acarbose plus conventional care (control).
  • Figure 5 shows potential ecosystem service providers (ESPs) and co-excluded adverse bacteria.
  • Genomics in the W group showed a reduction in abundance (A) or (B) after intervention, or a decrease in abundance (C) or (D) in the U group, showing participation in 154 high-quality genome sketches Distribution network of genes produced by short chain fatty acids (SCFA), H 2 S and sputum.
  • SCFA short chain fatty acids
  • H 2 S H 2 S and sputum.
  • the histogram of each gray circle represents the average abundance on day 0 and day 28 (log conversion).
  • the changes in bacterial abundance were determined according to those in Figure 4.
  • a line connecting a gray circle to other shapes indicates a gene involved in a specific activity.
  • Brown triangles indicate genes involved in H 2 S production; purple parallelograms represent genes involved in sputum production; green and blue shapes indicate genes involved in SCFA production.
  • Acetic acid synthesis tetrahydrofolate formate ligase.
  • Butyric acid synthesis butyryl CoA: acetic acid CoA transferase (But); butyryl CoA: acetoacetate CoA transferase (Ato; composed of ⁇ (AtoA) and ⁇ (AtoD) subunits); butyric acid kinase (Buk); Butyryl CoA: 4-hydroxybutyrate CoA transferase (4Hbt).
  • Propionic acid synthesis propionate CoA transferase / propionyl CoA: succinic acid CoA transferase (PCoAt).
  • E Changes in the abundance of ecosystem service providers. The size and color of the circle represent the mean abundance and abundance coefficient of variation of the strain, respectively.
  • W acarbose plus WTP diet;
  • U acarbose plus conventional care (control).
  • Figure 6 shows that a high fiber diet reduces endotoxin load and inflammation.
  • A Lipopolysaccharide binding protein.
  • B White blood cell count.
  • C TNF- ⁇ . Intra- and inter-group comparisons were performed using two-way repeated measures analysis of variance with Tukey's post hoc test. Day 0 with respect to the same group * P ⁇ 0.05, ** P ⁇ 0.01, *** P ⁇ 0.001; the same point in time with respect to U group # P ⁇ 0.05, ## P ⁇ 0.01, ### P ⁇ 0.001.
  • W acarbose plus WTP diet;
  • U acarbose plus conventional care (control).
  • Figure 7 shows the relationship between the abundance of bacterial CAG and the phenotypic reduction of type 2 diabetes.
  • Figure 8 shows that the abundance and diversity of ecosystem service providers (ESPs) are associated with a reduction in disease phenotype in patients with type 2 diabetes.
  • A Heat map of the relationship between the abundance of a single ESP and clinical variables. *P ⁇ 0.05 and **P ⁇ 0.01.
  • B ESP index Where A i is the change in the abundance of ESP i ).
  • CAG co-abundance gene set
  • size of CAG No.:i refers to the length of CAG No.:i, that is, the number of nucleotides of CAG No.:i.
  • biomarker refers to a measurable indicator of a biological state or condition.
  • the biomarker used herein is CAG, and its abundance data can indicate T2D.
  • the term "Receiver operating characteristic curve” or "ROC curve” refers to a graphical curve that exhibits the diagnostic capabilities of a binary classifier system as its discriminant threshold changes.
  • the ROC curve is generated by plotting the true positive rate versus the false positive rate at different threshold settings.
  • the true positive rate is also called sensitivity, recall rate or probability of detection.
  • the false positive rate is also referred to as a fall-out or false alarm probability and can be calculated as (1-specificity). Therefore, the ROC curve is the sensitivity as a function of the false alarm rate.
  • the term "Youden Index” refers to the difference between the true positive rate and the false positive rate. Maximizing this index allows the discovery of the best cut-off point independent of the prevalence rate from the ROC curve. The index is shown as the height above the opportunity line.
  • AUC area under the ROC curve
  • CAGs have been found to be commonly distributed in samples from T2D patients responding to high fiber diet intervention by scanning the entire intestinal microbiota. Of these CAGs, 15 were raised and 49 were down.
  • the GMM index and ESP index calculated based on the abundance of some of these CAGs or some of these CAGs in fecal samples can be used to assess the presence of T2D or the risk of developing T2D in a subject.
  • abundance changes in some of these CAGs or some of these CAGs can be used to monitor responses to disease treatment or dietary intervention in patients with T2D. Both methods can be performed in an efficient, accurate and patient-friendly manner.
  • the present invention provides a method of assessing the presence or risk of developing type 2 diabetes in a subject, comprising the steps of:
  • the present invention provides a method of assessing the efficacy of a dietary intervention or disease treatment in a subject with type 2 diabetes comprising the following steps:
  • the subject is determined to have a positive response to dietary intervention or disease treatment.
  • the present invention provides a method of assessing the risk or risk of developing type 2 diabetes in a subject, comprising the steps of:
  • the invention also provides a method of assessing the efficacy of a dietary or disease treatment in a subject having type 2 diabetes comprising the steps of:
  • the subject is determined to have a positive response to dietary intervention or disease treatment.
  • CAG NO.: 1 to 15 respectively include SEQ ID NO.: 1 to 191, 192 to 326, 327 to 593, 594 to 835, 836 to 885, 886 to 960, 961 to 1097, 1098 to 1264.
  • nucleic acid sequences of 1265 to 1433, 1434 to 1684, 1685 to 1833, 1834 to 1979, 1980 to 2163, 2164 to 2447, and 2448 to 2783, and CAG NO.: 16 to 64 include SEQ ID NO.: 2784, respectively.
  • the DNA sequence is obtained from a stool sample and subsequently aligned to the CAG sequence.
  • a seed extension strategy is used in the alignment of the DNA sequences and the abundance of each reference CAG is determined using sequences that are not mismatched in the seed sequence.
  • the seed sequence is 20 bp in length.
  • Obtaining the DNA sequence involves obtaining the original sequence reads in the sample and processing the original sequence reads to obtain acceptable sequence reads.
  • the original sequence reads are obtained by PCR-based high throughput sequencing techniques.
  • the original sequence reads are obtained by Illumina sequencing. Processing of the original sequence reads can be performed as is known in the art. In some cases, processing involves removing the adaptor, trimming the sequence at the 3' end until the first nucleotide with a quality threshold above 20 is reached, removing the short sequence, and removing the sequence that matches the human genome. In some embodiments, the short sequence is 59 bp or less in length.
  • a method for assessing the risk or risk of developing T2D in a subject if the GMM index or ESP index is near or below a predetermined level, it is determined that the subject has or is at risk of developing T2D.
  • the predetermined level can be set based on laboratory data or clinical data. Even if the level is predetermined, the hospital or doctor can adjust it according to the age, sex, physical condition, etc. of the subject.
  • the predetermined level is about -1.028883 for the GMM index. In a preferred embodiment of the invention, the predetermined level is about 4.4 for the ESP index.
  • the receiver operating characteristic curve is a graphical plot of the diagnostic capabilities of the binary classifier system as it varies with the threshold of discrimination.
  • the Youden index refers to the difference between the true positive rate and the false positive rate. The Youden Index is often used in conjunction with Receiver Operating Characteristic (ROC) analysis.
  • ROC Receiver Operating Characteristic
  • the index defines all points of the ROC curve, and the maximum value of the index can be used as a criterion for selecting the best cutoff point when the diagnostic test provides a numerical result rather than a binary result.
  • the binary number is set to 1. Accordingly, when the Youden index reaches a maximum, the GMM index is -1.028883; and when the Youden index reaches a maximum, the ESP index is 4.4.
  • the subject may have a HbA1c level of less than 6.5% with an accuracy of 90.48%; if the subject is determined to have a GMM index lower than or equal to -1.028883, then It may have a HbA1c level above 6.5% with an accuracy of 44.75%.
  • the ESP index if the subject is determined to have an ESP index above 4.4, it may have a HbA1c level of less than 6.5% with an accuracy of 92.11%; if the subject is determined to have an ESP index of less than or equal to 4.4, then It may have a HbA1c level above 6.5% with an accuracy of 45.52%.
  • the predetermined level is preferably about -1.028883, or for an ESP index, the predetermined level is preferably about 4.4, which is determined based on the corresponding ROC curve and the Younden index.
  • Participants were recruited from Chinese Han T2DM patients aged 35 to 70 years (6.5% ⁇ HbA1c ⁇ 12.0%).
  • the main exclusion criteria included: type 1 diabetes; pregnancy; lactation; pregnancy intended during the course of the study; severe diabetic complications (diabetic retinopathy, diabetic neuropathy, diabetic nephropathy, and diabetic foot); severe liver disease (including chronic persistence) Co-occurrence of hepatitis, cirrhosis, or positive hepatitis B virus surface antigen and abnormal liver transaminase (a serum concentration of alanine aminotransferase or aspartate aminotransferase > 2.5 ⁇ normal limit); within 3 months before recruitment Continuous use of antibiotics > 3 days; continuous use of weight loss drugs > 1 month; gastrointestinal surgery (in addition to appendicitis or sputum surgery); severe mental illness in the past 6 months; receiving medication to treat cholecystitis , peptic ulcer, urinary tract infection, acute pyelonephritis
  • Routine care consists of standard dietary and exercise recommendations based on the Chinese T2DM Diabetes Control Guide (2013 edition).
  • the WTP diet consists of three ready-to-eat pre-cooked foods: Formulation No. 1 (2), Formulation No. 2 (2), and Formulation No. 8 (manufactured by Perfect (China) Co., Ltd. (China Zhongshan)).
  • the WTP diet was administered in combination with an appropriate amount of vegetables, fruits and nuts as recommended by the nutritionist.
  • the input of constant nutrients is balanced according to the age-based standard nutritional requirements provided by the Chinese Dietary Reference Intake (DRI) and recommended by the Chinese Nutrition Society (CNS, 2013).
  • TCM Chinese medicine
  • oats buckwheat
  • white A precooked mixture of white beans, yellow corn, red beans, soybeans, yam, peanuts, lotus seeds and alfalfa, prepared in the form of canned porridge (370 g wet weight per can).
  • TCM Chinese medicine
  • Each contained 100 g of ingredients 59 g carbohydrate, 15 g protein, 5 g fat and 6 g fiber) and 336 kcal (70% carbohydrate, 17% protein, 13% fat).
  • Formulation No. 8 is a powder preparation for infusion (20 g per bag) containing bitter gourd (Momordica charantia) and oligosaccharides (including fructose-oligosaccharide and oligo-isomaltose).
  • Table 1 The detailed composition of Formulation No. 8 is shown in Table 1 below.
  • Formula No. 1 of ⁇ 360 g was consumed as a staple food, and No. 2 and No. 8 were consumed in 10 g and 15 g, respectively.
  • the nutrient intake was calculated based on the Chinese Food Ingredients Table 2009 39 using the dietary records of each subject (Table 2).
  • Acarbose was administered in an oral dose of 100 mg three times a day. Participants recorded their treatment options for diet, weight, drug use, and adverse events.
  • a data is mean ⁇ sem. ***P ⁇ 0.001 vs. W day 0; vs. U 84th day###P ⁇ 0.001.
  • Intra- and inter-group comparisons were performed using a two-way repeated measures analysis of variance with Bonferroni post hoc test.
  • the a intervention begins after the 2-week washout period of the above regular medication. Day -14 indicates the beginning of the washout period.
  • Biological samples, anthropometric data, and clinical laboratory analysis were obtained every 28 days at baseline and during the intervention. Venous blood samples were collected 10 hours after an overnight fast, and participants were then subjected to an oral glucose tolerance test for 3 hours. All participants ingested 75 g of glucose and obtained blood samples at 30, 60, 120 and 180 minutes. The blood sample was allowed to stand at room temperature for 30 minutes, and then centrifuged at 3,000 ⁇ g for 20 minutes to obtain serum. Collect feces and morning urine on the same day. Serum, urine and stool samples were collected, immediately transferred to dry ice and stored at -80 °C for additional analysis within 5 hours.
  • Fecal samples were collected from day 2 and day 84 from two female participants (2W009 from group W and 2U004 from group U). The two donors were systematically selected - the intestinal microbiota changes after intervention were determined among all participants, and those with no significant changes were excluded, and then one participant from each group was randomly selected as a representative donor.
  • Each fecal sample (0.5 g) was placed in an anaerobic chamber (80% N 2 : 10% CO 2 : 10% H 2 ) in 25 mL of sterile Ringer working buffer (9 g/L sodium chloride, 0.4 g/ Dilute in L potassium chloride, 0.25 g/L calcium chloride dihydrate and 0.05% (w/v) L-cysteine hydrochloride.
  • the fecal material was suspended by thorough vortexing (5 minutes) and allowed to settle by gravity for 5 minutes.
  • the clarified supernatant was transferred to a clean tube and an equal volume of 20% (w/v) skim milk (LP0031, Oxoid, UK) was added.
  • the inoculum was freshly prepared on the day of the experiment and the remainder was stored at -80 °C until the second inoculation.
  • the mixture was cultured using a plate coating method under the following conditions: 1) For aerobic bacteria, cultured on LB agar, brain heart infusion agar and thioglycollate agar under aerobic conditions at 37 ° C; 2) Anaerobic bacteria cultured on gibber anaerobic medium (GAM) under anaerobic conditions at 37 ° C; and 3) improved for fungi under aerobic conditions at 25 ° C to 28 ° C Cultured on Martin's agar and soybean tryptone agar. All cultures were examined under light microscope after 1, 2, 4, 7 and 14 days.
  • mice were fed ad libitum with sterile conventional diet (SLAC, Shanghai, China). Bacterial contamination is monitored by regular bacteriological examination of feces, food and padding. At 6 weeks of age, sterile mice were housed in separate cages and randomly divided into 4 groups (each group maintained in a separate isolator).
  • OGTT oral glucose tolerance test
  • Metagenomic sequencing DNA was extracted from fecal samples as previously described (2) and sequenced using Illumina HiSeq 3000 from GENEWIZ Co. (Beijing, China). Cluster generation, template hybridization, isothermal amplification, linearization, and blocking denaturation and hybridization of sequencing primers were performed according to the workflow specified by the service provider. A library with an insert size of about 500 bp was constructed, followed by high throughput sequencing to obtain a double ended read with 150 bp in the forward and reverse directions.
  • CAG Common Abundance Gene Set
  • the mass of the assembly 1) 90% of the genomic assembly must be included in the contig (>500bp); 2) 90% of the assembled base must be in the >5x readout coverage; 3) the contig N50 must be >5kb; 4) The scaffold N50 must be >20 kb; 5) the average contig length must be >5 kb; and 6) >90% of the core genes must be present in the assembly.
  • a phylogenetic tree was constructed using the CVtree 3.0 web server (12) with 154 bacterial CAGs with high quality assemblies, 352 reference gastrointestinal genomes from the HMP DACC database, and a server built-in database.
  • CVtree 3.0 web server (12) with 154 bacterial CAGs with high quality assemblies, 352 reference gastrointestinal genomes from the HMP DACC database, and a server built-in database.
  • SpecI (13) to map bacterial CAGs, which are based on 40 universal single-copy phylogenetic marker genes that group organisms into species clusters.
  • Low quality CAG was aligned with 7,991 reference genomes from the NCBI database at both protein (BLASTP) and nucleotide (BLASTN) levels. The alignment results were filtered using query coverage (>70%) and E values (at nucleotide levels, ⁇ 1e-10; at protein levels, ⁇ 1e-5).
  • Assigning CAG to species or genus based on previously described taxonomic allocation thresholds (14) (species level: 90% of genes can be mapped to species genomes with >95% identity at the DNA level; genus: 80% The genes can be mapped to genus with >85% identity at both DNA and protein levels).
  • HbA1c Glycated hemoglobin
  • Patients in the W group also attenuated a significantly greater percentage of body weight compared to the U group and showed a significant improvement in lipid mass spectrometry and inflammation levels.
  • Example 2 High fiber intervention regulates the overall structure of the intestinal microbiota in patients with T2DM
  • Shotgun metagenomic sequencing was performed on 172 stool samples collected at 4 time points (days 0, 28, 56 and 84). From the non-redundant gene catalog of 4,893,833 microbial genes, 422 co-abundance gene sets (CAG; using Canopy-based algorithm (19) binning) were identified as different bacterial genomes. Based on the Bray-Curtis distance from 422 bacterial CAGs, the overall structure of the intestinal microbiota (as indicated by the principal coordinate analysis) showed significant changes from day 0 to day 28 in both groups, after which no further changes were made (Fig. 2B).
  • Example 3 Transplantation demonstrates the causal contribution of the intestinal microbiota to alleviating T2DM
  • mice receiving the post-intervention microbial population from Group W had significantly lower body weight (Fig. 3A). These mice also had the lowest fasting and postprandial blood glucose levels associated with fasting insulin levels when compared to those transplanted with pre-intervention microbial populations from group W or microbial populations at any time point from group U. Effect ( Figures 3B to 3D). Our intervention was determined by the metastatic effect of microbial transplantation, and changes in intestinal microbiota caused by high dietary fiber contribute to the improvement of glucose homeostasis in T2DM patients.
  • High quality genome sketches were assembled to identify bacterial species/strains that drive dietary fiber to reduce intestinal specific effects of the T2DM phenotype.
  • 154 high-quality genome sketches were assembled from CAG shared by >20% of samples. The total readout percentage for each sample mapped to these high quality genome sketches was 57% ( ⁇ 11%), which represents both the predile and dominant enteric bacteria in the entire cohort.
  • 141 of the 154 high-quality genome sketches have at least one key gene for SCFA production and can be considered a SCFA producer.
  • 64 bacteria were selected for further analysis because: 1) it was an intervention-responsive CAG identified by Wilcoxon matching for the signed rank test, such as intervention in the W or U group on day 28 significantly altered (FIG.
  • These 15 bacteria can be used in the W group to serve supplements.
  • the important purpose of acetic acid and butyric acid and thus can be the essential function of the ecosystem service provider (ESP).
  • ESP ecosystem service provider
  • Efficient energy production from carbohydrates and tolerance to low pH can explain why these bacteria have a competitive advantage over other SCFA producers.
  • Bifidobacterium which is capable of producing more ATP molecules and acetic acid using its "bifold" pathway (21) compared to other acetic acid producers.
  • the 49 bacteria which were significantly down-regulated in either of the two groups were those having genes for synthesizing lipopolysaccharide, purine and H 2 S. Moreover, according to the analysis of the gene center pathway, this suggests that a reduced ability to produce metabolically unfavorable compounds may contribute to the beneficial effects of a high dietary fiber diet. Endotoxin production has been shown to reduce inflammation and restore insulin sensitivity (22, 23). Lipopolysaccharide binding proteins (a surrogate marker for endotoxin load) and inflammatory markers were lower in the W group than in the U group, indicating that the reduction in inflammation may be due to a decrease in endotoxin production (Figure 6).
  • Indole and H 2 S-producing bacteria is reduced abundance improved suppression of generation of GLP-1 (24-26), which is consistent with a large meal observed in the group W in response to GLP-1.
  • reducing bacteria that produce adverse metabolites achieves a clinically significant improvement in the host.
  • CAG0023, CAG0033, CAG0037, CAG0045, CAG0046, CAG0064, CAG0079, CAG0106, CAG0133, CAG0153, CAG0155, CAG0207, CAG0224, CAG0236 and CAG0409 are represented in the present invention as CAG NO.: 1 to 15, respectively.
  • the gut microbiota (GMM) index for each sample was calculated based on 15 ESP and abundance data of 49 ESPs reduced after intervention.
  • the ESP index followed a similar trajectory in both W and U groups, from a sharp increase from baseline to day 28 and remained at a similar level for the remainder of the intervention, but the index was in group W at each post-intervention time point. Significantly higher (days 28, 56 and 84, Figure 8B).
  • ROC receiver operating characteristic curve
  • ROC receiver operating characteristic curve

Landscapes

  • Chemical & Material Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Analytical Chemistry (AREA)
  • Organic Chemistry (AREA)
  • General Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • Genetics & Genomics (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Biotechnology (AREA)
  • Immunology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Molecular Biology (AREA)
  • Biophysics (AREA)
  • Biochemistry (AREA)
  • Pathology (AREA)
  • Microbiology (AREA)
  • Medical Informatics (AREA)
  • General Engineering & Computer Science (AREA)
  • Biomedical Technology (AREA)
  • Public Health (AREA)
  • Primary Health Care (AREA)
  • Evolutionary Biology (AREA)
  • Hematology (AREA)
  • Urology & Nephrology (AREA)
  • Theoretical Computer Science (AREA)
  • Epidemiology (AREA)
  • General Physics & Mathematics (AREA)
  • Medicinal Chemistry (AREA)
  • Food Science & Technology (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Cell Biology (AREA)
  • Nutrition Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

基于数个CAG的丰度数据来在对象中评估2型糖尿病之存在或发生风险的方法,以及基于这些CAG的丰度数据来在患有2型糖尿病的对象中评价饮食干预或疾病治疗之效力的方法。

Description

用于2型糖尿病的生物标志物及其用途
本申请要求于2018年01月31日提交中国专利局、申请号为201810143729.2、发明名称为“用于2型糖尿病的生物标志物及其用途”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。
技术领域
本发明涉及基于数个CAG的丰度数据来在对象中评估2型糖尿病之存在或发生风险的方法。本发明还提供了基于这些CAG的丰度数据来在患有2型糖尿病的对象中评价饮食干预或疾病治疗之效力的方法。
背景技术
肠微生物群(gut microbiota)为人宿主提供很多有益功能。这些功能中一些对于我们而言是必需的,因为在我们自身的基因组中我们不对其进行编码。从生态学角度考虑,这样的功能可以被视为“生态系统服务”(1)。在功能上,“健康的”肠微生物群是能够提供所需的所有生态系统服务的肠微生物群。短链脂肪酸(short-chain fatty acid,SCFA)产生是由肠细菌提供的此类服务的最显著实例。关于人如何可以直接受益于SCFA,已存在大量文献:例如,丁酸盐(butyrate)是结肠细胞的主要能量底物,并且很多种SCFA作为调节炎症和食欲调控的信号传导分子发挥功能(2)。因此,向人供应SCFA的细菌是生态系统服务提供者(ecosystem service provider,ESP)和用于使人宿主保持健康的肠微生物群的关键成员。
SCFA产生者的缺陷与生态失调相关性疾病(例如2型糖尿病(type 2 diabetes mellitus,T2DM))相关(3-6)。已经表明,使用高膳食纤维饮食的临床试验减轻T2DM的疾病表型,但是在个体之间的治疗响应差异很大(7-9),这可能是由于肠微生物群中SCFA产生者的个体特异性谱导致的(10)。
然而,鉴定用于SCFA产生以改善T2DM的ESP并不是容易。将有机化合物发酵成SCFA的能力是众多分类中的数百种肠细菌物种所共有的遗传性状(11)。由于对肠腔中酸度的耐受性不同,一些SCFA产生者可以在竞争中胜 过另一些SCFA产生者(12,13)。这需要区分具有产生SCFA的遗传能力的“产生者”与实际上在特殊肠环境中对碳水化合物进行发酵并供应SCFA的“提供者”。我们最近的研究进一步表明了丁酸和乙酸产生性物种针对高膳食纤维饮食的菌株特异性响应(14,15)。这需要菌株水平的全微生物组关联方法以鉴定作为响应于高膳食纤维摄入的针对人宿主的实际SCFA供应者的ESP。
发明内容
本申请使用鸟枪宏基因组测序来揭示T2D患者中响应于高纤维干预的肠微生物组变化。结果,发现15个表示为CAG NO.:1至15的CAG(共丰度基因集((co-abundance group)))上调并鉴定为ESP,而49个表示为CAG NO.:16至64的CAG在T2D患者中下调。这些CAG可以用作对T2D进行高效、准确且患者友好表征的生物标志物。
在一方面,本发明提供了在对象中评估2型糖尿病的存在或发生风险的方法,其包括以下步骤:
a)从对象收集粪便样品;
b)分析从粪便样品提取的DNA以确定选自CAG No.:1至64的每个参考CAG的丰度:
A i(CAG No:i的丰度)=与CAG No.:i匹配的读出(read)的数目/(CAG No.:i的大小×总读出的数目);
c)使用所计算的丰度数据来计算每个样品的GMM指数:
Figure PCTCN2019074162-appb-000001
以及
d)如果GMM指数接近或低于预定水平,则确定对象患有或有风险发生2型糖尿病,
其中CAG NO.:1至15分别包含SEQ ID NO.:1至191、192至326、327至593、594至835、836至885、886至960、961至1097、1098至1264、1265至1433、1434至1684、1685至1833、1834至1979、1980至2163、2164至2447和2448至2783所示的核酸序列,并且CAG NO.:16至64分别包含SEQ ID NO.:2784至2961、2962至3130、3131至3525、3526至3747、3748至3863、3864至4068、4069至4212、4213至4393、4394至4532、4533至4891、4892 至4979、4980至5116、5117至5320、5321至5464、5465至5781、5782至6279、6280至6646、6647至6954、6955至7178、7179至7613、7614至7758、7759至8046、8047至8491、8492至8546、8547至9971、9972至10099、10100至10392、10393至10502、10503至10694、10695至10986、10987至11089、11090至11262、11263至11466、11467至11704、11705至12034、12035至12113、12114至12341、12342至12454、12455至12664、12665至12825、12826至13042、13403至13500、13501至13726、13727至13949、13950至14014、14015至14290、14291至14403、14404至14686和14687至14850所示的核酸序列。
在一些实施方案中,步骤b)中的DNA分析包括以下步骤:获得DNA序列并将所获得的DNA序列与SEQ ID No.:1至14850所示的核酸序列进行比对。
在一些实施方案中,获得DNA序列包括以下步骤:在样品中获得原始序列读出并对原始序列读出进行处理以获得合格的序列读出。
在一些实施方案中,原始序列读出通过基于PCR的高通量测序技术来获得。在一些实施方案中,原始序列读出通过Illumina测序来获得。
在一些实施方案中,对原始序列读出进行处理包括:去除衔接子,在3’端修剪序列直至到达质量阈值高于20的第一个核苷酸,去除短序列,以及去除与人基因组匹配的序列。在一些实施方案中,短序列的长度为59bp或更小。
在一些实施方案中,DNA序列的比对使用种子延伸策略(seed-and-extend strategy)。在一些实施方案中,使用在种子序列中无错配的序列来在步骤b)中确定每个参考CAG的丰度。在一些实施方案中,种子序列的长度为4bp或更大、5bp或更大、6bp或更大、7bp或更大、8bp或更大、9bp或更大、10bp或更大、11bp或更大、12bp或更大、13bp或更大、14bp或更大、15bp或更大、16bp或更大、17bp或更大、18bp或更大、或者19bp或更大。在一些实施方案中,种子序列的长度为31bp或更小、30bp或更小、29bp或更小、28bp或更小、27bp或更小、26bp或更小、25bp或更小、24bp或更小、23bp或更小、22bp或更小、或者21bp或更小。在一些实施方案中,种子序列的长度为20bp。
在一些实施方案中,预定水平为约-1.028883。
在第二方面,本发明提供了在患有2型糖尿病的对象中评价饮食干预或疾病治疗的效力的方法,其包括以下步骤:
a)在饮食干预或疾病治疗之前和期间从对象收集粪便样品;
b)分析从粪便样品提取的DNA以确定选自CAG No.:1至64的每个参考CAG的丰度:
A i(CAG No:i的丰度)=与CAG No.:i匹配的读出的数目/(CAG No.:i的大小×总读出的数目);
c)使用所计算的丰度数据来计算每个样品的GMM指数:
Figure PCTCN2019074162-appb-000002
以及
e)如果在于饮食干预或疾病治疗期间收集的样品中GMM指数提高,则确定对象对饮食干预或疾病治疗作出积极响应,
其中CAG NO.:1至15分别包含SEQ ID NO.:1至191、192至326、327至593、594至835、836至885、886至960、961至1097、1098至1264、1265至1433、1434至1684、1685至1833、1834至1979、1980至2163、2164至2447和2448至2783所示的核酸序列,并且CAG NO.:16至64分别包含SEQ ID NO.:2784至2961、2962至3130、3131至3525、3526至3747、3748至3863、3864至4068、4069至4212、4213至4393、4394至4532、4533至4891、4892至4979、4980至5116、5117至5320、5321至5464、5465至5781、5782至6279、6280至6646、6647至6954、6955至7178、7179至7613、7614至7758、7759至8046、8047至8491、8492至8546、8547至9971、9972至10099、10100至10392、10393至10502、10503至10694、10695至10986、10987至11089、11090至11262、11263至11466、11467至11704、11705至12034、12035至12113、12114至12341、12342至12454、12455至12664、12665至12825、12826至13042、13403至13500、13501至13726、13727至13949、13950至14014、14015至14290、14291至14403、14404至14686和14687至14850所示的核酸序列。
在一些实施方案中,步骤b)中的DNA分析包括以下步骤:获得DNA序列并将获得的DNA序列与SEQ ID No.:1至14850所示的核酸序列进行比对。
在一些实施方案中,获得DNA序列包括以下步骤:在样品中获得原始序 列读出并对原始序列读出进行处理以获得合格的序列读出。
在一些实施方案中,原始序列读出通过基于PCR的高通量测序技术来获得。在一些实施方案中,原始序列读出通过Illumina测序来获得。
在一些实施方案中,对原始序列读出进行处理包括:去除衔接子,在3’端修剪序列直至到达质量阈值高于20的第一个核苷酸,去除短序列,以及去除与人基因组匹配的序列。在一些实施方案中,短序列的长度为59bp或更小。
在一些实施方案中,DNA序列的比对使用种子延伸策略。在一些实施方案中,使用在种子序列中无错配的序列来在步骤b)中确定每个参考CAG的丰度。在一些实施方案中,种子序列的长度为4bp或更大、5bp或更大、6bp或更大、7bp或更大、8bp或更大、9bp或更大、10bp或更大、11bp或更大、12bp或更大、13bp或更大、14bp或更大、15bp或更大、16bp或更大、17bp或更大、18bp或更大、或者19bp或更大。在一些实施方案中,种子序列的长度为31bp或更小、30bp或更小、29bp或更小、28bp或更小、27bp或更小、26bp或更小、25bp或更小、24bp或更小、23bp或更小、22bp或更小、或者21bp或更小。在一些实施方案中,种子序列的长度为20bp。
在一个实施方案中,在饮食干预或疾病治疗期间,在饮食干预或疾病治疗开始后1周、2周、3周和/或4周收集粪便样品。
在一些实施方案中,当在饮食干预或疾病治疗期间GMM指数变得接近或高于预定水平时,确定对象对饮食干预或疾病治疗产生积极响应。在一些实施方案中,预定水平为-1.028883。
在第三方面,本发明提供了在对象中评估2型糖尿病的存在或发生风险的方法,其包括以下步骤:
a)从对象收集粪便样品;
b)分析从粪便样品提取的DNA以确定选自CAG ID No.:1至15的每个参考CAG的丰度:
A i(CAG No:i的丰度)=与CAG No.:i匹配的读出的数目/(CAG No.:i的大小×总读出的数目);
c)使用所计算的丰度数据来计算每个样品的ESP指数:
Figure PCTCN2019074162-appb-000003
其中Heip=(e H-1)/14,
Figure PCTCN2019074162-appb-000004
以及
d)如果ESP指数接近或低于预定水平,则确定对象患有或有风险发生2型糖尿病,
其中CAG NO.:1至15分别包含SEQ ID NO.:1至191、192至326、327至593、594至835、836至885、886至960、961至1097、1098至1264、1265至1433、1434至1684、1685至1833、1834至1979、1980至2163、2164至2447和2448至2783所示的核酸序列。
在一些实施方案中,步骤b)中的DNA分析包括以下步骤:获得DNA序列并将所获得的DNA序列与SEQ ID No.:1至2783所示的核酸序列进行比对。
在一些实施方案中,获得DNA序列包括以下步骤:在样品中获得原始序列读出并对原始序列读出进行处理以获得合格的序列读出。
在一些实施方案中,原始序列读出通过基于PCR的高通量测序技术来获得。在一些实施方案中,原始序列读出通过Illumina测序来获得。
在一些实施方案中,对原始序列读出进行处理包括:去除衔接子,在3’端修剪序列直至到达质量阈值高于20的第一个核苷酸,去除短序列,并去除与人基因组匹配的序列。在一些实施方案中,短序列的长度为59bp或更小。
在一些实施方案中,DNA序列的比对使用种子延伸策略。在一些实施方案中,使用在种子序列中无错配的序列来在步骤b)中确定每个参考CAG的丰度。在一些实施方案中,种子序列的长度为4bp或更大、5bp或更大、6bp或更大、7bp或更大、8bp或更大、9bp或更大、10bp或更大、11bp或更大、12bp或更大、13bp或更大、14bp或更大、15bp或更大、16bp或更大、17bp或更大、18bp或更大、或者19bp或更大。在一些实施方案中,种子序列的长度为31bp或更小、30bp或更小、29bp或更小、28bp或更小、27bp或更小、26bp或更小、25bp或更小、24bp或更小、23bp或更小、22bp或更小、或者21bp或更小。在一些实施方案中,种子序列的长度为20bp。
在一些实施方案中,预定水平为约4.4。
在第四方面,本发明提供了在患有2型糖尿病的对象中评价饮食干预或疾病治疗之效力的方法,其包括以下步骤:
a)在饮食干预或疾病治疗之前和期间从对象收集粪便样品;
b)分析从粪便样品提取的DNA以确定选自CAG ID No.:1至15的每个参考CAG的丰度:
A i(CAG No:i的丰度)=与CAG No.:i匹配的读出的数目/(CAG No.:i的大小×总读出的数目);
c)使用所计算的丰度数据来计算每个样品的ESP指数:
Figure PCTCN2019074162-appb-000005
其中Heip=(e H-1)/14,
Figure PCTCN2019074162-appb-000006
以及
e)如果在于饮食干预或疾病治疗期间收集的样品中ESP指数提高,则确定对象对饮食干预或疾病治疗产生积极响应,
其中CAG NO.:1至15分别包含SEQ ID NO.:1至191、192至326、327至593、594至835、836至885、886至960、961至1097、1098至1264、1265至1433、1434至1684、1685至1833、1834至1979、1980至2163、2164至2447和2448至2783所示的核酸序列。
在一些实施方案中,步骤b)中的DNA分析包括以下步骤:获得DNA序列并将所获得的DNA序列与SEQ ID No.:1至2783所示的核酸序列进行比对。
在一些实施方案中,获得DNA序列包括以下步骤:在样品中获得原始序列读出并对原始序列读出进行处理以获得合格的序列读出。
在一些实施方案中,原始序列读出通过基于PCR的高通量测序技术来获得。在一些实施方案中,原始序列读出通过Illumina测序来获得。
在一些实施方案中,对原始序列读出进行处理包括:去除衔接子,在3’端修剪序列直至到达质量阈值高于20的第一个核苷酸,去除短序列,并去除与人基因组匹配的序列。在一些实施方案中,短序列的长度为59bp或更小。
在一些实施方案中,DNA序列的比对使用种子延伸策略。在一些实施方案中,使用在种子序列中无错配的序列来在步骤b)中确定每个参考CAG的丰度。在一些实施方案中,种子序列的长度为4bp或更大、5bp或更大、6bp或更大、7bp或更大、8bp或更大、9bp或更大、10bp或更大、11bp或更大、12bp或更大、13bp或更大、14bp或更大、15bp或更大、16bp或更大、17bp或更大、18bp或更大、或者19bp或更大。在一些实施方案中,种子序列的长度为31bp或更小、30bp或更小、29bp或更小、28bp或更小、27bp 或更小、26bp或更小、25bp或更小、24bp或更小、23bp或更小、22bp或更小、或者21bp或更小。在一些实施方案中,种子序列的长度为20bp。
在一个实施方案中,在饮食干预或疾病治疗期间,在饮食干预或疾病治疗开始后1周、2周、3周和/或4周收集粪便样品。
在一些实施方案中,当在饮食干预或疾病治疗期间ESP指数变得接近或高于预定水平时,确定对象对饮食干预或疾病治疗产生积极响应。在一些实施方案中,预定水平为4.4。
根据以下不应被解释为限制性的详细描述和实例,本公开内容的其他特征和优点将是明显的。在本申请通篇引用的所有参考文献、Genbank条目、专利和公开专利申请的内容均通过引用明确地并入本文。
附图说明
图1示出了一个实施例中临床试验的概况。
图2示出了高膳食纤维饮食在患有2型糖尿病的患者中改变肠微生物群并且改善葡萄糖体内平衡。(A)葡萄糖体内平衡的循环参数(HbA1c、空腹血糖、膳食耐量测试(meal tolerance test,MTT)中的葡萄糖和胰岛素曲线下面积(area-under-curve,AUC))的变化。数据表示为从第0天起的百分比变化(±标准误差)。使用具有Tukey事后检验的双向重复测量方差分析进行组内和组间比较。相对于同一组的第0天*P<0.05、**P<0.01和***P<0.001;相对于相同时间点的U组 #P<0.05、 ##P<0.01和 ###P<0.001。对于所有分析,对于W组,N=27;并且对于U组,n=16,除在针对MTT的U组中n=15之外。(B)整体肠微生物结构的变化。基于Bray-Curtis距离对422个细菌共丰度基因集进行主坐标分析。(C)肠微生物群多样性(基因丰富度)的变化。将基因计数的变化调整至每个样品3100万个映射读出。数据示为平均值±S.E.M。箱(box)示出了中位数和四分位数间距,须(whisker)表示在与第一和第三四分位数的1.5倍IQR内的最低值和最高值,并且异常值表示为单独点。使用Wilcoxon匹配对符号秩检验(双尾的)来在每个组内分析每个成对比较。使用Mann-Whitney检验来分析在相同时间点W组与U组之间的差异。*P<0.05、**P<0.01和***P<0.001(根据Benjamini&Hochberg,1995调整)。W=阿卡波糖加WTP饮食;U=阿卡波糖加常规护理(对照)。
图3示出了以膳食纤维补充性的肠微生物群的移植在小鼠中改善葡萄糖耐量。接受粪便微生物群移植的无菌小鼠的(A)体重、(B)空腹血糖(fasting blood glucose,FBG)、(C)口服葡萄糖耐量测试(在移植后2周)和(D)空腹循环胰岛素。移植物材料从代表性供体获得,一个来自W组,一个来自U组,干预前(“Pre”;第0天)和干预后(“Post”;第84天)二者皆有。接受移植物的小鼠:对于W-Pre、W-Post、U-Pre,n=5;对于U-Post,n=4。*P<0.05、**P<0.01和***P<0.001,使用具有Tukey事后检验的单向ANOVA进行组内和组间比较。W=阿卡波糖加WTP饮食;U=阿卡波糖加常规护理(对照)。
图4示出了表示(A)W组或(B)U组中干预响应性细菌的丰度(经log转换)的热图(使用Wilcoxon匹配对符号秩检验来比较第0天和第28天的数据。P<0.05,根据Benjamini&Hochberg,1995调整)。用Spearman相关系数和ward连接对细菌进行聚类。对于W,n=27;对于U,n=16。
图5示出了潜在的生态系统服务提供者(ESP)和共排除的不利细菌。针对W组中在干预之后丰度(A)降低或(B)提高,或者U组中在干预之后丰度(C)降低或(D)提高的基因组示出了154个高质量基因组草图中参与短链脂肪酸(SCFA)、H 2S和吲哚产生的基因的分布网络。紧邻每个灰色圆圈(鉴定为细菌菌株的高质量基因组草图)的直方图表示第0天和第28天的平均丰度(经log转换)。细菌丰度的变化根据图4中的那些来确定。将灰色圆圈与其他形状连接的线表示参与特定活性的基因。褐色三角形表示参与H 2S产生的基因;紫色平行四边形表示参与吲哚产生的基因;绿色和蓝色形状表示参与SCFA产生的基因。乙酸合成:甲酸四氢叶酸连接酶。丁酸合成:丁酰CoA:乙酸CoA转移酶(But);丁酰CoA:乙酰乙酸CoA转移酶(Ato;由α(AtoA)和β(AtoD)亚基组成);丁酸激酶(Buk);丁酰CoA:4-羟基丁酸CoA转移酶(4Hbt)。丙酸合成:丙酸CoA转移酶/丙酰CoA:琥珀酸CoA转移酶(PCoAt)。(E)生态系统服务提供者的丰度的变化。圆圈的大小和颜色分别表示菌株的平均丰度和丰度变异系数。W=阿卡波糖加WTP饮食;U=阿卡波糖加常规护理(对照)。
图6显示,高纤维饮食降低内毒素载量和炎症。(A)脂多糖结合蛋白。(B)白细胞计数。(C)TNF-α。使用具有Tukey事后检验的双向重复测量方差分析进行组内和组间比较。相对于同一组的第0天*P<0.05、**P<0.01、***P<0.001;相对于相同时间点的U组 #P<0.05、 ##P<0.01、 ###P<0.001。对于W组,N=27;对于U组,n=16。W=阿卡波糖加WTP饮食;U=阿卡波糖加常 规护理(对照)。
图7示出了细菌CAG的丰度与2型糖尿病的表型减轻之间的关系。(A-B)W组(A)和U组(B)中由细菌CAG的丰度与临床变量的水平之间的Spearman相关系数计算的热图:*=P<0.05、**=P<0.01(根据Benjamini&Hochberg,1995调整)。用Spearman相关系数和ward连接基于细菌的量对其进行聚类。(C)在GUT2DM项目中,HbA1c的干预后水平与肠微生物群调节(Gut Microbiota Modulation,GMM)指数负相关(Spearman相关系数(SCC)=-0.4901,P=1.0253e -11),所述指数为训练数据集(W组中27位患者和U组中16位患者)中增加的15个ESP除以降低的49个的丰度。(D)在测试性QIDONG临床试验中,HbA1c的干预后水平与74位患者的测试数据集中的15个ESP及其49种共排除细菌的肠微生物群调节(GMM)指数也负相关(SCC=-0.4006,P=4.53e -7),所述患者全部在无阿卡波糖下接受高纤维饮食3个月。
图8显示,生态系统服务提供者(ESP)的丰度和多样性与2型糖尿病患者中疾病表型的减轻相关。(A)单一ESP的丰度与临床变量之间的关系的热图。*P<0.05和**P<0.01。(B)ESP指数
Figure PCTCN2019074162-appb-000007
其中A i是ESP i的丰度)的变化。(C)GUT2D研究中ESP指数(第0天和第84天)与HbA1c(第0天和第84天)之间的关系。N=43。(D)GUT2D研究中ESP指数(第0天和第28天)与HbA1c(第0天和第84天)之间的关系。N=43。(E)QIDONG研究中ESP指数(第0天和第84天)与HbA1c(第0天和第84天)之间的关系。N=71。所有相关系数均使用Bland和Altman(16)所述的方法来计算。W=阿卡波糖加WTP饮食;U=阿卡波糖加常规护理(对照)。
具体实施方式
为了可以更容易地理解本公开内容,在此对某些术语进行限定。另外的限定在具体实施方式中阐述。
术语“共丰度基因集”或“CAG”指在丰度方面与随机挑选的种子基因相关的基因的集合。将宏基因组分离成具有类似丰度的基因集允许鉴定例如原核生物和噬菌体的生物实体,以及代表共遗传的克隆异质性的小遗传实体。
本文使用的术语“CAG No.:i的大小”指CAG No.:i的长度,即CAG No.:i的核苷酸的数目。
术语“生物标志物”指某种生物状态或状况的可测量指标。本文使用的生物标志物是CAG,其丰度数据可以指示T2D。
本文使用的术语“接受者操作特征曲线(Receiver operating characteristic curve)”或“ROC曲线”指对二进制分类器系统随着其鉴别阈值变化的诊断能力进行展示的图形曲线。ROC曲线通过在不同阈值设置下将真阳性率相对于假阳性率绘图来产生。真阳性率也称为灵敏度、召回率或检出概率。假阳性率也称为误警率(fall-out)或虚警(false alarm)概率,并且可以作为(1-特异度)计算。因此,ROC曲线是作为误警率的函数的灵敏度。
术语“Youden指数”指真阳性率与假阳性率之间的差异。使该指数最大化允许从ROC曲线发现独立于流行率的最佳截止点。该指数图示为在机会线之上的高度。
本文使用的术语“ROC曲线下面积”或“AUC”用于表示将受试组群分离成患有所讨论疾病和未患所讨论疾病的那些的检验的准确度。
在本发明中,通过扫描整个肠微生物组,已发现数个CAG在来自响应于高纤维饮食干预的T2D患者的样品中普遍分布。在这些CAG中,15个上调,而49个下调。基于这些CAG或这些CAG中一些在粪便样品中的丰度计算的GMM指数和ESP指数可用于在对象中评估T2D的存在或发生T2D的风险。或者,这些CAG或这些CAG中一些的丰度变化可用于在患有T2D的患者中监测针对疾病治疗或饮食干预的响应。两种方法都可以以高效、准确且患者友好的方式进行。
本发明提供了在对象中评估2型糖尿病之存在或发生风险的方法,其包括以下步骤:
a)从对象收集粪便样品;
b)分析从粪便样品提取的DNA以确定选自CAG ID No.:1至64的每个参考CAG的丰度:
A i(CAG No:i的丰度)=与CAG No.:i匹配的读出的数目/(CAG No.:i的大小×总读出的数目);
c)使用所计算的丰度数据来计算每个样品的GMM指数:
Figure PCTCN2019074162-appb-000008
以及
d)如果GMM指数接近或低于预定水平,则确定对象患有或有风险发生2型糖尿病。
本发明提供了在患有2型糖尿病的对象中评价饮食干预或疾病治疗的效力的方法,其包括以下步骤:
a)在饮食干预或疾病治疗之前和期间从对象收集粪便样品;
b)分析从粪便样品提取的DNA以确定选自CAG ID No.:1至64的每个参考CAG的丰度:
A i(CAG No:i的丰度)=与CAG No.:i匹配的读出的数目/(CAG No.:i的大小×总读出的数目);
c)使用所计算的丰度数据来计算每个样品的GMM指数:
Figure PCTCN2019074162-appb-000009
以及
e)如果在于饮食干预或疾病治疗期间收集的样品中GMM指数提高,则确定对象对饮食干预或疾病治疗产生积极响应。
对于ESP指数方面,本发明提供了在对象中评估2型糖尿病的存在或发生风险的方法,其包括以下步骤:
a)从对象收集粪便样品;
b)分析从粪便样品提取的DNA以确定选自CAG ID No.:1至15的每个参考CAG的丰度:
A i(CAG No:i的丰度)=与CAG No.:i匹配的读出的数目/(CAG No.:i的大小×总读出的数目);
c)使用所计算的丰度数据来计算每个样品的ESP指数:
Figure PCTCN2019074162-appb-000010
其中Heip=(e H-1)/14,
Figure PCTCN2019074162-appb-000011
以及
d)如果ESP指数接近或低于预定水平,则确定对象患有或有风险发生2型糖尿病。
本发明还提供了在患有2型糖尿病的对象中评价饮食干预或疾病治疗之效力的方法,其包括以下步骤:
a)在饮食干预或疾病治疗之前和期间从对象收集粪便样品;
b)分析从粪便样品提取的DNA以确定选自CAG ID No.:1至15的每个参考CAG的丰度:
A i(CAG No:i的丰度)=与CAG No.:i匹配的读出的数目/(CAG No.:i的大小×总读出的数目);
c)使用所计算的丰度数据来计算每个样品的ESP指数:
Figure PCTCN2019074162-appb-000012
其中Heip=(e H-1)/14,
Figure PCTCN2019074162-appb-000013
以及
e)如果在于饮食干预或疾病治疗期间收集的样品中ESP指数提高,则确定对象对饮食干预或疾病治疗产生积极响应。
在本发明中,CAG NO.:1至15分别包含SEQ ID NO.:1至191、192至326、327至593、594至835、836至885、886至960、961至1097、1098至1264、1265至1433、1434至1684、1685至1833、1834至1979、1980至2163、2164至2447和2448至2783所示的核酸序列,并且CAG NO.:16至64分别包含SEQ ID NO.:2784至2961、2962至3130、3131至3525、3526至3747、3748至3863、3864至4068、4069至4212、4213至4393、4394至4532、4533至4891、4892至4979、4980至5116、5117至5320、5321至5464、5465至5781、5782至6279、6280至6646、6647至6954、6955至7178、7179至7613、7614至7758、7759至8046、8047至8491、8492至8546、8547至9971、9972至10099、10100至10392、10393至10502、10503至10694、10695至10986、10987至11089、11090至11262、11263至11466、11467至11704、11705至12034、12035至12113、12114至12341、12342至12454、12455至12664、12665至12825、12826至13042、13403至13500、13501至13726、13727至13949、13950至14014、14015至14290、14291至14403、14404至14686和14687至14850所示的核酸序列。
为了确定本发明的每个参考CAG的丰度,可以使用本领域中公知的任何方法。在一些实施方案中,从粪便样品获得DNA序列并随后将其与CAG序列进行比对。在一些实施方案中,在DNA序列的比对中使用种子延伸策略,并使用在种子序列中无错配的序列来确定每个参考CAG的丰度。在一些实施方案中,种子序列的长度为20bp。
获得DNA序列包括在样品中获得原始序列读出并对原始序列读出进行处理以获得合格的序列读出。在一些实施方案中,原始序列读出通过基于PCR的高通量测序技术来获得。在一些实施方案中,原始序列读出通过Illumina测序来获得。对原始序列读出的处理可以如本领域中已知的进行。在一些情况下,处理包括去除衔接子,在3’端修剪序列直至到达质量阈值高于20的第一个核苷酸,去除短序列,并去除与人基因组匹配的序列。在一些实施方案中,短序列的长度为59bp或更小。
在用于在对象中评估T2D之存在或发生风险的方法中,如果GMM指数或ESP指数接近或低于预定水平,则确定对象患有或有风险发生T2D。
预定水平可以根据实验室数据或临床数据来设置。即使水平是预定的,医院或医生也可以根据对象的年龄、性别、身体状况等对其进行调整。
在本发明的一个优选实施方案中,对于GMM指数,预定水平为约-1.028883。在本发明的一个优选实施方案中,对于ESP指数,预定水平为约4.4。这些特定水平基于已使用在下文实施例中所述的数据产生的接受者操作特征曲线来确定。如上所述,接受者操作特征曲线是对二进制分类器系统随着其鉴别阈值变化的诊断能力进行展示的图形曲线。并且,Youden指数指真阳性率与假阳性率之间的差异。Youden指数通常与接受者操作特征(ROC)分析联合使用。该指数针对ROC曲线的所有点进行限定,并且该指数的最大值可用作用于在诊断测试提供数值结果而不是二分结果时选择最佳截止点的标准。在本发明中,当HbA1c>=6.5%时,二进制数设置为1。相应地,当Youden指数达到最大值时,GMM指数为-1.028883;并且当Youden指数达到最大值时,ESP指数为4.4。即,如果对象被确定具有高于-1.028883的GMM指数,则其可能具有低于6.5%的HbA1c水平,其中准确度为90.48%;如果对象被确定具有低于或等于-1.028883的GMM指数,则其可能具有高于6.5%的HbA1c水平,其中准确度为44.75%。对于ESP指数,如果对象被确定具有高于4.4的ESP指数,则其可能具有低于6.5%的HbA1c水平,其中准确度为92.11%;如果对象被确定具有低于或等于4.4的ESP指数,则其可能具有高于6.5%的HbA1c水平,其中准确度为45.52%。
对于在患有T2D的对象中监测针对疾病治疗或饮食干预的响应的方法,当在疾病治疗或饮食干预期间GMM指数或ESP指数提高或者在一些实施方案中变得接近或高于预定水平时,确定对象对疾病治疗或饮食干预产生积极响 应。如上所述,对于GMM指数,预定水平优选为约-1.028883,或者对于ESP指数,预定水平优选为约4.4,其基于相应ROC曲线和Younden指数来确定。
实施例
患者和方法
GUT2D研究
用于患有2型糖尿病(T2DM)的患者的随机化、开放标签、平行组临床试验由上海交通大学医学院上海总医院伦理委员会批准(No.2014KY086),并且该研究根据赫尔辛基宣言(Declaration of Helsinki)的原则进行。所有参与者均在试验开始时提供了书面知情同意书。该试验在中国临床试验注册中心登记(No.ChiCTR-TRC-14004959)。临床试验的设计和进程示于图1中。
招募的参与者为35至70岁的中国汉族T2DM患者(6.5%≤HbA1c≤12.0%)。主要排除标准包括:1型糖尿病;妊娠;哺乳期;打算在研究过程期间妊娠;严重糖尿病并发症(糖尿病性视网膜病、糖尿病性神经病、糖尿病性肾病和糖尿病足);严重肝病(包括慢性持续性肝炎、肝硬化、或阳性乙型肝炎病毒表面抗原和异常肝转氨酶(丙氨酸转氨酶或天冬氨酸转氨酶的血清浓度>2.5×正常值上限)的共现);在招募前3个月内连续使用抗生素>3天;连续使用减肥药物>1个月;胃肠外科手术(除阑尾炎或疝外科手术之外);在过去6个月内具有严重的精神疾病;接受药物治疗以治疗胆囊炎、消化性溃疡、尿路感染、急性肾盂肾炎、膀胱炎或甲状腺机能亢进;垂体功能障碍;严重的器质性疾病,包括癌症、冠心病、心肌梗死或脑卒中;感染性疾病,包括肺结核和AIDS;以及酒精中毒。
在2周的导入期期间,终止除胰岛素促分泌剂或甘精胰岛素之外的所有抗糖尿病药物以避免这些药物对肠微生物群的潜在作用。在干预之前(第0天),所有参与者都接受有关T2DM的健康教育和基线评价。使用基于饮食的食物频率问卷和24小时饮食记录,基于中国食物成分表2009(17)来计算基线营养物摄入量。将参与者随机分配为接受阿卡波糖加针对T2DM的常规护理(U组),或阿卡波糖加基于全谷物、中药食品或益生元(prebiotics)的饮食配方(WTP饮食)(W组),进行84天。
常规护理由根据中国T2DM糖尿病防治指南(Chinese diabetes guidelines for T2DM)(2013年版)的标准饮食和运动建议组成。WTP饮食包括三种即 食型预制食物:配方No.1(2)、配方No.2(2)和配方No.8(由完美(中国)有限公司(中国中山)制造)。对于W组,根据营养学家的建议,将WTP饮食与适量的蔬菜、水果和坚果联合施用。根据由中国居民膳食营养素参考摄入量(Dietary Reference Intake,DRI)提供并由中国营养协会(Chinese Nutrition Society)(CNS,2013)建议的根据年龄的标准营养要求平衡常量营养物的输入量。配方No.1是来自全谷物的12种组分材料和富含膳食纤维的中药(traditional Chinese medicine,TCM)食用植物(包括薏苡(薏米(Coix lachrymal-jobi L.))、燕麦、荞麦、白豆(white bean)、黄玉米(yellow corn)、红豆、大豆、薯蓣、花生、莲子和枸杞)的预烹制混合物,其以罐装粥状物的形式制备(每罐370g湿重)。各自包含100g成分(59g碳水化合物、15g蛋白质、5g脂肪和6g纤维)和336千卡(70%碳水化合物、17%蛋白质、13%脂肪)。配方No.2是包含苦瓜(苦瓜(Momordica charantia))和寡糖(包括果糖-寡糖和寡聚异麦芽糖)的输注用粉末制剂(每袋20g)。配方No.8的详细组成示于下表1中。对于每份膳食,作为主食食用≥360g的配方No.1,并分别以10g和15g食用配方No.2和No.8。使用每位对象的饮食记录基于中国食物成分表2009 39来计算营养物摄入量(表2)。阿卡波糖使用100mg的口服剂量施用,一天三次。参与者针对饮食、体重、药物使用和不良事件记录其治疗方案。此外,记录自监测每日空腹血糖(FBG)和餐后2小时血糖(2-hour postprandial blood glucose,2h PBG),并根据症状改善和每日两点血糖谱来调整背景治疗(胰岛素促分泌剂和甘精胰岛素)的剂量(表3)。
表1.WTP饮食中使用的即食型配方No.8的组分
Figure PCTCN2019074162-appb-000014
Figure PCTCN2019074162-appb-000015
a即食型干粉。
表2.在饮食干预之前和期间的每日能量和常量营养物摄入量 a
Figure PCTCN2019074162-appb-000016
Figure PCTCN2019074162-appb-000017
a数据为平均值±sem。相对于W第0天***P<0.001;相对于U第84天###P<0.001。使用具有Bonferroni事后检验的双向重复测量方差分析来进行组内和组间比较。
表3.抗糖尿病药物使用 a
Figure PCTCN2019074162-appb-000018
Figure PCTCN2019074162-appb-000019
a干预开始于上述定期用药的2周洗脱期之后。第-14天表示洗脱期的开始。
在基线和在干预期间每28天获得生物样品、人体测量数据和临床实验室分析。在过夜禁食10小时后收集静脉血样品,然后对参与者进行3小时的口服葡萄糖耐量测试。所有参与者摄入75g葡萄糖,并在30、60、120和180分钟时获得血液样品。将血液样品在室温下静置30分钟后以3,000×g离心20分钟以获得血清。在同一天收集粪便和晨尿。收集血清、尿和粪便样品,立即转移至干冰,并储存在-80℃,在5小时内用于另外的分析。
在上海交通大学医学院上海总医院(中国上海)确定生物临床参数。
QIDONG研究
在启东人民医院(Qidong People’s Hospital)(中国江苏)进行的这项临床试验检测高膳食纤维饮食在健康个体以及患有前驱糖尿病和临床上诊断为T2DM的那些的组群中在自由生活条件下的作用(QIDONG;中国临床试验注册中心:ChiCTR-IPC-14005346)。T2DM亚组的基线表型特征与GUT2D中的那些大致类似。将患有T2DM的参与者随机分为接受WTP饮食(无阿卡波糖;n=71)或常规护理(n=33),进行84天。在基线和在干预结束时收集血液和粪便样品,其中分别确定HbA1c和肠微生物谱。
统计学分析
使用SPSS Statistics 17.0软件包(SPSS Inc.,Chicago,USA)进行统计学分析。使用具有Tukey事后检验(双尾)的双向重复测量方差分析分别进行生物 临床参数和炎症相关标志物的组内和组间比较。使用Pearson卡方检验(双尾的)来分析两组中HbA1c为低于7.0%或6.5%的参与者的性别和比例的变化。使用Mann-Whitney U检验(双尾的)来分析两组在基线时其他特征的变化。
肠微生物群移植
在第0天和第84天从两位女性参与者(来自W组的2W009和来自U组的2U004)收集粪便样品。这两个供体是系统性选择的-在所有参与者中确定干预后肠微生物谱变化,排除无显著变化的那些,然后随机选择来自每个组的一位参与者作为代表性供体。将每个粪便样品(0.5g)在厌氧室(80%N 2:10%CO 2:10%H 2)中在25mL无菌林格工作缓冲剂(9g/L氯化钠、0.4g/L氯化钾、0.25g/L二水氯化钙和0.05%(w/v)L-半胱氨酸盐酸盐)中稀释。使粪便材料通过彻底涡旋(5分钟)悬浮,并通过重力沉降5分钟。将澄清的上清液转移到干净的管中,并添加等体积的20%(w/v)脱脂乳(LP0031,Oxoid,UK)。接种物在实验当天新鲜制备,将剩余部分储存在-80℃直至第二次接种。
所有的动物实验操作均由中国科学院动物研究所动物管理和使用委员会机构(Institute of Zoology Institutional Animal Care and Use Committee of the Chinese Academy of Sciences)批准,并根据委员会的指南进行。将断奶的无菌雌性C57BL/6J小鼠(n=30)在定期12小时光循环(在06:00开启光)下维持在柔性膜塑料隔离器中。在移植前收集粪便、食物、水和衬垫的样品。在充分混合下,将生理盐水添加到样品中。然后,使用平板涂布法在以下条件培养混合物:1)对于好氧细菌,在37℃下在好氧条件下在LB琼脂、脑心浸液琼脂和巯基乙酸盐琼脂上培养;2)对于厌氧细菌,在37℃下在厌氧条件下在岐阜厌氧培养基(Gifu anaerobic medium,GAM)上培养;以及3)对于真菌,在25℃至28℃下在好氧条件下在经改良的马丁氏琼脂和大豆胰蛋白胨琼脂上培养。在1、2、4、7和14天后在光学显微镜下检查所有培养物。
向小鼠随意喂食无菌常规饲料(SLAC,中国上海)。通过对粪便、食物和衬垫进行定期细菌学检查来对细菌污染进行监测。在6周龄时,将无菌小鼠圈养在单独的笼中,并随机分为4组(每组保持在单独的隔离器中)。在适应2周后,向4组小鼠经口管饲100μL的以下粪便悬浮接种物之一:第0天的2W009(W-Pre;n=10);第84天的2W009(W-Post;n=10);第0天的2U004(U-Pre;n=5);和第84天的2U004(U-Post;n=5)。次日重复接种以增强微生物群移植。在第14天,在过夜禁食8小时后,对所有小鼠进行2小时的口 服葡萄糖耐量测试(OGTT)。在口服管饲D-葡萄糖(2g/kg体重)后,在0、15、30、60、90和120分钟时从尾静脉收集血液样品,其中使用血糖仪(
Figure PCTCN2019074162-appb-000020
Performa)确定葡萄糖水平。
肠微生物群分析
1.宏基因组测序:如先前所述(2)从粪便样品提取DNA,并使用GENEWIZ Co.(中国北京)的Illumina HiSeq 3000进行测序。根据服务提供商指定的工作流程进行聚类生成、模板杂交、等温扩增、线性化、以及测序引物的封闭变性和杂交。构建插入物大小为约500bp的文库,之后进行高通量测序以获得在正向和反向方向具有150bp的双端读出。
2.数据质量控制:使用Prinseq(3)来进行:1)从3’端修剪读出直至到达质量阈值为20的第一个核苷酸;2)当读出为<60bp或含有“N”碱基时,去除读出对;和3)对读出去重复。去除可以与人基因组(智人(H.sapiens),UCSC hg 19)匹配的读出(使用--reorder--no-hd--no-contain-dovetail以Bowtie2(4)进行比对(种子序列的长度设置为20bp))。
3.从头非冗余宏基因组基因目录构建和基因丰度谱计算:将来自每个样品的高质量双端读出用于用IDBA_UD(5)从头组装成至少500bp的重叠群。使用MetaGeneMark(6)预测基因。使用参数“-c 0.95-aS 0.9”用CH-HIT构建4,893,833个微生物基因的非冗余基因目录。使用SOAPaligner(7)将高质量读出映射到基因目录上。将匹配的结果进行取样并缩减到每个样品3100万。使用soap.coverage.script来在每个缩减步骤中计算基因长度归一化碱基计数。重复取样操作30次,并将丰度的平均值用于进一步的分析。
4.共丰度基因集(CAG):使用基于Canopy的聚类算法(8)用默认参数在所有样品中基于所有基因的丰度来对其进行分箱。在后续分析中去除原始CAG:1)与canopy谱的Spearman相关性<0.7的基因;2)总canopy谱的90%分布在不超过3个样品中;3)具有少于三个基因的CAG。将具有>700个基因的大CAG当作用于进一步分析的细菌CAG。用QIIME(9)基于Bray-Curtis距离和Procrustes对细菌CAG进行主成分分析。
5.细菌CAG的组装和分类学分配:如先前所述(2)对180个流行细菌CAG中每一个进行从头组装。简言之,如下实现CAG和样品特异性读出:将所有高质量读出与CAG特异性重叠群进行比对,然后用Velvet(10)进行从 头组装。我们采用来自人类微生物组计划(Human Microbiome Project,HMP)(http://www.hmpdacc.org/reference_genomes/finishing.php)的用于高质量基因组草图组装的六个标准和checkM(11)来评估组装体的质量:1)基因组组装体的90%必须包括在重叠群(>500bp)中;2)90%的组装碱基必须处于>5x读出覆盖;3)重叠群N50必须为>5kb;4)支架N50必须为>20kb;5)平均重叠群长度必须为>5kb;以及6)>90%的核心基因必须存在于组装体中。我们使用两种方法来鉴定CAG的系统发生分类学,其高质量基因组草图满足至少5个HMP标准。首先,使用CVtree3.0网络服务器(12)用具有高质量组装体的154个细菌CAG、来自HMP DACC数据库的352个参考胃肠基因组和服务器内置数据库构建系统发生树,所述服务器应用组分矢量来进行系统发生分析。然后,我们还应用SpecI(13)来对细菌CAG进行描绘,SpecI是基于40个通用的单拷贝系统发生标志物基因将生物分组为物种聚类的方法。在蛋白质(BLASTP)和核苷酸(BLASTN)水平二者上,将低质量的CAG与来自NCBI数据库的7,991个参考基因组进行比对。用查询覆盖率(>70%)和E值(在核苷酸水平,<1e-10;在蛋白质水平,<1e-5)将该比对结果过滤。基于先前描述的分类学分配阈值(14),将CAG分配到物种或属级(物种级:90%的基因可以映射到在DNA水平上具有>95%同一性的物种基因组;属级:80%的基因可以映射到在DNA和蛋白质水平二者上具有>85%同一性的属)。
6.GMM指数和ESP指数计算
使用Bowtie2用参数--reorder--no-hd--no-contain-dovetail将来自GUT2D和/或QIDONG数据集的每个样品的高质量读出与64个高质量基因组草图进行比对(种子序列的长度设置为20bp)。过滤与YT:Z:DP(指示读出为一对的一部分并且该对不一致地比对)的比对结果。
Figure PCTCN2019074162-appb-000021
Figure PCTCN2019074162-appb-000022
其中A i(CAG No:i的丰度)=与CAG No.:i匹配的读出的数目/(CAG No.:i的大小×总读出的数目)。
Figure PCTCN2019074162-appb-000023
其中Heip=(e H-1)/14,
Figure PCTCN2019074162-appb-000024
A i(CAG No.:i的丰度)=与CAG No.:i匹配的读出的数目/(CAG No.:i的大小×总读出的数目)。
7.统计学分析:根据Benjamini&Hochberg(18),使用Wilcoxon匹配对符号秩检验(双尾的)并进行调整来鉴定干预响应性细菌CAG。用“mafdr”指令在
Figure PCTCN2019074162-appb-000025
程序中进行P值调整。用R包“randomForest”进行随机森林分析,并用“rfcv”进行交叉验证。
8.数据可用性
所有样品的原始焦磷酸测序和Illumina读出数据都已经以登录号PRJEB1455(GUT2D研究)和PRJEB15179(QIDONG研究)归入欧洲核苷酸档案库(European Nucleotide Archive,ENA)中。
实施例1.高纤维干预在患有T2DM的患者中显著改善生物临床参数
在干预的第一个月期间,几乎所有的生物临床参数在W组和U组二者中都改善。糖化血红蛋白(HbA1c)水平(当前临床试验中的主要结果)在两个组中都随时间从基线水平显著降低(图2A)。截止第84天,HbA1c的降低在W组中大于U组中。在干预结束时(第84天),充分血糖控制率(组群中HbA1c<7%的比例)在W组中显著高于U组中(88.9%相对于50.5%,P=0.005)。更加严格的目标实现率(组群中HbA1c<6.5%的比例)显示类似(尽管不显著的)趋势(51.9%相对于25.0%,P=0.084)。与U组相比,W组中的患者还减轻显著更大百分比的体重,并且表明脂质谱和炎症水平显著改善。可以刺激胰岛素分泌并抑制胰高血糖素分泌的胰高血糖素样肽-1(GLP-1)和肽YY(PYY)的水平在W组中随时间显著提高,而在U组中则没有。
实施例2.高纤维干预在患有T2DM的患者中调节肠微生物群的整体结构
对在4个时间点(第0、28、56和84天)收集的172个粪便样品进行鸟枪宏基因组测序。从4,893,833个微生物基因的非冗余基因目录中,鉴定422个共丰度基因集(CAG;使用基于Canopy的算法(19)分箱)为不同的细菌基因组。基于来自422个细菌CAG的Bray-Curtis距离,肠微生物群的整体结构(如主坐标分析所示的)在两个组中显示第0天至第28天显著改变,之后不再进一步改变(图2B)。在干预结束时(第84天),W组与U组之间在肠微生物结构方面的显著差异(P=0.0056)反映了WTP饮食对肠微生物群的独特调节作用。在两组中基因丰富度(每个样品鉴定的基因的数目)明显降低,在这之后是如整体微生物结构中的类似趋势,即:在第28天显著降低,并且在干预的剩余时间保持稳定(图2C)。基因丰富度的这一整体下降对以下目前观点提出挑战:较高的多样性意味着较佳的健康(20)。然而,与U组相比,第28天的基因丰富度在W组中显著更高,并且在第56天和第84天观察到类似趋势(图2C),这与W组中的代谢结果更佳一致。此外,将所有生物临床 变量组合并且具有422个细菌CAG的Procrustes分析显示,肠微生物群的结构变化与干预期间临床结果的改善相关(P<0.0001,来自999Monte-Carlo模拟)。总之,表明,WTP饮食在T2DM患者中诱导肠微生物群的整体结构显著变化并且这些与整体临床结果改善相关。
实施例3.移植表明肠微生物群对减轻T2DM的因果性贡献
为了建立饮食改变的肠微生物群与葡萄糖代谢改善之间的因果关系,将来自W组和U组中参与者的干预前和干预后(分别为第0天和第84天)肠微生物群移植到无菌C57BL/6J小鼠中。在移植14天后,接受来自W组的干预后微生物群的小鼠具有显著更低的体重(图3A)。当与移植有来自W组的干预前微生物群或者来自U组的任一时间点的微生物群的那些相比时,这些小鼠还具有最低的空腹血糖和餐后血糖,显示与空腹胰岛素水平相关的效应(图3B至3D)。我们的干预通过微生物移植的转移效应确定,高膳食纤维引起的肠微生物群变化因果地有助于改善T2DM患者中的葡萄糖体内平衡。
实施例4.特定菌株对纤维摄取作出响应
组装高质量基因组草图以鉴定驱动膳食纤维对减轻T2DM表型的肠特异性作用的细菌物种/菌株。由>20%样品所共有的CAG组装154个高质量基因组草图。映射到这些高质量基因组草图的每个样品的总读出百分比为57%(±11%),这代表整个组群中的流行和主导肠细菌二者。154个高质量基因组草图中的141个具有至少一种用于SCFA产生的关键基因,并且可以视为SCFA产生者。在154个高质量基因组草图中,选择64种细菌用于进一步分析,因为:1)其是通过Wilcoxon匹配对符号秩检验鉴定的干预响应性CAG,如W或U组中在第28天通过干预显著改变的(图4);以及2)其具有用于SCFA、H 2S或吲哚生物合成的基因中的至少一种。W组中升高的所有15个基因组具有用于SCFA生物合成的基因和用于乙酸产生的基因中的至少一种(包括也在U组中富集的3个),并且其中5个还具有丁酸生物合成的能力(图5B和5D)。这与在两组中粪便乙酸在较大程度上类似的提高以及乙酸合成途径的富集一致,但是WTP饮食对诱导丁酸产生的作用不同。这15个基因组的富集大部分在第28天达到峰值(图5E),这也符合我们在整体肠微生物群中观察到的模式,这进一步支持这些细菌菌株为生态系统中结构变化的关键驱动者。
这15种细菌(包括双歧杆菌属(Biffdobacterium spp.)、乳杆菌属(Lactobacillus spp.)、真杆菌属(Eubacterium spp.)和普氏粪杆菌 (Faecalibacterium prausnitzii))在W组可以服务于补充乙酸和丁酸的重要目的,并且因此可以是该必需功能的生态系统服务提供者(ESP)。来自碳水化合物的高效能量产生和针对低pH的耐受性可以解释这些细菌为什么相对于其他SCFA产生者具有竞争优势。在此,一个良好实例是双歧杆菌属,与其他乙酸产生者相比,其利用其“双歧”途径(21)而能够产生更多的ATP分子和乙酸。有趣的是,尽管SCFA产生的总体遗传能力提高,但是我们的干预显著减少大多数SCFA产生者(图5A和5C),这清楚地表明,并不是所有具有功能基因的细菌都可以对底物补充产生响应并且变成宿主所需功能的提供者。我们设想,这至少部分地由肠腔pH变化驱动,因为已知一些SCFA产生者是高度pH敏感的,例如多形拟杆菌(Bacteroides thetaiotaomicron)和普通拟杆菌(B.vulgatus)(12)。因此,我们的数据对微生物组领域的以下共识提出挑战,所述共识主要根据基于基因的功能预测来假定肠细菌与宿主的生理相关性。
在两组中任一组中显著下调的49种细菌是具有用于合成脂多糖、吲哚和H 2S的基因的那些。而且,根据基因中心途径分析,这表明产生代谢不利化合物的能力降低可能有助于高膳食纤维饮食的有益作用。已显示内毒素产生降低减轻炎症并恢复胰岛素敏感性(22,23)。脂多糖结合蛋白(内毒素载量的替代标志物)和炎性标志物在W组中低于U组中,表明炎症的减轻可能是由于内毒素产生降低(图6)。吲哚和H 2S产生细菌的丰度减小改善了对GLP-1产生的抑制(24-26),这与在W组中观察到的较大餐后GLP-1响应一致。总之,显示减少产生不利代谢物的细菌实现宿主的临床显著改善。
上述15个ESP:CAG0023、CAG0033、CAG0037、CAG0045、CAG0046、CAG0064、CAG0079、CAG0106、CAG0133、CAG0153、CAG0155、CAG0207、CAG0224、CAG0236和CAG0409在本发明中分别表示为CAG NO.:1至15。显著下调的49种细菌CAG0010、CAG0012、CAG0015、CAG0017、CAG0018、CAG0021、CAG0022、CAG0028、CAG0031、CAG0032、CAG0034、CAG0035、CAG0048、CAG0051、CAG0057、CAG0058、CAG0063、CAG0067、CAG0075、CAG0076、CAG0080、CAG0082、CAG0086、CAG0090、CAG0093、CAG0100、CAG0111、CAG0116、CAG0122、CAG0128、CAG0131、CAG0134、CAG0138、CAG0173、CAG0178、CAG0185、CAG0202、CAG0221、CAG0246、CAG0248、CAG0255、CAG0264、CAG0281、CAG0292、CAG0312、CAG0331、CAG0341、CAG0365和CAG0390在本发明中分别表示为CAG NO.:16至64。
每个样品的肠微生物群调节(GMM)指数基于15个ESP以及在干预后降低的49个ESP的丰度数据来计算。
Figure PCTCN2019074162-appb-000026
其中A i(CAG No:i的丰度)=与CAG No.:i匹配的读出的数目/(CAG No.:i的大小×总读出的数目)。该GMM指数在所所有患者中与干预后HbA1c水平显著负相关(Spearman相关系数(SCC)=-0.4901,P=1.0253e -11),表明由MAC提高促成的微生物群中有贡献细菌的组成改变与主要临床结果相关(图7C)。
ESP(生态系统服务提供者)指数仅基于在干预后提高的15个ESP的丰度数据来计算。
Figure PCTCN2019074162-appb-000027
其中Heip=(e H-1)/14,
Figure PCTCN2019074162-appb-000028
A i(CAG No.:i的丰度)=与CAG No.:i匹配的读出的数目/(CAG No.:i的大小×总读出的数目)。ESP指数在W组和U组二者中都遵循类似的轨迹,即从基线急剧增加至第28天并在干预的剩余时间保持在类似水平,但是该指数在每个干预后时间点在W组中显著更高(第28、56和84天,图8B)。在基线时和在干预结束时HbA1c与ESP指数之间的显著负相关(第0天和第84天;r=-0.6731;P=5.55e -07;图8C)确定这些ESP在调节宿主葡萄糖体内平衡中的作用。虽然临床结果(例如HbA1c)随着干预的持续时间持续降低(图2A),ESP指数从第28天开始达到平稳(图8B)。我们的数据清楚地表明膳食纤维诱导的ESP富集发生在临床结果的显著变化之前。当将第28天(代替第84天)的ESP指数与第84天的HbA1c一起用于绘制干预后数据点同时保持与图8C中完全相同的基线数据点组,在HbA1c与ESP指数之间观察到类似的负相关性(r=-7434;P=7.48e -08;图8D)。这表明,第28天的ESP指数(指示15个ESP在该早期时间点的富集)可对更晚发生的最终治疗结果具有信息性。
实施例5.生态系统服务提供者为不同的T2DM患者组群所共有
最后,为了发现其他T2DM患者组群是否共有在GUT2D试验中鉴定到的生态系统服务提供者,进行了另一项独立临床试验(QIDONG),在该试验中,使74位T2DM患者在无阿卡波糖下接受WTP饮食3个月。在干预后,HbA1c水平从基线显著改善。对于所有患者,在基线和每个月末时收集粪便样品。以14.1G的平均深度对148个样品进行宏基因组测序。超过一半的测序读出被映射到在GUT2DM项目中组装的154个高质量基因组草图,显示对应的流行肠细菌是中国T2DM患者的不同组群共有的。在QIDONG试验的患者中存在GUT2D中鉴定的15个ESP和由于这些ESP升高而共排除的49种 细菌。值得注意的是,使用第二项试验(不具有阿卡波糖)来提供测试数据集,基于15个ESP及其共排除细菌的GMM指数与主要结果(HbA1c水平)具有类似的显著负相关性(图7D)。
此外,使用在GUT2D中被鉴定为对膳食纤维具有积极响应性的15个SCFA提供者的相同组,在该QiDong干预组中在ESP指数与HbA1c之间存在类似的负相关性(图8E)。
根据来自在GUT2D研究中收集的172个粪便样品和在QIDONG研究中收集的148个样品的GMM指数建立接受者操作特征曲线(ROC),其留一交叉验证ROC下面积(AUC)达到0.7052,其中当HbA1c>=6.5%时,二进制数设置为1,并且特异度和灵敏度分别为90.48%和44.75%。当Youden指数达到最大值时,GMM指数为-1.02888。
根据来自在GUT2D研究中收集的172个粪便样品的ESP指数建立接受者操作特征曲线(ROC),其留一交叉验证ROC下面积(AUC)达到0.70,其中当HbA1c>=0.65%时,二进制数设置为1,并且特异度和灵敏度分别为92.11%和45.52%。当Youden指数达到最大值时,ESP指数为4.4。
参考文献
1.E.K. Costello, K. Stagaman, L. Dethlefsen, B.J. Bohannan, D.A. Relman, The application of ecological theory toward an understanding of the human microbiome. Science 336, 1255-1262 (2012).
2.A. Koh, F. De Vadder, P. Kovatcheva-Datchary, F. Backhed, From Dietary Fiber to Host Physiology: Short-Chain Fatty Acids as Key Bacterial Metabolites. Cell 165, 1332-1345 (2016).
3.J. Qin et al., A metagenome-wide association study of gut microbiota in type 2 diabetes. Nature 490, 55-60 (2012).
4.F.H. Karlsson et al., Gut metagenome in European women with normal, impaired and diabetic glucose control. Nature 498, 99-103 (2013) .
5.K. Forslund et al., Disentangling type 2 diabetes and metformin treatment signatures in the human gut microbiota. Nature 528, 262-266 (2015) .
6.N. Larsen et al., Gut microbiota in human adults with type 2 diabetes differs from non-diabetic adults. PloS one 5, e9085 (2010) .
7.A. Soare et al., The effect of the macrobiotic Ma-Pi 2 diet vs. the recommended diet in the management of type 2 diabetes: the randomized controlled MADIAB trial. Nutrition&metabolism 11, 39 (2014) .
8.M. Chandalia et al., Beneficial effects of high dietary fiber intake in patients with type 2 diabetes mellitus. The New England journal of medicine 342, 1392-1398 (2000) .
9.F.M. Silva, C.K. Kramer, D. Crispim, M.J. Azevedo, A high-glycemic index, low-fiber breakfast affects the postprandial plasma glucose, insulin, and ghrelin responses of patients with type 2 diabetes in a randomized clinical trial. The Journal of nutrition 145, 736-741 (2015) .
10. T. Chen et al., Fiber-utilizing capacity varies in Prevotella-versus Bacteroides-dominated gut microbiota. Sci Rep 7, 2594 (2017) .
11. H.J. Flint, S.H. Duncan, K.P. Scott, P. Louis, Links between diet, gut microbiota composition and gut metabolism. Proc Nutr Soc 74, 13-22 (2015) .
12. S.H. Duncan, P. Louis, J.M. Thomson, H.J. Flint, The role of pH in determining the species composition of the human colonic microbiota. Environmental microbiology 11, 2112-2122 (2009) .
13. H.J. Flint, K.P. Scott, P. Louis, S.H. Duncan, The role of the gut microbiota in nutrition and health. Nature reviews. Gastroenterology&hepatology 9, 577-589 (2012) .
14. G. Wu et al., Genomic Microdiversity of Bifidobacterium pseudocatenulatum Underlying Differential Strain-Level Responses to Dietary Carbohydrate Intervention. mBio 8, (2017) .
15. C. Zhang et al., Dietary Modulation of Gut Microbiota Contributes to Alleviation of Both Genetic and Simple Obesity in Children. EBioMedicine 2, 966-982 (2015) .
16. J.M. Bland, D. G. Altman, Calculating correlation coefficients with repeated  observations: Part 1--Correlation within subjects. BMJ 310, 446 (1995) .
17 Yuexin Yang, G.W., Xingchang Pang. China Food Composition (Book 1.2nd Edition) . (Beijing Medical University Press, 2009) .
18. P.D. Cani et al., Gut microbiota fermentation of prebiotics increases satietogenic and incretin gut peptide production with consequences for appetite sensation and glucose response after a meal. The American journal of clinical nutrition 90, 1236-1243 (2009) .
19. H.B. Nielsen et al., Identification and assembly of genomes and genetic elements in complex metagenomic samples without using reference genomes. Nature biotechnology 32, 822-828 (2014) .
20. E. Le Chatelier et al., Richness of human gut microbiome correlates with metabolic markers. Nature 500, 541-546 (2013) .
21. K. Pokusaeva, G.F. Fitzgerald, D. van Sinderen, Carbohydrate metabolism in Bifidobacteria. Genes Nutr 6, 285-306 (2011) .
22. L. Sun et al., A marker of endotoxemia is associated with obesity and related metabolic disorders in apparently healthy Chinese. Diabetes care 33, 1925-1932 (2010) .
23. P.D. Cani et al., Metabolic endotoxemia initiates obesity and insulin resistance. Diabetes 56, 1761-1772 (2007) .
24. M.T. Yokoyama, J.R. Carlson, Microbial metabolites of tryptophan in the intestinal tract with special reference to skatole. The American journal of clinical nutrition 32, 173-178 (1979) .
25. C. Chimerel et al., Bacterial metabolite indole modulates incretin secretion from intestinal enteroendocrine L cells. Cell reports 9, 1202-1208 (2014) .
26. V. Bala et al. , Release of GLP-1 and PYY in response to the activation of G protein-coupled bile acid receptor TGR5 is mediated by Epac/PLC-epsilon pathway and modulated by endogenous H2S. Frontiers in physiology 5, 420 (2014) .

Claims (21)

  1. 在患有2型糖尿病的对象中评价饮食干预或疾病治疗之效力的方法,其包括以下步骤:
    a)在所述饮食干预或疾病治疗之前和期间从所述对象收集粪便样品;
    b)分析从所述粪便样品提取的DNA以确定选自CAG ID No.:1至64的每个参考CAG的丰度:
    A i(CAG No:i的丰度)=与CAG No.:i匹配的读出的数目/(CAG No.:i的大小×总读出的数目);
    d)使用所计算的丰度数据来计算每个样品的GMM指数:
    Figure PCTCN2019074162-appb-100001
    以及
    e)如果在所述饮食干预或疾病治疗期间收集的样品中所述GMM指数提高,则确定所述对象对所述饮食干预或疾病治疗产生积极响应,
    其中CAG NO.:1至15分别包含SEQ ID NO.:1至191、192至326、327至593、594至835、836至885、886至960、961至1097、1098至1264、1265至1433、1434至1684、1685至1833、1834至1979、1980至2163、2164至2447和2448至2783所示的核酸序列,并且CAG NO.:16至64分别包含SEQ ID NO.:2784至2961、2962至3130、3131至3525、3526至3747、3748至3863、3864至4068、4069至4212、4213至4393、4394至4532、4533至4891、4892至4979、4980至5116、5117至5320、5321至5464、5465至5781、5782至6279、6280至6646、6647至6954、6955至7178、7179至7613、7614至7758、7759至8046、8047至8491、8492至8546、8547至9971、9972至10099、10100至10392、10393至10502、10503至10694、10695至10986、10987至11089、11090至11262、11263至11466、11467至11704、11705至12034、12035至12113、12114至12341、12342至12454、12455至12664、12665至12825、12826至13042、13403至13500、13501至13726、13727至13949、13950至14014、14015至14290、14291至14403、14404至14686和14687至14850所示的核酸序列。
  2. 在对象中评估2型糖尿病之存在或发生风险的方法,其包括以下 步骤:
    a)从所述对象收集粪便样品;
    b)分析从所述粪便样品提取的DNA以确定选自CAG ID No.:1至64的每个参考CAG的丰度:
    A i(CAG No:i的丰度)=与CAG No.:i匹配的读出的数目/(CAG No.:i的大小×总读出的数目);
    c)使用所计算的丰度数据来计算每个样品的GMM指数:
    Figure PCTCN2019074162-appb-100002
    以及
    d)如果GDI接近或低于预定水平,则确定所述对象患有或有风险发生2型糖尿病,
    其中CAG NO.:1至15分别包含SEQ ID NO.:1至191、192至326、327至593、594至835、836至885、886至960、961至1097、1098至1264、1265至1433、1434至1684、1685至1833、1834至1979、1980至2163、2164至2447和2448至2783所示的核酸序列,并且CAG NO.:16至64分别包含SEQ ID NO.:2784至2961、2962至3130、3131至3525、3526至3747、3748至3863、3864至4068、4069至4212、4213至4393、4394至4532、4533至4891、4892至4979、4980至5116、5117至5320、5321至5464、5465至5781、5782至6279、6280至6646、6647至6954、6955至7178、7179至7613、7614至7758、7759至8046、8047至8491、8492至8546、8547至9971、9972至10099、10100至10392、10393至10502、10503至10694、10695至10986、10987至11089、11090至11262、11263至11466、11467至11704、11705至12034、12035至12113、12114至12341、12342至12454、12455至12664、12665至12825、12826至13042、13403至13500、13501至13726、13727至13949、13950至14014、14015至14290、14291至14403、14404至14686和14687至14850所示的核酸序列。
  3. 在患有2型糖尿病的对象中评价饮食干预或疾病治疗之效力的方法,其包括以下步骤:
    a)在所述饮食干预或疾病治疗之前和期间从所述对象收集粪便样 品;
    b)分析从所述粪便样品提取的DNA以确定选自CAG ID No.:1至15的每个参考CAG的丰度:
    A i(CAG No:i的丰度)=与CAG No.:i匹配的读出的数目/(CAG No.:i的大小×总读出的数目);
    c)使用所计算的丰度数据来计算每个样品的ESP指数:
    Figure PCTCN2019074162-appb-100003
    其中Heip=(e H-1)/14,
    Figure PCTCN2019074162-appb-100004
    以及
    e)如果在所述饮食干预或疾病治疗期间收集的样品中所述ESP指数提高,则确定所述对象对所述饮食干预或疾病治疗产生积极响应,
    其中CAG NO.:1至15分别包含SEQ ID NO.:1至191、192至326、327至593、594至835、836至885、886至960、961至1097、1098至1264、1265至1433、1434至1684、1685至1833、1834至1979、1980至2163、2164至2447和2448至2783所示的核酸序列。
  4. 在对象中评估2型糖尿病之存在或发生风险的方法,其包括以下步骤:
    a)从所述对象收集粪便样品;
    b)分析从所述粪便样品提取的DNA以确定选自CAG ID No.:1至15的每个参考CAG的丰度:
    A i(CAG No:i的丰度)=与CAG No.:i匹配的读出的数目/(CAG No.:i的大小×总读出的数目);
    c)使用所计算的丰度数据来计算每个样品的ESP指数:
    Figure PCTCN2019074162-appb-100005
    其中Heip=(e H-1)/14,
    Figure PCTCN2019074162-appb-100006
    以及
    d)如果所述ESP指数接近或低于预定水平,则确定所述对象患有或有风险发生2型糖尿病,
    其中CAG NO.:1至15分别包含SEQ ID NO.:1至191、192至326、327至593、594至835、836至885、886至960、961至1097、1098至 1264、1265至1433、1434至1684、1685至1833、1834至1979、1980至2163、2164至2447和2448至2783所示的核酸序列。
  5. 根据权利要求1或2所述的方法,其中步骤b)中的DNA分析包括以下步骤:获得DNA序列并将所获得的DNA序列与SEQ ID No.:1至14850所示的核酸序列进行比对。
  6. 根据权利要求3或4所述的方法,其中步骤b)中的DNA分析包括以下步骤:获得DNA序列并将所获得的DNA序列与SEQ ID No.:1至2783所示的核酸序列进行比对。
  7. 根据权利要求5所述的方法,其中获得DNA序列包括以下步骤:在所述样品中获得原始序列读出并对所述原始序列读出进行处理以获得合格的序列读出。
  8. 根据权利要求6所述的方法,其中获得DNA序列包括以下步骤:在所述样品中获得原始序列读出并对所述原始序列读出进行处理以获得合格的序列读出。
  9. 根据权利要求7或8所述的方法,其中所述原始序列读出是通过基于PCR的高通量测序技术获得的。
  10. 根据权利要求7或8所述的方法,其中所述对所述原始序列读出进行处理包括:去除衔接子,在3’端修剪序列直至到达质量阈值高于20的第一个核苷酸,去除短序列,以及去除与人基因组匹配的序列。
  11. 根据权利要求5或6所述的方法,其中DNA序列的所述比对使用种子延伸策略。
  12. 根据权利要求11所述的方法,其中使用在种子序列中无错配的序列来在步骤b)中确定所述每个参考CAG的丰度。
  13. 根据权利要求11所述的方法,其中所述种子序列的长度为4至31bp。
  14. 根据权利要求13所述的方法,其中所述种子的长度为20bp。
  15. 根据权利要求1或3所述的方法,其中在所述饮食干预或疾病治疗期间,在所述饮食干预或疾病治疗开始后1周、2周、3周和/或4周收集所述粪便样品。
  16. 根据权利要求1所述的方法,其中当在所述饮食干预或疾病治疗期间所述GMM指数变得接近或高于预定水平时,确定所述对象对所述饮食干预或疾病治疗产生积极响应。
  17. 根据权利要求16所述的方法,所述预定水平为-1.028883。
  18. 根据权利要求2所述的方法,其中所述预定水平为约-1.028883。
  19. 根据权利要求3所述的方法,其中当在所述饮食干预和疾病治疗期间所述ESP指数变得接近或高于预定水平时,确定所述对象对所述饮食干预或疾病治疗产生积极响应。
  20. 根据权利要求19所述的方法,所述预定水平为4.4。
  21. 根据权利要求4所述的方法,其中所述预定水平为约4.4。
PCT/CN2019/074162 2018-01-31 2019-01-31 用于2型糖尿病的生物标志物及其用途 WO2019149247A1 (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201810143729.2A CN110527717B (zh) 2018-01-31 2018-01-31 用于2型糖尿病的生物标志物及其用途
CN201810143729.2 2018-01-31

Publications (1)

Publication Number Publication Date
WO2019149247A1 true WO2019149247A1 (zh) 2019-08-08

Family

ID=67478599

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/074162 WO2019149247A1 (zh) 2018-01-31 2019-01-31 用于2型糖尿病的生物标志物及其用途

Country Status (4)

Country Link
US (1) US20200377945A1 (zh)
CN (1) CN110527717B (zh)
TW (1) TWI707043B (zh)
WO (1) WO2019149247A1 (zh)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2021148661A1 (en) * 2020-01-23 2021-07-29 Enterome Compositions for treating epithelial barrier function disorders
EP4013411A4 (en) * 2019-08-13 2023-08-16 Tata Consultancy Services Limited PREDIABETES RISK ASSESSMENT SYSTEM AND METHODOLOGY
WO2023223092A1 (en) * 2022-05-18 2023-11-23 Institut Pasteur Identification of a human circovirus

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2023014598A2 (en) * 2021-08-03 2023-02-09 Inflammatix, Inc. Isothermal amplification-based diagnosis and treatment of acute infection
KR20240053585A (ko) * 2021-09-08 2024-04-24 메타지노미, 인크. 카고 뉴클레오티드 서열을 전이시키기 위한 시스템 및 방법
WO2023212563A1 (en) * 2022-04-25 2023-11-02 Rutgers, The State University Of New Jersey Two competing guilds as core microbiome signature for human diseases

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2009064901A2 (en) * 2007-11-13 2009-05-22 Veridex, Llc Diagnostic biomarkers of diabetes
WO2014026991A1 (en) * 2012-08-13 2014-02-20 Helmholtz Zentrum München - Deutsches Forschungszentrum für Gesundheit und Umwelt (GmbH) Biomarkers for type 2 diabetes
CN104395479A (zh) * 2012-01-27 2015-03-04 香港中文大学 糖尿病的生物标志物
CN105393117A (zh) * 2013-01-31 2016-03-09 凯普里昂蛋白质组学公司 2型糖尿病生物标志物及其用途
CN106310006A (zh) * 2015-06-30 2017-01-11 完美(中国)有限公司 组合包在制备用于改善和治疗人类小胖威利综合症的食品、药品、保健品、营养品中的应用

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080199480A1 (en) * 2004-07-22 2008-08-21 Sequenom, Inc. Methods for Identifying Risk of Type II Diabetes and Treatments Thereof
CN104540962B (zh) * 2012-08-01 2017-09-19 深圳华大基因研究院 糖尿病生物标志物及其应用
EP2932424A2 (en) * 2012-12-13 2015-10-21 Metabogen AB Identification of a person having risk for developing type 2 diabetes
CN106237168B (zh) * 2015-10-21 2020-09-08 完美(广东)日用品有限公司 一种组合包在制备用于减少肠道内菌群耐药基因组的食品、药品等中的应用

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2009064901A2 (en) * 2007-11-13 2009-05-22 Veridex, Llc Diagnostic biomarkers of diabetes
CN104395479A (zh) * 2012-01-27 2015-03-04 香港中文大学 糖尿病的生物标志物
WO2014026991A1 (en) * 2012-08-13 2014-02-20 Helmholtz Zentrum München - Deutsches Forschungszentrum für Gesundheit und Umwelt (GmbH) Biomarkers for type 2 diabetes
CN105393117A (zh) * 2013-01-31 2016-03-09 凯普里昂蛋白质组学公司 2型糖尿病生物标志物及其用途
CN106310006A (zh) * 2015-06-30 2017-01-11 完美(中国)有限公司 组合包在制备用于改善和治疗人类小胖威利综合症的食品、药品、保健品、营养品中的应用

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
CHEN, Y.W.: "Gut metagenomes of type 2 diabetic patients have characteri- stic singlenucleotide polymorphism distribution in Bacteroides coprocola", MICROBIOME, vol. 5, no. 15, 15 May 2017 (2017-05-15), pages 1 - 7, XP021241735, DOI: 10.1186/s40168-017-0232-3 *
JIANG LIWEN: "The effect of Dietary Intervention in the Treatment of Diabetes Mellitus", CHINESE JOURNAL OF TRAUMA AND DISABILITY MEDICINE, vol. 20, no. 12, 31 December 2012 (2012-12-31), pages 214 *
ZHAO, L.P. ET AL.: "Gut bacteria selectively promoted by dietary fibers alleviate type 2 diabetes", SCIENCE, vol. 359, no. 6380, 9 March 2018 (2018-03-09), XP055628389, ISSN: 0036-8075, DOI: 10.1126/science.aao5774 *

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP4013411A4 (en) * 2019-08-13 2023-08-16 Tata Consultancy Services Limited PREDIABETES RISK ASSESSMENT SYSTEM AND METHODOLOGY
WO2021148661A1 (en) * 2020-01-23 2021-07-29 Enterome Compositions for treating epithelial barrier function disorders
WO2023223092A1 (en) * 2022-05-18 2023-11-23 Institut Pasteur Identification of a human circovirus

Also Published As

Publication number Publication date
TW201934760A (zh) 2019-09-01
US20200377945A1 (en) 2020-12-03
TWI707043B (zh) 2020-10-11
CN110527717B (zh) 2023-08-18
CN110527717A (zh) 2019-12-03

Similar Documents

Publication Publication Date Title
WO2019149247A1 (zh) 用于2型糖尿病的生物标志物及其用途
US11931387B2 (en) Methods and compositions relating to microbial treatment and diagnosis of disorders
Borgo et al. Body mass index and sex affect diverse microbial niches within the gut
WO2018214855A1 (zh) 改善营养代谢肠功能及微生态的抗炎组合物和食品及应用
WO2018152306A1 (en) Modulation of host immune cell populations using gut microbiota
Tchitchek et al. Low-dose IL-2 shapes a tolerogenic gut microbiota that improves autoimmunity and gut inflammation
Steinbach et al. Upper small intestine microbiome in obesity and related metabolic disorders: A new field of investigation
Anwesh et al. Elucidating the richness of bacterial groups in the gut of Nicobarese tribal community–perspective on their lifestyle transition
Li et al. Consistent alterations of human fecal microbes after transplantation into germ-free mice
Song et al. An investigation into the correlation of intestinal flora with obesity and gestational diabetes mellitus
Rasmussen et al. Starch-entrapped microsphere fibers improve bowel habit but do not exhibit prebiotic capacity in those with unsatisfactory bowel habits: A phase I, randomized, double-blind, controlled human trial
Liang et al. Oral probiotics increased the proportion of Treg, Tfr, and Breg cells to inhibit the inflammatory response and impede gestational diabetes mellitus
WO2021219001A1 (zh) 用于提高免疫力的组合物
Sun et al. Microbial biomarkers for colorectal cancer identified with random forest model
Shestopalov et al. Effect of the Infant Feeding Type on Gut Microbiome Taxonomy and Levels of Trefoil Factors in Children and Adolescents
Kushugulova et al. The clinical effect of probiotic-containing product on health indicators among a population of Central Asia.
Huang et al. Gut microbial genomes with paired isolates from China signify probiotic and cardiometabolic effects
Tang Key determinants of the intestinal microbiota in early childhood
Comerlato et al. The Influence of Protein Secretomes of Enterococcus durans on ex vivo Human Gut Microbiome
WO2024030081A1 (en) Microbiome markers
Kordy et al. Multi-Omic Predictors of Steatohepatitis and Advanced Fibrosis in Children

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19747619

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 19747619

Country of ref document: EP

Kind code of ref document: A1