WO2020244017A1 - Intestinal flora-based schizophrenia biomarker combination, and applications thereof and motu screening method therefor - Google Patents

Intestinal flora-based schizophrenia biomarker combination, and applications thereof and motu screening method therefor Download PDF

Info

Publication number
WO2020244017A1
WO2020244017A1 PCT/CN2019/094923 CN2019094923W WO2020244017A1 WO 2020244017 A1 WO2020244017 A1 WO 2020244017A1 CN 2019094923 W CN2019094923 W CN 2019094923W WO 2020244017 A1 WO2020244017 A1 WO 2020244017A1
Authority
WO
WIPO (PCT)
Prior art keywords
schizophrenia
biomarker
relative abundance
combination
motu
Prior art date
Application number
PCT/CN2019/094923
Other languages
French (fr)
Chinese (zh)
Inventor
朱峰
王奇
马现仓
鞠艳梅
王崴
郭锐进
范雅娟
贾慧珏
马青艳
郭丽阳
高成阁
Original Assignee
西安交通大学医学院第一附属医院
深圳华大生命科学研究院
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 西安交通大学医学院第一附属医院, 深圳华大生命科学研究院 filed Critical 西安交通大学医学院第一附属医院
Publication of WO2020244017A1 publication Critical patent/WO2020244017A1/en

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6876Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
    • C12Q1/6883Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6876Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
    • C12Q1/6888Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for detection or identification of organisms
    • C12Q1/689Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for detection or identification of organisms for bacteria
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/136Screening for pharmacological compounds

Definitions

  • the invention belongs to the technical field of biomedicine, and relates to a biomarker for schizophrenia based on intestinal flora, a kit for diagnosing or predicting the risk of schizophrenia, and applications.
  • Schizophrenia (English: Schizophrenia) is a group of severe mental illnesses of unknown etiology, mostly slow or subacute onset in young adults, clinically often manifested as syndromes with different symptoms, involving perception, thinking, emotion and behavior Various obstacles and incoordination of mental activities. Patients generally have clear consciousness and normal intelligence, but some patients will suffer from cognitive impairment during the course of the disease. The course of the disease is generally protracted, showing repeated attacks, aggravation or deterioration. Most patients will eventually experience decline and mental disability, and only a few patients will be cured or basically cured after treatment.
  • the problem solved by the present invention is to provide a combination of biomarkers for schizophrenia based on intestinal flora and its application, which can overcome the inability of the existing diagnosis of schizophrenia to achieve early warning, fail to predict the incidence and development trend of schizophrenia, etc. Disadvantages, it can help the pathological classification of diseases and the study of drug targets, precise medication, and pathogenesis.
  • a biomarker combination for schizophrenia based on intestinal flora which is used to provide relative abundance information, and includes one or more selected from the following:
  • Biomarker 1 Lachnospiraceae bacterium 3_1_57FAA_CT1;
  • Biomarker 2 Cronobacter sakazakii
  • Biomarker 3 Lactobacillus acidophilus
  • Biomarker 4 Veillonella parvula
  • Biomarker 5 Lactococcus lactis
  • Biomarker 6 Alkaliphilus oremlandii
  • Biomarker 7 Pseudoflavonifractor capillosus
  • Biomarker 8 Streptococcus gallolyticus
  • Biomarker 9 Dialister invisus
  • Biomarker 10 Lactobacillus johnsonii
  • Biomarker 11 Methanobrevibacter smithii.
  • the relative abundance information provided by the biomarker combination is used for comparison with reference values.
  • the relative abundance information of the biomarkers 1-11 is provided based on the gene sequence for which abundance calculation can be performed.
  • biomarkers for schizophrenia based on intestinal flora is used as a detection target or a detection target in the preparation of a detection kit.
  • the method for screening a combination of biomarkers for schizophrenia based on intestinal flora includes the following steps:
  • Sample collection After collecting stool samples, they are frozen and transported and quickly transferred to -80°C for storage. DNA extraction is performed to obtain extracted DNA samples.
  • the sample subjects include schizophrenia patients and healthy people;
  • the sample subjects included 90 schizophrenia patients and 81 healthy people, and the verification collection, the sample subjects included 10 schizophrenia patients and 10 healthy people.
  • the method of using the above markers, that is, to diagnose whether the subject has schizophrenia or predict whether the subject has the risk of schizophrenia is:
  • step 2) Compare the relative abundance information described in step 2) with the reference data set or reference value.
  • the method can be used not only for disease diagnosis in the sense of patent law, but also for non-disease diagnosis such as scientific research or other personal genetic information enrichment and genetic information database enrichment.
  • the relative abundance information of each biomarker in the test subject is compared with a reference data set or reference value to determine whether the subject has schizophrenia, or predict the risk of schizophrenia.
  • the reference data set includes relative abundance information of biomarkers in samples from multiple schizophrenia patients and multiple healthy controls.
  • the reference data set refers to the relative abundance information of each biomarker obtained by operating the samples of the diagnosed diseased individuals and healthy individuals, which is used as a reference for the relative abundance of each biomarker .
  • the reference data set may refer to the training data set.
  • the training set refers to and the verification set has a well-known meaning in the art.
  • the training set refers to a data set containing the content of each biomarker in the test samples of a certain number of schizophrenic subjects and non-schizophrenic subjects.
  • the verification set is an independent data set used to test the performance of the training set.
  • the reference value in the present invention refers to the reference value or normal value of healthy controls.
  • Those skilled in the art know that when the sample volume is large enough, detection and calculation methods known in the art can be used to obtain the normal value (absolute value) range of each biomarker in the sample.
  • the absolute value of the biomarker level in the sample can be directly compared with a reference value to assess the risk of disease and diagnose or early diagnose schizophrenia.
  • the step of comparing the relative abundance information with the reference data set in step 2) further includes executing a multivariate statistical model to obtain the disease probability.
  • the use of multivariate statistical models can achieve fast and efficient detection.
  • the multivariate statistical model is a random forest model.
  • the probability of illness being greater than the threshold indicates that the subject has schizophrenia or related diseases or is at risk of suffering from schizophrenia or related diseases.
  • the threshold is 0.5.
  • the relative abundance information of the biomarker in step 2) is obtained by a sequencing method, and further includes: isolating a nucleic acid sample from the sample of the subject, and constructing a DNA library based on the obtained nucleic acid sample , Sequencing the DNA library to obtain a sequencing result; and comparing the sequencing result with a reference gene set based on the sequencing result to determine the relative abundance information of the biomarker.
  • At least one of SOAP2 and MAQ can be used to compare the sequencing result with the reference gene set, thereby improving the efficiency of the comparison, and thus the efficiency of detecting schizophrenia.
  • multiple (at least two) biomarkers can be detected at the same time, which can improve the efficiency of schizophrenia detection.
  • the reference gene set includes performing metagenomic sequencing from samples of multiple schizophrenia patients and multiple healthy controls to obtain a non-redundant gene set, and then combining the non-redundant gene set with the gut microbial gene set, Obtain the reference gene set.
  • the reference gene set in the present invention can be an existing gene set, such as an existing intestinal microbial reference gene set that has been published; it can also be metagenomic sequencing of samples from multiple schizophrenia patients and multiple healthy controls , Obtaining a non-redundant gene set, and then combining the non-redundant gene set with the gut microbial gene set to obtain the reference gene set, and the obtained reference gene set information is more comprehensive and the detection result is more reliable.
  • Redundant gene set is explained as a person skilled in the art generally understands, and in simple terms is the set of remaining genes after the redundant genes are removed. Redundant genes usually refer to multiple copies of a gene that appear on a chromosome.
  • the sample is a stool sample.
  • the sequencing method is performed by a second-generation sequencing method or a third-generation sequencing method.
  • the means for sequencing is not particularly limited. Fast and efficient sequencing can be achieved by second-generation or third-generation sequencing methods.
  • the sequencing method is performed by at least one selected from the group consisting of Hiseq2000, SOLiD, 454, and single molecule sequencing devices.
  • Hiseq2000 Hiseq2000
  • SOLiD single molecule sequencing devices
  • the present invention proposes the application of the schizophrenia biomarker combination based on the intestinal flora as a detection target or a detection target in the preparation of a detection kit for diagnosing whether a subject has schizophrenia Or related diseases or predict whether the subject has a risk of schizophrenia or related diseases.
  • the present invention proposes a kit that includes reagents for detecting biomarkers.
  • the kit With the kit, the relative abundance of these markers in the intestinal flora can be determined, and thus, the obtained The relative abundance value is used to determine whether the subject has or is susceptible to schizophrenia, and the efficiency of the treatment effect for monitoring schizophrenia patients.
  • the present invention proposes the application of a combination of biomarkers for schizophrenia based on intestinal flora as a target in the screening of drugs for the treatment and/or prevention of schizophrenia.
  • the biomarkers are the above-mentioned biomarkers of the present invention, and the influence of the candidate drug on these biomarkers before and after use can be used to determine whether the candidate drug can be used to treat or prevent schizophrenia.
  • the change in the relative abundance of the biomarker combination provides a basis for determining whether the candidate drug is effective.
  • the present invention has the following beneficial technical effects:
  • intestinal microbes are the microbial community that exists in the human intestinal tract and are the "second genome" of the human body.
  • the human intestinal flora and host constitute an interconnected whole.
  • Gut microbes can produce most of the neurotransmitters found in the human brain. More and more evidence supports the view that intestinal microbes affect central neurochemistry and behavior. Irritable bowel syndrome is considered a typical case of brain-gut microbial axis regulation disorders.
  • Translational studies have shown that certain specific flora may have an impact on stress response and cognitive function.
  • Using probiotics or antibiotics to change the intestinal microbiota provides a new method for improving brain function and treating depression and autism and other intestinal-brain axis diseases. Therefore, the present invention analyzes the intestinal flora and gene sequences of patients with schizophrenia and healthy people, thereby screening biomarkers with high correlation with schizophrenia, and using the markers can accurately diagnose schizophrenia Or predict the risk of disease, and can be used to monitor the effect of treatment.
  • Stool is a metabolite of the human body. It not only contains metabolites of the human body, but also includes intestinal microbes that are closely related to changes in our body's metabolism and immunity and brain function. Research on feces found that there are obvious differences in the composition of the intestinal flora between schizophrenia patients and healthy people, which can accurately assess the risk of schizophrenia and make early diagnosis.
  • the present invention is based on the comparison and analysis of the intestinal flora of schizophrenia patients and healthy people, and obtains a variety of related intestinal microbes. Combining high-quality mOTU of schizophrenia and healthy people as a training set, it can accurately treat mental Patients with schizophrenia undergo risk assessment and early diagnosis. Compared with the current commonly used diagnostic methods, this method has the characteristics of convenience and speed.
  • the schizophrenia-related biomarkers proposed by the present invention are valuable for early diagnosis.
  • the marker of the present invention has high specificity and sensitivity.
  • the analysis of stool ensures accuracy, safety, affordability and patient compliance. And the stool sample is transportable. Polymerase chain reaction (PCR)-based tests are comfortable and non-invasive, so it is easier for people to participate in a given screening procedure.
  • the marker of the present invention can also be used as a tool for treatment monitoring of patients with schizophrenia to detect the response to treatment. Due to the reasons of abundance measurement and population selection, the combination of 11 markers in the present invention is particularly suitable for the measurement of abundance based on the mOTU method of fecal mOTU of people in northwestern China.
  • Figure 1 shows the ⁇ diversity of patients with schizophrenia and healthy controls at the species level according to an embodiment of the present invention.
  • the graphic shows that there is a significant difference between schizophrenic patients and healthy controls.
  • Fig. 2 shows the error rate distribution of five 10-fold cross-validation in a random forest classifier according to an embodiment of the present invention.
  • Fig. 3 is a receiver operating characteristic (ROC) curve and curve of a training set composed of schizophrenia patients and healthy controls based on a random forest model (11 intestinal markers) according to an embodiment of the present invention Area under Curve (AUC).
  • ROC receiver operating characteristic
  • AUC Area under Curve
  • Figure 4 shows the ROC curve and AUC of a validation set composed of schizophrenia patients and healthy controls based on a random forest model (11 gut markers) according to an embodiment of the present invention.
  • Schiphrenia is a group of severe mental illnesses of unknown etiology, mostly in young adults with slow or subacute onset, clinically often manifested as syndromes with different symptoms, involving perception, thinking, emotion, and behavior. Obstacles and incoordination of mental activities.
  • Biomarker also known as “biological marker” refers to a measurable indicator of the biological state of an individual.
  • biomarkers can be any substance in the individual, as long as they are related to the specific biological state (e.g., disease) of the subject, for example, nucleic acid markers (also called genetic markers, such as DNA), Protein markers, cytokine markers, chemokine markers, carbohydrate markers, antigen markers, antibody markers, species markers (species/genus markers) and functional markers (KO/OG markers), etc.
  • nucleic acid markers is not limited to existing genes that can be expressed as biologically active proteins, but also includes any nucleic acid fragments, which can be DNA, RNA, modified DNA or RNA, or It is unmodified DNA or RNA, and a collection of them.
  • nucleic acid markers can sometimes be referred to as characteristic fragments.
  • biomarkers can also be expressed as "intestinal markers", because the found biomarkers related to schizophrenia are all present in the intestine of the subject. Biomarkers are measured and evaluated, often used to check normal biological processes, pathogenic processes, or pharmacological responses to therapeutic interventions, and are useful in many scientific fields.
  • the biomarkers can be used for high-throughput sequencing to batch analyze stool samples of healthy people and patients with schizophrenia. Based on the high-throughput sequencing data, the healthy population is compared with the schizophrenia patient group to determine the specific nucleic acid sequence related to the schizophrenia patient group. In short, the steps are as follows:
  • Sample collection and processing collect stool samples from healthy people and schizophrenia patients, and use the kit for DNA extraction to obtain nucleic acid samples;
  • DNA library construction and sequencing are performed by high-throughput sequencing to obtain the nucleic acid sequence of gut microbes contained in stool samples;
  • the nucleic acid sequences of specific gut microbes related to schizophrenia patients are determined.
  • the reads are combined with a reference gene set (also called a reference gene set, which can be a newly constructed gene set or any database of known sequences, for example, using known non-redundant genes of the human gut microbial community Set) for comparison.
  • a reference gene set also called a reference gene set, which can be a newly constructed gene set or any database of known sequences, for example, using known non-redundant genes of the human gut microbial community Set
  • the relative abundance of each gene in the nucleic acid samples from the stool samples of healthy people and schizophrenia patients are determined.
  • the corresponding relationship between the sequencing sequence and the genes in the reference gene set can be established, so that for a specific gene in a nucleic acid sample, the number of corresponding sequencing sequences can effectively reflect the gene Relative abundance. Therefore, the relative abundance of genes in the nucleic acid sample can be determined through the comparison results and conventional statistical analysis. Finally, after the relative abundance of each gene in the nucleic acid sample is determined, the relative abundance of each gene in the nucleic acid samples from the feces of healthy people and schizophrenic patients is statistically tested. From this, it can be judged in healthy people and mental health. Whether there is a gene with a significant difference in relative abundance in the population of schizophrenia patients, if the gene is significantly different, the gene is regarded as a biomarker of an abnormal state, that is, a nucleic acid marker.
  • the gene species information and functional annotations can be further classified.
  • the species markers and functional markers of the abnormal state can be further determined.
  • the method for determining species markers and functional markers further includes: comparing the sequencing sequences of healthy people and schizophrenia patients with a reference gene set; based on the comparison results, determining healthy people and schizophrenia respectively The relative abundance of species and relative abundance of each gene in nucleic acid samples of patients with diseases; the relative abundance of species and relative abundance of each gene in nucleic acid samples from healthy people and schizophrenia patients Test; and respectively determine species markers and functional markers that have significant differences in the relative abundance of nucleic acid samples between healthy people and schizophrenia patients.
  • the relative abundance of genes from the same species and the relative abundance of genes with the same functional annotation can be used to perform statistical tests, such as summation, average value, median value, etc., to determine the function Relative abundance and relative abundance of species.
  • the technical means used in the examples are conventional means well known to those skilled in the art, and can be carried out with reference to the third edition of the "Molecular Cloning Experiment Guide” or related products.
  • the reagents and products used are also available. Commercially acquired.
  • the various processes and methods that are not described in detail are conventional methods well known in the art.
  • the source of the reagents used, the trade name, and those whose components are necessary to be listed are all indicated when they first appear, and the same reagents used thereafter are not special The description is the same as the content indicated for the first time.
  • the present invention adopts the analysis method of Metagenome-Wide Association Study (MWAS) to analyze the bacterial composition and functional difference of stool samples by sequencing; the random forest discriminant model is used to discriminate the schizophrenic group and the non-schizophrenic group, Obtain the probability of illness, which can be used for risk assessment, diagnosis, early diagnosis of schizophrenia or to find potential drug targets.
  • MWAS Metagenome-Wide Association Study
  • the term "individual” refers to animals, especially mammals, such as primates, and preferably humans.
  • the sequencing (second-generation sequencing) and MWAS are well known in the art, and those skilled in the art can make adjustments according to specific conditions. According to the embodiment of the present invention, it can be performed according to the method described in the literature (Jun Wang, and Huijue Jia. Metagenome-wide association studies: fine-mining the microbiome. Nature Reviews Microbiology 14.8 (2016): 508-522.).
  • mOTU refers to the operational classification unit (metagenomics Operational Taxonomic Units) (Sunagawa S, Mende D R, Zeller G, et al. Metagenomic species profiling using universal phylogenetic marker genes [J]. Nature, methods, 2013 10(12):1196-1199.), in phylogenetic research or population genetics research, in order to facilitate analysis, artificially set the same mark for a certain taxa (line, species, genus, group, etc.). The sequence is usually divided into different mOTUs according to the similarity threshold, and each mOTU is usually regarded as a microbial species.
  • the method of using the random forest model and the ROC curve is well known in the art, and those skilled in the art can set and adjust the parameters according to specific conditions. According to the embodiment of the present invention, it can be based on the literature (Drogan D, Dunn WB, Lin W, Buijsse B, Schulze MB, Langenberg C, Brown M, Floegel a., Dietrich S, Rolandsson O, Wedge DC, Goodacre R, Forouhi NG ,Sharp SJ,Spranger J,Wareham NJ,Boeing H:Untargeted Metabolic Profiling Identifies Altered Serum Metabolites of Type 2-Diabetes Melitus in a Prospective, Nested Case Control Study.Clin 487-Study, Milik-Study.
  • a training set of biomarkers for schizophrenia subjects and non-schizophrenia subjects is constructed, and based on this, the biomarker content value of the sample to be tested is evaluated.
  • the normal content value range (absolute value) of each biomarker in the sample can be obtained by using sample detection and calculation methods known in the art.
  • the absolute value of the detected biomarker content can be compared with the normal content value.
  • statistical methods can be combined to obtain risk assessment and diagnosis of schizophrenia, and for monitoring patients with schizophrenia The efficiency of the treatment effect and so on.
  • these biomarkers are the intestinal flora present in the human body.
  • the intestinal flora of the subject is associated with the analysis, and it is obtained that the biomarker of the schizophrenia population shows a certain content range value in the detection of the flora.
  • the extracted DNA samples were used to construct a sequencing library, and the paired-end metagenomic sequencing (insert 350bp, read length 100bp) was performed on the Illumina HiSeq2000 sequencing platform.
  • the data generated by sequencing is filtered (quality-controlled) to remove adapter-contaminated sequences, low-quality sequences, and host genome-contaminated sequences to obtain high-quality reads. .
  • the abundance calculation steps are as follows: 1) Compare the high-quality sequencing fragments to the reference single-copy gene; 2) Count the number of inserts according to the comparison results; 3) Compare the number of inserts to the length of the single-copy gene Standardization (standardize according to the average gene length, and get the abundance of the corresponding mOTU by rounding down) to get the corresponding abundance.
  • this example constructs a training set of biomarkers for schizophrenia subjects and non-schizophrenia subjects, and uses this as a benchmark to determine the biomarkers of the sample to be tested.
  • the material content value is evaluated.
  • the training set and the verification set have meanings known in the art.
  • the training set refers to a data set containing the content of each biomarker in the sample to be tested for a certain number of schizophrenic subjects and non-schizophrenic subjects.
  • the validation set is an independent data set used to test the performance of the training set.
  • non-schizophrenic subjects are subjects with good mental states, and the subjects may be humans or model animals. In this embodiment, humans are used as subjects for experiments.
  • the present invention randomly selects 80 schizophrenia patients and 71 healthy people from 171 samples (90 schizophrenia patients and 81 healthy people) as the training set (Table 1-1, 1-2), and the rest The sample is used as a validation set (10 schizophrenic patients and 10 healthy people).
  • the RF classifier obtained in the present invention contains 11 metabolites (ie 11 biomarkers).
  • the relative abundances of these 11 biomarkers are shown in Tables 1-1 and 1-2, respectively.
  • the detailed information is shown in Table 2 shows.
  • Table 3 shows the combination of 11 biomarkers to predict the probability of illness in the training set, where the probability of illness ⁇ 0.5 confirms that the individual is at risk of schizophrenia or has schizophrenia.
  • Figure 2 shows the error rate distribution of the 5-fold 10-fold cross-validation in the random forest classifier.
  • the model is trained with the training set samples on the relative abundance of mOTU that meets the target obtained through the MWAS process.
  • the thick black solid curve represents the average value of 5 trials (the thin black curve represents 5 trials), and the black vertical line represents the selected The number of mOTUs in the best combination.
  • Figure 3 shows the ROC curve and AUC of a training set composed of schizophrenia patients and healthy controls based on the random forest model (11 biomarkers), where the specific characterization of the probability of correcting the disease is sensitive to Sex refers to the probability of correcting the disease.
  • the results show that the metabolite combination obtained from this model can be used as a potential biomarker to distinguish schizophrenia from non-schizophrenia.
  • the present invention uses an independent population to verify the model, and the disease probability is greater than or equal to 0.5 to predict that the individual is at risk of schizophrenia or suffers from schizophrenia.
  • Table 5 shows the probability of disease based on the 11 biomarker prediction validation set.
  • Figure 4 shows the ROC curve and AUC of an independent validation set composed of schizophrenia patients and healthy controls based on the random forest model (11 biomarkers).
  • the input includes training set data (that is, the relative abundance of the selected mOTU markers in the training sample, see Tables 1-1 and 1-2), sample disease status (the sample disease status of the training sample is a vector, and '1' represents the spirit Schizophrenia, '0' stands for healthy control), and a validation set (the relative abundance of the selected mOTU markers in the validation set, see Table 4-1, 4-2).
  • the inventor uses the random forest function of the random forest package in the R software to establish classification and prediction functions to predict the verification set data, and the output is the prediction result (probability of disease); the threshold is 0.5, if the probability of disease is ⁇ 0.5, then Think that you are at risk of schizophrenia or suffer from schizophrenia.
  • Table 1-1 Relative abundance data of intestinal markers (mOTU) in the training set of random forest model
  • #Validation set AUC indicates the degree of discrimination of the validation set data under the model obtained from the training set data.
  • the biomarker disclosed in the present invention has high accuracy and specificity, and has a good prospect of being developed as a diagnostic method, so as to find potential drugs for risk assessment, diagnosis, and early diagnosis of schizophrenia
  • the target provides the basis.
  • biomarkers for schizophrenia based on intestinal flora is used as a detection target or a detection target in the preparation of a detection kit.
  • the change in the relative abundance of the biomarker combination provides a basis for determining whether the candidate drug is effective.

Landscapes

  • Chemical & Material Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Organic Chemistry (AREA)
  • Health & Medical Sciences (AREA)
  • Analytical Chemistry (AREA)
  • Wood Science & Technology (AREA)
  • Zoology (AREA)
  • Engineering & Computer Science (AREA)
  • Genetics & Genomics (AREA)
  • Immunology (AREA)
  • Physics & Mathematics (AREA)
  • Microbiology (AREA)
  • Molecular Biology (AREA)
  • Biotechnology (AREA)
  • Biophysics (AREA)
  • Biochemistry (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Pathology (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
  • Investigating Or Analysing Biological Materials (AREA)

Abstract

Disclosed are an intestinal flora-based schizophrenia biomarker combination and applications. On the basis of the influence on biomarkers before and after use of candidate drugs, whether the candidate drugs can be used for treating or preventing schizophrenia is determined.

Description

一种基于肠道菌群的精神分裂症生物标志物组合、其应用及mOTU筛选方法A schizophrenia biomarker combination based on intestinal flora, its application and mOTU screening method 技术领域Technical field
本发明属于生物医药技术领域,涉及一种基于肠道菌群的精神分裂症的生物标志物、诊断或预测精神分裂症风险的试剂盒及应用。The invention belongs to the technical field of biomedicine, and relates to a biomarker for schizophrenia based on intestinal flora, a kit for diagnosing or predicting the risk of schizophrenia, and applications.
背景技术Background technique
精神分裂症(英语:Schizophrenia)是一组病因未明的重性精神病,多在青壮年缓慢或亚急性起病,临床上往往表现为症状各异的综合征,涉及感知觉、思维、情感和行为等多方面的障碍以及精神活动的不协调。患者一般意识清楚,智能基本正常,但部分患者在疾病过程中会伴有认知功能的损害。病程一般迁延,呈反复发作、加重或恶化,多数患者最终出现衰退和精神残疾,仅少数患者经过治疗后可达到痊愈或基本痊愈状态。Schizophrenia (English: Schizophrenia) is a group of severe mental illnesses of unknown etiology, mostly slow or subacute onset in young adults, clinically often manifested as syndromes with different symptoms, involving perception, thinking, emotion and behavior Various obstacles and incoordination of mental activities. Patients generally have clear consciousness and normal intelligence, but some patients will suffer from cognitive impairment during the course of the disease. The course of the disease is generally protracted, showing repeated attacks, aggravation or deterioration. Most patients will eventually experience decline and mental disability, and only a few patients will be cured or basically cured after treatment.
精神分裂症的全球患病率约为0.3-0.7%。截止2016年,全球估计有超过2100万名精神分裂症患者,其平均预期寿命较正常人缩短10年至25年。The global prevalence of schizophrenia is about 0.3-0.7%. As of 2016, there are estimated to be more than 21 million people with schizophrenia in the world, and their average life expectancy is 10 to 25 years shorter than that of normal people.
尽管既往研究表明精神分裂症发病是由遗传因素及环境因素共同作用所致,且患者的血清及脑组织都呈现部分异常改变,但目前对精神分裂症的诊断仍依赖于症状学评价,尚无可靠的生物学标记物标识。此外,现有的诊断标准不能早期预测精神分裂的发病,疗效及预后。Although previous studies have shown that the onset of schizophrenia is caused by a combination of genetic and environmental factors, and that the patient’s serum and brain tissue have some abnormal changes, the current diagnosis of schizophrenia still relies on symptomatic evaluation. Reliable biomarker identification. In addition, the existing diagnostic criteria cannot early predict the onset, efficacy and prognosis of schizophrenia.
发明内容Summary of the invention
本发明解决的问题在于提供一种基于肠道菌群的精神分裂症生物标志物组合及其应用,可以克服现有精神分裂症诊断不能做到早期预警、不能预测精神分裂症发病以及发展趋势等缺点,能够帮助疾病病理分型以及为药物作用靶点研究、精准用药、发病机理的研究等。The problem solved by the present invention is to provide a combination of biomarkers for schizophrenia based on intestinal flora and its application, which can overcome the inability of the existing diagnosis of schizophrenia to achieve early warning, fail to predict the incidence and development trend of schizophrenia, etc. Disadvantages, it can help the pathological classification of diseases and the study of drug targets, precise medication, and pathogenesis.
本发明是通过以下技术方案来实现:The present invention is realized through the following technical solutions:
一种基于肠道菌群的精神分裂症生物标志物组合,该生物标志物组合用于提供相对丰度信息,其包含选自以下的一种或多种:A biomarker combination for schizophrenia based on intestinal flora, which is used to provide relative abundance information, and includes one or more selected from the following:
生物标志物1:Lachnospiraceae bacterium 3_1_57FAA_CT1;Biomarker 1: Lachnospiraceae bacterium 3_1_57FAA_CT1;
生物标志物2:Cronobacter sakazakii;Biomarker 2: Cronobacter sakazakii;
生物标志物3:Lactobacillus acidophilus;Biomarker 3: Lactobacillus acidophilus;
生物标志物4:Veillonella parvula;Biomarker 4: Veillonella parvula;
生物标志物5:Lactococcus lactis;Biomarker 5: Lactococcus lactis;
生物标志物6:Alkaliphilus oremlandii;Biomarker 6: Alkaliphilus oremlandii;
生物标志物7:Pseudoflavonifractor capillosus;Biomarker 7: Pseudoflavonifractor capillosus;
生物标志物8:Streptococcus gallolyticus;Biomarker 8: Streptococcus gallolyticus;
生物标志物9:Dialister invisus;Biomarker 9: Dialister invisus;
生物标志物10:Lactobacillus johnsonii;Biomarker 10: Lactobacillus johnsonii;
生物标志物11:Methanobrevibacter smithii。Biomarker 11: Methanobrevibacter smithii.
所述的生物标志物组合提供的相对丰度信息用于和参考值进行比较。The relative abundance information provided by the biomarker combination is used for comparison with reference values.
所述的生物标志物1~11的相对丰度信息是基于能够对其进行丰度计算的基因序列来提供的。The relative abundance information of the biomarkers 1-11 is provided based on the gene sequence for which abundance calculation can be performed.
所述的基于肠道菌群的精神分裂症生物标志物组合作为检测靶点或检测目标在制备检测试剂盒中的应用。The combination of biomarkers for schizophrenia based on intestinal flora is used as a detection target or a detection target in the preparation of a detection kit.
所述的基于肠道菌群的精神分裂症生物标志物组合的筛选方法,步骤如下:The method for screening a combination of biomarkers for schizophrenia based on intestinal flora includes the following steps:
1)样本收集:采集粪便样品后冷冻运输并迅速转移到-80℃保存,进行DNA提取,得到提取的DNA样本,样本受试者包括精神分裂症病人和健康人;1) Sample collection: After collecting stool samples, they are frozen and transported and quickly transferred to -80°C for storage. DNA extraction is performed to obtain extracted DNA samples. The sample subjects include schizophrenia patients and healthy people;
2)宏基因组测序与组装2) Metagenomic sequencing and assembly
3)保守的单拷贝基因比对与丰度计算3) Conservative single-copy gene alignment and abundance calculation
3)将高质量的测序片段输入到软件mOTU计算出物种的相对丰度:3) Input high-quality sequencing fragments into the software mOTU to calculate the relative abundance of species:
3.1)将高质量测序片段比对到参考的单拷贝基因上;3.1) Align the high-quality sequencing fragments to the reference single-copy gene;
3.2)根据比对结果统计插入片段的数量;3.2) Count the number of inserts according to the comparison results;
3.3)将插入片段的数量对单拷贝基因的长度进行标准化得到对应的丰度。3.3) Normalize the number of inserts to the length of a single copy gene to obtain the corresponding abundance.
4)从样本集中随机地选取精神分裂症病人和健康人作为训练集,其余样品作为验证集,计算训练集中每个样本中mOTU的相对丰度,然后将训练集的mOTU输入随机森林分类器,对分类器进行5次10折交叉验证,利用RF模型筛选的mOTU相对丰度对每一个体计算其精神分裂症患病风险,绘制ROC曲线,并计算出AUC作为判别模型效能评价参数,选取标志物组合数<30,且判别效能最佳的组合,在模型中输出每个mOTU的重要性指数,重要性指数越高,代表该标志物用来判别精神分裂症和非精神分裂症的重要性越高。4) Randomly select schizophrenic patients and healthy people from the sample set as the training set, and the remaining samples as the validation set. Calculate the relative abundance of mOTU in each sample in the training set, and then input the mOTU of the training set into the random forest classifier. Perform 5 10-fold cross-validation on the classifier, use the relative abundance of mOTU screened by the RF model to calculate the risk of schizophrenia for each individual, draw the ROC curve, and calculate the AUC as the performance evaluation parameter of the discriminant model, and select the marker The number of combination of substances is less than 30, and the combination with the best discrimination performance, the importance index of each mOTU is output in the model. The higher the importance index, it represents the importance of the marker used to distinguish schizophrenia and non-schizophrenia Higher.
所述样本集中,样本受试者包括90个精神分裂症病人和81个健康人,验证集中,样本受试者包括10个精神分裂症病人和10个健康人。In the sample collection, the sample subjects included 90 schizophrenia patients and 81 healthy people, and the verification collection, the sample subjects included 10 schizophrenia patients and 10 healthy people.
上述标志物的使用方法即诊断对象是否患有精神分裂症或者预测对象是否患有精神分裂症的风险的方法为:The method of using the above markers, that is, to diagnose whether the subject has schizophrenia or predict whether the subject has the risk of schizophrenia is:
1)从对象中采集样本;1) Collect samples from the subject;
2)确定步骤1)中获得的所述样本中的生物标志物的相对丰度信息;2) Determine the relative abundance information of the biomarkers in the sample obtained in step 1);
3)将步骤2)中所述的相对丰度信息与参考数据集或参考值进行比较。所述方法不仅仅可以用于专利法意义上的疾病诊断,同时可以用作科学研究或者其他个人遗传信息的丰富以及遗传信息库的丰富等非疾病诊断。利用检测对象中的各生物标志物的相对丰度信息与参考数据集或参考值进行比较,来确定对象是否患有精神分裂症,或者预测其患有精神分裂症的风险。3) Compare the relative abundance information described in step 2) with the reference data set or reference value. The method can be used not only for disease diagnosis in the sense of patent law, but also for non-disease diagnosis such as scientific research or other personal genetic information enrichment and genetic information database enrichment. The relative abundance information of each biomarker in the test subject is compared with a reference data set or reference value to determine whether the subject has schizophrenia, or predict the risk of schizophrenia.
所述参考数据集包括来自多个精神分裂症患者和多个健康对照的样本中的生物标志物的相对丰度信息。The reference data set includes relative abundance information of biomarkers in samples from multiple schizophrenia patients and multiple healthy controls.
所述参考数据集指的是对已确诊为患病个体和健康个体的样本进行操作,所获得的各生 物标志物的相对丰度信息,用来作为每种生物标志物的相对丰度的参考。具体的,参考数据集可以是指训练数据集。根据本发明,所述训练集是指和验证集具有本领域公知的含义。在本发明的一个实施方案中,所述训练集是指包含一定样本数的精神分裂症受试者和非精神分裂症受试者待测样本中的各生物标志物的含量的数据集合。所述验证集是用来测试训练集性能的独立数据集合。The reference data set refers to the relative abundance information of each biomarker obtained by operating the samples of the diagnosed diseased individuals and healthy individuals, which is used as a reference for the relative abundance of each biomarker . Specifically, the reference data set may refer to the training data set. According to the present invention, the training set refers to and the verification set has a well-known meaning in the art. In an embodiment of the present invention, the training set refers to a data set containing the content of each biomarker in the test samples of a certain number of schizophrenic subjects and non-schizophrenic subjects. The verification set is an independent data set used to test the performance of the training set.
本发明中所述参考值指的是健康对照的参考值或正常值。本领域技术人员已知,当样本容量足够大时,可利用本领域公知的检测和计算方法获得样品中每个生物标志物的正常值(绝对值)的范围。当采用测定方法检测生物标志物的水平时,可将样品中的生物标志物水平的绝对值直接与参考值进行比较,以评估患病风险以及诊断或早期诊断精神分裂症,任选地,可以包括统计方法。The reference value in the present invention refers to the reference value or normal value of healthy controls. Those skilled in the art know that when the sample volume is large enough, detection and calculation methods known in the art can be used to obtain the normal value (absolute value) range of each biomarker in the sample. When an assay method is used to detect the level of a biomarker, the absolute value of the biomarker level in the sample can be directly compared with a reference value to assess the risk of disease and diagnose or early diagnose schizophrenia. Optionally, you can Including statistical methods.
步骤2)中所述的相对丰度信息与参考数据集进行比较的步骤中,还包括执行多元统计模型以获得患病概率。利用多元统计模型可以实现快速高效检测。具体的,所述多元统计模型为随机森林模型。The step of comparing the relative abundance information with the reference data set in step 2) further includes executing a multivariate statistical model to obtain the disease probability. The use of multivariate statistical models can achieve fast and efficient detection. Specifically, the multivariate statistical model is a random forest model.
所述患病概率大于阈值表明所述对象患有精神分裂症或相关疾病或者有患有精神分裂症或相关疾病的风险。具体的,所述阈值为0.5。The probability of illness being greater than the threshold indicates that the subject has schizophrenia or related diseases or is at risk of suffering from schizophrenia or related diseases. Specifically, the threshold is 0.5.
步骤2)中所述生物标志物的相对丰度信息是利用测序方法得到的,进一步包括:从所述对象的所述样本中分离得到核酸样本,基于所获得的所述核酸样本,构建DNA文库,对所述DNA文库进行测序,以便获得测序结果;以及基于所述测序结果,将测序结果与参考基因集进行比对,以确定所述生物标志物的相对丰度信息。The relative abundance information of the biomarker in step 2) is obtained by a sequencing method, and further includes: isolating a nucleic acid sample from the sample of the subject, and constructing a DNA library based on the obtained nucleic acid sample , Sequencing the DNA library to obtain a sequencing result; and comparing the sequencing result with a reference gene set based on the sequencing result to determine the relative abundance information of the biomarker.
根据本发明的一种实施例,可以利用SOAP2和MAQ的至少一种将测序结果与参考基因集进行比对,由此,可以提高比对的效率,进而可以提高精神分裂症检测的效率。根据本发明的实施例,可以同时对多种(至少两种)生物标志物进行检测,可以提高精神分裂症检测的效率。According to an embodiment of the present invention, at least one of SOAP2 and MAQ can be used to compare the sequencing result with the reference gene set, thereby improving the efficiency of the comparison, and thus the efficiency of detecting schizophrenia. According to the embodiment of the present invention, multiple (at least two) biomarkers can be detected at the same time, which can improve the efficiency of schizophrenia detection.
所述参考基因集包括从多个精神分裂症患者和多个健康对照的样本中进行宏基因组测序,获得非冗余基因集,然后将所述非冗余基因集与肠道微生物基因集合并,得到所述参考基因集。本发明中的参考基因集可以是已有的基因集,如现有的已经公开的肠道微生物参考基因集;也可以是将多个精神分裂症患者和多个健康对照的样品进行宏基因组测序,获得非冗余基因集,然后将所述非冗余基因集与肠道微生物基因集合并,得到所述参考基因集,由此获得的参考基因集信息更全面,检测结果更可靠。The reference gene set includes performing metagenomic sequencing from samples of multiple schizophrenia patients and multiple healthy controls to obtain a non-redundant gene set, and then combining the non-redundant gene set with the gut microbial gene set, Obtain the reference gene set. The reference gene set in the present invention can be an existing gene set, such as an existing intestinal microbial reference gene set that has been published; it can also be metagenomic sequencing of samples from multiple schizophrenia patients and multiple healthy controls , Obtaining a non-redundant gene set, and then combining the non-redundant gene set with the gut microbial gene set to obtain the reference gene set, and the obtained reference gene set information is more comprehensive and the detection result is more reliable.
所述非冗余基因集作本领域技术人员通常的理解来解释,简单来说是去除冗余基因后的剩余基因的集合。冗余基因通常指的是一条染色体上出现的一个基因的多个复份。The non-redundant gene set is explained as a person skilled in the art generally understands, and in simple terms is the set of remaining genes after the redundant genes are removed. Redundant genes usually refer to multiple copies of a gene that appear on a chromosome.
具体的,所述样本为粪便样本。所述测序方法是通过第二代测序方法或第三代测序方法进行的。进行测序的手段并不受特别限制,通过二代或者三代测序的方法进行测序,可以实现快速高效的测序。Specifically, the sample is a stool sample. The sequencing method is performed by a second-generation sequencing method or a third-generation sequencing method. The means for sequencing is not particularly limited. Fast and efficient sequencing can be achieved by second-generation or third-generation sequencing methods.
所述测序方法是通过选自Hiseq2000、SOLiD、454、和单分子测序装置的至少一种进行 的。由此,能够利用这些测序装置的高通量、深度测序的特点,从而有利于对后续测序数据进行分析,尤其是进行统计学检验时的精确性和准确度。The sequencing method is performed by at least one selected from the group consisting of Hiseq2000, SOLiD, 454, and single molecule sequencing devices. As a result, the high-throughput and deep sequencing characteristics of these sequencing devices can be utilized, thereby facilitating the analysis of subsequent sequencing data, especially the accuracy and accuracy of statistical testing.
本发明提出了所述的基于肠道菌群的精神分裂症生物标志物组合作为检测靶点或检测目标在制备检测试剂盒中的应用,所述试剂盒用于诊断对象是否患有精神分裂症或相关疾病或者预测对象是否患有精神分裂症或相关疾病的风险。The present invention proposes the application of the schizophrenia biomarker combination based on the intestinal flora as a detection target or a detection target in the preparation of a detection kit for diagnosing whether a subject has schizophrenia Or related diseases or predict whether the subject has a risk of schizophrenia or related diseases.
即,本发明提出了一种包括用于检测生物标志物的试剂的试剂盒,利用该试剂盒,可以确定这些标志物在肠道菌群中的相对丰度,由此,可以通过所得到的相对丰度值,从而确定对象是否患有或者易感精神分裂症,以及用于监控精神分裂症患者的治疗效果的效率。That is, the present invention proposes a kit that includes reagents for detecting biomarkers. With the kit, the relative abundance of these markers in the intestinal flora can be determined, and thus, the obtained The relative abundance value is used to determine whether the subject has or is susceptible to schizophrenia, and the efficiency of the treatment effect for monitoring schizophrenia patients.
本发明提出了基于肠道菌群的精神分裂症生物标志物组合作为靶点在筛选治疗和/或者预防精神分裂症的药物中的应用。所述生物标志物为上述本发明提出的生物标志物,可以利用候选药物使用前和使用后对这些生物标志物的影响,从而确定候选药物是否可以用于治疗或预防精神分裂症。The present invention proposes the application of a combination of biomarkers for schizophrenia based on intestinal flora as a target in the screening of drugs for the treatment and/or prevention of schizophrenia. The biomarkers are the above-mentioned biomarkers of the present invention, and the influence of the candidate drug on these biomarkers before and after use can be used to determine whether the candidate drug can be used to treat or prevent schizophrenia.
所述的生物标志物组合相对丰度的变化为确定候选药物是否有效提供依据。The change in the relative abundance of the biomarker combination provides a basis for determining whether the candidate drug is effective.
与现有技术相比,本发明具有以下有益的技术效果:Compared with the prior art, the present invention has the following beneficial technical effects:
本发明基于以下事实和问题的发现和认识作出的:肠道微生物是存在于人体肠道中的微生物群落,是人体的“第二基因组”。人体肠道菌群和宿主构成一个相互关联的整体。肠道微生物能够产生在人脑中发现的大多数神经递质。越来越多的证据支持肠道微生物影响中枢神经化学和行为的观点,肠易激综合征被认为是大脑-肠道微生物轴调节紊乱的典型案例。转化研究表明,某些特定的菌群可能会对压力反应和认知功能产生影响。用益生菌或抗生素改变肠道微生物群,为提高大脑功能和治疗抑郁及自闭症等肠-脑轴疾病提供了一种新的方法。因此,本发明通过对精神分裂症患者和健康人群的肠道菌群以及基因序列进行分析,从而筛选出与精神分裂症相关性高的生物标志物,并且利用该标志物能够准确地诊断精神分裂症或者预测患病风险,并且可以用于监测治疗效果。The present invention is based on the discovery and recognition of the following facts and problems: intestinal microbes are the microbial community that exists in the human intestinal tract and are the "second genome" of the human body. The human intestinal flora and host constitute an interconnected whole. Gut microbes can produce most of the neurotransmitters found in the human brain. More and more evidence supports the view that intestinal microbes affect central neurochemistry and behavior. Irritable bowel syndrome is considered a typical case of brain-gut microbial axis regulation disorders. Translational studies have shown that certain specific flora may have an impact on stress response and cognitive function. Using probiotics or antibiotics to change the intestinal microbiota provides a new method for improving brain function and treating depression and autism and other intestinal-brain axis diseases. Therefore, the present invention analyzes the intestinal flora and gene sequences of patients with schizophrenia and healthy people, thereby screening biomarkers with high correlation with schizophrenia, and using the markers can accurately diagnose schizophrenia Or predict the risk of disease, and can be used to monitor the effect of treatment.
粪便是人体的代谢产物,其内不仅包含人体的代谢产物,还包括对我们的机体代谢和免疫以及脑功能的变化密切相关的肠道微生物。对粪便进行研究,发现在精神分裂症患者和健康人群的肠道菌群的组成上存在明显的差异,可以准确地对精神分裂症患者进行患病风险评估、早期诊断。本发明基于对精神分裂症患者和健康人群肠道菌群的比较和分析,得到多种相关的肠道微生物,结合高质量的精神分裂症人群和健康人群mOTU作为训练集,能够准确地对精神分裂症患者进行患病风险评估、早期诊断。该方法与目前常用的诊断方法相比,具有方便、快捷的特点。Stool is a metabolite of the human body. It not only contains metabolites of the human body, but also includes intestinal microbes that are closely related to changes in our body's metabolism and immunity and brain function. Research on feces found that there are obvious differences in the composition of the intestinal flora between schizophrenia patients and healthy people, which can accurately assess the risk of schizophrenia and make early diagnosis. The present invention is based on the comparison and analysis of the intestinal flora of schizophrenia patients and healthy people, and obtains a variety of related intestinal microbes. Combining high-quality mOTU of schizophrenia and healthy people as a training set, it can accurately treat mental Patients with schizophrenia undergo risk assessment and early diagnosis. Compared with the current commonly used diagnostic methods, this method has the characteristics of convenience and speed.
本发明提出的精神分裂症相关的生物标记物对早期诊断是有价值的。第一,本发明的标记物具有较高的特异性和灵敏性。第二,粪便的分析保证准确性、安全性、可负担性和患者依从性。并且粪便的样本是可运输的。基于聚合酶链反应(PCR)的试验舒适且无创,所以人们会更容易参与给定的筛选程序。第三,本发明的标记物还可以用作于对精神分裂症患者进行治疗监测的工具以检测对治疗的响应。由于丰度度量和人群选择的原因,本发明中11种 标记物的组合特别适用于基于中国西北部人群粪便mOTU方法度量丰度的情况。The schizophrenia-related biomarkers proposed by the present invention are valuable for early diagnosis. First, the marker of the present invention has high specificity and sensitivity. Second, the analysis of stool ensures accuracy, safety, affordability and patient compliance. And the stool sample is transportable. Polymerase chain reaction (PCR)-based tests are comfortable and non-invasive, so it is easier for people to participate in a given screening procedure. Third, the marker of the present invention can also be used as a tool for treatment monitoring of patients with schizophrenia to detect the response to treatment. Due to the reasons of abundance measurement and population selection, the combination of 11 markers in the present invention is particularly suitable for the measurement of abundance based on the mOTU method of fecal mOTU of people in northwestern China.
附图说明Description of the drawings
图1为根据本发明一个实施例物种属水平上精神分裂症患者和健康对照β多样性的情况。图示表明,精神分裂症患者和健康对照存在显著差异。Figure 1 shows the β diversity of patients with schizophrenia and healthy controls at the species level according to an embodiment of the present invention. The graphic shows that there is a significant difference between schizophrenic patients and healthy controls.
图2为根据本发明的一个实施例随机森林分类器中5次10折交叉验证的错误率分布情况。Fig. 2 shows the error rate distribution of five 10-fold cross-validation in a random forest classifier according to an embodiment of the present invention.
图3为根据本发明的一个实施例基于随机森林模型(11个肠道标志物),由精神分裂症患者和健康对照组成的训练集的接收者操作特征(Receiver Operating Characteristic,ROC)曲线和曲线下面积(Area under Curve,AUC)。Fig. 3 is a receiver operating characteristic (ROC) curve and curve of a training set composed of schizophrenia patients and healthy controls based on a random forest model (11 intestinal markers) according to an embodiment of the present invention Area under Curve (AUC).
图4为根据本发明的一个实施例基于随机森林模型(11个肠道标志物),由精神分裂症患者和健康对照组成的验证集的ROC曲线和AUC。Figure 4 shows the ROC curve and AUC of a validation set composed of schizophrenia patients and healthy controls based on a random forest model (11 gut markers) according to an embodiment of the present invention.
具体实施方式Detailed ways
本发明所用术语具有相关领域普通技术人员通常理解的含义。然而,为了更好地理解本发明,对一些定义和相关术语的解释如下:The terms used in the present invention have meanings commonly understood by those of ordinary skill in the relevant art. However, in order to better understand the present invention, some definitions and related terms are explained as follows:
“精神分裂症”,是一组病因未明的重性精神病,多在青壮年缓慢或亚急性起病,临床上往往表现为症状各异的综合征,涉及感知觉、思维、情感和行为等多方面的障碍以及精神活动的不协调。"Schizophrenia" is a group of severe mental illnesses of unknown etiology, mostly in young adults with slow or subacute onset, clinically often manifested as syndromes with different symptoms, involving perception, thinking, emotion, and behavior. Obstacles and incoordination of mental activities.
“生物标志物”,也称为“生物学标志物”,是指个体的生物状态的可测量指标。这样的生物标记物可以是在个体中的任何物质,只要它们与被检个体的特定生物状态(例如,疾病)有关系,例如,核酸标志物(也可以称为基因标志物,例如DNA),蛋白质标志物,细胞因子标记物,趋化因子标记物,碳水化合物标志物,抗原标志物,抗体标志物,物种标志物(种/属的标记)和功能标志物(KO/OG标记)等。其中,核酸标志物的含义并不局限于现有可以表达为具有生物活性的蛋白质的基因,还包括任何核酸片段,可以为DNA,也可以为RNA,可以是经过修饰的DNA或者RNA,也可以是未经修饰的DNA或者RNA,以及由它们组成的集合。在本文中核酸标志物有时也可以称为特征片段。在本发明中,生物标志物也可以用“肠道标志物”来表示,因为所发现的与精神分裂症相关的生物标志物均存在于受试者的肠道内。生物标记物经过测量和评估,经常用以检查正常生物过程,致病过程,或治疗干预药理响应,而且在许多科学领域都是有用的。"Biomarker", also known as "biological marker", refers to a measurable indicator of the biological state of an individual. Such biomarkers can be any substance in the individual, as long as they are related to the specific biological state (e.g., disease) of the subject, for example, nucleic acid markers (also called genetic markers, such as DNA), Protein markers, cytokine markers, chemokine markers, carbohydrate markers, antigen markers, antibody markers, species markers (species/genus markers) and functional markers (KO/OG markers), etc. Among them, the meaning of nucleic acid markers is not limited to existing genes that can be expressed as biologically active proteins, but also includes any nucleic acid fragments, which can be DNA, RNA, modified DNA or RNA, or It is unmodified DNA or RNA, and a collection of them. In this context, nucleic acid markers can sometimes be referred to as characteristic fragments. In the present invention, biomarkers can also be expressed as "intestinal markers", because the found biomarkers related to schizophrenia are all present in the intestine of the subject. Biomarkers are measured and evaluated, often used to check normal biological processes, pathogenic processes, or pharmacological responses to therapeutic interventions, and are useful in many scientific fields.
所述的生物标志物,可以运用高通量测序,批量分析健康人群和精神分裂症患者的粪便样本。基于高通量测序数据,对健康人群与精神分裂症患者群进行比对,从而确定与精神分裂症患者群相关的特异性核酸序列。简言之,其步骤如下:The biomarkers can be used for high-throughput sequencing to batch analyze stool samples of healthy people and patients with schizophrenia. Based on the high-throughput sequencing data, the healthy population is compared with the schizophrenia patient group to determine the specific nucleic acid sequence related to the schizophrenia patient group. In short, the steps are as follows:
样品的收集与处理:收集健康人群与精神分裂症患者群的粪便样本,使用试剂盒进行DNA提取,得到核酸样本;Sample collection and processing: collect stool samples from healthy people and schizophrenia patients, and use the kit for DNA extraction to obtain nucleic acid samples;
文库构建和测序:DNA文库构建和测序是利用高通量测序进行,以便得到粪便样品中所包含肠道微生物的核酸序列;Library construction and sequencing: DNA library construction and sequencing are performed by high-throughput sequencing to obtain the nucleic acid sequence of gut microbes contained in stool samples;
通过生物信息学的分析方法,确定与精神分裂症患者相关的特异性肠道微生物核酸序列。 首先,将测序序列(reads)与参照基因集(也称为参考基因集,可以为新构建的基因集或任何已知序列的数据库,例如,采用已知的人肠道微生物群落非冗余基因集)进行比对。接下来,基于比对结果,分别确定来自健康人群和精神分裂症患者群粪便样品的核酸样本中各基因的相对丰度。通过将测序序列与参照基因集进行比对,可以将测序序列与参照基因集中的基因建立对应关系,从而针对核酸样本中的特定基因,与其相对应的测序序列的数目可以有效地反映该基因的相对丰度。由此,可以通过比对结果,按照常规的统计分析,确定在核酸样本中基因的相对丰度。最后,在确定核酸样本中各基因的相对丰度后,对来自健康人群和精神分裂症患者群粪便的核酸样本中各基因的相对丰度进行统计检验,由此,可以判断在健康人群和精神分裂症患者人群中是否存在相对丰度有显著差异的基因,如果存在基因是显著差异的,则该基因被当作是异常状态的生物标志物,即核酸标志物。Through bioinformatics analysis methods, the nucleic acid sequences of specific gut microbes related to schizophrenia patients are determined. First, the reads are combined with a reference gene set (also called a reference gene set, which can be a newly constructed gene set or any database of known sequences, for example, using known non-redundant genes of the human gut microbial community Set) for comparison. Next, based on the comparison results, the relative abundance of each gene in the nucleic acid samples from the stool samples of healthy people and schizophrenia patients are determined. By comparing the sequencing sequence with the reference gene set, the corresponding relationship between the sequencing sequence and the genes in the reference gene set can be established, so that for a specific gene in a nucleic acid sample, the number of corresponding sequencing sequences can effectively reflect the gene Relative abundance. Therefore, the relative abundance of genes in the nucleic acid sample can be determined through the comparison results and conventional statistical analysis. Finally, after the relative abundance of each gene in the nucleic acid sample is determined, the relative abundance of each gene in the nucleic acid samples from the feces of healthy people and schizophrenic patients is statistically tested. From this, it can be judged in healthy people and mental health. Whether there is a gene with a significant difference in relative abundance in the population of schizophrenia patients, if the gene is significantly different, the gene is regarded as a biomarker of an abnormal state, that is, a nucleic acid marker.
另外,对于已知或新构建的参照基因集,其通常包含基因物种信息和功能注释,由此,在确定基因相对丰度的基础上,可以进一步通过将基因的物种信息和功能注释进行分类,从而确定肠道菌群中各微生物的物种相对丰度和功能相对丰度,也就可以进一步确定异常状态的物种标志物和功能标志物。简言之,确定物种标志物和功能标志物的方法进一步包括:将健康人群和精神分裂症患者群的测序序列与参照基因集进行比对;基于比对结果,分别确定健康人群和精神分裂症病患者群的核酸样本中各基因的物种相对丰度和功能相对丰度;对来自健康人群和精神分裂症病患者群的核酸样本中各基因的物种相对丰度和功能相对丰度进行统计学检验;以及分别确定在健康人群和精神分裂症病患者群的核酸样本之间相对丰度存在显著差异的物种标志物和功能标志物。根据本发明的实施例,可以采用对来自相同物种的基因的相对丰度和具有相同功能注释的基因的相对丰度进行统计检验,例如加和、取平均值、中位数值等,来确定功能相对丰度和物种相对丰度。In addition, for a known or newly constructed reference gene set, it usually contains gene species information and functional annotations. Therefore, on the basis of determining the relative abundance of genes, the gene species information and functional annotations can be further classified, In order to determine the relative abundance of the species and the relative abundance of the functions of the microorganisms in the intestinal flora, the species markers and functional markers of the abnormal state can be further determined. In short, the method for determining species markers and functional markers further includes: comparing the sequencing sequences of healthy people and schizophrenia patients with a reference gene set; based on the comparison results, determining healthy people and schizophrenia respectively The relative abundance of species and relative abundance of each gene in nucleic acid samples of patients with diseases; the relative abundance of species and relative abundance of each gene in nucleic acid samples from healthy people and schizophrenia patients Test; and respectively determine species markers and functional markers that have significant differences in the relative abundance of nucleic acid samples between healthy people and schizophrenia patients. According to the embodiment of the present invention, the relative abundance of genes from the same species and the relative abundance of genes with the same functional annotation can be used to perform statistical tests, such as summation, average value, median value, etc., to determine the function Relative abundance and relative abundance of species.
若未特别指明,实施例中所采用的技术手段为本领域技术人员所熟知的常规手段,可以参照《分子克隆实验指南》第三版或者相关产品进行,所采用的试剂和产品也均为可商业获得的。未详细描述的各种过程和方法是本领域中公知的常规方法,所用试剂的来源、商品名以及有必要列出其组成成分者,均在首次出现时标明,其后所用相同试剂如无特殊说明,均与首次标明的内容相同。Unless otherwise specified, the technical means used in the examples are conventional means well known to those skilled in the art, and can be carried out with reference to the third edition of the "Molecular Cloning Experiment Guide" or related products. The reagents and products used are also available. Commercially acquired. The various processes and methods that are not described in detail are conventional methods well known in the art. The source of the reagents used, the trade name, and those whose components are necessary to be listed are all indicated when they first appear, and the same reagents used thereafter are not special The description is the same as the content indicated for the first time.
本发明采用宏基因组关联分析(Metagenome-Wide Association Study,MWAS)的分析方法,经测序分析粪便样本的菌群组成,功能差异;用随机森林判别模型判别精神分裂症群体和非精神分裂症群体,获得患病概率,用于精神分裂症的患病风险评估、诊断、早期诊断或者寻找潜在药物靶点。The present invention adopts the analysis method of Metagenome-Wide Association Study (MWAS) to analyze the bacterial composition and functional difference of stool samples by sequencing; the random forest discriminant model is used to discriminate the schizophrenic group and the non-schizophrenic group, Obtain the probability of illness, which can be used for risk assessment, diagnosis, early diagnosis of schizophrenia or to find potential drug targets.
根据本发明,术语“个体”指动物,特别是哺乳动物,如灵长类动物,最好是人。According to the present invention, the term "individual" refers to animals, especially mammals, such as primates, and preferably humans.
根据本发明,术语如“一”、“一个”和“这”不仅指单数的个体,而是包括可以用来说明特定实施方式的通常的一类。According to the present invention, terms such as "a", "an" and "this" not only refer to a singular individual, but include the general category that can be used to describe a particular embodiment.
在本发明中,所述的测序(二代测序)和MWAS具有本领域所公知,本领域技术人员可以根据具体情况进行调整。根据本发明的实施例,可以依据文献(Jun Wang,and Huijue Jia.Metagenome-wide association studies:fine-mining the microbiome.Nature Reviews Microbiology 14.8(2016):508-522.)中记载的方法进行。In the present invention, the sequencing (second-generation sequencing) and MWAS are well known in the art, and those skilled in the art can make adjustments according to specific conditions. According to the embodiment of the present invention, it can be performed according to the method described in the literature (Jun Wang, and Huijue Jia. Metagenome-wide association studies: fine-mining the microbiome. Nature Reviews Microbiology 14.8 (2016): 508-522.).
根据本发明,术语“mOTU”是指操作分类单元(metagenomics Operational Taxonomic Units)(Sunagawa S,Mende D R,Zeller G,et al.Metagenomic species profiling using universal phylogenetic marker genes[J].Nature methods,2013,10(12):1196-1199.),是在系统发生学研究或群体遗传学研究中,为了便于进行分析,人为给某一个分类单元(品系,种,属,分组等)设置的同一标志。通常按照相似性阈值将序列划分为不同的mOTU,每一个mOTU通常被视为一个微生物物种。According to the present invention, the term "mOTU" refers to the operational classification unit (metagenomics Operational Taxonomic Units) (Sunagawa S, Mende D R, Zeller G, et al. Metagenomic species profiling using universal phylogenetic marker genes [J]. Nature, methods, 2013 10(12):1196-1199.), in phylogenetic research or population genetics research, in order to facilitate analysis, artificially set the same mark for a certain taxa (line, species, genus, group, etc.). The sequence is usually divided into different mOTUs according to the similarity threshold, and each mOTU is usually regarded as a microbial species.
在本发明中,随机森林模型和ROC曲线的使用方法为本领域所公知,本领域技术人员可以根据具体情况进行参数设置和调整。根据本发明的实施例,可以根据文献(Drogan D,Dunn WB,Lin W,Buijsse B,Schulze MB,Langenberg C,Brown M,Floegel a.,Dietrich S,Rolandsson O,Wedge DC,Goodacre R,Forouhi NG,Sharp SJ,Spranger J,Wareham NJ,Boeing H:Untargeted Metabolic Profiling Identifies Altered Serum Metabolites of Type 2-Diabetes Mellitus in a Prospective,Nested Case Control Study.Clin Chem 2015,61:487-497.;Mihalik SJ,Michaliszyn SF,de las Heras J,Bacha F,Lee S,Chace DH,DeJesus VR,Vockley J,Arslanian SA:Metabolomic profiling of fatty acid and amino acid metabolism in youth with obesity and type 2 diabetes:evidence for enhanced mitochondrial oxidation.Diabetes Care 2012,35:605-611.,通过引用全文并入此处)中记载的方法进行。In the present invention, the method of using the random forest model and the ROC curve is well known in the art, and those skilled in the art can set and adjust the parameters according to specific conditions. According to the embodiment of the present invention, it can be based on the literature (Drogan D, Dunn WB, Lin W, Buijsse B, Schulze MB, Langenberg C, Brown M, Floegel a., Dietrich S, Rolandsson O, Wedge DC, Goodacre R, Forouhi NG ,Sharp SJ,Spranger J,Wareham NJ,Boeing H:Untargeted Metabolic Profiling Identifies Altered Serum Metabolites of Type 2-Diabetes Melitus in a Prospective, Nested Case Control Study.Clin 487-Study, Milik-Study. SF, de las Heras J, Bacha F, Lee S, Chace DH, DeJesus VR, Vockley J, Arslanian SA: Metabolomic profiling of fatty acid and amino acid metabolism in youth with diabetes for disease and type 2 diabetes: Care 2012, 35:605-611., the full text is incorporated herein by reference).
在本发明中,构建了精神分裂症受试者和非精神分裂症受试者的生物标志物的训练集,并以此为基准,对待测样本的生物标志物含量值进行评估。In the present invention, a training set of biomarkers for schizophrenia subjects and non-schizophrenia subjects is constructed, and based on this, the biomarker content value of the sample to be tested is evaluated.
本领域技术人员知晓,当进一步扩大样本量时,利用本领域公知的样本检测和计算方法,可以得出每种生物标志物在样本中的正常含量值区间(绝对数值)。可以将检测得到的生物标志物含量的绝对值与正常含量值进行比较,任选地,还可以结合统计学方法,以得出精神分裂症患病风险评价、诊断以及用于监控精神分裂症患者的治疗效果的效率等。Those skilled in the art know that when the sample size is further expanded, the normal content value range (absolute value) of each biomarker in the sample can be obtained by using sample detection and calculation methods known in the art. The absolute value of the detected biomarker content can be compared with the normal content value. Optionally, statistical methods can be combined to obtain risk assessment and diagnosis of schizophrenia, and for monitoring patients with schizophrenia The efficiency of the treatment effect and so on.
不希望受任何理论的限制,发明人指出这些生物标志物是存在于人体中的肠道菌群。通过本发明所述的方法对受试者肠道菌群进行关联分析,得到精神分裂症群体的所述生物标志物在菌群检测中表现出一定的含量范围值。Without wishing to be bound by any theory, the inventor pointed out that these biomarkers are the intestinal flora present in the human body. Through the method of the present invention, the intestinal flora of the subject is associated with the analysis, and it is obtained that the biomarker of the schizophrenia population shows a certain content range value in the detection of the flora.
下面结合具体的实施例对本发明做进一步的详细说明,所述是对本发明的解释而不是限定。The present invention will be further described in detail below in conjunction with specific embodiments, which are to explain rather than limit the present invention.
实施例1Example 1
1.1样本收集1.1 Sample collection
参照文献A metagenome-wide association study of gut microbiota in type 2 diabetes(Qin J et al.Nature.2012,490,55-60)记载的方法,采集粪便样品后冷冻运输并迅速转移到-80℃保存,进行DNA提取,得到提取的DNA样本。本发明的精神分裂症和非精神分裂症受试者的粪便样本来自中国。共计171例,其中健康样本81例和精神分裂症样本90例。Refer to the method described in the literature A metagenome-wide association study of gut microbiota in type 2 diabetes (Qin J et al. Nature.2012,490,55-60), the stool samples are collected, frozen, transported and quickly transferred to -80°C for storage, Perform DNA extraction to obtain extracted DNA samples. The stool samples of the schizophrenic and non-schizophrenic subjects of the present invention are from China. A total of 171 cases, including 81 healthy samples and 90 schizophrenia samples.
1.2宏基因组测序与组装1.2 Metagenomic sequencing and assembly
利用所提取的DNA样本构建测序文库,在Illumina HiSeq2000测序平台上进行双向(Paired-end)宏基因组测序(插入片段350bp,读长100bp)。对测序产生的数据进行过滤(quality-controlled),去除adapter污染序列、低质量序列和宿主基因组污染序列,得到高质量的测序片段(reads)。。The extracted DNA samples were used to construct a sequencing library, and the paired-end metagenomic sequencing (insert 350bp, read length 100bp) was performed on the Illumina HiSeq2000 sequencing platform. The data generated by sequencing is filtered (quality-controlled) to remove adapter-contaminated sequences, low-quality sequences, and host genome-contaminated sequences to obtain high-quality reads. .
1.3保守的单拷贝基因比对与丰度计算1.3 Conservative single-copy gene alignment and abundance calculation
将上述“1.2宏基因组测序与筛选”的高质量测序片段(reads)输入到软件mOTU(http://www.bork.embl.de/software/mOTU/download.html)即可计算出物种的相对丰度。参照文献Metagenomic species profiling using universal phylogenetic marker genes(Sunagawa S et al.Nature methods.2013,10(12),1196-9)记载的方法。其丰度计算步骤如下:1)将高质量测序片段比对到参考的单拷贝基因上;2)根据比对结果统计插入片段的数量;3)将插入片段的数量对单拷贝基因的长度进行标准化(按平均基因长度进行标准化,并按照向下取整的方式得到对应mOTU的丰度)得到对应的丰度。Input the high-quality reads of the above-mentioned "1.2 Metagenomic Sequencing and Screening" into the software mOTU (http://www.bork.embl.de/software/mOTU/download.html) to calculate the relative species Abundance. Refer to the method described in the literature Metagenomic species profiling using universal phylogenetic marker genes (Sunagawa S et al. Nature methods. 2013, 10(12), 1196-9). The abundance calculation steps are as follows: 1) Compare the high-quality sequencing fragments to the reference single-copy gene; 2) Count the number of inserts according to the comparison results; 3) Compare the number of inserts to the length of the single-copy gene Standardization (standardize according to the average gene length, and get the abundance of the corresponding mOTU by rounding down) to get the corresponding abundance.
1.4利用随机森林(ROC/AUC)筛选精神分裂症发生发展的潜在生物标志物1.4 Use random forest (ROC/AUC) to screen potential biomarkers for the development of schizophrenia
为进一步筛选潜在疾病肠道生物标志物,本实施例构建了精神分裂症受试者和非精神分裂症受试者的生物标志物的训练集,并以此为基准,对待测样本的生物标志物含量值进行评估。其中,在本发明中,所述训练集和所述验证集具有本领域所公知的含义。在本发明的实施方案中,训练集是指包含一定样本数的精神分裂症受试者和非精神分裂症受试者待测样本中的各生物标志物的含量的数据集合。验证集是用来测试训练集性能的独立数据集合。其中,非精神分裂症受试者为精神状态良好的受试者,受试者可以为人或者模型动物,在本实施例中是以人为受试者进行实验的。In order to further screen the intestinal biomarkers of potential diseases, this example constructs a training set of biomarkers for schizophrenia subjects and non-schizophrenia subjects, and uses this as a benchmark to determine the biomarkers of the sample to be tested. The material content value is evaluated. Wherein, in the present invention, the training set and the verification set have meanings known in the art. In the embodiment of the present invention, the training set refers to a data set containing the content of each biomarker in the sample to be tested for a certain number of schizophrenic subjects and non-schizophrenic subjects. The validation set is an independent data set used to test the performance of the training set. Among them, non-schizophrenic subjects are subjects with good mental states, and the subjects may be humans or model animals. In this embodiment, humans are used as subjects for experiments.
具体包括如下步骤:Specifically include the following steps:
本发明从171个样品(90个精神分裂症病人和81个健康人)中,随机地选取80个精神分裂症病人和71个健康人作为训练集(表1-1、1-2),其余样品作为验证集(10个精神分裂症病人和10个健康人)。The present invention randomly selects 80 schizophrenia patients and 71 healthy people from 171 samples (90 schizophrenia patients and 81 healthy people) as the training set (Table 1-1, 1-2), and the rest The sample is used as a validation set (10 schizophrenic patients and 10 healthy people).
1.4.1利用训练集数据筛选得到的生物标志物1.4.1 Biomarkers obtained by screening using training set data
首先,按照1.3描述的方法计算训练集中每个样本中mOTU的相对丰度。然后将训练集 的mOTU输入随机森林(randomForest 4.6-12 in R 3.2.5,RF)分类器。对分类器进行5次10折交叉验证,利用RF模型筛选的mOTU相对丰度对每一个体计算其精神分裂症患病风险,绘制ROC曲线,并计算出AUC作为判别模型效能评价参数。选取标志物组合数<30,且判别效能最佳的组合为本发明组合。在模型中输出每个mOTU的重要性指数,重要性指数越高,代表该标志物用来判别精神分裂症和非精神分裂症的重要性越高。First, calculate the relative abundance of mOTU in each sample in the training set according to the method described in 1.3. Then input the mOTU of the training set into the random forest (RandomForest 4.6-12 in R 3.2.5, RF) classifier. The classifier was cross-validated 5 times, and the relative abundance of mOTU screened by the RF model was used to calculate the risk of schizophrenia for each individual, draw the ROC curve, and calculate the AUC as the performance evaluation parameter of the discriminant model. Select the combination of markers <30 and the combination with the best discrimination performance is the combination of the invention. Output the importance index of each mOTU in the model. The higher the importance index, the higher the importance of the marker used to distinguish schizophrenia and non-schizophrenia.
最后,本发明所得RF分类器包含了11个代谢物(即11个生物标记物),这11个生物标记物的相对丰度分别如表1-1、1-2所示,其详细信息如表2所示。表3示出了11种生物标记物结合来预测训练集的患病概率,其中患病概率≥0.5确认个体具有患精神分裂的风险或者患有精神分裂症。Finally, the RF classifier obtained in the present invention contains 11 metabolites (ie 11 biomarkers). The relative abundances of these 11 biomarkers are shown in Tables 1-1 and 1-2, respectively. The detailed information is shown in Table 2 shows. Table 3 shows the combination of 11 biomarkers to predict the probability of illness in the training set, where the probability of illness ≥ 0.5 confirms that the individual is at risk of schizophrenia or has schizophrenia.
图2示出了随机森林分类器中5次10折交叉验证的错误率分布情况。该模型用训练集样品在经MWAS流程处理得到的满足目标的mOTU相对丰度进行训练,粗黑色实曲线代表5次试验(细黑色曲线代表5次试验)的平均值,黑色竖线代表所选最佳组合中mOTU数目。Figure 2 shows the error rate distribution of the 5-fold 10-fold cross-validation in the random forest classifier. The model is trained with the training set samples on the relative abundance of mOTU that meets the target obtained through the MWAS process. The thick black solid curve represents the average value of 5 trials (the thin black curve represents 5 trials), and the black vertical line represents the selected The number of mOTUs in the best combination.
图3示出了基于随机森林模型(11个生物标志物)由精神分裂症患者和健康对照组成的训练集的ROC曲线和AUC,其中特异性表征的是对于不患病判对的概率,敏感性指的是对于患病判对的概率,对训练集样本的判别效能为:AUC=82.1%,95%置信区间CI=75.3-88.89%。结果表明该模型所得代谢物组合可作为区分精神分裂症与非精神分裂症的潜在生物标志物。Figure 3 shows the ROC curve and AUC of a training set composed of schizophrenia patients and healthy controls based on the random forest model (11 biomarkers), where the specific characterization of the probability of correcting the disease is sensitive to Sex refers to the probability of correcting the disease. The discriminating power of the training set samples is: AUC=82.1%, 95% confidence interval CI=75.3-88.89%. The results show that the metabolite combination obtained from this model can be used as a potential biomarker to distinguish schizophrenia from non-schizophrenia.
1.4.2利用验证集数据验证筛选得到的生物标志物1.4.2 Use validation set data to verify the biomarkers obtained by screening
本发明随即使用独立人群对该模型进行验证,患病概率≥0.5预测个体具有精神分裂症患疾病风险或者患有精神分裂症。首先,按照1.3描述的方法计算训练集中每个样本中各生物标志物的相对丰度。然后按照1.4.1的方法利用随机森林模型对验证集数据进行验证。The present invention then uses an independent population to verify the model, and the disease probability is greater than or equal to 0.5 to predict that the individual is at risk of schizophrenia or suffers from schizophrenia. First, calculate the relative abundance of each biomarker in each sample in the training set according to the method described in 1.3. Then use the random forest model to verify the validation set data according to the method of 1.4.1.
基于该模型:Based on the model:
11个生物标记物在验证集中的相对丰度如表4-1、4-2所示。表5示出了基于11个生物标志物预测验证集的患病概率。The relative abundance of the 11 biomarkers in the validation set is shown in Table 4-1 and 4-2. Table 5 shows the probability of disease based on the 11 biomarker prediction validation set.
图4示出了基于随机森林模型(11个生物标志物)由精神分裂症患者和健康对照组成的独立验证集的ROC曲线和AUC,模型的判别AUC=77%(95%CI=55.71%-98.29%)。Figure 4 shows the ROC curve and AUC of an independent validation set composed of schizophrenia patients and healthy controls based on the random forest model (11 biomarkers). The model's discrimination AUC=77% (95%CI=55.71%- 98.29%).
在3.2.5版本R中使用“randomForest 4.6-12 package”进行随机森林模型分类和回归。输入包括训练集数据(即训练样本中选定的mOTU标记物的相对丰度,见表1-1、1-2),样本疾病状态(训练样本的样本疾病状态为矢量,‘1’代表精神分裂症,‘0’代表健康对照),以及一个验证集(验证集中所选mOTU标记物的相对丰度,见表4-1、4-2)。然后,发明人利用R软件中随机森林包的随机森林函数建立分类和预测函数对验证集数据进行预测,输出即为预测结果(患病概率);阈值为0.5,如果疾病的概率≥0.5,则认为有患精神分裂症的风险或者患有精神分裂症。In version R 3.2.5, use "randomForest 4.6-12 package" for random forest model classification and regression. The input includes training set data (that is, the relative abundance of the selected mOTU markers in the training sample, see Tables 1-1 and 1-2), sample disease status (the sample disease status of the training sample is a vector, and '1' represents the spirit Schizophrenia, '0' stands for healthy control), and a validation set (the relative abundance of the selected mOTU markers in the validation set, see Table 4-1, 4-2). Then, the inventor uses the random forest function of the random forest package in the R software to establish classification and prediction functions to predict the verification set data, and the output is the prediction result (probability of disease); the threshold is 0.5, if the probability of disease is ≥0.5, then Think that you are at risk of schizophrenia or suffer from schizophrenia.
表1-1随机森林模型训练集肠道标志物(mOTU)相对丰度数据Table 1-1 Relative abundance data of intestinal markers (mOTU) in the training set of random forest model
Figure PCTCN2019094923-appb-000001
Figure PCTCN2019094923-appb-000001
Figure PCTCN2019094923-appb-000002
Figure PCTCN2019094923-appb-000002
Figure PCTCN2019094923-appb-000003
Figure PCTCN2019094923-appb-000003
Figure PCTCN2019094923-appb-000004
Figure PCTCN2019094923-appb-000004
*SZ:精神分裂患者;H:健康对照*SZ: patients with schizophrenia; H: healthy controls
表1-2随机森林模型训练集肠道标志物(mOTU)相对丰度数据Table 1-2 Relative abundance data of intestinal markers (mOTU) in the training set of random forest model
Figure PCTCN2019094923-appb-000005
Figure PCTCN2019094923-appb-000005
Figure PCTCN2019094923-appb-000006
Figure PCTCN2019094923-appb-000006
Figure PCTCN2019094923-appb-000007
Figure PCTCN2019094923-appb-000007
Figure PCTCN2019094923-appb-000008
Figure PCTCN2019094923-appb-000008
表2 11种生物标志物详细信息Table 2 Detailed information of 11 biomarkers
Figure PCTCN2019094923-appb-000009
Figure PCTCN2019094923-appb-000009
Figure PCTCN2019094923-appb-000010
Figure PCTCN2019094923-appb-000010
#验证集AUC,表示在训练集数据得到模型下,对验证集数据的判别程度。#Validation set AUC indicates the degree of discrimination of the validation set data under the model obtained from the training set data.
表3基于11种标志物组合训练集患病概率Table 3 Prevalence probability based on 11 kinds of marker combination training set
样本编号Sample number 患病概率Probability of illness  To 样本编号Sample number 患病概率Probability of illness
H_1H_1 0.9189189190.918918919  To SZ_1SZ_1 0.8700564970.870056497
H_10H_10 0.0423280420.042328042  To SZ_10SZ_10 0.7103825140.710382514
H_11H_11 0.2596685080.259668508  To SZ_11SZ_11 0.6157894740.615789474
H_12H_12 0.0491803280.049180328  To SZ_12SZ_12 0.560439560.56043956
H_13H_13 0.1666666670.166666667  To SZ_13SZ_13 0.7837837840.783783784
H_14H_14 0.0994475140.099447514  To SZ_14SZ_14 0.9447852760.944785276
H_15H_15 0.129729730.12972973  To SZ_15SZ_15 0.9574468090.957446809
H_16H_16 0.0685714290.068571429  To SZ_16SZ_16 0.8882681560.888268156
H_17H_17 0.310.31  To SZ_17SZ_17 0.4285714290.428571429
H_18H_18 0.0691489360.069148936  To SZ_18SZ_18 0.8235294120.823529412
H_19H_19 0.3743016760.374301676  To SZ_19SZ_19 0.6141304350.614130435
H_2H_2 0.8913043480.891304348  To SZ_2SZ_2 0.6257668710.625766871
H_20H_20 0.2240437160.224043716  To SZ_20SZ_20 0.5082872930.508287293
H_21H_21 0.1675392670.167539267  To SZ_21SZ_21 0.6321243520.632124352
H_22H_22 0.3298969070.329896907  To SZ_22SZ_22 0.7350.735
H_23H_23 0.3877551020.387755102  To SZ_23SZ_23 0.7964071860.796407186
H_24H_24 0.5291005290.529100529  To SZ_24SZ_24 0.8988764040.898876404
H_25H_25 0.2473118280.247311828  To SZ_25SZ_25 0.4945652170.494565217
H_26H_26 0.1564245810.156424581  To SZ_26SZ_26 0.8241206030.824120603
H_27H_27 0.0769230770.076923077  To SZ_27SZ_27 0.4947368420.494736842
H_28H_28 0.1674641150.167464115  To SZ_28SZ_28 0.3142857140.314285714
H_29H_29 0.3609467460.360946746  To SZ_29SZ_29 0.4658385090.465838509
H_3H_3 0.7286432160.728643216  To SZ_3SZ_3 0.9030612240.903061224
H_30H_30 0.2786885250.278688525  To SZ_30SZ_30 0.9202127660.920212766
H_31H_31 0.2538071070.253807107  To SZ_31SZ_31 0.8272251310.827225131
H_32H_32 0.3756613760.375661376  To SZ_32SZ_32 0.7235294120.723529412
H_33H_33 0.3950.395  To SZ_33SZ_33 0.7011494250.701149425
H_34H_34 0.3446327680.344632768  To SZ_34SZ_34 0.6132596690.613259669
H_35H_35 0.158192090.15819209  To SZ_35SZ_35 0.732558140.73255814
H_36H_36 0.2486187850.248618785  To SZ_36SZ_36 0.1771428570.177142857
H_37H_37 0.2485875710.248587571  To SZ_37SZ_37 0.3241758240.324175824
H_38H_38 0.3913043480.391304348  To SZ_38SZ_38 0.1945945950.194594595
H_39H_39 0.3556701030.355670103  To SZ_39SZ_39 0.7330097090.733009709
H_4H_4 0.4444444440.444444444  To SZ_4SZ_4 0.632530120.63253012
H_40H_40 0.0862944160.086294416  To SZ_40SZ_40 0.9161676650.916167665
H_41H_41 0.4973262030.497326203  To SZ_41SZ_41 0.2774869110.277486911
H_42H_42 0.7472527470.747252747  To SZ_42SZ_42 0.7684210530.768421053
H_43H_43 0.2048780490.204878049  To SZ_43SZ_43 0.9839572190.983957219
H_44H_44 0.3173652690.317365269  To SZ_44SZ_44 0.2203389830.220338983
H_45H_45 0.0597014930.059701493  To SZ_45SZ_45 0.7159763310.715976331
H_46H_46 0.2254335260.225433526  To SZ_46SZ_46 0.8763440860.876344086
H_47H_47 0.5510204080.551020408  To SZ_47SZ_47 0.4810810810.481081081
H_48H_48 0.1428571430.142857143  To SZ_48SZ_48 0.8089171970.808917197
H_49H_49 0.2078651690.207865169  To SZ_49SZ_49 0.254437870.25443787
H_5H_5 0.6943005180.694300518  To SZ_5SZ_5 0.6119402990.611940299
H_50H_50 0.2022471910.202247191  To SZ_50SZ_50 0.4444444440.444444444
H_51H_51 0.1030927840.103092784  To SZ_51SZ_51 0.7944444440.794444444
H_52H_52 0.4917127070.491712707  To SZ_52SZ_52 0.5135135140.513513514
H_53H_53 0.4812834220.481283422  To SZ_53SZ_53 0.802395210.80239521
H_54H_54 0.6666666670.666666667  To SZ_54SZ_54 0.9505494510.950549451
H_55H_55 0.3846153850.384615385  To SZ_55SZ_55 0.910.91
H_56H_56 0.5635359120.563535912  To SZ_56SZ_56 0.9052631580.905263158
H_57H_57 0.7624309390.762430939  To SZ_57SZ_57 0.9792746110.979274611
H_58H_58 0.0454545450.045454545  To SZ_58SZ_58 0.7732558140.773255814
H_59H_59 0.7176470590.717647059  To SZ_59SZ_59 0.7052023120.705202312
H_6H_6 0.9016393440.901639344  To SZ_6SZ_6 0.5947368420.594736842
H_60H_60 0.2512820510.251282051  To SZ_60SZ_60 0.7102272730.710227273
H_61H_61 0.133689840.13368984  To SZ_61SZ_61 0.969696970.96969697
H_62H_62 0.2312138730.231213873  To SZ_62SZ_62 0.7419354840.741935484
H_63H_63 0.1746031750.174603175  To SZ_63SZ_63 0.0561797750.056179775
H_64H_64 0.4787878790.478787879  To SZ_64SZ_64 0.0558659220.055865922
H_65H_65 0.3714285710.371428571  To SZ_65SZ_65 0.3453608250.345360825
H_66H_66 0.4530386740.453038674  To SZ_66SZ_66 0.1695906430.169590643
H_67H_67 0.477611940.47761194  To SZ_67SZ_67 0.9414893620.941489362
H_68H_68 0.5803108810.580310881  To SZ_68SZ_68 0.5574712640.557471264
H_69H_69 0.5961538460.596153846  To SZ_69SZ_69 0.9299065420.929906542
H_7H_7 0.0150753770.015075377  To SZ_7SZ_7 0.4624277460.462427746
H_70H_70 0.0441988950.044198895  To SZ_70SZ_70 0.8029556650.802955665
H_71H_71 0.0410256410.041025641  To SZ_71SZ_71 0.6939890710.693989071
H_8H_8 0.5510204080.551020408  To SZ_72SZ_72 0.5919540230.591954023
H_9H_9 0.2445652170.244565217  To SZ_73SZ_73 0.6666666670.666666667
 To  To  To SZ_74SZ_74 0.9726775960.972677596
 To  To  To SZ_75SZ_75 0.7664974620.766497462
 To  To  To SZ_76SZ_76 0.1692307690.169230769
 To  To  To SZ_77SZ_77 0.6035502960.603550296
 To  To  To SZ_78SZ_78 0.1969696970.196969697
 To  To  To SZ_79SZ_79 0.8097826090.809782609
 To  To  To SZ_8SZ_8 0.8152173910.815217391
 To  To  To SZ_80SZ_80 0.7627118640.762711864
 To  To  To SZ_9SZ_9 0.7206703910.720670391
表4-1随机森林模型验证集肠道标志物(mOTU)相对丰度数据Table 4-1 Random Forest Model Validation Set Intestinal Marker (mOTU) Relative Abundance Data
Figure PCTCN2019094923-appb-000011
Figure PCTCN2019094923-appb-000011
表4-2随机森林模型验证集肠道标志物(mOTU)相对丰度数据Table 4-2 Relative abundance data of intestinal markers (mOTU) in the validation set of random forest model
Figure PCTCN2019094923-appb-000012
Figure PCTCN2019094923-appb-000012
Figure PCTCN2019094923-appb-000013
Figure PCTCN2019094923-appb-000013
表5基于11种标志物组合验证集患病概率Table 5 Prevalence probability based on the validation set of 11 marker combinations
样本编号Sample number 患病概率Probability of illness  To 样本编号Sample number 患病概率Probability of illness
H_72H_72 0.7460.746  To SZ_81SZ_81 0.5620.562
H_73H_73 0.4540.454  To SZ_82SZ_82 0.3780.378
H_74H_74 0.3060.306  To SZ_83SZ_83 0.6340.634
H_75H_75 0.2720.272  To SZ_84SZ_84 0.960.96
H_76H_76 0.3220.322  To SZ_85SZ_85 0.60.6
H_77H_77 0.1140.114  To SZ_86SZ_86 0.30.3
H_78H_78 0.1240.124  To SZ_87SZ_87 0.360.36
H_79H_79 0.1040.104  To SZ_88SZ_88 0.880.88
H_80H_80 0.5060.506  To SZ_89SZ_89 0.8980.898
H_81H_81 0.3940.394  To SZ_90SZ_90 0.2920.292
上结果表明,本发明公开的生物标志物具有较高的准确度和特异性,具有良好的开发为诊断方法的前景,从而为精神分裂症的患病风险评估、诊断、早期诊断,寻找潜在药物靶点提供依据。The above results show that the biomarker disclosed in the present invention has high accuracy and specificity, and has a good prospect of being developed as a diagnostic method, so as to find potential drugs for risk assessment, diagnosis, and early diagnosis of schizophrenia The target provides the basis.
因此,本发明提出以下应用:Therefore, the present invention proposes the following applications:
所述的基于肠道菌群的精神分裂症生物标志物组合作为检测靶点或检测目标在制备检测试剂盒中的应用。The combination of biomarkers for schizophrenia based on intestinal flora is used as a detection target or a detection target in the preparation of a detection kit.
所述的基于肠道菌群的精神分裂症生物标志物组合作为靶点在筛选治疗和/或者预防精神分裂症的药物中的应用。The application of the combination of biomarkers for schizophrenia based on intestinal flora as a target in screening drugs for the treatment and/or prevention of schizophrenia.
所述的生物标志物组合相对丰度的变化为确定候选药物是否有效提供依据。The change in the relative abundance of the biomarker combination provides a basis for determining whether the candidate drug is effective.
尽管上面已经示出和描述了本发明的实施例,可以理解的是,上述实施例是示例性的,不能理解为对本发明的限制,本领域的普通技术人员在本发明的范围内可以对上述实施例进行变化、修改、替换和变型。Although the embodiments of the present invention have been shown and described above, it can be understood that the above-mentioned embodiments are exemplary and should not be construed as limiting the present invention. Those of ordinary skill in the art can comment on the foregoing The embodiment undergoes changes, modifications, substitutions and modifications.

Claims (8)

  1. 一种基于肠道菌群的精神分裂症生物标志物组合,其特征在于,该生物标志物组合用于提供相对丰度信息,其包含选自以下的一种或多种:A biomarker combination for schizophrenia based on intestinal flora, characterized in that the biomarker combination is used to provide relative abundance information, and it contains one or more selected from the following:
    生物标志物1:Lachnospiraceae bacterium 3_1_57FAA_CT1;Biomarker 1: Lachnospiraceae bacterium 3_1_57FAA_CT1;
    生物标志物2:Cronobacter sakazakii;Biomarker 2: Cronobacter sakazakii;
    生物标志物3:Lactobacillus acidophilus;Biomarker 3: Lactobacillus acidophilus;
    生物标志物4:Veillonella parvula;Biomarker 4: Veillonella parvula;
    生物标志物5:Lactococcus lactis;Biomarker 5: Lactococcus lactis;
    生物标志物6:Alkaliphilus oremlandii;Biomarker 6: Alkaliphilus oremlandii;
    生物标志物7:Pseudoflavonifractor capillosus;Biomarker 7: Pseudoflavonifractor capillosus;
    生物标志物8:Streptococcus gallolyticus;Biomarker 8: Streptococcus gallolyticus;
    生物标志物9:Dialister invisus;Biomarker 9: Dialister invisus;
    生物标志物10:Lactobacillus johnsonii;Biomarker 10: Lactobacillus johnsonii;
    生物标志物11:Methanobrevibacter smithii。Biomarker 11: Methanobrevibacter smithii.
  2. 如权利要求1所述的基于肠道菌群的精神分裂症生物标志物组合,其特征在于,所述的生物标志物组合提供的相对丰度信息用于和参考值进行比较。The schizophrenia biomarker combination based on intestinal flora of claim 1, wherein the relative abundance information provided by the biomarker combination is used for comparison with a reference value.
  3. 如权利要求1所述的基于肠道菌群的精神分裂症生物标志物组合,其特征在于,所述的生物标志物1~11的相对丰度信息是基于能够对其进行丰度计算的基因序列来提供的。The biomarker combination for schizophrenia based on the intestinal flora of claim 1, wherein the relative abundance information of the biomarkers 1-11 is based on genes whose abundance can be calculated Sequence to provide.
  4. 权利要求1所述的基于肠道菌群的精神分裂症生物标志物组合作为检测靶点或检测目标在制备检测试剂盒中的应用。Application of the schizophrenia biomarker combination based on intestinal flora of claim 1 as a detection target or a detection target in the preparation of a detection kit.
  5. 权利要求1所述的基于肠道菌群的精神分裂症生物标志物组合作为靶点在筛选治疗和/或者预防精神分裂症的药物中的应用。The application of the schizophrenia biomarker combination based on the intestinal flora of claim 1 as a target in screening drugs for the treatment and/or prevention of schizophrenia.
  6. 如权利要求5所述的应用,其特征在于,所述的生物标志物组合相对丰度的变化为确定候选药物是否有效提供依据。The application according to claim 5, wherein the change in the relative abundance of the biomarker combination provides a basis for determining whether the candidate drug is effective.
  7. 权利要求1所述的基于肠道菌群的精神分裂症生物标志物组合的筛选方法,其特征在于,步骤如下:The method for screening a combination of biomarkers for schizophrenia based on intestinal flora of claim 1, wherein the steps are as follows:
    1)样本收集:采集粪便样品后冷冻运输并迅速转移到-80℃保存,进行DNA提取,得到提取的DNA样本,样本受试者包括精神分裂症病人和健康人;1) Sample collection: After collecting stool samples, they are frozen and transported and quickly transferred to -80°C for storage. DNA extraction is performed to obtain extracted DNA samples. The sample subjects include schizophrenia patients and healthy people;
    2)宏基因组测序与组装2) Metagenomic sequencing and assembly
    3)保守的单拷贝基因比对与丰度计算3) Conservative single-copy gene alignment and abundance calculation
    3)将高质量的测序片段输入到软件mOTU计算出物种的相对丰度:3) Input high-quality sequencing fragments into the software mOTU to calculate the relative abundance of species:
    3.1)将高质量测序片段比对到参考的单拷贝基因上;3.1) Align the high-quality sequencing fragments to the reference single-copy gene;
    3.2)根据比对结果统计插入片段的数量;3.2) Count the number of inserts according to the comparison results;
    3.3)将插入片段的数量对单拷贝基因的长度进行标准化得到对应的丰度。3.3) Normalize the number of inserts to the length of a single copy gene to obtain the corresponding abundance.
    4)从样本集中随机地选取精神分裂症病人和健康人作为训练集,其余样品作为验证集,计算训练集中每个样本中mOTU的相对丰度,然后将训练集的mOTU输入随机森林分类器,对分类器进行5次10折交叉验证,利用RF模型筛选的mOTU相对丰度对每一个体计算其精神分裂症患病风险,绘制ROC曲线,并计算出AUC作为判别模型效能评价参数,选取标志物组合数<30,且判别效能最佳的组合,在模型中输出每个mOTU的重要性指数,重要性指数越高,代表该标志物用来判别精神分裂症和非精神分裂症的重要性越高。4) Randomly select schizophrenic patients and healthy people from the sample set as the training set, and the remaining samples as the validation set. Calculate the relative abundance of mOTU in each sample in the training set, and then input the mOTU of the training set into the random forest classifier. Perform 5 10-fold cross-validation on the classifier, use the relative abundance of mOTU screened by the RF model to calculate the risk of schizophrenia for each individual, draw the ROC curve, and calculate the AUC as the performance evaluation parameter of the discriminant model, and select the marker The number of combination of substances is less than 30, and the combination with the best discrimination performance, the importance index of each mOTU is output in the model. The higher the importance index, it represents the importance of the marker used to distinguish schizophrenia and non-schizophrenia Higher.
  8. 权利要求7所述筛选方法,其特征在于,所述样本集中,样本受试者包括90个精神分裂症病人和81个健康人,验证集中,样本受试者包括10个精神分裂症病人和10个健康人。The screening method according to claim 7, characterized in that the sample is concentrated, the sample subjects include 90 schizophrenia patients and 81 healthy people, and the verification concentration, the sample subjects include 10 schizophrenia patients and 10 A healthy person.
PCT/CN2019/094923 2019-06-06 2019-07-05 Intestinal flora-based schizophrenia biomarker combination, and applications thereof and motu screening method therefor WO2020244017A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201910491545.X 2019-06-06
CN201910491545 2019-06-06

Publications (1)

Publication Number Publication Date
WO2020244017A1 true WO2020244017A1 (en) 2020-12-10

Family

ID=67891204

Family Applications (2)

Application Number Title Priority Date Filing Date
PCT/CN2019/094924 WO2020244018A1 (en) 2019-06-06 2019-07-05 Small-scale schizophrenia biomarker combination, application thereof and metaphlan2 screening method therefor
PCT/CN2019/094923 WO2020244017A1 (en) 2019-06-06 2019-07-05 Intestinal flora-based schizophrenia biomarker combination, and applications thereof and motu screening method therefor

Family Applications Before (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/094924 WO2020244018A1 (en) 2019-06-06 2019-07-05 Small-scale schizophrenia biomarker combination, application thereof and metaphlan2 screening method therefor

Country Status (2)

Country Link
CN (1) CN110241205A (en)
WO (2) WO2020244018A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113493849A (en) * 2021-09-09 2021-10-12 北京泱深生物信息技术有限公司 A product for diagnosing growth retardation

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111430027B (en) * 2020-03-18 2023-04-28 浙江大学 Duplex affective disorder biomarker based on intestinal microorganisms and screening application thereof
CN111370116A (en) * 2020-03-18 2020-07-03 浙江大学 Intestinal microbial marker for predicting curative effect of bipolar affective disorder and screening application thereof
CN112011602A (en) * 2020-07-23 2020-12-01 翊康生物科技发展(辽宁)有限公司 Intestinal flora related to immune recovery and application thereof
CN112011605B (en) * 2020-09-15 2023-04-28 石家庄市人民医院(石家庄市第一医院、石家庄市肿瘤医院、河北省重症肌无力医院、石家庄市心血管病医院) Use of microbial flora in disease diagnosis
CN113817853A (en) * 2021-09-24 2021-12-21 西安交通大学 Application of intestinal microorganisms as marker of mild traumatic brain trauma
CN117854720A (en) * 2023-12-06 2024-04-09 广州达安临床检验中心有限公司 Autism risk prediction device and computer equipment based on fungus genus characteristic

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2003042654A2 (en) * 2001-11-14 2003-05-22 Psychiatric Genomics, Inc. Multi-parameter high throughput screening assays (mphts)
WO2019049960A1 (en) * 2017-09-07 2019-03-14 国立大学法人千葉大学 Test method for depression symptoms and screening method of therapeutic agent therefor

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2016167365A1 (en) * 2015-04-16 2016-10-20 国立研究開発法人国立精神・神経医療研究センター Biomarker for mental disease
CN107480474B (en) * 2017-08-01 2019-03-26 山东师范大学 Classifier modeling evaluation method of calibration and system based on intestinal flora abundance
CN107746874A (en) * 2017-11-06 2018-03-02 张猛 Schizophrenia mark

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2003042654A2 (en) * 2001-11-14 2003-05-22 Psychiatric Genomics, Inc. Multi-parameter high throughput screening assays (mphts)
WO2019049960A1 (en) * 2017-09-07 2019-03-14 国立大学法人千葉大学 Test method for depression symptoms and screening method of therapeutic agent therefor

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
MISAKO KANAYAMA, HAYASHIDA MAIKO, HASHIOKA SADAYUKI, MIYAOKA TSUYOSHI, INAGAKI MASATOSHI: "Decreased Clostridium Abundance after Electroconvulsive Therapy in the Gut Microbiota of a Patient with Schizophrenia", CASE REPORTS IN PSYCHIATRY, 4576842, 25 February 2019 (2019-02-25), pages 1 - 5, XP055764595, ISSN: 2090-682X, DOI: 10.1155/2019/4576842 *
PENG ZHENG, BENHUA ZENG, MEILING LIU, JIANJUN CHEN, JUNXI PAN, YU HAN, YIYUN LIU, KE CHENG, CHANJUAN ZHOU, HAIYANG WANG, XINYU ZHO: "The Gut Microbiome from Patients with Schizophrenia Modulates the Glutamate-Glutamine-GABA Cycle and Schizophrenia-Relevant Behaviors in Mice", SCIENCE ADVANCES, vol. 5, no. 2, 6 February 2019 (2019-02-06), pages 1 - 12, XP055764593, ISSN: 2375-2548, DOI: 10.1126/sciadv.aau8317 *
YANG SHEN, XU JINTIAN, LI ZHIYONG, HUANG YICHEN, YUAN YE, WANG JIXIANG, ZHANG MENG, HU SONGNIAN, LIANG YING: "Analysis of Gut Microbiota Diversity and Auxiliary Diagnosis as a Biomarker in Patients with Schizophrenia: A Cross-Sectional Study", SCHIZOPHRENIA RESEARCH, vol. 197, 17 January 2018 (2018-01-17), pages 470 - 477, XP055764597, ISSN: 0920-9964, DOI: 10.1016/j.schres.2018.01.002 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113493849A (en) * 2021-09-09 2021-10-12 北京泱深生物信息技术有限公司 A product for diagnosing growth retardation

Also Published As

Publication number Publication date
CN110241205A (en) 2019-09-17
WO2020244018A1 (en) 2020-12-10

Similar Documents

Publication Publication Date Title
WO2020244017A1 (en) Intestinal flora-based schizophrenia biomarker combination, and applications thereof and motu screening method therefor
CN111430027B (en) Duplex affective disorder biomarker based on intestinal microorganisms and screening application thereof
CN105603066B (en) Intestinal microbial marker of mental disorder and application thereof
CN110904213B (en) Ulcerative colitis biomarker based on intestinal flora and application thereof
CN112119167B (en) Biomarker for depression and application thereof
US20150211053A1 (en) Biomarkers for diabetes and usages thereof
WO2016049932A1 (en) Biomarkers for obesity related diseases
CN107075453B (en) Biomarkers for coronary artery disease
CN111020020A (en) Biomarker combination for schizophrenia, application thereof and metaplan 2 screening method
WO2016049936A1 (en) Biomarkers for rheumatoid arthritis and usage therof
CN110838365A (en) Irritable bowel syndrome related flora marker and kit thereof
CN110396538B (en) Migraine biomarkers and uses thereof
CN113913490A (en) Non-alcoholic fatty liver marker microorganism and application thereof
CN109072306A (en) Isolated nucleic acid and application
CN112384634B (en) Osteoporosis biomarker and application thereof
WO2016049937A1 (en) Biomarkers for rheumatoid arthritis and usage therof
WO2016049930A1 (en) Biomarkers for rheumatoid arthritis and usage therof
CN111020021A (en) Intestinal flora-based small-scale schizophrenia biomarker combination, application thereof and mOTU screening method
WO2021184413A1 (en) Gut microbe-based biomarkers for predicting curative effect on bipolar disorder, and screening and applications thereof
CN112011605B (en) Use of microbial flora in disease diagnosis
CN114657270A (en) Alzheimer disease biomarker based on intestinal flora and application thereof
CN110396537B (en) Asthma biomarker and application thereof
WO2016049927A1 (en) Biomarkers for obesity related diseases
CN112877417A (en) Screening and application of polycystic ovarian syndrome intestinal flora biomarker
CN117385066B (en) Hypertension diagnosis marker and application thereof

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19931637

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 19931637

Country of ref document: EP

Kind code of ref document: A1