WO2018049947A1 - Biomarker composition for detection of endometriosis and application thereof - Google Patents

Biomarker composition for detection of endometriosis and application thereof Download PDF

Info

Publication number
WO2018049947A1
WO2018049947A1 PCT/CN2017/096249 CN2017096249W WO2018049947A1 WO 2018049947 A1 WO2018049947 A1 WO 2018049947A1 CN 2017096249 W CN2017096249 W CN 2017096249W WO 2018049947 A1 WO2018049947 A1 WO 2018049947A1
Authority
WO
WIPO (PCT)
Prior art keywords
seq
endometriosis
sample
nucleic acid
biomarker combination
Prior art date
Application number
PCT/CN2017/096249
Other languages
French (fr)
Chinese (zh)
Inventor
贾慧珏
钟焕姿
宋晓蕾
王子榕
陈晨
Original Assignee
深圳华大基因研究院
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳华大基因研究院 filed Critical 深圳华大基因研究院
Priority to CN201780047955.4A priority Critical patent/CN109715828B/en
Publication of WO2018049947A1 publication Critical patent/WO2018049947A1/en

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/11DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6876Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
    • C12Q1/6883Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/136Screening for pharmacological compounds

Definitions

  • the present application relates to the field of biomarkers, and in particular to a biomarker combination for endometriosis detection or risk assessment of disease and its use.
  • Endometriosis is a frequently-occurring disease in women of childbearing age, causing a variety of clinical symptoms. In recent years, the incidence rate has increased significantly, reaching 10%-15%, accounting for more than 30% of gynecological surgery. Endometriosis is closely related to infertility, and about 40%-50% of infertile patients have infertility caused by endometriosis. Current research suggests that microbial infection plays a very important role in infertility caused by endometriosis. In addition, many studies have shown that inflammatory factors in the peritoneal fluid of endometriosis, such as tumor necrosis factor (TNF- ⁇ ) and interleukin-6 (IL-6), can interfere with certain aspects of the reproductive process.
  • TNF- ⁇ tumor necrosis factor
  • IL-6 interleukin-6
  • endometriosis is a benign disease, its lesions are extensive, morphologically diverse, extremely aggressive and recurrent, and the destruction of organ structure and function is comparable to malignant tumors, so it is known as "benign cancer.”
  • benign cancer the destruction of organ structure and function is comparable to malignant tumors, so it is known as "benign cancer.”
  • the clinical manifestations of endometriosis are not directly proportional to the extent of the disease, so there is a lack of effective clinical diagnostic methods.
  • CA125 carcinoembryonic antigen 125
  • PP14 placental protein 14
  • IL-6 interleukin-6
  • the object of the present application is to provide a biomarker for the detection of endometriosis and a preparation method and application thereof.
  • One aspect of the present application discloses a biomarker for the detection of endometriosis, the biomarker combination comprising at least one of twenty-four nucleic acids, each of which is Seq ID No. 1
  • the twenty-four nucleic acids of the present application are researched and derived from a nucleic acid sequence associated with endometriosis, wherein each nucleic acid sequence is associated with endometriosis.
  • the endometriosis test or the risk assessment of the disease can be used alone or in combination without considering the accuracy of the judgment or when the requirement is low.
  • not only twenty-four nucleic acids are used together, but also twenty-four nucleic acids are classified according to a specific rule, and are divided into a plurality of marker groups, and each marker group is used together. Endometriosis detection or risk assessment of disease, which will be described in detail in the following preferred technical solutions.
  • the twenty-four nucleic acids of the present application are clustered according to 97% or more similarity, and then the most representative sequence is selected from each taxon (abbreviation OTU) as a seed sequence, wherein Endometriosis has twenty-four seed sequences that are related, ie, constitutes a biomarker combination of the present application; therefore, in the biomarker combination of the present application, twenty-four nucleic acids are not limited to Seq ID No. 1
  • the sequence shown in Seq ID No. 24 may also be a sequence having 97% or more similarity to the sequence shown by Seq ID No. 1 to Seq ID No. 24.
  • the biomarker combination for the detection of endometriosis or the risk assessment of the present application is not directly based on the detection of the combination of biomarkers with or without endometriosis detection.
  • the risk assessment but after detecting the biomarker combination, by analyzing its relative abundance, and taking the relative abundance into the random forest model to judge, the object to be tested is judged according to the probability of the random forest model output. Whether or not there is endometriosis or assessing the risk of endometriosis in the subject to be tested will be described in detail in the following technical solutions.
  • a biomarker combination for endometriosis detection or risk assessment including a first marker panel, a second marker panel, and a third marker panel At least one of the groups; the first marker group consists of fourteen nucleic acids, respectively, Seq ID No. 1, Seq ID No. 2, Seq ID No. 6, Seq ID No. 7, Seq ID No. 12, Seq ID No. 13, Seq ID No. 15, Seq ID No. 17 to Seq ID No. 22, Seq ID No. 24, or Seq ID No. 1, Seq ID No, respectively. .2, Seq ID No. 6, Seq ID No. 7, Seq ID No. 12, Seq ID No. 13, Seq ID No. 15, Seq ID No. 17 to Seq ID No.
  • Seq ID No. 24 The sequence shown has a sequence of 97% or more similarity; the second marker group consists of two nucleic acids, each of which is a sequence of Seq ID No. 1, Seq ID No. 7, or Seq ID No, respectively. 1.
  • the sequence shown by Seq ID No. 7 has a sequence of 97% or more similarity; the third marker group consists of eleven nucleic acids, and the eleven nucleic acids are Seq ID No. 3 to Seq ID No. 5, respectively.
  • the sequence shown by ID No. 8 to Seq ID No. 12, Seq ID No. 14, Seq ID No. 16, and Seq ID No. 23 has a sequence of 97% or more similarity.
  • nucleic acids are reproducibly divided into three marker groups, namely, a first marker group, a second marker group, and a third marker group;
  • three marker groups namely, a first marker group, a second marker group, and a third marker group;
  • the comprehensive judgment of the volunteer group can greatly improve the accuracy of detecting the endometriosis of the biomarker combination of the present application or assessing the risk of the disease.
  • the first marker group is a CL marker group for performing endometriosis detection or risk assessment of a sample from the lower third of the vagina.
  • the second marker group is a CU marker group for performing endometriosis detection or risk assessment of a sample from the vaginal posterior iliac crest.
  • the third marker group is a CV marker group for performing endometriosis detection or risk assessment of a sample from the cervical canal.
  • the twenty-four nucleic acids in the biomarker combination of the present application actually represent 14 microorganisms in the lower third of the vagina, the posterior vaginal canal and the cervical canal; the present application passes under the vagina Twenty-four nucleic acids of 14 microorganisms in 1/3, vaginal posterior fornix and cervical canal were detected, and the relationship between their relative abundance and endometriosis was statistically analyzed to establish a random forest model. In this way, it is judged whether the subject to be tested has endometriosis or is at risk of suffering from endometriosis. Therefore, the three marker groups actually correspond to three sampling sites respectively; the samples from the three sites correspond to the respective marker groups, and are independently analyzed and judged. However, comprehensive judgment based on the results of the three can improve the accuracy of detecting the endometriosis of the biomarker combination of the present application or assessing the risk of the disease.
  • the number of microorganisms is far more than 14 species, and the nucleic acids of 14 microorganisms are far more than the 24 records described in this application;
  • this application screens out 24 nucleic acids of 14 microorganisms according to the random forest model as a biomarker for the detection of endometriosis, providing a new measure for the detection and evaluation of endometriosis. Way.
  • the CL marker group is the marker group of the lower third of the vagina, the lower third of the vagina is abbreviated as CL; the marker of the CU marker group is the marker of the posterior vaginal sample.
  • CU vaginal posterior hernia
  • CV marker group is the marker group of cervical canal sample, and cervical canal is abbreviated as CV.
  • kits for endometriosis detection or risk assessment of a disease comprising a primer pair for detecting a biomarker combination of the present application, the positive of the primer pair
  • the primer was the sequence shown in SEQ ID No. 25, and the reverse primer was the sequence shown in SEQ ID No. 26.
  • biomarker combination of the present application can be present in the kit as a standard reference, and the primer pair is directly used for PCR amplification of the biomarker combination in the sample to be tested.
  • biomarker combination of the present application should be used in the screening of endometriosis drugs or in the preparation of kits or detection tools for endometriosis detection or risk assessment of disease. use.
  • biomarker combination of the present application is itself studied for endometriosis, and can of course be used for the detection or risk assessment of endometriosis; and the biomarker combination of the present application can also be integrated.
  • biomarker combination of the present application can detect endometriosis or assess the risk of endometriosis; of course, it can compare the endometriosis before and after treatment.
  • the disease condition or the risk of the disease changes to determine whether the drug used is effective for the purpose of drug screening.
  • a further aspect of the present application discloses a method for detecting endometriosis, comprising the following steps,
  • the level of each nucleic acid is the relative abundance of each nucleic acid; the reference data set or reference value is the nucleic acid of each of the biomarker combinations derived from the endometriosis population and the non-endometriosis population control Level.
  • the reference data set or reference value in step (2) is at least one of Table 5, Table 6, or Table 7; comparing the level of each nucleic acid with a reference data set or a reference value to obtain a detection result, specifically Including, using multivariate statistical models to calculate the probability of disease, preferably, the multivariate statistical model is a random forest model.
  • the sample to be tested is subjected to sample collection in step (1), including collecting the lower third of the vagina sample, the posterior vaginal sputum sample and the cervical canal sample.
  • each nucleic acid fragment in the biomarker combination of the present application is a nucleic acid fragment associated with endometriosis, and therefore, by analyzing the level of the biomarker combination in the test subject, that is, relatively abundant In degree, it is possible to detect whether the subject to be tested is sick or to determine the risk of the disease.
  • a further aspect of the present application discloses a method for preparing endometriosis detection or disease risk assessment kit or tool by detecting a biomarker for determining endometriosis; wherein the biomarker a biomarker combination of the present application;
  • a method for determining endometriosis by detecting a biomarker includes the following steps,
  • the level of each nucleic acid is the relative abundance of each nucleic acid; the reference data set or reference value is the level of each nucleic acid in the biomarker combination derived from endometriosis patients and non-endometriosis controls. .
  • the reference data set or reference value in step (2) is at least one of Table 5, Table 6, or Table 7; comparing the level of each nucleic acid with a reference data set or a reference value to obtain a detection result, specifically Including, using a multivariate statistical model to calculate the probability of disease, preferably, the multivariate statistical model is a random forest model.
  • a further aspect of the present application discloses a method of screening for a drug candidate for treating endometriosis, comprising the steps of
  • step 2) comparing the levels of each nucleic acid in the sample before and after administration, specifically including calculating the probability of disease using a multivariate statistical model, preferably, the multivariate statistical model is a random forest model.
  • a further aspect of the present application discloses a method for detecting a microbiota in a female reproductive tract, comprising the following steps:
  • the level of each nucleic acid is the relative abundance of each nucleic acid; the reference data set or reference value is the nucleic acid of each of the biomarker combinations derived from the endometriosis population and the non-endometriosis population control Level.
  • the reference data set or reference value in step (2) is at least one of Table 5, Table 6, or Table 7; comparing the level of each nucleic acid with a reference data set or a reference value to obtain a detection result, specifically including
  • the multivariate statistical model is used to calculate the probability of disease. More preferably, the multivariate statistical model is a random forest model.
  • the microbial sample in the genital tract of the test subject is collected, specifically comprising collecting the lower third of the vaginal sample, the posterior vaginal sputum sample and the cervical canal sample of the test subject.
  • the collection of the microbial samples in the reproductive tract can be carried out by using a conventional nylon fluff swab, which is not specifically limited herein.
  • the biomarker combination of the present application is actually based on the relationship between the microbial DNA in the female reproductive tract and endometriosis, that is, the biomarker of the present application is actually a female.
  • the microbe OTU in the reproductive tract can reflect the state of endometriosis; therefore, the present application proposes a method for detecting microbiota in the female reproductive tract, which provides a judgment for the endometriosis or its risk by detecting the microbial population. And the basis for the assessment.
  • a further aspect of the present application discloses a method of preparing a biomarker combination of endometriosis comprising the following steps,
  • the microbial sample is collected in the genital tract, specifically comprising collecting the lower third of the vagina sample, the posterior vaginal sputum sample and the cervical canal sample.
  • the method for preparing the biomarker combination of endometriosis in the present application is to use a random forest model to fit and verify the association between the microbial DNA in the reproductive tract and endometriosis.
  • a biomarker combination is obtained that is capable of assessing the risk or risk of endometriosis.
  • the preparation method of the present application or its basic idea is not limited to the preparation of a biomarker combination for endometriosis; it can also be used to prepare a biomarker combination of similar conditions associated with the presence of microbial DNA in the reproductive tract. For example, a biomarker combination of adenomyosis.
  • the biomarker for detecting endometriosis of the present application has good specificity and can detect endometriosis well, and provides for detection or risk assessment of endometriosis. A new approach and can be used for early diagnosis of endometriosis.
  • the biomarker combination of the present application is used for the detection of endometriosis or the risk assessment of the disease, and has the advantages of high sensitivity and high specificity, and has important application value.
  • the genital tract sample as a biomarker combination test sample has the advantages of convenient material selection, simple operation steps and continuous in vitro detection.
  • biomarker combination of the present application is useful for the detection of endometriosis or for assessing the risk of disease with reproducible characteristics.
  • FIG. 1 is a graph showing the results of infertility caused by endometriosis based on a marker group of CL at the lower third of the vagina in the embodiment of the present application, wherein a is a random forest identification as the number of OTUs increases. Endometriosis (infertility) 5 times 10 fold cross-validation error rate distribution, b is crossed
  • ROC curve verified combined receiver operating curve
  • AUC area under the curve
  • the diagonal represents the curve with an AUC of 0.5;
  • FIG. 2 is a diagram showing the results of identifying endometriosis (infertility) based on the marker group of vaginal posterior sacral CU in the embodiment of the present application, in which a is an identification of random forests in the uterus as the number of OTUs increases.
  • Membrane ectopic (infertility) 5 times 10 fold cross-validation error rate distribution
  • b is the cross-validated combination receiver operating curve (abbreviated ROC curve)
  • the area under the curve (abbreviated AUC) is 0.5919
  • the shaded area represents a 95% confidence interval and the diagonal represents a curve with an AUC of 0.5;
  • FIG. 3 is a diagram showing the results of identifying endometriosis (infertility) based on the CV marker group of the cervical canal in the embodiment of the present application, in which a is an identification of endometrial abnormalities in random forests as the number of OTUs increases.
  • a is an identification of endometrial abnormalities in random forests as the number of OTUs increases.
  • ROC curve 4 is a ROC curve for identifying endometriosis (infertility) in a second population of the CL marker group at the lower third of the vagina in the embodiment of the present application;
  • FIG. 5 is a ROC curve for identifying endometriosis (infertility) in a second population of the vaginal posterior sputum CU marker group in the examples of the present application;
  • FIG. 6 is a ROC curve for identifying endometriosis (infertility) in a second population of the cervical canal CV marker group in the examples of the present application;
  • the biomarker of the present application is obtained based on the relationship between the microbial DNA of the three parts of the subject and the endometriosis.
  • the biomarkers of the present application are actually the three parts.
  • Microorganism OTU in the state of endometriosis Specifically, in a preparation method of the present application, the correspondence or biomarker is obtained by using the relative abundance of the OTU seed sequence as a subject, endometriosis state (sick or non-diseased) For the second object, the two are fitted through a random forest model, and finally obtained through five 10-fold cross-validation. Through rigorous calculations and experimental studies, the present application finally obtained twenty-four nucleic acids of 14 microorganisms in three sites as biomarkers of the present application.
  • the marker group of the three sites can independently evaluate the risk or risk of endometriosis, but the probability of combining the three sites is used to determine whether the subject has a uterus. Endometriosis or the risk of endometriosis, the accuracy will be higher.
  • Endometriosis of the present application is a common gynecological disease, defined as a condition caused by the growth of endometrial tissue outside the uterine cavity.
  • the most common endometriosis occurs in the ovaries and fallopian tubes, and may also occur in the myometrium, pelvic peritoneum, and even the bladder and large intestine.
  • endometriosis of the ovary can form an endometrial tumor with a brown liquid inside, so there is also a "chocolate cyst” or "juliek tumor", which will affect pregnancy.
  • Endometriosis and endometriosis (infertility) of the present application specifically refer to infertility caused by endometriosis which is common in the art.
  • Non-endometriosis of the present application specifically refers to the absence of endometriosis, which is common in the art, and can be pregnant.
  • the levels of the biomarker materials of the present application are indicated by relative abundance.
  • the reference value refers to a reference value or a normal value of a healthy control. It will be apparent to those skilled in the art that in the case where the number of samples is sufficient, the range of normal values, i.e., absolute values, of each biomarker can be obtained by inspection and calculation.
  • a biomarker may be any substance in an individual as long as they are related to a specific biological state of the individual to be examined, such as a disease.
  • Such biomarkers can be, for example, nucleic acid markers (eg, DNA), protein markers, cytokine markers, chemokine markers, carbohydrate markers, antigenic markers, antibody markers, species markers ( Species/genus markers) and functional markers (KO/OG markers).
  • the biomarkers of the present application are specifically DNA nucleic acid markers.
  • OTU refers to the operation taxonomic units (OTU), which is in the phylogenetic study or population genetics research.
  • OTU operation taxonomic units
  • a certain classification unit such as strain, species, and genus
  • grouping, etc. set the same flag.
  • the sequence is divided into an OTU according to a 97% similarity threshold, whereby a plurality of OTUs can be obtained for each of the three sites, and each OTU is regarded as a microbial species.
  • the microbial diversity in the samples and the abundance of different microorganisms are based on an analysis of the OTU.
  • refers to an animal, particularly a mammal, such as a primate, referred to as a human in the examples of the present application.
  • sample collection in this case was assisted by a gynaecologist at Shenzhen Peking University Hospital. Excluding the cases of inflammation, the subjects were non-menstrual, non-pregnancy, non-lactation women, no endocrine and autoimmune diseases, normal liver and kidney function. No hormones or antibiotics were used for some time before sampling, no vaginal medication, vaginal lavage and cervical treatment, and no sexual life was performed within 48 hours before sampling. According to the above criteria, this example screened 49 women of childbearing age as the first group. All individuals who meet the above criteria are registered with detailed phenotypic information to understand their medical history, family history, medication history and lifestyle, and have signed informed consent.
  • the lower genital tract sampling is performed after the individual is admitted to the hospital, without disinfection, after emptying the urine, the lower third of the vagina (abbreviation CL), the posterior vagina (abbreviated CU), and the cervical canal (abbreviated CV) are collected in the gynecological examination bed.
  • the sample numbers and sampling information of the 49 collection objects are number C026, C028, C033, C035, C038, C041, C045, C048, C050, C055, C056, C058, C059, C062, C063, C064, C065.
  • Nylon fluff swabs were purchased from Chenyang Global Group CY-93050 and CY-98000. After sampling, the swab head was quickly frozen with liquid nitrogen, stored at -80 ° C, and transported to Shenzhen Huada Gene Research Institute with dry ice for subsequent experiments.
  • DNA extraction was performed using the QIAamp DNA Mini Kit (purchased from QIAGEN). The specific extraction steps are carried out in accordance with the instructions provided by the manufacturer.
  • the 16S rRNA gene V4-V5 hypervariable region-specific primers were used for amplification. The two primers were V4-515F and V5-907R, V4-515F was the sequence shown by Seq ID No. 25, and V5-907R was Seq ID No. The sequence shown in 26.
  • the PCR procedure was as follows, denaturation at 94 ° C for 3 min; then into 25 cycles: denaturation at 94 ° C for 45 s, annealing at 50 ° C for 60 s, extension at 72 ° C for 90 s; after the end of the cycle, extension at 72 ° C for 10 min.
  • the obtained PCR product was purified by AMPure Beads (Axygen), and sequencing was carried out by chip lane sequencing, and a plurality of samples were mixed and sequenced. Therefore, the library construction requires the addition of a linker sequence after ligation of a 10 bp barcode sequence at the outer end of the primer sequence of each sample.
  • V5-V4 reverse sequencing was performed by Ion torrent PGM sequencing platform. The above library construction and sequencing were carried out by Shenzhen Huada Gene.
  • the raw data was extracted and pre-processed from the PGM system using Mothur software (V1.33.3).
  • the standards for high-quality sequences include: 1) length greater than 200 bp; 2) less than 2 mismatched bases with degenerate PCR; 3) The average quality score is greater than 25.
  • the OTU was clustered using the QIIME uclust method, and the similarity threshold was set to 97%.
  • a seed sequence of each OTU was selected and annotated using the reference gene information gg_13_8_otus in the Greengene database.
  • the relative abundance of each OTU in each sample is calculated, where the relative abundance of an OTU is the ratio of the abundance of the OTU in a sample to the sum of all OTU abundances in the sample.
  • this example uses the Sorenson index ( –Dice index) to measure the similarity of microbial populations at different sites in the same individual, calculated as follows:
  • a and B represent the number of OTUs in samples A and B, respectively, and C represents the number of OTUs shared in the two samples.
  • QS is a similarity index and ranges from 0 to 1.
  • the similarity index of CL and CU, the similarity index of CL and CV, and the similarity index of CU and CV were calculated.
  • the similarity index is close to 1, indicating that the similarity of the microbiota at the two sampling sites is higher.
  • the relative abundance of each sample and the endometriosis state were fitted using the randomForest toolkit in R software (3.1.2RC).
  • the default parameters are used; wherein the OTU of each sample is an OTU present in at least 10% of the sample, that is, only less than 10% of all samples to be tested in each part can be inspected. Out of the OTU.
  • five 10-fold cross-validation is performed, and the error curves of the five 10-fold cross-validation are averaged, and the lowest error of the average post-curve is added to the standard error of the point as the domain value of the acceptable error.
  • the least number of OTUs is the optimal OTU combination as a biomarker combination for identifying endometriosis.
  • this example additionally used an independent test population, that is, the second population for verification.
  • the second group there were 11 endometriosis patients and 11 non-endometriosis individuals for CL and CU; for CV, there were 12 patients with endometriosis and 10 A non-endometriosis individual.
  • this example calculates the distance between samples of the same individual.
  • the weighted UniFrac distance from the posterior vagina (CU), cervical (CV) mucus to the uterus and peritoneal fluid increased sequentially relative to the lower vaginal 1/3 (CL) sample. This again indicates that the community structure of the female reproductive tract is continuously changing as the anatomical structure is from bottom to top.
  • the cervical mucus was sampled through the vagina and the uterine cavity, respectively. It was found that the bacterial distribution of the samples taken by the two routes showed a high degree of similarity, further indicating that the uterine cavity microorganisms can be evaluated by analyzing the easily available cervical tube samples. Case.
  • this example establishes a random forest model.
  • the specific steps are as follows: (1) Using the relative abundance of OTU as an input feature, design a random forest model based on the first population; (2) For the random forest model, a 10-fold cross-validation algorithm was designed. The first group was divided into two types: endometriosis individuals and non-endometriosis individuals, and the ROC curves of random forest models were obtained. The area AUC value under each ROC curve was used as an evaluation index.
  • a random forest model was used, combined with a 10-fold cross-validation, to obtain the optimal biomarkers for each part, as shown in Table 1, for the identification of endometriosis.
  • Tables 2 to 4 show the enrichment information of the marker group of the three sites in the sample, and Tables 5 to 7 respectively show the relative abundance information of the marker group of the three sites in the first population sample.
  • the biomarkers at three sites identify the results of endometriosis, as shown in Figures 1 to 3.
  • Figure 1 shows the marker in the lower third of the vagina (CL).
  • Figure 2 is the marker group of vaginal posterior iliac crest (CU) to identify endometriosis
  • Figure 3 is the marker group of cervical canal (CV) to identify endometriosis.
  • Dysgonomonas sp. -- -- ⁇ 6 33 Aerococcus sp. ⁇ -- -- 7 35 Prevotella sp. ⁇ ⁇ -- 8 38 Lactobacillus sp. -- -- -- ⁇ 9 42 Tissierellaceae -- -- ⁇ 10 48 Comamonadaceae -- -- ⁇ 11 54 Erysipelotrichaceae -- -- ⁇ 12 61 Lactobacillus sp. ⁇ -- ⁇ 13 64 Dialister sp. ⁇ -- -- 14 70 Erysipelothrix sp. -- -- ⁇ 15 86 Anaerococcus sp. ⁇ -- -- 16 108 Dysgonomonas sp.
  • Lactobacillus sp. ⁇ -- -- 18 233 Lactobacillus sp. ⁇ -- -- 19 344 Lactobacillus sp. ⁇ -- -- 20 424 Lactobacillus iners ⁇ -- -- twenty one 464 Prevotella sp. ⁇ -- -- twenty two 520 Lactobacillus iners ⁇ -- -- twenty three 628 Lactobacillus iners -- -- ⁇ twenty four 663 Lactobacillus sp. ⁇ -- -- --
  • the relative abundance of the OTU of each part is calculated, and the relative abundance is input into the random forest model to obtain the result, and it is judged whether it is endometriosis.
  • the endometriosis group refers to the sample of endometriosis among the 49 subjects in the first group
  • the control group refers to the 49 subjects in the first group.
  • a sample of endometriosis is
  • Figure 1 shows the endometriosis identified by the marker group at the lower third of the vagina (CL).
  • a is a randomized forest identification of endometriosis 5 times as the number of OTUs increases. 10% cross-validation error rate distribution, the model was trained with the relative abundance of OTU in the sample, using a total of 17 non-endometriosis individuals and 32 endometriosis CL samples, The black line represents the average of 5 trials, the gray line represents 5 trials, the black vertical line represents the number of OTUs in the best combination, and the b plot is the cross-validated combination receiver operating curve with the area under the curve AUC At 0.8272, the shaded area represents a 95% confidence interval and the diagonal represents a curve with an AUC of 0.5.
  • Figure 2 shows the identification of endometriosis in the marker group of posterior vaginal vault (CU).
  • a is a 10-fold crossover of random forest identification of endometriosis with increasing number of OTUs.
  • Test The distribution of error rates, the model was trained with the relative abundance of OTU in the sample, using a total of 17 non-endometriosis individuals and 32 uterine endometriosis CU samples, black lines represent The average of 5 trials, the gray line is 5 trials respectively, the black vertical line represents the number of OTUs in the best combination;
  • the b diagram is the receiver operation curve of the cross-validated combination, the area under the curve AUC is 0.5919, the shaded area Represents a 95% confidence interval and the diagonal represents a curve with an AUC of 0.5.
  • Figure 3 shows the identification of endometriosis in the cervical canal (CV) marker group.
  • a is a 10-fold cross-validation of random forest identification of endometriosis with increasing number of OTUs. The distribution of error rates, the model was trained with the relative abundance of OTU in the sample, using a total of 17 CV samples from individuals with non-endometriosis and 32 individuals with endometriosis, with black lines representing 5 The average of the subtests, the gray line is 5 trials respectively, the black vertical line represents the number of OTUs in the best combination; the b diagram is the receiver operation curve of the cross-validated combination, the area under the curve AUC is 0.8493, and the shaded area represents 95% confidence interval, diagonal represents a curve with an AUC of 0.5.
  • the OTU biomarker group at three different sites can identify individuals with endometriosis and non-endometriosis individuals;
  • the area under the curve of the ROC is AUC It is 0.8272 (CL), 0.5919 (CU) and 0.8493 (CV).
  • AUC is the area under the curve, and the larger the value, that is, the closer to 1, indicating that the judgment ability is stronger, that is, the more accurate the judgment.
  • the OTU biomarkers obtained from the random forest were verified in the second population samples, and the results are shown in Table 8, Table 9, and Table 10.
  • sample numbers such as C003CL, C003CU, and C003CV respectively indicate samples of three parts of CL, CU, and CV collected from the same C003 sampling object.
  • Tables 8 to 10 show the probability that the three marker groups predict individuals with endometriosis, and the ROC curves thus obtained are sequentially shown in Figs. 4 to 6 .
  • the probability > 0.5 is considered to be that the individual has a risk of suffering from endometriosis or endometriosis through the marker group at the site.
  • Table 9 CU marker group at CU site predicts the probability of second population sample suffering from endometriosis
  • Table 10 CV marker group at CV site predicts the probability of second population sample suffering from endometriosis
  • the results in Figure 4 show that the CL site is based on the CL marker group to determine the probability of endometriosis with an AUC value of 0.8750; the results in Figure 5 show that the CU site is based on the CU marker group to determine the probability of endometriosis, its AUC The value of 0.840; the results of Figure 6 show that the CV site based on the CV marker group to determine the probability of endometriosis, its AUC value is 0.9189; it can be seen that these three marker groups have a higher discriminating ability, can be used in the uterus Detection of endometriosis, the results are consistent with the results of Tables 8 to 10.

Abstract

A biomarker composition for detection or risk assessment of endometriosis, and application thereof. The biomarker composition comprises at least one of 24 nucleic acids. The 24 nucleic acids have sequences represented by Seq ID No. 1 to Seq ID No. 24 respectively, or sequences having similarity of 97% or more to those represented by Seq ID No. 1 to Seq ID No. 24. The biomarker composition has excellent specificity, and can be used for early diagnosis of endometriosis, thereby providing a new approach for detection or risk assessment of endometriosis. The biomarker composition has high sensitivity and specificity, good reproducibility, and important application value. By using a genital tract specimen as a biomarker detection specimen, the material is easily available, the operation steps are simple, and continuous in-vitro detection can be implemented.

Description

用于子宫内膜异位症检测的生物标志物组合及应用Biomarker combination and application for detection of endometriosis 技术领域Technical field
本申请涉及生物标志物领域,特别是涉及一种用于子宫内膜异位症检测或患病风险评估的生物标志物组合及其应用。The present application relates to the field of biomarkers, and in particular to a biomarker combination for endometriosis detection or risk assessment of disease and its use.
背景技术Background technique
子宫内膜异位症是育龄妇女的多发病,引发多种临床症状。近年发病率呈明显上升趋势,已达10%-15%,占妇科手术的30%以上。子宫内膜异位症与不孕症的关系非常密切,约40%-50%的不孕患者合并子宫内膜异位症导致的不孕。目前研究认为,微生物感染在子宫内膜异位症导致的不孕中起到非常重要的作用。此外,诸多研究显示,子宫内膜异位症腹腔液中的炎症因子,如肿瘤坏死因子(TNF-α)、白细胞介素6(IL-6)等,可通过干扰生殖过程中某些环节导致不孕。尽管子宫内膜异位症是一种良性疾病,但其病变广泛、形态多样,极具侵袭和复发性,对脏器结构和功能的破坏与恶性肿瘤相当,故有“良性癌”之称。子宫内膜异位症的临床表现与疾病程度不成正比,因此缺乏有效的临床诊断方法。Endometriosis is a frequently-occurring disease in women of childbearing age, causing a variety of clinical symptoms. In recent years, the incidence rate has increased significantly, reaching 10%-15%, accounting for more than 30% of gynecological surgery. Endometriosis is closely related to infertility, and about 40%-50% of infertile patients have infertility caused by endometriosis. Current research suggests that microbial infection plays a very important role in infertility caused by endometriosis. In addition, many studies have shown that inflammatory factors in the peritoneal fluid of endometriosis, such as tumor necrosis factor (TNF-α) and interleukin-6 (IL-6), can interfere with certain aspects of the reproductive process. Infertility. Although endometriosis is a benign disease, its lesions are extensive, morphologically diverse, extremely aggressive and recurrent, and the destruction of organ structure and function is comparable to malignant tumors, so it is known as "benign cancer." The clinical manifestations of endometriosis are not directly proportional to the extent of the disease, so there is a lack of effective clinical diagnostic methods.
对于子宫内膜异位症,尽管目前有部分应用于临床的标志物,如癌胚抗原125(CA125)、胎盘蛋白14(PP14)、子宫内膜抗体和白细胞介素6(IL-6)等,它们对于临床诊断和跟踪复发等方面具有一定的参考价值,但仍缺乏特异性。For endometriosis, although some of the currently used clinical markers, such as carcinoembryonic antigen 125 (CA125), placental protein 14 (PP14), endometrial antibodies and interleukin-6 (IL-6), etc. They have certain reference value for clinical diagnosis and tracking of recurrence, but they still lack specificity.
因此,寻找敏感、特异的子宫内膜异位症的生物标志物是目前急需解决的问题。Therefore, the search for sensitive and specific biomarkers of endometriosis is an urgent problem to be solved.
发明内容Summary of the invention
本申请的目的是提供一种用于子宫内膜异位症检测的生物标志物及其制备方法和应用。The object of the present application is to provide a biomarker for the detection of endometriosis and a preparation method and application thereof.
为了实现上述目的,本申请采用了以下技术方案:In order to achieve the above objectives, the present application adopts the following technical solutions:
本申请的一方面公开了一种用于子宫内膜异位症检测的生物标志物,生物标志物组合包括二十四条核酸中的至少一条,二十四条核酸分别为Seq ID No.1至Seq ID No.24所示序列,或者分别为与Seq ID No.1至Seq ID No.24所示序列具有97%以上相似性的序列。One aspect of the present application discloses a biomarker for the detection of endometriosis, the biomarker combination comprising at least one of twenty-four nucleic acids, each of which is Seq ID No. 1 The sequence shown in Seq ID No. 24, or a sequence having 97% or more similarity to the sequence shown in Seq ID No. 1 to Seq ID No. 24, respectively.
需要说明的是,本申请的二十四条核酸是经过研究得出的,和子宫内膜异位症有关联的核酸序列,其中每条核酸序列都与子宫内膜异位症有关联性,因 此,在不考虑判断准确性的情况下或者对此要求较低的情况下,可以单独或者组合用于子宫内膜异位症检测或者患病风险评估。但是,本申请的一种优选方案中,不仅二十四条核酸一起使用,而且,还将二十四条核酸按照特定的规律进行分类,分成多个标志物组,各个标志物组一起用于子宫内膜异位症检测或者患病风险评估,这将在后面的优选技术方案中详细描述。It should be noted that the twenty-four nucleic acids of the present application are researched and derived from a nucleic acid sequence associated with endometriosis, wherein each nucleic acid sequence is associated with endometriosis. Cause Therefore, the endometriosis test or the risk assessment of the disease can be used alone or in combination without considering the accuracy of the judgment or when the requirement is low. However, in a preferred embodiment of the present application, not only twenty-four nucleic acids are used together, but also twenty-four nucleic acids are classified according to a specific rule, and are divided into a plurality of marker groups, and each marker group is used together. Endometriosis detection or risk assessment of disease, which will be described in detail in the following preferred technical solutions.
还需要说明的是,本申请的二十四条核酸是根据97%以上相似性进行聚类分析,然后从每个分类单元(缩写OTU)中选取最具代表性的序列作为种子序列,其中与子宫内膜异位症具有关联性的二十四个种子序列,即组成本申请的生物标志物组合;因此,本申请的生物标志物组合中,二十四条核酸不仅限于Seq ID No.1至Seq ID No.24所示序列,还可以是与Seq ID No.1至Seq ID No.24所示序列具有97%以上相似性的序列。It should also be noted that the twenty-four nucleic acids of the present application are clustered according to 97% or more similarity, and then the most representative sequence is selected from each taxon (abbreviation OTU) as a seed sequence, wherein Endometriosis has twenty-four seed sequences that are related, ie, constitutes a biomarker combination of the present application; therefore, in the biomarker combination of the present application, twenty-four nucleic acids are not limited to Seq ID No. 1 The sequence shown in Seq ID No. 24 may also be a sequence having 97% or more similarity to the sequence shown by Seq ID No. 1 to Seq ID No. 24.
需要补充说明的是,本申请的用于子宫内膜异位症检测或患病风险评估的生物标志物组合,并不是直接根据检测生物标志物组合的有或者无进行子宫内膜异位症检测或患病风险评估的,而是,在检测到生物标志物组合后,通过分析其相对丰度,并将相对丰度带入随机森林模型进行判断,根据随机森林模型输出的概率判断待测对象是否患有子宫内膜异位症或评估待测对象患子宫内膜异位症的风险,这将在后面的技术方案中详细说明。It should be added that the biomarker combination for the detection of endometriosis or the risk assessment of the present application is not directly based on the detection of the combination of biomarkers with or without endometriosis detection. Or the risk assessment, but after detecting the biomarker combination, by analyzing its relative abundance, and taking the relative abundance into the random forest model to judge, the object to be tested is judged according to the probability of the random forest model output. Whether or not there is endometriosis or assessing the risk of endometriosis in the subject to be tested will be described in detail in the following technical solutions.
优选的,本申请的另一面公开了一种用于子宫内膜异位症检测或患病风险评估的生物标志物组合,包括第一标志物组、第二标志物组和第三标志物组中的至少一组;第一标志物组由十四条核酸组成,十四条核酸分别为Seq ID No.1、Seq ID No.2、Seq ID No.6、Seq ID No.7、Seq ID No.12、Seq ID No.13、Seq ID No.15、Seq ID No.17至Seq ID No.22、Seq ID No.24所示序列,或者分别为与Seq ID No.1、Seq ID No.2、Seq ID No.6、Seq ID No.7、Seq ID No.12、Seq ID No.13、Seq ID No.15、Seq ID No.17至Seq ID No.22、Seq ID No.24所示序列具有97%以上相似性的序列;第二标志物组由两条核酸组成,两条核酸分别为Seq ID No.1、Seq ID No.7所示序列,或者分别为与Seq ID No.1、Seq ID No.7所示序列具有97%以上相似性的序列;第三标志物组由十一条核酸组成,十一条核酸分别为Seq ID No.3至Seq ID No.5、Seq ID No.8至Seq ID No.12、Seq ID No.14、Seq ID No.16、Seq ID No.23所示序列,或者分别为与Seq ID No.3至Seq ID No.5、Seq ID No.8至Seq ID No.12、Seq ID No.14、Seq ID No.16、Seq ID No.23所示序列具有97%以上相似性的序列。Preferably, another aspect of the present application discloses a biomarker combination for endometriosis detection or risk assessment, including a first marker panel, a second marker panel, and a third marker panel At least one of the groups; the first marker group consists of fourteen nucleic acids, respectively, Seq ID No. 1, Seq ID No. 2, Seq ID No. 6, Seq ID No. 7, Seq ID No. 12, Seq ID No. 13, Seq ID No. 15, Seq ID No. 17 to Seq ID No. 22, Seq ID No. 24, or Seq ID No. 1, Seq ID No, respectively. .2, Seq ID No. 6, Seq ID No. 7, Seq ID No. 12, Seq ID No. 13, Seq ID No. 15, Seq ID No. 17 to Seq ID No. 22, Seq ID No. 24 The sequence shown has a sequence of 97% or more similarity; the second marker group consists of two nucleic acids, each of which is a sequence of Seq ID No. 1, Seq ID No. 7, or Seq ID No, respectively. 1. The sequence shown by Seq ID No. 7 has a sequence of 97% or more similarity; the third marker group consists of eleven nucleic acids, and the eleven nucleic acids are Seq ID No. 3 to Seq ID No. 5, respectively. S Eq ID No. 8 to Seq ID No. 12, Seq ID No. 14, Seq ID No. 16, Seq ID No. 23, or Seq ID No. 3 to Seq ID No. 5, Seq, respectively. The sequence shown by ID No. 8 to Seq ID No. 12, Seq ID No. 14, Seq ID No. 16, and Seq ID No. 23 has a sequence of 97% or more similarity.
需要说明的是,本申请的优选方案中,将二十四条核酸可重复选择的分为三个标志物组,即第一标志物组、第二标志物组和第三标志物组;通过三个标 志物组的综合判断,可以大大提高本申请的生物标志物组合检测子宫内膜异位症或者评估患病风险的准确性。It should be noted that, in a preferred embodiment of the present application, twenty-four nucleic acids are reproducibly divided into three marker groups, namely, a first marker group, a second marker group, and a third marker group; Three standard The comprehensive judgment of the volunteer group can greatly improve the accuracy of detecting the endometriosis of the biomarker combination of the present application or assessing the risk of the disease.
优选的,第一标志物组为CL标志物组,用于对来自阴道下1/3的样品进行子宫内膜异位症检测或患病风险评估。Preferably, the first marker group is a CL marker group for performing endometriosis detection or risk assessment of a sample from the lower third of the vagina.
优选的,第二标志物组为CU标志物组,用于对来自阴道后穹窿的样品进行子宫内膜异位症检测或患病风险评估。Preferably, the second marker group is a CU marker group for performing endometriosis detection or risk assessment of a sample from the vaginal posterior iliac crest.
优选的,第三标志物组为CV标志物组,用于对来自宫颈管的样品进行子宫内膜异位症检测或患病风险评估。Preferably, the third marker group is a CV marker group for performing endometriosis detection or risk assessment of a sample from the cervical canal.
需要说明的是,本申请的生物标志物组合中的二十四条核酸实际上代表的是阴道下1/3、阴道后穹窿和宫颈管三个部位的14种微生物;本申请通过对阴道下1/3、阴道后穹窿和宫颈管三个部位的14种微生物的二十四条核酸进行检测,并对其相对丰度与子宫内膜异位症的关系进行统计分析,建立随机森林模型,以此判断待测对象是否患有子宫内膜异位症或是否具有患子宫内膜异位症的风险。因此,三个标志物组,实际上就是分别对应三个采样部位;来自于三个部位的样品,分别对应各自的标志物组,独立进行分析判断。只是,根据三者的结果进行综合判断,能够提高本申请的生物标志物组合检测子宫内膜异位症或者评估患病风险的准确性。It should be noted that the twenty-four nucleic acids in the biomarker combination of the present application actually represent 14 microorganisms in the lower third of the vagina, the posterior vaginal canal and the cervical canal; the present application passes under the vagina Twenty-four nucleic acids of 14 microorganisms in 1/3, vaginal posterior fornix and cervical canal were detected, and the relationship between their relative abundance and endometriosis was statistically analyzed to establish a random forest model. In this way, it is judged whether the subject to be tested has endometriosis or is at risk of suffering from endometriosis. Therefore, the three marker groups actually correspond to three sampling sites respectively; the samples from the three sites correspond to the respective marker groups, and are independently analyzed and judged. However, comprehensive judgment based on the results of the three can improve the accuracy of detecting the endometriosis of the biomarker combination of the present application or assessing the risk of the disease.
还需要说明的是,在阴道下1/3、阴道后穹窿和宫颈管这三个部位中,其微生物数量远不止14个种,14种微生物的核酸也远不止本申请所记载的24个;但是,本申请根据随机森林模型从中筛选出14种微生物的二十四条核酸,以作为子宫内膜异位症检测的生物标志物,为子宫内膜异位症的检测和评估提供了一条新的途径。It should also be noted that in the lower third of the vagina, the posterior vaginal canal and the cervical canal, the number of microorganisms is far more than 14 species, and the nucleic acids of 14 microorganisms are far more than the 24 records described in this application; However, this application screens out 24 nucleic acids of 14 microorganisms according to the random forest model as a biomarker for the detection of endometriosis, providing a new measure for the detection and evaluation of endometriosis. Way.
需要补充说明的是,三个标志物组中,CL标志物组即阴道下1/3样品的标志物组,阴道下1/3缩写为CL;CU标志物组即阴道后穹窿样品的标志物组,阴道后穹窿缩写为CU;CV标志物组即宫颈管样品的标志物组,宫颈管缩写为CV。It should be added that in the three marker groups, the CL marker group is the marker group of the lower third of the vagina, the lower third of the vagina is abbreviated as CL; the marker of the CU marker group is the marker of the posterior vaginal sample. Group, vaginal posterior hernia is abbreviated as CU; CV marker group is the marker group of cervical canal sample, and cervical canal is abbreviated as CV.
本申请的另一面公开了一种用于子宫内膜异位症检测或患病风险评估的试剂盒,该剂盒中包含用于检测本申请的生物标志物组合的引物对,引物对的正向引物为SEQ ID No.25所示序列,反向引物为SEQ ID No.26所示序列。Another aspect of the present application discloses a kit for endometriosis detection or risk assessment of a disease, comprising a primer pair for detecting a biomarker combination of the present application, the positive of the primer pair The primer was the sequence shown in SEQ ID No. 25, and the reverse primer was the sequence shown in SEQ ID No. 26.
需要说明的是,本申请的生物标志物组合,可以作为一个标准参考存在于试剂盒中,而引物对则是直接用于PCR扩增待测样品中的生物标志物组合的。It should be noted that the biomarker combination of the present application can be present in the kit as a standard reference, and the primer pair is directly used for PCR amplification of the biomarker combination in the sample to be tested.
本申请的另一面公开了本申请的生物标志物组合在子宫内膜异位症药物筛选或者在制备子宫内膜异位症检测或患病风险评估的试剂盒或检测工具中的应 用。The other side of the present application discloses that the biomarker combination of the present application should be used in the screening of endometriosis drugs or in the preparation of kits or detection tools for endometriosis detection or risk assessment of disease. use.
可以理解,本申请的生物标志物组合本身就是针对子宫内膜异位症而研究的,当然可以用于子宫内膜异位症的检测或风险评估;而本申请的生物标志物组合也可以整合到一些专门用于子宫内膜异位症检测的试剂盒或工具中,以方便子宫内膜异位症的检测和评估,只要采用了本申请的生物标志物组合,都在本申请的保护范围内。与此同时,由于本申请的生物标志物组合可以检测子宫内膜异位症或者对子宫内膜异位症进行患病风险评估;当然,可以对比检测用药前和用药后的子宫内膜异位症患病情况或者患病风险变化,从而判断所用药物是否有效,以达到药物筛选的目的。It can be understood that the biomarker combination of the present application is itself studied for endometriosis, and can of course be used for the detection or risk assessment of endometriosis; and the biomarker combination of the present application can also be integrated. To some kits or tools specially designed for the detection of endometriosis to facilitate the detection and evaluation of endometriosis, as long as the biomarker combination of the present application is used, it is within the scope of protection of the present application. Inside. At the same time, because the biomarker combination of the present application can detect endometriosis or assess the risk of endometriosis; of course, it can compare the endometriosis before and after treatment. The disease condition or the risk of the disease changes to determine whether the drug used is effective for the purpose of drug screening.
本申请的再一面公开了一种子宫内膜异位症的检测方法,包括以下步骤,A further aspect of the present application discloses a method for detecting endometriosis, comprising the following steps,
(1)对待测对象进行样品采集,检测所采集的样品中本申请的生物标志物组合,并分析生物标志物组合中各核酸的水平;(1) performing sample collection on the object to be tested, detecting the biomarker combination of the present application in the collected sample, and analyzing the level of each nucleic acid in the biomarker combination;
(2)将步骤(1)测得的各核酸的水平与参考数据集或参考值进行比较,获得检测结果;(2) comparing the level of each nucleic acid measured in the step (1) with a reference data set or a reference value to obtain a detection result;
优选的,各核酸的水平为各核酸的相对丰度;参考数据集或参考值为来源于子宫内膜异位症人群和非子宫内膜异位症人群对照的生物标志物组合中各核酸的水平。Preferably, the level of each nucleic acid is the relative abundance of each nucleic acid; the reference data set or reference value is the nucleic acid of each of the biomarker combinations derived from the endometriosis population and the non-endometriosis population control Level.
更优选的,步骤(2)中的参考数据集或参考值为表5、表6或表7中的至少一组;将各核酸的水平与参考数据集或参考值进行比较获得检测结果,具体包括,利用多元统计模型计算得出患病概率,优选的,多元统计模型为随机森林模型。More preferably, the reference data set or reference value in step (2) is at least one of Table 5, Table 6, or Table 7; comparing the level of each nucleic acid with a reference data set or a reference value to obtain a detection result, specifically Including, using multivariate statistical models to calculate the probability of disease, preferably, the multivariate statistical model is a random forest model.
更优选的,步骤(1)中对待测对象进行样品采集,包括采集待测对象阴道下1/3样品、阴道后穹窿样品和宫颈管样品。More preferably, the sample to be tested is subjected to sample collection in step (1), including collecting the lower third of the vagina sample, the posterior vaginal sputum sample and the cervical canal sample.
需要说明的是,本申请的生物标志物组合中的各核酸片段都是与子宫内膜异位症相关联的核酸片段,因此,通过分析待测对象中生物标志物组合的水平,即相对丰度,就可以检测待测对象是否患病或判断其患病风险。It should be noted that each nucleic acid fragment in the biomarker combination of the present application is a nucleic acid fragment associated with endometriosis, and therefore, by analyzing the level of the biomarker combination in the test subject, that is, relatively abundant In degree, it is possible to detect whether the subject to be tested is sick or to determine the risk of the disease.
本申请的再一面公开了一种通过检测生物标志物判断子宫内膜异位症的方法在制备子宫内膜异位症检测或患病风险评估试剂盒或工具中的应用;其中,生物标志物为本申请的生物标志物组合;A further aspect of the present application discloses a method for preparing endometriosis detection or disease risk assessment kit or tool by detecting a biomarker for determining endometriosis; wherein the biomarker a biomarker combination of the present application;
通过检测生物标志物判断子宫内膜异位症的方法包括以下步骤,A method for determining endometriosis by detecting a biomarker includes the following steps,
(1)对待测对象进行样品采集,检测所采集的样品中本申请的生物标志物组合,并分析生物标志物组合中各核酸的水平;(1) performing sample collection on the object to be tested, detecting the biomarker combination of the present application in the collected sample, and analyzing the level of each nucleic acid in the biomarker combination;
(2)将步骤(1)测得的各核酸的水平与参考数据集或参考值进行比较,获得检测结果; (2) comparing the level of each nucleic acid measured in the step (1) with a reference data set or a reference value to obtain a detection result;
优选的,各核酸的水平为各核酸的相对丰度;参考数据集或参考值为来源于子宫内膜异位症患者和非子宫内膜异位症对照的生物标志物组合中各核酸的水平。Preferably, the level of each nucleic acid is the relative abundance of each nucleic acid; the reference data set or reference value is the level of each nucleic acid in the biomarker combination derived from endometriosis patients and non-endometriosis controls. .
更优选的,步骤(2)中的参考数据集或参考值为表5、表6或表7中的至少一组;将各核酸的水平与参考数据集或参考值进行比较获得检测结果,具体包括,利用多元统计模型计算得出患病概率,优选地,多元统计模型为随机森林模型。More preferably, the reference data set or reference value in step (2) is at least one of Table 5, Table 6, or Table 7; comparing the level of each nucleic acid with a reference data set or a reference value to obtain a detection result, specifically Including, using a multivariate statistical model to calculate the probability of disease, preferably, the multivariate statistical model is a random forest model.
本申请的再一面公开了一种筛选治疗子宫内膜异位症的候选药物的方法,包括以下步骤,A further aspect of the present application discloses a method of screening for a drug candidate for treating endometriosis, comprising the steps of
1)分别测定用药前和用药后的样品中本申请的生物标志物组合,并分析生物标志物组合中各核酸的水平;1) separately determining the biomarker combination of the present application in the sample before and after administration, and analyzing the level of each nucleic acid in the biomarker combination;
2)根据比较用药前和用药后的样品中各核酸的水平,判断候选药物;2) judging the candidate drug according to the level of each nucleic acid in the sample before and after the drug is compared;
步骤2)中,比较用药前和用药后的样品中各核酸的水平,具体包括,利用多元统计模型计算得出患病概率,优选地,多元统计模型为随机森林模型。In step 2), comparing the levels of each nucleic acid in the sample before and after administration, specifically including calculating the probability of disease using a multivariate statistical model, preferably, the multivariate statistical model is a random forest model.
本申请的再一面公开了一种女性生殖道内微生物群的检测方法,包括以下步骤,A further aspect of the present application discloses a method for detecting a microbiota in a female reproductive tract, comprising the following steps:
(1)采集待测对象生殖道内微生物样品,检测所采集的样品中本申请的生物标志物组合,并分析生物标志物组合中各核酸的水平;(1) collecting a microbial sample in the reproductive tract of the test subject, detecting the biomarker combination of the present application in the collected sample, and analyzing the level of each nucleic acid in the biomarker combination;
(2)将步骤(1)测得的各核酸的水平与参考数据集或参考值进行比较,获得检测结果;(2) comparing the level of each nucleic acid measured in the step (1) with a reference data set or a reference value to obtain a detection result;
优选的,各核酸的水平为各核酸的相对丰度;参考数据集或参考值为来源于子宫内膜异位症人群和非子宫内膜异位症人群对照的生物标志物组合中各核酸的水平。Preferably, the level of each nucleic acid is the relative abundance of each nucleic acid; the reference data set or reference value is the nucleic acid of each of the biomarker combinations derived from the endometriosis population and the non-endometriosis population control Level.
优选的,步骤(2)中的参考数据集或参考值为表5、表6或表7中的至少一组;将各核酸的水平与参考数据集或参考值进行比较获得检测结果,具体包括,利用多元统计模型计算得出患病概率,更优选地,多元统计模型为随机森林模型。Preferably, the reference data set or reference value in step (2) is at least one of Table 5, Table 6, or Table 7; comparing the level of each nucleic acid with a reference data set or a reference value to obtain a detection result, specifically including The multivariate statistical model is used to calculate the probability of disease. More preferably, the multivariate statistical model is a random forest model.
优选的,步骤(1)中采集待测对象生殖道内微生物样品,具体包括采集待测对象阴道下1/3样品、阴道后穹窿样品和宫颈管样品。其中,生殖道内微生物样品的采集可以采用常规的尼龙绒屑拭子,在此不做具体限定。Preferably, in step (1), the microbial sample in the genital tract of the test subject is collected, specifically comprising collecting the lower third of the vaginal sample, the posterior vaginal sputum sample and the cervical canal sample of the test subject. Among them, the collection of the microbial samples in the reproductive tract can be carried out by using a conventional nylon fluff swab, which is not specifically limited herein.
需要说明的是,本申请的生物标志物组合,实际上就是根据女性生殖道内微生物群DNA与子宫内膜异位症之间的关系而得出的,即本申请的生物标志物实际上就是女性生殖道内能够体现子宫内膜异位症状态的微生物OTU;因此,本申请提出了一种女性生殖道内微生物群检测方法,通过微生物群的检测为子宫内膜异位症或其患病风险提供判断和评估依据。 It should be noted that the biomarker combination of the present application is actually based on the relationship between the microbial DNA in the female reproductive tract and endometriosis, that is, the biomarker of the present application is actually a female. The microbe OTU in the reproductive tract can reflect the state of endometriosis; therefore, the present application proposes a method for detecting microbiota in the female reproductive tract, which provides a judgment for the endometriosis or its risk by detecting the microbial population. And the basis for the assessment.
本申请的再一面公开了一种制备子宫内膜异位症生物标志物组合的方法,包括以下步骤,A further aspect of the present application discloses a method of preparing a biomarker combination of endometriosis comprising the following steps,
(1)分别对子宫内膜异位症病患和非病患进行生殖道内微生物样品采集,对所有采集的样品分别进行16S测序;(1) Collecting microbial samples in the reproductive tract of endometriosis patients and non-patients, respectively, and performing 16S sequencing on all collected samples;
(2)将16S测序结果进行聚类分析,获得OTU单元以及每个OTU的种子序列,并计算每个OTU单元的相对丰度;(2) Clustering the 16S sequencing results to obtain the OTU unit and the seed sequence of each OTU, and calculate the relative abundance of each OTU unit;
(3)利用随机森林模型对每个OTU单元的相对丰度与子宫内膜异位症状态进行拟合,并进行5次十折交叉验证,得到最优的OTU组合,最优OTU组合中各OTU的种子序列,即组成子宫内膜异位症的生物标志物组合。(3) Using the random forest model to fit the relative abundance of each OTU unit with the endometriosis state, and perform five 10-fold cross-validation to obtain the optimal OTU combination, and the optimal OTU combination. The seed sequence of OTU, a biomarker combination that constitutes endometriosis.
优选的,步骤(1)中,生殖道内微生物样品采集,具体包括采集待测对象阴道下1/3样品、阴道后穹窿样品和宫颈管样品。Preferably, in step (1), the microbial sample is collected in the genital tract, specifically comprising collecting the lower third of the vagina sample, the posterior vaginal sputum sample and the cervical canal sample.
需要说明的是,本申请子宫内膜异位症生物标志物组合的制备方法,其关键在于利用随机森林模型对生殖道内微生物群DNA与子宫内膜异位症的关联进行拟合、验证,最终得到能够对子宫内膜异位症患病或风险进行评估的生物标志物组合。可以理解,本申请的制备方法或其基本思路,不只限于制备子宫内膜异位症的生物标志物组合;还可以用于制备类似的与生殖道内微生物群DNA存在关联的病症的生物标志物组合,例如子宫腺肌症的生物标志物组合。It should be noted that the method for preparing the biomarker combination of endometriosis in the present application is to use a random forest model to fit and verify the association between the microbial DNA in the reproductive tract and endometriosis. A biomarker combination is obtained that is capable of assessing the risk or risk of endometriosis. It will be understood that the preparation method of the present application or its basic idea is not limited to the preparation of a biomarker combination for endometriosis; it can also be used to prepare a biomarker combination of similar conditions associated with the presence of microbial DNA in the reproductive tract. For example, a biomarker combination of adenomyosis.
由于采用以上技术方案,本申请的有益效果在于:Due to the adoption of the above technical solutions, the beneficial effects of the present application are:
本申请的用于子宫内膜异位症检测的生物标志物,具有很好的特异性,能够很好的检测出子宫内膜异位症,为子宫内膜异位症的检测或风险评估提供了一条新的途径,并且,能够用于子宫内膜异位症的早期诊断。The biomarker for detecting endometriosis of the present application has good specificity and can detect endometriosis well, and provides for detection or risk assessment of endometriosis. A new approach and can be used for early diagnosis of endometriosis.
本申请的其它主要优点包括:Other key advantages of this application include:
(a)本申请的生物标志物组合用于子宫内膜异位症的检测或患病风险评估,具有高灵敏性、高特异性的优点,具有重要的应用价值。(a) The biomarker combination of the present application is used for the detection of endometriosis or the risk assessment of the disease, and has the advantages of high sensitivity and high specificity, and has important application value.
(b)生殖道样品作为生物标志物组合检测样本具有取材方便、操作步骤简单和可连续体外检测等优点。(b) The genital tract sample as a biomarker combination test sample has the advantages of convenient material selection, simple operation steps and continuous in vitro detection.
(c)本申请的生物标志物组合用于子宫内膜异位症的检测或患病风险评估具有重复性好的特点。(c) The biomarker combination of the present application is useful for the detection of endometriosis or for assessing the risk of disease with reproducible characteristics.
附图说明DRAWINGS
图1是本申请实施例中基于阴道下1/3处CL的标志物组鉴别子宫内膜异位症引发不孕的结果图,图中,a为随着OTU数量的增加,对随机森林鉴别子宫内膜异位症(不孕)进行5次10折交叉验证的错误率分布情况,b为经过交叉 验证过的组合的接收者操作曲线(缩写ROC曲线),曲线下面积(缩写AUC)为0.8272,阴影面积代表95%置信区间,对角线代表AUC为0.5的曲线;1 is a graph showing the results of infertility caused by endometriosis based on a marker group of CL at the lower third of the vagina in the embodiment of the present application, wherein a is a random forest identification as the number of OTUs increases. Endometriosis (infertility) 5 times 10 fold cross-validation error rate distribution, b is crossed The verified combined receiver operating curve (abbreviated ROC curve), the area under the curve (abbreviated AUC) is 0.8272, the shaded area represents the 95% confidence interval, and the diagonal represents the curve with an AUC of 0.5;
图2是本申请实施例中基于阴道后穹窿CU的标志物组鉴别子宫内膜异位症(不孕)的结果图,图中,a为随着OTU数量的增加,对随机森林鉴别子宫内膜异位症(不孕)进行5次10折交叉验证的错误率分布情况,b为经过交叉验证过的组合的接收者操作曲线(缩写ROC曲线),曲线下面积(缩写AUC)为0.5919,阴影面积代表95%置信区间,对角线代表AUC为0.5的曲线;2 is a diagram showing the results of identifying endometriosis (infertility) based on the marker group of vaginal posterior sacral CU in the embodiment of the present application, in which a is an identification of random forests in the uterus as the number of OTUs increases. Membrane ectopic (infertility) 5 times 10 fold cross-validation error rate distribution, b is the cross-validated combination receiver operating curve (abbreviated ROC curve), the area under the curve (abbreviated AUC) is 0.5919, The shaded area represents a 95% confidence interval and the diagonal represents a curve with an AUC of 0.5;
图3是本申请实施例中基于宫颈管CV标志物组鉴别子宫内膜异位症(不孕)的结果图,图中,a为随着OTU数量的增加,对随机森林鉴别子宫内膜异位症(不孕)进行5次10折交叉验证的错误率分布情况,b为经过交叉验证过的组合的接收者操作曲线,曲线下面积为0.8493,阴影面积代表95%置信区间,对角线代表AUC为0.5的曲线;3 is a diagram showing the results of identifying endometriosis (infertility) based on the CV marker group of the cervical canal in the embodiment of the present application, in which a is an identification of endometrial abnormalities in random forests as the number of OTUs increases. The error rate distribution of 5 cases of 10-fold cross-validation in the case of infertility (b infertility), b is the receiver's operation curve of the cross-validated combination, the area under the curve is 0.8493, the shaded area represents 95% confidence interval, diagonal a curve representing an AUC of 0.5;
图4是本申请实施例中阴道下1/3处CL标志物组在第二群体中对子宫内膜异位症(不孕)进行鉴别的ROC曲线;4 is a ROC curve for identifying endometriosis (infertility) in a second population of the CL marker group at the lower third of the vagina in the embodiment of the present application;
图5是本申请实施例中阴道后穹窿CU标志物组在第二群体中对子宫内膜异位症(不孕)进行鉴别的ROC曲线;5 is a ROC curve for identifying endometriosis (infertility) in a second population of the vaginal posterior sputum CU marker group in the examples of the present application;
图6是本申请实施例中宫颈管CV标志物组在第二群体中对子宫内膜异位症(不孕)进行鉴别的ROC曲线;6 is a ROC curve for identifying endometriosis (infertility) in a second population of the cervical canal CV marker group in the examples of the present application;
图中,变量数量是指OTU数量,其中,灵敏性=真阳性/(真阳性+假阴性);特异度=真阴性/(真阴性+假阳性)。In the figure, the number of variables refers to the number of OTUs, where sensitivity = true positive / (true positive + false negative); specificity = true negative / (true negative + false positive).
具体实施方式detailed description
本申请的生物标志物,是根据采集对象三个部位的微生物群DNA与子宫内膜异位症之间的关系而得出的,本申请的生物标志物实际上就是这三个部位的能够体现子宫内膜异位症状态的微生物OTU。具体的,本申请的一种制备方法中,这种对应关系或者生物标志物的获得,是以OTU种子序列的相对丰度为一个对象,子宫内膜异位症状态(患病或非患病)为第二个对象,通过随机森林模型对两者进行拟合,最终通过5次十折交叉验证而得出的。本申请通过严格的计算和试验研究,最终获得了三个部位的14种微生物的二十四条核酸作为本申请的生物标志物。The biomarker of the present application is obtained based on the relationship between the microbial DNA of the three parts of the subject and the endometriosis. The biomarkers of the present application are actually the three parts. Microorganism OTU in the state of endometriosis. Specifically, in a preparation method of the present application, the correspondence or biomarker is obtained by using the relative abundance of the OTU seed sequence as a subject, endometriosis state (sick or non-diseased) For the second object, the two are fitted through a random forest model, and finally obtained through five 10-fold cross-validation. Through rigorous calculations and experimental studies, the present application finally obtained twenty-four nucleic acids of 14 microorganisms in three sites as biomarkers of the present application.
本申请的一种实现方式中,三个部位的标志物组可以独立的对子宫内膜异位症患病或风险进行评估,但是,结合三个部位的概率,判断待测对象是否患有子宫内膜异位症或是否具有患子宫内膜异位症的风险,这样准确性会更高。In one implementation of the present application, the marker group of the three sites can independently evaluate the risk or risk of endometriosis, but the probability of combining the three sites is used to determine whether the subject has a uterus. Endometriosis or the risk of endometriosis, the accuracy will be higher.
本申请所用术语是本领域普通技术人员通常理解的含义。为了更好地理解本申请,对一些定义和相关术语的解释如下: The terms used in this application are those that are generally understood by those of ordinary skill in the art. For a better understanding of this application, some definitions and related terms are explained as follows:
本申请的“子宫内膜异位症”,是一种常见的妇科疾病,定义为子宫内膜组织生长在子宫腔以外引起的病症。最常见的子宫内膜异位症是出现于卵巢及输卵管,也有可能出现于子宫肌层、盆腔腹膜,甚至是膀胱及大肠。其中因卵巢的子宫内膜异位,可形成内有棕色液体的子宫内膜瘤,所以又有“巧克力囊肿”或“朱古力瘤”之称,会影响怀孕。本申请的“子宫内膜异位症”和“子宫内膜异位症(不孕)”特指本领域常见的子宫内膜异位症引发不孕。本申请的“非子宫内膜异位症”特指本领域常见的没有子宫内膜异位症,且可怀孕。"Endometriosis" of the present application is a common gynecological disease, defined as a condition caused by the growth of endometrial tissue outside the uterine cavity. The most common endometriosis occurs in the ovaries and fallopian tubes, and may also occur in the myometrium, pelvic peritoneum, and even the bladder and large intestine. Among them, endometriosis of the ovary can form an endometrial tumor with a brown liquid inside, so there is also a "chocolate cyst" or "juliek tumor", which will affect pregnancy. "Endometriosis" and "endometriosis (infertility)" of the present application specifically refer to infertility caused by endometriosis which is common in the art. "Non-endometriosis" of the present application specifically refers to the absence of endometriosis, which is common in the art, and can be pregnant.
本申请的生物标志物质的水平通过相对丰度指示。The levels of the biomarker materials of the present application are indicated by relative abundance.
在本申请的一个实施方式中,参考值是指健康对照的参考值或正常值。本领域的技术人员清楚,在样品数量足够多情况下,每个生物标志物的正常值,即绝对值,的范围可以通过检验和计算方法得到。In one embodiment of the present application, the reference value refers to a reference value or a normal value of a healthy control. It will be apparent to those skilled in the art that in the case where the number of samples is sufficient, the range of normal values, i.e., absolute values, of each biomarker can be obtained by inspection and calculation.
本申请的“生物标志物”,也称为“生物学标志物”,是指个体的生物状态的可测量指标。这样的生物标记物可以是在个体中的任何物质,只要它们与被检个体的特定生物状态,例如疾病,有关系即可。这样的生物标记物可以是,例如,核酸标志物(例如DNA)、蛋白质标志物、细胞因子标记物、趋化因子标记物、碳水化合物标志物、抗原标志物、抗体标志物、物种标志物(种/属的标记)和功能标志物(KO/OG标记)等。本申请的生物标志物具体的为DNA核酸标志物。A "biomarker", also referred to as a "biological marker," as used herein, refers to a measurable indicator of the biological state of an individual. Such a biomarker may be any substance in an individual as long as they are related to a specific biological state of the individual to be examined, such as a disease. Such biomarkers can be, for example, nucleic acid markers (eg, DNA), protein markers, cytokine markers, chemokine markers, carbohydrate markers, antigenic markers, antibody markers, species markers ( Species/genus markers) and functional markers (KO/OG markers). The biomarkers of the present application are specifically DNA nucleic acid markers.
本申请的“OTU”是指操作分类单元(operational taxonomic units缩写OTU),是在系统发生学研究或群体遗传学研究中,为了便于进行分析,人为给某一个分类单元,如品系、种、属、分组等,设置的同一标志。本申请中按照97%的相似性阈值将序列划分为一个OTU,由此使得三个部位的样品分别可以获得多个OTU,每一个OTU被视为一个微生物物种。样品中的微生物多样性和不同微生物的丰度都是基于对OTU的分析。The term "OTU" in this application refers to the operation taxonomic units (OTU), which is in the phylogenetic study or population genetics research. In order to facilitate the analysis, a certain classification unit, such as strain, species, and genus, is artificially given. , grouping, etc., set the same flag. In the present application, the sequence is divided into an OTU according to a 97% similarity threshold, whereby a plurality of OTUs can be obtained for each of the three sites, and each OTU is regarded as a microbial species. The microbial diversity in the samples and the abundance of different microorganisms are based on an analysis of the OTU.
本申请中提到的“个体”指动物,特别是哺乳动物,如灵长类动物,本申请的实施例中所指为人。As used herein, "individual" refers to an animal, particularly a mammal, such as a primate, referred to as a human in the examples of the present application.
下面通过具体实施例和附图对本申请作进一步详细说明。以下实施例仅对本申请进行进一步说明,不应理解为对本申请的限制。The present application will be further described in detail below by way of specific embodiments and the accompanying drawings. The following examples are only intended to further illustrate the present application and are not to be construed as limiting the invention.
实施例Example
1.材料与方法1. Materials and methods
1.1样品收集 1.1 Sample Collection
本例的样品采集由深圳北大医院妇产科医生协助进行。排除炎症病例,研究对象均为非经期、非妊娠期、非哺乳期女性,无内分泌和自身免疫性疾病,肝肾功能正常。取样前一段时间没有使用激素及抗生素,没有进行阴道用药、阴道灌洗及宫颈治疗,取样前48小时内没有进行性生活。根据以上标准,本例筛选出49例育龄女性,作为第一群体。所有符合以上标准的个体都进行详细的表型信息登记,以了解其病史、家族史、用药史及生活习惯等,并且均签署了知情同意书。The sample collection in this case was assisted by a gynaecologist at Shenzhen Peking University Hospital. Excluding the cases of inflammation, the subjects were non-menstrual, non-pregnancy, non-lactation women, no endocrine and autoimmune diseases, normal liver and kidney function. No hormones or antibiotics were used for some time before sampling, no vaginal medication, vaginal lavage and cervical treatment, and no sexual life was performed within 48 hours before sampling. According to the above criteria, this example screened 49 women of childbearing age as the first group. All individuals who meet the above criteria are registered with detailed phenotypic information to understand their medical history, family history, medication history and lifestyle, and have signed informed consent.
下生殖道采样是在个体入院后,不经过消毒处理,排空小便后,在妇科检查床采集阴道下1/3(缩写CL)、阴道后穹窿(缩写CU)、宫颈管(缩写CV)三个部位的分泌物样品。具体的,49个采集对象的样品编号及采样信息为,编号C026、C028、C033、C035、C038、C041、C045、C048、C050、C055、C056、C058、C059、C062、C063、C064、C065的十七个采集对象为非子宫内膜异位症患者,十七个采集对象都采集了CL、CU和CV三个部位的样品;编号T022、T024、T027、T028、T032、T033、T036、T039、T041、T042、T045、T053、T056、T058、T059、T061、T062、T063、T067、T069、T070、T076、T078、T084、T085、T086、T087、T088、T090、T092、T094、T095的三十二个采集对象为子宫内膜异位症患者,三十二个采集对象都采集了CL、CU和CV三个部位的样品。The lower genital tract sampling is performed after the individual is admitted to the hospital, without disinfection, after emptying the urine, the lower third of the vagina (abbreviation CL), the posterior vagina (abbreviated CU), and the cervical canal (abbreviated CV) are collected in the gynecological examination bed. A sample of secretions at each site. Specifically, the sample numbers and sampling information of the 49 collection objects are number C026, C028, C033, C035, C038, C041, C045, C048, C050, C055, C056, C058, C059, C062, C063, C064, C065. Seventeen subjects were non-endometriosis patients, and seventeen subjects collected samples of CL, CU and CV; number T022, T024, T027, T028, T032, T033, T036, T039 , T041, T042, T045, T053, T056, T058, T059, T061, T062, T063, T067, T069, T070, T076, T078, T084, T085, T086, T087, T088, T090, T092, T094, T095 Twelve subjects were patients with endometriosis, and thirty-two subjects collected samples of CL, CU and CV.
样品采集是利用尼龙绒屑拭子进行样品收集,尼龙绒屑拭子购自晨阳全球集团CY-93050和CY-98000两种型号。取样后将拭子头用液氮进行速冻,并保存于-80℃,用干冰运送至深圳华大基因研究院进行后续的试验。Sample collection was performed using a nylon fluff swab for sample collection. Nylon fluff swabs were purchased from Chenyang Global Group CY-93050 and CY-98000. After sampling, the swab head was quickly frozen with liquid nitrogen, stored at -80 ° C, and transported to Shenzhen Huada Gene Research Institute with dry ice for subsequent experiments.
1.2DNA提取与16S测序1.2 DNA extraction and 16S sequencing
本例利用QIAamp DNA Mini Kit试剂盒(购自QIAGEN)进行DNA提取。具体提取步骤参照生产厂商提供的说明书进行。采用16S rRNA基因V4-V5高变区特异引物进行扩增,两条引物分别为V4-515F和V5-907R,V4-515F为Seq ID No.25所示序列,V5-907R为Seq ID No.26所示序列。In this example, DNA extraction was performed using the QIAamp DNA Mini Kit (purchased from QIAGEN). The specific extraction steps are carried out in accordance with the instructions provided by the manufacturer. The 16S rRNA gene V4-V5 hypervariable region-specific primers were used for amplification. The two primers were V4-515F and V5-907R, V4-515F was the sequence shown by Seq ID No. 25, and V5-907R was Seq ID No. The sequence shown in 26.
Seq ID No.25:5’-GTGCCAGCMGCCGCGGTAA-3’Seq ID No. 25: 5'-GTGCCAGCMGCCGCGGTAA-3'
Seq ID No.26:5’-CCGTCAATTCMTTTRAGT-3’Seq ID No. 26: 5'-CCGTCAATTCMTTTRAGT-3’
PCR程序如下,94℃变性3min;然后进入25个循环:94℃变性45s,50℃退火60s,72℃延伸90s;循环结束后,72℃延伸10min。得到的PCR产物利用AMPure Beads(Axygen)进行纯化,测序采用芯片泳道测序的方法,将多个样品混合后测序。所以文库构建需要在各样品的引物序列外端连接10bp的barcode序列后,添加接头序列。通过对每个样品添加不同的barcode序列,即样本识别 序列,区分不同样本。文库构建完成后,通过Ion torrent PGM测序平台,进行V5-V4反向测序,以上文库构建和测序等由深圳华大基因进行。The PCR procedure was as follows, denaturation at 94 ° C for 3 min; then into 25 cycles: denaturation at 94 ° C for 45 s, annealing at 50 ° C for 60 s, extension at 72 ° C for 90 s; after the end of the cycle, extension at 72 ° C for 10 min. The obtained PCR product was purified by AMPure Beads (Axygen), and sequencing was carried out by chip lane sequencing, and a plurality of samples were mixed and sequenced. Therefore, the library construction requires the addition of a linker sequence after ligation of a 10 bp barcode sequence at the outer end of the primer sequence of each sample. By adding different barcode sequences to each sample, ie sample identification Sequence, distinguishing between different samples. After the library was constructed, V5-V4 reverse sequencing was performed by Ion torrent PGM sequencing platform. The above library construction and sequencing were carried out by Shenzhen Huada Gene.
1.3 16S测序数据处理1.3 16S sequencing data processing
利用Mothur软件(V1.33.3)从PGM系统中提取原始数据并进行预处理,高质量序列的标准包括:1)长度大于200bp;2)与简并PCR错配碱基少于2个;3)平均质量分数大于25。基于16S rRNA基因序列,利用QIIME的uclust方法对OTU进行聚类,相似阈值设置为97%。选取每个OTU的种子序列(Seed sequence),利用Greengene数据库中的参照基因信息gg_13_8_otus进行注释。计算每个样本中每一个OTU的相对丰度,其中某一OTU的相对丰度为某个样本中该OTU的丰度与该样本中所有OTU丰度之和的比值。The raw data was extracted and pre-processed from the PGM system using Mothur software (V1.33.3). The standards for high-quality sequences include: 1) length greater than 200 bp; 2) less than 2 mismatched bases with degenerate PCR; 3) The average quality score is greater than 25. Based on the 16S rRNA gene sequence, the OTU was clustered using the QIIME uclust method, and the similarity threshold was set to 97%. A seed sequence of each OTU was selected and annotated using the reference gene information gg_13_8_otus in the Greengene database. The relative abundance of each OTU in each sample is calculated, where the relative abundance of an OTU is the ratio of the abundance of the OTU in a sample to the sum of all OTU abundances in the sample.
1.4不同位点样品间微生物群一致性分析1.4 Analysis of microbial consistency between samples at different sites
基于OTU的存在或缺失,本例利用Sorenson指数(
Figure PCTCN2017096249-appb-000001
–Dice指数)来测量同一个体不同位点样品微生物群的相似性,计算方法如下:
Based on the presence or absence of OTU, this example uses the Sorenson index (
Figure PCTCN2017096249-appb-000001
–Dice index) to measure the similarity of microbial populations at different sites in the same individual, calculated as follows:
Figure PCTCN2017096249-appb-000002
Figure PCTCN2017096249-appb-000002
其中A和B分别代表样品A和B中OTU的数目,C代表两个样品中共有的OTU数目。QS是相似性指数,取值范围为0~1。本例中分别计算了CL和CU的相似性指数,CL和CV的相似性指数,以及CU和CV的相似性指数。相似性指数约接近1,表示两个采样部位的微生物群的相似性越高。Where A and B represent the number of OTUs in samples A and B, respectively, and C represents the number of OTUs shared in the two samples. QS is a similarity index and ranges from 0 to 1. In this example, the similarity index of CL and CU, the similarity index of CL and CV, and the similarity index of CU and CV were calculated. The similarity index is close to 1, indicating that the similarity of the microbiota at the two sampling sites is higher.
1.5随机森林分类器1.5 random forest classifier
为了建立一个能够鉴别异常状态样品的模型,对于每个采样部位,利用R软件(3.1.2RC)中randomForest工具包对每个样品的OTU的相对丰度与子宫内膜异位症状态进行拟合,采用默认参数;其中,每个样品的OTU是至少存在于10%的样品中的OTU,也就是说,剔除在各个部位的所有待测样品中只在有不到10%的样品中才能检出的OTU。之后进行5次10折交叉验证,将5次10折交叉验证的误差曲线进行平均,将平均后曲线的最低误差加上该点的标准误差作为可接受误差的域值。在分类误差小于域值的各组OTU中,其中,OTU数目最少的为最优OTU组合,作为鉴别子宫内膜异位症的生物标志物组合。In order to establish a model that can identify abnormal state samples, for each sampling site, the relative abundance of each sample and the endometriosis state were fitted using the randomForest toolkit in R software (3.1.2RC). The default parameters are used; wherein the OTU of each sample is an OTU present in at least 10% of the sample, that is, only less than 10% of all samples to be tested in each part can be inspected. Out of the OTU. Then, five 10-fold cross-validation is performed, and the error curves of the five 10-fold cross-validation are averaged, and the lowest error of the average post-curve is added to the standard error of the point as the domain value of the acceptable error. Among the groups of OTUs whose classification error is smaller than the domain value, the least number of OTUs is the optimal OTU combination as a biomarker combination for identifying endometriosis.
1.6生物标志物验证1.6 Biomarker verification
为了验证本例得到的生物标志物,本例另外采用了独立的受试群体,即第二群体进行验证。第二群体中,对于CL和CU,各有11位子宫内膜异位症患者和11位非子宫内膜异位症个体;对于CV,有12位子宫内膜异位症患者和10 位非子宫内膜异位症个体。In order to verify the biomarkers obtained in this example, this example additionally used an independent test population, that is, the second population for verification. In the second group, there were 11 endometriosis patients and 11 non-endometriosis individuals for CL and CU; for CV, there were 12 patients with endometriosis and 10 A non-endometriosis individual.
2.实验结果2. Experimental results
2.1同一个体内上下生殖道微生物结构特征及变化趋势2.1 Microbial structural characteristics and trends of the same in vivo genital tract
为了探索生殖道不同区域微生物群之间的关系,本例计算了同一个体的样品之间的距离。相对于下阴道1/3(CL)样品而言,从阴道后穹窿(CU)、宫颈管(CV)粘液到子宫和腹腔液的加权UniFrac距离依次增加。这也再一次指明随着解剖学结构由下至上,女性生殖道的群落结构呈现连续变化性。In order to explore the relationship between microbial populations in different regions of the reproductive tract, this example calculates the distance between samples of the same individual. The weighted UniFrac distance from the posterior vagina (CU), cervical (CV) mucus to the uterus and peritoneal fluid increased sequentially relative to the lower vaginal 1/3 (CL) sample. This again indicates that the community structure of the female reproductive tract is continuously changing as the anatomical structure is from bottom to top.
同一个体中不同部位的样品呈现高度相关性,不同部位样品之间的Sorenson指数与它们的解剖学结构相一致。宫颈(CV)粘液与腹腔液样本具有显著的相关性,平均Sorenson指数为0.255,表明在普通人群中可以通过分析易取得的宫颈粘液样本来评价宫腔和腹腔的健康状况。Samples in different parts of the same individual are highly correlated, and the Sorenson indices between samples in different parts are consistent with their anatomical structures. There was a significant correlation between cervical (CV) mucus and peritoneal fluid samples, with an average Sorenson index of 0.255, indicating that the uterine cavity and abdominal cavity health can be evaluated by analyzing readily available cervical mucus samples in the general population.
此外,本例还分别通过阴道、宫腔底部对宫颈粘液取样,发现两种途径取样取得样品的细菌分布显示出高度的相似性,进一步表明可以通过分析易获得的宫颈管样本来评价宫腔微生物的情况。In addition, in this case, the cervical mucus was sampled through the vagina and the uterine cavity, respectively. It was found that the bacterial distribution of the samples taken by the two routes showed a high degree of similarity, further indicating that the uterine cavity microorganisms can be evaluated by analyzing the easily available cervical tube samples. Case.
2.2与疾病相关的微生物2.2 Disease-related microorganisms
为了得到用来鉴别子宫内膜异位症的OTU生物标志物,本例建立随机森林模型,具体步骤为:(1)以OTU相对丰度作为输入特征,设计基于第一群体的随机森林模型;(2)对于随机森林模型,设计10折交叉验证算法,把第一群体分为子宫内膜异位症个体与非子宫内膜异位症个体两类,并分别得到随机森林模型的ROC曲线,以各ROC曲线下面积AUC值作为评价指标。In order to obtain the OTU biomarkers used to identify endometriosis, this example establishes a random forest model. The specific steps are as follows: (1) Using the relative abundance of OTU as an input feature, design a random forest model based on the first population; (2) For the random forest model, a 10-fold cross-validation algorithm was designed. The first group was divided into two types: endometriosis individuals and non-endometriosis individuals, and the ROC curves of random forest models were obtained. The area AUC value under each ROC curve was used as an evaluation index.
本例利用随机森林模型,并结合10折交叉验证,得到了各个部位最优的生物标志物,如表1所示,用于鉴别子宫内膜异位症。表2至表4分别为三个部位的标志物组在样品中的富集信息,表5至表7分别为三个部位的标志物组在第一群体样品的相对丰度信息。本例中,三个部位的生物标志物,鉴别子宫内膜异位症的结果,如图1至图3所示,图1为阴道下1/3处(CL)的标志物组鉴别子宫内膜异位症,图2为阴道后穹窿(CU)的标志物组鉴别子宫内膜异位症,图3为宫颈管(CV)的标志物组鉴别子宫内膜异位症。In this example, a random forest model was used, combined with a 10-fold cross-validation, to obtain the optimal biomarkers for each part, as shown in Table 1, for the identification of endometriosis. Tables 2 to 4 show the enrichment information of the marker group of the three sites in the sample, and Tables 5 to 7 respectively show the relative abundance information of the marker group of the three sites in the first population sample. In this example, the biomarkers at three sites identify the results of endometriosis, as shown in Figures 1 to 3. Figure 1 shows the marker in the lower third of the vagina (CL). Membrane ectopic disease, Figure 2 is the marker group of vaginal posterior iliac crest (CU) to identify endometriosis, and Figure 3 is the marker group of cervical canal (CV) to identify endometriosis.
表1生物标志物及其所属的各个部位Table 1 biomarkers and their respective parts
Seq ID No.Seq ID No. OTU编号OTU number OTU分类OTU classification CL CL CUCU CVCV
11 11 Lactobacillus sp.Lactobacillus sp. ----
22 55 LeptotrichiaceaeLeptotrichiaceae ---- ----
33 1111 Vagococcus sp.Vagococcus sp. ---- ----
44 1212 Delftia sp.Delftia sp. ---- ----
55 2727 Dysgonomonas sp.Dysgonomonas sp. ---- ----
66 3333 Aerococcus sp.Aerococcus sp. ---- ----
77 3535 Prevotella sp.Prevotella sp. ----
88 3838 Lactobacillus sp.Lactobacillus sp. ---- ----
99 4242 TissierellaceaeTissierellaceae ---- ----
1010 4848 ComamonadaceaeComamonadaceae ---- ----
1111 5454 ErysipelotrichaceaeErysipelotrichaceae ---- ----
1212 6161 Lactobacillus sp.Lactobacillus sp. ----
1313 6464 Dialister sp.Dialister sp. ---- ----
1414 7070 Erysipelothrix sp.Erysipelothrix sp. ---- ----
1515 8686 Anaerococcus sp.Anaerococcus sp. ---- ----
1616 108108 Dysgonomonas sp.Dysgonomonas sp. ---- ----
1717 221221 Lactobacillus sp.Lactobacillus sp. ---- ----
1818 233233 Lactobacillus sp.Lactobacillus sp. ---- ----
1919 344344 Lactobacillus sp.Lactobacillus sp. ---- ----
2020 424424 Lactobacillus inersLactobacillus iners ---- ----
21twenty one 464464 Prevotella sp.Prevotella sp. ---- ----
22twenty two 520520 Lactobacillus inersLactobacillus iners ---- ----
23twenty three 628628 Lactobacillus inersLactobacillus iners ---- ----
24twenty four 663663 Lactobacillus sp.Lactobacillus sp. ---- ----
表1中,CL、CU、CV三个部位的标记物可以单独分别做判断,“√”是表示针对该部位进行判断时所需用到的生物标志物,“--”表示不需要用到的。In Table 1, the markers of the three parts of CL, CU, and CV can be judged separately, and "√" is a biomarker used for judging the part. "--" means that it is not needed. of.
在进行样品检测时,要计算各部位的“√”的OTU的相对丰度,将相对丰度输入随机森林模型,得到结果,判断是否为子宫内膜异位症。In the sample test, the relative abundance of the OTU of each part is calculated, and the relative abundance is input into the random forest model to obtain the result, and it is judged whether it is endometriosis.
表2 CL中标志物组各OTU丰度信息Table 2 OTU abundance information of the marker group in CL
Figure PCTCN2017096249-appb-000003
Figure PCTCN2017096249-appb-000003
表3 CU中标志物组各OTU丰度信息Table 3 OTU abundance information of marker groups in CU
Figure PCTCN2017096249-appb-000004
Figure PCTCN2017096249-appb-000004
表4 CV中标志物组各OTU丰度信息Table 4 OTU abundance information of the marker group in CV
Figure PCTCN2017096249-appb-000005
Figure PCTCN2017096249-appb-000005
表2至表4中,子宫内膜异位症组是指第一群体的49个采集对象中患有子宫内膜异位症的样品,对照组是指第一群体的49个采集对象中没有患子宫内膜异位症的样品。In Tables 2 to 4, the endometriosis group refers to the sample of endometriosis among the 49 subjects in the first group, and the control group refers to the 49 subjects in the first group. A sample of endometriosis.
表5 CL中标志物组各OTU在第一群体中的丰度信息Table 5 Abundance information of each OTU of the marker group in the first group in CL
Figure PCTCN2017096249-appb-000006
Figure PCTCN2017096249-appb-000006
Figure PCTCN2017096249-appb-000007
Figure PCTCN2017096249-appb-000007
表6 CU中标志物组各OTU在第一群体中的丰度信息 Table 6 Abundance information of each OTU of the marker group in the CU in the first population
Figure PCTCN2017096249-appb-000008
Figure PCTCN2017096249-appb-000008
表7 CV中标志物组各OTU在第一群体中的丰度信息Table 7 Abundance information of each OTU of the marker group in the CV in the first population
Figure PCTCN2017096249-appb-000009
Figure PCTCN2017096249-appb-000009
Figure PCTCN2017096249-appb-000010
Figure PCTCN2017096249-appb-000010
图1为阴道下1/3处(CL)的标志物组鉴别子宫内膜异位症,图中,a图为随着OTU数量的增加,对随机森林鉴别子宫内膜异位症进行5次10折交叉验证的错误率分布情况,该模型用样品中OTU的相对丰度进行训练,总计采用了17位非子宫内膜异位症个体和32位子宫内膜异位症个体的CL样品,黑色线代表5次试验的平均值,灰色线则分别代表5次试验,黑色竖线代表最佳组合中OTU数目;b图为经过交叉验证过的组合的接收者操作曲线,曲线下面积AUC为0.8272,阴影面积代表95%置信区间,对角线代表AUC为0.5的曲线。Figure 1 shows the endometriosis identified by the marker group at the lower third of the vagina (CL). In the figure, a is a randomized forest identification of endometriosis 5 times as the number of OTUs increases. 10% cross-validation error rate distribution, the model was trained with the relative abundance of OTU in the sample, using a total of 17 non-endometriosis individuals and 32 endometriosis CL samples, The black line represents the average of 5 trials, the gray line represents 5 trials, the black vertical line represents the number of OTUs in the best combination, and the b plot is the cross-validated combination receiver operating curve with the area under the curve AUC At 0.8272, the shaded area represents a 95% confidence interval and the diagonal represents a curve with an AUC of 0.5.
图2为阴道后穹窿(CU)的标志物组鉴别子宫内膜异位症,图中,a图为随着OTU数量的增加,对随机森林鉴别子宫内膜异位症进行5次10折交叉验 证的错误率分布情况,该模型用样品中OTU的相对丰度进行训练,总计采用了17位非子宫内膜异位症个体和32位子宫内膜异位症个体的CU样品,黑色线代表5次试验的平均值,灰色线分别为5次试验,黑色竖线代表最佳组合中OTU数目;b图为经过交叉验证过的组合的接收者操作曲线,曲线下面积AUC为0.5919,阴影面积代表95%置信区间,对角线代表AUC为0.5的曲线。Figure 2 shows the identification of endometriosis in the marker group of posterior vaginal vault (CU). In the figure, a is a 10-fold crossover of random forest identification of endometriosis with increasing number of OTUs. Test The distribution of error rates, the model was trained with the relative abundance of OTU in the sample, using a total of 17 non-endometriosis individuals and 32 uterine endometriosis CU samples, black lines represent The average of 5 trials, the gray line is 5 trials respectively, the black vertical line represents the number of OTUs in the best combination; the b diagram is the receiver operation curve of the cross-validated combination, the area under the curve AUC is 0.5919, the shaded area Represents a 95% confidence interval and the diagonal represents a curve with an AUC of 0.5.
图3为宫颈管(CV)的标志物组鉴别子宫内膜异位症,图中,a图为随着OTU数量的增加,对随机森林鉴别子宫内膜异位症进行5次10折交叉验证的错误率分布情况,该模型用样品中OTU的相对丰度进行训练,总计采用了17位非子宫内膜异位症个体和32位子宫内膜异位症个体的CV样品,黑色线代表5次试验的平均值,灰色线分别为5次试验,黑色竖线代表最佳组合中OTU数目;b图为经过交叉验证过的组合的接收者操作曲线,曲线下面积AUC为0.8493,阴影面积代表95%置信区间,对角线代表AUC为0.5的曲线。Figure 3 shows the identification of endometriosis in the cervical canal (CV) marker group. In the figure, a is a 10-fold cross-validation of random forest identification of endometriosis with increasing number of OTUs. The distribution of error rates, the model was trained with the relative abundance of OTU in the sample, using a total of 17 CV samples from individuals with non-endometriosis and 32 individuals with endometriosis, with black lines representing 5 The average of the subtests, the gray line is 5 trials respectively, the black vertical line represents the number of OTUs in the best combination; the b diagram is the receiver operation curve of the cross-validated combination, the area under the curve AUC is 0.8493, and the shaded area represents 95% confidence interval, diagonal represents a curve with an AUC of 0.5.
由图1至图3的结果可以看出,三个不同位点的OTU生物标志物组能够鉴别子宫内膜异位症个体和非子宫内膜异位症个体;ROC的曲线下面积AUC值分别为0.8272(CL),0.5919(CU)和0.8493(CV)。其中,AUC是曲线下面积,该值越大,即越接近1,表示判断能力越强,即判断越准确。It can be seen from the results of Fig. 1 to Fig. 3 that the OTU biomarker group at three different sites can identify individuals with endometriosis and non-endometriosis individuals; the area under the curve of the ROC is AUC It is 0.8272 (CL), 0.5919 (CU) and 0.8493 (CV). Among them, AUC is the area under the curve, and the larger the value, that is, the closer to 1, indicating that the judgment ability is stronger, that is, the more accurate the judgment.
2.3生物标志物验证2.3 Biomarker verification
将随机森林得到的OTU生物标志物在第二群体样品中进行验证,结果如表8、表9和表10所示。表8至表10中,样品编号例如C003CL、C003CU、C003CV,分别表示采集自同样一个C003采样对象的CL、CU、CV三个部位的样品。表8至表10为三个标志物组预测个体患有子宫内膜异位症的概率,由此得到的ROC曲线依序为图4至图6。表8至表10中,概率>0.5认为通过该部位的标志物组判断个体具有患子宫内膜异位症的风险或者患有子宫内膜异位症。The OTU biomarkers obtained from the random forest were verified in the second population samples, and the results are shown in Table 8, Table 9, and Table 10. In Tables 8 to 10, sample numbers such as C003CL, C003CU, and C003CV respectively indicate samples of three parts of CL, CU, and CV collected from the same C003 sampling object. Tables 8 to 10 show the probability that the three marker groups predict individuals with endometriosis, and the ROC curves thus obtained are sequentially shown in Figs. 4 to 6 . In Tables 8 to 10, the probability > 0.5 is considered to be that the individual has a risk of suffering from endometriosis or endometriosis through the marker group at the site.
表8 CL部位的CL标志物组预测第二群体样品患有子宫内膜异位症的概率Table 8 CL marker group at the CL site predicts the probability of a second population sample suffering from endometriosis
样品编号Sample serial number 是否子宫内膜异位(不孕)(N:否;Y是)Whether endometriosis (infertility) (N: No; Y is) 概率Probability
C001CLC001CL NN 0.4270.427
C003CLC003CL NN 0.4280.428
C004CLC004CL NN 0.2610.261
C005CLC005CL NN 0.2950.295
C007CLC007CL NN 0.4270.427
C008CLC008CL NN 0.5410.541
C009CLC009CL NN 0.5770.577
C012CLC012CL NN 0.3370.337
C014CLC014CL NN 0.4470.447
C019CLC019CL NN 0.5630.563
C021CLC021CL NN 0.6490.649
T000CLT000CL YY 0.5750.575
T001CLT001CL YY 0.6910.691
T003CLT003CL YY 0.8510.851
T005CLT005CL YY 0.5260.526
T006CLT006CL YY 0.7310.731
T008CLT008CL YY 0.3660.366
T015CLT015CL YY 0.8080.808
T017CLT017CL YY 0.7460.746
T081CLT081CL YY 0.3300.330
T082CLT082CL YY 0.7840.784
T083CLT083CL YY 0.7610.761
表9 CU部位的CU标志物组预测第二群体样品患有子宫内膜异位症的概率Table 9 CU marker group at CU site predicts the probability of second population sample suffering from endometriosis
样品编号Sample serial number 是否子宫内膜异位(不孕)(N:否;Y是)Whether endometriosis (infertility) (N: No; Y is) 概率Probability
C001CUC001CU NN 0.2850.285
C003CUC003CU NN 0.1060.106
C004CUC004CU NN 0.3690.369
C005CUC005CU NN 0.4430.443
C007CUC007CU NN 0.2980.298
C008CUC008CU NN 0.5880.588
C009CUC009CU NN 0.6750.675
C012CUC012CU NN 0.9880.988
C014CUC014CU NN 0.1210.121
C019CUC019CU NN 0.1880.188
C021CUC021CU NN 0.5840.584
T001CUT001CU YY 0.9430.943
T003CUT003CU YY 0.5140.514
T005CUT005CU YY 0.9690.969
T006CUT006CU YY 0.8850.885
T000CUT000CU YY 0.4750.475
T008CUT008CU YY 0.1350.135
T015CUT015CU YY 0.8380.838
T017CUT017CU YY 0.9430.943
T081CUT081CU YY 0.7220.722
T082CUT082CU YY 0.9430.943
T083CUT083CU YY 0.9920.992
表10 CV部位的CV标志物组预测第二群体样品患有子宫内膜异位症的概率Table 10 CV marker group at CV site predicts the probability of second population sample suffering from endometriosis
样品编号Sample serial number 是否子宫内膜异位(不孕)(N:否;Y是)Whether endometriosis (infertility) (N: No; Y is) 概率Probability
C003CVC003CV NN 0.4150.415
C004CVC004CV NN 0.4150.415
C005CVC005CV NN 0.3510.351
C007CVC007CV NN 0.3880.388
C008CVC008CV NN 0.3470.347
C009CVC009CV NN 0.5150.515
C012CVC012CV NN 0.4180.418
C014CVC014CV NN 0.4170.417
C019CVC019CV NN 0.1520.152
C021CVC021CV NN 0.1520.152
T000CVT000CV YY 0.5460.546
T001CVT001CV YY 0.2860.286
T003CVT003CV YY 0.6320.632
T004CVT004CV YY 0.3950.395
T005CVT005CV YY 0.6330.633
T006CVT006CV YY 0.7230.723
T008CVT008CV YY 0.6420.642
T015CVT015CV YY 0.5540.554
T017CVT017CV YY 0.7170.717
T081CVT081CV YY 0.4480.448
T082CVT082CV YY 0.5730.573
T083CVT083CV YY 0.4080.408
图4的结果显示CL部位基于CL标志物组判断子宫内膜异位症概率,其AUC值为0.8750;图5的结果显示CU部位基于CU标志物组判断子宫内膜异位症概率,其AUC值为0.840;图6的结果显示CV部位基于CV标志物组判断子宫内膜异位症概率,其AUC值为0.9189;可见,这三个标志物组具有较高的鉴别能力,能够用于子宫内膜异位症的检测,该结果与表8至表10的结果相符。表8至表10的结果中,三个标志物组预测的概率,其中至少一个大于0.5,则判断个体具有患子宫内膜异位症的风险或者患有子宫内膜异位症,由此获得的判断结果,与实际情况相符。The results in Figure 4 show that the CL site is based on the CL marker group to determine the probability of endometriosis with an AUC value of 0.8750; the results in Figure 5 show that the CU site is based on the CU marker group to determine the probability of endometriosis, its AUC The value of 0.840; the results of Figure 6 show that the CV site based on the CV marker group to determine the probability of endometriosis, its AUC value is 0.9189; it can be seen that these three marker groups have a higher discriminating ability, can be used in the uterus Detection of endometriosis, the results are consistent with the results of Tables 8 to 10. In the results of Tables 8 to 10, the probability of the three marker groups predicted, at least one of which is greater than 0.5, determines that the individual is at risk of suffering from endometriosis or has endometriosis, thereby obtaining The judgment result is consistent with the actual situation.
以上内容是结合具体的实施方式对本申请所作的进一步详细说明,不能认定本申请的具体实施只局限于这些说明。对于本申请所属技术领域的普通技术人员来说,在不脱离本申请构思的前提下,还可以做出若干简单推演或替换。 The above content is a further detailed description of the present application in conjunction with the specific embodiments, and the specific implementation of the present application is not limited to the description. For the ordinary person skilled in the art to which the present invention pertains, a number of simple deductions or substitutions may be made without departing from the spirit of the present application.

Claims (18)

  1. 一种用于子宫内膜异位症检测或患病风险评估的生物标志物组合,其特征在于:所述生物标志物组合包括二十四条核酸中的至少一条,所述二十四条核酸分别为Seq ID No.1至Seq ID No.24所示序列,或者分别为与Seq ID No.1至Seq ID No.24所示序列具有97%以上相似性的序列。A biomarker combination for endometriosis detection or risk assessment of disease, characterized in that said biomarker combination comprises at least one of twenty-four nucleic acids, said twenty-four nucleic acids The sequences shown in Seq ID No. 1 to Seq ID No. 24, respectively, or sequences having 97% or more similarity to the sequences shown in Seq ID No. 1 to Seq ID No. 24, respectively.
  2. 一种用于子宫内膜异位症检测或患病风险评估的生物标志物组合,其特征在于:所述生物标志物组合包括第一标志物组、第二标志物组和第三标志物组中的至少一组;A biomarker combination for endometriosis detection or risk assessment of disease, characterized in that the biomarker combination comprises a first marker group, a second marker group and a third marker group At least one of the groups;
    所述第一标志物组由十四条核酸组成,所述十四条核酸分别为Seq ID No.1、Seq ID No.2、Seq ID No.6、Seq ID No.7、Seq ID No.12、Seq ID No.13、Seq ID No.15、Seq ID No.17至Seq ID No.22、Seq ID No.24所示序列,或者分别为与Seq ID No.1、Seq ID No.2、Seq ID No.6、Seq ID No.7、Seq ID No.12、Seq ID No.13、Seq ID No.15、Seq ID No.17至Seq ID No.22、Seq ID No.24所示序列具有97%以上相似性的序列;The first marker group is composed of fourteen nucleic acids, which are Seq ID No. 1, Seq ID No. 2, Seq ID No. 6, Seq ID No. 7, and Seq ID No., respectively. 12. The sequence shown by Seq ID No. 13, Seq ID No. 15, Seq ID No. 17 to Seq ID No. 22, Seq ID No. 24, or Seq ID No. 1, Seq ID No. 2, respectively. , Seq ID No. 6, Seq ID No. 7, Seq ID No. 12, Seq ID No. 13, Seq ID No. 15, Seq ID No. 17 to Seq ID No. 22, and Seq ID No. 24. a sequence having a sequence similarity of 97% or more;
    所述第二标志物组由两条核酸组成,所述两条核酸分别为Seq ID No.1、Seq ID No.7所示序列,或者分别为与Seq ID No.1、Seq ID No.7所示序列具有97%以上相似性的序列;The second marker group is composed of two nucleic acids, respectively, which are sequences shown by Seq ID No. 1, Seq ID No. 7, or Seq ID No. 1, Seq ID No. 7, respectively. The sequence shown has a sequence of 97% or more similarity;
    所述第三标志物组由十一条核酸组成,所述十一条核酸分别为Seq ID No.3至Seq ID No.5、Seq ID No.8至Seq ID No.12、Seq ID No.14、Seq ID No.16、Seq ID No.23所示序列,或者分别为与Seq ID No.3至Seq ID No.5、Seq ID No.8至Seq ID No.12、Seq ID No.14、Seq ID No.16、Seq ID No.23所示序列具有97%以上相似性的序列。The third marker group is composed of eleven nucleic acids, which are Seq ID No. 3 to Seq ID No. 5, Seq ID No. 8 to Seq ID No. 12, and Seq ID No., respectively. 14. The sequence shown by Seq ID No. 16, Seq ID No. 23, or Seq ID No. 3 to Seq ID No. 5, Seq ID No. 8 to Seq ID No. 12, Seq ID No. 14, respectively. The sequence shown by Seq ID No. 16, Seq ID No. 23 has a sequence similarity of 97% or more.
  3. 根据权利要求2所述的生物标志物组合,其特征在于:所述第一标志物组为CL标志物组,用于对来自阴道下1/3的样品进行子宫内膜异位症检测或患病风险评估。The biomarker combination according to claim 2, wherein the first marker group is a CL marker group for detecting endometriosis or a sample from a lower third of the vagina. Disease risk assessment.
  4. 根据权利要求2所述的生物标志物组合,其特征在于:所述第二标志物组为CU标志物组,用于对来自阴道后穹窿的样品进行子宫内膜异位症检测或患病风险评估。The biomarker combination according to claim 2, wherein the second marker group is a CU marker group for detecting endometriosis or risk of a disease from a vaginal posterior iliac crest. Evaluation.
  5. 根据权利要求2所述的生物标志物组合,其特征在于:所述第三标志物组为CV标志物组,用于对来自宫颈管的样品进行子宫内膜异位症检测或患病风险评估。The biomarker combination according to claim 2, wherein the third marker group is a CV marker group for performing endometriosis detection or risk assessment of a sample from the cervical canal. .
  6. 一种子宫内膜异位症检测或患病风险评估的试剂盒,其特征在于:所述试剂盒包含用于检测权利要求1-5任一项所述的生物标志物组合的引物对,所述引 物对的正向引物为SEQ ID No.25所示序列,反向引物为SEQ ID No.26所示序列。A kit for detecting endometriosis or risk assessment of a disease, characterized in that the kit comprises a primer pair for detecting the biomarker combination according to any one of claims 1 to 5, Reference The forward primer of the pair is the sequence shown in SEQ ID No. 25, and the reverse primer is the sequence shown in SEQ ID No. 26.
  7. 根据权利要求1-5任一项所述的生物标志物组合在子宫内膜异位症药物筛选或者在制备子宫内膜异位症检测或患病风险评估的试剂盒或检测工具中的应用。Use of the biomarker combination according to any one of claims 1 to 5 for the screening of endometriosis drugs or for the preparation of a kit or detection tool for endometriosis detection or risk assessment of disease.
  8. 一种子宫内膜异位症的检测方法,其特征在于:包括以下步骤,A method for detecting endometriosis characterized by comprising the following steps,
    (1)对待测对象进行样品采集,检测所采集的样品中权利要求1-5任一项所述的生物标志物组合,并分析生物标志物组合中各核酸的水平;(1) performing sample collection on the object to be tested, detecting the biomarker combination according to any one of claims 1 to 5 in the collected sample, and analyzing the level of each nucleic acid in the biomarker combination;
    (2)将步骤(1)测得的各核酸的水平与参考数据集或参考值进行比较,获得检测结果;(2) comparing the level of each nucleic acid measured in the step (1) with a reference data set or a reference value to obtain a detection result;
    优选的,所述各核酸的水平为各核酸的相对丰度;所述参考数据集或参考值为来源于子宫内膜异位症患者和非子宫内膜异位症对照的生物标志物组合中各核酸的水平。Preferably, the level of each nucleic acid is the relative abundance of each nucleic acid; the reference data set or reference value is in a biomarker combination derived from a patient with endometriosis and a non-endometriosis control The level of each nucleic acid.
  9. 根据权利要求8所述的检测方法,其特征在于:所述步骤(2)中的参考数据集或参考值为表5、表6或表7中的至少一组;将各核酸的水平与参考数据集或参考值进行比较获得检测结果,具体包括,利用多元统计模型计算得出患病概率,优选地,所述多元统计模型为随机森林模型。The detecting method according to claim 8, wherein the reference data set or reference value in the step (2) is at least one of the table 5, the table 6 or the table 7; the level and reference of each nucleic acid are referred to The data set or the reference value is compared to obtain the detection result, which specifically includes calculating the disease probability by using the multivariate statistical model. Preferably, the multivariate statistical model is a random forest model.
  10. 根据权利要求8或9所述的检测方法,其特征在于:所述步骤(1)中对待测对象进行样品采集,包括采集待测对象阴道下1/3样品、阴道后穹窿样品和宫颈管样品。The detecting method according to claim 8 or 9, wherein in the step (1), the sample to be measured is subjected to sample collection, including collecting the lower third of the vagina sample, the posterior vaginal sample and the cervical canal sample of the object to be tested. .
  11. 一种通过检测生物标志物判断子宫内膜异位症的方法在制备子宫内膜异位症检测或患病风险评估试剂盒或工具中的应用;所述生物标志物为权利要求1-5任一项所述的生物标志物组合;A method for determining endometriosis by detecting biomarkers for use in preparing a kit or tool for assessing endometriosis detection or disease risk; the biomarker is claim 1-5 a combination of said biomarkers;
    所述通过检测生物标志物判断子宫内膜异位症的方法包括以下步骤,The method for determining endometriosis by detecting a biomarker includes the following steps,
    (1)对待测对象进行样品采集,检测所采集的样品中权利要求1-5任一项所述的生物标志物组合,并分析生物标志物组合中各核酸的水平;(1) performing sample collection on the object to be tested, detecting the biomarker combination according to any one of claims 1 to 5 in the collected sample, and analyzing the level of each nucleic acid in the biomarker combination;
    (2)将步骤(1)测得的各核酸的水平与参考数据集或参考值进行比较,获得检测结果;(2) comparing the level of each nucleic acid measured in the step (1) with a reference data set or a reference value to obtain a detection result;
    优选的,所述各核酸的水平为各核酸的相对丰度;所述参考数据集或参考值为来源于子宫内膜异位症患者和非子宫内膜异位症对照的生物标志物组合中各核酸的水平。Preferably, the level of each nucleic acid is the relative abundance of each nucleic acid; the reference data set or reference value is in a biomarker combination derived from a patient with endometriosis and a non-endometriosis control The level of each nucleic acid.
  12. 根据权利要求11所述的应用,其特征在于:所述步骤(2)中的参考数据集或参考值为表5、表6或表7中的至少一组;将各核酸的水平与参考数据集或参考值进行比较获得检测结果,具体包括,利用多元统计模型计算得出患病概率, 优选地,所述多元统计模型为随机森林模型。The use according to claim 11, wherein the reference data set or reference value in the step (2) is at least one of the table 5, the table 6 or the table 7; the level of each nucleic acid and the reference data The set or reference value is compared to obtain the test result, which specifically includes calculating the disease probability by using the multivariate statistical model. Preferably, the multivariate statistical model is a random forest model.
  13. 一种筛选治疗子宫内膜异位症的候选药物的方法,其特征在于:包括以下步骤,A method for screening a drug candidate for treating endometriosis, comprising the steps of:
    1)分别测定用药前和用药后的样品中权利要求1-5任一项所述的生物标志物组合,并分析生物标志物组合中各核酸的水平;1) separately determining the biomarker combination according to any one of claims 1 to 5 in the sample before and after administration, and analyzing the level of each nucleic acid in the biomarker combination;
    2)根据比较用药前和用药后的样品中各核酸的水平,判断候选药物;2) judging the candidate drug according to the level of each nucleic acid in the sample before and after the drug is compared;
    所述步骤2)中,比较用药前和用药后的样品中各核酸的水平,具体包括,利用多元统计模型计算得出患病概率,优选地,所述多元统计模型为随机森林模型。In the step 2), comparing the levels of each nucleic acid in the sample before and after administration, specifically including calculating the probability of disease by using a multivariate statistical model, preferably, the multivariate statistical model is a random forest model.
  14. 一种女性生殖道内微生物群的检测方法,其特征在于:包括以下步骤,A method for detecting a microbiota in a female reproductive tract, comprising: the following steps,
    (1)采集待测对象生殖道内微生物样品,检测所采集的样品中权利要求1-5任一项所述的生物标志物组合,并分析生物标志物组合中各核酸的水平;(1) collecting a microbial sample in the reproductive tract of the test subject, detecting the biomarker combination according to any one of claims 1 to 5 in the collected sample, and analyzing the level of each nucleic acid in the biomarker combination;
    (2)将步骤(1)测得的各核酸的水平与参考数据集或参考值进行比较,获得检测结果;(2) comparing the level of each nucleic acid measured in the step (1) with a reference data set or a reference value to obtain a detection result;
    优选的,所述各核酸的水平为各核酸的相对丰度;所述参考数据集或参考值为来源于子宫内膜异位症人群和非子宫内膜异位症人群对照的生物标志物组合中各核酸的水平。Preferably, the level of each nucleic acid is the relative abundance of each nucleic acid; the reference data set or reference value is a biomarker combination derived from a population of endometriosis and a population of non-endometriosis controls The level of each nucleic acid.
  15. 根据权利要求14所述的检测方法,其特征在于:所述步骤(2)中的参考数据集或参考值为表5、表6或表7中的至少一组;将各核酸的水平与参考数据集或参考值进行比较获得检测结果,具体包括,利用多元统计模型计算得出患病概率,优选地,所述多元统计模型为随机森林模型。The detecting method according to claim 14, wherein the reference data set or reference value in the step (2) is at least one of the table 5, the table 6 or the table 7; the level and reference of each nucleic acid The data set or the reference value is compared to obtain the detection result, which specifically includes calculating the disease probability by using the multivariate statistical model. Preferably, the multivariate statistical model is a random forest model.
  16. 根据权利要求14或15所述的检测方法,其特征在于:所述步骤(1)中采集待测对象生殖道内微生物样品,具体包括采集待测对象阴道下1/3样品、阴道后穹窿样品和宫颈管样品。The detecting method according to claim 14 or 15, wherein in the step (1), the microbial sample in the genital tract of the object to be tested is collected, which comprises collecting the lower third of the vagina sample, the vaginal posterior iliac sample and the sample to be tested. Cervical tube samples.
  17. 一种制备子宫内膜异位症生物标志物组合的方法,其特征在于:包括以下步骤,A method for preparing a biomarker combination of endometriosis, comprising the steps of:
    (1)分别对子宫内膜异位症病患和非病患进行生殖道内微生物样品采集,对所有采集的样品分别进行16S测序;(1) Collecting microbial samples in the reproductive tract of endometriosis patients and non-patients, respectively, and performing 16S sequencing on all collected samples;
    (2)将16S测序结果进行聚类分析,获得OTU单元以及每个OTU的种子序列,并计算每个OTU单元的相对丰度;(2) Clustering the 16S sequencing results to obtain the OTU unit and the seed sequence of each OTU, and calculate the relative abundance of each OTU unit;
    (3)利用随机森林模型对每个OTU单元的相对丰度与子宫内膜异位症状态进行拟合,并进行5次十折交叉验证,得到最优的OTU组合,最优OTU组合中各OTU的种子序列,即组成子宫内膜异位症的生物标志物组合。 (3) Using the random forest model to fit the relative abundance of each OTU unit with the endometriosis state, and perform five 10-fold cross-validation to obtain the optimal OTU combination, and the optimal OTU combination. The seed sequence of OTU, a biomarker combination that constitutes endometriosis.
  18. 根据权利要求17所述的方法,其特征在于:所述步骤(1)中,生殖道内微生物样品采集,具体包括采集待测对象阴道下1/3样品、阴道后穹窿样品和宫颈管样品。 The method according to claim 17, wherein in the step (1), the microbial sample is collected in the genital tract, and specifically comprises collecting the lower third of the vagina sample, the posterior vaginal sputum sample and the cervical canal sample of the test subject.
PCT/CN2017/096249 2016-09-19 2017-08-07 Biomarker composition for detection of endometriosis and application thereof WO2018049947A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201780047955.4A CN109715828B (en) 2016-09-19 2017-08-07 Biomarker combination for detecting endometriosis and application thereof

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201610831390.6A CN107858416B (en) 2016-09-19 2016-09-19 Biomarker combination for detecting endometriosis and application thereof
CN201610831390.6 2016-09-19

Publications (1)

Publication Number Publication Date
WO2018049947A1 true WO2018049947A1 (en) 2018-03-22

Family

ID=61618620

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2017/096249 WO2018049947A1 (en) 2016-09-19 2017-08-07 Biomarker composition for detection of endometriosis and application thereof

Country Status (2)

Country Link
CN (2) CN107858416B (en)
WO (1) WO2018049947A1 (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112941169B (en) * 2021-03-24 2022-05-31 浙江大学医学院附属邵逸夫医院 Method for detecting internal abnormal disease related IVF fate based on granular cell gene expression
KR20230105080A (en) * 2022-01-03 2023-07-11 연세대학교 산학협력단 Method for providing information for diagnosis of endometriosis

Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060105405A1 (en) * 2004-10-20 2006-05-18 Onco Detectors International, Llc Migration inhibitory factor in serum as a tumor marker for prostate, bladder, breast, ovarian, kidney, pancreatic and lung cancer and for diagnosis of endometriosis
US20070287676A1 (en) * 2006-05-16 2007-12-13 Sun-Wei Guo Diagnosis and treatment of endometriosis
US20080318237A1 (en) * 2007-04-25 2008-12-25 Regents Of The University Of California Methods of diagnosing endometriosis
US20090220589A1 (en) * 2008-03-01 2009-09-03 Abraxis Bioscience, Llc Treatment, diagnostic, and method for discovering antagonist using sparc specific mirnas
CN103026227A (en) * 2010-04-22 2013-04-03 不列颠哥伦比亚省癌症分社 Novel biomarkers and targets for ovarian carcinoma
CN103249421A (en) * 2010-11-29 2013-08-14 株式会社明治 Endometriosis prevention and/or improving agent, and food or drink composition containing same
WO2013148151A1 (en) * 2012-03-29 2013-10-03 University Of Pittsburgh - Of The Commonwealth System Of Higher Education Plasma microribonucleic acids as biomarkers for endometriosis and endometriosis-associated ovarian cancer
CN104067124A (en) * 2011-09-01 2014-09-24 新加坡意志私人有限公司 Method for detecting nucleosomes containing nucleotides
WO2015128671A1 (en) * 2014-02-27 2015-09-03 Queen Mary University Of London Biomarkers for endometriosis
US20150291959A1 (en) * 2014-04-15 2015-10-15 Kaohsiung Medical University Diagnosis and Treatment of Endometriosis
US20150368714A1 (en) * 2013-03-07 2015-12-24 Juneau Biosciences, Llc Method of Testing for Endometriosis and Treatment Therefor

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101210929A (en) * 2006-12-29 2008-07-02 中国医学科学院北京协和医院 Method for detecting endometriosis blood plasma marker protein
CN101334410B (en) * 2007-06-26 2013-03-27 沙桂华 Method and reagent kit for detecting endometriosis
CN105543385A (en) * 2016-01-29 2016-05-04 山东大学齐鲁医院 Irritable bowel syndrome microbial marker and application thereof
CN105907879B (en) * 2016-06-29 2019-08-13 北京泱深生物信息技术有限公司 Carcinoma of endometrium biomarker

Patent Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060105405A1 (en) * 2004-10-20 2006-05-18 Onco Detectors International, Llc Migration inhibitory factor in serum as a tumor marker for prostate, bladder, breast, ovarian, kidney, pancreatic and lung cancer and for diagnosis of endometriosis
US20070287676A1 (en) * 2006-05-16 2007-12-13 Sun-Wei Guo Diagnosis and treatment of endometriosis
US20080318237A1 (en) * 2007-04-25 2008-12-25 Regents Of The University Of California Methods of diagnosing endometriosis
US20090220589A1 (en) * 2008-03-01 2009-09-03 Abraxis Bioscience, Llc Treatment, diagnostic, and method for discovering antagonist using sparc specific mirnas
CN103026227A (en) * 2010-04-22 2013-04-03 不列颠哥伦比亚省癌症分社 Novel biomarkers and targets for ovarian carcinoma
CN103249421A (en) * 2010-11-29 2013-08-14 株式会社明治 Endometriosis prevention and/or improving agent, and food or drink composition containing same
CN104067124A (en) * 2011-09-01 2014-09-24 新加坡意志私人有限公司 Method for detecting nucleosomes containing nucleotides
WO2013148151A1 (en) * 2012-03-29 2013-10-03 University Of Pittsburgh - Of The Commonwealth System Of Higher Education Plasma microribonucleic acids as biomarkers for endometriosis and endometriosis-associated ovarian cancer
US20150368714A1 (en) * 2013-03-07 2015-12-24 Juneau Biosciences, Llc Method of Testing for Endometriosis and Treatment Therefor
WO2015128671A1 (en) * 2014-02-27 2015-09-03 Queen Mary University Of London Biomarkers for endometriosis
US20150291959A1 (en) * 2014-04-15 2015-10-15 Kaohsiung Medical University Diagnosis and Treatment of Endometriosis

Also Published As

Publication number Publication date
CN109715828A (en) 2019-05-03
CN107858416B (en) 2021-05-28
CN109715828B (en) 2022-07-19
CN107858416A (en) 2018-03-30

Similar Documents

Publication Publication Date Title
CN111172279B (en) Model for diagnosing lung cancer by combined detection of peripheral blood methylation gene and IDH1
CN105256014B (en) Breast cancer combined diagnosis marker and detection kit
CN104004840B (en) Test kit for early screening Yu diagnosis of prostate cancer
WO2022012293A1 (en) Peripheral blood tcr marker for endometrial cancer, test kit using same, and application thereof
WO2017202185A1 (en) Peripheral blood gene marker for screening benign and malignant small pulmonary nodules and use thereof
CN114277143B (en) Application of exosomes ARPC5, CDA and the like in lung cancer diagnosis
WO2018049947A1 (en) Biomarker composition for detection of endometriosis and application thereof
CN114438214A (en) Colorectal cancer tumor marker and detection method and device thereof
WO2018049946A1 (en) Biomarker composition for detection of adenomyosis and application thereof
TWI571514B (en) Method for accessing the risk of having colorectal cancer
CN113724862A (en) Colorectal cancer biomarker and screening method and application thereof
US20210310078A1 (en) Method for early diagnosis of breast cancer and monitoring after treatment using liquid biopsy multi-cancer gene biomarkers
WO2019204985A1 (en) Osteoporosis biomarker and use thereof
RU2503960C1 (en) Method for assessing conditions of proliferative cervical diseases
CN109762900B (en) Colorectal cancer marker and application thereof
CN109609639B (en) Colorectal cancer detection method and system
WO2016049927A1 (en) Biomarkers for obesity related diseases
CN109988834A (en) Blood plasma excretion body molecular marker has-miR-219a-5p and application
CN108064273A (en) The biomarker of colorectal cancer relevant disease
Gao et al. Imbalance of the gut microbiota may be associated with missed abortions: a perspective study from a general hospital of hunan province
WO2024067848A1 (en) Use of blood extracellular vesicle mirna in ovarian cancer diagnosis
Zhang et al. Association between vaginal Gardnerella and tubal pregnancy in women with symptomatic early pregnancies in China: a nested case-control study
RU2763707C1 (en) Method for predicting premature rupture of membranes in the period from 22 to 28 weeks of gestation
TWI690597B (en) Detection kit and detection method for urothelial carcinoma
JP2019176754A (en) Inspection method for ovulation disorders

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 17850149

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 17850149

Country of ref document: EP

Kind code of ref document: A1