CN113637746B - Methylated molecular markers for detecting benign and malignant lung nodules or combination and application thereof - Google Patents
Methylated molecular markers for detecting benign and malignant lung nodules or combination and application thereof Download PDFInfo
- Publication number
- CN113637746B CN113637746B CN202010346298.7A CN202010346298A CN113637746B CN 113637746 B CN113637746 B CN 113637746B CN 202010346298 A CN202010346298 A CN 202010346298A CN 113637746 B CN113637746 B CN 113637746B
- Authority
- CN
- China
- Prior art keywords
- seq
- artificial sequence
- primers
- dna
- probes
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 230000003211 malignant effect Effects 0.000 title claims abstract description 77
- 206010056342 Pulmonary mass Diseases 0.000 title claims abstract description 48
- 239000000523 sample Substances 0.000 claims abstract description 294
- 239000003147 molecular marker Substances 0.000 claims abstract description 112
- 238000001514 detection method Methods 0.000 claims abstract description 101
- 230000007067 DNA methylation Effects 0.000 claims abstract description 73
- 230000000295 complement effect Effects 0.000 claims abstract description 30
- 230000011987 methylation Effects 0.000 claims description 36
- 238000007069 methylation reaction Methods 0.000 claims description 36
- 238000003753 real-time PCR Methods 0.000 claims description 29
- 230000002685 pulmonary effect Effects 0.000 claims description 19
- 239000003153 chemical reaction reagent Substances 0.000 claims description 10
- 238000002360 preparation method Methods 0.000 claims description 6
- 230000035945 sensitivity Effects 0.000 abstract description 67
- 239000012634 fragment Substances 0.000 abstract description 66
- 206010058467 Lung neoplasm malignant Diseases 0.000 abstract description 42
- 201000005202 lung cancer Diseases 0.000 abstract description 42
- 208000020816 lung neoplasm Diseases 0.000 abstract description 42
- 238000007403 mPCR Methods 0.000 abstract description 17
- 230000003321 amplification Effects 0.000 abstract description 14
- 238000003199 nucleic acid amplification method Methods 0.000 abstract description 14
- 230000007547 defect Effects 0.000 abstract description 2
- 108020004414 DNA Proteins 0.000 description 243
- 230000000875 corresponding effect Effects 0.000 description 42
- 238000000034 method Methods 0.000 description 29
- 238000006243 chemical reaction Methods 0.000 description 28
- 210000002345 respiratory system Anatomy 0.000 description 25
- 206010054107 Nodule Diseases 0.000 description 23
- 238000012360 testing method Methods 0.000 description 19
- 238000004458 analytical method Methods 0.000 description 16
- 239000012530 fluid Substances 0.000 description 16
- 206010028980 Neoplasm Diseases 0.000 description 15
- 239000007788 liquid Substances 0.000 description 14
- 108090000623 proteins and genes Proteins 0.000 description 13
- 206010036790 Productive cough Diseases 0.000 description 11
- 239000003550 marker Substances 0.000 description 11
- 239000000203 mixture Substances 0.000 description 11
- 238000001574 biopsy Methods 0.000 description 10
- 201000011510 cancer Diseases 0.000 description 10
- 239000000047 product Substances 0.000 description 10
- 208000024794 sputum Diseases 0.000 description 10
- 210000003802 sputum Anatomy 0.000 description 10
- 210000004072 lung Anatomy 0.000 description 9
- 238000012408 PCR amplification Methods 0.000 description 8
- LSNNMFCWUKXFEE-UHFFFAOYSA-M Bisulfite Chemical compound OS([O-])=O LSNNMFCWUKXFEE-UHFFFAOYSA-M 0.000 description 7
- 238000003745 diagnosis Methods 0.000 description 6
- 239000013642 negative control Substances 0.000 description 6
- 238000003556 assay Methods 0.000 description 5
- 238000002474 experimental method Methods 0.000 description 5
- 239000011259 mixed solution Substances 0.000 description 5
- 238000011002 quantification Methods 0.000 description 5
- 230000000241 respiratory effect Effects 0.000 description 5
- 238000012549 training Methods 0.000 description 5
- 239000008280 blood Substances 0.000 description 4
- 238000007405 data analysis Methods 0.000 description 4
- 238000007847 digital PCR Methods 0.000 description 4
- 230000036210 malignancy Effects 0.000 description 4
- 230000008569 process Effects 0.000 description 4
- 239000000126 substance Substances 0.000 description 4
- 230000004083 survival effect Effects 0.000 description 4
- -1 HOAX1 Proteins 0.000 description 3
- 210000004369 blood Anatomy 0.000 description 3
- 210000004027 cell Anatomy 0.000 description 3
- 230000000694 effects Effects 0.000 description 3
- 238000003384 imaging method Methods 0.000 description 3
- 238000011528 liquid biopsy Methods 0.000 description 3
- 230000004048 modification Effects 0.000 description 3
- 238000012986 modification Methods 0.000 description 3
- 239000012188 paraffin wax Substances 0.000 description 3
- 241000023308 Acca Species 0.000 description 2
- 108090000790 Enzymes Proteins 0.000 description 2
- 102000004190 Enzymes Human genes 0.000 description 2
- 102100022374 Homeobox protein DLX-4 Human genes 0.000 description 2
- 102100021090 Homeobox protein Hox-A9 Human genes 0.000 description 2
- 102100028404 Homeobox protein Hox-B4 Human genes 0.000 description 2
- 102100030634 Homeobox protein OTX2 Human genes 0.000 description 2
- 101000901614 Homo sapiens Homeobox protein DLX-4 Proteins 0.000 description 2
- 101000839788 Homo sapiens Homeobox protein Hox-B4 Proteins 0.000 description 2
- 101000584400 Homo sapiens Homeobox protein OTX2 Proteins 0.000 description 2
- 101000741974 Homo sapiens Phosphatidylinositol 3,4,5-trisphosphate-dependent Rac exchanger 1 protein Proteins 0.000 description 2
- 101001117509 Homo sapiens Prostaglandin E2 receptor EP4 subtype Proteins 0.000 description 2
- 101000712958 Homo sapiens Ras association domain-containing protein 1 Proteins 0.000 description 2
- 101000703741 Homo sapiens Short stature homeobox protein 2 Proteins 0.000 description 2
- 101000798377 Homo sapiens UDP-GlcNAc:betaGal beta-1,3-N-acetylglucosaminyltransferase-like protein 1 Proteins 0.000 description 2
- 101000915606 Homo sapiens Zinc finger protein 781 Proteins 0.000 description 2
- 102100038634 Phosphatidylinositol 3,4,5-trisphosphate-dependent Rac exchanger 1 protein Human genes 0.000 description 2
- 102100024450 Prostaglandin E2 receptor EP4 subtype Human genes 0.000 description 2
- 238000011529 RT qPCR Methods 0.000 description 2
- 102100033243 Ras association domain-containing protein 1 Human genes 0.000 description 2
- 102100031976 Short stature homeobox protein 2 Human genes 0.000 description 2
- 208000000017 Solitary Pulmonary Nodule Diseases 0.000 description 2
- LSNNMFCWUKXFEE-UHFFFAOYSA-N Sulfurous acid Chemical compound OS(O)=O LSNNMFCWUKXFEE-UHFFFAOYSA-N 0.000 description 2
- 102000006280 Twist-Related Protein 1 Human genes 0.000 description 2
- 108010083162 Twist-Related Protein 1 Proteins 0.000 description 2
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 2
- 102100028582 Zinc finger protein 781 Human genes 0.000 description 2
- 238000000137 annealing Methods 0.000 description 2
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical class NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 2
- 238000004925 denaturation Methods 0.000 description 2
- 230000036425 denaturation Effects 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 201000010099 disease Diseases 0.000 description 2
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 2
- 238000013399 early diagnosis Methods 0.000 description 2
- 230000036541 health Effects 0.000 description 2
- 108010027263 homeobox protein HOXA9 Proteins 0.000 description 2
- 230000003902 lesion Effects 0.000 description 2
- 238000005259 measurement Methods 0.000 description 2
- 239000002773 nucleotide Substances 0.000 description 2
- 125000003729 nucleotide group Chemical group 0.000 description 2
- 230000001575 pathological effect Effects 0.000 description 2
- 239000013641 positive control Substances 0.000 description 2
- 238000012257 pre-denaturation Methods 0.000 description 2
- 239000007787 solid Substances 0.000 description 2
- 239000000243 solution Substances 0.000 description 2
- 206010003598 Atelectasis Diseases 0.000 description 1
- 208000023275 Autoimmune disease Diseases 0.000 description 1
- 208000005623 Carcinogenesis Diseases 0.000 description 1
- 238000007400 DNA extraction Methods 0.000 description 1
- 102100030309 Homeobox protein Hox-A1 Human genes 0.000 description 1
- 101001083156 Homo sapiens Homeobox protein Hox-A1 Proteins 0.000 description 1
- 206010062717 Increased upper airway secretion Diseases 0.000 description 1
- JLVVSXFLKOJNIY-UHFFFAOYSA-N Magnesium ion Chemical compound [Mg+2] JLVVSXFLKOJNIY-UHFFFAOYSA-N 0.000 description 1
- 206010027476 Metastases Diseases 0.000 description 1
- 208000002151 Pleural effusion Diseases 0.000 description 1
- 208000007123 Pulmonary Atelectasis Diseases 0.000 description 1
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 1
- 108700025716 Tumor Suppressor Genes Proteins 0.000 description 1
- 102000044209 Tumor Suppressor Genes Human genes 0.000 description 1
- 238000000889 atomisation Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000001369 bisulfite sequencing Methods 0.000 description 1
- 230000036952 cancer formation Effects 0.000 description 1
- 231100000504 carcinogenesis Toxicity 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 230000002380 cytological effect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 239000000835 fiber Substances 0.000 description 1
- 230000030279 gene silencing Effects 0.000 description 1
- 210000004704 glottis Anatomy 0.000 description 1
- 230000014200 hypermethylation of CpG island Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 208000015181 infectious disease Diseases 0.000 description 1
- 238000007689 inspection Methods 0.000 description 1
- 239000007791 liquid phase Substances 0.000 description 1
- 238000007477 logistic regression Methods 0.000 description 1
- 238000000464 low-speed centrifugation Methods 0.000 description 1
- 229910001425 magnesium ion Inorganic materials 0.000 description 1
- 230000009401 metastasis Effects 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 238000013188 needle biopsy Methods 0.000 description 1
- 230000007170 pathology Effects 0.000 description 1
- 208000026435 phlegm Diseases 0.000 description 1
- 239000002244 precipitate Substances 0.000 description 1
- 238000004393 prognosis Methods 0.000 description 1
- 238000012175 pyrosequencing Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 230000011218 segmentation Effects 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 238000011895 specific detection Methods 0.000 description 1
- 239000006228 supernatant Substances 0.000 description 1
- 208000024891 symptom Diseases 0.000 description 1
- 238000010998 test method Methods 0.000 description 1
- 230000002103 transcriptional effect Effects 0.000 description 1
- 210000004881 tumor cell Anatomy 0.000 description 1
- 229940035893 uracil Drugs 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6876—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
- C12Q1/6883—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
- C12Q1/6886—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material for cancer
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6844—Nucleic acid amplification reactions
- C12Q1/6851—Quantitative amplification
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6876—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
- C12Q1/6883—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/112—Disease subtyping, staging or classification
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/154—Methylation markers
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/16—Primer sets for multiplex assays
Abstract
The invention provides a DNA methylation molecular marker for detecting benign and malignant lung nodules, wherein the DNA methylation molecular marker is any one or a combination of more than two of sequences shown in SEQ ID NO. 1-SEQ ID NO.21 or continuous fragments of at least 55% of the full length of the sequences; or a complete complementary sequence selected from the sequences shown in SEQ ID NO. 1-SEQ ID NO.21 or any one or combination of more than two of continuous fragments with at least 55 percent of the full length of the sequence. The DNA methylation molecular marker is highly related to lung cancer, and particularly can be used in combination to further improve the sensitivity and specificity of detection on benign and malignant lung nodules, improve the detection rate of malignant lung nodules and reduce the false positive rate of detection. The primers and the probes in the kit overcome the defect that a plurality of primers and probes interfere with each other during multiplex PCR amplification and detection, and the quantitative performance is equivalent to that of a single region.
Description
Technical Field
The invention belongs to the technical field of biology, and particularly relates to a methylated molecular marker for detecting benign and malignant lung nodules as well as a combination and an application thereof.
Background
The pulmonary nodule, namely solitary pulmonary nodule (solitary pulmonary nodule), refers to a high-low density solid or sub-solid lesion which is imaged in a circular-like shadow, single, clear-boundary, has a diameter less than or equal to 3cm and is surrounded by lung tissues containing air, and is not accompanied by atelectasis, glottis or pleural effusion.
The pulmonary nodules are classified into benign and malignant types, and have no obvious symptoms, the benign nodules need to be treated aiming at the causes of diseases, and the malignant nodules need to be subjected to early operation. Benign causes are often associated with autoimmune diseases or various infections, and malignant causes are often associated with lung cancer.
Lung cancer is one of the most rapidly growing malignancies that threaten human health and life. However, treatment of patients in the early stages of lung cancer development is effective in improving their five-year survival rate. Statistical research data shows that the five-year survival rate of the patients with stage I lung cancer can reach more than 60-90% after receiving effective treatment, while the five-year survival rate of the patients with advanced lung cancer is less than 20%. Therefore, it is very important to diagnose and treat early cancer patients. Currently, clinically used lung cancer detection methods include: imaging detection, cytological detection and detection of molecular markers.
LDCT (low dose spiral CT) image detection is a commonly used and effective early lung cancer screening method. The method reduces the exposure dose to 10% -25% of the conventional CT on the premise of not reducing the discovery rate of nodule foci, and has high sensitivity to the nodule discovery rate. While LDCT can not determine the properties of the nodules, so that 4% -55% of benign lesions are over-treated, and the false positive rate is increased. 96.4% of the positive results of LDCT in the NLST assay were false positives.
Clinically, another main means for early diagnosis of lung cancer is routine cytology examination of exfoliated sputum. The cancer cells falling off from the surface of the lung cancer can be expectorated along with the phlegm, the cancer cells found by the inspection of the sputum cytology can be clearly diagnosed, the accuracy rate can reach more than 80 percent, and the sensitivity of the early lung cancer of the sputum cytology is only 20 to 30 percent. The sensitivity is influenced by the position and tissue type of the tumor and the correctness of the sputum specimen, and the technical level of doctors in the pathology department, so that the overall positive rate of detection is low.
The detection samples for detecting the lung cancer molecular markers mainly come from tissue biopsy and liquid biopsy. Although tissue samples are obtained directly from tumor sites through surgical operations or fiberbronchoscopy mainly, the detection accuracy is higher, but tissue biopsies have certain invasiveness, the existence of tumor heterogeneity and the failure to obtain tissue samples due to various reasons or the quantity of the tissue samples is insufficient to complete molecular detection, so that the effects of the tissue biopsies in early diagnosis, prediction of metastasis, prognosis and the like of lung cancer have certain limitations. Compared with tissue biopsy, the liquid biopsy has the advantages of simple and convenient operation, non-invasiveness, strong repeatability, contribution to dynamic monitoring of diseases and the like. The lung cancer liquid biopsy takes blood, sputum, alveolar lavage fluid and the like of a patient as samples, and detects and analyzes the DNA of tumor cells and the modification level thereof, such as DNA methylation and the like. For cancer patients, because the ctDNA content in blood is very low and the individual difference is large, improving the sensitivity is a great challenge for the detection method. Sputum and alveolar lavage fluid are collected, sputum can be collected through atomization induction expectoration in clinic, and alveolar lavage fluid can be obtained through a fiber bronchoscope, and the sample is directly from the lung, so that the sputum and alveolar lavage fluid has more advantages than a blood sample in the sensitivity of signal detection. In the sample collection mode, the sputum collection is a noninvasive operation, so that the sputum collection is safer; bronchofiberscope collected alveolar lavage (BAL) is a method of collecting effective fluid on the surface of an alveolar by injecting normal saline into the alveolar using a bronchoscope and then sucking out the fluid, and examining cellular components and soluble substances. Compared with percutaneous lung puncture and surgical biopsy, the method is an invasive biopsy method with higher safety.
With the popularization of multi-layer spiral CT and high-resolution CT and the improvement of health consciousness of people, more and more lung nodules are detected, which is good for patients and can find early lung cancer, but causes much trouble for patients, even many patients can be difficult to sleep after finding lung nodules, so that the method is important for distinguishing the benign or malignant lung nodules in time.
At present, pathological biopsy is mainly used for determining the benign and malignant lung nodules, but pathological biopsy is an invasive operation, and small nodules are not easy to take specimens through needle biopsy, so most lung nodules are determined by the size, shape, density and the like of the nodules. Benign and malignant nodules often have different imaging manifestations, and physicians judge whether the lung nodules are benign or malignant according to the imaging characteristics, so as to determine whether the nodules need surgical excision, further examination or regular follow-up.
DNA methylation is closely related to cancer occurrence, especially promoter hypermethylation of CpG island regions can cause transcriptional silencing of cancer suppressor genes, thereby influencing the progress of tumorigenesis, and is an ideal marker for cancer diagnosis because DNA methylation is found in almost all cancers and occurs in precancerous or early stages of cancer.
The detection rate of lung cancer can be enhanced by searching specific DNA methylation biomolecular markers based on lung cancer respiratory tract samples and jointly detecting a plurality of molecular markers related to lung cancer, which plays a key role in detecting the benign and malignant lung nodules.
Disclosure of Invention
Based on the above, the invention provides the DNA methylation molecular marker, the detection kit and the method for detecting the benign and malignant lung nodules, wherein the DNA methylation molecular marker is highly related to lung cancer, particularly the DNA methylation molecular marker is used in combination with the lung cancer, has good sensitivity and specificity for detecting the benign and malignant lung nodules, and can effectively improve the detection rate of the malignant lung nodules.
Technical solutions to achieve the above objects include the following.
A DNA methylation molecular marker or a combination thereof for detecting benign and malignant lung nodules, wherein the DNA methylation molecular marker is any one or a combination of more than two of sequences shown in SEQ ID NO. 1-SEQ ID NO. 21; or any one or the combination of more than two of complete complementary sequences selected from the sequences shown in SEQ ID NO. 1-SEQ ID NO. 21; or any one or the combination of more than two of continuous fragments selected from at least 55% of the full length of the sequences shown in SEQ ID NO.1 to SEQ ID NO.21, or any one or the combination of more than two of completely complementary sequences selected from at least 55% of the full length of the sequences shown in SEQ ID NO.1 to SEQ ID NO. 21.
In some of these embodiments, the DNA methylation molecular marker comprises the sequence shown in SEQ ID No.9, or comprises the complete complement of SEQ ID No.9, or comprises a contiguous fragment of at least 55% of the full length of the sequence shown in SEQ ID No.9, or comprises the complete complement of at least 55% of the full length of the sequence shown in SEQ ID No. 9.
In some of these embodiments, the DNA methylation molecular marker or combination thereof comprises the sequence shown in SEQ ID No.9 or the complete complement thereof, or a contiguous fragment of at least 55% of the full length of the sequence shown in SEQ ID No.9, or the complete complement of a contiguous fragment of at least 55% of the full length of the sequence shown in SEQ ID No. 9; and
at least one of the sequences shown in SEQ ID NO. 1-SEQ ID NO.8 and SEQ ID NO. 10-SEQ ID NO.21 or complementary sequences thereof, or at least one of the continuous fragments of at least 55% of the full length of the sequences shown in SEQ ID NO. 1-SEQ ID NO.8 and SEQ ID NO. 10-SEQ ID NO.21, or at least one of the complete complementary sequences of at least 55% of the continuous fragments of the full length of the sequences shown in SEQ ID NO. 1-SEQ ID NO.8 and SEQ ID NO. 10-SEQ ID NO. 21.
In some of these embodiments, the combination of DNA methylation molecular markers is a sequence comprising SEQ ID No.6 and SEQ ID No. 9; or a fully complementary sequence comprising SEQ ID No.6 and SEQ ID No. 9; or is a continuous fragment comprising at least 55% of the full length of the sequences shown in SEQ ID No.6 and SEQ ID No.9, or is a fully complementary sequence comprising at least 55% of the full length of the sequences shown in SEQ ID No.6 and SEQ ID No. 9.
In some of the embodiments, the combination of DNA methylation molecular markers further comprises any one or a combination of two or more of the sequences shown in SEQ ID NO.8, SEQ ID NO.7, SEQ ID NO.15 and SEQ ID NO. 16; or any one or the combination of more than two of complete complementary sequences of the sequences shown in SEQ ID NO.8, SEQ ID NO.7, SEQ ID NO.15 and SEQ ID NO. 16; or comprises any one or more than two of continuous fragments selected from at least 55% of the full length of the sequence shown in SEQ ID NO.8, SEQ ID NO.7, SEQ ID NO.15 and SEQ ID NO.16, or comprises any one or more than two of complete complementary sequences selected from at least 55% of the continuous fragments selected from the full length of the sequence shown in SEQ ID NO.8, SEQ ID NO.7, SEQ ID NO.15 and SEQ ID NO. 16.
In some of the embodiments, the combination of DNA methylation molecular markers further comprises any one or a combination of two or more of the sequences shown in SEQ ID NO.1, SEQ ID NO.10, SEQ ID NO.14 and SEQ ID NO. 21; or also comprises any one or the combination of more than two of complete complementary sequences of the sequences shown in SEQ ID NO.1, SEQ ID NO.10, SEQ ID NO.14 and SEQ ID NO. 21; or comprises any one or more than two of continuous fragments selected from at least 55% of the full length of the sequences shown in SEQ ID NO.1, SEQ ID NO.10, SEQ ID NO.14 and SEQ ID NO.21, or comprises any one or more than two of complete complementary sequences selected from at least 55% of the continuous fragments selected from the full length of the sequences shown in SEQ ID NO.1, SEQ ID NO.10, SEQ ID NO.14 and SEQ ID NO. 21.
In some of these embodiments, the combination of DNA methylation molecular markers is a sequence comprising SEQ ID No.6, SEQ ID No.7, SEQ ID No.8, SEQ ID No.9, SEQ ID No.15, and SEQ ID No. 16; or a complete complementary sequence comprising SEQ ID NO.6, SEQ ID NO.7, SEQ ID NO.8, SEQ ID NO.9, SEQ ID NO.15 and SEQ ID NO. 16; or is a continuous fragment comprising at least 55% of the full length of the sequence shown in SEQ ID NO.6, SEQ ID NO.7, SEQ ID NO.8, SEQ ID NO.9, SEQ ID NO.15 and SEQ ID NO.16, or is a fully complementary sequence comprising at least 55% of the continuous fragment selected from the group consisting of the full length of the sequence shown in SEQ ID NO.6, SEQ ID NO.7, SEQ ID NO.8, SEQ ID NO.9, SEQ ID NO.15 and SEQ ID NO. 16.
In some embodiments, the combination of DNA methylation molecular markers comprises any one or more than two of the sequences shown in SEQ ID NO.15 and SEQ ID NO. 16; or the DNA methylation molecular marker is any one or the combination of more than two of sequences shown in SEQ ID NO.15 and SEQ ID NO. 16; or any one or the combination of more than two of continuous fragments selected from at least 55% of the full length of the sequences shown in SEQ ID NO.15 and SEQ ID NO.16, or any one or the combination of more than two of completely complementary sequences comprising at least 55% of the full length of the sequences shown in SEQ ID NO.15 and SEQ ID NO. 16.
In some of these embodiments, the combination of DNA methylation molecular markers is a sequence comprising SEQ ID No.6, SEQ ID No.9, SEQ ID No.15, and SEQ ID No. 16; or a complete complementary sequence comprising the sequences shown in SEQ ID NO.6, SEQ ID NO.9, SEQ ID NO.15 and SEQ ID NO. 16; or is a continuous fragment comprising at least 55% of the full length of the sequence shown in SEQ ID No.6, SEQ ID No.9, SEQ ID No.15 and SEQ ID No.16, or is a fully complementary sequence comprising at least 55% of the full length of the sequence shown in SEQ ID No.6, SEQ ID No.9, SEQ ID No.15 and SEQ ID No. 16.
In some of these embodiments, the combination of DNA methylation molecular markers further comprises any one or a combination of two or more selected from the group consisting of the sequences shown in SEQ ID NO.2, SEQ ID NO.3, SEQ ID NO.4, SEQ ID NO.5, SEQ ID NO.11, SEQ ID NO.12, SEQ ID NO.13, SEQ ID NO.17, SEQ ID NO.18, SEQ ID NO.19, SEQ ID NO. 20; or further comprises any one or the combination of more than two of complete complementary sequences of the sequences shown in SEQ ID NO.2, SEQ ID NO.3, SEQ ID NO.4, SEQ ID NO.5, SEQ ID NO.11, SEQ ID NO.12, SEQ ID NO.13, SEQ ID NO.17, SEQ ID NO.18, SEQ ID NO.19 and SEQ ID NO. 20; or is a continuous fragment which also comprises at least 55% of the full length of the sequence shown by SEQ ID NO.2, SEQ ID NO.3, SEQ ID NO.4, SEQ ID NO.5, SEQ ID NO.11, SEQ ID NO.12, SEQ ID NO.13, SEQ ID NO.17, SEQ ID NO.18, SEQ ID NO.19, SEQ ID NO.20, or is a fully complementary sequence which also comprises at least 55% of the full length of the sequence shown by SEQ ID NO.2, SEQ ID NO.3, SEQ ID NO.4, SEQ ID NO.5, SEQ ID NO.11, SEQ ID NO.12, SEQ ID NO.13, SEQ ID NO.17, SEQ ID NO.18, SEQ ID NO.19, SEQ ID NO. 20.
In some embodiments, the combination of DNA methylation molecular markers is a sequence shown in SEQ ID NO. 1-SEQ ID NO. 21; or a complete complementary sequence of the sequences shown in SEQ ID NO. 1-SEQ ID NO. 21; or the DNA methylation molecular marker is a continuous fragment which is at least 55% of the full length of the sequence shown by SEQ ID NO. 1-SEQ ID NO. 21; or complete complementary sequences of continuous fragments of at least 55 percent of the full length of the sequences shown in SEQ ID NO. 1-SEQ ID NO. 21.
A contiguous segment of at least 55% of the full length of the sequence set forth above may be at least 55% of the full length of the sequence, or at least 58%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, etc. of the sequence.
In some embodiments, the DNA methylation molecular marker is a continuous fragment of at least 55% of the full length of the sequence shown in SEQ ID NO. 1-SEQ ID NO. 21:
the primers are the sequences corresponding to the fragments amplified by any one group of SEQ ID NO.22 and SEQ ID NO.23, SEQ ID NO.25 and SEQ ID NO.26, SEQ ID NO.28 and SEQ ID NO.29 in SEQ ID NO. 1;
and/or the primer is the sequence of any group of amplified fragments in SEQ ID NO.31 and SEQ ID NO.32, SEQ ID NO.34 and SEQ ID NO.35, SEQ ID NO.37 and SEQ ID NO.38 corresponding to the sequence of SEQ ID NO. 2;
and/or, the sequence of the fragment corresponding to the SEQ ID NO.3 of the SEQ ID NO.3 amplified by the primers of any one of the groups SEQ ID NO.40 and SEQ ID NO.41, SEQ ID NO.43 and SEQ ID NO.44, SEQ ID NO.46 and SEQ ID NO. 47;
and/or the sequence of the fragment amplified by taking the primer as any one group of SEQ ID NO.49 and SEQ ID NO.50, SEQ ID NO.52 and SEQ ID NO.53, SEQ ID NO.55 and SEQ ID NO.56 corresponds to the sequence in SEQ ID NO. 4;
and/or the corresponding sequence of the fragment amplified by taking the primer as any one group of SEQ ID NO.58 and SEQ ID NO.59, EQ ID NO.61 and SEQ ID NO.62, SEQ ID NO.64 and SEQ ID NO.65 in SEQ ID NO. 5;
and/or the corresponding sequence of the fragment amplified by taking the primer as any one group of SEQ ID NO.67 and SEQ ID NO.68, SEQ ID NO.70 and SEQ ID NO.71, SEQ ID NO.73 and SEQ ID NO.74 in SEQ ID NO. 6;
and/or the sequence of the fragment amplified by taking the primer as any one group of SEQ ID NO.76 and SEQ ID NO.77, SEQ ID NO.79 and SEQ ID NO.80, SEQ ID NO.82 and SEQ ID NO.83 corresponding to SEQ ID NO. 7;
and/or the sequence of the fragment amplified by taking the primer as any one group of SEQ ID NO.85 and SEQ ID NO.86, SEQ ID NO.88 and SEQ ID NO.89, SEQ ID NO.91 and SEQ ID NO.92 corresponds to the sequence in SEQ ID NO. 8;
and/or the corresponding sequence of the amplified fragment in SEQ ID NO.9 by taking the primer as any one group of SEQ ID NO.94 and SEQ ID NO.95, SEQ ID NO.97 and SEQ ID NO.98, SEQ ID NO.100 and SEQ ID NO. 101;
and/or the sequence of the fragment amplified by taking the primer as any one group of SEQ ID NO.103 and SEQ ID NO.104, SEQ ID NO.106 and SEQ ID NO.107, SEQ ID NO.109 and SEQ ID NO.110 corresponds to the sequence in SEQ ID NO. 10;
and/or the sequence of the fragment amplified by taking the primer as any one group of SEQ ID NO.112 and SEQ ID NO.113, SEQ ID NO.115 and SEQ ID NO.116, SEQ ID NO.118 and SEQ ID NO.119 corresponding to the sequence in SEQ ID NO. 11;
and/or the sequence of the fragment amplified by taking the primer as any one group of SEQ ID NO.121 and SEQ ID NO.122, SEQ ID NO.124 and SEQ ID NO.125, SEQ ID NO.127 and SEQ ID NO.128 corresponds to the sequence in SEQ ID NO. 12;
and/or the sequence of the fragment amplified by taking the primer as any one group of SEQ ID NO.130 and SEQ ID NO.131, SEQ ID NO.133 and SEQ ID NO.134, SEQ ID NO.136 and SEQ ID NO.137 in SEQ ID NO. 13;
and/or the sequence of the fragment amplified by taking the primer as any one group of SEQ ID NO.139 and SEQ ID NO.140, SEQ ID NO.142 and SEQ ID NO.143, SEQ ID NO.145 and SEQ ID NO.146 corresponds to the sequence in SEQ ID NO. 14;
and/or the sequence of the fragment amplified by any one group of the primers SEQ ID NO.148 and SEQ ID NO.149, SEQ ID NO.151 and SEQ ID NO.152, SEQ ID NO.154 and SEQ ID NO.155 in SEQ ID NO. 15;
and/or the sequence of the fragment amplified by taking the primer as any one group of SEQ ID NO.157 and SEQ ID NO.158, SEQ ID NO.160 and SEQ ID NO.161, SEQ ID NO.163 and SEQ ID NO.164 corresponds to the sequence in SEQ ID NO. 16;
and/or the sequence of the fragment amplified by taking the primer as any one group of SEQ ID NO.166 and SEQ ID NO.167, SEQ ID NO.169 and SEQ ID NO.170, SEQ ID NO.172 and SEQ ID NO.173 corresponding to the sequence in SEQ ID NO. 17;
and/or the sequence of the fragment amplified by taking the primer as any one group of SEQ ID NO.175 and 176, SEQ ID NO.178 and 179, SEQ ID NO.181 and 182 in SEQ ID NO. 18;
and/or the sequence of the fragment amplified by taking the primer as any one group of SEQ ID NO.184 and SEQ ID NO.185, SEQ ID NO.187 and SEQ ID NO.188, SEQ ID NO.190 and SEQ ID NO.191 in SEQ ID NO.19 corresponds to the sequence in SEQ ID NO. 19;
and/or the sequence of the fragment amplified by taking the primer as any one group of SEQ ID NO.193 and 194, 196 and 197, 199 and 200 in SEQ ID NO. 20;
and/or the corresponding sequence of the fragment amplified by using the primer as any one group of SEQ ID NO.202 and SEQ ID NO.203, SEQ ID NO.205 and SEQ ID NO.206, SEQ ID NO.208 and SEQ ID NO.209 in SEQ ID NO. 21.
In some embodiments, the DNA methylation molecular marker is a continuous fragment of at least 55% of the full length of the sequence shown in SEQ ID NO. 1-SEQ ID NO. 21:
the primers are the sequences of the amplified fragments of SEQ ID NO.22 and SEQ ID NO.23 corresponding to SEQ ID NO. 1;
and/or, the primers are the sequences of the amplified fragments of SEQ ID NO.34 and SEQ ID NO.35 corresponding to SEQ ID NO. 2;
and/or, the primers are the sequences of the fragments amplified by the SEQ ID NO.40 and the SEQ ID NO.41 corresponding to the SEQ ID NO. 3;
and/or, the primers are the sequences of the amplified fragments of SEQ ID NO.55 and SEQ ID NO.56 corresponding to SEQ ID NO. 4;
and/or, the primers are the sequences corresponding to the fragments amplified by SEQ ID NO.58 and SEQ ID NO.59 in SEQ ID NO. 5;
and/or, the primers are the sequences of the amplified fragments of SEQ ID NO.67 and SEQ ID NO.68 corresponding to SEQ ID NO. 6;
and/or, the primers are the sequences corresponding to the fragments amplified by SEQ ID NO.76 and SEQ ID NO.77 in SEQ ID NO. 7;
and/or, the primers are the sequences of the amplified fragments of SEQ ID NO.88 and SEQ ID NO.89 corresponding to the SEQ ID NO. 8;
and/or, the primers are the sequences corresponding to the fragments amplified by SEQ ID NO.100 and SEQ ID NO.101 in SEQ ID NO. 9;
and/or, the primers are the sequences corresponding to the fragments amplified by SEQ ID NO.109 and SEQ ID NO.110 in SEQ ID NO. 10;
and/or, the primers are the sequences corresponding to the fragments amplified by SEQ ID NO.112 and SEQ ID NO.113 in SEQ ID NO. 11;
and/or, the primers are the sequences corresponding to the fragments amplified by SEQ ID NO.121 and SEQ ID NO.122 in SEQ ID NO. 12;
and/or, the primers are the sequences of the amplified fragments of SEQ ID NO.130 and SEQ ID NO.131 corresponding to SEQ ID NO. 13;
and/or the primers are the sequences corresponding to the fragments amplified by the SEQ ID NO.142 and the SEQ ID NO.143 in the SEQ ID NO. 14;
and/or, the primers are the sequences corresponding to the fragments amplified by SEQ ID NO.151 and SEQ ID NO.152 in SEQ ID NO. 15;
and/or, the primers are the sequences corresponding to the fragments amplified by SEQ ID NO.160 and SEQ ID NO.161 in SEQ ID NO. 16;
and/or, the primers are the sequences corresponding to the fragments amplified by SEQ ID NO.166 and SEQ ID NO.167 in SEQ ID NO. 17;
and/or, the primers are the sequences corresponding to the fragments amplified by SEQ ID NO.175 and SEQ ID NO.176 in SEQ ID NO. 18;
and/or the primers are the sequences corresponding to the fragments amplified by SEQ ID NO.184 and SEQ ID NO.185 in SEQ ID NO. 19;
and/or, the primers are the sequences corresponding to the fragments amplified by SEQ ID NO.199 and SEQ ID NO.200 in SEQ ID NO. 20;
and/or the primers are the sequences corresponding to the fragments amplified by SEQ ID NO.208 and SEQ ID NO.209 in SEQ ID NO. 21.
In some of these embodiments, the DNA methylation molecular marker is a molecular marker for a respiratory tract sample.
In some of these embodiments, the respiratory tract sample is a lung tissue sample or a respiratory tract liquid sample.
In another aspect of the invention, the invention also provides the application of any one or the combination of more than two of the sequences shown in SEQ ID NO. 1-21 as the lung cancer related methylation molecular marker in the detection of benign and malignant lung nodules and/or early lung cancer.
The invention also provides a kit for detecting benign and malignant lung nodules, which comprises a reagent for detecting the methylation level of the DNA methylation molecular marker.
In some of these embodiments, the kit can be used in the following assay platforms: comprises reagents used by a PCR amplification method, a fluorescent quantitative PCR method, a digital PCR method, a liquid phase chip method, a sequencing-by-generation method, a pyrosequencing method, a bisulfite conversion sequencing method, a methylation chip method, a simplified bisulfite sequencing technology or a combination thereof. In some preferred embodiments, the detection method is PCR amplification detection, fluorescent quantitative PCR detection, digital PCR detection, chip detection.
In some embodiments, the reagent for detecting the methylation level of the DNA methylation molecular marker in the kit comprises primers and probes for fluorescent quantitative PCR detection of the DNA methylation molecular marker, wherein the primers and probes are:
the primers and probes for SEQ ID No.1 are selected from at least one of the following: primers shown as SEQ ID NO.22 and SEQ ID NO.23, and a probe shown as SEQ ID NO. 24; primers shown as SEQ ID NO.25 and SEQ ID NO.26, and a probe shown as SEQ ID NO. 27; primers shown as SEQ ID NO.28 and SEQ ID NO.29, and a probe shown as SEQ ID NO. 30;
and/or, the primers and probes for SEQ ID No.2 are selected from at least one of the following: primers shown as SEQ ID NO.31 and SEQ ID NO.32, and a probe shown as SEQ ID NO. 33; primers shown as SEQ ID NO.34 and SEQ ID NO.35, and a probe shown as SEQ ID NO. 36; primers shown as SEQ ID NO.37 and SEQ ID NO.38, and a probe shown as SEQ ID NO. 39;
and/or, the primers and probes for SEQ ID No.3 are selected from at least one of: primers shown as SEQ ID NO.40 and SEQ ID NO.41, and a probe shown as SEQ ID NO. 42; primers shown as SEQ ID NO.43 and SEQ ID NO.44, and a probe shown as SEQ ID NO. 45; primers shown as SEQ ID NO.46 and SEQ ID NO.47, and a probe shown as SEQ ID NO. 48;
and/or, the primers and probes for SEQ ID No.4 are selected from at least one of: primers shown as SEQ ID NO.49 and SEQ ID NO.50, and a probe shown as SEQ ID NO. 51; primers shown as SEQ ID NO.52 and SEQ ID NO.53, and a probe shown as SEQ ID NO. 54; primers shown as SEQ ID NO.55 and SEQ ID NO.56, and a probe shown as SEQ ID NO. 57;
and/or, the primers and probes for SEQ ID No.5 are selected from at least one of: primers shown as SEQ ID NO.58 and SEQ ID NO.59, and a probe shown as SEQ ID NO. 60; primers shown as SEQ ID NO.61 and SEQ ID NO.62, and a probe shown as SEQ ID NO. 63; primers shown as SEQ ID NO.64 and SEQ ID NO.65, and a probe shown as SEQ ID NO. 66;
and/or, the primers and probes for SEQ ID No.6 are selected from at least one of: primers shown as SEQ ID NO.67 and SEQ ID NO.68, and a probe shown as SEQ ID NO. 69; primers shown as SEQ ID NO.70 and SEQ ID NO.71, and a probe shown as SEQ ID NO. 72; primers shown as SEQ ID NO.73 and SEQ ID NO.74, and a probe shown as SEQ ID NO. 75;
and/or, the primers and probes for SEQ ID No.7 are selected from at least one of: primers shown as SEQ ID NO.76 and SEQ ID NO.77, and a probe shown as SEQ ID NO. 78; primers shown as SEQ ID NO.79 and SEQ ID NO.80, and a probe shown as SEQ ID NO. 81; primers shown as SEQ ID NO.82 and SEQ ID NO.83, and a probe shown as SEQ ID NO. 84;
and/or, the primers and probes for SEQ ID No.8 are selected from at least one of: primers shown as SEQ ID NO.85 and SEQ ID NO.86, and a probe shown as SEQ ID NO. 87; primers shown as SEQ ID NO.88 and SEQ ID NO.89, and a probe shown as SEQ ID NO. 90; primers shown as SEQ ID NO.91 and SEQ ID NO.92, and a probe shown as SEQ ID NO. 93;
and/or, the primers and probes for SEQ ID No.9 are selected from at least one of: primers shown as SEQ ID NO.94 and SEQ ID NO.95, and a probe shown as SEQ ID NO. 96; primers shown as SEQ ID NO.97 and SEQ ID NO.98, and a probe shown as SEQ ID NO. 99; primers shown as SEQ ID NO.100 and SEQ ID NO.101, and a probe shown as SEQ ID NO. 102;
and/or, the primers and probes for SEQ ID No.10 are selected from at least one of: primers shown as SEQ ID NO.103 and SEQ ID NO.104, and a probe shown as SEQ ID NO. 105; primers shown as SEQ ID NO.106 and SEQ ID NO.107, and a probe shown as SEQ ID NO. 108; primers shown as SEQ ID NO.109 and SEQ ID NO.110, and a probe shown as SEQ ID NO. 111;
and/or, the primers and probes for SEQ ID No.11 are selected from at least one of: primers shown as SEQ ID NO.112 and SEQ ID NO.113, and a probe shown as SEQ ID NO. 114; primers shown as SEQ ID NO.115 and SEQ ID NO.116, and a probe shown as SEQ ID NO. 117; primers shown as SEQ ID NO.118 and SEQ ID NO.119, and a probe shown as SEQ ID NO. 120;
and/or, the primers and probes to SEQ ID No.12 are selected from at least one of the following: primers shown as SEQ ID NO.121 and SEQ ID NO.122, and a probe shown as SEQ ID NO. 123; primers shown as SEQ ID NO.124 and SEQ ID NO.125, and a probe shown as SEQ ID NO. 126; primers shown as SEQ ID NO.127 and SEQ ID NO.128, and a probe shown as SEQ ID NO. 129;
and/or, the primers and probes for SEQ ID No.13 are selected from at least one of: primers shown as SEQ ID NO.130 and SEQ ID NO.131, and a probe shown as SEQ ID NO. 132; primers shown as SEQ ID NO.133 and SEQ ID NO.134, and a probe shown as SEQ ID NO. 135; primers shown as SEQ ID NO.136 and SEQ ID NO.137, and a probe shown as SEQ ID NO. 138;
and/or, the primers and probes for SEQ ID No.14 are selected from at least one of: primers shown as SEQ ID NO.139 and SEQ ID NO.140, and a probe shown as SEQ ID NO. 141; primers shown as SEQ ID NO.142 and SEQ ID NO.143, and a probe shown as SEQ ID NO. 144; primers shown as SEQ ID NO.145 and SEQ ID NO.146, and a probe shown as SEQ ID NO. 147;
and/or, the primers and probes to SEQ ID No.15 are selected from at least one of: primers shown as SEQ ID NO.148 and SEQ ID NO.149, and a probe shown as SEQ ID NO. 150; primers shown as SEQ ID NO.151 and SEQ ID NO.152, and a probe shown as SEQ ID NO. 153; primers shown as SEQ ID NO.154 and SEQ ID NO.155, and a probe shown as SEQ ID NO. 156;
and/or, the primers and probes for SEQ ID No.16 are selected from at least one of: primers shown as SEQ ID NO.157 and SEQ ID NO.158, and a probe shown as SEQ ID NO. 159; primers shown as SEQ ID NO.160 and SEQ ID NO.161, and a probe shown as SEQ ID NO. 162; primers shown as SEQ ID NO.163 and SEQ ID NO.164, and a probe shown as SEQ ID NO. 165;
and/or, the primers and probes for SEQ ID No.17 are selected from at least one of: primers shown as SEQ ID NO.166 and SEQ ID NO.167, and a probe shown as SEQ ID NO. 168; primers shown as SEQ ID NO.169 and SEQ ID NO.170, and a probe shown as SEQ ID NO. 171; primers shown as SEQ ID NO.172 and SEQ ID NO.173, and a probe shown as SEQ ID NO. 174;
and/or, the primers and probes for SEQ ID No.18 are selected from at least one of: primers shown as SEQ ID NO.175 and SEQ ID NO.176, and a probe shown as SEQ ID NO. 177; primers shown as SEQ ID NO.178 and SEQ ID NO.179, and a probe shown as SEQ ID NO. 180; primers shown as SEQ ID NO.181 and SEQ ID NO.182, and a probe shown as SEQ ID NO. 183;
and/or, the primers and probes for SEQ ID No.19 are selected from at least one of: primers shown as SEQ ID NO.184 and SEQ ID NO.185, and a probe shown as SEQ ID NO. 186; primers shown as SEQ ID NO.187 and SEQ ID NO.188, and a probe shown as SEQ ID NO. 189; primers shown as SEQ ID NO.190 and SEQ ID NO.191 and a probe shown as SEQ ID NO. 192;
and/or, the primers and probes to SEQ ID No.20 are selected from at least one of the following: primers shown as SEQ ID NO.193 and SEQ ID NO.194, and a probe shown as SEQ ID NO. 195; primers shown as SEQ ID NO.196 and SEQ ID NO.197, and a probe shown as SEQ ID NO. 198; primers shown as SEQ ID NO.199 and SEQ ID NO.200, and a probe shown as SEQ ID NO. 201;
and/or, the primers and probes for SEQ ID No.21 are selected from at least one of: primers shown as SEQ ID NO.202 and SEQ ID NO.203, and a probe shown as SEQ ID NO. 204; primers shown in SEQ ID NO.205 and SEQ ID NO.206, and a probe shown in SEQ ID NO. 207; primers shown as SEQ ID NO.208 and SEQ ID NO.209, and a probe shown as SEQ ID NO. 210; or selected from primers and probes having at least 70%, 80%, 90%, 95% or 99% sequence identity over a plurality of contiguous nucleotides to the above sequences.
In some embodiments, the primers and probes are:
primers and probes for SEQ ID No.1 were: primers shown as SEQ ID NO.22 and SEQ ID NO.23, and a probe shown as SEQ ID NO. 24;
and/or, the primers and probes for SEQ ID No.2 are: primers shown as SEQ ID NO.34 and SEQ ID NO.35, and a probe shown as SEQ ID NO. 36;
and/or, the primers and probes for SEQ ID No.3 are: primers shown as SEQ ID NO.40 and SEQ ID NO.41, and a probe shown as SEQ ID NO. 42;
and/or, the primers and probes for SEQ ID No.4 are: primers shown as SEQ ID NO.55 and SEQ ID NO.56, and a probe shown as SEQ ID NO. 57;
and/or, the primers and probes for SEQ ID No.5 are: primers shown as SEQ ID NO.58 and SEQ ID NO.59, and a probe shown as SEQ ID NO. 60;
and/or, the primers and probes for SEQ ID No.6 are: primers shown as SEQ ID NO.67 and SEQ ID NO.68, and a probe shown as SEQ ID NO. 69;
and/or, the primers and probes for SEQ ID No.7 are: primers shown as SEQ ID NO.76 and SEQ ID NO.77, and a probe shown as SEQ ID NO. 78;
and/or, the primers and probes for SEQ ID No.8 are: primers shown as SEQ ID NO.88 and SEQ ID NO.89, and a probe shown as SEQ ID NO. 90;
and/or, the primers and probes for SEQ ID No.9 are: primers shown as SEQ ID NO.100 and SEQ ID NO.101, and a probe shown as SEQ ID NO. 102;
and/or, the primers and probes for SEQ ID No.10 are: primers shown as SEQ ID NO.109 and SEQ ID NO.110, and a probe shown as SEQ ID NO. 111;
and/or, the primers and probes for SEQ ID NO.11 are: primers shown as SEQ ID NO.112 and SEQ ID NO.113, and a probe shown as SEQ ID NO. 114;
and/or, the primers and probes for SEQ ID No.12 are: primers shown as SEQ ID NO.121 and SEQ ID NO.122, and a probe shown as SEQ ID NO. 123;
and/or, the primers and probes for SEQ ID No.13 are: primers shown as SEQ ID NO.130 and SEQ ID NO.131, and a probe shown as SEQ ID NO. 132;
and/or, the primers and probes for SEQ ID No.14 are: primers shown as SEQ ID NO.142 and SEQ ID NO.143, and a probe shown as SEQ ID NO. 144;
and/or, the primers and probes for SEQ ID No.15 are: primers shown as SEQ ID NO.151 and SEQ ID NO.152, and a probe shown as SEQ ID NO. 153;
and/or, the primers and probes for SEQ ID No.16 are: primers shown as SEQ ID NO.160 and SEQ ID NO.161, and a probe shown as SEQ ID NO. 162;
and/or, the primers and probes for SEQ ID No.17 are: primers shown as SEQ ID NO.166 and SEQ ID NO.167, and a probe shown as SEQ ID NO. 168;
and/or, the primers and probes for SEQ ID No.18 are: primers shown as SEQ ID NO.175 and SEQ ID NO.176, and a probe shown as SEQ ID NO. 177;
and/or, the primers and probes for SEQ ID No.19 are: primers shown as SEQ ID NO.184 and SEQ ID NO.185, and a probe shown as SEQ ID NO. 186;
and/or, the primers and probes for SEQ ID No.20 are: primers shown as SEQ ID NO.199 and SEQ ID NO.200, and a probe shown as SEQ ID NO. 201;
and/or, the primers and probes for SEQ ID No.21 are: primers shown as SEQ ID NO.208 and SEQ ID NO.209, and a probe shown as SEQ ID NO. 210;
or selected from primers and probes having at least 70%, 80%, 90%, 95% or 99% sequence identity over a plurality of contiguous nucleotides to the above sequences.
In some embodiments, the kit further comprises primers and probes for fluorescent quantitative PCR detection of the reference gene ACTB, preferably the primers and probes are: primers shown as SEQ ID NO.211 and SEQ ID NO.212, and a probe shown as SEQ ID NO. 213.
In some of these embodiments, the test sample of the kit is a respiratory sample. The airway sample comprises a lung tissue sample or an airway fluid sample.
The invention also provides a methylation level detection method of the DNA methylation molecular marker or the combination thereof, which comprises the following steps:
(1) extracting genome DNA from a sample to be detected;
(2) carrying out bisulfite treatment on the extracted genome DNA to obtain converted DNA;
(3) carrying out multiple PCR amplification on the converted DNA by using an amplification primer aiming at the DNA methylation molecular marker to obtain multiple PCR amplification products;
(4) and (4) carrying out multiplex fluorescence quantitative PCR detection on the multiplex PCR product obtained in the step (3) by using a probe aiming at the DNA methylation molecular marker.
In some of these embodiments, the multiplex PCR reaction conditions are as follows: 30s at 98 ℃; 15-35 cycles: 15s at 98 ℃, and 15-30s at 58-66 ℃; 15-30s at 72 ℃; 5min at 72 ℃; and/or the multiple fluorescent quantitative PCR reaction conditions are as follows: 30s at 95 ℃; 35-50 cycles: 10s at 95 ℃; 60-64 ℃ for 30 s.
In some embodiments, the detection is performed using the primers and probes of the kit described above.
The invention also provides a method for detecting benign and malignant lung nodules, which comprises the following steps:
(1) detecting the methylation level of the DNA methylation molecular marker by using the detection method;
(2) through reference gene C T Judging whether the sample is effective, and then using the reference gene C T Value pair C for each molecular marker detected in valid sample T Correcting the value;
(3) and performing model analysis on the corrected data, and finally judging whether the lung nodules are benign or malignant.
In some embodiments, the reference gene in step (2) is C T If the value is between 8 and 18, judging the sample as a valid sample; otherwise, the sample is an invalid sample; then using the reference gene C T Value for C of each DNA methylation molecular marker in valid samples T Correcting the value; if the target DNA is a methylated molecular marker C T Value of<35 determining that the DNA methylated molecular marker is detected, obtaining the relative cycle number deltaC of the target DNA methylated molecular marker T :ΔC T Methylated molecular marker of target DNA C T Value-reference Gene C T A value; if the target DNA is a methylated molecular marker C T If the value is not less than 35, it is judged that the DNA methylated molecular marker is not detected, and if it is not detected, Δ C is given thereto T =27。
In some embodiments, the corrected Δ C is used in step (3) T And (5) carrying out data analysis on the values, and establishing a lung nodule benign and malignant prediction model by adopting a naive Bayes algorithm. According to corrected Delta C T And (3) carrying out data analysis, randomly dividing a data set into a training set and a testing set according to a ratio of 6:4, establishing a benign and malignant prediction model by adopting a naive Bayes algorithm according to combinations containing different DNA methylation molecular markers in the training set, and finally evaluating the classification capability of the model in the corresponding testing set containing a specific DNA methylation molecular marker combination.
Compared with the prior art, the invention has the following beneficial effects:
the invention finds out the DNA methylation specific molecular markers highly related to the lung cancer, can detect the benign and malignant lung nodules by detecting the methylation level of the DNA methylation molecular markers, and has good sensitivity and specificity. Wherein, the single molecular marker SEQ ID NO.9 has good sensitivity, and the single molecular marker has higher sensitivity and specificity with the combination of SEQ ID NO.1, SEQ ID NO.6, SEQ ID NO.7, SEQ ID NO.8, SEQ ID NO.9, SEQ ID NO.10, SEQ ID NO.14, SEQ ID NO.15, SEQ ID NO.16 and the corresponding molecular markers thereof. Particularly, the combination of the markers corresponding to SEQ ID NO.6 and SEQ ID NO.9, or the combination of the markers corresponding to SEQ ID NO.15 and SEQ ID NO.16, or the further combination of the two markers can well improve the sensitivity and specificity of the detection of benign and malignant pulmonary nodules, effectively improve the detection rate of early malignant pulmonary nodules, perform treatment and intervention as soon as possible, and improve the survival rate of patients; meanwhile, the false positive rate detected is reduced, and excessive diagnosis and treatment of benign pulmonary nodules are avoided.
The DNA methylation molecular marker provided by the invention is particularly suitable for respiratory tract samples, including lung tissue samples obtained through surgical operations, respiratory tract liquid samples obtained through minimally invasive or non-invasive means and the like.
The fluorescent quantitative PCR detection primers and the probes for the DNA methylation molecular markers, which are provided by the kit, overcome the defect that a plurality of primers and probes interfere with each other during multiplex PCR amplification and detection, and each DNA methylation molecular marker can be effectively amplified and enriched when the primers are used for carrying out multiplex PCR on DNA treated by bisulfite; when multiple fluorescence quantitative PCR detection is subsequently carried out on multiple PCR products, C obtained by the corresponding DNA methylation molecular marker in the multiple fluorescence quantitative PCR detection T C value of fluorescent quantitative PCR reaction carried out separately from the DNA methylated molecular marker T The values were not significantly different and the quantification performance was equivalent to single-zone quantification. The kit is optimized, and the primers and the probes aiming at different DNA methylation molecular markers do not interfere with each other, so that the multiplex PCR amplification and the multiplex fluorescence quantitative PCR detection can be successfully realized, and the detection efficiency is effectively improved.
According to the detection method for the DNA methylation molecular marker, multiple PCR amplification is introduced, target molecules can be effectively enriched, the limit of low acquisition quantity of detection samples is overcome, joint detection of a plurality of molecular markers can be performed while detection signals are amplified, the detection sensitivity and detection efficiency are improved, and the detection rate of lung cancer and the accuracy of benign and malignant detection of lung nodules can be enhanced.
Drawings
FIG. 1 is the amplification curve detected by the fluorescent quantitative PCR reaction of one molecular marker (Marke9) in example 3.
FIG. 2 is a ROC plot for each molecular marker combination of example 5.
FIG. 3 is a ROC plot for each molecular marker combination of example 6.
Detailed Description
The experimental procedures of the present invention, without specifying the specific conditions in the following examples, are generally carried out according to conventional conditions, such as Sambrook et al, molecular cloning: the conditions described in the Laboratory Manual (New York: Cold Spring Harbor Laboratory Press,1989), or according to the manufacturer's recommendations. The various chemicals used in the examples are commercially available.
Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. The terminology used in the description of the invention herein is for the purpose of describing particular embodiments only and is not intended to be limiting of the invention.
The terms "comprising" and "having," and any variations thereof, are intended to cover non-exclusive inclusions. For example, a process, method, apparatus, article, or device that comprises a list of steps is not limited to only those steps or modules listed, but may alternatively include other steps not listed or inherent to such process, method, article, or device.
The "plurality" referred to in the present invention means two or more. "and/or" describes the association relationship of the associated objects, meaning that there may be three relationships, e.g., a and/or B, which may mean: a exists alone, A and B exist simultaneously, and B exists alone. The character "/" generally indicates that the former and latter associated objects are in an "or" relationship.
The DNA methylation molecular marker for detecting the benign and malignant lung nodules provided by the invention is used for detecting 21 methylation regions in 13 detection genes B3GNTL1, DLX4, HOAX1, HOXA9, HOXB4, MSC-AS1, OTX2, PREX1, PTGER4, RASSF1, SHOX2, TWIST1 and ZNF781, and is any one or a combination of more than two of sequences shown in SEQ ID No. 1-SEQ ID No. 21; or any one or the combination of more than two of complete complementary sequences selected from the sequences shown in SEQ ID NO. 1-SEQ ID NO. 21; or any one or the combination of more than two of continuous fragments selected from at least 55% of the full length of the sequences shown in SEQ ID NO.1 to SEQ ID NO.21, or any one or the combination of more than two of completely complementary sequences selected from at least 55% of the full length of the sequences shown in SEQ ID NO.1 to SEQ ID NO. 21.
In the following examples, the DNA methylation molecular markers described in SEQ ID NO.1 to SEQ ID NO.21 correspond to the markers described therein, and the corresponding markers amplified with the corresponding primers.
In one embodiment, the invention relates to a DNA methylation molecular marker suitable for detecting benign and malignant lung nodules in respiratory tract samples, wherein the DNA methylation molecular marker comprises any one or a combination of more than two of 21 methylation detection regions. In some embodiments, the DNA methylation molecular markers can be combined arbitrarily to detect benign and malignant lung nodules in the test sample.
In one embodiment, the invention relates to the use of the DNA methylation molecular marker in the preparation of a kit for detecting benign and malignant lung nodules.
In one embodiment, the invention relates to a kit for detecting benign and malignant pulmonary nodules, which comprises a reagent for detecting the methylation level of the DNA methylation molecular marker. The kit can be suitable for detection platforms such as PCR amplification, fluorescent quantitative PCR diagnosis, digital PCR (digital PCR) or detection chips and the like, and is preferably a platform capable of realizing high-throughput detection.
The invention carries out primer and probe design aiming at the DNA methylation molecular marker of the specific methylation region, and utilizes the amplification primer combined with the DNA methylation molecular marker to carry out multiplex PCR amplification on the genomic DNA (gDNA) which is extracted from a respiratory tract sample and is treated by bisulfite; and then, carrying out fluorescent quantitative PCR detection on the methylation signal of the detection area by using the specific probe of the DNA methylation molecular marker, then establishing a benign and malignant prediction model by adopting a naive Bayes algorithm, and finally diagnosing the benign and malignant of the lung nodule through the established model.
Example 1
A kit for detecting benign and malignant lung nodules in respiratory tract samples comprises detection primers and probes for DNA methylation molecular markers Marker1 to Marker21 of 21 methylation regions in 13 detection genes B3GNTL1, DLX4, HOXA1, HOXA9, HOXB4, MSC-AS1, OTX2, PREX1, PTGER4, RASSF1, SHOX2, TWIST1 and ZNF781, wherein the sequence and sequence number of the detection region of each DNA methylation molecular Marker are shown in Table 1 (wherein the underlined part of each region is Marker of the corresponding sequence of the fragment amplified by the preferred primer in the example):
TABLE 1 sequence of detection region of molecular marker
The kit respectively designs three pairs of primers and three probes (fluorescent markers of the probes can be marked by fluorescent groups such as FAM, VIC, NED and the like) aiming at specific methylation sites in 21 molecular markers Marker 1-Marker 21 for detecting benign and malignant lung nodules of respiratory tract samples, and the probes are respectively marked as combinations 1, 2 and 3. The selected combination of the primers and the probes in each molecular marker can be optionally selected to be combined with the combinations 1, 2 and 3 of the primers and the probes in other molecular markers and detected on the same platform. The specific primer and probe sequences corresponding to each molecular marker are shown in table 2:
TABLE 2 primer and Probe sequences for related molecular markers
In this and the following examples, the combination of primers and probes preferably used are:
primer and probe combination 1 of Marker1, primer and probe combination 2 of Marker2, primer and probe combination 1 of Marker3, primer and probe combination 3 of Marker4, primer and probe combination 1 of Marker5, primer and probe combination 1 of Marker6, primer and probe combination 1 of Marker7, primer and probe combination 2 of Marker8, primer and probe combination 3 of Marker9, primer and probe combination 3 of Marker10, primer and probe combination 1 of Marker11, primer and probe combination 1 of Marker12, primer and probe combination 1 of Marker13, primer and probe combination 2 of Marker14, primer and probe combination 2 of Marker15, primer and probe combination 2 of Marker16, primer and probe combination 1 of Marker17, primer and probe combination 1 of Marker18, primer and probe combination 1 of Marker19, primer and probe combination 2 of Marker20, primer and probe combination 3 of Marker 463, and probe combination 1 of Marker 21. In practical applications, the corresponding primers and probes will be selected based on different combinations of methylated molecular markers.
The kit also comprises primers and probes of an internal reference gene ACTB, and the sequence of the primers and probes is specifically shown in the table 3:
TABLE 3 reference Gene ACTB primers and probes
Example 2
This example uses the kit described in example 1 to detect the methylation levels of Marker1 to Marker21 in respiratory samples.
A method for detecting the methylation level of a DNA methylation molecular marker comprises the following steps:
1. extraction of gDNA from respiratory tract samples:
1) extracting gDNA of a lung tissue paraffin section sample: the specific operation steps of extracting the gDNA of the paraffin tissue are carried out according to the instructions of ALLPrep DNA/RNA FFPE Kit of Qiagen company;
2) extracting gDNA of a respiratory tract liquid sample: firstly, carrying out low-speed centrifugation treatment on a respiratory tract liquid sample at 4 ℃ for 5min at 5000 g; removing supernatant, and collecting precipitate; then according to QiagenBlood&The Tissue Kit instructions were manipulated to extract gDNA.
2. Sulfite conversion of extracted gDNA
The extracted gDNA is bisulfite converted and 20-50ng gDNA, preferably 50ng in this example, is added according to Zymo DNA Methylation-Direct MagPrep instructions to deaminate unmethylated cytosine to uracil while the methylated cytosine remains unchanged. The bisulfite converted DNA products were all used for multiplex PCR amplification.
3. Performing multiple PCR amplification on the transformed DNA
And performing multiple PCR amplification on all the transformed DNA products, wherein the reaction components are as follows: a primer mixture of a specific combination of molecular markers and 1 reference gene, wherein the concentration of each primer is 200nM-300nM, preferably 300 nM; the concentration of magnesium ions is 1-3mM, preferably 1.5mM in this embodiment; the concentration of the dNTP mixed solution is 200-600uM, preferably 400uM in the embodiment; the enzyme was Phusion U (Thermo Fisher, Cat # F555L), and the number of units for one reaction was 1-3U, preferably 1.5U in this example. The multiplex PCR reaction system was prepared as shown in table 4:
TABLE 4 multiplex PCR reaction System
Components | Volume (ul) | Final concentration |
|
10 | 1X |
dNTP mix(10mM/dNTP) | 2 | 400uM/dNTP |
Primer mixture (75 uM/primer) | 8.4 | 300 nM/primer |
MgCl 2 (25mM) | 3 | 1.5mM |
Phusion U(2U/ul) | 0.75 | 1.5U |
DNA template | 18-22 | |
DEPC H 2 O | up to 50 |
The specific reaction conditions are as follows: pre-denaturation at 98 ℃ for 30 s; 15-35 cycles of reaction, preferably 22 cycles in this example: denaturation, 98 ℃, 15 s; annealing at 58-66 deg.C for 15-30s, preferably 63 deg.C for 15s in this example; the elongation is 72 ℃ and 15-30s, preferably 15s in the embodiment.
4. Performing multiplex fluorescent quantitative PCR determination on multiplex PCR amplification products
Multiplex PCR products were diluted 1-5 fold, preferably 2 fold in this example. The multiple fluorescent quantitative PCR reaction components are as follows: primer probe mixed solution, wherein the concentration of each primer is 200-900nM, preferably 900nM in this embodiment; the probe concentration is 100-200nM, with 200nM being preferred in this example. The reaction enzyme mixture was 1 time of ChamQ Geno-SNP probe Master Mix (Vazyme, Cat # Q811-02), and one reaction was a 10ul system.
The primer probe mixed solution in the system comprises 2-3 molecular marker primers and probes marked by different fluorescent groups, and a mixed solution scheme for mixing some of the 2-3 molecular marker primers and probes is listed in table 5:
TABLE 5 primer Probe mixture solution combination protocol in multiplex fluorescent quantitative PCR reaction System
The specific reaction conditions are as follows: pre-denaturation at 95 ℃ for 5 min; 40-50 cycles, preferably 45 cycles in this example: denaturation, 15s at 95 ℃; annealing at 60-64 deg.C, 62 deg.C for 1min, and collecting signal. The qPCR fluorescent quantitation reaction system was prepared as in table 6:
TABLE 6 qPCR fluorescent quantitative reaction system
C of multiplex fluorescent quantitative PCR assay of 21 molecular markers according to the mixture combination (mixtures A to O) shown in Table 5 using the fully methylated (positive control) and non-methylated (negative control) standards T Value, C with which a single fluorescent quantitative PCR reaction is carried out T Value comparison, in which negative control was not detected in all combinations and individual quantification, C of positive control T The values are shown in Table 7 (including Table 7.1 and Table 7.2), and the results show that the molecular markers are C in the quantitative PCR reaction performed according to the mixture combination protocol shown in Table 5 T C value of fluorescent quantitative PCR reaction carried out independently of C T The values are similar and have no obvious difference, so that the amplification efficiency of each molecular marker in the primer probe mixed solution combination scheme for judging multiple fluorescence quantification has no mutual interference, the quantification performance is equivalent to that of a single region, and the simultaneous quantitative detection of 2-3 molecular markers can be realized.
TABLE 7.1C for each molecular marker in multiplex and Individual fluorescent quantitative PCR reactions T Value of
TABLE 7.2C for each molecular marker in multiplex and Individual Fluorogenic quantitative PCR reactions T Value of
The present embodiment further provides a method for detecting benign and malignant lung nodules, further comprising the following steps:
5. c of reference gene in sample determined from fluorescent quantitative PCR reaction T Judging whether the detected sample is an effective sample, and if so, detecting C of the reference gene in the sample T If the value is between 8 and 18, judging the sample as a valid sample; if the C of the reference gene in the sample is detected T Value of<8, the initial input amount of the sample is excessive; if the C of the reference gene in the sample is detected T Value of>18, the initial sample input amount is insufficient. Samples with excessive or insufficient initial input amount are judged as invalid samples, and are not included in the detection and analysis.
6. If the target DNA methylated molecular marker C is on the premise that the sample is judged to be a valid sample T Value of<35 determining that the DNA methylation molecular marker is detected, and obtaining the relative cycle number deltaC of the target DNA methylation molecular marker T :ΔC T Methylation of target DNA molecular marker C T Value-reference Gene C T A value; if the target DNA is a methylated molecular marker C T If the value is not less than 35, it is judged that the DNA methylated molecular marker is not detected, and if it is not detected, Δ C is given thereto T =27。
7. Corrected Δ C according to the target molecular marker in each sample T And (3) carrying out data analysis, dividing the data set into a training set and a test set according to the ratio of 6:4, carrying out 100 times of random division, establishing a benign and malignant prediction model by adopting a naive Bayes algorithm according to the combination containing different molecular markers for the training set, and finally evaluating the classification capability of the model in the corresponding test set containing a specific molecular marker combination.
Example 3
This example provides the detection of molecular markers in a standard by the following steps:
1. preparation of standards
1) Preparation of 0% methylated standards:
by usingSingle Cell Kit (Qiagen, Cat #150343) and Mung Bean Nuclear (NEB, Cat # M0250L) treated NA12878 DNA to make 0% methylation standards;
2) preparation of 100% methylated standard:
the prepared 0% methylated standards were treated with CpG methylransferase (m.sssi) to give 100% methylated standards.
2. Preparation of standards with different methylation ratios:
mix the 0% and 100% methylation standards according to the desired methylation ratio gradient to give 0.2%, 0.4%, 1% methylation standards.
3. Bisulfite conversion of standard DNA at different methylation ratios: the procedure is as in example 2, the conversion input being 10-50ng, preferably 50 ng.
4. The transformed standard DNA was amplified by multiplex PCR, as in example 2, using 21 molecular markers and primers for the reference gene as the primer mixture for multiplex PCR.
5. The multiplex PCR amplification product was subjected to the fluorescent quantitative PCR assay in the same manner as in example 2.
6. C of reference gene ACTB in sample determined according to fluorescent quantitative PCR reaction T Judging whether the detected sample is an effective sample, and if so, detecting C of the reference gene in the sample T If the value is between 8 and 18, judging the sample as a valid sample;
7. if the target DNA methylation molecular marker C is judged to be a valid sample T Value of<35 judging that the DNA methylation molecular marker is detected, and if the target DNA methylation molecular marker C is detected T If the value is not less than 35, it is judged that the DNA methylated molecular marker is not detected.
In this example and the following examples, the primer probe combinations of the respective molecular markers were as preferred in example 1.
In this and the following examples, a negative control was set for each experiment, and multiple PCR was performed using water as a template to obtain a negative control multiple PCR product, which was then subjected to quantitative fluorescence PCR measurement of each specific molecular marker. If the negative control has no detection signal, judging that the whole experiment operation has no exogenous pollution.
In this example, 3 completely independent replicates were performed, and 21 molecular markers all showed detection signals in 100% methylated standard, while none of the negative control and unmethylated detection standards showed detection signals, and the amplification curve of one of the molecular markers (Marker10) is shown in FIG. 1. In each standard substance with the methylation ratio of more than or equal to 0.2%, three tests of Marker1, Marker2, Marker4, Marker5, Marker10, Marker11 and Marker21 all have detection signals, which indicates that the detection rate of the molecular markers to the sample with the methylation ratio of more than or equal to 0.2% reaches 100%; in each standard substance with the methylation ratio of more than or equal to 0.4%, all three tests of Marker3, Marker12, Marker14, Marker15, Marker18 and Marker20 have detection signals, which indicates that the detection rate of the molecular markers to the sample with the methylation ratio of more than or equal to 0.4% reaches 100%; all three tests performed on 21 molecular markers in the standard with a methylation ratio of 1% showed that the molecular markers could detect a signal with a methylation ratio of 1%, as shown in table 8:
TABLE 8 test results of molecular markers in various methylation ratio standards
Example 4 correlation of molecular markers with Lung cancer
In this example, the correlation between the molecular marker provided by the present invention and lung cancer is studied, and the specific method is as follows:
1. DNA extraction was performed on paraffin section samples of lung tissue as described in example 2.
2. The extracted DNA was transformed with sulfite in an amount of 10ng to 50ng, preferably 50ng in this example, as described in example 2.
3. The transformed DNA was amplified by multiplex PCR using amplification primers containing specific molecular markers as described in example 2.
4. The multiplex PCR amplification products were subjected to fluorescent quantitative PCR assay as described in example 2.
5. Based on the results of the fluorescence measurements, it was determined whether the sample was a valid sample, as described in example 2.
6. For the target molecular marker C in the detected sample T The values were corrected as described in example 2.
7. According to the corrected Delta C of the target molecular marker in the sample T Data analysis was performed as described in example 2.
Single molecule Marker detection was performed on 109 lung nodule tissue samples (46 samples identified as benign by surgical biopsy, 63 malignant samples including 4 AIS samples, 11 MIA samples, 38 IA samples and 10 other subtype malignant samples, 84% of which are early lung cancer samples) for Marker1, Marker6, Marker7, Marker8, Marker9, Marker10, Marker14, Marker15, and Marker 16. The average AUC of each molecular marker is 0.80-0.89, the lowest AUC is 0.71-0.82 and the highest AUC is 0.88-0.95 at 95% CI. The Sensitivity (Sensitivity) after segmentation under the premise of the maximum Youden Index (Youden Index) is 65-79%, and the Specificity (Specificity) is 80-98%, wherein the Marker9 has high Specificity and Sensitivity in tissue samples. This suggests that the methylation levels of these molecular markers are highly correlated with the canceration of lung cancer. Specifically, as shown in table 9:
TABLE 9 correlation of molecular markers with Lung cancer
Example 5 presentation of different molecular marker combinations for detection of benign and malignant pulmonary nodules in Lung tissue samples
The present example utilizes 21 molecular markers including any combination of Marker 1-Marker 21, and the experimental method of example 2 to perform detection analysis on 109 lung nodule tissue samples in example 4, wherein the specific detection kit, the experimental method and the data judgment process are as described in example 2, and the combination of primers and probes is as preferred in example 1.
In this and the following embodiments, combinations including different specific molecular markers are selected, specific combinations of molecular markers are shown in table 10, a logistic regression model is used for modeling analysis, a data set is randomly segmented into a training set and a test set according to a ratio of 6:4, and a naive Bayes algorithm is used for establishing a benign/malignant prediction model.
TABLE 10 different molecular marker combinations
The molecular marker combination A in Table 10 was selected for modeling analysis, and its AUC in the test set was 0.95 (specificity: 94%; sensitivity: 86%), and ROC is shown in FIG. 2. It can be seen that the combined sensitivity and specificity of the selected molecular markers is very good. As 84% of malignant samples in the samples are early lung cancer samples, the molecular marker combination also has higher sensitivity and specificity for early lung cancer detection and benign and malignant lung nodule detection.
The molecular marker combination D in Table 10 was selected for modeling analysis, and its AUC in the test set was 0.91 (specificity: 90%; sensitivity: 85%), and ROC is shown in FIG. 2. The combination of the selected molecular markers can be seen to have better sensitivity and specificity. As 84% of malignant samples in the samples are early lung cancer samples, the molecular marker combination also has higher sensitivity and specificity for early lung cancer detection and benign and malignant lung nodule detection.
The molecular marker combination K in Table 10 was selected for modeling analysis, and its AUC in the test set was 0.91 (specificity: 90%; sensitivity: 85%), and ROC is shown in FIG. 2. The combination of selected molecular markers was seen to have better sensitivity and specificity, but slightly less sensitivity than the D combination. As 84% of malignant samples in the sample are lung cancer early samples, the molecular marker combination has higher sensitivity and specificity for early lung cancer detection and pulmonary nodule benign and malignant detection.
The molecular marker combination B in Table 10 was selected for modeling analysis, and its AUC in the test set was 0.89 (specificity: 84%; sensitivity: 88%), and ROC is shown in FIG. 2. It can be seen that the combination of selected molecular markers has better sensitivity and slightly weaker specificity than the other combinations. As 84% of malignant samples in the samples are early lung cancer samples, the molecular marker combination also has higher sensitivity and specificity for early lung cancer detection and benign and malignant lung nodule detection.
The molecular marker combination J in Table 10 was selected for modeling analysis, and its AUC in the test set was 0.87 (specificity: 90%; sensitivity: 85%), and ROC is shown in FIG. 2. The combination of the selected molecular markers can be seen to have better sensitivity and specificity. As 84% of malignant samples in the samples are lung cancer early samples, the molecular marker combination also has higher sensitivity and specificity for early lung cancer detection and lung nodule benign and malignant detection. The specificity and sensitivity of these three molecular marker combinations are similar to those of combinations D and K, but the AUC of combination J is slightly lower, so that the effect of this combination in diagnosing benign and malignant pulmonary nodules is slightly weaker than that of combinations D and K.
The molecular marker combinations C, I and F in Table 10 were selected for modeling analysis, and their performance in the test set was: combination C, AUC 0.88 (specificity: 95%; sensitivity: 76%); combination I, AUC 0.87 (specificity: 90%; sensitivity: 73%); combination F, AUC 0.90 (specificity: 100%; sensitivity: 77%). The ROC is shown in fig. 2. It can be seen that the combinations of selected molecular markers, although slightly less sensitive than the other combinations (combinations A, B, D, J and K), all have a very high specificity. Therefore, the detection method of the combination of the three molecular markers can reduce the false positive rate of early lung cancer detection and lung nodule benign and malignant detection through higher detection specificity.
The molecular marker combination G in Table 10 was selected for modeling analysis, and its AUC in the test set was 0.87 (specificity: 68%; sensitivity: 92%), and ROC is shown in FIG. 2. It can be seen that the combination of the selected molecular markers has very good sensitivity for early lung cancer detection and benign and malignant detection of pulmonary nodules, but relatively low specificity; and the specificity is obviously improved by adding the combination K of the Marker15 and the Marker16 on the basis.
Example 6 Performance of lung nodule benign and malignant detection on airway fluid samples with different molecular markers and combinations
This example detected 21 molecular markers including Marker1 through Marker21 mentioned in example 1 in 86 samples of respiratory fluid. The surgical biopsy was identified as 42 benign samples and 44 malignant samples, wherein the malignant samples included 25 stage I samples, 1 stage II samples, 6 stage III samples, and 12 stage IV samples. The specific test kit, test method and data judgment process were as described in example 2, and the primer and probe combinations were as preferred in example 1.
The molecular marker combination A in Table 10 of example 5 was selected for modeling analysis of airway fluid samples, and the mean AUC was 0.84 (specificity: 88%; sensitivity: 83%), the sensitivity to stage I malignancy was 77%, and the sensitivity to both stage III and stage IV samples reached 100%, and ROC is shown in FIG. 3. Therefore, the method has higher sensitivity and specificity for detecting benign and malignant pulmonary nodules in the respiratory tract liquid sample.
The molecular marker combination D in Table 10 of example 5 was selected for modeling analysis of airway fluid samples, and the mean AUC was 0.89 (specificity: 82%; sensitivity: 89%), the sensitivity to stage I malignancy was 78%, the sensitivity to both stage III and stage IV samples was 100%, and ROC is shown in FIG. 3. Therefore, the method has higher sensitivity and specificity for detecting the benign and malignant pulmonary nodules of the respiratory tract liquid sample.
The molecular marker combination E in Table 10 of example 5 was selected for modeling analysis of airway fluid samples, and the mean AUC was 0.87 (specificity: 81%; sensitivity: 85%), the sensitivity to stage I malignancy was 77%, the sensitivity to both stage III and stage IV samples reached 100%, and ROC is shown in FIG. 3. Therefore, the method has higher sensitivity and specificity for detecting the benign and malignant pulmonary nodules of the respiratory tract liquid sample.
The molecular marker combination J in Table 10 of example 5 was selected for modeling analysis of respiratory tract fluid samples, and the average AUC was 0.83 (specificity: 82%; sensitivity: 83%), the sensitivity to stage I malignant samples was 75%, the sensitivity to both stage III and stage IV samples reached 100%, and ROC is shown in FIG. 3. Therefore, the method has higher sensitivity and specificity for detecting the benign and malignant pulmonary nodules of the respiratory tract liquid sample.
The molecular marker combination F in Table 10 of example 5 was selected for modeling analysis of airway fluid samples, and the mean AUC was 0.82 (specificity: 81%; sensitivity: 79%), the sensitivity to stage I malignant samples was 73%, the sensitivity to both stage III and stage IV samples was 100%, and ROC is shown in FIG. 3. Therefore, the method has higher sensitivity and specificity for detecting the benign and malignant pulmonary nodules of the respiratory tract liquid sample.
The respiratory fluid samples were modeled using the molecular marker combinations G, I and K in table 10 of example 5 and were shown to be: combination G, mean AUC 0.83 (specificity: 76%; sensitivity: 83%); combination I, mean AUC 0.86 (specificity: 71%; sensitivity: 94%); in combination K, the mean AUC was 0.87 (specificity: 77%; sensitivity: 94%). The ROC is shown in fig. 3. The three molecular marker combinations have better sensitivity in respiratory tract liquid samples, but the specificity is obviously weaker than the detection effect of other combinations (A, D, E, F, J).
The combination of the molecular markers (A, D, E, F, J) can achieve higher detection specificity and sensitivity by having more suitable markers, and the combination D, E, F and J can also maintain higher detection specificity (> 80%) after a certain number of markers is reduced; the kit has high detection specificity, and can better reduce the misdiagnosis rate of benign and malignant diagnosis of the pulmonary nodules. Wherein, the combination D, E and J can simultaneously ensure high detection sensitivity, and the combination D and E even have higher sensitivity than the combination A; the detection sensitivity is higher, and the missed diagnosis rate of the benign and malignant diagnosis of the pulmonary nodules can be better reduced. Therefore, the molecular marker combination A, D, E, F and J is more suitable for detecting lung nodules in respiratory tract liquid samples than other combinations.
In addition to the combination A which is formed by combining more markers, the combination J which is formed by two molecular markers of the marker6 and the marker9 and the combination D which is formed by adding four molecular markers of the marker15 and the marker16 on the basis of the marker6 and the marker9 show higher detection capability in two different types of respiratory tract samples, so that the two marker combinations can have higher sensitivity and specificity for detecting the benign and malignant pulmonary nodules of the respiratory tract tissues and the liquid samples.
The present example also carried out the detection of single molecular markers for the molecular markers Marker6, Marker7, Marker8, Marker9, Marker15 and Marker16 in these respiratory fluid samples. As shown in example 4, the 6 molecular markers have high correlation with lung cancer, and the single detection performance in the respiratory tract liquid sample is shown in Table 11, wherein the sensitivity of the Marker9 in the respiratory tract liquid sample is higher, but the specificity is lower than that in the case of the multiple markers in combination. The specificity of any combination of molecular markers as shown in this example was higher than that of marker9 alone. Comprehensive comparison shows that the detection capability of combining specific and suitable molecular markers is superior to the performance of independent detection of single molecular markers.
TABLE 11
The above-mentioned embodiments only express several embodiments of the present invention, and the description thereof is specific and detailed, but not to be understood as limiting the scope of the present invention. It should be noted that various changes and modifications can be made by those skilled in the art without departing from the spirit of the invention, and these changes and modifications are all within the scope of the invention. Therefore, the protection scope of the present patent shall be subject to the appended claims.
Sequence listing
<110> Guangzhou City benchmark medical Limited liability company
<120> methylated molecular markers for detecting benign and malignant lung nodules or combination and application thereof
<160> 213
<170> SIPOSequenceListing 1.0
<210> 1
<211> 148
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 1
gcagcccgac cccgtggcca cagagcagcc tctgttcctt ggtaatcgtc aacatctggc 60
aggcacctcc acgcccctcc gcccccacca tacgctgctg gctggaggct cccgtgcaat 120
gccaggtgtg ttgggagagg acaggcta 148
<210> 2
<211> 145
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 2
agggggtctg ctgggagagt tggggcgcgg agggctatgc gaattcagca accgggtcgg 60
cctcagagac tctgcgtccc cgctcgaatt gcgcagtcca cgcagccaga gaggcgcggg 120
tgggctctgg gctgccagcc cagag 145
<210> 3
<211> 181
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 3
atggctgagc ctcctgcaaa agtttgccgg cttccgcagt gatggatcac cgttttagtg 60
gcatttaaat ccccggcgct ccgccgtcta ggtgacgcgc agtcgccccc ccaggcagcc 120
taggcggcgg cagctgctgc ggcgactgca aaggccgatt tggagtgctg gagcgaagaa 180
g 181
<210> 4
<211> 189
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 4
gagagcgcac cgggcgtggg ctccagccag gagcgcatgt acctgccgtc cggcgccgcc 60
gccgccacgg gcgcctgggg gtgcacgtag gggtggtggt gatggtggtg gtacaccgca 120
gcgggtacag cgttggcgcc cgccgcgtgc actgggttcc acgaggcgcc aaacaccgtc 180
gccttggac 189
<210> 5
<211> 153
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 5
gctcggccag cgtcgccgcc tgccggggag gctggcccag ggtccccggc gcatagcggc 60
caacgctcag ctcatccgcg gcgtcggcgc ccagcaggaa cgagtccacg tagtagttgc 120
ccagggcccc agtggtggcc atcaccgtgc cca 153
<210> 6
<211> 145
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 6
gttctgggcg cagggaggcg gcggggggct gctgctgacc gcctcgcagc gctggccggg 60
ctccgggagg agggccccgg cgggtggcgg cgcaggagcc cgaggggaca gaccgggcgg 120
tggcgggggc ggcgggggtg gtggc 145
<210> 7
<211> 182
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 7
gcgggggccc agggtcccgg caggccgcgt agcgctgcac ggtgcacgcc gcgcgccgcc 60
cgaagcccgc ctccggctgg aagctgctct ctcgcctctg gccgccggcg tagtacccgg 120
gcgagtggtc gctgggtagg taatcgctct gtgaatattc ctcgcatgga gggaacttgg 180
gg 182
<210> 8
<211> 151
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 8
cctgtttccg aaagccctcc tacttactgt caagtgaaca aagttaggcg cccacgtgat 60
cctccgagcc aatggccgcc ccgcctgcga ttcccggata aggaaatctg ctcacccgga 120
ccccactcca gccaaagagg tttatttccc c 151
<210> 9
<211> 178
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 9
cctccccacc cacccaccca acaccaggat ttacataggg ctcctgcggg gcgaccccct 60
ccttgcctcg ctctctccgg gatcagagag agagcgagag agagagcgcg cgcaggttgc 120
gactggaggg cctgttgggg cgctaggcag agcgcaaacc ctagatccct taagaagt 178
<210> 10
<211> 139
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 10
ccagggcagg ctggtcttga gcctggagaa ggctttgctc agcacgcgca tccgggcacg 60
ctcacgggcg ttggccgcgt tccgctgcga ctgcttgcac tctgcggctg agcccttggc 120
cgggaggggc ttcttgcca 139
<210> 11
<211> 151
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 11
ccttgcccac acgcgtcctc tttcctcccc cctggccagt ctcgctgtct ccgccttccg 60
ctccctggcg gaggcggagg ccagagagcg ctccaaggaa gactaaaaac ccaggccggg 120
aagcgcgggg tgagaaagcg aggtgggtgg c 151
<210> 12
<211> 166
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 12
tcctgacccc gaagctctag gatggagggg agaatttcta ggagcgactt ccccgtcccc 60
tccccaagca atccaccacc gcagggtcgg cgccgctcgg ccttcgctcc ccgcgcacca 120
gttctatctg tgaaggaagc aaaagccaac tcggtggaat cctgac 166
<210> 13
<211> 162
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 13
ggcccctggg gacgcggcgc ggcgctcggc gcctccgtcg gaagctcggg ggtcgggcgg 60
gagcctccgg aagggccccg cggagccggg agtccgaggc cgcgcgcacg ccgaaccgag 120
cgtaccaact ccgcgcccgg acgcgtggcg cccccccaac cc 162
<210> 14
<211> 148
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 14
agtcaccgcg ggctacgcca ctcccacccg gcacacgcga cacccgccgc gcgcaggctc 60
ctgcttgcag gtccggccgc tgctcgggcc aagtaaacac cgggctggga aagctcgctg 120
cggagccgct cgggggcaag ccagcccc 148
<210> 15
<211> 174
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 15
gctcgcggct gcagtaccca gacacctggt gcttcatcga ctggaccacc aacgtgacgg 60
cgcacgccgc ctactcctac atgtacgcgg gcttcagctc cttcctcatt ctcgccaccg 120
tcctctgcaa cgtgcttgtg tgcggcgcgc tgctccgcat gcaccgccag ttca 174
<210> 16
<211> 143
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 16
ggttgcctcc cggggccacc ccgctgcctc cccagccttg ccgcgcctca gcgactttcg 60
gcgccgccgg agcttccgcc gcatcgcggg cgccgagatc cagatggtca tcttactcat 120
tgccacctcc ctggtggtgc tca 143
<210> 17
<211> 147
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 17
ctcgtaggcg cgcggggcca ctactcacgc gcgcactgca ggcctttgcg cacgacgccc 60
cagatgaagt cgccacagag gtcgcaccac gtgtgcgtgg cgggccccgc gggctggaag 120
cggtggccac ggccagggac cagctgc 147
<210> 18
<211> 154
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 18
ctccagccgg gtgcggccct tcccagcgcg cccagcgggt gccagctccc gcagctcaat 60
gagctcaggc tcccccgaca tggcccggtt gggcccgtgc ttcgctggct ttgggcgcta 120
gcaagcgcgg gccgggcggg gccacagggc gggc 154
<210> 19
<211> 177
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 19
gcctgcccga ccggggtcgc acgagcacag gcgcccacgc catgttggct gcccaaaggg 60
ctcgccgccc aagccgggcc agaaggcagg aggcggaaaa ccagcctccg gtggcgggcg 120
aaagcaaccg ctctttctgt tctctcttcg ccctccctcg tggaaacgca gactcga 177
<210> 20
<211> 157
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 20
tggatggccc cgaggtccaa aaagaaagcg cccaacggct ggacgcacac cccgccaggc 60
ctcctggaaa cggtgccggt gctgcagagc ccgcgaggtg tctgggagtt gggcgagagc 120
tgcagacttg gaggctctta tacctccgtg caggcgg 157
<210> 21
<211> 159
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 21
cagcttgtgc cagcgactac taggtcaggg ggcgcagctg agagaattca agcaacctcc 60
ccgcaaccgg ttccgccgcg tttgtgggct ggtagcccgg aatacatttc ccagaggcct 120
tcgcggccga cgtgcttcgc gcaggaacgc agccgcctc 159
<210> 22
<211> 31
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 22
ctctattcct taataatcgt caacatctaa c 31
<210> 23
<211> 26
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 23
tgtacgggag tttttagtta gtagcg 26
<210> 24
<211> 21
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 24
aacacctcca cgcccctccg c 21
<210> 25
<211> 28
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 25
caaaacaacc tctattcctt aataatcg 28
<210> 26
<211> 24
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 26
ttttagttag tagcgtatgg tggg 24
<210> 27
<211> 27
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 27
caacatctaa caaacacctc cacgccc 27
<210> 28
<211> 29
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 28
acaaaacaac ctctattcct taataatcg 29
<210> 29
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 29
tttaatatat ttggtattgt acgggagttt 30
<210> 30
<211> 27
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 30
caacatctaa caaacacctc cacgccc 27
<210> 31
<211> 24
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 31
cggagggtta tgcgaattta gtaa 24
<210> 32
<211> 24
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 32
gcctctctaa ctacgtaaac tacg 24
<210> 33
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 33
cgaaaacgca aaatctctaa aaccgacccg 30
<210> 34
<211> 25
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 34
gagggttatg cgaatttagt aatcg 25
<210> 35
<211> 25
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 35
gcctctctaa ctacgtaaac tacgc 25
<210> 36
<211> 31
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 36
cgaacgaaaa cgcaaaatct ctaaaaccga c 31
<210> 37
<211> 24
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 37
tcaacaaccg aatcgacctc aaaa 24
<210> 38
<211> 26
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 38
tgggttggta gtttagagtt tattcg 26
<210> 39
<211> 25
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 39
cccgctcgaa ttacgcaatc cacgc 25
<210> 40
<211> 25
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 40
acttccgcaa taataaatca ccgtt 25
<210> 41
<211> 26
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 41
ttgtagtcgt cgtagtagtt gtcgtc 26
<210> 42
<211> 25
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 42
acatttaaat ccccgacgct ccgcc 25
<210> 43
<211> 27
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 43
agtgatggat tatcgtttta gtggtat 27
<210> 44
<211> 24
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 44
actccaaatc gacctttaca atcg 24
<210> 45
<211> 25
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 45
cgcaacaact accgccgcct aaact 25
<210> 46
<211> 22
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 46
cgcgcataac taaacctcct ac 22
<210> 47
<211> 23
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 47
tgtagtcgtc gtagtagttg tcg 23
<210> 48
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 48
accgacttcc gcaataataa atcaccgttt 30
<210> 49
<211> 26
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 49
ttagttagga gcgtatgtat ttgtcg 26
<210> 50
<211> 22
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 50
<210> 51
<211> 25
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 51
acgaacgcca acgctatacc cgcta 25
<210> 52
<211> 22
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 52
ccgaacgtaa actccaacca aa 22
<210> 53
<211> 21
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 53
cgttgtattc gttgcggtgt a 21
<210> 54
<211> 25
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 54
catataccta ccgtccgacg ccgcc 25
<210> 55
<211> 23
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 55
ggagcgtatg tatttgtcgt tcg 23
<210> 56
<211> 21
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 56
aaacccaata cacgcgacga a 21
<210> 57
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 57
cgctataccc gctacgatat accaccacca 30
<210> 58
<211> 22
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 58
<210> 59
<211> 27
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 59
tgggtaatta ttacgtggat tcgtttt 27
<210> 60
<211> 27
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 60
ataacgacca acgctcaact catccgc 27
<210> 61
<211> 20
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 61
<210> 62
<211> 21
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 62
acgccgcgaa taaactaaac g 21
<210> 63
<211> 28
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 63
aaccgctata cgccgaaaac cctaaacc 28
<210> 64
<211> 21
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 64
actaacccaa aatccccgac g 21
<210> 65
<211> 24
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 65
gggtacggtg atggttatta ttgg 24
<210> 66
<211> 27
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 66
ataacgacca acgctcaact catccgc 27
<210> 67
<211> 22
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 67
<210> 68
<211> 23
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 68
cgatctatcc cctcgaactc cta 23
<210> 69
<211> 19
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 69
ccgccacccg ccgaaaccc 19
<210> 70
<211> 24
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 70
gttgttgttg atcgtttcgt agcg 24
<210> 71
<211> 24
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 71
gatctatccc ctcgaactcc tacg 24
<210> 72
<211> 24
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 72
ccgaaaccct cctcccgaaa cccg 24
<210> 73
<211> 24
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 73
gggttgttgt tgatcgtttc gtag 24
<210> 74
<211> 24
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 74
gatctatccc ctcgaactcc tacg 24
<210> 75
<211> 25
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 75
cgccgaaacc ctcctcccga aaccc 25
<210> 76
<211> 22
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 76
<210> 77
<211> 23
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 77
acaaaacgat tacctaccca acg 23
<210> 78
<211> 25
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 78
cccgaatact acgccgacga ccaaa 25
<210> 79
<211> 24
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 79
aagttcgttt tcggttggaa gttg 24
<210> 80
<211> 24
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 80
aaacgattac ctacccaacg acca 24
<210> 81
<211> 25
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 81
cgcccgaata ctacgccgac gacca 25
<210> 82
<211> 24
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 82
aagttcgttt tcggttggaa gttg 24
<210> 83
<211> 24
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 83
aaacgattac ctacccaacg acca 24
<210> 84
<211> 25
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 84
cgcccgaata ctacgccgac gacca 25
<210> 85
<211> 25
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 85
aagtgaataa agttaggcgt ttacg 25
<210> 86
<211> 27
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 86
atccgaataa acaaatttcc ttatccg 27
<210> 87
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 87
aatcgcaaac gaaacgacca ttaactcgaa 30
<210> 88
<211> 27
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 88
ttaagtgaat aaagttaggc gtttacg 27
<210> 89
<211> 27
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 89
atccgaataa acaaatttcc ttatccg 27
<210> 90
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 90
aatcgcaaac gaaacgacca ttaactcgaa 30
<210> 91
<211> 26
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 91
gaataaagtt aggcgtttac gtgatt 26
<210> 92
<211> 27
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 92
atccgaataa acaaatttcc ttatccg 27
<210> 93
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 93
aatcgcaaac gaaacgacca ttaactcgaa 30
<210> 94
<211> 27
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 94
aatttacata aaactcctac gaaacga 27
<210> 95
<211> 27
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 95
cgttttgttt agcgttttaa taggttt 27
<210> 96
<211> 27
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 96
ccctccttac ctcgctctct ccgaaat 27
<210> 97
<211> 28
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 97
aaaatttaca taaaactcct acgaaacg 28
<210> 98
<211> 25
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 98
gcgttttgtt tagcgtttta atagg 25
<210> 99
<211> 27
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 99
ccctccttac ctcgctctct ccgaaat 27
<210> 100
<211> 27
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 100
tttacataaa actcctacga aacgacc 27
<210> 101
<211> 29
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 101
gtttagcgtt ttaataggtt ttttagtcg 29
<210> 102
<211> 29
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 102
cctccttacc tcgctctctc cgaaatcaa 29
<210> 103
<211> 20
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 103
<210> 104
<211> 23
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 104
gcaaaataca aacaatcgca acg 23
<210> 105
<211> 24
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 105
cgaccaacgc ccgtaaacgt accc 24
<210> 106
<211> 26
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 106
gtttggagaa ggttttgttt agtacg 26
<210> 107
<211> 24
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 107
tcaaccgcaa aatacaaaca atcg 24
<210> 108
<211> 22
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 108
cgaccaacgc ccgtaaacgt ac 22
<210> 109
<211> 23
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 109
aaactttact caacacgcgc atc 23
<210> 110
<211> 27
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 110
agtcgtagag tgtaagtagt cgtagcg 27
<210> 111
<211> 25
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 111
aacacgctca cgaacgttaa ccgcg 25
<210> 112
<211> 26
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 112
ttttggttag tttcgttgtt ttcgtt 26
<210> 113
<211> 28
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 113
tcccgaccta aatttttaat cttcctta 28
<210> 114
<211> 25
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 114
ctctctaacc tccgcctccg ccaaa 25
<210> 115
<211> 25
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 115
ttttggttag tttcgttgtt ttcgt 25
<210> 116
<211> 18
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 116
<210> 117
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 117
atcttcctta aaacgctctc taacctccgc 30
<210> 118
<211> 24
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 118
ggttagtttc gttgttttcg tttt 24
<210> 119
<211> 18
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 119
<210> 120
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 120
atcttcctta aaacgctctc taacctccgc 30
<210> 121
<211> 26
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 121
gagaattttt aggagcgatt ttttcg 26
<210> 122
<211> 28
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 122
ccttcacaaa taaaactaat acgcgaaa 28
<210> 123
<211> 24
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 123
aaaccgaacg acgccgaccc tacg 24
<210> 124
<211> 24
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 124
tcgaagtttt aggatggagg ggag 24
<210> 125
<211> 31
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 125
tacttccttc acaaataaaa ctaatacgcg a 31
<210> 126
<211> 25
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 126
accgaacgac gccgacccta cgata 25
<210> 127
<211> 25
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 127
ttgagtagga gaaattttga tttcg 25
<210> 128
<211> 26
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 128
aactaatacg cgaaaaacga aaaccg 26
<210> 129
<211> 26
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 129
aacgcacgcc gaccctacga taataa 26
<210> 130
<211> 19
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 130
cgttttcgtc ggaagttcg 19
<210> 131
<211> 25
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 131
gaaattaata cgctcgattc gacgt 25
<210> 132
<211> 25
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 132
cctcgaactc ccgactccgc gaaac 25
<210> 133
<211> 22
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 133
<210> 134
<211> 25
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 134
attaatacgc tcgattcgac gtacg 25
<210> 135
<211> 27
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 135
tacgcgcgac ctcgaactcc cgactcc 27
<210> 136
<211> 19
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 136
cgttttcgtc ggaagttcg 19
<210> 137
<211> 21
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 137
tacgctcgat tcgacgtacg c 21
<210> 138
<211> 22
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 138
ctcgaactcc cgactccgcg aa 22
<210> 139
<211> 21
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 139
cggtatacgc gatattcgtc g 21
<210> 140
<211> 25
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 140
actttcccaa cccgatattt actta 25
<210> 141
<211> 28
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 141
cccgaacaac gaccgaacct acaaacaa 28
<210> 142
<211> 21
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 142
ggtatacgcg atattcgtcg c 21
<210> 143
<211> 20
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 143
acgaactttc ccaacccgat 20
<210> 144
<211> 28
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 144
cccgaacaac gaccgaacct acaaacaa 28
<210> 145
<211> 22
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 145
cgcgcaaact cctacttaca aa 22
<210> 146
<211> 21
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 146
gggttggttt gttttcgagc g 21
<210> 147
<211> 29
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 147
ccgaccgcta ctcgaaccaa ataaacacc 29
<210> 148
<211> 25
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 148
aatacttcat cgactaaacc accaa 25
<210> 149
<211> 24
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 149
gaatgaggaa ggagttgaag ttcg 24
<210> 150
<211> 25
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 150
taacgacgca cgccgcctac tccta 25
<210> 151
<211> 24
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 151
tacttcatcg actaaaccac caac 24
<210> 152
<211> 27
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 152
cgtatataag tacgttgtag aggacgg 27
<210> 153
<211> 23
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 153
taacgacgca cgccgcctac tcc 23
<210> 154
<211> 27
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 154
ctaatacttc atcgactaaa ccaccaa 27
<210> 155
<211> 24
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 155
gaatgaggaa ggagttgaag ttcg 24
<210> 156
<211> 25
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 156
taacgacgca cgccgcctac tccta 25
<210> 157
<211> 22
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 157
<210> 158
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 158
ataaataaaa taaccatcta aatctcgacg 30
<210> 159
<211> 26
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 159
cgatacgacg aaaactccga cgacgc 26
<210> 160
<211> 23
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 160
ttttagtttt gtcgcgtttt agc 23
<210> 161
<211> 24
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 161
aaataaccat ctaaatctcg acgc 24
<210> 162
<211> 23
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 162
acgaaaactc cgacgacgcc gaa 23
<210> 163
<211> 20
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 163
<210> 164
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 164
gtaatgagta agatgattat ttggatttcg 30
<210> 165
<211> 25
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 165
ccaaccttac cgcgcctcaa cgact 25
<210> 166
<211> 21
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 166
cgcgcactac aaacctttac g 21
<210> 167
<211> 20
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 167
<210> 168
<211> 26
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 168
acgacgcccc aaataaaatc gccaca 26
<210> 169
<211> 27
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 169
ttagatgaag tcgttataga ggtcgta 27
<210> 170
<211> 24
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 170
aactaatccc taaccgtaac cacc 24
<210> 171
<211> 24
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 171
cttccaaccc gcgaaacccg ccac 24
<210> 172
<211> 27
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 172
cgttttagat gaagtcgtta tagaggt 27
<210> 173
<211> 24
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 173
ccgacaacta atccctaacc gtaa 24
<210> 174
<211> 24
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 174
cttccaaccc gcgaaacccg ccac 24
<210> 175
<211> 24
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 175
gtttagcggg tgttagtttt cgta 24
<210> 176
<211> 22
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 176
ttactaacgc ccaaaaccaa cg 22
<210> 177
<211> 29
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 177
aacacgaacc caaccgaacc atatcgaaa 29
<210> 178
<211> 20
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 178
<210> 179
<211> 20
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 179
<210> 180
<211> 25
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 180
aacgaaacac gaacccaacc gaacc 25
<210> 181
<211> 21
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 181
ttagcgggtg ttagttttcg t 21
<210> 182
<211> 20
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 182
<210> 183
<211> 25
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 183
aacgaaacac gaacccaacc gaacc 25
<210> 184
<211> 21
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 184
cgcccacgcc atattaacta c 21
<210> 185
<211> 23
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 185
ggttgttttc gttcgttatc gga 23
<210> 186
<211> 22
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 186
cgccgcccaa accgaaccaa aa 22
<210> 187
<211> 19
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 187
gtttgttcga tcggggtcg 19
<210> 188
<211> 20
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 188
<210> 189
<211> 21
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 189
cgtaaacgcc tatactcgta c 21
<210> 190
<211> 25
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 190
ttagtcgggt tagaaggtag gaggc 25
<210> 191
<211> 18
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 191
<210> 192
<211> 19
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 192
gattactttc gcccgccac 19
<210> 193
<211> 23
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 193
aaagaaagcg tttaacggtt gga 23
<210> 194
<211> 22
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 194
atctacaact ctcgcccaac tc 22
<210> 195
<211> 24
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 195
cgcgaactct acaacaccga cacc 24
<210> 196
<211> 23
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 196
cgctaaataa ccccgaaatc caa 23
<210> 197
<211> 23
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 197
cgggttttgt agtatcggta tcg 23
<210> 198
<211> 25
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 198
aaacgcccaa cgactaaacg cacac 25
<210> 199
<211> 25
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 199
gtttaacggt tggacgtata tttcg 25
<210> 200
<211> 23
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 200
atctacaact ctcgcccaac tcc 23
<210> 201
<211> 25
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 201
cgcgaactct acaacaccga caccg 25
<210> 202
<211> 24
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 202
ttagcgatta ttaggttagg gggc 24
<210> 203
<211> 21
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 203
tcgaccgcga aaacctctaa a 21
<210> 204
<211> 26
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 204
ttccgaacta ccaacccaca aacgcg 26
<210> 205
<211> 26
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 205
aatttaagta attttttcgt aatcgg 26
<210> 206
<211> 29
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 206
aaacgactac gttcctacgc gaaacagtc 29
<210> 207
<211> 25
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 207
ttccgaacta ccaacccaca aacgc 25
<210> 208
<211> 26
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 208
gggcgtagtt gagagaattt aagtaa 26
<210> 209
<211> 20
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 209
<210> 210
<211> 28
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 210
atattccgaa ctaccaaccc acaaacgc 28
<210> 211
<211> 24
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 211
gtgatggagg aggtttagta agtt 24
<210> 212
<211> 25
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 212
ccaataaaac ctactcctcc cttaa 25
<210> 213
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 213
accaccaccc aacacacaat aacaaacaca 30
Claims (7)
1. The application of the reagent for detecting the methylation level of the DNA methylation molecular marker combination in preparing the kit for detecting the benign and malignant lung nodules is characterized in that the DNA methylation molecular marker combination is as follows: the sequences of SEQ ID NO.6 and SEQ ID NO. 9; or the complete complement of SEQ ID NO.6 and SEQ ID NO. 9.
2. The application of the reagent for detecting the methylation level of the DNA methylation molecular marker combination in preparing the kit for detecting the benign and malignant lung nodules is characterized in that the DNA methylation molecular marker combination is as follows: the sequences shown in SEQ ID NO.6, SEQ ID NO.9, SEQ ID NO.15 and SEQ ID NO. 16; or a complete complementary sequence of the sequences shown in SEQ ID NO.6, SEQ ID NO.9, SEQ ID NO.15 and SEQ ID NO. 16.
3. The application of the reagent for detecting the methylation level of the DNA methylation molecular marker combination in preparing the kit for detecting the benign and malignant lung nodules is characterized in that the DNA methylation molecular marker combination is as follows: the sequences shown in SEQ ID NO.6 and SEQ ID NO.7, SEQ ID NO.8, and SEQ ID NO. 9; or a complete complement of the sequences shown in SEQ ID NO.6 and SEQ ID NO.7, SEQ ID NO.8, and SEQ ID NO. 9.
4. The application of the reagent for detecting the methylation level of the DNA methylation molecular marker combination in the preparation of the kit for detecting benign and malignant lung nodules is characterized in that the DNA methylation molecular marker combination is a sequence shown in SEQ ID NO. 1-SEQ ID NO. 21; or a complete complementary sequence of the sequences shown in SEQ ID NO. 1-SEQ ID NO. 21.
5. A kit for detecting benign and malignant pulmonary nodules, which comprises a reagent for detecting the methylation level of the DNA methylation molecular marker combination according to any one of claims 1 to 4;
the reagent comprises a primer and a probe for fluorescent quantitative PCR detection of a DNA methylation molecular marker, wherein the primer and the probe are respectively as follows:
primers and probes for SEQ ID No.1 were: primers shown as SEQ ID NO.22 and SEQ ID NO.23, and a probe shown as SEQ ID NO. 24;
primers and probes for SEQ ID No.2 were: primers shown as SEQ ID NO.34 and SEQ ID NO.35, and a probe shown as SEQ ID NO. 36;
primers and probes for SEQ ID No.3 were: primers shown as SEQ ID NO.40 and SEQ ID NO.41, and a probe shown as SEQ ID NO. 42;
primers and probes for SEQ ID No.4 were: primers shown as SEQ ID NO.55 and SEQ ID NO.56, and a probe shown as SEQ ID NO. 57;
primers and probes for SEQ ID No.5 were: primers shown as SEQ ID NO.58 and SEQ ID NO.59, and a probe shown as SEQ ID NO. 60;
primers and probes for SEQ ID No.6 were: primers shown as SEQ ID NO.67 and SEQ ID NO.68, and a probe shown as SEQ ID NO. 69;
primers and probes for SEQ ID No.7 were: primers shown as SEQ ID NO.76 and SEQ ID NO.77, and a probe shown as SEQ ID NO. 78;
primers and probes for SEQ ID No.8 were: primers shown as SEQ ID NO.88 and SEQ ID NO.89, and a probe shown as SEQ ID NO. 90;
primers and probes for SEQ ID No.9 were: primers shown as SEQ ID NO.100 and SEQ ID NO.101, and a probe shown as SEQ ID NO. 102;
primers and probes for SEQ ID No.10 were: primers shown as SEQ ID NO.109 and SEQ ID NO.110, and a probe shown as SEQ ID NO. 111;
primers and probes for SEQ ID NO.11 were: primers shown as SEQ ID NO.112 and SEQ ID NO.113, and a probe shown as SEQ ID NO. 114;
primers and probes for SEQ ID No.12 were: primers shown as SEQ ID NO.121 and SEQ ID NO.122, and a probe shown as SEQ ID NO. 123;
primers and probes for SEQ ID No.13 were: primers shown as SEQ ID NO.130 and SEQ ID NO.131, and a probe shown as SEQ ID NO. 132;
primers and probes for SEQ ID No.14 were: primers shown as SEQ ID NO.142 and SEQ ID NO.143, and a probe shown as SEQ ID NO. 144;
primers and probes for SEQ ID No.15 were: primers shown as SEQ ID NO.151 and SEQ ID NO.152, and a probe shown as SEQ ID NO. 153;
primers and probes for SEQ ID No.16 were: primers shown as SEQ ID NO.160 and SEQ ID NO.161, and a probe shown as SEQ ID NO. 162;
primers and probes for SEQ ID No.17 were: primers shown as SEQ ID NO.166 and SEQ ID NO.167, and a probe shown as SEQ ID NO. 168;
primers and probes for SEQ ID No.18 were: primers shown as SEQ ID NO.175 and SEQ ID NO.176 and a probe shown as SEQ ID NO. 177;
primers and probes for SEQ ID No.19 were: primers shown as SEQ ID NO.184 and SEQ ID NO.185, and a probe shown as SEQ ID NO. 186;
primers and probes for SEQ ID No.20 were: primers shown as SEQ ID NO.199 and SEQ ID NO.200, and a probe shown as SEQ ID NO. 201;
primers and probes for SEQ ID No.21 were: primers shown as SEQ ID NO.208 and SEQ ID NO.209, and a probe shown as SEQ ID NO. 210.
6. The kit for detecting benign and malignant lung nodules according to claim 5, wherein said kit further comprises primers and probes for fluorescent quantitative PCR detection of reference gene ACTB.
7. The kit for detecting benign and malignant lung nodules according to claim 6, wherein said primers and probes for reference gene ACTB are: primers shown as SEQ ID NO.211 and SEQ ID NO.212, and a probe shown as SEQ ID NO. 213.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010346298.7A CN113637746B (en) | 2020-04-27 | 2020-04-27 | Methylated molecular markers for detecting benign and malignant lung nodules or combination and application thereof |
PCT/CN2021/086646 WO2021218616A1 (en) | 2020-04-27 | 2021-04-12 | Methylation molecular marker or combination thereof for detecting benign and malignant lung nodules, and application thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010346298.7A CN113637746B (en) | 2020-04-27 | 2020-04-27 | Methylated molecular markers for detecting benign and malignant lung nodules or combination and application thereof |
Publications (2)
Publication Number | Publication Date |
---|---|
CN113637746A CN113637746A (en) | 2021-11-12 |
CN113637746B true CN113637746B (en) | 2022-09-20 |
Family
ID=78415228
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010346298.7A Active CN113637746B (en) | 2020-04-27 | 2020-04-27 | Methylated molecular markers for detecting benign and malignant lung nodules or combination and application thereof |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113637746B (en) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2022127717A1 (en) * | 2020-12-17 | 2022-06-23 | 广州市基准医疗有限责任公司 | Methylation molecular marker or combination thereof for detecting benign and malignant pulmonary nodules and use thereof |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP6543569B2 (en) * | 2012-05-22 | 2019-07-10 | ザ・ジョンズ・ホプキンス・ユニバーシティ | Quantitative multiplex methylation specific PCR method-cMeth DNA, reagent, and use thereof |
WO2017192221A1 (en) * | 2016-05-05 | 2017-11-09 | Exact Sciences Corporation | Detection of lung neoplasia by analysis of methylated dna |
WO2018069450A1 (en) * | 2016-10-14 | 2018-04-19 | Aarhus Universitet | Methylation biomarkers for lung cancer |
CN110317875A (en) * | 2019-07-30 | 2019-10-11 | 苏州呼呼健康科技有限公司 | One kind methylated genes relevant to lung cancer and its detection kit |
CN110387421A (en) * | 2019-08-28 | 2019-10-29 | 深圳市新合生物医疗科技有限公司 | DNA methylation qPCR kit and application method for lung cancer detection |
-
2020
- 2020-04-27 CN CN202010346298.7A patent/CN113637746B/en active Active
Also Published As
Publication number | Publication date |
---|---|
CN113637746A (en) | 2021-11-12 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN108977543B (en) | Colorectal cancer early diagnosis reagent based on joint detection of SDC2 and SFRP2 gene methylation level | |
WO2021128519A1 (en) | Combination of dna methylation biomarkers, and detection method therefor and kit thereof | |
CN114277135B (en) | Gastric cancer lymph node metastasis related methylation biomarker and combination and detection kit thereof | |
CN114317738B (en) | Methylation biomarker related to detection of gastric cancer lymph node metastasis or combination and application thereof | |
WO2022143396A1 (en) | Reagent kit for methylation detection of breast tumor specificity | |
CN112538529B (en) | Molecular marker for colorectal cancer in blood and detection kit and method thereof | |
CN113637746B (en) | Methylated molecular markers for detecting benign and malignant lung nodules or combination and application thereof | |
CN113637745B (en) | Methylated molecular markers for detecting benign and malignant lung nodules or combination and application thereof | |
WO2021185274A1 (en) | Probe composition for detecting 6 cancers with high incidence in china | |
CN114645087B (en) | Methylated molecular markers for detecting benign and malignant lung nodules or combination and application thereof | |
CN111961723A (en) | Tumor marker for noninvasive detection of early ovarian cancer diagnosis and kit | |
CN113430272B (en) | Reagent and kit for diagnosis or auxiliary diagnosis of esophageal cancer or precancerous lesion and application of reagent and kit | |
CN112852934B (en) | Primer, reagent and kit for detecting gene methylation | |
CN114807350A (en) | Biomarker for judging benign and malignant thyroid nodules, polygene joint inspection kit and application | |
CN114645043B (en) | Methylated molecular marker combination for detecting benign and malignant lung nodules and application | |
WO2021218616A1 (en) | Methylation molecular marker or combination thereof for detecting benign and malignant lung nodules, and application thereof | |
WO2024001668A1 (en) | Methylation molecular marker for detecting benign and malignant pulmonary nodules and use thereof | |
CN112226514B (en) | Marker combination for early gastric cancer detection, kit and application thereof | |
CN116676384A (en) | Biomarker for gastric cancer, nucleic acid product and kit | |
CN116083586A (en) | Nucleic acid product, kit and application for diagnosing esophageal cancer | |
CN116287229A (en) | Reagent and kit for detecting esophagus cancer | |
CN117431315A (en) | Methylation biomarker for colorectal cancer lymph node metastasis detection and detection kit | |
CN116411073A (en) | Nucleic acid product for detecting esophageal cancer or precancerous lesions, kit and application | |
CN117587125A (en) | Application of reagent for detecting methylation in preparation of pancreatic cancer diagnosis product | |
CN116814779A (en) | Biomarker for gastric cancer, nucleic acid product and kit |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |