CN117568473A - Methylation molecular marker for auxiliary diagnosis of cancer - Google Patents
Methylation molecular marker for auxiliary diagnosis of cancer Download PDFInfo
- Publication number
- CN117568473A CN117568473A CN202210372719.2A CN202210372719A CN117568473A CN 117568473 A CN117568473 A CN 117568473A CN 202210372719 A CN202210372719 A CN 202210372719A CN 117568473 A CN117568473 A CN 117568473A
- Authority
- CN
- China
- Prior art keywords
- cancer
- seq
- ccdc12
- dna fragment
- lung
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 230000011987 methylation Effects 0.000 title claims abstract description 167
- 238000007069 methylation reaction Methods 0.000 title claims abstract description 167
- 206010028980 Neoplasm Diseases 0.000 title claims abstract description 149
- 201000011510 cancer Diseases 0.000 title claims abstract description 124
- 238000003745 diagnosis Methods 0.000 title claims description 32
- 239000003147 molecular marker Substances 0.000 title abstract description 8
- 208000020816 lung neoplasm Diseases 0.000 claims abstract description 236
- 206010058467 Lung neoplasm malignant Diseases 0.000 claims abstract description 235
- 201000005202 lung cancer Diseases 0.000 claims abstract description 235
- 206010006187 Breast cancer Diseases 0.000 claims abstract description 220
- 208000026310 Breast neoplasm Diseases 0.000 claims abstract description 217
- 101150079177 CCDC12 gene Proteins 0.000 claims abstract description 121
- 101100219874 Homo sapiens CCDC12 gene Proteins 0.000 claims abstract description 121
- 206010054107 Nodule Diseases 0.000 claims abstract description 65
- 238000002360 preparation method Methods 0.000 claims abstract description 8
- 230000002401 inhibitory effect Effects 0.000 claims abstract description 7
- 230000001737 promoting effect Effects 0.000 claims abstract description 7
- 239000003550 marker Substances 0.000 claims abstract description 6
- 239000012634 fragment Substances 0.000 claims description 184
- 108020004414 DNA Proteins 0.000 claims description 178
- 108091029430 CpG site Proteins 0.000 claims description 160
- 210000000481 breast Anatomy 0.000 claims description 76
- 210000004072 lung Anatomy 0.000 claims description 71
- 238000013178 mathematical model Methods 0.000 claims description 65
- 238000000034 method Methods 0.000 claims description 43
- 238000001514 detection method Methods 0.000 claims description 41
- 102000053602 DNA Human genes 0.000 claims description 20
- 108020004682 Single-Stranded DNA Proteins 0.000 claims description 20
- 239000002773 nucleotide Substances 0.000 claims description 20
- 125000003729 nucleotide group Chemical group 0.000 claims description 20
- 230000004069 differentiation Effects 0.000 claims description 14
- 238000007477 logistic regression Methods 0.000 claims description 11
- 238000012360 testing method Methods 0.000 claims description 11
- 239000003153 chemical reaction reagent Substances 0.000 claims description 9
- 238000007405 data analysis Methods 0.000 claims description 8
- 238000012545 processing Methods 0.000 claims description 8
- 239000000126 substance Substances 0.000 claims description 7
- 239000003795 chemical substances by application Substances 0.000 claims description 6
- 210000004369 blood Anatomy 0.000 abstract description 32
- 239000008280 blood Substances 0.000 abstract description 32
- 238000013399 early diagnosis Methods 0.000 abstract description 7
- 230000000694 effects Effects 0.000 abstract description 6
- 230000006607 hypermethylation Effects 0.000 abstract description 3
- 239000012491 analyte Substances 0.000 abstract 1
- 239000000523 sample Substances 0.000 description 74
- 101000777336 Homo sapiens Coiled-coil domain-containing protein 12 Proteins 0.000 description 20
- 210000001165 lymph node Anatomy 0.000 description 20
- 230000008595 infiltration Effects 0.000 description 19
- 238000001764 infiltration Methods 0.000 description 19
- 208000010507 Adenocarcinoma of Lung Diseases 0.000 description 17
- 201000005249 lung adenocarcinoma Diseases 0.000 description 17
- 208000000587 small cell lung carcinoma Diseases 0.000 description 17
- 102100031046 Coiled-coil domain-containing protein 12 Human genes 0.000 description 16
- IGRCWJPBLWGNPX-UHFFFAOYSA-N 3-(2-chlorophenyl)-n-(4-chlorophenyl)-n,5-dimethyl-1,2-oxazole-4-carboxamide Chemical compound C=1C=C(Cl)C=CC=1N(C)C(=O)C1=C(C)ON=C1C1=CC=CC=C1Cl IGRCWJPBLWGNPX-UHFFFAOYSA-N 0.000 description 15
- 230000007067 DNA methylation Effects 0.000 description 15
- 206010041823 squamous cell carcinoma Diseases 0.000 description 14
- 206010041067 Small cell lung cancer Diseases 0.000 description 13
- 239000000047 product Substances 0.000 description 13
- 206010056342 Pulmonary mass Diseases 0.000 description 11
- 238000001574 biopsy Methods 0.000 description 9
- 201000010983 breast ductal carcinoma Diseases 0.000 description 9
- 206010073096 invasive lobular breast carcinoma Diseases 0.000 description 9
- 238000003384 imaging method Methods 0.000 description 8
- 238000011065 in-situ storage Methods 0.000 description 8
- 210000000265 leukocyte Anatomy 0.000 description 8
- 230000004083 survival effect Effects 0.000 description 8
- 238000012549 training Methods 0.000 description 8
- 208000037396 Intraductal Noninfiltrating Carcinoma Diseases 0.000 description 7
- 206010073094 Intraductal proliferative breast lesion Diseases 0.000 description 7
- 208000028715 ductal breast carcinoma in situ Diseases 0.000 description 7
- 201000007273 ductal carcinoma in situ Diseases 0.000 description 7
- LSNNMFCWUKXFEE-UHFFFAOYSA-M Bisulfite Chemical compound OS([O-])=O LSNNMFCWUKXFEE-UHFFFAOYSA-M 0.000 description 6
- 238000004820 blood count Methods 0.000 description 6
- 208000014581 breast ductal adenocarcinoma Diseases 0.000 description 6
- 238000001269 time-of-flight mass spectrometry Methods 0.000 description 6
- 208000006402 Ductal Carcinoma Diseases 0.000 description 5
- 101100495925 Schizosaccharomyces pombe (strain 972 / ATCC 24843) chr3 gene Proteins 0.000 description 5
- 108091081021 Sense strand Proteins 0.000 description 5
- 239000000427 antigen Substances 0.000 description 5
- 102000036639 antigens Human genes 0.000 description 5
- 108091007433 antigens Proteins 0.000 description 5
- 238000006243 chemical reaction Methods 0.000 description 5
- 238000010835 comparative analysis Methods 0.000 description 5
- 208000030776 invasive breast carcinoma Diseases 0.000 description 5
- 206010073095 invasive ductal breast carcinoma Diseases 0.000 description 5
- 108090000623 proteins and genes Proteins 0.000 description 5
- 230000035945 sensitivity Effects 0.000 description 5
- 208000037162 Ductal Breast Carcinoma Diseases 0.000 description 4
- 238000004458 analytical method Methods 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 4
- 239000000463 material Substances 0.000 description 4
- 238000012216 screening Methods 0.000 description 4
- 210000001519 tissue Anatomy 0.000 description 4
- 206010006272 Breast mass Diseases 0.000 description 3
- 102000012406 Carcinoembryonic Antigen Human genes 0.000 description 3
- 108010022366 Carcinoembryonic Antigen Proteins 0.000 description 3
- ZOKXTWBITQBERF-UHFFFAOYSA-N Molybdenum Chemical compound [Mo] ZOKXTWBITQBERF-UHFFFAOYSA-N 0.000 description 3
- 238000013276 bronchoscopy Methods 0.000 description 3
- 238000011976 chest X-ray Methods 0.000 description 3
- 230000001419 dependent effect Effects 0.000 description 3
- 238000011156 evaluation Methods 0.000 description 3
- 238000002474 experimental method Methods 0.000 description 3
- 230000003211 malignant effect Effects 0.000 description 3
- 229910052750 molybdenum Inorganic materials 0.000 description 3
- 239000011733 molybdenum Substances 0.000 description 3
- 230000001575 pathological effect Effects 0.000 description 3
- 238000004393 prognosis Methods 0.000 description 3
- 238000007619 statistical method Methods 0.000 description 3
- 238000001356 surgical procedure Methods 0.000 description 3
- 208000009458 Carcinoma in Situ Diseases 0.000 description 2
- 102000012288 Phosphopyruvate Hydratase Human genes 0.000 description 2
- 108010022181 Phosphopyruvate Hydratase Proteins 0.000 description 2
- 206010036790 Productive cough Diseases 0.000 description 2
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 2
- 201000008275 breast carcinoma Diseases 0.000 description 2
- 201000003714 breast lobular carcinoma Diseases 0.000 description 2
- 238000004364 calculation method Methods 0.000 description 2
- 210000004027 cell Anatomy 0.000 description 2
- 210000000038 chest Anatomy 0.000 description 2
- 238000003759 clinical diagnosis Methods 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 230000002380 cytological effect Effects 0.000 description 2
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical class NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 239000003814 drug Substances 0.000 description 2
- 201000004933 in situ carcinoma Diseases 0.000 description 2
- 201000005243 lung squamous cell carcinoma Diseases 0.000 description 2
- 238000010827 pathological analysis Methods 0.000 description 2
- 102000004169 proteins and genes Human genes 0.000 description 2
- 230000002685 pulmonary effect Effects 0.000 description 2
- 210000002966 serum Anatomy 0.000 description 2
- 210000003802 sputum Anatomy 0.000 description 2
- 208000024794 sputum Diseases 0.000 description 2
- 241000143060 Americamysis bahia Species 0.000 description 1
- 206010055113 Breast cancer metastatic Diseases 0.000 description 1
- 208000005623 Carcinogenesis Diseases 0.000 description 1
- 201000009030 Carcinoma Diseases 0.000 description 1
- 208000005443 Circulating Neoplastic Cells Diseases 0.000 description 1
- 102100023755 Coiled-coil domain-containing protein 192 Human genes 0.000 description 1
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 1
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 1
- 208000001490 Dengue Diseases 0.000 description 1
- 206010012310 Dengue fever Diseases 0.000 description 1
- 208000032843 Hemorrhage Diseases 0.000 description 1
- 101000978235 Homo sapiens Coiled-coil domain-containing protein 192 Proteins 0.000 description 1
- 206010061218 Inflammation Diseases 0.000 description 1
- 102100033420 Keratin, type I cytoskeletal 19 Human genes 0.000 description 1
- 108010066302 Keratin-19 Proteins 0.000 description 1
- 208000000265 Lobular Carcinoma Diseases 0.000 description 1
- 206010062049 Lymphocytic infiltration Diseases 0.000 description 1
- 102000043276 Oncogene Human genes 0.000 description 1
- 108700020796 Oncogene Proteins 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- 229910019142 PO4 Inorganic materials 0.000 description 1
- 102000052575 Proto-Oncogene Human genes 0.000 description 1
- 108700020978 Proto-Oncogene Proteins 0.000 description 1
- 230000006978 adaptation Effects 0.000 description 1
- 230000004075 alteration Effects 0.000 description 1
- 108010036226 antigen CYFRA21.1 Proteins 0.000 description 1
- 238000003556 assay Methods 0.000 description 1
- 230000001680 brushing effect Effects 0.000 description 1
- 230000036952 cancer formation Effects 0.000 description 1
- 150000001720 carbohydrates Chemical class 0.000 description 1
- 231100000504 carcinogenesis Toxicity 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000007385 chemical modification Methods 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 238000012790 confirmation Methods 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 238000010219 correlation analysis Methods 0.000 description 1
- 230000007423 decrease Effects 0.000 description 1
- 239000008367 deionised water Substances 0.000 description 1
- 229910021641 deionized water Inorganic materials 0.000 description 1
- 208000025729 dengue disease Diseases 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000002405 diagnostic procedure Methods 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 1
- 210000002919 epithelial cell Anatomy 0.000 description 1
- 210000000981 epithelium Anatomy 0.000 description 1
- 239000012530 fluid Substances 0.000 description 1
- 238000010230 functional analysis Methods 0.000 description 1
- 208000016356 hereditary diffuse gastric adenocarcinoma Diseases 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 230000004054 inflammatory process Effects 0.000 description 1
- 208000014674 injury Diseases 0.000 description 1
- 208000024312 invasive carcinoma Diseases 0.000 description 1
- 230000005865 ionizing radiation Effects 0.000 description 1
- 230000003902 lesion Effects 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- 230000004199 lung function Effects 0.000 description 1
- 230000036210 malignancy Effects 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 238000004949 mass spectrometry Methods 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 238000012821 model calculation Methods 0.000 description 1
- 210000004877 mucosa Anatomy 0.000 description 1
- 238000013188 needle biopsy Methods 0.000 description 1
- 208000015122 neurodegenerative disease Diseases 0.000 description 1
- 238000013421 nuclear magnetic resonance imaging Methods 0.000 description 1
- 230000007170 pathology Effects 0.000 description 1
- 239000013610 patient sample Substances 0.000 description 1
- 230000002093 peripheral effect Effects 0.000 description 1
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 1
- 239000010452 phosphate Substances 0.000 description 1
- 201000003144 pneumothorax Diseases 0.000 description 1
- 230000035755 proliferation Effects 0.000 description 1
- 238000012207 quantitative assay Methods 0.000 description 1
- 230000005855 radiation Effects 0.000 description 1
- 238000011470 radical surgery Methods 0.000 description 1
- 230000007363 regulatory process Effects 0.000 description 1
- 238000012827 research and development Methods 0.000 description 1
- 239000011347 resin Substances 0.000 description 1
- 229920005989 resin Polymers 0.000 description 1
- 230000002441 reversible effect Effects 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 238000000528 statistical test Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 239000006228 supernatant Substances 0.000 description 1
- 238000010998 test method Methods 0.000 description 1
- 210000000779 thoracic wall Anatomy 0.000 description 1
- 238000001196 time-of-flight mass spectrum Methods 0.000 description 1
- 238000013518 transcription Methods 0.000 description 1
- 230000035897 transcription Effects 0.000 description 1
- 230000001052 transient effect Effects 0.000 description 1
- 230000008733 trauma Effects 0.000 description 1
- 238000002604 ultrasonography Methods 0.000 description 1
- 229940035893 uracil Drugs 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Chemical compound O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6876—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
- C12Q1/6883—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
- C12Q1/6886—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material for cancer
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B15/00—ICT specially adapted for analysing two-dimensional or three-dimensional molecular structures, e.g. structural or functional relations or structure alignment
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B20/00—ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
- G16B20/30—Detection of binding sites or motifs
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B40/00—ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
- G16B40/10—Signal processing, e.g. from mass spectrometry [MS] or from PCR
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H50/00—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
- G16H50/20—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for computer-aided diagnosis, e.g. based on medical expert systems
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/112—Disease subtyping, staging or classification
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/154—Methylation markers
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Chemical & Material Sciences (AREA)
- Medical Informatics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Spectroscopy & Molecular Physics (AREA)
- General Health & Medical Sciences (AREA)
- Biotechnology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Biophysics (AREA)
- Pathology (AREA)
- Public Health (AREA)
- Genetics & Genomics (AREA)
- Molecular Biology (AREA)
- Organic Chemistry (AREA)
- Theoretical Computer Science (AREA)
- Evolutionary Biology (AREA)
- Bioinformatics & Computational Biology (AREA)
- Analytical Chemistry (AREA)
- Databases & Information Systems (AREA)
- Wood Science & Technology (AREA)
- Epidemiology (AREA)
- Immunology (AREA)
- Data Mining & Analysis (AREA)
- Zoology (AREA)
- Biomedical Technology (AREA)
- Oncology (AREA)
- Signal Processing (AREA)
- Biochemistry (AREA)
- Microbiology (AREA)
- General Engineering & Computer Science (AREA)
- Crystallography & Structural Chemistry (AREA)
- Primary Health Care (AREA)
- Hospice & Palliative Care (AREA)
- Artificial Intelligence (AREA)
- Bioethics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Evolutionary Computation (AREA)
Abstract
The invention discloses a methylation molecular marker which can be used for assisting in diagnosing cancers. The invention provides an application of a methylated CCDC12 gene serving as a marker in the preparation of products; the use of the product is at least one of the following: aiding in diagnosing cancer or predicting the risk of developing cancer; aiding in distinguishing benign nodules from cancers; aiding in distinguishing between different subtypes of cancer; aiding in differentiating different stages of cancer; aiding in differentiating between different cancers; determining whether the analyte has an inhibitory or promoting effect on the occurrence of cancer; the cancer may be lung cancer or breast cancer. The research of the invention discovers the hypermethylation phenomenon of the CCDC12 gene in the blood of patients with lung cancer and breast cancer, and the invention has important scientific significance and clinical application value for improving the early diagnosis and treatment effects of lung cancer and breast cancer and reducing the death rate.
Description
Technical Field
The invention relates to the field of medicine, in particular to a methylation molecular marker for assisting in diagnosing cancers
Background
Lung cancer is a malignant tumor that occurs in the epithelium of the bronchial mucosa, and in recent decades, the morbidity and mortality rate have been on the rise, being the cancer with the highest morbidity and mortality rate worldwide. Although new progress has been made in diagnostic methods, surgical techniques, and chemotherapeutics in recent years, the overall 5-year survival rate of lung cancer patients is only 16%, mainly because most lung cancer patients have been shifted at the time of diagnosis and have lost the opportunity for radical surgery. The study shows that the prognosis of lung cancer is directly related to stage, the survival rate of lung cancer in stage I for 5 years is 83%, the survival rate in stage II is 53%, the survival rate in stage III is 26%, and the survival rate in stage IV is 6%. Thus, the key to reducing mortality in lung cancer patients is early diagnosis and early treatment.
The main lung cancer diagnosis methods at present are as follows: (1) imaging method: such as chest X-rays and low dose helical CT. However, early lung cancer is difficult to detect by chest X-ray. Although low-dose spiral CT can find nodules in the lung, the false positive rate is as high as 96.4%, and unnecessary psychological burden is brought to a person to be checked. At the same time, chest X-rays and low dose helical CT are not suitable for frequent use due to radiation. In addition, imaging methods are also often affected by equipment and physician experience, as well as effective film reading time. (2) cytological methods: such as sputum cytology, bronchoscopy brush or biopsy, bronchoalveolar lavage cytology, etc. Sputum cytology and bronchoscopy have less sensitivity to peripheral lung cancer. Meanwhile, the operation of brushing a piece under a bronchoscope or taking a biopsy and performing cytological examination on bronchoalveolar lavage fluid is complicated, and the comfort level of a physical examination person is poor. (3) serum tumor markers commonly used at present: carcinoembryonic antigen (CEA), carbohydrate antigen (CA 125/153/199), cytokeratin 19 fragment antigen (CYFRA 21-1), and Neuron Specific Enolase (NSE), etc. These serum tumor markers have limited sensitivity to lung cancer, typically 30% -40%, and even lower for stage I tumors. Furthermore, the tumor specificity is limited, and is affected by many benign diseases such as benign tumor, inflammation, degenerative diseases and the like. At present, the tumor markers are mainly used for screening malignant tumors and rechecking tumor treatment effects. Therefore, further development of a highly efficient and specific early diagnosis technique for lung cancer is required.
The most effective method of pulmonary nodule diagnosis currently internationally accepted is chest low dose helical CT screening. However, the low-dose helical CT has high sensitivity, and a large number of nodules can be found, but it is difficult to determine whether or not the subject is benign or malignant. In the found nodules, the proportion of malignancy was still less than 4%. Currently, clinical identification of benign and malignant lung nodules requires long-term follow-up, repeated CT examination, or invasive examination methods relying on biopsy sampling of lung nodules (including chest wall fine needle biopsy, bronchoscopy biopsy, thoracoscopy or open chest lung biopsy), and the like. CT guided or ultrasound guided transthoracic biopsy has higher sensitivity, but has lower diagnosis rate for nodules <2cm, 30-70% missed diagnosis rate, and higher occurrence rate of pneumothorax and hemorrhage. The incidence rate of the aspiration biopsy complications of the bronchoscope needle is relatively low, but the diagnosis rate of the surrounding nodules is limited, the diagnosis rate of the nodules less than or equal to 2cm is only 34%, and the diagnosis rate of the nodules greater than 2cm is 63%. Surgical excision has a high diagnostic rate and can directly treat the node, but can cause a transient decline in patient lung function, and if the node is benign, the patient performs unnecessary surgery, resulting in excessive medical treatment. Therefore, there is a strong need for new in vitro diagnostic molecular markers to aid in the identification of pulmonary nodules, while reducing the rate of missed diagnosis and minimizing unnecessary punctures or surgeries.
Breast cancer is a malignant tumor caused by uncontrolled proliferation of mammary epithelial cells. On the one hand, breast cancer is one of the most common malignant tumors of females worldwide, and the incidence rate is the first malignant tumor of females. On the other hand, the survival rate of breast cancer is related to the class and stage of the tumor. The 5-year survival prognosis for early stage breast cancer is generally higher than 60%, but for advanced breast cancer this number falls to 40-60%. For metastatic breast cancer, the prognosis for 5 years survival is typically about 15%. Therefore, the improvement of the early detection rate of the breast cancer is necessary for effective diagnosis and treatment of the later stage of the breast cancer. At present, clinical medicine mainly has two modes of imaging and pathology for early screening diagnosis of breast cancer. The B-type ultrasonic imaging in the imaging diagnosis is non-radiative, but is limited by the mechanism of ultrasonic imaging, and the method has poor resolution for lesions with smaller volume and insignificant echo change and is easy to miss. The breast molybdenum target examination technology is a technology for shooting breasts by using low-dose breast X-rays, can clearly display the structural condition of each tissue of the breasts, but has higher false positive rate in breast molybdenum target examination, and needs to puncture the breasts of a patient to perform more accurate judgment, and in addition, the breast molybdenum target has damages such as ionizing radiation and the like to the patient. The breast nuclear magnetic resonance imaging technology for checking breast tissues and generating internal images by using magnetic energy and radio waves is mainly suitable for screening high-risk groups of breast cancer. The pathological diagnosis mainly comprises breast biopsy, which is a method for taking pathological tissues for pathological diagnosis, however, the biopsy operation is very resistant to patients due to human trauma. In addition, some common tumor markers such as tumor antigen 15-3, tumor antigen 27.29, carcinoembryonic antigen, tumor antigen 125 and circulating tumor cells are used for diagnosing breast cancer, but the specificity and sensitivity thereof are required to be improved, and are generally used in combination with imaging studies. Therefore, more sensitive and specific early breast cancer molecular markers are urgently discovered.
DNA methylation is a chemical modification important on genes that affects the regulatory process of gene transcription and nuclear structure. Alterations in DNA methylation are early events and concomitant events in cancer progression, and are mainly manifested by hypermethylation of oncogenes and hypomethylation of protooncogenes on tumor tissues, etc. However, there is less reported correlation between DNA methylation in blood and tumorigenesis development. In addition, blood is easy to collect, DNA methylation is stable, and if a tumor-specific blood DNA methylation molecular marker can be found, the DNA methylation molecular marker has great clinical application value. Therefore, the research and development of blood DNA methylation diagnosis technology suitable for clinical detection has important clinical application value and social significance for improving early diagnosis and treatment effect of lung cancer and reducing death rate.
Disclosure of Invention
The invention aims to provide a Coiled-coil domain protein 12 (coded-Coil Domain Containing, CCDC12) methylation marker and a kit for assisting in diagnosing cancers.
In a first aspect, the invention claims the use of a methylated CCDC12 gene as a marker in the preparation of a product. The use of the product may be at least one of the following:
(1) Aiding in diagnosing cancer or predicting the risk of developing cancer;
(2) Aiding in distinguishing benign nodules from cancers;
(3) Aiding in distinguishing between different subtypes of cancer;
(4) Aiding in differentiating different stages of cancer;
(5) Auxiliary diagnosis of lung cancer or prediction of lung cancer risk;
(6) Assisting in distinguishing benign nodules of the lung from lung cancer;
(7) Assisting in distinguishing different subtypes of lung cancer;
(8) Auxiliary differentiation of different stages of lung cancer;
(9) Auxiliary diagnosis of breast cancer or prediction of breast cancer risk;
(10) Aiding in distinguishing benign nodules of breast from breast cancer;
(11) Assisting in distinguishing different subtypes of breast cancer;
(12) Assisting in distinguishing different stages of breast cancer;
(13) Auxiliary differentiation between lung and breast cancer;
(14) Determining whether the test agent has an inhibitory or promoting effect on the occurrence of cancer.
Further, the auxiliary diagnosis of cancer described in (1) may be embodied as at least one of the following: aiding in distinguishing cancer patients from non-cancerous controls (it is understood that no cancer is present and ever and no benign nodules of the lung and breast are reported and blood normative indicators are within reference); helping to distinguish between different cancers.
Further, the benign nodules in (2) are benign nodules corresponding to the cancer in (2), such as benign nodules of the lung and lung cancer; or benign nodules of the breast and breast cancer.
Further, the different subtypes of cancer described in (3) may be pathological, such as histological, types.
Further, the different stage of the cancer in (4) may be a clinical stage or a TNM stage.
In a specific embodiment of the present invention, the auxiliary diagnosis of lung cancer described in (5) is embodied as at least one of the following: can be used for assisting in distinguishing lung cancer patients from non-cancer controls, assisting in distinguishing lung adenocarcinoma patients from non-cancer controls, assisting in distinguishing lung squamous cancer patients from non-cancer controls, assisting in distinguishing small cell lung cancer patients from non-cancer controls, assisting in distinguishing stage I lung cancer patients from non-cancer controls, assisting in distinguishing stage II-III lung cancer patients from non-cancer controls, assisting in distinguishing lung cancer patients without lymph node infiltration from non-cancer controls, and assisting in distinguishing lung cancer patients with lymph node infiltration from non-cancer controls. Wherein, the cancer-free control is understood to be that no cancer is present and no benign nodules of the lung are reported and the blood routine index is within the reference range.
In a specific embodiment of the present invention, the assisting in distinguishing benign nodules of the lung from lung cancer in (6) is embodied as at least one of: can help to distinguish lung cancer from benign lung nodules, can help to distinguish lung adenocarcinoma from benign lung nodules, can help to distinguish lung squamous cell carcinoma from benign lung nodules, can help to distinguish small cell lung cancer from benign lung nodules, can help to distinguish stage I lung cancer from benign lung nodules, can help to distinguish stage II-III lung cancer from benign lung nodules, can help to distinguish lung cancer without node infiltration from benign lung nodules, can help to distinguish lung cancer with node infiltration from benign lung nodules.
In a specific embodiment of the present invention, the assisting in differentiating between different subtypes of lung cancer described in (7) is embodied as: can help to distinguish any two of lung adenocarcinoma, lung squamous carcinoma and small cell lung carcinoma.
In a specific embodiment of the present invention, the assisting in differentiating different stages of lung cancer described in (8) is embodied as at least one of: any two of the lung cancer of the T1 stage, the lung cancer of the T2 stage and the lung cancer of the T3 stage can be assisted to be distinguished; can help to distinguish lung cancer without lymph node infiltration from lung cancer with lymph node infiltration; can help to distinguish any two of clinical lung cancer in stage I, clinical lung cancer in stage II and clinical lung cancer in stage III.
In a specific embodiment of the present invention, the auxiliary diagnosis of breast cancer described in (9) is embodied as at least one of the following: can help distinguish breast cancer patients from non-cancerous female controls. Can assist in distinguishing breast duct carcinoma in situ from non-cancerous controls, can assist in distinguishing breast invasive duct carcinoma from non-cancerous female controls, can assist in distinguishing breast invasive lobular carcinoma from non-cancerous female controls, can assist in distinguishing stage I breast carcinoma from non-cancerous female controls, can assist in distinguishing stage II-III breast carcinoma from non-cancerous female controls, can assist in distinguishing non-lymph node invasive breast carcinoma from non-cancerous female controls, and can assist in distinguishing lymph node invasive breast carcinoma from non-cancerous female controls. Wherein, the cancer-free female control can be understood as a female who has no cancer at present and once and has not reported benign nodules of the breast and whose blood routine index is within the reference range.
In a specific embodiment of the present invention, the assisting in distinguishing benign nodules of breast from breast cancer in (10) is embodied as at least one of: can help distinguish breast cancer and benign nodules of breast, can help distinguish ductal carcinoma in situ and benign nodules of breast, can help distinguish ductal carcinoma of breast from benign nodules of breast, can help distinguish lobular carcinoma of breast from benign nodules of breast, can help distinguish stage I breast cancer from benign nodules of breast, can help distinguish stage II-III breast cancer from benign nodules of breast, can help distinguish breast cancer without lymph node infiltration from benign nodules of breast, can help distinguish breast cancer with lymph node infiltration from benign nodules of breast.
In a specific embodiment of the present invention, the assisting in distinguishing between different subtypes of breast cancer described in (11) is embodied as: can help to distinguish any two of breast duct carcinoma in situ, breast invasive duct carcinoma and breast invasive lobular carcinoma.
In a specific embodiment of the present invention, the assisting in distinguishing between different stages of breast cancer in (12) is embodied as at least one of: any two of the T1-stage breast cancer, the T2-stage breast cancer and the T3-stage breast cancer can be assisted to be distinguished; can help to distinguish breast cancer without lymph node infiltration from breast cancer with lymph node infiltration; can help to distinguish any two of clinical stage I breast cancer, clinical stage II breast cancer and clinical stage III breast cancer.
In the above (1) - (14), the cancer may be a cancer capable of causing an increase in the methylation level of CCDC12 gene in the body, such as lung cancer, breast cancer, etc.
In a second aspect, the invention claims the use of a substance for detecting the methylation level of the CCDC12 gene in the preparation of a product. The use of the product may be at least one of the foregoing (1) to (14).
In a third aspect, the invention claims the use of a substance for detecting the methylation level of the CCDC12 gene and a medium storing mathematical modeling methods and/or usage methods for the preparation of a product. The use of the product may be at least one of the foregoing (1) to (14).
The mathematical model may be obtained by a method comprising the steps of:
(A1) Detecting CCDC12 gene methylation levels (training set) of n1 type a samples and n2 type B samples, respectively;
(A2) Taking the CCDC12 gene methylation level data of all samples obtained in the step (A1), establishing a mathematical model according to the classification modes of the A type and the B type by a two-classification logistic regression method, and determining the threshold value of classification judgment.
Wherein n1 and n2 in (A1) are positive integers of 50 or more.
The using method of the mathematical model comprises the following steps:
(B1) Detecting the methylation level of the CCDC12 gene of a sample to be detected;
(B2) Substituting the CCDC12 gene methylation level data of the sample to be detected obtained in the step (B1) into the mathematical model to obtain a detection index; and then comparing the detection index with a threshold value, and determining whether the type of the sample to be detected is A type or B type according to the comparison result.
In a specific embodiment of the present invention, the threshold is set to 0.5. More than 0.5 is classified as one type, less than 0.5 is classified as another type, and 0.5 is equal as an undefined gray zone. Wherein the A type and the B type are two corresponding classifications, the two classifications are grouped, which group is the A type and which group is the B type, and the A type and the B type are determined according to a specific mathematical model without convention.
In practical applications, the threshold may also be determined according to the maximum approximate sign-up index (specifically, may be a value corresponding to the maximum approximate sign-up index). Greater than the threshold is classified as one class, less than the threshold is classified as another class, and equal to the threshold as an indeterminate gray zone. Wherein the A type and the B type are two corresponding classifications, the two classifications are grouped, which group A type and which group B type are determined according to a specific mathematical model, and no convention is needed.
The type a sample and the type B sample may be any one of the following:
(C1) Lung cancer samples and no cancer controls;
(C2) Lung cancer samples and lung benign nodule samples;
(C3) A sample of different subtypes of lung cancer;
(C4) Samples of lung cancer at different stages;
(C5) Breast cancer samples and cancer-free female controls;
(C6) Breast cancer samples and breast benign nodule samples;
(C7) A sample of different subtypes of breast cancer;
(C8) Breast cancer samples of different stages;
(C9) Lung cancer samples and breast cancer samples.
In a fourth aspect, the invention claims the use of a medium storing a mathematical model building method and/or a use method as described in the third aspect above for the manufacture of a product. The use of the product may be at least one of the foregoing (1) to (14).
In a fifth aspect, the invention claims a kit.
The kit claimed in the present invention comprises a substance for detecting the methylation level of the CCDC12 gene. The use of the kit may be at least one of the foregoing (1) to (14).
Further, the kit may further comprise a medium storing the mathematical model creation method and/or the use method described in the third or fourth aspect.
In a sixth aspect, the invention claims a system.
The claimed system includes:
(D1) Reagents and/or instrumentation for detecting the methylation level of the CCDC12 gene;
(D2) A device comprising a unit X and a unit Y;
the unit X is used for establishing a mathematical model and comprises a data acquisition module, a data analysis processing module and a model output module;
the data acquisition module is configured to acquire (D1) CCDC12 gene methylation level data of n 1A type samples and n 2B type samples detected;
the data analysis processing module is configured to receive the CCDC12 gene methylation level data of the n 1A type samples and the n 2B type samples from the data acquisition module, establish a mathematical model according to a classification mode of the A type and the B type through a two-classification logistic regression method, and determine a threshold value of classification judgment;
the model output module is configured to receive the mathematical model established by the data analysis processing module and output the mathematical model;
the unit Y is used for determining the type of the sample to be detected and comprises a data input module, a data operation module, a data comparison module and a conclusion output module;
the data input module is configured to input (D1) the detected CCDC12 gene methylation level data of the to-be-detected person;
The data operation module is configured to receive the CCDC12 gene methylation level data of the person to be tested from the data input module, and substitutes the CCDC12 gene methylation level data of the person to be tested into the mathematical model to calculate a detection index;
the data comparison module is configured to receive the detection index calculated by the data operation module and compare the detection index with the threshold value determined by the data analysis processing module in the unit X;
the conclusion output module is configured to receive the comparison result from the data comparison module and output a conclusion of whether the type of the sample to be tested is A type or B type according to the comparison result of the data comparison module;
the type a sample and the type B sample may be any one of the following:
(C1) Lung cancer samples and no cancer controls;
(C2) Lung cancer samples and lung benign nodule samples;
(C3) A sample of different subtypes of lung cancer;
(C4) Samples of lung cancer at different stages;
(C5) Breast cancer samples and cancer-free female controls;
(C6) Breast cancer samples and breast benign nodule samples;
(C7) A sample of different subtypes of breast cancer;
(C8) Breast cancer samples of different stages;
(C9) Lung cancer samples and breast cancer samples.
Wherein, n1 and n2 can be positive integers more than 50.
In a specific embodiment of the present invention, the threshold is set to 0.5. More than 0.5 is classified as one type, less than 0.5 is classified as another type, and 0.5 is equal as an undefined gray zone. Wherein the A type and the B type are two corresponding classifications, the two classifications are grouped, which group is the A type and which group is the B type, and the A type and the B type are determined according to a specific mathematical model without convention.
In practical applications, the threshold may also be determined according to the maximum approximate sign-up index (specifically, may be a value corresponding to the maximum approximate sign-up index). Greater than the threshold is classified as one class, less than the threshold is classified as another class, and equal to the threshold as an indeterminate gray zone. Wherein the A type and the B type are two corresponding classifications, the two classifications are grouped, which group is the A type and which group is the B type, and the A type and the B type are determined according to a specific mathematical model without convention.
In the foregoing aspects, the methylation level of the CCDC12 gene may be the methylation level of all or part of CpG sites in the fragments of the CCDC12 gene as shown in (e 1) - (e 5) below. The methylated CCDC12 gene can be all or part of the CpG site methylation in the fragments shown in (e 1) - (e 5) below in the CCDC12 gene.
(e1) A DNA fragment shown in SEQ ID No.1 or a DNA fragment having 80% or more identity thereto;
(e2) A DNA fragment shown in SEQ ID No.2 or a DNA fragment having 80% or more identity thereto;
(e3) A DNA fragment shown in SEQ ID No.3 or a DNA fragment having 80% or more identity thereto;
(e4) A DNA fragment shown in SEQ ID No.4 or a DNA fragment having 80% or more identity thereto;
(e5) The DNA fragment shown in SEQ ID No.5 or a DNA fragment having 80% or more identity thereto.
Further, the "all or part of CpG sites" may be any one or more CpG sites of 5 DNA fragments shown in SEQ ID No.1 to SEQ ID No.5 in the CCDC12 gene. The upper limit of "multiple CpG sites" as used herein is all CpG sites in the 5 DNA fragments shown in SEQ ID No.1 to SEQ ID No.5 in the CCDC12 gene. All CpG sites in the DNA fragment shown in SEQ ID No.1 are shown in Table 1, all CpG sites in the DNA fragment shown in SEQ ID No.2 are shown in Table 2, all CpG sites in the DNA fragment shown in SEQ ID No.3 are shown in Table 3, all CpG sites in the DNA fragment shown in SEQ ID No.4 are shown in Table 4, and all CpG sites in the DNA fragment shown in SEQ ID No.5 are shown in Table 5.
Alternatively, the "all or part of the CpG sites" may be all CpG sites in the DNA fragment shown in SEQ ID No.4 (see Table 4) and all CpG sites in the DNA fragment shown in SEQ ID No.1 (see Table 1) of the CCDC12 gene.
Alternatively, the "all or part of the CpG sites" may be all CpG sites in the DNA fragment shown in SEQ ID No.4 (see Table 4) and all CpG sites in the DNA fragment shown in SEQ ID No.2 (see Table 2) of the CCDC12 gene.
Alternatively, the "all or part of the CpG sites" may be all CpG sites in the DNA fragment shown in SEQ ID No.1 (see Table 1) and all CpG sites in the DNA fragment shown in SEQ ID No.2 (see Table 2) of the CCDC12 gene.
Alternatively, the "all or part of the CpG sites" may be all CpG sites in the DNA fragment shown in SEQ ID No.4 (see Table 4) and all CpG sites in the DNA fragment shown in SEQ ID No.1 (see Table 1) and all CpG sites in the DNA fragment shown in SEQ ID No.2 (see Table 2) of the CCDC12 gene.
Alternatively, the "all or part of the CpG sites" may be all or any 16 or any 15 or any 14 or any 13 or any 12 or any 11 or any 10 or any 9 or any 8 or any 7 or any 6 or any 5 or any 4 or any 3 or any 2 or any 1 of the DNA fragments shown in SEQ ID No.4 in the CCDC12 gene.
Or, the "all or part of the CpG sites" may be all or any 11 items or any 10 items or any 9 items or any 8 items or any 7 items or any 6 items or any 5 items or any 4 items or any 3 items or any 2 items or any 1 item of the CpG sites as shown in following 12 items in the DNA fragment shown in SEQ ID No.4 in the CCDC12 gene:
(f1) The CpG site (CCDC 12_D_5) shown in 104 th-105 th positions of the 5' end of the DNA fragment shown in SEQ ID No. 4;
(f2) The CpG sites (CCDC 12_D_6) shown in the 133 th to 134 th positions of the DNA fragment shown in SEQ ID No.4 from the 5' end;
(f3) The CpG site (CCDC 12_D_7) shown in 202-203 of the DNA fragment shown in SEQ ID No.4 from the 5' end;
(f4) The CpG site (CCDC 12_D_8) shown in 273-274 of the DNA fragment shown in SEQ ID No.4 from the 5' end;
(f5) The CpG site (CCDC 12_D_9) shown in 300-301 th position of the 5' end of the DNA fragment shown in SEQ ID No. 4;
(f6) The DNA fragment shown in SEQ ID No.4 shows the CpG sites (CCDC 12_D_10) from 344 to 345 positions of the 5' end;
(f7) The CpG site (CCDC 12_D_11) shown in 361-362 th position of the 5' end of the DNA fragment shown in SEQ ID No. 4;
(f8) The DNA fragment shown in SEQ ID No.4 shows the CpG sites (CCDC 12_D_12) from 383 to 384 bits of the 5' end;
(f9) The DNA fragment shown in SEQ ID No.4 shows the CpG sites (CCDC 12_D_13) from 426 to 427 positions of the 5' end;
(f10) The CpG site (CCDC 12_D_14) shown in 448-449 of the 5' end of the DNA fragment shown in SEQ ID No. 4;
(f11) The DNA fragment shown in SEQ ID No.4 shows the CpG sites (CCDC 12_D_15) from 457 to 458 positions of the 5' end;
(f12) The DNA fragment shown in SEQ ID No.4 shows the CpG sites from 464 to 465 of the 5' end (CCDC 12_D_16).
In particular embodiments of the invention, some adjacent methylation sites are treated as one methylation site when analyzed for DNA methylation using time-of-flight mass spectrometry, because several CpG sites are located on one methylation fragment, the peak pattern is indistinguishable (indistinguishable sites are set forth in Table 7), and thus the methylation level analysis is performed, and related mathematical models are constructed and used.
In the above aspects, the means for detecting the methylation level of the CCDC12 gene may comprise (or be) a primer combination for amplifying a full or partial fragment of the CCDC12 gene. The reagent for detecting the methylation level of the CCDC12 gene may comprise (or be) a primer combination for amplifying a full or partial fragment of the CCDC12 gene; the instrument for detecting the methylation level of the CCDC12 gene may be a time-of-flight mass spectrometry detector. Of course, other conventional reagents for performing time-of-flight mass spectrometry may also be included in the reagents for detecting the methylation level of the CCDC12 gene.
Further, the partial fragment may be at least one fragment of:
(g1) A DNA fragment shown in SEQ ID No.1 or a DNA fragment comprising the same;
(g2) A DNA fragment shown in SEQ ID No.2 or a DNA fragment comprising the same;
(g3) A DNA fragment shown in SEQ ID No.3 or a DNA fragment comprising the same;
(g4) A DNA fragment shown in SEQ ID No.4 or a DNA fragment comprising the same;
(g5) A DNA fragment shown in SEQ ID No.5 or a DNA fragment comprising the same;
(g6) A DNA fragment having an identity of 80% or more to the DNA fragment shown in SEQ ID No.1 or a DNA fragment comprising the same;
(g7) A DNA fragment having an identity of 80% or more to the DNA fragment shown in SEQ ID No.2 or a DNA fragment comprising the same;
(g8) A DNA fragment having an identity of 80% or more to the DNA fragment shown in SEQ ID No.3 or a DNA fragment comprising the same.
(g9) A DNA fragment having an identity of 80% or more to the DNA fragment shown in SEQ ID No.4 or a DNA fragment comprising the same;
(g10) A DNA fragment having an identity of 80% or more to the DNA fragment shown in SEQ ID No.5 or a DNA fragment comprising the same.
In the present invention, the primer combination may specifically be primer pair a and/or primer pair B and/or primer pair C and/or primer pair D and/or primer pair E;
the primer pair A is a primer pair consisting of a primer A1 and a primer A2; the primer A1 can be specifically a single-stranded DNA shown in SEQ ID No.6 or 11-35 nucleotides of SEQ ID No. 6; the primer A2 can be specifically a single-stranded DNA shown in SEQ ID No.7 or 32-56 nucleotides of SEQ ID No. 7;
The primer pair B is a primer pair consisting of a primer B1 and a primer B2; the primer B1 can be specifically single-stranded DNA shown in SEQ ID No.8 or 11-35 nucleotides of SEQ ID No. 8; the primer B2 can be specifically a single-stranded DNA shown in SEQ ID No.9 or 32-56 nucleotides of SEQ ID No. 9;
the primer pair C is a primer pair consisting of a primer C1 and a primer C2; the primer C1 can be specifically a single-stranded DNA shown in SEQ ID No.10 or 11-35 nucleotides of SEQ ID No. 10; the primer C2 can be specifically a single-stranded DNA shown in SEQ ID No.11 or 32-56 nucleotides of SEQ ID No. 11;
the primer pair D is a primer pair consisting of a primer D1 and a primer D2; the primer D1 can be specifically a single-stranded DNA shown in SEQ ID No.12 or 11-35 nucleotides of SEQ ID No. 12; the primer D2 can be specifically a single-stranded DNA shown in SEQ ID No.13 or 32-56 nucleotides of SEQ ID No. 13;
the primer pair E is a primer pair consisting of a primer E1 and a primer E2; the primer E1 can be specifically a single-stranded DNA shown in SEQ ID No.14 or 11-35 nucleotides of SEQ ID No. 14; the primer E2 can be specifically a single-stranded DNA shown in SEQ ID No.15 or 32-56 nucleotides of SEQ ID No. 15.
In addition, the invention also discloses a method for distinguishing whether the sample to be detected is an A type sample or a B type sample. The method may comprise the steps of:
(A) The mathematical model may be built as a method comprising the steps of:
(A1) Detecting CCDC12 gene methylation levels (training set) of n1 type a samples and n2 type B samples, respectively;
(A2) Taking the CCDC12 gene methylation level data of all samples obtained in the step (A1), establishing a mathematical model according to the classification modes of the A type and the B type by a two-classification logistic regression method, and determining the threshold value of classification judgment.
Wherein n1 and n2 in (A1) are positive integers of 50 or more.
(B) The sample to be tested may be determined as a type a sample or a type B sample according to a method comprising the steps of:
(B1) Detecting the CCDC12 gene methylation level of the test sample;
(B2) Substituting the CCDC12 gene methylation level data of the sample to be detected obtained in the step (B1) into the mathematical model to obtain a detection index; and then comparing the detection index with a threshold value, and determining whether the type of the sample to be detected is A type or B type according to the comparison result.
In a specific embodiment of the present invention, the threshold is set to 0.5. More than 0.5 is classified as one type, less than 0.5 is classified as another type, and 0.5 is equal as an undefined gray zone. Wherein the A type and the B type are two corresponding classifications, the two classifications are grouped, which group is the A type and which group is the B type, and the A type and the B type are determined according to a specific mathematical model without convention.
In practical applications, the threshold may also be determined according to the maximum approximate sign-up index (specifically, may be a value corresponding to the maximum approximate sign-up index). Greater than the threshold is classified as one class, less than the threshold is classified as another class, and equal to the threshold as an indeterminate gray zone. Wherein the A type and the B type are two corresponding classifications, the two classifications are grouped, which group is the A type and which group is the B type, and the A type and the B type are determined according to a specific mathematical model without convention.
The type a sample and the type B sample may be any one of the following:
(C1) Lung cancer samples and no cancer controls;
(C2) Lung cancer samples and lung benign nodule samples;
(C3) A sample of different subtypes of lung cancer;
(C4) Samples of lung cancer at different stages;
(C5) Breast cancer samples and cancer-free female controls;
(C6) Breast cancer samples and breast benign nodule samples;
(C7) A sample of different subtypes of breast cancer;
(C8) Breast cancer samples of different stages;
(C9) Lung cancer samples and breast cancer samples.
Any of the above mathematical models may be changed in practical application according to the detection method and the fitting mode of DNA methylation, and the mathematical model is determined according to a specific mathematical model without any convention.
In the embodiment of the invention, the model is specifically log (y/(1-y))=b0+b1x1+b2x2+b3x3+ … +bnxn, where y is a detection index obtained after substituting a methylation value of one or more methylation sites of a sample to be tested into the model by a dependent variable, b0 is a constant, x1 to xn are independent variables which are methylation values of one or more methylation sites of the sample to be tested (each value is a value between 0 and 1), and b1 to bn are weights given by the model to the methylation values of each site.
In the embodiment of the invention, the model can be established by adding known parameters such as age, sex, white blood cell count and the like as appropriate to improve the discrimination efficiency. One specific model established in embodiments of the present invention is a model for assisting in distinguishing benign nodules of the lung from lung cancer, the model being specifically: log (y/(1-y)) = -0.001+0.014 x ccdc12_d_5+0.007 x ccdc12_d_6+0.049 x ccdc12_d_7+0.023 x ccdc12_d_8-0.011 x ccdc12_d_9+0.385 x ccdc12_d_10-1.78 x ccdc12_d_11+0.186 x ccdc12_d_12-0.003 x ccdc12_d_13-0.401 x ccdc12_d_14+0.707 x ccdc12_d_15+1.099 x ccdc12_d_16+0.001 x age (integer) +0.110 x sex (male assigned as 1 and female assigned as 0) +0.021 x white blood cell number (unit 10 x 9/L). The CCDC12_D_5 is the methylation level of CpG sites shown in positions 104-105 of a DNA fragment shown in SEQ ID No.4 from the 5' end; the CCDC12_D_6 is the methylation level of CpG sites shown in the 133 th-134 th positions of the 5' end of the DNA fragment shown in SEQ ID No. 4; the CCDC12_D_7 is the methylation level of CpG sites shown in the 202 st-203 th position of the 5' end of the DNA fragment shown in SEQ ID No. 4; the CCDC12_D_8 is the methylation level of CpG sites shown in the 273-274 th position of the 5' end of the DNA fragment shown in SEQ ID No. 4; the CCDC12_D_9 is the methylation level of the CpG site shown in 300-301 th position of the 5' end of the DNA fragment shown in SEQ ID No. 4. The CCDC12_D_10 is the methylation level of CpG sites shown in 344-345 th positions of a DNA fragment shown in SEQ ID No.4 from the 5' end; the CCDC12_D_11 is the methylation level of CpG sites shown in the 361 th-362 th positions of the 5' end of the DNA fragment shown in SEQ ID No. 4; the CCDC12_D_12 is the methylation level of the CpG site shown in 383-384 bits of the DNA fragment shown in SEQ ID No.4 from the 5' end; the CCDC12_D_13 is the methylation level of the CpG site shown in 426-427 from the 5' end of the DNA fragment shown in SEQ ID No. 4. The CCDC12_D_14 is the methylation level of CpG sites shown in 448-449 bits of a DNA fragment shown in SEQ ID No.4 from the 5' end; the CCDC12_D_15 is the methylation level of CpG sites shown in 457-458 bits of a DNA fragment shown in SEQ ID No.4 from a 5' end; the CCDC12_D_16 is the methylation level of CpG sites shown in 464-465 of the 5' end of the DNA fragment shown in SEQ ID No. 4. The threshold of the model was 0.5. Patient candidates with a detection index greater than 0.5 calculated by the model are lung cancer patients, and patient candidates with less than 0.5 are lung benign nodule patients.
In the above aspects, the detecting CCDC12 gene methylation level is detecting CCDC12 gene methylation level in blood.
In the above aspects, when the type a sample and the type B sample are different subtype samples of lung cancer in (C3), the type a sample and the type B sample may specifically be any two of a lung adenocarcinoma sample, a lung squamous carcinoma sample, and a small cell lung cancer sample.
In the above aspects, when the type a sample and the type B sample are different stage samples of lung cancer in (C4), the type a sample and the type B sample may specifically be any two of a clinical stage I lung cancer sample, a clinical stage II lung cancer sample, and a clinical stage III lung cancer sample.
In the above aspects, when the type a sample and the type B sample are different subtype samples of breast cancer in (C7), the type a sample and the type B sample may specifically be: any two of breast ductal carcinoma in situ, breast invasive ductal carcinoma, and breast invasive lobular carcinoma.
In the above aspects, when the type a sample and the type B sample are different stage samples of breast cancer in (C8), the type a sample and the type B sample may specifically be: any two of T1 stage breast cancer, T2 stage breast cancer and T3 breast cancer; or breast cancer without lymph node infiltration and breast cancer with lymph node infiltration; or any two of clinical stage I breast cancer, clinical stage II breast cancer and clinical stage III breast cancer.
The CCDC12 gene of any of the above specifically may include Genbank accession No.: NM_001277074.2 (day 13 of 8 months in 2020), NM_144716.6 (day 13 of 8 months in 2020)
The present invention provides hypermethylation of CCDC12 gene in blood of lung cancer patient and breast cancer. Experiments prove that the blood can be used as a sample to distinguish cancer (lung cancer and breast cancer) patients from cancer-free controls, lung benign nodules and lung cancer, different subtypes of lung cancer, different stages of lung cancer, benign nodules and breast cancer of breast, different subtypes of breast cancer and different stages of breast cancer, and can distinguish lung cancer and breast cancer. The invention has important scientific significance and clinical application value for improving the early diagnosis and treatment effect of lung cancer and breast cancer and reducing the death rate.
Drawings
FIG. 1 is a schematic diagram of a mathematical model.
Fig. 2 is an illustration of a mathematical model.
Detailed Description
The experimental methods used in the following examples are conventional methods unless otherwise specified.
Materials, reagents and the like used in the examples described below are commercially available unless otherwise specified.
The quantitative assay of the Coiled-coil domain protein 12 (Coiled-Coil Domain Containing, CCDC12) gene in the examples below was performed in triplicate and the results averaged.
Example 1 primer design for detection of methylation site of CCDC12 Gene
Five fragments in the CCDC12 gene (ccdc12_a fragment, ccdc12_b fragment, ccdc12_c fragment, ccdc12_d fragment and ccdc12_e fragment) were selected for methylation level and cancer correlation analysis through a number of sequence and functional analyses.
The CCDC12_A fragment (SEQ ID No. 1) is located on the hg19 reference genome chr3:46967532-46968000, sense strand;
the CCDC12_B fragment (SEQ ID No. 2) is located on the sense strand of the hg19 reference genome chr3: 46974665-46975200;
the CCDC12_C fragment (SEQ ID No. 3) is located in the hg19 reference genome chr3:46989130-46989700, sense strand;
the CCDC12_D fragment (SEQ ID No. 4) is located in the hg19 reference genome chr3:46989800-46990430, sense strand;
the CCDC12_E fragment (SEQ ID No. 5) is located on the sense strand of the hg19 reference genome chr3: 46990450-46991000.
CpG site information in CCDC12_A fragment is shown in Table 1;
CpG site information in CCDC12_B fragment is shown in Table 2;
CpG site information in CCDC12_C fragment is shown in Table 3;
CpG site information in CCDC12_D fragment is shown in Table 4;
the CpG site information in the CCDC12_E fragment is shown in Table 5.
Table 1, cpG site information in CCDC12_A fragment
Table 2, cpG site information in CCDC12_B fragment
CpG sites | Position of CpG sites in the sequence |
CCDC12_B_1 | SEQ ID No.2 from positions 26-27 of the 5' end |
CCDC12_B_2 | 170 th to 171 th positions from 5' end of SEQ ID No.2 |
CCDC12_B_3 | SEQ ID No.2 from position 244-245 of the 5' end |
CCDC12_B_4 | SEQ ID No.2 from positions 301 to 302 of the 5' end |
CCDC12_B_5 | SEQ ID No.2 shows the 371-372 th position from the 5' end |
CCDC12_B_6 | SEQ ID No.2 shows positions 386-387 from the 5' end |
CCDC12_B_7 | SEQ ID No.2 from positions 412-413 of the 5' end |
CCDC12_B_8 | SEQ ID No.2 from positions 504-505 of the 5' end |
Table 3, cpG site information in CCDC12_C fragment
CpG sites | Position of CpG sites in the sequence |
CCDC12_C_1 | SEQ ID No.3 from position 29 to 30 of the 5' end |
CCDC12_C_2 | 79 th to 80 th positions of SEQ ID No.3 from 5' end |
CCDC12_C_3 | SEQ ID No.3 from position 160-161 of the 5' end |
CCDC12_C_4 | SEQ ID No.3 from 382-383 bits at the 5' end |
CCDC12_C_5 | SEQ ID No.3 from position 400-401 of the 5' end |
CCDC12_C_6 | SEQ ID No.3 from position 526-527 of the 5' end |
Table 4, cpG site information in CCDC12_D fragment
CpG sites | Position of CpG sites in the sequence |
CCDC12_D_1 | SEQ ID No.4 from position 29 to 30 of the 5' end |
CCDC12_D_2 | SEQ ID No.4 from position 36-37 of the 5' end |
CCDC12_D_3 | SEQ ID No.4 from positions 55-56 of the 5' end |
CCDC12_D_4 | 86 th to 87 th positions of SEQ ID No.4 from 5' end |
CCDC12_D_5 | SEQ ID No.4 from positions 104 to 105 of the 5' end |
CCDC12_D_6 | SEQ ID No.4 from positions 133 to 134 of the 5' end |
CCDC12_D_7 | SEQ ID No.4 from positions 202-203 of the 5' end |
CCDC12_D_8 | SEQ ID No.4 from position 273-274 of the 5' end |
CCDC12_D_9 | SEQ ID No.4 from position 300-301 at the 5' end |
CCDC12_D_10 | SEQ ID No.4 shows positions 344-345 from the 5' end |
CCDC12_D_11 | 361 st to 362 th position from 5' end of SEQ ID No.4 |
CCDC12_D_12 | 383-384 bits of SEQ ID No.4 from 5' end |
CCDC12_D_13 | SEQ ID No.4 from positions 426-427 of the 5' end |
CCDC12_D_14 | SEQ ID No.4 from position 448-449 of the 5' end |
CCDC12_D_15 | SEQ ID No.4 shows the positions 457-458 from the 5' end |
CCDC12_D_16 | SEQ ID No.4 from position 464-465 of the 5' end |
CCDC12_D_17 | SEQ ID No.4 from the 5' end at positions 595-596 |
Table 5, cpG site information in CCDC12_E fragment
CpG sites | Position of CpG sites in the sequence |
CCDC12_E_1 | SEQ ID No.5 from positions 26-27 of the 5' end |
CCDC12_E_2 | SEQ ID No.5 from position 73-74 of the 5' end |
CCDC12_E_3 | SEQ ID No.5 from positions 117-118 of the 5' end |
CCDC12_E_4 | 319-320 th bit of SEQ ID No.5 from 5' end |
CCDC12_E_5 | SEQ ID No.5 from 5' end at positions 496-497 |
Specific PCR primers were designed for five fragments (ccdc12_a fragment, ccdc12_b fragment, ccdc12_c fragment, ccdc12_d fragment, and ccdc12_e fragment) as shown in table 6. Wherein SEQ ID No.6, SEQ ID No.8, SEQ ID No.10, SEQ ID No.12 and SEQ ID No.14 are forward primers, SEQ ID No.7, SEQ ID No.9, SEQ ID No.11, SEQ ID No.13 and SEQ ID No.15 are reverse primers; positions 1 to 10 from the 5' end in SEQ ID No.6, SEQ ID No.8, SEQ ID No.10, SEQ ID No.12 and SEQ ID No.14 are nonspecific tags, and positions 11 to 35 are specific primer sequences; SEQ ID No.7, SEQ ID No.9, SEQ ID No.11, SEQ ID No.13 and SEQ ID No.15 show non-specific tags at positions 1 to 31 and specific primer sequences at positions 32 to 56 from 5'. The primer sequences do not contain SNPs and CpG sites.
TABLE 6 CCDC12 methylation primer sequences
Example 2, CCDC12 Gene methylation detection and analysis of results
1. Study sample
After informed consent of the patients, in total, ex-vivo blood samples of 426 lung cancer patients, 286 lung benign nodule patients, 292 breast cancer patients, 82 breast benign nodule patients and 816 cancer-free controls (in which the sexes of men and women are half, and 408 are all) were collected.
All patient samples were collected preoperatively and were subjected to imaging and pathological confirmation.
Lung cancer and breast cancer subtypes are judged according to histopathology.
Cancer-free controls, i.e., patients who had not previously and now had no cancer and had not reported benign nodules of the lung or breast and blood normative indicators were within the reference range
The stage of lung cancer and breast cancer takes an AJCC 8 th edition stage system as a judgment standard.
426 lung cancer patients were classified according to type: 319 cases of lung adenocarcinoma, 47 cases of lung squamous carcinoma, 52 cases of small cell lung carcinoma and 8 other cases.
426 lung cancer patients were divided according to stage: 338 cases in stage I, 49 cases in stage II, 39 cases in stage III.
426 lung cancer patients were classified according to lung cancer tumor size (T): 306 cases in T1, 72 cases in T2 and 48 cases in T3.
426 cases of lung cancer patients were classified according to the presence or absence of lung cancer lymph node infiltration (N): there were 394 cases of lung cancer lymph node infiltration, and 32 cases of lung cancer lymph node infiltration.
292 breast cancer patients were classified according to typing: 93 cases of breast ductal carcinoma in situ, 183 cases of breast invasive ductal carcinoma, and 16 cases of breast invasive lobular carcinoma.
292 breast cancer patients were divided by stage: 225 cases in phase I, 49 cases in phase II, and 18 cases in phase III.
292 breast cancer patients were classified according to lung cancer tumor size (T): 238 cases in T1, 41 cases in T2 and 13 cases in T3.
292 breast cancer patients were classified according to the presence or absence of breast cancer lymph node infiltration (N): there were no breast cancer lymph node infiltrates 266 cases and there were breast cancer lymph node infiltrates 26 cases.
The median ages of the cancer-free population, lung cancer, benign lung nodules, breast cancer and benign breast nodules patients were 54, 55, 58, 56 and 56 years old, respectively, and the ratio of men and women in each of these 5 populations was about 1:1.
The median age of the cancer-free female control was 55 years.
2. Methylation detection
1. Total DNA of the blood sample is extracted.
2. The total DNA of the blood samples prepared in step 1 was subjected to bisulfite treatment (see DNA methylation kit instructions for Qiagen). After bisulfite treatment, unmethylated cytosine (C) is converted to uracil (U), while methylated cytosine remains unchanged, i.e., the C base of the original CpG site is converted to C or U after bisulfite treatment.
3. And (3) carrying out PCR amplification by using the DNA treated by the bisulfite in the step (2) as a template and adopting 5 pairs of specific primer pairs in the table (6) through DNA polymerase according to a reaction system required by a conventional PCR reaction, wherein all primers adopt a conventional standard PCR reaction system and are amplified according to the following procedure.
The PCR reaction procedure was: 95 ℃,4 min- & gt (95 ℃,20 s- & gt 56 ℃,30 s- & gt 72 ℃ 2 min) 45 cycles- & gt 72 ℃,5 min- & gt 4 ℃ for 1h.
4. Taking the amplified product of the step 3, and carrying out DNA methylation analysis by a time-of-flight mass spectrum, wherein the specific method is as follows:
(1) Mu.l of Shrimp Alkaline Phosphate (SAP) solution (0.3 ml SAP [ 0.5U) was added to 5. Mu.l of PCR product]+1.7ml H 2 O) then incubated in a PCR apparatus (37 ℃,20 min. Fwdarw. 85 ℃,5 min. Fwdarw. 4 ℃,5 min) according to the following procedure;
(2) Taking out 2 mu.l of the SAP treated product obtained in the step (1), adding the product into a 5 mu l T-clear reaction system according to the instruction, and then incubating for 3 hours at 37 ℃;
(3) Taking the product of the step (2), adding 19 mu l of deionized water, and then carrying out deionized incubation on a rotary shaking table for 1h by using 6 mu g of Resin;
(4) Centrifuging at 2000rpm at room temperature for 5min, and loading 384SpectroCHIP with the micro supernatant by a Nanodispenser mechanical arm;
(5) Time-of-flight mass spectrometry; the data obtained were collected with the spectroacquisition v3.3.1.3 software and visualized by MassArray EpiTyper v 1.2.1.2 software.
Reagents used for the time-of-flight mass spectrometry were all from the kit (T-Cleavage MassCLEAVE Reagent Auto Kit, cat# 10129A); the detection instrument used for the time-of-flight mass spectrometry detection isAnalyzer Chip Prep Module 384, model: 41243; the data analysis software is self-contained software of the detection instrument.
5. And (5) analyzing the data obtained in the step (4).
Statistical analysis of the data was performed by SPSS Statistics 23.0.
Non-parametric tests were used for comparative analysis between the two groups.
The identification effect of a combination of multiple CpG sites on different sample groupings is achieved by logistic regression and statistical methods of the subject curves.
All statistical tests were double-sided, with p-values <0.05 considered statistically significant.
By mass spectrometry experiments, a total of 43 distinguishable peak patterns of methylated fragments were obtained. The methylation level at each CpG site of each sample can be automatically obtained by calculating the peak area according to the "methylation level=peak area of methylated fragments/(peak area of unmethylated fragments+peak area of methylated fragments)" formula using SpectroAcquin v3.3.1.3 software.
3. Analysis of results
1. Cancer-free control, lung benign nodules and CCDC12 Gene methylation level in the blood of Lung cancer
Methylation levels of all CpG sites in the CCDC12 gene were analyzed using blood of 426 lung cancer patients, 286 lung benign nodule patients, and 816 cancer-free controls as study materials (Table 7). The results show that all CpG sites in CCDC12 gene have a median methylation level of 0.39 (iqr=0.18-0.56) in the cancer-free control group, 0.45 (iqr=0.27-0.70) in benign nodules in the lung, and 0.44 (iqr=0.26-0.69) in lung cancer patients.
2. Blood CCDC12 gene methylation level can distinguish cancer-free control and lung cancer patients
As a result of comparative analysis of methylation levels of CCDC12 genes in 426 lung cancer patients and 816 cancer-free controls, it was found that methylation levels of all CpG sites in CCDC12 genes in lung cancer patients were significantly higher than those in cancer-free controls (p <0.05, table 8). Furthermore, methylation levels of all CpG sites of CCDC12 gene in different subtypes of lung cancer (lung adenocarcinoma, lung squamous carcinoma and small cell lung carcinoma) were significantly different from that of the non-cancerous control (p < 0.05), respectively. Methylation levels of all CpG sites of the CCDC12 gene in different stages (clinical stage I and stage II-III) of lung cancer were significantly different from that of the cancer-free control (p < 0.05), respectively. Furthermore, there was a significant difference in methylation levels between non-lymphoblastic lung cancer patients and lymphoblastic lung cancer patients, respectively, and non-cancerous controls (p < 0.05). Therefore, the methylation level of the CCDC12 gene can be used for clinical diagnosis of lung cancer, and especially can be used for early diagnosis of lung cancer.
3. The methylation level of CCDC12 gene in blood can distinguish benign nodule of lung and lung cancer patient
As a result of comparative analysis of methylation levels of the CCDC12 gene in 426 lung cancer patients and 286 lung benign nodules, it was found that methylation levels of all CpG sites of the CCDC12 gene were significantly higher in lung benign nodules patients than in lung cancer patients (p <0.05, table 9). Furthermore, it was found that methylation levels of all CpG in CCDC12 genes of lung cancer patients of different subtypes (lung adenocarcinoma, lung squamous carcinoma and small cell lung carcinoma), different clinical stages (stage I and stage II-III) and the presence or absence of lymphocytic infiltration were significantly different from benign nodules of the lung (p < 0.05), respectively. Thus, the methylation level of the CCDC12 gene can be used to distinguish lung cancer patients from lung benign nodule patients, and is a very valuable marker.
4. The methylation level of CCDC12 gene in blood can be used for distinguishing different subtypes of lung cancer or different stages of lung cancer
By comparing and analyzing the methylation level of the CCDC12 gene in different subtype lung cancer patients and different stage lung cancer patients, the methylation level of all CpG sites in the CCDC12 gene is found to have significant differences under the conditions of different lung cancer subtypes (lung adenocarcinoma, lung squamous carcinoma and small cell lung cancer), different tumor sizes of lung cancer (T1, T2 and T3), different stages of lung cancer (clinical stage I, stage II and stage III) and the presence or absence of lymph node infiltration of the lung cancer (p <0.05, table 10). Thus, the methylation level of the CCDC12 gene can be used to distinguish between different subtypes of lung cancer or different stages of lung cancer.
5. Cancer-free female control, benign breast nodules and CCDC12 Gene methylation level in blood of breast cancer
The level of CpG site methylation in CCDC12 gene between breast cancer patients, breast benign nodule patients and cancer-free female controls was analyzed using blood from 292 breast cancer patients, 82 breast benign nodule patients and 408 cancer-free female controls as study material (table 11). The results showed that the cancer-free female control group had a median methylation level of 0.39 (iqr=0.17-0.55), the benign breast nodules had a median methylation level of 0.42 (iqr=0.24-0.64), and the breast cancer patients had a median methylation level of 0.41 (iqr=0.22-0.64).
6. Blood CCDC12 gene methylation level distinguishes between cancer-free female control and breast cancer patients
As a result of comparative analysis of methylation levels of the CCDC12 gene in 292 breast cancer patients and 408 cancer-free female controls, it was found that methylation levels of all CpG sites in the CCDC12 gene were significantly higher in breast cancer patients than in cancer-free female controls (p <0.05, table 12). Furthermore, methylation levels of all CpG sites of CCDC12 gene in different subtypes of breast cancer (ductal carcinoma in situ, ductal invasive carcinoma of the breast and lobular invasive carcinoma of the breast) were significantly different from that of non-cancerous female controls (p < 0.05), respectively. Methylation levels of all CpG sites of the CCDC12 gene in different stages (clinical stage I and stage II-III) of breast cancer were significantly different from that of a cancer-free female control (p < 0.05), respectively. Furthermore, there was a significant difference (p < 0.05) in methylation levels between non-lymphoblastic breast cancer patients and lymphoblastic breast cancer patients, respectively, and non-cancerous female controls. Therefore, the methylation level of CCDC12 gene can be used for clinical diagnosis of breast cancer, and especially for early diagnosis of breast cancer.
7. The methylation level of CCDC12 gene in blood can distinguish benign nodule of breast from breast cancer patient
By comparative analysis of the methylation level of CCDC12 gene in 292 breast cancer patients and 82 breast benign nodules, it was found that the methylation level of CCDC12 gene at all CpG sites in breast benign nodules patients was significantly higher than in breast cancer patients (p <0.05, table 13). Furthermore, it was found that methylation levels of all CpG sites in CCDC12 genes of breast cancer patients of different subtypes (ductal carcinoma in situ, ductal carcinoma invasive and lobular carcinoma invasive), different clinical stages (stages I and II-III) and the presence or absence of lymphoid infiltration were significantly different from benign nodules of the breast (p < 0.05), respectively. Thus, the methylation level of the CCDC12 gene can be used as a very valuable marker for distinguishing breast cancer patients from breast benign nodule patients.
8. The methylation level of CCDC12 gene in blood can distinguish different subtypes of breast cancer or different stages of breast cancer
By comparing and analyzing the methylation level of the CCDC12 gene in breast cancer patients with different subtypes and breast cancer patients with different stages, the methylation level of all CpG sites in the CCDC12 gene is found to have significant differences under the conditions of different subtypes of breast cancer (breast ductal carcinoma in situ, breast invasive ductal carcinoma and breast invasive lobular carcinoma), breast cancer tumor sizes (T1, T2 and T3), different stages of breast cancer (clinical stage I, stage II and stage III) and the presence or absence of lymph node infiltration (p <0.05, table 14). Thus, the methylation level of the CCDC12 gene can be used to distinguish between different subtypes of breast cancer or different stages of breast cancer.
9. The methylation level of CCDC12 in blood can distinguish breast cancer patients from lung cancer patients
Blood from 292 breast cancer patients and 426 lung cancer patients was used as a study material to analyze the difference in methylation level in CCDC12 gene in blood from breast cancer patients and lung cancer patients (table 15). The results show that the methylation level of all target CpG sites in breast cancer patients is median 0.41 (iqr=0.22-0.64), the methylation level of lung cancer patients is median 0.44 (iqr=0.26-0.69), and the methylation level of all CpG sites in breast cancer patients is significantly lower than that in lung cancer patients (p < 0.05). Thus, the methylation level of the CCDC12 gene can be used to distinguish breast and lung cancer patients.
10. Modeling of mathematical models for aiding in cancer diagnosis
The mathematical model established by the invention can be used for achieving the following purposes:
(1) Distinguishing lung cancer patients from non-cancerous controls;
(2) Distinguishing lung cancer patients from lung benign nodule patients;
(3) Distinguishing breast cancer patients from cancer-free female controls;
(4) Distinguishing breast cancer patients from breast benign nodule patients;
(5) Distinguishing breast cancer patients from lung cancer patients
(6) Distinguishing lung cancer subtypes;
(7) Differentiating lung cancer stage;
(8) Distinguishing breast cancer subtypes;
(9) Differentiation of breast cancer stage.
The mathematical model is established as follows:
(A) Data sources: methylation levels of target CpG sites (combinations of one or more of tables 1-5) of the isolated blood samples of 426 lung cancer patients, 286 lung-developed benign nodule patients, 292 breast cancer patients, 82 breast benign nodule patients, and 816 cancer-free controls (including 408 cancer-free female controls) listed in step one (test method is the same as step two).
The data can be added with known parameters such as age, sex, white blood cell count and the like according to actual needs to improve the discrimination efficiency.
(B) Model building
Any two different types of patient data, namely training sets (for example, cancer-free control and lung cancer patients, cancer-free female control and breast cancer patients, lung benign nodule patients and lung cancer patients, breast benign nodule and breast cancer patients, lung cancer patients and breast cancer patients, lung adenocarcinoma and lung squamous carcinoma patients, lung adenocarcinoma and small cell lung cancer patients, lung squamous carcinoma and small cell lung cancer patients, lung cancer patients I and lung cancer patients II, lung cancer patients I and lung cancer patients III, lung cancer patients II and lung cancer patients III, breast catheter carcinoma in situ and breast invasive catheter cancer patients, breast catheter carcinoma in situ and breast invasive lobular carcinoma patients, breast invasive catheter carcinoma in I and breast cancer patients II, breast cancer patients I and breast cancer patients III, breast cancer patients II and breast cancer patients III) are selected as required to serve as data for establishing a model, and statistical software such as SAS, R, SPSS and the like is used for establishing a mathematical model by using a statistical method of differential logic regression through formulas. The numerical value corresponding to the maximum approximate dengue index calculated by the mathematical model formula is a threshold value or is directly set to be 0.5 as the threshold value, the detection index obtained by the sample to be tested after the sample is tested and substituted into the model calculation is more than the threshold value and is classified into one type (B type), less than the threshold value and is classified into the other type (A type), and the detection index is equal to the threshold value and is used as an uncertain gray area. When a new sample to be detected is predicted to judge which type belongs to, firstly, detecting methylation levels of one or more CpG sites on a CCDC12 gene of the sample to be detected by a DNA methylation determination method, then substituting data of the methylation levels into the mathematical model (if known parameters such as age, sex, white cell count and the like are included in the model construction, the step simultaneously substitutes specific numerical values of corresponding parameters of the sample to be detected into a model formula), calculating to obtain a detection index corresponding to the sample to be detected, and then comparing the detection index corresponding to the sample to be detected with a threshold value, and determining which type of sample the sample to be detected belongs to according to a comparison result.
Examples: as shown in fig. 1, the methylation level of single CpG sites or the methylation level of a plurality of CpG sites combined in the training set CCDC12 gene is used to establish a mathematical model for distinguishing between class a and class B by using a formula of two classification logistic regression through statistical software such as SAS, R, SPSS. The mathematical model is herein a two-class logistic regression model, specifically: log (y/1-y) =b0+b1x1+b2x2+b3x3+ … +bnxn, where y is a detection index obtained by substituting a dependent variable, i.e., a methylation value of one or more methylation sites of a sample to be tested, into a model, b0 is a constant, x 1-xn are independent variables, i.e., methylation values (each value is a value between 0 and 1) of one or more methylation sites of the sample to be tested, and b 1-bn are weights given to each methylation site by the model. In specific application, a mathematical model is established according to methylation degrees (x 1-xn) of one or more DNA methylation sites of a sample detected in a training set and known classification conditions (class A or class B, respectively assigning 0 and 1 to y), so that a constant B0 of the mathematical model and weights B1-bn of each methylation site are determined, and a threshold value divided by a detection index (0.5 in the example) corresponding to the maximum sign index is calculated by the mathematical model. And the detection index, namely y value, obtained by testing the sample to be tested and substituting the sample into the model for calculation is classified into B class, less than 0.5 is classified into A class, and the y value is equal to 0.5 as an uncertain gray area. Wherein the class a and the class B are corresponding two classifications (two classification groups, which group a and which group B are to be determined according to a specific mathematical model, no convention is made herein), such as cancer-free control and lung cancer patients, cancer-free female control and breast cancer patients, lung benign nodule and lung cancer patients, breast benign nodule and breast cancer patients, lung cancer and breast cancer patients, lung adenocarcinoma and lung squamous cell lung cancer patients, lung squamous cell and small cell lung cancer patients, lung cancer and lung cancer patients in stage I, lung cancer and stage II, lung cancer and lung cancer in stage III, lung cancer in stage II, lung cancer in stage III, duct in situ and duct in breast invasive lobular cancer patients, duct in breast invasive duct cancer and duct in breast invasive lobular cancer patients, stage I and stage II breast cancer patients, stage I and III breast cancer patients, stage II and stage III breast cancer patients. When predicting a sample of a subject to determine which category the sample belongs to, blood of the subject is collected first, and then DNA is extracted therefrom. After the extracted DNA is converted by bisulfite, the methylation level of single CpG sites or the methylation level of a plurality of CpG sites of the CCDC12 gene of a subject is detected by using a DNA methylation determination method, and methylation data obtained by detection are substituted into the mathematical model. If the methylation level of one or more CpG sites of the CCDC12 gene of the subject is substituted into the mathematical model and then the calculated detection index is greater than a threshold value, the subject judges that the subject belongs to a class (B class) with the detection index in the training set greater than 0.5; if the methylation level data of one or more CpG sites of the CCDC12 gene of the subject is substituted into the mathematical model and then the calculated value, namely the detection index, is smaller than the threshold value, the subject belongs to a class (A class) with the detection index in the training set smaller than 0.5; if the methylation level data of one or more CpG sites of the CCDC12 gene of the subject is substituted into the mathematical model, and the calculated value, i.e. the detection index, is equal to the threshold value, the subject cannot be judged to be A class or B class.
Examples: the schematic diagram of fig. 2 illustrates the methylation level of preferred CpG sites of ccdc12_d (ccdc12_d_5, ccdc12_d_6, ccdc12_d_7, ccdc12_d_8, ccdc12_d_9, ccdc12_d_10, ccdc12_d_11, ccdc12_d_12, ccdc12_d_13, ccdc12_d_14, ccdc12_d_15 and ccdc12_d_16) and the application of mathematical modeling for lung benign and malignant nodule discrimination: the methylation level data of the combination of 12 distinguishable preferred CpG sites already detected in the training set of lung cancer patients and lung benign nodule patients (426 lung cancer patients and 286 lung benign nodule patients herein) and the age, sex (male assigned 1 and female assigned 0) and white blood cell count of the patients were used to build a mathematical model for distinguishing lung cancer patients from lung benign nodule patients by using a formula of a two-class logistic regression by R software. The mathematical model is here a two-class logistic regression model, whereby the constant b0 of the mathematical model and the weights b1 to bn of the individual methylation sites are determined, in this example in particular: log (y/(1-y)) = -0.001+0.014 x ccdc12_d_5+0.007 x ccdc12_d_6+0.049 x ccdc12_d_7+0.023 x ccdc12_d_8-0.011 x ccdc12_d_9+0.385 x ccdc12_d_10-1.78 x ccdc12_d_11+0.186 x ccdc12_d_12-0.003 x ccdc12_d_13-0.401 x ccdc12_d_14+0.707 x ccdc12_d_15+1.099 x ccdc12_d_16+0.001 x age (integer) +0.110 x sex (male assigned as 1 and female assigned as 0) +0.021 x white blood cell number (unit 10 x 9/L). Where y is the detection index obtained by substituting the methylation values of the 12 distinguishable methylation sites of the sample to be tested into the model according to the dependent variables such as age, sex and white blood cell count. Under the condition that 0.5 is set as a threshold value, the methylation level of 12 distinguishable CpG sites, namely CCDC12_D_5, CCDC12_D_6, CCDC12_D_7, CCDC12_D_8, CCDC12_D_9, CCDC12_D_10, CCDC12_D_11, CCDC12_D_12, CCDC12_D_13, CCDC12_D_14, CCDC12_D_15 and CCDC12_D_16, of the sample to be tested is calculated through substitution of the sample to the model together with information of age, sex and white blood cell count, and the obtained detection index, namely y value, is more than 0.5 and is classified as lung cancer patients, less than 0.5 is classified as lung benign nodule patients, and the methylation level of the 12 distinguishable CpG sites, namely lung cancer patients or lung benign nodule patients is not determined as lung cancer patients if the y value is equal to 0.5. The area under the curve (AUC) calculation for this model was 0.80 (table 19). Specific subject judgment methods are shown in FIG. 2, for example, blood is collected from two subjects (A, B), DNA is extracted from the blood, the extracted DNA is converted by bisulfite, and methylation levels of 12 distinguishable CpG sites, CCDC12_D_5, CCDC12_D_6, CCDC12_D_7, CCDC12_D_8, CCDC12_D_9, CCDC12_D_10, CCDC12_D_11, CCDC12_D_12, CCDC12_D_13, CCDC12_D_14, CCDC12_D_15 and CCDC12_D_16, of the subjects are detected by a DNA methylation assay method. The methylation level data obtained from the detection together with the information on age, sex and white blood cell count of the subject are then substituted into the mathematical model described above. The value calculated by the first test subject after the mathematical model is 0.77 to be more than 0.5, and the first test subject is judged to be a lung cancer patient (which accords with the clinical judgment result); and substituting methylation level data of one or more CpG sites of the CCDC12 gene of the subject B into the mathematical model to calculate a value of 0.25 to be less than 0.5, and judging the benign nodule patient of the lung (conforming to the clinical judgment result) by the subject B.
(C) Model Effect evaluation
According to the above method, mathematical models for distinguishing between a cancer-free control and a lung cancer patient, a cancer-free female control and a breast cancer patient, a lung benign nodule patient and a lung cancer patient, a breast benign nodule and breast cancer patient, a lung cancer patient and a breast cancer patient, a lung adenocarcinoma and lung squamous carcinoma patient, a lung adenocarcinoma and small cell lung cancer patient, a lung squamous carcinoma and small cell lung cancer patient, a lung cancer patient of stage I and stage II, a lung cancer patient of stage I and stage III, a lung cancer patient of stage II and stage III, a breast ductal carcinoma in situ and a breast invasive ductal carcinoma patient, a breast ductal carcinoma in situ and a breast invasive lobular carcinoma patient, a breast cancer patient of stage I and stage II, a breast cancer patient of stage I and stage III, a breast cancer patient of stage II and a breast cancer patient of stage III are respectively established, and the effectiveness thereof is evaluated by a subject curve (ROC curve). The larger the area under the curve (AUC) from the ROC curve, the better the differentiation of the model, the more efficient the molecular marker. The evaluation results after construction of mathematical models using different CpG sites are shown in tables 16, 17 and 18. In tables 16, 17 and 18, 1 CpG site represents the site of any one CpG site in the amplified CCDC12_D fragment, 2 CpG sites represent the combination of any 2 CpG sites in CCDC12_D, 3 CpG sites represent the combination … … of any 3 CpG sites in CCDC12_D, and so on. The values in the table are the range of values for the combined evaluation of the different sites (i.e., the results for any combination of CpG sites are within this range).
The above results show that the discrimination ability of CCDC12 gene for each group (no cancer control and lung cancer patient, no cancer female control and breast cancer patient, lung benign nodule and lung cancer patient, breast benign nodule and breast cancer patient, lung cancer and breast cancer patient, lung adenocarcinoma and lung squamous carcinoma patient, lung adenocarcinoma and small cell lung cancer patient, lung squamous carcinoma and small cell lung cancer patient, lung cancer and lung cancer patient in stage I, lung cancer and lung cancer patient in stage III, lung cancer and lung cancer patient in stage II, ductal carcinoma in situ and ductal invasive, ductal carcinoma in situ and lobular invasive, ductal invasive and lobular invasive breast cancer patient in stage I and breast cancer patient in stage II, breast cancer in stage I and breast cancer patient in stage III, breast cancer in stage II and breast cancer patient in stage III) increases with increasing number of loci.
In addition, among the CpG sites shown in tables 1 to 5, there are cases where combinations of a few preferred sites are better in discrimination than combinations of a plurality of non-preferred sites. The combination of 12 distinguishable optimal sites, e.g., ccdc12_d_5, ccdc12_d_6, ccdc12_d_7, ccdc12_d_8, ccdc12_d_9, ccdc12_d_10, ccdc12_d_11, ccdc12_d_12, ccdc12_d_13, ccdc12_d_14, ccdc12_d_15, and ccdc12_d_16 shown in table 19, table 20, table 21, is a preferred site combination of any 12 distinguishable sites in ccdc12_d.
In summary, the CpG sites on the CCDC12 gene and various combinations thereof, the CpG sites on the CCDC 12A fragment and various combinations thereof, the CpG sites on the CCDC 12B fragment and various combinations thereof, the CpG sites on the CCDC 12C fragment and various combinations thereof, the CpG sites on the CCDC 12D fragment and various combinations thereof, the CCDC12 D_5, the CCDC12 D_6, the CCDC12 D_7, the CCDC12 D_8, the CCDC12 D_9, the CCDC12 D_10, the CCDC12 D_11, the CCDC12 D_12, the CCDC12 D_13, the CCDC12 D_14, the CCDC12 D_15 and the CCDC12 D_16 sites and various combinations thereof, the CpG sites on the CCDC 12E fragment and various combinations thereof, methylation levels at CpG sites and various combinations thereof on CCDC12_ A, CCDC12_ B, CCDC _ C, CCDC12_d and CCDC12_e are capable of discriminating between cancer-free control and lung cancer patients, cancer-free female control and breast cancer patients, lung benign nodule and lung cancer patients, breast benign nodule and breast cancer patients, lung cancer and breast cancer patients, lung adenocarcinoma and lung squamous carcinoma patients, lung adenocarcinoma and small cell lung cancer patients, lung squamous carcinoma and small cell lung cancer patients, lung cancer and lung cancer patients in stage I and stage II, lung cancer and lung cancer patients in stage I, lung cancer and stage III, lung cancer patients in stage II, ductal carcinoma in situ and ductal breast invasive, ductal carcinoma in situ and invasive lobular breast cancer patients, ductal carcinoma in stage I and stage II breast cancer patients, ductal carcinoma in stage I and III breast cancer patients, ductal carcinoma in stage II and ductal carcinoma in stage III.
TABLE 7 methylation levels comparing benign nodules in lung and lung cancer for cancer-free controls
Table 8 methylation level differences for comparison of cancer-free controls and lung cancer
/>
Table 9, comparison of methylation level differences between benign nodules in the lung and lung cancer
TABLE 10 comparison of methylation level differences for different subtypes of lung cancer or different stages of lung cancer
Table 11, comparison methylation levels for non-cancerous female controls, benign nodules of breast and breast cancer
/>
Table 12, methylation level differences for comparison of cancer-free female controls and breast cancer
TABLE 13 comparison of methylation level differences between benign nodules of breast and breast cancer
TABLE 14 comparison of methylation level differences for different subtypes of breast cancer or different stages of breast cancer
TABLE 15 comparison of methylation level differences between lung and breast cancers
/>
CpG sites of Table 16, CCDC12_D and combinations thereof for distinguishing lung cancer and non-cancerous controls, lung cancer and benign nodules, breast cancer and non-cancerous female controls, breast cancer and benign nodules of breast, lung cancer and breast cancer
Table 17, cpG sites of CCDC12_D and free combinations thereof for differentiating between different subtypes of lung cancer patients and different stages of lung cancer
Table 18, cpG sites of CCDC12_D and free combinations thereof for distinguishing between different subtypes and different stages of breast cancer
Table 19, optimal CpG sites for CCDC12_D and combinations thereof for distinguishing lung cancer and non-cancerous controls, lung cancer and benign nodules of the lung, breast cancer and benign nodules of the breast, breast cancer and non-cancerous female controls, breast cancer and benign nodules of the breast, lung cancer and breast cancer
/>
Table 20, optimal CpG sites of CCDC12_D and combinations thereof for differentiating between different subtypes and different stages of lung cancer patients
Table 21, optimal CpG sites of CCDC12_D and combinations thereof for distinguishing between different subtypes and different stages of breast cancer patients
The present invention is described in detail above. It will be apparent to those skilled in the art that the present invention can be practiced in a wide range of equivalent parameters, concentrations, and conditions without departing from the spirit and scope of the invention and without undue experimentation. While the invention has been described with respect to specific embodiments, it will be appreciated that the invention may be further modified. In general, this application is intended to cover any variations, uses, or adaptations of the invention following, in general, the principles of the invention and including such departures from the present disclosure as come within known or customary practice within the art to which the invention pertains. The application of some of the basic features may be done in accordance with the scope of the claims that follow.
<110> Nanjing Techno Biotechnology Co., ltd
<120> a methylated molecular marker useful for aiding in the diagnosis of cancer
<130> GNCLN220239
<160> 15
<170> PatentIn version 3.5
<210> 1
<211> 469
<212> DNA
<213> Artificial sequence
<400> 1
ctgccctggg gctcatgcca tggaatacgc ctcaactact tagagaaaag catctgggtg 60
tgctgggcca ggggcatgtg gcagtgagga gtgacatgtg gacagagaag caggtgctgg 120
agggctccag aggggcgtgg ctgcctacag ctgtgggaat gtggagggga gacactccag 180
gaggaggtgg cgcctgggcc tgcaatgacg cggacagggt caaggcacaa agcacacgag 240
aggctgactc agctccaaga ctgttgagag tggccagtga gggctgtgaa aataggcatg 300
accatgtgca aggctgggaa ggccacagtc caccaggaca gcagtgagaa ctgaacagcc 360
ccaaaggcct ccatgaagag agaggagcca caaggggcca gaatctcccg gcaggagctg 420
cggctagcta ctatgttttt atgttcttcc tgcctcagtc ttttcaggc 469
<210> 2
<211> 536
<212> DNA
<213> Artificial sequence
<400> 2
ttatcctgga aactccctcc ctcaacggga gcccagcctc tcagctctcc ccactgccag 60
gagaagcctt cctcagccct aaaaattaca attttaggtc acaggtctta gaggcagaag 120
gagcttcaga agtagtacag tcaagccctt gccttaaaaa caaggaggcc ggagaagcca 180
gagctggctg gggatgagat ctgtatggga gcccccacca agacagatca acagctatgg 240
ccacgtgttg gcaccaaggc ctacccaaag gcccctgagg gatggcttat aagcagccaa 300
cggctgttcc aggcaatgac cagggctgag tgttagttaa atgaggcccc tgccctgctg 360
gagctgccat cgcatctgca tcatgcggtc tgcacccact ctcatcccag acgctgcaga 420
acctaaccaa ggtgctctac actgacaagt ttgaactttg cacccatccc acctgaaagc 480
ccatagctag tgggtactta aggcgttaga aaaaactggt ctgccaaaag caaaac 536
<210> 3
<211> 571
<212> DNA
<213> Artificial sequence
<400> 3
ctcaatgtgc tgctaggatg acttggtccg agtatccact ctatctgctg accagaccaa 60
gagttcctag agggcagacg tttgttgtac cagtttgcac ccacaggcct gacaccaact 120
acagcaggca cagcactgtg gtagaggagt gcacaggagc ggggaaatcc cagctccatt 180
tcatcttggc tgtgtgacct tgggcaagtt atatgacctg tctggcctca gtttcctcat 240
ctacaaaaat gaggataatc atactgacat catatcacca ttacaaggaa atgatgaagg 300
agaaaagcct gcagcttagg gagcctgaag aacagggaga atcaggccag ggcaggaagg 360
aaagggctgc agaaaagcca gcggggcaca ggcccagaac ggtgtgcctc tgcagaggtc 420
atgggaagct cccatgccca tcatccttat cagatgttct cagcaatcct gtaagcagac 480
agggcagtgt tgtaatacct gcttcaccaa gacattagga gatgacgtca gatgatctgt 540
gcagtgtaga cagatctgat tccccattca g 571
<210> 4
<211> 631
<212> DNA
<213> Artificial sequence
<400> 4
aaggcttctg agcagagcaa gaacaaagcg ctgagcggga ggctgggaga acagcgaggc 60
tagagataag gactggagat aaggacggga actaggtcaa cagcggtgga ctagaacaag 120
gaggtgcctc aacgctgtgg ggagacaggg aggagcagaa acaactgagg ttctgctggt 180
gactcactgg gggaaggaag gcgcctttga caaaaaggtg gtggcactta ggggacaaag 240
agaaagggtg tggcagcctt caggcattct aacggtgagg cttgacaccc aaggcctgac 300
gaaggttggt gccaagggat agcacctgtc aaggtgccct cagcggtgga aaggagggcc 360
cgctccactc agaagagagg cacgctgggc cccaccatcc atatggcacc tcaagggcca 420
ggccacggca tgcctagtgc agctcctcgc caggcccgtc acacggccct gcccagatat 480
cctccatgtg tctgtcagga gtgtggggag ccagcccaag tcaactctga tcctattctt 540
gctgcaagtg ctcaagggag gcactacaga gctcctccag cagactttat cagtcgacaa 600
atactgactg gctcccctgt ggtactccat g 631
<210> 5
<211> 551
<212> DNA
<213> Artificial sequence
<400> 5
gctgggtact gaggaggaag aggaacgagt aggcacctag tgcccacaga gcagggcagg 60
ggttcttatc ctcggtgcac agattccaga ggtcaatgac cagacagaaa cacaggcgag 120
tgtgagaagg tgaatcttag tggatgagaa atctggagct cttcagattc tcaagggggt 180
ggtgacccac aggggaaaag ggaaggtgtc aacagccagg gaaggcagtg tttttttggg 240
tggcattaag aggtgtctac agaggctttc caaaggaagt gtcaccttaa aggatgagtg 300
acagtgggga cagagggacg gagggggaag agagcagagg gaacaaccag aaccattagt 360
atgcaaaagg cccctcccac tgccctgaga cttcacccac actggcccca gcaggaaata 420
acctcttgaa aggtcacaca gagggcaggg caaagaccag tctcttcccc ttccttcata 480
tgcagccaga ctggacgagt tcctctggga agtctggcct ggctccaaac tccctccagg 540
gtcctccctg t 551
<210> 6
<211> 35
<212> DNA
<213> Artificial sequence
<400> 6
aggaagagag ttgttttggg gtttatgtta tggaa 35
<210> 7
<211> 56
<212> DNA
<213> Artificial sequence
<400> 7
cagtaatacg actcactata gggagaaggc tacctaaaaa aactaaaaca aaaaaa 56
<210> 8
<211> 35
<212> DNA
<213> Artificial sequence
<400> 8
aggaagagag ttattttgga aatttttttt tttaa 35
<210> 9
<211> 56
<212> DNA
<213> Artificial sequence
<400> 9
cagtaatacg actcactata gggagaaggc tattttactt ttaacaaacc aatttt 56
<210> 10
<211> 35
<212> DNA
<213> Artificial sequence
<400> 10
aggaagagag tttaatgtgt tgttaggatg atttg 35
<210> 11
<211> 56
<212> DNA
<213> Artificial sequence
<400> 11
cagtaatacg actcactata gggagaaggc tctaaataaa aaatcaaatc tatcta 56
<210> 12
<211> 35
<212> DNA
<213> Artificial sequence
<400> 12
aggaagagag aaggtttttg agtagagtaa gaata 35
<210> 13
<211> 56
<212> DNA
<213> Artificial sequence
<400> 13
cagtaatacg actcactata gggagaaggc tcataaaata ccacaaaaaa accaat 56
<210> 14
<211> 35
<212> DNA
<213> Artificial sequence
<400> 14
aggaagagag gttgggtatt gaggaggaag aggaa 35
<210> 15
<211> 56
<212> DNA
<213> Artificial sequence
<400> 15
cagtaatacg actcactata gggagaaggc tacaaaaaaa accctaaaaa aaattt 56
Claims (10)
1. Application of methylation CCDC12 gene as a marker in preparation of products; the application of the product is at least one of the following:
(1) Aiding in diagnosing cancer or predicting the risk of developing cancer;
(2) Aiding in distinguishing benign nodules from cancers;
(3) Aiding in distinguishing between different subtypes of cancer;
(4) Aiding in differentiating different stages of cancer;
(5) Auxiliary diagnosis of lung cancer or prediction of lung cancer risk;
(6) Assisting in distinguishing benign nodules of the lung from lung cancer;
(7) Assisting in distinguishing different subtypes of lung cancer;
(8) Auxiliary differentiation of different stages of lung cancer;
(9) Auxiliary diagnosis of breast cancer or prediction of breast cancer risk;
(10) Aiding in distinguishing benign nodules of breast from breast cancer;
(11) Assisting in distinguishing different subtypes of breast cancer;
(12) Assisting in distinguishing different stages of breast cancer;
(13) Auxiliary differentiation between lung and breast cancer;
(14) Determining whether the test agent has an inhibitory or promoting effect on the occurrence of cancer.
2. Use of a substance for detecting the methylation level of a CCDC12 gene in the preparation of a product; the application of the product is at least one of the following:
(1) Aiding in diagnosing cancer or predicting the risk of developing cancer;
(2) Aiding in distinguishing benign nodules from cancers;
(3) Aiding in distinguishing between different subtypes of cancer;
(4) Aiding in differentiating different stages of cancer;
(5) Auxiliary diagnosis of lung cancer or prediction of lung cancer risk;
(6) Assisting in distinguishing benign nodules of the lung from lung cancer;
(7) Assisting in distinguishing different subtypes of lung cancer;
(8) Auxiliary differentiation of different stages of lung cancer;
(9) Auxiliary diagnosis of breast cancer or prediction of breast cancer risk;
(10) Aiding in distinguishing benign nodules of breast from breast cancer;
(11) Assisting in distinguishing different subtypes of breast cancer;
(12) Assisting in distinguishing different stages of breast cancer;
(13) Auxiliary differentiation between lung and breast cancer;
(14) Determining whether the test agent has an inhibitory or promoting effect on the occurrence of cancer.
3. Use of a substance for detecting the methylation level of the CCDC12 gene and a medium storing mathematical modeling methods and/or usage methods for the preparation of a product; the application of the product is at least one of the following:
(1) Aiding in diagnosing cancer or predicting the risk of developing cancer;
(2) Aiding in distinguishing benign nodules from cancers;
(3) Aiding in distinguishing between different subtypes of cancer;
(4) Aiding in differentiating different stages of cancer;
(5) Auxiliary diagnosis of lung cancer or prediction of lung cancer risk;
(6) Assisting in distinguishing benign nodules of the lung from lung cancer;
(7) Assisting in distinguishing different subtypes of lung cancer;
(8) Auxiliary differentiation of different stages of lung cancer;
(9) Auxiliary diagnosis of breast cancer or prediction of breast cancer risk;
(10) Aiding in distinguishing benign nodules of breast from breast cancer;
(11) Assisting in distinguishing different subtypes of breast cancer;
(12) Assisting in distinguishing different stages of breast cancer;
(13) Auxiliary differentiation between lung and breast cancer;
(14) Determining whether the test agent has an inhibitory or promoting effect on the occurrence of cancer.
The mathematical model is obtained according to a method comprising the following steps:
(A1) Detecting CCDC12 gene methylation levels of n1 type a samples and n2 type B samples, respectively;
(A2) Taking CCDC12 gene methylation level data of all samples obtained in the step (A1), establishing a mathematical model by a two-classification logistic regression method according to classification modes of A type and B type, and determining a classification judgment threshold;
the using method of the mathematical model comprises the following steps:
(B1) Detecting the methylation level of the CCDC12 gene of a sample to be detected;
(B2) Substituting the CCDC12 gene methylation level data of the sample to be detected obtained in the step (B1) into the mathematical model to obtain a detection index; then comparing the detection index with a threshold value, and determining whether the type of the sample to be detected is A type or B type according to a comparison result;
The type a sample and the type B sample are any one of the following:
(C1) Lung cancer samples and no cancer controls;
(C2) Lung cancer samples and lung benign nodule samples;
(C3) A sample of different subtypes of lung cancer;
(C4) Samples of lung cancer at different stages;
(C5) Breast cancer samples and cancer-free female controls;
(C6) Breast cancer samples and breast benign nodule samples;
(C7) A sample of different subtypes of breast cancer;
(C8) Breast cancer samples of different stages;
(C9) Lung cancer samples and breast cancer samples.
4. Use of a medium storing a mathematical model building method and/or a use method for the preparation of a product; the application of the product is at least one of the following:
(1) Aiding in diagnosing cancer or predicting the risk of developing cancer;
(2) Aiding in distinguishing benign nodules from cancers;
(3) Aiding in distinguishing between different subtypes of cancer;
(4) Aiding in differentiating different stages of cancer;
(5) Auxiliary diagnosis of lung cancer or prediction of lung cancer risk;
(6) Assisting in distinguishing benign nodules of the lung from lung cancer;
(7) Assisting in distinguishing different subtypes of lung cancer;
(8) Auxiliary differentiation of different stages of lung cancer;
(9) Auxiliary diagnosis of breast cancer or prediction of breast cancer risk;
(10) Aiding in distinguishing benign nodules of breast from breast cancer;
(11) Assisting in distinguishing different subtypes of breast cancer;
(12) Assisting in distinguishing different stages of breast cancer;
(13) Auxiliary differentiation between lung and breast cancer;
(14) Determining whether the test agent has an inhibitory or promoting effect on the occurrence of cancer.
The mathematical model is obtained according to a method comprising the following steps:
(A1) Detecting CCDC12 gene methylation levels of n1 type a samples and n2 type B samples, respectively;
(A2) Taking CCDC12 gene methylation level data of all samples obtained in the step (A1), establishing a mathematical model by a two-classification logistic regression method according to classification modes of A type and B type, and determining a classification judgment threshold;
the using method of the mathematical model comprises the following steps:
(B1) Detecting the methylation level of the CCDC12 gene of a sample to be detected;
(B2) Substituting the CCDC12 gene methylation level data of the sample to be detected obtained in the step (B1) into the mathematical model to obtain a detection index; then comparing the detection index with a threshold value, and determining whether the type of the sample to be detected is A type or B type according to a comparison result;
the type a sample and the type B sample are any one of the following:
(C1) Lung cancer samples and no cancer controls;
(C2) Lung cancer samples and lung benign nodule samples;
(C3) A sample of different subtypes of lung cancer;
(C4) Samples of lung cancer at different stages;
(C5) Breast cancer samples and cancer-free female controls;
(C6) Breast cancer samples and breast benign nodule samples;
(C7) A sample of different subtypes of breast cancer;
(C8) Breast cancer samples of different stages;
(C9) Lung cancer samples and breast cancer samples.
5. A kit comprising a substance for detecting the methylation level of the CCDC12 gene; the application of the kit is at least one of the following:
(1) Aiding in diagnosing cancer or predicting the risk of developing cancer;
(2) Aiding in distinguishing benign nodules from cancers;
(3) Aiding in distinguishing between different subtypes of cancer;
(4) Aiding in differentiating different stages of cancer;
(5) Auxiliary diagnosis of lung cancer or prediction of lung cancer risk;
(6) Assisting in distinguishing benign nodules of the lung from lung cancer;
(7) Assisting in distinguishing different subtypes of lung cancer;
(8) Auxiliary differentiation of different stages of lung cancer;
(9) Auxiliary diagnosis of breast cancer or prediction of breast cancer risk;
(10) Aiding in distinguishing benign nodules of breast from breast cancer;
(11) Assisting in distinguishing different subtypes of breast cancer;
(12) Assisting in distinguishing different stages of breast cancer;
(13) Auxiliary differentiation between lung and breast cancer;
(14) Determining whether the test agent has an inhibitory or promoting effect on the occurrence of cancer.
6. The kit of claim 5, wherein: the kit further comprises a medium storing a mathematical model establishing method and/or a using method as set forth in claim 3 or 4.
7. A system, comprising:
(D1) Reagents and/or instrumentation for detecting the methylation level of the CCDC12 gene;
(D2) A device comprising a unit X and a unit Y;
the unit X is used for establishing a mathematical model and comprises a data acquisition module, a data analysis processing module and a model output module;
the data acquisition module is configured to acquire (D1) CCDC12 gene methylation level data of n 1A type samples and n 2B type samples detected;
the data analysis processing module is configured to receive the CCDC12 gene methylation level data of the n 1A type samples and the n 2B type samples from the data acquisition module, establish a mathematical model according to a classification mode of the A type and the B type through a two-classification logistic regression method, and determine a threshold value of classification judgment;
the model output module is configured to receive the mathematical model established by the data analysis processing module and output the mathematical model;
the unit Y is used for determining the type of the sample to be detected and comprises a data input module, a data operation module, a data comparison module and a conclusion output module;
the data input module is configured to input (D1) the detected CCDC12 gene methylation level data of the to-be-detected person;
The data operation module is configured to receive the CCDC12 gene methylation level data of the person to be tested from the data input module, and substitutes the CCDC12 gene methylation level data of the person to be tested into the mathematical model to calculate a detection index;
the data comparison module is configured to receive the detection index calculated by the data operation module and compare the detection index with the threshold value determined by the data analysis processing module in the unit X;
the conclusion output module is configured to receive the comparison result from the data comparison module and output a conclusion of whether the type of the sample to be tested is A type or B type according to the comparison result of the data comparison module;
the type a sample and the type B sample are any one of the following:
(C1) Lung cancer samples and no cancer controls;
(C2) Lung cancer samples and lung benign nodule samples;
(C3) A sample of different subtypes of lung cancer;
(C4) Samples of lung cancer at different stages;
(C5) Breast cancer samples and cancer-free female controls;
(C6) Breast cancer samples and breast benign nodule samples;
(C7) A sample of different subtypes of breast cancer;
(C8) Breast cancer samples of different stages;
(C9) Lung cancer samples and breast cancer samples.
8. The use or kit or system according to any one of claims 1-7, wherein: the methylation level of the CCDC12 gene is the methylation level of all or part of CpG sites in fragments shown in the following (e 1) - (e 5) in the CCDC12 gene;
the methylated CCDC12 gene is formed by methylation of all or part of CpG sites in fragments shown in the following (e 1) - (e 5) in the CCDC12 gene;
(e1) A DNA fragment shown in SEQ ID No.1 or a DNA fragment having 80% or more identity thereto;
(e2) A DNA fragment shown in SEQ ID No.2 or a DNA fragment having 80% or more identity thereto;
(e3) A DNA fragment shown in SEQ ID No.3 or a DNA fragment having 80% or more identity thereto;
(e4) A DNA fragment shown in SEQ ID No.4 or a DNA fragment having 80% or more identity thereto;
(e5) The DNA fragment shown in SEQ ID No.5 or a DNA fragment having 80% or more identity thereto.
9. The use or kit or system according to claim 8, wherein: the 'all or part of CpG sites' are any one or more CpG sites in 5 DNA fragments shown in SEQ ID No.1 to SEQ ID No.5 in the CCDC12 gene;
or (b)
The 'all or part of CpG sites' are all CpG sites in a DNA fragment shown in SEQ ID No.4 and all CpG sites in a DNA fragment shown in SEQ ID No.1 in the CCDC12 gene;
Or (b)
The 'all or part of CpG sites' are all CpG sites in a DNA fragment shown in SEQ ID No.4 and all CpG sites in a DNA fragment shown in SEQ ID No.2 in the CCDC12 gene;
or (b)
The 'all or part of CpG sites' are all CpG sites in a DNA fragment shown in SEQ ID No.1 and all CpG sites in a DNA fragment shown in SEQ ID No.2 in the CCDC12 gene;
or (b)
The 'all or part of CpG sites' are all CpG sites in a DNA fragment shown as SEQ ID No.4, all CpG sites in a DNA fragment shown as SEQ ID No.1 and all CpG sites in a DNA fragment shown as SEQ ID No.2 in the CCDC12 gene;
or (b)
The 'all or part of CpG sites' are all or any 16 or any 15 or any 14 or any 13 or any 12 or any 11 or any 10 or any 9 or any 8 or any 7 or any 6 or any 5 or any 4 or any 3 or any 2 or any 1 CpG sites in the DNA fragment shown in SEQ ID No.4 in the CCDC12 gene;
or (b)
The whole or part of CpG sites are all or any 11 or any 10 or any 9 or any 8 or any 7 or any 6 or any 5 or any 4 or any 3 or any 2 or any 1 of the following 12 CpG sites in the DNA fragment shown in SEQ ID No.4 in the CCDC12 gene:
(f1) The DNA fragment shown in SEQ ID No.4 shows CpG sites from 104 th to 105 th positions of the 5' end;
(f2) The DNA fragment shown in SEQ ID No.4 shows CpG sites from 133 th to 134 th positions of the 5' end;
(f3) The DNA fragment shown in SEQ ID No.4 shows CpG sites from 202-203 of the 5' end;
(f4) The CpG sites shown in 273-274 of the DNA fragment shown in SEQ ID No.4 from the 5' end;
(f5) The DNA fragment shown in SEQ ID No.4 shows CpG sites from 300 th to 301 th positions of the 5' end;
(f6) The DNA fragment shown in SEQ ID No.4 shows CpG sites from 344-345 positions of the 5' end;
(f7) The DNA fragment shown in SEQ ID No.4 shows CpG sites from 361 st to 362 th site of the 5' end;
(f8) The DNA fragment shown in SEQ ID No.4 shows CpG sites from 383 to 384 positions of the 5' end;
(f9) The DNA fragment shown in SEQ ID No.4 shows CpG sites from 426 th to 427 th positions of the 5' end;
(f10) The DNA fragment shown in SEQ ID No.4 shows CpG sites from 448 to 449 of the 5' end;
(f11) The DNA fragment shown in SEQ ID No.4 shows CpG sites from 457 to 458 positions of the 5' end;
(f12) The DNA fragment shown in SEQ ID No.4 shows CpG sites from 464-465 of the 5' end.
10. The use or kit or system according to any one of claims 1-9, wherein: the substance for detecting the methylation level of the CCDC12 gene comprises a primer combination for amplifying a full or partial fragment of the CCDC12 gene;
The reagent for detecting the methylation level of the CCDC12 gene comprises a primer combination for amplifying the full length or partial fragment of the CCDC12 gene;
further, the partial fragment is at least one fragment of:
(g1) A DNA fragment shown in SEQ ID No.1 or a DNA fragment comprising the same;
(g2) A DNA fragment shown in SEQ ID No.2 or a DNA fragment comprising the same;
(g3) A DNA fragment shown in SEQ ID No.3 or a DNA fragment comprising the same;
(g4) A DNA fragment shown in SEQ ID No.4 or a DNA fragment comprising the same;
(g5) A DNA fragment shown in SEQ ID No.5 or a DNA fragment comprising the same;
(g6) A DNA fragment having an identity of 80% or more to the DNA fragment shown in SEQ ID No.1 or a DNA fragment comprising the same;
(g7) A DNA fragment having an identity of 80% or more to the DNA fragment shown in SEQ ID No.2 or a DNA fragment comprising the same;
(g8) A DNA fragment having an identity of 80% or more to the DNA fragment shown in SEQ ID No.3 or a DNA fragment comprising the same.
(g9) A DNA fragment having an identity of 80% or more to the DNA fragment shown in SEQ ID No.4 or a DNA fragment comprising the same;
(g10) A DNA fragment having an identity of 80% or more to the DNA fragment shown in SEQ ID No.5 or a DNA fragment comprising the same;
still further, the primer combination is primer pair a and/or primer pair B and/or primer pair C or primer pair D and/or primer pair E;
The primer pair A is a primer pair consisting of a primer A1 and a primer A2; the primer A1 is SEQ ID No.6 or single-stranded DNA shown in 11 th-35 th nucleotides of SEQ ID No. 6; the primer A2 is SEQ ID No.7 or single-stranded DNA shown in 32 th-56 th nucleotides of SEQ ID No. 7;
the primer pair B is a primer pair consisting of a primer B1 and a primer B2; the primer B1 is single-stranded DNA shown in SEQ ID No.8 or 11 th-35 th nucleotide of SEQ ID No. 8; the primer B2 is SEQ ID No.9 or single-stranded DNA shown in 32 th-56 th nucleotides of SEQ ID No. 9;
the primer pair C is a primer pair consisting of a primer C1 and a primer C2; the primer C1 is SEQ ID No.10 or single-stranded DNA shown in 11 th-35 th nucleotides of SEQ ID No. 10; the primer C2 is SEQ ID No.11 or single-stranded DNA shown in 32-56 nucleotides of SEQ ID No. 11.
The primer pair D is a primer pair consisting of a primer D1 and a primer D2; the primer D1 is SEQ ID No.12 or single-stranded DNA shown in 11 th-35 th nucleotides of SEQ ID No. 12; the primer D2 is SEQ ID No.13 or single-stranded DNA shown in 32 th-56 th nucleotides of SEQ ID No. 13;
the primer pair E is a primer pair consisting of a primer E1 and a primer E2; the primer E1 is SEQ ID No.14 or single-stranded DNA shown in 11 th-35 th nucleotides of SEQ ID No. 14; the primer E2 is SEQ ID No.15 or single-stranded DNA shown in 32-56 nucleotides of SEQ ID No. 15.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210372719.2A CN117568473A (en) | 2022-04-11 | 2022-04-11 | Methylation molecular marker for auxiliary diagnosis of cancer |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210372719.2A CN117568473A (en) | 2022-04-11 | 2022-04-11 | Methylation molecular marker for auxiliary diagnosis of cancer |
Publications (1)
Publication Number | Publication Date |
---|---|
CN117568473A true CN117568473A (en) | 2024-02-20 |
Family
ID=89890512
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202210372719.2A Pending CN117568473A (en) | 2022-04-11 | 2022-04-11 | Methylation molecular marker for auxiliary diagnosis of cancer |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN117568473A (en) |
-
2022
- 2022-04-11 CN CN202210372719.2A patent/CN117568473A/en active Pending
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN114507731B (en) | Methylation marker and kit for assisting cancer diagnosis | |
CN111363811A (en) | Lung cancer diagnostic agent and kit based on FOXD3 gene | |
CN113215252B (en) | Methylation markers for aiding in the diagnosis of cancer | |
CN113136428B (en) | Application of methylation marker in auxiliary diagnosis of cancer | |
CN113355412B (en) | Methylation markers and kits for aiding in the diagnosis of cancer | |
CN114480630A (en) | Methylation marker for auxiliary diagnosis of cancer | |
CN113215251B (en) | Methylation marker for assisting diagnosis of cancer | |
CN117568473A (en) | Methylation molecular marker for auxiliary diagnosis of cancer | |
JP2018139537A (en) | Method of data acquisition of possibility of lymph node metastasis of esophageal cancer | |
CN113122630B (en) | Calbindin methylation markers for use in aiding diagnosis of cancer | |
CN113355413B (en) | Application of molecular marker and kit in auxiliary diagnosis of cancer | |
CN113186279B (en) | Hyaluronidase methylation marker and kit for auxiliary diagnosis of cancer | |
CN113215250B (en) | Use of methylation level of genes in aiding diagnosis of cancer | |
CN118028461A (en) | Application of protein gene in auxiliary diagnosis of cancer | |
CN117568471A (en) | Protein gene methylation as a molecular marker for aiding in the diagnosis of cancer | |
CN117604094A (en) | Methylation marker and application of kit in auxiliary diagnosis of cancer | |
CN115701454A (en) | Molecular marker and kit for auxiliary diagnosis of cancer | |
Noor et al. | Evaluation of roles of MicroRNA-21 and MicroRNA-18a in esophageal squamous cell carcinoma and comparison of their changes in expression post-chemoradiotherapy | |
CN117568472A (en) | Application of methylation marker in auxiliary diagnosis of cancer | |
CN117568470A (en) | Molecular marker and kit for auxiliary diagnosis of cancer | |
CN116790752A (en) | Molecular marker for early screening and early diagnosing lung cancer | |
CN115612732A (en) | Marker for auxiliary diagnosis of cancer and kit thereof | |
CN115612735A (en) | Potential molecular marker for auxiliary diagnosis of cancer | |
CN115701453A (en) | Molecular marker and kit for auxiliary diagnosis of cancer | |
CN115612731A (en) | Molecular marker for auxiliary diagnosis of cancer |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |