CN114214409A - Biomarker for esophageal cancer typing and application thereof - Google Patents
Biomarker for esophageal cancer typing and application thereof Download PDFInfo
- Publication number
- CN114214409A CN114214409A CN202111590459.8A CN202111590459A CN114214409A CN 114214409 A CN114214409 A CN 114214409A CN 202111590459 A CN202111590459 A CN 202111590459A CN 114214409 A CN114214409 A CN 114214409A
- Authority
- CN
- China
- Prior art keywords
- biomarker
- mctf
- tfs
- esophageal
- tfsc
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 208000000461 Esophageal Neoplasms Diseases 0.000 title claims abstract description 41
- 206010030155 Oesophageal carcinoma Diseases 0.000 title claims abstract description 40
- 201000004101 esophageal cancer Diseases 0.000 title claims abstract description 40
- 239000000090 biomarker Substances 0.000 title claims abstract description 39
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 29
- 101000984620 Homo sapiens Low-density lipoprotein receptor-related protein 1B Proteins 0.000 claims abstract description 16
- 102100027121 Low-density lipoprotein receptor-related protein 1B Human genes 0.000 claims abstract description 16
- 108010036639 WW Domain-Containing Oxidoreductase Proteins 0.000 claims abstract description 15
- 102000012163 WW Domain-Containing Oxidoreductase Human genes 0.000 claims abstract description 14
- 208000036764 Adenocarcinoma of the esophagus Diseases 0.000 claims abstract description 13
- 102100028122 Forkhead box protein P1 Human genes 0.000 claims abstract description 13
- 101001059893 Homo sapiens Forkhead box protein P1 Proteins 0.000 claims abstract description 13
- 206010030137 Oesophageal adenocarcinoma Diseases 0.000 claims abstract description 13
- 208000028653 esophageal adenocarcinoma Diseases 0.000 claims abstract description 13
- 238000000034 method Methods 0.000 claims abstract description 12
- 102100023274 Dual specificity mitogen-activated protein kinase kinase 4 Human genes 0.000 claims abstract description 11
- 102100031493 Growth arrest-specific protein 7 Human genes 0.000 claims abstract description 11
- 101001115395 Homo sapiens Dual specificity mitogen-activated protein kinase kinase 4 Proteins 0.000 claims abstract description 11
- 101000923044 Homo sapiens Growth arrest-specific protein 7 Proteins 0.000 claims abstract description 11
- 238000001514 detection method Methods 0.000 claims abstract description 10
- -1 RUNX1T Proteins 0.000 claims description 35
- 206010041823 squamous cell carcinoma Diseases 0.000 claims description 20
- 238000012163 sequencing technique Methods 0.000 claims description 15
- 102000052567 Anaphase-Promoting Complex-Cyclosome Apc1 Subunit Human genes 0.000 claims description 12
- 101000890325 Homo sapiens Anaphase-promoting complex subunit 1 Proteins 0.000 claims description 12
- 238000004458 analytical method Methods 0.000 claims description 11
- 239000012472 biological sample Substances 0.000 claims description 11
- 102100027909 Folliculin Human genes 0.000 claims description 10
- 101001060703 Homo sapiens Folliculin Proteins 0.000 claims description 10
- DIDGPCDGNMIUNX-UUOKFMHZSA-N 2-amino-9-[(2r,3r,4s,5r)-5-(dihydroxyphosphinothioyloxymethyl)-3,4-dihydroxyoxolan-2-yl]-3h-purin-6-one Chemical compound C1=2NC(N)=NC(=O)C=2N=CN1[C@@H]1O[C@H](COP(O)(O)=S)[C@@H](O)[C@H]1O DIDGPCDGNMIUNX-UUOKFMHZSA-N 0.000 claims description 9
- 101001018109 Homo sapiens Nucleotidyltransferase MB21D2 Proteins 0.000 claims description 9
- 101000650697 Homo sapiens Roundabout homolog 2 Proteins 0.000 claims description 9
- 101000687905 Homo sapiens Transcription factor SOX-2 Proteins 0.000 claims description 9
- 108010050345 Microphthalmia-Associated Transcription Factor Proteins 0.000 claims description 9
- 102100030157 Microphthalmia-associated transcription factor Human genes 0.000 claims description 9
- 102100033052 Nucleotidyltransferase MB21D2 Human genes 0.000 claims description 9
- 102100027739 Roundabout homolog 2 Human genes 0.000 claims description 9
- 102100024270 Transcription factor SOX-2 Human genes 0.000 claims description 9
- 102100025725 Mothers against decapentaplegic homolog 4 Human genes 0.000 claims description 7
- 101710143112 Mothers against decapentaplegic homolog 4 Proteins 0.000 claims description 7
- 102100029981 Receptor tyrosine-protein kinase erbB-4 Human genes 0.000 claims description 7
- 101710100963 Receptor tyrosine-protein kinase erbB-4 Proteins 0.000 claims description 7
- 102100027881 Tumor protein 63 Human genes 0.000 claims description 7
- 101710140697 Tumor protein 63 Proteins 0.000 claims description 7
- 102100034540 Adenomatous polyposis coli protein Human genes 0.000 claims description 6
- 101100455868 Arabidopsis thaliana MKK2 gene Proteins 0.000 claims description 6
- 108010009392 Cyclin-Dependent Kinase Inhibitor p16 Proteins 0.000 claims description 6
- 102000001301 EGF receptor Human genes 0.000 claims description 6
- 102100026353 F-box-like/WD repeat-containing protein TBL1XR1 Human genes 0.000 claims description 6
- 102100033452 GMP synthase [glutamine-hydrolyzing] Human genes 0.000 claims description 6
- 101710071060 GMPS Proteins 0.000 claims description 6
- 206010064571 Gene mutation Diseases 0.000 claims description 6
- 101000924577 Homo sapiens Adenomatous polyposis coli protein Proteins 0.000 claims description 6
- 101000851181 Homo sapiens Epidermal growth factor receptor Proteins 0.000 claims description 6
- 101000835675 Homo sapiens F-box-like/WD repeat-containing protein TBL1XR1 Proteins 0.000 claims description 6
- 101000651236 Homo sapiens NCK-interacting protein with SH3 domain Proteins 0.000 claims description 6
- 101000962041 Homo sapiens Neurobeachin Proteins 0.000 claims description 6
- 101000741790 Homo sapiens Peroxisome proliferator-activated receptor gamma Proteins 0.000 claims description 6
- 101000605639 Homo sapiens Phosphatidylinositol 4,5-bisphosphate 3-kinase catalytic subunit alpha isoform Proteins 0.000 claims description 6
- 101000738771 Homo sapiens Receptor-type tyrosine-protein phosphatase C Proteins 0.000 claims description 6
- 101000606537 Homo sapiens Receptor-type tyrosine-protein phosphatase delta Proteins 0.000 claims description 6
- 101000835093 Homo sapiens Transferrin receptor protein 1 Proteins 0.000 claims description 6
- 102100027673 NCK-interacting protein with SH3 domain Human genes 0.000 claims description 6
- 102100039234 Neurobeachin Human genes 0.000 claims description 6
- 102100038825 Peroxisome proliferator-activated receptor gamma Human genes 0.000 claims description 6
- 102100038332 Phosphatidylinositol 4,5-bisphosphate 3-kinase catalytic subunit alpha isoform Human genes 0.000 claims description 6
- 102100033479 RAF proto-oncogene serine/threonine-protein kinase Human genes 0.000 claims description 6
- 102100037422 Receptor-type tyrosine-protein phosphatase C Human genes 0.000 claims description 6
- 102100039666 Receptor-type tyrosine-protein phosphatase delta Human genes 0.000 claims description 6
- 102100026144 Transferrin receptor protein 1 Human genes 0.000 claims description 6
- 102100033254 Tumor suppressor ARF Human genes 0.000 claims description 6
- 238000007637 random forest analysis Methods 0.000 claims description 6
- 101001012157 Homo sapiens Receptor tyrosine-protein kinase erbB-2 Proteins 0.000 claims description 5
- 102100030086 Receptor tyrosine-protein kinase erbB-2 Human genes 0.000 claims description 5
- 230000007614 genetic variation Effects 0.000 claims description 5
- QDUDQAJMMWFNEO-UHFFFAOYSA-N 2-(sulfoamino)propane-1,2,3-tricarboxylic acid Chemical compound OC(=O)CC(CC(O)=O)(NS(O)(=O)=O)C(O)=O QDUDQAJMMWFNEO-UHFFFAOYSA-N 0.000 claims description 4
- 102100039577 ETS translocation variant 5 Human genes 0.000 claims description 4
- 102100032610 Guanine nucleotide-binding protein G(s) subunit alpha isoforms XLas Human genes 0.000 claims description 4
- 101000813745 Homo sapiens ETS translocation variant 5 Proteins 0.000 claims description 4
- 101000967216 Homo sapiens Eosinophil cationic protein Proteins 0.000 claims description 4
- 101001014590 Homo sapiens Guanine nucleotide-binding protein G(s) subunit alpha isoforms XLas Proteins 0.000 claims description 4
- 101001014594 Homo sapiens Guanine nucleotide-binding protein G(s) subunit alpha isoforms short Proteins 0.000 claims description 4
- 101001014610 Homo sapiens Neuroendocrine secretory protein 55 Proteins 0.000 claims description 4
- 101000797903 Homo sapiens Protein ALEX Proteins 0.000 claims description 4
- 101000573199 Homo sapiens Protein PML Proteins 0.000 claims description 4
- 101000712530 Homo sapiens RAF proto-oncogene serine/threonine-protein kinase Proteins 0.000 claims description 4
- 101000694802 Homo sapiens Receptor-type tyrosine-protein phosphatase T Proteins 0.000 claims description 4
- 101000685990 Homo sapiens Specifically androgen-regulated gene protein Proteins 0.000 claims description 4
- 101000979190 Homo sapiens Transcription factor MafB Proteins 0.000 claims description 4
- 108010051742 Platelet-Derived Growth Factor beta Receptor Proteins 0.000 claims description 4
- 102100026547 Platelet-derived growth factor receptor beta Human genes 0.000 claims description 4
- 102100026375 Protein PML Human genes 0.000 claims description 4
- 102100028645 Receptor-type tyrosine-protein phosphatase T Human genes 0.000 claims description 4
- 102000001332 SRC Human genes 0.000 claims description 4
- 108060006706 SRC Proteins 0.000 claims description 4
- 102100023355 Specifically androgen-regulated gene protein Human genes 0.000 claims description 4
- 102100023234 Transcription factor MafB Human genes 0.000 claims description 4
- 102100037333 Tyrosine-protein kinase Fes/Fps Human genes 0.000 claims description 4
- 238000011156 evaluation Methods 0.000 claims description 4
- 238000013210 evaluation model Methods 0.000 claims description 4
- RQKYYWBMXHYBAP-UHFFFAOYSA-N 4-(chloromethyl)-2,2-dimethyl-1,3,2-dioxasilolane Chemical compound C[Si]1(C)OCC(CCl)O1 RQKYYWBMXHYBAP-UHFFFAOYSA-N 0.000 claims description 3
- SVYBEBLNQGDRHF-UHFFFAOYSA-N 4-amino-N-(5-ethyl-1,3,4-thiadiazol-2-yl)benzenesulfonamide Chemical compound S1C(CC)=NN=C1NS(=O)(=O)C1=CC=C(N)C=C1 SVYBEBLNQGDRHF-UHFFFAOYSA-N 0.000 claims description 3
- UUUUFICXTVCJQV-UHFFFAOYSA-N 6-[4-[(2-chloro-4-fluorophenoxy)methyl]piperidine-1-carbonyl]-4H-1,4-benzoxazin-3-one Chemical compound ClC1=C(OCC2CCN(CC2)C(=O)C=2C=CC3=C(NC(CO3)=O)C=2)C=CC(=C1)F UUUUFICXTVCJQV-UHFFFAOYSA-N 0.000 claims description 3
- 102100034580 AT-rich interactive domain-containing protein 1A Human genes 0.000 claims description 3
- 101100002343 Arabidopsis thaliana ARID1 gene Proteins 0.000 claims description 3
- 101100504181 Arabidopsis thaliana GCS1 gene Proteins 0.000 claims description 3
- 102100029968 Calreticulin Human genes 0.000 claims description 3
- 102100028914 Catenin beta-1 Human genes 0.000 claims description 3
- 102100025064 Cellular tumor antigen p53 Human genes 0.000 claims description 3
- 102100022477 DNA repair protein complementing XP-C cells Human genes 0.000 claims description 3
- 101100520033 Dictyostelium discoideum pikC gene Proteins 0.000 claims description 3
- 102000013601 Fanconi Anemia Complementation Group D2 protein Human genes 0.000 claims description 3
- 108010026653 Fanconi Anemia Complementation Group D2 protein Proteins 0.000 claims description 3
- 102100031812 Fibulin-1 Human genes 0.000 claims description 3
- 102100031813 Fibulin-2 Human genes 0.000 claims description 3
- 102100039121 Histone-lysine N-methyltransferase MECOM Human genes 0.000 claims description 3
- 102100032742 Histone-lysine N-methyltransferase SETD2 Human genes 0.000 claims description 3
- 101000924266 Homo sapiens AT-rich interactive domain-containing protein 1A Proteins 0.000 claims description 3
- 101100326671 Homo sapiens CALR gene Proteins 0.000 claims description 3
- 101000916173 Homo sapiens Catenin beta-1 Proteins 0.000 claims description 3
- 101000618535 Homo sapiens DNA repair protein complementing XP-C cells Proteins 0.000 claims description 3
- 101001065276 Homo sapiens Fibulin-1 Proteins 0.000 claims description 3
- 101001065274 Homo sapiens Fibulin-2 Proteins 0.000 claims description 3
- 101000654725 Homo sapiens Histone-lysine N-methyltransferase SETD2 Proteins 0.000 claims description 3
- 101000599779 Homo sapiens Insulin-like growth factor 2 mRNA-binding protein 2 Proteins 0.000 claims description 3
- 101001003687 Homo sapiens Lipoma-preferred partner Proteins 0.000 claims description 3
- 101100076418 Homo sapiens MECOM gene Proteins 0.000 claims description 3
- 101000623901 Homo sapiens Mucin-16 Proteins 0.000 claims description 3
- 101000896414 Homo sapiens Nuclear nucleic acid-binding protein C1D Proteins 0.000 claims description 3
- 101000741978 Homo sapiens Phosphatidylinositol 3,4,5-trisphosphate-dependent Rac exchanger 2 protein Proteins 0.000 claims description 3
- 101001043564 Homo sapiens Prolow-density lipoprotein receptor-related protein 1 Proteins 0.000 claims description 3
- 101000579954 Homo sapiens RanBP2-like and GRIP domain-containing protein 3 Proteins 0.000 claims description 3
- 102100037919 Insulin-like growth factor 2 mRNA-binding protein 2 Human genes 0.000 claims description 3
- 102100026358 Lipoma-preferred partner Human genes 0.000 claims description 3
- 108700024831 MDS1 and EVI1 Complex Locus Proteins 0.000 claims description 3
- 102100023123 Mucin-16 Human genes 0.000 claims description 3
- 108010071382 NF-E2-Related Factor 2 Proteins 0.000 claims description 3
- 102000048238 Neuregulin-1 Human genes 0.000 claims description 3
- 108090000556 Neuregulin-1 Proteins 0.000 claims description 3
- 101150067565 Nfatc1 gene Proteins 0.000 claims description 3
- 102000001759 Notch1 Receptor Human genes 0.000 claims description 3
- 108010029755 Notch1 Receptor Proteins 0.000 claims description 3
- 102100031701 Nuclear factor erythroid 2-related factor 2 Human genes 0.000 claims description 3
- 102100034404 Nuclear factor of activated T-cells, cytoplasmic 1 Human genes 0.000 claims description 3
- 102100038633 Phosphatidylinositol 3,4,5-trisphosphate-dependent Rac exchanger 2 protein Human genes 0.000 claims description 3
- 102100021923 Prolow-density lipoprotein receptor-related protein 1 Human genes 0.000 claims description 3
- 101150111584 RHOA gene Proteins 0.000 claims description 3
- 102100027510 RanBP2-like and GRIP domain-containing protein 3 Human genes 0.000 claims description 3
- 102100037375 SLIT-ROBO Rho GTPase-activating protein 3 Human genes 0.000 claims description 3
- 101150083405 SRGAP3 gene Proteins 0.000 claims description 3
- 102100033455 TGF-beta receptor type-2 Human genes 0.000 claims description 3
- 108010082684 Transforming Growth Factor-beta Type II Receptor Proteins 0.000 claims description 3
- 102100022387 Transforming protein RhoA Human genes 0.000 claims description 3
- 108010047933 Tumor Necrosis Factor alpha-Induced Protein 3 Proteins 0.000 claims description 3
- 108010078814 Tumor Suppressor Protein p53 Proteins 0.000 claims description 3
- 102100024596 Tumor necrosis factor alpha-induced protein 3 Human genes 0.000 claims description 3
- 239000003153 chemical reaction reagent Substances 0.000 claims description 3
- RMLPZKRPSQVRAB-UHFFFAOYSA-N tris(3-methylphenyl) phosphate Chemical compound CC1=CC=CC(OP(=O)(OC=2C=C(C)C=CC=2)OC=2C=C(C)C=CC=2)=C1 RMLPZKRPSQVRAB-UHFFFAOYSA-N 0.000 claims description 3
- YPSXFMHXRZAGTG-UHFFFAOYSA-N 4-methoxy-2-[2-(5-methoxy-2-nitrosophenyl)ethyl]-1-nitrosobenzene Chemical compound COC1=CC=C(N=O)C(CCC=2C(=CC=C(OC)C=2)N=O)=C1 YPSXFMHXRZAGTG-UHFFFAOYSA-N 0.000 claims description 2
- 102100021569 Apoptosis regulator Bcl-2 Human genes 0.000 claims description 2
- 102100033182 Creatine kinase S-type, mitochondrial Human genes 0.000 claims description 2
- 101100184204 Drosophila melanogaster Mlf gene Proteins 0.000 claims description 2
- 101000971171 Homo sapiens Apoptosis regulator Bcl-2 Proteins 0.000 claims description 2
- 101001135421 Homo sapiens Creatine kinase S-type, mitochondrial Proteins 0.000 claims description 2
- 101100078258 Homo sapiens RUNX1T1 gene Proteins 0.000 claims description 2
- 102100024952 Protein CBFA2T1 Human genes 0.000 claims description 2
- 108700040655 RUNX1 Translocation Partner 1 Proteins 0.000 claims description 2
- 101710199430 TATA-box-binding protein 2 Proteins 0.000 claims description 2
- 210000004369 blood Anatomy 0.000 claims description 2
- 239000008280 blood Substances 0.000 claims description 2
- 239000012502 diagnostic product Substances 0.000 claims description 2
- 238000003748 differential diagnosis Methods 0.000 claims 1
- 238000003205 genotyping method Methods 0.000 claims 1
- 238000004519 manufacturing process Methods 0.000 claims 1
- 238000003745 diagnosis Methods 0.000 abstract description 13
- 206010061534 Oesophageal squamous cell carcinoma Diseases 0.000 abstract description 7
- 208000036765 Squamous cell carcinoma of the esophagus Diseases 0.000 abstract description 7
- 208000007276 esophageal squamous cell carcinoma Diseases 0.000 abstract description 7
- 230000001575 pathological effect Effects 0.000 abstract description 7
- 238000012795 verification Methods 0.000 abstract description 6
- 238000011282 treatment Methods 0.000 abstract description 5
- 238000010827 pathological analysis Methods 0.000 abstract description 3
- 238000005516 engineering process Methods 0.000 description 21
- 206010028980 Neoplasm Diseases 0.000 description 11
- 208000009956 adenocarcinoma Diseases 0.000 description 11
- 210000001519 tissue Anatomy 0.000 description 9
- 201000011510 cancer Diseases 0.000 description 6
- 239000003550 marker Substances 0.000 description 6
- 230000002068 genetic effect Effects 0.000 description 4
- 230000035772 mutation Effects 0.000 description 4
- 239000000523 sample Substances 0.000 description 4
- 102100035631 Bloom syndrome protein Human genes 0.000 description 3
- 108091009167 Bloom syndrome protein Proteins 0.000 description 3
- 101001026790 Homo sapiens Tyrosine-protein kinase Fes/Fps Proteins 0.000 description 3
- 102100037845 Isocitrate dehydrogenase [NADP], mitochondrial Human genes 0.000 description 3
- 230000014509 gene expression Effects 0.000 description 3
- 238000009396 hybridization Methods 0.000 description 3
- 238000002493 microarray Methods 0.000 description 3
- 208000023514 Barrett esophagus Diseases 0.000 description 2
- 208000023665 Barrett oesophagus Diseases 0.000 description 2
- 101710181272 Choline dehydrogenase, mitochondrial Proteins 0.000 description 2
- 101710177984 Isocitrate dehydrogenase [NADP] Proteins 0.000 description 2
- 101710102690 Isocitrate dehydrogenase [NADP] cytoplasmic Proteins 0.000 description 2
- 101710175291 Isocitrate dehydrogenase [NADP], mitochondrial Proteins 0.000 description 2
- 101710157228 Isoepoxydon dehydrogenase patN Proteins 0.000 description 2
- 101710105116 Oxygen-dependent choline dehydrogenase Proteins 0.000 description 2
- 238000013145 classification model Methods 0.000 description 2
- 239000003814 drug Substances 0.000 description 2
- 238000012165 high-throughput sequencing Methods 0.000 description 2
- 239000003112 inhibitor Substances 0.000 description 2
- 230000007170 pathology Effects 0.000 description 2
- 238000012797 qualification Methods 0.000 description 2
- 238000012216 screening Methods 0.000 description 2
- 238000003786 synthesis reaction Methods 0.000 description 2
- CDKIEBFIMCSCBB-UHFFFAOYSA-N 1-(6,7-dimethoxy-3,4-dihydro-1h-isoquinolin-2-yl)-3-(1-methyl-2-phenylpyrrolo[2,3-b]pyridin-3-yl)prop-2-en-1-one;hydrochloride Chemical compound Cl.C1C=2C=C(OC)C(OC)=CC=2CCN1C(=O)C=CC(C1=CC=CN=C1N1C)=C1C1=CC=CC=C1 CDKIEBFIMCSCBB-UHFFFAOYSA-N 0.000 description 1
- 102100021631 B-cell lymphoma 6 protein Human genes 0.000 description 1
- 102000014816 CACNA1D Human genes 0.000 description 1
- 102100040755 CREB-regulated transcription coactivator 3 Human genes 0.000 description 1
- 101100463133 Caenorhabditis elegans pdl-1 gene Proteins 0.000 description 1
- 102100038700 Calcium-responsive transactivator Human genes 0.000 description 1
- 102100037713 Down syndrome cell adhesion molecule Human genes 0.000 description 1
- 102100031480 Dual specificity mitogen-activated protein kinase kinase 1 Human genes 0.000 description 1
- 206010058314 Dysplasia Diseases 0.000 description 1
- 101150016325 EPHA3 gene Proteins 0.000 description 1
- 102100030324 Ephrin type-A receptor 3 Human genes 0.000 description 1
- 102100022462 Eukaryotic initiation factor 4A-II Human genes 0.000 description 1
- 229940125497 HER2 kinase inhibitor Drugs 0.000 description 1
- 101000971234 Homo sapiens B-cell lymphoma 6 protein Proteins 0.000 description 1
- 101000891906 Homo sapiens CREB-regulated transcription coactivator 3 Proteins 0.000 description 1
- 101000957728 Homo sapiens Calcium-responsive transactivator Proteins 0.000 description 1
- 101000777079 Homo sapiens Chromodomain-helicase-DNA-binding protein 2 Proteins 0.000 description 1
- 101000880945 Homo sapiens Down syndrome cell adhesion molecule Proteins 0.000 description 1
- 101001095815 Homo sapiens E3 ubiquitin-protein ligase RING2 Proteins 0.000 description 1
- 101001044475 Homo sapiens Eukaryotic initiation factor 4A-II Proteins 0.000 description 1
- 101000599886 Homo sapiens Isocitrate dehydrogenase [NADP], mitochondrial Proteins 0.000 description 1
- 101001057193 Homo sapiens Membrane-associated guanylate kinase, WW and PDZ domain-containing protein 1 Proteins 0.000 description 1
- 101001005609 Homo sapiens Mitogen-activated protein kinase kinase kinase 13 Proteins 0.000 description 1
- 101000972286 Homo sapiens Mucin-4 Proteins 0.000 description 1
- 101001013158 Homo sapiens Myeloid leukemia factor 1 Proteins 0.000 description 1
- 101000728236 Homo sapiens Polycomb group protein ASXL1 Proteins 0.000 description 1
- 101000606502 Homo sapiens Protein-tyrosine kinase 6 Proteins 0.000 description 1
- 101000654718 Homo sapiens SET-binding protein Proteins 0.000 description 1
- 101000740519 Homo sapiens Syndecan-4 Proteins 0.000 description 1
- 101001041525 Homo sapiens Transcription factor 12 Proteins 0.000 description 1
- 101000740048 Homo sapiens Ubiquitin carboxyl-terminal hydrolase BAP1 Proteins 0.000 description 1
- 101000867817 Homo sapiens Voltage-dependent L-type calcium channel subunit alpha-1D Proteins 0.000 description 1
- 101000650162 Homo sapiens WW domain-containing transcription regulator protein 1 Proteins 0.000 description 1
- 101000740049 Latilactobacillus curvatus Bioactive peptide 1 Proteins 0.000 description 1
- 102000003960 Ligases Human genes 0.000 description 1
- 108090000364 Ligases Proteins 0.000 description 1
- 108010068342 MAP Kinase Kinase 1 Proteins 0.000 description 1
- 101150053046 MYD88 gene Proteins 0.000 description 1
- 102100027240 Membrane-associated guanylate kinase, WW and PDZ domain-containing protein 1 Human genes 0.000 description 1
- 206010054949 Metaplasia Diseases 0.000 description 1
- 102100025184 Mitogen-activated protein kinase kinase kinase 13 Human genes 0.000 description 1
- 101710143111 Mothers against decapentaplegic homolog 3 Proteins 0.000 description 1
- 102100025748 Mothers against decapentaplegic homolog 3 Human genes 0.000 description 1
- 102100022693 Mucin-4 Human genes 0.000 description 1
- 102000013609 MutL Protein Homolog 1 Human genes 0.000 description 1
- 108010026664 MutL Protein Homolog 1 Proteins 0.000 description 1
- 102100024134 Myeloid differentiation primary response protein MyD88 Human genes 0.000 description 1
- 102100029691 Myeloid leukemia factor 1 Human genes 0.000 description 1
- 108010018525 NFATC Transcription Factors Proteins 0.000 description 1
- 108700019961 Neoplasm Genes Proteins 0.000 description 1
- 102000048850 Neoplasm Genes Human genes 0.000 description 1
- 102100034400 Nuclear factor of activated T-cells, cytoplasmic 2 Human genes 0.000 description 1
- 102100029799 Polycomb group protein ASXL1 Human genes 0.000 description 1
- 102100039810 Protein-tyrosine kinase 6 Human genes 0.000 description 1
- 102100032741 SET-binding protein Human genes 0.000 description 1
- 206010041067 Small cell lung cancer Diseases 0.000 description 1
- 102100037220 Syndecan-4 Human genes 0.000 description 1
- 238000012338 Therapeutic targeting Methods 0.000 description 1
- 102100021123 Transcription factor 12 Human genes 0.000 description 1
- 102100027548 WW domain-containing transcription regulator protein 1 Human genes 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 210000004027 cell Anatomy 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 230000008094 contradictory effect Effects 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 1
- 230000035622 drinking Effects 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 229940121647 egfr inhibitor Drugs 0.000 description 1
- 210000000981 epithelium Anatomy 0.000 description 1
- 210000003236 esophagogastric junction Anatomy 0.000 description 1
- 210000003238 esophagus Anatomy 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 238000007672 fourth generation sequencing Methods 0.000 description 1
- 230000002055 immunohistochemical effect Effects 0.000 description 1
- 238000009169 immunotherapy Methods 0.000 description 1
- 229940124302 mTOR inhibitor Drugs 0.000 description 1
- 230000003211 malignant effect Effects 0.000 description 1
- 239000003628 mammalian target of rapamycin inhibitor Substances 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 229940028444 muse Drugs 0.000 description 1
- 230000036438 mutation frequency Effects 0.000 description 1
- 238000007481 next generation sequencing Methods 0.000 description 1
- 238000004393 prognosis Methods 0.000 description 1
- GMVPRGQOIOIIMI-DWKJAMRDSA-N prostaglandin E1 Chemical compound CCCCC[C@H](O)\C=C\[C@H]1[C@H](O)CC(=O)[C@@H]1CCCCCCC(O)=O GMVPRGQOIOIIMI-DWKJAMRDSA-N 0.000 description 1
- 238000012175 pyrosequencing Methods 0.000 description 1
- 238000003908 quality control method Methods 0.000 description 1
- 238000011127 radiochemotherapy Methods 0.000 description 1
- 238000007480 sanger sequencing Methods 0.000 description 1
- 230000011218 segmentation Effects 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
- 208000000587 small cell lung carcinoma Diseases 0.000 description 1
- 230000000391 smoking effect Effects 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 210000001082 somatic cell Anatomy 0.000 description 1
- 230000004083 survival effect Effects 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 229940126585 therapeutic drug Drugs 0.000 description 1
- 238000011269 treatment regimen Methods 0.000 description 1
- 238000007482 whole exome sequencing Methods 0.000 description 1
- 238000012070 whole genome sequencing analysis Methods 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6876—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
- C12Q1/6883—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
- C12Q1/6886—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material for cancer
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/243—Classification techniques relating to the number of classes
- G06F18/24323—Tree-organised classifiers
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B20/00—ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
- G16B20/50—Mutagenesis
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B50/00—ICT programming tools or database systems specially adapted for bioinformatics
- G16B50/30—Data warehousing; Computing architectures
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/112—Disease subtyping, staging or classification
Landscapes
- Engineering & Computer Science (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Physics & Mathematics (AREA)
- Chemical & Material Sciences (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Theoretical Computer Science (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Genetics & Genomics (AREA)
- Bioinformatics & Computational Biology (AREA)
- General Health & Medical Sciences (AREA)
- Organic Chemistry (AREA)
- Evolutionary Biology (AREA)
- Biotechnology (AREA)
- Biophysics (AREA)
- Analytical Chemistry (AREA)
- Data Mining & Analysis (AREA)
- Molecular Biology (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Medical Informatics (AREA)
- Immunology (AREA)
- Pathology (AREA)
- General Engineering & Computer Science (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Evolutionary Computation (AREA)
- General Physics & Mathematics (AREA)
- Microbiology (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Biochemistry (AREA)
- Hospice & Palliative Care (AREA)
- Bioethics (AREA)
- Databases & Information Systems (AREA)
- Artificial Intelligence (AREA)
- Oncology (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
The invention relates to a biomarker for esophageal cancer typing and application thereof, and relates to the technical field of medical detection. The biomarker comprises more than 5 genes such as GAS7, LRP1B, MAP2K4, FOXP1 and WWOX. By utilizing the biological markers, the diagnosis of the esophageal squamous cell carcinoma and the esophageal adenocarcinoma is carried out, the diagnosis force AUC can reach 0.9447 when 5 markers are used, the diagnosis force AUC can reach 0.9789 when 10 markers are used, the diagnosis force AUC can reach 0.9714 when 20 markers are used, the diagnosis force AUC can reach 0.9489 when all the markers are used, the diagnosis force is excellent, a molecular level-based method for distinguishing two pathological subtypes is provided, mutual verification is provided for pathological diagnosis results, the diagnosis result of a case is ensured to be correct, and subsequent accurate treatment is facilitated.
Description
Technical Field
The invention relates to the technical field of medical detection, in particular to a biomarker for esophageal cancer typing and application thereof.
Background
Esophageal cancer is the 8 th most common cancer and 6 th leading cause of death all over the world, and in China, the incidence rate of esophageal cancer is reduced in recent years, but the mortality rate is always the fourth place. Esophageal cancer is a malignant disease with poorer prognosis, and the total 5-year survival rate of the esophageal cancer in each stage is about 14 percent. Histologically, esophageal cancer is classified as a tumor of epithelial and non-epithelial origin. More than 90% of esophageal cancers are Squamous Cell Carcinomas (SCC) or adenocarcinoma. In terms of histological types, esophageal cancer mainly comprises squamous cell carcinoma accounting for more than 90 percent of the total esophageal cancer in China, mainly comprises adenocarcinoma accounting for about 70 percent of the total esophageal cancer in America and Europe, but the pathological form of the esophageal cancer is changed later, the esophageal adenocarcinoma comprising gastroesophageal junction cancer (GEJ) is remarkably increased in Western countries and America, and currently, the esophageal cancer accounts for 50 percent of the pathological form of the esophageal cancer. Smoking and heavy drinking are important factors causing esophageal squamous carcinoma.
Treatment regimens for esophageal cancer depend on the judgment of the pathotype. Meanwhile, in addition to conventional chemoradiotherapy means, esophageal cancer treatment has now begun to advance into the era of precision medicine: patients who break through the progress of treatment at a later stage, usually need to be tested for genetic variation and, depending on the individual condition, use corresponding targeted drugs, such as Her2 inhibitors, EGFR inhibitors, PDL-1 immunotherapy, mTOR inhibitors, AXL inhibitors, C-MET inhibitors.
The limited tissue samples and the need for increasingly more therapeutic targeting markers greatly increases the current diagnostic needs, and studies of histological diagnostic reproducibility have shown intra-and inter-pathologist variability in the decision: the wrong result of pathological judgment, poorly differentiated tumors, contradictory immunohistochemical results and the like present challenges to the precise medical accuracy of current esophageal cancer.
Meanwhile, the latest 2021 year-old national solid tumor somatic cell high-throughput sequencing (big panel) detection interstitial assessment report shows that the national detection laboratory qualification rate is only 49.2%, the qualification rate of genome or exon sequencing is expected to be lower, and the data greatly shows that the quality control of the patient for formulating a therapeutic drug recommendation based on pathology and gene detection is urgently needed to be improved.
Disclosure of Invention
Aiming at the problems, the invention provides a biomarker for esophageal cancer typing, obtains the biomarker for the typing diagnosis of esophageal squamous cell carcinoma and esophageal adenocarcinoma through different expression maps of variant genes in esophageal squamous cell carcinoma vs adenocarcinoma of the esophageal cancer, provides a distinguishing method for two pathological subtypes based on molecular level, and provides mutual verification for pathological diagnosis results, thereby ensuring correct case diagnosis results and facilitating subsequent accurate treatment.
In order to achieve the above objects, the present invention provides a biomarker for esophageal cancer typing, comprising at least 5 of the following genes: TP, NFE2L, NOTCH, RUNX1T, LRP1, ERBB, MUC, PTPRD, PREX, ERBB, CDKN2, NBEA, APC, ARID1, SMAD, ROBO, PTPRC, NRG, RGPD, TNFAIP, PDGFRB, WWOX, SMAD, SRC, MAFB, PTPRT, SETBP, PML, IDH, BLM, FES, CHD, FLCN, MAP2K, SMAD, ASXL, GAS, MAP2K, TCF, SDC, CRTC, PTK, SS18L, GNAS, NFATC, SALL, 1, FOXP, SRGAP, VHL, PIK3, ROA, TBTBL 1XR, NCKIPSD, MINNND, FBLN, MLF, ROBO, SOX, MB21 XPD, SOC, GMPS, SETD, GMPLTP, MUTP 3, ATR, TBETP, NCK, CACCITP, MCBTR, EPTR, MGPR, TMB, TMBER, TMB, TMCP, MCPR, TMB, MCPR, TMB, MCPR, MCTF, MCPR, MCTF, TMB, MCTF, PBF, MCTF, PBF, MCTF, PBF, PBR, PBF, MCTF, PBF, MCTF, PBR, MCTF, PBR, MCTF, MCR, PBR, MCR, PBR, MCR.
The inventor analyzes through a TCGA database, and finds that the genetic map of the esophageal squamous carcinoma with higher gene mutation frequency than that of esophageal adenocarcinoma is derived from the following genes: TP53, NFE2L2, NOTCH1, RUNX1T 1; the genetic map of the esophageal adenocarcinoma with higher frequency of gene mutation than that of esophageal squamous cell carcinoma is derived from the following genes: LRP1B, ERBB4, MUC16, PTPRD, PREX2, ERBB2, CDKN2A, NBEA, APC, ARID1A, SMAD4, ROBO2, PTPRC, NRG1, RGPD3, TNFAIP3, PDGFRB; the genetic map of the esophageal squamous carcinoma with higher frequency of copy number variation than that of esophageal adenocarcinoma is derived from the following genes: LRP1B, FOXP1, SRGAP3, VHL, PIK3CA, RHOA, TBL1XR1, NCKIPSD, MITF, FANCD2, FBLN2, MLF1, ROBO2, SOX2, MB21D2, XPC, GMPS, SETD2, WWTR1, MAP3K13, TP63, TFRC, BAP1, MECOM, EPHA3, RAF1, MLH1, CTNNB1, IGF2BP2, ETV5, MUC4, CACNA1D, PPARG, MYD88, EIF4A2, BCL6, PBTGFBR 1, TGFBR2, ERBB4, LPP, ATR, CCR 4; the genetic map of the esophageal adenocarcinoma with higher frequency of copy number variation than that of esophageal squamous cell carcinoma is derived from the following genes: WWOX, SMAD4, SRC, MAFB, PTPRT, SETBP1, PML, IDH2, BLM, FES, CHD2, FLCN, MAP2K1, SMAD3, ASXL1, GAS7, MAP2K4, TCF12, SDC4, CRTC3, PTK6, SS18L1, GNAS, NFATC2, SALL 4; the gene is used as a biomarker, and the diagnostic power AUC can reach 0.9489.
In one embodiment, the biomarkers that are typed for frequency of occurrence of gene mutations include at least 5 of the following genes: TP53, NFE2L2, NOTCH1, RUNX1T1, LRP1B, ERBB4, MUC16, PTPRD, PREX2, ERBB2, CDKN2A, NBEA, APC, ARID1A, SMAD4, ROBO2, PTPRC, NRG1, RGPD3, TNFAIP3, frpdgb; the biomarkers based on the occurrence frequency of copy number variation comprise at least 5 of the following genes: LRP1B, FOXP1, SRGAP3, VHL, PIK3CA, RHOA, TBL1XR1, NCKIPSD, MITF, FANCD2, FBLN2, MLF 2, ROBO2, SOX2, MB21D2, XPC, GMPS, SETD2, WWTR 2, MAP3K 2, TP 2, TFRC, BAP 2, MECOM, EPHA 2, RAF 2, MLH 2, CTNNB 2, IGF2BP2, ETV 2, MUC 2, CACNA 12, PPARG, MYD 2, MEIF 4A2, BCL 2, PBFB3672, TGFBR2, ERBB2, LPP, ATR 2, WW 36OX, SMFB 2, PREI 2, SDCT 2, TBSC 2, SMTBAS 2, SMTCK 2, SATC 2, SMTCF 2, SATC 2, SARG 2, FLTC 2, SARG 2, SATCK 2, TFSC 2, TFS 2, TFSC 2, TFAS 2, TFSC 2, TFS 2, TFSC 2, TFS 2, TFSC 2, TFAS 2, TFS 2, TFSC 2, TFAS 2, TFSC 2, TFS 2, TFAS 2, TFS 2, TFAS 2, TFS 2, TFAS 2, TFSC 2, TFS 2, TFAS 2, TFS 2, TFAS 2, TFS 2, TF.
In one embodiment, the biomarkers include the following genes: GAS7, LRP1B, MAP2K4, FOXP1, and WWOX.
By adopting the 5 genes as the biomarkers, the diagnostic power AUC can reach 0.9447.
In one embodiment, the biomarkers include the following genes: TP63, FLCN, MB21D2, MITF, SOX2, GAS7, LRP1B, MAP2K4, FOXP1, and WWOX.
By adopting the 10 genes as the biomarkers, the diagnostic power AUC can reach 0.9789.
In one embodiment, the biomarkers include the following genes: ETV5, SMAD4, ROBO2, TFRC, TBL1XR1, ERBB4, GMPS, PIK3CA, PPARG, RAF1, TP63, FLCN, MB21D2, MITF, SOX2, GAS7, LRP1B, MAP2K4, FOXP1, and WWOX.
By adopting the 20 genes as the biomarkers, the diagnostic power AUC can reach 0.9714.
The invention also provides application of the biomarker in developing and/or preparing a diagnostic product for esophageal squamous carcinoma and esophageal adenocarcinoma typing.
In one embodiment, the biomarker is used as a biomarker in the detection of a biological sample taken from: at least one of blood or tissue.
In one embodiment, the biological sample is detected by a detection method selected from the group consisting of: sequencing technology, microarray hybridization technology, or PCR technology.
In one embodiment, the sequencing technique is selected from the group consisting of: sanger sequencing technology, high-throughput sequencing technology, pyrosequencing technology, sequencing-by-synthesis technology, single-molecule sequencing technology, nanopore sequencing technology, semiconductor sequencing technology, connection sequencing technology, hybridization sequencing technology, digital gene expression technology, next-generation sequencing technology, single-molecule sequencing-by-synthesis technology, massively parallel sequencing technology, clonal single-molecule array technology, shotgun sequencing technology, Maxim Gilbert sequencing technology, primer walking technology, or sequencing technology based on PacBio, SOLiD, ion Torrent or nanopore platforms.
In one embodiment, the microarray hybridization technique is a SNP microarray technique.
In one embodiment, the PCR technique is selected from: KASP typing method, ligase detection reaction typing method or Taqman probe method.
The invention also provides a kit for esophageal cancer typing diagnosis, which comprises a reagent for detecting the biomarkers in a biological sample.
The invention also provides a system for esophageal cancer typing diagnosis, which comprises:
an analysis device: the system is used for obtaining the genetic variation condition of the biomarker in a biological sample of a subject to be diagnosed and inputting the genetic variation condition into an evaluation model for typing evaluation;
an output device: for outputting the above evaluation result.
In one embodiment, the evaluation model is established by: and obtaining a plurality of biological samples of esophageal adenocarcinoma and esophageal squamous carcinoma, sequencing to obtain the gene mutation condition of the biomarker, and establishing a typing model by using a random forest model to obtain the biological sample.
Compared with the prior art, the invention has the following beneficial effects:
the biomarker for esophageal cancer typing and the application thereof are obtained through different expression maps of variant genes in esophageal squamous cell carcinoma vs adenocarcinoma of esophageal cancer, so that the typing diagnosis of esophageal squamous cell carcinoma and esophageal adenocarcinoma is realized, a distinguishing method for two pathological subtypes based on molecular level can be provided, mutual verification is provided for pathological diagnosis results, the diagnosis result of a case is ensured to be correct, and the follow-up accurate treatment is facilitated.
Drawings
FIG. 1 is a flow chart for modeling an esophageal cancer model in example 2;
FIG. 2 is a graph showing AUC in the esophageal cancer triple classification model using 88 markers in example 2;
FIG. 3 is a graph showing the AUC of the esophageal cancer triage model using 20 markers in example 2;
FIG. 4 is a graph showing the AUC of the esophageal cancer triage model using 10 markers in example 2;
fig. 5 is a graph showing AUC of the esophageal cancer triple classification model using 5 markers in example 2.
Detailed Description
To facilitate an understanding of the invention, the invention will now be described more fully with reference to the accompanying drawings. Preferred embodiments of the present invention are shown in the drawings. This invention may, however, be embodied in many different forms and should not be construed as limited to the embodiments set forth herein. Rather, these embodiments are provided so that this disclosure will be thorough and complete.
Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. The terminology used in the description of the invention herein is for the purpose of describing particular embodiments only and is not intended to be limiting of the invention. As used herein, the term "and/or" includes any and all combinations of one or more of the associated listed items.
Defining:
esophageal squamous carcinoma: refers to a subtype of esophageal cancer that develops primarily from squamous cell dysplasia.
Esophageal adenocarcinoma: refers to a subtype of esophageal cancer that has evolved primarily from normal squamous epithelium of the esophagus via Barrett's Esophagus (BE) or incomplete intestinal metaplasia.
TCGA: refers to a database collectively known as The Cancer Genome Atlas, which contains data for 30+ tumors, derived from The National Cancer Institute (NCI) and The National human Genome Institute (NHGRI) initiated Cancer Genome Atlas (The Cancer Genome Atlas, TCGA) program, with The website https:// www.cbioportal.org/.
The source is as follows:
reagents, materials and equipment used in the embodiment are all commercially available sources unless otherwise specified; unless otherwise specified, all the experimental methods are routine in the art.
Example 1
And (3) primarily screening the variant gene markers for the pathological subtype typing of the esophageal cancer based on a TCGA public database.
The screening method is as follows.
1. Tumor tissue whole exome sequencing data from esophageal cancer patients were obtained from the TCGA database.
In this example, full exon sequencing data of 184 esophageal cancer patients (88 adenocarcinoma and 96 squamous carcinoma) were downloaded, and seven different software were used: MuTect, VarScan, MuSE, Radia and somaticSniper, and respectively detecting SNP mutation sites; InDels is detected by VarScan, Pindel and Inelocator respectively, and finally, at least two mutation software are adopted to simultaneously call the mutation sites in the sample as candidate mutation sites.
2. Differential analysis was performed on the adenocarcinoma group dataset and the squamous carcinoma group dataset.
The mutant genes (SNV and CNV) with p less than or equal to 0.05 are selected as potential markers by fisher test analysis, and the potential markers are shown in the table.
TABLE 1 SNV Table
TABLE 2 CNV Table
3. Cancer-associated genes and WWOX genes were selected according to COMIC CGC (Cancer Gene Census, v94) Gene annotation, totaling 88 genes as model markers, 88 markers: TP, NFE2L, NOTCH, RUNX1T, LRP1, ERBB, MUC, PTPRD, PREX, ERBB, CDKN2, NBEA, APC, ARID1, SMAD, ROBO, PTPRC, NRG, RGPD, TNFAIP, PDGFRB, WWOX, SMAD, SRC, MAFB, PTPRT, SETBP, PML, IDH, BLM, FES, CHD, FLCN, MAP2K, SMAD, ASXL, GAS, MAP2K, TCF, SDC, CRTC, PTK, SS18L, GNAS, NFATC, SALL, 1, FOXP, SRGAP, VHL, PIK3, ROA, TBTBL 1XR, NCKIPSD, MINNND, FBLN, MLF, ROBO, SOX, MB21 XPD, SOC, GMPS, SETD, GMPLTP, MUTP 3, ATR, TBETP, NCK, CACCITP, MCBTR, EPTR, MGPR, TMB, TMBER, TMB, TMCP, MCPR, TMB, MCPR, TMB, MCPR, MCTF, MCPR, MCTF, TMB, MCTF, PBF, MCTF, PBF, MCTF, PBF, PBR, PBF, MCTF, PBF, MCTF, PBR, MCTF, PBR, MCTF, MCR, PBR, MCR, PBR, MCR.
Example 2
For the potential markers obtained in example 1, clinical samples were analyzed and validated.
The verification process is as follows.
1. Obtaining a tissue sample: 184 specimens of relevant FFPE sections of pathology identified by relevant experts as esophageal cancer (96 squamous carcinomas, 88 squamous carcinomas) were collected from river-south university.
2. Sample sequencing analysis: FFPE tissue samples were subjected to whole genome sequencing analysis by a third party (clear biotechnology).
3. Establishing a model: using the information of all target markers in example 1, performing detection and judgment on independent verification sets, namely tissue samples of 96 cases of squamous carcinoma and 88 cases of adenocarcinoma patients, performing modeling analysis by using a random forest model, wherein a modeling flow is shown in fig. 1, segmentation is performed according to a ratio of 7:3, 20 times of repetition are performed, the AUC of the model is up to 0.9489, and model parameters are as follows: n _ estimators is 300, max _ features is log2, criterion is entropy, min _ samples _ leaf is 3, and class _ weight is balanced. As shown in fig. 2.
4. Preference was given to 88 markers: by utilizing a random forest model for modeling analysis, tissue samples of 96 cases of squamous carcinoma patients and 88 cases of adenocarcinoma patients are detected and judged according to the proportion of 7:3, repeating for 20 times, establishing the characteristic importance of the model according to the step 3, and selecting the optimal combination of the first 20 MARKER, namely ETV5, SMAD4, ROBO2, TFRC, TBL1XR1, ERBB4, GMPS, PIK3CA, PPARG, RAF1, TP63, FLCN, MB21D2, MITF, SOX2, GAS7, LRP1B, MAP2K4, FOXP1 and WWOX, wherein the AUC of the model obtained by using the 20 MARKER is as high as 0.9714, as shown in FIG. 3.
5. Preference was given to 20 markers: by utilizing a random forest model for modeling analysis, tissue samples of 96 cases of squamous carcinoma patients and 88 cases of adenocarcinoma patients are detected and judged according to the proportion of 7:3, repeating for 20 times, selecting the optimal combination of the first 10 MARKER according to the characteristic importance of the model in the step 3, namely TP63, FLCN, MB21D2, MITF, SOX2, GAS7, LRP1B, MAP2K4, FOXP1 and WWOX, and obtaining the model AUC up to 0.9789 by using the 10 MARKER, as shown in FIG. 4.
6. Preference was given to 10 markers: by utilizing a random forest model for modeling analysis, tissue samples of 96 cases of squamous carcinoma patients and 88 cases of adenocarcinoma patients are detected and judged according to the proportion of 7:3, repeating for 20 times, selecting the optimal combination of the first 5 MARKER according to the characteristic importance of the model in the step 3, namely GAS7, LRP1B, MAP2K4, FOXP1 and WWOX, and obtaining the AUC of the model up to 0.9447 by using the above 5 MARKERs, as shown in FIG. 5.
Example 3
28 samples which are clinically judged to be esophageal cancer are selected, 10 MARKER combined models established in the embodiment 2 are adopted for analysis, the analysis results are compared with the judgment results of clinical experts, and the results are shown in the following table.
Table 3 clinical verification results
From the results, the biomarker of the invention is adopted to accurately judge the classification of squamous cell carcinoma or adenocarcinoma in small cell lung cancer by using the model, and the consistency with expert judgment is more than 91.3%.
The technical features of the embodiments described above may be arbitrarily combined, and for the sake of brevity, all possible combinations of the technical features in the embodiments described above are not described, but should be considered as being within the scope of the present specification as long as there is no contradiction between the combinations of the technical features.
The above-mentioned embodiments only express several embodiments of the present invention, and the description thereof is more specific and detailed, but not construed as limiting the scope of the invention. It should be noted that, for a person skilled in the art, several variations and modifications can be made without departing from the inventive concept, which falls within the scope of the present invention. Therefore, the protection scope of the present patent shall be subject to the appended claims.
Claims (10)
1. A biomarker for the typing of esophageal cancer, comprising at least 5 of the following genes: TP, NFE2L, NOTCH, RUNX1T, LRP1, ERBB, MUC, PTPRD, PREX, ERBB, CDKN2, NBEA, APC, ARID1, SMAD, ROBO, PTPRC, NRG, RGPD, TNFAIP, PDGFRB, WWOX, SMAD, SRC, MAFB, PTPRT, SETBP, PML, IDH, BLM, FES, CHD, FLCN, MAP2K, SMAD, ASXL, GAS, MAP2K, TCF, SDC, CRTC, PTK, SS18L, GNAS, NFATC, SALL, 1, FOXP, SRGAP, VHL, PIK3, ROA, TBTBL 1XR, NCKIPSD, MINNND, FBLN, MLF, ROBO, SOX, MB21 XPD, SOC, GMPS, SETD, GMPLTP, MUTP 3, ATR, TBETP, NCK, CACCITP, MCBTR, EPTR, MGPR, TMB, TMBER, TMB, TMCP, MCPR, TMB, MCPR, TMB, MCPR, MCTF, MCPR, MCTF, TMB, MCTF, PBF, MCTF, PBF, MCTF, PBF, PBR, PBF, MCTF, PBF, MCTF, PBR, MCTF, PBR, MCTF, MCR, PBR, MCR, PBR, MCR.
2. The biomarker according to claim 1, wherein the biomarker based on the typing of the occurrence frequency of gene mutation comprises at least 5 of the following genes: TP53, NFE2L2, NOTCH1, RUNX1T1, LRP1B, ERBB4, MUC16, PTPRD, PREX2, ERBB2, CDKN2A, NBEA, APC, ARID1A, SMAD4, ROBO2, PTPRC, NRG1, RGPD3, TNFAIP3, frpdgb; the biomarkers based on the occurrence frequency of copy number variation comprise at least 5 of the following genes: LRP1B, FOXP1, SRGAP3, VHL, PIK3CA, RHOA, TBL1XR1, NCKIPSD, MITF, FANCD2, FBLN2, MLF 2, ROBO2, SOX2, MB21D2, XPC, GMPS, SETD2, WWTR 2, MAP3K 2, TP 2, TFRC, BAP 2, MECOM, EPHA 2, RAF 2, MLH 2, CTNNB 2, IGF2BP2, ETV 2, MUC 2, CACNA 12, PPARG, MYD 2, MEIF 4A2, BCL 2, PBFB3672, TGFBR2, ERBB2, LPP, ATR 2, WW 36OX, SMFB 2, PREI 2, SDCT 2, TBSC 2, SMTBAS 2, SMTCK 2, SATC 2, SMTCF 2, SATC 2, SARG 2, FLTC 2, SARG 2, SATCK 2, TFSC 2, TFS 2, TFSC 2, TFAS 2, TFSC 2, TFS 2, TFSC 2, TFS 2, TFSC 2, TFAS 2, TFS 2, TFSC 2, TFAS 2, TFSC 2, TFS 2, TFAS 2, TFS 2, TFAS 2, TFS 2, TFAS 2, TFSC 2, TFS 2, TFAS 2, TFS 2, TFAS 2, TFS 2, TF.
3. The biomarker of claim 1, comprising the following genes: GAS7, LRP1B, MAP2K4, FOXP1, and WWOX.
4. The biomarker of claim 1, comprising the following genes: TP63, FLCN, MB21D2, MITF, SOX2, GAS7, LRP1B, MAP2K4, FOXP1, and WWOX.
5. The biomarker of claim 1, comprising the following genes: ETV5, SMAD4, ROBO2, TFRC, TBL1XR1, ERBB4, GMPS, PIK3CA, PPARG, RAF1, TP63, FLCN, MB21D2, MITF, SOX2, GAS7, LRP1B, MAP2K4, FOXP1, and WWOX.
6. Use of the biomarker according to any one of claims 1 to 5 in the development and/or manufacture of a diagnostic product for the typing of esophageal squamous carcinoma and esophageal adenocarcinoma.
7. The use according to claim 6, wherein the biomarker is used as a biomarker in the detection of a biological sample taken from the group consisting of: at least one of blood or tissue.
8. A kit for use in a differential diagnosis of esophageal cancer, comprising reagents for detecting the biomarkers of any one of claims 1-5 in a biological sample.
9. A system for genotyping and diagnosing esophageal cancer, the system comprising:
an analysis device: the method is used for obtaining the genetic variation condition of the biomarker of any one of claims 1 to 5 in a biological sample of a subject to be diagnosed, and inputting the genetic variation condition into an evaluation model for typing evaluation;
an output device: for outputting the above evaluation result.
10. The system of claim 9, wherein the evaluation model is established by: and obtaining a plurality of biological samples of esophageal adenocarcinoma and esophageal squamous carcinoma, sequencing to obtain the gene mutation condition of the biomarker, and establishing a typing model by using a random forest model to obtain the biological sample.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111590459.8A CN114214409B (en) | 2021-12-23 | 2021-12-23 | Biomarker for esophageal carcinoma typing and application thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111590459.8A CN114214409B (en) | 2021-12-23 | 2021-12-23 | Biomarker for esophageal carcinoma typing and application thereof |
Publications (2)
Publication Number | Publication Date |
---|---|
CN114214409A true CN114214409A (en) | 2022-03-22 |
CN114214409B CN114214409B (en) | 2024-03-12 |
Family
ID=80705436
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202111590459.8A Active CN114214409B (en) | 2021-12-23 | 2021-12-23 | Biomarker for esophageal carcinoma typing and application thereof |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN114214409B (en) |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110499364A (en) * | 2019-07-30 | 2019-11-26 | 北京凯昂医学诊断技术有限公司 | A kind of probe groups and its kit and application for detecting the full exon of extended pattern hereditary disease |
CN112779338A (en) * | 2021-03-03 | 2021-05-11 | 南京世和基因生物技术股份有限公司 | Gene marker for esophageal cancer prognosis evaluation |
CN113186287A (en) * | 2021-05-10 | 2021-07-30 | 深圳康华君泰生物科技有限公司 | Biomarker for non-small cell lung cancer typing and application thereof |
US20210254056A1 (en) * | 2017-05-05 | 2021-08-19 | Camp4 Therapeutics Corporation | Identification and targeted modulation of gene signaling networks |
-
2021
- 2021-12-23 CN CN202111590459.8A patent/CN114214409B/en active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20210254056A1 (en) * | 2017-05-05 | 2021-08-19 | Camp4 Therapeutics Corporation | Identification and targeted modulation of gene signaling networks |
CN110499364A (en) * | 2019-07-30 | 2019-11-26 | 北京凯昂医学诊断技术有限公司 | A kind of probe groups and its kit and application for detecting the full exon of extended pattern hereditary disease |
CN112779338A (en) * | 2021-03-03 | 2021-05-11 | 南京世和基因生物技术股份有限公司 | Gene marker for esophageal cancer prognosis evaluation |
CN113186287A (en) * | 2021-05-10 | 2021-07-30 | 深圳康华君泰生物科技有限公司 | Biomarker for non-small cell lung cancer typing and application thereof |
Non-Patent Citations (2)
Title |
---|
NAN ZHANG等: "Mutational Characterization and Potential Prognostic Biomarkers of Chinese Patients with Esophageal Squamous Cell Carcinoma", 《ONCOTARGETS AND THERAPY》, vol. 13, pages 12797 - 12809 * |
国家卫生健康委员会: "食管癌诊疗规范(2018年版)", 《中华消化病与影像杂志(电子版)》, vol. 9, no. 4, pages 158 - 192 * |
Also Published As
Publication number | Publication date |
---|---|
CN114214409B (en) | 2024-03-12 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Tong et al. | Tumor-derived DNA from pleural effusion supernatant as a promising alternative to tumor tissue in genomic profiling of advanced lung cancer | |
CN113186287B (en) | Biomarker for non-small cell lung cancer typing and application thereof | |
WO2019068082A1 (en) | Dna methylation biomarkers for cancer diagnosing | |
Lih et al. | Analytical validation and application of a targeted next-generation sequencing mutation-detection assay for use in treatment assignment in the NCI-MPACT trial | |
ES2959360T3 (en) | Improving cancer screening using acellular viral nucleic acids | |
WO2018166476A1 (en) | Method for detecting mutation site in sample | |
Rodríguez et al. | Molecular biomarkers in early stage lung cancer | |
CN114277148B (en) | Biomarker for breast cancer typing and application thereof | |
Santorsola et al. | A multi-parametric workflow for the prioritization of mitochondrial DNA variants of clinical interest | |
Pass et al. | Biomarkers and molecular testing for early detection, diagnosis, and therapeutic prediction of lung cancer | |
JP2021518107A (en) | Tissue-specific methylation marker | |
CN116631508B (en) | Detection method for tumor specific mutation state and application thereof | |
Zhu et al. | The genomic and epigenomic evolutionary history of papillary renal cell carcinomas | |
Gao et al. | Whole‐genome bisulfite sequencing analysis of circulating tumour DNA for the detection and molecular classification of cancer | |
CN113544288A (en) | DNA methylation marker for predicting liver cancer recurrence and application thereof | |
CN115820860A (en) | Method for screening non-small cell lung cancer marker based on methylation difference of enhancer, marker and application thereof | |
Jiang et al. | Circulating tumor cell methylation profiles reveal the classification and evolution of non-small cell lung cancer | |
Yang et al. | Genomic profiling with large-scale next-generation sequencing panels distinguishes separate primary lung adenocarcinomas from intrapulmonary metastases | |
CN113699242A (en) | Primer probe, kit and method for detecting KRAS gene mutation, ADAMTS1 and BNC1 methylation | |
US20230002831A1 (en) | Methods and compositions for analyses of cancer | |
Wood et al. | Comparing mutation calls in fixed tumour samples between the affymetrix OncoScan® array and PCR based next-generation sequencing | |
CN114214409A (en) | Biomarker for esophageal cancer typing and application thereof | |
CN111028888A (en) | Detection method of genome-wide copy number variation and application thereof | |
ES2960872T3 (en) | A method to diagnose cancers of the genitourinary tract | |
CN113999918B (en) | Colorectal cancer distal metastasis specific long-chain non-coding RNA marker LINC01871 and detection kit thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |