CN114317756B - Application of marker - Google Patents
Application of marker Download PDFInfo
- Publication number
- CN114317756B CN114317756B CN202210018328.0A CN202210018328A CN114317756B CN 114317756 B CN114317756 B CN 114317756B CN 202210018328 A CN202210018328 A CN 202210018328A CN 114317756 B CN114317756 B CN 114317756B
- Authority
- CN
- China
- Prior art keywords
- gene combination
- gene
- ppi
- mod
- tumor
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 239000003550 marker Substances 0.000 title claims abstract description 12
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 100
- 206010028980 Neoplasm Diseases 0.000 claims abstract description 82
- 230000014509 gene expression Effects 0.000 claims abstract description 42
- 206010021143 Hypoxia Diseases 0.000 claims abstract description 26
- 230000007954 hypoxia Effects 0.000 claims abstract description 24
- 230000004083 survival effect Effects 0.000 claims abstract description 12
- 238000004393 prognosis Methods 0.000 claims abstract description 11
- 230000002596 correlated effect Effects 0.000 claims description 12
- 201000007270 liver cancer Diseases 0.000 claims description 6
- 208000014018 liver neoplasm Diseases 0.000 claims description 6
- 230000037361 pathway Effects 0.000 claims description 6
- 108010028501 Hypoxia-Inducible Factor 1 Proteins 0.000 claims description 5
- 102000016878 Hypoxia-Inducible Factor 1 Human genes 0.000 claims description 5
- 210000000170 cell membrane Anatomy 0.000 claims description 5
- 230000000694 effects Effects 0.000 claims description 5
- 230000004069 differentiation Effects 0.000 claims description 4
- 210000001339 epidermal cell Anatomy 0.000 claims description 4
- 102100031636 Dynein axonemal heavy chain 9 Human genes 0.000 claims description 3
- 102100029688 Dynein axonemal intermediate chain 3 Human genes 0.000 claims description 3
- 102100024802 Fibroblast growth factor 23 Human genes 0.000 claims description 3
- 102100027817 Homeobox protein GBX-1 Human genes 0.000 claims description 3
- 102100020766 Homeobox protein Hox-C11 Human genes 0.000 claims description 3
- 102100020758 Homeobox protein Hox-C12 Human genes 0.000 claims description 3
- 102100020761 Homeobox protein Hox-C13 Human genes 0.000 claims description 3
- 102100020762 Homeobox protein Hox-C5 Human genes 0.000 claims description 3
- 101000866325 Homo sapiens Dynein axonemal heavy chain 9 Proteins 0.000 claims description 3
- 101000865953 Homo sapiens Dynein axonemal intermediate chain 3 Proteins 0.000 claims description 3
- 101001051973 Homo sapiens Fibroblast growth factor 23 Proteins 0.000 claims description 3
- 101000859749 Homo sapiens Homeobox protein GBX-1 Proteins 0.000 claims description 3
- 101001003015 Homo sapiens Homeobox protein Hox-C11 Proteins 0.000 claims description 3
- 101001002991 Homo sapiens Homeobox protein Hox-C12 Proteins 0.000 claims description 3
- 101001002988 Homo sapiens Homeobox protein Hox-C13 Proteins 0.000 claims description 3
- 101001002966 Homo sapiens Homeobox protein Hox-C5 Proteins 0.000 claims description 3
- 101001050275 Homo sapiens Keratin, type I cuticular Ha1 Proteins 0.000 claims description 3
- 101001007036 Homo sapiens Keratin, type I cuticular Ha4 Proteins 0.000 claims description 3
- 101001046960 Homo sapiens Keratin, type II cytoskeletal 1 Proteins 0.000 claims description 3
- 101001046936 Homo sapiens Keratin, type II cytoskeletal 2 epidermal Proteins 0.000 claims description 3
- 101001020548 Homo sapiens LIM/homeobox protein Lhx1 Proteins 0.000 claims description 3
- 101000613577 Homo sapiens Paired box protein Pax-2 Proteins 0.000 claims description 3
- 101000855004 Homo sapiens Protein Wnt-7a Proteins 0.000 claims description 3
- 101000711796 Homo sapiens Sclerostin Proteins 0.000 claims description 3
- 101000618139 Homo sapiens Sperm-associated antigen 6 Proteins 0.000 claims description 3
- 101000868887 Homo sapiens Transcription factor Sp7 Proteins 0.000 claims description 3
- 102100023131 Keratin, type I cuticular Ha1 Human genes 0.000 claims description 3
- 102100028355 Keratin, type I cuticular Ha4 Human genes 0.000 claims description 3
- 102100022854 Keratin, type II cytoskeletal 2 epidermal Human genes 0.000 claims description 3
- 102100036133 LIM/homeobox protein Lhx1 Human genes 0.000 claims description 3
- 102100020729 Protein Wnt-7a Human genes 0.000 claims description 3
- 102100034201 Sclerostin Human genes 0.000 claims description 3
- 102100032317 Transcription factor Sp7 Human genes 0.000 claims description 3
- 239000003153 chemical reaction reagent Substances 0.000 claims description 3
- 239000000203 mixture Substances 0.000 claims description 3
- 230000003321 amplification Effects 0.000 claims description 2
- 238000003199 nucleic acid amplification method Methods 0.000 claims description 2
- 239000002671 adjuvant Substances 0.000 claims 1
- 210000001808 exosome Anatomy 0.000 abstract description 49
- 206010073071 hepatocellular carcinoma Diseases 0.000 abstract description 12
- 231100000844 hepatocellular carcinoma Toxicity 0.000 abstract description 8
- 239000000092 prognostic biomarker Substances 0.000 abstract description 7
- 101150078635 18 gene Proteins 0.000 abstract description 5
- 230000002601 intratumoral effect Effects 0.000 abstract description 4
- 238000002360 preparation method Methods 0.000 abstract description 2
- 230000001747 exhibiting effect Effects 0.000 abstract 1
- 230000004850 protein–protein interaction Effects 0.000 description 24
- 238000004458 analytical method Methods 0.000 description 18
- 238000000034 method Methods 0.000 description 15
- 238000007637 random forest analysis Methods 0.000 description 12
- 201000011510 cancer Diseases 0.000 description 10
- 238000004422 calculation algorithm Methods 0.000 description 8
- 210000004027 cell Anatomy 0.000 description 8
- 210000001519 tissue Anatomy 0.000 description 8
- 230000033115 angiogenesis Effects 0.000 description 6
- 210000002865 immune cell Anatomy 0.000 description 6
- 102000004169 proteins and genes Human genes 0.000 description 6
- 239000000523 sample Substances 0.000 description 6
- 238000012163 sequencing technique Methods 0.000 description 6
- 101001046870 Homo sapiens Hypoxia-inducible factor 1-alpha Proteins 0.000 description 5
- 102100022875 Hypoxia-inducible factor 1-alpha Human genes 0.000 description 5
- 210000001124 body fluid Anatomy 0.000 description 5
- 239000010839 body fluid Substances 0.000 description 5
- 230000003993 interaction Effects 0.000 description 5
- 230000001105 regulatory effect Effects 0.000 description 5
- 206010061218 Inflammation Diseases 0.000 description 4
- 206010027476 Metastases Diseases 0.000 description 4
- 239000000090 biomarker Substances 0.000 description 4
- 238000010276 construction Methods 0.000 description 4
- 230000004547 gene signature Effects 0.000 description 4
- 230000004054 inflammatory process Effects 0.000 description 4
- 238000007619 statistical method Methods 0.000 description 4
- 238000012549 training Methods 0.000 description 4
- 238000004364 calculation method Methods 0.000 description 3
- 230000006870 function Effects 0.000 description 3
- 230000009401 metastasis Effects 0.000 description 3
- 238000012986 modification Methods 0.000 description 3
- 230000004048 modification Effects 0.000 description 3
- 230000007170 pathology Effects 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 238000005070 sampling Methods 0.000 description 3
- 210000004881 tumor cell Anatomy 0.000 description 3
- 238000010200 validation analysis Methods 0.000 description 3
- 101100450705 Caenorhabditis elegans hif-1 gene Proteins 0.000 description 2
- 101000635854 Homo sapiens Myoglobin Proteins 0.000 description 2
- 101000706121 Homo sapiens Parvalbumin alpha Proteins 0.000 description 2
- 108700011259 MicroRNAs Proteins 0.000 description 2
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical compound [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 description 2
- 238000003066 decision tree Methods 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 238000010201 enrichment analysis Methods 0.000 description 2
- 238000010195 expression analysis Methods 0.000 description 2
- 230000035992 intercellular communication Effects 0.000 description 2
- 230000009545 invasion Effects 0.000 description 2
- 210000005229 liver cell Anatomy 0.000 description 2
- 238000001325 log-rank test Methods 0.000 description 2
- 208000002154 non-small cell lung carcinoma Diseases 0.000 description 2
- 229910052760 oxygen Inorganic materials 0.000 description 2
- 239000001301 oxygen Substances 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 210000002536 stromal cell Anatomy 0.000 description 2
- 208000029729 tumor suppressor gene on chromosome 11 Diseases 0.000 description 2
- 238000012795 verification Methods 0.000 description 2
- 102100039463 2-oxoglutarate receptor 1 Human genes 0.000 description 1
- 102100031020 5-aminolevulinate synthase, erythroid-specific, mitochondrial Human genes 0.000 description 1
- 102100036311 5-hydroxytryptamine receptor 1F Human genes 0.000 description 1
- 102100039675 Adenylate cyclase type 2 Human genes 0.000 description 1
- 102100036792 Adhesion G protein-coupled receptor L4 Human genes 0.000 description 1
- 102100025677 Alkaline phosphatase, germ cell type Human genes 0.000 description 1
- 102100032956 Alpha-actinin-3 Human genes 0.000 description 1
- 102100032381 Alpha-hemoglobin-stabilizing protein Human genes 0.000 description 1
- 102100020895 Ammonium transporter Rh type A Human genes 0.000 description 1
- 206010002660 Anoxia Diseases 0.000 description 1
- 241000976983 Anoxia Species 0.000 description 1
- 238000012935 Averaging Methods 0.000 description 1
- 102100030009 Azurocidin Human genes 0.000 description 1
- 102000008096 B7-H1 Antigen Human genes 0.000 description 1
- 108010074708 B7-H1 Antigen Proteins 0.000 description 1
- 102000004219 Brain-derived neurotrophic factor Human genes 0.000 description 1
- 108090000715 Brain-derived neurotrophic factor Proteins 0.000 description 1
- 102000017927 CHRM1 Human genes 0.000 description 1
- 102000017924 CHRM4 Human genes 0.000 description 1
- 102100021851 Calbindin Human genes 0.000 description 1
- 102100039532 Calcium-activated chloride channel regulator 2 Human genes 0.000 description 1
- 102100036214 Cannabinoid receptor 2 Human genes 0.000 description 1
- 102100025475 Carcinoembryonic antigen-related cell adhesion molecule 5 Human genes 0.000 description 1
- 102100028012 Cation channel sperm-associated auxiliary subunit delta Human genes 0.000 description 1
- 108091006146 Channels Proteins 0.000 description 1
- 238000001353 Chip-sequencing Methods 0.000 description 1
- 102100035932 Cocaine- and amphetamine-regulated transcript protein Human genes 0.000 description 1
- 102100033215 DNA nucleotidylexotransferase Human genes 0.000 description 1
- 102100031817 Delta-type opioid receptor Human genes 0.000 description 1
- 102100034577 Desmoglein-3 Human genes 0.000 description 1
- 102100040620 Dynein regulatory complex subunit 6 Human genes 0.000 description 1
- 102100039812 E3 ubiquitin-protein ligase RNF182 Human genes 0.000 description 1
- 102100031690 Erythroid transcription factor Human genes 0.000 description 1
- 102100026082 F-box only protein 40 Human genes 0.000 description 1
- 102000017703 GABRG2 Human genes 0.000 description 1
- 102100028447 Galanin receptor type 1 Human genes 0.000 description 1
- 108010016122 Ghrelin Receptors Proteins 0.000 description 1
- 102100030651 Glutamate receptor 2 Human genes 0.000 description 1
- 102100035716 Glycophorin-A Human genes 0.000 description 1
- 102100036430 Glycophorin-B Human genes 0.000 description 1
- 102100039256 Growth hormone secretagogue receptor type 1 Human genes 0.000 description 1
- 101150087110 HCRT gene Proteins 0.000 description 1
- -1 HOXD Proteins 0.000 description 1
- 101150017422 HTR1 gene Proteins 0.000 description 1
- 102100027685 Hemoglobin subunit alpha Human genes 0.000 description 1
- 102100039894 Hemoglobin subunit delta Human genes 0.000 description 1
- 102100030826 Hemoglobin subunit epsilon Human genes 0.000 description 1
- 102100030378 Hemoglobin subunit theta-1 Human genes 0.000 description 1
- 102100023830 Homeobox protein EMX2 Human genes 0.000 description 1
- 101000609562 Homo sapiens 2-oxoglutarate receptor 1 Proteins 0.000 description 1
- 101001083755 Homo sapiens 5-aminolevulinate synthase, erythroid-specific, mitochondrial Proteins 0.000 description 1
- 101000783605 Homo sapiens 5-hydroxytryptamine receptor 1F Proteins 0.000 description 1
- 101000959347 Homo sapiens Adenylate cyclase type 2 Proteins 0.000 description 1
- 101000928172 Homo sapiens Adhesion G protein-coupled receptor L4 Proteins 0.000 description 1
- 101000574440 Homo sapiens Alkaline phosphatase, germ cell type Proteins 0.000 description 1
- 101000797292 Homo sapiens Alpha-actinin-3 Proteins 0.000 description 1
- 101000797984 Homo sapiens Alpha-hemoglobin-stabilizing protein Proteins 0.000 description 1
- 101001075525 Homo sapiens Ammonium transporter Rh type A Proteins 0.000 description 1
- 101000793686 Homo sapiens Azurocidin Proteins 0.000 description 1
- 101000898082 Homo sapiens Calbindin Proteins 0.000 description 1
- 101000888580 Homo sapiens Calcium-activated chloride channel regulator 2 Proteins 0.000 description 1
- 101000875075 Homo sapiens Cannabinoid receptor 2 Proteins 0.000 description 1
- 101000914324 Homo sapiens Carcinoembryonic antigen-related cell adhesion molecule 5 Proteins 0.000 description 1
- 101000859043 Homo sapiens Cation channel sperm-associated auxiliary subunit delta Proteins 0.000 description 1
- 101000715592 Homo sapiens Cocaine- and amphetamine-regulated transcript protein Proteins 0.000 description 1
- 101000800646 Homo sapiens DNA nucleotidylexotransferase Proteins 0.000 description 1
- 101000992305 Homo sapiens Delta-type opioid receptor Proteins 0.000 description 1
- 101000924311 Homo sapiens Desmoglein-3 Proteins 0.000 description 1
- 101000816907 Homo sapiens Dynein regulatory complex subunit 6 Proteins 0.000 description 1
- 101000667703 Homo sapiens E3 ubiquitin-protein ligase RNF182 Proteins 0.000 description 1
- 101001066268 Homo sapiens Erythroid transcription factor Proteins 0.000 description 1
- 101000913308 Homo sapiens F-box only protein 40 Proteins 0.000 description 1
- 101001061554 Homo sapiens Galanin receptor type 1 Proteins 0.000 description 1
- 101000926813 Homo sapiens Gamma-aminobutyric acid receptor subunit gamma-2 Proteins 0.000 description 1
- 101001010449 Homo sapiens Glutamate receptor 2 Proteins 0.000 description 1
- 101001074244 Homo sapiens Glycophorin-A Proteins 0.000 description 1
- 101001071776 Homo sapiens Glycophorin-B Proteins 0.000 description 1
- 101001009007 Homo sapiens Hemoglobin subunit alpha Proteins 0.000 description 1
- 101001035503 Homo sapiens Hemoglobin subunit delta Proteins 0.000 description 1
- 101001083591 Homo sapiens Hemoglobin subunit epsilon Proteins 0.000 description 1
- 101000843063 Homo sapiens Hemoglobin subunit theta-1 Proteins 0.000 description 1
- 101001048970 Homo sapiens Homeobox protein EMX2 Proteins 0.000 description 1
- 101001053263 Homo sapiens Insulin gene enhancer protein ISL-1 Proteins 0.000 description 1
- 101000977692 Homo sapiens Iroquois-class homeodomain protein IRX-6 Proteins 0.000 description 1
- 101001091379 Homo sapiens Kallikrein-5 Proteins 0.000 description 1
- 101001063991 Homo sapiens Leptin Proteins 0.000 description 1
- 101000620451 Homo sapiens Leucine-rich glioma-inactivated protein 1 Proteins 0.000 description 1
- 101001038509 Homo sapiens Ly6/PLAUR domain-containing protein 2 Proteins 0.000 description 1
- 101001032848 Homo sapiens Metabotropic glutamate receptor 3 Proteins 0.000 description 1
- 101000782981 Homo sapiens Muscarinic acetylcholine receptor M1 Proteins 0.000 description 1
- 101000720512 Homo sapiens Muscarinic acetylcholine receptor M4 Proteins 0.000 description 1
- 101000629029 Homo sapiens Myosin regulatory light chain 2, ventricular/cardiac muscle isoform Proteins 0.000 description 1
- 101000958753 Homo sapiens Myosin-2 Proteins 0.000 description 1
- 101000958741 Homo sapiens Myosin-6 Proteins 0.000 description 1
- 101001030243 Homo sapiens Myosin-7 Proteins 0.000 description 1
- 101000982032 Homo sapiens Myosin-binding protein C, cardiac-type Proteins 0.000 description 1
- 101000635963 Homo sapiens Myosin-binding protein C, fast-type Proteins 0.000 description 1
- 101000604463 Homo sapiens Netrin-G1 Proteins 0.000 description 1
- 101000604469 Homo sapiens Netrin-G2 Proteins 0.000 description 1
- 101000581986 Homo sapiens Neurocan core protein Proteins 0.000 description 1
- 101000603239 Homo sapiens Neuroligin-1 Proteins 0.000 description 1
- 101000633388 Homo sapiens Neuropeptide Y receptor type 4 Proteins 0.000 description 1
- 101000591385 Homo sapiens Neurotensin receptor type 1 Proteins 0.000 description 1
- 101000830386 Homo sapiens Neutrophil defensin 3 Proteins 0.000 description 1
- 101000603107 Homo sapiens Noelin-3 Proteins 0.000 description 1
- 101000983116 Homo sapiens Pancreatic prohormone Proteins 0.000 description 1
- 101000947178 Homo sapiens Platelet basic protein Proteins 0.000 description 1
- 101000582950 Homo sapiens Platelet factor 4 Proteins 0.000 description 1
- 101001116931 Homo sapiens Protocadherin alpha-6 Proteins 0.000 description 1
- 101001072243 Homo sapiens Protocadherin-19 Proteins 0.000 description 1
- 101001060451 Homo sapiens Pyroglutamylated RF-amide peptide receptor Proteins 0.000 description 1
- 101000665449 Homo sapiens RNA binding protein fox-1 homolog 1 Proteins 0.000 description 1
- 101000684826 Homo sapiens Sodium channel protein type 2 subunit alpha Proteins 0.000 description 1
- 101000980827 Homo sapiens T-cell surface glycoprotein CD1a Proteins 0.000 description 1
- 101000764274 Homo sapiens Troponin T, fast skeletal muscle Proteins 0.000 description 1
- 101000607314 Homo sapiens UL16-binding protein 6 Proteins 0.000 description 1
- 101000910745 Homo sapiens Voltage-dependent calcium channel gamma-3 subunit Proteins 0.000 description 1
- 108090000144 Human Proteins Proteins 0.000 description 1
- 102000003839 Human Proteins Human genes 0.000 description 1
- 108091008036 Immune checkpoint proteins Proteins 0.000 description 1
- 102000037982 Immune checkpoint proteins Human genes 0.000 description 1
- 206010062016 Immunosuppression Diseases 0.000 description 1
- 102100023527 Iroquois-class homeodomain protein IRX-6 Human genes 0.000 description 1
- 102100034868 Kallikrein-5 Human genes 0.000 description 1
- 102100022905 Keratin, type II cytoskeletal 1 Human genes 0.000 description 1
- 102100034845 KiSS-1 receptor Human genes 0.000 description 1
- 108010076800 Kisspeptin-1 Receptors Proteins 0.000 description 1
- 102100030874 Leptin Human genes 0.000 description 1
- 102100022275 Leucine-rich glioma-inactivated protein 1 Human genes 0.000 description 1
- 102100040282 Ly6/PLAUR domain-containing protein 2 Human genes 0.000 description 1
- 208000007433 Lymphatic Metastasis Diseases 0.000 description 1
- 206010064912 Malignant transformation Diseases 0.000 description 1
- 101710151321 Melanostatin Proteins 0.000 description 1
- 102100038352 Metabotropic glutamate receptor 3 Human genes 0.000 description 1
- 102000015494 Mitochondrial Uncoupling Proteins Human genes 0.000 description 1
- 108010050258 Mitochondrial Uncoupling Proteins Proteins 0.000 description 1
- 102100030856 Myoglobin Human genes 0.000 description 1
- 102100026925 Myosin regulatory light chain 2, ventricular/cardiac muscle isoform Human genes 0.000 description 1
- 102100038303 Myosin-2 Human genes 0.000 description 1
- 102100038319 Myosin-6 Human genes 0.000 description 1
- 102100038934 Myosin-7 Human genes 0.000 description 1
- 102100026771 Myosin-binding protein C, cardiac-type Human genes 0.000 description 1
- 102100030736 Myosin-binding protein C, fast-type Human genes 0.000 description 1
- 102000017921 NTSR1 Human genes 0.000 description 1
- 206010061309 Neoplasm progression Diseases 0.000 description 1
- 102100038699 Netrin-G2 Human genes 0.000 description 1
- 102100030466 Neurocan core protein Human genes 0.000 description 1
- 102100038992 Neuroligin-1 Human genes 0.000 description 1
- 102100029551 Neuropeptide Y receptor type 4 Human genes 0.000 description 1
- 102100024761 Neutrophil defensin 3 Human genes 0.000 description 1
- 102100037046 Noelin-3 Human genes 0.000 description 1
- 102100037757 Orexin Human genes 0.000 description 1
- 108700005081 Overlapping Genes Proteins 0.000 description 1
- 102100040852 Paired box protein Pax-2 Human genes 0.000 description 1
- 102100026844 Pancreatic prohormone Human genes 0.000 description 1
- 108010088847 Peptide YY Proteins 0.000 description 1
- 102100036154 Platelet basic protein Human genes 0.000 description 1
- 102100028427 Pro-neuropeptide Y Human genes 0.000 description 1
- 102100036389 Protocadherin-19 Human genes 0.000 description 1
- 101001021643 Pseudozyma antarctica Lipase B Proteins 0.000 description 1
- 102100027888 Pyroglutamylated RF-amide peptide receptor Human genes 0.000 description 1
- 102100038188 RNA binding protein fox-1 homolog 1 Human genes 0.000 description 1
- 108091006283 SLC17A7 Proteins 0.000 description 1
- 108091006282 SLC17A8 Proteins 0.000 description 1
- 108091006774 SLC18A3 Proteins 0.000 description 1
- 102100023150 Sodium channel protein type 2 subunit alpha Human genes 0.000 description 1
- 102100021909 Sperm-associated antigen 6 Human genes 0.000 description 1
- 102100024219 T-cell surface glycoprotein CD1a Human genes 0.000 description 1
- 210000001744 T-lymphocyte Anatomy 0.000 description 1
- 102100026896 Troponin T, fast skeletal muscle Human genes 0.000 description 1
- 206010064390 Tumour invasion Diseases 0.000 description 1
- 102100040013 UL16-binding protein 6 Human genes 0.000 description 1
- 102100039452 Vesicular acetylcholine transporter Human genes 0.000 description 1
- 102100038039 Vesicular glutamate transporter 1 Human genes 0.000 description 1
- 102100038033 Vesicular glutamate transporter 3 Human genes 0.000 description 1
- 102100024138 Voltage-dependent calcium channel gamma-3 subunit Human genes 0.000 description 1
- 230000006978 adaptation Effects 0.000 description 1
- 230000007953 anoxia Effects 0.000 description 1
- 239000012752 auxiliary agent Substances 0.000 description 1
- KMGARVOVYXNAOF-UHFFFAOYSA-N benzpiperylone Chemical compound C1CN(C)CCC1N1C(=O)C(CC=2C=CC=CC=2)=C(C=2C=CC=CC=2)N1 KMGARVOVYXNAOF-UHFFFAOYSA-N 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 230000009400 cancer invasion Effects 0.000 description 1
- 239000000969 carrier Substances 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 238000010219 correlation analysis Methods 0.000 description 1
- 230000000875 corresponding effect Effects 0.000 description 1
- 238000002790 cross-validation Methods 0.000 description 1
- 108040004564 crotonyl-CoA reductase activity proteins Proteins 0.000 description 1
- UQHKFADEQIVWID-UHFFFAOYSA-N cytokinin Natural products C1=NC=2C(NCC=C(CO)C)=NC=NC=2N1C1CC(O)C(CO)O1 UQHKFADEQIVWID-UHFFFAOYSA-N 0.000 description 1
- 239000004062 cytokinin Substances 0.000 description 1
- 230000001086 cytosolic effect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 238000003745 diagnosis Methods 0.000 description 1
- 239000000104 diagnostic biomarker Substances 0.000 description 1
- CZWHMRTTWFJMBC-UHFFFAOYSA-N dinaphtho[2,3-b:2',3'-f]thieno[3,2-b]thiophene Chemical compound C1=CC=C2C=C(SC=3C4=CC5=CC=CC=C5C=C4SC=33)C3=CC2=C1 CZWHMRTTWFJMBC-UHFFFAOYSA-N 0.000 description 1
- 108091007232 endothelial orphan receptors Proteins 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 201000010536 head and neck cancer Diseases 0.000 description 1
- 208000014829 head and neck neoplasm Diseases 0.000 description 1
- 230000001146 hypoxic effect Effects 0.000 description 1
- 230000003053 immunization Effects 0.000 description 1
- 238000002649 immunization Methods 0.000 description 1
- 230000001506 immunosuppresive effect Effects 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 230000008595 infiltration Effects 0.000 description 1
- 238000001764 infiltration Methods 0.000 description 1
- 230000002757 inflammatory effect Effects 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 238000011835 investigation Methods 0.000 description 1
- 150000002632 lipids Chemical class 0.000 description 1
- 230000036212 malign transformation Effects 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- 238000010197 meta-analysis Methods 0.000 description 1
- 230000004060 metabolic process Effects 0.000 description 1
- 239000002207 metabolite Substances 0.000 description 1
- 230000001394 metastastic effect Effects 0.000 description 1
- 206010061289 metastatic neoplasm Diseases 0.000 description 1
- 239000011859 microparticle Substances 0.000 description 1
- 238000010606 normalization Methods 0.000 description 1
- 108020004707 nucleic acids Proteins 0.000 description 1
- 102000039446 nucleic acids Human genes 0.000 description 1
- 150000007523 nucleic acids Chemical class 0.000 description 1
- 230000008520 organization Effects 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 230000001737 promoting effect Effects 0.000 description 1
- 230000006916 protein interaction Effects 0.000 description 1
- 238000011002 quantification Methods 0.000 description 1
- 210000003370 receptor cell Anatomy 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000003248 secreting effect Effects 0.000 description 1
- 238000010187 selection method Methods 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 238000000528 statistical test Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 230000001225 therapeutic effect Effects 0.000 description 1
- 230000005747 tumor angiogenesis Effects 0.000 description 1
- 230000005748 tumor development Effects 0.000 description 1
- 230000004614 tumor growth Effects 0.000 description 1
- 230000005751 tumor progression Effects 0.000 description 1
- 230000003827 upregulation Effects 0.000 description 1
Landscapes
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
The invention relates to the field of biotechnology, in particular to application of a marker. The invention provides application of gene combination as a marker in preparation of products for predicting tumors, predicting tumor stages and/or predicting tumor prognosis. The gene combination-3-PPI-Mod (3 NM) obtained by the invention is verified in two independent data sets, and the Kaplan-Meier curve shows that the total survival rate time of patients classified as high risk patients is obviously shorter than that of patients classified as low risk patients; and 3-PPI-Mod has a better prognostic significance than the 18-gene, exhibiting better prognostic value than the features produced by gene expression alone. Furthermore, 3-PPI-Mod is significantly associated with intratumoral hypoxia scores. These results indicate that 3-PPI-Mod is highly expressed in tumor exosomes of high clinical grade and can be used as a prognostic and predictive biomarker for hepatocellular carcinoma.
Description
Technical Field
The invention relates to the field of biotechnology, in particular to application of a marker.
Background
Exosomes (exosomes) are membrane particles released by cells, modulating intercellular communication by delivering functional molecules (such as proteins, nucleic acids and lipids) to recipient cells. Cells can secrete exosomes of many different sizes and origins. The outer body is mainly between 50-150nm in diameter, resulting from the polycystic inner body. Other exosomes (100-1000 nm diameter), such as Microbubbles (MVS), exosomes or microparticles, bud directly from the cell membrane.
In recent years, there has been increasing evidence that exosomes play an important role in the development of tumors. Cancer-derived exosomes are involved in a variety of cancerous processes including malignant transformation, angiogenesis, immunosuppression, invasion and therapeutic resistance. Exosomes released by the tumor microenvironment may also affect the properties of cancer cells. One common feature of tumor microenvironments is the induction of tumor exosomes to the receptor cell metastasis invasion and metastasis phenomena. In addition, exosomes released by tumors can alter the distant microenvironment, forming a pre-metastatic niche, promoting the formation of metastases. Suggesting that cancer cell exosomes may play a role both locally and remotely.
Cancerous exosomes detected in various body fluids are considered to be a new non-invasive biomarker. For example, small exosomes (less than 200nm in diameter) in the circulation carry microRNAs (miRNAs) and proteins, which are promising diagnostic and prognostic biomarkers in tumors. However, exosomes identified in body fluids may represent mixed populations of tumor and other tissue release. To date, distinguishing tumor-specific exosomes from exosomes released by other tissues remains a significant challenge due to the lack of specific biomarkers. Protein levels in exosomes released into body fluids are reported to be consistent with protein expression in NSCLC (non-small cell lung cancer) tissues. Thus, comprehensive analysis of primary tumor and secretory exosomes may be a viable method of identifying tumor-specific exosomes-associated biomarkers.
Disclosure of Invention
In view of this, the present invention provides the use of markers that can be prognostic and predictive biomarkers for hepatocellular carcinoma.
In order to achieve the above object, the present invention provides the following technical solutions:
The invention provides application of gene combination as a marker in preparation of products for predicting tumors, predicting tumor stage and/or tumor prognosis;
The gene combinations include one or more of gene combination 1, gene combination 2, or gene combination 3;
The gene combination 1 comprises genes KRT1, HOXC13, KRT34, KRT2, KRT31, HOXC12, HOXC11 and HOXC5;
the gene combination 2 comprises genes PAX2, GBX1, LHX1, WNT7A, SOST, SP7 and FGF23;
The gene combination 3 comprises genes SPAG6, WDR63 and DNAH9.
In some embodiments of the invention, the above gene combination 1 is associated with developmental differentiation of epidermal cells; the gene combination 2 is related to tumor related channels; the above gene combination 3 is involved in the constitution and/or activity of cell membrane components.
In some embodiments of the invention, the above-described gene combination 1 forms an interactive relationship with the above-described gene combination 2 via the gene ISL 1; the gene combination 2 is in an interaction relationship with the gene combination 3 through the genes PVALB, NPY, PF, SPAG 17.
In some embodiments of the invention, expression of the above-described gene combinations is up-regulated in high clinical staging tumors.
In some embodiments of the invention, the above-described combination of genes predicts survival of cancer patients.
In some embodiments of the invention, the above gene combinations are significantly positively correlated with the ratio of immune cells.
In some embodiments of the invention, the ratio of partial genes to immune cells in the above described gene combinations is significantly positively correlated.
In some embodiments of the invention, the above-described gene combinations are associated with a tumor microenvironment.
In some embodiments of the invention, the above-described combination of genes is positively correlated with tumor hypoxia.
In some embodiments of the invention, hypoxia pathway-related genes are significantly enriched in the high-risk tumors predicted by the above-described gene combinations.
In some embodiments of the invention, the above-described score for tumor hypoxia is significantly positively correlated with the expression level of hypoxia inducible factor 1.
In some embodiments of the invention, the above gene combinations are significantly positively correlated with the expression level of hypoxia inducible factor 1 in a high risk subgroup.
The invention also provides a primer combination for amplifying the gene combination.
The invention also provides a reagent, a kit, a system or a device for predicting tumor, predicting tumor stage and/or tumor prognosis, which is characterized by comprising the amplification primer of the gene combination or a product aiming at obtaining the sequence of the gene combination, and acceptable auxiliary materials, auxiliary agents, carriers, modules or components.
In some embodiments of the invention, the steps of manipulating the above-described reagents, kits, systems or devices include extracting exosome genes, sequencing, and/or analyzing the sequencing results using the above-described gene combinations.
The invention also provides a detection method, which comprises the steps of designing and synthesizing primers by using the gene combination as a template and/or analyzing a sample gene by using the gene combination as a reference.
In some embodiments of the invention, the steps of the above detection method comprise extracting exosome genes, sequencing, and/or analyzing the sequencing results using the above gene combinations.
The marker of the invention has the following effects:
1. Expression of 3-PPI-Mod (3 NM) was up-regulated in tumors of the high clinical grade group. The risk of clinical features in patients was predicted by 3-PPI-Mod, with an area under ROC of 0.7368. 79.74% (126/158) of patients were correctly classified when compared to the true clinical grade. The Kaplan-Meier curve demonstrates that 3-PPI-Mod can predict overall survival of patients (p=0.0057), demonstrating that 3-PPI-Mod can be used to predict tumor progression as well as tumor prognosis.
2. In 3-PPI-Mod, mod_5 is involved in the developmental differentiation of epidermal cells; mod_8 is significantly associated with tumor-associated pathways; the gene in mod_15 is widely involved in the composition and activity of cell membrane components.
3. The same algorithm is used for establishing a prediction model-18 genes based on gene expression, and the result 18-genes have poorer prognosis significance than 3-PPI-Mod, which shows that the related characteristics of the exosome show better prognosis value compared with the characteristics of the pure gene expression.
4. All 3 modules in 3-PPI-Mod are positively correlated with hypoxia, and hypoxia pathway related genes are remarkably enriched in high-risk tumors predicted by 3-PPI-Mod. Intratumoral hypoxia scores have a significant correlation with hypoxia inducible factor 1 (HIF 1) expression levels, and up-regulation of HIF1 expression levels in a high risk subgroup identified by 3-PPI-Mod, suggesting that 3-PPI-Mod may be useful for predicting, assessing, and tumor hypoxia microenvironment.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below.
FIG. 1 illustrates an algorithm of a modular construction;
FIG. 2 shows a flow chart of the method of the present patent;
FIG. 3 illustrates PPI network construction; wherein blue ellipses represent exosome-related genes; black lines represent interactions between blue ellipses;
FIG. 4a illustrates cluster mapping according to the expression scores of the 16 module configuration file; the cluster map also shows the link between clusters and clinical pathology stage; T-N-M: tumor classification forms represent tumor-lymph node-metastasis;
FIG. 4b illustrates determining an optimal module using an RF algorithm; the abscissa represents the number of modules of the study and the ordinate represents the cross-validation of each predictive model;
FIG. 4c shows expression scoring based on 3 modules; the abscissa represents the modules 5, 8, 15; the ordinate represents the module score; red is a high clinical tumor stage; blue is low clinical tumor stage;
FIG. 4d shows the validation of prognosis prediction for 3NM by ROC curve; the abscissa indicates the specificity; the ordinate represents sensitivity;
FIG. 4e shows a survival analysis of TCGA-LIHC HCC patients based on 3NM features using KM analysis; the abscissa indicates time; the ordinate represents the survival probability;
FIG. 5a shows the interaction of exosome-related genes from 3 modules by overlapping genes;
FIG. 5b shows GO and KEGG analysis of 3 modules;
FIG. 6 shows survival analysis of HCC patients from different data sets using KM analysis based on 3NM exosome-associated genes; wherein a is survival analysis of GEO-GSE76427 dataset; b is ICGC-LIHC-survival analysis of the JP dataset;
FIG. 7 shows survival analysis of HCC patients on TCGA-LIHC dataset using KM analysis based on 3NM exosome-related genes;
FIG. 8a shows a correlation analysis of 3NM with tumor immune cells and tumor microenvironment status;
FIG. 8b shows tumor hypoxia status using GSEA analysis according to 3NM classification for each group;
FIG. 8c shows the analysis of HIF1A correlation with hypoxia using the Pearson correlation algorithm; the abscissa represents hypoxia score; the ordinate indicates HIF1A expression scores;
FIG. 8d shows a comparison of high and low risk groups using tumor microenvironment status scores (hypoxia); the abscissa represents the grouping; the ordinate represents the hypoxia status score;
FIG. 8e shows a comparison of high and low risk groups using tumor microenvironment status score (HIF 1A); the abscissa represents the grouping; the ordinate indicates HIF1A expression scores.
Detailed Description
The invention discloses application of a marker, and a person skilled in the art can properly improve the technological parameters by referring to the content of the marker. It is expressly noted that all such similar substitutions and modifications will be apparent to those skilled in the art, and are deemed to be included in the present invention. While the methods and applications of this invention have been described in terms of preferred embodiments, it will be apparent to those skilled in the relevant art that variations and modifications can be made in the methods and applications described herein, and in the practice and application of the techniques of this invention, without departing from the spirit or scope of the invention.
Exosomes (exosomes) mediate intercellular communication in the tumor microenvironment and are involved in tumor invasion phenomena. Although exosomes in body fluids are considered ideal biomarkers for tumor diagnosis and prognosis, it is difficult to distinguish between exosomes of tumor origin and exosomes released by other tissues. The invention assumes that analysis of exosomes-associated molecules in tumor tissue helps to estimate the prognostic value of tumor-specific exosomes, and analyzes the gene expression profile of exosomes protein-encoding genes in hepatocellular carcinoma within tumors. Based on a protein-protein interaction (PPI) network method, exosomes related gene characteristics are established, and 3 network modules (3-PPI-Mod) are determined as characteristics for predicting clinical grades of hepatocellular carcinoma. This feature is further validated against the predicted values in both retrospective datasets. In addition, the relationship of gene signature to tumor microenvironment, including hypoxia status and stromal cell abundance, was also studied. These results indicate that the 3-PPI-Mod signal is highly expressed in tumor exosomes of high clinical grade and can be used as a prognostic and predictive biomarker for hepatocellular carcinoma.
Hypoxia/anoxia as referred to herein refers to a condition in which the partial pressure of oxygen in a tissue drops below a critical value or oxygen reduction is effectively utilized.
The microenvironment referred to in the present invention refers to the cellular matrix and the body fluid components therein.
Tumor microenvironment refers to the fact that tumor development, growth and metastasis are closely related to the internal and external environment in which tumor cells are located, including not only the structure, function and metabolism of the tissue in which the tumor is located, but also to the internal environment of the tumor cells themselves (nuclear and cytoplasmic).
The invention is further illustrated by the following examples:
Example 1: establishing predictive features for clinical grade risk of cancer
1. Patient and clinical characteristics
The invention uses data from 3 different databases as the subject of investigation. They are LIHC (hepatocellular carcinoma) transcript sequencing data from TCGA (TCGA-LIHC), japanese case transcript sequencing queues from LIHC of ICGC (ICGC-LIHC-JP) and LIHC chip sequencing data from the Singapore bioinformatic center (GEO: GSE 76427), respectively. These 3 different independent queues cover two different transcript quantification methods, different populations in asia and europe. Wherein, the total number of samples of TCGA-LIHC data is 356, and the number of usable samples after OS (overall survival rate) is less than 30 days and clinical data insufficiency cases is 316. Similarly, ICGC-LIHC-JP data total number of samples 212, available number of samples 161, GSE76427 data total number of samples 115, available number of samples 94. The sample information is detailed in table 1.
Table 1 clinical characteristics of patients in different LIHC databases
TCGA data was downloaded from:
https://www.cancer.gov/about-nci/organization/ccg/research/structural-genomics/tcga/
ICGC data were downloaded from:
https://dcc.icgc.org/
GSE76427 data was downloaded from:
https://www.ncbi.nlm.nih.gov/geo/query/acc.cgiacc=GSE76427/
2. exoRBase exosome database
The liver cell cancer-associated exosome genes are from exosome gene database exoRBase. The download address is http:// www.exorbase.org/exoRBase/browse/tomRNAIndex/. The database of liver cell cancer exosome characteristic genes comprises 17221 genes. Further enrichment by bioinformatics means is required to obtain exocrine expression genes closely related to hepatocellular carcinoma.
3. Gene expression data processing
TCGA-LIHC data is two sets of data of read counts and FPKM, ICGC-LIHC-JP data is read counts, and GSE76427 data extracts probe values (log 2 intensity) and probe notes from an original file downloaded from a gene expression integrated database (GEO). The data normalization was performed by Z-transforming the FPKM data of TCGA-LIHC, ICGC-LIHC-JP and GSE76427 data. Differential expression analysis of tumor and paranormal samples was performed using read counts data from TCGA-LIHC.
In the differential expression analysis, the invention uses EBSeq software to analyze the gene expression difference between the tumor and the paracancerous normal sample in the TCGA-LIHC queue.
4. PPI data processing
Construction of the human protein interaction network PPI was performed using STRING (https:// STRING-db. Org /). The build process uses default parameters.
And identifying the PPI module by using the MCODE. Degree cutoff value is 2,node score cutoff, K-score value is 2, max. Depth value is 100.
5. Identification of clinical grade related PPI modules
To further screen out of the potential PPI modules with significant clinical grading prediction functions. The PPI module obtained by the method is integrated with the expression data of TCGA-LIHC. The present invention first calculates the Expression Score (Expression Score; e) of each module. In a given module M with M genes, the expression score (e) of M in sample j is defined as:
Wherein Z ij is the Z-transgene expression value of gene i.
Then, the discriminant score S (M) of the M module is defined as the Mutual Information (MI) between class e' and clinical classification class (c):
Where e' represents a discrete form of e, the expression score e is discretized to 9 (log 2 (N) +1), N being the number of samples. The calculation process is shown in fig. 1. In clinical grading, stage I and II are classified as Low Stage group and III and IV are classified as HIGH STAGE group.
Thereafter, the present invention calculates MI values of randomly selected "modules" by randomly extracting the same genes as the number of M module genes from PPI network genes. Each module was run randomly 1000 times. And (3) carrying out statistical test by using the calculation result of random sampling and the calculation result of an actual module, and selecting a module with significant p value (p < 0.0001) for further analysis.
6. Recursive predictive PPI module
Gene expression scores of candidate modules as described previously were used to construct PPI network-based gene signatures. Feature selection and modeling is performed using R-packets randomForest based on a Random Forest (RF) algorithm, i.e., the predictive importance of each candidate module repetition is estimated using the initial RF of 5000 decision trees. And determining the optimal combination of the recursion prediction candidate modules by adopting a step-by-step backward selection method. In each iteration, 10% of the features were excluded and the remaining features were used to construct an RF containing 3000 decision trees. This procedure is stopped when only two functions remain. Among all the iteration results, the RF model with the smallest feature number is selected. In the final RF model, the clinical grade risk of the patient is determined by oob (out-of-bag data) probabilities. Finally, the invention selects 3 PPI modules meeting the requirements.
7. Other statistical methods
The present invention uses R package clusterProfiler to perform GO and KEGG path enrichment analysis on PPI modules. GSEA was used to compare the gene set of interest to a subgroup of patients classified by 3-PPI-Mod. Correlation of PPI modules with hypoxia, angiogenesis and inflammation scores and stromal cell abundance was calculated by pearson correlation coefficients. Multiple tests were adjusted by the error discovery rate (FDR) using the method of Benjamini-Hochberg. Patient survival rates were compared for the 3-PPI-Mod assigned low-risk and high-risk groups using the Kaplan-Meier curve and the log-rank test. The multivariate cox model was used to evaluate the prognostic and predictive value of 3-PPI-Mod signals. All statistical analyses were performed using R software (version 3.3.1). p <0.05 indicates significant differences.
8. Results and analysis
Fig. 2 depicts an overall flow chart of the present invention.
1134 Specific tumor and paracancerous differential expressed genes (PPDE > 0.99) were obtained by analysis of the present invention. On the basis, the invention uses the differential expression genes and the liver cancer specific exosome expression genes in exoRBase databases to carry out intersecting operation, thus obtaining 437 liver cancer exosome specific expression genes.
Based on the obtained 437 liver cancer exosomes specifically expressed genes, PPI network constructed by STRING was used. The network contains 321 nodes, 938 interactions (fig. 3).
For specific proteins in the exosomes of hepatocellular carcinoma, 321 proteins were mapped onto a reference PPI network. The exogenously related PPI network integrates with the gene expression profile of the training dataset.
And identifying the PPI modules by using the MCODE, and finally obtaining 16 potential PPI modules. The set of genes included in the module are as follows:
Module 1: PYY, PF4, NPY4R, CCR, ADCY2, OPRD1, NMU, GRM3, CNR2, NPY, PPBP, PPY, CHRM4, GALR1, OXGR1, HTR1F, HTR a;
Module 2: MYH7, MYL2, ACTN3, MYBPC3, MYH2, MYH6, MYBPC2, TNNT3
Module 3: GYPA, GYPB, RHAG, GATA1, AHSP, ALAS2, HBA1, HBQ1;
Module 4: HBD, HBE1, SCN2A, CALB, BDNF, CARTPT, SLC17A8, SLC18A3, NTSR1, GHSR, CHRM1, KISS1R, QRFPR;
Module 5: KRT1, HOXC13, KRT34, KRT2, KRT31, HOXC12, HOXC11, HOXC5;
and (6) module 6: ALPPL2, CEACAM5, RAET1L, LYPD2, NTNG1, NLGN1, SLC17A7, OLFM3, GABRG2, RBFOX1;
Module 7: UCP1, HCRT, LEP, NR A1;
Module 8: PAX2, GBX1, LHX1, WNT7A, SOST, SP7, FGF23;
Module 9: CATSPERD, CACNG3, CACCNG 6;
module 10: AZU1, DNTT, DEFA3, MPO, CD1A;
Module 11: chra 2, PCDH19, LGI1;
Module 12: PVALB, NCAN, GRIA2;
module 13: EMX2, HOXD, IRX6;
module 14: CLCA2, KLK5, DSG3;
Module 15: SPAG6, WDR63, DNAH9;
Module 16: FBXO40, RNF182, FBXL13.
A random sampling method is used to estimate the significance of the discrimination score for each module. Finally, by random forest modeling analysis, the results showed that the recurrence score was significantly higher for 16 modules than for contingency (p <0.001, table 2). The details of the 16 candidate blocks are shown in table 2.
Table 2 module MI values and significance
Module | MI value | P value |
Module 8 | 0.06861101 | 7.18×10-28 |
Module 15 | 0.06685537 | 1.55×10-38 |
Module 5 | 0.06665415 | 1.37×10-11 |
Module 16 | 0.06464605 | 5.05×10-10 |
Module 1 | 0.06455428 | 1.66×10-10 |
Module 6 | 0.06433018 | 5.85×10-10 |
Module 4 | 0.06269076 | 3.76×10-7 |
Module 10 | 0.06220704 | 4.32×10-7 |
Module 11 | 0.06148547 | 1.07×10-6 |
Module 7 | 0.0605967 | 1.01×10-18 |
Module 2 | 0.06023881 | 1.31×10-22 |
Module 12 | 0.06015081 | 1.00×10-14 |
Module 9 | 0.06011182 | 2.96×10-17 |
Module 14 | 0.05942734 | 7.16×10-25 |
Module 3 | 0.05891626 | 5.58×10-32 |
Module 13 | 0.05820099 | 2.46×10-30 |
The expression score distribution of the 16 PPI modules is shown in fig. 4a, where a cluster map shows the relationship between clusters and clinical pathology stage. Next, the greedy search procedure and random sampling found 13 modules to be able to significantly distinguish clinical classifications (P < 0.001). The cluster map of expression scores for the 16 modules divides the modules into two broad categories (clusters 1, 2). Cluster 1 contains 3 modules (including module 1, module 4 and module 6) genes up-regulated at tumor stage I and II, and cluster 2 contains 13 modules up-regulated at tumor stage iii+iv (p=0.033). Furthermore, clinical pathology classification of HCC, such as T (p=0.043) or grade stage (p=0.046), is associated with these 16 modules.
The present invention then contemplates the use of 13 PPI modules to build an optimal model for predicting patient clinical grade risk. By using a Random Forest (RF) algorithm, the present invention finds that the combination of the three modules (mod_5, mod_8, mod_15) achieves the optimal prediction accuracy by the RF algorithm (fig. 4 b). 3 modules were up-regulated in HIGH STAGE groups of tumors (fig. 4 c). A predictive model, called the 3-PPI-Mod signature, was then constructed using these 3 PPI modules. The risk of clinical profile in patients was predicted by 3-PPI-Mod profile with an Area Under ROC (AUROC) of 0.7368 (FIG. 4 d). Patients were classified into a high clinical grade risk group and a low clinical grade risk group with the median of the predicted risk factors as a threshold. Overall, 79.74% (126/158) patients were correctly classified when compared to the true clinical grade. The Kaplan-Meier curve demonstrates that the 3-PPI-Mod profile predicts overall survival in patients (p=0.0057, fig. 4 e). Multivariate cox regression showed that the 3-PPI-Mod signature was an independent prognostic factor for OS (corrected risk ratio [ HR ] =2.7, 95% ci,1-7.2, p=0.045, table 3).
TABLE 3 multivariable Cox regression of OS in training dataset
In the 3-PPI-Mod marker, 8, 7 and 3 genes are contained in module 5 (Mod_5), module 8 (Mod_8) and module 15 (Mod_15), respectively. Mod_5 is connected to Mod_8 and Mod_15 in the PPI network (FIG. 5 a). GO and route enrichment analysis showed that mod_5 is involved in developmental differentiation of epidermal cells, mod_8 is significantly involved in tumor-related pathways, and genes in mod_15 are widely involved in the composition and activity of cell membrane components (fig. 5 b).
Example 2: verification of 3-PPI-Mod features in independent queues
The statistical method described in example 1 was used to verify the 3-PPI-Mod prognosis using two independent sets of verification data. The prediction of patient relapse risk was performed on each validation dataset using the 3-PPI-Mod features established by the training dataset. The Kaplan-Meier curve shows that the OS time classified as high risk patients is significantly shorter than that classified as low risk patients (log-rank test: GSE76427, p=0.0039; icgc-LIHC-JP, P < 0.0001) (fig. 6a, fig. 6 b).
Example 3: comparison of prognostic value of exosome-related Signal with simple Gene expression Signal
The exosome-related 3-PPI-Mod markers were compared to 18-gene markers (constructed based on gene expression alone) using the statistical method described in example 1 (fig. 7). The 18 gene signature was established using the same algorithm as the 3-PPI-Mod signature. Whether in the training dataset or in the validation set, the 18 gene signature showed similar predictive performance as the 3-PPI-Mod signature in distinguishing clinical grade from OS (fig. 6, fig. 7). However, the prognostic significance of the 18-gene signature was worse than that of the 3-PPI-Mod signature to a significant extent compared to the two, indicating that the exosome-related signature showed better prognostic value than the signature of the pure gene expression.
Example 4:3-PPI-Mod reflects tumor interstitial interactions and hypoxic tumor microenvironment
1. Tumor microenvironment identification
Hypoxia metabolites of different cancer types were obtained from previous studies with core angiogenesis markers of (Buffa F.M.Harris A.L.West C.M.Miller C.J.Large meta-analysis of multiple cancers reveals a common,compact and highly prognostic hypoxia metagene.Br.J.Cancer.2010;102:428-435). primary tumors (Masiero M).F.C.Han H.D.Snell C.Peterkin T.Bridges E.Mangala L.S.Wu S.Y.Pradeep S.Li D.et al.A core human primary tumor angiogenesis signature identifies the endothelial orphan receptor ELTD1 as a key regulator of angiogenesis.Cancer Cell.2013;24:229-241). Inflammatory cytokinins were used to estimate intratumoral inflammation levels (Saloura V.Zuo Z.Koeppen H.Keck M.K.Khattri A.Boe M.Hegde P.S.Xiao Y.Nakamura Y.Vokes E.E.et al.Correlation of T-cell inflamed phenotype with mesenchymal subtype,expression of PD-L1,and other immune checkpoints in head and neck cancer.J.Clin.Oncol.2014;32(6009–6009)). tumor cells hypoxia, angiogenesis and inflammation scores were calculated by averaging the Z-normalized expression values of the corresponding marker genes. The present invention uses cibert to calculate the abundance of immune and non-immune cells in a tumor microenvironment through a gene expression profile for a tissue infiltrating cell population.
2. Results and analysis
Non-cancerous cells within a tumor play an important role in the construction of the tumor microenvironment, particularly the infiltration of immune cells. It is therefore hypothesized that exosome-specific 3-PPI-Mod may be associated with the tumor microenvironment. For this, identification of the proportion of cell subtypes associated with immunization was first performed in the TCGA dataset using CIRBERSORT. Interestingly, the proportion of partial gene expression in 3-PPI-Mod was in a significant positive correlation with the immune cell, but 3-PPI-Mod was not itself significantly correlated.
Further, the present invention analyzes the relationship between the 3-PPI-Mod signal and the tumor microenvironment status. The results showed that all 3 mods were positively correlated with hypoxia, and that all of mod_5 was correlated with angiogenesis and inflammation relative to mod_8 and mod_15 (fig. 8 a). Hypoxia pathway-related genes were significantly enriched in 3-PPI-Mod predicted high-risk tumors (fig. 8 b). Intratumoral hypoxia scores were significantly correlated with hypoxia inducible factor 1 (HIF 1) expression levels, with HIF1 expression levels upregulated in the high risk subgroups identified by the 3-PPI-Mod markers (fig. 8c, 8d, 8 e).
The foregoing is merely a preferred embodiment of the present invention and it should be noted that modifications and adaptations to those skilled in the art may be made without departing from the principles of the present invention, which are intended to be comprehended within the scope of the present invention.
Claims (8)
1. The application of the gene combination as a marker in preparing a liver cancer prognosis product;
the gene combination consists of a gene combination 1, a gene combination 2 and a gene combination 3;
the gene combination 1 is genes KRT1, HOXC13, KRT34, KRT2, KRT31, HOXC12, HOXC11 and HOXC5;
the gene combination 2 is genes PAX2, GBX1, LHX1, WNT7A, SOST, SP7 and FGF23;
the gene combination 3 is genes SPAG6, WDR63 and DNAH9.
2. The use according to claim 1, wherein the gene combination 1 is associated with developmental differentiation of epidermal cells; the gene combination 2 is related to a tumor-related pathway; the gene combination 3 is involved in the composition and/or activity of cell membrane components.
3. The use of claim 2, wherein said gene combination 1, said gene combination 2 and said gene combination 3 are predictive of survival in liver cancer patients.
4. The use of claim 3, wherein said gene combination 1, said gene combination 2 and said gene combination 3 are associated with a tumor microenvironment.
5. The use of claim 4, wherein said gene combination 1, said gene combination 2 and said gene combination 3 are positively correlated with tumor hypoxia.
6. The use of claim 5, wherein the score for tumor hypoxia is positively correlated with the level of hypoxia inducible factor 1 expression.
7. A primer combination for amplifying the gene combination 1, the gene combination 2 and the gene combination 3 according to claim 1.
8. A reagent, kit, system or device for prognosis of liver cancer comprising the amplification primers of gene combination 1, gene combination 2 and gene combination 3 of claim 1, and acceptable adjuvants, modules or components.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210018328.0A CN114317756B (en) | 2022-01-07 | 2022-01-07 | Application of marker |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210018328.0A CN114317756B (en) | 2022-01-07 | 2022-01-07 | Application of marker |
Publications (2)
Publication Number | Publication Date |
---|---|
CN114317756A CN114317756A (en) | 2022-04-12 |
CN114317756B true CN114317756B (en) | 2024-05-07 |
Family
ID=81024791
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202210018328.0A Active CN114317756B (en) | 2022-01-07 | 2022-01-07 | Application of marker |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN114317756B (en) |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2014184334A1 (en) * | 2013-05-16 | 2014-11-20 | INSERM (Institut National de la Santé et de la Recherche Médicale) | Fgf23 as a biomarker for predicting the risk of mortality due to end stage liver disease |
WO2016115354A1 (en) * | 2015-01-14 | 2016-07-21 | Taipei Medical University | Methods for cancer diagnosis and prognosis |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1961825A1 (en) * | 2007-02-26 | 2008-08-27 | INSERM (Institut National de la Santé et de la Recherche Medicale) | Method for predicting the occurrence of metastasis in breast cancer patients |
-
2022
- 2022-01-07 CN CN202210018328.0A patent/CN114317756B/en active Active
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2014184334A1 (en) * | 2013-05-16 | 2014-11-20 | INSERM (Institut National de la Santé et de la Recherche Médicale) | Fgf23 as a biomarker for predicting the risk of mortality due to end stage liver disease |
WO2016115354A1 (en) * | 2015-01-14 | 2016-07-21 | Taipei Medical University | Methods for cancer diagnosis and prognosis |
Non-Patent Citations (5)
Title |
---|
Exosomal MiR-744 Inhibits Proliferation and Sorafenib Chemoresistance in Hepatocellular Carcinoma by Targeting PAX2;Guanghui Wang等;Med Sci Monit;第25卷;7209-7217 * |
LHX1基因在肝癌患者中的潜在临床价值和功能初步研究;郭鹏等;中国临床医生杂志;第46卷(第07期);806-810 * |
Prospective Analysis of Proteins Carried in Extracellular Vesicles with Clinical Outcome in Hepatocellular Carcinoma;Wenbiao Chen等;Curr Genomics;第23卷(第2期);109-117 * |
The role and potential application of extracellular vesicles in liver cancer;Xuewei Qi等;Sci China Life Sci;第64卷(第8期);1281-1294 * |
Wnt7a对肝癌细胞凋亡、迁移及侵袭的影响;兰莉辉等;现代肿瘤医学;第27卷(第5期);746-749 * |
Also Published As
Publication number | Publication date |
---|---|
CN114317756A (en) | 2022-04-12 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Song et al. | Identification of serum microRNAs as novel non-invasive biomarkers for early detection of gastric cancer | |
Kwon et al. | Prognosis of stage III colorectal carcinomas with FOLFOX adjuvant chemotherapy can be predicted by molecular subtype | |
Staaf et al. | Relation between smoking history and gene expression profiles in lung adenocarcinomas | |
CN110423816B (en) | Breast cancer prognosis quantitative evaluation system and application | |
Brunner et al. | A shared transcriptional program in early breast neoplasias despite genetic and clinical distinctions | |
Jovanović et al. | Comparison of triple-negative breast cancer molecular subtyping using RNA from matched fresh-frozen versus formalin-fixed paraffin-embedded tissue | |
Le Page et al. | Gene expression profiling of primary cultures of ovarian epithelial cells identifies novel molecular classifiers of ovarian cancer | |
Chen et al. | Identification of differentially expressed genes in lung adenocarcinoma cells using single-cell RNA sequencing not detected using traditional RNA sequencing and microarray | |
Jiang et al. | Circulating tumor cell methylation profiles reveal the classification and evolution of non-small cell lung cancer | |
US10519505B2 (en) | Genomic signatures of metastasis in prostate cancer | |
Shi et al. | Genomic alterations and evolution of cell clusters in metastatic invasive micropapillary carcinoma of the breast | |
Zhang et al. | A Novel Immune‐Related lncRNA‐Based Model for Survival Prediction in Clear Cell Renal Cell Carcinoma | |
Ye et al. | miRNA-218/FANCI is associated with metastasis and poor prognosis in lung adenocarcinoma: a bioinformatics analysis | |
Dupont et al. | A gene expression signature associated with metastatic cells in effusions of breast carcinoma patients | |
Cai et al. | A plasma-derived extracellular vesicle mRNA classifier for the detection of breast cancer | |
Liu et al. | Immune cell infiltration and identifying genes of prognostic value in the papillary renal cell carcinoma microenvironment by bioinformatics analysis | |
Zhang et al. | Identification of five cytotoxicity-related genes involved in the progression of triple-negative breast cancer | |
Yan et al. | Analysis of the role of METTL5 as a hub gene in lung adenocarcinoma based on a weighted gene co-expression network | |
Zhao et al. | PGM1 and ENO1 promote the malignant progression of bladder cancer via comprehensive analysis of the m6A signature and tumor immune infiltration | |
CN114317756B (en) | Application of marker | |
Lotan et al. | Urine-Based Markers for Detection of Urothelial Cancer and for the Management of Non–muscle-Invasive Bladder Cancer | |
Wang et al. | Use of bioinformatic database analysis and specimen verification to identify novel biomarkers predicting gastric cancer metastasis | |
Tiwari | Microarrays and cancer diagnosis. | |
CN116121390A (en) | Marker for prognosis of cancer and suitability for immunotherapy and application thereof | |
Zawistowski et al. | Unifying genomics and transcriptomics in single cells with ResolveOME amplification chemistry to illuminate oncogenic and drug resistance mechanisms |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |