CN116665898A - Biomarker for predicting prognosis of gastric cancer based on histone modification regulator characteristics, scoring model and application - Google Patents
Biomarker for predicting prognosis of gastric cancer based on histone modification regulator characteristics, scoring model and application Download PDFInfo
- Publication number
- CN116665898A CN116665898A CN202310640320.2A CN202310640320A CN116665898A CN 116665898 A CN116665898 A CN 116665898A CN 202310640320 A CN202310640320 A CN 202310640320A CN 116665898 A CN116665898 A CN 116665898A
- Authority
- CN
- China
- Prior art keywords
- gastric cancer
- prognosis
- scoring
- histone modification
- predicting
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 208000005718 Stomach Neoplasms Diseases 0.000 title claims abstract description 73
- 206010017758 gastric cancer Diseases 0.000 title claims abstract description 73
- 201000011549 stomach cancer Diseases 0.000 title claims abstract description 73
- 108010033040 Histones Proteins 0.000 title claims abstract description 65
- 230000004048 modification Effects 0.000 title claims abstract description 65
- 238000012986 modification Methods 0.000 title claims abstract description 65
- 238000004393 prognosis Methods 0.000 title claims abstract description 48
- 239000000090 biomarker Substances 0.000 title claims abstract description 17
- 230000014509 gene expression Effects 0.000 claims abstract description 40
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 36
- 230000001105 regulatory effect Effects 0.000 claims abstract description 16
- 230000011987 methylation Effects 0.000 claims abstract description 12
- 238000007069 methylation reaction Methods 0.000 claims abstract description 12
- 230000006195 histone acetylation Effects 0.000 claims abstract description 9
- 238000000611 regression analysis Methods 0.000 claims abstract description 8
- 101150063057 CRYAB gene Proteins 0.000 claims abstract description 6
- 101150066681 Rae1 gene Proteins 0.000 claims abstract description 6
- 101100317196 Schizosaccharomyces pombe (strain 972 / ATCC 24843) swc2 gene Proteins 0.000 claims abstract description 6
- 238000000034 method Methods 0.000 claims description 10
- 230000002103 transcriptional effect Effects 0.000 claims description 6
- 102100040743 Alpha-crystallin B chain Human genes 0.000 claims description 5
- 102100033642 Bromodomain-containing protein 3 Human genes 0.000 claims description 5
- 101000799140 Homo sapiens Activin receptor type-1 Proteins 0.000 claims description 5
- 101000891982 Homo sapiens Alpha-crystallin B chain Proteins 0.000 claims description 5
- 101000871851 Homo sapiens Bromodomain-containing protein 3 Proteins 0.000 claims description 5
- 101000959794 Homo sapiens Interferon alpha-2 Proteins 0.000 claims description 5
- 101001104307 Homo sapiens Ribonuclease 7 Proteins 0.000 claims description 5
- 101000841520 Homo sapiens Uridine-cytidine kinase-like 1 Proteins 0.000 claims description 5
- 101000667209 Homo sapiens Vacuolar protein sorting-associated protein 72 homolog Proteins 0.000 claims description 5
- 101000744742 Homo sapiens YTH domain-containing family protein 1 Proteins 0.000 claims description 5
- 101000759186 Homo sapiens Zinc finger translocation-associated protein Proteins 0.000 claims description 5
- 101000620751 Homo sapiens mRNA export factor RAE1 Proteins 0.000 claims description 5
- 102100040018 Interferon alpha-2 Human genes 0.000 claims description 5
- 102100029155 Uridine-cytidine kinase-like 1 Human genes 0.000 claims description 5
- 102100039098 Vacuolar protein sorting-associated protein 72 homolog Human genes 0.000 claims description 5
- 102100039647 YTH domain-containing family protein 1 Human genes 0.000 claims description 5
- 102100023386 Zinc finger translocation-associated protein Human genes 0.000 claims description 5
- 238000004458 analytical method Methods 0.000 claims description 5
- 238000010276 construction Methods 0.000 claims description 5
- 102100022885 mRNA export factor RAE1 Human genes 0.000 claims description 5
- 238000012216 screening Methods 0.000 claims description 5
- -1 FERM 2 Proteins 0.000 claims description 4
- 239000003153 chemical reaction reagent Substances 0.000 claims description 4
- 102100034111 Activin receptor type-1 Human genes 0.000 claims description 3
- 102100040684 Fermitin family homolog 2 Human genes 0.000 claims description 2
- 101000892677 Homo sapiens Fermitin family homolog 2 Proteins 0.000 claims description 2
- 238000010837 poor prognosis Methods 0.000 abstract description 5
- 230000009286 beneficial effect Effects 0.000 abstract description 3
- 238000003766 bioinformatics method Methods 0.000 abstract description 3
- 238000011337 individualized treatment Methods 0.000 abstract description 2
- 230000004083 survival effect Effects 0.000 description 15
- 206010028980 Neoplasm Diseases 0.000 description 13
- 238000011226 adjuvant chemotherapy Methods 0.000 description 10
- 238000004422 calculation algorithm Methods 0.000 description 7
- 230000009467 reduction Effects 0.000 description 6
- 210000001519 tissue Anatomy 0.000 description 6
- 102100022618 COMM domain-containing protein 6 Human genes 0.000 description 5
- 108090000246 Histone acetyltransferases Proteins 0.000 description 5
- 102000003893 Histone acetyltransferases Human genes 0.000 description 5
- 102000003964 Histone deacetylase Human genes 0.000 description 5
- 108090000353 Histone deacetylase Proteins 0.000 description 5
- 101000899991 Homo sapiens COMM domain-containing protein 6 Proteins 0.000 description 5
- 230000021736 acetylation Effects 0.000 description 4
- 238000006640 acetylation reaction Methods 0.000 description 4
- 230000008901 benefit Effects 0.000 description 4
- 230000003993 interaction Effects 0.000 description 4
- 238000011282 treatment Methods 0.000 description 4
- 230000007067 DNA methylation Effects 0.000 description 3
- 108010074870 Histone Demethylases Proteins 0.000 description 3
- 102000008157 Histone Demethylases Human genes 0.000 description 3
- 108010036115 Histone Methyltransferases Proteins 0.000 description 3
- 102000011787 Histone Methyltransferases Human genes 0.000 description 3
- 238000011088 calibration curve Methods 0.000 description 3
- 201000011510 cancer Diseases 0.000 description 3
- 125000003588 lysine group Chemical group [H]N([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])(N([H])[H])C(*)=O 0.000 description 3
- 239000000463 material Substances 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 238000011160 research Methods 0.000 description 3
- 239000000523 sample Substances 0.000 description 3
- 108010077544 Chromatin Proteins 0.000 description 2
- HEVGGTGPGPKZHF-UHFFFAOYSA-N Epilaurene Natural products CC1C(=C)CCC1(C)C1=CC=C(C)C=C1 HEVGGTGPGPKZHF-UHFFFAOYSA-N 0.000 description 2
- 102100038720 Histone deacetylase 9 Human genes 0.000 description 2
- 238000010824 Kaplan-Meier survival analysis Methods 0.000 description 2
- 210000004027 cell Anatomy 0.000 description 2
- 238000012512 characterization method Methods 0.000 description 2
- 210000003483 chromatin Anatomy 0.000 description 2
- 238000003745 diagnosis Methods 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 230000004049 epigenetic modification Effects 0.000 description 2
- 238000003205 genotyping method Methods 0.000 description 2
- 230000008595 infiltration Effects 0.000 description 2
- 238000001764 infiltration Methods 0.000 description 2
- 238000005457 optimization Methods 0.000 description 2
- 230000035945 sensitivity Effects 0.000 description 2
- 230000000007 visual effect Effects 0.000 description 2
- 102100032746 Actin-histidine N-methyltransferase Human genes 0.000 description 1
- 102100037468 Bifunctional peptidase and arginyl-hydroxylase JMJD5 Human genes 0.000 description 1
- 102100021975 CREB-binding protein Human genes 0.000 description 1
- 108020004414 DNA Proteins 0.000 description 1
- 102100035074 Elongator complex protein 3 Human genes 0.000 description 1
- 206010061968 Gastric neoplasm Diseases 0.000 description 1
- 102100036530 General transcription factor 3C polypeptide 4 Human genes 0.000 description 1
- 108091005772 HDAC11 Proteins 0.000 description 1
- 102100022901 Histone acetyltransferase KAT2A Human genes 0.000 description 1
- 102100022846 Histone acetyltransferase KAT2B Human genes 0.000 description 1
- 102100033071 Histone acetyltransferase KAT6A Human genes 0.000 description 1
- 102100033070 Histone acetyltransferase KAT6B Human genes 0.000 description 1
- 102100033068 Histone acetyltransferase KAT7 Human genes 0.000 description 1
- 102100033069 Histone acetyltransferase KAT8 Human genes 0.000 description 1
- 102100038885 Histone acetyltransferase p300 Human genes 0.000 description 1
- 102100021467 Histone acetyltransferase type B catalytic subunit Human genes 0.000 description 1
- 102100039996 Histone deacetylase 1 Human genes 0.000 description 1
- 102100039385 Histone deacetylase 11 Human genes 0.000 description 1
- 102100039999 Histone deacetylase 2 Human genes 0.000 description 1
- 102100021455 Histone deacetylase 3 Human genes 0.000 description 1
- 102100021454 Histone deacetylase 4 Human genes 0.000 description 1
- 102100021453 Histone deacetylase 5 Human genes 0.000 description 1
- 102100022537 Histone deacetylase 6 Human genes 0.000 description 1
- 102100038586 Histone demethylase UTY Human genes 0.000 description 1
- 102100022103 Histone-lysine N-methyltransferase 2A Human genes 0.000 description 1
- 102100022102 Histone-lysine N-methyltransferase 2B Human genes 0.000 description 1
- 102100027768 Histone-lysine N-methyltransferase 2D Human genes 0.000 description 1
- 102100026265 Histone-lysine N-methyltransferase ASH1L Human genes 0.000 description 1
- 102100035043 Histone-lysine N-methyltransferase EHMT1 Human genes 0.000 description 1
- 102100035042 Histone-lysine N-methyltransferase EHMT2 Human genes 0.000 description 1
- 102100038970 Histone-lysine N-methyltransferase EZH2 Human genes 0.000 description 1
- 102100029144 Histone-lysine N-methyltransferase PRDM9 Human genes 0.000 description 1
- 102100029768 Histone-lysine N-methyltransferase SETD1A Human genes 0.000 description 1
- 102100030095 Histone-lysine N-methyltransferase SETD1B Human genes 0.000 description 1
- 102100032742 Histone-lysine N-methyltransferase SETD2 Human genes 0.000 description 1
- 102100027711 Histone-lysine N-methyltransferase SETD5 Human genes 0.000 description 1
- 102100023696 Histone-lysine N-methyltransferase SETDB1 Human genes 0.000 description 1
- 102100032804 Histone-lysine N-methyltransferase SMYD3 Human genes 0.000 description 1
- 102100028998 Histone-lysine N-methyltransferase SUV39H1 Human genes 0.000 description 1
- 102100029239 Histone-lysine N-methyltransferase, H3 lysine-36 specific Human genes 0.000 description 1
- 102100039489 Histone-lysine N-methyltransferase, H3 lysine-79 specific Human genes 0.000 description 1
- 102100025663 Histone-lysine N-trimethyltransferase SMYD5 Human genes 0.000 description 1
- 102000006947 Histones Human genes 0.000 description 1
- 101000654703 Homo sapiens Actin-histidine N-methyltransferase Proteins 0.000 description 1
- 101001025948 Homo sapiens Bifunctional peptidase and arginyl-hydroxylase JMJD5 Proteins 0.000 description 1
- 101000896987 Homo sapiens CREB-binding protein Proteins 0.000 description 1
- 101000877382 Homo sapiens Elongator complex protein 3 Proteins 0.000 description 1
- 101000714252 Homo sapiens General transcription factor 3C polypeptide 4 Proteins 0.000 description 1
- 101001046967 Homo sapiens Histone acetyltransferase KAT2A Proteins 0.000 description 1
- 101001047006 Homo sapiens Histone acetyltransferase KAT2B Proteins 0.000 description 1
- 101000944179 Homo sapiens Histone acetyltransferase KAT6A Proteins 0.000 description 1
- 101000944174 Homo sapiens Histone acetyltransferase KAT6B Proteins 0.000 description 1
- 101000944166 Homo sapiens Histone acetyltransferase KAT7 Proteins 0.000 description 1
- 101000944170 Homo sapiens Histone acetyltransferase KAT8 Proteins 0.000 description 1
- 101000882390 Homo sapiens Histone acetyltransferase p300 Proteins 0.000 description 1
- 101000898976 Homo sapiens Histone acetyltransferase type B catalytic subunit Proteins 0.000 description 1
- 101001035024 Homo sapiens Histone deacetylase 1 Proteins 0.000 description 1
- 101001035011 Homo sapiens Histone deacetylase 2 Proteins 0.000 description 1
- 101000899282 Homo sapiens Histone deacetylase 3 Proteins 0.000 description 1
- 101000899259 Homo sapiens Histone deacetylase 4 Proteins 0.000 description 1
- 101000899255 Homo sapiens Histone deacetylase 5 Proteins 0.000 description 1
- 101000899330 Homo sapiens Histone deacetylase 6 Proteins 0.000 description 1
- 101001032113 Homo sapiens Histone deacetylase 7 Proteins 0.000 description 1
- 101001032092 Homo sapiens Histone deacetylase 9 Proteins 0.000 description 1
- 101000808558 Homo sapiens Histone demethylase UTY Proteins 0.000 description 1
- 101001045846 Homo sapiens Histone-lysine N-methyltransferase 2A Proteins 0.000 description 1
- 101001045848 Homo sapiens Histone-lysine N-methyltransferase 2B Proteins 0.000 description 1
- 101001008894 Homo sapiens Histone-lysine N-methyltransferase 2D Proteins 0.000 description 1
- 101000785963 Homo sapiens Histone-lysine N-methyltransferase ASH1L Proteins 0.000 description 1
- 101000877314 Homo sapiens Histone-lysine N-methyltransferase EHMT1 Proteins 0.000 description 1
- 101000877312 Homo sapiens Histone-lysine N-methyltransferase EHMT2 Proteins 0.000 description 1
- 101000882127 Homo sapiens Histone-lysine N-methyltransferase EZH2 Proteins 0.000 description 1
- 101001124887 Homo sapiens Histone-lysine N-methyltransferase PRDM9 Proteins 0.000 description 1
- 101000865038 Homo sapiens Histone-lysine N-methyltransferase SETD1A Proteins 0.000 description 1
- 101000864672 Homo sapiens Histone-lysine N-methyltransferase SETD1B Proteins 0.000 description 1
- 101000654725 Homo sapiens Histone-lysine N-methyltransferase SETD2 Proteins 0.000 description 1
- 101000650669 Homo sapiens Histone-lysine N-methyltransferase SETD5 Proteins 0.000 description 1
- 101000684609 Homo sapiens Histone-lysine N-methyltransferase SETDB1 Proteins 0.000 description 1
- 101000708574 Homo sapiens Histone-lysine N-methyltransferase SMYD3 Proteins 0.000 description 1
- 101000696705 Homo sapiens Histone-lysine N-methyltransferase SUV39H1 Proteins 0.000 description 1
- 101000634050 Homo sapiens Histone-lysine N-methyltransferase, H3 lysine-36 specific Proteins 0.000 description 1
- 101000963360 Homo sapiens Histone-lysine N-methyltransferase, H3 lysine-79 specific Proteins 0.000 description 1
- 101000835819 Homo sapiens Histone-lysine N-trimethyltransferase SMYD5 Proteins 0.000 description 1
- 101000613958 Homo sapiens Lysine-specific demethylase 2A Proteins 0.000 description 1
- 101000614017 Homo sapiens Lysine-specific demethylase 3A Proteins 0.000 description 1
- 101000614020 Homo sapiens Lysine-specific demethylase 3B Proteins 0.000 description 1
- 101000613625 Homo sapiens Lysine-specific demethylase 4A Proteins 0.000 description 1
- 101000613629 Homo sapiens Lysine-specific demethylase 4B Proteins 0.000 description 1
- 101001088893 Homo sapiens Lysine-specific demethylase 4C Proteins 0.000 description 1
- 101001088895 Homo sapiens Lysine-specific demethylase 4D Proteins 0.000 description 1
- 101001088892 Homo sapiens Lysine-specific demethylase 5A Proteins 0.000 description 1
- 101001088883 Homo sapiens Lysine-specific demethylase 5B Proteins 0.000 description 1
- 101001088887 Homo sapiens Lysine-specific demethylase 5C Proteins 0.000 description 1
- 101001088879 Homo sapiens Lysine-specific demethylase 5D Proteins 0.000 description 1
- 101001025967 Homo sapiens Lysine-specific demethylase 6A Proteins 0.000 description 1
- 101001025971 Homo sapiens Lysine-specific demethylase 6B Proteins 0.000 description 1
- 101001025945 Homo sapiens Lysine-specific demethylase 7A Proteins 0.000 description 1
- 101001050886 Homo sapiens Lysine-specific histone demethylase 1A Proteins 0.000 description 1
- 101000615488 Homo sapiens Methyl-CpG-binding domain protein 2 Proteins 0.000 description 1
- 101001000090 Homo sapiens Methyltransferase N6AMT1 Proteins 0.000 description 1
- 101000650674 Homo sapiens N-lysine methyltransferase SETD6 Proteins 0.000 description 1
- 101000708645 Homo sapiens N-lysine methyltransferase SMYD2 Proteins 0.000 description 1
- 101000616738 Homo sapiens NAD-dependent protein deacetylase sirtuin-6 Proteins 0.000 description 1
- 101000709248 Homo sapiens NAD-dependent protein deacetylase sirtuin-7 Proteins 0.000 description 1
- 101000616727 Homo sapiens NAD-dependent protein deacylase sirtuin-5, mitochondrial Proteins 0.000 description 1
- 101000863629 Homo sapiens NAD-dependent protein lipoamidase sirtuin-4, mitochondrial Proteins 0.000 description 1
- 101000602926 Homo sapiens Nuclear receptor coactivator 1 Proteins 0.000 description 1
- 101001028703 Homo sapiens Probable JmjC domain-containing histone demethylation protein 2C Proteins 0.000 description 1
- 101000650667 Homo sapiens SET domain-containing protein 4 Proteins 0.000 description 1
- 102100040598 Lysine-specific demethylase 2A Human genes 0.000 description 1
- 102100040581 Lysine-specific demethylase 3A Human genes 0.000 description 1
- 102100040582 Lysine-specific demethylase 3B Human genes 0.000 description 1
- 102100040863 Lysine-specific demethylase 4A Human genes 0.000 description 1
- 102100040860 Lysine-specific demethylase 4B Human genes 0.000 description 1
- 102100033230 Lysine-specific demethylase 4C Human genes 0.000 description 1
- 102100033231 Lysine-specific demethylase 4D Human genes 0.000 description 1
- 102100033246 Lysine-specific demethylase 5A Human genes 0.000 description 1
- 102100033247 Lysine-specific demethylase 5B Human genes 0.000 description 1
- 102100033249 Lysine-specific demethylase 5C Human genes 0.000 description 1
- 102100033143 Lysine-specific demethylase 5D Human genes 0.000 description 1
- 102100037462 Lysine-specific demethylase 6A Human genes 0.000 description 1
- 102100037461 Lysine-specific demethylase 6B Human genes 0.000 description 1
- 102100037465 Lysine-specific demethylase 7A Human genes 0.000 description 1
- 102100024985 Lysine-specific histone demethylase 1A Human genes 0.000 description 1
- 102100021299 Methyl-CpG-binding domain protein 2 Human genes 0.000 description 1
- 108060004795 Methyltransferase Proteins 0.000 description 1
- 102000016397 Methyltransferase Human genes 0.000 description 1
- 102100036543 Methyltransferase N6AMT1 Human genes 0.000 description 1
- WWGBHDIHIVGYLZ-UHFFFAOYSA-N N-[4-[3-[[[7-(hydroxyamino)-7-oxoheptyl]amino]-oxomethyl]-5-isoxazolyl]phenyl]carbamic acid tert-butyl ester Chemical compound C1=CC(NC(=O)OC(C)(C)C)=CC=C1C1=CC(C(=O)NCCCCCCC(=O)NO)=NO1 WWGBHDIHIVGYLZ-UHFFFAOYSA-N 0.000 description 1
- 102100027709 N-lysine methyltransferase SETD6 Human genes 0.000 description 1
- 102100032806 N-lysine methyltransferase SMYD2 Human genes 0.000 description 1
- 102100031455 NAD-dependent protein deacetylase sirtuin-1 Human genes 0.000 description 1
- 102100022913 NAD-dependent protein deacetylase sirtuin-2 Human genes 0.000 description 1
- 102100030710 NAD-dependent protein deacetylase sirtuin-3, mitochondrial Human genes 0.000 description 1
- 102100021840 NAD-dependent protein deacetylase sirtuin-6 Human genes 0.000 description 1
- 102100034376 NAD-dependent protein deacetylase sirtuin-7 Human genes 0.000 description 1
- 102100021839 NAD-dependent protein deacylase sirtuin-5, mitochondrial Human genes 0.000 description 1
- 102100030709 NAD-dependent protein lipoamidase sirtuin-4, mitochondrial Human genes 0.000 description 1
- 206010061309 Neoplasm progression Diseases 0.000 description 1
- 102100037223 Nuclear receptor coactivator 1 Human genes 0.000 description 1
- 108091034117 Oligonucleotide Proteins 0.000 description 1
- 101710203703 Peptidyl-lysine N-acetyltransferase YjaB Proteins 0.000 description 1
- 102100037169 Probable JmjC domain-containing histone demethylation protein 2C Human genes 0.000 description 1
- 102100034607 Protein arginine N-methyltransferase 5 Human genes 0.000 description 1
- 101710084427 Protein arginine N-methyltransferase 5 Proteins 0.000 description 1
- 102100027707 SET domain-containing protein 4 Human genes 0.000 description 1
- 108091005770 SIRT3 Proteins 0.000 description 1
- 108010041191 Sirtuin 1 Proteins 0.000 description 1
- 108010041216 Sirtuin 2 Proteins 0.000 description 1
- 102100030140 Thiosulfate:glutathione sulfurtransferase Human genes 0.000 description 1
- 108091023040 Transcription factor Proteins 0.000 description 1
- 102000040945 Transcription factor Human genes 0.000 description 1
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 1
- 230000001594 aberrant effect Effects 0.000 description 1
- 230000002159 abnormal effect Effects 0.000 description 1
- 108020002494 acetyltransferase Proteins 0.000 description 1
- 102000005421 acetyltransferase Human genes 0.000 description 1
- 239000005557 antagonist Substances 0.000 description 1
- 230000006023 anti-tumor response Effects 0.000 description 1
- 238000003149 assay kit Methods 0.000 description 1
- 230000008827 biological function Effects 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 230000005773 cancer-related death Effects 0.000 description 1
- 231100000504 carcinogenesis Toxicity 0.000 description 1
- 230000005591 charge neutralization Effects 0.000 description 1
- 238000002512 chemotherapy Methods 0.000 description 1
- 229940044683 chemotherapy drug Drugs 0.000 description 1
- 238000007621 cluster analysis Methods 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 238000002790 cross-validation Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 239000012502 diagnostic product Substances 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 230000008482 dysregulation Effects 0.000 description 1
- 230000004076 epigenetic alteration Effects 0.000 description 1
- 230000001973 epigenetic effect Effects 0.000 description 1
- 230000002349 favourable effect Effects 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 210000002865 immune cell Anatomy 0.000 description 1
- 238000009169 immunotherapy Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000002401 inhibitory effect Effects 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 230000036210 malignancy Effects 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 125000002496 methyl group Chemical group [H]C([H])([H])* 0.000 description 1
- 239000003607 modifier Substances 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 239000013610 patient sample Substances 0.000 description 1
- 230000002980 postoperative effect Effects 0.000 description 1
- 230000001124 posttranscriptional effect Effects 0.000 description 1
- 238000000513 principal component analysis Methods 0.000 description 1
- 230000001737 promoting effect Effects 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 230000000153 supplemental effect Effects 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 230000001225 therapeutic effect Effects 0.000 description 1
- 238000012549 training Methods 0.000 description 1
- 238000013518 transcription Methods 0.000 description 1
- 230000035897 transcription Effects 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 230000005851 tumor immunogenicity Effects 0.000 description 1
- 230000005751 tumor progression Effects 0.000 description 1
- 238000010200 validation analysis Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H50/00—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
- G16H50/30—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for calculating health indices; for individual health risk assessment
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6876—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
- C12Q1/6883—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
- C12Q1/6886—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material for cancer
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/217—Validation; Performance evaluation; Active pattern learning techniques
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/23—Clustering techniques
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B25/00—ICT specially adapted for hybridisation; ICT specially adapted for gene or protein expression
- G16B25/10—Gene or protein expression profiling; Expression-ratio estimation or normalisation
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/118—Prognosis of disease development
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/154—Methylation markers
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02A—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
- Y02A90/00—Technologies having an indirect contribution to adaptation to climate change
- Y02A90/10—Information and communication technologies [ICT] supporting adaptation to climate change, e.g. for weather forecasting or climate simulation
Landscapes
- Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- Chemical & Material Sciences (AREA)
- Theoretical Computer Science (AREA)
- Genetics & Genomics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Medical Informatics (AREA)
- Pathology (AREA)
- Organic Chemistry (AREA)
- General Health & Medical Sciences (AREA)
- Bioinformatics & Computational Biology (AREA)
- General Engineering & Computer Science (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Evolutionary Biology (AREA)
- Public Health (AREA)
- Biotechnology (AREA)
- Molecular Biology (AREA)
- Biophysics (AREA)
- Wood Science & Technology (AREA)
- Artificial Intelligence (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Evolutionary Computation (AREA)
- Zoology (AREA)
- General Physics & Mathematics (AREA)
- Analytical Chemistry (AREA)
- Immunology (AREA)
- Primary Health Care (AREA)
- Oncology (AREA)
- Hospice & Palliative Care (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Microbiology (AREA)
- Epidemiology (AREA)
- Databases & Information Systems (AREA)
- Biomedical Technology (AREA)
- Biochemistry (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
- Investigating Or Analysing Biological Materials (AREA)
Abstract
The invention discloses a biomarker for predicting gastric cancer prognosis based on histone modification regulator characteristics, a scoring model and application thereof, and belongs to the field of biomedicine. The invention clusters the transcriptome expression of 76 histone acetylation and methylation regulating factors in an unsupervised mode, is used for comprehensively evaluating an expression model of the histone modification regulating factors, utilizes a bioinformatics method to identify gastric cancer prognosis risk characteristics by using LASSO regression and multivariable stepwise Cox regression analysis based on a gastric cancer public database, and establishes a scoring model consisting of 10 characteristic genes: hr_score=acvr1+brd3+c11orf 95+cryab+fermt2+rae1+uckl1+vps72+ythdf1-IFNA2. The scoring model is beneficial to identifying a specific histone modification regulatory factor mode gastric cancer patient with poor prognosis and guiding an individualized treatment scheme.
Description
Technical Field
The invention relates to the field of biomedicine, in particular to a biomarker for predicting gastric cancer prognosis based on histone modification regulating factor characteristics, a scoring model and application.
Background
Gastric cancer is the fifth most common malignancy, and is the third leading cause of cancer-related death in China. Currently, the AJCC (American Joint Committee on Cancer) -based TNM staging system is the most widely used gastric cancer staging system clinically. However, TNM staging systems miss molecular characterization information for patients, and even for patients in the same stage, their prognosis varies widely. Therefore, developing new indexes based on molecular characteristic typing of patients is helpful for promoting accurate diagnosis and treatment process of gastric cancer. In recent years, in order to better improve prognosis of gastric cancer patients, several molecular typing schemes have been proposed to characterize different biological characteristics of gastric cancer, such as TCGA genotyping and ACRG genotyping, but the high throughput technique used for typing is expensive and complex in process, and the current stage of typing also needs a lot of research and further optimization, so that we have urgent need to develop an alternative method for molecular typing that is economical, convenient and easy to clinically transform.
Studies have shown that epigenetic alterations may be an emerging clinical biomarker for tumor-related diagnosis, prognosis and treatment. As an important epigenetic modification, dysregulation of histone modification may lead to aberrant transcription processes, affecting tumor immunogenicity and the function of immune cells involved in anti-tumor responses. Among these, studies have shown that changes in histone acetylation or methylation patterns and their interactions are one of the most widely affected epigenetic pathways in tumors. Histone methylation is one of the most important post-transcriptional modifications, mainly by the addition of methyl groups at lysine residues of histones H3 and H4, and can be used as an active or inhibitory marker of gene expression. Whereas histone acetylation is mainly regulated by the balance between Histone Acetyltransferase (HAT) and Histone Deacetylase (HDAC). Based on the "charge neutralization model," acetylation of histone tail lysine residues reduces the amount of positive charge carried by the histone to reduce its electrostatic affinity with DNA, resulting in relaxation of chromatin structure, while the open chromatin conformation is more prone to binding to transcription factors and significantly increases gene expression. Thus, histone acetylation is generally considered as an active histone mark. Furthermore, acetylation and methylation in the same lysine residue can act as antagonists to inhibit each other, resulting in cross-talk between different histone marks. It has been found that abnormal histone modification patterns can become a driving factor for tumors, but that histone methylation and acetylation modification patterns and interactions thereof have been studied in gastric cancer only to a limited extent. Moreover, the current research is mainly focused on the importance of histone modification regulators on tumor progression and Tumor Microenvironment (TME), their potential roles and interactions in TME cell infiltration, drug sensitivity and immunotherapy have not been explored yet, and due to technical limitations, most research is limited to a few histone modification regulators, whereas the highly integrated interactions exist between them, exploring the influence of histone modification regulators' regulatory networks on tumor heterogeneity and biological functions of TME might contribute to the development of personalized therapeutic strategies. Currently, in gastric cancer, some scoring models based on epigenetic modifications such as DNA methylation profile, m6A methylation modification and the like have been proposed, for example Qi Meng et al construct DNA Methylation Scores (DMS) by identifying different DNA methylation modification patterns and different tumor microenvironment characteristics to enhance the recognition of gastric cancer immune microenvironment, bo Zhang et al construct m6Ascore to quantify m6A modification patterns of individual tumors using principal component analysis algorithms by recognizing m6A modification patterns and correlating with TME cell infiltration characterization systems, but the effectiveness and clinical utility of these scoring models remain to be explored. In view of the importance of histone modification, we hope to explore further more efficient molecular typing in gastric cancer from the point of view of histone modification pattern to complement existing typing systems.
Disclosure of Invention
The invention aims to provide a biomarker for predicting gastric cancer prognosis based on histone modification regulator characteristics, a scoring model and application thereof, so as to solve the problems of the prior art, and the scoring model constructed by using 10 characteristic genes is beneficial to identifying gastric cancer patients with a specific histone modification regulator mode with poor prognosis and guiding an individualized treatment scheme.
In order to achieve the above object, the present invention provides the following solutions:
the invention provides a biomarker for predicting gastric cancer prognosis based on histone modification regulator characteristics, which comprises genes ACVR1, BRD3, C11orf95, CRYAB, FERM 2, RAE1, UCKL1, VPS72, YTHDF1 and IFNA2.
The invention also provides a kit for predicting prognosis of gastric cancer based on the characteristics of the histone modification regulator, which comprises a reagent for detecting the transcriptional expression level of genes contained in the biomarker.
The invention also provides application of the reagent for detecting the biomarker in preparation of a kit for predicting gastric cancer prognosis.
The invention also provides a construction method of a scoring model for predicting gastric cancer prognosis based on histone modification regulator characteristics, which comprises the following steps:
s1: determining four different histone modification regulator expression patterns by consensus clustering and analysis of transcriptome data of 76 histone acetylation and methylation regulators;
s2: screening out 10 characteristic genes based on the expression mode of the histone modification regulator determined by S1 by using LASSO regression and multivariable stepwise Cox regression analysis, and constructing a quantitative scoring model for quantifying the histone modification state and distinguishing the HMR-C1 subtype;
wherein the characteristic genes comprise ACVR1, BRD3, C11orf95, CRYAB, FERM 2, RAE1, UCKL1, VPS72, YTHDF1 and IFNA2;
the formula of the scoring model is: hr_score=acvr1+brd3+c11orf 95+cryab+fermt2+rae1+uckl1+vps72+ythdf1-IFNA2.
The invention also provides a grading model for predicting gastric cancer prognosis based on histone modification regulator characteristics, which is formed by the construction method.
The invention also provides application of the scoring model in construction of a gastric cancer prognosis system or device, and gastric cancer patients are grouped according to scoring results calculated by the scoring model so as to predict prognosis of the gastric cancer patients.
Preferably, the criteria for predicting prognosis of gastric cancer patient based on the scoring result are: HR score scores were ranked from large to small, with the top 1/3 and bottom 1/3 samples being defined as high scoring and low scoring groups, respectively, where the high scoring group had poor patient prognosis.
The invention also provides a system or device for predicting gastric cancer prognosis, which comprises a scoring model for predicting gastric cancer prognosis based on the characteristics of the histone modification regulator, wherein the scoring model takes the expression level of the biomarker as an input variable for evaluating the predicted gastric cancer prognosis; wherein the scoring model calculates a score for gastric cancer prognosis using the following formula: hr_score=acvr1+brd3+c11orf 95+cryab+fermt2+rae1+uckl1+vps72+ythdf1-IFNA2.
Preferably, the scoring criteria are: HR score scores were ranked from large to small, with the top 1/3 and bottom 1/3 samples being defined as high scoring and low scoring groups, respectively, where the high scoring group had poor patient prognosis.
The invention discloses the following technical effects:
the invention clusters the transcriptome expression of 76 histone acetylation and methylation regulating factors in an unsupervised mode, is used for comprehensively evaluating an expression model of the histone modification regulating factors, utilizes a bioinformatics method to identify gastric cancer prognosis risk characteristics by using LASSO regression and multivariable stepwise Cox regression analysis based on a gastric cancer public database, establishes a scoring model consisting of 10 genes, guides the personalized treatment of gastric cancer patients by analyzing clinical characteristics and tumor microenvironment of gastric cancer, and names the scoring model as HR_score.
The scoring model constructed by the invention is beneficial to identifying a gastric cancer patient with a specific histone modification regulatory factor mode with poor prognosis and guiding an individual accurate treatment scheme. Further analysis showed that hr_score is an independent poor prognosis biomarker useful for predicting patient outcome, with a significant negative correlation to total survival time, high hr_score patients also being predictive of higher risk of mortality.
The developed scoring tool HR_score can be developed into diagnostic products such as test kits and the like based on the transcriptional expression level of 10 genes only; and the calculation of the HR_score only needs to subtract the average expression values of the 2 groups of genes, so that the method is convenient and easy for clinical transformation.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings that are needed in the embodiments will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and other drawings may be obtained according to these drawings without inventive effort for a person skilled in the art.
FIG. 1 is a Circo diagram representing the expression and prognosis of histone methyltransferase (A), histone demethylase (B), histone acetyltransferase (C) and histone deacetylase (D) modulators in gastric cancer patients; the outer circles in the A-D figures are orange, green, blue, and dark grey represent histone methyltransferase (A), histone demethylase (B), histone acetyltransferase (C), and histone deacetylase (D), respectively;
FIG. 2 is a graph of an optimal histone modification typing matrix of gastric cancer patients;
FIG. 3 is a heat map of histone modulator expression patterns in gastric cancer patients;
FIG. 4 is a Kaplan-Meier plot of total survival (OS) plotted in meta-GEO queue according to histone modification regulator pattern;
FIG. 5 is a Kaplan-Meier plot of total survival (OS) for different HMR clusters of meta-GEO cohorts, based on whether adjuvant chemotherapy is performed;
FIG. 6 is a forest map of a multi-factor COX regression analysis based on clinical characteristics of patients in GSE62245 cohorts;
FIG. 7 is a forest graph based on the benefits of adjuvant chemotherapy for different HMR subtypes in the GSE62245 cohort;
FIG. 8 is a box plot of the distribution of HR_score over HMR subtypes;
FIG. 9 is a heat map of the gene expression pattern of histone modification regulator typing after dimension reduction by the Borata algorithm; yellow represents high expression, blue represents low expression;
FIG. 10 is a Kaplan-Meier survival curve based on meta-GEO (A) and TCGA-STAD (B) datasets;
FIG. 11 is a forest graph showing a multi-factor COX risk regression model based on GSE62254 (A) and TCGA-STAD cohorts (B) with patient gender, age, lanren typing, TNM typing, ACRG typing as covariates;
FIG. 12 is an alignment chart based on meta-GEO (A) and TCGA-STAD (B) queue multifactor Cox regression model results shown in a visual graph;
FIG. 13 is a graph showing three modes of clinical decision curves (DCA decision curves) of model1 (clinical features: TNM phase + Lauren type + molecular type), model2 (HR_score), and model3 (clinical features combined with HR_score), respectively; the benefit score is displayed on the vertical axis, and the probability threshold is displayed on the horizontal axis;
FIG. 14 is a graph of the consistency between actual and predicted occurrence compared by a calibration curve (Calibration curve) for 1,3,5 years.
Detailed Description
Various exemplary embodiments of the invention will now be described in detail, which should not be considered as limiting the invention, but rather as more detailed descriptions of certain aspects, features and embodiments of the invention.
It is to be understood that the terminology used herein is for the purpose of describing particular embodiments only and is not intended to be limiting of the invention. In addition, for numerical ranges in this disclosure, it is understood that each intermediate value between the upper and lower limits of the ranges is also specifically disclosed. Every smaller range between any stated value or stated range, and any other stated value or intermediate value within the stated range, is also encompassed within the invention. The upper and lower limits of these smaller ranges may independently be included or excluded in the range.
Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. Although only preferred methods and materials are described herein, any methods and materials similar or equivalent to those described herein can be used in the practice or testing of the present invention. All documents mentioned in this specification are incorporated by reference for the purpose of disclosing and describing the methods and/or materials associated with the documents. In case of conflict with any incorporated document, the present specification will control.
It will be apparent to those skilled in the art that various modifications and variations can be made in the specific embodiments of the invention described herein without departing from the scope or spirit of the invention. Other embodiments will be apparent to those skilled in the art from consideration of the specification of the present invention. The specification and examples of the present invention are exemplary only.
As used herein, the terms "comprising," "including," "having," "containing," and the like are intended to be inclusive and mean an inclusion, but not limited to.
In a comprehensive queue consisting of 838 gastric cancer patients in a gene expression comprehensive (GEO) database, 76 transcriptome expressions of histone acetylation and methylation regulating factors are clustered in an unsupervised mode to comprehensively evaluate an expression model of the histone modification regulating factors, and a multifactor Cox risk model capable of predicting gastric cancer prognosis is constructed by utilizing a bioinformatics method based on a gastric cancer public database. By analyzing the clinical characteristics of gastric cancer and tumor microenvironment, the personalized adjuvant chemotherapy of gastric cancer patients is guided, and the scoring model is named HR_ score (Histone modification regulator Riskscore). Further description will be made by way of specific examples.
EXAMPLE 1 determination of transcriptional expression of histone modification regulatory molecules from gastric cancer patients and correlation with clinical prognosis
(1) Collection and pretreatment of gastric cancer public data sets
Common transcriptome data and clinical data for GC samples were retrospectively collected from NCBI GEO database (https:// www.ncbi.nlm.nih.gov/GEO /) including GSE62254/ACRG queue (n=300), GSE57303 (n=70), GSE29272 (n=268), GSE15459 (n=200), and the above data integration was named meta-GEO queue. The latest clinical data and sample information for TCGA-STAD samples were obtained from a database (https:// portal. Gdc. Cancer. Gov /) using the TCGAbiolinks R package. The raw data of the Affymetrix dataset was background adjusted using the RMA algorithm in the Affy software package, quantile normalized, and the oligonucleotides for each transcript were finally summarized using a median optimization algorithm. Corresponding clinical data is collected from the supplemental files of the related articles.
(2) The expression and prognosis characteristics of the histone modification regulator of the gastric cancer patient are clear.
By reference to reviews on histone acetylation and methylation modifications, we summarize a total of 76 known putative histone modification modulators, including: the 14 acetyl transferases are specifically: NCOA1, KAT8, KAT7, KAT6B, KAT6A, KAT, KAT2B, KAT2A, CREBBP, EP300, ELP3, HAT1, CLOCK, GTF3C4; the 16 deacetylases are specifically: HDAC5, HDAC4, SIRT7, SIRT6, SIRT3, HDAC9, HDAC7, HDAC6, HDAC3, HDAC2, HDAC11, HDAC1, SIRT5, SIRT4, SIRT2, SIRT1; the 26 methyltransferases are specifically: KMT2B, SETD5, SETD3, SETD1B, SMYD3, SETD4, PRDM9, DOT1L, EZH2, KMT2D, KMT2A, SUV H2, SUV39H1, SETD6, SETD2, SETD1A, SETDB1, EHMT2, EHMT1, NSD1, ASH1L, N6AMT1, SMYD5, SMYD2, PRMT5, carmm 1; and 20 demethylases, specifically: KDM5B, KDM5A, KDM3A, PHF, KDM8, KDM7A, KDM4C, KDM4A, UTY, KDM6B, KDM6A, KDM5D, KDM5C, KDM4D, KDM4B, KDM3B, KDM2A, KDM1A, PHF, JMJD1C. Applicants extracted the transcriptional profiles of these 76 genes in the TCGA-STAD cohort. The expression differences of the above genes in tumor and normal tissues were analyzed in R software using a "limma" R package, and a univariate COX regression analysis was used to assess whether the histone modification genes were associated with prognosis.
As a result, as shown in FIG. 1, the outer circles (A-D) of FIG. 1 are orange, green, blue, and dark gray, respectively represent histone methyltransferase (A), histone demethylase (B), histone acetyltransferase (C), and histone deacetylase (D). Orange indicates that the gene expression in gastric cancer tissue is higher than in normal tissue. Yellow indicates that the gene expression in gastric cancer tissue is lower than in normal tissue. The blue color in the circle represents the favorable factor of total survival (HR > 1, P < 0.05), and the red color represents the dangerous factor of total survival (HR < 1, P < 0.05); gray represents a factor that has no statistical significance for the overall survival impact (P > 0.05). The inner circle represents the P value of HR value.
Example 2 clear the clinical characteristics of histone modification typing and histone modification typing of gastric cancer patients
(1) Determination of histone regulatory factor characteristic typing of gastric cancer patients
To assess whether transcriptional analysis of 76 acetylation and methylation modulators contributes to classification of gastric cancer patients. A total of 838 cases of stage I-IV gastric cancer patient samples from four independent gastric cancer cohorts (GSE 15459, GSE29272, GSE57303, GSE 62254) were included in the study, and unsupervised K-means cluster analysis was performed on meta-GEO cohorts (838 cases) to identify different gastric cancer histone modification expression models. The best cluster number was selected using the R package "consissusclusteriplus", 4 different histone modification regulator expression patterns were determined (see fig. 2), and these expression patterns were referred to as HMR-C1 (n=172), HMR-C2 (n=262), HMR-C3 (n=107), HMR-C4 (n=297). The expression of histone modification regulators in HMR subtype in meta-GEO queue is shown in FIG. 3, wherein the expression of most of histone modification regulators is enriched in HMR-C1, HMR-C2 and HMR-C3, indicating that the activities and changes of histone modification of these three types are stronger.
(2) Explore clinical significance of tumor tissue histone modification typing of gastric cancer patients
Based on the Meta-GEO queue, we generated a survival curve using R-packet "survivinal" and Kaplan-Meier analysis and tested Log-Rank against group-to-group differences using Log-Rank sums, which showed that the total time to live (OS) for HMR-C1 was significantly shorter than HMR-C2, C3 and C4. (HMR-C1 vs HMR-C2, 3, 4) Hazard Ratio (HR) 2.419, 95% confidence interval (1.684-3.475) (see FIG. 4). Multi-factor COX regression analysis of clinical features of age, sex, TNM cohort, lanren typing, ACRG typing of patients incorporating GSE62254 cohorts using IBM SPSS Statistics 26.0.0 software showed HMR-C1 to be an independent prognosis artifact (fig. 6,HR:2.743, 95%CI:1.820-4.135).
In terms of chemotherapy, we found that HMR-C1 was significantly increased compared to other types three with negative correlation to total survival (OS) in patients without adjuvant chemotherapy, whereas in patients with adjuvant chemotherapy, the differences in types four were not statistically significant (see fig. 5), suggesting that post-operative adjuvant chemotherapy could significantly improve prognosis in patients with type C1. We found that patients of type four benefited from adjuvant chemotherapy (see figure 7) but had different sensitivity to adjuvant chemotherapy drugs than the non-adjuvant chemotherapy group.
Example 3 establishment of a histone modification scoring model based on gastric cancer histone modification modifier typing
(1) Selecting different genes of different histone modification types
Because of the worst prognosis for patients in the HMR-C1 group, it is believed that developing a scoring model that can independently quantify the histone modulator modification status to identify patients in the HMR-C1 group may have potential clinical utility. Thus, HMR-C1 patients were differentially analyzed with all non-HMR-C1 patients in meta-GEO cohorts using the "limma" package to obtain genes specifically expressed in HMR-C1, wherein the screening criteria for differential genes was p <0.001. In combination with the screening criteria, 8020 differential genes were selected (3188 up-regulated in HMR-C1 and 4832 down-regulated in HMR-C1).
(2) Dimension reduction of differential genes using the Boruta algorithm
In the meta-GEO dataset (training set), the "Borata" package of R software is used for respectively reducing the dimension of the high-expression and low-expression differential genes in the HMR-C1, and the operation parameters are specifically set as follows: dotrace=2, maxrunos=100, ntree=500. Through dimension reduction, 126 genes are screened out from 4832 genes which are low in expression in HMR-C1 and are named as B genes; the remaining 100 genes after the dimensionality reduction of 3188 genes highly expressed in HMR-C1 were designated as class a genes.
(3) Construction of HR_score by further selection of differential genes with prognostic value by LASSO-COX regression analysis
Further, a gene with prognostic significance is selected from the genes after dimension reduction. First, 226 differential genes were analyzed for correlation with total survival by single factor Cox regression and 104 of these genes were selected for p < 0.05. These genes were subjected to LASSO-COX regression screening. Then, the LASSO-COX algorithm operation is carried out by ten times of cross validation by using the "glmnet" package, and the variable combination corresponding to the minimum value of the "Partifikelihood device" is the modeling gene. Finally, 10 genes were determined to construct a prognostic scoring model, respectively: ACVR1, BRD3, C11orf95, CRYAB, FERMT2, IFNA2, RAE1, UCKL1, VPS72, YTHDF1. The final score is obtained by subtracting the average expression level of the finally determined high expression genes from the average expression level of the finally determined low expression genes, and the specific prediction formula is as follows: hr_score=acvr1+brd3+c11orf 95+cryab+fermt2+rae1+uckl1+vps72+ythdf1-IFNA2.
The HR_score scores are arranged from large to small, and the first 1/3 sample and the lowest 1/3 sample are respectively defined as a high-scoring group and a low-scoring group. The box plot is shown in FIG. 8, which shows that the median HR_score value of HMR-C1 is highest in the meta-GEO queue. FIG. 9 is a heat map showing the pattern of gene expression after the histone modification regulator typing by the Borata algorithm for dimension reduction.
Example 4 prediction of prognosis, predictive value and adjuvant chemotherapy benefit of gastric cancer patients in meta-GEO dataset using histone modification model hr_score.
Based on the Meta-GEO and TCGA-STAD queues, we generated survival curves using R-packet "survivinal" and Kaplan-Meier analysis, and tested Log-Rank against group-to-group differences using Log-Rank sums, which indicated that the low hr_score group had significantly high overall survival time in the Meta-GEO queue (fig. 11,OS,HR:2.462, 95%CI:1.952-3.106, p < 0.0001). Similar results were observed with the TCGA-STAD queue as the validation set, with the OS of the low scoring group being longer than that of the high scoring group (FIGS. 10,OS,HR:1.708, 95%CI:1.170-2.494, P: 0.005); indicating a statistical difference between the overall survival of the high and low hr_score group samples. Using IBM SPSS Statistics 26.0.0 software, this hr_score score was confirmed to be an independent poor prognosis biomarker useful for assessing patient outcome, considering multivariate Cox regression model analysis of GSE62254 and TCGA-STAD cohorts for patient gender, age, TNM staging, lanren typing, ACRG typing (fig. 11, meta-GEO: HR 2.847, 95% ci 1.919-4.224, TCGA-STAD: HR:1.627, 95% ci: 1.160-2.284).
Regarding the predictive value of hr_score, we constructed an alignment graph using multivariate Cox regression based on the clinical features of GSE62254 and TCGA-STAD cohorts, presented Cox regression model results in a visual graph, and predicted 1 year, 5 year total survival (OS) probabilities for gastric cancer patients (fig. 12), to evaluate the clinical value of alignment graph we plotted model1 (clinical features: TNM staging + Lauren typing + molecular typing), model2 (hr_score), model3 (clinical features combined hr_score) clinical decision curves, showing that hr_score provides better net benefit than "full" or "none" and model3 (clinical features combined hr_score) shows greater value, respectively (fig. 13). To further verify the validity of nomogram, calibration curves were also drawn to compare the predicted 1 year, 3 years and 5 years total survival (OS) probabilities with the actual observations with good agreement (fig. 14).
The above embodiments are only illustrative of the preferred embodiments of the present invention and are not intended to limit the scope of the present invention, and various modifications and improvements made by those skilled in the art to the technical solutions of the present invention should fall within the protection scope defined by the claims of the present invention without departing from the design spirit of the present invention.
Claims (9)
1. A biomarker for predicting gastric cancer prognosis based on histone modification regulator characteristics, characterized in that the biomarker comprises the genes ACVR1, BRD3, C11orf95, CRYAB, FERMT2, RAE1, UCKL1, VPS72, YTHDF1 and IFNA2.
2. A kit for predicting prognosis of gastric cancer based on the characteristics of a histone modification regulator, characterized in that the kit comprises a reagent for detecting the transcriptional expression level of a gene contained in the biomarker of claim 1.
3. Use of a reagent for detecting the biomarker of claim 1 in the preparation of a kit for predicting prognosis of gastric cancer.
4. The method for constructing the scoring model for predicting the prognosis of the gastric cancer based on the characteristics of the histone modification regulating factor is characterized by comprising the following steps:
s1: determining four different histone modification regulator expression patterns by consensus clustering and analysis of transcriptome data of 76 histone acetylation and methylation regulators;
s2: screening out 10 characteristic genes based on the expression mode of the histone modification regulator determined by S1 by using LASSO regression and multivariable stepwise Cox regression analysis, and constructing a quantitative scoring model for quantifying the histone modification state and distinguishing the HMR-C1 subtype;
wherein the characteristic genes comprise ACVR1, BRD3, C11orf95, CRYAB, FERM 2, RAE1, UCKL1, VPS72, YTHDF1 and IFNA2;
the formula of the scoring model is: hr_score=acvr1+brd3+c11orf 95+cryab+fermt2+rae1+uckl1+vps72+ythdf1-IFNA2.
5. A scoring model for predicting gastric cancer prognosis based on histone modification regulator features, which is characterized by being constructed by the construction method according to claim 4.
6. Use of a scoring model according to claim 5 for constructing a gastric cancer prognosis system or device, wherein gastric cancer patients are grouped according to scoring results calculated by the scoring model, thereby predicting prognosis of gastric cancer patients.
7. The use according to claim 6, wherein the criteria for predicting prognosis of a gastric cancer patient based on the scoring result are: HR score scores were ranked from large to small, with the top 1/3 and bottom 1/3 samples being defined as high scoring and low scoring groups, respectively, where the high scoring group had poor patient prognosis.
8. A system or device for predicting gastric cancer prognosis, comprising a scoring model for predicting gastric cancer prognosis based on characteristics of histone modification regulator, the scoring model taking the expression level of the biomarker of claim 1 as an input variable for evaluating the predicted gastric cancer prognosis; wherein the scoring model calculates a score for gastric cancer prognosis using the following formula: hr_score=acvr1+brd3+c11orf 95+cryab+fermt2+rae1+uckl1+vps72+ythdf1-IFNA2.
9. The system or device for predicting prognosis of gastric cancer according to claim 8, wherein the scoring criteria is: HR score scores were ranked from large to small, with the top 1/3 and bottom 1/3 samples being defined as high scoring and low scoring groups, respectively, where the high scoring group had poor patient prognosis.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310640320.2A CN116665898B (en) | 2023-06-01 | 2023-06-01 | Biomarker for predicting prognosis of gastric cancer based on histone modification regulator characteristics, scoring model and application |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310640320.2A CN116665898B (en) | 2023-06-01 | 2023-06-01 | Biomarker for predicting prognosis of gastric cancer based on histone modification regulator characteristics, scoring model and application |
Publications (2)
Publication Number | Publication Date |
---|---|
CN116665898A true CN116665898A (en) | 2023-08-29 |
CN116665898B CN116665898B (en) | 2024-01-30 |
Family
ID=87718540
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202310640320.2A Active CN116665898B (en) | 2023-06-01 | 2023-06-01 | Biomarker for predicting prognosis of gastric cancer based on histone modification regulator characteristics, scoring model and application |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN116665898B (en) |
Citations (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111254194A (en) * | 2020-01-13 | 2020-06-09 | 东南大学 | Cancer-related biomarkers based on sequencing and data analysis of cfDNA and application thereof in classification of cfDNA samples |
CN112048559A (en) * | 2020-09-10 | 2020-12-08 | 辽宁省肿瘤医院 | Model construction and clinical application of m 6A-related IncRNA network gastric cancer prognosis-based model |
CN112133365A (en) * | 2020-09-03 | 2020-12-25 | 南方医科大学南方医院 | Gene set for evaluating tumor microenvironment, scoring model and application of gene set |
CN113450873A (en) * | 2021-05-14 | 2021-09-28 | 山东大学 | Marker for predicting gastric cancer prognosis and immunotherapy applicability and application thereof |
CN114438201A (en) * | 2020-11-05 | 2022-05-06 | 复旦大学附属华山医院 | Tumor marker combination and application thereof |
US20220244263A1 (en) * | 2019-05-28 | 2022-08-04 | The Regents Of The University Of California | Methods for treating small cell neuroendocrine and related cancers |
CN115011689A (en) * | 2022-05-12 | 2022-09-06 | 南方医科大学南方医院 | Evaluation model for predicting colon cancer prognosis and assisting chemotherapy benefit and application |
CN115141887A (en) * | 2022-08-18 | 2022-10-04 | 南方医科大学南方医院 | Scoring model for prognosis of colon cancer and benefit of adjuvant chemotherapy based on secretory cell enrichment characteristics, construction method and application |
CN115497552A (en) * | 2022-09-27 | 2022-12-20 | 山西医科大学第一医院 | Gastric cancer prognosis risk model based on endoplasmic reticulum stress characteristic gene and application |
CN115807089A (en) * | 2022-11-14 | 2023-03-17 | 石河子大学 | Hepatocellular carcinoma prognosis biomarker and application thereof |
CN116092674A (en) * | 2022-09-08 | 2023-05-09 | 山西医科大学 | Exosome-mediated gastric cancer overall survival rate prognosis model, construction method and application |
-
2023
- 2023-06-01 CN CN202310640320.2A patent/CN116665898B/en active Active
Patent Citations (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20220244263A1 (en) * | 2019-05-28 | 2022-08-04 | The Regents Of The University Of California | Methods for treating small cell neuroendocrine and related cancers |
CN111254194A (en) * | 2020-01-13 | 2020-06-09 | 东南大学 | Cancer-related biomarkers based on sequencing and data analysis of cfDNA and application thereof in classification of cfDNA samples |
CN112133365A (en) * | 2020-09-03 | 2020-12-25 | 南方医科大学南方医院 | Gene set for evaluating tumor microenvironment, scoring model and application of gene set |
CN112048559A (en) * | 2020-09-10 | 2020-12-08 | 辽宁省肿瘤医院 | Model construction and clinical application of m 6A-related IncRNA network gastric cancer prognosis-based model |
CN114438201A (en) * | 2020-11-05 | 2022-05-06 | 复旦大学附属华山医院 | Tumor marker combination and application thereof |
CN113450873A (en) * | 2021-05-14 | 2021-09-28 | 山东大学 | Marker for predicting gastric cancer prognosis and immunotherapy applicability and application thereof |
CN115011689A (en) * | 2022-05-12 | 2022-09-06 | 南方医科大学南方医院 | Evaluation model for predicting colon cancer prognosis and assisting chemotherapy benefit and application |
CN115141887A (en) * | 2022-08-18 | 2022-10-04 | 南方医科大学南方医院 | Scoring model for prognosis of colon cancer and benefit of adjuvant chemotherapy based on secretory cell enrichment characteristics, construction method and application |
CN116092674A (en) * | 2022-09-08 | 2023-05-09 | 山西医科大学 | Exosome-mediated gastric cancer overall survival rate prognosis model, construction method and application |
CN115497552A (en) * | 2022-09-27 | 2022-12-20 | 山西医科大学第一医院 | Gastric cancer prognosis risk model based on endoplasmic reticulum stress characteristic gene and application |
CN115807089A (en) * | 2022-11-14 | 2023-03-17 | 石河子大学 | Hepatocellular carcinoma prognosis biomarker and application thereof |
Also Published As
Publication number | Publication date |
---|---|
CN116665898B (en) | 2024-01-30 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Yousefi et al. | DNA methylation-based predictors of health: applications and statistical considerations | |
US20240029892A1 (en) | Disease monitoring from insurance claims data | |
Warrick et al. | Intratumoral heterogeneity of bladder cancer by molecular subtypes and histologic variants | |
Zhou et al. | Relapse-related long non-coding RNA signature to improve prognosis prediction of lung adenocarcinoma | |
Jayawardana et al. | Determination of prognosis in metastatic melanoma through integration of clinico‐pathologic, mutation, mRNA, microRNA, and protein information | |
CN112133365B (en) | Gene set for evaluating tumor microenvironment, scoring model and application of gene set | |
CN111128299B (en) | Construction method of ceRNA regulation and control network with significant correlation to colorectal cancer prognosis | |
Yang et al. | Similarities of ordered gene lists | |
Li et al. | Identification of novel long non-coding RNA biomarkers for prognosis prediction of papillary thyroid cancer | |
Simon | Analysis of DNA microarray expression data | |
Dinalankara et al. | Gene expression signatures based on variability can robustly predict tumor progression and prognosis | |
Lu et al. | Cell-free DNA methylation as a predictive biomarker of response to neoadjuvant chemotherapy for patients with muscle-invasive bladder cancer in SWOG S1314 | |
Liu et al. | 5-methylcytosine RNA methylation regulators affect prognosis and tumor microenvironment in lung adenocarcinoma | |
Zhao et al. | A robust gene expression prognostic signature for overall survival in high-grade serous ovarian cancer | |
CN116665898B (en) | Biomarker for predicting prognosis of gastric cancer based on histone modification regulator characteristics, scoring model and application | |
Li et al. | Bioinformatic analysis and in vitro validation of a five-microRNA signature as a prognostic biomarker of hepatocellular carcinoma | |
Ye et al. | Use of cell free DNA as a prognostic biomarker in non-small cell lung cancer patients with bone metastasis | |
Yang et al. | Systematic computation with functional gene-sets among leukemic and hematopoietic stem cells reveals a favorable prognostic signature for acute myeloid leukemia | |
US20190112729A1 (en) | Novel set of biomarkers useful for predicting lung cancer survival | |
LU504786B1 (en) | Biomarker, scoring model and application for predicting the prognosis of gastric cancer based on histone modified regulatory factor characteristics | |
CN113234823B (en) | Pancreatic cancer prognosis risk assessment model and application thereof | |
Yang et al. | A nomogram based on a gene signature for predicting the prognosis of patients with head and neck squamous cell carcinoma | |
CN115424728A (en) | Method for constructing tumor malignant cell gene prognosis risk model | |
Tournoud et al. | A strategy to build and validate a prognostic biomarker model based on RT-qPCR gene expression and clinical covariates | |
Chang et al. | LINC00963 May Be Associated with a Poor Prognosis in Patients with Cervical Cancer |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |