CN113969318A - Application of combined tar death related gene in esophageal adenocarcinoma prognosis model - Google Patents
Application of combined tar death related gene in esophageal adenocarcinoma prognosis model Download PDFInfo
- Publication number
- CN113969318A CN113969318A CN202111326421.XA CN202111326421A CN113969318A CN 113969318 A CN113969318 A CN 113969318A CN 202111326421 A CN202111326421 A CN 202111326421A CN 113969318 A CN113969318 A CN 113969318A
- Authority
- CN
- China
- Prior art keywords
- esophageal adenocarcinoma
- gene
- model
- related genes
- prognosis
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 59
- 208000036764 Adenocarcinoma of the esophagus Diseases 0.000 title claims abstract description 37
- 206010030137 Oesophageal adenocarcinoma Diseases 0.000 title claims abstract description 37
- 208000028653 esophageal adenocarcinoma Diseases 0.000 title claims abstract description 37
- 238000004393 prognosis Methods 0.000 title claims abstract description 32
- 230000034994 death Effects 0.000 title claims abstract description 22
- 230000014509 gene expression Effects 0.000 claims abstract description 23
- 102100029647 Apoptosis-associated speck-like protein containing a CARD Human genes 0.000 claims abstract description 12
- 102100037383 Gasdermin-B Human genes 0.000 claims abstract description 12
- 101000728679 Homo sapiens Apoptosis-associated speck-like protein containing a CARD Proteins 0.000 claims abstract description 12
- 101001026281 Homo sapiens Gasdermin-B Proteins 0.000 claims abstract description 12
- 101001033249 Homo sapiens Interleukin-1 beta Proteins 0.000 claims abstract description 12
- 102100039065 Interleukin-1 beta Human genes 0.000 claims abstract description 12
- 238000000034 method Methods 0.000 claims abstract description 12
- 102100035904 Caspase-1 Human genes 0.000 claims abstract description 10
- 101000715398 Homo sapiens Caspase-1 Proteins 0.000 claims abstract description 10
- 238000013210 evaluation model Methods 0.000 claims abstract description 7
- 238000012216 screening Methods 0.000 claims abstract description 6
- 230000006907 apoptotic process Effects 0.000 claims description 20
- 230000004083 survival effect Effects 0.000 claims description 18
- 238000000611 regression analysis Methods 0.000 claims description 7
- 108020004999 messenger RNA Proteins 0.000 claims description 6
- 102100038916 Caspase-5 Human genes 0.000 claims description 5
- 101000741072 Homo sapiens Caspase-5 Proteins 0.000 claims description 5
- 238000004458 analytical method Methods 0.000 claims description 5
- 206010028980 Neoplasm Diseases 0.000 claims description 4
- 238000004364 calculation method Methods 0.000 claims description 4
- 230000006870 function Effects 0.000 claims description 4
- 238000000513 principal component analysis Methods 0.000 claims description 4
- 102100030385 Granzyme B Human genes 0.000 claims description 3
- 101001009603 Homo sapiens Granzyme B Proteins 0.000 claims description 3
- 101001109455 Homo sapiens NACHT, LRR and PYD domains-containing protein 6 Proteins 0.000 claims description 3
- 101000795117 Homo sapiens Triggering receptor expressed on myeloid cells 2 Proteins 0.000 claims description 3
- 101000611183 Homo sapiens Tumor necrosis factor Proteins 0.000 claims description 3
- 238000010824 Kaplan-Meier survival analysis Methods 0.000 claims description 3
- 102100022696 NACHT, LRR and PYD domains-containing protein 6 Human genes 0.000 claims description 3
- 102100029678 Triggering receptor expressed on myeloid cells 2 Human genes 0.000 claims description 3
- 102100040247 Tumor necrosis factor Human genes 0.000 claims description 3
- 201000011510 cancer Diseases 0.000 claims description 3
- 230000002068 genetic effect Effects 0.000 claims description 3
- 238000004422 calculation algorithm Methods 0.000 claims description 2
- 238000010195 expression analysis Methods 0.000 claims description 2
- 238000012821 model calculation Methods 0.000 claims description 2
- 201000010099 disease Diseases 0.000 abstract description 4
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 abstract description 4
- 238000003745 diagnosis Methods 0.000 abstract description 3
- 238000010276 construction Methods 0.000 description 4
- 238000010200 validation analysis Methods 0.000 description 4
- 208000000461 Esophageal Neoplasms Diseases 0.000 description 3
- 101000964436 Homo sapiens Z-DNA-binding protein 1 Proteins 0.000 description 3
- 102100037924 Insulin-like growth factor 2 mRNA-binding protein 1 Human genes 0.000 description 3
- 206010030155 Oesophageal carcinoma Diseases 0.000 description 3
- 206010061534 Oesophageal squamous cell carcinoma Diseases 0.000 description 3
- 238000003559 RNA-seq method Methods 0.000 description 3
- 101000599776 Rattus norvegicus Insulin-like growth factor 2 mRNA-binding protein 1 Proteins 0.000 description 3
- 208000036765 Squamous cell carcinoma of the esophagus Diseases 0.000 description 3
- 201000004101 esophageal cancer Diseases 0.000 description 3
- 208000007276 esophageal squamous cell carcinoma Diseases 0.000 description 3
- 108010076667 Caspases Proteins 0.000 description 2
- 102000011727 Caspases Human genes 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 230000004547 gene signature Effects 0.000 description 2
- 238000012795 verification Methods 0.000 description 2
- 108091005804 Peptidases Proteins 0.000 description 1
- 102000035195 Peptidases Human genes 0.000 description 1
- 239000004365 Protease Substances 0.000 description 1
- 238000013103 analytical ultracentrifugation Methods 0.000 description 1
- 230000001363 autoimmune Effects 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 239000000090 biomarker Substances 0.000 description 1
- 210000004027 cell Anatomy 0.000 description 1
- 230000030833 cell death Effects 0.000 description 1
- 210000000170 cell membrane Anatomy 0.000 description 1
- 238000002512 chemotherapy Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000002651 drug therapy Methods 0.000 description 1
- 230000002526 effect on cardiovascular system Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 208000015181 infectious disease Diseases 0.000 description 1
- 230000002458 infectious effect Effects 0.000 description 1
- 230000002757 inflammatory effect Effects 0.000 description 1
- 230000028709 inflammatory response Effects 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 230000003834 intracellular effect Effects 0.000 description 1
- 230000001613 neoplastic effect Effects 0.000 description 1
- 230000000926 neurological effect Effects 0.000 description 1
- 230000000770 proinflammatory effect Effects 0.000 description 1
- 238000001959 radiotherapy Methods 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 238000001356 surgical procedure Methods 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6876—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
- C12Q1/6883—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
- C12Q1/6886—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material for cancer
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B20/00—ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B25/00—ICT specially adapted for hybridisation; ICT specially adapted for gene or protein expression
- G16B25/10—Gene or protein expression profiling; Expression-ratio estimation or normalisation
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B40/00—ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
- G16B40/20—Supervised data analysis
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B40/00—ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
- G16B40/30—Unsupervised data analysis
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B45/00—ICT specially adapted for bioinformatics-related data visualisation, e.g. displaying of maps or networks
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B50/00—ICT programming tools or database systems specially adapted for bioinformatics
- G16B50/30—Data warehousing; Computing architectures
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H50/00—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
- G16H50/30—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for calculating health indices; for individual health risk assessment
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/118—Prognosis of disease development
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/158—Expression markers
Abstract
The invention provides application of a combined focal death related gene in an esophageal adenocarcinoma prognosis model, wherein the combined focal death related gene is CASP1, GSDMB, IL1B, PYCARD and ZBP 1; the method for establishing the esophageal adenocarcinoma prognosis model comprises the steps of collecting and sorting data, screening differential expression scorch related genes, analyzing the differential expression scorch related genes, establishing a risk index evaluation model, and establishing a nomogram. The prognosis model of the invention has the advantage of high accuracy, and can provide a new method for disease diagnosis and prognosis for patients with esophageal adenocarcinoma clinically.
Description
Technical Field
The invention belongs to the technical field of tumor molecular biology, and particularly relates to application of a combined focal death related gene in an esophageal adenocarcinoma prognosis model.
Background
Esophageal cancer is one of the most common malignant tumors worldwide, with about 60 million cases of new onset and 54 million cases of death worldwide each year; esophageal Adenocarcinoma (EAC) and Esophageal Squamous Cell Carcinoma (ESCC) constitute the major histological subtypes of esophageal cancer, with the incidence of EAC in western countries increasing dramatically over the past decades; despite advances in surgery, radiation therapy, chemotherapy, and targeted drug therapy, the 5-year survival rate of esophageal cancer is still less than 20%. Therefore, biomarkers and effective models are urgently needed to predict the prognosis of EAC.
Apoptosis is a programmed pattern of cell death with an inflammatory response that depends on the activity of inflammatory proteases of the caspase (caspase) family. Cell apoptosis is characterized by rapid disruption of the cytoplasmic membrane followed by release of proinflammatory intracellular contents. In recent years, studies have been emerging on the role of apoptosis in neurological, infectious, autoimmune, cardiovascular, and neoplastic diseases.
However, although studies have shown the role and importance of cell apoptosis in ESCC, the role of cell apoptosis in EAC remains to be explored, and a convenient, accurate and efficient early warning model for EAC prognosis is lacking at present. Therefore, the present patent comprehensively evaluates the tar death-related genes in EAC and develops a model based on tar death genes to predict patient prognosis.
Disclosure of Invention
Aiming at the defects of the prior art, the invention provides the application of the combined tar death related gene in an esophageal adenocarcinoma prognosis model, has the advantage of high accuracy, and can provide a new method for disease diagnosis and prognosis for esophageal adenocarcinoma patients clinically.
In order to achieve the purpose, the specific technical scheme of the invention is as follows:
the application of the combined focal death related genes in an esophageal adenocarcinoma prognosis model, wherein the combined focal death related genes are CASP1, GSDMB, IL1B, PYCARD and ZBP 1.
As another specific embodiment of the invention, the establishment method of the esophageal adenocarcinoma prognosis model comprises the following steps:
step one, collecting and sorting data
Acquiring clinical data and normal tissue transcriptome data of esophageal adenocarcinoma patients from a cancer genome map (TCGA) database;
step two, screening the differential expression tar death related gene
Carrying out differential expression analysis on a plurality of known cell apoptosis related genes by using a Limma algorithm, and screening out a gene combination which is obviously expressed compared with a normal tissue;
step three, re-analysis of differentially expressed tar death related genes
Aiming at the screened gene combinations, performing LASSO regression analysis by adopting a glmnet package of an R language, and removing redundant genes to obtain different gene combinations;
step four, establishing a risk index evaluation model
Establishing a prognosis risk index evaluation model of the scorch related genes by using an LASSO Cox regression method based on the difference gene combination obtained in the third step, wherein the calculation method of the evaluation model comprises the following steps:
risk index score-regression coefficient x gene expression level;
calculating a regression coefficient corresponding to each gene in the differential gene combination by adopting a glmnet function;
step five, constructing a nomogram
Based on genetic and clinical characteristics, alignment plots were constructed using rms, foreign, survival packages, to assess patient survival and to assess the efficacy of the model.
As another embodiment of the present invention, the combinations of genes selected in step two to be significantly expressed compared to normal tissues are CASP1, CASP5, GSDMB, GZMB, IL1B, NLRP6, PYCARD, TNF, TREM2 and ZBP 1.
As another embodiment of the present invention, the set of difference genes obtained in step three are CASP1, GSDMB, IL1B, PYCARD and ZBP 1.
As another specific embodiment of the present invention, the model calculation method in step four is:
risk index score ═ (0.042 × CASP1mRNA expression level) + (-0.025 × GSDMB mRNA expression level) + (0.021 × IL1B mRNA expression level) + (-0.037 × PYCARD mRNA expression level) + (-0.243 × ZBP 1mRNA expression level).
As another embodiment of the present invention, in step four, the patients are divided into a high risk group and a low risk group based on median risk score, and the overall survival rate between the two groups is compared using Kaplan-Meier analysis.
As another embodiment of the invention, principal component analysis is performed on the high-risk group and the low-risk group respectively, separability of the two groups is evaluated through a "prcomp" function, and a working characteristic curve of the subject and an area calculation under the curve are constructed by using the packages of "survival", "survivminiser", "timeROC" and "riskReguration".
The invention has the following beneficial effects:
the invention utilizes the TCGA database to analyze the whole genome of the esophageal adenocarcinoma, establishes a prognosis model related to esophageal adenocarcinoma prognosis based on five pyro-death related genes, has the advantage of high accuracy, and can provide a new method for disease diagnosis and prognosis for esophageal adenocarcinoma patients clinically.
The present invention will be described in further detail with reference to the accompanying drawings.
Drawings
FIG. 1 is a schematic diagram of the primary screening of 10 differentially expressed genes associated with apoptosis in cells according to the present invention;
FIG. 2 is a schematic diagram showing the construction of risk features of apoptosis-related genes in a TCGA cohort according to the present invention;
FIG. 3 is a schematic representation of the prediction of prognosis using the five gene signatures associated with apoptosis in the TCGA cohort in accordance with the present invention;
FIG. 4 is a schematic representation of the validation of the five gene signatures associated with apoptosis in the GSE13898 cohort of the present invention;
FIG. 5 is a nomogram constructed based on the characteristics of five genes associated with pyro-death in accordance with the present invention.
Detailed Description
In order that the above objects, features and advantages of the present invention can be more clearly understood, a more particular description of the invention will be rendered by reference to the appended drawings. It should be noted that the embodiments and features of the embodiments of the present application may be combined with each other without conflict.
In the following description, numerous specific details are set forth in order to provide a thorough understanding of the present invention, however, the present invention may be practiced in other ways than those specifically described herein, and therefore the scope of the present invention is not limited to the specific embodiments disclosed below.
Method
1) Data set
On 20/5/2021, RNA sequencing (RNA-seq) data and corresponding clinical information were retrieved from the TCGA database for 87 patients (78 with EAC; 9 normal samples). Validation cohort RNA-seq data and clinical information were obtained from a gene expression integration (GEO) database. (accession number: GSE 13898).
2) Identification of Differentially Expressed Genes (DEGs) in a apoptosis-related genome
58 genes associated with tar death were obtained from the review published earlier and the GO term "tar death" (ID: GO 0070269). DEG with P <0.05 was verified using the "limma" packet.
3) Development and verification of prediction model of tar death related gene
Cox regression analysis was used to further assess the value of tar death-related DEG for prognosis. A prognostic model was constructed using R package "glmnet" and LASSO Cox regression analysis, and the risk score was calculated using the following formula:
risk score ═ (Coef i denotes coefficient, Xi denotes normalized gene expression level). EAC patients were divided into low risk and high risk groups according to median risk score and the Overall Survival (OS) between the two groups was compared using Kaplan-Meier analysis. Principal Component Analysis (PCA) was used to evaluate the separability of the two groups by the "prcomp" function. The R packages "survival", "survivmini", "timeROC" and "riskReguration" were used for 1, 2 and 5 years of Receiver Operating Characteristic (ROC) graph and Area Under Curve (AUC) calculationA histogram model with clinical features (including staging and risk scoring) was constructed from R-packs "rms", "foreign" and "survival". The EAC cohort from the GEO database (GSE13898) was used for validation and risk scores were calculated by the same method as described above, dividing the cohort into two subgroups (low risk group and high risk group).
4) Prognostic analysis of variables
Clinical data (gender and stage) of the patients were extracted from the TCGA cohort and GSE13898 cohort. Variables including gender, stage, and risk score were analyzed in the regression model by univariate and multivariate Cox regression analysis.
Results
1) Identification of DEG between EAC and Normal tissue
The expression levels of 58 apoptosis-related genes were verified in 78 EACs and 9 normal tissues of the TCGA cohort, and 10 DEGs were identified (| log2FC | ≧ 1 and P-value < 0.05; CASP1, CASP5, GSDMB, GZMB, IL1B, NLRP6, PYCARD, TNF, TREM2, ZBP1), which were all upregulated in the tumor group. The expression profile of DEG is shown in FIG. 1 (red for higher expression levels; blue for lower expression levels).
2) DEGs-based prognosis model construction
Univariate Cox regression analysis was used to assess the prognostic value of DEG (fig. 2A). Of these, 6 genes (CASP1, CASP5, GSDMB, IL1B, PYCARD and ZBP1) had P values <0.2, high expression of CASP1, CASP5, IL1B was associated with increased risk (HR >1), while GSDMB, PYCARD, ZBP1 were associated with lower risk (HR < 1). Subsequently, LASSO Cox regression analysis yielded 5 genes for prognostic model construction based on the best λ values (fig. 2B, 2C).
The risk score is calculated as follows: risk score ═ 0.042 x expCASP1) + (-0.025 x exppgsdmb) + (0.021 x exppil 1B) + (-0.037 x expPYCARD) + (-0.243 + expZBP 1). Based on the calculated median risk scores, 65 patients were divided into two groups (32 in the high risk group and 33 in the low risk group), and the clinical data are shown in fig. 3A. Patients were better differentiated into two subgroups (fig. 3B). The distribution of risk scores and survival times is shown in fig. 3C, 3D. The OS in the high risk group was significantly lower than in the low risk group (P0.0012, FIG. 3E). ROC analysis of the risk model showed that the AUCs for 1 year, 2 years and 5 years survival were 0.708, 0.815 and 0.952, respectively (fig. 3F, 3G, 3H). Both univariate and multivariate Cox regression analyses showed that characteristics of the apoptosis-associated genes independently predicted prognosis in EAC patients (fig. 3I, 3J).
3) Verification of focal death gene combinations by external data sets
Information from 60 EAC patients in the GEO GSE13898 dataset was used to validate the tar-death-related gene combinations described above. As described above, patients were subdivided into low risk and high risk groups, respectively. Patients were better differentiated into two subgroups (fig. 4A). The distribution of risk scores and survival times is shown in fig. 4B, 4C. Patients in the low risk group had significantly higher survival rates than those in the high risk group (P0.003; fig. 4D). According to the ROC curve, the AUC of the 1-year and 2-year survival prediction models were 0.678 and 0.663, respectively (fig. 4E, 4F). The risk score can also serve as an independent prognostic factor in the validation cohort (fig. 4G, 4H).
4) Nomogram construction based on genetic features and clinical data
To predict the prognosis of EAC patients more accurately, we constructed a histogram model in conjunction with TNM staging, as shown in fig. 5A. The AUC of the nomograms used to predict 1-year, 2-year, and 5-year survival were 0.722, 0.884, and 1.000, respectively (fig. 5B, 5C, and 5D), indicating that the nomograms are ideal for predicting patient survival.
Although the present invention has been described with reference to the preferred embodiments, it is not intended to limit the scope of the invention. It will be appreciated by those skilled in the art that changes may be made without departing from the scope of the invention, and it is intended that all matter contained in the above description or shown in the accompanying drawings shall be interpreted as illustrative and not in a limiting sense.
Claims (7)
1. The application of the combined focal death related gene in an esophageal adenocarcinoma prognosis model is characterized in that the combined focal death related gene is CASP1, GSDMB, IL1B, PYCARD and ZBP 1.
2. The use of the combination of apoptosis-related genes of claim 1 in esophageal adenocarcinoma prognosis models, wherein the esophageal adenocarcinoma prognosis models are established by a method comprising the steps of:
step one, collecting and sorting data
Acquiring clinical data and normal tissue transcriptome data of esophageal adenocarcinoma patients from a cancer genome map (TCGA) database;
step two, screening the differential expression tar death related gene
Carrying out differential expression analysis on a plurality of known cell apoptosis related genes by using a Limma algorithm, and screening out a gene combination which is obviously expressed compared with a normal tissue;
step three, re-analysis of differentially expressed tar death related genes
Aiming at the screened gene combinations, performing LASSO regression analysis by adopting a glmnet package of an R language, and removing redundant genes to obtain different gene combinations;
step four, establishing a risk index evaluation model
Establishing a prognosis risk index evaluation model of the scorch related genes by using an LASSO Cox regression method based on the difference gene combination obtained in the third step, wherein the calculation method of the evaluation model comprises the following steps:
risk index score-regression coefficient x gene expression level;
calculating a regression coefficient corresponding to each gene in the differential gene combination by adopting a glmnet function;
step five, constructing a nomogram
Based on genetic and clinical characteristics, alignment plots were constructed using rms, foreign, survival packages, to assess patient survival and to assess the efficacy of the model.
3. The use of the combined apoptosis-related genes in the esophageal adenocarcinoma prognosis model of claim 2, wherein the genes selected in step two that are significantly expressed compared to normal tissues are CASP1, CASP5, GSDMB, GZMB, IL1B, NLRP6, PYCARD, TNF, TREM2 and ZBP 1.
4. Use of the combined apoptosis-related genes of claim 3 in the esophageal adenocarcinoma prognosis model, wherein the set of difference genes obtained in step three is CASP1, GSDMB, IL1B, PYCARD and ZBP 1.
5. The use of the combined apoptosis-related genes of claim 4 in esophageal adenocarcinoma prognosis models, wherein the model calculation method in step four is as follows:
risk index score ═ (0.042 × CASP1mRNA expression level) + (-0.025 × GSDMB mRNA expression level) + (0.021 × IL1B mRNA expression level) + (-0.037 × PYCARD mRNA expression level) + (-0.243 × ZBP 1mRNA expression level).
6. Use of the combined apoptosis-related genes in esophageal adenocarcinoma prognosis model according to claim 2, wherein in step four, patients are divided into high risk group and low risk group according to median risk score, and the overall survival rate between the two groups is compared using Kaplan-Meier analysis.
7. Use of the combined apoptosis-related genes in esophageal adenocarcinoma prognosis model according to claim 6, wherein the principal component analysis is performed on the high risk group and the low risk group separately, and the subject working characteristic curve and the area under the curve are calculated using the "survivval", "survivminister", "timeROC" and "riskReguration" packages.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111326421.XA CN113969318A (en) | 2021-11-10 | 2021-11-10 | Application of combined tar death related gene in esophageal adenocarcinoma prognosis model |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111326421.XA CN113969318A (en) | 2021-11-10 | 2021-11-10 | Application of combined tar death related gene in esophageal adenocarcinoma prognosis model |
Publications (1)
Publication Number | Publication Date |
---|---|
CN113969318A true CN113969318A (en) | 2022-01-25 |
Family
ID=79589621
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202111326421.XA Pending CN113969318A (en) | 2021-11-10 | 2021-11-10 | Application of combined tar death related gene in esophageal adenocarcinoma prognosis model |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113969318A (en) |
Citations (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2009049129A1 (en) * | 2007-10-11 | 2009-04-16 | The Ohio State University Research Foundation | Methods and compositions for the diagnosis and treatment of esphageal adenocarcinomas |
US20160319372A1 (en) * | 2011-10-17 | 2016-11-03 | Kaohsiung Medical University | Method for predicting the survival status and prognosis of esophageal cancer and a kit thereof |
WO2017137427A1 (en) * | 2016-02-08 | 2017-08-17 | Ucl Business Plc | Detection of cancer of the esophagus |
WO2018204764A1 (en) * | 2017-05-05 | 2018-11-08 | Camp4 Therapeutics Corporation | Identification and targeted modulation of gene signaling networks |
CN109715802A (en) * | 2016-03-18 | 2019-05-03 | 卡里斯科学公司 | Oligonucleotide probe and application thereof |
CN109859801A (en) * | 2019-02-14 | 2019-06-07 | 辽宁省肿瘤医院 | A kind of model and method for building up containing seven genes as biomarker prediction lung squamous cancer prognosis |
CN110499364A (en) * | 2019-07-30 | 2019-11-26 | 北京凯昂医学诊断技术有限公司 | A kind of probe groups and its kit and application for detecting the full exon of extended pattern hereditary disease |
WO2020102043A1 (en) * | 2018-11-15 | 2020-05-22 | Ampel Biosolutions, Llc | Machine learning disease prediction and treatment prioritization |
CN112687342A (en) * | 2020-11-16 | 2021-04-20 | 徐同鹏 | Application of a group of immune-related molecular markers identified based on TCGA (TCGA) database in esophageal cancer prognosis prediction |
CN112750497A (en) * | 2021-01-11 | 2021-05-04 | 湖南大学 | Multisource data fusion framework for revealing breast cancer immune evasion regulation and control mechanism |
CN113355419A (en) * | 2021-06-28 | 2021-09-07 | 广州中医药大学(广州中医药研究院) | Breast cancer prognosis risk prediction marker composition and application |
-
2021
- 2021-11-10 CN CN202111326421.XA patent/CN113969318A/en active Pending
Patent Citations (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2009049129A1 (en) * | 2007-10-11 | 2009-04-16 | The Ohio State University Research Foundation | Methods and compositions for the diagnosis and treatment of esphageal adenocarcinomas |
US20160319372A1 (en) * | 2011-10-17 | 2016-11-03 | Kaohsiung Medical University | Method for predicting the survival status and prognosis of esophageal cancer and a kit thereof |
WO2017137427A1 (en) * | 2016-02-08 | 2017-08-17 | Ucl Business Plc | Detection of cancer of the esophagus |
CN109715802A (en) * | 2016-03-18 | 2019-05-03 | 卡里斯科学公司 | Oligonucleotide probe and application thereof |
WO2018204764A1 (en) * | 2017-05-05 | 2018-11-08 | Camp4 Therapeutics Corporation | Identification and targeted modulation of gene signaling networks |
WO2020102043A1 (en) * | 2018-11-15 | 2020-05-22 | Ampel Biosolutions, Llc | Machine learning disease prediction and treatment prioritization |
CN109859801A (en) * | 2019-02-14 | 2019-06-07 | 辽宁省肿瘤医院 | A kind of model and method for building up containing seven genes as biomarker prediction lung squamous cancer prognosis |
CN110499364A (en) * | 2019-07-30 | 2019-11-26 | 北京凯昂医学诊断技术有限公司 | A kind of probe groups and its kit and application for detecting the full exon of extended pattern hereditary disease |
CN112687342A (en) * | 2020-11-16 | 2021-04-20 | 徐同鹏 | Application of a group of immune-related molecular markers identified based on TCGA (TCGA) database in esophageal cancer prognosis prediction |
CN112750497A (en) * | 2021-01-11 | 2021-05-04 | 湖南大学 | Multisource data fusion framework for revealing breast cancer immune evasion regulation and control mechanism |
CN113355419A (en) * | 2021-06-28 | 2021-09-07 | 广州中医药大学(广州中医药研究院) | Breast cancer prognosis risk prediction marker composition and application |
Non-Patent Citations (4)
Title |
---|
RUIJIE ZENG等: "Identification of the Novel Pyroptosis-Related Gene Signature in Patients with Esophageal Adenocarcinoma", BIORXIV, pages 1 - 34 * |
SOO MI KIM等: "Prognostic Biomarkers for Esophageal Adenocarcinoma Identified by Analysis of Tumor Transcriptome", PLOS ONE, vol. 5, pages 15074 * |
房渝;任冬仁;陈浩;胡建国;陈晓欣;: "Barrett食管和食管腺癌的动物模型", 胃肠病学, no. 04, pages 193 - 197 * |
董娜;邵峰;: "细胞焦亡的机制和功能", 中国科学:生命科学, no. 12, pages 1606 - 1634 * |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109859801B (en) | Model for predicting lung squamous carcinoma prognosis by using seven genes as biomarkers and establishing method | |
Cheng et al. | Histologic grading of urothelial carcinoma: a reappraisal | |
CN109493969B (en) | Model for evaluating prognosis of patients with Paget's disease complicated with invasive ductal carcinoma and application of model | |
CN111128299B (en) | Construction method of ceRNA regulation and control network with significant correlation to colorectal cancer prognosis | |
CN107885973B (en) | Clinical research method of DNA repair gene in colorectal cancer | |
CN110305964A (en) | A kind of foundation of patients with prostate cancer prognosis recurrence risk profile mark tool and its risk evaluation model | |
Yuan et al. | Identification of a nine immune-related lncRNA signature as a novel diagnostic biomarker for hepatocellular carcinoma | |
CN114891887A (en) | Method for screening triple negative breast cancer prognosis gene marker | |
CN112626218A (en) | Gene expression classifier and in-vitro diagnosis kit for predicting pancreatic cancer metastasis risk | |
US20230383364A1 (en) | Prognostic model of hepatocellular carcinoma based on ddr and icd gene expression and construction method and application thereof | |
CN115410713A (en) | Hepatocellular carcinoma prognosis risk prediction model construction based on immune-related gene | |
Wan et al. | Discovery and validation of a metastasis-related prognostic and diagnostic biomarker for melanoma based on single cell and gene expression datasets | |
CN116013403A (en) | Construction method and application of cervical cancer methylation related long-chain non-coding RNA prognosis and immunotherapy curative effect prediction model | |
CN113658696B (en) | Prediction system for predicting prognosis of gastric cancer based on combination of patient age, nutrition index, tumor stage and tumor marker | |
Zhao et al. | A signature constructed with mitophagy-related genes to predict the prognosis and therapy response for breast cancer | |
CN107760783B (en) | Gastric cancer peritoneal metastasis prediction model based on 108 genes and application thereof | |
CN113969318A (en) | Application of combined tar death related gene in esophageal adenocarcinoma prognosis model | |
Tohme et al. | The use of machine learning to create a risk score to predict survival in patients with hepatocellular carcinoma: a TCGA cohort analysis | |
CN112037863A (en) | Early NSCLC prognosis prediction system | |
CN116153387A (en) | Overall survival rate prognosis model for lung squamous carcinoma patient and application | |
CN112746108B (en) | Gene marker for tumor prognosis hierarchical evaluation, evaluation method and application | |
Houri et al. | Prediction of endometrial cancer recurrence by using a novel machine learning algorithm: An Israeli gynecologic oncology group study | |
CN112382341B (en) | Method for identifying biomarkers related to prognosis of esophageal squamous carcinoma | |
CN116200490A (en) | Method for detecting tiny residual focus of solid tumor | |
CN114627970A (en) | Prognosis model of scorching-related lncRNA of colon adenocarcinoma and construction method and application thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |