CN108647493B - Individualized prognosis evaluation method for renal clear cell carcinoma - Google Patents

Individualized prognosis evaluation method for renal clear cell carcinoma Download PDF

Info

Publication number
CN108647493B
CN108647493B CN201810440933.0A CN201810440933A CN108647493B CN 108647493 B CN108647493 B CN 108647493B CN 201810440933 A CN201810440933 A CN 201810440933A CN 108647493 B CN108647493 B CN 108647493B
Authority
CN
China
Prior art keywords
gene
patient
cell carcinoma
survival
clear cell
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201810440933.0A
Other languages
Chinese (zh)
Other versions
CN108647493A (en
Inventor
李文兴
李功华
黄京飞
赵旭东
代绍兴
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Kunming Institute of Zoology of CAS
Original Assignee
Kunming Institute of Zoology of CAS
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Kunming Institute of Zoology of CAS filed Critical Kunming Institute of Zoology of CAS
Priority to CN201810440933.0A priority Critical patent/CN108647493B/en
Publication of CN108647493A publication Critical patent/CN108647493A/en
Application granted granted Critical
Publication of CN108647493B publication Critical patent/CN108647493B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B40/00ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B25/00ICT specially adapted for hybridisation; ICT specially adapted for gene or protein expression

Landscapes

  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Medical Informatics (AREA)
  • Biophysics (AREA)
  • Theoretical Computer Science (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • General Health & Medical Sciences (AREA)
  • Evolutionary Biology (AREA)
  • Biotechnology (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Data Mining & Analysis (AREA)
  • Software Systems (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Computation (AREA)
  • Epidemiology (AREA)
  • Databases & Information Systems (AREA)
  • Public Health (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Bioethics (AREA)
  • Genetics & Genomics (AREA)
  • Molecular Biology (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
  • Investigating Or Analysing Biological Materials (AREA)

Abstract

The invention discloses a kidney clear cell carcinoma individualized prognosis evaluation method based on polygene expression profile, which comprises the following steps: obtaining a renal clear cell carcinoma prognosis risk gene list and gene weight; constructing a prognosis evaluation model by using a tumor tissue transcriptome and survival data of a renal clear cell carcinoma patient; calculating a risk score for the patient based on the gene expression profile of the tumor tissue of the renal clear cell carcinoma patient; the annual probability of survival of the patient is calculated based on the risk score of the patient. The annual survival probability of renal clear cell carcinoma patients obtained by the method of the invention is highly consistent with the actual annual survival rate (linear correlation R)20.999 and P8.21E-74). The method is proved to have high prediction accuracy and to be highly consistent with the actual survival state. Meanwhile, for each tumor patient, the invention can provide a survival probability curve specific to the patient.

Description

Individualized prognosis evaluation method for renal clear cell carcinoma
Technical Field
The invention belongs to the field of biotechnology and medicine, and particularly relates to a renal clear cell carcinoma personalized prognosis evaluation method based on a polygene expression profile.
Background
Clear cell carcinoma of the kidney is a renal cell carcinoma, accounting for 60% -70% of renal cancers. The etiology of renal clear cell carcinoma is unknown, and mainly affects the male population 60-70 years old. Patients with renal clear cell carcinoma often have a better prognosis. Global Burden of Disease (GBD) data shows that the number of people with kidney cancer worldwide in 2016 reaches 130 ten thousand, wherein the number of people with kidney cancer in china is 16.6 ten thousand. The number of deaths of kidney cancer patients worldwide in 2016 is 13.2 ten thousand, accounting for 0.24% of the total deaths. The number of deaths in 2016 in China is 1.6 ten thousand, accounting for 0.17% of the total deaths. Statistics show that the prevalence and the mortality of renal clear cell carcinoma increase rapidly and the prevalence and the mortality of Chinese increase slowly from 1990 to 2016.
The currently internationally accepted method of staging tumors is the TNM staging system, which is a method of classifying malignant tumors proposed by the American Joint Committee on Cancer (AJCC). The National Cancer Institute (NCI) describes the staging of TNM as: t refers to the size and extent of the major tumor, which is often referred to as the primary tumor. N refers to the number of nearby lymph nodes with cancer. M refers to whether the cancer has metastasized, i.e., spread from the primary tumor to other parts of the body. Malignant tumors can be roughly classified into stage I, stage II, stage III and stage IV according to the above indexes, wherein higher stage indicates higher malignancy of the tumor. The TNM staging system is helpful for the treatment and prognosis evaluation of tumor patients. However, due to the occurrence mechanism of tumors and the difference of in vivo microenvironment among different individuals, the survival time of different patients varies greatly, and the TNM staging system does not well reflect the prognosis of the patients. Studies have found that patients diagnosed with stage I may have a shorter survival (1-2 years), whereas patients diagnosed with stage IV may have a longer survival (5 years and beyond). Thus, TNM staging systems may be more likely to describe the average level of a population of cancer patients, with less suitability for personalized diagnosis and treatment. On the other hand, for patients diagnosed in the late stage (stage III and IV), it is difficult to select a certain treatment scheme for the patients and medical staff, resulting in premature death of many tumor patients who can survive for a long time due to over-medical treatment or medical misappropriation; while other patients who should be treated appropriately may have prolonged survival, giving up treatment or improper treatment may also lead to premature death of the tumor patient.
At present, there are reports that can be used to evaluate prognosis of tumor patients by using gene expression profiles. However, most reports only use a single or a plurality of genes, only classify one population, and only qualitatively classify the survival time of individuals (such as two indexes of good prognosis and poor prognosis). Therefore, there is a need to build more elaborate personalized tumor prognosis evaluation models to evaluate the survival time of patients to select appropriate treatment regimens.
Disclosure of Invention
In view of the above, the invention provides a renal clear cell carcinoma personalized prognosis evaluation method based on a polygene expression profile, which can accurately predict the annual survival probability of a patient.
In order to solve the technical problems, the invention discloses a renal clear cell carcinoma personalized prognosis evaluation method based on a polygene expression profile, which comprises the following steps:
step 1, obtaining a renal clear cell carcinoma prognosis risk gene list and gene weight;
step 2, constructing a prognosis evaluation model by using the tumor tissue transcriptome and survival data of the renal clear cell carcinoma patient;
step 3, calculating the risk score of the patient according to the gene expression profile of the tumor tissue of the patient with renal clear cell carcinoma;
And 4, calculating the annual survival probability of the patient according to the risk score of the patient.
Optionally, the obtaining of the renal clear cell carcinoma prognosis risk gene list and the gene weights in step 1 specifically include:
step 1.1, downloading transcriptome Data and clinical Data of tumor tissues and para-carcinoma tissues of patients with renal clear cell carcinoma from a Genomic Data common Data Portal database to obtain a tumor tissue gene expression profile FPKM value of the patients with renal clear cell carcinoma, and carrying out logarithmic conversion;
step 1.2, setting the total number of samples as m, dividing all samples into three groups according to the three quantiles of the gene expression values of the samples, wherein the gene expression values refer to the FPKM values obtained in the step 1.1 and are represented by V, and the ith gene is marked as ViCalculating the survival risk of the third grouping compared with the first grouping by using a Cox proportion risk model to obtain a risk ratio HRi and a P value of the ith gene; defining P value less than 0.05, screening survival risk gene with significance, and marking as N1(ii) a In addition, calculating the correlation between each gene and the survival days of the patient to obtain the correlation coefficient r and the P value of each gene; definition P value <0.05 significance, screening significant survival related gene, and marking as N 2(ii) a Defining the intersection of the survival risk gene and the survival related gene as a prognosis risk gene, and marking as N, then:
N=N1∩N2 (1)
step 1.3, calculating the weight W of the ith gene according to the risk ratio HRi of the ith geneiThe calculation formula is as follows:
Figure GDA0003280409780000031
thus, the weight of each gene is obtained, and the list of the renal clear cell carcinoma prognosis risk genes and the gene weight are finally obtained.
Optionally, the list of renal clear cell carcinoma prognosis risk genes and the gene weights are shown in the following table:
Figure GDA0003280409780000032
Figure GDA0003280409780000041
Figure GDA0003280409780000051
Figure GDA0003280409780000061
Figure GDA0003280409780000071
Figure GDA0003280409780000081
optionally, the step 2 of constructing a prognosis evaluation model by using the tumor tissue transcriptome and survival data of the renal clear cell carcinoma patient includes:
step 2.1, defining the gene expression value as V, and calculating the risk score S of the ith patient according to the expression value and the weight of the ith gene in the jth samplej(ii) a The calculation formula is as follows:
Figure GDA0003280409780000091
wherein j represents a sample number, VijExpressing the expression value of the ith gene in the jth sample;
step 2.2, all renal clear cell carcinoma patient samples are ranked from low to high risk score, and an average risk score is calculated for each 50 samples using a sliding window model
Figure GDA0003280409780000092
The calculation formula is as follows:
Figure GDA0003280409780000093
where j +49 represents the last 50 samples counted from sample j;
Step 2.3, performing curve fitting on the survival data of 50 samples by using Weibull distribution, wherein the probability density function of the Weibull distribution is as follows:
Figure GDA0003280409780000094
where k > 0 is a shape parameter and λ > 0 is a distributed ratio parameter;
step 2.4, calculate for every 50 samples
Figure GDA0003280409780000095
Corresponding kjAnd λj(ii) a Empirically, kjIs a relatively fixed number with the mean being:
Figure GDA0003280409780000096
wherein k isjThe shape parameter of the Weibull distribution from the jth sample to the j +49 th sample;
proportional parameter lambdajIs widely varied, defining lambdajAnd
Figure GDA0003280409780000101
the functional relationship of (A) is as follows:
Figure GDA0003280409780000102
wherein λ isjA proportion parameter representing the Weibull distribution of the survival curves from the jth sample to the j +49 th sample;
where e is the base of the natural logarithm, α, β are parameters of the function, and the logarithm is taken to the above formula:
Figure GDA0003280409780000103
where log λjAnd
Figure GDA0003280409780000104
solving for linear relation through linear fitting;
according to average risk score
Figure GDA0003280409780000105
With Weibull distribution parameter lambdajThe obtained function relationship is as follows:
Figure GDA0003280409780000106
will be provided with
Figure GDA0003280409780000107
Substituting the function to obtain a predicted lambdaj′,λj' for the expected distribution parameter calculated with this function, λ is calculatediAnd λj' correlation to obtain a correlation coefficient R20.883, and 6.50E-100.
Optionally, the calculating the risk score of the patient according to the gene expression profile of the tumor tissue of the renal clear cell carcinoma patient in the step 3 specifically comprises:
Obtaining FPKM value of ith gene expression profile of tumor tissue of renal clear cell carcinoma patient, and recording as: vl(ii) a The weight corresponding to the ith gene is recorded as: wi(ii) a The patient risk score was scored as: s; the calculation formula is as follows:
Figure GDA0003280409780000108
wherein i is the gene number and n is the gene number.
Optionally, the step 4 of calculating the annual survival probability of the patient according to the risk score of the patient specifically includes: substituting the risk score S of a patient into the cumulative distribution function of the Weibull distribution yields a survival probability function for that patient as:
Figure GDA0003280409780000111
wherein t is time, alpha, beta, S,
Figure GDA0003280409780000112
Are all fixed parameters.
Compared with the prior art, the invention can obtain the following technical effects:
1) and (2) continuously: the invention can predict the survival probability of tumor patients in continuous time. For example, the probability of survival of the patient per month, the probability of survival of the patient per year, etc. may be given. The current clinical typing method can only give a qualitative judgment.
2) More accurate: compared with the traditional TNM staging, the individual prognosis evaluation method for the renal clear cell carcinoma based on the polygene expression profile can more accurately reflect the survival state of the patient.
3) Personalization: for each tumor patient, the invention can give a survival probability curve specific to the patient, which is not possessed by a general tumor prognosis evaluation model.
Of course, it is not necessary for any one product in which the invention is practiced to achieve all of the above-described technical effects simultaneously.
Drawings
The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this specification, illustrate embodiments of the invention and together with the description serve to explain the invention and not to limit the invention. In the drawings:
FIG. 1 is a comparison of the predicted average annual survival probability and the actual annual survival probability of the present invention;
FIG. 2 is a correlation of TNM tumor stage and patient survival time according to the present invention;
FIG. 3 is a curve fitted to the Weibull distribution parameter scale according to the present invention for the average risk score;
FIG. 4 is a plot of the fitted residuals of the mean risk scores of the present invention with the Weibull distribution parameter scale;
FIG. 5 shows the result of the prognosis evaluation of the personalized renal clear cell carcinoma according to the present invention.
Detailed Description
The following embodiments are described in detail with reference to the accompanying drawings, so that how to implement the technical features of the present invention to solve the technical problems and achieve the technical effects can be fully understood and implemented.
The invention discloses a kidney clear cell carcinoma individualized prognosis evaluation method based on polygene expression profile, which comprises the following steps:
Step 1, obtaining a renal clear cell carcinoma prognosis risk gene list and gene weight, specifically:
step 1.1, downloading transcriptome Data and clinical Data of tumor tissues and para-carcinoma tissues of patients with renal clear cell carcinoma from a Genomic Data common Data Portal database to obtain a tumor tissue gene expression profile FPKM (gene expression profiling for renal clear cell carcinoma patients) (FPKM)Fragments Per Kilobase of transcript per Mill fragments mapped) values, log transformed (log 2).
Step 1.2, assuming the total number of samples as m, all samples were scored as V for the ith gene based on their gene expression values (FPKM value obtained in step 1.1, denoted by V)i) The third quantile is divided into three groups, the survival risk of the third group compared with the first group is calculated by using a Cox proportional risk model, and the risk ratio HRi and the P value of the ith gene are obtained. Defining P value less than 0.05, screening survival risk gene with significance, and marking as N1. In addition, the correlation between each gene and the survival days of the patient is calculated, and the correlation coefficient r and the P value of each gene are obtained. Defining P value less than 0.05, screening survival related gene with significance, and marking as N2. Defining the intersection of the survival risk gene and the survival related gene as a prognosis risk gene, and marking as N, then:
N=N1∩N2 (1)
Step 1.3, calculating the weight W of the ith gene according to the risk ratio HRi of the ith geneiThe calculation formula is as follows:
Figure GDA0003280409780000121
thus, the weight of each gene is calculated.
The list of the risk genes and the gene weights of the prognosis renal clear cell carcinoma are shown in table 1.
TABLE 1 Gene names and weights
Figure GDA0003280409780000122
Figure GDA0003280409780000131
Figure GDA0003280409780000141
Figure GDA0003280409780000151
Figure GDA0003280409780000161
Figure GDA0003280409780000171
Figure GDA0003280409780000181
Step 2, constructing a prognosis evaluation model by using the tumor tissue transcriptome and survival data of the renal clear cell carcinoma patient, which specifically comprises the following steps:
step 2.1, defining the gene expression value as V, and calculating the risk score S of the ith patient according to the expression value and the weight of the ith gene in the jth samplej(ii) a The calculation formula is as follows:
Figure GDA0003280409780000182
wherein j represents a sample number, VijExpressing the expression value of the ith gene in the jth sample;
step 2.2, all renal clear cell carcinoma patient samples were ranked from low to high risk score using a sliding window model (Kang HJ et alre.2011; 478(7370): 483-489.) calculate the average risk score for each 50 samples
Figure GDA0003280409780000183
The calculation formula is as follows:
Figure GDA0003280409780000184
where j +49 represents the last 50 samples counted from sample j.
Step 2.3, performing curve fitting on the survival data of 50 samples by using Weibull distribution, wherein the probability density function of the Weibull distribution is as follows:
Figure GDA0003280409780000185
Where k > 0 is the shape (shape) parameter and λ > 0 is the scale of distribution (scale) parameter.
Step 2.4, calculate for every 50 samples
Figure GDA0003280409780000191
Corresponding kjAnd λj. Empirically, kjIs a relatively fixed number with the mean being:
Figure GDA0003280409780000192
wherein k isjThe shape parameter of the Weibull distribution from the jth sample to the j +49 sample survival curve is the same as k in the above, wherein j is added to refer to a specific group of samples;
proportional parameter lambdajIs widely varied, defining lambdajAnd
Figure GDA0003280409780000193
the functional relationship of (A) is as follows:
Figure GDA0003280409780000194
wherein λ isjA proportion parameter representing the Weibull distribution of the survival curves from the jth sample to the j +49 th sample;
where e is the base of the natural logarithm, α, β are parameters of the function, and the logarithm is taken to obtain the formula:
Figure GDA0003280409780000195
where log λjAnd
Figure GDA0003280409780000196
the linear relationship can be solved by linear fitting.
FIG. 3 shows the average risk score
Figure GDA0003280409780000197
With Weibull distribution parameter lambdajThe obtained function relationship is as follows:
Figure GDA0003280409780000198
will be provided with
Figure GDA0003280409780000199
Substituting the function to obtain a predicted lambdaj′(λj' is an expected distribution parameter calculated using the function), lambda is calculatedjAnd λj' correlation can be given by a correlation coefficient R20.883, and 6.50E-100.
By analyzing the fitted residual map and the Q-Q map (FIG. 4), it was shown that the model achieved significance, i.e., the average risk score
Figure GDA00032804097800001910
With Weibull distribution parameter lambdajIs in a functional relationship ofThe letter is as follows.
Step 3, calculating the risk score of the patient according to the gene expression profile of the tumor tissue of the renal clear cell carcinoma patient, which comprises the following specific steps:
FPKM values (including all or most of the genes listed in table 1) were obtained for the ith gene expression profile of tumor tissue from renal clear cell carcinoma patients and are reported as: vi(i is the gene number); the weights corresponding to the ith gene in table 1 are given as: wi(i is the gene number); the patient risk score was scored as: s; the calculation formula is as follows:
Figure GDA0003280409780000201
wherein i is the gene number and n is the number of genes listed in Table 1.
Step 4, calculating the annual survival probability of the patient according to the risk score of the patient, specifically: substituting the risk score S of a patient into the cumulative distribution function of the Weibull distribution yields a probability of survival function for that patient as:
Figure GDA0003280409780000202
wherein t is time, alpha, beta, S,
Figure GDA0003280409780000203
Are all fixed parameters.
FIG. 5 shows the survival probability curve for a patient, with days on the abscissa and probability of survival on the ordinate. The annual survival probability of the patient is indicated below the curve. The black box in the upper right corner indicates the actual number of days the patient survived, and Status (Status)0 indicates that the patient was still alive. The red dots on the curve mark the number of days and probability of survival for the patient, and the probability of survival for the patient is around 0.64.
The invention utilizes TCGA-KIRC transcriptome and clinical data to carry out personalized survival prediction on all renal clear cell carcinoma patients, and utilizes a cross validation method to carry out survival prediction on all renal clear cell carcinoma patientsThe results obtained were verified. The results show that the annual survival probability of the renal clear cell carcinoma patients obtained by the renal clear cell carcinoma individualized prognosis evaluation method adopting the multi-gene expression profile is highly consistent with the actual annual survival rate (linear correlation R)20.999, P-value 8.21E-74, fig. 1). The method is proved to have high prediction accuracy and to be highly consistent with the actual survival state.
As shown in fig. 2, TNM staging has a lower correlation with survival time for renal clear cell carcinoma patients. Fig. 1 and fig. 2 can be compared to obtain the individual prognosis evaluation method of renal clear cell carcinoma based on multi-gene expression profile, which can reflect the survival status of the patient more accurately than the traditional TNM stage.
As shown in FIG. 5, for each tumor patient, the present invention can give a survival probability curve (FIG. 5) specific to the patient, which is not available in general tumor prognosis evaluation models.
While the foregoing description shows and describes several preferred embodiments of the invention, it is to be understood, as noted above, that the invention is not limited to the forms disclosed herein, but is not to be construed as excluding other embodiments and is capable of use in various other combinations, modifications, and environments and is capable of changes within the scope of the inventive concept as expressed herein, commensurate with the above teachings, or the skill or knowledge of the relevant art. And that modifications and variations may be effected by those skilled in the art without departing from the spirit and scope of the invention as defined by the appended claims.

Claims (2)

1. A method for personalized prognosis of renal clear cell carcinoma, comprising the steps of:
step 1, obtaining a renal clear cell carcinoma prognosis risk gene list and gene weight;
step 2, constructing a prognosis evaluation model by using the tumor tissue transcriptome and survival data of the renal clear cell carcinoma patient;
step 3, calculating the risk score of the patient according to the gene expression profile of the tumor tissue of the patient with renal clear cell carcinoma;
step 4, calculating the annual survival probability of the patient according to the risk score of the patient;
the obtaining of the renal clear cell carcinoma prognosis risk gene list and the gene weight in the step 1 are specifically as follows:
step 1.1, downloading transcriptome Data and clinical Data of tumor tissues and para-carcinoma tissues of patients with renal clear cell carcinoma from a Genomic Data common Data Portal database to obtain a tumor tissue gene expression profile FPKM value of the patients with renal clear cell carcinoma, and carrying out logarithmic conversion;
step 1.2, setting the total number of samples as m, dividing all samples into three groups according to the three quantiles of the gene expression values of the samples, wherein the gene expression values refer to the FPKM values obtained in the step 1.1 and are represented by V, and the ith gene is marked as ViCalculating the survival risk of the third grouping compared with the first grouping by using a Cox proportion risk model to obtain a risk ratio HRi and a P value of the ith gene; defining P value <0.05 significant, screening significant survival risk gene, and marking as N1(ii) a In addition, calculating the correlation between each gene and the survival days of the patient to obtain the correlation coefficient r and the P value of each gene; defining P value<0.05 significance, screening significant survival related gene, and marking as N2(ii) a Defining the intersection of the survival risk gene and the survival related gene as a prognosis risk gene, and marking as N, then:
N=N1∩N2 (1)
step 1.3, calculating the weight W of the ith gene according to the risk ratio HRi of the ith geneiThe calculation formula is as follows:
Figure FDA0003280409770000021
thus obtaining the weight of each gene, and finally obtaining a renal clear cell carcinoma prognosis risk gene list and the gene weight;
the list of risk genes and the gene weights for prognosis of renal clear cell carcinoma are shown in the following table:
Figure FDA0003280409770000022
Figure FDA0003280409770000031
Figure FDA0003280409770000041
Figure FDA0003280409770000051
Figure FDA0003280409770000061
Figure FDA0003280409770000071
in the step 2, a prognosis evaluation model is constructed by using the tumor tissue transcriptome and survival data of the renal clear cell carcinoma patient, and the method specifically comprises the following steps:
step 2.1, defining the gene expression value as V, and calculating the risk score S of the ith patient according to the expression value and the weight of the ith gene in the jth samplej(ii) a The calculation formula is as follows:
Figure FDA0003280409770000072
wherein j represents a sample number, VijExpressing the expression value of the ith gene in the jth sample;
Step 2.2, all renal clear cell carcinoma patient samples are ranked from low to high according to risk score, and the average risk is calculated for each 50 samples using a sliding window modelScore of
Figure FDA0003280409770000073
The calculation formula is as follows:
Figure FDA0003280409770000074
where j +49 represents the last 50 samples counted from sample j;
step 2.3, performing curve fitting on the survival data of 50 samples by using Weibull distribution, wherein the probability density function of the Weibull distribution is as follows:
Figure FDA0003280409770000081
where k > 0 is a shape parameter and λ > 0 is a distributed ratio parameter;
step 2.4, calculate for every 50 samples
Figure FDA0003280409770000082
Corresponding kjAnd λj;kjIs a fixed number with the mean:
Figure FDA0003280409770000083
wherein k isjThe shape parameter of the Weibull distribution from the jth sample to the j +49 th sample;
definition of lambdajAnd
Figure FDA0003280409770000084
the functional relationship of (A) is as follows:
Figure FDA0003280409770000085
wherein λ isjA proportion parameter representing the Weibull distribution of the survival curves from the jth sample to the j +49 th sample;
where e is the base of the natural logarithm, α, β are parameters of the function, and the logarithm is taken to the above formula:
Figure FDA0003280409770000086
where log λjAnd
Figure FDA0003280409770000087
solving for linear relation through linear fitting;
according to average risk score
Figure FDA0003280409770000088
With Weibull distribution parameter lambdajThe obtained function relationship is as follows:
Figure FDA0003280409770000089
will be provided with
Figure FDA0003280409770000091
Substituting the function to obtain a predicted lambdaj′,λj' for the expected distribution parameter calculated with this function, λ is calculated jAnd λj' correlation to obtain a correlation coefficient R20.883, P value 6.50E-100;
the step 3 of calculating the risk score of the patient according to the gene expression profile of the tumor tissue of the renal clear cell carcinoma patient specifically comprises the following steps:
obtaining FPKM value of ith gene expression profile of tumor tissue of renal clear cell carcinoma patient, and recording as: vi(ii) a The weight corresponding to the ith gene is recorded as: wi(ii) a The patient risk score was scored as: s; the calculation formula is as follows:
Figure FDA0003280409770000092
wherein i is the gene number and n is the gene number.
2. The prognostic assessment method according to claim 1, wherein the step 4 of calculating the annual survival probability of the patient based on the risk score of the patient comprises: substituting the risk score S of a patient into the cumulative distribution function of the Weibull distribution yields a survival probability function for that patient as:
Figure FDA0003280409770000093
wherein t is time, alpha, beta, S,
Figure FDA0003280409770000094
Are all fixed parameters.
CN201810440933.0A 2018-05-09 2018-05-09 Individualized prognosis evaluation method for renal clear cell carcinoma Active CN108647493B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810440933.0A CN108647493B (en) 2018-05-09 2018-05-09 Individualized prognosis evaluation method for renal clear cell carcinoma

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810440933.0A CN108647493B (en) 2018-05-09 2018-05-09 Individualized prognosis evaluation method for renal clear cell carcinoma

Publications (2)

Publication Number Publication Date
CN108647493A CN108647493A (en) 2018-10-12
CN108647493B true CN108647493B (en) 2022-01-18

Family

ID=63753958

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810440933.0A Active CN108647493B (en) 2018-05-09 2018-05-09 Individualized prognosis evaluation method for renal clear cell carcinoma

Country Status (1)

Country Link
CN (1) CN108647493B (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109887600A (en) * 2019-04-16 2019-06-14 上海理工大学 A kind of analysis method of pair of non-small cell lung cancer prognosis Survival
CN111944898A (en) * 2020-08-04 2020-11-17 佛山科学技术学院 Characteristic mRNA expression profile combination and renal clear cell carcinoma early prediction method
CN111944899A (en) * 2020-08-04 2020-11-17 佛山科学技术学院 Feature miRNA expression profile combination and renal papillary cell carcinoma early prediction method

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101688240A (en) * 2007-04-10 2010-03-31 国立台湾大学 By survival expection after microRNA prediction cancer patients's the treatment
CN103547682A (en) * 2011-01-14 2014-01-29 新加坡科技研究局 Gene signatures for use with hepatocellular carcinoma
CN104024436A (en) * 2011-10-24 2014-09-03 纯德赛尔医药公司 Marker genes for prostate cancer classification
CN104854247A (en) * 2012-10-12 2015-08-19 新加坡科技研究局 Method of prognosis and stratification of ovarian cancer
CN106407689A (en) * 2016-09-27 2017-02-15 牟合(上海)生物科技有限公司 Stomach cancer prognostic marker screening and classifying method based on gene expression profile
CN107463798A (en) * 2017-08-02 2017-12-12 南京高新生物医药公共服务平台有限公司 Predict the 12 gene expressions classification device and its construction method of adenocarcinoma of colon prognosis

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7695915B2 (en) * 2007-01-31 2010-04-13 Celera Corporation Molecular prognostic signature for predicting breast cancer distant metastasis, and uses thereof

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101688240A (en) * 2007-04-10 2010-03-31 国立台湾大学 By survival expection after microRNA prediction cancer patients's the treatment
CN103547682A (en) * 2011-01-14 2014-01-29 新加坡科技研究局 Gene signatures for use with hepatocellular carcinoma
CN104024436A (en) * 2011-10-24 2014-09-03 纯德赛尔医药公司 Marker genes for prostate cancer classification
CN104854247A (en) * 2012-10-12 2015-08-19 新加坡科技研究局 Method of prognosis and stratification of ovarian cancer
CN106407689A (en) * 2016-09-27 2017-02-15 牟合(上海)生物科技有限公司 Stomach cancer prognostic marker screening and classifying method based on gene expression profile
CN107463798A (en) * 2017-08-02 2017-12-12 南京高新生物医药公共服务平台有限公司 Predict the 12 gene expressions classification device and its construction method of adenocarcinoma of colon prognosis

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
"Identification of metabolism‑associated genes and pathways involved in different stages of clear cell renal cell carcinoma";HUI-JUAN LI et al.;《Oncology Letters》;20171208;2316-2322 *

Also Published As

Publication number Publication date
CN108647493A (en) 2018-10-12

Similar Documents

Publication Publication Date Title
CN108363907B (en) Lung adenocarcinoma personalized prognosis evaluation method based on polygene expression profile
CN108630317B (en) Liver cancer personalized prognosis evaluation method based on polygene expression profile
CN112048559B (en) Model construction and clinical application of m 6A-related IncRNA network gastric cancer prognosis
CN108470111B (en) Stomach cancer personalized prognosis evaluation method based on polygene expression profile
CN108648826B (en) Pancreatic cancer personalized prognosis evaluation method based on polygene expression profile
CN108647493B (en) Individualized prognosis evaluation method for renal clear cell carcinoma
Xie et al. Radiomics nomogram for prediction disease-free survival and adjuvant chemotherapy benefits in patients with resected stage I lung adenocarcinoma
CN111564214B (en) Method for establishing and verifying breast cancer prognosis evaluation model based on 7 special genes
Chen et al. Clinical use of a machine learning histopathological image signature in diagnosis and survival prediction of clear cell renal cell carcinoma
CN108611416B (en) Cervical cancer personalized prognosis evaluation method based on polygene expression profile
CN108733980B (en) Glioblastoma multiforme personalized prognosis evaluation method based on polygene expression profile
CN114891887A (en) Method for screening triple negative breast cancer prognosis gene marker
CN115019965A (en) Method for constructing liver cancer patient survival prediction model based on cell death related gene
CN113355419B (en) Breast cancer prognosis risk prediction marker composition and application
CN110305964A (en) A kind of foundation of patients with prostate cancer prognosis recurrence risk profile mark tool and its risk evaluation model
US20230383364A1 (en) Prognostic model of hepatocellular carcinoma based on ddr and icd gene expression and construction method and application thereof
CN109055562A (en) A kind of biomarker, predict clear-cell carcinoma recurrence and mortality risk method
CN108320806B (en) Colon cancer personalized prognosis evaluation method based on polygene expression profile
CN115482880A (en) Head and neck squamous carcinoma glycolysis related gene prognosis model, construction method and application
Chen et al. N6-methyladenosine-related long noncoding RNA is a potential biomarker for predicting pancreatic cancer prognosis
CN114203256B (en) MIBC typing and prognosis prediction model construction method based on microbial abundance
US20180100199A1 (en) Method for determining prognosis of breast cancer patient by using gene expression data
CN109979532B (en) Thyroid papillary carcinoma distant metastasis molecular mutation prediction model, method and system
CN114678062B (en) Hepatocellular carcinoma prognosis prediction system based on multiple sets of chemical characteristics and prediction method thereof
CN114171200A (en) PTC (Positive temperature coefficient) prognosis marker, application thereof and construction method of PTC prognosis evaluation model

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant