CN108363907A - A kind of adenocarcinoma of lung personalization prognostic evaluation methods based on multi-gene expression characteristic spectrum - Google Patents

A kind of adenocarcinoma of lung personalization prognostic evaluation methods based on multi-gene expression characteristic spectrum Download PDF

Info

Publication number
CN108363907A
CN108363907A CN201810440855.4A CN201810440855A CN108363907A CN 108363907 A CN108363907 A CN 108363907A CN 201810440855 A CN201810440855 A CN 201810440855A CN 108363907 A CN108363907 A CN 108363907A
Authority
CN
China
Prior art keywords
gene
patient
survival
lung adenocarcinoma
value
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201810440855.4A
Other languages
Chinese (zh)
Other versions
CN108363907B (en
Inventor
李文兴
李功华
黄京飞
赵旭东
代绍兴
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Trace Expression Life Science Chongqing Co ltd
Original Assignee
Kunming Institute of Zoology of CAS
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Kunming Institute of Zoology of CAS filed Critical Kunming Institute of Zoology of CAS
Priority to CN201810440855.4A priority Critical patent/CN108363907B/en
Publication of CN108363907A publication Critical patent/CN108363907A/en
Application granted granted Critical
Publication of CN108363907B publication Critical patent/CN108363907B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B25/00ICT specially adapted for hybridisation; ICT specially adapted for gene or protein expression
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H50/00ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
    • G16H50/30ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for calculating health indices; for individual health risk assessment

Landscapes

  • Health & Medical Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Medical Informatics (AREA)
  • Public Health (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Physics & Mathematics (AREA)
  • General Health & Medical Sciences (AREA)
  • Pathology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Databases & Information Systems (AREA)
  • Primary Health Care (AREA)
  • Data Mining & Analysis (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Genetics & Genomics (AREA)
  • Molecular Biology (AREA)
  • Epidemiology (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Biotechnology (AREA)
  • Evolutionary Biology (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Theoretical Computer Science (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
  • Investigating Or Analysing Biological Materials (AREA)

Abstract

本发明公开了一种基于多基因表达特征谱的肺腺癌个性化预后评估方法,包括以下步骤:获取肺腺癌预后风险基因列表与基因权重;利用肺腺癌患者肿瘤组织转录组和生存数据构建预后评估模型;根据肺腺癌患者肿瘤组织的基因表达谱计算患者的风险得分;根据患者的风险得分计算患者每年的生存概率。本发明的方法得出的肺腺癌患者每年的生存概率与实际每年存活比率高度一致(线性相关R2=0.994,P值=2.86E‑43)。证实了该方法具有很高的预测准确性,与实际生存状态高度吻合。同时,对于每个肿瘤患者,本发明可以给出该患者特有的生存概率曲线。

The invention discloses a method for evaluating the individualized prognosis of lung adenocarcinoma based on a multi-gene expression profile, which comprises the following steps: obtaining a list of prognostic risk genes and gene weights for lung adenocarcinoma; using tumor tissue transcriptome and survival data of patients with lung adenocarcinoma Construct a prognosis assessment model; calculate the patient's risk score based on the gene expression profile of the tumor tissue of lung adenocarcinoma patients; calculate the patient's annual survival probability based on the patient's risk score. The annual survival probability of lung adenocarcinoma patients obtained by the method of the present invention is highly consistent with the actual annual survival rate (linear correlation R 2 =0.994, P value =2.86E-43). It is confirmed that the method has high prediction accuracy, which is highly consistent with the actual survival status. At the same time, for each tumor patient, the present invention can provide the specific survival probability curve of the patient.

Description

一种基于多基因表达特征谱的肺腺癌个性化预后评估方法A personalized prognostic assessment method for lung adenocarcinoma based on multiple gene expression profiles

技术领域technical field

本发明属于生物技术和医学领域,具体地说,涉及一种基于多基因表达特征谱的肺腺癌个性化预后评估方法。The invention belongs to the fields of biotechnology and medicine, and in particular relates to a personalized prognosis evaluation method for lung adenocarcinoma based on multi-gene expression profiles.

背景技术Background technique

肺腺癌占所有肺癌患者的约40%。肺癌是全球发病率最高的肿瘤,也是导致男性癌症死亡的首要原因。在女性人群中肺癌的发生率仅次于乳腺癌。全球疾病负担(GlobalBurden of Disease,GBD)数据显示,2016年全球患有气管、支气管或肺癌的人数超过280万,其中中国患病人数高达100万。2016年全球患有上述癌症的死亡人数为170万,占总死亡人数的3.12%。中国2016年死亡患者数为59万,占总死亡人数的6.11%。统计结果显示,从1990年到2016年全球气管、支气管和肺癌患病率和死亡率持续增长,中国患病率和死亡率也持续增长且增长趋势和全球增长趋势相对一致。Lung adenocarcinoma accounts for about 40% of all lung cancer patients. Lung cancer is the most common cancer worldwide and the leading cause of cancer death in men. The incidence of lung cancer in the female population is second only to breast cancer. Global Burden of Disease (GBD) data shows that in 2016, the number of people suffering from trachea, bronchus or lung cancer in the world exceeded 2.8 million, and the number of patients in China was as high as 1 million. In 2016, the number of deaths from the above-mentioned cancers worldwide was 1.7 million, accounting for 3.12% of the total deaths. In 2016, the number of patients who died in China was 590,000, accounting for 6.11% of the total number of deaths. Statistics show that from 1990 to 2016, the global prevalence and mortality of trachea, bronchus and lung cancer continued to increase, and the prevalence and mortality in China also continued to increase, and the growth trend was relatively consistent with the global growth trend.

目前国际上通用的的肿瘤分期方法是TNM分期系统,该系统是美国癌症联合委员会(American Joint Committee on Cancer,AJCC)提出的一种恶性肿瘤分类方法。美国国家癌症研究所(National Cancer Institute,NCI)对TNM分期的描述为:T指主要肿瘤的大小和范围,主要肿瘤通常被称为原发性肿瘤。N指患有癌症的附近淋巴结的数目。M指癌症是否已经转移,即从原发性肿瘤扩散到身体的其他部位。根据以上指标可将恶性肿瘤大致分为I期,II期,III期和IV期,其中分期越高表示肿瘤的恶性程度越高。TNM分期系统对肿瘤患者的治疗和预后评估有一定帮助。但是,由于不同个体中肿瘤的发生机制及体内微环境的不同,导致不同患者的生存时间差异巨大,TNM分期系统不能很好地反映出患者的预后状况。研究发现,对于某些诊断为I期的患者可能只有较短的生存期(1-2年),然而对于一些诊断为IV期的患者可能具有较长的生存期(5年及以上)。因此,TNM分期系统可能更倾向于描述一个癌症患者群体的平均水平,对个性化的诊断和治疗适用性较差。另一方面,对于诊断为晚期(III期、IV期)的患者,会给患者及医务人员造成一定的治疗方案选择困难,导致很多本来可以长时间生存的肿瘤患者由于过度医疗或医疗失当而提前死亡;而另一些本应进行适当治疗可以延长生存的患者由于放弃治疗或治疗不当同样导致肿瘤患者提前死亡。The current international tumor staging method is the TNM staging system, which is a malignant tumor classification method proposed by the American Joint Committee on Cancer (AJCC). The National Cancer Institute (NCI) describes the TNM staging as follows: T refers to the size and extent of the main tumor, which is usually called the primary tumor. N refers to the number of nearby lymph nodes with cancer. M refers to whether the cancer has metastasized, that is, spread from the primary tumor to other parts of the body. According to the above indicators, malignant tumors can be roughly divided into stage I, stage II, stage III and stage IV, and the higher the stage, the higher the degree of malignancy of the tumor. The TNM staging system is helpful to the treatment and prognosis evaluation of tumor patients. However, due to the differences in tumorigenesis mechanisms and in vivo microenvironments in different individuals, the survival time of different patients varies greatly, and the TNM staging system cannot well reflect the prognosis of patients. Studies have found that some patients diagnosed with stage I may have a short survival period (1-2 years), while some patients diagnosed with stage IV may have a longer survival period (5 years and above). Therefore, the TNM staging system may be more inclined to describe the average level of a cancer patient population, which is less applicable to individualized diagnosis and treatment. On the other hand, for patients diagnosed as advanced (stage III, stage IV), it will cause certain difficulties for patients and medical staff to choose a treatment plan, resulting in many tumor patients who could have survived for a long time due to excessive medical treatment or medical malpractice. and other patients who should have received appropriate treatment to prolong their survival also led to early death of tumor patients due to abandonment of treatment or improper treatment.

目前,有报道提出利用基因表达谱可以对肿瘤患者进行预后评估。但是,绝大多数报道只是使用单个或数个基因,只能对一个群体进行分类,对个体生存期只能进行定性的划分(如预后好、预后差两个指标)。因此,需要建立更精细的个性化肿瘤预后评估模型来评估患者的生存时间从而选择合适的治疗方案。At present, some reports suggest that the use of gene expression profiles can be used to assess the prognosis of cancer patients. However, most of the reports only use single or several genes, can only classify a group, and can only qualitatively divide the individual survival period (such as two indicators of good prognosis and poor prognosis). Therefore, it is necessary to establish a more refined personalized tumor prognosis assessment model to evaluate the patient's survival time and choose an appropriate treatment plan.

发明内容Contents of the invention

有鉴于此,本发明提供了一种基于多基因表达特征谱的肺腺癌个性化预后评估方法,能够准确预测患者每年的存活概率。In view of this, the present invention provides a personalized prognosis assessment method for lung adenocarcinoma based on multi-gene expression profiles, which can accurately predict the annual survival probability of patients.

为了解决上述技术问题,本发明公开了一种基于多基因表达特征谱的肺腺癌个性化预后评估方法,In order to solve the above technical problems, the present invention discloses a personalized prognosis evaluation method for lung adenocarcinoma based on multi-gene expression profiles,

包括以下步骤:Include the following steps:

步骤1、获取肺腺癌预后风险基因列表与基因权重;Step 1. Obtain the list of prognostic risk genes and gene weights for lung adenocarcinoma;

步骤2、利用肺腺癌患者肿瘤组织转录组和生存数据构建预后评估模型;Step 2, using the tumor tissue transcriptome and survival data of patients with lung adenocarcinoma to construct a prognosis assessment model;

步骤3、根据肺腺癌患者肿瘤组织的基因表达谱计算患者的风险得分;Step 3, calculating the risk score of the patient according to the gene expression profile of the tumor tissue of the lung adenocarcinoma patient;

步骤4、根据患者的风险得分计算患者每年的生存概率。Step 4. Calculate the patient's annual survival probability based on the patient's risk score.

可选地,所述步骤1中的获取肺腺癌预后风险基因列表与基因权重具体按照以下步骤实施:Optionally, the acquisition of the lung adenocarcinoma prognosis risk gene list and gene weights in the step 1 is specifically implemented according to the following steps:

步骤1.1、从Genomic Data Commons Data Portal数据库中下载肺腺癌患者肿瘤组织和癌旁组织转录组数据以及临床数据,获得肺腺癌患者肿瘤组织基因表达谱FPKM数值,进行对数转换;Step 1.1. Download the tumor tissue and paracancerous tissue transcriptome data and clinical data of lung adenocarcinoma patients from the Genomic Data Commons Data Portal database, obtain the FPKM value of the tumor tissue gene expression profile of lung adenocarcinoma patients, and perform logarithmic transformation;

步骤1.2、设总样本数为m,将所有样本根据其基因表达值的三分位数分为三组,其中,基因表达值为步骤1.1中获得的FPKM数值,用V表示,对第i个基因记为Vi,利用Cox比例风险模型计算第三分组相比第一分组的生存风险,得出每个基因的风险比HR和P值;定义P值<0.05具有显著性,筛选具有显著性的生存风险基因,记为n1;此外,计算每个基因与患者生存天数的相关性,得出每个基因的相关系数r和P值;定义P值<0.05具有显著性,筛选具有显著性的生存相关基因,记为n2;将生存风险基因和生存相关基因的交集定义为预后风险基因,记为n,则有:Step 1.2, set the total number of samples as m, divide all samples into three groups according to the tertiles of their gene expression values, where the gene expression value is the FPKM value obtained in step 1.1, denoted by V, for the ith The gene is denoted as V i , and the Cox proportional hazard model is used to calculate the survival risk of the third group compared with the first group, and the hazard ratio HR and P value of each gene are obtained; P value <0.05 is defined as significant, and the screening is significant The survival risk genes of , denoted as n 1 ; in addition, calculate the correlation between each gene and the patient's survival days, and obtain the correlation coefficient r and P value of each gene; define that P value <0.05 is significant, and the screening is significant The survival-related genes of , denoted as n 2 ; the intersection of survival risk genes and survival-related genes is defined as the prognosis risk gene, denoted as n, then:

n=n1∩n2 (1)n=n 1 ∩n 2 (1)

步骤1.3、根据第i个基因的风险比计算每个基因的权重Wi,计算公式为:Step 1.3. Calculate the weight W i of each gene according to the risk ratio of the i-th gene, and the calculation formula is:

其中i表示基因编号,HRi表示第i个基因的风险比;Where i represents the gene number, and HR i represents the hazard ratio of the i-th gene;

这样就计算得到每一个基因的权重;最终得到的肺腺癌预后风险基因列表与基因权重。In this way, the weight of each gene is calculated; the final lung adenocarcinoma prognosis risk gene list and gene weight are obtained.

可选地,所述的肺腺癌预后风险基因列表与基因权重具体见下表:Optionally, the list of prognostic risk genes and gene weights for lung adenocarcinoma are specifically shown in the following table:

可选地,所述步骤2中的利用肺腺癌患者肿瘤组织转录组和生存数据构建预后评估模型具体按照以下步骤实施:Optionally, in step 2, constructing a prognosis assessment model using tumor tissue transcriptome and survival data of patients with lung adenocarcinoma is specifically implemented according to the following steps:

步骤2.1、定义基因表达值为V,根据第i个基因在第j个样本中的表达值和权重计算第i个患者的风险得分Si;计算公式为:Step 2.1. Define the gene expression value V, and calculate the risk score S i of the i-th patient according to the expression value and weight of the i-th gene in the j-th sample; the calculation formula is:

其中i表示基因编号,Vij表示第i个基因在第j个样本中的表达值;Where i represents the gene number, V ij represents the expression value of the i-th gene in the j-th sample;

步骤2.2、将所有肺腺癌患者样本按照风险得分从低到高排序,使用滑动窗口模型对每50个样本计算平均风险得分计算公式为:Step 2.2, sort all lung adenocarcinoma patient samples according to the risk score from low to high, and use the sliding window model to calculate the average risk score for every 50 samples The calculation formula is:

其中j+49表示从样本j开始计数的后50个样本;Where j+49 represents the last 50 samples counted from sample j;

步骤2.3、使用Weibull分布对50个样本的生存数据进行曲线拟合,Weibull分布的概率密度函数为:Step 2.3. Use the Weibull distribution to perform curve fitting on the survival data of 50 samples. The probability density function of the Weibull distribution is:

其中k>0是形状(shape)参数,λ>0是分布的比例(scale)参数。Where k>0 is a shape parameter, and λ>0 is a scale parameter of the distribution.

步骤2.4、对每50个样本计算出所对应的kj和λj;根据经验,kj为一个相对固定的数值,均值为:Step 2.4, calculate for every 50 samples The corresponding k j and λ j ; according to experience, k j is a relatively fixed value, and the mean value is:

其中,kj为第j个样本到第j+49个样本生存曲线Weibull分布的形状参数;Among them, k j is the shape parameter of the Weibull distribution of the survival curve from the jth sample to the j+49th sample;

比例参数λj的变化范围较大,定义λj的函数关系为:The variation range of the proportional parameter λ j is relatively large, and the definition of λ j and The functional relationship is:

其中,λj表示第j个样本到第j+49个样本生存曲线Weibull分布的比例参数;Among them, λj represents the proportional parameter of the Weibull distribution of the survival curve from the jth sample to the j+49th sample;

其中e为自然对数的底,α、β为函数的参数,对上式取对数得:Where e is the base of the natural logarithm, α and β are the parameters of the function, and the logarithm of the above formula is obtained:

其中logλj为线性关系,通过线性拟合求解;where logλ j and is a linear relationship, solved by linear fitting;

根据平均风险得分与Weibull分布参数λj的拟合曲线,得出的函数关系为:According to the average risk score With the fitting curve of Weibull distribution parameter λ j , the function relationship obtained is:

代入该函数得出预测的λj′,λj′为用该函数计算出的预期分布参数,计算λj与λj′的相关性得相关系数R2=0.943,P值=2.96E-97。Will Substituting this function to get the predicted λ j ′, λ j ′ is the expected distribution parameter calculated by this function, and calculating the correlation between λ j and λ j ′, the correlation coefficient R 2 = 0.943, P value = 2.96E-97 .

可选地,所述步骤3中的根据肺腺癌患者肿瘤组织的基因表达谱计算患者的风险得分具体按照以下步骤实施:Optionally, the calculation of the risk score of the patient according to the gene expression profile of the tumor tissue of the lung adenocarcinoma patient in the step 3 is specifically implemented according to the following steps:

获取肺腺癌患者肿瘤组织的基因表达谱的FPKM数值,记为:Vi,其中,i为基因编号;第i基因对应的权重记为:Wi,其中,i为基因编号;患者风险得分记为:S;计算公式为:Obtain the FPKM value of the gene expression profile of the tumor tissue of patients with lung adenocarcinoma, recorded as: V i , where i is the gene number; the weight corresponding to the i-th gene is recorded as: W i , where i is the gene number; patient risk score Recorded as: S; the calculation formula is:

其中i为基因编号,n为基因个数。Where i is the gene number and n is the number of genes.

可选地,所述步骤4中的根据患者的风险得分计算患者每年的生存概率具体按照以下步骤实施:将患者的风险得分S带入Weibull分布的累积分布函数得出该患者的存活概率函数为:Optionally, the calculation of the patient's annual survival probability based on the patient's risk score in step 4 is specifically implemented in the following steps: the patient's risk score S is brought into the cumulative distribution function of the Weibull distribution to obtain the patient's survival probability function as :

其中t为时间,α、β、S、均为固定参数。Where t is time, α, β, S, are fixed parameters.

与现有技术相比,本发明可以获得包括以下技术效果:Compared with prior art, the present invention can obtain and comprise following technical effect:

1)连续:能预测肿瘤患者连续时间的生存概率。例如可以给出患者每个月的生存概率、患者每年的生存概率等。而目前临床采用的分型方法只能给出一个定性的判断。1) Continuous: It can predict the survival probability of tumor patients for continuous time. For example, the survival probability of the patient per month, the survival probability of the patient per year, and the like can be given. However, the current clinical classification method can only give a qualitative judgment.

2)更精确:基于多基因表达特征谱的肺腺癌个性化预后评估方法相比传统TNM分期能够更精确地反映患者的生存状态。2) More accurate: Compared with the traditional TNM staging, the personalized prognostic assessment method of lung adenocarcinoma based on multi-gene expression profiles can more accurately reflect the survival status of patients.

3)个性化:对于每个肿瘤患者,本发明可以给出该患者特有的生存概率曲线,这是一般肿瘤预后评估模型所不具备的。3) Personalization: For each tumor patient, the present invention can provide the patient's unique survival probability curve, which is not available in general tumor prognosis assessment models.

当然,实施本发明的任一产品并不一定需要同时达到以上所述的所有技术效果。Of course, implementing any product of the present invention does not necessarily need to achieve all the technical effects described above at the same time.

附图说明Description of drawings

此处所说明的附图用来提供对本发明的进一步理解,构成本发明的一部分,本发明的示意性实施例及其说明用于解释本发明,并不构成对本发明的不当限定。在附图中:The accompanying drawings described here are used to provide a further understanding of the present invention, and constitute a part of the present invention. The schematic embodiments of the present invention and their descriptions are used to explain the present invention, and do not constitute improper limitations to the present invention. In the attached picture:

图1是本发明预测平均每年生存概率与实际每年存活概率比较;Fig. 1 is that the average annual survival probability predicted by the present invention compares with the actual annual survival probability;

图2是本发明TNM肿瘤分期与患者生存时间的相关性;Fig. 2 is the correlation between TNM tumor staging of the present invention and patient survival time;

图3是本发明平均风险得分与Weibull分布参数scale的拟合曲线;Fig. 3 is the fitting curve of average risk score of the present invention and Weibull distribution parameter scale;

图4是本发明平均风险得分与Weibull分布参数scale的拟合残差图;Fig. 4 is the fitting residual figure of average risk score of the present invention and Weibull distribution parameter scale;

图5是本发明个性化肺腺癌预后评估结果。Fig. 5 is the result of the personalized lung adenocarcinoma prognosis assessment of the present invention.

具体实施方式Detailed ways

以下将配合实施例来详细说明本发明的实施方式,藉此对本发明如何应用技术手段来解决技术问题并达成技术功效的实现过程能充分理解并据以实施。The implementation of the present invention will be described in detail below with examples, so as to fully understand and implement the implementation process of how the present invention uses technical means to solve technical problems and achieve technical effects.

本发明公开了一种基于多基因表达特征谱的肺腺癌个性化预后评估方法,包括以下步骤:The invention discloses a method for evaluating the individualized prognosis of lung adenocarcinoma based on a polygene expression profile, comprising the following steps:

步骤1、获取肺腺癌预后风险基因列表与基因权重;Step 1. Obtain the list of prognostic risk genes and gene weights for lung adenocarcinoma;

步骤1.1、从Genomic Data Commons Data Portal数据库中下载肺腺癌患者肿瘤组织和癌旁组织转录组数据以及临床数据,获得肺腺癌患者肿瘤组织基因表达谱FPKM(Fragments Per Kilobase of transcript per Million fragments mapped)数值,进行对数转换(log2)。Step 1.1, download the tumor tissue and paracancerous tissue transcriptome data and clinical data of lung adenocarcinoma patients from the Genomic Data Commons Data Portal database, and obtain the tumor tissue gene expression profile FPKM ( Fragments P er Kilobase of transcript per Million fragments mapped) values, logarithmic transformation (log2).

步骤1.2、设总样本数为m,将所有样本根据其基因表达值的三分位数分为三组,其中,基因表达值为步骤1.1中获得的FPKM数值,用V表示,对第i个基因记为Vi,利用Cox比例风险模型计算第三分组相比第一分组的生存风险,得出每个基因的风险比HR和P值。定义P值<0.05具有显著性,筛选具有显著性的生存风险基因,记为n1。此外,计算每个基因与患者生存天数的相关性,得出每个基因的相关系数r和P值。定义P值<0.05具有显著性,筛选具有显著性的生存相关基因,记为n2。将生存风险基因和生存相关基因的交集定义为预后风险基因,记为n,则有:Step 1.2, set the total number of samples as m, divide all samples into three groups according to the tertiles of their gene expression values, where the gene expression value is the FPKM value obtained in step 1.1, denoted by V, for the ith The gene was denoted as V i , and the survival risk of the third group compared with the first group was calculated using the Cox proportional hazards model, and the hazard ratio HR and P value of each gene were obtained. A P value < 0.05 was defined as significant, and significant survival risk genes were screened out, which were recorded as n 1 . In addition, the correlation between each gene and the patient's survival days was calculated, and the correlation coefficient r and P value of each gene were obtained. A P value < 0.05 was defined as significant, and significant survival-related genes were screened, which were recorded as n 2 . The intersection of survival risk genes and survival-related genes is defined as the prognosis risk gene, denoted as n, then:

n=n1∩n2 (1)n=n 1 ∩n 2 (1)

步骤1.3、根据第i个基因的风险比计算每个基因的权重Wi,计算公式为:Step 1.3. Calculate the weight W i of each gene according to the risk ratio of the i-th gene, and the calculation formula is:

其中i表示基因编号,HRi表示第i个基因的风险比。Where i represents the gene number, and HR i represents the hazard ratio of the i-th gene.

这样就计算得到每一个基因的权重;最终得到的肺腺癌预后风险基因列表与基因权重见表1。In this way, the weight of each gene was calculated; the final list of lung adenocarcinoma prognostic risk genes and gene weights are shown in Table 1.

表1基因名称和权重Table 1 Gene name and weight

步骤2、利用肺腺癌患者肿瘤组织转录组和生存数据构建预后评估模型;具体为:Step 2, using the tumor tissue transcriptome and survival data of patients with lung adenocarcinoma to construct a prognosis assessment model; specifically:

步骤2.1、定义基因表达值为V,根据第i个基因在第j个样本中的表达值和权重计算第i个患者的风险得分Sj;计算公式为:Step 2.1, define the gene expression value V, and calculate the risk score S j of the i-th patient according to the expression value and weight of the i-th gene in the j-th sample; the calculation formula is:

其中i表示基因编号,j表示患者编号,Vij表示第i个基因在第j个样本中的表达值;Where i represents the gene number, j represents the patient number, V ij represents the expression value of the i-th gene in the j-th sample;

步骤2.2、将所有肺腺癌患者样本按照风险得分从低到高排序,使用滑动窗口模型对每50个样本计算平均风险得分计算公式为:Step 2.2, sort all lung adenocarcinoma patient samples according to the risk score from low to high, and use the sliding window model to calculate the average risk score for every 50 samples The calculation formula is:

其中j+49表示从样本j开始计数的后50个样本。where j+49 represents the last 50 samples counted from sample j.

步骤2.3、使用Weibull分布对50个样本的生存数据进行曲线拟合,Weibull分布的概率密度函数为:Step 2.3. Use the Weibull distribution to perform curve fitting on the survival data of 50 samples. The probability density function of the Weibull distribution is:

其中k>0是形状(shape)参数,λ>0是分布的比例(scale)参数。Where k>0 is a shape parameter, and λ>0 is a scale parameter of the distribution.

步骤2.4、对每50个样本计算出所对应的kj和λj。根据经验,kj为一个相对固定的数值,均值为:Step 2.4, calculate for every 50 samples The corresponding k j and λ j . According to experience, k j is a relatively fixed value, and the mean value is:

其中,kj为第j个样本到第j+49个样本生存曲线Weibull分布的形状参数;Among them, k j is the shape parameter of the Weibull distribution of the survival curve from the jth sample to the j+49th sample;

比例参数λj的变化范围较大,定义λj的函数关系为:The variation range of the proportional parameter λ j is relatively large, and the definition of λ j and The functional relationship is:

其中,λj表示第j个样本到第j+49个样本生存曲线Weibull分布的比例参数;Among them, λj represents the proportional parameter of the Weibull distribution of the survival curve from the jth sample to the j+49th sample;

其中e为自然对数的底,α、β为函数的参数,对上式取对数可得:Where e is the base of the natural logarithm, α and β are the parameters of the function, and the logarithm of the above formula can be obtained:

其中logλj为线性关系,可通过线性拟合求解。where logλ j and It is a linear relationship, which can be solved by linear fitting.

如图3所示为平均风险得分与Weibull分布参数λj的拟合曲线,得出的函数关系为:The average risk score is shown in Figure 3 With the fitting curve of Weibull distribution parameter λ j , the function relationship obtained is:

代入该函数得出预测的λj′,λj′为用该函数计算出的预期分布参数,计算λj与λj′的相关性可得相关系数R2=0.943,P值=2.96E-97。Will Substituting this function to get the predicted λ j ′, λ j ′ is the expected distribution parameter calculated by this function, calculating the correlation between λ j and λ j ′, the correlation coefficient R 2 = 0.943, P value = 2.96E- 97.

通过分析拟合残差图和Q-Q图(图4),表明该模型达到显著性,即平均风险得分与Weibull分布参数λj的函数关系是可信的。Analysis of the fitted residual plots and QQ plots (Fig. 4) shows that the model reaches significance, i.e. the average risk score A functional relationship with the Weibull distribution parameter λj is credible.

步骤3、根据肺腺癌患者肿瘤组织的基因表达谱计算患者的风险得分;具体为:Step 3. Calculate the risk score of the patient according to the gene expression profile of the tumor tissue of the patient with lung adenocarcinoma; specifically:

获取肺腺癌患者肿瘤组织的基因表达谱的FPKM数值(应包含全部或大部分表1中所列基因),记为:Vi(i为基因编号);表1中第i个基因对应的权重记为:Wi(i为基因编号);患者风险得分记为:S;计算公式为:Obtain the FPKM value of the gene expression profile of the tumor tissue of patients with lung adenocarcinoma (should include all or most of the genes listed in Table 1), denoted as: V i (i is the gene number); the i-th gene in Table 1 corresponds to The weight is recorded as: W i (i is the gene number); the patient risk score is recorded as: S; the calculation formula is:

其中i为基因编号,n为表1中列出的基因个数。Where i is the gene number, and n is the number of genes listed in Table 1.

步骤4、根据患者的风险得分计算患者每年的生存概率,具体为:Step 4. Calculate the patient's annual survival probability according to the patient's risk score, specifically:

将患者的风险得分S带入Weibull分布的累积分布函数可以得出该患者的存活概率函数为:Substituting the patient's risk score S into the cumulative distribution function of the Weibull distribution gives the patient's survival probability function as:

其中t为时间,α、β、S、均为固定参数。Where t is time, α, β, S, are fixed parameters.

如图5所示为一个患者的存活概率曲线,图中横坐标为天数,纵坐标为存活概率。患者每年的存活概率在曲线下方标出。右上角黑色方框中标出患者存活的实际天数,状态(Status)1表示患者已经死亡。曲线上红色点(即Death)标出患者死亡时对应的天数和存活概率,图中患者死亡时对应的存活概率在0.22左右。Figure 5 shows a patient's survival probability curve, where the abscissa in the figure is the number of days, and the ordinate is the survival probability. The patient's annual probability of survival is plotted below the curve. The actual number of days the patient survived is marked in the black box in the upper right corner, and the status (Status) 1 indicates that the patient has died. The red point on the curve (Death) marks the corresponding days and survival probability of the patient when he died. In the figure, the survival probability corresponding to the death of the patient is about 0.22.

综上所述,本发明利用TCGA-LUAD转录组和临床数据,对所有肺腺癌患者进行了个性化的生存预测,并利用交叉验证的方法对得到的结果进行了验证。结果显示采用多基因表达特征谱的肺腺癌个性化预后评估方法得出的肺腺癌患者每年的生存概率与实际每年存活比率高度一致(线性相关R2=0.994,P值=2.86E-43,图1)。证实了该方法具有很高的预测准确性,与实际生存状态高度吻合。In summary, the present invention uses TCGA-LUAD transcriptome and clinical data to perform personalized survival prediction for all patients with lung adenocarcinoma, and uses the cross-validation method to verify the obtained results. The results showed that the annual survival probability of lung adenocarcinoma patients obtained by the individualized prognosis assessment method of lung adenocarcinoma using multi-gene expression profiles was highly consistent with the actual annual survival rate (linear correlation R 2 =0.994, P value =2.86E-43 ,figure 1). It is confirmed that the method has high prediction accuracy, which is highly consistent with the actual survival status.

如图2所示,TNM分期与肺腺癌患者的生存时间仅具有较低的负相关。图1与图2相比较可以得出基于多基因表达特征谱的肺腺癌个性化预后评估方法相比传统TNM分期能够更精确地反映患者的生存状态。As shown in Figure 2, TNM stage had only a low negative correlation with the survival time of patients with lung adenocarcinoma. Comparing Figure 1 with Figure 2, it can be concluded that the personalized prognosis assessment method for lung adenocarcinoma based on the multi-gene expression profile can more accurately reflect the survival status of patients than the traditional TNM staging.

如图5所示,本发明能预测肿瘤患者连续时间的生存概率。例如可以给出患者每个月的生存概率、患者每年的生存概率等。而目前临床采用的分型方法只能给出一个定性的判断。对于每个肿瘤患者,本发明可以给出该患者特有的生存概率曲线,这是一般肿瘤预后评估模型所不具备的。As shown in Fig. 5, the present invention can predict the survival probability of tumor patients in continuous time. For example, the survival probability of the patient per month, the survival probability of the patient per year, and the like can be given. However, the current clinical classification method can only give a qualitative judgment. For each tumor patient, the present invention can provide the specific survival probability curve of the patient, which is not available in general tumor prognosis assessment models.

上述说明示出并描述了发明的若干优选实施例,但如前所述,应当理解发明并非局限于本文所披露的形式,不应看作是对其他实施例的排除,而可用于各种其他组合、修改和环境,并能够在本文所述发明构想范围内,通过上述教导或相关领域的技术或知识进行改动。而本领域人员所进行的改动和变化不脱离发明的精神和范围,则都应在发明所附权利要求的保护范围内。The above description shows and describes several preferred embodiments of the invention, but as previously stated, it should be understood that the invention is not limited to the form disclosed herein, and should not be regarded as excluding other embodiments, but can be used in various other embodiments. Combinations, modifications and circumstances, and can be modified within the scope of the inventive concept described herein, by the above teachings or by skill or knowledge in the relevant field. However, changes and changes made by those skilled in the art do not depart from the spirit and scope of the invention, and should be within the protection scope of the appended claims of the invention.

Claims (6)

1.一种基于多基因表达特征谱的肺腺癌个性化预后评估方法,其特征在于,包括以下步骤:1. A lung adenocarcinoma personalized prognosis assessment method based on polygene expression profile, it is characterized in that, comprises the following steps: 步骤1、获取肺腺癌预后风险基因列表与基因权重;Step 1. Obtain the list of prognostic risk genes and gene weights for lung adenocarcinoma; 步骤2、利用肺腺癌患者肿瘤组织转录组和生存数据构建预后评估模型;Step 2, using the tumor tissue transcriptome and survival data of patients with lung adenocarcinoma to construct a prognosis assessment model; 步骤3、根据肺腺癌患者肿瘤组织的基因表达谱计算患者的风险得分;Step 3, calculating the risk score of the patient according to the gene expression profile of the tumor tissue of the lung adenocarcinoma patient; 步骤4、根据患者的风险得分计算患者每年的生存概率。Step 4. Calculate the patient's annual survival probability based on the patient's risk score. 2.根据权利要求1所述的预后评估方法,其特征在于,所述步骤1中的获取肺腺癌预后风险基因列表与基因权重具体按照以下步骤实施:2. The method for assessing prognosis according to claim 1, characterized in that the acquisition of lung adenocarcinoma prognostic risk gene list and gene weight in said step 1 is specifically implemented according to the following steps: 步骤1.1、从Genomic Data Commons Data Portal数据库中下载肺腺癌患者肿瘤组织和癌旁组织转录组数据以及临床数据,获得肺腺癌患者肿瘤组织基因表达谱FPKM数值,进行对数转换;Step 1.1. Download the tumor tissue and paracancerous tissue transcriptome data and clinical data of lung adenocarcinoma patients from the Genomic Data Commons Data Portal database, obtain the FPKM value of the tumor tissue gene expression profile of lung adenocarcinoma patients, and perform logarithmic transformation; 步骤1.2、设总样本数为m,将所有样本根据其基因表达值的三分位数分为三组,其中,基因表达值为步骤1.1中获得的FPKM数值,用V表示,对第i个基因记为Vi,利用Cox比例风险模型计算第三分组相比第一分组的生存风险,得出每个基因的风险比HR和P值;定义P值<0.05具有显著性,筛选具有显著性的生存风险基因,记为n1;此外,计算每个基因与患者生存天数的相关性,得出每个基因的相关系数r和P值;定义P值<0.05具有显著性,筛选具有显著性的生存相关基因,记为n2;将生存风险基因和生存相关基因的交集定义为预后风险基因,记为n,则有:Step 1.2, set the total number of samples as m, divide all samples into three groups according to the tertiles of their gene expression values, where the gene expression value is the FPKM value obtained in step 1.1, denoted by V, for the ith The gene is denoted as V i , and the Cox proportional hazard model is used to calculate the survival risk of the third group compared with the first group, and the hazard ratio HR and P value of each gene are obtained; P value <0.05 is defined as significant, and the screening is significant The survival risk genes of , denoted as n 1 ; in addition, calculate the correlation between each gene and the patient's survival days, and obtain the correlation coefficient r and P value of each gene; define that P value <0.05 is significant, and the screening is significant The survival-related genes of , denoted as n 2 ; the intersection of survival risk genes and survival-related genes is defined as the prognosis risk gene, denoted as n, then: n=n1∩n2 (1)n=n 1 ∩n 2 (1) 步骤1.3、根据第i个基因的风险比计算每个基因的权重Wi,计算公式为:Step 1.3. Calculate the weight W i of each gene according to the risk ratio of the i-th gene, and the calculation formula is: 其中i表示基因编号,HRi表示第i个基因的风险比;Where i represents the gene number, and HR i represents the hazard ratio of the i-th gene; 这样就计算得到每一个基因的权重;最终得到的肺腺癌预后风险基因列表与基因权重。In this way, the weight of each gene is calculated; the final lung adenocarcinoma prognosis risk gene list and gene weight are obtained. 3.根据权利要求1所述的预后评估方法,其特征在于,所述的肺腺癌预后风险基因列表与基因权重具体见下表:3. The prognosis assessment method according to claim 1, wherein the list of prognostic risk genes and gene weights of the lung adenocarcinoma are specifically shown in the following table: 4.根据权利要求1所述的预后评估方法,其特征在于,所述步骤2中的利用肺腺癌患者肿瘤组织转录组和生存数据构建预后评估模型具体按照以下步骤实施:4. The method for evaluating prognosis according to claim 1, characterized in that, in said step 2, the use of tumor tissue transcriptome and survival data of patients with lung adenocarcinoma to construct a prognosis evaluation model is specifically implemented according to the following steps: 步骤2.1、定义基因表达值为V,根据第i个基因在第j个样本中的表达值和权重计算第i个患者的风险得分Sj;计算公式为:Step 2.1, define the gene expression value V, and calculate the risk score S j of the i-th patient according to the expression value and weight of the i-th gene in the j-th sample; the calculation formula is: 其中i表示基因编号,Vij表示第i个基因在第j个样本中的表达值;Where i represents the gene number, V ij represents the expression value of the i-th gene in the j-th sample; 步骤2.2、将所有肺腺癌患者样本按照风险得分从低到高排序,使用滑动窗口模型对每50个样本计算平均风险得分计算公式为:Step 2.2, sort all lung adenocarcinoma patient samples according to the risk score from low to high, and use the sliding window model to calculate the average risk score for every 50 samples The calculation formula is: 其中j+49表示从样本j开始计数的后50个样本;Where j+49 represents the last 50 samples counted from sample j; 步骤2.3、使用Weibull分布对50个样本的生存数据进行曲线拟合,Weibull分布的概率密度函数为:Step 2.3. Use the Weibull distribution to perform curve fitting on the survival data of 50 samples. The probability density function of the Weibull distribution is: 其中k>0是形状(shape)参数,λ>0是分布的比例(scale)参数。Where k>0 is a shape parameter, and λ>0 is a scale parameter of the distribution. 步骤2.4、对每50个样本计算出所对应的kj和λj;根据经验,kj为一个相对固定的数值,均值为:Step 2.4, calculate for every 50 samples The corresponding k j and λ j ; according to experience, k j is a relatively fixed value, and the mean value is: 其中,kj为第j个样本到第j+49个样本生存曲线Weibull分布的形状参数;Among them, k j is the shape parameter of the Weibull distribution of the survival curve from the jth sample to the j+49th sample; 比例参数λj的变化范围较大,定义λj的函数关系为:The variation range of the proportional parameter λ j is relatively large, and the definition of λ j and The functional relationship is: 其中,λj表示第j个样本到第j+49个样本生存曲线Weibull分布的比例参数;Among them, λj represents the proportional parameter of the Weibull distribution of the survival curve from the jth sample to the j+49th sample; 其中e为自然对数的底,α、β为函数的参数,对上式取对数得:Where e is the base of the natural logarithm, α and β are the parameters of the function, and the logarithm of the above formula is obtained: 其中logλj为线性关系,通过线性拟合求解;where logλ j and is a linear relationship, solved by linear fitting; 根据平均风险得分与Weibull分布参数λj的拟合曲线,得出的函数关系为:According to the average risk score With the fitting curve of Weibull distribution parameter λ j , the function relationship obtained is: 代入该函数得出预测的λj′,λj′为用该函数计算出的预期分布参数,计算λj与λj′的相关性得相关系数R2=0.943,P值=2.96E-97。Will Substituting this function to get the predicted λ j ′, λ j ′ is the expected distribution parameter calculated by this function, and calculating the correlation between λ j and λ j ′, the correlation coefficient R 2 = 0.943, P value = 2.96E-97 . 5.根据权利要求1所述的预后评估方法,其特征在于,所述步骤3中的根据肺腺癌患者肿瘤组织的基因表达谱计算患者的风险得分具体按照以下步骤实施:5. The prognosis assessment method according to claim 1, characterized in that the calculation of the patient's risk score according to the gene expression profile of the lung adenocarcinoma patient's tumor tissue in said step 3 is specifically implemented according to the following steps: 获取肺腺癌患者肿瘤组织的基因表达谱的FPKM数值,记为:Vi,其中,i为基因编号;第i基因对应的权重记为:Wi,其中,i为基因编号;患者风险得分记为:S;计算公式为:Obtain the FPKM value of the gene expression profile of the tumor tissue of patients with lung adenocarcinoma, recorded as: V i , where i is the gene number; the weight corresponding to the i-th gene is recorded as: W i , where i is the gene number; patient risk score Recorded as: S; the calculation formula is: 其中i为基因编号,n为表1中列出的基因个数。Where i is the gene number, and n is the number of genes listed in Table 1. 6.根据权利要求1所述的预后评估方法,其特征在于,所述步骤4中的根据患者的风险得分计算患者每年的生存概率具体按照以下步骤实施:将患者的风险得分S带入Weibull分布的累积分布函数得出该患者的存活概率函数为:6. The prognosis assessment method according to claim 1, characterized in that the calculation of the patient's annual survival probability according to the patient's risk score in said step 4 is specifically implemented according to the following steps: bringing the patient's risk score S into Weibull distribution The cumulative distribution function of gives the patient's survival probability function as: 其中t为时间,α、β、S、均为固定参数。Where t is time, α, β, S, are fixed parameters.
CN201810440855.4A 2018-05-09 2018-05-09 Lung adenocarcinoma personalized prognosis evaluation method based on polygene expression profile Active CN108363907B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810440855.4A CN108363907B (en) 2018-05-09 2018-05-09 Lung adenocarcinoma personalized prognosis evaluation method based on polygene expression profile

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810440855.4A CN108363907B (en) 2018-05-09 2018-05-09 Lung adenocarcinoma personalized prognosis evaluation method based on polygene expression profile

Publications (2)

Publication Number Publication Date
CN108363907A true CN108363907A (en) 2018-08-03
CN108363907B CN108363907B (en) 2022-01-18

Family

ID=63012041

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810440855.4A Active CN108363907B (en) 2018-05-09 2018-05-09 Lung adenocarcinoma personalized prognosis evaluation method based on polygene expression profile

Country Status (1)

Country Link
CN (1) CN108363907B (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111009292A (en) * 2019-11-20 2020-04-14 华南理工大学 Method for detecting critical point of phase transition in complex biological system based on single-sample sKLD index
CN111354462A (en) * 2020-04-14 2020-06-30 中山大学孙逸仙纪念医院 Advanced breast cancer survival probability prediction nomogram, survival probability prediction method and patient classification method
CN111676288A (en) * 2020-06-19 2020-09-18 中国医学科学院肿瘤医院 A system for predicting the prognosis of patients with lung adenocarcinoma and its application
CN111793692A (en) * 2020-08-04 2020-10-20 中国科学院昆明动物研究所 A combination of characteristic miRNA expression profiles and an early prediction method for lung squamous cell carcinoma
CN111850124A (en) * 2020-08-04 2020-10-30 中国科学院昆明动物研究所 A combination of characteristic lincRNA expression profiles and an early prediction method for lung squamous cell carcinoma
GR1010022B (en) * 2020-03-17 2021-06-01 Πανεπιστημιο Πατρων Methods for determining prognosis of a subject with lung adenocarcinoma and responsiveness to treatment
CN114341367A (en) * 2019-08-27 2022-04-12 上海善准生物科技有限公司 Lung adenocarcinoma molecular typing and survival risk gene group, diagnosis product and application

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101688240A (en) * 2007-04-10 2010-03-31 国立台湾大学 Prediction of post-treatment survival expectation in cancer patients by micrornas
CN102203619A (en) * 2008-10-31 2011-09-28 圣文森特医院悉尼有限公司 Method of prognosis in chronic kidney disease
CN106202988A (en) * 2016-10-11 2016-12-07 哈尔滨工业大学深圳研究生院 The Stepwise multiple-regression model of a kind of predictive disease life cycle and application
CN106407689A (en) * 2016-09-27 2017-02-15 牟合(上海)生物科技有限公司 Stomach cancer prognostic marker screening and classifying method based on gene expression profile
CN107742062A (en) * 2017-11-22 2018-02-27 复旦大学 A method for predicting the prognosis of FIGO stage IIIC serous ovarian cancer based on gene expression
US20180100199A1 (en) * 2016-10-06 2018-04-12 Gwangju Institute Of Science And Technology Method for determining prognosis of breast cancer patient by using gene expression data

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101688240A (en) * 2007-04-10 2010-03-31 国立台湾大学 Prediction of post-treatment survival expectation in cancer patients by micrornas
CN102203619A (en) * 2008-10-31 2011-09-28 圣文森特医院悉尼有限公司 Method of prognosis in chronic kidney disease
CN106407689A (en) * 2016-09-27 2017-02-15 牟合(上海)生物科技有限公司 Stomach cancer prognostic marker screening and classifying method based on gene expression profile
US20180100199A1 (en) * 2016-10-06 2018-04-12 Gwangju Institute Of Science And Technology Method for determining prognosis of breast cancer patient by using gene expression data
CN106202988A (en) * 2016-10-11 2016-12-07 哈尔滨工业大学深圳研究生院 The Stepwise multiple-regression model of a kind of predictive disease life cycle and application
CN107742062A (en) * 2017-11-22 2018-02-27 复旦大学 A method for predicting the prognosis of FIGO stage IIIC serous ovarian cancer based on gene expression

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
王先培: "《测控系统可靠性基础》", 31 July 2012 *
贾红英等: ""神经胶质瘤术后预后因素的研究"", 《中华流行病学杂质》 *

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114341367A (en) * 2019-08-27 2022-04-12 上海善准生物科技有限公司 Lung adenocarcinoma molecular typing and survival risk gene group, diagnosis product and application
CN114341367B (en) * 2019-08-27 2024-04-05 上海善准生物科技有限公司 Molecular typing and survival risk gene group for lung adenocarcinoma, diagnostic product and application
CN111009292A (en) * 2019-11-20 2020-04-14 华南理工大学 Method for detecting critical point of phase transition in complex biological system based on single-sample sKLD index
CN111009292B (en) * 2019-11-20 2023-04-21 华南理工大学 A Method for Detecting Phase Transition Critical Points of Complex Biological Systems Based on Single-Sample sKLD Index
GR1010022B (en) * 2020-03-17 2021-06-01 Πανεπιστημιο Πατρων Methods for determining prognosis of a subject with lung adenocarcinoma and responsiveness to treatment
CN111354462A (en) * 2020-04-14 2020-06-30 中山大学孙逸仙纪念医院 Advanced breast cancer survival probability prediction nomogram, survival probability prediction method and patient classification method
CN111354462B (en) * 2020-04-14 2024-01-09 中山大学孙逸仙纪念医院 Advanced breast cancer survival probability prediction nomogram, survival probability prediction method and patient classification method
CN111676288A (en) * 2020-06-19 2020-09-18 中国医学科学院肿瘤医院 A system for predicting the prognosis of patients with lung adenocarcinoma and its application
CN111676288B (en) * 2020-06-19 2022-09-06 中国医学科学院肿瘤医院 System for predicting lung adenocarcinoma patient prognosis and application thereof
CN111793692A (en) * 2020-08-04 2020-10-20 中国科学院昆明动物研究所 A combination of characteristic miRNA expression profiles and an early prediction method for lung squamous cell carcinoma
CN111850124A (en) * 2020-08-04 2020-10-30 中国科学院昆明动物研究所 A combination of characteristic lincRNA expression profiles and an early prediction method for lung squamous cell carcinoma

Also Published As

Publication number Publication date
CN108363907B (en) 2022-01-18

Similar Documents

Publication Publication Date Title
CN108363907A (en) A kind of adenocarcinoma of lung personalization prognostic evaluation methods based on multi-gene expression characteristic spectrum
CN108630317B (en) A method for individualized prognosis assessment of liver cancer based on multi-gene expression profile
CN108470111B (en) Stomach cancer personalized prognosis evaluation method based on polygene expression profile
CN108648826B (en) Pancreatic cancer personalized prognosis evaluation method based on polygene expression profile
Wang et al. Semiquantitative computed tomography characteristics for lung adenocarcinoma and their association with lung cancer survival
CN111564214B (en) Establishment and verification method of a breast cancer prognosis assessment model based on 7 special genes
CN114891887A (en) Method for screening triple negative breast cancer prognosis gene marker
CN114203256B (en) MIBC typing and prognosis prediction model construction method based on microbial abundance
CN108611416B (en) A personalized prognostic assessment method for cervical cancer based on multi-gene expression profiles
CN108647493B (en) Individualized prognosis evaluation method for renal clear cell carcinoma
Wu et al. Integrating tumor and nodal imaging characteristics at baseline and mid-treatment computed tomography scans to predict distant metastasis in oropharyngeal cancer treated with concurrent chemoradiotherapy
CN107561280B (en) A kit for predicting breast cancer recurrence
CN108733980B (en) Glioblastoma multiforme personalized prognosis evaluation method based on polygene expression profile
Luo et al. hsa‐mir‐3199‐2 and hsa‐mir‐1293 as novel prognostic biomarkers of papillary renal cell carcinoma by COX ratio risk regression model screening
Wang et al. Value of radiomics model based on multi-parametric magnetic resonance imaging in predicting epidermal growth factor receptor mutation status in patients with lung adenocarcinoma
CN108320806B (en) A method for individualized prognostic assessment of colon cancer based on multi-gene expression profiles
Ye et al. An online survival predictor in glioma patients using machine learning based on WHO CNS5 data
US20180100199A1 (en) Method for determining prognosis of breast cancer patient by using gene expression data
Yang et al. Development and validation of a DeepSurv nomogram to predict survival outcomes and guide personalized adjuvant chemotherapy in non-small cell lung cancer
CN114015773B (en) Application of systemic inflammatory response index in prognosis evaluation of gastrointestinal diffuse large B cell lymphoma
CN115937130A (en) Image processing method for predicting ovarian cancer Ki-67 expression based on dual-energy CT
CN113658696A (en) Prediction system for jointly predicting gastric cancer prognosis based on patient age, nutritional indexes, tumor stages and tumor markers
CN114171200A (en) PTC (Positive temperature coefficient) prognosis marker, application thereof and construction method of PTC prognosis evaluation model
Jiang et al. A competing risk nomogram for predicting cancer‐specific death of patients with buccal mucosa cancer
CN110055328A (en) A kind of adenocarcinoma of lung diagnosis marker based on metabolic gene spectrum

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right
TR01 Transfer of patent right

Effective date of registration: 20241231

Address after: Unit 8-1, Building 6, No. 153 Jinyu Avenue, Cuntan Street, Liangjiang New District, Jiangbei District, Chongqing City, 400000

Patentee after: Trace Expression Life Science (Chongqing) Co.,Ltd.

Country or region after: China

Address before: No.32, Jiaochang East Road, Wuhua District, Kunming, Yunnan 650000

Patentee before: KUNMING INSTITUTE OF ZOOLOGY CHINESE ACADEMY OF SCIENCES

Country or region before: China