KR101504818B1

KR101504818B1 - Novel system for predicting prognosis of gastric cancer

Info

Publication number: KR101504818B1
Application number: KR1020130037335A
Authority: KR
Inventors: 허용민; 노성훈; 서진석; 정재호; 박은성
Original assignee: 연세대학교 산학협력단
Priority date: 2013-04-05
Filing date: 2013-04-05
Publication date: 2015-03-24
Also published as: KR20140121523A

Abstract

본 발명은 위암의 예후 예측이 가능한 신규한 예후 예측 시스템에 관한 것으로, 보다 상세하게는, 유전자 집합 발현 비교 분석법을 통해 위암의 수술에 의한 절제 후 임상 결과를 예측할 수 있다.The present invention relates to a novel prognostic prediction system capable of predicting the prognosis of stomach cancer, and more particularly, it is possible to predict the clinical outcome after surgical resection of stomach cancer through comparative analysis of gene expression.

Description

[0001] The present invention relates to a novel system for predicting prognosis of gastric cancer,

본 발명은 유전자 발현 비교 분석법을 통해 위암의 예후 예측이 가능한 신규한 예후 예측 시스템에 관한 것이다.
The present invention relates to a novel prognostic prediction system capable of predicting the prognosis of gastric cancer through comparative gene expression analysis.

위선암(Gastric adeno-carcinoma)은 2000년 700,349명의 사망에서 두 번째 원인으로, 세계에서 가장 일반적으로 진단된 네 번째 암이다. 몇 가지 역학적 및 조직병리학적 특징들을 갖는 단일 이질적 질환으로 간주하고 있다. 위암 치료는 주로 환자를 수술 만으로 또는 수술과 화학요법으로 치료하여야 하는지를 결정하는 TNM(tumor, node, metastasis) 병기결정 같은 임상적 파라미터에 근거한다. 위암은 유방암과 대장암 등과 달리 TNM 병기 시스템에 따라서 1기에서 4기까지 명확하게 차이가 난다. 즉, 1기의 경우에는 5년 생존율이 90% 이상이며, 4기의 경우에는 20% 이하로 큰 차이를 보인다. 그러므로 TNM 병기 시스템의 예후 예측력이 매우 뛰어남을 알 수 있다 [참고문헌, 7th edition of the AJCC cancer staging Manual: stomach. Ann Surg Oncol 2010;17:3077-3079]. 상기 병기 시스템에 기반을 두어 위암은 흔히 조기 위암(Early Gastric Cancer), 국소진행형(Locally Advanced Gastric Cancer), 국소 침윤형(Locally Advanced Invasive Gastric Cancer) 및 전이 위암(Metastatic Gastric Cancer) 등으로 나눌 수 있다. Gastric adenocarcinoma is the fourth most common cancer in the world, the second most common cause of cancer death in 2000 (700,349 deaths). Is considered a single heterogeneous disorder with several epidemiological and histopathologic features. Gastric cancer treatment is based mainly on clinical parameters such as tumor, node, metastasis (TNM) staging, which determines whether patients should be treated surgically or by surgery and chemotherapy. Unlike breast cancer and colorectal cancer, stomach cancer differs markedly from stage 1 to stage 4 according to the TNM stage system. That is, the 5-year survival rate is more than 90% in case 1 and less than 20% in case 4. Thus, the prognosis of the TNM staging system is very good [7]. Ann Surg Oncol 2010; 17: 3077-3079]. Gastric cancer based on the above-described staging system can be divided into Early Gastric Cancer, Locally Advanced Gastric Cancer, Locally Advanced Invasive Gastric Cancer, and Metastatic Gastric Cancer .

비록 수술이 실시 가능한 위암의 주요 치료이긴 하나, 진행성인 경우 재발률이 높다. 재발을 예방하고, 위암 환자들의 예후를 개선하기 위해 화학요법과 화학-방사선요법을 포함한 집학적 치료가 도입되었다. 그러나, 이들 치료 방법이 환자들에서 일반적인 임상 결과를 개선하기는 하나 종양의 임상병리학적 이질성과, 같은 병기에 있는 환자들의 다른 결과는 어쥬번트 화학요법의 임무를 예측하는데 한계가 있어 개별 환자들에 대한 최적 접근이 부족한 상태이다. Although surgery is the main treatment for viable gastric cancer, recurrence rates are high in advanced cases. In order to prevent recurrence and to improve the prognosis of patients with gastric cancer, chemotherapy and chemotherapy including chemotherapy was introduced. However, although these treatment modalities improve general clinical outcomes in patients, the clinicopathologic heterogeneity of the tumors and the different outcomes of patients in the same stage have limitations in predicting the task of adjuvant chemotherapy, There is a lack of optimal access to.

최근의 진전에도 불구하고 발병적으로 별개의 종양 유형에 대해 특이적 치료 섭생을 표적화하고, 궁극적으로 성과를 최대화시키기 위하여 종양 치료를 개인화하기 위한 암 치료의 도전과제들이 남아있다. 따라서, 각종 치료 선택사항들에 대한 환자 반응에 관한 예측적 정보를 동시에 제공하는 시험을 필요로 하고 있다.
Despite recent advances, challenges remain for cancer treatment to personalize tumor therapy to target specific therapeutic regimens for distinctly different tumor types and ultimately maximize outcome. Therefore, tests that simultaneously provide predictive information about patient response to various treatment options are needed.

1. 대한민국 공개특허 제2007-0022694호, 2007.02.271. Korean Patent Publication No. 2007-0022694, February 27, 2007 2. 대한민국 공개특허 제2010-0072283호, 2010.06.302. Korean Patent Publication No. 2010-0072283, June 30, 2010

1. Journal of clinical oncology, vol. 23(29), pp.7286-7295, 2005.10.101. Journal of clinical oncology, vol. 23 (29), pp.7286-7295, 2005.10.10

본 발명의 목적은 TNM 병기와 상관없는 전체 위암 환자들의 임상 결과에 영향을 주는 유전자 발현 예후 지표 또는 위험도 점수에 기초한 새로운 예후 예측 시스템을 제공하는 것이다.
It is an object of the present invention to provide a new prognostic prediction system based on a gene expression prognostic index or risk score that affects the clinical outcome of all gastric cancer patients not related to the TNM stage.

상기 목적을 달성하기 위하여, 본 발명은 대상으로부터 얻은 암세포를 포함하는 생물학적 샘플에서 CSK, CD8A 및 CHEK2의 RNA 전사체의 발현도를 결정하는 단계; 및 상기 단계에서 결정된 RNA 전사체의 발현도에 기초하여 상기 생물학적 샘플의 위험도 점수(RS, Risk Score) 및 RS 백분율(RS(%))을 계산하고, 상기 RS(%)에 따라 예후를 판단하는 단계를 포함하는, 위암으로 진단된 대상에서 예후를 예측하는 방법을 제공한다.In order to achieve the above object, the present invention provides a method for determining the expression level of an RNA transcript of CSK, CD8A and CHEK2 in a biological sample comprising cancer cells obtained from a subject; And calculating a risk score (RS, Risk Score) and a RS percentage (RS (%)) of the biological sample based on the expression level of the RNA transcript determined in the step and determining a prognosis according to the RS (% A method for predicting a prognosis in a subject diagnosed with gastric cancer.

본 발명은 또한 대상으로부터 얻은 암세포를 포함하는 생물학적 샘플에서 CSK, CD8A 및 CHEK2의 RNA 전사체의 발현 수준을 측정하는 단계; 및 상기 전사체의 발현 증가는 긍정적인 임상 결과 가능성의 증가로 판단하는 단계를 포함하는, 위암으로 진단된 대상에서 예후를 예측하는 방법을 제공한다.The present invention also relates to a method for detecting the expression level of an RNA transcript of CSK, CD8A and CHEK2 in a biological sample comprising cancer cells obtained from a subject; And determining that the increased expression of the transcript is an increase in the likelihood of a positive clinical outcome.

상기 예후 예측 방법은 TNM 병기와 상관없는 전체 위암의 수술에 의한 절제 후 임상 결과를 예측하는 것일 수 있다.This prognostic prediction method may be to predict the clinical outcome after surgical resection of the entire gastric cancer that is not related to the TNM stage.

본 발명은 또한 위암의 예후 예측을 실행하는 프로그램을 기록한 컴퓨터로 판독가능한 기록 매체에 있어서, 환자로부터 얻은 핵산 시료에서 CSK, CD8A 및 CHEK2의 RNA 전사체의 발현도를 결정하는 단계; 및 상기 단계에서 결정된 RNA의 발현도에 기초하여 상기 생물학적 샘플의 위험도 점수(RS, Risk Score) 및 RS 백분율(RS(%))을 계산하고, 전체 생존율(Overall Survival, OS) 측면에서 상기 RS(%)의 설정치 범위가 75% 이상을 갖는 경우 고 위험군 환자, 25% 이상 내지 75% 미만인 경우 중간 위험군 환자, 25% 미만인 경우 저 위험군 환자로 분류하는 단계를 컴퓨터에 실행시키는 프로그램을 기록한 컴퓨터로 판독가능한 기록 매체를 제공한다.The present invention also relates to a computer-readable medium having recorded thereon a program for performing prognostic prediction of gastric cancer, comprising: determining the degree of expression of an RNA transcript of CSK, CD8A and CHEK2 in a nucleic acid sample obtained from a patient; (RS) and a RS percentage (RS) of the biological sample based on the expression level of the RNA determined in the step (a), and calculating the RS (%) in terms of overall survival (OS) ) Are recorded in a computer-readable recording medium that records a program for causing a computer to classify a high-risk group as having a set value of 75% or more, an intermediate-risk group as 25% to less than 75%, or a low- Thereby providing a recording medium.

상기 기록 매체는 TNM 병기와 상관없는 전체 위암의 수술에 의한 절제 후 임상 결과를 예측하는 것일 수 있다.
The recording medium may be to predict the clinical outcome after surgical resection of whole gastric cancer which is not related to the TNM stage.

본 발명은 TNM 병기와 상관없는 전체 위암 환자군에 대한 전체 생존율 측면에서 예측 모델을 만든 후 통계적으로 유의한 생존에 영향을 미치는 RNA 전사체의 발현도를 결정하여 이로부터 위험도 평점 시스템을 만들어 예후 지표 값을 산출하는 방식으로 위암 수술에 의한 절제 후 임상 결과를 예측할 수 있다.
The present invention establishes a prediction model in terms of overall survival rate for all gastric cancer patients without regard to TNM stage, and then determines the expression level of RNA transcript that influences statistically significant survival, The clinical outcome after gastrectomy with stomach cancer surgery can be predicted.

도 1은 연세대학교 세브란스 병원에서 2001년부터 2004년까지 총 520명을 대상으로 위암 병기에 따른 분포를 조사한 결과이다.
도 2는 레퍼런스 유전자에 대한 표준화 작업을 수행한 결과를 나타낸 것이다.
도 3은 RNA 전사체 발현도에 따른 예후가 좋은 군과 나쁜 군의 생존율 분석 결과를 나타낸 것이다.
도 4는 BRB-Array Tools를 이용한 PCA 분석에 따라 생존 위험 예측 결과를 나타낸 것이다.
도 5는 본 발명의 예후 모델의 정확도를 검증한 결과를 나타낸 것이다.
도 6은 위암 환자군을 대상으로 위험도 점수(RS(%))에 따른 생존 분석 결과를 나타낸 것이다.Figure 1 shows the results of 520 gastric cancer stage at Yonsei University Severance Hospital between 2001 and 2004.
FIG. 2 shows the result of performing a standardization work on a reference gene.
FIG. 3 shows the survival analysis results of good and bad prognostic groups according to RNA transcript expression.
FIG. 4 shows the survival risk prediction result according to the PCA analysis using the BRB-Array Tools.
5 shows the results of verifying the accuracy of the prognosis model of the present invention.
6 shows survival analysis results according to the risk score (RS (%)) of gastric cancer patients.

이하 본 발명의 구성을 구체적으로 설명한다. Hereinafter, the structure of the present invention will be described in detail.

본 발명은 대상으로부터 얻은 암세포를 포함하는 생물학적 샘플에서 CSK, CD8A 및 CHEK2의 RNA 전사체의 발현도를 결정하는 단계; 및 상기 단계에서 결정된 RNA 전사체의 발현도에 기초하여 상기 생물학적 샘플의 위험도 점수(RS, Risk Score) 및 RS 백분율(RS(%))을 계산하고, 상기 RS(%)에 따라 예후를 판단하는 단계를 포함하는, 위암으로 진단된 대상에서 예후를 예측하는 방법에 관한 것이다.The present invention relates to a method for determining the expression level of an RNA transcript of CSK, CD8A and CHEK2 in a biological sample comprising cancer cells obtained from a subject; And calculating a risk score (RS, Risk Score) and a RS percentage (RS (%)) of the biological sample based on the expression level of the RNA transcript determined in the step and determining a prognosis according to the RS (% To a method for predicting a prognosis in a subject diagnosed with gastric cancer.

본 발명에 따른 예후를 예측하는 방법은 위암 환자들의 예후와 관련을 보이는 유전자와 종양생물학적인 특성을 강하게 나타내는 유전자군을 선정하여 qPCR을 수행하여 콕스 회귀 분석에서 통계적 유의(p<0.001)를 갖는 유전자를 예후와 관련된 유전자 대상으로 선정하고, 상기 유전자들의 위험 계수(hazard ratio)를 유전자의 발현 값에 곱하여 하기 수학식 1 및 2에 따라 위험도 점수(RS, Risk Score) 및 RS 백분율(RS(%))을 구한 후 전체 생존율(Overall Survival, OS) 측면에서 상기 RS(%)가 75% 이상을 갖는 샘플을 고 위험군, 25% 이상 내지 75% 미만인 경우 중간 위험군, 25% 미만인 경우 저 위험군으로 분류함으로써 위암으로 진단된 대상에서 예후를 예측할 수 있는 것을 특징으로 한다.The method of predicting the prognosis according to the present invention was performed by selecting a gene group strongly associated with the prognosis of gastric cancer patients and a gene group strongly expressing the tumor biologic characteristics and performing qPCR to determine the gene having a statistical significance (p <0.001) in the Cox regression analysis (RS, Risk Score) and the RS percentage (RS (%)) according to the following equations (1) and (2) by multiplying the gene expression value by the hazard ratio of the genes, ) Were classified as high-risk, 25% to less than 75%, intermediate, and less than 25% of RS (%) in the overall survival (OS) It is characterized by being able to predict the prognosis in a subject diagnosed with gastric cancer.

상기 RS 및 RS(%)는 하기 수학식 1 및 2에 따라 계산할 수 있다:The RS and RS (%) can be calculated according to the following equations (1) and (2): < EMI ID =

[수학식 1][Equation 1]

RS=HR₁*normLogTransValue₁ + HR₂*normLogTransValue₂ + ... + HR_n* normLogTransValue_n
RS = HR ₁ * normLogTransValue ₁ + HR ₂ * normLogTransValue ₂ + ... + HR _n * normLogTransValue _n

[수학식 2]&Quot; (2) "

RS(%)=100×(생물학적 샘플의 RS - 모집단의 RS 최소값)/(모집단의 RS 최고값 - 모집단의 RS 최소값)
RS (%) = 100 × (RS minimum value of the biological sample) / (RS maximum value of the population - RS minimum value of the population)

상기 식에서, In this formula,

HR_n는 n번째 RNA 전사체의 위험 계수(hazard ratio)를 나타내고, 상기 HR_n이 1 미만인 경우 -1/HR_n로 변환하여 사용하며,HR _n represents the hazard ratio of the n-th RNA transcript and is converted to -1 / HR _n when HR _n is less than 1,

normLogTransValue_n는 RNA 전사체의 발현과 관련된 값이고, 이 값은 해당 유전자의 전체 값을 대상으로 하여 중간값을 중심으로 스케일을 변화시킨 값이며,normLogTransValue _n is a value associated with the expression of the RNA transcript and is a value obtained by varying the scale around the median based on the total value of the gene,

상기 모집단은 TNM 병기와 상관없는 전체 위암을 갖는 일정 수의 집단을 의미하고, 일정 수는 RS 최고값과 최소값을 산출할 수 있는 정도의 임의의 정수이다.The population means a certain number of groups having a whole gastric cancer unrelated to the TNM stage, and the constant number is an arbitrary integer such that the RS maximum value and the minimum value can be calculated.

상기 모집단의 수는 특별히 제한하지는 않으며, 일 구체예에서는 TNM 병기와 상관 없는 전체 위암 조직 520개를 모집단으로 사용하였다.The number of the population is not particularly limited. In one embodiment, 520 whole gastric cancer tissues irrespective of TNM stage were used as a population.

본 명세서에서, 용어, 위험 계수(Hazard Ratio: HR)란 암의 진행, 재발, 또는 요법 반응에 대한 기여도를 반영하는 계수를 의미한다. 위험 계수는 다양한 통계적 기법에 의하여 도출될 수 있다. 상기 위험 계수, HR 값은 다양한 통계적 모델에서 결정할 수 있으며, 예컨대, 다변량 콕스 비례 위험 회귀 분석에서 결정할 수 있다. 일 구체예에서, HR 값을 PI 수식에 사용함에 있어서, HR 값이 1보다 크거나 같을 경우 HR 값을 그대로 사용하고, HR 값이 1 보다 작을 경우 -1/HR 값을 사용할 수 있다. As used herein, the term Hazard Ratio (HR) refers to a coefficient that reflects the contribution of a cancer to progression, relapse, or therapy response. Risk factors can be derived by various statistical techniques. The risk factor, HR value, may be determined in various statistical models, for example, in a multivariate Cox proportional hazards regression analysis. In one embodiment, when the HR value is used in the PI equation, the HR value is used as it is when the HR value is equal to or greater than 1, and -1 / HR value is used when the HR value is less than 1.

또한, 상기 식에서, 용어, RNA 전사체의 발현 값이란 개별유전자, 즉 RNA 전사체의 발현과 관련된 값을 의미한다. 상기 값은 공지된 다양한 통계적 수단을 사용하여 결정할 수 있다. 예를 들어 발현 값은 콕스 회귀 분석에 의해 측정된 p 값을 log2 함수값으로 변형 후 사분위수 표준화(quantile normalization) 후의 값을 사용할 수 있다. 상기 수학식 1에서 사용된 발현 값은 해당 유전자의 전체 값을 대상으로 하여 중간값을 중심으로 스케일을 변화시킨 값을 사용하였다.In the above expression, the expression value of the term RNA transcript refers to a value related to the expression of an individual gene, that is, an RNA transcript. The value can be determined using various statistical means known in the art. For example, the expression value can be obtained by transforming p value measured by Cox regression analysis into log2 function value and then quantile normalization. The expression value used in Equation (1) is a value obtained by changing the scale around the intermediate value with respect to the total value of the corresponding gene.

일 구체예에 따르면, RS는 다음과 같이 결정할 수 있다:According to one embodiment, the RS may determine as follows:

RS = - CSK×1.317 - CD8A×1.18 - CHEK2×1.26RS = - CSK x 1.317 - CD8A x 1.18 - CHEK2 x 1.26

상기 수학식 1에 따라 계산된 RS는 상기 수학식 2에 따라 RS(%)로 나타낼 수 있다. 위의 식에서 결정된 값을 모집단에서 해당하는 순위로 변환하여 전체 생존율(Overall Survival, OS) 측면에서 상기 RS(%)가 75% 이상을 갖는 샘플을 고 위험군, 25% 이상 내지 75% 미만인 경우 중간 위험군, 25% 미만인 경우 저 위험군으로 분류한다. 고 위험군은 나쁜 예후를, 저 위험군은 좋은 예후인 것으로 판단할 수 있다. 즉, RS(%) 값이 75% 이상을 갖는 샘플을 고 위험군은 3년 이상, 6년 이상, 10년 이상의 기간 동안 전체 생존율이 낮음을 의미하고, 25% 미만인 저 위험군은 3년 이상, 6년 이상, 10년 이상 동안 전체 생존율이 높음을 의미한다. 상기 용어, 좋은 예후는 임상 결과의 긍정적 임상 결과 가능성의 증가로 표현될 수 있고, 나쁜 예후는 임상 결과의 긍정적 임상 결과 가능성의 감소로 표현될 수 있다.The RS calculated according to Equation (1) can be expressed as RS (%) according to Equation (2). When the value determined in the above equation is converted into a ranking in the population, a sample having the RS (%) of 75% or more in terms of overall survival (OS) is referred to as a high-risk group, and in the case of 25% , And less than 25%. The high-risk group can be judged as a bad prognosis and the low-risk group as a good prognosis. That is, a sample with a RS (%) value of 75% or higher means that the overall survival rate is low for more than 3 years, more than 6 years, more than 10 years for high risk group, more than 3 years for low risk group less than 25% This means that the overall survival rate is high for more than 10 years. The term, a good prognosis, can be expressed as an increase in the likelihood of a positive clinical outcome of a clinical outcome, and a bad prognosis can be expressed as a decrease in the likelihood of a positive clinical outcome of a clinical outcome.

상기 방법은 TNM 병기와 상관없는 전체 위암의 수술에 의한 절제 후 임상 결과를 예측하는데 유용할 수 있다.
This method may be useful for predicting clinical outcome after surgical resection of whole gastric cancer, regardless of TNM stage.

상기 방법은 qPCR 기반 방법일 수 있다.The method may be a qPCR-based method.

상기 발현 수준이 하나 이상의 RNA 전사체의 발현 수준에 대해 표준화되는 것일 수 있다. The expression level may be one that is normalized to the level of expression of one or more RNA transcripts.

상기 임상 결과가 전체 생존율(Overall Survival, OS) 측면에서 표현되는 것일 수 있다.The clinical outcome may be expressed in terms of Overall Survival (OS).

상기 방법은 RNA 전사체 전체의 발현 수준을 측정하고 발현 증가를 분석하여 긍정적인 임상 결과 가능성의 증가 또는 감소를 판단하여 예후를 예측할 수 있다.This method can predict the prognosis by measuring the expression level of the entire RNA transcript and analyzing the expression increase to judge the increase or decrease in the possibility of positive clinical outcome.

본 발명은 또한 위암의 예후 예측을 실행하는 프로그램을 기록한 컴퓨터로 판독가능한 기록 매체에 있어서, 환자로부터 얻은 핵산 시료에서 CSK, CD8A 및 CHEK2의 RNA 전사체의 발현도를 결정하는 단계; 및 상기 단계에서 결정된 RNA의 발현도에 기초하여 상기 시료의 위험도 점수(RS, Risk Score) 및 RS 백분율(RS(%))을 계산하고, 전체 생존율(Overall Survival, OS) 측면에서 상기 RS(%)의 설정치 범위가 75% 이상을 갖는 경우 고 위험군 환자, 25% 이상 내지 75% 미만인 경우 중간 위험군 환자, 25% 미만인 경우 저 위험군 환자로 분류하는 단계를 컴퓨터에 실행시키는 프로그램을 기록한 컴퓨터로 판독가능한 기록 매체를 제공한다.The present invention also relates to a computer-readable medium having recorded thereon a program for performing prognostic prediction of gastric cancer, comprising: determining the degree of expression of an RNA transcript of CSK, CD8A and CHEK2 in a nucleic acid sample obtained from a patient; (RS), a risk score (RS), and a RS percentage (RS) of the sample based on the degree of expression of the RNA determined in the step, and calculating the RS (%) in terms of overall survival (OS) A computer readable recording of a program that causes a computer to perform the steps of classifying a high-risk group as having a set-value range of 75% or more, an intermediate-risk group as 25% to less than 75%, or a low-risk group as 25% Media.

상기 기록 매체는 TNM 병기와 상관 없는 전체 위암의 수술에 의한 절제 후 임상 결과를 예측하는데 유용할 수 있다.The recording medium may be useful for predicting clinical outcome after surgical resection of whole stomach cancer independent of TNM stage.

상기 RS 및 RS(%)는 상기 수학식 1 및 2에 따라 계산할 수 있다. The RS and RS (%) can be calculated according to Equations (1) and (2).

상기 기록 매체는 전체 생존율(Overall Survival, OS) 측면에서 RS(%)의 설정치 범위가 75% 이상을 갖는 경우 고 위험군, 25% 이상 내지 75% 미만인 경우 중간 위험군, 25% 미만인 경우 저 위험군으로 판단할 수 있다. 즉, RS(%) 값이 75% 이상을 갖는 샘플을 고 위험군은 3년 이상, 6년 이상, 10년 이상의 기간 동안 전체 생존율이 낮다고 판단하고, 25% 미만인 저 위험군은 3년 이상, 6년 이상, 10년 이상 동안 전체 생존율이 높다고 판단한다. 상기 용어, 좋은 예후는 임상 결과의 긍정적 임상 결과 가능성의 증가로 표현될 수 있고, 나쁜 예후는 임상 결과의 긍정적 임상 결과 가능성의 감소로 표현될 수 있다.The recording medium is judged to be a high-risk group when the set value range of RS (%) is more than 75% in terms of overall survival (OS), an intermediate risk group when 25% or more to less than 75% can do. That is, a sample with a RS (%) value of 75% or more is considered to have a low overall survival rate for a period of 3 years, 6 years, 10 years or more in a high-risk group, Or more, and the overall survival rate is higher than 10 years. The term, a good prognosis, can be expressed as an increase in the likelihood of a positive clinical outcome of a clinical outcome, and a bad prognosis can be expressed as a decrease in the likelihood of a positive clinical outcome of a clinical outcome.

달리 정의되지 않는다면, 본원에 사용된 기술 및 과학 용어들은 당업자 수준에서 일반적으로 이해하는 바와 같은 의미가 있다. 본 발명은 어떤 방식으로든 설명된 방법 및 재료로 제한되지 않는다. 본 발명의 목적상, 하기 용어들이 아래에서 정의된다.Unless otherwise defined, the technical and scientific terms used herein have the same meaning as commonly understood by one of skill in the art. The present invention is not limited in any way to the methods and materials described. For purposes of the present invention, the following terms are defined below.

용어, "폴리뉴클레오티드"는 일반적으로 임의의 폴리리보뉴클레오티드 또는 폴리데옥시리보뉴클레오티드를 말하고, 예컨대 변형 또는 비-변형 RNA 또는 DNA일 수 있다. 본 명세서에서, "폴리뉴클레오티드"는 구체적으로 cDNA를 포함한다. The term "polynucleotide" generally refers to any polyribonucleotide or polydeoxyribonucleotide and can be, for example, a modified or non-modified RNA or DNA. As used herein, "polynucleotide" specifically includes cDNA.

용어, "올리고뉴클레오티드"는 비제한적으로 한-가닥 데옥시리보뉴클레오티드, 한- 또는 두-가닥 리보뉴클레오티드, RNA:DNA 하이브리드 및 두-가닥 DNA를 포함하는, 비교적 짧은 폴리뉴클레오티드를 말한다. 올리고뉴클레오티드, 예를 들면 한-가닥 DNA 프로브 올리고뉴클레오티드는 종종 예를 들면 상업적으로 입수가능한 자동화 올리고뉴클레오티드 합성기를 사용하는 화학적 방법에 의해 합성된다. 그러나, 올리고뉴클레오티드는 시험관내 재조합 DNA-매개 기술을 포함하는 각종 다른 방법들에 의해 및 세포 및 유기체 중에서의 DNA 발현에 의해 제조될 수 있다.The term "oligonucleotide" refers to a relatively short polynucleotide, including but not limited to one-strand deoxyribonucleotide, one- or two-strand ribonucleotides, RNA: DNA hybrids and two-stranded DNA. Oligonucleotides, such as one-stranded DNA probe oligonucleotides, are often synthesized, for example, by chemical methods using commercially available automated oligonucleotide synthesizers. However, oligonucleotides can be produced by various other methods, including in vitro recombinant DNA-mediated techniques, and by DNA expression in cells and organisms.

용어, "차등적으로 발현된 유전자" 또는 "차등적인 유전자 발현"은 정상 또는 대조용 대상체의 발현에 비하여 위암과 같은 암을 앓는 대상체 중에서 더 높거나 낮은 수준으로 활성화되는 유전자를 말한다. 또한, 동일한 질병의 다른 병기에서 더 높거나 낮은 수준으로 활성화되는 유전자를 포함한다. 차등적으로 발현되는 유전자는 핵산 수준 또는 단백질 수준에서 활성화 또는 억제되거나, 다른 스플라이싱을 받아 상이한 폴리펩타이드 산물을 야기하는 경우일 수 있다. 이러한 차이는 예를 들면 폴리펩타이드의 mRNA 수준, 표면 발현, 분비 또는 다른 분배에 있어서의 변화에 의해 입증될 수 있다. 본 발명의 목적상, "차등적인 유전자 발현"은 정상 및 질병에 걸린 대상체에서 또는 질병에 걸린 대상체의 다양한 병기에서 주어진 유전자의 발현 사이에 약 1.5배 이상, 약 4배 이상, 약 6배 이상, 약 10배 이상의 차이가 있을 때 존재하는 것으로 간주 된다.The term "differentially expressed gene" or "differential gene expression" refers to a gene that is activated at a higher or lower level than a cancerous subject, such as a stomach cancer, relative to the expression of a normal or control subject. It also includes genes that are activated at higher or lower levels in other stages of the same disease. The differentially expressed genes may be activated or inhibited at the nucleic acid or protein level, or they may undergo different splicing resulting in different polypeptide products. This difference can be demonstrated, for example, by changes in the mRNA level, surface expression, secretion or other distribution of the polypeptide. For the purposes of the present invention, "differential gene expression" means about 1.5 times, at least about 4 times, at least about 6 times, at least about 5 times, at least about 5 times, at least about 5 times, It is considered to be present when there is a difference of about 10 times or more.

유전자 전사체 또는 유전자 발현 생성물에 관한 용어 "표준화된"은 기준 유전자 세트의 전사체/생성물의 평균 수준에 대한 전사체 또는 유전자 발현 생성물의 수준을 말하는데, 여기서 레퍼런스 유전자들은 환자, 조직 또는 치료에 걸쳐 이들의 최소한의 변동에 기준하여 선택되거나 ("하우스키핑 유전자(housekeeping genes)"), 또는 레퍼런스 유전자는 시험된 유전자들 전체를 말한다. 후자의 경우, 일반적으로 "전체 표준화(global normalization)"로 언급되는데, 시험된 유전자들의 총 수가 비교적 큰, 바람직하게는 50 초과인 것이 중요하다. 구체적으로, RNA 전사체에 관한 용어 '표준화된'은 기준 유전자 세트의 전사 수준의 평균에 대한 전사 수준을 말한다.The term "normalized " with respect to a gene transcript or gene expression product refers to the level of a transcript or gene expression product relative to an average level of a transcript / product of a set of reference genes, ("Housekeeping genes"), or a reference gene refers to the whole of the genes tested. In the latter case, it is generally referred to as "global normalization ", where it is important that the total number of genes tested is relatively large, preferably over 50. Specifically, the term " standardized " for an RNA transcript refers to the transcription level relative to the average of transcription levels of a set of reference genes.

용어, "발현 역치" 및 "정의된 발현 역치"는 혼용하여 사용하며, 이 수준 이상에서는 유전자 또는 유전자 생성물이 환자 반응에 대한 예측 마커로서 사용되는 해당 유전자 또는 유전자 생성물의 수준을 말한다. 역치는 대표적으로 임상적 연구로부터 실험적으로 정의된다. 발현 역치는 최대 민감성, 또는 최대 선택성(예를 들면 한 약물에 대한 반응자들만을 선택하도록), 또는 최소 오차로 선택될 수 있다.The terms "expression threshold" and "defined expression threshold" are used interchangeably, and at this level above the level of the gene or gene product the gene or gene product is used as a predictive marker for the patient response. Thresholds are typically defined experimentally from clinical studies. Expression thresholds can be selected for maximum sensitivity, or for maximum selectivity (e.g., to select only responders for a drug), or minimum error.

용어, "유전자 증폭"은 특정 세포 또는 세포주에서 유전자 또는 유전자 단편의 다수개의 복사물이 형성되는 과정을 말한다. 복제된 영역 (증폭된 DNA의 신장)은 종종 "암플리콘"으로 언급된다. 종종, 생산된 mRNA의 양, 즉 유전자 발현도는 또한 특정 유전자의 만들어진 복제 수에 비례하여 증가한다.The term "gene amplification" refers to the process by which multiple copies of a gene or gene fragment are formed in a particular cell or cell line. The replicated region (the elongation of amplified DNA) is often referred to as "amplicon. &Quot; Often, the amount of mRNA produced, or gene expression, also increases in proportion to the number of copies made of a particular gene.

본 명세서에서, "예후"는 본원에서 암에 의한 사망 또는 위암과 같은 신생물성 질환의 진행(재발, 전이성 확산 및 내약물성 포함)의 가능성의 예측을 말하는데 사용된다. 용어 "예측"은 본원에서 환자가 주요 종양의 수술 제거 후에 암 재발 없이 특정 기간 동안 살아남게 될 가능성을 말하는데 사용된다. 이러한 예측은 임의의 특정 환자에 대하여 가장 적절한 치료 기법을 선택함으로써 치료를 결정하는데 임상적으로 사용될 수 있다. 이러한 예측은 환자가 치료 섭생, 예를 들면 수술 시술에 대하여 유리하게 반응하기 쉬운지, 또는 수술 종료 후에 환자의 장기간 생존이 가능한지를 예측하는데 있어서 귀중한 수단이 된다.As used herein, the term "prognosis" is used herein to refer to the prediction of the likelihood of death (including recurrence, metastatic spread, and drug resistance) of a neoplastic disease such as cancer or gastric cancer. The term "prediction" is used herein to refer to the likelihood that a patient will survive a specific period of time without cancer recurrence following surgical removal of a major tumor. Such prediction can be clinically used to determine treatment by selecting the most appropriate treatment technique for any particular patient. Such a prediction is a valuable tool in predicting whether a patient is likely to respond favorably to therapeutic regimens, such as surgical procedures, or whether long-term survival of a patient is possible after the end of surgery.

본 명세서에서, "장기간" 생존은 본원에서 수술 또는 다른 치료 후 3년 이상, 보다 바람직하게는 6년 이상, 가장 바람직하게는 10년 이상 생존하는 것을 말하는데 사용된다.As used herein, "prolonged" survival is used herein to refer to survival of 3 years or more, more preferably 6 years or more, and most preferably 10 years or more after surgery or other treatment.

달리 지시하지 않는 한, 분자 생물학(재조합 기술 포함), 미생물학, 세포 생물학 및 생화학의 종래 기술을 사용하여 본 발명을 수행할 수 있다.
Unless otherwise indicated, the present invention may be practiced using conventional techniques of molecular biology (including recombinant techniques), microbiology, cell biology, and biochemistry.

1. 유전자 발현 프로파일 작성(Profiling)1. Profiling of gene expression profile

유전자 발현 프로파일 작성 방법은 폴리뉴클레오티드의 혼성화 분석에 기초한 방법, 폴리뉴클레오티드의 서열화에 기초한 방법, 및 프로테오믹스 기재 방법을 포함한다. 예를 들어 mRNA 발현의 정량화를 위한 방법은 노던 블랏팅(northern blotting) 및 인 시츄 혼성화(in situ hybridization); RNAse 보호 검정시험; 및 PCR-기재 방법, 예를 들면 역 전사 폴리머라제 연쇄 반응(RT-PCR) 등을 포함한다. 또는, DNA 두 가닥, RNA 두 가닥, 및 DNA-RNA 하이브리드 두 가닥 또는 DNA-단백질 두 가닥을 포함하는 특정 두 가닥을 인식할 수 있는 항체들이 사용될 수 있다. 서열화-기재 유전자 발현 분석에서 대표적인 방법은 유전자 발현의 연속 분석(Serial Analysis of Gene Expression, SAGE) 및 대량적으로 평행한 시그너쳐 서열화(massively parallel signature sequencing, MPSS)에 의한 유전자 발현 분석을 포함한다.
Methods for generating gene expression profiles include methods based on hybridization analysis of polynucleotides, methods based on sequencing of polynucleotides, and methods based on proteomics. Methods for quantifying mRNA expression, for example, include northern blotting and in situ hybridization; RNAse protection assays; And PCR-based methods, such as reverse transcription polymerase chain reaction (RT-PCR). Alternatively, antibodies capable of recognizing two specific strands including two DNA strands, two RNA strands, and two DNA-RNA hybrid strands or two DNA-protein strands can be used. Representative methods for sequencing-based gene expression analysis include gene expression analysis by Serial Analysis of Gene Expression (SAGE) and massively parallel signature sequencing (MPSS).

2. PCR-기반 유전자 발현 프로파일 작성 방법2. PCR-Based Gene Expression Profiling Methods

a. 역 전사효소 PCR(RT-PCR)a. Reverse transcriptase PCR (RT-PCR)

가장 민감하고 가장 유연한 정량적 PCR-기재 유전자 발현 프로파일작성 방법들 중 하나는 RT-PCR이고, 이것은 약물 치료와 함께 또는 약물 치료 없이 정상 조직 및 종양 조직에서의 상이한 샘플 집단에 있어서의 mRNA 수준을 비교하여 유전자 발현 패턴을 특성화하고, 밀접하게 관련된 mRNA들을 판별하고, RNA 구조를 분석하는데 사용될 수 있다.One of the most sensitive and most flexible quantitative PCR-based gene expression profile generation methods is RT-PCR, which compares mRNA levels in different sample populations in normal and tumor tissues with or without drug therapy Can be used to characterize gene expression patterns, identify closely related mRNAs, and analyze RNA structures.

제1단계는 표적 샘플로부터 mRNA의 단리이다. mRNA는 예를 들면 냉동되거나 보관된 파라핀-매립 및 고정된 (예를 들면, 포르말린-고정된) 조직 샘플로부터 추출될 수 있다.The first step is the isolation of mRNA from the target sample. mRNA can be extracted from, for example, frozen or stored paraffin-embedded and fixed (e.g., formalin-fixed) tissue samples.

mRNA 추출을 위한 일반적인 방법은 당업계에 공지되어 있으며, 파라핀에 매립된 조직으로부터의 RNA 추출 방법은 문헌 ([Rupp and Locker, Lab Invest. 56: A67 (1987)] 및 [De Andres et al ., BioTechniques 18: 42044 (1995)]) 등에 개시되어 있다. 특히, RNA 단리는 상업적 제조업체, 예를 들면 퀴아겐(Qiagen)으로부터의 정제 키트, 완충제 세트 및 프로테아제를 사용하여 제조업체의 설명서에 따라 수행할 수 있다. 다른 상업적으로 입수가능한 RNA 단리 키트는 마스터푸어(MasterPure)^TM 완전 DNA 및 RNA 정제 키트(에피센트레(EPICENTRE)^®, 위스콘신주 매디슨) 및 파라핀 블록(Paraffin Block) RNA 단리 키트(앰비온, 인크.(Ambion, Inc.))를 포함한다. 조직 샘플로부터의 전제 RNA는 RNA Stat-60 (Tel-Test)를 사용하여 단리할 수 있다. 종양으로부터 제조된 RNA는 예를 들면 염화세슘 밀도 구배 원심분리에 의해 단리될 수 있다.General methods for mRNA extraction are well known in the art, and methods for RNA extraction from paraffin-embedded tissue are described in Rupp and Locker, Lab Invest . 56: A67 (1987)] and [De Andres et al ., BioTechniques 18: 42044 (1995)). In particular, RNA isolation can be performed according to the manufacturer ' s instructions using commercial manufacturers, for example, purification kits from Qiagen, buffer sets and proteases. Commercially available RNA isolation kit is different from the master Poor (MasterPure) ^TM Complete DNA and RNA Purification Kit (epi cent LES (EPICENTRE) ^®, Madison, WI), and Paraffin Block (Paraffin Block) RNA isolation kit (aembi-one, Inc. (Ambion, Inc.). Total RNA from tissue samples can be isolated using RNA Stat-60 (Tel-Test). RNA prepared from tumors can be isolated, for example, by cesium chloride density gradient centrifugation.

RNA는 PCR을 위한 주형으로 사용될 수 없기 때문에, RT-PCR에 의한 유전자 발현 프로파일 작성의 제1단계는 RNA 주형의 cDNA로의 역 전사이고, 이후 그의 PCR 반응으로의 지수적 증폭이 이어진다. 역 전사효소는 주로 조류 골수아세포증 바이러스 역 전사효소(AMV-RT) 및 몰로니(Moloney) 쥐 백혈병 바이러스 역 전사효소(MMLV-RT)를 사용할 수 있다. 역 전사 단계는 대표적으로 발현 프로파일작성의 환경 및 목표에 따라, 특정 프라이머, 무작위 헥사머, 또는 올리고-dT 프라이머를 사용하여 초회항원자극된다. 예를 들면, 추출된 RNA는 진앰프(GeneAmp) RNA PCR 키트(퍼킨 엘머(Perkin Elmer), 미국 캘리포니아주)를 사용하여 제조업체의 설명서에 따라 역-전사될 수 있다. 유도된 cDNA는 이어서 후속되는 PCR 반응에서 주형으로서 사용될 수 있다.Since RNA can not be used as a template for PCR, the first step in creating a gene expression profile by RT-PCR is reverse transcription into the RNA template cDNA, followed by exponential amplification into its PCR reaction. The reverse transcriptase can be used mainly in avian myeloproliferative viral reverse transcriptase (AMV-RT) and Moloney murine leukemia virus reverse transcriptase (MMLV-RT). The reverse transcription step is typically an initial antigen stimulation using specific primers, random hexamers, or oligo-dT primers, depending on the environment and goals of expression profile creation. For example, the extracted RNA can be reverse-transcribed according to the manufacturer's instructions using a GeneAmp RNA PCR kit (Perkin Elmer, CA, USA). The derived cDNA can then be used as a template in subsequent PCR reactions.

비록 PCR 단계가 각종 열안정성 DNA-의존성 DNA 폴리머라제를 사용할 수 있지만, 이것은 전형적으로는 Taq DNA 폴리머라제를 사용하는데, Taq DNA 폴리머라제는 5'-3' 뉴클레아제 활성을 갖지만, 3'-5' 판독방지(proofreading) 엔도뉴클레아제 활성은 부족하다. 따라서, 타크맨^®PCR은 전형적으로는 Taq 또는 Tth 폴리머라제의 그의 표적 암플리콘에 결합된 혼성화 프로브를 혼성화시키는 5'-뉴클레아제 활성을 이용하지만, 5' 뉴클레아제 활성을 갖는 임의의 효소가 사용될 수 있다. 2개의 올리고뉴클레오티드 프라이머들을 사용하여 PCR 반응의 대표적인 암플리콘을 생성시킨다. 제3 올리고뉴클레오티드 또는 프로브는 2개의 PCR 프라이머들 사이에 위치한 뉴클레오티드 서열을 검출하도록 설계된다. 프로브는 Taq DNA 폴리머라제 효소에 의해 비-연신성이고, 리포터 형광 염료 및 소광제(quencher) 형광 염료로 표지된다. 리포터 염료로부터 임의의 레이저-유도 방출은 2개의 염료가 이들이 프로브 상에 있을 때와 같이 함께 가깝게 위치할 때 소광제 염료에 의해 소거된다. 증폭 반응 동안, Taq DNA 폴리머라제 효소는 주형-의존적 방식으로 프로브를 절단한다. 생성되는 프로브 단편들은 용액 중에서 해리되고, 방출된 리포터 염료로부터의 신호에는 제2 형광단의 소거 효과가 없다. 리포터 염료의 한 분자가 합성된 새로운 분자 각각으로부터 방출되고, 소거되지 않은 리포터 염료의 검출은 데이터의 정량적 해석에 대한 기준을 제공한다.Although the PCR step can use a variety of thermostable DNA-dependent DNA polymerases, it typically uses Taq DNA polymerase, Taq DNA polymerase has 5'-3 'nuclease activity, while 3'- 5 'proofreading endonuclease activity is deficient. Thus, Takman < ( ^R) > PCR typically utilizes 5 ' -nuclease activity to hybridize a hybridization probe bound to its target ampicillin of a Taq or Tth polymerase, Enzymes can be used. Two oligonucleotide primers are used to generate a representative amplicon for the PCR reaction. The third oligonucleotide or probe is designed to detect the nucleotide sequence located between the two PCR primers. The probe is non-extensible by the Taq DNA polymerase enzyme and is labeled with a reporter fluorescent dye and a quencher fluorescent dye. Any laser-induced emission from the reporter dye is cleared by the quencher dye when the two dyes are located close together, such as when they are on the probe. During the amplification reaction, the Taq DNA polymerase enzyme cleaves the probe in a template-dependent manner. The resulting probe fragments are dissociated in solution and the signal from the released reporter dye has no effect of eliminating the second fluorophore. One molecule of the reporter dye is released from each of the synthesized new molecules, and the detection of the non-erased reporter dye provides a basis for quantitative interpretation of the data.

타크맨^® RT-PCR은 상업적으로 입수 가능한 장비, 예를 들면 ABI 프리즘(PRISM) 7700TM 시퀀스 검출 시스템(Sequence Detection System)^TM(Perkin-Elmer-Applied Biosystems), 또는 라이트사이클러(Lightcycler)(Roche Molecular Biochemicals)를 사용하여 수행될 수 있다.Tacna Go ^® RT-PCR are commercially available equipment, for example, ABI Prism (PRISM) 7700TM Sequence Detection System ^{(Sequence Detection System) TM (Perkin} -Elmer-Applied Biosystems), or the light between the clutch (Lightcycler) (Roche Molecular Biochemicals).

5'-뉴클레아제 검정시험 데이터는 초기에 Ct, 또는 역치 사이클로서 표현된다. 형광 값은 매 사이클 동안 기록되고, 증폭 반응으로 그 지점으로까지 증폭된 생성물의 양을 나타낸다. 형광 신호가 처음 통계학적으로 유의한 것으로 기록될 때의 지점이 역치 사이클(Ct)이다.The 5'-nuclease assay test data is initially expressed as Ct, or as a threshold cycle. The fluorescence value is recorded for each cycle and represents the amount of product amplified to that point by an amplification reaction. The point at which the fluorescence signal is initially recorded as statistically significant is the threshold cycle (Ct).

샘플 간의 변동 효과 및 오차를 최소화시키기 위하여, RT-PCR은 일반적으로 기준 RNA(이것은 이상적으로는 상이한 조직들 사이에서 일정 수준으로 발현됨)를 사용하여 수행되고, 실험 치료에 의해 영향을 받지 않는다. 유전자 발현 패턴을 표준화하는데 가장 자주 사용되는 RNA는 하우스키핑 유전자 글리세르알데히드-3-포스페이트-데히드로게나제 (GAPD) 및 β-액틴(ACTB)의 mRNA이다.In order to minimize variation effects and errors between samples, RT-PCR is generally performed using reference RNA, which is ideally expressed at a constant level among different tissues, and is not affected by experimental treatment. The RNA most often used to standardize gene expression patterns is the mRNA of the housekeeping genes glyceraldehyde-3-phosphate-dehydrogenase (GAPD) and beta -actin (ACTB).

또한, 새로운 실시간 정량적 PCR은 이중-표지된 형광원성 프로브(즉, 타크맨^®프로브)를 통한 PCR 생성물 축적을 측정하는 기술로, 정량적 경쟁적 PCR(여기서는, 각 표적 서열에 대한 내부 경쟁자가 표준화에 사용됨) 및 샘플 내에 포함된 표준화 유전자 또는 RT-PCR에 대한 하우스키핑 유전자를 사용하는 정량적 비교용 PCR 모두와 상용 가능하다.
In addition, the new real-time quantitative PCR is a dual-labeled fluorescent immunogenic probe to the technology of measuring the PCR product accumulation through a (i.e., Tacna Go ^® probes), quantitative competitive PCR (in this case, the internal competitor for each target sequence is used to standardize ) And a standardized gene contained in a sample or a quantitative comparative PCR using a housekeeping gene for RT-PCR.

b. 매스어레이(MassARRAY) 시스템b. Massarray system

시커놈, 인크.(Sequenom, Inc.)가 개발한 매스어레이-기반 유전자 발현 프로파일 작성 방법에서는 RNA의 단리 및 역 전사 후에 얻은 cDNA를 합성 DNA 분자(경쟁자)(이것은 단일 염기를 제외한 모든 위치에서 표적화 cDNA 구역과 일치함)로 스파이크하고(spiked), 내부 표준으로 사용한다. cDNA/경쟁자 혼합물을 PCR 증폭시키고, 후-PCR 새우 알칼리성 포스파타제(SAP) 효소 처리를 가하여 남아있는 뉴클레오티드의 데포스포릴화를 야기한다. 알칼리성 포스파타제의 불활성화 후에, 경쟁자 및 cDNA로부터의 PCR 생성물을 프라이머 신장시키고, 이것은 경쟁자- 및 cDNA-유래 PCR 생성물에 대한 별도의 질량 신호들을 생성시킨다. 정제 후, 이들 생성물들을 매트릭스-보조 레이저 탈착 이온화 흐름 시간 질량 분광측정법(MALDI-TOF MS) 분석을 이용한 분석에 필요한 성분들이 이미-부하 되어 있는 칩 어레이 상에 계량분배한다. 반응에 존재하는 cDNA를 이어서 생성된 질량 스펙트럼 내의 피크 면적 비를 분석하여 정량화한다.
In a mass array-based gene expression profiling method developed by Sequenom, Inc., cDNAs obtained after isolation and reverse transcription of RNA are synthesized using a synthetic DNA molecule (competitor), which is targeted at all positions except for a single base lt; / RTI > cDNA region), and used as an internal standard. The cDNA / competitor mixture is PCR amplified and subjected to post-PCR shrimp alkaline phosphatase (SAP) enzyme treatment resulting in the dephosphorylation of the remaining nucleotides. After inactivation of the alkaline phosphatase, the PCR product from the competitor and cDNA is primer-extended, which produces separate mass signals for the competitor- and cDNA-derived PCR products. After purification, these products are dispensed onto a chip array where the components necessary for analysis using matrix-assisted laser desorption ionization flow time mass spectrometry (MALDI-TOF MS) analysis are pre-loaded. The cDNA present in the reaction is then quantified by analyzing the peak area ratio in the resulting mass spectrum.

c. 기타 PCR-기반 방법c. Other PCR-based methods

기타 시차 디스플레이; 증폭된 단편 길이 다형성 (iAFLP); 비드어레이(BeadArray) TM 기술 (일루미나(Illumina), 캘리포니아주 샌 디에고); 유전자 발현에 대한 신속 검정시험에 상업적으로 입수 가능한 루미넥스(Luminex)100 LabMAP 시스템 및 다색-코딩된 미소구(Luminex Corp.)를 사용하는, 유전자 발현 검출용 비즈어레이(BeadsArray for Detection of Gene Expression, BADGE); 및 고 피복 발현프로파일작성 (HiCEP) 분석이 있다.
Other parallax displays; Amplified fragment length polymorphism (iAFLP); BeadArray TM technology (Illumina, San Diego, Calif.); A BeadsArray for Detection of Gene Expression using a commercially available Luminex 100 LabMAP system and multicolor-coded microspheres (Luminex Corp.) for rapid assay testing for gene expression. BADGE); And high-coat expression profile creation (HiCEP) analysis.

3. mRNA 단리, 정제 및 증폭의 일반적인 설명3. General description of mRNA isolation, purification and amplification

파라핀에 매립된 조직을 사용하여 유전자 발현 프로파일을 작성하는 기술은 상술한 바와 같다. 최종적으로 얻은 데이터를 분석하여 관찰된 종양 샘플에서 확인된 특징적인 유전자 발현 패턴에 기초하여 환자에게 이용할 수 있는 최상의 치료 선택사항(들)을 판별해낸다.Techniques for creating gene expression profiles using tissue embedded in paraffin have been described above. The final data is analyzed to determine the best treatment option (s) available to the patient based on the characteristic gene expression pattern identified in the observed tumor sample.

본 발명의 중요한 면은 위암 조직에 의한 특정 유전자의 측정된 발현을 사용하여 예후 정보를 제공하는 것이다. 이러한 목적을 위해, 검정 시험된 RNA의 양, 사용된 RNA 품질에 있어서의 변동, 및 다른 인자, 예를 들면 기계 및 작업자 차이에 있어서의 차이에 대해 보정하는(표준화) 것이 필수적이다. 그러므로, 검정시험은 전형적으로 GAPD 및 ACTB와 같은 공지된 하우스키핑 유전자로부터 전사된 것들을 포함하는, 기준 RNA의 사용을 측정하여 혼입시킨다. 또는, 표준화는 검정시험된 유전자들 또는 이들의 많은 서브세트 전부의 평균 또는 중간 신호(Ct)를 기준으로 할 수 있다(전체 표준화 접근법). 하기 실시예에서는 중심 표준화 전략을 사용하였는데, 이것은 표준화를 위해 임상적 성과와의 상관성 부족에 기초하여 선택된 스크리닝된 유전자의 서브세트를 이용하였다.An important aspect of the present invention is to provide prognostic information using measured expression of a particular gene by gastric cancer tissue. For this purpose, it is essential to calibrate (standardize) the amount of assayed RNA, variations in RNA quality used, and differences in other factors, such as machine and operator differences. Therefore, assays typically measure and incorporate the use of reference RNA, including those transcribed from known housekeeping genes such as GAPD and ACTB. Alternatively, standardization can be based on the mean or median signal (Ct) of all of the tested genes or a large number of subset thereof (full standardization approach). In the following examples, a centralized standardization strategy was used, which used a subset of screened genes selected based on lack of correlation with clinical outcomes for standardization.

용어, "트레이닝 세트(Training set)"라 함은 예후에 통계적으로 유의한 RNA 전사체를 추출한 대상 샘플을 의미한다. The term "training set" means a subject sample from which a statistically significant RNA transcript is extracted for a prognosis.

용어, "검증 세트(Validation set)" 또는 "테스트 세트(Test set)"라 함은 상기 추출된 변수가 실제로 예후의 좋고 나쁨을 판단할 수 있는 정확도를 테스트하는 세트를 의미한다. 이러한 방법을 사용하는 이유는 특정 샘플 군에서만 효과적으로 예후 판단 능력이 있는 것이 아니라 독립적 샘플에서도 효능이 있는 것을 판단하기 위해서이다.
The term "validation set" or "test set" refers to a set that tests the accuracy with which the extracted variables can actually be judged to be good or bad. The reason for using this method is that it is not only effective in determining the prognosis in a specific sample group but also is effective in an independent sample.

4. 재발에 대한 예후 지표 및 이들의 응용4. Prognostic indicators for recurrence and their application

위암 재발의 가능성에 대한 암 예후 방법을 구분 짓는 연산의 특징은 1) 재발 가능성을 측정하는데 사용된 독특한 시험 mRNAs 세트 (또는 대응하는 유전자 발현 생성물), 2) 발현 데이터를 식으로 합치는데 사용된 특정 가중치, 및 3) 환자들을 상이한 수준의 위험을 갖는 군, 예를 들면 저, 중간 및 고 위험 군으로 나누는데 사용된 역치를 포함한다. 이 연산은 수치적인 위험도 점수(RS) 및 RS(%)를 산출해낸다.Characteristics of the cancer prognostic method for the likelihood of recurrence of gastric cancer are characterized by 1) a set of unique test mRNAs (or corresponding gene expression products) used to measure the likelihood of recurrence, 2) the specific Weight, and 3) thresholds used to divide patients into groups of different levels of risk, such as low, middle, and high risk groups. This computation yields numerical risk scores (RS) and RS (%).

시험은 명시된 mRNA 또는 이들의 발현 생성물의 수준을 측정하기 위한 실험실 검정시험을 필요로 하지만, 신선한 조직이거나 또는 냉동된 조직, 또는 이미 반드시 환자들로부터 수집되어 보관되어 있는 고정되어 파라핀에 매립된 종양 생검 시험편을 매우 소량으로 이용할 수 있다. 따라서, 시험은 비침입성일 수 있다. 예를 들면 코어 생검 또는 미세침 흡인을 통해 수확된 종양 조직의 몇 가지 상이한 방법들과 상용성이기도 하다. 이 방법에 따르면, 암 위험도 점수(RS)는The tests require a laboratory test to determine the level of the indicated mRNA or their expression products, but may be either fresh or frozen tissue, or a fixed, paraffin-embedded tumor biopsy The test piece can be used in very small amounts. Thus, the test may be non-invasive. It is also compatible with several different methods of tumor tissue harvested, for example, through a core biopsy or fine needle aspiration. According to this method, the cancer risk score (RS)

(a) 상기 대상체로부터 얻은 암 세포를 포함하는 생물학적 샘플로 유전자 또는 단백질 발현 프로파일을 작성하고;(a) creating a gene or protein expression profile with a biological sample comprising cancer cells obtained from said subject;

(b) 다수개의 개별 유전자의 발현도 [즉, mRNA 수준]를 정량화하여 각 유전자에 대한 발현 값을 정하고;(b) Quantifying the expression level of a plurality of individual genes (i.e., mRNA level) to determine the expression value for each gene;

(c) 각각 암-관련 생물학적 함수에 의해 및(또는) 동시발현에 의해 연결된 유전자들에 대한 발현 값을 포함하는, 유전자 발현 값들의 서브세트를 생성시키고;(c) generating a subset of gene expression values, each comprising an expression value for genes linked by a cancer-related biological function and / or by co-expression;

(d) 한 서브세트 내의 각 유전자의 발현도에 상기 서브세트 내에서의 그의 암 재발 반응에 대한 상대적 기여도를 반영하는 계수를 곱하고 곱한 값을 더하여 상기 서브세트에 대한 값을 산출하고;(d) multiplying the expression of each gene in a subset by a coefficient that reflects a relative contribution to its cancer recurrence in said subset, and multiplying that value to yield a value for said subset;

(e) 각 서브세트의 그 값에 그의 암 재발 반응에 대한 기여도를 반영하는 계수를 곱하고; (e) multiplying the value of each subset by a coefficient that reflects its contribution to cancer recurrence;

(f) 상기 계수를 곱한 각 서브세트에 대한 값들의 합을 구하여 위험도 점수(RS) 및 RS(%)를 얻음으로써 결정되는데, 여기서, 암 재발과 선형 상관관계를 보이지 않는 각 서브세트의 기여도는 단지 소정의 역치 값 이상에서만 포함되고,(f) obtaining the sum of the values for each subset multiplied by the coefficient to obtain a risk score (RS) and an RS (%), where the contribution of each subset not exhibiting a linear correlation with cancer recurrence Is included only at or above a predetermined threshold value,

명시된 유전자의 증가된 발현이 암 재발 위험을 감소시키는 서브세트에는 음의 값을 부여하고, 명시된 유전자의 발현이 암 재발 위험을 증가시키는 서브세트에는 양의 값을 부여한다.Negative values are assigned to subsets in which the increased expression of a given gene reduces the risk of cancer recurrence and a positive value is given to a subset in which the expression of a specified gene increases the risk of cancer recurrence.

구체적인 실시태양에서, RS 및 RS(%)는In a specific embodiment, RS and RS (%) are

(a) CSK, CD8A 및 CHEK2의 RNA 전사체의 발현도를 측정하고,(a) The expression levels of the RNA transcripts of CSK, CD8A and CHEK2 were measured,

(b) 하기 수학식 1 및 2에 의해 RS 및 RS(%)를 계산함으로써 결정된다:(b) RS and RS (%) by the following equations 1 and 2:

[수학식 1][Equation 1]

[수학식 2]&Quot; (2) "

상기 식에서, In this formula,

여기서, RS(%) 값이 75% 이상이면 나쁜 예후이고, 25% 미만이면 좋은 예후인 것으로 판단한다.
If the value of RS (%) is 75% or more, it is a bad prognosis. If it is less than 25%, it is judged that it is a good prognosis.

이하, 본 발명을 실시예에 의해 상세히 설명한다. 단, 하기 실시예는 본 발명을 예시하는 것일 뿐, 본 발명의 내용이 하기 실시예에 한정되는 것은 아니다.
Hereinafter, the present invention will be described in detail by way of examples. However, the following examples are illustrative of the present invention, and the present invention is not limited to the following examples.

<제조예> 예후 예측 대상 선정 및 실험설계&Lt; Production example >

예후 예측 대상 선정을 위해, 2001년 내지 2004년까지 연세대학교 세브란스 병원에서 일차 치료로 위 절제술을 받은 위암 환자들(YUSH, n=520)에서 종양 표본 및 임상 데이터를 얻었다. 1기에서 4기까지 대상으로 하였으나 주로 2기-3기에 집중되어 있었다(도 1). 모든 샘플들은 환자로부터 상세하게 기재된 동의서를 받은 후 수집하였고, 연구는 연세대학교 세브란스 병원의 조사윤리위원회의 승인을 받았다. 임상 데이터는 소급하여 얻었다. 전체생존기간은 수술부터 사망에 이르는 시간으로 정하였고, 데이터는 환자가 마지막 접촉 시 살아있을 때 검열을 받은 것으로 간주하였다. Tumor specimens and clinical data were obtained from gastric cancer patients (YUSH, n = 520) who underwent gastrectomy as primary treatment at Yonsei University Severance Hospital between 2001 and 2004. 1 to 4, but mainly in 2 and 3 (Fig. 1). All samples were collected after receiving detailed written consent from the patient, and the study was approved by the Research Ethics Committee at Severance Hospital, Yonsei University. Clinical data were obtained retrospectively. Overall survival was defined as the time from surgery to death, and the data were considered censored when the patient was alive at the last contact.

표 1은 위암 2-3기에서 RNA 전사체를 이용하여 예후 지표를 산출하기 위한 대상 유전자군(44개)과 보정을 위한 레퍼런스 유전자(10개)를 나열한 것이다.Table 1 lists the target genes (44) and the reference genes for correction (10) for the prognostic index using RNA transcripts in gastric cancer 2-3.

상기 유전자들의 mRNA 종들의 수준을 RT-PCR로 측정하였다. mRNA를 마스터퓨어TM RNA 정제 키트 (에피센터 테크놀로지스(Epicentre Technologies))를 사용하여 추출하고 리보그린(RiboGreen)^® 형광 방법(Molecular probes)에 의해 정량화하였다. 정량적 유전자 발현의 분자 검정시험을 ROCHE 480을 사용하여 수행하였다. ROCHE 480은 써모사이클러, 레이저, 전하-커플링된 장치(CCD), 카메라 및 컴퓨터로 이루어진다. 시스템은 샘플을 써모사이클러 상에서 384-웰 포맷으로 증폭시켰다. 증폭 동안, 레이저-유도 형광 신호가 모든 384-웰에 대하여 실시간으로 수집되고 CCD에서 검출되었다. 시스템은 기기를 실행하기 위한 및 데이터를 분석하기 위한 소프트웨어를 포함한다.The level of mRNA species of the genes was measured by RT-PCR. mRNA is extracted using the master Pure TM RNA purification kit (Epicenter Technologies (Epicentre Technologies)), which was quantified by a revolving green (RiboGreen) ^® Fluorescence method (Molecular probes). Molecular assay of quantitative gene expression was performed using ROCHE 480. The ROCHE 480 consists of a thermocycler, a laser, a charge-coupled device (CCD), a camera and a computer. The system amplified the samples in a 384-well format on a thermocycler. During amplification, a laser-induced fluorescence signal was collected in real time for all 384-wells and detected on the CCD. The system includes software for running the instrument and for analyzing the data.

각 환자에 대한 역치 사이클(CT) 값을 임상적 성과와의 상관성 부족에 기초하여 선택된, 특정 환자에 대한 스크리닝된 유전자의 한 서브세트의 평균에 기초하여 표준화하였다(중심 표준화 전략). The threshold value cycle (CT) values for each patient were standardized based on an average of a subset of screened genes for a particular patient selected based on a lack of correlation with clinical outcome (centralized standardization strategy).

유전자 발현을 분석하기 위한 보정 수단으로 레퍼런스 유전자를 활용하는 것이 상례이므로, "Normfiner" 프로그램을 사용하여 10개의 레퍼런스 유전자 중에서 6개의 유전자를 활용하여 표준화 작업을 수행하였다. 표준화에 사용된 ATP5E, GPX1, PGK1, UBB, VDAC2, ACTB이다(도 2).Since it is common practice to use reference genes as a means of calibrating gene expression, the "Normfiner" program was used to standardize six of the 10 reference genes. ATP5E, GPX1, PGK1, UBB, VDAC2, and ACTB used for standardization (Fig. 2).

qRT-PCR에 사용된 프라이머는 미국 특허 제7526387호의 표 1 및 2에 기재된 것을 사용하였다. The primers used in qRT-PCR were those listed in Tables 1 and 2 of U.S. Patent No. 7526387.

예후 예측 모형을 만들기 위해, 클러스터 분석은 Cluster and Treeview(http://rana.lbl.gov/EigenSoftware.htm)에 따라 수행하였다. 자율 클러스터링 분석은 연속 분산 여과 후 수행하였고, 2개의 주요 클러스터로 이루어진 2개의 분류군의 예후적 차이는 log rank test과 Kaplan Meier Plot에 따라 시험하였다.
To construct a prognostic prediction model, cluster analysis was performed according to Cluster and Treeview ( http://rana.lbl.gov/EigenSoftware.htm ). Autonomous clustering analysis was performed after continuous dispersion filtration and the prognostic differences of the two taxa of two major clusters were tested according to the log rank test and the Kaplan Meier plot.

<실시예 1> 위암 환자들의 유전자 발현 프로파일 조사<Example 1> Gene expression profile of gastric cancer patients

도 3은 520개 샘플을 qPCR한 결과의 히트맵을 나타낸 것으로, 붉은색은 저 발현, 녹색 계열은 고발현을 나타낸 것이며, 520개 샘플은 2개의 예후가 좋은 군(CL2)과 나쁜 군(CL1)으로 나눌 수 있는 것을 볼 수 있었다. Kaplan Meier Plot과 log rank test에서 p-값이 0.0233 이 나온 것을 알 수 있었다.
FIG. 3 shows a heat map of the results of qPCR of 520 samples, showing low expression in red color and high expression in green line, and 520 samples showed good prognosis (CL2) and bad prognosis (CL1 ). Kaplan Meier Plot and log rank test showed a p-value of 0.0233.

<실시예 2> BRB-Array Tools를 이용한 생존 위험 예측(PCA 분석)Example 2: Survival risk prediction using BRB-Array Tools (PCA analysis)

하기 표 2는 위험 군의 분류에 사용된 유전자이며, BRB ArrayTools Version 4.1(http://linus.nic.nih.gov./BRB-ArrayTools.html)를 사용하여 PCA(Principal Component Analysis)를 통하여 예후 지표(Prognostic Index)를 계산하였다. 즉 wi 값과 유전자의 중간값화 된 발현수치를 곱하여 아래의 산식으로 산출한 값을 두 계층으로 나눈 것이다. Table 2 below shows the genes used in the classification of the risk groups and is analyzed by PCA (Principal Component Analysis) using BRB ArrayTools Version 4.1 ( http://linus.nic.nih.gov./BRB-ArrayTools.html ) The Prognostic Index was calculated. That is, the wi value is multiplied by the intermediate expression value of the gene and the value calculated by the following formula is divided into two layers.

∑ _i w _i x _i - 0.007356Σ _i w _i x _i - 0.007356

도 4는 BRB-Array Tools를 이용한 PCA 분석에 따라 생존 위험 예측 결과를 나타낸 것이다.FIG. 4 shows the survival risk prediction result according to the PCA analysis using the BRB-Array Tools.

상기 산식에 따른 이 모델의 정확도 측정을 위해 CCP(Compound Covariate Predictor), LDA(Diagonal Linear, Discriminant Analysis), 1-NN(1-Nearest Neighbor), 3-NN(3-Nearest Neighbors), NC(Nearest Centroid), SVM(Support Vector Machines), BCCP(Bayesian Compound Covariate Predictor)로 실시한 클래스 예측 결과는 아래와 같았다(표 3. 도 5).In order to measure the accuracy of this model according to the above formula, a compound covariate predictor (CCP), a diagonal linear discriminant analysis (LDA), a 1-nearest neighbor (NN), a 3-nearest neighbors Centroid), SVM (Support Vector Machines), and BCCP (Bayesian Compound Covariate Predictor).

p-값p-value % CV Support% CV Support Weights (wi)Weights (wi) 명칭designation 심볼symbol 1One 0.001730.00173 100100 0.119466 0.119466 CSKCSK CSKCSK 22 0.00127820.0012782 100100 0.1126310.112631 CD8ACD8A CD8ACD8A 33 0.0143590.014359 9090 0.1060450.106045 CHEK2CHEK2 CHEK2CHEK2

상기 예후 지표를 사용하여 상위 50% 하위 50%의 두 계층으로 나눈 결과 도 5와 같이 각 예측 모델 별로 CCP, LDA, 1-NN, 3-NN, NC, SMV, BCCP에서 구분되는 것을 알 수 있다. Using the above prognostic index, the results of dividing into two groups of the upper 50% and lower 50% are shown as CCP, LDA, 1-NN, 3-NN, NC, SMV and BCCP for each prediction model as shown in FIG. 5 .

CCPCCP LDALDA 1-NN1-NN 3-NN3-NN NCNC SVMSVM BCCPBCCP 7676 8080 8383 8282 7878 9292 8989

<실시예 3> BRB-Array Tools를 이용한 생존 위험 예측(HR 분석)Example 3 Prediction of Survival Risk Using BRB-Array Tools (HR Analysis)

도 3의 히트맵과 같이 유전자 발현을 콕스 회귀 분석(Univariate Cox's proportional harzard test)을 실행하여 3개의 유전자를 선정하였다. 사용 툴은 BRB ArrayTools Version 4.1(http://linus.nic.nih.gov./BRB-ArrayTools.html)를 사용하였고, 이를 이용하여 위험 계수(Hazard ratio)와 관련된 알고리즘을 설정하였다.As shown in the heat map of FIG. 3, three genes were selected by performing a Cox's proportional harzard test on gene expression. The tool used was BRB ArrayTools Version 4.1 ( http://linus.nic.nih.gov./BRB-ArrayTools.html ), which was used to set the algorithm related to the hazard ratio.

qPCR에 의한 예후 위험도 평점 시스템은 표준화된 유전자 발현량을 RG(Relative Quantity)로 변환하고, RQ로 변화된 유전자 발현량(Cq) value를 콕스 회귀 분석에 따라 위험 계수(hazard ratio, HR)를 구하였다. 위험 계수를 -1/HR로 변환하고, 유전자 발현량을 곱하여 총합(RS, Risk Score)을 산출하였다(컷 오프: -23 이하는 -23으로 간주함). 위험도 점수의 백분율(percentile risk score)은 다음 식에 따라 계산하였다:The prognostic risk rating system by qPCR converts the standardized gene expression level to RG (Relative Quantity) and calculates the hazard ratio (HR) by Cox regression analysis of the gene expression amount (Cq) value changed by RQ . The risk coefficient was converted to -1 / HR and multiplied by the amount of gene expression to calculate the sum (RS, Risk Score) (cut-off: -23 or less is considered to be -23). The percentile risk score was calculated according to the following formula:

RS(%)=100×(샘플의 RS - 모집단의 RS 최소값)/(모집단의 RS 최고값 - 모집단의 RS 최소값)RS (%) = 100 × (RS minimum value of the sample) / (RS maximum value of the population - RS minimum value of the population)

75% 이상의 RS(%)을 갖는 샘플은 고 위험군으로 분류하고, 25% 이상 내지 75% 미만의 RS(%)을 갖는 샘플은 중간 위험군으로 분류하였다. 마지막으로 25% 미만의 RS(%)을 갖는 샘플은 저 위험군으로 분류하였다.Samples with greater than 75% RS (%) were classified as high risk and samples with RS (%) ranging from 25% to less than 75% were classified as intermediate risk. Finally, samples with less than 25% RS (%) were classified as low risk.

상기 표 4는 위험 군의 분류에 사용된 유전자 및 이의 위험 계수(Hazard Ratio)이다. Table 4 above shows the genes used in the classification of risk groups and their hazard coefficients.

Parametric p-value Parametric p-value Hazard Ratio Hazard Ratio UniqueID UniqueID 0.0011978 0.0011978 0.759 0.759 CSK CSK 0.0013091 0.0013091 0.847 0.847 CD8A CD8A 0.0153688 0.0153688 0.79 0.79 CHEK2 CHEK2

도 6에 나타난 바와 같이, 총 520개의 샘플은 저 위험군이 108명 중 24명이 사망하였고(전체 20% 차지함; 사망률 22.2%), 중간 위험군은 299명 중 100명이 사망하였으며(전체 57% 차지함; 사망률 33.5%), 고 위험군은 113명 중 48명이 사망(전체 21.6% 차지함; 사망률 42.5%)하였다.As shown in Figure 6, a total of 520 samples of low-risk patients (24% of total deaths) accounted for 20% of the total deaths (22.2% of deaths) 33.5%) and 48 of the 113 patients in the high-risk group died (total 21.6%, mortality rate 42.5%).

결과적으로, 위암은 분자적 특징을 이용하여 병기 결정의 가능성을 보여주었다. 위암의 FFPE 샘플에 대한 qPCR를 결과로부터 선정된 유전자 리스트에서 얻은 RS(%)를 통해 위 절제술 후 임상 결과의 예후 예측이 가능함을 확인하였다. As a result, stomach cancer has shown the possibility of staging using molecular features. From the results of qPCR for FFPE samples of stomach cancer, we confirmed that prognosis of clinical outcome is possible after gastrectomy through RS (%) obtained from selected gene list.

Claims

In a biological sample containing cancer cells from a subject
Determining the degree of expression of the RNA transcript of CSK, CD8A and CHEK2; And
(RS, Risk Score) and the RS percentage (RS (%)) of the biological sample are calculated according to the following equations (1) and (2) based on the expression level of the RNA transcript determined in the step
And determining the prognosis in terms of overall survival (OS) without recurrence of gastric cancer after surgical resection of the whole gastric cancer regardless of the TNM stage according to the RS (%). Method of providing:
[Equation 1]
RS = HR ₁ * normLogTransValue ₁ + HR ₂ * normLogTransValue ₂ + ... + HR _n * normLogTransValue _n
&Quot; (2) "
RS (%) = 100 × (RS minimum value of the biological sample) / (RS maximum value of the population - RS minimum value of the population)
In this formula,
HR _n represents the hazard ratio of the n-th RNA transcript and is converted to -1 / HR _n when HR _n is less than 1,
normLogTransValue _n is a value associated with the expression of the RNA transcript and is a value obtained by varying the scale around the median based on the total value of the gene,
The population means a certain number of groups having a whole gastric cancer unrelated to the TNM stage, and the constant number is an arbitrary integer such that the RS maximum value and the minimum value can be calculated.

delete

The method according to claim 1,
This method is considered to be a high-risk group when the RS (%) of the biological sample is more than 75% in terms of the overall survival rate (OS), an intermediate risk group in the range of 25% to less than 75%, and a low- How it is.

In a biological sample containing cancer cells from a subject
Measuring the expression levels of RNA transcripts of CSK, CD8A and CHEK2; And
The increase in the expression of the transcript is judged as an increase in the possibility of a positive clinical result in terms of overall survival (OS) without recurrence of gastric cancer after surgical resection of the entire gastric cancer regardless of the TNM stage. A method of providing information for predicting prognosis.

6. The method of claim 5,
Wherein the method is a qPCR-based method.

delete

A computer-readable recording medium recording a program for predicting the prognosis of stomach cancer, comprising: a nucleic acid sample
Determining the degree of expression of the RNA transcript of CSK, CD8A and CHEK2; And
(RS, Risk Score) and the RS percentage (RS (%)) of the sample are calculated according to the following equations (1) and (2) based on the degree of expression of RNA determined in the above step,
Patients with high-risk patients with> 75% of the RS (%) setting range in terms of overall survival (OS) without recurrence of gastric cancer after surgical resection of the entire gastric cancer, regardless of TNM stage, %, &Lt; / RTI > a low risk group if less than < RTI ID = 0.0 > 25% < / RTI >
[Equation 1]
RS = HR ₁ * normLogTransValue ₁ + HR ₂ * normLogTransValue ₂ + ... + HR _n * normLogTransValue _n
&Quot; (2) "
RS (%) = 100 × (RS minimum value of the RS in the sample) / (RS maximum value in the population - RS minimum value in the population)
In this formula,
HR _n represents the hazard ratio of the n-th RNA transcript and is converted to -1 / HR _n when HR _n is less than 1,
normLogTransValue _n is a value associated with the expression of the RNA transcript and is a value obtained by varying the scale around the median based on the total value of the gene,
The population means a certain number of groups having a whole gastric cancer unrelated to the TNM stage, and the constant number is an arbitrary integer such that the RS maximum value and the minimum value can be calculated.

delete