KR101825369B1 - Genetic Bio-marker for Predicting Prognosis in Cancer and Use thereof - Google Patents
Genetic Bio-marker for Predicting Prognosis in Cancer and Use thereof Download PDFInfo
- Publication number
- KR101825369B1 KR101825369B1 KR1020150084070A KR20150084070A KR101825369B1 KR 101825369 B1 KR101825369 B1 KR 101825369B1 KR 1020150084070 A KR1020150084070 A KR 1020150084070A KR 20150084070 A KR20150084070 A KR 20150084070A KR 101825369 B1 KR101825369 B1 KR 101825369B1
- Authority
- KR
- South Korea
- Prior art keywords
- cancer
- gene
- prognosis
- expression
- genes
- Prior art date
Links
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6876—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
- C12Q1/6883—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
- C12Q1/6886—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material for cancer
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/48—Biological material, e.g. blood, urine; Haemocytometers
- G01N33/50—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
- G01N33/53—Immunoassay; Biospecific binding assay; Materials therefor
- G01N33/574—Immunoassay; Biospecific binding assay; Materials therefor for cancer
- G01N33/57407—Specifically defined cancers
- G01N33/57423—Specifically defined cancers of lung
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/48—Biological material, e.g. blood, urine; Haemocytometers
- G01N33/50—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
- G01N33/53—Immunoassay; Biospecific binding assay; Materials therefor
- G01N33/574—Immunoassay; Biospecific binding assay; Materials therefor for cancer
- G01N33/57407—Specifically defined cancers
- G01N33/57438—Specifically defined cancers of liver, pancreas or kidney
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/118—Prognosis of disease development
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N2800/00—Detection or diagnosis of diseases
- G01N2800/52—Predicting or monitoring the response to treatment, e.g. for selection of therapy based on assay results in personalised medicine; Prognosis
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Immunology (AREA)
- Engineering & Computer Science (AREA)
- Urology & Nephrology (AREA)
- Molecular Biology (AREA)
- Hematology (AREA)
- Pathology (AREA)
- Analytical Chemistry (AREA)
- Biomedical Technology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Hospice & Palliative Care (AREA)
- Organic Chemistry (AREA)
- Biotechnology (AREA)
- Microbiology (AREA)
- General Health & Medical Sciences (AREA)
- Physics & Mathematics (AREA)
- Oncology (AREA)
- Biochemistry (AREA)
- Medicinal Chemistry (AREA)
- General Physics & Mathematics (AREA)
- Food Science & Technology (AREA)
- Genetics & Genomics (AREA)
- Cell Biology (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Gastroenterology & Hepatology (AREA)
- Biophysics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
The present invention relates to a genetic marker for judging the classification or prognosis of cancer, and a method of classifying cancer and determining a prognosis using the same. The biomarker for determining cancer classification or prognosis according to the present invention is one or more genes selected from NCAPH, CDCA5, CDCA8, SPC25, DLGAP5, NUSAP1, NEK2, CCNB1, KIAA1524 and ZWINT. By measuring the expression level of the genes of the present invention, it is possible to classify the cancer patients and determine their prognosis. In addition, the genes of the present invention can be usefully used to select a therapeutic agent effective for cancer patients and establish a treatment plan.
Description
The present invention relates to gene biomarkers for cancer prognosis determination and uses thereof.
Tumors of different patients are not identical at the molecular or morphological and pathological levels. These characteristics are usually referred to as inter-tumor heterogeneity (1-3). The causes of different patient tumors having different characteristics are as follows: (1) the location of the origin or organ of different tumor cells; (2) a variety of biological factors that may affect the development or progression of cancer, such as the sex, age, or condition of the hormone; And (3) genetic diversity between hosts (4-6). Recent evidence suggests that some evidence for deeper levels of tumor heterogeneity, ie, intra-tumor heterogeneity, within a single patient has been reported (7-10). Thus, studies of heterogeneity in tumors have received attention as the latest research field in cancer genomics (11-14). Because tumors composed of distinct subpopulations can vary in sensitivity to cancer treatments, it is important to understand the degree and extent of tumor heterogeneity within tumors (15-17). The presence of a resistant subgroup containing different genetic mutations can determine whether specific treatment methods for subgroups of patients are effective. Therefore, even single patients require profound temporal or spatial examinations of tumor cells. Indeed, the increased diversity of sub-clones within tumors is associated with an increased risk of progression from Barrett's esophagus to malignant tumors (18-19).
The surprising and rapid development of mass-parallel sequencing, also referred to as next generation sequencing (NGS), has dramatically elucidated the extent of tumor heterogeneity and has resulted in hundreds of cancer-causing somatic mutations and structural variants (20-22). Several researchers have conducted studies on tumor heterogeneity using multi-region sequencing approaches based on NGS techniques (7, 23-24). However, recent research reports have shown that Hiley et al. [25] suggest that single cell sequencing is indispensable for the complete identification of the actual number of subclones present in tumors. This is because multi-region sequencing methods can not separate the constituents of subclones present in these tumors, because the regions of the tumor are a collection of numerous cells with different genetic conditions.
The patent documents and references cited herein are hereby incorporated by reference to the same extent as if each reference was individually and clearly identified by reference.
The present inventors have sought to identify biomarkers capable of predicting the therapeutic prognosis of cancer patients through the study of tumor heterogeneity in cancer. As a result, it was confirmed that correlated genes expressed in a single cell of 34 cancer-derived xenograft cancer can be identified and the therapeutic prognosis of cancer can be predicted by measuring the expression levels of these correlated genes. .
Accordingly, an object of the present invention is to provide a composition for judging the classification or prognosis of cancer.
Another object of the present invention is to provide a kit for judging the classification or prognosis of cancer.
It is still another object of the present invention to provide a biomarker for judging the classification or prognosis of cancer.
It is still another object of the present invention to provide a method for measuring the expression status of a biomarker gene for the classification or prognosis of cancer according to the present invention from a biological sample so as to provide information necessary for classification or prognosis of cancer.
Other objects and technical features of the present invention will be described in more detail with reference to the following detailed description, claims and drawings.
According to one aspect of the present invention there is provided a cancer agent comprising an agent capable of measuring the expression status of at least one gene selected from the group consisting of NCAPH, CDCA5, CDCA8, SPC25, DLGAP5, NUSAP1, NEK2, CCNB1, KIAA1524 and ZWINT For the classification or prognosis determination.
As used herein, the term " expression state of a gene " refers to a gene expression level (e.g., mRNA level or protein level), an activity level of a protein expressed from the gene, a mutation of the gene, Is meant to include the methylation state of the gene, preferably the expression level of the gene or the activity level of the protein expressed from the gene, and most preferably the expression level of the gene.
The term " classifying " as used herein is meant to include determining clinically relevant characteristics in cancer or determining a particular prognosis of a cancer patient. Classification of such cancers includes, but is not limited to: (i) assessing the likelihood of metastasis to another organ, or the risk of recurrence; (Ii) evaluation of stage of cancer; (Iii) assessment of the prognosis of cancer patients without treatment; (Iv) evaluation of the responsiveness of cancer patients to treatment methods (chemotherapy, radiation therapy, surgical procedures); (V) diagnosis of the actual cancer patient's response to current or past treatment methods; (Vi) selection of the preferred method for the treatment of cancer patients; And (iii) prognosis for the life expectancy of cancer patients (eg prognosis for overall survival).
Therefore, the term " cancer classification " in the present invention has some equivalents to " cancer prognosis determination " and can be used in combination within the overlapping meanings.
The genes used as cancer biomarkers in the present invention are one or more genes selected from the group consisting of the following genes: CCNB1 (cyclin B1), CDCA8 (cell division cycle associated 8), CDCA5 (cell division cycle associated 5), DLGAP5 (NSCA-related kinase 2), NUSAP1 (nucleolar and spindle-associated protein 1), SPC25 (NSC-1) SPC25, NDC80 kinetochore complex component), ZWINT (ZW10 interacting kinetochore protein).
According to one embodiment of the present invention, the cancer is lung cancer, kidney cancer, or liver cancer.
According to another embodiment of the present invention, the cancer is lung cancer, and the lung cancer includes small cell lung cancer and non-small cell lung cancer, preferably non-small cell lung cancer.
In the present invention, the non-small cell lung cancer includes, but is not limited to, lung adenocarcinoma (LADC), lung squamous cell carcinoma (LUSC), bronchopulmonary cancer.
In the present invention, the expression level of the genes can be measured to determine the classification of cancer or the prognosis of cancer.
According to another embodiment of the present invention, when the expression level of the genes is up-regulated, it is judged to be a bad prognosis in relation to the prognosis of cancer. In relation to the classification of cancer, the possibility of metastasis of cancer to other organs, It is evaluated as a high risk, highly advanced cancer stage and the life expectancy of the patient is considered to be short.
According to another embodiment of the present invention, when the level of expression of the genes is down-regulated, it is judged to be a good prognosis in relation to the prognosis of cancer. Regarding the classification of cancer, the possibility of metastasis or recurrence of cancer to other organs The risk is low, it is evaluated as early cancer stage, and the life expectancy of the patient is considered to be long.
As used herein, the term " upregulation " means a level of expression of the genes of the present invention above a normal expression level, and " downregulation " Which means a lower level.
In the present invention, the term " agent capable of measuring the expression state of a gene " means that (i) a nucleic acid sequence encoding the gene or a nucleic acid probe capable of hybridizing to the sequence complementary thereto, (ii) (Iii) an antibody capable of immunologically binding to a protein, polypeptide, or fragment thereof encoded by said gene, or a primer set or primer set capable of amplifying all of said antibody But are not limited to, fragments.
Methods for measuring the level of expression of a particular gene are known to those skilled in the art and include, for example, microarray analysis (e.g., mRNA or microRNA expression analysis or copy number analysis), quantitative real time PCR (e.g., "qRT- TaqMan (TM)), immunological assays (e.g., ELISA, immunohistochemistry), and the like.
In the present invention, the activity level of a protein or a polypeptide encoded by the genes is also used in a sense equivalent to the expression level of the gene. For example, a high activity level of a protein or polypeptide means a high expression level, and a low activity level means a low expression level.
According to another aspect of the present invention there is provided a cancer agent comprising an agent capable of measuring the expression status of one or more genes selected from the group consisting of NCAPH, CDCA5, CDCA8, SPC25, DLGAP5, NUSAP1, NEK2, CCNB1, KIAA1524 and ZWINT And a kit for judging the prognosis.
According to one embodiment of the present invention, in the kit of the present invention, the cancer is cancer, lung cancer, kidney cancer or liver cancer.
According to another embodiment of the present invention, the cancer is lung cancer, and the lung cancer includes small cell lung cancer and non-small cell lung cancer, preferably non-small cell lung cancer.
In the present invention, the non-small cell lung cancer includes, but is not limited to, lung adenocarcinoma, squamous cell carcinoma of the lung, bronchopulmonary cancer.
The kit of the present invention may comprise a carrier for various components, such as, for example, a container such as a bag, box, tube, rack, But are not limited to, a support.
As described above, the kit of the present invention is an agent capable of measuring the expression state of the gene, comprising (i) a nucleic acid probe capable of hybridizing to the nucleic acid sequence encoding the gene or a complementary sequence thereof, (ii) a primer or a primer set capable of amplifying a part or the whole of the gene, (iii) a protein, polypeptide, or fragment thereof immunologically bound Or fragments of such antibodies.
The nucleic acid probe or primer may be labeled with a suitable detection marker. The detection markers include, but are not limited to, radioactive isotopes, fluorophores, biotin, enzymes (e.g., alkaline phosphatase), enzyme substrates, ligands and antibodies. For further details, see Jablonski et al., Nucleic Acids Res., 14: 6115-6128 (1986); Nguyen et al., Biotechniques, 13: 116-123 (1992); Rigby et al., J. Mol. Biol., 113: 237-251 (1977), the disclosures of which are incorporated herein by reference.
The kit of the present invention may comprise various components. But are not limited to, for example, Taq polymerase, deoxyribonucleotides, dideoxyribonucleotides, other suitable primers capable of amplifying the target DNA sequence, RNase A, and the like.
The kit of the present invention may preferably include instructions for using the kit of the present invention to determine the classification of cancer or prognosis of cancer.
According to another embodiment of the present invention there is provided a method of screening for a cancer classifying or cancer comprising one or more genes selected from the group consisting of NCAPH, CDCA5, CDCA8, SPC25, DLGAP5, NUSAP1, NEK2, CCNB1, KIAA1524 and ZWINT Provide a biomarker for prognosis.
According to another aspect of the present invention there is provided a method of screening for cancer in a mammal comprising the steps of contacting a biological sample with NCAPH, CDCA5, CDCA8, SPC25, DLGAP5, NUSAP1, NEK2, CCNB1, KIAA1524 and ZWINT The expression level of the at least one gene selected from the group consisting of < RTI ID = 0.0 >
According to an embodiment of the present invention, the cancer is lung cancer, kidney cancer or liver cancer.
According to another embodiment of the present invention, the cancer is lung cancer, and the lung cancer includes small cell lung cancer and non-small cell lung cancer, preferably non-small cell lung cancer.
In the present invention, the non-small cell lung cancer includes, but is not limited to, lung adenocarcinoma, squamous cell carcinoma of the lung, bronchopulmonary cancer.
According to one embodiment of the present invention, in the kit of the present invention, the biological sample is a tissue, a whole blood, a serum, a plasma, a body fluid, a urine, a cell, a cell lysate or a supernatant of a cell culture.
The features and advantages of the present invention are summarized as follows.
(i) The present invention relates to a gene marker for judging the classification of cancer or prognosis of cancer, and a method of classifying cancer and determining prognosis using the same.
(Ii) The biomarker for the classification or prognosis of cancer in the present invention is at least one gene selected from NCAPH, CDCA5, CDCA8, SPC25, DLGAP5, NUSAP1, NEK2, CCNB1, KIAA1524 and ZWINT.
(Iii) The genes of the present invention are expressed in cancer patients in a jointly regulated expression level, and cancer patients can be grouped according to their expression levels.
(Vi) Cancer patients grouped according to the expression levels of the genes of the present invention are clearly distinguished in prognosis.
(v) Therefore, by measuring the expression levels of the genes of the present invention, it is possible to classify cancer patients and determine their prognosis, and are useful for selecting therapeutic agents effective for cancer patients.
(Vi) Furthermore, the genes of the present invention can be usefully used for selecting a therapeutic agent effective for cancer patients and establishing a treatment plan.
The present invention relates to a gene marker for judging the classification of cancer or prognosis of cancer, and a method of classifying cancer and judging prognosis using the same. The biomarker for the classification of cancer or prognosis of cancer of the present invention is one or more genes selected from NCAPH, CDCA5, CDCA8, SPC25, DLGAP5, NUSAP1, NEK2, CCNB1, KIAA1524 and ZWINT. By measuring the expression level of the genes of the present invention, it is possible to classify the cancer patients and determine their prognosis. In addition, the genes of the present invention can be usefully used to select a therapeutic agent effective for cancer patients and establish a treatment plan.
FIG. 1 schematically shows an entire analysis process of the experiment of the present invention. The upper part shows preparation of a single cell and the lower part shows the process of removing the transcriptome of PDX-LADC single cells.
Figure 2 shows the type and expression correlation of tumor tissue according to the experimental step. Panel A shows gene expression correlations with tumor tissue types and shows Pearson correlation coefficients between tumor tissues. Panel B shows a box plot analysis of Pearson correlation coefficients between groups.
Figure 3 shows the expression correlations between pT and bulkT improved through data cleaning. Panel A shows the results before data cleaning. Panel B shows the results after data cleaning.
Figure 4 shows two different genes with a simple expression pattern. Panel A shows the simple expression of genes expressed in 1-2 single cells. In this case, the FPKM values of the remaining cells are " 0 ". Panel B shows the housekeeping expression of a gene that is variably expressed between -3 and 3
Figure 5 shows the expression of G64 over 34 single cells. The
FIG. 6 shows hierarchical clustering analysis combined with heat map of 34 single cells using the G64 gene set and PCA analysis results thereof. Panel A shows the hierarchical clustering analysis combined with a heat map and Panel B shows the results of the PCA analysis.
Figure 7 shows the hierarchical clustering analysis combined with the heat map of 488 LADC tumor samples using the G64 gene set and the PCA analysis results thereof. Panel A shows the hierarchical clustering analysis combined with a heat map and Panel B shows the PCA analysis.
Figure 8 shows the hierarchical clustering analysis combined with the heat map of 488 LADC tumor samples using the G10 gene set and the results of PCA analysis thereof. Panel A shows the hierarchical clustering analysis combined with a heat map and Panel B shows the results of the PCA analysis.
FIG. 9 shows survival analysis results of patients providing 488 LADC samples according to the degree of G10 expression. Panel A shows the survival rates of the C1 and C3 groups. Panel B shows the survival rates of C1 and C2, C1 and C3, and C2 and C3.
FIG. 10 shows a hierarchical clustering analysis combined with a heat map of 289 kidney cancer (KIRP) tumor samples using the G10 gene set, a PCA analysis thereof, and a comparison of the survival rates of the C1 group and the C3 group. Panel A shows a hierarchical clustering analysis combined with a heat map. Panel B shows the results of the PCA analysis. Panel C shows the survival rates of groups C1 and C3.
FIG. 11 shows a hierarchical clustering analysis of a heat map of 351 liver cancer (LIHC) tumor samples using the G10 gene set, a PCA analysis thereof, and a comparison of the survival rates of the C1 group and the C3 group. Panel A shows a hierarchical clustering analysis combined with a heat map. Panel B shows the results of the PCA analysis. Panel C shows the survival rates of groups C1 and C3.
Example
Materials and Methods
1. Data Cleaning
Among the 23,284 genes as the first transcriptome of the assay, FPKM values were found to be " 0 " over 34 patient-derived xenograft lung adenocarcinoma single cells (PDX-LADC single cells) 9,455 genes were selected (34-36). In order to solve the infinity problem arising from the process of calculating the change in expression level for 9,455 genes, all 0 to 0.1 FPKM values were converted to 0.1.
2. Identification of correlated genes
After the data was purified as described above, cell-cell correlations related to expression of each gene were estimated through Pearson's correlation analysis (PCA) using R (http: //www.r-project. org /). Cell expression pattern with a Pearson correlation coefficient (r)> 0.9 after removal of genes with high correlation coefficients expressed in simple expression or house-keeping expression patterns in PDX-LADC single cells The gene group was selected as " seed gene " and analyzed. The seed genes were used to expand the number of highly expressed genes. The expansion of the gene was performed through a correlation test with the remaining genes using the threshold value r > 0.75. As a result, 64 genes with high correlation and expression were found in 34 PDX-LADC single cells and named 'G64' (Table 1). In addition, G64 was used to further narrow the classifier gene to find a minimum gene group that can clearly divide the patient group among the 64 genes constituting G64. To do this, gene-gene correlation tests were performed on the genes in the up-regulated group and the down-regulated group, and correlation coefficients were calculated. The top 10 genes were selected by combining all the correlation coefficients calculated based on each gene, and this was named 'G10' (Table 2).
3. Removal of Experimental DEG (Differently Expressed Gene)
Single cells used to produce a single cell transcriptome were produced using a patient-derived single cell approach, which may be a xenotransplantation process, a cell culture process, All of the processes induce additional expression changes independent of tumorigenesis. Therefore, it is necessary to remove differential expression genes (DEGs) that are generated during the experiment rather than by tumorigenesis. RNA-seq transcripts from three different tissues were used: primary tumor-derived transcript (pT); Xenotransplantation tumor-derived transcript (xeonT); And a tumor cell-derived transcript (bulkT) obtained after resection from xenograft tumor tissue and cultured. The reason for removing such experimental DEGs is to remove three different types of DEGs: (1) DEGs between pT and xeonT; (2) DEGs between xeonT and bulkT; And (3) DEGs between pT and bulkT. In each of the above comparisons, the estimation of DEG was performed through Log 2-fold change for each FPKM value. In any of these comparisons, it was excluded from the further analysis below if the DEG is considered to be DEG (i.e., the
4. Data for TCGA LADC
RNA-seq data for 488 LADC patients and post-treatment survival information for LADC patients were downloaded from the TCGA website (https://tcga-data.nci.nih.gov/tcga/tcgaHome2.jsp).
5. Statistical analysis
All statistical tests and analyzes were performed using R studio (R studio, Racine 2012). The 'cor.test' function was used to calculate the Pearson correlation coefficient (http://www.rproject.org/) to determine how FPKM values are related among different cells. The 'hclust' function was used for hierarchical clustering (https://stat.ethz.ch/R-manual/R-patched/library/stats/html/hclust.html). The packages 'factoMineR' (http://factominer.free.fr) and 'rgl' (https://r-forge.r-project.org/projects/rgl/) perform principal component analysis (PCA) and And visualization of results (30). The 'survival' package combined with the Kaplan-Meier survival curve was used to estimate the survival rate of patients with LADC (http://r-forge.r-project.org). Cox proportional hazards were calculated using the 'coxph' function of survival packages (27, 28). Other scripting tasks that are required for multiple batch jobs were performed using Perl scripts.
Experiment result
1. Removal of the transcriptome of PDX-LADC single cells
The entire analysis procedure of this experiment is shown in Fig. In Fig. 1, the upper part describes the preparation of a single cell, and the lower part is the part of the process of removing the transcriptome of PDX-LADC single cells. The lack of sequencing depth is a known technical limitation in the single-cell based sequencing approach (26, 27, 28, 29, 31-33). In addition, many genes differentially expressed in 34 normal LADC single cells compared with single cells are expressed not by tumor formation but rather by experimental procedures for preparing single cells (for example, xenotransplantation processes or cells Culture process). mN, pT, xenoT and bulkT are the masses obtained by collecting and matched normal ',' primary tumor ',' xenograft tumor 'and' Of tumors. &Quot; The correlation between pT and mN expression correlates with the expression relationship between pT and xeonT or between pT and bulkT when the correlation between the samples is compared before the removal of the gene showing the change by the experimental procedure (Panel B in Fig. 2). Therefore, in order to eliminate the above errors, the genes showing the expression changes by xenotransplantation or cell culture process were excluded from the later analysis. As a result of the extensive cleaning process as described above, only 9455 genes remained in 23284 genes, and the expression correlation between pT and bulkT was significantly improved from 0.44 to 0.92 (Pearson correlation coefficient (r) 3).
2. Identification of highly correlated gene sets in 34 single cells
The purpose of this experiment is to understand the degree of intra-tumor heterogeneity through profiling of transcripts. Hierarchical clustering analysis of all 9455 gene expressions is not useful to reach the conclusion that it is a single cell heterogeneity. Because the Pearson correlation coefficient of the genes expressed in each cell is 0.55-0.89, the genes expressed in each cell are very similar, and there is a very large amount of noise in grouping into the clusters Because. However, the degree of single cell heterogeneity based on gene expression is the reason why the degree of single cell heterogeneity based on gene expression must be determined by the gene group, not by a single gene, which shows noisy expression across 34 single cells. High expression correlations should be estimated based on genes with complex expression patterns among single cells. This is because high cell-cell correlations determined by the expression of grouped genes can only occur by chance when genes are expressed in one or two single cells or when genes are expressed anywhere (Figure 4) . The selected genes were further screened based on the reasons described in the experimental method, and a total of 5586 genes were selected for cell-cell correlation in 34 single cells. As a result, a total of 20 genes were identified as a seed gene, for example, a gene expressed in a highly correlated manner among 34 single cells in the case of a threshold value of Pearson correlation coefficient ≥ 0.9 (Fig. 5 ). Subsequently, genes with positive correlation with each of the 20 seed genes with a Pearson correlation coefficient of at least 0.75 were searched to further collect extended sets of correlation genes. As shown in FIG. 6, a total of 64 genes were identified as an extended set of correlated genes and named 'G64' (Table 1).
3. Identification of tumor heterogeneity by a set of correlation genes
Hierarchical clustering analysis was performed to identify intra-tumor heterogeneity in 34 single cells using the G64 gene set. The analysis was performed by analyzing the separation pattern of single cells by expression of G64 gene sets. The expression level of each gene in each single cell was calculated by dividing the FPKM value in each single cell by the average FPKM value of 34 single cells and then by Log2 conversion. Interestingly, 34 single cells derived from a single LADC tumor site were divided into two subpopulations according to the degree of G64 expression (Fig. 6). That is, the two subgroups are downregulated and upregulated. These results were also confirmed by Principal Component Analysis (PCA) (Fig. 6). The present inventors have confirmed that the results classified by the G64 subgroup can also be applied to the inter-tumoral level. For this purpose, transcriptome information of 488 LADC tumor samples prepared by the RNA-seq approach was retrieved from the TCGA database. The change in the expression level of each gene in each tumor sample was calculated in the same manner as in the analysis of the 34 single cells. Hierarchical clustering analysis was then performed on 488 tumor samples combined with a heat map illustration. As a result of the analysis, surprisingly, G64, which classified single cells into two subgroups, showed the same pattern of classification for LADC samples (FIG. 7). The pattern of G64 classification into two groups was identical in the analysis using 88 Korean lung cancer sample transcripts prepared using the RNA-seq approach. These results imply that the classification into subgroups by G64 is not limited to intra-tumor levels but may extend to inter-tumor levels.
4. Excavation of minimal gene set G10 with correlation
A set of minimum genes having a correlation with cancer classification was analyzed using the obtained G64 gene set. For this purpose, gene-gene correlation tests were performed on the G64 expression-upregulated group and the G64 expression-downregulated group, respectively, and the correlation coefficient of each gene was calculated. The correlation coefficients of each gene were summed and the top 10 genes were selected. As a result, 10 genes, NCAPH, CDCA5, CDCA8, SPC25, DLGAP5, NUSAP1, NEK2, CCNB1, KIAA1524 and ZWINT were identified as a set of genes maintaining lung cancer classification function and named as 'G10' (Table 2). Based on the above results, hierarchical clustering analysis of 488 LADC tumor samples was performed using G10 in the same manner as G64. As a result of the analysis, G10 classified 488 LADC samples into three subgroups (panel A of FIG. 8). The subpopulations are downregulated in the C1 population (no expression or very low levels of expression), respectively, depending on the degree of expression of the G10 gene; A slightly downregulated C2 population (low level of expression); And upregulated C3 populations (high levels of expression). The above results were also confirmed in the PCA analysis (panel B in Fig. 8).
5. Correlation of Correlation Gene Set and Prognosis of Lung Cancer in LADC
5.1. Relationship between G64 gene set and lung cancer prognosis
We have determined whether the down-regulation or up-regulation of G64 is related to the clinical characteristics of LADC patients, such as prognosis and survival rate. For this purpose, clinical data on 488 LADC samples retrieved from TCGA were analyzed by chi-squared test and univariate logistic regression analysis. Results showed that age, It is significantly associated with down-regulation or up-regulation of the G64 gene with several clinical factors including age of initial diagnosis, stage of cancer, smoking or vital. . Four variables related to smoking habits such as smoking status, tabacco year, tabacco reformed year, and tabacco pack per year were all associated with the expression pattern of G64. For example, smokers, smokers, organs who smoked a lot of tobacco tend to have G64 upwards. Multinomial logistic regression analysis was performed to analyze the independent effects of G64 expression changes. The results of the analysis showed that the upregulation of G64 was closely related to the poor prognosis of the patient compared to the downregulation of G64. The odds of vital of the vital in the single cell group with G64 downregulated was 2.286. For example, in the smoking cessation variable, the odds ratio was 3.508, which showed the highest tendency for the upward adjustment of G64 (Table 3). These results suggest that G64, identified as a classifier in 34 LADC single cells, may be an excellent tumor-classifying factor for lung adenocarcinoma in particular.
5.2. Association of G10 gene set with lung cancer prognosis
We also analyzed the level of expression of G10, a gene set that can be classified as a subset of lung cancer, according to gene expression level, in relation to the treatment outcome, survival rate, or clinical characteristics of patients. Kaplan-Meier survival analysis showed that the survival rate of the down-regulated group (C1, n = 105) was lower than that of the up-regulated group (C3, n = 191) (P = 0.002) (Fig. 9). The above results indicate that the survival rate of the G10-down-regulated C1 population is better than that of the G10 up-regulated C3 population. Univariate logistic regression and multivariate logistic regression analysis were performed to determine the relationship between the expression pattern of G10 and the clinical characteristics of LADC patients. Univariate logistic regression analysis showed that all clinical features except for sex (p = 0.054) of LADC patients were significantly different from those of G10 expression (Table 4). In addition, the polynomial logistic regression analysis showed that the smoking cessation rate was the most significant difference (p = 0) in the expression of G10 despite the intrinsic correlation between clinical characteristics of LADC patients (Table 5) .
6. Clinical relevance of correlated gene sets and cancer
Co-expression of the genes that make up G64 can provide an interesting viewpoint on cancer that has not yet been studied, such as lung cancer. Therefore, we examined how G64 was differentially expressed in 501 lung squamous cell carcinoma (LUSC) samples retrieved from TCGA. Interestingly, 501 LUSC samples were also classified by G64, although not significant compared to the LADC results. Of the total 501 LUSC samples, only 21% (n = 107) were classified in the G64 downregulation group, while in the LADC sample, approximately 50% were included in the G64 downregulation group. These results indicate that LADC patients (P <0.0001, chi-sqaure test). The difference between LADC and LUSC was confirmed in 88 Korean lung cancer samples. Interestingly, the co-regulated patterns of these G64 genes, as described above, may change the ratio of G64 up or down regulated patterns depending on the carcinoma, but other cancers found in TCGA, such as breast, kidney, and hepatocellular carcinoma It is confirmed that they appear in common. We analyzed other cancers that were searched in the TCGA to classify other carcinomas using the "G10" gene set or to be able to diagnose the prognosis. As a result, it was confirmed that kidney renal papillary cell carcinoma (KIRP) and liver hepatocellular carcinoma (LIHC) were classified according to the expression pattern of G10. In the case of renal cancer, the expression of G10 was classified as a down-regulated C1 group and an up-regulated C3 group, which was also confirmed in PCA analysis (panels A and B in FIG. 10). The survival rate of the C1 group was significantly higher than that of the C3 group (p = 0.001) in the prognosis of renal cancer patients (panel C in FIG. 10). In the case of liver cancer, the expression of G10 was down-regulated in the C1 group; C2 group with weakly regulated downward; And up-regulated C3 groups, which were also confirmed in the PCA analysis (panels A and B in Figure 11). The survival rate of the C1 group was significantly higher than that of the C3 group (p = 0.000) in the prognosis of patients with liver cancer (panel C in FIG. 11).
The specific embodiments described herein are representative of preferred embodiments or examples of the present invention, and thus the scope of the present invention is not limited thereto. It will be apparent to those skilled in the art that modifications and other uses of the invention do not depart from the scope of the invention described in the claims.
Claims (12)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020150084070A KR101825369B1 (en) | 2015-06-15 | 2015-06-15 | Genetic Bio-marker for Predicting Prognosis in Cancer and Use thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020150084070A KR101825369B1 (en) | 2015-06-15 | 2015-06-15 | Genetic Bio-marker for Predicting Prognosis in Cancer and Use thereof |
Publications (2)
Publication Number | Publication Date |
---|---|
KR20160147392A KR20160147392A (en) | 2016-12-23 |
KR101825369B1 true KR101825369B1 (en) | 2018-02-06 |
Family
ID=57736271
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020150084070A KR101825369B1 (en) | 2015-06-15 | 2015-06-15 | Genetic Bio-marker for Predicting Prognosis in Cancer and Use thereof |
Country Status (1)
Country | Link |
---|---|
KR (1) | KR101825369B1 (en) |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20140315935A1 (en) * | 2013-02-21 | 2014-10-23 | Myriad Genetics, Inc. | Gene signatures for lung cancer prognosis and therapy selection |
-
2015
- 2015-06-15 KR KR1020150084070A patent/KR101825369B1/en active IP Right Grant
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20140315935A1 (en) * | 2013-02-21 | 2014-10-23 | Myriad Genetics, Inc. | Gene signatures for lung cancer prognosis and therapy selection |
Non-Patent Citations (1)
Title |
---|
NCBI GenBank Accession No. BC008079.2 (2006.07.15.)* |
Also Published As
Publication number | Publication date |
---|---|
KR20160147392A (en) | 2016-12-23 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP7506380B2 (en) | Residual disease detection system and method | |
US20210381062A1 (en) | Nasal epithelium gene expression signature and classifier for the prediction of lung cancer | |
JP6140202B2 (en) | Gene expression profiles to predict breast cancer prognosis | |
CN113785076B (en) | Methods of predicting prognosis of cancer and compositions thereof | |
Lee et al. | Prediction of recurrence-free survival in postoperative non–small cell lung cancer patients by using an integrated model of clinical information and gene expression | |
JP6144695B2 (en) | How to treat breast cancer with taxane therapy | |
US20170283885A1 (en) | Algorithms for gene signature-based predictor of sensitivity to mdm2 inhibitors | |
US20140357660A1 (en) | Gene expression signatures of neoplasm responsiveness to therapy | |
US20160333419A1 (en) | Gene signatures associated with sensitivity to mdm2 inhibitors | |
EP3149209B1 (en) | Methods for typing of lung cancer | |
CN104975063B (en) | Screening method and application of antitumor drug biomarker | |
US20220093251A1 (en) | Novel biomarkers and diagnostic profiles for prostate cancer | |
CN115410713A (en) | Hepatocellular carcinoma prognosis risk prediction model construction based on immune-related gene | |
WO2017136508A1 (en) | Dissociation of human tumor to single cell suspension followed by biological analysis | |
CN109715830A (en) | For using the composition and method of gene expression profile diagnosing | |
CN109072481A (en) | The gene expression characteristics of remaining risk after early-stage breast cancer endocrine therapy | |
Lin et al. | Evolutionary route of nasopharyngeal carcinoma metastasis and its clinical significance | |
CN110004229A (en) | Application of the polygenes as EGFR monoclonal antibody class Drug-resistant marker | |
US20240287612A1 (en) | Novel biomarkers and diagnostic profiles for prostate cancer integrating clinical variables and gene expression data | |
KR20160086145A (en) | Selection method of predicting genes for breast cancer prognosis | |
US20210079479A1 (en) | Compostions and methods for diagnosing lung cancers using gene expression profiles | |
KR101825369B1 (en) | Genetic Bio-marker for Predicting Prognosis in Cancer and Use thereof | |
Lin et al. | LncRNA DIRC1 is a novel prognostic biomarker and correlated with immune infiltrates in stomach adenocarcinoma | |
KR20180052985A (en) | Breast cancer prognosis prediction tool using next-generation sequencing of breast cancer prognostic genes | |
Zhang et al. | Identification of a novel RNA modifications-related model to improve bladder cancer outcomes in the framework of predictive, preventive, and personalized medicine |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
A201 | Request for examination | ||
E902 | Notification of reason for refusal | ||
E902 | Notification of reason for refusal | ||
E701 | Decision to grant or registration of patent right |