WO2007073220A1 - Prognosis prediction for colorectal cancer - Google Patents
Prognosis prediction for colorectal cancer Download PDFInfo
- Publication number
- WO2007073220A1 WO2007073220A1 PCT/NZ2006/000343 NZ2006000343W WO2007073220A1 WO 2007073220 A1 WO2007073220 A1 WO 2007073220A1 NZ 2006000343 W NZ2006000343 W NZ 2006000343W WO 2007073220 A1 WO2007073220 A1 WO 2007073220A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- signature
- genes
- expression
- gene
- data
- Prior art date
Links
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6876—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
- C12Q1/6883—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
- C12Q1/6886—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material for cancer
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/48—Biological material, e.g. blood, urine; Haemocytometers
- G01N33/50—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
- G01N33/53—Immunoassay; Biospecific binding assay; Materials therefor
- G01N33/574—Immunoassay; Biospecific binding assay; Materials therefor for cancer
- G01N33/57407—Specifically defined cancers
- G01N33/57419—Specifically defined cancers of colon
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B20/00—ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B20/00—ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
- G16B20/20—Allele or variant detection, e.g. single nucleotide polymorphism [SNP] detection
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B25/00—ICT specially adapted for hybridisation; ICT specially adapted for gene or protein expression
- G16B25/10—Gene or protein expression profiling; Expression-ratio estimation or normalisation
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B25/00—ICT specially adapted for hybridisation; ICT specially adapted for gene or protein expression
- G16B25/20—Polymerase chain reaction [PCR]; Primer or probe design; Probe optimisation
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B25/00—ICT specially adapted for hybridisation; ICT specially adapted for gene or protein expression
- G16B25/30—Microarray design
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/118—Prognosis of disease development
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/158—Expression markers
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/16—Primer sets for multiplex assays
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B25/00—ICT specially adapted for hybridisation; ICT specially adapted for gene or protein expression
Definitions
- This invention relates to methods and compositions for determining the prognosis of cancer, particularly colorectal cancer, in a patient. Specifically, this invention relates to the use of genetic markers for determining the prognosis of cancer, such as colorectal cancer, based on prognostic signatures.
- CRC Colorectal cancer
- This invention provides further methods, compositions, kits, and devices based on prognostic cancer markers, specifically colorectal cancer prognostic markers, to aid in the prognosis and treatment of cancer.
- a set of markers genes identified to be differentially expressed in recurrent and non-recurrent colorectal tumours This set of genes can be used to generate prognostics signatures, comprising two or more markers, capable of predicting the progression of colorectal tumour in a patient.
- the individual markers can differentially expressed depending on whether the tumour is recurrent or not.
- the accuracy of prediction can be enhanced by combining the markers together into a prognostic signature for, providing for much more effective individual tests than single-gene assays.
- techniques such as statistics, machine learning, artificial intelligence, and data mining to the prognostics signatures to generate prediction models.
- expression levels of the markers of a particular prognostic signature in the tumour of a patient can then be applied to the prediction model to determine the prognosis.
- the expression level of the markers can be established using microarray methods, quantitative polymerase chain reaction (qPCR), or immunoassays.
- Figure 1 depicts a flow chart showing the methodology for producing the prognostic signatures from 149 New Zealand (NZ) and 55 German (DE) colorectal cancer (CRC) samples.
- New Zealand RNA samples were hybridized to oligonucleotide spotted arrays, with a 22-gene signature produced via leave one out cross validation (LOOCV), and then independently validated by LOOCV using the 55 sample DE data set.
- German RNA samples were hybridized to Affymetrix arrays, with a 19-gene signature produced via LOOCV, and then independently validated by LOOCV using the NZ data set.
- Figure 2 depicts a Kaplan-Meier analysis of disease-free survival time with patients predicted as high versus low risk of tumour recurrence: a, using NZ 22-gene signature on 149 tumours from NZ patients; b, using DE 19-gene signature on 55 tumours from DE patients; c, NZ prognostic signature validated on 55 tumours from DE patients; d, DE prognostic signature validated on 149 tumours from NZ patients. P-values were calculated using the log-rank test.
- Figure 3 depicts a Kaplan-Meier analysis of disease free survival time with patients predicted as high versus low risk of tumour recurrence: a, using the 22-gene NZ signature on NZ patients with Stage II and Stage III disease; b, using the 19-gene DE signature on NZ patients with Stage II and Stage III disease.
- I refers to Stage I or Stage II colorectal cancer (with no progression)
- IV refers to eventual progression to Stage IV metastases.
- Figure 5 shows the decreased predictive value of signatures for the prognosis of colorectal cancer, in a repeat of the experiment of Figure 4, except with the two genes, FAS and ME2,removed from the data set.
- Figure 6 shows a pairs chart of "top counts” (number of times each gene appeared in the "top-n” gene lists, i.e., top 10, top 20, top 100, and top 325 as described in Example 17) using three different normalization methods produced using the R statistical computing package(10,39) , in accordance with Example 17, below.
- the "pairs" chart is described in by Becker et al, in their treatise on the S Language (upon which R is based; see reference 39)
- To compare methods use row and column as defined on the diagonal to obtain the scatter plot between those two methods, analogous to reading distances off a distance chart on a map
- Figure 7 shows the pairs chart (39) of top counts (number of times each gene appeared in the "top-n" gene lists, i.e., top 10, top 20, top 100, and top 325 as described in Example 17) using three different filtering statistics: (a) two-sample Wilcoxon test (41), (b) t-test (modified using an ad-hoc correction factor in the denominator to abrogate the effect of low-variance genes falsely appearing as significant) and (c) empirical Bayes as provided by the "limma"( 10,40,42) package of Bioconductor (12,40).
- markers refers to a molecule that is associated quantitatively or qualitatively with the presence of a biological phenomenon.
- markers include a polynucleotide, such as a gene or gene fragment, RNA or RNA fragment; or a gene product, including a polypeptide such as a peptide, oligopeptide, protein, or protein fragment; or any related metabolites, by products, or any other identifying molecules, such as antibodies or antibody fragments, whether related directly or indirectly to a mechanism underlying the phenomenon.
- the markers of the invention include the nucleotide sequences (e.g., GenBank sequences) as disclosed herein, in particular, the full-length sequences, any coding sequences, any fragments, or any complements thereof, and any measurable marker thereof as defined above.
- GenBank sequences e.g., GenBank sequences
- CCPM colonal cancer prognostic marker
- CCPM family member refers to a marker with altered expression that is associated with a particular prognosis, e.g., a higher or lower likelihood of recurrence of cancer, as described herein, but can exclude molecules that are known in the prior art to be associated with prognosis of colorectal cancer. It is to be understood that the term CCPM does not require that the marker be specific only for colorectal tumours. Rather, expression of CCPM can be altered in other types of tumours, including malignant tumours.
- prognostic signature refers to a set of two or more markers, for example CCPMs, that when analysed together as a set allow for the determination of or prediction of an event, for example the prognostic outcome of colorectal cancer.
- CCPMs are set forth in Tables 1, 2, 5, and 9, while non-limiting examples of prognostic signatures are set forth in Tables 3, 4, 8A, 8B, and 9, herein.
- reference to "at least one,” “at least two,” “at least five,” etc., of the markers listed in any particular set means any one or any and all combinations of the markers listed.
- prediction method is defined to cover the broader genus of methods from the fields of statistics, machine learning, artificial intelligence, and data mining, which can be used to specify a prediction model. These are discussed further in the Detailed Description section.
- prediction model refers to the specific mathematical model obtained by applying a prediction method to a collection of data. In the examples detailed herein, such data sets consist of measurements of gene activity in tissue samples taken from recurrent and non-recurrent colorectal cancer patients, for which the class (recurrent or non-recurrent) of each sample is known.
- Such models can be used to (1) classify a sample of unknown recurrence status as being one of recurrent or non-recurrent, or (2) make a probabilistic prediction (i.e., produce either a proportion or percentage to be interpreted as a probability) which represents the likelihood that the unknown sample is recurrent, based on the measurement of mRNA expression levels or expression products, of a specified collection of genes, in the unknown sample.
- a probabilistic prediction i.e., produce either a proportion or percentage to be interpreted as a probability
- “Sensitivity”, “specificity” (or “selectivity”), and “classification rate”, when applied to the describing the effectiveness of prediction models mean the following: “Sensitivity” means the proportion of truly positive samples that are also predicted (by the model) to be positive, hi a test for CRC recurrence, that would be the proportion of recurrent tumours predicted by the model to be recurrent. "Specificity” or “selectivity” means the proportion of truly negative samples that are also predicted (by the model) to be negative. In a test for CRC recurrence, this equates to the proportion of non-recurrent samples that are predicted to by non-recurrent by the model. “Classification Rate” is the proportion of all samples that are correctly classified by the prediction model (be that as positive or negative).
- antibodies and like terms refer to immunoglobulin molecules and immunologically active portions of immunoglobulin (Ig) molecules, i.e., molecules that contain an antigen binding site that specifically binds (immunoreacts with) an antigen. These include, but are not limited to, polyclonal, monoclonal, chimeric, single chain, Fc, Fab, Fab', and Fab 2 fragments, and a Fab expression library. Antibody molecules relate to any of the classes IgG, IgM, IgA, IgE, and IgD, which differ from one another by the nature of heavy chain present in the molecule. These include subclasses as well, such as IgGl, IgG2, and others.
- the light chain may be a kappa chain or a lambda chain.
- Reference herein to antibodies includes a reference to all classes, subclasses, and types. Also included are chimeric antibodies, for example, monoclonal antibodies or fragments thereof that are specific to more than one source, e.g., a mouse or human sequence. Further included are camerid antibodies, shark antibodies or nanobodies.
- cancer and “cancerous” refer to or describe the physiological condition in mammals that is typically characterized by abnormal or unregulated cell growth. Cancer and cancer pathology can be associated, for example, with metastasis, interference with the normal functioning of neighbouring cells, release of cytokines or other secretory products at abnormal levels, suppression or aggravation of inflammatory or immunological response, neoplasia, premalignancy, malignancy, invasion of surrounding or distant tissues or organs, such as lymph nodes, etc. Specifically included are colorectal cancers, such as, bowel (e.g., large bowel), anal, and rectal cancers.
- colonal cancer includes cancer of the colon, rectum, and/or anus, and especially, adenocarcinomas, and may also include carcinomas (e.g., squamous cloacogenic carcinomas), melanomas, lymphomas, and sarcomas. Epidermoid (nonkeratinizing squamous cell or basaloid) carcinomas are also included.
- the cancer may be associated with particular types of polyps or other lesions, for example, tubular adenomas, tubulovillous adenomas (e.g., villoglandular polyps), villous (e.g., papillary) adenomas (with or without adenocarcinoma), hyperplastic polyps, hamartomas, juvenile polyps, polypoid carcinomas, pseudopolyps, lipomas, or leiomyomas.
- the cancer may be associated with familial polyposis and related conditions such as Gardner's syndrome or Peutz-Jeghers syndrome.
- the cancer may be associated, for example, with chronic fistulas, irradiated anal skin, leukoplakia, lymphogranuloma venereum, Bowen's disease (intraepithelial carcinoma), condyloma acuminatum, or human papillomavirus.
- the cancer may be associated with basal cell carcinoma, extramammary Paget's disease, cloacogenic carcinoma, or malignant melanoma.
- differentially expressed refers to a gene marker whose expression is activated to a higher or lower level in a subject (e.g., test sample) having a condition, specifically cancer, such as colorectal cancer, relative to its expression in a control subject (e.g., reference sample).
- the terms also include markers whose expression is activated to a higher or lower level at different stages of the same condition; in recurrent or non-recurrent disease; or in cells with higher or lower levels of proliferation.
- a differentially expressed marker may be either activated or inhibited at the polynucleotide level or polypeptide level, or may be subject to alternative splicing to result in a different polypeptide product. Such differences may be evidenced by a change in mRNA levels, surface expression, secretion or other partitioning of a polypeptide, for example.
- Differential expression may include a comparison of expression between two or more markers (e.g., genes or their gene products); or a comparison of the ratios of the expression between two or more markers (e.g., genes or their gene products); or a comparison of two differently processed products (e.g., transcripts or polypeptides) of the same marker, which differ between normal subjects and diseased subjects; or between various stages of the same disease; or between recurring and non-recurring disease; or between cells with higher and lower levels of proliferation; or between normal tissue and diseased tissue, specifically cancer, or colorectal cancer.
- markers e.g., genes or their gene products
- ratios of the expression between two or more markers e.g., genes or their gene products
- two differently processed products e.g., transcripts or polypeptides
- Differential expression includes both quantitative, as well as qualitative, differences in the temporal or cellular expression pattern in a gene or its expression products among, for example, normal and diseased cells, or among cells which have undergone different disease events or disease stages, or cells with different levels of proliferation.
- expression includes production of polynucleotides and polypeptides, in particular, the production of RNA (e.g., mRNA) from a gene or portion of a gene, and includes the production of a polypeptide encoded by an RNA or gene or portion of a gene, and the appearance of a detectable material associated with expression.
- RNA e.g., mRNA
- the formation of a complex for example, from a polypeptide-polypeptide interaction, polypeptide-nucleotide interaction, or the like, is included within the scope of the term "expression”.
- binding of a binding ligand such as a hybridization probe or antibody
- a binding ligand such as a hybridization probe or antibody
- a gene or other polynucleotide or oligonucleotide, a polypeptide or a protein fragment and the visualization of the binding ligand.
- a binding ligand such as a hybridization probe or antibody
- the intensity of a spot on a microarray, on a hybridization blot such as a Northern blot, or on an immunoblot such as a Western blot, or on a bead array, or by PCR analysis is included within the term "expression" of the underlying biological molecule.
- expression threshold and “defined expression threshold” are used interchangeably and refer to the level of a marker in question outside which the polynucleotide or polypeptide serves as a predictive marker for patient survival without cancer recurrence.
- the threshold will be dependent on the predictive model established are derived experimentally from clinical studies such as those described in the Examples below.
- the expression threshold may be set to achieve maximum sensitivity, or for maximum specificity, or for minimum error (maximum classification rate). For example a higher threshold may be set to achieve minimum errors, but this may result in a lower sensitivity. Therefore, for any given predictive model, clinical studies will be used to set an expression threshold that generally achieves the highest sensitivity while having a minimal error rate. The determination of the expression threshold for any situation is well within the knowledge of those skilled in the art.
- long-term survival is used herein to refer to survival for at least 5 years, more preferably for at least 8 years, most preferably for at least 10 years following surgery or other treatment.
- microarray refers to an ordered or unordered arrangement of capture agents, preferably polynucleotides (e.g., probes) or polypeptides on a substrate. See, e.g., Microarray Analysis, M. Schena, John Wiley & Sons, 2002; Microarray Biochip Technology, M. Schena, ed., Eaton Publishing, 2000; Guide to Analysis of DNA Microarray Data, S. Knudsen, John Wiley & Sons, 2004; and Protein Microarray Technology, D. Kambhampati, ed., John Wiley & Sons, 2004.
- oligonucleotide refers to a polynucleotide, typically a probe or primer, including, without limitation, single-stranded deoxyribonucleotides, single- or double- stranded ribonucleotides, RNA: DNA hybrids, and double-stranded DNAs. Oligonucleotides, such as single-stranded DNA probe oligonucleotides, are often synthesized by chemical methods, for example using automated oligonucleotide synthesizers that are commercially available, or by a variety of other methods, including in vitro expression systems, recombinant techniques, and expression in cells and organisms.
- polynucleotide when used in the singular or plural, generally refers to any polyribonucleotide or polydeoxribonucleotide, which may be unmodified RNA or DNA or modified RNA or DNA.
- triple-stranded regions comprising RNA or DNA or both RNA and DNA.
- mRNAs RNAs, cDNAs, and genomic DNAs, and any fragments thereof.
- the term includes DNAs and RNAs that contain one or more modified bases, such as tritiated bases, or unusual bases, such as inosine.
- the polynucleotides of the invention can encompass coding or non-coding sequences, or sense or antisense sequences. It will be understood that each reference to a "polynucleotide" or like term, herein, will include the full-length sequences as well as any fragments, derivatives, or variants thereof.
- Polypeptide refers to an oligopeptide, peptide, or protein sequence, or fragment thereof, and to naturally occurring, recombinant, synthetic, or semisynthetic molecules. Where “polypeptide” is recited herein to refer to an amino acid sequence of a naturally occurring protein molecule, “polypeptide” and like terms, are not meant to limit the amino acid sequence to the complete, native amino acid sequence for the full-length molecule. It will be understood that each reference to a “polypeptide” or like term, herein, will include the full-length sequence, as well as any fragments, derivatives, or variants thereof.
- prognosis refers to a prediction of medical outcome, for example, a poor or good outcome (e.g., likelihood of long-term survival); a negative prognosis, or poor outcome, includes a prediction of relapse, disease progression (e.g., tumour growth or metastasis, or drug resistance), or mortality; a positive prognosis, or good outcome, includes a prediction of disease remission, (e.g., disease-free status), amelioration (e.g., tumour regression), or stabilization.
- proliferation refers to the processes leading to increased cell size or cell number, and can include one or more of: tumour or cell growth, angiogenesis, innervation, and metastasis.
- qPCR quantitative polymerase chain reaction
- J. W. Larrick ed., Eaton Publishing, 1997
- A-Z of Quantitative PCR S. Bustin, ed., IUL Press, 2004.
- tumor refers to all neoplastic cell growth and proliferation, whether malignant or benign, and all pre-cancerous and cancerous cells and tissues.
- “Stringency” of hybridization reactions is readily determinable by one of ordinary skill in the art, and generally is an empirical calculation dependent upon probe length, washing temperature, and salt concentration, hi general, longer probes require higher temperatures for proper annealing, while shorter probes need lower temperatures.
- Hybridization generally depends on the ability of denatured DNA to reanneal when complementary strands are present in an environment below their melting temperature. The higher the degree of desired homology between the probe and hybridisable sequence, the higher the relative temperature which can be used. As a result, it follows that higher relative temperatures would tend to make the reaction conditions more stringent, while lower temperatures less so. Additional details and explanation of stringency of hybridization reactions, are found e.g., in Ausubel et al., Current Protocols in Molecular Biology, Wiley Interscience Publishers, (1995).
- “Stringent conditions” or “high stringency conditions”, as defined herein, typically: (1) employ low ionic strength and high temperature for washing, for example 0.015 M sodium chloride/0.0015 M sodium citrate/0.1% sodium dodecyl sulfate at 50°C; (2) employ a denaturing agent during hybridization, such as formamide, for example, 50% (v/v) formamide with 0.1% bovine serum albumin/0.1% Ficoll/0.1% polyvinylpyrrolidone/50 mM sodium phosphate buffer at pH 6.5 with 750 mM sodium chloride, 75 mM sodium citrate at 42°C; or (3) employ 50% formamide, 5X SSC (0.75 M NaCl, 0.075 M sodium citrate), 50 mM sodium phosphate (pH 6.8), 0.1% sodium pyrophosphate, 5X, Denhardt's solution, sonicated salmon sperm DNA (50 ⁇ g/ml), 0.1% SDS, and 10% dextran sulfate at 42°
- Modely stringent conditions may be identified as described by Sambrook et al., Molecular Cloning: A Laboratory Manual, New York: Cold Spring Harbor Press, 1989, and include the use of washing solution and hybridization conditions (e. g., temperature, ionic strength, and % SDS) less stringent that those described above.
- washing solution and hybridization conditions e. g., temperature, ionic strength, and % SDS
- moderately stringent conditions is overnight incubation at 37°C in a solution comprising: 20% formamide, 5X SSC (150 mM NaCl 5 15 mM trisodium citrate), 50 mM sodium phosphate (pH 7.6), 5X Denhardt's solution, 10% dextran sulfate, and 20 mg/ml denatured sheared salmon sperm DNA, followed by washing the filters in IX SSC at about 37-50 0 C.
- 5X SSC 150 mM NaCl 5 15 mM trisodium citrate
- 50 mM sodium phosphate pH 7.6
- 5X Denhardt's solution 10% dextran sulfate
- 20 mg/ml denatured sheared salmon sperm DNA followed by washing the filters in IX SSC at about 37-50 0 C.
- the skilled artisan will recognize how to adjust the temperature, ionic strength, etc. as necessary to accommodate factors such as probe length and the like.
- Blackwell eds., Blackwell Science Inc., 1987; Gene Transfer Vectors for Mammalian Cells, J.M. Miller & M.P. Calos, eds., 1987; Current Protocols in Molecular Biology, F.M. Ausubel et al., eds., 1987; and PCR: The Polymerase Chain Reaction, Mullis et al., eds., 1994.
- Embodiments of the Invention In colorectal cancer, discordant results have been reported for prognostic markers.
- the present invention discloses the use of microarrays to reach a firmer conclusion, and to determine the prognostic role of specific prognostic signatures in colorectal cancer.
- the microarray-based studies shown herein indicate that particular prognostic signatures in colorectal cancer are associated with a prognosis.
- the invention can therefore be used to identify patients at high risk of recurrence of cancer, or patients with a high likelihood of recovery.
- the present invention provides for markers for the determination of disease prognosis, for example, the likelihood of recurrence of tumours, including colorectal tumours.
- the present invention therefore provides for a set of genes, outlined in Table 1 and Table 2, that are differentially expressed in recurrent and non-recurrent colorectal cancers.
- the genes outlined in Table 1 and Table 2 provide for a set of colorectal cancer prognostic makers (CCPMs).
- CCPMs colorectal cancer prognostic markers
- markers associated with immune responses are indicative of a particular prognosis.
- This can include increased likelihood of cancer recurrence after standard treatment, especially for colorectal cancer.
- an increase in other CCPMs is indicative of a particular prognosis.
- This can include disease progression or the increased likelihood of cancer recurrence, especially for colorectal cancer.
- a decrease or increase in expression can be determined, for example, by comparison of a test sample, e.g., patient's tumour sample, to a reference sample, e.g., a sample associated with a known prognosis, hi particular, one or more samples from patient(s) with non- recurrent cancer could be used as a reference sample.
- a test sample e.g., patient's tumour sample
- a reference sample e.g., a sample associated with a known prognosis, hi particular, one or more samples from patient(s) with non- recurrent cancer could be used as a reference sample.
- a patient's sample e.g., tumour sample
- expression levels in a patient's sample can be compared to samples from patients with a known outcome. If the patient's sample shows increased or decreased expression of one or more CCPMs that compares to samples with good outcome (no recurrence), then a positive prognosis, or recurrence is unlikely, is implicated. If the patient's sample shows expression of one or more CCPMs that is comparable to samples with poor outcome (recurrence), then a positive prognosis, or recurrence of the tumour is likely, is implicated.
- the expression levels of a prognostic signature comprising two or more CCPMS from a patient's sample can be compared to samples of recurrent/non-recurrent cancer. If the patient's sample shows increased or decreased expression of CCPMs by comparison to samples of non-recurrent cancer, and/or comparable expression to samples of recurrent cancer, then a negative prognosis is implicated. If the patient's sample shows expression of CCPMs that is comparable to samples of non-recurrent cancer, and/or lower or higher expression than samples of recurrent cancer, then a positive prognosis is implicated.
- a prediction method can be applied to a panel of markers, for example the panel of CCPMs outlined in Table 1 and Table 2, in order to generate a predictive model. This involves the generation of a prognostic signature, comprising two or more CCPMs.
- the disclosed CCPMs in Table 1 and Table 2 therefore provide a useful set of markers to generate prediction signatures for determining the prognosis of cancer, and establishing a treatment regime, or treatment modality, specific for that tumour.
- a positive prognosis can be used by a patient to decide to pursue standard or less invasive treatment options.
- a negative prognosis can be used by a patient to decide to terminate treatment or to pursue highly aggressive or experimental treatments.
- a patient can chose treatments based on their impact on the expression of prognostic markers (e.g., CCPMs).
- CCPMs can be detected in tumour tissue, tissue proximal to the tumour, lymph node samples, blood samples, serum samples, urine samples, or faecal samples, using any suitable technique, and can include, but is not limited to, oligonucleotide probes, quantitative PCR, or antibodies raised against the markers. It will be appreciated that by analyzing the presence and amounts of expression of a plurality of CCPMs in the form of prediction signatures, and constructing a prognostic signature (e.g., as set forth in Tables 3, 4, 8A, 8B, and 9), the sensitivity and accuracy of prognosis will be increased. Therefore, multiple markers according to the present invention can be used to determine the prognosis of a cancer.
- a prognostic signature e.g., as set forth in Tables 3, 4, 8A, 8B, and 9
- RNA is isolated from a fixed, wax-embedded cancer tissue specimen of the patient. Isolation may be performed by any technique known in the art, for example from core biopsy tissue or fine needle aspirate cells.
- the invention relates to a method of predicting a prognosis, e.g., the likelihood of long-term survival of a cancer patient without the recurrence of cancer, comprising determining the expression level of one or more prognostic markers or their expression products in a sample obtained from the patient, normalized against the expression level of other RNA transcripts or their products in the sample, or of a reference set of RNA transcripts or their expression products.
- the prognostic marker is one or more markers listed in Tables 1, 2, or 5, , or is included as one or more of the prognostic signatures derived from the markers listed in Tables 1,
- the expression levels of the prognostic markers or their expression products are determined, e.g., for the markers listed in Tables 1, 2, or 5, a prognostic signature derived from the markers listed in Tables 1, 2, and 5, e.g., for the prognostic signatures listed in Tables 3, 4, 8 A, 8B, or 9.
- the method comprises the determination of the expression levels of a full set of prognosis markers or their expression products, e.g., for the markers listed in Tables 1, 2, or 5, or, a prognostic signature derived from the markers listed in Tables 1, 2, and 5, e.g., for the prognostic signatures listed in Tables 3, 4, 8A, 8B, or 9.
- the invention relates to an array (e.g., microarray) comprising polynucleotides hybridizing to two or more markers, e.g., for the markers listed in Tables 1, 2, and 5, , or a prognostic signature derived from the markers listed in Tables 1, 2, and 5, e.g., the prognostic signatures listed in Tables 3, 4, 8A, 8B, and 9.
- the array comprises polynucleotides hybridizing to prognostic signature derived from the markers listed in Tables 1, 2, and 5, or e.g., for the prognostic signatures listed in Tables 3, 4, 8A, 8B, or 9.
- the array comprises polynucleotides hybridizing to the full set of markers, e.g., for the markers listed in Tables 1, 2, or 5, or, e.g., for the prognostic signatures listed hi Tables 3, 4, 8A, 8B, or 9.
- the polynucleotides can be cDNAs, or oligonucleotides, and the solid surface on which they are displayed can be glass, for example.
- the polynucleotides can hybridize to one or more of the markers as disclosed herein, for example, to the full-length sequences, any coding sequences, any fragments, or any complements thereof.
- CCPM indicates a decreased likelihood of long-term survival, e.g., due to cancer recurrence, while a lack of an increase or decrease in expression levels of one or more CCPM indicates an increased likelihood of long-term survival without cancer recurrence.
- Table 1 Colorectal Cancer Predictive Markers (corresponding to Affymetrix GeneChip probes that show statistically significant differential expression, P ⁇ 0.05, as ascertained by BRB Array Tools)
- Table 2 Markers with expression correlating to that of the 22 genes from NZ signature.
- CCPM family members microarray approaches using oligonucleotide probes selective for a CCPM; real-time qPCR on tumour samples using CCPM specific primers and probes; real-time qPCR on lymph node, blood, serum, faecal, or urine samples using CCPM specific primers and probes; enzyme-linked immunological assays (ELISA); immunohistochemistry using anti- marker antibodies; and analysis of array or qPCR data using computers.
- Primary data can be collected and fold change analysis can be performed, for example, by comparison of marker expression levels in tumour tissue and non-tumour
- tumour marker expression includes matching those markers exhibiting increased or decreased expression with expression profiles of known colorectal tumours to provide a prognosis.
- a threshold for concluding that expression is increased will be dependent on the particular marker and also the particular predictive model that is to be applied.
- the threshold is generally set to achieve the highest sensitivity and selectivity with the lowest error rate, although variations may be desirable for a particular clinical situation.
- the desired threshold is determined by analysing a population of sufficient size taking into account the statistical variability of any predictive model and is calculated from the size of the sample used to produce the predictive model. The same applies for the determination of a threshold for concluding that expression is decreased. It can be appreciated that other thresholds, or methods for establishing a threshold,for concluding that increased or decreased expression has occurred can be selected without departing from the scope of this invention.
- a prediction model may produce as it's output a numerical value, for example a score, likelihood value or probability.
- a numerical value for example a score, likelihood value or probability.
- the likelihood of the cancer recurring can then be determined.
- prognostic signatures comprising one or more CCPMs can be used to determine the prognosis of a cancer, by comparing the expression level of the one or more markers to the disclosed prognostic signature. By comparing the expression of one or more of the CCPMs in a tumour sample with the disclosed prognostic signature, the likelihood of the cancer recurring can be determined. The comparison of expression levels of the prognostic signature to establish a prognosis can be done by applying a predictive model as described previously.
- Determining the likelihood of the cancer recurring is of great value to the medical practitioner.
- a high likelihood of re-occurrence means that a longer or higher dose treatment should be given, and the patient should be more closely monitored for signs of recurrence of the cancer.
- An accurate prognosis is also of benefit to the patient. It allows the patient, along with their partners, family, and friends to also make decisions about treatment, as well as decisions about their future and lifestyle changes. Therefore, the invention also provides for a method establishing a treatment regime for a particular cancer based on the prognosis established by matching the expression of the markers in a tumour sample with the differential expression signature.
- the marker selection, or construction of a prognostic signature does not have to be restricted to the CCPMs disclosed in Tables 1, 2, or 5, herein, or the prognostic signatures disclosed in Tables 3, 4, 8A, 8B, and 9, but could involve the use of one or more CCPMs from the disclosed signatures, or a new signature may be established using CCPMs selected from the disclosed marker lists.
- the requirement of any signature is that it predicts the likelihood of recurrence with enough accuracy to assist a medical practitioner to establish a treatment regime.
- RT-PCR Reverse Transcription PCR
- RT-PCR which can be used to compare RNA levels in different sample populations, in normal and tumour tissues, with or without drug treatment, to characterize patterns of expression, to discriminate between closely related RNAs, and to analyze RNA structure.
- RNA is typically total RNA isolated from human tumours or tumour cell lines, and corresponding normal tissues or cell lines, respectively.
- the starting material is typically total RNA isolated from human tumours or tumour cell lines, and corresponding normal tissues or cell lines, respectively.
- RNA can be isolated from a variety of samples, such as tumour samples from breast, lung, colon (e.g., large bowel or small bowel), colorectal, gastric, esophageal, anal, rectal, prostate, brain, liver, kidney, pancreas, spleen, thymus, testis, ovary, uterus, etc., tissues, from primary tumours, or tumour cell lines, and from pooled samples from healthy donors.
- RNA can be extracted, for example, from frozen or archived paraffin-embedded and fixed (e.g., formalin-fixed) tissue samples.
- the first step in gene expression profiling by RT-PCR is the reverse transcription of the RNA template into cDNA, followed by its exponential amplification in a PCR reaction.
- the two most commonly used reverse transcriptases are avian myeloblastosis virus reverse transcriptase (AMV-RT) and Moloney murine leukaemia virus reverse transcriptase (MMLV-RT).
- AMV-RT avian myeloblastosis virus reverse transcriptase
- MMLV-RT Moloney murine leukaemia virus reverse transcriptase
- the reverse transcription step is typically primed using specific primers, random hexamers, or oligo-dT primers, depending on the circumstances and the goal of expression profiling.
- extracted RNA can be reverse-transcribed using a GeneAmp RNA PCR kit (Perkin Elmer, CA, USA), following the manufacturer's instructions.
- the derived cDNA can then be used as a template in the subsequent PCR reaction.
- the PCR step can use a variety of thermostable DNA-dependent DNA polymerases, it typically employs the Taq DNA polymerase, which has a 5'-3' nuclease activity but lacks a 3 '-5' proofreading endonuclease activity.
- TaqMan (q) PCR typically utilizes the 5' nuclease activity of Taq or Tth polymerase to hydrolyze a hybridization probe bound to its target amplicon, but any enzyme with equivalent 5' nuclease activity can be used.
- Two oligonucleotide primers are used to generate an amplicon typical of a
- a third oligonucleotide, or probe is designed to detect nucleotide sequence located between the two PCR primers.
- the probe is non-extendible by Taq DNA polymerase enzyme, and is labeled with a reporter fluorescent dye and a quencher fluorescent dye. Any laser-induced emission from the reporter dye is quenched by the quenching dye when the two dyes are located close together as they are on the probe.
- the Taq DNA polymerase enzyme cleaves the probe in a template-dependent manner.
- the resultant probe fragments disassociate in solution, and signal from the released reporter dye is free from the quenching effect of the second fluorophore.
- One molecule of reporter dye is liberated for each new molecule synthesized, and detection of the unquenched reporter dye provides the basis for quantitative interpretation of the data.
- TaqMan RT-PCR can be performed using commercially available equipment, such as, for example, ABI PRISM 7700 Sequence Detection System (Perkin-Elmer- Applied Biosystems, Foster City, CA 5 USA), or Lightcycler (Roche Molecular Biochemicals, Mannheim, Germany).
- the 5' nuclease procedure is run on a real-time quantitative PCR device such as the ABI PRISM 7700tam Sequence Detection System.
- the system consists of a the ⁇ nocycler, laser, charge-coupled device (CCD), camera, and computer.
- the system amplifies samples in a 96-well format on a thermocycler.
- laser-induced fluorescent signal is collected in real-time through fibre optics cables for all 96 wells, and detected at the CCD.
- the system includes software for running the instrument and for analyzing the data.
- 5' nuclease assay data are initially expressed as Ct, or the threshold cycle.
- fluorescence values are recorded during every cycle and represent the amount of product amplified to that point in the amplification reaction. The point when the fluorescent signal is first recorded as statistically significant is the threshold cycle.
- RT-PCR is usually performed using an internal standard.
- the ideal internal standard is expressed at a constant level among different tissues, and is unaffected by the experimental treatment.
- RNAs most frequently used to normalize patterns of gene expression are mRNAs for the housekeeping genes glyceraldehyde-3-phosphate-dehydrogenase (GAPDH) and-actin.
- PCR which measures PCR product accumulation through a dual-labeled fluorigenic probe (i.e., TaqMan probe).
- Real time PCR is compatible both with quantitative competitive PCR and with quantitative comparative PCR.
- the former uses an internal competitor for each target sequence for normalization, while the latter uses a normalization gene contained within the sample, or a housekeeping gene for RT-PCR. Further details are provided, e.g., by Held et al., Genome Research 6: 986-994 (1996).
- Expression levels can be determined using fixed, paraffin-embedded tissues as the RNA source.
- PCR primers and probes are designed based upon intron sequences present in the gene to be amplified.
- the first step in the primer/probe design is the delineation of intron sequences within the genes. This can be done by publicly available software, such as the DNA BLAT software developed by Kent, W. J., Genome Res. 12 (4): 656-64 (2002), or by the BLAST software including its variations. Subsequent steps follow well established methods of PCR primer and probe design.
- PCR primer design The most important factors considered in PCR primer design include primer length, melting temperature (T m ), and G/C content, specificity, complementary primer sequences, and 3' end sequence.
- optimal PCR primers are generally 17-30 bases in length, and contain about 20-80%, such as, for example, about 50-60% G+C bases. Melting temperatures between 50 and 80 0 C, e.g., about 50 to 70 0 C, are typically preferred.
- CCPMs Differential expression can also be identified, or confirmed using the microarray technique.
- the expression profile of CCPMs can be measured in either fresh or paraffin-embedded tumour tissue, using microarray technology.
- polynucleotide sequences of interest including cDNAs and oligonucleotides
- the arrayed sequences i.e., capture probes
- specific polynucleotides from cells or tissues of interest i.e., targets.
- the source of RNA typically is total RNA isolated from human tumours or tumour cell lines, and corresponding normal tissues or cell lines.
- RNA can be isolated from a variety of primary tumours or tumour cell lines. If the source of RNA is a primary tumour, RNA can be extracted, for example, from frozen or archived formalin fixed paraffin-embedded (FFPE) tissue samples and fixed (e.g., formalin-fixed) tissue samples, which are routinely prepared and preserved in everyday clinical practice.
- FFPE formalin fixed paraffin-embedded
- PCR amplified inserts of cDNA clones are applied to a substrate.
- the substrate can include up to 1, 2, 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, or 75 nucleotide sequences. In other aspects, the substrate can include at least 10,000 nucleotide sequences.
- the microarrayed sequences, immobilized on the microchip, are suitable for hybridization under stringent conditions.
- the targets for the microarrays can be at least 50, 100, 200, 400, 500, 1000, or 2000 bases in length; or 50-100, 100-200, 100- 500, 100-1000, 100-2000, or 500-5000 bases in length.
- the capture probes for the microarrays can be at least 10, 15, 20, 25, 50, 75, 80, or 100 bases in length; or 10-15, 10-20, 10-25, 10-50, 10-75, 10-80, or 20-80 bases in length.
- Fluorescently labeled cDNA probes may be generated through incorporation of fluorescent nucleotides by reverse transcription of RNA extracted from tissues of interest. Labeled cDNA probes applied to the chip hybridize with specificity to each spot of DNA on the array. After stringent washing to remove non-specifically bound probes, the chip is scanned by confocal laser microscopy or by another detection method, such as a CCD camera. Quantitation of hybridization of each arrayed element allows for assessment of corresponding mRNA abundance. With dual colour fluorescence, separately labeled cDNA probes generated from two sources of RNA are hybridized pairwise to the array. The relative abundance of the transcripts from the two sources corresponding to each specified gene is thus determined simultaneously. An exemplary protocol for this is described in detail in Example 4.
- the miniaturized scale of the hybridization affords a convenient and rapid evaluation of the expression pattern for large numbers of genes.
- Such methods have been shown to have the sensitivity required to detect rare transcripts, which are expressed at a few copies per cell, and to reproducibly detect at least approximately two-fold differences in the expression levels (Schena et al., Proc. Natl. Acad. Sci. USA 93 (2): 106-149 (1996)).
- Microarray analysis can be performed by commercially available equipment, following manufacturer's protocols, such as by using the Affymetrix GenChip technology, IUumina microarray technology or Incyte's microarray technology.
- the development of microarray methods for large-scale analysis of gene expression makes it possible to search systematically for molecular markers of cancer classification and outcome prediction in a variety of tumour types.
- RNA isolation can be performed using purification kit, buffer set, and protease from commercial manufacturers, such as Qiagen, according to the manufacturer's instructions. For example, total RNA from cells in culture can be isolated using Qiagen RNeasy mini-columns.
- RNA isolation kits include MasterPure Complete DNA and RNA Purification Kit (EPICENTRE (D, Madison, WI), and Paraffin Block RNA Isolation Kit (Ambion, Inc.). Total RNA from tissue samples can be isolated using RNA Stat-60 (Tel-Test). RNA prepared from tumour can be isolated, for example, by cesium chloride density gradient centrifugation.
- RNA isolation, purification, primer extension and amplification are given in various published journal articles (for example: T. E. Godfrey et al. J. Molec. Diagnostics 2: 84-91 (2000); K. Specht et al., Am. J. Pathol. 158: 419-29 (2001)).
- a representative process starts with cutting about 10 ⁇ m thick sections of paraffin-embedded tumour tissue samples. The RNA is then extracted, and protein and DNA are removed.
- RNA repair and/or amplification steps may be included, if necessary, and RNA is reverse transcribed using gene specific promoters followed by RT-PCR. Finally, the data are analyzed to identify the best treatment option(s) available to the patient on the basis of the characteristic gene expression pattern identified in the tumour sample examined.
- Immunohistochemistry methods are also suitable for detecting the expression levels of the proliferation markers of the present invention.
- antibodies or antisera preferably polyclonal antisera, and most preferably monoclonal antibodies specific for each marker, are used to detect expression.
- the antibodies can be detected by direct labeling of the antibodies themselves, for example, with radioactive labels, fluorescent labels, hapten labels such as, biotin, or an enzyme such as horse radish peroxidase or alkaline phosphatase.
- unlabeled primary antibody is used in conjunction with a labeled secondary antibody, comprising antisera, polyclonal antisera or a monoclonal antibody specific for the primary antibody.
- Immunohistochemistry protocols and kits are well known in the art and are commercially available.
- Proteomics can be used to analyze the polypeptides present in a sample (e.g., tissue, organism, or cell culture) at a certain point of time, hi particular, proteomic techniques can be used to assess the global changes of polypeptide expression in a sample (also referred to as expression proteomics).
- Proteomic analysis typically includes: (1) separation of individual polypeptides in a sample by 2-D gel electrophoresis (2-D PAGE); (2) identification of the individual polypeptides recovered from the gel, e.g., by mass spectrometry or N-terminal sequencing, and (3) analysis of the data using bioinformatics.
- Proteomics methods are valuable supplements to other methods of gene expression profiling, and can be used, alone or in combination with other methods, to detect the products of the proliferation markers of the present invention.
- the likelihood of the cancer recurring can then be determined.
- the inventors have identified a number of markers that are differentially expressed in nonrecurring colorectal cancers compared to recurring colorectal cancers in patient data sets.
- the markers are set out in Tables 1, 2, and 9, in the examples below.
- Microarray experiments typically involve the simultaneous measurement of thousands of genes. If one is comparing the expression levels for a particular gene between two groups (for example recurrent and non- recurrent tumours), the typical tests for significance (such as the t-test) are not adequate. This is because, in an ensemble of thousands of experiments (in this context each gene constitutes an "experiment"), the probability of at least one experiment passing the usual criteria for significance by chance alone is essentially unity, hi a test for significance, one typically calculates the probability that the "null hypothesis" is correct. In the case of comparing two groups, the null hypothesis is that there is no difference between the two groups.
- Data Mining is the term used to describe the extraction of "knowledge”, in other words the “know-how”, or predictive ability from (usually) large volumes of data (the dataset). This is the approach used in this study to generate prognostic signatures.
- the "know-how” is the ability to accurately predict prognosis from a given set of gene expression measurements, or "signature” (as described generally in this section and in more detail in the examples section).
- Data mining (49), and the related topic machine learning (40) is a complex, repetitive mathematical task that involves the use of one or more appropriate computer software packages (see below).
- the use of software is advantageous on the one hand, in that one does not need to be completely familiar with the intricacies of the theory behind each technique in order to successfully use data mining techniques, provided that one adheres to the correct methodology.
- the disadvantage is that the application of data mining can often be viewed as a "black box": one inserts the data and receives the answer. How this is achieved is often masked from the end-user (this is the case for many of the techniques described, and can often influence the statistical method chosen for data mining.
- neural networks and support vector machines have a particularly complex implementation that makes it very difficult for the end user to extract out the "rules" used to produce the decision.
- k- nearest neighbours and linear discriminant analysis have a very transparent process for decision making that is not hidden from the user.
- supervised and unsupervised approaches There are two types of approach used in data mining: supervised and unsupervised approaches.
- the information that is being linked to the data is known, such as categorical data (e.g. recurrent vs. non recurrent tumours). What is required is the ability to link the observed response (e.g. recurrence vs. non- recurrence) to the input variables.
- the classes within the dataset are not known in advance, and data mining methodology is employed to attempt to find the classes or structure within the dataset.
- the overall protocol involves the following steps:
- Feature Selection typically involves finding those genes which are most "differentially expressed”. In a more general sense, it involves those groups which pass some statistical test for significance, i.e. is the level of a particular variable consistently higher or lower in one or other of the groups being investigated.
- the features are those variables (or dimensions) which exhibit the greatest variance.
- the application of feature selection is completely independent of the method used to create a prediction model, and involves a great deal of experimentation to achieve the desired results.
- the selection of significant genes, and those which correlated with the earlier successful model are those variables (or dimensions) which exhibit the greatest variance.
- the NZ classifier (the NZ classifier), entailed feature selection.
- methods of data reduction (such as principal component analysis) can be applied to the dataset.
- the reduced dataset (as described by the features) is applied to the prediction model of choice.
- the input for this model is usually in the form a multi-dimensional numerical input,(known as a vector), with associated output information (a class label or a response).
- selected data is input into the prediction model, either sequentially (in techniques such as neural networks) or as a whole (in techniques that apply some form of regression, such as linear models, linear discriminant analysis, support vector machines). In some instances (e.g.
- the dataset (or subset of the dataset obtained after feature selection) is itself the model.
- effective models can be established with minimal understanding of the detailed mathematics, through the use of various software packages where the parameters of the model have been pre-determined by expert analysts as most likely to lead to successful results.
- Linear models (49, 50): The data is treated as the input of a linear regression model, of which the class labels or responses variables are the output. Class labels, or other categorical data, must be transformed into numerical values (usually integer). In generalised linear models, the class labels or response variables are not themselves linearly related to the input data, but are transformed through the use of a "link function". Logistic regression is the most common form of generalized linear model.
- Linear Discriminant analysis (49, 51, 52). Provided the data is linearly separable (i.e. the groups or classes of data can be separated by a hyperplane, which is an n-dimensional extension of a threshold), this technique can be applied. A combination of variables is used to separate the classes, such that the between group variance is maximised, and the within-group variance is minimised. The byproduct of this is the formation of a classification rule.
- Support vector machines A collection of variables is used in conjunction with a collection of weights to determine a model that maximizes the separation between classes in terms of those weighted variables. Application of this model to a sample then produces a classification or prediction of class membership for that sample.
- Neural networks The data is treated as input into a network of nodes, which superficially resemble biological neurons, which apply the input from all the nodes to which they are connected, and transform the input into an output.
- nodes which superficially resemble biological neurons, which apply the input from all the nodes to which they are connected, and transform the input into an output.
- neural networks use the "multiply and sum" algorithm, to transform the inputs from multiple connected input nodes into a single output.
- a node may not necessarily produce an output unless the inputs to that node exceed a certain threshold.
- Each node has as its input the output from several other nodes, with the final output node usually being linked to a categorical variable.
- the number of nodes, and the topology of the nodes can be varied in almost infinite ways, providing for the ability to classify extremely noisy data that may not be possible to categorize in other ways.
- the most common implementation of neural networks is the multi-layer perceptron.
- Classification and regression trees In these, variables are used to define a hierarchy of rules that can be followed in a stepwise manner to determine the class of a sample. The typical process creates a set of rules which lead to a specific class output, or a specific statement of the inability to discriminate.
- Euclidean distance an extension of the Pythagorean distance, as in triangulation, to n-dimensions
- various forms of correlation including Pearson Correlation co-efficient
- transformation functions that convert data points that would not normally be interconnected by a meaningful distance metric into euclidean space, so that Euclidean distance can then be applied (e.g. Mahalanobis distance).
- Mahalanobis distance the distance metric can be quite complex, the basic premise of k-nearest neighbours is quite simple, essentially being a restatement of "find the k-data vectors that are most similar to the unknown input, find out which class they correspond to, and vote as to which class the unknown input is”.
- a directed acyclic graph is used to represent a collection of variables in conjunction with their joint probability distribution, which is then used to determine the probability of class membership for a sample.
- Independent components analysis in which independent signals (e.g., class membership) re isolated (into components) from a collection of variables. These components can then be used to produce a classification or prediction of class membership for a sample.
- Training involves taking a subset of the dataset of interest (in this case gene expression measurements from colorectal tumours), such that it is stratified across the classes that are being tested for (in this case recurrent and nonrecurrent tumours). This training set is used to generate a prediction model (defined above), which is tested on the remainder of the data (the testing set).
- dataset of interest in this case gene expression measurements from colorectal tumours
- This training set is used to generate a prediction model (defined above), which is tested on the remainder of the data (the testing set).
- K-fold cross-validation The dataset is divided into K subsamples, each subsample containing approximately the same proportions of the class groups as the original. In each round of validation, one of the K subsamples is set aside, and training is accomplished using the remainder of the dataset. The effectiveness of the training for that round is guaged by how correctly the classification of the left-out group is. This procedure is repeated K- times, and the overall effectiveness ascertained by comparison of the predicted class with the known class.
- Leave-one-out cross-validation A commonly used variation of K-fold cross validation, in which where n is the number of samples.
- Combinations of CCPMS such as those described above in Tables 1 and 2, can be used to construct predictive models for prognosis.
- Prognostic signatures comprising one or more of these markers, can be used to determine the outcome of a patient, through application of one or more predictive models derived from the signature.
- a clinician or researcher can determine the differential expression (e.g., increased or decreased expression) of the one or more markers in the signature, apply a predictive model, and thereby predict the negative prognosis, e.g., likelihood of disease relapse, of a patient, or alternatively the likelihood of a positive prognosis (continued remission).
- a set of prognostic signatures have been developed.
- the first instance there are two signatures developed by cross-comparison of predictive ability between two datasets: the set of microarray experiments encompassing the German colorectal cancer samples, and the set of microarray experiments encompassing the New Zealand samples (discussed in example 6).
- the second instance there has been an exhaustive statistical search for effective signatures based solely on the German dataset (discussed in example 17).
- a prognostic signature comprising 19 genes has been established from a set of colorectal samples from Germany (Table 4).
- Another prognostic signature, of 22 genes has also been established from samples of colorectal tumours from patients in New Zealand (Table 3).
- this invention provides methods for determining the prognosis of a cancer, comprising: (a) providing a sample of the cancer; (b) detecting the expression level of a CCPM family member in said sample; and (c) determining the prognosis of the cancer.
- the cancer is colorectal cancer.
- the invention includes a step of detecting the expression level of a CCPM mRNA. In other aspects, the invention includes a step of detecting the expression level of a CCPM polypeptide. In yet a further aspect, the invention includes a step of detecting the level of a CCPM peptide. In yet another aspect, the invention includes detecting the expression level of more than one CCPM family member in said sample. In a further aspect the CCPM is a gene associated with an immune response. In a further aspect the CCPM is selected from the markers set forth in Tables 3, 4, 8A, 8B, or 9. In a still further aspect, the CCPM is included in a signature selected from the signatures set forth in Tables 3, 4, 8 A, 8B, or 9.
- the invention comprises detecting the expression level of; WDR44, RBMSl, SACMlL 3 SOATl, PBK 5 G3BP2, ZBTB20, ZNF410, COMMD2, PSMCl,
- the invention comprises detecting the expression level of; CXCLlO 5 FAS 5 CXCLO, TLKl, CXCLIl, PBK 5 PSATl,
- the invention includes a method of determining a treatment regime for a cancer comprising: (a) providing a sample of the cancer; (b) detecting the expression level of a CCPM family member in said sample; (c) determining the prognosis of the cancer based on the expression level of a CCPM family member; and (d) determining the treatment regime according to the prognosis.
- the invention includes a device for detecting a CCPM 5 comprising: a substrate having a CCPM capture reagent thereon; and a detector associated with said substrate, said detector capable of detecting a CCPM associated with said capture reagent.
- kits for detecting cancer comprising: a substrate; a CCPM capture reagent; and instructions for use.
- method for detecting a CCPM using qPCR comprising: a forward primer specific for said CCPM; a reverse primer specific for said CCPM; PCR reagents; a reaction vial; and instructions for use.
- kits for detecting the presence of a CCPM polypeptide or peptide comprising: a substrate having a capture agent for said CCPM polypeptide or peptide; an antibody specific for said CCPM polypeptide or peptide; a reagent capable of labeling bound antibody for said CCPM polypeptide or peptide; and instructions for use.
- this invention includes a method for determining the prognosis of colorectal cancer, comprising the steps of: providing a tumour sample from a patient suspected of having colorectal cancer; measuring the presence of a CCPM polypeptide using an ELISA method.
- the CCPM of the invention is selected from the markers set forth in Tables 1, 2, 5, or 9.
- the CCPM is included in a prognostic signature selected from the signatures set forth in Tables 3, 4, 8A, 8B, or 10.
- Example 1 Patients and methods
- NZ New Zealand
- DE Germany
- Tumor content ranged between 70 and 100% with an average of 87%.
- NZ tumours Tumours were homogenized and RNA was extracted using Tri-Reagent (Progenz, Auckland, New Zealand). The RNA was then further purified using RNeasy mini column (Qiagen, Victoria, Australia). Ten micrograms of RNA was labelled with Cy5 dUTP using the indirect amino-allyl cDNA labelling protocol. A reference RNA from 12 different cell lines was labelled with Cy3 dUTP. The fluorescently labelled cDNA were purified using a QiaQuick PCR purification kit (Qiagen, Victoria, Australia) according to the manufacturer's protocol.
- NZ tumours Hybridisation of the labelled target cDNA was performed using MWG Human 30K Array oligonucleotides printed on epoxy coated slides. Slides were blocked with 1% BSA and the hybridisation was done in pre-hybridisation buffer at 42°C for at least 12 hours followed by a high stringency wash. Slides were scanned with a GenePix Microarray Scanner and data was analyzed using GenePix Pro 4.1 Microarray Acquisition and Analysis Software (Axon, CA).
- cRNA was mixed with B2-control oligonucleotide (Affymetrix, Santa Clara, CA), eukaryotic hybridization controls (Affymetrix, Santa Clara, CA), herring sperm (Promega, Madison, WT), buffer and BSA to a final volume of 300 ⁇ l and hybridized to one microarray chip (Affymetrix, Santa Clara, CA) for 16 hours at 45°C.
- B2-control oligonucleotide Affymetrix, Santa Clara, CA
- eukaryotic hybridization controls Affymetrix, Santa Clara, CA
- herring sperm Promega, Madison, WT
- buffer and BSA to a final volume of 300 ⁇ l and hybridized to one microarray chip (Affymetrix, Santa Clara, CA) for 16 hours at 45°C.
- NZ data Data pre-processing and normalization was performed in the R computing environment (10). A Iog2 transformation was applied to the foreground intensities from each channel of each array. Data from each spot was used on a per array basis to perform print-tip loss normalization via the limma package (11) from the Bioconductor suite of analysis tools (12). Scale normalization (13) was then used to standardize the distribution of log intensity ratios across arrays. Post-normalization cluster analysis revealed the presence of a gene-specific print-run effect present in the data. Analysis of variance (ANOVA) normalization was used to estimate and remove print run effects from the data for each gene. Replicate array data was available for 46 of the 149 samples.
- ANOVA Analysis of variance
- Example 7 Survival analysis Kaplan-Meier survival analysis for censored data was performed using the survival package within the R computing environment. Survival was defined to be "disease free survival" post surgery. For each analysis, survival curves were constructed, and the log-rank test (15) was used to assess the presence of significant differences between the curves for the two groups in question. Censoring was taken into account for both the NZ and DE data sets. For the disease-free survival data, right censoring prior to five years could only occur for non-recurrent patients as a result of either death, or the last clinical follow-up occurring at less than five years. Odds ratios and confidence intervals were produced using the epitools package for R.
- Example 8 Identification of markers co-expressed with chemokine Iigands Genes in the DE data which had a Pearson correlation coefficient greater than 0.75 with at least one of the four chemokines appearing in the predictor in the non-relapse group were selected for ontology analysis. Ontology was performed using DAVID (hypertext transfer protocol://apps 1.niaid.nih.gov/david/).
- Example 10 Exhaustive Identification of differentially expressed markers DE Data Set: The BRB Array Tools class comparison procedure was used to detect probes exhibiting statistically significant differences in average intensity between relapse and non-relapse samples. The RVM (random variance model) was again used to produce p- values for each probe in the data set. In this second round, a total of 325 probes were found to be significantly differentially expressed between the two sample classes using an arbitrary significance threshold of 0.05. Note this selection of genes did not apply any fold-change threshold, and used a significance cut off of 0.05, rather than the threshold of 0.001 that was used in Example 6.
- the test for significance comprises the following: generating a test statistic for each gene which was identical to that of a standard two sample t-test (45) except that the estimate of the pooled variance was obtained by representing the variance structure across all genes as an F-distribution, and then using the parameters, a and b, of this distribution (obtained via maximization of the empirical likelihood function) to form the following estimate of the pooled variance (see next page),
- S> is the new estimate of the pooled variance
- Sp ooled is the standard estimate of pooled variance (45)
- n is the number of samples
- a and b are the parameters of the F-distribution (46).
- a t-distribution with n-2+2a degrees of freedom was used to obtain a p- value for each gene.
- the False Discovery Rate controlling procedure of Benjamini and Hochberg (7) was used to produce adjusted p-values for each gene.
- a gene was considered to have undergone significant differential expression if its adjusted p-value was less than 0.05.
- the NZ data set was generated using oligonucleotide printed microarrays.
- SVM support vector machine
- Tables 8A and 8B the gene descriptions are shown in Tables 3 and 4, respectively.
- NZ data ( training; SVM) 0.77 (0.66, 0.86) ⁇ 0.53 (0.33, 0.73) 0.88 (0.77, 0.95) 0.002 8.4 (3.5, 21.4)
- German data were removed from 0.72 0.38 0.87 0.011
- NZ data set (training; SVM)
- the NZ signature had an odds ratio for disease recurrence in the NZ patients of 8.4 (95% CI 3.5-21.4).
- the DE data set was generated using Affymetrix arrays resulting in a 19-gene (22- probe) and 3-nearest neighbour (3-NN) signature (selection threshold 0.002, log 2 fold change>l.l, 84% classification rate, 85% sensitivity, 83% specificity, pO.OOOl,
- NZ and DE patients was significantly different for those predicted to recur or not recur (NZ signature, pO.OOOl, Fig. 2A; DE signature, pO.OOOl, Fig. 2B).
- Example 13 External validation of the NZ and DE prognostic signatures
- the DE signature was next validated by using the 19 genes to construct a 3-NN signature in the NZ data set by LOOCV.
- the odds ratio for recurrence in NZ patients, using the DE signature was 2.6 (95% CI 1.2-6.0).
- the reduction of the prediction rate was due to five genes from DE signature not being present in the NZ data set. This was confirmed when removal of these five genes from the DE data set resulted in a reduction of the LOOCV prediction rate from 84% to 67% (Table 7).
- PBK is a protein kinase, which is involved in the process of mitosis (26), and the only gene common to the NZ and DE signatures. Eleven of 19 genes in the DE signature (Table 4) are involved in the immune response including 4 chemokine ligands (CXCL9, CXCLlO,
- stage I/II samples could be used to predict stage III outcome. It was also surprising that many genes associated with recurrent disease are related to the immune response.
- the immune response has an important role in the progression of different cancers and T- lymphocyte infiltration in CRC patients is an indicator of good prognosis (36-38). All of the eleven immune response (Table 5) genes were down-regulated in recurrent patients which would be unexpected based on known biological mechanisms.
- chemokine genes were chosen for further analysis. Chemokine ligands not only reflect the activity of the immune system and mediate leukocyte recruitment but also are involved in chemotaxis, cell adhesion and motility, and angiogenesis (36). To investigate the role of the immune response genes, 86 genes co-expressed with the chemokine ligands were identified. Almost half of these genes had a Gene Ontology classification within the "immune response" category suggesting that the primary function of these genes in the recurrence process is the modulation of the immune response.
- CD4+ and CD8+ T cell antigens (CD8A, CD3, PRPl 5 TRA@, TRB@) or functionally related antigens, for example, major histocompatibility molecules, interferon gamma induced proteins, and IL2RB, were found in the co-expressed gene list.
- functionally related antigens for example, major histocompatibility molecules, interferon gamma induced proteins, and IL2RB
- Figure 4 shows the classification rates obtained from signatures of varying lengths.
- the classification rate is the proportion of correct relapse predictions (expressed as a percentage of total predictions), i.e., the proportion of samples correctly classified.
- the classification rates were determined using 11-fold cross validation. For this cross validation, a randomly selected stratified sample (i.e. same ratio of recurrent to non- recurrent tumours as the full data set) was removed as a validation set prior to gene selection of the genes, and model construction (using the training set of the remaining 50 samples). Cross-validation was then repeated a further ten times so that all 55 samples appeared in one validation set each. This 11 -fold cross-validation process was repeated as 10 replicates, and the results plotted in Figure 4 and Figure.
- the signatures generated in this way contained both ME2, and all but one contained FAS, which may be due to the importance of these genes in providing prediction of prognosis. It was noted that the high classification rate obtained using this approach on the in-house data set did not necessarily mean that these signatures that would be expected to perform better than those set forth in Example 12, on other data sets.
- genes comprising the signatures were themselves obtained from the list of significantly differentially expressed probes, and those from the list of genes which were found to correlate with genes from the NZ 22-gene signature. In some cases there was more than one significant (or correlated) probe per gene. In these cases, the prediction models used the median intensity data across all significant probes (i.e. those in the significant probe list, see table 1) for that gene.
- TNF Tumor necrosis factor
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Chemical & Material Sciences (AREA)
- Physics & Mathematics (AREA)
- Molecular Biology (AREA)
- Genetics & Genomics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Health & Medical Sciences (AREA)
- Biotechnology (AREA)
- Immunology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Analytical Chemistry (AREA)
- Biophysics (AREA)
- Medical Informatics (AREA)
- Evolutionary Biology (AREA)
- Bioinformatics & Computational Biology (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Theoretical Computer Science (AREA)
- Organic Chemistry (AREA)
- Pathology (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Microbiology (AREA)
- Oncology (AREA)
- Biochemistry (AREA)
- Hospice & Palliative Care (AREA)
- Urology & Nephrology (AREA)
- Hematology (AREA)
- Biomedical Technology (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Cell Biology (AREA)
- Food Science & Technology (AREA)
- Medicinal Chemistry (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
- Apparatus Associated With Microorganisms And Enzymes (AREA)
- Investigating Or Analysing Biological Materials (AREA)
- Peptides Or Proteins (AREA)
Abstract
Description
Claims
Priority Applications (12)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2008547140A JP5632581B2 (en) | 2005-12-23 | 2006-12-22 | Prognosis prediction of colorectal cancer |
KR1020147036584A KR101579934B1 (en) | 2005-12-23 | 2006-12-22 | Prognosis prediction for colorectal cancer |
EP06835771.4A EP1977237B1 (en) | 2005-12-23 | 2006-12-22 | Prognosis prediction for colorectal cancer |
KR1020087018020A KR101562644B1 (en) | 2005-12-23 | 2006-12-22 | Prognosis prediction for colorectal cancer |
CN2006800529023A CN101389957B (en) | 2005-12-23 | 2006-12-22 | Prognosis prediction for colorectal cancer |
CA2640352A CA2640352C (en) | 2005-12-23 | 2006-12-22 | Prognosis prediction for colorectal cancer |
KR1020137023872A KR101530689B1 (en) | 2005-12-23 | 2006-12-22 | Prognosis prediction for colorectal cancer |
ES06835771.4T ES2525219T3 (en) | 2005-12-23 | 2006-12-22 | Prognosis prediction for colorectal cancer related application |
AU2006328023A AU2006328023B2 (en) | 2005-12-23 | 2006-12-22 | Prognosis prediction for colorectal cancer |
US12/214,782 US10422005B2 (en) | 2005-12-23 | 2008-06-20 | Methods for treating colorectal cancer using prognostic genetic markers |
US14/860,024 US20160068916A1 (en) | 2005-12-23 | 2015-09-21 | Test kits |
US16/552,737 US11180815B2 (en) | 2005-12-23 | 2019-08-27 | Methods for treating colorectal cancer using prognostic genetic markers |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
NZ544432A NZ544432A (en) | 2005-12-23 | 2005-12-23 | Prognosis prediction for colorectal cancer using a prognositc signature comprising markers ME2 and FAS |
NZ544432 | 2005-12-23 |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US12/214,782 Continuation US10422005B2 (en) | 2005-12-23 | 2008-06-20 | Methods for treating colorectal cancer using prognostic genetic markers |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2007073220A1 true WO2007073220A1 (en) | 2007-06-28 |
WO2007073220A9 WO2007073220A9 (en) | 2007-11-08 |
Family
ID=38188869
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/NZ2006/000343 WO2007073220A1 (en) | 2005-12-23 | 2006-12-22 | Prognosis prediction for colorectal cancer |
Country Status (10)
Country | Link |
---|---|
US (3) | US10422005B2 (en) |
EP (3) | EP1977237B1 (en) |
JP (3) | JP5632581B2 (en) |
KR (3) | KR101562644B1 (en) |
CN (1) | CN101389957B (en) |
AU (1) | AU2006328023B2 (en) |
CA (2) | CA3015335A1 (en) |
ES (3) | ES2536233T3 (en) |
NZ (3) | NZ544432A (en) |
WO (1) | WO2007073220A1 (en) |
Cited By (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2008061525A3 (en) * | 2006-11-02 | 2008-10-09 | Signature Diagnostics Ag | Prognostic marker for classifying the three-year progression-free survival of patients with colorectal carcinoma based on expression profiles of biological samples |
WO2009071720A1 (en) | 2007-12-04 | 2009-06-11 | Universidad Autónoma de Madrid | Genomic imprinting for the prognosis of the course of colorectal adenocarcinoma |
EP2215254A1 (en) * | 2007-10-05 | 2010-08-11 | Pacific Edge Biotechnology Ltd | Proliferation signature and prognosis for gastrointestinal cancer |
US20100330079A1 (en) * | 2007-06-04 | 2010-12-30 | Curzio Ruegg | Biomarkder combinations for colorectal cancer |
WO2011006642A1 (en) | 2009-07-16 | 2011-01-20 | Roche Diagnostics Gmbh | Flap endonuclease-1 as a marker for cancer |
US8026060B2 (en) | 2006-01-11 | 2011-09-27 | Genomic Health, Inc. | Gene expression markers for colorectal cancer prognosis |
EP2634264A1 (en) * | 2006-07-21 | 2013-09-04 | Epigenomics AG | Methods and nucleic acids related to the gene PCDHGC3 for analyses of cellular proliferative disorders |
US9605318B2 (en) | 2004-04-09 | 2017-03-28 | Genomic Health, Inc. | Gene expression markers for predicting response to chemotherapy |
EP2391887A4 (en) * | 2009-02-02 | 2018-04-04 | Rutgers, The State University Of New Jersey | Image-based risk score-a prognostic predictor of survival and outcome from digital histopathology |
US10179936B2 (en) | 2009-05-01 | 2019-01-15 | Genomic Health, Inc. | Gene expression profile algorithm and test for likelihood of recurrence of colorectal cancer and response to chemotherapy |
US11725048B2 (en) | 2019-12-20 | 2023-08-15 | Hudson Institute of Medical Research | CXCL10 binding proteins and compositions thereof |
Families Citing this family (46)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
FR2783348B1 (en) * | 1998-09-15 | 2000-10-13 | Alstom Technology | METHOD OF DISCRIMINATION BETWEEN AN INTERNAL ARC AND A CUT-OUT ARC IN A MEDIUM OR HIGH VOLTAGE CIRCUIT BREAKER |
US20090076734A1 (en) | 2007-03-22 | 2009-03-19 | Torres-Roca Javier F | Gene Signature for the Prediction of Radiation Therapy Response |
US20080234946A1 (en) * | 2007-03-22 | 2008-09-25 | University Of South Florida | Predictive radiosensitivity network model |
US8445271B2 (en) | 2010-05-03 | 2013-05-21 | Enzo Life Sciences, Inc. | Processes and kits for determining multi-drug resistance of cells |
WO2011140662A1 (en) * | 2010-05-13 | 2011-11-17 | The Royal Institution For The Advancement Of Learning / Mcgill University | Cux1 signature for determination of cancer clinical outcome |
EP2576815B1 (en) * | 2010-06-04 | 2018-02-14 | Biomérieux | Method for the prognosis of colorectal cancer |
US9539323B2 (en) | 2010-08-06 | 2017-01-10 | Beth Israel Deaconess Medical Center, Inc. | Methods and compositions for malic enzyme 2 (ME2) as a target for cancer therapy |
KR101401561B1 (en) * | 2010-12-30 | 2014-06-11 | 주식회사 바이오인프라 | Cancer Diagnosis Information Generation Method, Cancer Diagnosis System Device |
US20120184454A1 (en) * | 2011-01-14 | 2012-07-19 | Kalady Matthew F | Gene signature is associated with early stage rectal cancer recurrence |
CN102183662A (en) * | 2011-03-22 | 2011-09-14 | 浙江大学 | Method for establishing colon cancer prognosis prediction model |
CN104053788A (en) * | 2011-11-28 | 2014-09-17 | 加泰罗尼亚调查和高级研究机构 | Methods and kits for prognosis of colorectal cancer |
US9977033B2 (en) * | 2012-09-11 | 2018-05-22 | The Board Of Regents Of The University Of Texas System | Methods for assessing cancer recurrence |
KR20160045547A (en) | 2014-10-17 | 2016-04-27 | 에스케이텔레콤 주식회사 | Composition for diagnosing pancreatic cancer and method for diagnosing pancreatic cancer using the same |
CN105021828B (en) * | 2015-07-20 | 2017-07-28 | 上海交通大学医学院附属新华医院 | By detecting that three phase colorectal carcinoma patients serums predict tumor patient prognosis |
CN109072309B (en) | 2016-02-02 | 2023-05-16 | 夸登特健康公司 | Cancer evolution detection and diagnosis |
WO2017143322A1 (en) | 2016-02-18 | 2017-08-24 | University Of South Florida | Materials and methods for detecting cancer based on urinary levels of rhamm |
CA3023487A1 (en) | 2016-05-10 | 2017-11-16 | National University Corporation Tokyo Medical And Dental University | Expression inhibitor of inflammation promoting factor, screening method for active ingredient thereof, expression cassette useful for said method, diagnostic agent and diagnosis method |
US10360499B2 (en) * | 2017-02-28 | 2019-07-23 | Anixa Diagnostics Corporation | Methods for using artificial neural network analysis on flow cytometry data for cancer diagnosis |
US11164082B2 (en) | 2017-02-28 | 2021-11-02 | Anixa Diagnostics Corporation | Methods for using artificial neural network analysis on flow cytometry data for cancer diagnosis |
US9934364B1 (en) | 2017-02-28 | 2018-04-03 | Anixa Diagnostics Corporation | Methods for using artificial neural network analysis on flow cytometry data for cancer diagnosis |
US20180357361A1 (en) * | 2017-06-13 | 2018-12-13 | Feliks Frenkel | Systems and methods for identifying responders and non-responders to immune checkpoint blockade therapy |
SG11202004097RA (en) * | 2017-11-09 | 2020-06-29 | Univ Nat Corp Tokyo Medical & Dental | Inhibitor of the expression of cancer-promoting factors, screening method for active ingredient thereof, expression cassette useful in said method, diagnostic drug, and diagnostic method |
WO2019154884A1 (en) * | 2018-02-07 | 2019-08-15 | Ecole Polytechnique Federale De Lausanne (Epfl) | Method for determining cancer invasiveness and patient prognosis |
WO2019173233A1 (en) * | 2018-03-05 | 2019-09-12 | Anixa Diagnostics Corporation | Methods for using artificial neural network analysis on flow cytometry data for cancer diagnosis |
TW202018727A (en) | 2018-11-09 | 2020-05-16 | 財團法人工業技術研究院 | Ensemble learning predicting method and system |
CN111321221B (en) * | 2018-12-14 | 2022-09-23 | 中国医学科学院肿瘤医院 | Composition, microarray and computer system for predicting risk of recurrence after regional resection of rectal cancer |
CN109599181B (en) * | 2019-01-09 | 2021-04-16 | 中国医学科学院肿瘤医院 | Survival prediction system and prediction method for T3-LARC patient before treatment |
EP3935581A4 (en) | 2019-03-04 | 2022-11-30 | Iocurrents, Inc. | Data compression and communication using machine learning |
CN109762907B (en) * | 2019-03-27 | 2022-02-22 | 中山大学附属第六医院 | Application of hypoxia-related gene in kit for predicting colorectal cancer at stage I/II |
CN109811057B (en) * | 2019-03-27 | 2022-02-22 | 中山大学附属第六医院 | Application of hypoxia-related gene in colorectal cancer prediction system |
US11154240B2 (en) | 2019-04-02 | 2021-10-26 | Kpn Innovations Llc | Methods and systems for utilizing diagnostics for informed vibrant constitutional guidance |
CN110246591B (en) * | 2019-07-15 | 2021-11-05 | 中国中医科学院西苑医院 | Prognosis prediction system for traditional Chinese medicine treatment population |
CN112342295A (en) * | 2019-08-06 | 2021-02-09 | 中山大学孙逸仙纪念医院 | Tumor marker for detecting human colorectal cancer and application thereof |
CN110791565B (en) * | 2019-09-29 | 2021-09-03 | 浙江大学 | Prognostic marker gene for colorectal cancer recurrence prediction in stage II and random survival forest model |
US12020417B2 (en) * | 2020-04-24 | 2024-06-25 | Camtek Ltd. | Method and system for classifying defects in wafer using wafer-defect images, based on deep learning |
US11289206B2 (en) | 2020-06-02 | 2022-03-29 | Kpn Innovations, Llc. | Artificial intelligence methods and systems for constitutional analysis using objective functions |
US11211158B1 (en) | 2020-08-31 | 2021-12-28 | Kpn Innovations, Llc. | System and method for representing an arranged list of provider aliment possibilities |
CN112301131B (en) * | 2020-11-13 | 2021-10-26 | 中国医学科学院基础医学研究所 | Application of RNF138 as biomarker for predicting sensitivity of colorectal cancer to SC75741 treatment |
CN113591919B (en) * | 2021-06-29 | 2023-07-21 | 复旦大学附属中山医院 | Analysis method and system for prognosis of early hepatocellular carcinoma postoperative recurrence based on AI |
WO2023128419A1 (en) * | 2021-12-31 | 2023-07-06 | 주식회사 이노제닉스 | Method for screening colorectal cancer and colorectal polyps or advanced adenomas and application thereof |
IT202200001817A1 (en) | 2022-02-02 | 2023-08-02 | Aizoon S R L | Procedure for the prognosis of a disease following a therapeutic treatment, related system and IT product |
CN114652814B (en) * | 2022-03-11 | 2023-02-03 | 郑州大学第一附属医院 | Immune checkpoint inhibitor and application thereof |
IT202200005861A1 (en) | 2022-03-24 | 2023-09-24 | Aizoon S R L | PROCEDURE TO ESTIMATE A VARIABLE OF INTEREST ASSOCIATED WITH A GIVEN DISEASE AS A FUNCTION OF A PLURALITY OF DIFFERENT OMICS DATA, RELATED DEVICE AND COMPUTER PRODUCT |
AU2023208071A1 (en) | 2022-07-25 | 2024-02-08 | Aizoon S.r.l. | Method for providing a clinical decision support, corresponding system for providing a clinical decision support, and computer program product |
WO2024138106A1 (en) * | 2022-12-23 | 2024-06-27 | The Wistar Institute Of Anatomy And Biology | Compositions and methods for identification of hypomorphic p53 variants |
CN115938590B (en) * | 2023-02-09 | 2023-05-02 | 四川大学华西医院 | Construction method and prediction system of colorectal cancer postoperative LARS prediction model |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2002006867A2 (en) | 2000-07-18 | 2002-01-24 | Kvh Industries, Inc. | Method of incorporating optical material into an optical fiber |
WO2004090550A2 (en) | 2003-04-08 | 2004-10-21 | Colotech A/S | A method for detection of colorectal cancer in human samples |
WO2005064009A1 (en) | 2003-12-27 | 2005-07-14 | Aros Applied Biotechnology Aps | Classification of cancer |
Family Cites Families (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6582908B2 (en) * | 1990-12-06 | 2003-06-24 | Affymetrix, Inc. | Oligonucleotides |
GB9805918D0 (en) * | 1998-03-19 | 1998-05-13 | Nycomed Amersham Plc | Sequencing by hybridisation |
MXPA01009369A (en) * | 1999-03-15 | 2003-06-06 | Eos Biotechnology Inc | Novel methods of diagnosing colorectal cancer, compositions, and methods of screening for colorectal cancer modulators. |
US20030077568A1 (en) * | 2000-09-15 | 2003-04-24 | Gish Kurt C. | Methods of diagnosis of colorectal cancer, compositions and methods of screening for colorectal cancer modulators |
CA2477043A1 (en) * | 2001-02-27 | 2002-09-06 | Eos Biotechnology, Inc. | Novel methods of diagnosis of metastatic colorectal cancer, compositions and methods of screening for modulators of metastatic colorectal cancer |
US20040191782A1 (en) * | 2003-03-31 | 2004-09-30 | Yixin Wang | Colorectal cancer prognostics |
US20060019256A1 (en) * | 2003-06-09 | 2006-01-26 | The Regents Of The University Of Michigan | Compositions and methods for treating and diagnosing cancer |
US20050048494A1 (en) * | 2003-08-28 | 2005-03-03 | Yixin Wang | Colorectal cancer prognostics |
-
2005
- 2005-12-23 NZ NZ544432A patent/NZ544432A/en unknown
-
2006
- 2006-12-22 WO PCT/NZ2006/000343 patent/WO2007073220A1/en active Application Filing
- 2006-12-22 EP EP06835771.4A patent/EP1977237B1/en not_active Not-in-force
- 2006-12-22 CA CA3015335A patent/CA3015335A1/en not_active Abandoned
- 2006-12-22 ES ES11172587.5T patent/ES2536233T3/en active Active
- 2006-12-22 KR KR1020087018020A patent/KR101562644B1/en active IP Right Grant
- 2006-12-22 ES ES11172410.0T patent/ES2484702T3/en active Active
- 2006-12-22 CA CA2640352A patent/CA2640352C/en not_active Expired - Fee Related
- 2006-12-22 KR KR1020137023872A patent/KR101530689B1/en active IP Right Grant
- 2006-12-22 EP EP11172410.0A patent/EP2371972B1/en not_active Not-in-force
- 2006-12-22 ES ES06835771.4T patent/ES2525219T3/en active Active
- 2006-12-22 KR KR1020147036584A patent/KR101579934B1/en active IP Right Grant
- 2006-12-22 CN CN2006800529023A patent/CN101389957B/en not_active Expired - Fee Related
- 2006-12-22 NZ NZ586616A patent/NZ586616A/en unknown
- 2006-12-22 NZ NZ597363A patent/NZ597363A/en unknown
- 2006-12-22 AU AU2006328023A patent/AU2006328023B2/en not_active Ceased
- 2006-12-22 JP JP2008547140A patent/JP5632581B2/en not_active Expired - Fee Related
- 2006-12-22 EP EP11172587.5A patent/EP2392678B1/en not_active Not-in-force
-
2008
- 2008-06-20 US US12/214,782 patent/US10422005B2/en not_active Expired - Fee Related
-
2014
- 2014-05-19 JP JP2014103714A patent/JP6218141B2/en not_active Expired - Fee Related
-
2015
- 2015-09-21 US US14/860,024 patent/US20160068916A1/en not_active Abandoned
- 2015-12-28 JP JP2015257292A patent/JP6058780B2/en not_active Expired - Fee Related
-
2019
- 2019-08-27 US US16/552,737 patent/US11180815B2/en active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2002006867A2 (en) | 2000-07-18 | 2002-01-24 | Kvh Industries, Inc. | Method of incorporating optical material into an optical fiber |
WO2004090550A2 (en) | 2003-04-08 | 2004-10-21 | Colotech A/S | A method for detection of colorectal cancer in human samples |
WO2005064009A1 (en) | 2003-12-27 | 2005-07-14 | Aros Applied Biotechnology Aps | Classification of cancer |
Non-Patent Citations (13)
Title |
---|
ARANGO D. ET AL.: "Gene-expressing profiling predicts recurrence in Duke's C colorectal cancer", GASTROENTEROLOGY, vol. 129, September 2005 (2005-09-01), pages 874 - 884, XP005314702 * |
BACKUS H.H.J. ET AL.: "Differential expression of cell cycle and apoptosis related proteins in colorectal mucosa, primary colon tumours, and liver metastases", JOURNAL OF CLINICAL PHARMACOLOGY, vol. 55, 2002, pages 206 - 211, XP003014770 * |
BARRIER A. ET AL.: "Colon cancer prognosis prediction by gene expression profiling", ONCOGENE, vol. 24, September 2005 (2005-09-01), pages 6155 - 6164, XP002393868 * |
BERTUCCI F. ET AL.: "Gene expression profiling of colon cancer by DNA microarrays and correlation with histoclinical parameters", ONCOGENE, vol. 23, 2004, pages 1377 - 1391, XP002414422 * |
ESCHRICH S. ET AL.: "Molecular staging for survival prediction of colorectal cancer patients", JOURNAL OF CLINICAL ONCOLOGY, vol. 23, no. 15, May 2005 (2005-05-01), pages 3526 - 3535, XP003014768 * |
GAO ET AL., CHINESE JOUNRAL OF CLINICAL REHABILITATION, 9 January 2005 (2005-01-09), pages 228 - 229 |
GAO J.-J. ET AL.: "Evaluative significance of the expression of Fas antigen and P53 protein in colorectal cancer for the biological characters and prognosis", CHINESE JOURNAL OF CLINICAL REHABILITATION, vol. 9, no. 14, April 2005 (2005-04-01), pages 228 - 229, XP008082293 * |
GREENBAUM ET AL., GENOME BIOLOGY, vol. 40, no. 9, 1 January 2003 (2003-01-01), pages 117.01 - 117.08 |
LIN ET AL.: "Journal of Sichuan University", vol. 36, July 2005, MEDICAL SCIENCE EDITION, pages: 503 - 505 |
OERNTOFT ET AL., MOLECULAR AND CELLULAR PROTEOMICS, vol. 1, no. 1, 1 January 2002 (2002-01-01), pages 37 - 45 |
OGAWA S. ET AL.: "The breakdown of apoptotic mechanism in the development and progression of colorectal carcinoma", ANTICANCER RESEARCH, vol. 24, 2004, pages 1569 - 1580, XP008082092 * |
See also references of EP1977237A4 |
WANG Y. ET AL.: "Gene expression profiles and molecular markers to predict recurrence of Dukes' B colon cancer", JOURNAL OF CLINICAL ONCOLOGY, vol. 22, no. 9, 2004, pages 1564 - 1571, XP003014769 * |
Cited By (27)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9605318B2 (en) | 2004-04-09 | 2017-03-28 | Genomic Health, Inc. | Gene expression markers for predicting response to chemotherapy |
US8153379B2 (en) | 2006-01-11 | 2012-04-10 | Genomic Health, Inc. | Gene expression markers for colorectal cancer prognosis |
US8367345B2 (en) | 2006-01-11 | 2013-02-05 | Genomic Health Inc. | Gene expression markers for colorectal cancer prognosis |
US8273537B2 (en) | 2006-01-11 | 2012-09-25 | Genomic Health, Inc. | Gene expression markers for colorectal cancer prognosis |
US8198024B2 (en) | 2006-01-11 | 2012-06-12 | Genomic Health, Inc. | Gene expression markers for colorectal cancer prognosis |
US8153378B2 (en) | 2006-01-11 | 2012-04-10 | Genomic Health, Inc. | Gene expression markers for colorectal cancer prognosis |
US8026060B2 (en) | 2006-01-11 | 2011-09-27 | Genomic Health, Inc. | Gene expression markers for colorectal cancer prognosis |
US8029995B2 (en) | 2006-01-11 | 2011-10-04 | Genomic Health, Inc. | Gene expression markers for colorectal cancer prognosis |
US8153380B2 (en) | 2006-01-11 | 2012-04-10 | Genomic Health, Inc. | Gene expression markers for colorectal cancer prognosis |
EP2634264A1 (en) * | 2006-07-21 | 2013-09-04 | Epigenomics AG | Methods and nucleic acids related to the gene PCDHGC3 for analyses of cellular proliferative disorders |
WO2008061525A3 (en) * | 2006-11-02 | 2008-10-09 | Signature Diagnostics Ag | Prognostic marker for classifying the three-year progression-free survival of patients with colorectal carcinoma based on expression profiles of biological samples |
US8697350B2 (en) * | 2007-06-04 | 2014-04-15 | Diagnoplex | Biomarker combinations for colorectal cancer |
US20100330079A1 (en) * | 2007-06-04 | 2010-12-30 | Curzio Ruegg | Biomarkder combinations for colorectal cancer |
JP2011517765A (en) * | 2007-06-04 | 2011-06-16 | ダイアグノプレックス エセアー | Biomarker combinations for colorectal cancer |
EP2215254A1 (en) * | 2007-10-05 | 2010-08-11 | Pacific Edge Biotechnology Ltd | Proliferation signature and prognosis for gastrointestinal cancer |
EP2215254A4 (en) * | 2007-10-05 | 2012-07-18 | Pacific Edge Biotechnology Ltd | Proliferation signature and prognosis for gastrointestinal cancer |
CN101932724A (en) * | 2007-10-05 | 2010-12-29 | 环太平洋生物技术有限公司 | The hyperplasia label and the prognosis of gastrointestinal cancer |
JP2010539973A (en) * | 2007-10-05 | 2010-12-24 | パシフィック エッジ バイオテクノロジー リミティド | Signs of growth and prognosis in gastrointestinal cancer |
JP2015165811A (en) * | 2007-10-05 | 2015-09-24 | パシフィック エッジ バイオテクノロジー リミティド | Proliferation signature and prognosis for gastrointestinal cancer |
EP2995690A1 (en) * | 2007-10-05 | 2016-03-16 | Pacific Edge Limited | Proliferation signature and prognosis for gastrointestinal cancer |
JP2017060517A (en) * | 2007-10-05 | 2017-03-30 | パシフィック エッジ バイオテクノロジー リミティド | Sign of proliferation and prognosis in gastrointestinal cancer |
JP2018126154A (en) * | 2007-10-05 | 2018-08-16 | パシフィック エッジ リミテッド | Sign of proliferation and prognosis in gastrointestinal cancer |
WO2009071720A1 (en) | 2007-12-04 | 2009-06-11 | Universidad Autónoma de Madrid | Genomic imprinting for the prognosis of the course of colorectal adenocarcinoma |
EP2391887A4 (en) * | 2009-02-02 | 2018-04-04 | Rutgers, The State University Of New Jersey | Image-based risk score-a prognostic predictor of survival and outcome from digital histopathology |
US10179936B2 (en) | 2009-05-01 | 2019-01-15 | Genomic Health, Inc. | Gene expression profile algorithm and test for likelihood of recurrence of colorectal cancer and response to chemotherapy |
WO2011006642A1 (en) | 2009-07-16 | 2011-01-20 | Roche Diagnostics Gmbh | Flap endonuclease-1 as a marker for cancer |
US11725048B2 (en) | 2019-12-20 | 2023-08-15 | Hudson Institute of Medical Research | CXCL10 binding proteins and compositions thereof |
Also Published As
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11180815B2 (en) | Methods for treating colorectal cancer using prognostic genetic markers | |
JP5745848B2 (en) | Signs of growth and prognosis in gastrointestinal cancer | |
US10266902B2 (en) | Methods for prognosis prediction for melanoma cancer | |
AU2018200973B2 (en) | Prognosis prediction for colorectal cancer | |
AU2013263832B2 (en) | Prognosis prediction for colorectal cancer |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
WWE | Wipo information: entry into national phase |
Ref document number: 2008547140 Country of ref document: JP |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2006328023 Country of ref document: AU |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2006835771 Country of ref document: EP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 6372/DELNP/2008 Country of ref document: IN |
|
WWE | Wipo information: entry into national phase |
Ref document number: 1020087018020 Country of ref document: KR |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2640352 Country of ref document: CA |
|
ENP | Entry into the national phase |
Ref document number: 2006328023 Country of ref document: AU Date of ref document: 20061222 Kind code of ref document: A |
|
WWE | Wipo information: entry into national phase |
Ref document number: 200680052902.3 Country of ref document: CN |
|
WWE | Wipo information: entry into national phase |
Ref document number: 1020137023872 Country of ref document: KR |
|
WWE | Wipo information: entry into national phase |
Ref document number: 1020147036584 Country of ref document: KR |