WO2011012136A1 - A method for classifying a human cell sample as cancerous - Google Patents

A method for classifying a human cell sample as cancerous Download PDF

Info

Publication number
WO2011012136A1
WO2011012136A1 PCT/DK2010/050191 DK2010050191W WO2011012136A1 WO 2011012136 A1 WO2011012136 A1 WO 2011012136A1 DK 2010050191 W DK2010050191 W DK 2010050191W WO 2011012136 A1 WO2011012136 A1 WO 2011012136A1
Authority
WO
WIPO (PCT)
Prior art keywords
mir
cancer
group
level
cell sample
Prior art date
Application number
PCT/DK2010/050191
Other languages
French (fr)
Inventor
Fog Jacob Ulrik
Baker Adam
Moeller Soeren
Original Assignee
Exiqon A/S
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Exiqon A/S filed Critical Exiqon A/S
Publication of WO2011012136A1 publication Critical patent/WO2011012136A1/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6876Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
    • C12Q1/6883Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
    • C12Q1/6886Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material for cancer
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/118Prognosis of disease development
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/158Expression markers
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/178Oligonucleotides characterized by their use miRNA, siRNA or ncRNA

Definitions

  • the present invention relates to the field of cancer-diagnostics.
  • the invention 5 relates to a microRNA expression signature that allows discriminating between cancer and non-cancer samples.
  • Colorectal cancer is the third most frequent cancer disease and the second most frequent cause of cancer related mortality in the Western World. There are approximately 400.000 new cases annually in these countries.
  • Solid cancers are normally diagnosed based on a histo-pathological evaluation of tissue 15 samples. It is of paramount importance to the correct treatment of patients that correct
  • MicroRNAs are an abundant class of short endogenous RNAs that act as posttranscriptional regulators of gene expression by base-pairing with their target mRNAs. 25 miRs are 19-25 nucleotide (nt) RNAs that are processed from longer endogenous hairpin transcripts (Ambros et al. 2003, RNA 9: 277-279). To date more than 6000 miRs have been identified in humans, worms, fruit flies and plants according to the miR registry database release 1 1.0 in April 2008, hosted by Sanger Institute, UK.
  • breast cancer e.g., surgeons often takes a needle aspiration biopsy, also known as fine needle aspiration biopsy to investigate superficial tumors.
  • a fine needle aspiration biopsy is safer and less traumatic than an open surgical biopsy, however the histological structures of the tumor are prone to be perturbed during the sampling making the histo-pathological evaluation difficult
  • the present invention describes a method which do not require normal test material from the patient is simple, robust and have high specificity and predictive accuracy.
  • the present inventors believed that in order to obtain a meaningful and robust miR-signature based on quantitative PCR would require that the miR- level were normalised to a reference gene, typically a house-keeping gene, or an external standard and the miR-levels compared with the similar levels in a control tissue-sample (Bustin, 2004).
  • the present inventors surprisingly, realized that traditional controls, i.e. a control tissue-sample and a reference gene, were not necessary to obtain a meaningful and robust miR-signature of cancer provided that the signature is based on the level of a few, but carefully selected miRs.
  • the invention pertains to a method for classifying a test cell sample as cancerous, the method comprising:
  • said cut-off value being estimated by calculating the level of the same miR(s) that were selected from miR-Group I in step a) relative to the level of the same miR(s) selected from miR-Group Il in step a) in a dataset comprising data on miR levels comprising at least one normal cell sample.
  • the invention pertains to a method for assessing recurrence of a specific type of cancer in a human subject who previous has been treated for said specific type of cancer, the method comprising: a) detecting the level of at least one miR selected from miR-Group I consisting of: miR- 21 , miR-34a and miR-141 , and detecting the level of at least one miR selected from miR-Group Il consisting of: miR-126, miR-143 and miR-145 in a test cell sample obtained from a site close to and a tissue similar to the site and tissue-type of said specific type of cancer of said human subject and,
  • said cut-off value being estimated by calculating the level of the same miR(s) that were selected from miR-Group I in step a) relative to the level of the same miR(s) selected from miR-Group Il in step a) in a dataset comprising data on miR levels in cell samples from a collection of humans subjects who previously were treated for cancer and did not experienced recurrence of cancer as well as subjects who previously were treated for cancer and did experienced recurrence of cancer.
  • a miR signature comprised of least one miR selected from miR-Group I consisting of: miR-21 , miR-34a and miR-141 , and at least one miR selected from miR-Group Il consisting of: miR-126, miR-143 and miR-145 in will identify stage Il colon cancer patients at high risk of recurrence.
  • a further important aspect of the invention is the provision of a kit of parts for classifying a test cell sample as cancerous or non-cancerous comprising:
  • a further advantage of the herein described methods is that they can be run on formalin fixed paraffin embedded (FFPE) tissue sections.
  • FFPE formalin fixed paraffin embedded
  • cancer and “cancerous” are used interchangeably herein to refer to cells which has acquired one or more mutations allowing them to reproduce in defiance of the normal restraints to the cells of the body and to invade and colonize territories normally reserved for other cells.
  • Cancer cells exhibit relatively autonomous growth, characterized by a significant loss of control of cell proliferation.
  • Cancer cells include pre-malignant (e.g., benign hyperplasic), malignant, metastatic, and non-metastatic cells. Cancer cells often forms tumors which frequently comprise cells which are not cancer cells. Tumors of the colon are characterized by a number of well described characteristics well-known to the skilled artesian.
  • cancer colon test cell sample
  • non-cancerous or “non-cancer” which are used interchangeably herein to refer to a test cell sample which not is forming part a tumor or neoplasm or in a number allowing it to be classified as cancer by a skilled clinician.
  • miR and “microRNA” refer to about 18-25 nt non-coding RNAs derived from endogenous genes. They are processed from longer (ca 75 nt) hairpin-like precursors termed pre-miRs. MicroRNAs assemble in complexes termed miRNPs and recognize their targets by antisense complementarity. If the microRNAs match 100% their target, i.e. the
  • miR-signature is referred to a collection of miRs, the level, expression or amount of which is characteristic of the pathological status of a test sample.
  • the term "detecting the level of a miR” refer to the quantification of said miR.
  • One way of quantification is described in the Examples.
  • the miR may be quantified in a multitude of other ways e.g. by arrays, northern blots, dot blots, RN'ase protection assays, quantitative mass spectroscopy or various quantitative PCR-based methods such as the TaqMan assay or the UniRT assay used in the examples.
  • expression refers to the transcription and/or accumulation of RNA-molecules within a call.
  • level of expression of a miR and “level of a miR” are used synonymously as a measure of the "amount of a specific miR” that is detected in the sample.
  • the “amount of a specific miR” may be expressed in either absolute or relative measures and refers to values obtained by both quantitative, as well as qualitative methods.
  • One particularly preferred measure of the "amount of a specific miR” is the Crossing point (Cp) value obtained by real-time RT-QPCR as described in the examples.
  • the level of miR-21 , miR-34a, miR-126 and (miR-143 or miR-145) refer to the level of miR-21 , miR-34a, miR-126 and miR-143, or, level of miR-21 , miR-34a, miR-126 and miR-145.
  • level designates relative as well as absolute amounts of the miRs referred to.
  • Q-PCR quantitative polymerase chain reaction.
  • Q-PCR is highly sensitive method for quantifying the amounts of specific DNA and RNA species in a test sample.
  • a thorough treatise of the Q-PCR technique can be found in Bustin, S.A. (ed.) A-Z of quantitative PCR, IUL Biotechnology Series 5 (2004) 882 pages, which hereby is incorporated herein by reference.
  • sample refers to a sample of cells, or tissue or fluid isolated from an organism or organisms, including but not limited to solid tissue samples and samples of in vitro cell culture constituents.
  • a specific cancer e.g. colon cancer is known to metastasise to various organs, for example to liver and lung
  • the “sample” may be any sample of blood, plasma, serum, spinal fluid, lymph fluid, synovial fluid, urine, tears, blood cells, organs, tumors, that contains cells or lysed cells and also to samples of in vitro cell cultures.
  • machine learning can be used to discover general rules in large data sets, machine learning can e.g. be used to extract clinical informative data from a dataset comprising miR expression in cancer and non-cancer samples.
  • a general treatise of the concept of machine learning can be found in: Tom Mitchell, Machine Learning, McGraw Hill, 1997, which hereby is incorporated herein by reference.
  • the "cut-off value” is a threshold-value above (or below) which, a value calculated to represent the level of a number of miRs indicate that a test cell sample is from a cancer.
  • LNA LNA nucleoside or LNA nucleotide
  • LNA oligomer e.g., an oligonucleotide or nucleic acid
  • LNA monomers as disclosed in PCT Publication WO 99/14226 are in general particularly desirable modified nucleic acids for incorporation into an oligonucleotide of the invention.
  • oligonucleotide By “oligonucleotide,” “oligomer,” or “oligo” is meant a successive chain of monomers (e.g., glycosides of heterocyclic bases) connected via intemucleoside linkages.
  • LNA modified oligonucleotide or "LNA substituted oligonucleotide” is meant a
  • oligonucleotide comprising at least one LNA monomer.
  • nucleobase covers naturally occurring nucleobases as well as non-naturally occurring nucleobases.
  • nucleobase includes not only the known purine and pyrimidine heterocycles, but also heterocyclic analogues and tautomers thereof.
  • Illustrative examples of nucleobases are adenine, guanine, thymine, cytosine, uracil, purine, xanthine, diaminopurine, and 5-methylcytosine..
  • LNA monomers are depicted in capitals (T, A, G) and DNA monomers in lower case (t, a, c, g). Modified LNA monomers include 5' methyl cytosine shown as "C".
  • a further central element in the invention is the use of a predetermined cut-off value to determine whether a test cell sample is drawn from a cancer or not.
  • a cut-off value established on test cell samples from one specific cancer type and corresponding normal cell test sample(s) obtained from the same type of tissue may be useful also for the assessment of test cell samples obtained from different types of tissues.
  • a cut-off value established on colon samples appear informative also for the assessment of e.g. bladder and breast cancer samples (see figure 7).
  • the cut-off value is determined using a Receiver Operator Characteristic (ROC) curve, according to the method of Sackett et al., Clinical Epidemiology: A Basic Science for Clinical Medicine, Little Brown and Co., 1985, p. 106-7.
  • ROC Receiver Operator Characteristic
  • the cut-off value may be determined from a plot of pairs of true positive rates (i.e., sensitivity) and false positive rates (100%-specificity) that correspond to each possible cut-off value for the diagnostic test result.
  • the cut-off value on the plot that is the closest to the upper left-hand corner (i.e., the value that encloses the largest area) is the most accurate cut-off value, and a sample generating a signal that is higher than the cut-off value determined by this method may be considered positive.
  • the cut-off value may be shifted to the left along the plot, to minimize the false positive rate, or to the right, to minimize the false negative rate.
  • a test cell sample generating a value of the level of the miR(s) selected from miR- Group I relative to the level of the miR(s) selected from miR-Group Il that is higher than the cut-off value determined by this method is considered positive for cancer.
  • One preferred embodiment the invention may be expressed stating that the ratio: (level of miR21) x (level of miR34a)
  • level of miR126 x (level of miff 145) is relatively large in cancer tissue versus in the corresponding normal tissue.
  • the level of miRs may conveniently be quantified by Quantitative real-time Reverse
  • RT-QPCR Transcriptase mediated Polymerase Chain Reaction method
  • RT-QPCR (Bustin, S.A. (ed.) A- Z of quantitative PCR, IUL Biotechnology Series 5 (2004) 882 pages), see also Example 1 and 2.
  • Cp crossing point
  • the read-out from a typical real-time QPCR instrument is often the so-called Cp (crossing point)-value.
  • the Cp-value is related to the level of e.g. a specific miR, by the relation:
  • a linear regression model such as:
  • Logistic regression is a widely used method for generating linear models to classify data. Performing a logistic regression analysis on the data of Example 2 results in a best fit regression model:
  • the previously obtained dataset comprise data from at least one cancer and as least one non-cancer cell sample.
  • a previously obtained dataset comprising data from at least 10 cancer cell samples 5 and at least 10 corresponding non-cancer cell samples is preferred.
  • a dataset of that size furthermore posses the advantage that a cut-off value that characterize non-cancer samples from cancer samples can be calculated by methods well know in the art.
  • comparing the levels of miR-21 and miR-34a, to the levels of miR-126 and miR- 20 145 are similar informative to comparing the levels of miR-21 and miR-34a, to the levels of miR-126 and miR-143.
  • Our data indicate that miR-143 and miR-145 can be used
  • the data in figure 3 also show that the level of miR-21 comparing the levels of miR-143 (or 25 miR-145) is remarkably informative (i.e. both stable and accurate), indicated by the AUC values of 0.991 and 0.992 and an accuracy of 94,6%. Comparing the level of miR-21 to the levels of miR-126 is almost as good having AUC values of 1 ,000 and 0,970 and an accuracy of 91 ,9%.
  • the method for detecting cancer in a tissue sample further comprises: a) obtaining a biological sample from a patient; b) determining the probability of the test cancer cell sample being cancerous by a process implying machine learning (such as decision tree, principal component analysis, neural networks, or Bayesian networks) analysis wherein the applied program has been trained on a dataset comprised of expression data of the miRs selected from Mir-Group I and the miRs selected from Mir-Group Il in cancer as well as in corresponding normal cell samples.
  • machine learning such as decision tree, principal component analysis, neural networks, or Bayesian networks
  • GMM Gaussian mixture model
  • Genetic algorithms (1950's) Genetic algorithms (1950's)
  • Decision trees(1960's) Support vector machines can be used to produce estimators useable to separate cancer test samples from non-cancerous samples based on the level of miR-21 , miR-34a, miR-126 and (miR-143 or miR-145).
  • Example 2-5 the calculation of even a simple score: Cp(miR-126) + Cp(miR- 143) - Cp (miR-21 ) - Cp(miR-34a) allow a cut-off value to be estimated and used to classify a test cell sample as cancer or non-cancer.
  • Cut-off values from -1.3 to -2 have been obtained, however other values at such as in the range from -0.5 to -4, such as within the range of -1 to -3, e.g. within the range of -1.1 to -2.5, such as within the range of -1.2 to -2.3, including the range of -1.5 to -2.2 and the range of - 1.7 to -2.1 may be obtained depending on the precise experimental setup and the type of tissues analysed.
  • the miRs may be quantified in a multitude of ways a preferred method of quantification is one wherein the the levels of the miRs are determined by Q-PCR.
  • a quantification based on the UniRT-method described in Example 1 is particularly preferred.
  • Today the standard method for histological tissue preparation in pathology is to prepare Formalin-Fixed Paraffin-Embedded (FFPE) samples. Every year millions of FFPE samples are classified and archived by pathologists. Consequently, it a significant advantage of the herein described methods that they apply to cell samples which are formalin fixed paraffin embedded (FFPE) tissue sections.
  • FFPE Formalin-Fixed Paraffin-Embedded
  • a further aspect of the invention is a kit of parts for classifying a test cell sample as cancerous or non-cancerous comprising:
  • the reagents kit contains additional means to perform the quantification by the UniRT method of example 1 such a kit would conveniently comprise a selection of UniRT- primers, according to Table 1 , that are required to detect the necessary miRs, e.g. the primers required to detect miR-21 , miR-34a, miR-126 and miR-143 by the UniRT method.
  • FIGURE 1 Show the steps involved in microRNA specific qRT-PCR by the UniRT-protocol.
  • STEP 1 is a one-tube-reaction for all microRNAs present in a sample.
  • STEP 2 is a microRNA specific qPCR using forward and reverse primer pairs for a specific microRNA.
  • An oval indicate insertion of LNAs in forward and reverse primers.
  • NV designates a 3'-end VN- degenerate sequence motif comprising the two 3'-end terminal nucleotide residues, wherein V is either an adenine residue, a guanine residue, or a cytosine residue and N is either an adenine residue, a guanine residue, a cytosine residue or a thymine residue.
  • FIGURE 2 Show the relative expression of 5 selected miRs in Colon cancer and normal (NAT) FFPE tissue. Shown are ⁇ Cp values of all 5 miR relative to miR-103 (reference). miR- 141 , miR-21 and miR-34a are upregulated in cancer samples compared to normal, whereas miR-126 and miR-143 are downregulated in cancer relative to normal tissue. Boxes indicate the distribution of expression values from the 25th to the 75th percentile for each miR.
  • Vertical lines indicate values from the 5th to the 95th percentile.
  • the value on Y-axis is the miR score.
  • FIGURE 3 Show scatterplots of the distribution of 3 different the miR based scores obtained. Data from two separate rounds of experiments performed on the same clinical samples are shown. On each panel the “spots" to the left are the scores obtained in normal colon tissue (normal adjacent tissue, NAT), the “spots” to the left are the scores obtained in colon cancer tissue.
  • normal colon tissue normal adjacent tissue, NAT
  • the numbers under the abscissa are the Area Under receiver operating characteristic Curves (AUC) for each set of data. Values on ordinates are the scores referred to above. Each "spot" in the panels refer to the score of one tissue sample. The value on Y-axis is the miR score.
  • FIGURE 4 Shows the relative level of miR-143 plotted against the relative level of miR-143 is an almost perfect 1 :1 correlation.
  • the relative level of miR-143, ⁇ Cp(miR-143), is calculated based on miR-103.
  • the relative level of miR-145, ⁇ Cp(miR-145), is calculated based on miR-103.
  • FIGURE 5 Shows reciever operator curves (ROC) of specificity versus sensitivity of miRs indicated. Two separate rounds of measuring miR's and evaluating data were performed on the clinical material consisting of an equal numbers of cancer and non-cancer samples. The curves to the left are from round 1 , those to the right are from round 2.
  • ROC reciever operator curves
  • FIGURE 6 Show scatterplots of the distribution of the miR based score
  • FIGURE 7 miR based score +Cp(miR126)+Cp(miR143)-Cp(miR21 )-Cp(miR34a) obtained on cancer specimens of different origin and compared with the score obtained in normal adjacent tissue (NAT) of the same origin.
  • the tissues are from left to right: bladder tumor, bladder NAT, breast tumor, breast NAT, gallbladder tumor, gallbladder NAT, kidney tumor, kidney NAT, lung tumor, lung NAT, pancreas tumor, pancreas NAT, small intestine tumor, small intestine NAT and uterus tumor, uterus NAT.
  • the value on Y-axis is the miR based score.
  • Example 1 The UniRT method
  • the UniRT method for amplification and quantification of small non-coding RNA molecules by use of quantitative reverse transcription polymerase chain reaction (qRT- PCR) technology is described in brief.
  • the UniRT protocol is a two step protocol.
  • the miRs present in a sample are firstly poly-A-tailed using a poly(A) polymerase, which adds adenine residues to the 3'-end of RNA molecules.
  • an extension primer which has a poly-T-core nucleotide sequence, a 3'-end VN-degenerate motif and a 5'-end tail, is annealed to the poly- A-tailed miRs through hybridization with the VN-poly-T-sequence of the extension primer. Subsequently, the extension primer is extended in a reverse transcription reaction using the miR as template. The resulting primary extension product is composed of the extension primer and the newly synthesized cDNA, which complementary to the miRs in the sample.
  • a miR-specific PCR is carried out.
  • a miR-specific forward primer is annealed to 3'-end of the newly synthesized cDNA and the upper-strand synthesis is carried out by extending the forward primer in a DNA-polymerization reaction using the primary extension product as template.
  • a miR-specific reverse primer composed of a miR-specific 3'- end sequence, a poly-T-stretch and a 5'-end tail is then hybridized to the upper-strand and the lower-strand is synthesized by extension of the reverse primer.
  • Stage I A purely exploratory stage.
  • FFPE Fixed Paraffin Embedded
  • the level of miR-498, miR-191 , miR-21 , miR-103, miR-320, miR-203, miR-106a, let-7a, miR- 15 181 b, miR-20a, miR-26a, miR-151 , miR-127, miR-141 , miR-143, miR-145, miR-342, miR- 34a, miR-200c, miR-125b, miR-126, miR-212, miR-492, miR-142-5p, miR-92a, miR-9, miR- 16, miR-25, miR-27a, miR-29c, miR-200a, miR-708 and miR-155 were quantified using the UniRT qPCR method (see, example 1 ). In brief,
  • RNA from 1x 10 ⁇ m sections was purified with the Qiagen FFPE miRNeasy kit (Qiagen Denmark, cat. No. 217404) according to manufacturer's protocol. RNA concentration was determined by use of a Nanodrop 1000 instrument (Thermo Scientific, Wilmington, DE, USA) and following the instructions of the manufacturer.
  • v designates cytosine, guanine and adenine residues
  • n designates cytosine, guanine, adenine and thymine residues
  • the RT reaction was incubated at 42°C for 1 hour, 95°C for 5 minutes.
  • the Cp-values were calculated using the Lightcycler 480 software release 1.5.0, version 1.5.0.39 accompanying the Lightcycler 480 Il instrument.
  • miR-21 A short list of 6 miRs was identified (miR-21 , miR-34a, miR-126, miR-143, miR-141 , miR- 103).
  • miR-103 was selected as reference gene as it showed uniform expression levels across the colon cancer and normal tissue and appeared to be useable as a control reference gene for qPCR quantification.
  • FFPE Formalin Fixed Paraffin Embedded
  • the six miRs described above were run on the staging set using the method described above. Based on differential expression in colon cancer and normal colon tissue of the new sample-set miRs were further selected.
  • the accuracy of the test depends on how well the test separates the group being tested into those with and without cancer. Accuracy is measured by the area under an Receiver Operating Characteristic curve, ROC-curve. An Area Under receiver operating characteristic Curves (AUC) of 1 represents a perfect test; an area of .5 represents a worthless test. A rough guide for classifying the accuracy of a diagnostic test is the traditional academic point system:
  • Example 3 Identification of further microRNA signatures.
  • TPR true positive rate
  • FPR false positive rate
  • SPC specificity
  • ACC accuracy
  • miR-21 , miR-34a, miR-126 and miR-143 was assessed using the following protocol (finalized protocol, now called miRCURY LIMATM Universal RT microRNA PCR from Exiqon, Denmark: miRCURY LNATM Universal RT microRNA PCR
  • the assessment of miRs was basically performed as described in Example 2 but using the commercial Universal cDNA synthesis kit (Exiqon Vedbaek, cat no 203300). STEP 1 of the assay was performed using the Universal cDNA synthesis kit protocol. 10 ng of total RNA was used in a 10 ⁇ l RT reaction having the composition:
  • the RT reaction was incubated at 42°C for 1 hour, 95°C for 5 minutes. Then the RT reaction was diluted 5Ox with water prior to qPCR analysis - STEP 2.
  • the Cp-values were calculated using the Lightcycler 480 software release 1.5.0, version 1.5.0.39 accompanying the Lightcycler 480 Il instrument.
  • Lower case designates natural occuring nucleotides
  • upper case designates LNA
  • Upper case C designates 5-methylcytosine LNA. All miRs are human miR sequences found in miR registry database release 12.0 hosted by Sanger Institute, UK (miR 12_0).
  • the RANX05 samples colon samples were reanalyzed with respect to the 4 miRs: miR-21 , miR-34a, miR-126 and miR-143.
  • Sections of the RANX05 FFPE blocks were stained with H&E stain (hematoxylin and eosin stain) which is often used by pathologists to examine histology. The sections were carefully assessed for percent tumor area, that is the area of tumor tissue as a percentage of the area of the tissue. The percent tumor area was binned into 5 categories; 100-75%, 75-50%, 50- 10 25%, 25-10% and 10-0%.
  • H&E stain hematoxylin and eosin stain
  • Cp(miR34 ⁇ ) can be used to identify a sample as cancerous or not, provided that sample was characterized by an area of tumor tissue that is at least 50 percentage.
  • Example 5 The microRNA signature can discriminate normal from cancer in general.
  • the 4-miR signature developed in example 2 could differentiate cancer from non-cancer in general samples from various solid cancer-types were obtained the level of miR-21 , miR-34a, miR-126 and miR-143 determined and the miR based score
  • FFPE Formalin Fixed Paraffin Embedded
  • the levels of miR-21 , miR-143, miR-126 and miR-34a was measured using the miRCURY LNATM Universal RT microRNA PCR system as described in example 4. The score was calculated as previously described, as:
  • the miR-21 score is higher in all cancer specimens tested - compared to the normal tissue of the same origin. Even more surprising, the cutoff determined in the colon experiments (-2), seems to be able to differentiate most of the cancers from their normals. When taking into consideration that the best cutoff has to be determined for each tissue from a panel of normal samples from that particular tissue these results indicate a very wide use of the 4- mi R-sco re.

Landscapes

  • Chemical & Material Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Organic Chemistry (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Engineering & Computer Science (AREA)
  • Immunology (AREA)
  • Pathology (AREA)
  • Analytical Chemistry (AREA)
  • Zoology (AREA)
  • Genetics & Genomics (AREA)
  • Wood Science & Technology (AREA)
  • Physics & Mathematics (AREA)
  • Biotechnology (AREA)
  • Microbiology (AREA)
  • Molecular Biology (AREA)
  • Hospice & Palliative Care (AREA)
  • Biophysics (AREA)
  • Oncology (AREA)
  • Biochemistry (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

A new method for discriminating between cancer and non-cancer samples is described. The method comprising detecting the level of at least one microRNA (miR) selected from Mir- Group I consisting of: miR-21, miR-34a and miR-141, and detecting the level of at least one miR selected from Mir-Group Il consisting of: miR-126, miR-143 and miR-145 in a test cell sample and, comparing the level of expression of said selected miRs in the test cell sample with the level of expression of the same selected miRs in a previously recorded test set.

Description

A METHOD FOR CLASSIFYING A HUMAN CELL SAMPLE AS CANCEROUS
FIELD OF THE INVENTION:
The present invention relates to the field of cancer-diagnostics. In particular the invention 5 relates to a microRNA expression signature that allows discriminating between cancer and non-cancer samples.
BACKGROUND OF THE INVENTION:
10 Colorectal cancer is the third most frequent cancer disease and the second most frequent cause of cancer related mortality in the Western World. There are approximately 400.000 new cases annually in these countries.
Solid cancers are normally diagnosed based on a histo-pathological evaluation of tissue 15 samples. It is of paramount importance to the correct treatment of patients that correct
discrimination between cancerous and non-cancerous cases are made.
However the correct determination, especially when only small samples of tissue are at hand, may be problematic. The problem is even more pronounced when the cell sample is a fine needle biopsy as such a biopsy often destroys the tumor histology. Thus, there is a constant 20 need for alternative or improved methods to discriminate between cancerous and noncancerous specimens.
MicroRNAs (miRs or mir's) are an abundant class of short endogenous RNAs that act as posttranscriptional regulators of gene expression by base-pairing with their target mRNAs. 25 miRs are 19-25 nucleotide (nt) RNAs that are processed from longer endogenous hairpin transcripts (Ambros et al. 2003, RNA 9: 277-279). To date more than 6000 miRs have been identified in humans, worms, fruit flies and plants according to the miR registry database release 1 1.0 in April 2008, hosted by Sanger Institute, UK.
30 miRs represent robust and stable biomarkers in formalin fixed paraffin embedded FFPE
material. There are several publications demonstrating that miRs are stable and regulated in cancer. A recent paper by Schetter et al (JAMA, 2008) describes the use of miR as prognostic biomarkers in colon cancer. Of particular interest is that the authors were able to separate non-cancerous specimens from cancerous specimens by the use of one miR (miR-
35 21 ). This separation was supported by multivariate analyses, showing that miR-21 expression was independent of disease stage. The traditional need for control samples result in limitations. In most hospitals it is a well established practice to take out samples of the epithelium from patients that have symptoms indicating cancer. In the case of colon cancer this is done by endoscopy. During this procedure the surgeon visually inspects the colon and typically takes out samples from sites that look "suspicious" for further pathological analysis. It is not common practice that the surgeon takes out samples from normal epithelium. Thus to be compatible with the current clinical practice a new diagnostic test should not rely on normal colon samples from the patient. Furthermore in the more advanced cancers it may not be an easy task to identify and obtain normal colon tissue.
In the case of other cancers, breast cancer e.g., surgeons often takes a needle aspiration biopsy, also known as fine needle aspiration biopsy to investigate superficial tumors. A fine needle aspiration biopsy is safer and less traumatic than an open surgical biopsy, however the histological structures of the tumor are prone to be perturbed during the sampling making the histo-pathological evaluation difficult
In order to offer a valuable supplement to present day histo-pathological examinations a new diagnostic test must prove to be simple, robust and have a high specificity and predictive accuracy.
The present invention describes a method which do not require normal test material from the patient is simple, robust and have high specificity and predictive accuracy. SUMMARY OF THE INVENTION:
Prior to the present invention, the present inventors believed that in order to obtain a meaningful and robust miR-signature based on quantitative PCR would require that the miR- level were normalised to a reference gene, typically a house-keeping gene, or an external standard and the miR-levels compared with the similar levels in a control tissue-sample (Bustin, 2004).
However, the present inventors, surprisingly, realized that traditional controls, i.e. a control tissue-sample and a reference gene, were not necessary to obtain a meaningful and robust miR-signature of cancer provided that the signature is based on the level of a few, but carefully selected miRs.
Thus, in a first aspect, the invention pertains to a method for classifying a test cell sample as cancerous, the method comprising:
a) detecting the level of at least one miR selected from miR-Group I consisting of: miR- 21 , miR-34a and miR-141 , and detecting the level of at least one miR selected from miR-Group Il consisting of: miR-126, miR-143 and miR-145 in a test cell sample and, b) calculating the level of the miR(s) selected from miR-Group I relative to the level of the miR(s) selected from miR-Group Il in the test cell sample, compare it with a predetermined cut-off value and therefrom determine if the test cell sample is cancerous;
said cut-off value being estimated by calculating the level of the same miR(s) that were selected from miR-Group I in step a) relative to the level of the same miR(s) selected from miR-Group Il in step a) in a dataset comprising data on miR levels comprising at least one normal cell sample.
In a second aspect, the invention pertains to a method for assessing recurrence of a specific type of cancer in a human subject who previous has been treated for said specific type of cancer, the method comprising: a) detecting the level of at least one miR selected from miR-Group I consisting of: miR- 21 , miR-34a and miR-141 , and detecting the level of at least one miR selected from miR-Group Il consisting of: miR-126, miR-143 and miR-145 in a test cell sample obtained from a site close to and a tissue similar to the site and tissue-type of said specific type of cancer of said human subject and,
b) calculating the level of the miR(s) selected from miR-Group I relative to the level of the miR(s) selected from miR-Group Il in the test cell sample, compare it with a predetermined cut-off value and there from determine if the human subject is likely to experience recurrence of said specific type of cancer cancer;
said cut-off value being estimated by calculating the level of the same miR(s) that were selected from miR-Group I in step a) relative to the level of the same miR(s) selected from miR-Group Il in step a) in a dataset comprising data on miR levels in cell samples from a collection of humans subjects who previously were treated for cancer and did not experienced recurrence of cancer as well as subjects who previously were treated for cancer and did experienced recurrence of cancer.
It is contemplated that a miR signature comprised of least one miR selected from miR-Group I consisting of: miR-21 , miR-34a and miR-141 , and at least one miR selected from miR-Group Il consisting of: miR-126, miR-143 and miR-145 in will identify stage Il colon cancer patients at high risk of recurrence.
A further important aspect of the invention is the provision of a kit of parts for classifying a test cell sample as cancerous or non-cancerous comprising:
a) the primers that are necessary for the detection of said of selected miRs according to any of the preceding claims.
b) with the exception of polymerase all reagents necessary to perform the detection, and c) instructions on how to perform the detection.
A further advantage of the herein described methods is that they can be run on formalin fixed paraffin embedded (FFPE) tissue sections.
DEFINITIONS
Prior to a discussion of the detailed embodiments of the invention is provided a definition of specific terms related to the main aspects and embodiments of the invention. All terms are defined in accordance with the skilled person's normal understanding of the terms. In addition, all contents of US patent Nos. 6,268,490 and 6,770,748 "Bicyclonucleoside and ligonucleotide analogue" (Imanishi), International patent publication No. WO/1999/014226 "Bi- and tri-cyclic nucleoside, nucleotide and oligonucleotide analogues" (Exiqon a/s) are incorporated herein by reference. The terms "cancer" and "cancerous " are used interchangeably herein to refer to cells which has acquired one or more mutations allowing them to reproduce in defiance of the normal restraints to the cells of the body and to invade and colonize territories normally reserved for other cells. Cancer cells exhibit relatively autonomous growth, characterized by a significant loss of control of cell proliferation. Cancer cells include pre-malignant (e.g., benign hyperplasic), malignant, metastatic, and non-metastatic cells. Cancer cells often forms tumors which frequently comprise cells which are not cancer cells. Tumors of the colon are characterized by a number of well described characteristics well-known to the skilled artesian.
By a "cancerous" (or "cancer") colon test cell sample is referred to a sample which contain cancer cells forming part a tumor or neoplasm or in a number allowing it to be classified as cancer (benign as well as malignant) by a skilled clinician.
The term "non-cancerous" or "non-cancer" which are used interchangeably herein to refer to a test cell sample which not is forming part a tumor or neoplasm or in a number allowing it to be classified as cancer by a skilled clinician.
The terms "miR" and "microRNA" refer to about 18-25 nt non-coding RNAs derived from endogenous genes. They are processed from longer (ca 75 nt) hairpin-like precursors termed pre-miRs. MicroRNAs assemble in complexes termed miRNPs and recognize their targets by antisense complementarity. If the microRNAs match 100% their target, i.e. the
complementarity is complete, the target mRNA is cleaved, and the miR acts like a siRNA. If the match is incomplete, i.e. the complementarity is partial, then the translation of the target mRNA is blocked. As used herein the terms "let-7a", "miR-103", "miR-106a", "miR-125b", "miR-126", "miR-127", "miR-142-3p", "miR-141 ", "miR-143", "miR-145", "miR-151", "miR-155", "miR-16", "miR-181 b", "miR-191 ", "miR-200a", "miR-200c", "miR-203", "miR-20a", "miR-21 ", "miR-212", "miR-25", "miR-26a", "miR-27a", "miR-29c", "miR-320", "miR-342", "miR-34a", "miR-492", "miR-498", "miR-708", "miR-9" or "miR-92a" refer to the human miR sequences found in miR registry database release 12.0 or later and hosted by Sanger Institute, UK as well as their animal equivalents.
By the term "miR-signature" is referred to a collection of miRs, the level, expression or amount of which is characteristic of the pathological status of a test sample.
As used herein the term "detecting the level of a miR" refer to the quantification of said miR. One way of quantification is described in the Examples. However, the miR may be quantified in a multitude of other ways e.g. by arrays, northern blots, dot blots, RN'ase protection assays, quantitative mass spectroscopy or various quantitative PCR-based methods such as the TaqMan assay or the UniRT assay used in the examples. The term "expression", as used herein, refers to the transcription and/or accumulation of RNA-molecules within a call.
In the present context the terms "level of expression of a miR" and "level of a miR" are used synonymously as a measure of the "amount of a specific miR" that is detected in the sample. The "amount of a specific miR" may be expressed in either absolute or relative measures and refers to values obtained by both quantitative, as well as qualitative methods. One particularly preferred measure of the "amount of a specific miR" is the Crossing point (Cp) value obtained by real-time RT-QPCR as described in the examples.
As used herein the term "the level of miR-21 , miR-34a, miR-126 and (miR-143 or miR-145)" refer to the level of miR-21 , miR-34a, miR-126 and miR-143, or, level of miR-21 , miR-34a, miR-126 and miR-145. The term "level" designates relative as well as absolute amounts of the miRs referred to.
"Q-PCR" quantitative polymerase chain reaction. Q-PCR is highly sensitive method for quantifying the amounts of specific DNA and RNA species in a test sample. A thorough treatise of the Q-PCR technique can be found in Bustin, S.A. (ed.) A-Z of quantitative PCR, IUL Biotechnology Series 5 (2004) 882 pages, which hereby is incorporated herein by reference.
" UniRT" is a novel Q-PCR method. The method is described in Example 1 and in Danish Patent Application PA 2009 00156.
"Sample" refers to a sample of cells, or tissue or fluid isolated from an organism or organisms, including but not limited to solid tissue samples and samples of in vitro cell culture constituents. As a specific cancer e.g. colon cancer is known to metastasise to various organs, for example to liver and lung, the "sample" may be any sample of blood, plasma, serum, spinal fluid, lymph fluid, synovial fluid, urine, tears, blood cells, organs, tumors, that contains cells or lysed cells and also to samples of in vitro cell cultures.
By a "process implying machine learning" is referred to a process which take advantage of computer algorithms that improve automatically through experience, in the art this process of improving the algorithms is often referred to as "training". Machine learning can be used to discover general rules in large data sets, machine learning can e.g. be used to extract clinical informative data from a dataset comprising miR expression in cancer and non-cancer samples. A general treatise of the concept of machine learning can be found in: Tom Mitchell, Machine Learning, McGraw Hill, 1997, which hereby is incorporated herein by reference. In the present context the "cut-off value" is a threshold-value above (or below) which, a value calculated to represent the level of a number of miRs indicate that a test cell sample is from a cancer. From the data presented in Figure 3 it is clear that cancerous and non-cancerous samples fall in two separate groups, and accordingly that cut-off values (or scores) that differentiate between cancer and non-cancer samples can be established and used to serve as predetermined cut-off values for new test samples to be evaluated.
By "LNA" or "LNA monomer" (e.g., an LNA nucleoside or LNA nucleotide) or an LNA oligomer (e.g., an oligonucleotide or nucleic acid) is meant a nucleoside or nucleotide analogue that includes at least one LNA monomer. LNA monomers as disclosed in PCT Publication WO 99/14226) are in general particularly desirable modified nucleic acids for incorporation into an oligonucleotide of the invention.
By "oligonucleotide," "oligomer," or "oligo" is meant a successive chain of monomers (e.g., glycosides of heterocyclic bases) connected via intemucleoside linkages.
By "LNA modified oligonucleotide" or "LNA substituted oligonucleotide" is meant a
oligonucleotide comprising at least one LNA monomer.
In the present context, the terms "nucleobase" covers naturally occurring nucleobases as well as non-naturally occurring nucleobases. Thus, "nucleobase" includes not only the known purine and pyrimidine heterocycles, but also heterocyclic analogues and tautomers thereof. Illustrative examples of nucleobases are adenine, guanine, thymine, cytosine, uracil, purine, xanthine, diaminopurine, and 5-methylcytosine.. In the nucleic acid sequences described herein LNA monomers are depicted in capitals (T, A, G) and DNA monomers in lower case (t, a, c, g). Modified LNA monomers include 5' methyl cytosine shown as "C".
Embodiments of the present invention are described below, by way of examples only. DETAILED DISCLOSURE OF THE INVENTION:
Central to the present invention is the observation that the level of expression of miR-21 and miR34a are up regulated in colon as well as other cancer tissues compared to normal tissue, whereas the levels of miR126 and miR143 (or miR145) are down-regulated in colon as well as other cancer tissues compared to normal tissue. I.e. that cancer samples are
characterised by an increased level of expression of miR-21 and miR-34a relative to the expression of miR-126 and (miR-143 or miR-145). A further central element in the invention is the use of a predetermined cut-off value to determine whether a test cell sample is drawn from a cancer or not. Surprisingly, the data presented in example 5 illustrates that a cut-off value established on test cell samples from one specific cancer type and corresponding normal cell test sample(s) obtained from the same type of tissue may be useful also for the assessment of test cell samples obtained from different types of tissues. I.e. a cut-off value established on colon samples appear informative also for the assessment of e.g. bladder and breast cancer samples (see figure 7).
In a preferred embodiment, the cut-off value is determined using a Receiver Operator Characteristic (ROC) curve, according to the method of Sackett et al., Clinical Epidemiology: A Basic Science for Clinical Medicine, Little Brown and Co., 1985, p. 106-7. Briefly, in this embodiment, the cut-off value may be determined from a plot of pairs of true positive rates (i.e., sensitivity) and false positive rates (100%-specificity) that correspond to each possible cut-off value for the diagnostic test result. The cut-off value on the plot that is the closest to the upper left-hand corner (i.e., the value that encloses the largest area) is the most accurate cut-off value, and a sample generating a signal that is higher than the cut-off value determined by this method may be considered positive.
Alternatively, the cut-off value may be shifted to the left along the plot, to minimize the false positive rate, or to the right, to minimize the false negative rate.
In general, a test cell sample generating a value of the level of the miR(s) selected from miR- Group I relative to the level of the miR(s) selected from miR-Group Il that is higher than the cut-off value determined by this method is considered positive for cancer.
One preferred embodiment the invention may be expressed stating that the ratio: (level of miR21) x (level of miR34a)
(level of miff 126) x (level of miff 143) or the ratio:
(level of miR2ϊ) x (level of miff34α)
(level of miR126) x (level of miff 145) is relatively large in cancer tissue versus in the corresponding normal tissue.
The level of miRs may conveniently be quantified by Quantitative real-time Reverse
Transcriptase mediated Polymerase Chain Reaction method, RT-QPCR (Bustin, S.A. (ed.) A- Z of quantitative PCR, IUL Biotechnology Series 5 (2004) 882 pages), see also Example 1 and 2. The read-out from a typical real-time QPCR instrument is often the so-called Cp (crossing point)-value. The Cp-value is related to the level of e.g. a specific miR, by the relation:
(liniar) expression level of miRx ~ 2~Cp(-mιRx^ Wherein Cp(miRx) designates the Cp-readout from real-time QPCR instrument specifically detecting one specific miR called miRx. Example 2 describes such an assay in details.
Accordingly, when the Cp-values are used as quantifiers of miR-levels, the expression:
(level of miR21) x (level of miff34α)
(level of miRllβ) x (level of miff 143)
is equivalent to:
+Cp(miffl26) + Cp(miR143) - Cp(miR21) - Cp(miR34ά) and likewise the expression:
(level of miR21) x (level of miR34ά)
(level of miRllβ) x (level of miRIAS)
is equivalent to:
+Cp(miR126) + Cp(miRlAS) - Cp(miR21) - Cp(miR34a) Realizing that both colon as well as other cancer samples are characterised by an increased level of expression of miR-21 and miR-34a relative to the expression of miR-126 and (miR- 143 or miR-145) make it possible to formulate a wide range of estimators with accompanying cut-off values that can be used to characterise a test sample as cancerous or non-cancerous.
One example of such an estimator is the simple expression:
+Cp(miR126) + Cp(miR143) - Cp(miR21) - Cp(miR34a)
As can be seen in Example 2-5, using this simple estimator and a cut-off value of
approximately -1.3 to -2 allows discriminating between normal and cancer samples in particularly between normal colon and colon cancer samples. As indicated by the Area Under receiver operating characteristic Curves (AUC)-values of 0,998 and 0.996 the accuracy of the test is almost perfect (for further explanation see example 2). We are not aware of any other miR-based diagnostic test for any cancer that approaches such high accuracy.
Another example of a useable estimator is a linear regression model, such as:
X x Cp(miR126) + Y x Cp(miR143) + Z x Cp(miR21) + W x Cp(πιiR34a) where the coefficients X, Y, Z and W are determined by the regression-analysis according to the particular set-up.
Logistic regression is a widely used method for generating linear models to classify data. Performing a logistic regression analysis on the data of Example 2 results in a best fit regression model:
-109,3 x Cp(miR126) - 98,1378 x Cp(miR143) + 295,0965 x Cp(miR21) - 75,747 x Cp(miR34a) wherein, a value over -817,297 is indicative of a non-cancer colon test sample, and a value less that -817,297 is indicative of a colon cancer test sample. It is an important feature of the present invention that once the test has been implemented in the actual test laboratory that are to perform the tests the need for additional control colon cell samples diminishes as illustrated by the data in example 5. To validate the particular experimental set up it is an advantage that the previously obtained dataset comprise data from at least one cancer and as least one non-cancer cell sample. However, a previously obtained dataset comprising data from at least 10 cancer cell samples 5 and at least 10 corresponding non-cancer cell samples is preferred. A dataset of that size furthermore posses the advantage that a cut-off value that characterize non-cancer samples from cancer samples can be calculated by methods well know in the art.
As can be seen from the examples comparing the levels of miR-21 and miR-34a, to the levels 10 of miR-126 and miR-143 are particularly informative (i.e. stable and accurate), indicated by the AUC values of 0.998 and 0.996, and an accuracy of 97,3 % (see table 2), see also example 4.
The data depicted in Figure 4 shows that there is an almost perfect 1 :1 correlation between 15 the level of miR-143 and miR-145. This is indeed to be expected as the genes for miR-143 and miR-145 are localized very close to each other in the genome, and are probably expressed from the same transcript.
Accordingly, comparing the levels of miR-21 and miR-34a, to the levels of miR-126 and miR- 20 145 are similar informative to comparing the levels of miR-21 and miR-34a, to the levels of miR-126 and miR-143. Our data indicate that miR-143 and miR-145 can be used
interchangeably.
The data in figure 3 also show that the level of miR-21 comparing the levels of miR-143 (or 25 miR-145) is remarkably informative (i.e. both stable and accurate), indicated by the AUC values of 0.991 and 0.992 and an accuracy of 94,6%. Comparing the level of miR-21 to the levels of miR-126 is almost as good having AUC values of 1 ,000 and 0,970 and an accuracy of 91 ,9%.
30 Further comparing the levels of miR-34a and miR-141 to the levels of miR-126 and miR-143 (miR-145) are also both stable and accurate as indicated by the AUC values of 0.977 and 0.968.
Also comparing the levels of miR-21 and miR-141 to the levels of miR-126 and miR-143 (or 35 miR-145) are surprisingly accurate AUC = 0.987, see Example 3. It is particularly noteworthy that the preferred signature: miR-21 and miR-34a, relative to miR- 126 and miR-143 not is improved by the addition yet a miR, in casu miR-141.
In a further embodiment of the invention the method for detecting cancer in a tissue sample according further comprises: a) obtaining a biological sample from a patient; b) determining the probability of the test cancer cell sample being cancerous by a process implying machine learning (such as decision tree, principal component analysis, neural networks, or Bayesian networks) analysis wherein the applied program has been trained on a dataset comprised of expression data of the miRs selected from Mir-Group I and the miRs selected from Mir-Group Il in cancer as well as in corresponding normal cell samples. In addition to logistic regression, it is contemplated that various techniques in the art referred to as: principal component analysis, nearest neighbour analysis, neural network and
Gaussian mixture model (GMM), Genetic algorithms (1950's), Decision trees(1960's) and Support vector machines can be used to produce estimators useable to separate cancer test samples from non-cancerous samples based on the level of miR-21 , miR-34a, miR-126 and (miR-143 or miR-145).
As illustrated in Example 2-5 the calculation of even a simple score: Cp(miR-126) + Cp(miR- 143) - Cp (miR-21 ) - Cp(miR-34a) allow a cut-off value to be estimated and used to classify a test cell sample as cancer or non-cancer.
Cut-off values from -1.3 to -2 have been obtained, however other values at such as in the range from -0.5 to -4, such as within the range of -1 to -3, e.g. within the range of -1.1 to -2.5, such as within the range of -1.2 to -2.3, including the range of -1.5 to -2.2 and the range of - 1.7 to -2.1 may be obtained depending on the precise experimental setup and the type of tissues analysed.
Using the technique of linear regression X*Cp(miR126) + Y*Cp(miR-143) + Z*Cp(miR-21 ) + W*Cp(miR-34a) wherein X, Y, Z and W are coefficients determined by linear regression. Whereas the miRs may be quantified in a multitude of ways a preferred method of quantification is one wherein the the levels of the miRs are determined by Q-PCR. A quantification based on the UniRT-method described in Example 1 is particularly preferred. Today the standard method for histological tissue preparation in pathology is to prepare Formalin-Fixed Paraffin-Embedded (FFPE) samples. Every year millions of FFPE samples are classified and archived by pathologists. Consequently, it a significant advantage of the herein described methods that they apply to cell samples which are formalin fixed paraffin embedded (FFPE) tissue sections.
A further aspect of the invention is a kit of parts for classifying a test cell sample as cancerous or non-cancerous comprising:
a) the primers that are necessary for the detection of said of selected miRs according to any of the preceding claims.
b) with the exception of polymerase all reagents necessary to perform the detection, and c) instructions on how to perform the detection.
It is preferable if the reagents kit contains additional means to perform the quantification by the UniRT method of example 1 such a kit would conveniently comprise a selection of UniRT- primers, according to Table 1 , that are required to detect the necessary miRs, e.g. the primers required to detect miR-21 , miR-34a, miR-126 and miR-143 by the UniRT method.
The invention is further illustrated in the following non-limiting examples and the figures wherein
LEGENDS
FIGURE 1 : Show the steps involved in microRNA specific qRT-PCR by the UniRT-protocol. STEP 1 is a one-tube-reaction for all microRNAs present in a sample. STEP 2 is a microRNA specific qPCR using forward and reverse primer pairs for a specific microRNA. An oval indicate insertion of LNAs in forward and reverse primers. NV designates a 3'-end VN- degenerate sequence motif comprising the two 3'-end terminal nucleotide residues, wherein V is either an adenine residue, a guanine residue, or a cytosine residue and N is either an adenine residue, a guanine residue, a cytosine residue or a thymine residue. FIGURE 2: Show the relative expression of 5 selected miRs in Colon cancer and normal (NAT) FFPE tissue. Shown are ΔCp values of all 5 miR relative to miR-103 (reference). miR- 141 , miR-21 and miR-34a are upregulated in cancer samples compared to normal, whereas miR-126 and miR-143 are downregulated in cancer relative to normal tissue. Boxes indicate the distribution of expression values from the 25th to the 75th percentile for each miR.
Vertical lines indicate values from the 5th to the 95th percentile. The value on Y-axis is the miR score.
FIGURE 3: Show scatterplots of the distribution of 3 different the miR based scores obtained. Data from two separate rounds of experiments performed on the same clinical samples are shown. On each panel the "spots" to the left are the scores obtained in normal colon tissue (normal adjacent tissue, NAT), the "spots" to the left are the scores obtained in colon cancer tissue. Panel A, "miR-126 vs miR-21 " show the score calculated as Cp(miR-126)-Cp(miR- 21 ), B: "miR-21 vs miR-143" show the score calculated as Cp(miR-143)-Cp(miR-21 ), and C "miR-21 , miR-34, miR-143 and miR-126" refer to the score calculated as Cp(miR-
126)+Cp(miR-143)-(miR-21 )-Cp(miR-34a). The numbers under the abscissa (x-axis) are the Area Under receiver operating characteristic Curves (AUC) for each set of data. Values on ordinates are the scores referred to above. Each "spot" in the panels refer to the score of one tissue sample. The value on Y-axis is the miR score.
FIGURE 4: Shows the relative level of miR-143 plotted against the relative level of miR-143 is an almost perfect 1 :1 correlation. The relative level of miR-143, ΔCp(miR-143), is calculated based on miR-103. Also the relative level of miR-145, ΔCp(miR-145), is calculated based on miR-103.
FIGURE 5: Shows reciever operator curves (ROC) of specificity versus sensitivity of miRs indicated. Two separate rounds of measuring miR's and evaluating data were performed on the clinical material consisting of an equal numbers of cancer and non-cancer samples. The curves to the left are from round 1 , those to the right are from round 2.
FIGURE 6: Show scatterplots of the distribution of the miR based score
+Cp(miR126)+Cp(miR143)-Cp(miR21 )-Cp(miR34a) described in example 4. Data from 1 ) reevaluation of the 74 Proteogenex colon cancer and normal samples used in example 2 (discovery set), 2) an independent validation set of 40 Protegenex colon cancer and normal samples and 3) reevaluation of the RanxO5 samples used in example 1 now with percentage tumor information. The value on Y-axis is the miR based score. FIGURE 7: miR based score +Cp(miR126)+Cp(miR143)-Cp(miR21 )-Cp(miR34a) obtained on cancer specimens of different origin and compared with the score obtained in normal adjacent tissue (NAT) of the same origin. The tissues are from left to right: bladder tumor, bladder NAT, breast tumor, breast NAT, gallbladder tumor, gallbladder NAT, kidney tumor, kidney NAT, lung tumor, lung NAT, pancreas tumor, pancreas NAT, small intestine tumor, small intestine NAT and uterus tumor, uterus NAT. The value on Y-axis is the miR based score. Example 1 : The UniRT method
In this example the UniRT method for amplification and quantification of small non-coding RNA molecules by use of quantitative reverse transcription polymerase chain reaction (qRT- PCR) technology is described in brief. In brief, see figure 1 , the UniRT protocol is a two step protocol. In STEP 1 the miRs present in a sample are firstly poly-A-tailed using a poly(A) polymerase, which adds adenine residues to the 3'-end of RNA molecules. Secondly, an extension primer, which has a poly-T-core nucleotide sequence, a 3'-end VN-degenerate motif and a 5'-end tail, is annealed to the poly- A-tailed miRs through hybridization with the VN-poly-T-sequence of the extension primer. Subsequently, the extension primer is extended in a reverse transcription reaction using the miR as template. The resulting primary extension product is composed of the extension primer and the newly synthesized cDNA, which complementary to the miRs in the sample.
In the next step, STEP 2, a miR-specific PCR is carried out. A miR-specific forward primer is annealed to 3'-end of the newly synthesized cDNA and the upper-strand synthesis is carried out by extending the forward primer in a DNA-polymerization reaction using the primary extension product as template. A miR-specific reverse primer composed of a miR-specific 3'- end sequence, a poly-T-stretch and a 5'-end tail is then hybridized to the upper-strand and the lower-strand is synthesized by extension of the reverse primer.
In both STEP 1 and STEP 2 the LNA's help to ensure a specific and efficient annealing of the primers to their respective targets. Example 2: Identification of microRNA signatures.
The identification of the miRs that constitute a meaningful and robust miR-signature that differentiate cancerous from non-cancerous Formalin Fixed Paraffin Embedded (FFPE) Colon Cancers tissues were carried out in two separate stages.
5
Stage I. A purely exploratory stage.
In this stage Formalin Fixed Paraffin Embedded (FFPE) Colon Cancers tissues (133 stage Il colon cancer specimens) from the RANX05 set of the The Danish RANX05 Colorectal Cancer 10 Study Group (Werther K, Christensen IJ, Brϋnner N and Nielsen HJ (2000) Eur J Surg Oncol.
26:657-62.) and FFPE-samples from Colon normal tissues from ProteoGenex Inc., Culver City, CA, USA were evaluated.
The level of miR-498, miR-191 , miR-21 , miR-103, miR-320, miR-203, miR-106a, let-7a, miR- 15 181 b, miR-20a, miR-26a, miR-151 , miR-127, miR-141 , miR-143, miR-145, miR-342, miR- 34a, miR-200c, miR-125b, miR-126, miR-212, miR-492, miR-142-5p, miR-92a, miR-9, miR- 16, miR-25, miR-27a, miR-29c, miR-200a, miR-708 and miR-155 were quantified using the UniRT qPCR method (see, example 1 ). In brief,
20 RNA Purification:
RNA from 1x 10 μm sections was purified with the Qiagen FFPE miRNeasy kit (Qiagen Denmark, cat. No. 217404) according to manufacturer's protocol. RNA concentration was determined by use of a Nanodrop 1000 instrument (Thermo Scientific, Wilmington, DE, USA) and following the instructions of the manufacturer.
25
UniRT
In STEP 1 of the UniRT protocol 10 ng of total RNA was used in a 10 μl RT reaction having the composition:
1x PoIy(A) polymerase buffer (E CoIi PoIy A Polymerase, NEB M0276L, New England 30 Biolabs, Ipswich MA).
0.1 mM ATP
0, 1 μM RT-primer (L2TA: 5'-ggtactagtttttttttttttttvn (SEQ lD NO. I)), or
(v designates cytosine, guanine and adenine residues, n designates cytosine, guanine, adenine and thymine residues).
35 0.1 mM dATP, 0.1 mM dCTP, 0.1 mM dGTP, 0.1 mM dTTP
0.25 unit of Transcriptor (Roche Diagnostics GmbH, Mannheim, Germany) 2 unit of PoIy(A) polymerase (E CoIi Poly A Polymerase, NEB M0276L, New England Biolabs, Ipswich MA)
The RT reaction was incubated at 42°C for 1 hour, 95°C for 5 minutes.
Then the RT reaction was diluted 5Ox in water prior to qPCR analysis - STEP 2.
In STEP 1 of the UniRT protocol 1 μl of the diluted RT reaction was mixed with the PCR primer sets of table 1 (final concentration of each primer is 0.125 μM) for each miR and Fast start SYBR Green Master mix (Roche Diagnostics GmbH, Mannheim, Germany) according to protocol by the providers.
All qPCRs were run in duplicate and in 10 μl volume. The qPCR reactions were run in 384 well plates in a Roche Lightcycler 480 Il (Roche Diagnostics GmbH, Mannheim, Germany). All real-time PCR data were analyzed using the Cp(Crossing point) method calculating the relative expression ratios of the specific target miRs as the crossing point difference (ΔCp) of the specific miRs relative to a reference gene as described by Bustin (2004).
The Cp-values were calculated using the Lightcycler 480 software release 1.5.0, version 1.5.0.39 accompanying the Lightcycler 480 Il instrument.
TABLE 1
Figure imgf000019_0001
Figure imgf000020_0001
Lower case designates natural occuring nucleotides, upper case designates LNA. Upper case C designates 5-methylcytosine LNA. All miRs are human miR sequences found in miR registry database release 12.0 hosted by Sanger Institute, UK (miR 12_0). RESULT
A short list of 6 miRs was identified (miR-21 , miR-34a, miR-126, miR-143, miR-141 , miR- 103).
These were 5 miRs that showed differential expression between colon cancer and normal colon tissue. miR-103 was selected as reference gene as it showed uniform expression levels across the colon cancer and normal tissue and appeared to be useable as a control reference gene for qPCR quantification.
Relative expression ratios of the 5 remaining miRs (miR-21 , miR-34a, miR-126, miR-143, miR-141 ) expressed at the crossing point difference (-ΔCp) of the specific miRs relative to miR-103 ( -ACp(miRx) = Cp(miRx) - Cp(miR103) ) are showed in Figure 2.
Stage II. Discovery of RUO signature.
A completely independent set of Formalin Fixed Paraffin Embedded (FFPE) Colon Cancers tissues samples was obtained. The sample set was a staging set purchased from Proteogenex, Ca. USA. (74 samples in total including all stage I-IV and matched NATs (normal adjacent tissue).
The six miRs described above were run on the staging set using the method described above. Based on differential expression in colon cancer and normal colon tissue of the new sample-set miRs were further selected.
In order to find the miRs with a potential to distinguish between normal and cancer tissue, a simplified version of linear regression was used.
While performing these analysis we surprisingly realized that a better differentiation between cancerous and non-cancerous samples were obtained when the level of different miRs were normalized to other miR's instead to a traditional control gene. To our surprise we also observed that it was not necessary to relate the miR-level to the level in normal control tissue. Accordingly, this possibility was explored.
For each pair of miRs the difference in Cp values were calculated for all samples and a t-test between the groups and used to determine the miR pair's having the most promising potential as a biomarker.
Two separate rounds of measuring miR's and evaluating data were performed on the clinical material consisting of an equal numbers of cancer and non-cancer samples (approximately 40 of each, depending on the round). RESULTS
The results are presented in figure 3.
Receiver Operating Characteristic curves, ROC-curves, From 1st and 2nd round are presented in figure 5.
From the data presented in Figure 3 it is clear that scores of cancerous and non-cancerous samples fall in two separate groups. Using the calculated scores as a test score cut-off values can be determined on the scatter plots shown. In case of the data presented in Figure 3, C and 5, C the cut-off value were calculated to - 1 ,3 and used as the predetermined cut-off value to determine if the test colon cell sample is cancerous or not; l.e a value of: +Cp(miR126) + Cp(miR143) - Cp(miR21) - Cp(miR34a) < -1,3 is indicative of a non- cancereous sample, whereas a value of:
+Cp(miR126) + Cp(miR143) - Cp(miR21) - Cp(miR34a)≥ -1,3 is indicative of a cancereous sample.
The accuracy of the test depends on how well the test separates the group being tested into those with and without cancer. Accuracy is measured by the area under an Receiver Operating Characteristic curve, ROC-curve. An Area Under receiver operating characteristic Curves (AUC) of 1 represents a perfect test; an area of .5 represents a worthless test. A rough guide for classifying the accuracy of a diagnostic test is the traditional academic point system:
0.90-1 = excellent
.80-.9O = good
.70-.8O = fair
.60-.7O = poor
.50-.6O = fail
(ref.: Tape, Thomas G. : "Interpreting Diagnostic Tests", http://gim.unmc.edu
/dxtests/Default.htm)
As indicated by the Area Under receiver operating characteristic Curves, the AUC-values, excellent separation could be obtained with miR-21 and miR-126. However almost perfect separation (AUC=O, 998 and 0,996) was found when the four miR signature was made.
In conclusion
The unexpected result of this experiment is that direct, non-normalized values of the miR expression levels of only few, but carefully selected miRs even in a single FFPE tumor section of 10um without the use of normal tissue or fresh frozen tissue as controls can be used to produce an almost perfect distinction between cancer and non-cancer colon tissue.
Example 3: Identification of further microRNA signatures.
Closely following the experimental setup described in Example 1 and 2 a number of values were calculated for two separated runs on rounds of miR quantification performed on the same set of test samples.
Receiver Operating Characteristic curves, ROC-curves, From 1st and 2nd round are presented in figure 5.
Detailed data are presented in table 2 below.
Table 2
Figure imgf000024_0001
TPR=true positive rate, FPR=false positive rate, SPC=specificity, ACC=accuracy,
PPV=positive predictive value, NPV=negative predictive value, FDR=False discovery rate Example 4: Robustness of microRNA signature.
To determine the cutoff value with new reagents, a new set of experiments were performed. Following the new experimental setup the level of miR21 , miR34a, miR126 and miR143 were determined as described in example 1 and 2, and the score:
+Cp(miR126) + Cp(miR143) - Cp(miR21) - Cp(miR34a) for each sample calculated.
The levels of miR-21 , miR-34a, miR-126 and miR-143 was assessed using the following protocol (finalized protocol, now called miRCURY LIMA™ Universal RT microRNA PCR from Exiqon, Denmark: miRCURY LNA™ Universal RT microRNA PCR
The assessment of miRs was basically performed as described in Example 2 but using the commercial Universal cDNA synthesis kit (Exiqon Vedbaek, cat no 203300). STEP 1 of the assay was performed using the Universal cDNA synthesis kit protocol. 10 ng of total RNA was used in a 10 μl RT reaction having the composition:
1x Reaction buffer, Exiqon cat no 203300-04.
1 μl of RT enzymes, Exiqon cat no 203300-03
Water
RNA (10 ng)
The RT reaction was incubated at 42°C for 1 hour, 95°C for 5 minutes. Then the RT reaction was diluted 5Ox with water prior to qPCR analysis - STEP 2.
In STEP 1 of the UniRT protocol 1 μl of the diluted RT reaction was mixed with the PCR primer sets of table 3 (final concentration of each primer is 0.3 μM) for each miR and Fast start SYBR Green Master mix (Roche Diagnostics GmbH, Mannheim, Germany) according to protocol by the providers.
All qPCRs were run in 10 μl volume. The qPCR reactions were run in 384 well plates in a Roche Lightcycler 480 Il (Roche Diagnostics GmbH, Mannheim, Germany).
All real-time PCR data were analyzed using the Cp(Crossing point, 2nd derivative method) method .
The Cp-values were calculated using the Lightcycler 480 software release 1.5.0, version 1.5.0.39 accompanying the Lightcycler 480 Il instrument.
TABLE 3
Figure imgf000026_0001
Lower case designates natural occuring nucleotides, upper case designates LNA. Upper case C designates 5-methylcytosine LNA. All miRs are human miR sequences found in miR registry database release 12.0 hosted by Sanger Institute, UK (miR 12_0).
Cutoff determination
The 74 Proteogenex colon normal and colon cancer samples used in example 2 were evaluated and the best cutoff determined to be -2 (see figure 6) using the score determined as in example 1 and 2;
In case of the data from the training set presented in Figure 6, left 2 scatter-plot panels, the cut-off value were calculated to - 2. l.e a value of: +Cp(miR126) + Cp(miR143) - Cp(miR21) - Cp(miR34a) < -2 is indicative of a non- cancereous sample, whereas a value of:
+Cp(miR126) + Cp(miR143) - Cp(miR21) - Cp(miR34a)≥ -2 is indicative of a cancereous sample.
Validation set
An additional independent set of 40 colon normal and colon (purchased from Proteogenex, Ca. USA.) were also analysed and showed perfect separation using the determined cutoff of - 2 (see figure 6) on the score determined in example 1 and 2.
+Cp(miR126) + Cp(miR143) - Cp(miR21) - Cp(miR34ά) Reexamination of the RANX05 sample set of samples incorporating percentage tumor information
Furthermore, the RANX05 samples colon samples were reanalyzed with respect to the 4 miRs: miR-21 , miR-34a, miR-126 and miR-143.
5
Sections of the RANX05 FFPE blocks were stained with H&E stain (hematoxylin and eosin stain) which is often used by pathologists to examine histology. The sections were carefully assessed for percent tumor area, that is the area of tumor tissue as a percentage of the area of the tissue. The percent tumor area was binned into 5 categories; 100-75%, 75-50%, 50- 10 25%, 25-10% and 10-0%.
By showing the score;
+Cp(miR126) + Cp(miR143) - Cp(miR21) - Cp(miR34ά) for each of the percentage tumor groups (see figure 6), it is clear that the score and cutoff is 15 able to separate colon tumor and colon normal with high accuracy with more than 50% tumor, whereas the accuracy is only mildly decreased on >25% tumor. Even most of the <10 % tumor samples are classified as tumor.
Result
20 This experiment show that the score: +Cp(miR126) + Cp(miR143) - Cp(miR21) -
Cp(miR34ά) can be used to identify a sample as cancerous or not, provided that sample was characterized by an area of tumor tissue that is at least 50 percentage.
Surprisingly, the majority of samples were correctly identified as cancerous when the score 25 were calculated and the predetermined cut-off value used to determine their cancer-status, even when the percentage of tumor cells in the sample was as low as 25%.
This demonstrate that score based on the 4 miR-signature is unexpectedly robust.
30 Example 5: The microRNA signature can discriminate normal from cancer in general.
To investigate if the 4-miR signature developed in example 2 could differentiate cancer from non-cancer in general samples from various solid cancer-types were obtained the level of miR-21 , miR-34a, miR-126 and miR-143 determined and the the miR based score
+Cp(miR126)+Cp(miR143)-Cp(miR21 )-Cp(miR34a) calculated
RNA was purified from 16 samples were Formalin Fixed Paraffin Embedded (FFPE) blocks from different human tissues using the Roche High Pure miR Isolation Kit. All the FFPE blocks were from NDRI (National Development and Research Institutes, Inc. New york ).The specimens were: Bladder and Bladder cancer, Breast and breast cancer, gallbladder cancer (50%) and gallbladder cancer (100%), kidney and kidney cancer, lung and lung cancer, pancreas and pancreas cancer, small intestine and small intestine cancer, stomach and stomach cancer and uterus and uterus cancer (Patient IDS: 28478, 25017, 19560, 20876, 26169, 26545, 23220, 21521 ).
The levels of miR-21 , miR-143, miR-126 and miR-34a was measured using the miRCURY LNA™ Universal RT microRNA PCR system as described in example 4. The score was calculated as previously described, as:
+Cp(miR126) + Cp(miR143) - Cp(miR21) - Cp(miR34a)
Result
Surprisingly, the miR-21 score is higher in all cancer specimens tested - compared to the normal tissue of the same origin. Even more surprising, the cutoff determined in the colon experiments (-2), seems to be able to differentiate most of the cancers from their normals. When taking into consideration that the best cutoff has to be determined for each tissue from a panel of normal samples from that particular tissue these results indicate a very wide use of the 4- mi R-sco re.
REFERENCES:
1. Ambros, V. et al. (2003), A uniform system for microRNA annotation. RNA 9: 277-279. 2. Bustin, S.A. (ed.) A-Z of quantitative PCR, IUL Biotechnology Series 5 (2004) 882 pages .
3. Mitchell, Tom. Machine Learning, McGraw Hill, 1997.
4. Schetter, Aaron J. et al. (2008) MicroRNA Expression Profiles Associated With Prognosis and Therapeutic Outcome in Colon Adenocarcinoma. JAMA. 299(4):425-436.
5. Tape, Thomas G.: "Interpreting Diagnostic Tests", http://gim.unmc.edu/dxtests/Default.htm 6. Sackett et al., Clinical Epidemiology: A Basic Science for Clinical Medicine, Little Brown and Co., 1985,
7. Werther K, Christensen IJ, Brϋnner N and Nielsen HJ (2000) Soluble vascular endothelial growth factor levels in patients with primary colorectal carcinoma. The Danish RANX05
Colorectal Cancer Study Group. Eur J Surg Oncol. 26:657-62.

Claims

1. A method for classifying a test cell sample as cancerous, the method comprising:
a) detecting the level of at least one miR selected from miR-Group I consisting of: miR- 21 , miR-34a and miR-141 , and detecting the level of at least one miR selected from miR-Group Il consisting of: miR-126, miR-143 and miR-145 in a test cell sample and, b) calculating the level of the miR(s) selected from miR-Group I relative to the level of the miR(s) selected from miR-Group Il in the test cell sample, compare it with a predetermined cut-off value and therefrom determine if the test cell sample is cancerous;
said cut-off value being estimated by calculating the level of the same miR(s) that were selected from miR-Group I in step a) relative to the level of the same miR(s) selected from miR-Group Il in step a) in a dataset comprising data on miR levels comprising at least one normal cell sample.
2. The method according to claim 1 , wherein the test cell sample is drawn from the group of tissues consisting of tumors of epithelial origin,
3. The method according to claim 1 , wherein the test cell sample is drawn from the group of tissues consisting of solid tumors.
4. The method according to claim 1 , wherein the test cell sample is drawn from the group of tissues consisting of bladder, breast, gallbladder, kidney, lung, pancreas, colon, small intestine and uterus.
5. The method according to any of claims 1 - 4, wherein the previously obtained dataset comprise data from at least one cancer and as least one non-cancer cell sample.
6. The method according to any of claims 1 - 5, wherein the test cell sample and the normal cell sample are obtained from the same type of tissue.
7. The method according to any of claims 1 - 6, wherein the previously obtained dataset comprise data from at least 10 cancer cell samples and at least 10 non-cancer cell samples.
8. The method according to any of claims 1 - 7, wherein the miRs selected from Mir-Group I are miR-21 and miR-34a, and the miRs selected from Mir-Group Il are miR-126 and miR-143 or miR-126 and miR-145.
9. The method according to any of claims 1 - 7, wherein the miR selected from Mir-Group I is miR-21 , and the miR selected from Mir-Group Il is miR-143 or miR-145.
5 10. The method according to claims 1 - 7, wherein the miRs selected from Mir-Group I are miR-21 and miR-141 , and the miRs selected from Mir-Group Il are miR-126 and miR-143 or miR-126 and miR-145.
11. The method according to claims 1 - 7, wherein the miR selected from Mir-Group I is miR- 10 21 , and the miR selected from Mir-Group Il is miR-126.
12. The method according to claims 1 - 7, wherein the miRs selected from Mir-Group I are miR-34a and miR-141 , and the miRs selected from Mir-Group Il are miR-126 and miR-143 or miR-126 and miR-145.
15
13. The method for detecting cancer in a tissue sample according to claims 1 - 12 which further comprises: a) obtaining a biological sample from a patient;
20
b) determining the probability of the test cancer cell sample being cancerous by a process implying machine learning (such as decision tree, principal component analysis, neural networks, or Bayesian networks) analysis wherein the applied program has been trained on a dataset comprised of expression data of the miRs
25 selected from Mir-Group I and the miRs selected from Mir-Group Il in cancer as well as in normal cell samples.
14. The method of claims 1 - 7, wherein the score: Cp(miR-126) + Cp(miR-143) - Cp (miR- 21 ) - Cp(miR-34a) is calculated, a cut-off value estimated and used to classify a test cell
30 sample as cancer or non-cancer.
15. The method of claims claims 1 - 7, wherein the score: (miR-126) + Cp(miR-145) - Cp (miR-21 ) - Cp(miR-34a) is calculated, a cut-off value estimated and used to classify a test cell sample as cancer or non-cancer.
35
16. The method of claims 1 - 7, which imply the calculation of the score: X*Cp(miR126) + Y*Cp(miR-143) + Z*Cp(miR-21 ) + W*Cp(miR-34a) wherein X, Y, Z and W are coefficients determined by linear regression.
17. The method of claims 1 - 7, which imply the calculation of the score: X*Cp(miR126) + Y*Cp(miR-145) + Z*Cp(miR-21 ) + W*Cp(miR-34a) wherein X, Y, Z and W are coefficients determined by linear regression.
18. The method of claim 14 or 15, wherein the cut-off value is from -1.3 to -2.
19. The method of any of the preceding claims, wherein the levels of the miRs are determined by Q-PCR.
20. The method of claim 19, wherein the Q-PCR method applied is the UniRT-method according to Example 1.
21. A method for assessing recurrence of a specific type of cancer in a human subject who previous has been treated for said specific type of cancer, the method comprising: a) detecting the level of at least one miR selected from miR-Group I consisting of: miR- 21 , miR-34a and miR-141 , and detecting the level of at least one miR selected from miR-Group Il consisting of: miR-126, miR-143 and miR-145 in a test cell sample obtained from a site close to and a tissue similar to the site and tissue-type of said specific type of cancer of said human subject and,
b) calculating the level of the miR(s) selected from miR-Group I relative to the level of the miR(s) selected from miR-Group Il in the test cell sample, compare it with a predetermined cut-off value and there from determine if the human subject is likely to experience recurrence of said specific type of cancer;
said cut-off value being estimated by calculating the level of the same miR(s) that were selected from miR-Group I in step a) relative to the level of the same miR(s) selected from miR-Group Il in step a) in a dataset comprising data on miR levels in cell samples from a collection of humans subjects who previously were treated for cancer and did not experienced recurrence of cancer as well as subjects who previously were treated for cancer and did experienced recurrence of cancer.
22. The method of claim 21 , wherein said cut-off value is estimated by calculating the level of the same miR(s) that were selected from miR-Group I in step a) relative to the level of the same miR(s) selected from miR-Group Il in step a) in a dataset comprising data on miR levels in cell samples obtained from a site close to and a tissue similar to the site and tissue-type of 5 said specific type of cancer from a collection of humans subjects who previously were treated for said specific type of cancer and did not experienced recurrence of said specific type of cancer as well as subjects who previously were treated for said specific type of cancer and did experienced recurrence of said specific type of cancer.
10 23. The method of any of the preceding claims, wherein the cell sample is formalin fixed paraffin embedded (FFPE) tissue sections.
24. The method of any of the preceding claims, wherein the cell sample is a fine needle biopsy.
15
25. A kit of parts for classifying a test cell sample as cancerous or non-cancerous comprising: a) the primers that are necessary for the detection of said of selected miRs according to any of the preceding claims.
b) with the exception of polymerase all reagents necessary to perform the detection, and 20 c) instructions on how to perform the detection.
26. The kit of parts according to claim 25, wherein the the primers that are necessary for the detection of said of selected miRs are selected form the group of primers in Table 1 ,
25 27. A kit according to claims 25 or 26, wherein the detection of miRs are performed by the the UniRT method of example 1 , and the kit comprise a selection of UniRT-primers according to Table 1 .
PCT/DK2010/050191 2009-07-28 2010-07-16 A method for classifying a human cell sample as cancerous WO2011012136A1 (en)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US22906509P 2009-07-28 2009-07-28
US61/229,065 2009-07-28
US24511809P 2009-09-23 2009-09-23
US61/245,118 2009-09-23

Publications (1)

Publication Number Publication Date
WO2011012136A1 true WO2011012136A1 (en) 2011-02-03

Family

ID=42963074

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/DK2010/050191 WO2011012136A1 (en) 2009-07-28 2010-07-16 A method for classifying a human cell sample as cancerous

Country Status (1)

Country Link
WO (1) WO2011012136A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2013093635A2 (en) 2011-10-21 2013-06-27 Hospital Clinic De Barcelona Plasma micrornas for the detection of early colorectal cancer
WO2015155765A1 (en) 2014-04-10 2015-10-15 Bio-Marcare Technologies Ltd. Methods and kits for identifying pre-cancerous colorectal polyps and colorectal cancer

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1999014226A2 (en) 1997-09-12 1999-03-25 Exiqon A/S Bi- and tri-cyclic nucleoside, nucleotide and oligonucleotide analogues
US6268490B1 (en) 1997-03-07 2001-07-31 Takeshi Imanishi Bicyclonucleoside and oligonucleotide analogues
US6770748B2 (en) 1997-03-07 2004-08-03 Takeshi Imanishi Bicyclonucleoside and oligonucleotide analogue
WO2006137941A2 (en) * 2004-11-12 2006-12-28 Ambion, Inc. Methods and compositions involving mirna and mirna inhibitor molecules
US20070161004A1 (en) * 2004-05-28 2007-07-12 David Brown Methods and compositions involving microRNA
US20070292878A1 (en) * 2005-01-25 2007-12-20 Rosetta Inpharmatics Llc Methods for quantitating small rna molecules
WO2009059026A1 (en) * 2007-10-30 2009-05-07 Veridex, Llc Process for monitoring colorectal cancer

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6268490B1 (en) 1997-03-07 2001-07-31 Takeshi Imanishi Bicyclonucleoside and oligonucleotide analogues
US6770748B2 (en) 1997-03-07 2004-08-03 Takeshi Imanishi Bicyclonucleoside and oligonucleotide analogue
WO1999014226A2 (en) 1997-09-12 1999-03-25 Exiqon A/S Bi- and tri-cyclic nucleoside, nucleotide and oligonucleotide analogues
US20070161004A1 (en) * 2004-05-28 2007-07-12 David Brown Methods and compositions involving microRNA
WO2006137941A2 (en) * 2004-11-12 2006-12-28 Ambion, Inc. Methods and compositions involving mirna and mirna inhibitor molecules
US20070292878A1 (en) * 2005-01-25 2007-12-20 Rosetta Inpharmatics Llc Methods for quantitating small rna molecules
WO2009059026A1 (en) * 2007-10-30 2009-05-07 Veridex, Llc Process for monitoring colorectal cancer

Non-Patent Citations (17)

* Cited by examiner, † Cited by third party
Title
AMBROS ET AL., RNA, vol. 9, 2003, pages 277 - 279
AMBROS, V. ET AL.: "A uniform system for microRNA annotation", RNA, vol. 9, 2003, pages 277 - 279, XP009032091, DOI: doi:10.1261/rna.2183803
BANDRES E ET AL: "Identification by Real-time PCR of 13 mature microRNAs differentially expressed in colorectal cancer and non-tumoral tissues", MOLECULAR CANCER, vol. 5, July 2006 (2006-07-01), XP002607207, ISSN: 1476-4598 *
BUSTIN, S.A: "A-Z of quantitative PCR, IUL Biotechnology Series", vol. 5, 2004, pages: 882
CLAUDIUS FABER ET AL: "The impact of microRNAs on colorectal cancer", VIRCHOWS ARCHIV, vol. 454, no. 4, 14 March 2009 (2009-03-14), SPRINGER, BERLIN, DE, pages 359 - 367, XP019715706, ISSN: 1432-2307 *
MITCHELL: "Tom. Machine Learning", 1997, MCGRAW HILL
NAKAGAWA YOSHIHITO ET AL: "Deregulated Expression of MicroRNA-143,-145,-7,-21 and-34a in Colon Tumor Development", GASTROENTEROLOGY, vol. 136, no. 5, Suppl. 1, May 2009 (2009-05-01), & DIGESTIVE DISEASE WEEK/110TH ANNUAL MEETING OF THE AMERICAN-GASTROENTEROLOGICAL-ASSOCIATION; CHICAGO, IL, USA; MAY 30 JUNE 04, 2009, pages A620, XP002607208, ISSN: 0016-5085 *
PFAFFL M W: "A new mathematical model for relative quantification in real-time RT-PCR.", NUCLEIC ACIDS RESEARCH, vol. 29, no. 9, E45, 1 May 2001 (2001-05-01), pages 2002 - 2007, XP002607492, ISSN: 1362-4962, DOI: 10.1093/nar/29.9.e45 *
SACKETT ET AL.: "Clinical Epidemiology: A Basic Science for Clinical Medicine", 1985, LITTLE BROWN AND CO.
SACKETT ET AL.: "Clinical Epidemiology: A Basic Science for Clinical Medicine", 1985, LITTLE BROWN AND CO., pages: 106 - 7
SCHETTER ET AL., JAMA, 2008
SCHETTER, AARON J. ET AL.: "MicroRNA Expression Profiles Associated With Prognosis and Therapeutic Outcome in Colon Adenocarcinoma", JAMA, vol. 299, no. 4, 2008, pages 425 - 436, XP002529936, DOI: doi:10.1001/jama.299.4.425
SLABY O ET AL: "Altered expression of miR-21, miR-31, miR-143 and miR-145 is related to clinicopathologic features of colorectal cancer", ONCOLOGY, vol. 72, no. 5-6, 1 December 2007 (2007-12-01), S. KARGER, BASEL, CH, pages 397 - 402, XP009111358, ISSN: 0030-2414, [retrieved on 20080115], DOI: 10.1159/000113489 *
TAZAWA H ET AL: "Tumor-suppressive miR-34a induces senescence-like growth arrest through modulation of the E2F pathway in human colon cancer cells", PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES (PNAS), vol. 104, no. 39, 25 September 2007 (2007-09-25), US, pages 15472 - 15477, XP002535305, ISSN: 0027-8424, DOI: 10.1073/PNAS.0707351104 *
TOM MITCHELL: "Machine Learning", 1997, MCGRAW HILL
WERTHER K; CHRISTENSEN IJ; BRUNNER N; NIELSEN HJ, EUR J SURG ONCOL., vol. 26, 2000, pages 657 - 62
WERTHER K; CHRISTENSEN IJ; BRÜNNER N; NIELSEN HJ: "Soluble vascular endothelial growth factor levels in patients with primary colorectal carcinoma. The Danish RANX05 Colorectal Cancer Study Group", EUR J SURG ONCOL, vol. 26, 2000, pages 657 - 62

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2013093635A2 (en) 2011-10-21 2013-06-27 Hospital Clinic De Barcelona Plasma micrornas for the detection of early colorectal cancer
EP2944700A1 (en) 2011-10-21 2015-11-18 Hospital Clínic de Barcelona Plasma microRNAs for the detection of early colorectal cancer
WO2015155765A1 (en) 2014-04-10 2015-10-15 Bio-Marcare Technologies Ltd. Methods and kits for identifying pre-cancerous colorectal polyps and colorectal cancer
US10407737B2 (en) 2014-04-10 2019-09-10 Bio-Marcare Technologies Ltd. Methods and kits for identifying pre-cancerous colorectal polyps and colorectal cancer

Similar Documents

Publication Publication Date Title
US20210180134A1 (en) microRNAs as Biomarkers for Endometriosis
US20110160290A1 (en) Use of extracellular rna to measure disease
WO2016036994A1 (en) Method of analysis allowing avoidance of surgery
EP2732052A1 (en) Diagnostic microrna profiling in cutaneous t-cell lymphoma (ctcl)
JP2011501966A (en) Process for colorectal cancer monitoring
WO2009133915A1 (en) Cancer marker, method for evaluation of cancer by using the cancer marker, and evaluation reagent
Usó et al. miRNA detection methods and clinical implications in lung cancer
JP2011501964A (en) Prognostic prediction process of squamous cell lung cancer
WO2014055117A1 (en) Diagnostic mirnas for differential diagnosis of incidental pancreatic cystic lesions
CN101613748A (en) A kind of method that detects serum marker of pancreatic cancer
Segersten et al. The role of microRNA profiling in prognosticating progression in Ta and T1 urinary bladder cancer
KR102096498B1 (en) MicroRNA-4732-5p for diagnosing or predicting recurrence of colorectal cancer and use thereof
US20150247202A1 (en) Microrna based method for diagnosis of colorectal tumors and of metastasis
KR102096499B1 (en) MicroRNA-3960 for diagnosing or predicting recurrence of colorectal cancer and use thereof
Zhou et al. A method for extracting and characterizing RNA from urine: For downstream PCR and RNAseq analysis
WO2011012136A1 (en) A method for classifying a human cell sample as cancerous
Zhao et al. Serum microRNA-155 as a potential biomarker for breast cancer screening
KR101381894B1 (en) Serum miRNA as a marker for the diagnosis of lymph node metastasis of gastric cancer
CN109563548B (en) In vitro method for identifying pancreatic cancer or intraductal papillary myxoma of pancreas
Lee et al. MicroRNA-23a: a novel serum based diagnostic biomarker for lung adenocarcinoma
US11021756B2 (en) MiRNA markers for the diagnosis of osteosarcoma
US20130059742A1 (en) Global analysis of serum micro rnas as potential biomarkers for lung adenocarcinoma
JP7297902B2 (en) Analysis method and kit
US20150240319A1 (en) Global Analysis of Serum microRNAs as Potential Biomarkers for Lung Adenocarcinoma
CN111909999B (en) MiRNA combination and application thereof in preparation of marker for DLBCL prognosis

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 10744665

Country of ref document: EP

Kind code of ref document: A1

122 Ep: pct application non-entry in european phase

Ref document number: 10744665

Country of ref document: EP

Kind code of ref document: A1