WO2010086782A2 - Methods for the subclassification of breast tumours - Google Patents

Methods for the subclassification of breast tumours Download PDF

Info

Publication number
WO2010086782A2
WO2010086782A2 PCT/IB2010/050316 IB2010050316W WO2010086782A2 WO 2010086782 A2 WO2010086782 A2 WO 2010086782A2 IB 2010050316 W IB2010050316 W IB 2010050316W WO 2010086782 A2 WO2010086782 A2 WO 2010086782A2
Authority
WO
WIPO (PCT)
Prior art keywords
methylation
feature
seq
sequences
subject
Prior art date
Application number
PCT/IB2010/050316
Other languages
French (fr)
Other versions
WO2010086782A8 (en
WO2010086782A3 (en
Inventor
Sitharthan Kamalakaran
Angel Janevski
James B. Hicks
Original Assignee
Koninklijke Philips Electronics N.V.
Cold Spring Harbor Laboratory
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Koninklijke Philips Electronics N.V., Cold Spring Harbor Laboratory filed Critical Koninklijke Philips Electronics N.V.
Priority to EP10705647A priority Critical patent/EP2391735A2/en
Priority to US13/147,105 priority patent/US20120004118A1/en
Priority to CN2010800052379A priority patent/CN102549165A/en
Priority to JP2011547032A priority patent/JP2012517215A/en
Priority to BRPI1005306A priority patent/BRPI1005306A2/en
Priority to RU2011135955/10A priority patent/RU2011135955A/en
Publication of WO2010086782A2 publication Critical patent/WO2010086782A2/en
Publication of WO2010086782A3 publication Critical patent/WO2010086782A3/en
Publication of WO2010086782A8 publication Critical patent/WO2010086782A8/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6813Hybridisation assays
    • C12Q1/6827Hybridisation assays for detection of mutation or polymorphism
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6876Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
    • C12Q1/6883Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
    • C12Q1/6886Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material for cancer
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B30/00ICT specially adapted for sequence analysis involving nucleotides or amino acids
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2537/00Reactions characterised by the reaction format or use of a specific feature
    • C12Q2537/10Reactions characterised by the reaction format or use of a specific feature the purpose or use of
    • C12Q2537/165Mathematical modelling, e.g. logarithm, ratio
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/112Disease subtyping, staging or classification
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/154Methylation markers
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10TTECHNICAL SUBJECTS COVERED BY FORMER US CLASSIFICATION
    • Y10T436/00Chemistry: analytical and immunological testing
    • Y10T436/14Heterocyclic carbon compound [i.e., O, S, N, Se, Te, as only ring hetero atom]
    • Y10T436/142222Hetero-O [e.g., ascorbic acid, etc.]
    • Y10T436/143333Saccharide [e.g., DNA, etc.]

Definitions

  • This invention pertains in general to the field of biology and bioinformatics. More particularly the invention relates to the field of categorization of cancer tumours and even more particularly to identifying methylated sites, which may aid in categorization of cancer tumours.
  • breast cancer is the fifth most common cause of cancer death, after lung cancer, stomach cancer, liver cancer, and colon cancer.
  • breast cancer is the most common cancer and the most common cause of cancer death.
  • Breast cancer is diagnosed by the pathological examination of surgically removed breast tissue. Following diagnosis, it is important to analyze the tumour type in order to aid clinicians when choosing the right therapy. Within the art, such analysis is performed according to two categories.
  • the first category involves the use of immuno-histopathological variables, such as tumour size, ER/PR status, lymph node negativity, etc. to define a clinical prognostic index such as the Nottingham Prognostic Index (NPI).
  • NPI Nottingham Prognostic Index
  • the problem with such an index is that it has been shown to be very conservative, thus typically causing patients to receive aggressive therapy even when they are a low risk of disease recurrence.
  • the second category involves the measurement of the expression levels of a large number of genes, typically around 500, and calculating probability of a subtype based on the relative expression levels of the genes. This method is very costly in terms of tissue handling requirements. It is also hard to perform in a clinical setting, due to the demand of laboratory equipment.
  • DNA methylation a type of chemical modification of DNA that can be inherited and subsequently removed without changing the original DNA sequence, is the most well studied epigenetic mechanism of gene regulation. There are areas in DNA where a cytosine nucleotide occurs next to a guanine nucleotide in the linear sequence of bases called CpG islands.
  • CpG islands are generally heavily methylated in normal cells. However, during tumorigenesis, hypomethylation occurs at these islands, which may result in the expression of certain repeats. These hypomethylation events also correlate to the severity of some cancers. Under certain circumstances, which may occur in pathologies such as cancer, imprinting, development, tissue specificity, or X chromosome inactivation, gene associated islands may be heavily methylated. Specifically, in cancer, methylation of islands proximal to tumour suppressors is a frequent event, often occurring when the second allele is lost by deletion (Loss of Heterozygosity, LOH). Some tumour suppressors commonly seen with methylated islands are pi 6, Rassfla, and BRCAl.
  • the present invention preferably seeks to mitigate, alleviate or eliminate one or more of the above-identified deficiencies in the art and disadvantages singly or in any combination and solves at least the above mentioned problems by providing a method for the analysis of breast cancer disorders according to the appended patent claims.
  • a method for analysis of breast cancer disorders comprises determining the genomic methylation status of one or more CpG dinucleotides in a sequence selected from the group of sequences consisting of SEQ ID NO. 1 to SEQ ID NO. 600.
  • the method provides for improved abilities to characterize cancer tumours using methylation patterns.
  • the regions of interest of the sequences SEQ ID NO. 1 to 600 are designated in table 1 (as "start” and “end” on respective "chromosome”).
  • a computer program product is disclosed.
  • the computer program product is stored on a computer-readable medium comprising software code adapted to perform the steps of the method according to an aspect when executed on a data-processing apparatus.
  • a device in an aspect a device is disclosed.
  • the device comprises means adapted to carry out methods according to som embodiments.
  • An advantage with this is to support a clinician.
  • sequences claimed also encompass the sequences, which are reverse complement to the sequences designated.
  • Fig. 1 is a schematic illustration of a method according to some embodiments
  • Fig. 2 is a schematic illustration of a dataset 20 of five measurements 1 to 5;
  • Fig. 3 is a schematic illustration of a first subset 30 of five measurements 1 to 5;
  • Fig. 4 is a schematic illustration of a second subset 40 of five measurements 1 to 5;
  • Fig. 5 is an illustration of clusters 51, 52, 53, where Fig. 5 A is a first cluster 51 , Fig. 5B is a second cluster 52 and Fig. 5C is a third cluster 53.
  • Fig. 6 is a schematic illustration of a computer program product according to an embodiment.
  • Fig. 7 is a schematic illustration of a device according to an embodiment.
  • An idea according to some embodiments is a method using a small selection of DNA sequences to analyze breast cancer disorders.
  • the analysis is done by determining genomic methylation status of one or more CpG dinucleotides, in either sequence disclosed herein, or its reverse complement. It was surprisingly found that some DNA sequences, SEQ ID NO: 1 to
  • SEQ ID NO: 600 act as epigenetic markers that may be used to analyze breast cancer by subtyping tumours.
  • the DNA SEQ ID NO: 1 to SEQ ID NO: 600 were identified by analysing 150 000 individual genomic loci for methylation, across a set of 83 breast tumours. The availability of clinical information regarding tumour specimens allowed for an investigation of DNA methylation in the context of breast cancer subtypes, histology and tumour aggressiveness. The five major breast cancer molecular subtypes
  • a method 10 is provided, according to Fig. 1.
  • Said method 10 comprises selecting 100 a feature subset comprising at least one post from the methylation classification list according to SEQ ID NO. 1 to SEQ ID NO. 600.
  • Hi 1 of N values where In 1J provides the methylation status for the i-th sample and the j-th probe.
  • some statistical analysis of the measured signal will produce a set of probes (features) to be input to the hierarchical clustering method above.
  • the feature subset selection 100 uses a Genetic Algorithm (GA), which repetitively evaluate feature subsets based on a fitness function that in some way characterizes some property of the feature subset.
  • GA Genetic Algorithm
  • hierarchical clustering with Pearson correlation and complete linkage is used as the fitness function to assess how good a feature subset is.
  • Fig. 2 show a dataset 20 of measurements, in this case 5 samples, which are displayed as 1 to 5 are characterized with 8 features, which are displayed as letters A to H.
  • Figs. 3 and 4 show two feature subsets, generated from the measurements dataset by selecting rows (features) from the dataset.
  • Fig. 3 shows a first feature subset 30 with the 5 samples, which are displayed as 1 to 5, but only four of the features.
  • Fig. 4 shows a second subset 40 with the 5 samples, which are displayed as 1 to 5, but only six of the features.
  • Fig. 5 show clusters, or dendrograms, based on the datasets from Figs. 2 to 4, when subjected to hierarchical clustering with
  • Fig. 5 A shows a first cluster 51 based on the total dataset 20.
  • Fig. 5B shows a second cluster 52 based on the first feature subset 30 and
  • Fig. 5C shows a third cluster 53 based on the second feature subset 40.
  • a ranking of all clustering results is performed.
  • a cluster analysis method is used for the ranking. For example, it is possible to characterize and rank individual clusters based on their validity, for example in terms of cluster cohesion or separation. This may be done in one of multiple ways well known to a person skilled in the art. Thus, it is possible to rank two or more feature subsets based on the quality of the clusters they generate when used to cluster the samples.
  • the second subset 40 represented by Fig. 5C, is clearly better compared to the first feature subset 30 or the clustering based on the entire dataset 20, since it correctly cluster the subtypes together.
  • two clustering outputs Di and D 2 are compared based on the clusters.
  • N (Ci, C 2 , ... C N ) clusters are obtained based on the dendrogram, produced by the clustering.
  • a property is computed based on the clusters, such as the popular method of silhouette width - SIL(C 1 ).
  • a single-number characterization of a clustering is obtained by the formula:
  • AVGSIL(Di) and AVGSIL(D 2 ) it may be determined which clustering is preferable.
  • AVGUNIFORMITY D I
  • AVGUNIFORMITY D 2
  • all evaluated features subsets can be further filtered based on their performance during the GA execution.
  • feature subsets are sorted by the average clustering performance in stratification of the clinical samples.
  • feature subsets, in addition to the average performance are filtered based on their persistent re-evaluation. In other words, feature subsets that are repeatedly selected for further evaluation are preferred to feature subsets that are dropped from consideration only after a few iterations.
  • the final output of a GA feature subset selection is to run multiple instances with different initial conditions, and merge the filtered feature subsets from each of these instances.
  • Feature subsets from one such evaluation are listed in Table 3 A. Furthermore, a cumulative characterization of a collection of GA runs can be obtained and used to generate feature subsets that aggregate the feature subsets in single set of subsets. In one embodiment, the appearance of each feature in feature subsets is counted and a total histogram is obtained giving the degree of utilization of each of the 600 features. Based on this information and for example in one embodiment the frequencies of the pairwise occurrences of the 600 features are used to build feature subsets that summarize the GA run in a single set of subsets, a so called trend pattern. Table 3B provides such feature subset of lengths 45 and 60.
  • the feature subset comprises the CpG dinucleotides according to one of the selections listed in Table 2.
  • Each subset comprise a selection of sequences indicated by numbers corresponding to the FragID:s in table 1. 10 152494, 55649, 158649, 33381, 129193, 38485, 86866, 1601, 153363, 158646, 72675, 128850, 13583, 4109, 38815, 63267, 19926, 103295, 79123, 4823, 80726, 115442, 25715, 71104,92237, 152496, 134481, 1359,65610,55215,11111, 114219, 118132, 149792,757, 27685, 71089, 120745, 3535, 36661, 52666, 148458, 56504, 87210, 110848, 39760, 152716, 94345, 47510, 87185, 156306, 71105, 89865, 54424, 95724, 153087, 42953, 71090, 57442, 76797, 7053
  • the feature subset comprises the CpG dinucleotides according to one of the selections listed in Table 3 A.
  • Table 3 A Feature subsets. Each subset comprise a selection of sequences indicated by numbers corresponding to the FragID:s in table 1.
  • the feature subset comprises the CpG dinucleotides according to one of the selections listed in Table 3B.
  • Table 3B Feature subsets. Each subset comprise a selection of sequences indicated by numbers corresponding to the FragID:s in table 1.
  • the method 10 comprises determining 120 the methylation status of one or more CpG dinucleotides in a sequence selected from the group of sequences corresponding to the marker panel, resulting in a methylation classification list. There are numerous methods for determining 120 the methylation status of a DNA molecule of a subject, corresponding to the feature subset.
  • the DNA may be obtained by any method for purifying DNA known to a person skilled in the art.
  • the methylation status is determined 110 by means of one or more of the methods selected form the group of, bisulfite sequencing, pyrosequencing, methylation-sensitive single-strand conformation analysis (MS-SSCA), high resolution melting analysis (HRM), methylation-sensitive single nucleotide primer extension (MS- SnuPE), base-specific cleavage/MALDI-TOF, methylation-specific PCR (MSP), microarray-based methods, msp I cleavage.
  • the methods selected form the group of, bisulfite sequencing, pyrosequencing, methylation-sensitive single-strand conformation analysis (MS-SSCA), high resolution melting analysis (HRM), methylation-sensitive single nucleotide primer extension (MS- SnuPE), base-specific cleavage/MALDI-TOF, methylation-specific PCR (MSP), microarray-based methods, msp I cleavage.
  • the method 10 also comprises statistically analyzing 120 the methylation classification list, thus obtaining a category of the breast cancer of the subject. This may be done by jointly clustering the subject methylation data and the samples from the clinical study. The resulting clustering is then split in N groups (e.g. by cutting the clustering dendrogram into N sub-trees). The sub-tree containing the subject is evaluated for the categories of breast cancer present in the study samples and the subject sample is assigned the category of the majority samples in the sub-tree. In an embodiment, the method 10 further comprises classifying (130) the subject as belonging to one of the five major subtypes of breast cancers.
  • a computer program product 60 is provided.
  • the computer program product 60 is stored on a computer-readable medium, which comprises a first 61, second 62, third 63 and forth 64 code segments arranged, when run by an apparatus having computer-processing properties, for performing all of the method steps defined in some embodiments.
  • a device 70 for supporting a clinician comprising means for selecting 700 a feature subset comprising at least one post from the methylation classification list according to SEQ ID NO. 1 to SEQ ID NO. 600. Furthermore, the device 70 comprises means for determining 710 the methylation status of one or more CpG dinucleotides in DNA of a subject, corresponding to the feature subset. Furthermore, the device 70 comprises means for statistically analyzing 720 the methylation classification list, thus obtaining a category of the breast cancer of the subject. Furthermore, the device 70 comprises means for classifying 730 the subject as belonging to one of the five major subtypes of breast cancers. Said means 700, 710, 720, 730 may be operatively connected to each other.
  • the invention may be implemented in any suitable form including hardware, software, firmware or any combination of these. However, preferably, the invention is implemented as computer software running on one or more data processors and/or digital signal processors.
  • the elements and components of an embodiment of the invention may be physically, functionally and logically implemented in any suitable way. Indeed, the functionality may be implemented in a single unit, in a plurality of units or as part of other functional units. As such, the invention may be implemented in a single unit, or may be physically and functionally distributed between different units and processors.

Landscapes

  • Chemical & Material Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Organic Chemistry (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Engineering & Computer Science (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Analytical Chemistry (AREA)
  • Genetics & Genomics (AREA)
  • Immunology (AREA)
  • Physics & Mathematics (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Biotechnology (AREA)
  • General Health & Medical Sciences (AREA)
  • Biophysics (AREA)
  • Molecular Biology (AREA)
  • Microbiology (AREA)
  • Biochemistry (AREA)
  • General Engineering & Computer Science (AREA)
  • Pathology (AREA)
  • Hospice & Palliative Care (AREA)
  • Oncology (AREA)
  • Medical Informatics (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Theoretical Computer Science (AREA)
  • Evolutionary Biology (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
  • Apparatus Associated With Microorganisms And Enzymes (AREA)
  • Medical Treatment And Welfare Office Work (AREA)
  • Investigating Or Analysing Biological Materials (AREA)

Abstract

Provided is a method for the analysis of breast cancer disorders, comprising determining the genomic methylation status of one or more CpG dinucleotides. Furthermore, a computer program product stored on a computer-readable medium comprising software code adapted to perform the steps of the method when executed on a data-processing apparatus is provided. A device comprising means for supporting a clinician is also provided.

Description

METHODS FOR THE SUBCLAS SIFIC ATION OF BREAST TUMOURS
FIELD OF THE INVENTION This invention pertains in general to the field of biology and bioinformatics. More particularly the invention relates to the field of categorization of cancer tumours and even more particularly to identifying methylated sites, which may aid in categorization of cancer tumours.
BACKGROUND OF THE INVENTION
Worldwide, breast cancer is the fifth most common cause of cancer death, after lung cancer, stomach cancer, liver cancer, and colon cancer. Among women, breast cancer is the most common cancer and the most common cause of cancer death. Breast cancer is diagnosed by the pathological examination of surgically removed breast tissue. Following diagnosis, it is important to analyze the tumour type in order to aid clinicians when choosing the right therapy. Within the art, such analysis is performed according to two categories.
The first category involves the use of immuno-histopathological variables, such as tumour size, ER/PR status, lymph node negativity, etc. to define a clinical prognostic index such as the Nottingham Prognostic Index (NPI). The problem with such an index is that it has been shown to be very conservative, thus typically causing patients to receive aggressive therapy even when they are a low risk of disease recurrence. The second category involves the measurement of the expression levels of a large number of genes, typically around 500, and calculating probability of a subtype based on the relative expression levels of the genes. This method is very costly in terms of tissue handling requirements. It is also hard to perform in a clinical setting, due to the demand of laboratory equipment. DNA methylation, a type of chemical modification of DNA that can be inherited and subsequently removed without changing the original DNA sequence, is the most well studied epigenetic mechanism of gene regulation. There are areas in DNA where a cytosine nucleotide occurs next to a guanine nucleotide in the linear sequence of bases called CpG islands.
CpG islands are generally heavily methylated in normal cells. However, during tumorigenesis, hypomethylation occurs at these islands, which may result in the expression of certain repeats. These hypomethylation events also correlate to the severity of some cancers. Under certain circumstances, which may occur in pathologies such as cancer, imprinting, development, tissue specificity, or X chromosome inactivation, gene associated islands may be heavily methylated. Specifically, in cancer, methylation of islands proximal to tumour suppressors is a frequent event, often occurring when the second allele is lost by deletion (Loss of Heterozygosity, LOH). Some tumour suppressors commonly seen with methylated islands are pi 6, Rassfla, and BRCAl.
There are reported epigenetic markers for colorectal and prostate cancer. For example, Epigenomics AG (Berlin, Germany) has the Septin 9 as a marker for colorectal cancer screening in blood plasma. A method for using methylation sites to predict differential therapy responses in cancer and recommending an appropriate therapy has been disclosed in US20050021240A1. However, the results predicted by this method are limited, since they cannot be directly applied in clinical practice. Therefore, it would advantageous to have a method for the analysis of breast cancer disorders, which is time efficient, reliable and cost-effective.
SUMMARY OF THE INVENTION
Accordingly, the present invention preferably seeks to mitigate, alleviate or eliminate one or more of the above-identified deficiencies in the art and disadvantages singly or in any combination and solves at least the above mentioned problems by providing a method for the analysis of breast cancer disorders according to the appended patent claims.
According to an aspect a method for analysis of breast cancer disorders is disclosed. The method comprises determining the genomic methylation status of one or more CpG dinucleotides in a sequence selected from the group of sequences consisting of SEQ ID NO. 1 to SEQ ID NO. 600. The method provides for improved abilities to characterize cancer tumours using methylation patterns. The regions of interest of the sequences SEQ ID NO. 1 to 600 are designated in table 1 (as "start" and "end" on respective "chromosome").
This aspect presents improvements over the state of the art in that it enables a highly specific classification of breast cell proliferative disorders. In an aspect a computer program product is disclosed. The computer program product is stored on a computer-readable medium comprising software code adapted to perform the steps of the method according to an aspect when executed on a data-processing apparatus.
In an aspect a device is disclosed. The device comprises means adapted to carry out methods according to som embodiments. An advantage with this is to support a clinician.
Herein, the sequences claimed also encompass the sequences, which are reverse complement to the sequences designated.
BRIEF DESCRIPTION OF THE DRAWINGS
These and other aspects, features and advantages of which the invention is capable of will be apparent and elucidated from the following description of embodiments of the present invention, reference being made to the accompanying drawings, in which
Fig. 1 is a schematic illustration of a method according to some embodiments;
Fig. 2 is a schematic illustration of a dataset 20 of five measurements 1 to 5; Fig. 3 is a schematic illustration of a first subset 30 of five measurements 1 to 5; Fig. 4 is a schematic illustration of a second subset 40 of five measurements 1 to 5; and Fig. 5 is an illustration of clusters 51, 52, 53, where Fig. 5 A is a first cluster 51 , Fig. 5B is a second cluster 52 and Fig. 5C is a third cluster 53. Fig. 6 is a schematic illustration of a computer program product according to an embodiment.Fig. 7 is a schematic illustration of a device according to an embodiment.
DETAILED DESCRIPTION OF EMBODIMENTS
Several embodiments of the present invention will be described in more detail below with reference to the accompanying drawings in order for those skilled in the art to be able to carry out the invention. The invention may, however, be embodied in many different forms and should not be construed as limited to the embodiments set forth herein. Rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the invention to those skilled in the art. The embodiments do not limit the invention, but the invention is only limited by the appended patent claims. Furthermore, the terminology used in the detailed description of the particular embodiments illustrated in the accompanying drawings is not intended to be limiting of the invention.
An idea according to some embodiments is a method using a small selection of DNA sequences to analyze breast cancer disorders. The analysis is done by determining genomic methylation status of one or more CpG dinucleotides, in either sequence disclosed herein, or its reverse complement. It was surprisingly found that some DNA sequences, SEQ ID NO: 1 to
SEQ ID NO: 600 act as epigenetic markers that may be used to analyze breast cancer by subtyping tumours. In prior art, it is possible to subtype breast cancer based on gene expression. Five different subtypes have been reported; luminal A, luminal B, basal, ERBB2 overexpressing, and normal-like. The inventors have identified the same subtypes using DNA methylation.
The DNA SEQ ID NO: 1 to SEQ ID NO: 600 were identified by analysing 150 000 individual genomic loci for methylation, across a set of 83 breast tumours. The availability of clinical information regarding tumour specimens allowed for an investigation of DNA methylation in the context of breast cancer subtypes, histology and tumour aggressiveness. The five major breast cancer molecular subtypes
(luminal A and B, basal, ERBB2 overexpressing, and normal-like) were identified. First, an investigation was performed regarding however unsupervised clustering of the tumour set using methylation recapitulates the major Luminal and basal classes that were identified by expression analysis or not. A filtering criterion was used to identify the features to be used in clustering. This criterion was the top 500 loci that varied most across the 83 tumour samples. Then, the top 100 loci that distinguished tumours from normal tissues from were added. These 600 features, displayed in table 1, were used to cluster the 83 tumours for which the expression subtype data was available. Hierarchical clustering with Pearson correlation and complete linkage of the samples based on these six hundred loci gave a dendrogram that is surprisingly similar to the one produced by expression analysis.
Figure imgf000007_0001
Figure imgf000008_0001
Figure imgf000009_0001
Figure imgf000010_0001
Figure imgf000011_0001
Figure imgf000012_0001
Figure imgf000013_0001
Figure imgf000014_0001
Figure imgf000015_0001
Figure imgf000016_0001
Figure imgf000017_0001
Figure imgf000018_0001
Figure imgf000019_0001
Figure imgf000020_0001
Figure imgf000021_0001
In an embodiment a method 10 is provided, according to Fig. 1. Said method 10 comprises selecting 100 a feature subset comprising at least one post from the methylation classification list according to SEQ ID NO. 1 to SEQ ID NO. 600.
Selecting 100 a feature subset may be performed based on hierarchical clustering with Pearson correlation and complete linkage to characterize the fitness of each feature subset, given a dataset with methylation characterization for of each sample (S1, i=l..M) in a form of a vector Hi1 of N values, where In1J provides the methylation status for the i-th sample and the j-th probe. Typically, some statistical analysis of the measured signal will produce a set of probes (features) to be input to the hierarchical clustering method above.
The feature subset selection 100 uses a Genetic Algorithm (GA), which repetitively evaluate feature subsets based on a fitness function that in some way characterizes some property of the feature subset. In an embodiment, hierarchical clustering with Pearson correlation and complete linkage is used as the fitness function to assess how good a feature subset is.
The following example is used to illustrate the principle.
Fig. 2 show a dataset 20 of measurements, in this case 5 samples, which are displayed as 1 to 5 are characterized with 8 features, which are displayed as letters A to H. Figs. 3 and 4 show two feature subsets, generated from the measurements dataset by selecting rows (features) from the dataset. Fig. 3 shows a first feature subset 30 with the 5 samples, which are displayed as 1 to 5, but only four of the features. Fig. 4 shows a second subset 40 with the 5 samples, which are displayed as 1 to 5, but only six of the features.
Next, clustering may be performed. Fig. 5 show clusters, or dendrograms, based on the datasets from Figs. 2 to 4, when subjected to hierarchical clustering with
Pearson correlation and complete linkage. Fig. 5 A shows a first cluster 51 based on the total dataset 20. Fig. 5B shows a second cluster 52 based on the first feature subset 30 and Fig. 5C shows a third cluster 53 based on the second feature subset 40.
After having clustered the datasets, a ranking of all clustering results is performed. In one embodiment, a cluster analysis method is used for the ranking. For example, it is possible to characterize and rank individual clusters based on their validity, for example in terms of cluster cohesion or separation. This may be done in one of multiple ways well known to a person skilled in the art. Thus, it is possible to rank two or more feature subsets based on the quality of the clusters they generate when used to cluster the samples.
In another embodiment, some property of the samples (e.g. cancer subtype based on pathology) is used for ranking. From this property, the same or related subtypes are grouped together. For example, if the five samples from Figs. 2 to 4 have the following subtype labels associated with them (I=X, 2=X, 3=Y, 4=Y, 5=X} respectively, this would then produce the following label groupings for the three clusters shown in Fig. 5: A: (XXY, YX}; B: (XY, YXX}; C: (XXX, YY}. In this case, the second subset 40, represented by Fig. 5C, is clearly better compared to the first feature subset 30 or the clustering based on the entire dataset 20, since it correctly cluster the subtypes together.
In an embodiment, two clustering outputs Di and D2, are compared based on the clusters.. First, N (Ci, C2, ... CN) clusters are obtained based on the dendrogram, produced by the clustering. Then, a property is computed based on the clusters, such as the popular method of silhouette width - SIL(C1). Now a single-number characterization of a clustering is obtained by the formula:
AVGSIL(D) = (SUM[i=l..N] SIL(C1)VN
By comparing AVGSIL(Di) and AVGSIL(D2), it may be determined which clustering is preferable. In another embodiment, build a data structure G is built in form of a matrix with dimensions N x L, where L is the number of distinct labels available for the samples. With labels (X. Y}, L = 2, or for labels (normal, aggressive cancer, non-aggressive cancer} L = 3. Then for each cluster i (i=l ..N) L values are obtained in the following manner for each element gy from G:
gij = count(sample in cluster i and has label j)
Now, it is possible to compute uniformity of each cluster C1:
UNIFORMITY(C1) = max(countsinrowi in G)/sum(countsinrowiinG)
Finally, the clustering is characterized with: AVGUNIFORMITY(D) = SUM[i=l..N] (UNIFORMITY(CO)ZN
as a single-number characterization of a clustering. By comparing AVGUNIFORMITY (DI) and AVGUNIFORMITY (D2) it may be determined which clustering is preferable.
Iterative repetition of this selection process gradually refines the quality of the clustering of the feature subsets discovered by the GA. After a number of repetitions, all evaluated features subsets can be further filtered based on their performance during the GA execution. In one embodiment, feature subsets are sorted by the average clustering performance in stratification of the clinical samples. In another embodiment, feature subsets, in addition to the average performance, are filtered based on their persistent re-evaluation. In other words, feature subsets that are repeatedly selected for further evaluation are preferred to feature subsets that are dropped from consideration only after a few iterations. The final output of a GA feature subset selection is to run multiple instances with different initial conditions, and merge the filtered feature subsets from each of these instances. Feature subsets from one such evaluation are listed in Table 3 A. Furthermore, a cumulative characterization of a collection of GA runs can be obtained and used to generate feature subsets that aggregate the feature subsets in single set of subsets. In one embodiment, the appearance of each feature in feature subsets is counted and a total histogram is obtained giving the degree of utilization of each of the 600 features. Based on this information and for example in one embodiment the frequencies of the pairwise occurrences of the 600 features are used to build feature subsets that summarize the GA run in a single set of subsets, a so called trend pattern. Table 3B provides such feature subset of lengths 45 and 60.
Examples of feature subsets are provided in Tables 2, 3A and 3B. Thus, in an embodiment, the feature subset comprises the CpG dinucleotides according to one of the selections listed in Table 2.
Table 2. Feature subsets. Each subset comprise a selection of sequences indicated by numbers corresponding to the FragID:s in table 1.
Figure imgf000025_0001
10 152494, 55649, 158649, 33381, 129193, 38485, 86866, 1601, 153363, 158646, 72675, 128850, 13583, 4109, 38815, 63267, 19926, 103295, 79123, 4823, 80726, 115442, 25715, 71104,92237, 152496, 134481, 1359,65610,55215,11111, 114219, 118132, 149792,757, 27685, 71089, 120745, 3535, 36661, 52666, 148458, 56504, 87210, 110848, 39760, 152716, 94345, 47510, 87185, 156306, 71105, 89865, 54424, 95724, 153087, 42953, 71090, 57442, 76797, 70538, 156440, 113989, 13394, 46277, 14656, 20225, 9029, 89183
11 152494, 110545, 12301, 14289, 61152, 1650, 129193, 99554, 153362, 72675, 120416, 149794, 13583, 19926, 32667, 103295, 150393, 92237, 45338, 95107, 96587, 149788, 66071, 14254, 757, 37395, 99668, 14231, 118129, 152681, 155418, 36661, 146589, 148458, 1249, 55611, 110848, 71074, 88982, 32624, 47510, 31913, 26073, 71121, 71105, 145717, 72461, 15478, 118488, 153027, 154875, 133709, 144856, 60445, 73062, 5525, 152213, 92849, 80168, 63043, 90137, 56922
12 152494, 110545, 114218, 129193, 86495, 86866, 99554, 45501, 38815, 19926, 158648, 103295, 60291, 10427, 149824, 115442, 151564, 152496, 98223, 147018, 65670, 77777, 55218, 118491, 118132, 33338, 142017, 54824, 55941, 36661, 145238, 87210, 138677, 39760, 45409, 123890, 99150, 71121, 25164, 1324, 71105, 82920, 1534, 123955, 133709, 24273, 60445, 94051, 71090, 80169, 108016, 70538, 78440, 39539, 131234, 134630, 50444, 87698, 143864, 90137, 64684, 45650
13 152494, 110545, 55649, 1650, 102005, 158649, 129193, 86495, 86866, 128414, 128850, 146035, 1173, 19926, 153364, 4823, 149824, 14609, 72674, 56402, 118551, 45338, 65670, 114220, 61161, 118491, 130315, 18856, 118129, 148458, 87210, 110848, 134826, 145015, 93471, 48491, 80728, 125612, 46110, 110793, 99150, 71121, 96210, 10393, 2123, 15066, 152094, 27268, 28887, 1339, 133709, 111802, 76797, 42441, 145731, 26333, 147896, 63043, 87698, 11354, 73907, 27495
14 114205, 129193, 86866, 99554, 152321, 52027, 80645, 72674, 76619, 151564, 71104, 113247, 47435, 95107, 126936, 136763, 147018, 84490, 65670, 55275, 105101, 20895, 757, 99668, 50853, 27685, 148458, 56504, 110848, 145015, 144226, 89408, 99113, 158958, 125612, 144360, 7116, 26073, 99150, 96210, 71105, 124831, 152094, 71216, 1339, 14451, 88395, 142439, 71090, 92849, 103793, 57442, 119665, 88411, 46277, 10916, 134630, 11354,90137,27495
15 110545, 102005, 129193, 158646, 153362, 73586, 27115, 114138, 127886, 56402, 5104, 115442, 150632, 151564, 71104, 152496, 53338, 114207, 134481, 116804, 65670, 55275, 118132, 130315, 96227, 71581, 118129, 79207, 155418, 123180, 114108, 52666, 1249, 84518, 64725, 87210, 136153, 135257, 145015, 156308, 48491, 152480, 45409, 88982, 26073, 71121, 152094, 40505, 149461, 54424, 28887, 14451, 123955, 56289, 83839, 1391, 108016, 39539, 119665, 88411, 9278, 102061,27677, 115870, 14656, 56922
16 152494, 110545, 86939, 55649, 102005, 25023, 128737, 129193, 14197, 99554, 152321, 153362, 72675, 13583, 39470, 61003, 103295, 79123, 80726, 118551, 114139, 147620, 96587, 55218, 38714, 8273, 757, 54400, 1823, 15771, 46721, 157076, 71120, 3535, 52666, 11474, 148458, 87210, 57206, 152480, 55475, 89408, 99113, 148624, 7116, 8778, 110793, 47510, 26073, 76120, 25164, 71105, 124831, 127669, 9928, 27268, 154875, 144856, 60445, 88395, 94051, 36595, 71090, 111358, 76797, 50444, 27677, 23738, 76467, 71700
17 110545, 114140, 102005, 129193, 99554, 152321, 128850, 5455, 124390, 149824, 80726, 126928, 56402, 151564, 17697, 47435, 152496, 38417, 147018, 116804, 84490, 65670, 4389, 118491, 757, 99668, 15771, 46721, 118129, 79207, 105085, 127220, 36661, 22036, 148458, 64725, 52146, 87210, 136153, 145015, 31913, 26073, 71105, 15066, 145717, 20134, 130161, 14451, 50717, 17091, 60445, 87160, 33136, 54796, 57442, 76797, 59067, 61099, 20706, 28326, 72750, 76801, 82859, 105873, 27677, 113614, 9029
18 152494, 110545, 55649, 153365, 129193, 21537, 86866, 99554, 72675, 120581, 52027, 19926, 103295, 114138, 1340, 151564, 128857, 132985, 118551, 95107, 152748, 98223, 14203,65670, 149788,55218, 118491, 118132, 142017, 118129, 11782,27685,99472, 36661, 87210, 38910, 55611, 135107, 135257, 149787, 48491, 80728, 7116, 110793, 99150, 71105, 9928, 40858, 58680, 1534, 133709, 60445, 94051, 5525, 71090, 70538, 80112, 2643, 9937, 98985, 64684 In an embodiment, the feature subset comprises the CpG dinucleotides according to one of the selections listed in Table 3 A.
Table 3 A. Feature subsets. Each subset comprise a selection of sequences indicated by numbers corresponding to the FragID:s in table 1.
Figure imgf000027_0001
In an embodiment, the feature subset comprises the CpG dinucleotides according to one of the selections listed in Table 3B.
Table 3B. Feature subsets. Each subset comprise a selection of sequences indicated by numbers corresponding to the FragID:s in table 1.
Figure imgf000028_0001
In an embodiment the method 10 comprises determining 120 the methylation status of one or more CpG dinucleotides in a sequence selected from the group of sequences corresponding to the marker panel, resulting in a methylation classification list. There are numerous methods for determining 120 the methylation status of a DNA molecule of a subject, corresponding to the feature subset. The DNA may be obtained by any method for purifying DNA known to a person skilled in the art. In an embodiment the methylation status is determined 110 by means of one or more of the methods selected form the group of, bisulfite sequencing, pyrosequencing, methylation-sensitive single-strand conformation analysis (MS-SSCA), high resolution melting analysis (HRM), methylation-sensitive single nucleotide primer extension (MS- SnuPE), base-specific cleavage/MALDI-TOF, methylation-specific PCR (MSP), microarray-based methods, msp I cleavage.
In an embodiment, the method 10 also comprises statistically analyzing 120 the methylation classification list, thus obtaining a category of the breast cancer of the subject. This may be done by jointly clustering the subject methylation data and the samples from the clinical study. The resulting clustering is then split in N groups (e.g. by cutting the clustering dendrogram into N sub-trees). The sub-tree containing the subject is evaluated for the categories of breast cancer present in the study samples and the subject sample is assigned the category of the majority samples in the sub-tree. In an embodiment, the method 10 further comprises classifying (130) the subject as belonging to one of the five major subtypes of breast cancers.
In an embodiment according to Fig. 6, a computer program product 60 is provided. The computer program product 60 is stored on a computer-readable medium, which comprises a first 61, second 62, third 63 and forth 64 code segments arranged, when run by an apparatus having computer-processing properties, for performing all of the method steps defined in some embodiments.
In an embodiment according to Fig. 7, a device 70 for supporting a clinician is provided. Said device comprising means for selecting 700 a feature subset comprising at least one post from the methylation classification list according to SEQ ID NO. 1 to SEQ ID NO. 600. Furthermore, the device 70 comprises means for determining 710 the methylation status of one or more CpG dinucleotides in DNA of a subject, corresponding to the feature subset. Furthermore, the device 70 comprises means for statistically analyzing 720 the methylation classification list, thus obtaining a category of the breast cancer of the subject. Furthermore, the device 70 comprises means for classifying 730 the subject as belonging to one of the five major subtypes of breast cancers. Said means 700, 710, 720, 730 may be operatively connected to each other.
The invention may be implemented in any suitable form including hardware, software, firmware or any combination of these. However, preferably, the invention is implemented as computer software running on one or more data processors and/or digital signal processors. The elements and components of an embodiment of the invention may be physically, functionally and logically implemented in any suitable way. Indeed, the functionality may be implemented in a single unit, in a plurality of units or as part of other functional units. As such, the invention may be implemented in a single unit, or may be physically and functionally distributed between different units and processors. Although the present invention has been described above with reference to specific embodiments, it is not intended to be limited to the specific form set forth herein. Rather, the invention is limited only by the accompanying claims and, other embodiments than the specific above are equally possible within the scope of these appended claims. In the claims, the term "comprises/comprising" does not exclude the presence of other elements or steps. Furthermore, although individually listed, a plurality of means, elements or method steps may be implemented by e.g. a single unit or processor. Additionally, although individual features may be included in different claims, these may possibly advantageously be combined, and the inclusion in different claims does not imply that a combination of features is not feasible and/or advantageous. In addition, singular references do not exclude a plurality. The terms "a", "an", "first", "second" etc do not preclude a plurality. Reference signs in the claims are provided merely as a clarifying example and shall not be construed as limiting the scope of the claims in any way. LIST OF REFERENCE SIGNS:
10 A method
100 A selecting step
110 A determining step
120 An analyzing step
130 A classifying step
20 A dataset
30 A first feature subset
40 A second feature subset
51 A first cluster
52 A second cluster
53 A third cluster
60 A computer program product
61 A first code segment
62 A second code segment
63 A third code segment
64 A fourth code segment
70 A device
700 Selecing means
710 Determining means
720 Analyzing means
730 Classifying means
1 to 5 Sample numbers

Claims

CLAIMS:
1. Method (10) for the analysis of breast cancer disorders, comprising determining the genomic methylation status of one or more CpG dinucleotides in a sequence selected from the group of sequences consisting of SEQ ID NO. 1 to SEQ ID NO. 600.
2. Method according to claim 1, wherein the analysis is categorization of breast cancer in a subject and wherein the following steps are performed, a. selecting (100) a feature subset comprising at least one post from the methylation classification list according to SEQ ID NO. 1 to SEQ ID NO. 600; b. determining (110) the methylation status of one or more CpG dinucleotides in DNA of a subject, corresponding to the feature subset; and c. statistically analyzing (120) the methylation classification list, thus obtaining a category of the breast cancer of the subject.
3. Method according to claims 1 or 2, wherein additionally following steps are performed, d. classifying (130) the subject as belonging to one of the five major subtypes of breast cancers.
4. Method according to claims 1 to 3, wherin the methylation status is determined (110) for a subgroup of sequences where in the specific subgroup is selected from Table 2, 3A or 3B.
5. Method according to claims 1 to 3, wherein the methylation status is determined (110) for a subgroup of sequences determined by selecting (100) a feature subset.
6. Method according to claim 5, whereinthe feature subset selection (100) is a genetic algorithm with hierarchical clustering.
7. Method according to claims 1 to 3, wherein the methylation status is determined (110) for a subgroup of sequences determined by a summarization of output of feature subset selection (100).
8. Method according to claim 7, wherein the summarization of output of feature subset selection (100) is the count of appearance of each feature in feature subsets and pairwise occurrences of sequences selected from the group of sequences consisting of SEQ ID NO. 1 to SEQ ID NO. 600.
9. Method according to claim 8, wherein the the count of appearance of each feature in feature subsets and pairwise occurrences of sequences are of size 45.
10. Method according to claim 8, wherein the the count of appearance of each feature in feature subsets and pairwise occurrences of sequences are of size 60.
11. Method according to claims 1 to 4, wherein the methylation status is determined (110) by means of one or more of the methods selected form the group of, a. bisulfite sequencing b. pyrosequencing c. methylation-sensitive single-strand conformation analysis(MS-SSCA) d. high resolution melting analysis (HRM) e. methylation-sensitive single nucleotide primer extension (MS-SnuPE) f. base-specific cleavage/MALDI-TOF g. methylation-specific PCR (MSP) h. microarray-based methods and i. msp I cleavage.
12. A computer program product (60) stored on a computer-readable medium comprising software code adapted to perform the steps of the method according to claim 2, 3, 4 or 7 when executed on a data-processing apparatus.
13. A device (70) for supporting a clinician, said device comprising means for a. selecting (700) a feature subset comprising at least one post from the methylation classification list according to SEQ ID NO. 1 to SEQ ID NO. 600; b. determining (710) the methylation status of one or more CpG dinucleotides in DNA of a subject, corresponding to the feature subset; c. statistically analyzing (720) the methylation classification list, thus obtaining a category of the breast cancer of the subject; and d. classifying (730) the subject as belonging to one of the five major subtypes of breast cancers. said means being operatively connected to each other.
PCT/IB2010/050316 2009-01-30 2010-01-25 Methods for the subclassification of breast tumours WO2010086782A2 (en)

Priority Applications (6)

Application Number Priority Date Filing Date Title
EP10705647A EP2391735A2 (en) 2009-01-30 2010-01-25 Methods for the subclassification of breast tumours
US13/147,105 US20120004118A1 (en) 2009-01-30 2010-01-25 Methods for the subclassification of breast tumours
CN2010800052379A CN102549165A (en) 2009-01-30 2010-01-25 Methods for the subclassification of breast tumours
JP2011547032A JP2012517215A (en) 2009-01-30 2010-01-25 Methods for subclassification of breast tumors
BRPI1005306A BRPI1005306A2 (en) 2009-01-30 2010-01-25 method for breast cancer analysis and computer program product
RU2011135955/10A RU2011135955A (en) 2009-01-30 2010-01-25 METHODS FOR CLASSIFICATION OF BREAST TUMORS WITH DIVIDING

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US14841309P 2009-01-30 2009-01-30
US61/148,413 2009-01-30

Publications (3)

Publication Number Publication Date
WO2010086782A2 true WO2010086782A2 (en) 2010-08-05
WO2010086782A3 WO2010086782A3 (en) 2010-11-25
WO2010086782A8 WO2010086782A8 (en) 2011-12-08

Family

ID=42224291

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/IB2010/050316 WO2010086782A2 (en) 2009-01-30 2010-01-25 Methods for the subclassification of breast tumours

Country Status (8)

Country Link
US (1) US20120004118A1 (en)
EP (2) EP2391735A2 (en)
JP (1) JP2012517215A (en)
KR (1) KR20110113642A (en)
CN (1) CN102549165A (en)
BR (1) BRPI1005306A2 (en)
RU (1) RU2011135955A (en)
WO (1) WO2010086782A2 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP6054750B2 (en) * 2011-01-28 2016-12-27 国立研究開発法人国立がん研究センター Risk assessment method for hepatocellular carcinoma

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110229913B (en) * 2019-07-19 2022-10-11 上海奕谱生物科技有限公司 Broad-spectrum marker for detecting tumor based on methylation level and application thereof
CN116121371A (en) * 2022-07-06 2023-05-16 武汉艾米森生命科技有限公司 Application of reagent for detecting methylation level of target region in TLX1 gene in preparation of esophageal cancer diagnosis product

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050021240A1 (en) 2000-11-02 2005-01-27 Epigenomics Ag Systems, methods and computer program products for guiding selection of a therapeutic treatment regimen based on the methylation status of the DNA

Family Cites Families (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8076063B2 (en) * 2000-02-07 2011-12-13 Illumina, Inc. Multiplexed methylation detection methods
ATE429511T1 (en) * 2001-03-01 2009-05-15 Epigenomics Ag METHOD FOR DEVELOPING GENE SETS FOR DIAGNOSTIC AND THERAPEUTIC PURPOSES BASED ON THE EXPRESSION AND METHYLATION STATUS OF THE GENES
JP2004033210A (en) * 2002-02-20 2004-02-05 Ncc Technology Ventures Pte Ltd Substance and method relating to diagnosing cancer
US7991557B2 (en) * 2004-06-19 2011-08-02 Genenews Corporation Computer system and methods for constructing biological classifiers and uses thereof
WO2005123945A2 (en) * 2004-06-21 2005-12-29 Epigenomics Ag Epigenetic markers for the treatment of breast cancer
EP2281902A1 (en) * 2004-07-18 2011-02-09 Epigenomics AG Epigenetic methods and nucleic acids for the detection of breast cell proliferative disorders
CA2612021A1 (en) * 2005-06-13 2006-12-28 The Regents Of The University Of Michigan Compositions and methods for treating and diagnosing cancer
WO2007019670A1 (en) * 2005-07-01 2007-02-22 Graham, Robert Method and nucleic acids for the improved treatment of breast cancers
JPWO2007026960A1 (en) * 2005-08-31 2009-03-12 リンク・ジェノミクス株式会社 Therapeutic or diagnostic use of MOCS3 gene
EP2024515B1 (en) * 2006-05-31 2012-08-22 Orion Genomics, LLC Gene methylation in cancer diagnosis
EP2052355A2 (en) * 2006-08-11 2009-04-29 Koninklijke Philips Electronics N.V. Methods and apparatus to integrate systematic data scaling into genetic algorithm-based feature subset selection
EP2097538A4 (en) * 2006-12-07 2011-11-30 Switchgear Genomics Transcriptional regulatory elements of biological pathways, tools, and methods
US20090203011A1 (en) * 2007-01-19 2009-08-13 Epigenomics Ag Methods and nucleic acids for analyses of cell proliferative disorders
CN101861398A (en) * 2007-09-17 2010-10-13 皇家飞利浦电子股份有限公司 The method that is used for analysis of breast cancer disorders
EP2279478A1 (en) * 2008-05-12 2011-02-02 Koninklijke Philips Electronics N.V. A medical analysis system
WO2010070572A1 (en) * 2008-12-18 2010-06-24 Koninklijke Philips Electronics N. V. Method for the detection of dna methylation patterns
US20120004855A1 (en) * 2008-12-23 2012-01-05 Koninklijke Philips Electronics N.V. Methylation biomarkers for predicting relapse free survival
WO2011036604A2 (en) * 2009-09-22 2011-03-31 Koninklijke Philips Electronics N.V. Method and compositions for assisting in diagnosing and/or monitoring breast cancer progression

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050021240A1 (en) 2000-11-02 2005-01-27 Epigenomics Ag Systems, methods and computer program products for guiding selection of a therapeutic treatment regimen based on the methylation status of the DNA

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP6054750B2 (en) * 2011-01-28 2016-12-27 国立研究開発法人国立がん研究センター Risk assessment method for hepatocellular carcinoma

Also Published As

Publication number Publication date
WO2010086782A8 (en) 2011-12-08
EP2955235A3 (en) 2016-03-02
EP2391735A2 (en) 2011-12-07
RU2011135955A (en) 2013-03-10
KR20110113642A (en) 2011-10-17
WO2010086782A3 (en) 2010-11-25
EP2955235A2 (en) 2015-12-16
JP2012517215A (en) 2012-08-02
CN102549165A (en) 2012-07-04
US20120004118A1 (en) 2012-01-05
BRPI1005306A2 (en) 2019-03-19

Similar Documents

Publication Publication Date Title
Lee-Six et al. The landscape of somatic mutation in normal colorectal epithelial cells
Quackenbush Microarray analysis and tumor classification
Tam et al. Robust global microRNA expression profiling using next-generation sequencing technologies
US7711492B2 (en) Methods for diagnosing lymphoma types
US20020137086A1 (en) Method for the development of gene panels for diagnostic and therapeutic purposes based on the expression and methylation status of the genes
CN106795562A (en) Tissue methylation patterns analysis in DNA mixtures
EP2050035A2 (en) Methods for identifying, diagnosing, and predicting survival of lymphomas
JP2007502113A (en) Methods and compositions for differentiating tissue or cell types using epigenetic markers
JP2004528837A (en) Method and nucleic acid for analyzing abnormal proliferation of hematopoietic cells
JP2022514879A (en) Cell-free DNA terminal characteristics
CA3178302A1 (en) Methods and systems for detecting colorectal cancer via nucleic acid methylation analysis
Goldsmith et al. The microrevolution: applications and impacts of microarray technology on molecular biology and medicine
JP2014519319A (en) Methods and compositions for detecting cancer through general loss of epigenetic domain stability
US20230178181A1 (en) Methods and systems for detecting cancer via nucleic acid methylation analysis
US20120004118A1 (en) Methods for the subclassification of breast tumours
CN102348809B (en) Methylation biomarkers for predicting relapse free survival
US20140206565A1 (en) Esophageal Cancer Markers
JP2012517215A5 (en)
Rosa et al. Cluster center genes as candidate biomarkers for the classification of Leukemia
US20130090257A1 (en) Pathway analysis for providing predictive information
EP4234720A1 (en) Epigenetic biomarkers for the diagnosis of thyroid cancer
Vaidya et al. A review of bioinformatics application in breast cancer research
Tsai et al. Intelligent DNA methylation biomarker selection for colorectal cancer
Wang et al. Heritable Clustering Algorithms for Recapturing Epigenetic Progression in Breast Cancer
Khalid et al. Microarrays: Rise of Novel Technology

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 201080005237.9

Country of ref document: CN

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 10705647

Country of ref document: EP

Kind code of ref document: A2

WWE Wipo information: entry into national phase

Ref document number: 2010705647

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 2011547032

Country of ref document: JP

NENP Non-entry into the national phase

Ref country code: DE

WWE Wipo information: entry into national phase

Ref document number: 6015/CHENP/2011

Country of ref document: IN

ENP Entry into the national phase

Ref document number: 20117020012

Country of ref document: KR

Kind code of ref document: A

WWE Wipo information: entry into national phase

Ref document number: 2011135955

Country of ref document: RU

WWE Wipo information: entry into national phase

Ref document number: 13147105

Country of ref document: US

ENP Entry into the national phase

Ref document number: PI1005306

Country of ref document: BR

Kind code of ref document: A2

Effective date: 20110727