EP2044214A2 - Procédé servant à déterminer le taux de méthylation d'un acide nucléique - Google Patents

Procédé servant à déterminer le taux de méthylation d'un acide nucléique

Info

Publication number
EP2044214A2
EP2044214A2 EP07785943A EP07785943A EP2044214A2 EP 2044214 A2 EP2044214 A2 EP 2044214A2 EP 07785943 A EP07785943 A EP 07785943A EP 07785943 A EP07785943 A EP 07785943A EP 2044214 A2 EP2044214 A2 EP 2044214A2
Authority
EP
European Patent Office
Prior art keywords
nucleic acid
base
sequencing
cytosine
signal
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
EP07785943A
Other languages
German (de)
English (en)
Inventor
Jörn LEWIN
Ralf Lesche
Matthias Schuster
Dimo Dietrich
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Epigenomics AG
Original Assignee
Epigenomics AG
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Epigenomics AG filed Critical Epigenomics AG
Priority to EP07785943A priority Critical patent/EP2044214A2/fr
Publication of EP2044214A2 publication Critical patent/EP2044214A2/fr
Withdrawn legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6813Hybridisation assays
    • C12Q1/6827Hybridisation assays for detection of mutation or polymorphism
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6869Methods for sequencing

Definitions

  • the invention relates to a method for quantitative sequencing of methylated DNA according to claim 1, to an oligonucleotide according to claim 25, to kits for the realization of these methods according to claims 26 and 26, and to the use of these methods and these kits according to claims 28, 29, and 30.
  • DNA is methylated nearly exclusively at cytosine bases located 5' to guanine in the CpG dinucleotide.
  • This modification has important regulatory effects on gene expression, especially when involving CpG rich areas, known as CpG islands, located in the promoter regions of many genes. While almost all gene-associated islands are protected from methylation on autosomal chromosomes, extensive methylation of CpG islands has been associated with transcriptional inactivation of selected imprinted genes and genes on the inactive X-chromosome of females.
  • genomic DNA is treated with a chemical or enzyme leading to a conversion of the cytosine bases, which consequently allows one to distinguish between methylated and unmethylated cytosine.
  • the most common methods are a) the use of methylation-sensitive restriction enzymes capable of differentiating between methylated and unmethylated DNA and b) treatment with bisulfite.
  • the use of methylation-sensitive restriction enzymes is limited due to the selectivity of the restriction enzyme towards a specific recognition sequence.
  • methylation-specific enzymes As the use of methylation-specific enzymes is dependent on the presence of restriction sites, most methods are based on a bisulfite treatment that is conducted before a detection or amplifying step (for review: DE 100 29 915 Al, page 2, lines 35-46 or the according translated US application 10/311,661; see also WO 2004/067545).
  • the term 'bisulfite treatment' is meant to comprise treatment with a bisulfite, a disulfite or a hydrogensulfite solution.
  • the term "bisulfite” is used interchangeably for "hydrogensulfite”.
  • genomic DNA is isolated, denatured, converted several hours by a concentrated bisulfite solution, and finally desulfonated and desalted (e.g.: Frommer et al. (1992) A genomic sequencing protocol that yields a positive display of 5-methylcytosine residues in individual DNA strands. Proc Natl Acad Sci U. S. A.; 89(5): 1827-1831).
  • the treatment with bisulfite (or similar chemical agents or enzymes) with the effect of altering the base pairing behavior of one type of cytosine specifically, either the methylated or the unmethylated, thereby introducing different hybridization properties makes the treated DNA more applicable to the conventional methods of molecular biology, especially the polymerase-based amplification methods, such as the sequencing method based on Sanger F, Nicklen S, Coulson AR. (1977) DNA sequencing with chain-terminating inhibitors. Proc Natl Acad Sci U S A, 74(12): 5463-5467).
  • a quantification of the degree of methylation is necessary for different applications, e.g., for classifications of tumors, for prognostic information or for the prediction of drug effects.
  • Different methods are known for the quantification of the degree of methylation, e.g. by Ms-SNuPE, by hybridizations on microarrays, by hybridization assays in solution or with by bisulfite sequencing (for review: Fraga and Estella (2002), Biotechniques 33(3): 632, 634, 636-649.)-
  • a powerful quantification tool based on real time PCR detection is the so called "QM- Assay" described in PCT/EP2005/003793.
  • a tissue sample contains a mixture of different cells. Therefore, a proper description of methylation at a certain CpG site requires quantification of the proportion of the methylated templates at the investigated CpG. This proportion is herein referred to as the "methylation rate" of the CpG.
  • the methylation rate at a CpG can be determined by assessing the proportion of remaining cytosine bases relative to the number of thymine bases. This can be done, e.g. by hybridization to oligomer probes on DNA chips (Adorjan et al., (2002) Tumour class prediction and discovery by microarray-based DNA methylation analysis.
  • the sense strand of a native nucleic acid is the strand that serves as a template for RNA transcription.
  • the anti-sense strand of a native nucleic acid is the strand of the nucleic acid which is complementary to the sense strand and with which it forms a duplex nucleic acid. After bisulfite treatment, these two strands are no longer complementary to each other. Both of them contain low quantities of cytosine bases, since all unmethylated cytosines have been converted into uracils. For this reason, these two strands are referred to as "G- rich strands" (guanosines are more abundant in comparison to cytidines). In contrast to native DNA, these two strands are herein also called "bisulfite sense strands".
  • bisulfite sense strands may act as a template DNA for analyzing the same CpG sites. Accordingly, the strands which are generated during an amplification reaction, and which are complementary to these bisulfite sense strands, are herein referred to as "bisulfite anti- sense strands". Since bisulfite anti-sense strands are complementary to the bisulfite sense strands, they contain low quantities of guanosines but high quantities of cytidines and are therefore also called "C-rich strands".
  • Bisulfite treatment leads to the degradation of the treated nucleic acid. This is especially problematic when using genomic DNA that is already degraded. Body fluids as wells as archived sources, such as formaline-fixed and paraffin embedded tissues are well known to contain degraded DNA.. Because of the degradation by bisulfite treatment and, if the case may be, of the applied genomic DNA, only short fragments of the nucleic acid can be amplified. These short fragments are difficult to sequence because the signal resolution at the beginning of a sequencing read is of poor quality rendering the base caller unable to identify the bases accurately.
  • Cytosine signals (in the bisulfite sense strand) and guanosine signals (in the bisulfite anti-sense strand) are overscaled due to base caller artifacts.
  • the so-called “base caller” is a program of the sequencing machine supplied by the manufacturer, e.g. Applied Biosys- tems “.abi” files or the well-described ".scf files (Dear and Staden, (1992) A standard file format for data from DNA sequencing instruments. DNA Seq., 3: 107-110).
  • cytosine signals are scarce in bisulfite-treated DNA, the remaining cytosine signals are over-scaled by the base caller program, leading to too great a signal for the detected cytosine bases. Therefore, a reliable determination of the methylation rate is not feasible.
  • Bisulfite genomic DNA sequencing offers a continuous readout of the entire, detailed, base-by-base methylation map of a genomic DNA sequence (Frommer et al. (1992) Genomic sequencing protocol that yields a positive display of 5-methylcytosine residues in individual DNA strands. Proc Natl Acad Sci USA 89: 1827-1831 ; R. Feil et al. (1994) Methylation Analysis on Individual Chromosomes - Improved Protocol for Bisulfite Genomic Sequencing. Nucleic Acids Res 22: 695-696). The technique also relies on initial bisulfite modification of DNA and, as a final step, direct cycle sequencing of the resulting PCR- amplified sequence.
  • PCR primers are designed external to potential methylation sites.
  • C in the bisulfite sense strand
  • G in the bisulfite anti-sense strand
  • the problem underlying the present invention was to provide a method for sequencing of methylated DNA that could be performed in an easy and reliable manner to determine the methylation rate of a nucleic acid. Furthermore, a kit was to be supplied with which the method according to the present invention could be realized.
  • the central idea of the invention is to provide an artificially introduced base into the nucleic acid to be analyzed, that provides for a reference signal when the nucleic acid to be analyzed is sequenced. This reference signal can then be used in a normalization procedure for all the cytosine (or guanine) signals stemming from the nucleic acid to be ana- lyzed that were obtained in the sequencing reaction.
  • This newly developed data analysis method allows the use of established high-throughput sequencing technology for methylation studies using only one sequencing dye. It is furthermore useful for sequencing a target DNA molecule to be analyzed in a pool of different DNA molecules.
  • Disclosed is a method for determining the methylation rate of a nucleic acid through se- quencing.
  • a template DNA that is to be analyzed with respect to its methylation rate is treated with at least one chemical reagent or with at least one solution containing at least one enzyme, whereby the base pairing behavior of methylated cytosine bases and/or unmethylated cytosine bases of the nucleic acid are altered such that methy- lated cytosine bases become distinguishable from unmethylated cytosine bases in terms of their hybridization properties, i.e. base pairing properties.
  • the nucleic acid with the introduced at least one reference base is sequenced, a signal from each cytosine base of the nucleic acid, or a signal from each guanine base of the nucleic acid, and a reference signal from the at least on introduced base is obtained.
  • the signal obtained from each cytosine base of the nucleic acid is normalized to the reference signal from the at least one introduced base.
  • the signal obtained from each guanine base of the nucleic acid is normalized to the reference signal from the at least one introduced base.
  • a signal stemming from sequencing the introduced base is used as a reference signal for normalization of all the signals stemming from the cytosine (or guanine) bases of the nucleic acid to be analyzed.
  • the first two steps mentioned above can be performed also in reverse order, that is first introducing the reference signal into the nucleic acid to be analyzed and then treating, i.e. converting the nucleic acid such that methylated cytosine bases are distinguishable form unmethylated cytosine bases, if the introduced reference base will not be converted itself during treatment. This is the case in particular for guanine and for certain base analogs, which do not occur naturally.
  • the method according to the present invention is useful for analyzing the methylation rate of nucleic acid samples, i.e. determining to what percentage a certain CpG position of a nucleic acid population has been methylated.
  • the method can be performed using only one sequencing dye, namely for the cytosine (or guanine) signal trace, since the introduced base can also detected using the same sequencing dye.
  • the method is further advantageous, because it can be successfully used on short nucleic acid molecules, particularly those nucleic acid molecules that have been partially degraded.
  • degraded nucleic acids can be found in body fluid or in formalin-fixed tissue sam- pies.
  • the method of the invention is further advantageous with respect ot the prior art, because it is able to directly interprete raw sequencing data without any pre-processing for example by means of a base caller algorithm.
  • Pre-processing algorithm are applied to most sequenc- ings by default. This is in particular done in order to be able to compare the signal derived from one base with the signals derived from the other bases.
  • a preprocessing is necessary wherein the signals derived for at least two bases are regarded.
  • preprocessing is not necessary according to the invention, because only the signals of either cytosine or guanine are considered.
  • the methods of the state of the art are based on a comparison of at the least the cytosine signal with the thymine signal.
  • a cytosine base or guanin base is introduced, but however this is only been done in order to achieve an accurate preprocessing by an preprocessing algorithm (base caller).
  • base caller base caller
  • methylation is determined from the signals of cytosine and thymine.
  • Lewin et al. methylation is determined from the cytosine and thymine signal.
  • the normalization value according to Lewin et al. is determined by combining the signals of cytosine and thymine at CpG positions.
  • Lewin et al. discloses also an embodiment which is based on the consideration of only one base i.e. the signals derived from thymine, this embodiment allows only the direct detection of non- methylation. Thereby a normalization is carried out against the signal of thymine at positions wherein thymine occurs in genomic DNA or wherein cytosines outside of CpG positions are converted to thymine.
  • this embodiment of Lewin et al. has the disadvantage that methylation is only determinable indirectly.
  • the method of the invention has the advantage that normalization occurs with a reference signal of the same base (i.e. the introduced base(s)) as the analyzed signals (i.e. signals of cytosines or guanins). This excludes that signal noise leads an adulteration of re- suits.
  • the normalized signals at each position at which a cytosine (or guanine) signal was detected can also be integrated to obtain a normalized area of the signal.
  • the said normalized area is a more robust value of the methylation rate than one would get when determining the peak heights alone because it is based on a plurality of measured values. In either case, the methylation rate of a nucleic acid can be determined quantitatively through sequencing.
  • the height of signals (peak maxima) are used for normalization.
  • the area of signals are used for normalization.
  • the methylation rate of an analyzed nucleic acid thereby obtained can be further corrected by comparing the normalized signals (areas or heights) with normalized signals (areas or heights, respectively) of analyzed standard nucleic acids with known methylation rates (calibration). This way, one can determine the percentage of methylation at a defined position, yielding the methylation index.
  • the at least one introduced base is at least one artificial base analog, and/or at least one cytosine base, and/or at least one guanine base.
  • a base analog is a base that is not naturally occurring, and that differs from the naturally occuring A, C, G, and T.
  • Such base analogs are well known in the art and for example but not limited to described in Sismour and Brenner 2005, Johnson et al. 2004, Yang et al. 2006 or Brenner and Sismour 2005 (Sismour AM and Benner SA (2005) The use of thymidine analogs to improve the replication of an extra DNA base pair: a synthetic biological system. Nucleic Acids Res. 33(17): 5640-5646; Johnson SC, Sherrill CB, Marshall DJ, Moser MJ, Prudent JR. (2004) A third base pair for the polymerase chain reaction: inserting isoC and isoG.
  • the base analog is isocytosine, isoguanine, 2-thiothymidine, 6-amino-5-nitro-3-(r-beta-D-2'-deoxyribofura- nosyl)-2(lH)-pyridone (dZ), or 2-amino-8-(r-beta-D-2'-deoxyribofuranosyl)-imidazo[l,2- a]-l,3,5-triazin-4(8H)-one (dP).
  • this method can be used on a nucleic acid that has been treated with a chemical reagent or with a solution containing at least one enzyme, whereby the base pairing behavior of methylated cytosine bases and/or unmethylated cytosine bases of the nucleic acid are altered such that methylated cytosine bases become distinguishable from unmethylated cytosine bases in terms of their hybridization properties.
  • this method can also be applied on a nucleic acid which is reverse complementary in sequence to the treated nucleic acid and was generated by an amplification reaction, e.g. by a polymerase chain reaction (PCR). It can be necessary, however, to introduce a different at least one base for each particular nucleic acid.
  • PCR polymerase chain reaction
  • cytosine is to be introduced into the G-rich strand (bisulfite sense strand) according to the present method and a guanine is to be introduced into the C- rich strand (bisulfite anti-sense strand).
  • Base analogs can be used regardless of the base composition of the nucleic acid.
  • the at least one introduced base is introduced into the nucleic acid as part of an at least one nucleotide.
  • Introduction into the nucleic acid is easiest if the at least one nucleotide is itself part of an at least one oligonucleotide, which then comprises at least one base analog, and/or at least one cytosine base, and/or at least one guanine base to generate a reference signal in the sequencing procedure.
  • the at least on oligonucleotide comprises only one reference base, best results are achieved if the at least one oligonucleotide comprises two to four, most preferably three introduced bases for generating a sequencing signal to be used as a reference signal.
  • the at least one oligonucleotide comprises three cytosine bases or base analogs for sequencing the bisulfite sense strand or three guanine bases or base analogs for sequencing the bisulfite anti-sense strand.
  • a greater number of reference signals allows for choosing an appropriate signal to be used in the normalization calculation.
  • a mean reference value can be calculated from all reference signals stemming from the at least one oligonucleotide, and this mean reference value can then be used to normalize the signals stemming from the nucleic acid to be analyzed.
  • the nucleic acid to be analyzed is enzymatically converted.
  • cytidin deaminases These enzymes convert unmethylated cytosine faster than methylated cytosine (Bransteitter et al.: Activation-induced cy-tidine deaminase deaminates deoxycytidine on single-stranded DNA but requires the action of RNase. Proc Natl Acad Sci U S A. 2003 Apr l;100(7):4102-7; or WO 2005/005660).
  • the nucleic acid to be analyzed is incubated with a bisulfite containing solution, whereby unmethylated cytosine bases of the nucleic acid are converted into sulfon-uracil bases or uracil bases while 5-methylcytosine bases remain unchanged, yielding a G-rich nucleic acid.
  • a conversion into uracil bases occurs, wherein the bisulfite conversion com- prises the desulfonation of the treated nucleic acid by increasing the pH.
  • a conversion into sulfon-uracil bases occurs, wherein the bisulfite conversion is free of a desulfonation step. Desulfonation is then carried out subsequently by increasing the temperature for example before or in the sequencing reaction (WO 2006/040187).
  • bisulfite reagent refers to a reagent comprising bisulfite, disulfite, hydrogen sulfite or combinations thereof, useful as disclosed herein to distinguish between methylated and unmethylated CpG dinucleotide sequences. Methods of said treatment are known in the art (e.g. PCT/EP 2004/011715).
  • the bisulfite treatment is conducted in the presence of denaturing solvents, such as, but not limited to, n-alkylenglycol, particularly diethylene glycol dimethyl ether (DME), or in the presence of dioxane or dioxane derivatives.
  • denaturing solvents such as, but not limited to, n-alkylenglycol, particularly diethylene glycol dimethyl ether (DME), or in the presence of dioxane or dioxane derivatives.
  • the denaturing solvents are used in concentrations between 1 % and 35 % (v/v).
  • the bisulfite reaction is carried out in the presence of scavengers such as, but not limited to, chromane derivatives, e.g. 6-hydroxy-2,5,7,8-tetramethylchromane 2- carboxylic acid or trihydroxybenzoe acid and derivates thereof, e.g.
  • the bisulfite conversion is preferably carried out at a reaction tem- perature between 30 0 C and 70 °C, whereby the temperature is increased to over 85 °C for short periods of times during the reaction (see: PCT/EP2004/011715).
  • the bisulfite-treated DNA is preferably purified prior to the quantification. This may be conducted by any means known in the art, such as, but not limited to, ultrafiltration, preferably carried out by means of MicroconTM columns (manufactured by MilliporeTM). The purification is carried out according to a modified manufacturer's protocol (see: PCT/EP2004/011715).
  • the bisulfite reaction is carried out in the presence of scavengers such as but not limited to chromane derivatives, e.g., 6-hydroxy-2,5,7,8-tetramethylchro- mane 2-carboxylic acid or trihydroxybenzoe acid and derivates thereof, e.g. Gallic acid (see: PCT/EP2004/011715 which is incorporated by reference in its entirety).
  • scavengers such as but not limited to chromane derivatives, e.g., 6-hydroxy-2,5,7,8-tetramethylchro- mane 2-carboxylic acid or trihydroxybenzoe acid and derivates thereof, e.g. Gallic acid
  • the bisulfite conversion is preferably carried out at a reaction temperature between 30 0 C and 70 °C, whereby the temperature is increased to over 85 °C for short periods of times during the reaction (see: PCT/EP2004/011715 which is incorporated by reference in its entirety).
  • the bisulfite-treated DNA is preferably purified priori to the quantification. This may be con- ducted by any means known in the art, such as but not limited to ultrafiltration, preferably carried out by means of MicroconTM columns (manufactured by MilliporeTM). The purification is carried out according to a modified manufacturer's protocol (see: PCT/EP2004/011715).
  • the at least one base can be conveniently introduced into the nucleic acid through two possible ways, namely through an amplification reaction or a ligation reaction.
  • the bisulfite treated nucleic acid is amplified using at least one oligonucleotide that comprises at least one cytosine base and/or at least one base analog, whereby the at least one oligonucleotide is incorporated into a G-rich nucleic acid.
  • the at least one oligonucleotide may further comprises a sequencing domain for hybridization of a sequencing primer, wherein the sequencing domain is located on the 5' side of the at least one cytosine base and/or of the at least one base analog, enabling the hybridization of a sequencing primer to the amplification product.
  • the bisulfite treated nucleic acid is amplified using at least one oligonucleotide that comprises at least one guanine base and/or at least one base analog, whereby the at least one oligonucleotide is incorporated into a C-rich nucleic acid.
  • the at least one oligonucleotide can further comprise a sequencing domain for hybridization of a sequencing primer, with the sequencing domain located on the 5' side of the at least one guanine base and/or of the at least one base analog.
  • the at least one oligonucleotide serves as a primer for the amplification reaction.
  • the at least one cytosine or guanine base can be located at the 5' end of the oligonucleotide, and on the 3' side of that region a nucleotide sequence is located which hybridizes with the nucleic acid to be analyzed.
  • the oligonucleotide has a sequence part at the 5' tail end that comprises at least one cytosine and/or at least one guanine base and/or at least one base analog, which will generate a sequencing reference signal, and a 3' end that hybridizes with the nucleic acid to be analyzed and primes the nucleic acid amplification reaction. It is preferred that the 5' end of the oligonucleotide that serves as a primer in the amplification reaction comprises an asymetric sequence. This has the advantage that the occurance of primer dimers is avoided.
  • the said 5' end com- prises the sequence "CC", “CCC”, “CCCC”, “CCCCC”, or more than five cytosines in a row, or combinations of said sequences.
  • the said 5' end comprises the sequence "GG”, “GGG”, “GGGG”, “GGGGG”, or more than five guanins in a row, or combinations of said sequences.
  • the said 5' end comprises the sequence of two , three, four, five or more base analogs in a row, or combinations of said sequences.
  • the said sequence comprises, contains or is the sequence ACTCC (for the bisulfite sense strand) or AGGTG (for the bisulfite anti-sense strand). It is particularly also preferred that the said sequence compises, contains or is the sequence CGTCGTCG.
  • the at least one cytosine base, guanine base, or base analog which generates the sequencing reference signal can be embedded within a portion of the at least one oligonucleotide that hybridizes (is complementary) to the strand of the nucleic acid to be analyzed.
  • the at least one cytosine or guanine base can hybridize with a guanine or (methylated) cytosine base of the opposite strand, respectively, or can mismatch with a base on the opposite strand.
  • a base analog as an introduced base, a non Watson-Crick base pairing will occur.
  • At least one or preferably at least two nucleotides should be located on the 5' and 3' side of the corresponding nucleotide that generates a reference signal during the sequencing reaction to allow for an enzyme to bind efficiently and catalyze an amplification reaction.
  • the at least one sequence part that hybridizes with a sequence of the nucleic acid to be amplified of the at least one oligonucleotide has a length of between 10 nucleotides (nt) to 40 nt, preferably between 15 nt to 30 nt, and more preferably between 18 nt to 25 nt.
  • the primer that comprises the at least one guanine base or the at least one cytosine base or the at least one base analog further comprises a sequencing domain.
  • This sequencing domain will allow a sequencing primer to hybridize and allow sequencing of the amplified nucleic acid.
  • the sequencing domain is located on the 5' side of the at least one cytosine or guanine base, so that the reference signals used for normalization are also sequenced.
  • This embodiment has the advantage that the normalization signal is located at the beginning of a sequencing read, independent of the sequence of the analyzed nucleic acid and is therefore easily automatically detectable.
  • Base analogs can be advantageously used within an at least one oligonucleotide used as a primer for an amplification reaction.
  • a primer can be used that contains e.g. at least one iso-cytosine (isoC) or at least one iso-guanine (isoG).
  • ddNTP dye-labeled dideoxyribonucleoside triphosphates
  • ddisoCTP in case of an iso-guanine containing primer in the amplification reaction
  • ddisoGTP in case of an iso-cytosine containing primer in the amplification reaction
  • dNTP nucleotide
  • both primers for amplify- ing the sense and the anti-sense strand, respectively
  • an enzymatic amplification reaction is performed using a first oligonucleotide comprising at least one cytosine base (for amplifying the bisulfite sense strand), and a second oligonucleotide comprising at least one guanine base (for amplifying the bisulfite anti-sense strand).
  • At least one of these two primers, that is the first or the second oligonucleotide further comprises a sequencing domain.
  • This sequencing domain will allow a sequencing primer to hybridize and allow sequencing of the amplified nucleic acid.
  • the sequencing domain is located on the 5' side of the at least one cytosine or guanine base, so that the reference signals used for normalization are also sequenced.
  • This embodiment has advantages for x-axis normalization, as will be discussed below.
  • the oligonucleotides used as primers will be chosen such that they amplify a fragment of interest. It is particularly preferred that these oligonucleotides are designed to amplify a nucleic acid fragment of a template nucleic acid sample by means of a polymerase reaction, in particular a polymerase chain reaction (PCR), as known in the art.
  • the oligonucleotides are therefore designed to anneal to the template nucleic acids, to form a double strand, following the Watson-Crick base pairing rules (with the exception of introducing mismatching bases into the at least one oligonucleotide, as mentioned above).
  • the length of the two oligonucleotides used in one amplification reaction will be selected such that they anneal at approximately the same temperature.
  • the PCR can also be performed using oligonucleotides (primers) without at least one introduced base.
  • an appropriate oligonucleotide providing a reference sequencing signal can be introduced into the nucleic acid after the amplification reaction, using ligation.
  • the following embodiments of the invention make use of a ligation reaction to introduce the at least one base into the nucleic acid.
  • the at least one oligonucleotide comprises at least one cytosine base and/or at least one base analog for sequencing a G-rich strand of the nu- cleic acid, or at least one guanine base and/or at least one base analog for sequencing a C- rich strand of the nucleic acid.
  • the at least one oligonucleotide can be hybridized with a reverse complementary oligonucleotide to form a double-stranded nucleic acid.
  • This double-stranded nucleic acid is then ligated to the nucleic acid which can then be analyzed through sequencing.
  • the ligation step can be performed such that both the oligonucleotide and the nucleic acid to be analyzed are blunt ended. However, it is preferred that ligation is performed between molecules with sticky ends. Ways of performing the ligation reaction are known to a person skilled in the art.
  • the at least one oligonucleotide comprises a sequencing domain for hybridization of a sequencing primer, wherein the sequencing domain is located on the 5' side of the at least one cytosine base and/or at least one base analog or guanine base and/or at least one base analog.
  • This sequencing domain will allow a sequencing primer to hybridize to the at least one oligonucleotide and allow sequencing of the amplified nucleic acid.
  • the sequencing domain of the at least one oligonucleotide is located on the 5' side of the at least one cytosine or guanine base, so that the at least one reference signal used for normalization is sequenced together with the nucleic acid.
  • sequencing reaction of the amplified nucleic acid is performed using cycle sequencing, as known in the art.
  • the enzyme-based amplification reaction is started by heat-activation, that is, by a brief incubation at an increased temperature, which activates the enzymatic activity.
  • heat-activation that is, by a brief incubation at an increased temperature, which activates the enzymatic activity.
  • a heat stable enzyme is preferred.
  • the enzyme-based amplification reaction is a polymerase-based amplification reaction, in particular a polymerase chain reaction (PCR), it is preferred that the enzyme is a heat stable polymerase.
  • PCR polymerase chain reaction
  • other enzymatic amplification reactions known to the person skilled in the art, including, but not limited to, ligase- mediated amplifications (e.g. Ligase-Chain Reaction) or amplifications based on transcrip- tion (e.g. NASB ATM, 3SRTM, TMATM).
  • the sequencing reaction is preferably performed using one kind of labeled dideoxyribonu- cleoside triphosphates that forms base pairs with either the cytosine bases of the nucleic acid and the introduced base, or the guanine bases of the nucleic acid and the introduced base. This way, the method can be performed using only one sequencing dye and signals from other bases than from the bases of interest (guanine or cytosine) need not be recorded.
  • the at least one signal stemming from the at least one introduced base of the at least one oligonucleotide is identified. If more than one base was in- troduced into the nucleic acid, an appropriate signal has to be chosen to be used in the normalization calculation. Alternatively, a mean reference value can be calculated from all reference signals, and this mean reference value can then be used to normalize the signals stemming from the nucleic acid to be analyzed.
  • the sequencing signal applied to the method of the invention is either unprocessed (raw data) or preprocessed (for example by a base caller algorithm).
  • the sequencing signal is applied to the method of the invention as raw data.
  • the known algorithms for preprocessing of sequencing data are all based on certain assumptions about the frequencies of occurrence of the four bases A, T, C, G. A completely correct preprocessing would only be possible if the exact frequency of each of the four bases would already been known prior sequencing. Because this is not the case for methylation analysis by DNA conversion and subsequent sequencing (the frequencies or occurance of cytosine is the subject matter of the analysis), any methylation analysis, wherein a pre-processing algorithm (e.g. base caller algorithm) is used is error-associated.
  • a pre-processing algorithm e.g. base caller algorithm
  • pre-processed sequencing signals it is also possible and herewith preferred to apply the pre-processed sequencing signals to the method of the invention.
  • sequencing is performed as an auto- mated process, wherein a pre-processing is applied by default. Therefore the use of pre- processed sequencing data has the advantage, that the method of the invention can be directly applied without the need to change the sequencing process. This may lead to error- associated results, wherein the assumpted frequencies of base occurance differ from the actual frequencies, in particular in cases wherein the sequencing is performed in order to quantitatively determine methylation. A qualitative determination may be uneffected i.e. correct.
  • the sequencing signal is usually represented by a time-dependent intensity curve.
  • the area under the sequencing curve is determined for each signal stemming from a cytosine base or a guanine base of the nucleic acid and for each signal stemming from the at least one introduced base, to yield area values. Normalization is performed for each signal obtained by dividing the area value of each signal stemming from a cytosine base or a guanine base of the nucleic acid by the area value of the signal from the at least one introduced base (or, in case of more than one introduced bases, of the chosen reference signal or of the calculated mean reference signal).
  • the height of sequencing curve peaks is determined for each signal stemming from a cytosine base or a guanine base of the nucleic acid and for each signal stemming from the at least one introduced base, to yield height values. Normalization is performed for each signal obtained by dividing the height value of each signal stemming from a cytosine base or a guanine base of the nucleic acid by the height value of the signal from the at least one introduced base (or, in case of more than one introduced bases, of the chosen reference signal or of the calculated mean reference signal).
  • peaks can be narrower or broader with each run. This can be compensated for by the reference signal, if the determination of the peak area is per- formed using integration of the entire peak. It is, however, also possible to determine the area of a peak by determining the maximum height of a peak together with a certain number of measured values before and after this peak on the time line, e.g. to take 15 measured values before and after the peak value. The area determined this way might be smaller than the actual peak area. Therefore, area values of broader peaks might be smaller than those of narrower peaks. It is thus important to also normalize for the length of the peaks (x-axis normalization). In addition, this x-axis normalization enables an automated analysis, since signals of the methylated cytosine sites occur at the same point on the normalized time line (when analyzing the same nucleic acid sequence).
  • X-axis normalization can be performed by using two points on the time line that are far apart and normalizing the distance between these two points.
  • the nucleic acid that was sequenced comprises bases generating reference signals both on the 5' and 3' end of the nucleic acid (as described above), then these reference signals can be used for normalization.
  • the two signals necessary for x-axis normalization can be generated by running two internal size standards parallel with the sequencing reaction on the same gel.
  • these standards bear the same dye as the sequencing reaction of the base in question, so that the method according to the invention can be performed using only one dye.
  • the in- ternal standards could be of 220 nt and 250 nt.
  • the internal standards therefore do not interfere with the sequencing reaction but will run more slowly than the sequencing reaction products on a sequencing gel.
  • the nucleic acid is genomic DNA, which may be isolated and/or denatured and therefore present as single strands. It is also possible to use DNA from other sources, such as synthesized DNA that does not stem from a natural source.
  • the present invention also comprises an oligonucleotide (primer) for amplifying a nucleic acid.
  • an oligonucleotide comprises a first sequence part that is reverse complemen- tary to the nucleic acid to be amplified and serves for initiating an amplification reaction, a second sequence part that contains at least one base for generating a sequencing signal to be used for the normalization of sequencing signals stemming from the nucleic acid, and a third sequence part for the hybridization of a sequencing primer.
  • the second and third sequence part are not identical. Therefore, the at least one base for generating a sequencing signal will be sequenced before the nucleotides of interest of the nucleic acid to be analyzed, which simplifies the determination of the methylation rate.
  • the first sequence part is located at the 3' end of the oligonucleotide, followed, in 3' to 5' direction, by the second and then the third sequence parts. It is furthermore preferred that the at least one base of the second sequence part is a cytosine base, and/or a guanine base, and/or a base analog.
  • the present invention also comprises a kit, in particular a test kit for the realization of the method described above.
  • a test kit comprises a chemical reagent, particularly bisul- fite, or an enzyme which alters the base pairing behavior of methylated cytosine bases and/or unmethylated cytosine bases of the nucleic acid such that methylated cytosine bases become distinguishable from unmethylated cytosine bases, and at least one oligonucleotide that comprises at least one cytosine base and/or at least one guanine base and/or at least one base analog, and an enzymatic activity for amplifying a nucleic acid using the at least one oligonucleotide as a primer and/or an enzymatic activity for ligating the at least one oligonucleotide to a nucleic acid.
  • the at least one oligonucleotide of the test kit enables the correct and simple determination of the methylation rate by introducing at least one cytosine base, at least one guanine base, or at least one base analog that generates a sequencing signal which can serve as a reference signal for the normalization of the cytosine (when sequencing the bisulfite sense strand) or guanine signals (when sequencing the bisulfite anti-sense strand) of the nucleic acid that is analyzed.
  • the properties of the at least one oligonucleotide are the same as de- scribed above with reference to the method according to the present invention.
  • test kits may further comprise one or more of the additional components, such as:
  • one or more denaturing reagent and/or solution for example: dioxane or diethylene gly- col dimethylether (DME) or any substance, which is suitable as described in WO
  • scavenger for example 6-hydroxy-2,5,7,8-tetramethylchromane 2-carboxylic acid or other scavengers as described in WO 01/98528 or WO 05/038051,
  • reaction buffers which are suitable for a bisulfite treatment and/or a PCR reaction
  • nucleotides which can be dATP, dCTP, dTTG, dUTP and dGTP or any derivative of these nucleotides, - MgCl 2 as a substance or in solution and/or any other magnesium salt, which can be used to carry out a DNA polymerase replication,
  • DNA polymerase for example Taq polymerase or any other polymerase with or without proof-reading acitivity, - any reagent, solution, device and/or instruction which is useful for realization of a method according to the invention.
  • the method and test kit disclosed here are preferably used for the diagnosis and/or progno- sis of adverse events for patients or individuals, whereby diagnosis means diagnose of an adverse event, a predisposition for an adverse event and/or a progression of an adverse event.
  • adverse events belong to at least one of the following categories: undesired drug in- teractions, cancer diseases, CNS malfunctions, damage or disease, symptoms of aggression or behavioral disturbances, clinical, psychological and social consequences of brain damage, psychotic disturbances and personality disorders, dementia and/or associated syndromes, cardiovascular disease, malfunction or damage, malfunction, damage or disease of the gastrointestinal tract, malfunction, damage or disease of the respiratory system, lesion, inflammation, infection, immunity and/or convalescence, malfunction, damage or disease of the body as an abnormality in the development process, malfunction, damage or disease of the skin, of the muscles, of the connective tissue or of the bones, endocrine and metabolic malfunction, damage or disease, headaches or sexual malfunction.
  • the method and test kits also serve for distinguishing cell types and tissues or for investigating cell differentiation. They also serve for analyzing the response of a patient to a drug treatment.
  • the method and test kit of the invention can also be used to determine the DNA methyla- tion rate in that positions are methylated or non-methylated compared to normal conditions if a single defined disease exists.
  • they can serve for identifying an indication-specific target, wherein a template nucleic acid is treated according to the method of the present invention, and wherein an indication-specific target is defined as differences in the DNA methylation rate of a DNA derived from a diseased tissue in com- parison to a DNA derived from a healthy tissue.
  • the tissue samples can originate from a patient with the single defined disease and from a healthy individual. They can also originate from one patient with the single defined disease diseased only, in which case DNA from the pathological tissue will be compared to DNA from healthy tissue that was ob- tained from adjacent to the sick tissue of the patient (so-called adjacent analogous normal tissue).
  • DNA stemming from a healthy individual and an individual with a single defined disease will be analyzed with respect to its methylation rate at particular CpG sites.
  • the results are then compared to each other with the goal of identifying CpG positions in genomic DNA that allow for the diagnosis of the single defined disease in a patient and/or that allow for the prediction of likelihood of an individual becoming ill with the single defined disease and/or that allow for the prediction of likelihood of an individual surviving with the single defined disease.
  • the method and test kit of the invention can serve for identifying an indication-specific target, wherein a template nucleic acid is treated according to the method of the present invention, and wherein an indication-specific target is de- fined as differences in the DNA methylation rate of a DNA derived from a diseased tissue in comparison to a DNA derived from a healthy tissue.
  • tissue samples can originate from diseased or healthy patients or from diseased or healthy adjacent tissue of the same patient.
  • the sample nucleic acid can be obtained from serum or other body fluids of an individual. They can, in particular, be obtained from cell lines, tissue embedded in paraffin, such as tissue from eyes, intestine, kidneys, brain, heart, prostate, lungs, breast or liver, histological slides, body fluids and all possible combinations thereof.
  • body fluids is meant to comprise fluids such as whole blood, blood plasma, blood serum, urine, sputum, ejaculate, semen, tears, sweat, saliva, lymph fluid, bronchial lavage, pleural effusion, peritoneal fluid, meningal fluid, amniotic fluid, glandular fluid, fine needle aspirates, nipple aspirate fluid, spinal fluid, conjunctival fluid, vaginal fluid, duodenal juice, pancreatic juice, bile, stool and cerebrospinal fluid. It is especially preferred that said body fluids are whole blood, blood plasma, blood serum, urine, stool, ejaculate, bronchial lavage, vaginal fluid and nipple aspirate fluid.
  • the present invention can furthermore be used to determine methylation patterns of cells and tissues, both healthy and sick.
  • the nucleic acid according to the invention is genomic DNA.
  • Subject matter of the invention is a method for determining the methylation rate of a nu- cleic acid through sequencing, comprising the steps of:
  • the at least one introduced base is: at least one base analog, and/or at least one cytosine base, and/or at least one guanine base.
  • the at least one introduced base is introduced into the nucleic acid as an at least one nucleotide.
  • the at least one nucleotide is introduced into the nucleic acid as an at least one oligonucleotide.
  • the at least one oligonucleotide comprises two to four, most preferably three bases for generating a sequencing signal.
  • the nucleic acid is treated with a bisulfite containing solution, whereby unmethylated cytosine bases of the nucleic acid are converted into sulfon-uracil bases or uracil bases whereas methylated cytosine bases remain unchanged.
  • the at least one base is introduced into the nucleic acid through an amplification reaction and/or a ligation reaction.
  • the bisulfite treated nucleic acid is amplified using at least one oligonucleotide that comprises at least one cytosine base and/or at least one base analog, whereby the at least one oligonucleotide is incorporated into a G-rich nucleic acid.
  • the at least one oligonucleotide further comprises a sequencing domain for hybridization of a sequencing primer, wherein the sequencing domain is located on the 5' side of the at least one cytosine base and/or of the at least one base analog.
  • the bisulfite treated nucleic acid is amplified using at least one oligonucleotide that comprises at least one guanine base and/or at least one base analog, whereby the at least one oligonucleotide is incorporated into a C-rich nucleic acid.
  • the at least one oligonucleotide further comprises a sequencing domain for hybridization of a sequencing primer, wherein the sequencing domain is located on the 5' side of the at least one guanine base and/or of the at least one base analog.
  • At least one oligonucleotide serves as a primer for the amplification reaction.
  • the at least one oligonucleotide comprises the at least one introduced base in a sequence part of the oligonucleotide that does not hybridize with the nucleic acid.
  • the sequence part of the oligonucleotide that does not hybridize with the nucleic acid is localized at the 5' end of the oligonucleotide.
  • the at least one oligonucleotide comprises the at least one in- troduced base within a sequence part of the at least one oligonucleotide that hybridizes with the nucleic acid.
  • sequence part of the at least one oligonucleotide is reverse complementary to the sequence of the nucleic acid.
  • the sequence part of the at least one oligonucleotide that hybridizes with a sequence of the nucleic acid has a length of between 15 to 30 nucleotides.
  • the amplification reaction is mediated by a polymerase, pref- erably by a heat stable polymerase.
  • the at least one oligonucleotide is introduced into the nucleic acid through ligation, with the at least one oligonucleotide comprising:
  • the at least one oligonucleotide comprises a sequencing do- main for hybridization of a sequencing primer that is located on the 5' side of the at least one cytosine base and/or at least one base analog or guanine base and/or at least one base analog.
  • the sequencing reaction is performed using one kind of labeled dideoxyribonucleoside triphosphates that forms base pairs with either
  • the at least one signal stemming from the at least one introduced base is identified.
  • the area under the sequencing curve is determined for each signal stemming from a cytosine base or a guanine base of the nucleic acid and for each signal stemming from the at least one introduced base, to yield area values.
  • normalization occurs either by dividing the area value of each signal stemming from a cytosine base or a guanine base of the nucleic acid by the area value of the signal from the at least one introduced base, or by dividing the height value of each signal stemming from a cytosine base or a guanine base of the nucleic acid by the height value of the signal from the at least one introduced base.
  • the percentage of methylation of a specific position is obtained by calibrating the normalized signal of the nucleic acid to be analyzed against the normalized signal of a reference nucleic acid.
  • the nucleic acid is genomic DNA.
  • Subject matter of the invention is also a method for determining the clonality of a sample, comprising a) isolating genomic DNA from a sample; b) submitting the isolate genomic DNA to the method of claim 1 , wherein normalized signals for cytosine bases or guanin bases are obtained, c) deducing that a sample is of monoclonal origin wherein only cytosine bases or guanine bases are detected that are specific for either the maternal chromosome or the paternal chromosome; or deducing a sample is of polyclonal origin, wherein cytosine bases or guanine bases are detected that are specific for the maternal as well the paternal chromosome.
  • An additional subject matter of the invention is also an oligonucleotide for amplifying a nucleic acid, comprising
  • oligonucleotide is suitable for ampli- f ⁇ ction.
  • a further subject matter of the invention is also a kit for the realization of the method of the invention, with the following components:
  • a chemical reagent or an enzyme which alters the base pairing behavior of methylated cytosine bases and/or unmethylated cytosine bases of the nucleic acid such that methylated cytosine bases become distinguishable from unmethylated cytosine bases
  • At least one oligonucleotide that comprises at least one cytosine base and/or at least one guanine base and/or at least one base analog
  • a preferred kit is a kit, wherein the chemical reagent is bisulfite.
  • a preferred kit is a kit, wherein at least one oligonucleotide is an oligonucleotide comprising a first sequence part that is reverse complementary to the nucleic acid to be amplified for initiating an amplification reaction, - a second sequence part that contains at least one base for generating a sequencing signal to be used for the normalization of sequencing signals stemming from the nucleic acid, and a third sequence part for the hybridization of a sequencing primer.
  • the second and third sequence part are not identical.
  • said at least one oligonucleotide is suitable for amplifiction.
  • Subject matter of the invention is also the use of the method of the invention or of an oli- gonucleotide according to the invention or of a kit according to the invention for diagnosis and/or prognosis of adverse events for patients or individuals, whereby these adverse events belong to at least one of the following categories: undesired drug interactions; cancer diseases; CNS malfunctions; damage or disease; symptoms of aggression or behavioral disturbances; clinical; psychological and social consequences of brain damages; psychotic disturbances and personality disorders; dementia and/or associated syndromes; cardiovascular disease, malfunction or damage; malfunction, damage or disease of the gastrointestinal tract; malfunction, damage or disease of the respiratory system; lesion, inflammation , infection, immunity and/or convalescence; malfunction, damage or disease of the body as an abnormality in the development process; malfunction, damage or disease of the skin, of the muscles, of the connective tissue or of the bones; endocrine and metabolic malfunction, damage or disease; headaches or sexual malfunction.
  • adverse events belong to at least one of the following categories: undesired drug interactions; cancer diseases; CNS
  • the method of the invention or an oli- gonucleotide according to the invention or a kit according to the invention for distinguishing cell types and/or tissues and/or for investigating cell differentiation.
  • an indication-specific target in a nucleic acid is defined by a difference in the methylation rate of a nucleic acid derived from a diseased tissue in comparison to the methylation rate of a nucleic acid derived from a healthy tissue.
  • Figure 1 describes the complete conversion of unmethylated cytosine to uracil, also referred to as bisulfite conversion, which is known in the art.
  • unmethylated cytosine bases are sulfonated at position C6 at a pH around 5 through reaction with hydrogensulfite.
  • the second step is the deamination that takes place rather spontaneously in aqueous solu- tion. Thereby, cytosine sulfonate is converted into uracil sulfonate.
  • the third step is the desulfonation step, which takes place in alkaline conditions, resulting in uracil.
  • FIG. 2 shows data processing of the sequenced TFFl gene PCR product. The sequencing electorpherogram of 50% methylated, bisulfite treated DNA is depicted.
  • Figure 2 two contains three horizontal panels, denoted A, B, and C, all showing electro- pherograms.
  • the run time is represented on the x-axis and the signal intensity on the y-axis.
  • CpG sites are denoted by arrows.
  • the upper panel A of Fig. 2 shows the data that was exported from ABI trace files after completed sequencing of a nucleic acid that is to be analyzed for its methylation rate.
  • Each of the four lanes represents the signal for one base (A, C, G, T).
  • Panel B shows the identified methylation signals stemming from the nucleic acid and the signals stemming from the at least one introduced base of an at least one oligonucleotide.
  • the at least one oligonucleotide that was used was introduced into the bisulfite-treated nucleic acid as a primer in a PCR amplification and contained three cytosine bases.
  • the three cytosine bases were introduced into the nucleic acid to be analyzed in a PCR reaction. All cytosine bases of the nucleic acid as well as the three introduced cytosine bases are detected using the same sequencing dye.
  • Panel C shows the excised methylation and normalization signals. As can be seen, the use of only one dye for cytosine is sufficient to perform determination of the methylation rate of a nucleic acid.
  • Fig. 2 The process of data analysis of the TFFl gene (forward [G-rich] primer) is exemplarily depicted in Fig. 2 (together with Fig. 3). Sequence data from the 50 % methylated DNA was exported from the ".ABI" trace files generated by the software of the sequencing machine and the trace of the C signal (containing the methylation signal and the normalization tail) is depicted (Fig 2, panel A and B). The signals at the CpG sites and the normalization tail were identified and excised (Fig. 3, B and C).
  • Figure 3 shows data normalization.
  • Panel A top shows the sequencing raw data (cytosine trace of CpG sites only; Intensity over run time)
  • panel B shows the normalized sequencing data (cytosine trace of CpG sites only; normalized intensity over run time)
  • panel C shows the calibration (G-rich sequencing only; area of normalized CpG sites over methylation of input DNA in %).
  • Peak intensities at CpG sites were divided by the area of the normalization tail ([A] and [B]).
  • the integrated areas of all normalized signals (methylation index) reflect the average methylation rate of the analyzed sequence [C].
  • Figure 3 (A) shows the cytosine traces of all used standard DNAs (0 %, 5 %, 10 %, 25 %, 50 %, 75 % and 100 % methylated).
  • CpG sites and the normalization tail were already excised in Fig. 3. Normalization was performed by dividing the area of all CpG sites by the area of the normalization tail (Fig. 3, B). Quantitative methylation information is reflected by the normalized area (methylation index) of the CpG sites (Fig. 3, C).
  • Figures 4 to 16 show the results of the sequencing of the 0 %, 5 %, 10 %, 25 %, 50 %, 75 % and 100 % methylated DNA mixtures using 13 PCR amplicons (10 different genes). All amplicons were sequenced using forward and reverse primers (Table 1). The methylation rate (methylation index) is displayed by the normalized areas of the peaks of the respective methylation sites that were normalized against the normalization tail. The methylation indices of all 13 amplicons and both sequencing directions for each amplicon show a strong correlation to the level of methylation of the input DNA. AIl Figures 4 to 16 consist of three panels: panel A (top), panel B (middle), and panel C (bottom).
  • Panel A always shows C-rich sequencing (bisulfite anti-sense strand) of a particular gene (methylation index over methylation of input DNA in %).
  • Panel B always shows G-rich sequencing (bisulfite sense strand) of a particular gene (methylation index over methylation of input DNA in %).
  • Panel C always shows the concordance of G- rich and C-rich sequencing of a particular gene (methylation index of the G-rich sequencing over methylation index of the C-rich sequencing).
  • Figure 4 shows the results of the sequencing of the TFFl gene.
  • FIG. 5 shows the results of the sequencing of the SLITRKl gene.
  • Figure 6 shows the results of the sequencing of the SLIT2 gene.
  • Figure 7 shows the results of the sequencing of the RASSFlA gene.
  • Figure 8 shows the results of the sequencing of the PLAU gene.
  • Figure 9 shows the results of the sequencing of the PITX3 gene.
  • Figure 10 shows the results of the sequencing of the PITX2 (region 4) gene.
  • Figure 11 shows the results of the sequencing of the PITX2 (region 3) gene.
  • Figure 12 shows the results of the sequencing of the PITX2 (region 2) gene.
  • Figure 13 shows the results of the sequencing of the PITX2 (region 2) gene.
  • Figure 13 shows the results of the sequencing of the PITX2 (region 1) gene.
  • Figure 14 shows the results of the sequencing of the LIMKl gene.
  • FIG. 15 shows the results of the sequencing of the LHX3 gene.
  • Figure 16 shows the results of the sequencing of the HS3ST2 gene.
  • Figure 17 shows a preferred embodiment of the method for determining the methylation rate of a nucleic acid through sequencing, in which two oligonucleotides are used as primers to introduce bases into a nucleic acid to be analyzed via a PCR amplification reaction.
  • nucleic acid 1 is shown that was treated such that unmethylated cytosines were converted into uracils. This was done using bisulfite treatment, such as described with reference to figure 1.
  • the bisulfite sense strand is depicted. Due to bisulfite treatment, this DNA strand 1 is rich in guanine bases compared to cytosine bases (G-rich strand). In the middle of the nucleic acid molecule 1, three CpG sites are located 9, which will be analyzed using the method according to this invention.
  • the nucleic acid strand 1 is amplified using an enzymatic amplification reaction in the form of a polymerase chain reaction (PCR). Two oligonucleotides 2, 3 are used in the PCR.
  • PCR polymerase chain reaction
  • Both of the oligonucleotides (primers) 2, 3 shown here for amplifying a nucleic acid 1 comprise a first sequence part that is reverse complementary to the nucleic acid to be amplified which serves for initiating an amplification reaction, a second sequence part that contains at least one base (C or G in this examples) for generating a sequencing signal to be used for the normalization of sequencing signals stemming from the nucleic acid, and a third sequence part for the hybridization of a sequencing primer. Due to the fact that the second and third sequence parts are not identical, the at least one base for generating a se- quencing signal will be sequenced before the nucleotides of interest of the nucleic acid to be analyzed, which simplifies the determination of the methylation rate.
  • a first oligonucleotide 2 is a C-rich primer that comprises a sequence part at its 3' end that hybridizes to the nucleic acid strand 1 and takes part in initiating the amplification reac- tion.
  • the 5' end of the first primer 2 comprises at least one guanine base. Therefore, the first primer 2 amplifies the bisulfite anti-sense strand, introducing at least one guanine base as a reference base for sequencing.
  • a second oligonucleotide 3 is a G-rich primer that comprises a sequence part that hybrid- izes to the nucleic acid strand 1, which is located at the 3' side of the second primer 3.
  • On the 5' end of the second primer 3, a sequencing domain 4 for hybridization of a sequencing primer is located.
  • the second primer 3 amplifies the bisulfite sense strand, introducing at least one cytosine base as reference base for sequencing.
  • a sequencing primer used for sequencing the nucleic acid 1 can hybridize to the sequencing domain 4 of the second primer 3 to initiate a sequencing reaction, such as a cycle sequencing reaction.
  • a sequencing reaction such as a cycle sequencing reaction.
  • the first primer 2 contains three guanines and the second primer 3 contains three cytosine bases that generate a reference signal when sequenced. These reference signals will later be used for normalizing the signals stemming from guanine of cytosines of the nucleic acid 1.
  • both the first (reverse) primer 2 and the second (forward) primer 3 comprise at least one guanine or cytosine base as an introduced base, respectively
  • the PCR product contains at least one guanine or cytosine base at each side, i.e. at the 5' and the 3' end. This is advantageous for x-axis normalization.
  • Panel B shows an amplification product 5 of a PCR amplification reaction of a nucleic acid 1 using first and second primers 2, 3 as described with reference to figure 17 A.
  • the strand shown of the amplification product 5 is the bisulfite sense strand (G-rich strand); the bisulfite anti-sense strand (C-rich strand) is not shown.
  • the depicted amplification product 5 comprises, in 5' to 3' direction: a sequencing domain 6, cytosine reference bases 7 to be used for normalization, the sequence that the second primer 3 hybridized with 8, the analyzed sequence of the nucleic acid 9 containing the me- thylation information, the sequence 10 that the first primer 2 hybridized with, and cytosine reference bases 11 to be used for normalization.
  • the three guanine bases of the first primer 2 are now introduced into the amplification product 5 of the bisulfite sense strand as cytosines.
  • the amplification product 5 contains reference signals for normalization at both its 5' and 3' end, which is advantageous for determining the methylation rate of the nucleic acid 1.
  • Panel C depicts the sequence trace data 12 from the cytosine signal, which was generated form the sequencing data of the PCR amplification product 5 shown in panel B of figure 17.
  • Signals stemming from cytosines from the oligonucleotides in the form of primers are located on the 5' and the 3' side of the CpG sites of interest, making the determination of the methylation rate of the cytosine bases to be analyzed 9 easy, reliable and reproducible.
  • Figure 18 illustrates the principle of the method of the invention on the example of a G- rich sequencing via the forward primer.
  • Figure 19 shows the principle of normalization according to the method of the invention. Shown are the sequence histograms (only cytosine signals) of 5 different methylation mixtures - in the upper part before normalization and in the lower part after normalization. The normalization signal is identified for each sample. The intensities of each measured value are then divided by the area of the normalization signal. The heights as well as the areas of the peaks at the CpG positions correlate with the methylation of the methylation mixtures after normalization.
  • Figure 20 Dependency of the normalized methylation signals (sum of all normalized peak areas) on the methylation of the applied standard DNA. 13 different loci were sequenced by means of both a guanosin-rich forward primer (primary y-axis) and a cytosine-rich reverse primer (sekundary y-axis). This figure is equivalent to the content of figures 4-16 summerizing the data.
  • Figure 21 Comparison of the method of the invention with correspondent QM assays. Shown are the results obtained by the analysis of the three genes PITX2, uPA and TFFl. The analyzed standard DNA samples were characterized by a methylation of 0, 5, 10, 25, 50, 75 and 100%. The displayed measured values are median values of three different QM assay runs ⁇ standard deviation (secondary y-axis) and the results of a single sequencing run by means of guanine-rich primer according to the method of the invention (primary y- axis), respectively.
  • Figure 22 illustrates a preferred embodiment of the invention, wherein the clonality of a sample, in particular of tumor cells is determined.
  • the human chromosome set is a duplicate set, wherein one copy of a gene is derived from the mother (maternal allele) and the other from the father (paternal allele). One allele is only active in each cell, while the other allele is inactivated by methylation. Maternally and paternaly derived chromosomal regions can be differenciated for example by the different lengths of short tandem repeats. In case of a monoclonal sample, the sample will only generate a sequencing histogramm according to the invention that reflects the methylated cytosines of either the maternal or the parternal allele.
  • the sample will generate a sequencing histogramm according to the invention that reflects the methylated cytosines of the maternal as well the parternal allele.
  • the sample comprises tumor cells, it is possible to deter- mine the clonality of the cancerous cells. Thereby it is determined if the cancerous cells are derived from a single origin or from multiple origins.
  • the nucleic acid to be analyzed is incubated with a chemical reagent or an enzyme containing solution, whereby unmethy- lated cytosine bases are converted into uracil bases. Accordingly, bisulfite treated DNA comprises only a small number of cytosine bases and all remaining cytosine bases derive from previously methylated cytosine bases.
  • At least one base is introduce into the bisulfite-treated nucleic acid.
  • This is done in this example using enzymatic amplification reaction, which is performed with at least one oligonucleotide, which comprises at least one cytosine base for amplifying the bisulfite sense strand or at least one guanine base for amplifying the bisulfite anti-sense strand of the nucleic acid.
  • the mixture is then incubated, whereby the nucleic acid is amplified.
  • amplification is performed using the polymerase chain reaction (PCR).
  • bisulfite-treated DNA is amplified in this example using at least one oligonucleotide in the form of a tagged primer, which leads to an incorporation of additional cytosine bases at the end of the PCR product of the bisulfite sense strand. If this strand is sequenced (G-rich sequencing) in a later step, these additional cytosine bases can be used as reference signals to quantify the methylation rate of the DNA.
  • the at least one oligonucleotide comprises three cytosine bases for amplifying the bisulfite sense strand or three guanine bases for amplifying the bisulfite anti-sense strand.
  • the at least one cytosine or guanine base is located at the 5' end of the oligonucleotide, and on the 3' side to that 5' end of the oligonucleotide, a nucleotide sequence is located which specifically hybridizes with the particular target gene of the nucleic acid that is to be analyzed.
  • the oli- gonucleotides used in these examples will generate a sequencing reference signal that can be used to normalize the sequencing signals obtained for the cytosine or guanine base of the nucleic acid that is analyzed.
  • the 5' end of the oligonucleotides that serve as primers in the PCR bear either the sequence ACTCC (when used for amplifying the bisulfite sense strand) or AGGTG (when used for amplifying the bisulfite anti-sense strand) (see Table 1).
  • the bisulfite anti-sense strand which is generated during PCR amplification, contains few guanine bases and all remaining guanine bases reflect previously methylated CpG sites. Accordingly, guanine bases have to be incorporated (here using tagged primers) at one terminal end of the PCR amplification product in order for determining the methylation rate of this strand using sequencing (C-rich sequencing).
  • Unmethylated DNA was prepared by MDA (multiple displacement amplification), a genome-wide amplification method described by Dean FB, Hosono S, Fang L, Wu X, Faruqi AF, Bray- Ward P, Sun Z, Zong Q, Du Y, Du J, Driscoll M, Song W, Kingsmore SF, Eg- holm M and Lasken RS (2002) Comprehensive human genome amplification using multiple displacement amplification. Proc. Natl. Acad. Sci. U S A, 99, 5261-5266).
  • Methylated DNA was prepared by treating unmethylated DNA (obtained as described above) with Sssl methyltransferase (New England Biolabs) in the presence of S-adenosyl- methionine according to the manufacturer's instructions.
  • the at least one cytosine base or guanine bases as a mismatch within the sequence of the at least one oligonucleotide, rather than as a tail as described in these examples.
  • PCR amplification reactions were performed in a total volume of 25 ⁇ l containing 5 ng template DNA, 1 U Hotstar Taq polymerase (Qiagen), 12.5 pmol of forward and reverse primer, Ix PCR buffer (Qiagen), 0.2 mmol/1 of each dNTP (Fermentas). Cycling was performed using a Mastercycler (Eppendorf) under the following conditions: 15 min at 95 °C and 45 cycles at 95 °C for 20 s, 58 °C for 45 s and 72 °C for 30 s.
  • Digestion of remaining dNTPs and primers was performed in a total volume of 7 ⁇ l, containing 5 ⁇ l PCR product and 2 ⁇ l ExoS AP-IT (Amersham Bioscience) at 37 °C for 45 min. The enzyme was inactivated by heating to 95 °C for 15 min. ExoSAP-IT digested PCR products were subjected to the cycle sequencing reaction. Cycle sequencing reaction was carried out in a total volume of 20 ⁇ l containing 5 ⁇ l ExoSAP-ITTM digested PCR product, Ix sequencing buffer, 1 ⁇ l BigDye Terminator v3.1TM (Applied Biosystems) and 0.5 ⁇ mol/1 primer.
  • Cycle sequencing products were purified using DyeExTM 96 plates (Qiagen) according to the manufacturer's instructions. Sequence analysis of the purified cycle sequencing product was performed with the 3730 DNA Analyzer (Applied Biosystems) using POP-7TM Polymer (Applied Biosystems). The KB base caller was applied to analyze sequence trace files. Data (electropherogram) from these ABI sequence trace files were exported using Chro- mas 2.31 software (Technelysium Pty Ltd.).
  • Figure 2 (A) shows the C traces of all used standard DNAs (0 %, 5 %, 10 %, 25 %, 50 %, 75 % and 100 % methylated). CpG sites and the normalization tail were already excised in figure 2. Normalization was performed by dividing the area of all CpG sites by the area of the normalization tail (Fig. 2, B). Quantitative methylation information is reflected by the normalized area (methylation index) of the CpG sites (Fig. 2, C).
  • Table 1 Sequences of oligonucleotides used as primers. All primers contain a gene specific sequence (capital letters) and a normalization tag (lower case letters).
  • Results Figures 3 to 15 show the results of the sequencing of the O %, 5 %, 10 %, 25 %, 50 %, 75 % and 100 % methylated DNA mixtures using 13 PCR amplicons (10 different genes). All amplicons were sequenced using the forward and the reverse primer. The methylation index is displayed by the normalized areas of the peaks of the respective methylation sites, which were normalized against the normalization tail. The methylation indices of all 13 amplicons and both sequencing directions for each amplicon show a strong correlation to the level of methylation of the input DNA. In this case, the methylation index is the sum of all methylation signals within one analyzed nucleic acid, and therefore reflects the average methylation per analyzed sequence. Alternatively, this analysis can also be performed separately for each CpG site.
  • Example 2 Example 2:
  • the new method of quantification of bisulfite sequencing enables quantification of methylation with high resolution.
  • the principle of the new method is depicted in Figure 18 and is based on the introduction of a normalization signal in the PCR product.
  • the addition of bisulfite drives the conversion of the DNA in that all cytosines are being converted to uracil. Solely methylated cytosine remains unaffected from this reaction. Accordingly, the DNA contains cytosines only where methylated cytosine was previously found ( Figure 18, I). Now this DNA is PCR-amplified whereby one uses a backward-primer, which contains a gene-unspecific domain with guanosines at its 5'end ( Figure 18, II).
  • cytosines at its 3 'end which originate from the domain of the backward-primer, in addition to the cytosines originating from the methylated cytosines ( Figurel ⁇ , III).
  • the additional cytosines, at the end of the PCR product are independent of the original relative methylation and are present in each and every DNA molecule of the PCR product. They behave, therefore, like cytosines, which in the original DNA were methylated up to 100 %, and can be used for the normalization of the actual methylation signal.
  • the signal normalization is shown in Figure 19.
  • the cytosine signals are produced from five different methylated DNA standards (0, 5, 15, 50, and 100 % methylation).
  • Figure 19, top no correlation would exist of the peak-height and peak-area of the methylation signals with the methylation of the starting DNA. If, however, one normalizes the signals in the form that the cytosines, artificially inserted into the PCR product, have a uniform value, then the correlation of the peak-height and peak-area of the methylation of the starting DNA can be perceived well (Figure 19, bottom).
  • the actual normalization is performed in that the fluorescence intensities are divided into each measuring point along the peak- area of the entire normalization signal.
  • the peak-area of the normalization signal averages to exactly one for each sample after the normalization, and all samples can be directly compared to each other.
  • the actual quantitative methylation information is contained in the height of the peak, corresponding to the methylation positions, as well as in the area below the peak.
  • Figure 19 one sees this correlation clearly:
  • the peak-height as well as the peak-area at the CpG positions correlate very well with the methylation of the introduced DNA standards. In the following, only the peak-areas are used for a further analysis.
  • the peak-area is represented by the sum of several measuring points and does not constitute only one measuring point as the peak-height.
  • the peak-area should, therefore, give a more robust signal than the peak-height.
  • each unique CpG position can be separately analyzed. If not otherwise described, then in this example, the sum of the peak-areas is used within each PCR product in order to further increase the robustness of the signal.
  • the definition of the peak-area of a methylation position (CpG position) is defined as follows: Fluorescence of the local maximum plus the fluorescence intensities of the 15 preceding and the 15 subsequent measuring points. A peak is thus the sum of 31 measuring points.
  • the methylation of a total of 13 loci is examined from the micro-dissection samples. Since there is obviously not enough DNA present in the micro-dissection material, the DNA is first pre-amplified in a PCR in order to analyze the 13 loci separately from one another. For this purpose, a PCR is performed where the primers for all 13 loci are contained (multiplex-PCR, mPCR). Subsequently, each of the 13 loci is re-amplified in a separate PCR (singleplex-PCR, sPCR) whereby the product of the multiplex-PCR pre- amplification served as template. Primers with the 5 'domains for the generation of the normalization signals are first used in the sPCR re-amplification.
  • the pre-amplification with the multiplex-PCR is performed with conventional primers without domains.
  • the emphasis was placed on specificity.
  • the formation of side products and primer-dimers are hindered despite high numbers of cycles and a low concentration of template.
  • the product of this mPCR pre-amplification is, thus, suited to be re-amplified subsequently in an sPCR without the potential of side products being better amplified than the desired loci.
  • the primers being used were those, which through additional bases near the 5 'domain and in the 3 'end differ from pre-amplification primers. As far as it was possible and the primers of pre-amplification did not directly border a CpG, the primers of re-amplification were lengthened up to 3 bases in the amplificate. Thus, it was ensured that only the correct amplificate was reproduced.
  • the efficiency of the new method was tested for the 13 loci to be investigated.
  • the DNA standards (0, 5, 10, 25, 50, 75, and 100% methylated) are processed according to the described procedure.
  • the DNA standards are composed of a mixture of genome-wide amplified DNA (MDA DNA) and synthetically methylated MDADNA in the corresponding ratio.
  • 20 ng of each bisulfite-converted DNA standards are first pre- amplified in the mPCR and subsequently re-amplified in the 13 separate sPCR's.
  • the PCR product of the re-amplification was sequenced with both G-rich forward-primer as well as the C-rich backward-primer.
  • Figure 20 shows the sequencing result of the DNA standards.
  • the G-rich and C-rich sequencing leads to the same results for all 13 sequencing assays. In each of these 13 assays, it was possible to distinguish between all investigated methylation standards.
  • the methylation for three of the 13 loci was additionally determined using the corresponding QM assays (compare to this WO 2005/098035 as well as the following paragraph).
  • 20 ng of bisulfite-converted DNA from each of the DNA standards was used in each of the three QM assays.
  • the measurement with the QM assays were performed as a threefold determination so that altogether 240 ng of each of the DNA standards for the three QM assays were used (3 x 20 ng for each of the three QM assays) as opposed to only 20 ng for each of the standards for all 13 sequencing assays.
  • the result of the comparison of the techniques is depicted in Figure 21. In the case of the sequencing method, only the sequencing with the G-rich primer is shown in this diagram.
  • Each of the QM assays was specifically for the analysis of the gene PITX2 (promoter AB and C), TFFl, ABHD9 and uPA.
  • the primers and the probes for detection used for these assays are portrayed in Table 1.
  • the following regions of the genome were analyzed (after the ensemble v41): chromosome 4, Region 111777835-111777978 (PITX2 AB), chr.4, 111763501-111763655 (PITX2 C), chr.lO, 75340750-75340828 (uPA), chr.21, 42656449-42656529 (TFFl), chr.19, 15204086-15204233 (ABHD9).
  • PCRs were per- formed in 20 ul units and had the following composition: Ix PCR Puffer with passive ROX reference (Eurogentec, B), 1 U HotGoldStar Taq polymerase (Eurogentec, B), 0.2 mmol/1 each dNTP (dTTP, dATP, dGTP und dCTP, Fermentas, CDN), 0,625 ⁇ mol/1 each primer und 0,2 ⁇ mol/1 each probe (Biomers, D).
  • the magnesium concentration was different for the various assays: 3,5 mmol/1 in the TFFl assay, 3 mmol/1 each for both the PITX2 assays and in the uPA assay, as well as 2,5 mmol/1 for the ABHD9 assay.
  • the QM assays were performed in the 96-well as also in the 384-well plates. Here, optical PCR plates and optical foils (Applied Biosystems, USA) were used. The QM assays were incubated using the following temperature-time profile: initial activation of the polymerase for 10 min at 95° and 45 cycles with 15 s denaturation (95°) and 60 s annealing and extension, each. The an- nealing and the extension temperature is for the assays PITX2 (promoter AB) 62 °C, PITX2 (promoter C) 62 0 C, TFFl 58 °C, uPA 60 °C and ABHD9 60 °C.
  • CT(CG) and CT(TG) CT the CG ie. TG detection probes.
  • Table 2 Primer and probes for detection for the QM assays. All the probes carry at the 3' end a quencher- (BHQ-I) and at the 5' end a reporter-dye (6-FAM for CG-probes and HEX for TG-probes).
  • the primers and the probes for detection were purchased from Biomers (D). gene forward primer reverse primer CG-probe TG-probe
  • Figure 21 clearly shows that for both genes PITX2 and TFFl the corresponding sequence assays and the QM assays deliver comparable results.
  • the assays for the gene uPA one sees in Figure 21 that with the QM assay in the lower methylation region, a resolution is hardly possible.
  • the sequencing assay shows the equivalent good resolution in this region as for the other genes PITX2 and TFFl .
  • a QM assay yields a value between 0 and 100% due to the method of analysis by which the signals of the two probes are compared to each other. This value is equated, as a simplification, with the percentage of methylation.
  • the sequencing assays yield values that can be very different from assay to assay. Since this value reflects the sum of the normalized peak-areas at the CpG positions, such a value is dependent, for example, on the number of CpG positions being examined. Furthermore, the fluorescence intensities that a base generates during sequencing are also dependent on the position in the DNA fragment and the surroundings. A methylated cytosine can yield, thus, at one position another signal intensity than a methylated cytosine in another position.
  • the running differences of the four different bases are equalized in that newly scaling is performed along the x-axis; for another, the intensity differences are also compensated for in that the four signals are also scaled along the y-axis.
  • the latter causes the low theoretically possible resolution capacity, since for an accurate quantification an exact scaling of the T-signal to the C-signal is necessary. Since only the signals from the originally methylated cytosine still exist in the C-signal, the exact scaling is difficult.
  • the sequence histogram of unmethylated DNA for example, contains absolutely no cytosine, which could be used for scaling. Strictly speaking, the scaling can then only pre- cisely take place when the methylation of a sample was known.
  • the artificially introduced methylation signal which is introduced through the primer, is directly used for normalization. For this reason, an accurate scaling of the C to T signal is not necessary because the T-signal for a determination of methylation is not considered.
  • the third sequencing method is the pyro-sequencing.
  • sequencing is done according to the principle "sequencing through synthesis.”
  • the four different dNTP's are alternately added into the mixture and the introduction of each of the corresponding nucleotide is quantified through a biochemical measurement of arising pyrophosphates.
  • the methylation can be ascertained.
  • One disadvantage of this method is that only amplificates 100- 200 bp long can be sequenced (Ronaghi M (2001). Pyrosequencing sheds light on DNA sequencing.
  • the measured methylation values between 0% and 5% are defined as background signal (Shaw RJ, Liloglou T, Rogers SN, Brown JS, Vaughan ED, Lowe D, Field JK, Risk JM (2006).
  • Example 3 Detection of the clonalty of a sample.
  • Tail tagged primers can also be used to analyse the state of clonality of a sample. Therefore, a loci is chosen which comprises at least one methylation site and a lenght polymorphism (i.e short tandem repeat, STR). In addition, one allel of the loci is silenced by DNA methylation, whereby this silencing occurs randomly. Genes, which fulfill these criteria can be found i.e. on the X-chromosome which is randomly inactivated. As depicted in Figure 22, the sequencing of such a loci using tag tailed primers can be used to determine, whether a sample consists of cells from the same or from different progenitor cells. A gene which is located on the X-chromosome and which comprises a STR and methylation sites is the human adrogen receptor (SEQ ID NO: 47
  • the fol- lowing primers can be used to amplify this region on bisulfite converted DNA and to incorporate the tail: SEQ ID NO: 48 5'cgtcgtcgaaccccaaacacccaaa3' and SEQ ID NO: 49 5'gttttttatttaggattaggtagtttgt3', where cgtcgtcg reflects the tail.
  • the clonality of tumor can be determined. Therefore, genomic DNA is iso- lated from cancereous cells and converted by means of bisulfite. Subsequently the bisulfite converted DNA is sequenced according to the method of the invention. Wherein only either the maternal or the paternal allele is detected, the tumor is of monoclonal origin. Wherein both the maternal and paternal allele are detected, the tumor is of polyclonal origin.
  • base caller refers to, but is not limited to, an algorhithm used to process the sequencing raw data obtained directly from the detection of the fluorescence signal(s) over time.
  • the signals of all four bases are detected in- dependently from another.
  • the said algorhithm scales all signals along the x- axis, whereby the differences in the running behaviour of the different bases are compensated.
  • the signals are scaled along the y-axis. Thereby differences in the signal intensities are compensated.
  • methylation index refers to, but is not limited to, the absolute amount of methylation at an analyzed CpG dinucleotide after normalization.
  • methylation index may refer, but is not limited to, the peak area or peak height or the sum of a multiply peak areas or heights of several CpG di- nucleotides.
  • methylation rate refers to, but is not limited to, the percentage of methylation at an analyzed CpG dinucleotide. Therefore the methylation index is transformed into a percentage methylation based on a calibration curve.

Landscapes

  • Chemical & Material Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Organic Chemistry (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Health & Medical Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Microbiology (AREA)
  • Immunology (AREA)
  • Physics & Mathematics (AREA)
  • Molecular Biology (AREA)
  • Biotechnology (AREA)
  • Biophysics (AREA)
  • Analytical Chemistry (AREA)
  • Biochemistry (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

L'invention concerne un procédé servant à déterminer quantitativement le taux de méthylation d'un acide nucléique grâce au séquençage. Selon l'invention, le procédé comprend au moins les étapes suivantes consistant à : a) traiter l'acide nucléique avec un réactif chimique ou une solution contenant une enzyme, ce par quoi le comportement d'appariement des bases pour les bases de type cytosines méthylées et/ou les bases de type cytosines non méthylées de l'acide nucléique est modifié de façon à ce que les bases de type cytosines méthylées deviennent distinguables des bases de type cytosines non méthylées ; b) introduire dans l'acide nucléique au moins une base servant à générer un signal de séquençage à utiliser en tant que signal de référence pour la normalisation ; c) séquencer l'acide nucléique, ce par quoi on obtient un signal provenant de chaque base de type cytosine de l'acide nucléique ou un signal provenant de chaque base de type guanine de l'acide nucléique et un signal de référence provenant de ladite ou desdites bases introduites ; et d) normaliser le signal obtenu à partir de chaque base de type cytosine de l'acide nucléique ou le signal obtenu à partir de chaque base de type guanine de l'acide nucléique par rapport au signal de référence provenant de ladite ou desdites bases introduites.
EP07785943A 2006-07-18 2007-07-09 Procédé servant à déterminer le taux de méthylation d'un acide nucléique Withdrawn EP2044214A2 (fr)

Priority Applications (1)

Application Number Priority Date Filing Date Title
EP07785943A EP2044214A2 (fr) 2006-07-18 2007-07-09 Procédé servant à déterminer le taux de méthylation d'un acide nucléique

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
EP06090125 2006-07-18
EP06120694 2006-09-14
EP07109855 2007-06-08
EP07785943A EP2044214A2 (fr) 2006-07-18 2007-07-09 Procédé servant à déterminer le taux de méthylation d'un acide nucléique
PCT/EP2007/006064 WO2008009365A2 (fr) 2006-07-18 2007-07-09 Procédé servant à déterminer le taux de méthylation d'un acide nucléique

Publications (1)

Publication Number Publication Date
EP2044214A2 true EP2044214A2 (fr) 2009-04-08

Family

ID=38935863

Family Applications (1)

Application Number Title Priority Date Filing Date
EP07785943A Withdrawn EP2044214A2 (fr) 2006-07-18 2007-07-09 Procédé servant à déterminer le taux de méthylation d'un acide nucléique

Country Status (2)

Country Link
EP (1) EP2044214A2 (fr)
WO (1) WO2008009365A2 (fr)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2150626B1 (fr) * 2007-06-07 2015-09-30 Haplomic Technologies Pty Ltd. Méthodes épigénétiques
JP5757572B2 (ja) * 2009-07-03 2015-07-29 国立大学法人 東京大学 癌細胞の存否を判定する方法および癌患者の予後を判定する方法
CN105705653A (zh) * 2013-05-10 2016-06-22 南加州大学 针对膀胱癌的dna甲基化生物标志物
SG11202103486YA (en) * 2018-10-31 2021-05-28 Guardant Health Inc Methods, compositions and systems for calibrating epigenetic partitioning assays

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See references of WO2008009365A3 *

Also Published As

Publication number Publication date
WO2008009365A2 (fr) 2008-01-24
WO2008009365A3 (fr) 2008-04-17

Similar Documents

Publication Publication Date Title
JP4484431B2 (ja) シトシンメチル化の高感度での検出方法
US8911937B2 (en) Method for detecting methylation status by using methylation-independent primers
JP4044290B2 (ja) 高処理dnaメチル化解析のための方法
JP4727670B2 (ja) 核酸の改変された前処理によって達成されるメチル化解析を標的とするdna増幅系におけるキャリーオーバー保護のための方法
EP2049684B1 (fr) Procédé d'analyse par méthylation d'acide nucléique
JP2005514035A (ja) ハイブリダイゼーションしたプローブオリゴヌクレオチド(mla)の指数的ライゲーションによるシトシン−メチル化パターンの検出方法
US8361724B2 (en) Methods for evaluating the methylation status of a polynucleotide
US20050287553A1 (en) Method for the quantification of methylated DNA
US6335165B1 (en) Methods and kits for characterizing GC-rich nucleic acid sequences
EP2699695A2 (fr) Marqueurs du cancer de la prostate
CN113186278B (zh) 甲状腺结节良恶性相关标志物及其应用
WO2024056008A1 (fr) Marqueur de méthylation pour identifier un cancer et son utilisation
EP2044214A2 (fr) Procédé servant à déterminer le taux de méthylation d'un acide nucléique
CA3173044A1 (fr) Procedes et kits pour le depistage d'un neoplasme colorectal
AU2015336938B2 (en) Genome methylation analysis
US20080286778A1 (en) Method for Investigating Cytosine Methylations in Dna
EP1590484B1 (fr) Technique de detection de configurations de methylation de cytosine a haute sensibilite
WO2022170984A1 (fr) Procédé et un kit de dépistage, d'évaluation des risques et de pronostic pour les adénomes colorectaux avancés
US20080172183A1 (en) Systems and methods for methylation prediction
US20100003680A1 (en) Method For Determining The Methylation Rate of a Nucleic Acid
EP1882747A1 (fr) Procédé d'analyse de l'état de méthylation d'une acide nucléique
EP1704255B1 (fr) Procede pour etudier la methylation de la cytosine dans l'adn au moyen d'enzymes de reparation de l'adn
US8206927B2 (en) Method for accurate assessment of DNA quality after bisulfite treatment
CN117778568A (zh) 鉴别胃癌的标志物及应用

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

17P Request for examination filed

Effective date: 20090116

AK Designated contracting states

Kind code of ref document: A2

Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LI LT LU LV MC MT NL PL PT RO SE SI SK TR

AX Request for extension of the european patent

Extension state: AL BA HR MK RS

DAX Request for extension of the european patent (deleted)
17Q First examination report despatched

Effective date: 20110114

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN

18D Application deemed to be withdrawn

Effective date: 20130201