WO2024038069A1 - Detection of epigenetic modifications - Google Patents

Detection of epigenetic modifications Download PDF

Info

Publication number
WO2024038069A1
WO2024038069A1 PCT/EP2023/072497 EP2023072497W WO2024038069A1 WO 2024038069 A1 WO2024038069 A1 WO 2024038069A1 EP 2023072497 W EP2023072497 W EP 2023072497W WO 2024038069 A1 WO2024038069 A1 WO 2024038069A1
Authority
WO
WIPO (PCT)
Prior art keywords
group
sequencing
5hmc
reactive group
nucleic acid
Prior art date
Application number
PCT/EP2023/072497
Other languages
French (fr)
Inventor
Frank Bergmann
Dieter Heindl
Original Assignee
F. Hoffmann-La Roche Ag
Roche Diagnostics Gmbh
Roche Molecular Systems, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by F. Hoffmann-La Roche Ag, Roche Diagnostics Gmbh, Roche Molecular Systems, Inc. filed Critical F. Hoffmann-La Roche Ag
Publication of WO2024038069A1 publication Critical patent/WO2024038069A1/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6806Preparing nucleic acids for analysis, e.g. for polymerase chain reaction [PCR] assay

Definitions

  • the invention relates to the field mapping the epigenetic modifications of DNA and RNA, which has become increasingly important, because these modifications play a role in several biological processes and diseases including development, aging and last but not least, cancer.
  • C cytosine
  • 5mC 5-methylcytosine
  • oxidation can also be performed by chemical means using e.g. potassium perruthenate (KRuCh).
  • KRuCh potassium perruthenate
  • WO 2022/096751 also disclosed a method for generating a dihydrothymine (DHT) or a dihydrouracil (DHU) residue from a nucleoside or a polynucleotide containing 5-methylcytosine (5mC) or 5-carboxyl-cytosine (5caC) by using a radical initiator which may be used together with a nucleophilic compound.
  • DHT dihydrothymine
  • DHU dihydrouracil
  • the present invention is based on the basic idea that the introduction of a nucleophile at 5fC, 5hmC or caC and the subsequent intramolecular addition at C- 6 of a 5-substitued cytosine is enabling a deamination at C-4 of a cytosine derivative.
  • the present invention is directed to a method comprising the steps of a) providing a nucleic acid comprising 5fC, 5hmC, or 5caC b) providing one reactant comprising two reactive groups wherein the first reactive group is capable of reacting with the formyl, hydroxymethyl, or carboxyl group, and the second reactive group is a nucleophilic group, c) reacting said first reactive group with the formyl, hydroxymethyl or carboxyl group thereby resulting in a modified 5fC, 5hmC, or 5caC d) reacting said second reactive group with the C6 position of said modified 5fC, 5hmC, or 5caC, thereby obtaining a bicyclic or tricyclic molecule comprising a 5,6 di-hydro Cytosine entity, and e) deaminating said 5,6 di-hydro Cytosine entity to a 5,6 di-hydro-Uracil entity.
  • step d) The newly formed cyclic structures in step d) are 5-7 membered rings
  • the method further comprises step f), i.e. reversing the ring formation of step d) thereby obtaining a 5-substituted Uracil.
  • the method may also comprise a step of converting 5mC to 5fC, 5hmC or 5caC prior to step a).
  • the present invention is facilitating the methylation status analysis of naturally occurring DNA.
  • said nucleic acid comprises at least one 5fC residue.
  • the first reactive group may be a C-H acidic group, an amine or a phosphorus ylide.
  • said nucleic acid comprises at least one 5hmC residue. Then, the first reactive group may constitute a Glycosyl Transferase substrate.
  • said nucleic acid comprises a 5caC residue. Then, the first reactive group may be an amine.
  • the said second reactive group may be a nucleophilic group selected from a group consisting of Thiol or Sulfinate.
  • All methods as disclosed above may further comprise a subsequent step of amplifying said nucleic acid, which may be preferably executed by means of PCR amplification.
  • all methods disclosed above may further comprise the step of a subsequent step of sequencing said nucleic acid either with or without a prior amplification step, which is preferably a PCR amplification.
  • Fig. 1 provides a scheme for detection of methylated sequences according to the present invention.
  • Fig. 2 shows a reaction scheme for modification of 5fC by means of a Knoevenagel condensation using a reactant with a C-H acidic group
  • Fig. 3 shows a reaction scheme for modification of 5fC by means of an aldol condensation reaction with a CH-acidic reagent comprising a sulfinate moiety
  • Fig. 4 shows a reaction scheme for modification of 5fC by means of a Wittig reaction.
  • Fig. 5 shows a specific example for a Wittig reaction according to Fig. 4 with a sulfinate modified Wittig reagent.
  • Fig. 6 shows a reaction scheme for modification of 5fC by means of reductive amination
  • Fig: 7 shows a specific example for a reductive amination as in Fig. 6, where 5fC is reacted with 2-aminomethyl sulfinate
  • Fig. 8 shows a reaction scheme for the modification of hmC using UDP-glucose substituted at the 2-position of the glucose moiety with a nucleophile as a reagent and executing a subsequent beta-glucosyl-transferase reaction.
  • Fig. 9 shows a similar reaction scheme as example 8, wherein 2-thio-glucose-UDP is used.
  • Fig. 10 shows an example for modification of the carboxyl group of caC.
  • a nucleophilic group is a functional group that is attractive to electron-poor or positively charged centers and donates electrons, in particular donating an electron pair to an electrophile or an electrophilic center to form a covalent bond.
  • Nucleophilic groups are for example alcohols, alcoholates, thiols, thiolates, amines, sulfinates or carbanions.
  • deamination refers herein to a substitution reaction of an exocyclic amino group by a hydroxyl group, in particular substitution of the amino group at the C-4 position of a cytosine nucleobase to convert into a uracil nucleobase. Deamination at C-4 position is enhanced in 5,6-dihydrocytosines to give 5,6-dihydrouracil derivatives, as e.g. being applied after bisulfite addition to the C5- C6 double bond of cytosine.
  • C-H acidic group is used herein for a carbon atom within a compound which carries at least one electron-withdrawing group and thus being more easily deprotonated by a base.
  • Electron-withdrawing groups are for example carbonyl containing groups, a nitro group, a cyano group, a sulfoxide or sulfone group.
  • Wittig reagent or phosphorus ylide refers to neutral dipolar molecules containing a formally negatively charged carbon atom (a carbanion) directly attached to a phosphorus atom with a formal positive charge.
  • phosphorus ylides phosphonium ylides
  • Phosphorus ylides are thus 1,2- dipolar compounds and a subclass of zwitterions.
  • triphenylphosphonium ylides are reacted with aldehydes or ketones to give an alkene bond.
  • a Homer-Wadsworth-Emmons reaction stabilized phosphonate carbanions are used as reagent.
  • a Glycosyl Transferase Substrate refers to a reagent comprising an activation group, e.g. a UDP moiety, and a sugar residue, for example a glucose or substituted glucose.
  • Glycosyltransferases catalyze the transfer of the sugar moiety from an activated donor sugar (the glycosyl transferase substrate) onto a saccharide or non- saccharide acceptor.
  • the non-saccharide acceptor can be for example 5- hydroxymethyl-dC.
  • T4 Phage B-glucosyltransferase for example from New England Biolabs, specifically transfers a glucose moiety of uridine diphosphoglucose to the 5-hydroxymethylcytosine (5-hmC) residues in double-stranded DNA, making beta- glucosyl-5-hydroxymethylcytosine.
  • FIG. 1 A general scheme, how methylated C-residues mC are detected is schematically shown in Fig. 1.
  • the epigenetic dC modifications 5-methyl-dC (mC), 5-hydroxymethyl-dC (hmC), 5-formyl-dC (fC) and 5-carboxyl- dC (caC) are read as unmodified dC when amplified and/or sequenced.
  • oxidation of mC by enzymatic and/or chemical means results in oxidized mC modifications hmC, fC and caC (general: xC).
  • Oxidation or conversion of 5mC to 5fC can be achieved by using TET enzymes (ten- eleven translocation (TET) methylcytosine dioxygenases).
  • TET ten- eleven translocation
  • the same class of enzymes are suitable for converting 5mC to 5hmC.
  • 5hmC can be further oxidized to 5fC enzymatically by a laccase enzyme or chemically using either KRuO4 or Cu(II)/TEMPO. Further details are disclosed in Pfeifer et al. Epigenetics & Chromatin 2013, 6: 10, p.1-9.
  • a reagent of the invention comprising two reactive groups, the first reactive group forming a covalent bond with hmC, fC or caC and thus introducing the second reactive group which comprises a nucleophile.
  • the present also provides a method for analyzing the methylation status of a naturally occurring DNA comprising the steps of a) converting 5mC to 5fC, 5hmC or 5caC b) providing one reactant comprising two reactive groups wherein the first reactive group is capable of reacting with the formyl, hydroxymethyl or carboxyl group, and the second reactive group is a nucleophilic group, c) reacting said first reactive group with the formyl, hydroxymethyl or carboxyl group thereby resulting in a modified 5fC, 5hmC, or 5caC, d) reacting said second reactive group with the C6 position of said modified 5fC, 5hmC, or 5caC, thereby obtaining a bicyclic or tricyclic molecule comprising a 5,6 di-hydro Cytosine entity, and e) deaminating said 5,6 di-hydro Cytosine entity to
  • the sample is derived from a subject or a patient.
  • the sample may comprise a fragment of a solid tissue or a solid tumor derived from the subject or the patient, e.g., by biopsy.
  • the sample may also comprise body fluids that may contain nucleic acids (e.g., urine, sputum, serum, blood or blood fractions, i.e., plasma, lymph, saliva, sputum, sweat, tear, cerebrospinal fluid, amniotic fluid, synovial fluid, pericardial fluid, peritoneal fluid, pleural fluid, cystic fluid, bile, gastric fluid, intestinal fluid, or fecal samples).
  • nucleic acids e.g., urine, sputum, serum, blood or blood fractions, i.e., plasma, lymph, saliva, sputum, sweat, tear, cerebrospinal fluid, amniotic fluid, synovial fluid, pericardial fluid, peritoneal fluid, pleural fluid, cystic fluid,
  • the sample is a cultured sample, e.g., a tissue culture containing cells and fluids from which nucleic acids may be isolated.
  • the nucleic acids of interest in the sample come from infectious agents such as viruses, bacteria, protozoa or fungi.
  • the present invention involves manipulating isolated nucleic acids isolated or extracted from a sample. Methods of nucleic acid extraction are well known in the art (see, Sambrook et al., & quot; Molecular Cloning: A Laboratory Manual, & quot; 1989, 2nd Ed., Cold Spring Harbor Laboratory Press: New York, N. Y.).
  • kits are commercially available for extracting nucleic acids (DNA or RNA) from biological samples (e.g., KAPA Express Extract (Roche Sequencing Solutions, Pleasanton, Cal.) and other similar products from BD Biosciences Clontech (Palo Alto, Cal.), Epicentre Technologies (Madison, Wise.); Gentra Systems, (Minneapolis, Minn.); and Qiagen (Valencia, Cal.), Ambion (Austin, Tex.); BioRad Laboratories (Hercules, Cal.); and more.
  • the present invention involves detecting an epigenetic modification, specifically, an epigenetic cytosine modification in nucleic acids (including, but not limited to, cytosine methylation).
  • the nucleic acid sequences that are subject to conditional epigenetic modification are the target sequences analyzed by the method disclosed herein.
  • the same nucleic acid sequence may or may not have the epigenetic modification characterized by methylation of cytosines at the 5-position (5mC or 5hmC).
  • a set or a panel of target nucleic acids are probed for the presence of methylation. For example, as shown in Patai, et al.
  • the method of the instant invention includes detecting methylation in all sites throughout the genome of an organism to diagnose a disease or condition or predisposition to a disease or condition using the sequence analysis and artificial intelligence tools described, e.g., in Shull, et al. “Sequencing the cancer methylome,” Methods Mol Biol. 1238:627-635 (2015).
  • the method of detecting epigenetic modifications includes sequencing.
  • the nucleic acid processed as described herein is subjected to sequencing; preferably, massively parallel single molecule sequencing. Analyzing individual molecules by massively parallel sequencing typically requires a separate level of barcoding for sample identification and error correction.
  • the use of molecular barcodes such as described in U.S. Patent Nos. 7,393,665, 8,168,385, 8,481,292, 8,685,678, and 8,722,368.
  • a unique molecular barcode is added to each molecule to be sequenced to mark molecule and its progeny (e.g., the original molecule and its amplicons generated by PCR).
  • the unique molecular barcode has multiple uses including counting the number of original target molecules in the sample and error correction (Newman, et al., “An ultrasensitive method for quantitating circulating tumor DNA with broad patient coverage,” Nature Medicine doi: 10.1038/nm.3519 (2014)).
  • Nanopore sequencing is a unique, scalable technology that enables direct, real-time analysis of long DNA or RNA fragments. It works by monitoring changes to an electrical current as nucleic acids are passed through a protein nanopore. The resulting signal is decoded to provide the specific DNA or RNA sequence. Nanopore-based sequencing technology detects the unique electrical signals of different molecules as they pass through the nanopore with a semiconductor-based electronic detection system. This technology makes for a high throughput, cost effective sequencing solution.
  • the biological nanopore a protein pore embedded in a membrane, while the brains of the technology lie in the electronics of a semiconductor integrated circuit and proprietary chemistries.
  • the electronic sensor technology embedded in the chip enables automatic membrane assembly and nanopore insertion, while allowing for active control of individual sensors on the circuit.
  • Different sequencing chemistries can be paired with the nanopore and electronic sensor technology to enable high throughput, high accuracy sequencing with faster time-to-data.
  • unique molecular barcodes are used for sequencing error correction.
  • the entire progeny of a single target molecule is marked with the same barcode and forms a barcoded family.
  • a variation in the sequence not shared by all members of the barcoded family is discarded as an artefact.
  • Barcodes can also be used for positional deduplication and target quantification, as the entire family represents a single molecule in the original sample (Newman, et al. “Integrated digital error suppression for improved detection of circulating tumor DNA,” Nature Biotechnology 34:547 (2016)).
  • the method involves forming a library comprising nucleic acids from a sample.
  • the library consists of a plurality of nucleic acids ready for sequencing or another type of detection method, e.g., PCR.
  • a library can be stored and used multiple times for further processing such as amplification or sequencing of the nucleic acids in the library.
  • the library is the input nucleic acid in which methylation is detected by the method described herein.
  • the library is formed from nucleic acids that have undergone the methylation detection reactions described herein.
  • the nucleic acids processed for detection of epigenetic modifications according to the method described herein are sequenced. Any of a number of sequencing technologies or sequencing assays can be utilized.
  • the term “Next Generation Sequencing (NGS)” as used herein refers to sequencing methods that allow for massively parallel sequencing of clonally amplified molecules and of single nucleic acid molecules.
  • Non-limiting examples of sequence assays that are suitable for use with the methods disclosed herein include nanopore sequencing (U.S. Patent Publication Nos. US2013/0244340, US2013/0264207, US2014/0134616, US2015/0119259 and US2015/0337366), Sanger sequencing, capillary array sequencing, thermal cycle sequencing (Sears, et al., Biotechniques, 13:626-633 (1992)), solid-phase sequencing (Zimmerman, et al., Methods Mol.
  • sequencing with mass spectrometry such as matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF/MS; Fu, et al., Nature Biotech., 16:381-384 (1998)), sequencing by hybridization (Drmanac et al., Nature Biotech., 16:54-58 (1998), and NGS methods, including but not limited to sequencing by synthesis (e.g., HiSeqTM, MiSeqTM, or Genome Analyzer, each available from Illumina), sequencing by ligation (e.g., SOLiDTM, Life Technologies), ion semiconductor sequencing (e.g., Ion TorrentTM, Life Technologies), and SMRT® sequencing (e.g., Pacific Biosciences).
  • MALDI-TOF/MS matrix-assisted laser desorption/ionization time-of-flight mass spectrometry
  • sequencing-by- hybridization platforms from Affymetrix Inc. (Sunnyvale, Calif.), sequencing-by- synthesis platforms from Illumina/Solexa (San Diego, Calif.) and Helicos Biosciences (Cambridge, Mass.), sequencing-by-ligation platform from Applied Biosystems (Foster City, Calif.).
  • Other sequencing technologies include, but are not limited to, the Ion Torrent technology (ThermoFisher Scientific), and nanopore sequencing (Genia Technology, Roche Sequencing Solutions, Santa Clara, Cal.), Oxford Nanopore Technologies (Oxford, UK), and sequencing by expansion (Stratos Genomics, Roche Sequencing Solutions).
  • the sequencing step involves sequence aligning.
  • aligning is used to determine a consensus sequence from a plurality of sequences, e.g., a plurality having the same unique molecular ID (UID).
  • the molecular ID is a barcode that can be added to each molecule prior to sequencing or if amplification step is included, prior to the amplification step.
  • a UID is present in the 5 ’-portion of the RT primer.
  • a UID can be present in the 5 ’-end of the last barcode subunit to be added to the compound barcode.
  • a UID is present in an adaptor and is added to one or both ends of the target nucleic acid by ligation.
  • a consensus sequence is determined from a plurality of sequences all having an identical UID.
  • the sequences having an identical UID are presumed to derive from the same original molecule through amplification.
  • UID is used to eliminate artifacts, i.e., variations existing in the progeny of a single molecule (characterized by a particular UID). Such artifacts resulting from PCR errors or sequencing errors can be eliminated using UIDs.
  • the number of each sequence in the sample can be quantified by quantifying relative numbers of sequences with each UID among the population having the same multiplex sample ID (MID).
  • MID multiplex sample ID
  • Each UID represents a single molecule in the original sample and counting different UIDs associated with each sequence variant can determine the fraction of each sequence variant in the original sample, where all molecules share the same MID.
  • a person skilled in the art will be able to determine the number of sequence reads necessary to determine a consensus sequence.
  • the relevant number is reads per UID (“sequence depth”) necessary for an accurate quantitative result.
  • the desired depth is 5-50 reads per UID. Modification of 5fC
  • Modification of 5fC according to the present invention can be achieved if the first reactive group of the reagent according to the present invention is a C-H acidic group, an amine or a phosphorus ylide.
  • the reaction can be executed as a Knoevenagel condensation, which is schematically depicted in Fig. 2.
  • the CH-acidic reagent comprising a nucleophile group is added via Knoevenagel conditions to fC, then subsequent intramolecular addition of the nucleophile to the C5-C6 double bond takes place, thereby enabling the deamination at C4 to obtain a “T-equivalent”.
  • Yields of intramolecular addition of the nucleophile to the C5-C6 double bond can be increased by a photoisomerization of the double bond and/or by applying a radical initiation or a catalyst. If desirable, a reversal of the ring closure can be achieved under alkaline conditions.
  • an aldol condensation reaction of 5fC with a CH-acidic reagent comprising a sulfinate moiety e.g. 1 -(methyl sulfonyl)methane sulfinate
  • a subsequent addition of a sulfinate moiety at the C-6 position of the substituted cytosine followed by deamination and desulfmation (5fC to T conversion) can be executed (Fig. 3)
  • a desulfmation step under alkaline conditions may even not be required.
  • a thiol moiety can also be applied as nucleophile
  • addition reaction to the C5-C6 double bond can be achieved by means of applying thiol-ene chemistry using a radical initiator or a catalyst.
  • the yields may be increased by photoisomerization of the newly formed double bond.
  • Modification of 5fC according to the present invention can also be done by means of a Wittig reaction (Fig. 4).
  • the reagent according to the present invention is a phosphorus ylide comprising a nucleophilic group.
  • the reagent is added via Wittig conditions to 5fC, then a subsequent intramolecular addition of the nucleophile to the C5-C6 double bond takes place, thereby enabling a deamination at the C4 position in order to obtain a “T-equivalent”.
  • Yields of intramolecular addition of the nucleophile to the C5-C6 double bond may be increased by photoisomerization of the newly formed double bond, and/or by applying radical initiation or a catalyst. If desirable, a reversal of the ring closure can be achieved under alkaline conditions. Alternatively, the reaction can be performed using Horner-Wadsworth-Emmons reaction conditions, in particular using a sulfinate group as nucleophile.
  • reagent of the invention comprising an amine and a further nucleophilic group is added via reductive amination to 5fC. Subsequently there is an intramolecular addition of the nucleophile to the C5-C6 double bond, which enables the deamination at the C4 position to obtain a “T- equivalent”. Reversal of ring closure can be achieved under alkaline conditions, if necessary. Yields can be increased by applying radical initiation or a catalyst.
  • a more detailed example for implementing a reductive amination reaction is illustrated in Fig. 7.
  • 5fC is reacted with 2-aminomethyl sulfinate, and subsequently addition of the sulfinate moiety at C-6 position of substituted cytosine takes place.
  • This is followed by a deamination and desulfination step, which converts 5fC to T.
  • a desulfination step under alkaline conditions may not be needed.
  • the sulfinate moiety may be replaced by a thiol moiety using thiol-ene chemistry for the addition reaction to double bond C5-C6, which optionally includes the use of a radical initiator or a catalyst.
  • Fig. 8 schematically shows a reaction scheme for the modification of hmC.
  • the reagent used according to the present invention is a UDP -glucose, which is modified at the 2-position with a nucleophilic group. It is added via a B-glucosyl transferase enzyme to hmC. Then, a subsequent intramolecular addition of the nucleophile to the C5-C6 double bond takes place, thereby enabling the deamination at the C4 position in order to obtain a “T-equivalent”.
  • yields of intramolecular addition of the nucleophile to the C5-C6 double bond can be increased by applying radical initiation or a catalyst and a reversal of the ring closure can be achieved under alkaline conditions, if necessary.
  • Fig. 9 An alternative example is shown in Fig. 9.
  • 5hmC is reacted with 2-thio- glucose-UDP and B-glucosyltransferase.
  • an addition of the thiol moiety at the C6 position of substituted cytosine applying thiol-ene chemistry and optionally using a radical initiator or a catalyst is taking place, which is followed by a deamination reaction, resulting in a 5hmC to T conversion.
  • 2- Sulfinato-glucose-UDP can also be applied as substrate for B-glucosyltransferase.
  • the method of the present invention uses a reagent comprising an amine and a further nucleophilic group. It is added after an EDC/NHS activation of the carboxyl group of caC. This results in caC forming an amide bond, and a subsequent intramolecular addition of the nucleophile to the C5-C6 double bond, thereby enabling the deamination at the C4 position in order to obtain a “T-equivalent”.
  • the yield of intramolecular addition of the nucleophile to the C5-C6 double bond can be increased by means of applying radical initiation or a catalyst and reversal of the ring closure can be achieved under alkaline conditions.

Abstract

The present invention provides a method comprising the steps of a) providing a nucleic acid comprising 5fC, 5hmC, or 5caC, b) providing one reactant comprising two reactive groups wherein the first reactive group is capable of reacting with the formyl hydroxymethyl or carboxyl group, and the second reactive group is a nucleophilic group, c) reacting said first reactive group with the formyl, hydroxymethyl or carboxyl group thereby resulting in a modified 5fC, 5hmC, or 5caC, d) reacting said second reactive group with the C6 position of said modified 5fC, 5hmC, or 5caC, thereby obtaining a bicyclic or tricyclic molecule comprising a 5,6-di-hydro Cytosine entity, and e) deaminating said 5,6-di-hydro Cytosine entity to a 5,6 di-hydro-Uracil entity.

Description

Detection of epigenetic modifications
Field of the invention
The invention relates to the field mapping the epigenetic modifications of DNA and RNA, which has become increasingly important, because these modifications play a role in several biological processes and diseases including development, aging and last but not least, cancer.
Technical background
Most relevant to identify epigenetic modifications is detection by sequencing based methods. Differentiation between cytosine (C) and 5-methylcytosine (5mC) is described by direct sequencing (without preamplification) of DNA through a nanopore (e.g. by Pacific Bioscience Single Molecule, Real-Time (SMRT) Sequencing technology) and thereby reading out the different kinetics of nucleotide incorporation opposite C vs. mC by polymerase.
Other techniques are using methods to convert C or mC into a T(U) equivalent and subsequent amplification to allow identification when comparing untreated and converted sample DNA (see review: L. Zhao et al., Protein Cell 2020, 11, 792- 808).
Most important methods are bisulfite sequencing (conversion of C to U by bisulfite treatment), NEB’s EM-seq method (oxidation of mC by TET2 enzyme and glucosylation by B-glucosyltransferase (to block enzymatic deaminase reaction) and convert C to U by APOBEC deaminase), TAPS method (TET-assisted pyridine borane sequencing) applying TET enzyme oxidation of mC and subsequent reduction of oxidized mC species by pyridine borane to obtain dihydrocytosine nucleoside which is easily deaminated to give dihydrouracil (T equivalent) and the CLEVER method which is based on oxidation of mC by TET enzyme and subsequent reaction of 5-formyl-C with malononitrile to give an adduct which acts predominantly as a T equivalent in subsequent PCR amplification (C. Zhu et al., Cell Stem Cell 2017, 20, 720-731). Instead of enzymatic oxidation of mC by TET enzymes oxidation can also be performed by chemical means using e.g. potassium perruthenate (KRuCh).
There are also methods to differentiate the modifications 5-hydroxymethyl-dC (5hmC), 5-formyl (5fC) or 5-carboxyl-C (5caC) by partly modifying the methods described above.
Recently, WO 2022/096751 also disclosed a method for generating a dihydrothymine (DHT) or a dihydrouracil (DHU) residue from a nucleoside or a polynucleotide containing 5-methylcytosine (5mC) or 5-carboxyl-cytosine (5caC) by using a radical initiator which may be used together with a nucleophilic compound.
However, all the methods disclosed have some shortcomings. Bisulfite sequencing encompasses harsh conditions which degrades the majority of input DNA, only indirect detection of methylated cytosines is possible. Likewise, Enzymatic Methyl Sequencing (EM-Seq) via enzymatic deamination which is inefficient because the majority of cytosines which are non-methylated have to be converted. TAPS requires a toxic reagent needed for reduction such as pyridine borane, and the dihydrouridine being labile under slightly acidic conditions cannot easily be converted back to the more stable Uridine derivative. For the CLEVER method, the conversion yield of the condensation reaction of 5fC and malononitrile is far below 100%, because the enzymatic incorporation of nucleotides by polymerase when the condensation product of 5fC with malononitrile acts as substrate is not highly accurate, and incorporation of dA is only favoured, but not exclusive (see also: F. Galardi et al., Biomolecules 2020, 10, 1677). The method of WO 2022/096751 is not entirely specific for caC but also converts unmodified C to some extent. Another disadvantage is that di-hydro Uridine derivative cannot easily be converted back to the more stable Uridine derivative. Summary of the invention:
The present invention is based on the basic idea that the introduction of a nucleophile at 5fC, 5hmC or caC and the subsequent intramolecular addition at C- 6 of a 5-substitued cytosine is enabling a deamination at C-4 of a cytosine derivative.
Thus, in a first aspect, the present invention is directed to a method comprising the steps of a) providing a nucleic acid comprising 5fC, 5hmC, or 5caC b) providing one reactant comprising two reactive groups wherein the first reactive group is capable of reacting with the formyl, hydroxymethyl, or carboxyl group, and the second reactive group is a nucleophilic group, c) reacting said first reactive group with the formyl, hydroxymethyl or carboxyl group thereby resulting in a modified 5fC, 5hmC, or 5caC d) reacting said second reactive group with the C6 position of said modified 5fC, 5hmC, or 5caC, thereby obtaining a bicyclic or tricyclic molecule comprising a 5,6 di-hydro Cytosine entity, and e) deaminating said 5,6 di-hydro Cytosine entity to a 5,6 di-hydro-Uracil entity.
The newly formed cyclic structures in step d) are 5-7 membered rings
In one embodiment, the method further comprises step f), i.e. reversing the ring formation of step d) thereby obtaining a 5-substituted Uracil.
Since the naturally occurring methylation of DNA is 5mC, the method may also comprise a step of converting 5mC to 5fC, 5hmC or 5caC prior to step a). Thus, the present invention is facilitating the methylation status analysis of naturally occurring DNA.
In a specific embodiment, said nucleic acid comprises at least one 5fC residue. In this case, the first reactive group may be a C-H acidic group, an amine or a phosphorus ylide. In another specific embodiment, said nucleic acid comprises at least one 5hmC residue. Then, the first reactive group may constitute a Glycosyl Transferase substrate. In still another embodiment, said nucleic acid comprises a 5caC residue. Then, the first reactive group may be an amine.
In all cases, the said second reactive group may be a nucleophilic group selected from a group consisting of Thiol or Sulfinate.
All methods as disclosed above may further comprise a subsequent step of amplifying said nucleic acid, which may be preferably executed by means of PCR amplification. As well, all methods disclosed above may further comprise the step of a subsequent step of sequencing said nucleic acid either with or without a prior amplification step, which is preferably a PCR amplification.
Brief description of Figures
Fig. 1 provides a scheme for detection of methylated sequences according to the present invention.
Fig. 2 shows a reaction scheme for modification of 5fC by means of a Knoevenagel condensation using a reactant with a C-H acidic group
Fig. 3 shows a reaction scheme for modification of 5fC by means of an aldol condensation reaction with a CH-acidic reagent comprising a sulfinate moiety
Fig. 4 shows a reaction scheme for modification of 5fC by means of a Wittig reaction.
Fig. 5 shows a specific example for a Wittig reaction according to Fig. 4 with a sulfinate modified Wittig reagent.
Fig. 6 shows a reaction scheme for modification of 5fC by means of reductive amination
Fig: 7 shows a specific example for a reductive amination as in Fig. 6, where 5fC is reacted with 2-aminomethyl sulfinate Fig. 8 shows a reaction scheme for the modification of hmC using UDP-glucose substituted at the 2-position of the glucose moiety with a nucleophile as a reagent and executing a subsequent beta-glucosyl-transferase reaction.
Fig. 9 shows a similar reaction scheme as example 8, wherein 2-thio-glucose-UDP is used.
Fig. 10 shows an example for modification of the carboxyl group of caC.
Detailed description of the invention
Abbreviations
C - cytosine or cytidine
T - thymine or thymidine
U - uracil or uridine
DHU - dihydrouracil or dihydrouridine
5mC - 5-methylcytosine or 5-methylcytidine
5hmC - 5 -hydroxymethyl cytosine or 5 -hydroxymethyl cytidine
5fC - 5-formylcytosine or 5-formylcytidine
5caC - 5-carboxylcytosine or 5-carboxylcytidine dC - 2’ -deoxy cytidine dU - 2’ -deoxyuridine
TET - ten-eleven translocation dioxygenase
TAPS - TET-assisted pyridine-borane sequencing
CAPS - chemically-assisted pyridine-borane sequencing Definitions
A nucleophilic group is a functional group that is attractive to electron-poor or positively charged centers and donates electrons, in particular donating an electron pair to an electrophile or an electrophilic center to form a covalent bond. Nucleophilic groups are for example alcohols, alcoholates, thiols, thiolates, amines, sulfinates or carbanions. The term deamination refers herein to a substitution reaction of an exocyclic amino group by a hydroxyl group, in particular substitution of the amino group at the C-4 position of a cytosine nucleobase to convert into a uracil nucleobase. Deamination at C-4 position is enhanced in 5,6-dihydrocytosines to give 5,6-dihydrouracil derivatives, as e.g. being applied after bisulfite addition to the C5- C6 double bond of cytosine.
The term C-H acidic group is used herein for a carbon atom within a compound which carries at least one electron-withdrawing group and thus being more easily deprotonated by a base. Electron-withdrawing groups are for example carbonyl containing groups, a nitro group, a cyano group, a sulfoxide or sulfone group.
The term Wittig reagent or phosphorus ylide, as used herein, refers to neutral dipolar molecules containing a formally negatively charged carbon atom (a carbanion) directly attached to a phosphorus atom with a formal positive charge. In phosphorus ylides (phosphonium ylides) the two adjacent carbon and phosphorus atoms are connected by both a covalent and an ionic bond. Phosphorus ylides are thus 1,2- dipolar compounds and a subclass of zwitterions. In a Wittig reaction triphenylphosphonium ylides are reacted with aldehydes or ketones to give an alkene bond. In a Homer-Wadsworth-Emmons reaction stabilized phosphonate carbanions are used as reagent.
A Glycosyl Transferase Substrate, as used herein, refers to a reagent comprising an activation group, e.g. a UDP moiety, and a sugar residue, for example a glucose or substituted glucose. Glycosyltransferases catalyze the transfer of the sugar moiety from an activated donor sugar (the glycosyl transferase substrate) onto a saccharide or non- saccharide acceptor. The non-saccharide acceptor can be for example 5- hydroxymethyl-dC. T4 Phage B-glucosyltransferase, for example from New England Biolabs, specifically transfers a glucose moiety of uridine diphosphoglucose to the 5-hydroxymethylcytosine (5-hmC) residues in double-stranded DNA, making beta- glucosyl-5-hydroxymethylcytosine.
Methods for DNA methylation status analysis by means of modification, followed by sequencing
A general scheme, how methylated C-residues mC are detected is schematically shown in Fig. 1. Without any particular treatment, the epigenetic dC modifications 5-methyl-dC (mC), 5-hydroxymethyl-dC (hmC), 5-formyl-dC (fC) and 5-carboxyl- dC (caC) are read as unmodified dC when amplified and/or sequenced. However, oxidation of mC by enzymatic and/or chemical means results in oxidized mC modifications hmC, fC and caC (general: xC).
Oxidation or conversion of 5mC to 5fC can be achieved by using TET enzymes (ten- eleven translocation (TET) methylcytosine dioxygenases). The same class of enzymes are suitable for converting 5mC to 5hmC. 5hmC can be further oxidized to 5fC enzymatically by a laccase enzyme or chemically using either KRuO4 or Cu(II)/TEMPO. Further details are disclosed in Pfeifer et al. Epigenetics & Chromatin 2013, 6: 10, p.1-9.
Subsequently, 5fC, 5hmC or caC are reacted with a reagent of the invention comprising two reactive groups, the first reactive group forming a covalent bond with hmC, fC or caC and thus introducing the second reactive group which comprises a nucleophile. By intramolecular addition of the nucleophile to the C5-C6 double bond of hmC, fC or caC deamination at C4 position is initiated resulting in a modified dU derivative which is read in a subsequent amplification or sequencing step as a T, thus enabling detection of epigenetic modification.
Such a sequencing according to the present invention allows the generation of data in the methylation status of the DNA of interest. Thus, the present also provides a method for analyzing the methylation status of a naturally occurring DNA comprising the steps of a) converting 5mC to 5fC, 5hmC or 5caC b) providing one reactant comprising two reactive groups wherein the first reactive group is capable of reacting with the formyl, hydroxymethyl or carboxyl group, and the second reactive group is a nucleophilic group, c) reacting said first reactive group with the formyl, hydroxymethyl or carboxyl group thereby resulting in a modified 5fC, 5hmC, or 5caC, d) reacting said second reactive group with the C6 position of said modified 5fC, 5hmC, or 5caC, thereby obtaining a bicyclic or tricyclic molecule comprising a 5,6 di-hydro Cytosine entity, and e) deaminating said 5,6 di-hydro Cytosine entity to a 5,6 di-hydro-Uracil entity.
In some embodiments, the sample is derived from a subject or a patient. In some embodiments the sample may comprise a fragment of a solid tissue or a solid tumor derived from the subject or the patient, e.g., by biopsy. The sample may also comprise body fluids that may contain nucleic acids (e.g., urine, sputum, serum, blood or blood fractions, i.e., plasma, lymph, saliva, sputum, sweat, tear, cerebrospinal fluid, amniotic fluid, synovial fluid, pericardial fluid, peritoneal fluid, pleural fluid, cystic fluid, bile, gastric fluid, intestinal fluid, or fecal samples). In other embodiments, the sample is a cultured sample, e.g., a tissue culture containing cells and fluids from which nucleic acids may be isolated. In some embodiments, the nucleic acids of interest in the sample come from infectious agents such as viruses, bacteria, protozoa or fungi. The present invention involves manipulating isolated nucleic acids isolated or extracted from a sample. Methods of nucleic acid extraction are well known in the art (see, Sambrook et al., & quot; Molecular Cloning: A Laboratory Manual, & quot; 1989, 2nd Ed., Cold Spring Harbor Laboratory Press: New York, N. Y.). A variety of kits are commercially available for extracting nucleic acids (DNA or RNA) from biological samples (e.g., KAPA Express Extract (Roche Sequencing Solutions, Pleasanton, Cal.) and other similar products from BD Biosciences Clontech (Palo Alto, Cal.), Epicentre Technologies (Madison, Wise.); Gentra Systems, (Minneapolis, Minn.); and Qiagen (Valencia, Cal.), Ambion (Austin, Tex.); BioRad Laboratories (Hercules, Cal.); and more. The present invention involves detecting an epigenetic modification, specifically, an epigenetic cytosine modification in nucleic acids (including, but not limited to, cytosine methylation). The nucleic acid sequences that are subject to conditional epigenetic modification are the target sequences analyzed by the method disclosed herein. The same nucleic acid sequence may or may not have the epigenetic modification characterized by methylation of cytosines at the 5-position (5mC or 5hmC). In some embodiments, a set or a panel of target nucleic acids are probed for the presence of methylation. For example, as shown in Patai, et al. “Comprehensive DNA Methylation Analysis Reveals a Common Ten-Gene Methylation Signature in Colorectal Adenomas and Carcinomas” PLOS ONE 10(8): e0133836 (2015), and in Onwuka, et al., “A panel of DNA methylation signature from peripheral blood may predict colorectal cancer susceptibility,” BMC Cancer 20, 692 (2020), methylation of biomarkers in a panel of methylation biomarkers is indicative of the presence of colorectal cancer in the patient. Accordingly, testing any known or future panels of methylation biomarkers for prognostic or diagnostic purposes is envisioned with the method disclosed herein. In some embodiments, the entire genome of an organism is probed for the presence of methylation. The method of the instant invention includes detecting methylation in all sites throughout the genome of an organism to diagnose a disease or condition or predisposition to a disease or condition using the sequence analysis and artificial intelligence tools described, e.g., in Shull, et al. “Sequencing the cancer methylome,” Methods Mol Biol. 1238:627-635 (2015).
In some embodiments, the method of detecting epigenetic modifications includes sequencing. The nucleic acid processed as described herein is subjected to sequencing; preferably, massively parallel single molecule sequencing. Analyzing individual molecules by massively parallel sequencing typically requires a separate level of barcoding for sample identification and error correction. The use of molecular barcodes such as described in U.S. Patent Nos. 7,393,665, 8,168,385, 8,481,292, 8,685,678, and 8,722,368. A unique molecular barcode is added to each molecule to be sequenced to mark molecule and its progeny (e.g., the original molecule and its amplicons generated by PCR). The unique molecular barcode (UID) has multiple uses including counting the number of original target molecules in the sample and error correction (Newman, et al., “An ultrasensitive method for quantitating circulating tumor DNA with broad patient coverage,” Nature Medicine doi: 10.1038/nm.3519 (2014)).
Nanopore sequencing is a unique, scalable technology that enables direct, real-time analysis of long DNA or RNA fragments. It works by monitoring changes to an electrical current as nucleic acids are passed through a protein nanopore. The resulting signal is decoded to provide the specific DNA or RNA sequence. Nanopore-based sequencing technology detects the unique electrical signals of different molecules as they pass through the nanopore with a semiconductor-based electronic detection system. This technology makes for a high throughput, cost effective sequencing solution. At the heart of the technology is the biological nanopore, a protein pore embedded in a membrane, while the brains of the technology lie in the electronics of a semiconductor integrated circuit and proprietary chemistries. The electronic sensor technology embedded in the chip enables automatic membrane assembly and nanopore insertion, while allowing for active control of individual sensors on the circuit. Different sequencing chemistries can be paired with the nanopore and electronic sensor technology to enable high throughput, high accuracy sequencing with faster time-to-data.
In some embodiments, unique molecular barcodes (UIDs) are used for sequencing error correction. The entire progeny of a single target molecule is marked with the same barcode and forms a barcoded family. A variation in the sequence not shared by all members of the barcoded family is discarded as an artefact. Barcodes can also be used for positional deduplication and target quantification, as the entire family represents a single molecule in the original sample (Newman, et al. “Integrated digital error suppression for improved detection of circulating tumor DNA,” Nature Biotechnology 34:547 (2016)).
In some embodiments, the method involves forming a library comprising nucleic acids from a sample. The library consists of a plurality of nucleic acids ready for sequencing or another type of detection method, e.g., PCR. A library can be stored and used multiple times for further processing such as amplification or sequencing of the nucleic acids in the library. In some embodiments, the library is the input nucleic acid in which methylation is detected by the method described herein. In other embodiments, the library is formed from nucleic acids that have undergone the methylation detection reactions described herein.
In some embodiments, the nucleic acids processed for detection of epigenetic modifications according to the method described herein are sequenced. Any of a number of sequencing technologies or sequencing assays can be utilized. The term “Next Generation Sequencing (NGS)” as used herein refers to sequencing methods that allow for massively parallel sequencing of clonally amplified molecules and of single nucleic acid molecules.
Non-limiting examples of sequence assays that are suitable for use with the methods disclosed herein include nanopore sequencing (U.S. Patent Publication Nos. US2013/0244340, US2013/0264207, US2014/0134616, US2015/0119259 and US2015/0337366), Sanger sequencing, capillary array sequencing, thermal cycle sequencing (Sears, et al., Biotechniques, 13:626-633 (1992)), solid-phase sequencing (Zimmerman, et al., Methods Mol. Cell Biol., 3:39-42 (1992)), sequencing with mass spectrometry such as matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF/MS; Fu, et al., Nature Biotech., 16:381-384 (1998)), sequencing by hybridization (Drmanac et al., Nature Biotech., 16:54-58 (1998), and NGS methods, including but not limited to sequencing by synthesis (e.g., HiSeq™, MiSeq™, or Genome Analyzer, each available from Illumina), sequencing by ligation (e.g., SOLiD™, Life Technologies), ion semiconductor sequencing (e.g., Ion Torrent™, Life Technologies), and SMRT® sequencing (e.g., Pacific Biosciences).
Commercially available sequencing technologies include: sequencing-by- hybridization platforms from Affymetrix Inc. (Sunnyvale, Calif.), sequencing-by- synthesis platforms from Illumina/Solexa (San Diego, Calif.) and Helicos Biosciences (Cambridge, Mass.), sequencing-by-ligation platform from Applied Biosystems (Foster City, Calif.). Other sequencing technologies include, but are not limited to, the Ion Torrent technology (ThermoFisher Scientific), and nanopore sequencing (Genia Technology, Roche Sequencing Solutions, Santa Clara, Cal.), Oxford Nanopore Technologies (Oxford, UK), and sequencing by expansion (Stratos Genomics, Roche Sequencing Solutions). In some embodiments, the sequencing step involves sequence aligning. In some embodiments, aligning is used to determine a consensus sequence from a plurality of sequences, e.g., a plurality having the same unique molecular ID (UID). The molecular ID is a barcode that can be added to each molecule prior to sequencing or if amplification step is included, prior to the amplification step. In some embodiments, a UID is present in the 5 ’-portion of the RT primer. Similarly, a UID can be present in the 5 ’-end of the last barcode subunit to be added to the compound barcode. In other embodiments, a UID is present in an adaptor and is added to one or both ends of the target nucleic acid by ligation.
In some embodiments, a consensus sequence is determined from a plurality of sequences all having an identical UID. The sequences having an identical UID are presumed to derive from the same original molecule through amplification. In other embodiments, UID is used to eliminate artifacts, i.e., variations existing in the progeny of a single molecule (characterized by a particular UID). Such artifacts resulting from PCR errors or sequencing errors can be eliminated using UIDs.
In some embodiments, the number of each sequence in the sample can be quantified by quantifying relative numbers of sequences with each UID among the population having the same multiplex sample ID (MID). Each UID represents a single molecule in the original sample and counting different UIDs associated with each sequence variant can determine the fraction of each sequence variant in the original sample, where all molecules share the same MID. A person skilled in the art will be able to determine the number of sequence reads necessary to determine a consensus sequence. In some embodiments, the relevant number is reads per UID (“sequence depth”) necessary for an accurate quantitative result. In some embodiments, the desired depth is 5-50 reads per UID. Modification of 5fC
Modification of 5fC according to the present invention can be achieved if the first reactive group of the reagent according to the present invention is a C-H acidic group, an amine or a phosphorus ylide.
If the first reactive group is a C-H acidic group, the reaction can be executed as a Knoevenagel condensation, which is schematically depicted in Fig. 2. The CH-acidic reagent comprising a nucleophile group is added via Knoevenagel conditions to fC, then subsequent intramolecular addition of the nucleophile to the C5-C6 double bond takes place, thereby enabling the deamination at C4 to obtain a “T-equivalent”. Yields of intramolecular addition of the nucleophile to the C5-C6 double bond can be increased by a photoisomerization of the double bond and/or by applying a radical initiation or a catalyst. If desirable, a reversal of the ring closure can be achieved under alkaline conditions.
Alternatively, an aldol condensation reaction of 5fC with a CH-acidic reagent comprising a sulfinate moiety (e.g. 1 -(methyl sulfonyl)methane sulfinate), a subsequent addition of a sulfinate moiety at the C-6 position of the substituted cytosine, followed by deamination and desulfmation (5fC to T conversion) can be executed (Fig. 3) A desulfmation step under alkaline conditions may even not be required. Instead of a sulfinate moiety, a thiol moiety can also be applied as nucleophile, addition reaction to the C5-C6 double bond can be achieved by means of applying thiol-ene chemistry using a radical initiator or a catalyst. The yields may be increased by photoisomerization of the newly formed double bond.
Modification of 5fC according to the present invention can also be done by means of a Wittig reaction (Fig. 4). I this case, the reagent according to the present invention is a phosphorus ylide comprising a nucleophilic group. The reagent is added via Wittig conditions to 5fC, then a subsequent intramolecular addition of the nucleophile to the C5-C6 double bond takes place, thereby enabling a deamination at the C4 position in order to obtain a “T-equivalent”.
Details of the reaction are shown for a particular example in Fig. 5. In this case, 5fC is reacted with a sulfinate modified Wittig reagent. Subsequently there is an addition of the sulfinate moiety at the C-6 position of substituted cytosine, followed by a deamination and desulfination (5fC to T conversion). The desulfination step under alkaline conditions may not be needed necessarily. Alternatively, instead of a sulfinate moiety, a thiol moiety can also be used by means of applying thiol-ene chemistry, using a radical initiator or a catalyst. Yields of intramolecular addition of the nucleophile to the C5-C6 double bond may be increased by photoisomerization of the newly formed double bond, and/or by applying radical initiation or a catalyst. If desirable, a reversal of the ring closure can be achieved under alkaline conditions. Alternatively, the reaction can be performed using Horner-Wadsworth-Emmons reaction conditions, in particular using a sulfinate group as nucleophile.
Yet another alternative is the modification of 5fC by means of reductive amination as shown in Fig. 6. In this case, the reagent of the invention comprising an amine and a further nucleophilic group is added via reductive amination to 5fC. Subsequently there is an intramolecular addition of the nucleophile to the C5-C6 double bond, which enables the deamination at the C4 position to obtain a “T- equivalent”. Reversal of ring closure can be achieved under alkaline conditions, if necessary. Yields can be increased by applying radical initiation or a catalyst.
A more detailed example for implementing a reductive amination reaction is illustrated in Fig. 7. 5fC is reacted with 2-aminomethyl sulfinate, and subsequently addition of the sulfinate moiety at C-6 position of substituted cytosine takes place. This is followed by a deamination and desulfination step, which converts 5fC to T. Again, a desulfination step under alkaline conditions may not be needed. Also again, the sulfinate moiety may be replaced by a thiol moiety using thiol-ene chemistry for the addition reaction to double bond C5-C6, which optionally includes the use of a radical initiator or a catalyst.
Modification of hmC:
Fig. 8 schematically shows a reaction scheme for the modification of hmC. In this case, the reagent used according to the present invention is a UDP -glucose, which is modified at the 2-position with a nucleophilic group. It is added via a B-glucosyl transferase enzyme to hmC. Then, a subsequent intramolecular addition of the nucleophile to the C5-C6 double bond takes place, thereby enabling the deamination at the C4 position in order to obtain a “T-equivalent”. As discussed above, yields of intramolecular addition of the nucleophile to the C5-C6 double bond can be increased by applying radical initiation or a catalyst and a reversal of the ring closure can be achieved under alkaline conditions, if necessary.
An alternative example is shown in Fig. 9. In this case, 5hmC is reacted with 2-thio- glucose-UDP and B-glucosyltransferase. Subsequently, an addition of the thiol moiety at the C6 position of substituted cytosine applying thiol-ene chemistry and optionally using a radical initiator or a catalyst is taking place, which is followed by a deamination reaction, resulting in a 5hmC to T conversion. Alternatively, 2- Sulfinato-glucose-UDP can also be applied as substrate for B-glucosyltransferase.
Modification of caC:
For this embodiment, which is shown in Fig. 10, the method of the present invention uses a reagent comprising an amine and a further nucleophilic group. It is added after an EDC/NHS activation of the carboxyl group of caC. This results in caC forming an amide bond, and a subsequent intramolecular addition of the nucleophile to the C5-C6 double bond, thereby enabling the deamination at the C4 position in order to obtain a “T-equivalent”. Again, the yield of intramolecular addition of the nucleophile to the C5-C6 double bond can be increased by means of applying radical initiation or a catalyst and reversal of the ring closure can be achieved under alkaline conditions.

Claims

Patent Claims
1. A method comprising a) providing a nucleic acid comprising 5fC, 5hmC, or 5caC b) providing one reactant comprising two reactive groups wherein the first reactive group is capable of reacting with the formyl, hydroxymethyl, or carboxyl group, and the second reactive group is a nucleophilic group, c) reacting said first reactive group with the formyl, hydroxymethyl, or carboxyl group thereby resulting in a modified 5fC, 5hmC, or 5caC, d) reacting said second reactive group with the C6 position of said modified 5fC, 5hmC, or 5caC, thereby obtaining a bicyclic or tricyclic molecule comprising a 5,6-di-hydro Cytosine entity, and e) deaminating said 5,6-di-hydro Cytosine entity to a 5,6-di-hydro-Uracil entity.
2. Method according to claim 1, further comprising step f) reversing the ring formation of step d) thereby obtaining a 5- substituted Uracil.
3. Method of claim 1-2, further comprising the step of converting 5mC to 5fC, 5hmC or 5caC prior to step a) .
4. A method according to claim 1-3, wherein said nucleic acid comprises 5fC.
5. Method according to claim 4, wherein the first reactive group is a C-H acidic group, an amine or a phosphorus ylide.
6. Method according to claim 1-3, wherein said nucleic acid comprises 5hmC.
7. Method according to claim 6, wherein the first reactive group is a Glycosyl Transferase substrate.
8. Method according to claim 1-3, wherein said nucleic acid comprises 5caC.
9. Method according to claim 8, wherein the first reactive group is an amine.
10. Method according to claims 1-9, wherein said second reactive group is a nucleophilic group selected from a group consisting of Thiol or Sulfinate.
11. Method according to claims 1-10, further comprising a subsequent step of amplifying said nucleic acid.
12. Method according to claims 1-11 further comprising the subsequent step of sequencing said nucleic acid.
13. A composition comprising a nucleic acid treated according to the method of claims 1-10.
14. A kit for performing a method according to claims 1-12.
PCT/EP2023/072497 2022-08-18 2023-08-16 Detection of epigenetic modifications WO2024038069A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
EP22191052 2022-08-18
EP22191052.4 2022-08-18

Publications (1)

Publication Number Publication Date
WO2024038069A1 true WO2024038069A1 (en) 2024-02-22

Family

ID=83004497

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/EP2023/072497 WO2024038069A1 (en) 2022-08-18 2023-08-16 Detection of epigenetic modifications

Country Status (1)

Country Link
WO (1) WO2024038069A1 (en)

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7393665B2 (en) 2005-02-10 2008-07-01 Population Genetics Technologies Ltd Methods and compositions for tagging and identifying polynucleotides
US8481292B2 (en) 2010-09-21 2013-07-09 Population Genetics Technologies Litd. Increasing confidence of allele calls with molecular counting
US20130244340A1 (en) 2012-01-20 2013-09-19 Genia Technologies, Inc. Nanopore Based Molecular Detection and Sequencing
US20130264207A1 (en) 2010-12-17 2013-10-10 Jingyue Ju Dna sequencing by synthesis using modified nucleotides and nanopore detection
US20140134616A1 (en) 2012-11-09 2014-05-15 Genia Technologies, Inc. Nucleic acid sequencing using tags
US20150119259A1 (en) 2012-06-20 2015-04-30 Jingyue Ju Nucleic acid sequencing by nanopore detection of tag molecules
US20150337366A1 (en) 2012-02-16 2015-11-26 Genia Technologies, Inc. Methods for creating bilayers for use with nanopore sensors
WO2022096751A1 (en) 2020-11-09 2022-05-12 Bluecat Ip Limited Methods for detection of nucleotide modification

Patent Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7393665B2 (en) 2005-02-10 2008-07-01 Population Genetics Technologies Ltd Methods and compositions for tagging and identifying polynucleotides
US8168385B2 (en) 2005-02-10 2012-05-01 Population Genetics Technologies Ltd Methods and compositions for tagging and identifying polynucleotides
US8481292B2 (en) 2010-09-21 2013-07-09 Population Genetics Technologies Litd. Increasing confidence of allele calls with molecular counting
US8685678B2 (en) 2010-09-21 2014-04-01 Population Genetics Technologies Ltd Increasing confidence of allele calls with molecular counting
US8722368B2 (en) 2010-09-21 2014-05-13 Population Genetics Technologies Ltd. Method for preparing a counter-tagged population of nucleic acid molecules
US20130264207A1 (en) 2010-12-17 2013-10-10 Jingyue Ju Dna sequencing by synthesis using modified nucleotides and nanopore detection
US20130244340A1 (en) 2012-01-20 2013-09-19 Genia Technologies, Inc. Nanopore Based Molecular Detection and Sequencing
US20150337366A1 (en) 2012-02-16 2015-11-26 Genia Technologies, Inc. Methods for creating bilayers for use with nanopore sensors
US20150119259A1 (en) 2012-06-20 2015-04-30 Jingyue Ju Nucleic acid sequencing by nanopore detection of tag molecules
US20140134616A1 (en) 2012-11-09 2014-05-15 Genia Technologies, Inc. Nucleic acid sequencing using tags
WO2022096751A1 (en) 2020-11-09 2022-05-12 Bluecat Ip Limited Methods for detection of nucleotide modification

Non-Patent Citations (14)

* Cited by examiner, † Cited by third party
Title
ANDRÃ GRAND ET AL: "Comparison of the mechanism of deamination of 5,6-dihydro-5-methylcytosine with other cytosine derivatives", THEORETICAL CHEMISTRY ACCOUNTS ; THEORY, COMPUTATION, AND MODELING, SPRINGER, BERLIN, DE, vol. 131, no. 4, 4 April 2012 (2012-04-04), pages 1 - 11, XP035046539, ISSN: 1432-2234, DOI: 10.1007/S00214-012-1187-X *
C. ZHU ET AL., CELL STEM CELL, vol. 20, 2017, pages 720 - 731
DRMANAC ET AL., NATURE BIOTECH., vol. 16, 1998, pages 381 - 384
F. GALARDI ET AL., BIOMOLECULES, vol. 10, 2020, pages 1677
L. ZHAO ET AL., PROTEIN CELL, vol. 11, 2020, pages 792 - 808
NEWMAN ET AL.: "An ultrasensitive method for quantitating circulating tumor DNA with broad patient coverage", NATURE MEDICINE, 2014
NEWMAN ET AL.: "Integrated digital error suppression for improved detection of circulating tumor DNA", NATURE BIOTECHNOLOGY, vol. 34, 2016, pages 547, XP055802486, DOI: 10.1038/nbt.3520
ONWUKA ET AL.: "A panel of DNA methylation signature from peripheral blood may predict colorectal cancer susceptibility", BMC CANCER, vol. 20, 2020, pages 692
PATAI ET AL.: "Comprehensive DNA Methylation Analysis Reveals a Common Ten-Gene Methylation Signature in Colorectal Adenomas and Carcinomas", PLOS ONE, vol. 10, no. 8, 2015, pages 0133836
PFEIFER ET AL., EPIGENETICS & CHROMATIN, vol. 6, no. 10, 2013, pages 1 - 9
SAMBROOK ET AL.: "Molecular Cloning: A Laboratory Manual", 1989, COLD SPRING HARBOR LABORATORY PRESS
SEARS ET AL., BIOTECHNIQUES, vol. 13, 1992, pages 626 - 633
SHULL ET AL.: "Sequencing the cancer methylome", METHODS MOL BIOL., vol. 1238, 2015, pages 627 - 635
ZIMMERMAN ET AL., METHODS MOL. CELL BIOL., vol. 3, 1992, pages 39 - 42

Similar Documents

Publication Publication Date Title
US10808244B2 (en) Method of normalizing biological samples
DK2737085T3 (en) METHODS FOR DETECTING NUCLEOTID MODIFICATION
US20230235380A1 (en) Methods for the Epigenetic Analysis of DNA, Particularly Cell-Free DNA
EP3146075B1 (en) Ion sensor dna and rna sequencing by synthesis using nucleotide reversible terminators
US20180327828A1 (en) Ion sensor dna and rna sequencing by synthesis using nucleotide reversible terminators
WO2002021098A2 (en) A method for identifying polymorphisms
US11608518B2 (en) Methods for analyzing nucleic acids
US20210380971A1 (en) Simultaneous, sequencing-based analysis of proteins, nucleosomes, and cell-free nucleic acids from a single biological sample
EP1117838B1 (en) A method for analyzing polynucleotides
Feng et al. Recent advancements in intestinal microbiota analyses: a review for non-microbiologists
US20200040390A1 (en) Methods for Sequencing Repetitive Genomic Regions
WO2024038069A1 (en) Detection of epigenetic modifications
TW202302861A (en) Methods for accurate parallel quantification of nucleic acids in dilute or non-purified samples
WO2023242075A1 (en) Detection of epigenetic cytosine modification
Shetty et al. Introduction to nucleic acid sequencing
WO2023125898A1 (en) Nucleic acid testing method and system
CN113373207A (en) Methods for determining cytosine modifications
Tost Current and emerging technologies for the analysis of the genome-wide and locus-specific DNA methylation patterns
US20240093300A1 (en) Methylated dna fragment enrichment, methods, compositions and kits
WO2024076981A2 (en) Tet-assisted pyridine borane sequencing
Różalski et al. Recent approaches to the liquid chromatography-mass spectrometry analysis of modified deoxynucleosides as biomarkers in clinical research
Monisha et al. RNA-SEQ: A High-Revolution View of the Transcriptome
Honisch The World of Nucleic Acid-Based Mass Spectrometry for Microbial and Viral Detection
EP4291677A2 (en) Methods for base-level detection of methylation in nucleic acids
WO2023287876A1 (en) Efficient duplex sequencing using high fidelity next generation sequencing reads

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 23757600

Country of ref document: EP

Kind code of ref document: A1