WO2024038069A1 - Detection of epigenetic modifications - Google Patents
Detection of epigenetic modifications Download PDFInfo
- Publication number
- WO2024038069A1 WO2024038069A1 PCT/EP2023/072497 EP2023072497W WO2024038069A1 WO 2024038069 A1 WO2024038069 A1 WO 2024038069A1 EP 2023072497 W EP2023072497 W EP 2023072497W WO 2024038069 A1 WO2024038069 A1 WO 2024038069A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- group
- sequencing
- 5hmc
- reactive group
- nucleic acid
- Prior art date
Links
- 238000001514 detection method Methods 0.000 title description 11
- 230000004049 epigenetic modification Effects 0.000 title description 10
- 238000000034 method Methods 0.000 claims abstract description 49
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 claims abstract description 36
- 150000007523 nucleic acids Chemical class 0.000 claims abstract description 34
- 108020004707 nucleic acids Proteins 0.000 claims abstract description 32
- 102000039446 nucleic acids Human genes 0.000 claims abstract description 32
- 229940104302 cytosine Drugs 0.000 claims abstract description 18
- OIVLITBTBDPEFK-UHFFFAOYSA-N 5,6-dihydrouracil Chemical compound O=C1CCNC(=O)N1 OIVLITBTBDPEFK-UHFFFAOYSA-N 0.000 claims abstract description 13
- 230000000269 nucleophilic effect Effects 0.000 claims abstract description 13
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 claims abstract description 10
- 125000002485 formyl group Chemical group [H]C(*)=O 0.000 claims abstract description 7
- -1 formyl hydroxymethyl Chemical group 0.000 claims abstract description 7
- 125000004029 hydroxymethyl group Chemical group [H]OC([H])([H])* 0.000 claims abstract description 7
- 239000000376 reactant Substances 0.000 claims abstract description 5
- 125000002619 bicyclic group Chemical group 0.000 claims abstract description 4
- 238000012163 sequencing technique Methods 0.000 claims description 45
- DOUHZFSGSXMPIE-UHFFFAOYSA-N hydroxidooxidosulfur(.) Chemical compound [O]SO DOUHZFSGSXMPIE-UHFFFAOYSA-N 0.000 claims description 13
- 150000001412 amines Chemical class 0.000 claims description 8
- 229910052698 phosphorus Inorganic materials 0.000 claims description 8
- OAICVXFJPJFONN-UHFFFAOYSA-N Phosphorus Chemical compound [P] OAICVXFJPJFONN-UHFFFAOYSA-N 0.000 claims description 7
- 230000002378 acidificating effect Effects 0.000 claims description 7
- 239000011574 phosphorus Substances 0.000 claims description 7
- 150000003573 thiols Chemical class 0.000 claims description 7
- 239000000758 substrate Substances 0.000 claims description 6
- 102000051366 Glycosyltransferases Human genes 0.000 claims description 5
- 108700023372 Glycosyltransferases Proteins 0.000 claims description 5
- FGUUSXIOTUKUDN-IBGZPJMESA-N C1(=CC=CC=C1)N1C2=C(NC([C@H](C1)NC=1OC(=NN=1)C1=CC=CC=C1)=O)C=CC=C2 Chemical compound C1(=CC=CC=C1)N1C2=C(NC([C@H](C1)NC=1OC(=NN=1)C1=CC=CC=C1)=O)C=CC=C2 FGUUSXIOTUKUDN-IBGZPJMESA-N 0.000 claims description 2
- 238000007363 ring formation reaction Methods 0.000 claims description 2
- 238000006243 chemical reaction Methods 0.000 description 20
- 230000004048 modification Effects 0.000 description 19
- 238000012986 modification Methods 0.000 description 19
- 238000005516 engineering process Methods 0.000 description 17
- 239000012038 nucleophile Substances 0.000 description 16
- 239000000523 sample Substances 0.000 description 16
- 239000003153 chemical reaction reagent Substances 0.000 description 14
- 238000006481 deamination reaction Methods 0.000 description 14
- 230000011987 methylation Effects 0.000 description 14
- 238000007069 methylation reaction Methods 0.000 description 14
- 108020004414 DNA Proteins 0.000 description 13
- 230000009615 deamination Effects 0.000 description 13
- 239000003054 catalyst Substances 0.000 description 9
- 230000003321 amplification Effects 0.000 description 7
- 238000003199 nucleic acid amplification method Methods 0.000 description 7
- 230000003647 oxidation Effects 0.000 description 7
- 238000007254 oxidation reaction Methods 0.000 description 7
- 102000004190 Enzymes Human genes 0.000 description 6
- 108090000790 Enzymes Proteins 0.000 description 6
- 206010028980 Neoplasm Diseases 0.000 description 6
- 230000002255 enzymatic effect Effects 0.000 description 6
- NNTOJPXOCKCMKR-UHFFFAOYSA-N boron;pyridine Chemical compound [B].C1=CC=NC=C1 NNTOJPXOCKCMKR-UHFFFAOYSA-N 0.000 description 5
- 239000003999 initiator Substances 0.000 description 5
- 230000000977 initiatory effect Effects 0.000 description 5
- 238000006268 reductive amination reaction Methods 0.000 description 5
- 238000006798 ring closing metathesis reaction Methods 0.000 description 5
- OXBLVCZKDOZZOJ-UHFFFAOYSA-N 2,3-Dihydrothiophene Chemical compound C1CC=CS1 OXBLVCZKDOZZOJ-UHFFFAOYSA-N 0.000 description 4
- LRSASMSXMSNRBT-UHFFFAOYSA-N 5-methylcytosine Chemical compound CC1=CNC(=O)N=C1N LRSASMSXMSNRBT-UHFFFAOYSA-N 0.000 description 4
- DRTQHJPVMGBUCF-XVFCMESISA-N Uridine Chemical class O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-XVFCMESISA-N 0.000 description 4
- 238000007239 Wittig reaction Methods 0.000 description 4
- 229910052799 carbon Inorganic materials 0.000 description 4
- 150000001875 compounds Chemical class 0.000 description 4
- 239000012530 fluid Substances 0.000 description 4
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 3
- RYVNIFSIEDRLSJ-UHFFFAOYSA-N 5-hydroxymethylcytosine Natural products NC=1NC(=O)N=CC=1CO RYVNIFSIEDRLSJ-UHFFFAOYSA-N 0.000 description 3
- BTIWPBKNTZFNRI-XLPZGREQSA-N 5-hydroxymethyldeoxycytidylic acid Chemical compound C1=C(CO)C(N)=NC(=O)N1[C@@H]1O[C@H](COP(O)(O)=O)[C@@H](O)C1 BTIWPBKNTZFNRI-XLPZGREQSA-N 0.000 description 3
- 208000001333 Colorectal Neoplasms Diseases 0.000 description 3
- 108091035707 Consensus sequence Proteins 0.000 description 3
- 230000007067 DNA methylation Effects 0.000 description 3
- 108091028043 Nucleic acid sequence Proteins 0.000 description 3
- 238000012408 PCR amplification Methods 0.000 description 3
- 208000034953 Twin anemia-polycythemia sequence Diseases 0.000 description 3
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Natural products O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 3
- 239000012445 acidic reagent Substances 0.000 description 3
- 238000004458 analytical method Methods 0.000 description 3
- 239000000090 biomarker Substances 0.000 description 3
- 201000011510 cancer Diseases 0.000 description 3
- 150000001720 carbohydrates Chemical class 0.000 description 3
- 125000004432 carbon atom Chemical group C* 0.000 description 3
- 210000004027 cell Anatomy 0.000 description 3
- 238000012937 correction Methods 0.000 description 3
- 201000010099 disease Diseases 0.000 description 3
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 3
- 238000007672 fourth generation sequencing Methods 0.000 description 3
- 238000010348 incorporation Methods 0.000 description 3
- 150000002500 ions Chemical class 0.000 description 3
- CUONGYYJJVDODC-UHFFFAOYSA-N malononitrile Chemical compound N#CCC#N CUONGYYJJVDODC-UHFFFAOYSA-N 0.000 description 3
- 238000007481 next generation sequencing Methods 0.000 description 3
- 238000007699 photoisomerization reaction Methods 0.000 description 3
- 102000004169 proteins and genes Human genes 0.000 description 3
- 108090000623 proteins and genes Proteins 0.000 description 3
- 239000004065 semiconductor Substances 0.000 description 3
- NBAKTGXDIBVZOO-UHFFFAOYSA-N 5,6-dihydrothymine Chemical compound CC1CNC(=O)NC1=O NBAKTGXDIBVZOO-UHFFFAOYSA-N 0.000 description 2
- LSNNMFCWUKXFEE-UHFFFAOYSA-M Bisulfite Chemical compound OS([O-])=O LSNNMFCWUKXFEE-UHFFFAOYSA-M 0.000 description 2
- 206010009944 Colon cancer Diseases 0.000 description 2
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 2
- 238000006000 Knoevenagel condensation reaction Methods 0.000 description 2
- 206010036790 Productive cough Diseases 0.000 description 2
- IQFYYKKMVGJFEH-XLPZGREQSA-N Thymidine Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](CO)[C@@H](O)C1 IQFYYKKMVGJFEH-XLPZGREQSA-N 0.000 description 2
- HSCJRCZFDFQWRP-JZMIEXBBSA-N UDP-alpha-D-glucose Chemical group O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1OP(O)(=O)OP(O)(=O)OC[C@@H]1[C@@H](O)[C@@H](O)[C@H](N2C(NC(=O)C=C2)=O)O1 HSCJRCZFDFQWRP-JZMIEXBBSA-N 0.000 description 2
- 230000004913 activation Effects 0.000 description 2
- 238000007259 addition reaction Methods 0.000 description 2
- 238000005882 aldol condensation reaction Methods 0.000 description 2
- 125000003277 amino group Chemical group 0.000 description 2
- 238000003556 assay Methods 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- 238000001369 bisulfite sequencing Methods 0.000 description 2
- ZPTBLXKRQACLCR-XVFCMESISA-N dihydrouridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)CC1 ZPTBLXKRQACLCR-XVFCMESISA-N 0.000 description 2
- 125000006575 electron-withdrawing group Chemical group 0.000 description 2
- 230000001973 epigenetic effect Effects 0.000 description 2
- 239000012634 fragment Substances 0.000 description 2
- 239000008103 glucose Substances 0.000 description 2
- 125000002791 glucosyl group Chemical group C1([C@H](O)[C@@H](O)[C@H](O)[C@H](O1)CO)* 0.000 description 2
- 238000009396 hybridization Methods 0.000 description 2
- 238000001840 matrix-assisted laser desorption--ionisation time-of-flight mass spectrometry Methods 0.000 description 2
- 239000012528 membrane Substances 0.000 description 2
- 239000002777 nucleoside Substances 0.000 description 2
- 239000002773 nucleotide Substances 0.000 description 2
- 125000003729 nucleotide group Chemical group 0.000 description 2
- 125000004437 phosphorous atom Chemical group 0.000 description 2
- 230000009467 reduction Effects 0.000 description 2
- 238000006722 reduction reaction Methods 0.000 description 2
- 238000007841 sequencing by ligation Methods 0.000 description 2
- 210000003802 sputum Anatomy 0.000 description 2
- 208000024794 sputum Diseases 0.000 description 2
- 239000000126 substance Substances 0.000 description 2
- 238000006467 substitution reaction Methods 0.000 description 2
- 238000003786 synthesis reaction Methods 0.000 description 2
- 210000001519 tissue Anatomy 0.000 description 2
- 238000012546 transfer Methods 0.000 description 2
- 230000005945 translocation Effects 0.000 description 2
- 229940035893 uracil Drugs 0.000 description 2
- GUAHPAJOXVYFON-ZETCQYMHSA-N (8S)-8-amino-7-oxononanoic acid zwitterion Chemical compound C[C@H](N)C(=O)CCCCCC(O)=O GUAHPAJOXVYFON-ZETCQYMHSA-N 0.000 description 1
- UHDGCWIWMRVCDJ-UHFFFAOYSA-N 1-beta-D-Xylofuranosyl-NH-Cytosine Natural products O=C1N=C(N)C=CN1C1C(O)C(O)C(CO)O1 UHDGCWIWMRVCDJ-UHFFFAOYSA-N 0.000 description 1
- OCMSXKMNYAHJMU-JXOAFFINSA-N 4-amino-1-[(2r,3r,4s,5r)-3,4-dihydroxy-5-(hydroxymethyl)oxolan-2-yl]-2-oxopyrimidine-5-carbaldehyde Chemical compound C1=C(C=O)C(N)=NC(=O)N1[C@H]1[C@H](O)[C@H](O)[C@@H](CO)O1 OCMSXKMNYAHJMU-JXOAFFINSA-N 0.000 description 1
- SVRWPYGLQBPNNJ-UAKXSSHOSA-N 4-amino-1-[(2r,3r,4s,5r)-3,4-dihydroxy-5-(hydroxymethyl)oxolan-2-yl]-2-oxopyrimidine-5-carboxylic acid Chemical compound C1=C(C(O)=O)C(N)=NC(=O)N1[C@H]1[C@H](O)[C@H](O)[C@@H](CO)O1 SVRWPYGLQBPNNJ-UAKXSSHOSA-N 0.000 description 1
- FPOVCZDHZSAAIX-UHFFFAOYSA-N 4-amino-5,6-dihydro-1h-pyrimidin-2-one Chemical class NC1=NC(=O)NCC1 FPOVCZDHZSAAIX-UHFFFAOYSA-N 0.000 description 1
- SBHSUMUTJOPRIK-HPFNVAMJSA-N 5-(beta-D-glucosylmethyl)cytosine Chemical compound NC1=NC(=O)NC=C1CO[C@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 SBHSUMUTJOPRIK-HPFNVAMJSA-N 0.000 description 1
- NFEXJLMYXXIWPI-JXOAFFINSA-N 5-Hydroxymethylcytidine Chemical compound C1=C(CO)C(N)=NC(=O)N1[C@H]1[C@H](O)[C@H](O)[C@@H](CO)O1 NFEXJLMYXXIWPI-JXOAFFINSA-N 0.000 description 1
- ZAYHVCMSTBRABG-UHFFFAOYSA-N 5-Methylcytidine Natural products O=C1N=C(N)C(C)=CN1C1C(O)C(O)C(CO)O1 ZAYHVCMSTBRABG-UHFFFAOYSA-N 0.000 description 1
- BLQMCTXZEMGOJM-UHFFFAOYSA-N 5-carboxycytosine Chemical compound NC=1NC(=O)N=CC=1C(O)=O BLQMCTXZEMGOJM-UHFFFAOYSA-N 0.000 description 1
- LUCHPKXVUGJYGU-XLPZGREQSA-N 5-methyl-2'-deoxycytidine Chemical compound O=C1N=C(N)C(C)=CN1[C@@H]1O[C@H](CO)[C@@H](O)C1 LUCHPKXVUGJYGU-XLPZGREQSA-N 0.000 description 1
- ZAYHVCMSTBRABG-JXOAFFINSA-N 5-methylcytidine Chemical compound O=C1N=C(N)C(C)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](CO)O1 ZAYHVCMSTBRABG-JXOAFFINSA-N 0.000 description 1
- 108091093088 Amplicon Proteins 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- DWRXFEITVBNRMK-UHFFFAOYSA-N Beta-D-1-Arabinofuranosylthymine Natural products O=C1NC(=O)C(C)=CN1C1C(O)C(O)C(CO)O1 DWRXFEITVBNRMK-UHFFFAOYSA-N 0.000 description 1
- 108010077544 Chromatin Proteins 0.000 description 1
- JPVYNHNXODAKFH-UHFFFAOYSA-N Cu2+ Chemical compound [Cu+2] JPVYNHNXODAKFH-UHFFFAOYSA-N 0.000 description 1
- UHDGCWIWMRVCDJ-PSQAKQOGSA-N Cytidine Natural products O=C1N=C(N)C=CN1[C@@H]1[C@@H](O)[C@@H](O)[C@H](CO)O1 UHDGCWIWMRVCDJ-PSQAKQOGSA-N 0.000 description 1
- 102000053602 DNA Human genes 0.000 description 1
- 230000030933 DNA methylation on cytosine Effects 0.000 description 1
- CKTSBUTUHBMZGZ-UHFFFAOYSA-N Deoxycytidine Natural products O=C1N=C(N)C=CN1C1OC(CO)C(O)C1 CKTSBUTUHBMZGZ-UHFFFAOYSA-N 0.000 description 1
- 102000016680 Dioxygenases Human genes 0.000 description 1
- 108010028143 Dioxygenases Proteins 0.000 description 1
- 241000233866 Fungi Species 0.000 description 1
- 101000653374 Homo sapiens Methylcytosine dioxygenase TET2 Proteins 0.000 description 1
- 238000006546 Horner-Wadsworth-Emmons reaction Methods 0.000 description 1
- 108010029541 Laccase Proteins 0.000 description 1
- 102100030803 Methylcytosine dioxygenase TET2 Human genes 0.000 description 1
- 108030004080 Methylcytosine dioxygenases Proteins 0.000 description 1
- 208000005228 Pericardial Effusion Diseases 0.000 description 1
- ZLMJMSJWJFRBEC-UHFFFAOYSA-N Potassium Chemical compound [K] ZLMJMSJWJFRBEC-UHFFFAOYSA-N 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- RWQNBRDOKXIBIV-UHFFFAOYSA-N Thymine Natural products CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 description 1
- 102000004357 Transferases Human genes 0.000 description 1
- 108090000992 Transferases Proteins 0.000 description 1
- HSCJRCZFDFQWRP-RDKQLNKOSA-N UDP-D-glucose Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)OC1OP(O)(=O)OP(O)(=O)OC[C@@H]1[C@@H](O)[C@@H](O)[C@H](N2C(NC(=O)C=C2)=O)O1 HSCJRCZFDFQWRP-RDKQLNKOSA-N 0.000 description 1
- HSCJRCZFDFQWRP-UHFFFAOYSA-N Uridindiphosphoglukose Natural products OC1C(O)C(O)C(CO)OC1OP(O)(=O)OP(O)(=O)OCC1C(O)C(O)C(N2C(NC(=O)C=C2)=O)O1 HSCJRCZFDFQWRP-UHFFFAOYSA-N 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- 230000032683 aging Effects 0.000 description 1
- 150000001298 alcohols Chemical class 0.000 description 1
- 150000001299 aldehydes Chemical class 0.000 description 1
- 210000004381 amniotic fluid Anatomy 0.000 description 1
- 238000013473 artificial intelligence Methods 0.000 description 1
- 210000003567 ascitic fluid Anatomy 0.000 description 1
- IQFYYKKMVGJFEH-UHFFFAOYSA-N beta-L-thymidine Natural products O=C1NC(=O)C(C)=CN1C1OC(CO)C(O)C1 IQFYYKKMVGJFEH-UHFFFAOYSA-N 0.000 description 1
- DRTQHJPVMGBUCF-PSQAKQOGSA-N beta-L-uridine Natural products O[C@H]1[C@@H](O)[C@H](CO)O[C@@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-PSQAKQOGSA-N 0.000 description 1
- 210000000941 bile Anatomy 0.000 description 1
- 230000031018 biological processes and functions Effects 0.000 description 1
- 239000012472 biological sample Substances 0.000 description 1
- 238000001574 biopsy Methods 0.000 description 1
- 210000004369 blood Anatomy 0.000 description 1
- 239000008280 blood Substances 0.000 description 1
- 210000001124 body fluid Anatomy 0.000 description 1
- 239000010839 body fluid Substances 0.000 description 1
- 210000004556 brain Anatomy 0.000 description 1
- 125000002915 carbonyl group Chemical group [*:2]C([*:1])=O 0.000 description 1
- 210000001175 cerebrospinal fluid Anatomy 0.000 description 1
- 210000003483 chromatin Anatomy 0.000 description 1
- 201000002758 colorectal adenoma Diseases 0.000 description 1
- 239000007859 condensation product Substances 0.000 description 1
- 238000006482 condensation reaction Methods 0.000 description 1
- 125000004093 cyano group Chemical group *C#N 0.000 description 1
- 125000004122 cyclic group Chemical group 0.000 description 1
- UHDGCWIWMRVCDJ-ZAKLUEHWSA-N cytidine Chemical compound O=C1N=C(N)C=CN1[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O1 UHDGCWIWMRVCDJ-ZAKLUEHWSA-N 0.000 description 1
- MXHRCPNRJAMMIM-UHFFFAOYSA-N desoxyuridine Natural products C1C(O)C(CO)OC1N1C(=O)NC(=O)C=C1 MXHRCPNRJAMMIM-UHFFFAOYSA-N 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 230000004069 differentiation Effects 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 239000012039 electrophile Substances 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 230000002550 fecal effect Effects 0.000 description 1
- 125000000524 functional group Chemical group 0.000 description 1
- 230000002496 gastric effect Effects 0.000 description 1
- 150000002303 glucose derivatives Chemical class 0.000 description 1
- 108700014210 glycosyltransferase activity proteins Proteins 0.000 description 1
- 125000002887 hydroxy group Chemical group [H]O* 0.000 description 1
- 239000012678 infectious agent Substances 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 230000000968 intestinal effect Effects 0.000 description 1
- 150000002576 ketones Chemical class 0.000 description 1
- 210000002751 lymph Anatomy 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 238000004949 mass spectrometry Methods 0.000 description 1
- 125000002496 methyl group Chemical group [H]C([H])([H])* 0.000 description 1
- GDOPTJXRTPNYNR-UHFFFAOYSA-N methyl-cyclopentane Natural products CC1CCCC1 GDOPTJXRTPNYNR-UHFFFAOYSA-N 0.000 description 1
- CBKJJOFJHWJJTF-UHFFFAOYSA-N methylsulfonylmethanesulfinic acid Chemical compound CS(=O)(=O)CS(O)=O CBKJJOFJHWJJTF-UHFFFAOYSA-N 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 230000007935 neutral effect Effects 0.000 description 1
- 125000000449 nitro group Chemical group [O-][N+](*)=O 0.000 description 1
- 150000003833 nucleoside derivatives Chemical class 0.000 description 1
- 210000004912 pericardial fluid Anatomy 0.000 description 1
- 210000005259 peripheral blood Anatomy 0.000 description 1
- 239000011886 peripheral blood Substances 0.000 description 1
- UEZVMMHDMIWARA-UHFFFAOYSA-M phosphonate Chemical compound [O-]P(=O)=O UEZVMMHDMIWARA-UHFFFAOYSA-M 0.000 description 1
- XYFCBTPGUUZFHI-UHFFFAOYSA-O phosphonium Chemical compound [PH4+] XYFCBTPGUUZFHI-UHFFFAOYSA-O 0.000 description 1
- 210000002381 plasma Anatomy 0.000 description 1
- 210000004910 pleural fluid Anatomy 0.000 description 1
- 108091033319 polynucleotide Proteins 0.000 description 1
- 102000040430 polynucleotide Human genes 0.000 description 1
- 239000002157 polynucleotide Substances 0.000 description 1
- 239000011148 porous material Substances 0.000 description 1
- 239000011591 potassium Substances 0.000 description 1
- 229910052700 potassium Inorganic materials 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 239000000047 product Substances 0.000 description 1
- 238000011002 quantification Methods 0.000 description 1
- 238000010223 real-time analysis Methods 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 210000003296 saliva Anatomy 0.000 description 1
- 238000007480 sanger sequencing Methods 0.000 description 1
- 210000002966 serum Anatomy 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 239000007790 solid phase Substances 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 210000000130 stem cell Anatomy 0.000 description 1
- 125000001174 sulfone group Chemical group 0.000 description 1
- 150000003462 sulfoxides Chemical class 0.000 description 1
- 230000001629 suppression Effects 0.000 description 1
- 210000004243 sweat Anatomy 0.000 description 1
- 210000001179 synovial fluid Anatomy 0.000 description 1
- 210000001138 tear Anatomy 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 150000007944 thiolates Chemical class 0.000 description 1
- 229940104230 thymidine Drugs 0.000 description 1
- 229940113082 thymine Drugs 0.000 description 1
- 231100000331 toxic Toxicity 0.000 description 1
- 230000002588 toxic effect Effects 0.000 description 1
- 238000005820 transferase reaction Methods 0.000 description 1
- RIOQSEWOXXDEQQ-UHFFFAOYSA-O triphenylphosphanium Chemical compound C1=CC=CC=C1[PH+](C=1C=CC=CC=1)C1=CC=CC=C1 RIOQSEWOXXDEQQ-UHFFFAOYSA-O 0.000 description 1
- DRTQHJPVMGBUCF-UHFFFAOYSA-N uracil arabinoside Natural products OC1C(O)C(CO)OC1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-UHFFFAOYSA-N 0.000 description 1
- 229940045145 uridine Drugs 0.000 description 1
- 210000002700 urine Anatomy 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6806—Preparing nucleic acids for analysis, e.g. for polymerase chain reaction [PCR] assay
Definitions
- the invention relates to the field mapping the epigenetic modifications of DNA and RNA, which has become increasingly important, because these modifications play a role in several biological processes and diseases including development, aging and last but not least, cancer.
- C cytosine
- 5mC 5-methylcytosine
- oxidation can also be performed by chemical means using e.g. potassium perruthenate (KRuCh).
- KRuCh potassium perruthenate
- WO 2022/096751 also disclosed a method for generating a dihydrothymine (DHT) or a dihydrouracil (DHU) residue from a nucleoside or a polynucleotide containing 5-methylcytosine (5mC) or 5-carboxyl-cytosine (5caC) by using a radical initiator which may be used together with a nucleophilic compound.
- DHT dihydrothymine
- DHU dihydrouracil
- the present invention is based on the basic idea that the introduction of a nucleophile at 5fC, 5hmC or caC and the subsequent intramolecular addition at C- 6 of a 5-substitued cytosine is enabling a deamination at C-4 of a cytosine derivative.
- the present invention is directed to a method comprising the steps of a) providing a nucleic acid comprising 5fC, 5hmC, or 5caC b) providing one reactant comprising two reactive groups wherein the first reactive group is capable of reacting with the formyl, hydroxymethyl, or carboxyl group, and the second reactive group is a nucleophilic group, c) reacting said first reactive group with the formyl, hydroxymethyl or carboxyl group thereby resulting in a modified 5fC, 5hmC, or 5caC d) reacting said second reactive group with the C6 position of said modified 5fC, 5hmC, or 5caC, thereby obtaining a bicyclic or tricyclic molecule comprising a 5,6 di-hydro Cytosine entity, and e) deaminating said 5,6 di-hydro Cytosine entity to a 5,6 di-hydro-Uracil entity.
- step d) The newly formed cyclic structures in step d) are 5-7 membered rings
- the method further comprises step f), i.e. reversing the ring formation of step d) thereby obtaining a 5-substituted Uracil.
- the method may also comprise a step of converting 5mC to 5fC, 5hmC or 5caC prior to step a).
- the present invention is facilitating the methylation status analysis of naturally occurring DNA.
- said nucleic acid comprises at least one 5fC residue.
- the first reactive group may be a C-H acidic group, an amine or a phosphorus ylide.
- said nucleic acid comprises at least one 5hmC residue. Then, the first reactive group may constitute a Glycosyl Transferase substrate.
- said nucleic acid comprises a 5caC residue. Then, the first reactive group may be an amine.
- the said second reactive group may be a nucleophilic group selected from a group consisting of Thiol or Sulfinate.
- All methods as disclosed above may further comprise a subsequent step of amplifying said nucleic acid, which may be preferably executed by means of PCR amplification.
- all methods disclosed above may further comprise the step of a subsequent step of sequencing said nucleic acid either with or without a prior amplification step, which is preferably a PCR amplification.
- Fig. 1 provides a scheme for detection of methylated sequences according to the present invention.
- Fig. 2 shows a reaction scheme for modification of 5fC by means of a Knoevenagel condensation using a reactant with a C-H acidic group
- Fig. 3 shows a reaction scheme for modification of 5fC by means of an aldol condensation reaction with a CH-acidic reagent comprising a sulfinate moiety
- Fig. 4 shows a reaction scheme for modification of 5fC by means of a Wittig reaction.
- Fig. 5 shows a specific example for a Wittig reaction according to Fig. 4 with a sulfinate modified Wittig reagent.
- Fig. 6 shows a reaction scheme for modification of 5fC by means of reductive amination
- Fig: 7 shows a specific example for a reductive amination as in Fig. 6, where 5fC is reacted with 2-aminomethyl sulfinate
- Fig. 8 shows a reaction scheme for the modification of hmC using UDP-glucose substituted at the 2-position of the glucose moiety with a nucleophile as a reagent and executing a subsequent beta-glucosyl-transferase reaction.
- Fig. 9 shows a similar reaction scheme as example 8, wherein 2-thio-glucose-UDP is used.
- Fig. 10 shows an example for modification of the carboxyl group of caC.
- a nucleophilic group is a functional group that is attractive to electron-poor or positively charged centers and donates electrons, in particular donating an electron pair to an electrophile or an electrophilic center to form a covalent bond.
- Nucleophilic groups are for example alcohols, alcoholates, thiols, thiolates, amines, sulfinates or carbanions.
- deamination refers herein to a substitution reaction of an exocyclic amino group by a hydroxyl group, in particular substitution of the amino group at the C-4 position of a cytosine nucleobase to convert into a uracil nucleobase. Deamination at C-4 position is enhanced in 5,6-dihydrocytosines to give 5,6-dihydrouracil derivatives, as e.g. being applied after bisulfite addition to the C5- C6 double bond of cytosine.
- C-H acidic group is used herein for a carbon atom within a compound which carries at least one electron-withdrawing group and thus being more easily deprotonated by a base.
- Electron-withdrawing groups are for example carbonyl containing groups, a nitro group, a cyano group, a sulfoxide or sulfone group.
- Wittig reagent or phosphorus ylide refers to neutral dipolar molecules containing a formally negatively charged carbon atom (a carbanion) directly attached to a phosphorus atom with a formal positive charge.
- phosphorus ylides phosphonium ylides
- Phosphorus ylides are thus 1,2- dipolar compounds and a subclass of zwitterions.
- triphenylphosphonium ylides are reacted with aldehydes or ketones to give an alkene bond.
- a Homer-Wadsworth-Emmons reaction stabilized phosphonate carbanions are used as reagent.
- a Glycosyl Transferase Substrate refers to a reagent comprising an activation group, e.g. a UDP moiety, and a sugar residue, for example a glucose or substituted glucose.
- Glycosyltransferases catalyze the transfer of the sugar moiety from an activated donor sugar (the glycosyl transferase substrate) onto a saccharide or non- saccharide acceptor.
- the non-saccharide acceptor can be for example 5- hydroxymethyl-dC.
- T4 Phage B-glucosyltransferase for example from New England Biolabs, specifically transfers a glucose moiety of uridine diphosphoglucose to the 5-hydroxymethylcytosine (5-hmC) residues in double-stranded DNA, making beta- glucosyl-5-hydroxymethylcytosine.
- FIG. 1 A general scheme, how methylated C-residues mC are detected is schematically shown in Fig. 1.
- the epigenetic dC modifications 5-methyl-dC (mC), 5-hydroxymethyl-dC (hmC), 5-formyl-dC (fC) and 5-carboxyl- dC (caC) are read as unmodified dC when amplified and/or sequenced.
- oxidation of mC by enzymatic and/or chemical means results in oxidized mC modifications hmC, fC and caC (general: xC).
- Oxidation or conversion of 5mC to 5fC can be achieved by using TET enzymes (ten- eleven translocation (TET) methylcytosine dioxygenases).
- TET ten- eleven translocation
- the same class of enzymes are suitable for converting 5mC to 5hmC.
- 5hmC can be further oxidized to 5fC enzymatically by a laccase enzyme or chemically using either KRuO4 or Cu(II)/TEMPO. Further details are disclosed in Pfeifer et al. Epigenetics & Chromatin 2013, 6: 10, p.1-9.
- a reagent of the invention comprising two reactive groups, the first reactive group forming a covalent bond with hmC, fC or caC and thus introducing the second reactive group which comprises a nucleophile.
- the present also provides a method for analyzing the methylation status of a naturally occurring DNA comprising the steps of a) converting 5mC to 5fC, 5hmC or 5caC b) providing one reactant comprising two reactive groups wherein the first reactive group is capable of reacting with the formyl, hydroxymethyl or carboxyl group, and the second reactive group is a nucleophilic group, c) reacting said first reactive group with the formyl, hydroxymethyl or carboxyl group thereby resulting in a modified 5fC, 5hmC, or 5caC, d) reacting said second reactive group with the C6 position of said modified 5fC, 5hmC, or 5caC, thereby obtaining a bicyclic or tricyclic molecule comprising a 5,6 di-hydro Cytosine entity, and e) deaminating said 5,6 di-hydro Cytosine entity to
- the sample is derived from a subject or a patient.
- the sample may comprise a fragment of a solid tissue or a solid tumor derived from the subject or the patient, e.g., by biopsy.
- the sample may also comprise body fluids that may contain nucleic acids (e.g., urine, sputum, serum, blood or blood fractions, i.e., plasma, lymph, saliva, sputum, sweat, tear, cerebrospinal fluid, amniotic fluid, synovial fluid, pericardial fluid, peritoneal fluid, pleural fluid, cystic fluid, bile, gastric fluid, intestinal fluid, or fecal samples).
- nucleic acids e.g., urine, sputum, serum, blood or blood fractions, i.e., plasma, lymph, saliva, sputum, sweat, tear, cerebrospinal fluid, amniotic fluid, synovial fluid, pericardial fluid, peritoneal fluid, pleural fluid, cystic fluid,
- the sample is a cultured sample, e.g., a tissue culture containing cells and fluids from which nucleic acids may be isolated.
- the nucleic acids of interest in the sample come from infectious agents such as viruses, bacteria, protozoa or fungi.
- the present invention involves manipulating isolated nucleic acids isolated or extracted from a sample. Methods of nucleic acid extraction are well known in the art (see, Sambrook et al., & quot; Molecular Cloning: A Laboratory Manual, & quot; 1989, 2nd Ed., Cold Spring Harbor Laboratory Press: New York, N. Y.).
- kits are commercially available for extracting nucleic acids (DNA or RNA) from biological samples (e.g., KAPA Express Extract (Roche Sequencing Solutions, Pleasanton, Cal.) and other similar products from BD Biosciences Clontech (Palo Alto, Cal.), Epicentre Technologies (Madison, Wise.); Gentra Systems, (Minneapolis, Minn.); and Qiagen (Valencia, Cal.), Ambion (Austin, Tex.); BioRad Laboratories (Hercules, Cal.); and more.
- the present invention involves detecting an epigenetic modification, specifically, an epigenetic cytosine modification in nucleic acids (including, but not limited to, cytosine methylation).
- the nucleic acid sequences that are subject to conditional epigenetic modification are the target sequences analyzed by the method disclosed herein.
- the same nucleic acid sequence may or may not have the epigenetic modification characterized by methylation of cytosines at the 5-position (5mC or 5hmC).
- a set or a panel of target nucleic acids are probed for the presence of methylation. For example, as shown in Patai, et al.
- the method of the instant invention includes detecting methylation in all sites throughout the genome of an organism to diagnose a disease or condition or predisposition to a disease or condition using the sequence analysis and artificial intelligence tools described, e.g., in Shull, et al. “Sequencing the cancer methylome,” Methods Mol Biol. 1238:627-635 (2015).
- the method of detecting epigenetic modifications includes sequencing.
- the nucleic acid processed as described herein is subjected to sequencing; preferably, massively parallel single molecule sequencing. Analyzing individual molecules by massively parallel sequencing typically requires a separate level of barcoding for sample identification and error correction.
- the use of molecular barcodes such as described in U.S. Patent Nos. 7,393,665, 8,168,385, 8,481,292, 8,685,678, and 8,722,368.
- a unique molecular barcode is added to each molecule to be sequenced to mark molecule and its progeny (e.g., the original molecule and its amplicons generated by PCR).
- the unique molecular barcode has multiple uses including counting the number of original target molecules in the sample and error correction (Newman, et al., “An ultrasensitive method for quantitating circulating tumor DNA with broad patient coverage,” Nature Medicine doi: 10.1038/nm.3519 (2014)).
- Nanopore sequencing is a unique, scalable technology that enables direct, real-time analysis of long DNA or RNA fragments. It works by monitoring changes to an electrical current as nucleic acids are passed through a protein nanopore. The resulting signal is decoded to provide the specific DNA or RNA sequence. Nanopore-based sequencing technology detects the unique electrical signals of different molecules as they pass through the nanopore with a semiconductor-based electronic detection system. This technology makes for a high throughput, cost effective sequencing solution.
- the biological nanopore a protein pore embedded in a membrane, while the brains of the technology lie in the electronics of a semiconductor integrated circuit and proprietary chemistries.
- the electronic sensor technology embedded in the chip enables automatic membrane assembly and nanopore insertion, while allowing for active control of individual sensors on the circuit.
- Different sequencing chemistries can be paired with the nanopore and electronic sensor technology to enable high throughput, high accuracy sequencing with faster time-to-data.
- unique molecular barcodes are used for sequencing error correction.
- the entire progeny of a single target molecule is marked with the same barcode and forms a barcoded family.
- a variation in the sequence not shared by all members of the barcoded family is discarded as an artefact.
- Barcodes can also be used for positional deduplication and target quantification, as the entire family represents a single molecule in the original sample (Newman, et al. “Integrated digital error suppression for improved detection of circulating tumor DNA,” Nature Biotechnology 34:547 (2016)).
- the method involves forming a library comprising nucleic acids from a sample.
- the library consists of a plurality of nucleic acids ready for sequencing or another type of detection method, e.g., PCR.
- a library can be stored and used multiple times for further processing such as amplification or sequencing of the nucleic acids in the library.
- the library is the input nucleic acid in which methylation is detected by the method described herein.
- the library is formed from nucleic acids that have undergone the methylation detection reactions described herein.
- the nucleic acids processed for detection of epigenetic modifications according to the method described herein are sequenced. Any of a number of sequencing technologies or sequencing assays can be utilized.
- the term “Next Generation Sequencing (NGS)” as used herein refers to sequencing methods that allow for massively parallel sequencing of clonally amplified molecules and of single nucleic acid molecules.
- Non-limiting examples of sequence assays that are suitable for use with the methods disclosed herein include nanopore sequencing (U.S. Patent Publication Nos. US2013/0244340, US2013/0264207, US2014/0134616, US2015/0119259 and US2015/0337366), Sanger sequencing, capillary array sequencing, thermal cycle sequencing (Sears, et al., Biotechniques, 13:626-633 (1992)), solid-phase sequencing (Zimmerman, et al., Methods Mol.
- sequencing with mass spectrometry such as matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF/MS; Fu, et al., Nature Biotech., 16:381-384 (1998)), sequencing by hybridization (Drmanac et al., Nature Biotech., 16:54-58 (1998), and NGS methods, including but not limited to sequencing by synthesis (e.g., HiSeqTM, MiSeqTM, or Genome Analyzer, each available from Illumina), sequencing by ligation (e.g., SOLiDTM, Life Technologies), ion semiconductor sequencing (e.g., Ion TorrentTM, Life Technologies), and SMRT® sequencing (e.g., Pacific Biosciences).
- MALDI-TOF/MS matrix-assisted laser desorption/ionization time-of-flight mass spectrometry
- sequencing-by- hybridization platforms from Affymetrix Inc. (Sunnyvale, Calif.), sequencing-by- synthesis platforms from Illumina/Solexa (San Diego, Calif.) and Helicos Biosciences (Cambridge, Mass.), sequencing-by-ligation platform from Applied Biosystems (Foster City, Calif.).
- Other sequencing technologies include, but are not limited to, the Ion Torrent technology (ThermoFisher Scientific), and nanopore sequencing (Genia Technology, Roche Sequencing Solutions, Santa Clara, Cal.), Oxford Nanopore Technologies (Oxford, UK), and sequencing by expansion (Stratos Genomics, Roche Sequencing Solutions).
- the sequencing step involves sequence aligning.
- aligning is used to determine a consensus sequence from a plurality of sequences, e.g., a plurality having the same unique molecular ID (UID).
- the molecular ID is a barcode that can be added to each molecule prior to sequencing or if amplification step is included, prior to the amplification step.
- a UID is present in the 5 ’-portion of the RT primer.
- a UID can be present in the 5 ’-end of the last barcode subunit to be added to the compound barcode.
- a UID is present in an adaptor and is added to one or both ends of the target nucleic acid by ligation.
- a consensus sequence is determined from a plurality of sequences all having an identical UID.
- the sequences having an identical UID are presumed to derive from the same original molecule through amplification.
- UID is used to eliminate artifacts, i.e., variations existing in the progeny of a single molecule (characterized by a particular UID). Such artifacts resulting from PCR errors or sequencing errors can be eliminated using UIDs.
- the number of each sequence in the sample can be quantified by quantifying relative numbers of sequences with each UID among the population having the same multiplex sample ID (MID).
- MID multiplex sample ID
- Each UID represents a single molecule in the original sample and counting different UIDs associated with each sequence variant can determine the fraction of each sequence variant in the original sample, where all molecules share the same MID.
- a person skilled in the art will be able to determine the number of sequence reads necessary to determine a consensus sequence.
- the relevant number is reads per UID (“sequence depth”) necessary for an accurate quantitative result.
- the desired depth is 5-50 reads per UID. Modification of 5fC
- Modification of 5fC according to the present invention can be achieved if the first reactive group of the reagent according to the present invention is a C-H acidic group, an amine or a phosphorus ylide.
- the reaction can be executed as a Knoevenagel condensation, which is schematically depicted in Fig. 2.
- the CH-acidic reagent comprising a nucleophile group is added via Knoevenagel conditions to fC, then subsequent intramolecular addition of the nucleophile to the C5-C6 double bond takes place, thereby enabling the deamination at C4 to obtain a “T-equivalent”.
- Yields of intramolecular addition of the nucleophile to the C5-C6 double bond can be increased by a photoisomerization of the double bond and/or by applying a radical initiation or a catalyst. If desirable, a reversal of the ring closure can be achieved under alkaline conditions.
- an aldol condensation reaction of 5fC with a CH-acidic reagent comprising a sulfinate moiety e.g. 1 -(methyl sulfonyl)methane sulfinate
- a subsequent addition of a sulfinate moiety at the C-6 position of the substituted cytosine followed by deamination and desulfmation (5fC to T conversion) can be executed (Fig. 3)
- a desulfmation step under alkaline conditions may even not be required.
- a thiol moiety can also be applied as nucleophile
- addition reaction to the C5-C6 double bond can be achieved by means of applying thiol-ene chemistry using a radical initiator or a catalyst.
- the yields may be increased by photoisomerization of the newly formed double bond.
- Modification of 5fC according to the present invention can also be done by means of a Wittig reaction (Fig. 4).
- the reagent according to the present invention is a phosphorus ylide comprising a nucleophilic group.
- the reagent is added via Wittig conditions to 5fC, then a subsequent intramolecular addition of the nucleophile to the C5-C6 double bond takes place, thereby enabling a deamination at the C4 position in order to obtain a “T-equivalent”.
- Yields of intramolecular addition of the nucleophile to the C5-C6 double bond may be increased by photoisomerization of the newly formed double bond, and/or by applying radical initiation or a catalyst. If desirable, a reversal of the ring closure can be achieved under alkaline conditions. Alternatively, the reaction can be performed using Horner-Wadsworth-Emmons reaction conditions, in particular using a sulfinate group as nucleophile.
- reagent of the invention comprising an amine and a further nucleophilic group is added via reductive amination to 5fC. Subsequently there is an intramolecular addition of the nucleophile to the C5-C6 double bond, which enables the deamination at the C4 position to obtain a “T- equivalent”. Reversal of ring closure can be achieved under alkaline conditions, if necessary. Yields can be increased by applying radical initiation or a catalyst.
- a more detailed example for implementing a reductive amination reaction is illustrated in Fig. 7.
- 5fC is reacted with 2-aminomethyl sulfinate, and subsequently addition of the sulfinate moiety at C-6 position of substituted cytosine takes place.
- This is followed by a deamination and desulfination step, which converts 5fC to T.
- a desulfination step under alkaline conditions may not be needed.
- the sulfinate moiety may be replaced by a thiol moiety using thiol-ene chemistry for the addition reaction to double bond C5-C6, which optionally includes the use of a radical initiator or a catalyst.
- Fig. 8 schematically shows a reaction scheme for the modification of hmC.
- the reagent used according to the present invention is a UDP -glucose, which is modified at the 2-position with a nucleophilic group. It is added via a B-glucosyl transferase enzyme to hmC. Then, a subsequent intramolecular addition of the nucleophile to the C5-C6 double bond takes place, thereby enabling the deamination at the C4 position in order to obtain a “T-equivalent”.
- yields of intramolecular addition of the nucleophile to the C5-C6 double bond can be increased by applying radical initiation or a catalyst and a reversal of the ring closure can be achieved under alkaline conditions, if necessary.
- Fig. 9 An alternative example is shown in Fig. 9.
- 5hmC is reacted with 2-thio- glucose-UDP and B-glucosyltransferase.
- an addition of the thiol moiety at the C6 position of substituted cytosine applying thiol-ene chemistry and optionally using a radical initiator or a catalyst is taking place, which is followed by a deamination reaction, resulting in a 5hmC to T conversion.
- 2- Sulfinato-glucose-UDP can also be applied as substrate for B-glucosyltransferase.
- the method of the present invention uses a reagent comprising an amine and a further nucleophilic group. It is added after an EDC/NHS activation of the carboxyl group of caC. This results in caC forming an amide bond, and a subsequent intramolecular addition of the nucleophile to the C5-C6 double bond, thereby enabling the deamination at the C4 position in order to obtain a “T-equivalent”.
- the yield of intramolecular addition of the nucleophile to the C5-C6 double bond can be increased by means of applying radical initiation or a catalyst and reversal of the ring closure can be achieved under alkaline conditions.
Abstract
The present invention provides a method comprising the steps of a) providing a nucleic acid comprising 5fC, 5hmC, or 5caC, b) providing one reactant comprising two reactive groups wherein the first reactive group is capable of reacting with the formyl hydroxymethyl or carboxyl group, and the second reactive group is a nucleophilic group, c) reacting said first reactive group with the formyl, hydroxymethyl or carboxyl group thereby resulting in a modified 5fC, 5hmC, or 5caC, d) reacting said second reactive group with the C6 position of said modified 5fC, 5hmC, or 5caC, thereby obtaining a bicyclic or tricyclic molecule comprising a 5,6-di-hydro Cytosine entity, and e) deaminating said 5,6-di-hydro Cytosine entity to a 5,6 di-hydro-Uracil entity.
Description
Detection of epigenetic modifications
Field of the invention
The invention relates to the field mapping the epigenetic modifications of DNA and RNA, which has become increasingly important, because these modifications play a role in several biological processes and diseases including development, aging and last but not least, cancer.
Technical background
Most relevant to identify epigenetic modifications is detection by sequencing based methods. Differentiation between cytosine (C) and 5-methylcytosine (5mC) is described by direct sequencing (without preamplification) of DNA through a nanopore (e.g. by Pacific Bioscience Single Molecule, Real-Time (SMRT) Sequencing technology) and thereby reading out the different kinetics of nucleotide incorporation opposite C vs. mC by polymerase.
Other techniques are using methods to convert C or mC into a T(U) equivalent and subsequent amplification to allow identification when comparing untreated and converted sample DNA (see review: L. Zhao et al., Protein Cell 2020, 11, 792- 808).
Most important methods are bisulfite sequencing (conversion of C to U by bisulfite treatment), NEB’s EM-seq method (oxidation of mC by TET2 enzyme and glucosylation by B-glucosyltransferase (to block enzymatic deaminase reaction) and convert C to U by APOBEC deaminase), TAPS method (TET-assisted pyridine borane sequencing) applying TET enzyme oxidation of mC and subsequent reduction of oxidized mC species by pyridine borane to obtain dihydrocytosine nucleoside which is easily deaminated to give dihydrouracil (T equivalent) and the CLEVER method which is based on oxidation of mC by TET enzyme and subsequent reaction of 5-formyl-C with malononitrile to give an adduct which acts predominantly as a T equivalent in subsequent PCR amplification (C. Zhu et al., Cell Stem Cell 2017, 20, 720-731). Instead of
enzymatic oxidation of mC by TET enzymes oxidation can also be performed by chemical means using e.g. potassium perruthenate (KRuCh).
There are also methods to differentiate the modifications 5-hydroxymethyl-dC (5hmC), 5-formyl (5fC) or 5-carboxyl-C (5caC) by partly modifying the methods described above.
Recently, WO 2022/096751 also disclosed a method for generating a dihydrothymine (DHT) or a dihydrouracil (DHU) residue from a nucleoside or a polynucleotide containing 5-methylcytosine (5mC) or 5-carboxyl-cytosine (5caC) by using a radical initiator which may be used together with a nucleophilic compound.
However, all the methods disclosed have some shortcomings. Bisulfite sequencing encompasses harsh conditions which degrades the majority of input DNA, only indirect detection of methylated cytosines is possible. Likewise, Enzymatic Methyl Sequencing (EM-Seq) via enzymatic deamination which is inefficient because the majority of cytosines which are non-methylated have to be converted. TAPS requires a toxic reagent needed for reduction such as pyridine borane, and the dihydrouridine being labile under slightly acidic conditions cannot easily be converted back to the more stable Uridine derivative. For the CLEVER method, the conversion yield of the condensation reaction of 5fC and malononitrile is far below 100%, because the enzymatic incorporation of nucleotides by polymerase when the condensation product of 5fC with malononitrile acts as substrate is not highly accurate, and incorporation of dA is only favoured, but not exclusive (see also: F. Galardi et al., Biomolecules 2020, 10, 1677). The method of WO 2022/096751 is not entirely specific for caC but also converts unmodified C to some extent. Another disadvantage is that di-hydro Uridine derivative cannot easily be converted back to the more stable Uridine derivative.
Summary of the invention:
The present invention is based on the basic idea that the introduction of a nucleophile at 5fC, 5hmC or caC and the subsequent intramolecular addition at C- 6 of a 5-substitued cytosine is enabling a deamination at C-4 of a cytosine derivative.
Thus, in a first aspect, the present invention is directed to a method comprising the steps of a) providing a nucleic acid comprising 5fC, 5hmC, or 5caC b) providing one reactant comprising two reactive groups wherein the first reactive group is capable of reacting with the formyl, hydroxymethyl, or carboxyl group, and the second reactive group is a nucleophilic group, c) reacting said first reactive group with the formyl, hydroxymethyl or carboxyl group thereby resulting in a modified 5fC, 5hmC, or 5caC d) reacting said second reactive group with the C6 position of said modified 5fC, 5hmC, or 5caC, thereby obtaining a bicyclic or tricyclic molecule comprising a 5,6 di-hydro Cytosine entity, and e) deaminating said 5,6 di-hydro Cytosine entity to a 5,6 di-hydro-Uracil entity.
The newly formed cyclic structures in step d) are 5-7 membered rings
In one embodiment, the method further comprises step f), i.e. reversing the ring formation of step d) thereby obtaining a 5-substituted Uracil.
Since the naturally occurring methylation of DNA is 5mC, the method may also comprise a step of converting 5mC to 5fC, 5hmC or 5caC prior to step a). Thus, the present invention is facilitating the methylation status analysis of naturally occurring DNA.
In a specific embodiment, said nucleic acid comprises at least one 5fC residue. In this case, the first reactive group may be a C-H acidic group, an amine or a
phosphorus ylide. In another specific embodiment, said nucleic acid comprises at least one 5hmC residue. Then, the first reactive group may constitute a Glycosyl Transferase substrate. In still another embodiment, said nucleic acid comprises a 5caC residue. Then, the first reactive group may be an amine.
In all cases, the said second reactive group may be a nucleophilic group selected from a group consisting of Thiol or Sulfinate.
All methods as disclosed above may further comprise a subsequent step of amplifying said nucleic acid, which may be preferably executed by means of PCR amplification. As well, all methods disclosed above may further comprise the step of a subsequent step of sequencing said nucleic acid either with or without a prior amplification step, which is preferably a PCR amplification.
Brief description of Figures
Fig. 1 provides a scheme for detection of methylated sequences according to the present invention.
Fig. 2 shows a reaction scheme for modification of 5fC by means of a Knoevenagel condensation using a reactant with a C-H acidic group
Fig. 3 shows a reaction scheme for modification of 5fC by means of an aldol condensation reaction with a CH-acidic reagent comprising a sulfinate moiety
Fig. 4 shows a reaction scheme for modification of 5fC by means of a Wittig reaction.
Fig. 5 shows a specific example for a Wittig reaction according to Fig. 4 with a sulfinate modified Wittig reagent.
Fig. 6 shows a reaction scheme for modification of 5fC by means of reductive amination
Fig: 7 shows a specific example for a reductive amination as in Fig. 6, where 5fC is reacted with 2-aminomethyl sulfinate
Fig. 8 shows a reaction scheme for the modification of hmC using UDP-glucose substituted at the 2-position of the glucose moiety with a nucleophile as a reagent and executing a subsequent beta-glucosyl-transferase reaction.
Fig. 9 shows a similar reaction scheme as example 8, wherein 2-thio-glucose-UDP is used.
Fig. 10 shows an example for modification of the carboxyl group of caC.
Detailed description of the invention
Abbreviations
C - cytosine or cytidine
T - thymine or thymidine
U - uracil or uridine
DHU - dihydrouracil or dihydrouridine
5mC - 5-methylcytosine or 5-methylcytidine
5hmC - 5 -hydroxymethyl cytosine or 5 -hydroxymethyl cytidine
5fC - 5-formylcytosine or 5-formylcytidine
5caC - 5-carboxylcytosine or 5-carboxylcytidine dC - 2’ -deoxy cytidine dU - 2’ -deoxyuridine
TET - ten-eleven translocation dioxygenase
TAPS - TET-assisted pyridine-borane sequencing
CAPS - chemically-assisted pyridine-borane sequencing
Definitions
A nucleophilic group is a functional group that is attractive to electron-poor or positively charged centers and donates electrons, in particular donating an electron pair to an electrophile or an electrophilic center to form a covalent bond. Nucleophilic groups are for example alcohols, alcoholates, thiols, thiolates, amines, sulfinates or carbanions. The term deamination refers herein to a substitution reaction of an exocyclic amino group by a hydroxyl group, in particular substitution of the amino group at the C-4 position of a cytosine nucleobase to convert into a uracil nucleobase. Deamination at C-4 position is enhanced in 5,6-dihydrocytosines to give 5,6-dihydrouracil derivatives, as e.g. being applied after bisulfite addition to the C5- C6 double bond of cytosine.
The term C-H acidic group is used herein for a carbon atom within a compound which carries at least one electron-withdrawing group and thus being more easily deprotonated by a base. Electron-withdrawing groups are for example carbonyl containing groups, a nitro group, a cyano group, a sulfoxide or sulfone group.
The term Wittig reagent or phosphorus ylide, as used herein, refers to neutral dipolar molecules containing a formally negatively charged carbon atom (a carbanion) directly attached to a phosphorus atom with a formal positive charge. In phosphorus ylides (phosphonium ylides) the two adjacent carbon and phosphorus atoms are connected by both a covalent and an ionic bond. Phosphorus ylides are thus 1,2- dipolar compounds and a subclass of zwitterions. In a Wittig reaction triphenylphosphonium ylides are reacted with aldehydes or ketones to give an alkene bond. In a Homer-Wadsworth-Emmons reaction stabilized phosphonate carbanions are used as reagent.
A Glycosyl Transferase Substrate, as used herein, refers to a reagent comprising an activation group, e.g. a UDP moiety, and a sugar residue, for example a glucose or substituted glucose. Glycosyltransferases catalyze the transfer of the sugar moiety from an activated donor sugar (the glycosyl transferase substrate) onto a saccharide or non- saccharide acceptor. The non-saccharide acceptor can be for example 5- hydroxymethyl-dC. T4 Phage B-glucosyltransferase, for example from New England Biolabs, specifically transfers a glucose moiety of uridine diphosphoglucose to the
5-hydroxymethylcytosine (5-hmC) residues in double-stranded DNA, making beta- glucosyl-5-hydroxymethylcytosine.
Methods for DNA methylation status analysis by means of modification, followed by sequencing
A general scheme, how methylated C-residues mC are detected is schematically shown in Fig. 1. Without any particular treatment, the epigenetic dC modifications 5-methyl-dC (mC), 5-hydroxymethyl-dC (hmC), 5-formyl-dC (fC) and 5-carboxyl- dC (caC) are read as unmodified dC when amplified and/or sequenced. However, oxidation of mC by enzymatic and/or chemical means results in oxidized mC modifications hmC, fC and caC (general: xC).
Oxidation or conversion of 5mC to 5fC can be achieved by using TET enzymes (ten- eleven translocation (TET) methylcytosine dioxygenases). The same class of enzymes are suitable for converting 5mC to 5hmC. 5hmC can be further oxidized to 5fC enzymatically by a laccase enzyme or chemically using either KRuO4 or Cu(II)/TEMPO. Further details are disclosed in Pfeifer et al. Epigenetics & Chromatin 2013, 6: 10, p.1-9.
Subsequently, 5fC, 5hmC or caC are reacted with a reagent of the invention comprising two reactive groups, the first reactive group forming a covalent bond with hmC, fC or caC and thus introducing the second reactive group which comprises a nucleophile. By intramolecular addition of the nucleophile to the C5-C6 double bond of hmC, fC or caC deamination at C4 position is initiated resulting in a modified dU derivative which is read in a subsequent amplification or sequencing step as a T, thus enabling detection of epigenetic modification.
Such a sequencing according to the present invention allows the generation of data in the methylation status of the DNA of interest. Thus, the present also provides a method for analyzing the methylation status of a naturally occurring DNA comprising the steps of a) converting 5mC to 5fC, 5hmC or 5caC
b) providing one reactant comprising two reactive groups wherein the first reactive group is capable of reacting with the formyl, hydroxymethyl or carboxyl group, and the second reactive group is a nucleophilic group, c) reacting said first reactive group with the formyl, hydroxymethyl or carboxyl group thereby resulting in a modified 5fC, 5hmC, or 5caC, d) reacting said second reactive group with the C6 position of said modified 5fC, 5hmC, or 5caC, thereby obtaining a bicyclic or tricyclic molecule comprising a 5,6 di-hydro Cytosine entity, and e) deaminating said 5,6 di-hydro Cytosine entity to a 5,6 di-hydro-Uracil entity.
In some embodiments, the sample is derived from a subject or a patient. In some embodiments the sample may comprise a fragment of a solid tissue or a solid tumor derived from the subject or the patient, e.g., by biopsy. The sample may also comprise body fluids that may contain nucleic acids (e.g., urine, sputum, serum, blood or blood fractions, i.e., plasma, lymph, saliva, sputum, sweat, tear, cerebrospinal fluid, amniotic fluid, synovial fluid, pericardial fluid, peritoneal fluid, pleural fluid, cystic fluid, bile, gastric fluid, intestinal fluid, or fecal samples). In other embodiments, the sample is a cultured sample, e.g., a tissue culture containing cells and fluids from which nucleic acids may be isolated. In some embodiments, the nucleic acids of interest in the sample come from infectious agents such as viruses, bacteria, protozoa or fungi. The present invention involves manipulating isolated nucleic acids isolated or extracted from a sample. Methods of nucleic acid extraction are well known in the art (see, Sambrook et al., & quot; Molecular Cloning: A Laboratory Manual, & quot; 1989, 2nd Ed., Cold Spring Harbor Laboratory Press: New York, N. Y.). A variety of kits are commercially available for extracting nucleic acids (DNA or RNA) from biological samples (e.g., KAPA Express Extract (Roche Sequencing Solutions, Pleasanton, Cal.) and other similar products from BD Biosciences Clontech (Palo Alto, Cal.), Epicentre Technologies (Madison, Wise.); Gentra Systems, (Minneapolis, Minn.); and Qiagen (Valencia, Cal.), Ambion (Austin, Tex.); BioRad Laboratories (Hercules, Cal.); and more.
The present invention involves detecting an epigenetic modification, specifically, an epigenetic cytosine modification in nucleic acids (including, but not limited to, cytosine methylation). The nucleic acid sequences that are subject to conditional epigenetic modification are the target sequences analyzed by the method disclosed herein. The same nucleic acid sequence may or may not have the epigenetic modification characterized by methylation of cytosines at the 5-position (5mC or 5hmC). In some embodiments, a set or a panel of target nucleic acids are probed for the presence of methylation. For example, as shown in Patai, et al. “Comprehensive DNA Methylation Analysis Reveals a Common Ten-Gene Methylation Signature in Colorectal Adenomas and Carcinomas” PLOS ONE 10(8): e0133836 (2015), and in Onwuka, et al., “A panel of DNA methylation signature from peripheral blood may predict colorectal cancer susceptibility,” BMC Cancer 20, 692 (2020), methylation of biomarkers in a panel of methylation biomarkers is indicative of the presence of colorectal cancer in the patient. Accordingly, testing any known or future panels of methylation biomarkers for prognostic or diagnostic purposes is envisioned with the method disclosed herein. In some embodiments, the entire genome of an organism is probed for the presence of methylation. The method of the instant invention includes detecting methylation in all sites throughout the genome of an organism to diagnose a disease or condition or predisposition to a disease or condition using the sequence analysis and artificial intelligence tools described, e.g., in Shull, et al. “Sequencing the cancer methylome,” Methods Mol Biol. 1238:627-635 (2015).
In some embodiments, the method of detecting epigenetic modifications includes sequencing. The nucleic acid processed as described herein is subjected to sequencing; preferably, massively parallel single molecule sequencing. Analyzing individual molecules by massively parallel sequencing typically requires a separate level of barcoding for sample identification and error correction. The use of molecular barcodes such as described in U.S. Patent Nos. 7,393,665, 8,168,385, 8,481,292, 8,685,678, and 8,722,368. A unique molecular barcode is added to each molecule to be sequenced to mark molecule and its progeny (e.g., the original molecule and its amplicons generated by PCR). The unique molecular barcode (UID) has multiple uses including counting the number of original target molecules in the sample and error correction (Newman, et al., “An ultrasensitive method for
quantitating circulating tumor DNA with broad patient coverage,” Nature Medicine doi: 10.1038/nm.3519 (2014)).
Nanopore sequencing is a unique, scalable technology that enables direct, real-time analysis of long DNA or RNA fragments. It works by monitoring changes to an electrical current as nucleic acids are passed through a protein nanopore. The resulting signal is decoded to provide the specific DNA or RNA sequence. Nanopore-based sequencing technology detects the unique electrical signals of different molecules as they pass through the nanopore with a semiconductor-based electronic detection system. This technology makes for a high throughput, cost effective sequencing solution. At the heart of the technology is the biological nanopore, a protein pore embedded in a membrane, while the brains of the technology lie in the electronics of a semiconductor integrated circuit and proprietary chemistries. The electronic sensor technology embedded in the chip enables automatic membrane assembly and nanopore insertion, while allowing for active control of individual sensors on the circuit. Different sequencing chemistries can be paired with the nanopore and electronic sensor technology to enable high throughput, high accuracy sequencing with faster time-to-data.
In some embodiments, unique molecular barcodes (UIDs) are used for sequencing error correction. The entire progeny of a single target molecule is marked with the same barcode and forms a barcoded family. A variation in the sequence not shared by all members of the barcoded family is discarded as an artefact. Barcodes can also be used for positional deduplication and target quantification, as the entire family represents a single molecule in the original sample (Newman, et al. “Integrated digital error suppression for improved detection of circulating tumor DNA,” Nature Biotechnology 34:547 (2016)).
In some embodiments, the method involves forming a library comprising nucleic acids from a sample. The library consists of a plurality of nucleic acids ready for sequencing or another type of detection method, e.g., PCR. A library can be stored and used multiple times for further processing such as amplification or sequencing of the nucleic acids in the library. In some embodiments, the library is the input nucleic acid in which methylation is detected by the method described herein. In
other embodiments, the library is formed from nucleic acids that have undergone the methylation detection reactions described herein.
In some embodiments, the nucleic acids processed for detection of epigenetic modifications according to the method described herein are sequenced. Any of a number of sequencing technologies or sequencing assays can be utilized. The term “Next Generation Sequencing (NGS)” as used herein refers to sequencing methods that allow for massively parallel sequencing of clonally amplified molecules and of single nucleic acid molecules.
Non-limiting examples of sequence assays that are suitable for use with the methods disclosed herein include nanopore sequencing (U.S. Patent Publication Nos. US2013/0244340, US2013/0264207, US2014/0134616, US2015/0119259 and US2015/0337366), Sanger sequencing, capillary array sequencing, thermal cycle sequencing (Sears, et al., Biotechniques, 13:626-633 (1992)), solid-phase sequencing (Zimmerman, et al., Methods Mol. Cell Biol., 3:39-42 (1992)), sequencing with mass spectrometry such as matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF/MS; Fu, et al., Nature Biotech., 16:381-384 (1998)), sequencing by hybridization (Drmanac et al., Nature Biotech., 16:54-58 (1998), and NGS methods, including but not limited to sequencing by synthesis (e.g., HiSeq™, MiSeq™, or Genome Analyzer, each available from Illumina), sequencing by ligation (e.g., SOLiD™, Life Technologies), ion semiconductor sequencing (e.g., Ion Torrent™, Life Technologies), and SMRT® sequencing (e.g., Pacific Biosciences).
Commercially available sequencing technologies include: sequencing-by- hybridization platforms from Affymetrix Inc. (Sunnyvale, Calif.), sequencing-by- synthesis platforms from Illumina/Solexa (San Diego, Calif.) and Helicos Biosciences (Cambridge, Mass.), sequencing-by-ligation platform from Applied Biosystems (Foster City, Calif.). Other sequencing technologies include, but are not limited to, the Ion Torrent technology (ThermoFisher Scientific), and nanopore sequencing (Genia Technology, Roche Sequencing Solutions, Santa Clara, Cal.), Oxford Nanopore Technologies (Oxford, UK), and sequencing by expansion (Stratos Genomics, Roche Sequencing Solutions).
In some embodiments, the sequencing step involves sequence aligning. In some embodiments, aligning is used to determine a consensus sequence from a plurality of sequences, e.g., a plurality having the same unique molecular ID (UID). The molecular ID is a barcode that can be added to each molecule prior to sequencing or if amplification step is included, prior to the amplification step. In some embodiments, a UID is present in the 5 ’-portion of the RT primer. Similarly, a UID can be present in the 5 ’-end of the last barcode subunit to be added to the compound barcode. In other embodiments, a UID is present in an adaptor and is added to one or both ends of the target nucleic acid by ligation.
In some embodiments, a consensus sequence is determined from a plurality of sequences all having an identical UID. The sequences having an identical UID are presumed to derive from the same original molecule through amplification. In other embodiments, UID is used to eliminate artifacts, i.e., variations existing in the progeny of a single molecule (characterized by a particular UID). Such artifacts resulting from PCR errors or sequencing errors can be eliminated using UIDs.
In some embodiments, the number of each sequence in the sample can be quantified by quantifying relative numbers of sequences with each UID among the population having the same multiplex sample ID (MID). Each UID represents a single molecule in the original sample and counting different UIDs associated with each sequence variant can determine the fraction of each sequence variant in the original sample, where all molecules share the same MID. A person skilled in the art will be able to determine the number of sequence reads necessary to determine a consensus sequence. In some embodiments, the relevant number is reads per UID (“sequence depth”) necessary for an accurate quantitative result. In some embodiments, the desired depth is 5-50 reads per UID.
Modification of 5fC
Modification of 5fC according to the present invention can be achieved if the first reactive group of the reagent according to the present invention is a C-H acidic group, an amine or a phosphorus ylide.
If the first reactive group is a C-H acidic group, the reaction can be executed as a Knoevenagel condensation, which is schematically depicted in Fig. 2. The CH-acidic reagent comprising a nucleophile group is added via Knoevenagel conditions to fC, then subsequent intramolecular addition of the nucleophile to the C5-C6 double bond takes place, thereby enabling the deamination at C4 to obtain a “T-equivalent”. Yields of intramolecular addition of the nucleophile to the C5-C6 double bond can be increased by a photoisomerization of the double bond and/or by applying a radical initiation or a catalyst. If desirable, a reversal of the ring closure can be achieved under alkaline conditions.
Alternatively, an aldol condensation reaction of 5fC with a CH-acidic reagent comprising a sulfinate moiety (e.g. 1 -(methyl sulfonyl)methane sulfinate), a subsequent addition of a sulfinate moiety at the C-6 position of the substituted cytosine, followed by deamination and desulfmation (5fC to T conversion) can be executed (Fig. 3) A desulfmation step under alkaline conditions may even not be required. Instead of a sulfinate moiety, a thiol moiety can also be applied as nucleophile, addition reaction to the C5-C6 double bond can be achieved by means of applying thiol-ene chemistry using a radical initiator or a catalyst. The yields may be increased by photoisomerization of the newly formed double bond.
Modification of 5fC according to the present invention can also be done by means of a Wittig reaction (Fig. 4). I this case, the reagent according to the present invention is a phosphorus ylide comprising a nucleophilic group. The reagent is added via Wittig conditions to 5fC, then a subsequent intramolecular addition of the nucleophile to the C5-C6 double bond takes place, thereby enabling a deamination at the C4 position in order to obtain a “T-equivalent”.
Details of the reaction are shown for a particular example in Fig. 5. In this case, 5fC is reacted with a sulfinate modified Wittig reagent. Subsequently there is an addition
of the sulfinate moiety at the C-6 position of substituted cytosine, followed by a deamination and desulfination (5fC to T conversion). The desulfination step under alkaline conditions may not be needed necessarily. Alternatively, instead of a sulfinate moiety, a thiol moiety can also be used by means of applying thiol-ene chemistry, using a radical initiator or a catalyst. Yields of intramolecular addition of the nucleophile to the C5-C6 double bond may be increased by photoisomerization of the newly formed double bond, and/or by applying radical initiation or a catalyst. If desirable, a reversal of the ring closure can be achieved under alkaline conditions. Alternatively, the reaction can be performed using Horner-Wadsworth-Emmons reaction conditions, in particular using a sulfinate group as nucleophile.
Yet another alternative is the modification of 5fC by means of reductive amination as shown in Fig. 6. In this case, the reagent of the invention comprising an amine and a further nucleophilic group is added via reductive amination to 5fC. Subsequently there is an intramolecular addition of the nucleophile to the C5-C6 double bond, which enables the deamination at the C4 position to obtain a “T- equivalent”. Reversal of ring closure can be achieved under alkaline conditions, if necessary. Yields can be increased by applying radical initiation or a catalyst.
A more detailed example for implementing a reductive amination reaction is illustrated in Fig. 7. 5fC is reacted with 2-aminomethyl sulfinate, and subsequently addition of the sulfinate moiety at C-6 position of substituted cytosine takes place. This is followed by a deamination and desulfination step, which converts 5fC to T. Again, a desulfination step under alkaline conditions may not be needed. Also again, the sulfinate moiety may be replaced by a thiol moiety using thiol-ene chemistry for the addition reaction to double bond C5-C6, which optionally includes the use of a radical initiator or a catalyst.
Modification of hmC:
Fig. 8 schematically shows a reaction scheme for the modification of hmC. In this case, the reagent used according to the present invention is a UDP -glucose, which is
modified at the 2-position with a nucleophilic group. It is added via a B-glucosyl transferase enzyme to hmC. Then, a subsequent intramolecular addition of the nucleophile to the C5-C6 double bond takes place, thereby enabling the deamination at the C4 position in order to obtain a “T-equivalent”. As discussed above, yields of intramolecular addition of the nucleophile to the C5-C6 double bond can be increased by applying radical initiation or a catalyst and a reversal of the ring closure can be achieved under alkaline conditions, if necessary.
An alternative example is shown in Fig. 9. In this case, 5hmC is reacted with 2-thio- glucose-UDP and B-glucosyltransferase. Subsequently, an addition of the thiol moiety at the C6 position of substituted cytosine applying thiol-ene chemistry and optionally using a radical initiator or a catalyst is taking place, which is followed by a deamination reaction, resulting in a 5hmC to T conversion. Alternatively, 2- Sulfinato-glucose-UDP can also be applied as substrate for B-glucosyltransferase.
Modification of caC:
For this embodiment, which is shown in Fig. 10, the method of the present invention uses a reagent comprising an amine and a further nucleophilic group. It is added after an EDC/NHS activation of the carboxyl group of caC. This results in caC forming an amide bond, and a subsequent intramolecular addition of the nucleophile to the C5-C6 double bond, thereby enabling the deamination at the C4 position in order to obtain a “T-equivalent”. Again, the yield of intramolecular addition of the nucleophile to the C5-C6 double bond can be increased by means of applying radical initiation or a catalyst and reversal of the ring closure can be achieved under alkaline conditions.
Claims
1. A method comprising a) providing a nucleic acid comprising 5fC, 5hmC, or 5caC b) providing one reactant comprising two reactive groups wherein the first reactive group is capable of reacting with the formyl, hydroxymethyl, or carboxyl group, and the second reactive group is a nucleophilic group, c) reacting said first reactive group with the formyl, hydroxymethyl, or carboxyl group thereby resulting in a modified 5fC, 5hmC, or 5caC, d) reacting said second reactive group with the C6 position of said modified 5fC, 5hmC, or 5caC, thereby obtaining a bicyclic or tricyclic molecule comprising a 5,6-di-hydro Cytosine entity, and e) deaminating said 5,6-di-hydro Cytosine entity to a 5,6-di-hydro-Uracil entity.
2. Method according to claim 1, further comprising step f) reversing the ring formation of step d) thereby obtaining a 5- substituted Uracil.
3. Method of claim 1-2, further comprising the step of converting 5mC to 5fC, 5hmC or 5caC prior to step a) .
4. A method according to claim 1-3, wherein said nucleic acid comprises 5fC.
5. Method according to claim 4, wherein the first reactive group is a C-H acidic group, an amine or a phosphorus ylide.
6. Method according to claim 1-3, wherein said nucleic acid comprises 5hmC.
7. Method according to claim 6, wherein the first reactive group is a Glycosyl Transferase substrate.
8. Method according to claim 1-3, wherein said nucleic acid comprises 5caC.
9. Method according to claim 8, wherein the first reactive group is an amine.
10. Method according to claims 1-9, wherein said second reactive group is a nucleophilic group selected from a group consisting of Thiol or Sulfinate.
11. Method according to claims 1-10, further comprising a subsequent step of amplifying said nucleic acid.
12. Method according to claims 1-11 further comprising the subsequent step of sequencing said nucleic acid.
13. A composition comprising a nucleic acid treated according to the method of claims 1-10.
14. A kit for performing a method according to claims 1-12.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP22191052 | 2022-08-18 | ||
EP22191052.4 | 2022-08-18 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2024038069A1 true WO2024038069A1 (en) | 2024-02-22 |
Family
ID=83004497
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/EP2023/072497 WO2024038069A1 (en) | 2022-08-18 | 2023-08-16 | Detection of epigenetic modifications |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2024038069A1 (en) |
Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7393665B2 (en) | 2005-02-10 | 2008-07-01 | Population Genetics Technologies Ltd | Methods and compositions for tagging and identifying polynucleotides |
US8481292B2 (en) | 2010-09-21 | 2013-07-09 | Population Genetics Technologies Litd. | Increasing confidence of allele calls with molecular counting |
US20130244340A1 (en) | 2012-01-20 | 2013-09-19 | Genia Technologies, Inc. | Nanopore Based Molecular Detection and Sequencing |
US20130264207A1 (en) | 2010-12-17 | 2013-10-10 | Jingyue Ju | Dna sequencing by synthesis using modified nucleotides and nanopore detection |
US20140134616A1 (en) | 2012-11-09 | 2014-05-15 | Genia Technologies, Inc. | Nucleic acid sequencing using tags |
US20150119259A1 (en) | 2012-06-20 | 2015-04-30 | Jingyue Ju | Nucleic acid sequencing by nanopore detection of tag molecules |
US20150337366A1 (en) | 2012-02-16 | 2015-11-26 | Genia Technologies, Inc. | Methods for creating bilayers for use with nanopore sensors |
WO2022096751A1 (en) | 2020-11-09 | 2022-05-12 | Bluecat Ip Limited | Methods for detection of nucleotide modification |
-
2023
- 2023-08-16 WO PCT/EP2023/072497 patent/WO2024038069A1/en unknown
Patent Citations (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7393665B2 (en) | 2005-02-10 | 2008-07-01 | Population Genetics Technologies Ltd | Methods and compositions for tagging and identifying polynucleotides |
US8168385B2 (en) | 2005-02-10 | 2012-05-01 | Population Genetics Technologies Ltd | Methods and compositions for tagging and identifying polynucleotides |
US8481292B2 (en) | 2010-09-21 | 2013-07-09 | Population Genetics Technologies Litd. | Increasing confidence of allele calls with molecular counting |
US8685678B2 (en) | 2010-09-21 | 2014-04-01 | Population Genetics Technologies Ltd | Increasing confidence of allele calls with molecular counting |
US8722368B2 (en) | 2010-09-21 | 2014-05-13 | Population Genetics Technologies Ltd. | Method for preparing a counter-tagged population of nucleic acid molecules |
US20130264207A1 (en) | 2010-12-17 | 2013-10-10 | Jingyue Ju | Dna sequencing by synthesis using modified nucleotides and nanopore detection |
US20130244340A1 (en) | 2012-01-20 | 2013-09-19 | Genia Technologies, Inc. | Nanopore Based Molecular Detection and Sequencing |
US20150337366A1 (en) | 2012-02-16 | 2015-11-26 | Genia Technologies, Inc. | Methods for creating bilayers for use with nanopore sensors |
US20150119259A1 (en) | 2012-06-20 | 2015-04-30 | Jingyue Ju | Nucleic acid sequencing by nanopore detection of tag molecules |
US20140134616A1 (en) | 2012-11-09 | 2014-05-15 | Genia Technologies, Inc. | Nucleic acid sequencing using tags |
WO2022096751A1 (en) | 2020-11-09 | 2022-05-12 | Bluecat Ip Limited | Methods for detection of nucleotide modification |
Non-Patent Citations (14)
Title |
---|
ANDRÃ GRAND ET AL: "Comparison of the mechanism of deamination of 5,6-dihydro-5-methylcytosine with other cytosine derivatives", THEORETICAL CHEMISTRY ACCOUNTS ; THEORY, COMPUTATION, AND MODELING, SPRINGER, BERLIN, DE, vol. 131, no. 4, 4 April 2012 (2012-04-04), pages 1 - 11, XP035046539, ISSN: 1432-2234, DOI: 10.1007/S00214-012-1187-X * |
C. ZHU ET AL., CELL STEM CELL, vol. 20, 2017, pages 720 - 731 |
DRMANAC ET AL., NATURE BIOTECH., vol. 16, 1998, pages 381 - 384 |
F. GALARDI ET AL., BIOMOLECULES, vol. 10, 2020, pages 1677 |
L. ZHAO ET AL., PROTEIN CELL, vol. 11, 2020, pages 792 - 808 |
NEWMAN ET AL.: "An ultrasensitive method for quantitating circulating tumor DNA with broad patient coverage", NATURE MEDICINE, 2014 |
NEWMAN ET AL.: "Integrated digital error suppression for improved detection of circulating tumor DNA", NATURE BIOTECHNOLOGY, vol. 34, 2016, pages 547, XP055802486, DOI: 10.1038/nbt.3520 |
ONWUKA ET AL.: "A panel of DNA methylation signature from peripheral blood may predict colorectal cancer susceptibility", BMC CANCER, vol. 20, 2020, pages 692 |
PATAI ET AL.: "Comprehensive DNA Methylation Analysis Reveals a Common Ten-Gene Methylation Signature in Colorectal Adenomas and Carcinomas", PLOS ONE, vol. 10, no. 8, 2015, pages 0133836 |
PFEIFER ET AL., EPIGENETICS & CHROMATIN, vol. 6, no. 10, 2013, pages 1 - 9 |
SAMBROOK ET AL.: "Molecular Cloning: A Laboratory Manual", 1989, COLD SPRING HARBOR LABORATORY PRESS |
SEARS ET AL., BIOTECHNIQUES, vol. 13, 1992, pages 626 - 633 |
SHULL ET AL.: "Sequencing the cancer methylome", METHODS MOL BIOL., vol. 1238, 2015, pages 627 - 635 |
ZIMMERMAN ET AL., METHODS MOL. CELL BIOL., vol. 3, 1992, pages 39 - 42 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10808244B2 (en) | Method of normalizing biological samples | |
DK2737085T3 (en) | METHODS FOR DETECTING NUCLEOTID MODIFICATION | |
US20230235380A1 (en) | Methods for the Epigenetic Analysis of DNA, Particularly Cell-Free DNA | |
EP3146075B1 (en) | Ion sensor dna and rna sequencing by synthesis using nucleotide reversible terminators | |
US20180327828A1 (en) | Ion sensor dna and rna sequencing by synthesis using nucleotide reversible terminators | |
WO2002021098A2 (en) | A method for identifying polymorphisms | |
US11608518B2 (en) | Methods for analyzing nucleic acids | |
US20210380971A1 (en) | Simultaneous, sequencing-based analysis of proteins, nucleosomes, and cell-free nucleic acids from a single biological sample | |
EP1117838B1 (en) | A method for analyzing polynucleotides | |
Feng et al. | Recent advancements in intestinal microbiota analyses: a review for non-microbiologists | |
US20200040390A1 (en) | Methods for Sequencing Repetitive Genomic Regions | |
WO2024038069A1 (en) | Detection of epigenetic modifications | |
TW202302861A (en) | Methods for accurate parallel quantification of nucleic acids in dilute or non-purified samples | |
WO2023242075A1 (en) | Detection of epigenetic cytosine modification | |
Shetty et al. | Introduction to nucleic acid sequencing | |
WO2023125898A1 (en) | Nucleic acid testing method and system | |
CN113373207A (en) | Methods for determining cytosine modifications | |
Tost | Current and emerging technologies for the analysis of the genome-wide and locus-specific DNA methylation patterns | |
US20240093300A1 (en) | Methylated dna fragment enrichment, methods, compositions and kits | |
WO2024076981A2 (en) | Tet-assisted pyridine borane sequencing | |
Różalski et al. | Recent approaches to the liquid chromatography-mass spectrometry analysis of modified deoxynucleosides as biomarkers in clinical research | |
Monisha et al. | RNA-SEQ: A High-Revolution View of the Transcriptome | |
Honisch | The World of Nucleic Acid-Based Mass Spectrometry for Microbial and Viral Detection | |
EP4291677A2 (en) | Methods for base-level detection of methylation in nucleic acids | |
WO2023287876A1 (en) | Efficient duplex sequencing using high fidelity next generation sequencing reads |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 23757600 Country of ref document: EP Kind code of ref document: A1 |