WO2024102659A1 - Directed acetylation of mrna through engineered snorna adapters - Google Patents
Directed acetylation of mrna through engineered snorna adapters Download PDFInfo
- Publication number
- WO2024102659A1 WO2024102659A1 PCT/US2023/078830 US2023078830W WO2024102659A1 WO 2024102659 A1 WO2024102659 A1 WO 2024102659A1 US 2023078830 W US2023078830 W US 2023078830W WO 2024102659 A1 WO2024102659 A1 WO 2024102659A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- sequence
- nucleotide sequence
- disease
- engineered nucleotide
- disorder
- Prior art date
Links
- 230000021736 acetylation Effects 0.000 title claims description 45
- 238000006640 acetylation reaction Methods 0.000 title claims description 45
- 108020004999 messenger RNA Proteins 0.000 title description 83
- 102000042773 Small Nucleolar RNA Human genes 0.000 title description 63
- 108020003224 Small Nucleolar RNA Proteins 0.000 title description 63
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 claims abstract description 103
- 201000010099 disease Diseases 0.000 claims abstract description 54
- 208000035475 disorder Diseases 0.000 claims abstract description 49
- 238000000034 method Methods 0.000 claims abstract description 46
- 239000013598 vector Substances 0.000 claims abstract description 36
- 239000008194 pharmaceutical composition Substances 0.000 claims abstract description 13
- 206010028980 Neoplasm Diseases 0.000 claims abstract description 12
- 201000011510 cancer Diseases 0.000 claims abstract description 10
- 108090000623 proteins and genes Proteins 0.000 claims description 102
- 239000002773 nucleotide Substances 0.000 claims description 86
- 125000003729 nucleotide group Chemical group 0.000 claims description 86
- 230000008685 targeting Effects 0.000 claims description 44
- 108091032973 (ribonucleotides)n+m Proteins 0.000 claims description 42
- 102000004169 proteins and genes Human genes 0.000 claims description 38
- 230000014509 gene expression Effects 0.000 claims description 29
- 102100037011 RNA cytidine acetyltransferase Human genes 0.000 claims description 28
- 108010011536 PTEN Phosphohydrolase Proteins 0.000 claims description 18
- 102000014160 PTEN Phosphohydrolase Human genes 0.000 claims description 18
- 101001055144 Homo sapiens Interleukin-2 receptor subunit alpha Proteins 0.000 claims description 17
- 102100026878 Interleukin-2 receptor subunit alpha Human genes 0.000 claims description 17
- 101001057504 Homo sapiens Interferon-stimulated gene 20 kDa protein Proteins 0.000 claims description 14
- -1 NHF1 B Proteins 0.000 claims description 14
- 230000000295 complement effect Effects 0.000 claims description 14
- 238000013519 translation Methods 0.000 claims description 12
- 108091026890 Coding region Proteins 0.000 claims description 10
- 108091026898 Leader sequence (mRNA) Proteins 0.000 claims description 7
- 108010025464 Cyclin-Dependent Kinase 4 Proteins 0.000 claims description 6
- 102100036252 Cyclin-dependent kinase 4 Human genes 0.000 claims description 6
- 108020004414 DNA Proteins 0.000 claims description 6
- 101000777277 Homo sapiens Serine/threonine-protein kinase Chk2 Proteins 0.000 claims description 6
- NIDVTARKFBZMOT-PEBGCTIMSA-N N(4)-acetylcytidine Chemical compound O=C1N=C(NC(=O)C)C=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](CO)O1 NIDVTARKFBZMOT-PEBGCTIMSA-N 0.000 claims description 6
- 102100031075 Serine/threonine-protein kinase Chk2 Human genes 0.000 claims description 6
- 230000004048 modification Effects 0.000 claims description 6
- 238000012986 modification Methods 0.000 claims description 6
- 238000012545 processing Methods 0.000 claims description 6
- 238000011282 treatment Methods 0.000 claims description 6
- 101000631760 Homo sapiens Sodium channel protein type 1 subunit alpha Proteins 0.000 claims description 4
- KDCGOANMDULRCW-UHFFFAOYSA-N Purine Natural products N1=CNC2=NC=NC2=C1 KDCGOANMDULRCW-UHFFFAOYSA-N 0.000 claims description 4
- 102100028910 Sodium channel protein type 1 subunit alpha Human genes 0.000 claims description 4
- 230000003247 decreasing effect Effects 0.000 claims description 4
- 125000000561 purinyl group Chemical group N1=C(N=C2N=CNC2=C1)* 0.000 claims description 4
- WVAKRQOMAINQPU-UHFFFAOYSA-N 2-[4-[2-[5-(2,2-dimethylbutyl)-1h-imidazol-2-yl]ethyl]phenyl]pyridine Chemical compound N1C(CC(C)(C)CC)=CN=C1CCC1=CC=C(C=2N=CC=CC=2)C=C1 WVAKRQOMAINQPU-UHFFFAOYSA-N 0.000 claims description 3
- 102100033793 ALK tyrosine kinase receptor Human genes 0.000 claims description 3
- 102000000872 ATM Human genes 0.000 claims description 3
- 102100034540 Adenomatous polyposis coli protein Human genes 0.000 claims description 3
- 108010004586 Ataxia Telangiectasia Mutated Proteins Proteins 0.000 claims description 3
- 102100037152 BAG family molecular chaperone regulator 1 Human genes 0.000 claims description 3
- 101700002522 BARD1 Proteins 0.000 claims description 3
- 108700020463 BRCA1 Proteins 0.000 claims description 3
- 102000036365 BRCA1 Human genes 0.000 claims description 3
- 101150072950 BRCA1 gene Proteins 0.000 claims description 3
- 102100028048 BRCA1-associated RING domain protein 1 Human genes 0.000 claims description 3
- 102000052609 BRCA2 Human genes 0.000 claims description 3
- 108700020462 BRCA2 Proteins 0.000 claims description 3
- 102100037674 Bis(5'-adenosyl)-triphosphatase Human genes 0.000 claims description 3
- 102100025423 Bone morphogenetic protein receptor type-1A Human genes 0.000 claims description 3
- 101150008921 Brca2 gene Proteins 0.000 claims description 3
- 102100028226 COUP transcription factor 2 Human genes 0.000 claims description 3
- 102100021975 CREB-binding protein Human genes 0.000 claims description 3
- 102100028914 Catenin beta-1 Human genes 0.000 claims description 3
- 108091007854 Cdh1/Fizzy-related Proteins 0.000 claims description 3
- 102000038594 Cdh1/Fizzy-related Human genes 0.000 claims description 3
- 102100031265 Chromodomain-helicase-DNA-binding protein 2 Human genes 0.000 claims description 3
- 102100038165 Chromodomain-helicase-DNA-binding protein 8 Human genes 0.000 claims description 3
- 102100033601 Collagen alpha-1(I) chain Human genes 0.000 claims description 3
- 102100036213 Collagen alpha-2(I) chain Human genes 0.000 claims description 3
- 108010024682 Core Binding Factor Alpha 1 Subunit Proteins 0.000 claims description 3
- 102000015775 Core Binding Factor Alpha 1 Subunit Human genes 0.000 claims description 3
- 108010043471 Core Binding Factor Alpha 2 Subunit Proteins 0.000 claims description 3
- 108010009392 Cyclin-Dependent Kinase Inhibitor p16 Proteins 0.000 claims description 3
- 102000004480 Cyclin-Dependent Kinase Inhibitor p57 Human genes 0.000 claims description 3
- 108010017222 Cyclin-Dependent Kinase Inhibitor p57 Proteins 0.000 claims description 3
- 108010079245 Cystic Fibrosis Transmembrane Conductance Regulator Proteins 0.000 claims description 3
- 102100034157 DNA mismatch repair protein Msh2 Human genes 0.000 claims description 3
- 102100021147 DNA mismatch repair protein Msh6 Human genes 0.000 claims description 3
- 102100037799 DNA-binding protein Ikaros Human genes 0.000 claims description 3
- 241000702421 Dependoparvovirus Species 0.000 claims description 3
- 102100031480 Dual specificity mitogen-activated protein kinase kinase 1 Human genes 0.000 claims description 3
- 102000012804 EPCAM Human genes 0.000 claims description 3
- 101150084967 EPCAM gene Proteins 0.000 claims description 3
- 102100031785 Endothelial transcription factor GATA-2 Human genes 0.000 claims description 3
- 101710105178 F-box/WD repeat-containing protein 7 Proteins 0.000 claims description 3
- 102100028138 F-box/WD repeat-containing protein 7 Human genes 0.000 claims description 3
- 101150026630 FOXG1 gene Proteins 0.000 claims description 3
- 102100037008 Factor in the germline alpha Human genes 0.000 claims description 3
- 108010067741 Fanconi Anemia Complementation Group N protein Proteins 0.000 claims description 3
- 102100034553 Fanconi anemia group J protein Human genes 0.000 claims description 3
- 102100031509 Fibrillin-1 Human genes 0.000 claims description 3
- 102100023600 Fibroblast growth factor receptor 2 Human genes 0.000 claims description 3
- 101710182389 Fibroblast growth factor receptor 2 Proteins 0.000 claims description 3
- 108010010285 Forkhead Box Protein L2 Proteins 0.000 claims description 3
- 102100020871 Forkhead box protein G1 Human genes 0.000 claims description 3
- 102100035137 Forkhead box protein L2 Human genes 0.000 claims description 3
- 102100028115 Forkhead box protein P2 Human genes 0.000 claims description 3
- 102100039820 Frizzled-4 Human genes 0.000 claims description 3
- 102100029974 GTPase HRas Human genes 0.000 claims description 3
- 102100039788 GTPase NRas Human genes 0.000 claims description 3
- 102100025334 Guanine nucleotide-binding protein G(q) subunit alpha Human genes 0.000 claims description 3
- 102100032610 Guanine nucleotide-binding protein G(s) subunit alpha isoforms XLas Human genes 0.000 claims description 3
- 102100022054 Hepatocyte nuclear factor 4-alpha Human genes 0.000 claims description 3
- 102100035108 High affinity nerve growth factor receptor Human genes 0.000 claims description 3
- 102100033071 Histone acetyltransferase KAT6A Human genes 0.000 claims description 3
- 102100038885 Histone acetyltransferase p300 Human genes 0.000 claims description 3
- 102100022103 Histone-lysine N-methyltransferase 2A Human genes 0.000 claims description 3
- 102100029768 Histone-lysine N-methyltransferase SETD1A Human genes 0.000 claims description 3
- 102100029239 Histone-lysine N-methyltransferase, H3 lysine-36 specific Human genes 0.000 claims description 3
- 102100021088 Homeobox protein Hox-B13 Human genes 0.000 claims description 3
- 102100040227 Homeobox protein Hox-D13 Human genes 0.000 claims description 3
- 102100034826 Homeobox protein Meis2 Human genes 0.000 claims description 3
- 102100027875 Homeobox protein Nkx-2.5 Human genes 0.000 claims description 3
- 102100027345 Homeobox protein SIX3 Human genes 0.000 claims description 3
- 101000779641 Homo sapiens ALK tyrosine kinase receptor Proteins 0.000 claims description 3
- 101000924577 Homo sapiens Adenomatous polyposis coli protein Proteins 0.000 claims description 3
- 101000740062 Homo sapiens BAG family molecular chaperone regulator 1 Proteins 0.000 claims description 3
- 101000934638 Homo sapiens Bone morphogenetic protein receptor type-1A Proteins 0.000 claims description 3
- 101000860860 Homo sapiens COUP transcription factor 2 Proteins 0.000 claims description 3
- 101000896987 Homo sapiens CREB-binding protein Proteins 0.000 claims description 3
- 101000916173 Homo sapiens Catenin beta-1 Proteins 0.000 claims description 3
- 101000777079 Homo sapiens Chromodomain-helicase-DNA-binding protein 2 Proteins 0.000 claims description 3
- 101000883545 Homo sapiens Chromodomain-helicase-DNA-binding protein 8 Proteins 0.000 claims description 3
- 101000875067 Homo sapiens Collagen alpha-2(I) chain Proteins 0.000 claims description 3
- 101001134036 Homo sapiens DNA mismatch repair protein Msh2 Proteins 0.000 claims description 3
- 101000968658 Homo sapiens DNA mismatch repair protein Msh6 Proteins 0.000 claims description 3
- 101000599038 Homo sapiens DNA-binding protein Ikaros Proteins 0.000 claims description 3
- 101000880945 Homo sapiens Down syndrome cell adhesion molecule Proteins 0.000 claims description 3
- 101001066265 Homo sapiens Endothelial transcription factor GATA-2 Proteins 0.000 claims description 3
- 101000878291 Homo sapiens Factor in the germline alpha Proteins 0.000 claims description 3
- 101000848171 Homo sapiens Fanconi anemia group J protein Proteins 0.000 claims description 3
- 101000846893 Homo sapiens Fibrillin-1 Proteins 0.000 claims description 3
- 101001059881 Homo sapiens Forkhead box protein P2 Proteins 0.000 claims description 3
- 101000885581 Homo sapiens Frizzled-4 Proteins 0.000 claims description 3
- 101000584633 Homo sapiens GTPase HRas Proteins 0.000 claims description 3
- 101000744505 Homo sapiens GTPase NRas Proteins 0.000 claims description 3
- 101000857888 Homo sapiens Guanine nucleotide-binding protein G(q) subunit alpha Proteins 0.000 claims description 3
- 101001014590 Homo sapiens Guanine nucleotide-binding protein G(s) subunit alpha isoforms XLas Proteins 0.000 claims description 3
- 101001014594 Homo sapiens Guanine nucleotide-binding protein G(s) subunit alpha isoforms short Proteins 0.000 claims description 3
- 101001045740 Homo sapiens Hepatocyte nuclear factor 4-alpha Proteins 0.000 claims description 3
- 101000596894 Homo sapiens High affinity nerve growth factor receptor Proteins 0.000 claims description 3
- 101000944179 Homo sapiens Histone acetyltransferase KAT6A Proteins 0.000 claims description 3
- 101000882390 Homo sapiens Histone acetyltransferase p300 Proteins 0.000 claims description 3
- 101001045846 Homo sapiens Histone-lysine N-methyltransferase 2A Proteins 0.000 claims description 3
- 101000865038 Homo sapiens Histone-lysine N-methyltransferase SETD1A Proteins 0.000 claims description 3
- 101000634050 Homo sapiens Histone-lysine N-methyltransferase, H3 lysine-36 specific Proteins 0.000 claims description 3
- 101001041145 Homo sapiens Homeobox protein Hox-B13 Proteins 0.000 claims description 3
- 101001037168 Homo sapiens Homeobox protein Hox-D13 Proteins 0.000 claims description 3
- 101001019057 Homo sapiens Homeobox protein Meis2 Proteins 0.000 claims description 3
- 101000632197 Homo sapiens Homeobox protein Nkx-2.5 Proteins 0.000 claims description 3
- 101000651928 Homo sapiens Homeobox protein SIX3 Proteins 0.000 claims description 3
- 101001011446 Homo sapiens Interferon regulatory factor 6 Proteins 0.000 claims description 3
- 101001027628 Homo sapiens Kinesin-like protein KIF21A Proteins 0.000 claims description 3
- 101001017592 Homo sapiens Mediator of RNA polymerase II transcription subunit 13-like Proteins 0.000 claims description 3
- 101001030211 Homo sapiens Myc proto-oncogene protein Proteins 0.000 claims description 3
- 101001014610 Homo sapiens Neuroendocrine secretory protein 55 Proteins 0.000 claims description 3
- 101000572986 Homo sapiens POU domain, class 3, transcription factor 2 Proteins 0.000 claims description 3
- 101000613577 Homo sapiens Paired box protein Pax-2 Proteins 0.000 claims description 3
- 101000613490 Homo sapiens Paired box protein Pax-3 Proteins 0.000 claims description 3
- 101000601724 Homo sapiens Paired box protein Pax-5 Proteins 0.000 claims description 3
- 101000601664 Homo sapiens Paired box protein Pax-8 Proteins 0.000 claims description 3
- 101000741790 Homo sapiens Peroxisome proliferator-activated receptor gamma Proteins 0.000 claims description 3
- 101000605639 Homo sapiens Phosphatidylinositol 4,5-bisphosphate 3-kinase catalytic subunit alpha isoform Proteins 0.000 claims description 3
- 101000595669 Homo sapiens Pituitary homeobox 2 Proteins 0.000 claims description 3
- 101001126417 Homo sapiens Platelet-derived growth factor receptor alpha Proteins 0.000 claims description 3
- 101000610107 Homo sapiens Pre-B-cell leukemia transcription factor 1 Proteins 0.000 claims description 3
- 101000611614 Homo sapiens Proline-rich protein PRCC Proteins 0.000 claims description 3
- 101000797903 Homo sapiens Protein ALEX Proteins 0.000 claims description 3
- 101000579425 Homo sapiens Proto-oncogene tyrosine-protein kinase receptor Ret Proteins 0.000 claims description 3
- 101001012157 Homo sapiens Receptor tyrosine-protein kinase erbB-2 Proteins 0.000 claims description 3
- 101000932478 Homo sapiens Receptor-type tyrosine-protein kinase FLT3 Proteins 0.000 claims description 3
- 101000654718 Homo sapiens SET-binding protein Proteins 0.000 claims description 3
- 101000984753 Homo sapiens Serine/threonine-protein kinase B-raf Proteins 0.000 claims description 3
- 101000628562 Homo sapiens Serine/threonine-protein kinase STK11 Proteins 0.000 claims description 3
- 101000616761 Homo sapiens Single-minded homolog 2 Proteins 0.000 claims description 3
- 101000595526 Homo sapiens T-box brain protein 1 Proteins 0.000 claims description 3
- 101000713590 Homo sapiens T-box transcription factor TBX1 Proteins 0.000 claims description 3
- 101000837845 Homo sapiens Transcription factor E3 Proteins 0.000 claims description 3
- 101000813738 Homo sapiens Transcription factor ETV6 Proteins 0.000 claims description 3
- 101000819074 Homo sapiens Transcription factor GATA-4 Proteins 0.000 claims description 3
- 101000664703 Homo sapiens Transcription factor SOX-10 Proteins 0.000 claims description 3
- 101000687905 Homo sapiens Transcription factor SOX-2 Proteins 0.000 claims description 3
- 101000711846 Homo sapiens Transcription factor SOX-9 Proteins 0.000 claims description 3
- 101001074042 Homo sapiens Transcriptional activator GLI3 Proteins 0.000 claims description 3
- 101000808011 Homo sapiens Vascular endothelial growth factor A Proteins 0.000 claims description 3
- 101000931371 Homo sapiens Zinc finger protein ZFPM2 Proteins 0.000 claims description 3
- 101000976643 Homo sapiens Zinc finger protein ZIC 2 Proteins 0.000 claims description 3
- 101001026573 Homo sapiens cAMP-dependent protein kinase type I-alpha regulatory subunit Proteins 0.000 claims description 3
- 102100030130 Interferon regulatory factor 6 Human genes 0.000 claims description 3
- 102100037688 Kinesin-like protein KIF21A Human genes 0.000 claims description 3
- 241000713666 Lentivirus Species 0.000 claims description 3
- 108010068342 MAP Kinase Kinase 1 Proteins 0.000 claims description 3
- 101150083522 MECP2 gene Proteins 0.000 claims description 3
- 108010018650 MEF2 Transcription Factors Proteins 0.000 claims description 3
- 229910015837 MSH2 Inorganic materials 0.000 claims description 3
- 102100034164 Mediator of RNA polymerase II transcription subunit 13-like Human genes 0.000 claims description 3
- 102100039124 Methyl-CpG-binding protein 2 Human genes 0.000 claims description 3
- 108010074346 Mismatch Repair Endonuclease PMS2 Proteins 0.000 claims description 3
- 102000008071 Mismatch Repair Endonuclease PMS2 Human genes 0.000 claims description 3
- 102100025725 Mothers against decapentaplegic homolog 4 Human genes 0.000 claims description 3
- 101710143112 Mothers against decapentaplegic homolog 4 Proteins 0.000 claims description 3
- 102000013609 MutL Protein Homolog 1 Human genes 0.000 claims description 3
- 108010026664 MutL Protein Homolog 1 Proteins 0.000 claims description 3
- 102100038895 Myc proto-oncogene protein Human genes 0.000 claims description 3
- 102100039229 Myocyte-specific enhancer factor 2C Human genes 0.000 claims description 3
- 102000007530 Neurofibromin 1 Human genes 0.000 claims description 3
- 108010085793 Neurofibromin 1 Proteins 0.000 claims description 3
- 108010032788 PAX6 Transcription Factor Proteins 0.000 claims description 3
- 102100026459 POU domain, class 3, transcription factor 2 Human genes 0.000 claims description 3
- 102100040852 Paired box protein Pax-2 Human genes 0.000 claims description 3
- 102100040891 Paired box protein Pax-3 Human genes 0.000 claims description 3
- 102100037504 Paired box protein Pax-5 Human genes 0.000 claims description 3
- 102100037506 Paired box protein Pax-6 Human genes 0.000 claims description 3
- 102100037502 Paired box protein Pax-8 Human genes 0.000 claims description 3
- 102100040884 Partner and localizer of BRCA2 Human genes 0.000 claims description 3
- 102100038825 Peroxisome proliferator-activated receptor gamma Human genes 0.000 claims description 3
- 102100038332 Phosphatidylinositol 4,5-bisphosphate 3-kinase catalytic subunit alpha isoform Human genes 0.000 claims description 3
- 102100036090 Pituitary homeobox 2 Human genes 0.000 claims description 3
- 102100030485 Platelet-derived growth factor receptor alpha Human genes 0.000 claims description 3
- 102100040171 Pre-B-cell leukemia transcription factor 1 Human genes 0.000 claims description 3
- 102100040829 Proline-rich protein PRCC Human genes 0.000 claims description 3
- 102100028286 Proto-oncogene tyrosine-protein kinase receptor Ret Human genes 0.000 claims description 3
- 108010068097 Rad51 Recombinase Proteins 0.000 claims description 3
- 102000002490 Rad51 Recombinase Human genes 0.000 claims description 3
- 102100030086 Receptor tyrosine-protein kinase erbB-2 Human genes 0.000 claims description 3
- 102100020718 Receptor-type tyrosine-protein kinase FLT3 Human genes 0.000 claims description 3
- 102000002278 Ribosomal Proteins Human genes 0.000 claims description 3
- 108010000605 Ribosomal Proteins Proteins 0.000 claims description 3
- 102100025373 Runt-related transcription factor 1 Human genes 0.000 claims description 3
- 102100032741 SET-binding protein Human genes 0.000 claims description 3
- 108091007634 SLC52A2 Proteins 0.000 claims description 3
- 102000001332 SRC Human genes 0.000 claims description 3
- 108060006706 SRC Proteins 0.000 claims description 3
- 102100027103 Serine/threonine-protein kinase B-raf Human genes 0.000 claims description 3
- 102100026715 Serine/threonine-protein kinase STK11 Human genes 0.000 claims description 3
- 102100021825 Single-minded homolog 2 Human genes 0.000 claims description 3
- 102100036862 Solute carrier family 52, riboflavin transporter, member 2 Human genes 0.000 claims description 3
- 102100032889 Sortilin Human genes 0.000 claims description 3
- 108010048349 Steroidogenic Factor 1 Proteins 0.000 claims description 3
- 102100029856 Steroidogenic factor 1 Human genes 0.000 claims description 3
- 102100036083 T-box brain protein 1 Human genes 0.000 claims description 3
- 108010014480 T-box transcription factor 5 Proteins 0.000 claims description 3
- 102100036771 T-box transcription factor TBX1 Human genes 0.000 claims description 3
- 102100024755 T-box transcription factor TBX5 Human genes 0.000 claims description 3
- 101150057140 TACSTD1 gene Proteins 0.000 claims description 3
- 102100028507 Transcription factor E3 Human genes 0.000 claims description 3
- 102100039580 Transcription factor ETV6 Human genes 0.000 claims description 3
- 102100021380 Transcription factor GATA-4 Human genes 0.000 claims description 3
- 102100038808 Transcription factor SOX-10 Human genes 0.000 claims description 3
- 102100024270 Transcription factor SOX-2 Human genes 0.000 claims description 3
- 102100034204 Transcription factor SOX-9 Human genes 0.000 claims description 3
- 102100035559 Transcriptional activator GLI3 Human genes 0.000 claims description 3
- 108010078814 Tumor Suppressor Protein p53 Proteins 0.000 claims description 3
- 102100033254 Tumor suppressor ARF Human genes 0.000 claims description 3
- 102100039037 Vascular endothelial growth factor A Human genes 0.000 claims description 3
- 108010088665 Zinc Finger Protein Gli2 Proteins 0.000 claims description 3
- 102100035558 Zinc finger protein GLI2 Human genes 0.000 claims description 3
- 102100020996 Zinc finger protein ZFPM2 Human genes 0.000 claims description 3
- 102100023492 Zinc finger protein ZIC 2 Human genes 0.000 claims description 3
- 108010029483 alpha 1 Chain Collagen Type I Proteins 0.000 claims description 3
- 108010005713 bis(5'-adenosyl)triphosphatase Proteins 0.000 claims description 3
- 102100037490 cAMP-dependent protein kinase type I-alpha regulatory subunit Human genes 0.000 claims description 3
- 102000052116 epidermal growth factor receptor activity proteins Human genes 0.000 claims description 3
- 108700015053 epidermal growth factor receptor activity proteins Proteins 0.000 claims description 3
- 102000008371 intracellularly ATP-gated chloride channel activity proteins Human genes 0.000 claims description 3
- YOHYSYJDKVYCJI-UHFFFAOYSA-N n-[3-[[6-[3-(trifluoromethyl)anilino]pyrimidin-4-yl]amino]phenyl]cyclopropanecarboxamide Chemical compound FC(F)(F)C1=CC=CC(NC=2N=CN=C(NC=3C=C(NC(=O)C4CC4)C=CC=3)C=2)=C1 YOHYSYJDKVYCJI-UHFFFAOYSA-N 0.000 claims description 3
- 230000007111 proteostasis Effects 0.000 claims description 3
- 108010014657 sortilin Proteins 0.000 claims description 3
- 238000013518 transcription Methods 0.000 claims description 3
- 230000035897 transcription Effects 0.000 claims description 3
- 241000701161 unidentified adenovirus Species 0.000 claims description 3
- 241001430294 unidentified retrovirus Species 0.000 claims description 3
- 102100022057 Hepatocyte nuclear factor 1-alpha Human genes 0.000 claims description 2
- 101001045751 Homo sapiens Hepatocyte nuclear factor 1-alpha Proteins 0.000 claims description 2
- 101001088739 Homo sapiens Probable inactive ribonuclease-like protein 12 Proteins 0.000 claims description 2
- 101000779418 Homo sapiens RAC-alpha serine/threonine-protein kinase Proteins 0.000 claims description 2
- 101001024635 Homo sapiens RNA cytidine acetyltransferase Proteins 0.000 claims description 2
- 101001099922 Homo sapiens Retinoic acid-induced protein 1 Proteins 0.000 claims description 2
- 102100033810 RAC-alpha serine/threonine-protein kinase Human genes 0.000 claims description 2
- 102100038470 Retinoic acid-induced protein 1 Human genes 0.000 claims description 2
- 241000700605 Viruses Species 0.000 claims description 2
- 102000012163 WW Domain-Containing Oxidoreductase Human genes 0.000 claims description 2
- 108010036639 WW Domain-Containing Oxidoreductase Proteins 0.000 claims description 2
- 239000000546 pharmaceutical excipient Substances 0.000 claims description 2
- 101100127296 Dictyostelium discoideum kif1 gene Proteins 0.000 claims 1
- 101100127288 Mus musculus Kif1a gene Proteins 0.000 claims 1
- 102000015098 Tumor Suppressor Protein p53 Human genes 0.000 claims 1
- 108091028043 Nucleic acid sequence Proteins 0.000 abstract description 7
- 210000004027 cell Anatomy 0.000 description 59
- 102100031181 Glyceraldehyde-3-phosphate dehydrogenase Human genes 0.000 description 34
- 108020004445 glyceraldehyde-3-phosphate dehydrogenase Proteins 0.000 description 34
- 101710160924 RNA cytidine acetyltransferase Proteins 0.000 description 27
- 239000012528 membrane Substances 0.000 description 25
- 230000014616 translation Effects 0.000 description 22
- 150000007523 nucleic acids Chemical class 0.000 description 17
- 238000001890 transfection Methods 0.000 description 16
- 239000008267 milk Substances 0.000 description 15
- 210000004080 milk Anatomy 0.000 description 15
- 235000013336 milk Nutrition 0.000 description 15
- 102000039446 nucleic acids Human genes 0.000 description 13
- 108020004707 nucleic acids Proteins 0.000 description 13
- 239000013612 plasmid Substances 0.000 description 13
- 239000006144 Dulbecco’s modified Eagle's medium Substances 0.000 description 11
- 238000000338 in vitro Methods 0.000 description 11
- 239000000872 buffer Substances 0.000 description 10
- 239000000203 mixture Substances 0.000 description 10
- 108010001336 Horseradish Peroxidase Proteins 0.000 description 9
- 229930185560 Pseudouridine Natural products 0.000 description 9
- PTJWIQPHWPFNBW-UHFFFAOYSA-N Pseudouridine C Natural products OC1C(O)C(CO)OC1C1=CNC(=O)NC1=O PTJWIQPHWPFNBW-UHFFFAOYSA-N 0.000 description 9
- WGDUUQDYDIIBKT-UHFFFAOYSA-N beta-Pseudouridine Natural products OC1OC(CN2C=CC(=O)NC2=O)C(O)C1O WGDUUQDYDIIBKT-UHFFFAOYSA-N 0.000 description 9
- PTJWIQPHWPFNBW-GBNDHIKLSA-N pseudouridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1C1=CNC(=O)NC1=O PTJWIQPHWPFNBW-GBNDHIKLSA-N 0.000 description 9
- 239000003153 chemical reaction reagent Substances 0.000 description 8
- LOKCTEFSRHRXRJ-UHFFFAOYSA-I dipotassium trisodium dihydrogen phosphate hydrogen phosphate dichloride Chemical compound P(=O)(O)(O)[O-].[K+].P(=O)(O)([O-])[O-].[Na+].[Na+].[Cl-].[K+].[Cl-].[Na+] LOKCTEFSRHRXRJ-UHFFFAOYSA-I 0.000 description 8
- 238000004949 mass spectrometry Methods 0.000 description 8
- 108700043045 nanoluc Proteins 0.000 description 8
- 239000002953 phosphate buffered saline Substances 0.000 description 8
- 241000699666 Mus <mouse, genus> Species 0.000 description 7
- 230000000694 effects Effects 0.000 description 7
- 239000002609 medium Substances 0.000 description 7
- 239000000758 substrate Substances 0.000 description 7
- 238000012384 transportation and delivery Methods 0.000 description 7
- 239000003656 tris buffered saline Substances 0.000 description 7
- 238000001262 western blot Methods 0.000 description 7
- UHDGCWIWMRVCDJ-UHFFFAOYSA-N 1-beta-D-Xylofuranosyl-NH-Cytosine Natural products O=C1N=C(N)C=CN1C1C(O)C(O)C(CO)O1 UHDGCWIWMRVCDJ-UHFFFAOYSA-N 0.000 description 6
- UHDGCWIWMRVCDJ-PSQAKQOGSA-N Cytidine Natural products O=C1N=C(N)C=CN1[C@@H]1[C@@H](O)[C@@H](O)[C@H](CO)O1 UHDGCWIWMRVCDJ-PSQAKQOGSA-N 0.000 description 6
- 241000283973 Oryctolagus cuniculus Species 0.000 description 6
- DRTQHJPVMGBUCF-XVFCMESISA-N Uridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-XVFCMESISA-N 0.000 description 6
- 230000000692 anti-sense effect Effects 0.000 description 6
- 230000027455 binding Effects 0.000 description 6
- 244000309466 calf Species 0.000 description 6
- UHDGCWIWMRVCDJ-ZAKLUEHWSA-N cytidine Chemical compound O=C1N=C(N)C=CN1[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O1 UHDGCWIWMRVCDJ-ZAKLUEHWSA-N 0.000 description 6
- 238000011068 loading method Methods 0.000 description 6
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 5
- 229930182816 L-glutamine Natural products 0.000 description 5
- 238000004458 analytical method Methods 0.000 description 5
- 230000008901 benefit Effects 0.000 description 5
- 239000000499 gel Substances 0.000 description 5
- 238000001727 in vivo Methods 0.000 description 5
- 238000010348 incorporation Methods 0.000 description 5
- 210000002966 serum Anatomy 0.000 description 5
- 102100025064 Cellular tumor antigen p53 Human genes 0.000 description 4
- 201000007547 Dravet syndrome Diseases 0.000 description 4
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 4
- 238000012156 HITS-CLIP Methods 0.000 description 4
- 108091092195 Intron Proteins 0.000 description 4
- 239000007993 MOPS buffer Substances 0.000 description 4
- 208000036572 Myoclonic epilepsy Diseases 0.000 description 4
- 229920001213 Polysorbate 20 Polymers 0.000 description 4
- 206010073677 Severe myoclonic epilepsy of infancy Diseases 0.000 description 4
- 239000000074 antisense oligonucleotide Substances 0.000 description 4
- 238000012230 antisense oligonucleotides Methods 0.000 description 4
- 125000002680 canonical nucleotide group Chemical group 0.000 description 4
- 238000011161 development Methods 0.000 description 4
- 239000003814 drug Substances 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 4
- 230000000977 initiatory effect Effects 0.000 description 4
- 230000003993 interaction Effects 0.000 description 4
- BDAGIHXWWSANSR-UHFFFAOYSA-N methanoic acid Natural products OC=O BDAGIHXWWSANSR-UHFFFAOYSA-N 0.000 description 4
- 238000003752 polymerase chain reaction Methods 0.000 description 4
- 239000000256 polyoxyethylene sorbitan monolaurate Substances 0.000 description 4
- 235000010486 polyoxyethylene sorbitan monolaurate Nutrition 0.000 description 4
- 230000035945 sensitivity Effects 0.000 description 4
- 229910021642 ultra pure water Inorganic materials 0.000 description 4
- 239000012498 ultrapure water Substances 0.000 description 4
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 3
- 108020000948 Antisense Oligonucleotides Proteins 0.000 description 3
- 102000004190 Enzymes Human genes 0.000 description 3
- 108090000790 Enzymes Proteins 0.000 description 3
- 108700024394 Exon Proteins 0.000 description 3
- 239000012097 Lipofectamine 2000 Substances 0.000 description 3
- 208000001826 Marfan syndrome Diseases 0.000 description 3
- 108091034117 Oligonucleotide Proteins 0.000 description 3
- 229940124158 Protease/peptidase inhibitor Drugs 0.000 description 3
- 239000012083 RIPA buffer Substances 0.000 description 3
- 208000006289 Rett Syndrome Diseases 0.000 description 3
- 238000013459 approach Methods 0.000 description 3
- 239000011324 bead Substances 0.000 description 3
- DRTQHJPVMGBUCF-PSQAKQOGSA-N beta-L-uridine Natural products O[C@H]1[C@@H](O)[C@H](CO)O[C@@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-PSQAKQOGSA-N 0.000 description 3
- OWMVSZAMULFTJU-UHFFFAOYSA-N bis-tris Chemical compound OCCN(CCO)C(CO)(CO)CO OWMVSZAMULFTJU-UHFFFAOYSA-N 0.000 description 3
- 239000002299 complementary DNA Substances 0.000 description 3
- 238000012155 cross-linking immunoprecipitation Methods 0.000 description 3
- 238000009472 formulation Methods 0.000 description 3
- 210000005260 human cell Anatomy 0.000 description 3
- 230000000670 limiting effect Effects 0.000 description 3
- 238000004020 luminiscence type Methods 0.000 description 3
- 239000000137 peptide hydrolase inhibitor Substances 0.000 description 3
- 102000040430 polynucleotide Human genes 0.000 description 3
- 108091033319 polynucleotide Proteins 0.000 description 3
- 239000002157 polynucleotide Substances 0.000 description 3
- 238000011002 quantification Methods 0.000 description 3
- 230000002829 reductive effect Effects 0.000 description 3
- DRTQHJPVMGBUCF-UHFFFAOYSA-N uracil arabinoside Natural products OC1C(O)C(CO)OC1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-UHFFFAOYSA-N 0.000 description 3
- 229940045145 uridine Drugs 0.000 description 3
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 2
- 108020004463 18S ribosomal RNA Proteins 0.000 description 2
- OSWFIVFLDKOXQC-UHFFFAOYSA-N 4-(3-methoxyphenyl)aniline Chemical compound COC1=CC=CC(C=2C=CC(N)=CC=2)=C1 OSWFIVFLDKOXQC-UHFFFAOYSA-N 0.000 description 2
- 108700028369 Alleles Proteins 0.000 description 2
- 241000283707 Capra Species 0.000 description 2
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 2
- 241000588724 Escherichia coli Species 0.000 description 2
- 101000971697 Homo sapiens Kinesin-like protein KIF1B Proteins 0.000 description 2
- 102100021524 Kinesin-like protein KIF1B Human genes 0.000 description 2
- 239000000020 Nitrocellulose Substances 0.000 description 2
- 108700020796 Oncogene Proteins 0.000 description 2
- 102000043276 Oncogene Human genes 0.000 description 2
- 239000012124 Opti-MEM Substances 0.000 description 2
- 102000004160 Phosphoric Monoester Hydrolases Human genes 0.000 description 2
- 108090000608 Phosphoric Monoester Hydrolases Proteins 0.000 description 2
- 108020004511 Recombinant DNA Proteins 0.000 description 2
- 108091036066 Three prime untranslated region Proteins 0.000 description 2
- 238000002679 ablation Methods 0.000 description 2
- 239000002787 antisense oligonuctleotide Substances 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 2
- 239000000969 carrier Substances 0.000 description 2
- 239000013592 cell lysate Substances 0.000 description 2
- 230000002596 correlated effect Effects 0.000 description 2
- 230000007547 defect Effects 0.000 description 2
- 230000002939 deleterious effect Effects 0.000 description 2
- 238000012159 eCLIP Methods 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 235000019253 formic acid Nutrition 0.000 description 2
- 238000001415 gene therapy Methods 0.000 description 2
- 230000002068 genetic effect Effects 0.000 description 2
- 238000009396 hybridization Methods 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- 230000035772 mutation Effects 0.000 description 2
- 229920001220 nitrocellulos Polymers 0.000 description 2
- 239000002777 nucleoside Substances 0.000 description 2
- 230000037361 pathway Effects 0.000 description 2
- 239000000047 product Substances 0.000 description 2
- 238000001243 protein synthesis Methods 0.000 description 2
- 238000011084 recovery Methods 0.000 description 2
- 230000009467 reduction Effects 0.000 description 2
- 108091008146 restriction endonucleases Proteins 0.000 description 2
- 239000000523 sample Substances 0.000 description 2
- 230000014621 translational initiation Effects 0.000 description 2
- 238000011144 upstream manufacturing Methods 0.000 description 2
- YIMATHOGWXZHFX-WCTZXXKLSA-N (2r,3r,4r,5r)-5-(hydroxymethyl)-3-(2-methoxyethoxy)oxolane-2,4-diol Chemical compound COCCO[C@H]1[C@H](O)O[C@H](CO)[C@H]1O YIMATHOGWXZHFX-WCTZXXKLSA-N 0.000 description 1
- UCKYOOZPSJFJIZ-FMDGEEDCSA-N (4r)-1-[(2r,3r,4s,5r)-3,4-dihydroxy-5-(hydroxymethyl)oxolan-2-yl]-4-hydroxy-1,3-diazinan-2-one Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)N[C@H](O)CC1 UCKYOOZPSJFJIZ-FMDGEEDCSA-N 0.000 description 1
- 108020005345 3' Untranslated Regions Proteins 0.000 description 1
- 108020003589 5' Untranslated Regions Proteins 0.000 description 1
- ZAYHVCMSTBRABG-UHFFFAOYSA-N 5-Methylcytidine Natural products O=C1N=C(N)C(C)=CN1C1C(O)C(O)C(CO)O1 ZAYHVCMSTBRABG-UHFFFAOYSA-N 0.000 description 1
- ZAYHVCMSTBRABG-JXOAFFINSA-N 5-methylcytidine Chemical compound O=C1N=C(N)C(C)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](CO)O1 ZAYHVCMSTBRABG-JXOAFFINSA-N 0.000 description 1
- 238000000035 BCA protein assay Methods 0.000 description 1
- 239000004322 Butylated hydroxytoluene Substances 0.000 description 1
- NLZUEZXRPGMBCV-UHFFFAOYSA-N Butylhydroxytoluene Chemical compound CC1=CC(C(C)(C)C)=C(O)C(C(C)(C)C)=C1 NLZUEZXRPGMBCV-UHFFFAOYSA-N 0.000 description 1
- 101100322915 Caenorhabditis elegans akt-1 gene Proteins 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- 102000012410 DNA Ligases Human genes 0.000 description 1
- 108010061982 DNA Ligases Proteins 0.000 description 1
- 102100031780 Endonuclease Human genes 0.000 description 1
- 206010016334 Feeling hot Diseases 0.000 description 1
- 102100021519 Hemoglobin subunit beta Human genes 0.000 description 1
- 108091005904 Hemoglobin subunit beta Proteins 0.000 description 1
- 102100022123 Hepatocyte nuclear factor 1-beta Human genes 0.000 description 1
- 206010050469 Holt-Oram syndrome Diseases 0.000 description 1
- 101001045758 Homo sapiens Hepatocyte nuclear factor 1-beta Proteins 0.000 description 1
- 108060001084 Luciferase Proteins 0.000 description 1
- 239000005089 Luciferase Substances 0.000 description 1
- 102000006890 Methyl-CpG-Binding Protein 2 Human genes 0.000 description 1
- 108010072388 Methyl-CpG-Binding Protein 2 Proteins 0.000 description 1
- 241000699670 Mus sp. Species 0.000 description 1
- VQAYFKKCNSOZKM-IOSLPCCCSA-N N(6)-methyladenosine Chemical class C1=NC=2C(NC)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O VQAYFKKCNSOZKM-IOSLPCCCSA-N 0.000 description 1
- 208000012902 Nervous system disease Diseases 0.000 description 1
- 239000004677 Nylon Substances 0.000 description 1
- 239000002033 PVDF binder Substances 0.000 description 1
- 108010029485 Protein Isoforms Proteins 0.000 description 1
- 102000001708 Protein Isoforms Human genes 0.000 description 1
- 238000002123 RNA extraction Methods 0.000 description 1
- 108010092799 RNA-directed DNA polymerase Proteins 0.000 description 1
- 238000003559 RNA-seq method Methods 0.000 description 1
- 238000011529 RT qPCR Methods 0.000 description 1
- 102100024544 SURP and G-patch domain-containing protein 1 Human genes 0.000 description 1
- 108010034546 Serratia marcescens nuclease Proteins 0.000 description 1
- 201000001388 Smith-Magenis syndrome Diseases 0.000 description 1
- 108010052164 Sodium Channels Proteins 0.000 description 1
- 108091081024 Start codon Proteins 0.000 description 1
- 108020004566 Transfer RNA Proteins 0.000 description 1
- 239000007983 Tris buffer Substances 0.000 description 1
- 102000044209 Tumor Suppressor Genes Human genes 0.000 description 1
- 108700025716 Tumor Suppressor Genes Proteins 0.000 description 1
- 108091023045 Untranslated Region Proteins 0.000 description 1
- 208000026724 Waardenburg syndrome Diseases 0.000 description 1
- 206010049644 Williams syndrome Diseases 0.000 description 1
- 230000001594 aberrant effect Effects 0.000 description 1
- 239000011543 agarose gel Substances 0.000 description 1
- 150000001413 amino acids Chemical group 0.000 description 1
- 230000003042 antagnostic effect Effects 0.000 description 1
- 230000003443 anti-oncogenic effect Effects 0.000 description 1
- 238000003556 assay Methods 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- UDSAIICHUKSCKT-UHFFFAOYSA-N bromophenol blue Chemical compound C1=C(Br)C(O)=C(Br)C=C1C1(C=2C=C(Br)C(O)=C(Br)C=2)C2=CC=CC=C2S(=O)(=O)O1 UDSAIICHUKSCKT-UHFFFAOYSA-N 0.000 description 1
- 235000010354 butylated hydroxytoluene Nutrition 0.000 description 1
- 229940095259 butylated hydroxytoluene Drugs 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 238000011072 cell harvest Methods 0.000 description 1
- 230000005754 cellular signaling Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 108091036078 conserved sequence Proteins 0.000 description 1
- 239000013068 control sample Substances 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 238000009109 curative therapy Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 230000001627 detrimental effect Effects 0.000 description 1
- 238000010790 dilution Methods 0.000 description 1
- 239000012895 dilution Substances 0.000 description 1
- 239000003937 drug carrier Substances 0.000 description 1
- 238000012377 drug delivery Methods 0.000 description 1
- 241001493065 dsRNA viruses Species 0.000 description 1
- 230000009977 dual effect Effects 0.000 description 1
- 230000002500 effect on skin Effects 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 230000001973 epigenetic effect Effects 0.000 description 1
- 230000007717 exclusion Effects 0.000 description 1
- 239000013604 expression vector Substances 0.000 description 1
- 239000000706 filtrate Substances 0.000 description 1
- 238000007672 fourth generation sequencing Methods 0.000 description 1
- 239000012634 fragment Substances 0.000 description 1
- 239000012737 fresh medium Substances 0.000 description 1
- 238000010362 genome editing Methods 0.000 description 1
- 210000004602 germ cell Anatomy 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 238000004128 high performance liquid chromatography Methods 0.000 description 1
- 238000000589 high-performance liquid chromatography-mass spectrometry Methods 0.000 description 1
- 102000048904 human NAT10 Human genes 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 230000002401 inhibitory effect Effects 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 230000000968 intestinal effect Effects 0.000 description 1
- 238000007918 intramuscular administration Methods 0.000 description 1
- 238000001990 intravenous administration Methods 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 150000002632 lipids Chemical class 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 238000004895 liquid chromatography mass spectrometry Methods 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 238000007069 methylation reaction Methods 0.000 description 1
- 108091070501 miRNA Proteins 0.000 description 1
- 239000002679 microRNA Substances 0.000 description 1
- 238000002703 mutagenesis Methods 0.000 description 1
- 231100000350 mutagenesis Toxicity 0.000 description 1
- 239000002105 nanoparticle Substances 0.000 description 1
- 239000006199 nebulizer Substances 0.000 description 1
- 239000013642 negative control Substances 0.000 description 1
- 102000042567 non-coding RNA Human genes 0.000 description 1
- 108091027963 non-coding RNA Proteins 0.000 description 1
- 238000010899 nucleation Methods 0.000 description 1
- 238000011330 nucleic acid test Methods 0.000 description 1
- 150000003833 nucleoside derivatives Chemical class 0.000 description 1
- 229920001778 nylon Polymers 0.000 description 1
- 230000002018 overexpression Effects 0.000 description 1
- 239000000825 pharmaceutical preparation Substances 0.000 description 1
- 125000002467 phosphate group Chemical group [H]OP(=O)(O[H])O[*] 0.000 description 1
- 230000008488 polyadenylation Effects 0.000 description 1
- 229920001184 polypeptide Polymers 0.000 description 1
- 229920002981 polyvinylidene fluoride Polymers 0.000 description 1
- 102000004196 processed proteins & peptides Human genes 0.000 description 1
- 108090000765 processed proteins & peptides Proteins 0.000 description 1
- 230000000750 progressive effect Effects 0.000 description 1
- 238000002731 protein assay Methods 0.000 description 1
- 230000002685 pulmonary effect Effects 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 238000003259 recombinant expression Methods 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 230000001105 regulatory effect Effects 0.000 description 1
- 238000009877 rendering Methods 0.000 description 1
- 230000001718 repressive effect Effects 0.000 description 1
- 238000003757 reverse transcription PCR Methods 0.000 description 1
- 230000002441 reversible effect Effects 0.000 description 1
- 150000003290 ribose derivatives Chemical class 0.000 description 1
- 238000007480 sanger sequencing Methods 0.000 description 1
- 238000007790 scraping Methods 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 238000007493 shaping process Methods 0.000 description 1
- 108010062513 snake venom phosphodiesterase I Proteins 0.000 description 1
- 239000012475 sodium chloride buffer Substances 0.000 description 1
- 239000001509 sodium citrate Substances 0.000 description 1
- NLJMYIDDQXHKNR-UHFFFAOYSA-K sodium citrate Chemical compound O.O.[Na+].[Na+].[Na+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O NLJMYIDDQXHKNR-UHFFFAOYSA-K 0.000 description 1
- VVLFAAMTGMGYBS-UHFFFAOYSA-M sodium;4-[[4-(ethylamino)-3-methylphenyl]-(4-ethylimino-3-methylcyclohexa-2,5-dien-1-ylidene)methyl]-3-sulfobenzenesulfonate Chemical compound [Na+].C1=C(C)C(NCC)=CC=C1C(C=1C(=CC(=CC=1)S([O-])(=O)=O)S(O)(=O)=O)=C1C=C(C)C(=NCC)C=C1 VVLFAAMTGMGYBS-UHFFFAOYSA-M 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 239000000243 solution Substances 0.000 description 1
- 230000009870 specific binding Effects 0.000 description 1
- 230000002269 spontaneous effect Effects 0.000 description 1
- 230000004936 stimulating effect Effects 0.000 description 1
- 238000007920 subcutaneous administration Methods 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 238000004885 tandem mass spectrometry Methods 0.000 description 1
- 230000001225 therapeutic effect Effects 0.000 description 1
- 238000002560 therapeutic procedure Methods 0.000 description 1
- 230000000699 topical effect Effects 0.000 description 1
- 231100000331 toxic Toxicity 0.000 description 1
- 230000002588 toxic effect Effects 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 239000001226 triphosphate Substances 0.000 description 1
- 235000011178 triphosphate Nutrition 0.000 description 1
- 238000010200 validation analysis Methods 0.000 description 1
- 239000013603 viral vector Substances 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/111—General methods applicable to biologically active non-coding nucleic acids
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/67—General methods for enhancing the expression
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2310/00—Structure or type of the nucleic acid
- C12N2310/10—Type of nucleic acid
- C12N2310/11—Antisense
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2310/00—Structure or type of the nucleic acid
- C12N2310/30—Chemical structure
- C12N2310/35—Nature of the modification
- C12N2310/351—Conjugate
- C12N2310/3519—Fusion with another nucleic acid
Definitions
- compositions and methods related to directed acetylation by engineered snoRNA molecules are provided.
- Small nucleolar RNAs are non-coding RNAs that primarily accumulate in the nucleoli and typically comprise about 60 to 300 nucleotides. Based on the characteristic nucleotide motifs and association with canonical partner proteins, snoRNAs are classified into either C/D- (SNORD) or H/ACA-box (SNORA) subfamilies. Eukaryotic box C/D RNAs are typically 70 to 120 nucleotides in length and contain two conserved motif elements: box C (RUGAUGA; wherein R is a purine) and box D (CUGA) located at the 5’- and 3’-termini of the RNA molecule, respectively (see Figure 1).
- NAT10 Human N-acetyltransferase 10
- Traditional gene therapy approaches rely on eliminating deleterious transcripts.
- strategies that involve genome manipulation carry inherent risks and ablation does not address diseases that result from inadequate mRNA translation.
- there is a need fortreatment of any disease or disorder where moderate protein expression can lead to alleviation of disease phenotype (/.e., a disease or disorder with an associated haploinsufficiency).
- Provided herein are tools and methods that function in human cells for precise RNA epigenetic engineering.
- the compositions and methods provided herein can modulate gene regulation, target diseases, and serve as the basis of curative therapies for many previously untreatable diseases and/or disorders.
- snoRNA-guided acetylation of select mRNA The inventors examined the feasibility of capitalizing on NATIO’s property to direct acetylation, particularly the N4-acetylcytidine (ac4C) modification, to specific transcript substrates to regulate protein synthesis.
- engineered chimeric snoRNA guides can recruit NAT10 to a target transcript that causes directed acetylation of the target transcript, which regulates protein production.
- the inventors generated a chimeric snoRNA composed of the Box C/D snoRNA SNORD13 and antisense elements directed to, for example, GAPDH.
- the snoRNA was delivered into HeLa cells through incorporation into a minigene vector. Recovery of GAPDH transcripts and mass spectrometry confirmed successful targeted acetylation in human cells lines. As further described below, this disclosure provides directed acetylation through engineered snoRNA adapters and can be useful for the treatment of any disease or disorder where moderate protein expression can lead to alleviation of disease phenotype (/.e., a disease or disorder with an associated haploinsufficiency).
- this disclosure provides an engineered nucleotide sequence comprising, 5’ to 3’:
- the first targeting sequence comprises about 15 to about 30 nucleotides.
- the second targeting sequence comprises about 15 to about 30 nucleotides.
- the first targeting sequence and the second targeting sequence are complementary to separate regions of a target sequence.
- the target sequence comprises an RNA transcript.
- the target sequence of the RNA transcript comprises a 5’ untranslated region (UTR) of the RNA transcript.
- the target sequence of the RNA transcript comprises a coding sequence (CDS) of the RNA transcript.
- the target sequence comprises a known acetylation site or a putative acetylation site.
- the target sequence is within about 30-50 nucleotides of a known acetylation site or a putative acetylation site.
- the C box motif containing sequence comprises a 5’ RUGAUGA-3’ sequence (wherein R is a purine).
- the C box motif containing sequence comprises about 40 to about 200 nucleotides.
- the D box motif containing sequence comprises a 5’ CUGA-3’ sequence.
- the D box motif containing sequence comprises about 15 to about 50 nucleotides.
- the engineered nucleotide sequence recruits NAT10 to the target sequence and causes acetylation (e g., N4-acetylcytidine (ac4C) modification) of the target sequence.
- acetylation e g., N4-acetylcytidine (ac4C) modification
- the targeting sequences are complementary to a target sequence in a desired target gene, and the gene is involved or implicated in any disease or disorder where moderate protein expression can lead to alleviation of the disease or disorder phenotype.
- the desired target gene encodes a ribosomal protein, a gene involved in RNA processing, a gene involved in gene transcription and/or translation, or a gene involved in proteostasis.
- the desired target gene may be selected from the group consisting of, for example, but not limited to, CD25, CHD2, CFTR, CHD8, COL1A1 , COL1A2, FBN1, FIGLA, FOXG1 , FOXP2, FZD4, GATA2, GATA4, GLI2, GLI3, HNF1 B, HNF4A, HOXD13, IKZF1 , IL2RA, IRF6, KIF1B, KIF21A, MECP2, MED13L, MEF2C, MEIS2, NHF1B, NKX2-5, NSD1, NR2F2, NR5A1 , PAX2, PAX3, PAX5, PAX6, PBX1, PITX2, PPARG, POU3F2, RA11 , RUNX1, RUNX2, SCN1A, SETBP1, SETD1A, SIM, SIX3, SLC52A2, SOX2, SOX2, SOX2, SOX
- the targeting sequences are complementary to a target sequence in a desired target gene, and the desired target gene is involved or implicated in cancer.
- the desired target gene may be selected from the group consisting of, for example, but not limited to, AKT1 , ALK, APC, ATM, BAG1 , BARD1 , BMPR1A, BRAF, BRCA1 , BRCA2, BRIP1 , CDK4, CDH1 , CDK4, CDKN1C, CDKN2A, CHEK2, CHK2, CREBBP, CTNNB1 , EGFR, EP300, EPCAM, ERBB2, ETV6, FBXW7, FGFR2, FHIT, FLT3, FOXL2, GNAQ, GNAS, HOXB13, HRAS, KIT, KRAS, MAP2K1 , MET, MLH1 , MLL, MSH2, MSH6, MYC, NF1 , NTR3, NTRK1, NRAS, PALB2, PAX8, PDGFRA, PIK3CA, PMS
- the disclosure provides a vector comprising the engineered nucleotide sequence disclosed herein.
- the vector is a virus, a lentivirus, an adenovirus, a retrovirus, an adeno-associated virus (AAV), a transposon, a DNA vector, or a RNA vector.
- AAV adeno-associated virus
- the disclosure provides a pharmaceutical composition comprising the engineered nucleotide sequence disclosed herein, or the vector disclosed herein, and a pharmaceutically acceptable excipient.
- the engineered nucleotide sequence disclosed herein can be administered via a pharmaceutical composition comprising the engineered nucleotide sequence disclosed herein in a lipid nanoparticle composition.
- the disclosure provides a method for directing acetylation to a target sequence, wherein the method comprises administering the engineered nucleotide sequence disclosed herein, the vector disclosed herein, or the pharmaceutical composition disclosed herein.
- the disclosure provides a method for treating a disease or disorder, wherein the method comprises administering the engineered nucleotide sequence disclosed herein, the vector disclosed herein, or the pharmaceutical composition disclosed herein to a subject in need thereof.
- the disease or disorder comprises any disease or disorder where moderate protein expression can lead to alleviation of the disease or disorder phenotype (/.e., a disease or disorder with an associated haploinsufficiency).
- the disease or disorder comprises a cancer.
- the disclosure provides a use of the engineered nucleotide sequence disclosed herein, the vector disclosed herein, or the pharmaceutical composition disclosed herein in the treatment of a disease or disorder in a subject in need thereof.
- the disease or disorder comprises any disease or disorder where moderate protein expression can lead to alleviation of the disease or disorder phenotype (/.e., a disease or disorder with an associated haploinsufficiency).
- the disease or disorder is associated with a haploinsuffciency and may be selected from the group consisting of, for example, but not limited to, Rett syndrome, Marfan syndrome, Dravet syndrome, and cancers (for example, but not limited to, PTEN or p53 null).
- the method and/or use results in increased protein levels of the protein encoded by the target sequence.
- the method and/or use results in decreased protein levels of the protein encoded by the target sequence.
- FIG. 1 shows a schematic of the unnatural and engineered snoRNA, SAS-49, (5’- GUCCGACCUUCACCUUCCGAGCGUGAUGAUUGGGUGUUCAUACGCUUGUGUGAGAUGU GCCACCCUUUGUCUGAGCGAUGUGGUGGGCACAUUACCCGUCUGACC-3’; SEQ ID NO:9) designed to target the Kozak sequence of GAPDH mRNA (5’ CACAUCGCUCAGACACCAUGGGGAAGGUGAAGGUCGGA-3’; SEQ ID NO: 10). The box indicates the putative acetylation site.
- FIG. 2 shows a schematic of an excerpt of the pCMV [3-globin plasmid used for the delivery of the synthetic snoRNA analogue of SNORD13 (U13).
- Exons 1-3, as well as the intervening introns sequences stem from the natural p-globin gene and enable transfer and expression of unnatural snoRNAs in transfected cells.
- the snoRNA is inserted into the intron through utilization of the restriction enzymes Clal and Xhol.
- FIG. 3 shows expression of SAS-49 increases N4-acetylcytidine levels in GAPDH mRNA. Quantitative data for N4-acetylcytidine levels in isolated GAPDH mRNA determined by UHPLC-MS/MS analysis. The control sample represents results from HeLa WT cells transfected with the empty pCMV [3-globin plasmid. SAS-49 represents HeLa WT cells transfected with the pCMV p-globin plasmid containing the optimal SNORD13 analogue, SAS-49, targeting the Kozak sequence of GAPDH mRNA.
- FIG. 4A and 4B show a schematic of the NAT10/snoRNA scaffold (FIG. 4A) and positional information derived from CLIP and CLASH-seq (FIG. 4B)
- FIG. 5 shows a schematic for NAT10 adapters for mRNA acetylation.
- FIG. 6 shows that snoRNAs are a preferred substrate for NAT10 and establish their utility as flexible adapters.
- NAT10 CLIP-seq revealed 115 interactions with snoRNAs, of which 108 are of the Box C/D variety.
- FIG. 7 shows that depletion of NAT10 associated SNORD14B (U14B) in HeLa cells results in decreased RNA acetylation.
- NAT10 CLIP-seq identified strong binding to U14B.
- ASOs antisense oligonucleotides
- FIG. 8 shows that chimeric U13 reproducibly increases GAPDH mRNA acetylation. Mass spectrometric quantification of ac4C levels normalized against cytidine after targeting of GAPDH mRNA with chimeric SNORD13 in wildtype HeLa cells.
- FIG. 9A and 9B show the haploinsufficient mRNA CD25 exhibits enhanced sensitivity to ac4C as compared to PTEN mRNA.
- FIG. 9A Western blot for PTEN with GAPDH as loading control upon transfection of in vitro transcribed PTEN mRNA with varying levels of ac4C incorporation into wildtype HeLa cells.
- FIG. 9B Western blot for CD25 with GAPDH as loading control upon transfection of in vitro transcribed CD25 mRNA +/- ac4C into wild-type HeLa cells.
- FIG. 10A and 10B show that ac4C stimulates mRNA translation as compared to unmodified mRNA and mRNA containing the uridine modification pseudouridine.
- FIG. 10A A Western blot for Nanoluciferase (Nanoluc) protein levels in HeLa cells transfected with in vitro transcribed Nanoluc mRNA with either canonical nucleotides, or nucleoside triphosphates (NTPs), in which ac4C or pseudouridine (PseudoU) have been substituted for C or U, respectively. Endogenous GAPDH serves as a loading control.
- NTPs nucleoside triphosphates
- the term “about” is used to provide literal support for the exact number that it precedes, as well as a number that is near to or approximately the number that the term precedes. In determining whether a number is near to or approximately a specifically recited number, the near or approximating unrecited number may be a number which, in the context in which it is presented, provides the substantial equivalent of the specifically recited number. Ranges and amounts can be expressed as “about” a particular value or range. About can also include the exact amount. Typically, the term “about” includes an amount that would be expected to be within experimental error. The term “about” includes values that are within 10% less to 10% greater of the value provided.
- nucleic acid can be used interchangeably to refer to nucleic acid comprising DNA, cDNA, RNA, mRNA, derivatives thereof, or combinations thereof.
- Methods well known to those skilled in the art can be used to construct genetic expression constructs and recombinant cells according to this invention. These methods include in vitro recombinant DNA techniques, synthetic techniques, in vivo recombination techniques, and polymerase chain reaction (PCR) techniques.
- PCR polymerase chain reaction
- the inventors engineered chimeric snoRNA guides to recruit NAT10 to a target transcript that causes directed acetylation of the target transcript, which can regulate protein production.
- the inventors aim to target acetylation to specific mRNAs in cells to regulate protein synthesis (e.g., enhanced expression of tumor suppressor genes or reduced expression of oncogenes).
- snoRNAs small nucleolar RNAs
- NAT10 is the sole enzyme to mediate acetylation in human RNA and snoRNAs are established guides for RNA-modifying enzymes. The inventors aim to determine the potential of these snoRNA to target a transcript of interest for acetylation in in vivo experiments. Endogenous NAT10 would subsequently be recruited to the targeted transcript and produce the acetylated cytidine. Mass spectrometric analysis of the transcript, and (semi-)quantitative protein expression will be used to validate formation of ac4C and effects on protein biosynthesis. As proof of principle, the inventors modulated the NAT10-interacting snoRNA SNORD13 to target GAPDH mRNA.
- the engineered nucleotides as disclosed herein comprise (5’ to 3’) a first targeting sequence, a C box motif, a second targeting sequence, and a D box motif.
- the first targeting sequence can be from about 15 to about 30 nucleotides in length, for example 18 - 28 nucleotides, or 21-25 nucleotides.
- the C box motif can be about 40 to about 200 nucleotides in length, and comprises the sequence 5’ RUGAUGA-3’ (wherein R is a purine) or a portion thereof.
- the second targeting sequence can be from about 15 to about 30 nucleotides in length, for example 18 - 28 nucleotides, or 21-25 nucleotides.
- the D box motif can be about 15 to about 50 nucleotides in length (e.g., and can be substantially complementary to a portion of a target sequence). In some embodiments, the D box motif comprises a 5’-CUGA-3’ sequence.
- first targeting sequence and the second targeting sequence are substantially complementary to separate regions of a target sequence.
- first targeting sequence and the second targeting sequence are substantially complementary to separate regions of a target sequence in the same desired target gene.
- a target sequence serves to hybridize to a target nucleic acid sequence (/.e., the mRNA of a desired target gene for which modified expression is the result).
- target nucleic acid sequence /.e., the mRNA of a desired target gene for which modified expression is the result.
- the first targeting sequence and the second targeting sequence are substantially complementary to a portion of target nucleic acid sequence that is about is 15-30 nucleotides in length, but may be longer depending upon sequence and base composition.
- substantially complementary means that there does not need to be exact complementarity between the target nucleic acid sequence and first targeting sequence and the second targeting sequence of the engineered nucleotide sequence. However, it is to be understood that there should be a high degree of sequence identity, typically greater than 90% or 95%, which is sufficient to ensure specific binding according to known RNA or DNA/RNA base pairing rules.
- hybridize refers to the sequence specific non-covalent binding interaction with a complementary nucleic acid. In one embodiment, the hybridization is under high stringency conditions. Appropriate stringency conditions that promote hybridization are known to those skilled in the art.
- the desired target gene can be a gene involved or implicated in any disease or disorder where moderate protein expression can lead to alleviation of the disease or disorder phenotype (/.e., a disease or disorder with an associated haploinsufficiency).
- the disease or disorder is associated with a haploinsuffciency and may be selected from the group consisting of, for example, but not limited to, Holt-Oram syndrome, Rett syndrome, Marfan syndrome, Dravet syndrome, Smith-Magenis syndrome, Waardenburg syndrome, Williams syndrome, and cancers (for example, but not limited to, PTEN or p53 null).
- the desired target gene encodes a ribosomal protein, or the desired target gene encodes a gene involved in RNA processing, or a gene involved in gene transcription and/or translation, or a gene involved in proteostasis.
- the desired target gene can be one or more of, but not limited to, CD25, CHD2, CFTR, CHD8, COL1A1 , COL1A2, FBN1 , FIGLA, FOXG1, FOXP2, FZD4, GATA2, GATA4, GLI2, GLI3, HNF1B, HNF4A, HOXD13, IKZF1 , IL2RA, IRF6, KIF1B, KIF21A, MECP2, MED13L, MEF2C, MEIS2, NHF1 B, NKX2-5, NSD1, NR2F2, NR5A1 , PAX2, PAX3, PAX5, PAX6, PBX1, PITX2, PPARG, POU3F2, RAI1 , RAI1 , R
- the desired target gene can be a gene involved or implicated in cancer.
- the desired target gene can be one or more of, but not limited to, AKT 1 , ALK, APC, ATM, BAG1 , BARD1 , BMPR1A, BRAF, BRCA1 , BRCA2, BRIP1, CDK4, CDH1 , CDK4, CDKN1C, CDKN2A, CHEK2, CHK2, CREBBP, CTNNB1 , EGFR, EP300, EPCAM, ERBB2, ETV6, FBXW7, FGFR2, FHIT, FLT3, FOXL2, GNAQ, GNAS, HOXB13, HRAS, KIT, KRAS, MAP2K1 , MET, MLH1 , MLL, MSH2, MSH6, MYC, NF1 , NTR3, NTRK1, NRAS, PALB2, PAX8, PDGFRA, PIK3CA, PMS2,
- the target sequence is an RNA sequence, typically an mRNA, tRNA, miRNA or rRNA sequence, or an RNA virus genomic sequence or transcript derived thereof of the desired target gene.
- the target sequence is an mRNA sequence.
- the target sequence is a coding sequence (CDS) within the mRNA sequence.
- CDS coding sequence within a target nucleic acid sequence the complementary region can be either in the intron, or exon, or particularly at intron - exon junctions, but can also reside in 5' or 3' untranslated regions.
- targeting of the engineered nucleic acids as disclosed herein to the CDS of a desired target gene can increase protein production of the desired target gene.
- the target sequence is in the 5’ untranslated region of an mRNA transcript. In certain embodiments, the target sequence is in the 3’ untranslated region of an mRNA transcript. In certain embodiments, targeting of the engineered nucleic acids as disclosed herein to the 5’ untranslated region of a desired target gene can repress protein production of the desired target gene.
- the target sequence comprises a known acetylation site or a putative acetylation site. In some embodiments, the target sequence is within about 30 to about 50 nucleotides of a known acetylation site or a putative acetylation site.
- this disclosure provides a vector or other nucleic acid construct capable of expressing at least one engineered nucleotide sequence as disclosed herein.
- the vector may further comprise common nucleic acid features often found in conventional nucleic acid vectors and well known to the skilled artisan.
- the vector may comprise a selection marker gene for facilitating identification of cells into which the nucleic acid construct has been transformed or transfected.
- the vector comprises at least one promoter, such as a constitutive or controllable promoter known in the art, for facilitating expression of the nucleic acid encoding engineered nucleotide sequences disclosed herein.
- the vectors as disclosed herein may also be delivered into a cell using viral vectors, such as lentivirus, retrovirus or adenovirus-derived vectors known in the art.
- compositions comprising the engineered nucleotides as disclosed herein, or the vectors capable of the engineered nucleotides as disclosed herein and a pharmaceutically acceptable carrier therefore.
- compositions include those suitable for oral, topical (including dermal, buccal and sublingual), rectal or parenteral (including subcutaneous, intradermal, intramuscular and intravenous), nasal and pulmonary administration e.g., by inhalation.
- the formulation may, where appropriate, be conveniently presented in discrete dosage units and may be prepared by any of the methods well known in the art of pharmacy. All methods include the step of bringing into association an active compound with liquid carriers or finely divided solid carriers or both and then, if necessary, shaping the product into the desired formulation.
- This disclosure also provides methods of directing acetylation to a target sequence of a desired target gene, which can regulate gene expression of the desired target gene.
- the engineered nucleotide sequence as disclosed herein can be used to treat a disease or condition in a subject where it is desirable to modulate (either up-regulate or down-regulate) expression of a desired target gene or genes.
- acetylation of the CDS of a desired target gene can increase protein production (in particular, beneficial for haploinsufficiency disorders).
- targeting to a 5’-UTR of a desired target gene can be stimulatory in upstream/alternative CUG initiation sites (for example, stimulate anti-oncogenic protein isoforms) and repressive in annotated AUG initiation sites or surrounding Kozak sequence (for example, potential target to suppress oncogenes in cancer).
- CUG initiation sites for example, stimulate anti-oncogenic protein isoforms
- repressive in annotated AUG initiation sites or surrounding Kozak sequence for example, potential target to suppress oncogenes in cancer.
- This disclosure also provides methods comprising the step of administering to a subject in need thereof, a therapeutically effective amount of the engineered nucleotide sequence, the vector, or the pharmaceutical composition as disclosed herein to a subject in need thereof.
- the phrase “treating” refers to inhibiting or arresting the development of a disease, disorder or condition and/or causing the reduction, remission, or regression of a disease, disorder or condition in an individual suffering from, or diagnosed with, the disease, disorder or condition.
- the disease or disorder is associated with a haploinsuffciency.
- the disease or disorder is selected from the group consisting of, for example, but not limited to Rett syndrome, Marfan syndrome, and Dravet syndrome.
- the disease or disorder is cancer.
- targeting of the engineered nucleic acids as disclosed herein to a CDS of a desired target gene can increase protein production of the desired target gene.
- targeting of the engineered nucleic acids as disclosed herein to a 5’ untranslated region of a desired target gene can repress protein production of the desired target gene.
- an engineered chimeric snoRNA composed of the Box C/D snoRNA SNORD13 and antisense elements directed to, as proof of principle, GAPDH mRNA.
- the engineered snoRNA was delivered into human cells through incorporation into a minigene vector. Recovery of GAPDH transcripts and mass spectrometry confirmed successful targeted acetylation in vitro.
- SNORD13 was identified to exhibit two imperfect targeting sequences surrounding the target cytidine in helix 45 of 18S rRNA.
- the inventors designed a homologue of this snoRNA (Figure 1) with two targeting sequences around the Kozak cytidine that match the GAPDH mRNA sequence to 100%. (Sequence SAS-49):
- the plasmid provided by the Filipowicz lab (3) was digested using the Clal and Xhol restriction enzymes and the ends were dephosphorylated using the phosphatase rSAP.
- the snoRNA analogue was purchased from IDT with the respective restriction sites on the 5’ and 3’ end and digested to yield compatible ends. Both fragments were purified via column purification (PCR Clean-Up kit, Qiagen). Utilizing T4 DNA ligase, the snoRNA was inserted into the plasmid overnight at 16 °C and the product transformed into DH5a cells. Colony-PCR with T7 FWD and SAS-51/BGH polyA REV primers revealed good candidates which were confirmed by Sanger sequencing. The material was amplified through a standard midi-prep protocol (ZymoResearch).
- 3-globin gene with introns harboring the snoRNA sequence was transfected.
- HeLa WT cells cultured routinely in DMEM medium supplemented with 1% L-glutamine and 10% bovine calf serum were seeded to 2.2x10 6 cells into a 10 cm culture dish and allowed to attach at 37 °C in atmosphere containing 5% CO2.
- the plasmid DNA was transfected according to the standard Lipofectamine 2000 (Invitrogen) protocol using 10 pg of plasmid DNA and 60 pL of Lipofectamine 2000 solution. The medium was replaced with fresh medium after 4-5 h and the cells were cultured for a total of 24 h after transfection.
- RNA was harvested for RNA after 24 h by scraping with 1 mL TRIZOL (Invitrogen).
- the total RNA was isolated according to the standard protocol. From total RNA the GAPDH mRNA was enriched utilizing a biotinylated antisense oligo (biotin-ASO) according to a previously published protocol (5).
- biotin-ASO biotinylated antisense oligo
- the RNA up to 150 pg was diluted in 50 pL ultrapure water.
- the biotin-ASO complementary to the GAPDH Kozak sequence SAS-41 , 1 pL, 100 pmol
- Magnetically separated DynaBeads My One Streptavidin-T1 (25 pL) were washed three times with buffer A (1 M NaCI, 0.5 mM Na 2 [EDTA], 5 mM TRIS-HCI, pH 7.5), once with 5xSSC buffer (25 pL) before resuspension in 5xSSC buffer (25 pL).
- the RNA was heated for 3 min at 90 °C, then at 65 °C for 10 min and finally allowed to cool to room temperature.
- the prepared beads (25 pL) were added to the sample and incubated for 30 min at 600 rpm and 25 °C.
- the beads were washed once with 1xSSC buffer (50 pL) and three times with O.IxSSC buffer (50 pL). Finally, the beads were resuspended in ultrapure water (15 pL), incubated at 75 °C for 2 min and the RNA transferred to a new tube.
- enriched GAPDH mRNA was subjected to mass spectrometric analysis. To this end, enriched GAPDH mRNA (100 ng) was diluted to a total volume of 20 pL with nuclease-free water.
- a master mix of snake venom phosphodiesterase (0.2 U), calf intestinal phosphatase (2 U) and benzonase (2 U) was prepared with 1 mM MgCh, 5 mM TRIS (pH 8), 10 nmol butylated hydroxytoluene and 5 iig tetrahydrouridine (modified from Heiss (2021); (5)) containing isotope analogues of N 6 -methyladenosine, 5-Methylcytidine and N 4 -acetylcytidine as internal standards. After addition of 15 gL of the mastermix, samples were incubated at 37 °C for 2 h.
- the gradient settings and nucleoside-specific parameters in the mass spectrometer are detailed in Table 2 and 3.
- the parameters of the MS were the following: 35 °C column temperature, 90 °C gas temperature, 19 L/min gas flow, 166 V fragmentor voltage, 30 psi nebulizer, 150 °C sheath gas temperature, 12 L/min sheath gas flow, 0 V nozzle voltage, positive high pressure RF 80, positive low pressure RF 40, negative high pressure RF 80, negative low pressure RF 40, 2500 V and - 2000 capillary voltage, respectively. All data were analyzed according to a published protocol utilizing the isotope dilution technique (6).
- Figure 3 confirm an increase of acetylation in GAPDH mRNA after expression of an engineered snoRNA based on SNORD13 (SAS-49).
- a chimeric U13 reproducibly increased GAPDH mRNA acetylation (based on mass spectrometric quantification of ac4C levels normalized against cytidine after targeting of GAPDH mRNA with chimeric SNORD13 in wild-type HeLa cells).
- the snoRNA guide disclosed herein demonstrates proof of principle in cell lines through transfection of a DNA-vector.
- methods could include the use of Adeno-associated virus (AAV) or lentiviral vectors.
- AAV Adeno-associated virus
- lentiviral vectors Such vectors are ideally suited for the delivery of the snoRNA guides.
- AAV and lentiviral vectors are already being used/investigated clinically, so application of this therapy using such vectors appears feasible.
- these snoRNA guides and methods of delivery as disclosed herein can be developed as a patient-specific treatment, as well as a generic treatment. In vivo studies are performed to identify whether protein expression of a particular haploinsufficiency gene(s) increases upon acetylation. Subsequently, the optimized snoRNA guide are identified and engineered for those specific haploinsufficiency gene(s). Once the guide is identified, the vector- encoded snoRNA could be made broadly available to any patient suffering from that specific haploinsufficiency. snoRNAs for guided mRNA acetylation
- NAT10 activity can be manipulated to develop novel therapeutics for specific defects in mRNA translation.
- Traditional gene therapy approaches rely on eliminating deleterious transcripts.
- strategies that involve genome manipulation carry inherent risks, and mRNA ablation does not address the numerous human diseases that result for inadequate mRNA translation, such as haploinsufficiency disorders (8).
- Haploinsufficiency is a genetic condition that arises when a single wild-type allele in combination with a mutant allele produces insufficient protein, manifesting in disease (9).
- haploinsufficiency is a genetic condition that arises when a single wild-type allele in combination with a mutant allele produces insufficient protein, manifesting in disease (9).
- haploinsufficiencies 10, 11). This class of disease is generally recalcitrant to therapeutic interventions as technologies for enhancing the abundance of specific proteins are lacking.
- RNA-based therapeutics have aimed to address this unmet need through direct delivery of codon optimized mRNAs and through ASO technologies that promote protein expression via exclusion of toxic-exons (13).
- mRNA delivery is size-constrained and involves uniform incorporation of modified RNA nucleotides that may exert antagonistic influences and ASO technologies are limited to pre- mRNAs with toxic exons (14).
- Preliminary quantification detected a ⁇ 20% increase in ac4C within chimeric U13 targeted GAPDH mRNA as compared to control transfected cells (Fig. 3).
- haploinsufficiencies can display dosage sensitivity, wherein small increases in protein expression are beneficial, while large increases are detrimental (15), this initial value is significant.
- ac4C influenced total protein abundance within this stoichiometry range in HeLa mRNAs (1).
- the overall impact on GAPDH expression in cell lysates is also examined. Considering that NAT10 binding to U13 was relatively reduced as compared to other snoRNAs in HeLa CLIP-seq, these promising preliminary findings bolster the development of improved snoRNA scaffolds.
- the remainder of the scaffold and specific antisense placement was guided by CLASH-seq determination of substrate-interacting regions.
- Antisense targeting sequences were directed to GAPDH as proof of principle and other abundant mRNAs (e.g., CD25 and PTEN) to confirm acetylation through mass spectrometry and base-resolution RNA-sequencing. The net impact on protein expression as compared to non-target mRNAs was also examined.
- the scaffolds were optimized through sequence randomization. In parallel, optimized U13 snoRNA scaffolds were pursued through mutagenesis.
- NAT10 CLIP/CLASH-seq was performed to verify NAT10/snoRNA association and assess interacting mRNAs.
- the engineered snoRNA guides were tested by mass spectrometric analysis for inadvertent targeting of 2’-0-methylation to non-substrates.
- This disclosure provides a reliable snoRNA scaffold to target NAT10 to specific mRNAs through a flexible antisense platform.
- NAT10 CLIP-seq was performed and revealed 2460 CLIP sites that are enriched in wild-type HeLa as compared to control.
- HeLa WT cells maintained in complete (containing 1 % L-glutamine and 10% bovine calf serum (BCS)) Dulbecco’s modified Eagle’s medium (DMEM) at 37 °C in atmosphere containing 5% CO2 were trypsinized and counted. For each condition, 200,000 cells were seeded into 6 wells. The medium was completely replaced with OptiMEM 24 hours after seeding, and for each condition 50 nM antisense-oligo (ASO1-5, Table 1) was transfected using Lipofectamine 2000 reagent in a total volume of 200 pL according to manufacturer’s instructions. The medium was replaced with fresh complete DMEM after 4 hours. The cells were lysed directly in 500 pL TRIZOL reagent (Invitrogen) 48 hours after transfection.
- DMEM Dulbecco’s modified Eagle’s medium
- 2x formamide loading dye 95% formamide, 0.5 mM EDTA, traces of bromophenol blue and xylene cyanol FF
- RNA was transferred to a HyBond nylon membrane with 20x sodium citrate/chloride buffer for 3 hours at room temperature and cross-linked twice to the membrane with the auto-crosslink function of the Stratalinker (120 mJ/cm2).
- PBS phosphate buffered saline
- the membrane was blocked for 15 minutes in 5% milk and PBS containing 0.01% Tween-20 (PBS- T).
- PBS- T PBS containing 0.01% Tween-20
- the membrane was incubated with primary antibody against ac4C (monoclonal rabbit, 1:1000 in 1 %milk/PBS-T) overnight at 4 °C.
- the membrane was incubated with secondary horseradish peroxidase (HRP) conjugated antibody against rabbit IgG (monoclonal mouse, 1 :10,000 in 5% milk/PBS-T, Jackson ImmunoResearch 211-032-171).
- HRP horseradish peroxidase
- the membrane was washed three times with PBS-T, incubated for 2 minutes with Femto reagent (Thermo Fisher Scientific) and the chemiluminescent signal captured with the Biorad ChemiDoc machine.
- RNA was converted into cDNA with superscript IV reverse transcriptase using random hexamers.
- the cDNA was subsequently subjected to qPCR on an LightCycler 480 Instrument (Roche Diagnostics) using the SYBRGreen 2x mastermix from Roche Diagnostics and SNORD14B- specific primers (forward: 5’-TCACTGTGATGATGGTTTTC-3’ (SEQ ID NO:7) and reverse: 5’-CTCACTCAGACATCCAAGGA-3’ (SEQ ID NO:8)).
- haploinsufficient CD25 mRNA shows enhanced sensitivity to ac4C as compared to PTEN mRNA.
- the transcripts were in vitro transcribed from a template containing no cytidines in the 5’-untranslated region, Kozak sequence and first 15 nucleotides of the coding sequence, as well as a 20 nucleotide 3’-UTR.
- the acetylated and pseudouridylated mRNAs were achieved by omission or reduction of cytidine and uridine, respectively.
- the transcripts were simultaneously capped with AG-CleanCap (Tri Link) according to manufacturer’s protocol.
- PTEN and CD25 were subsequently polyadenylated with E. coli polyA-polymerase (according to manufacturer’s protocol).
- the construct for nanoluciferase included templated poly(A)-sequence making polyadenylation with E. Coli polyA-polymerase obsolete.
- HeLa WT cells maintained in complete (containing 1 % L-glutamine and 10% bovine calf serum (BCS)) Dulbecco’s modified Eagle’s medium (DMEM) at 37 °C in atmosphere containing 5% CO 2 were trypsinized and counted. For each condition, 200,000 cells were seeded in 12 wells in biological triplicates and grown for 8 hours at 37 °C in atmosphere containing 5% CO 2 . The transfections were performed at different time points, either 24 hours, 12 hours, 6 hours, 3 hours or 0 hours before cell harvest.
- BCS bovine calf serum
- DMEM Dulbecco’s modified Eagle’s medium
- the protein concentration was determined with the Pierce BCA Protein Assay (Thermo Fisher Scientific) and 25 pg separated for 45 minutes at 200 on 4-12% BisTris gels (Invitrogen) using MOPS buffer (Invitrogen).
- the proteins were transferred to PVDF membranes using the Biorad TurboBlot midi gel mixed molecular weight settings for 7 minutes.
- the membranes were blocked with 5% milk in TRIS-buffered saline (TBS) containing 0.01% Tween-20 (TBS-T).
- TRIS-buffered saline TRIS-buffered saline
- TBS-T TRIS-buffered saline
- TBS-T TRIS-buffered saline
- TBS-T Tween-20
- the membrane was incubated with primary antibody against PTEN (monoclonal rabbit, 1 :1000 in 5% milk/TBS-T, Cell Signaling Technologies 9559S) overnight at 4 °C on a rocking device.
- the membrane was washed three times for 5 minutes with TBS-T and then incubated for 45 minutes at room temperature with secondary horseradish peroxidase (HRP)- conjugated antibody against rabbit IgG (monoclonal mouse, 1 :10,000 in 5% milk/TBS-T, Jackson ImmunoResearch 211-032-171).
- HRP horseradish peroxidase
- the membrane was washed three times with TBS-T, incubated for 2 minutes with ECL-Select reagent (Amersham), rinsed quickly with TBS and the chemiluminescent signal captured with the Biorad ChemiDoc machine.
- the membrane was rinsed again and blocked for 1 hour with 5% milk/TBS-T.
- the membrane was incubated with primary antibody against GAPDH (monoclonal mouse, 1 :10,000 in 5% milk/TBS-T, Santa Cruz Biotechnology SC32233) overnight at 4 °C on a rocking device. The following day, the membrane was washed three times for 5 minutes with TBS-T and then incubated for 1 hour at room temperature with secondary horseradish peroxidase (HRP)-conjugated antibody against mouse IgG (polyclonal goat, 1 :10,000 in 5% milk/TBS-T, abeam ab6789). The membrane was washed three times with TBS-T, incubated for 2 minutes with ECL reagent (Amersham) and the chemiluminescent signal captured with the Biorad ChemiDoc machine.
- GAPDH monoclonal mouse, 1 :10,000 in 5% milk/TBS-T, Santa Cruz Biotechnology SC32233
- HeLa WT cells routinely maintained in complete DMEM at 37 °C in atmosphere containing 5% CO 2 were trypsinized and counted. For each condition, 1x106 cells were seeded into 6 wells the day before the transfection. For each condition, 2.5 pg of mRNA utilizing 3.75 j L Lipofectamine MessengerMax reagent in a total volume of 250 pL were transfected according to manufacturer’s instructions. The medium was replaced with fresh complete DMEM containing 1% L-glutamine and 10% bovine calf serum (BCS) after 4 hours. After 24 h, the cells were trypsinized, washed twice with PBS and lysed in RIPA buffer containing 1 % protease inhibitor.
- RIPA buffer containing 1 % protease inhibitor.
- the protein concentration was determined with the Pierce BOA Protein Assay (Thermo Fisher Scientific) and 40 pg of protein separated on 4-12% BisTris gels using MOPS buffer.
- the proteins were transferred to a nitrocellulose membrane using the Biorad TurboBlot mixed molecular weight settings for 7 minutes.
- the membranes were blocked with 5% milk in TRIS-buffered saline (TBS) containing 0.01 % Tween-20 (TBS-T).
- TRIS-buffered saline TRIS-buffered saline
- TBS-T TRIS-buffered saline
- the membrane was incubated with primary antibody against CD25 (monoclonal rabbit, 1 :1000 in 5% milk/TBS-T, abeam ab231441) overnight at 4 °C on a rocking device.
- the membrane was washed three times for 5 minutes with TBS-T and then incubated for 1 hour at room temperature with secondary horseradish peroxidase (HRP)- conjugated antibody against rabbit IgG (monoclonal mouse, 1 :10,000 in 5% milk/TBS-T, Jackson ImmunoResearch 211-032-171).
- HRP-substrate horseradish peroxidase
- HeLa WT cells routinely maintained in complete (containing 1% L-glutamine and 10% bovine calf serum (BCS)) Dulbecco’s modified Eagle’s medium (DMEM) at 37 °C in atmosphere containing 5% CO2 were trypsinized and counted. For each condition in biological triplicates, 1x106 cells were transfected with 2.5 pg of mRNA utilizing 3.75 pL Lipofectamine MessengerMax reagent in a total volume of 250 pL (according to manufacturer’s instructions).
- complete containing 1% L-glutamine and 10% bovine calf serum (BCS)
- DMEM Dulbecco’s modified Eagle’s medium
- Proteins were transferred to a nitrocellulose membrane using the Biorad TurboBlot mixed molecular weight settings for 7 min.
- the membranes were blocked with 5% milk in TRIS-buffered saline (TBS) containing 0.01 % Tween-20 (TBS-T).
- TRIS-buffered saline TRIS-buffered saline
- TBS-T TRIS-buffered saline
- the membrane was incubated with primary antibody against Nanoluciferase (monoclonal mouse, 1 :500 in 5% milk/TBS-T, R&D Systems MAB10026) overnight at 4 °C on a rocking device.
- the membrane was washed three times for 5 min with TBS-T and then incubated for 1 h at room temperature with secondary secondary HRP-conjugated antibody against mouse IgG (polyclonal goat, 1 :10,000 in 5% milk/TBS-T, abeam ab6789).
- the membrane was washed three times with TBS-T, incubated with HRP- substrate and the chemiluminescent signal captured with the Biorad ChemiDoc machine.
- ac4C stimulates mRNA translation as compared to unmodified mRNA and mRNA containing the uridine modification pseudouridine.
Landscapes
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Genetics & Genomics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Biomedical Technology (AREA)
- Wood Science & Technology (AREA)
- Organic Chemistry (AREA)
- Chemical & Material Sciences (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Zoology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Molecular Biology (AREA)
- Microbiology (AREA)
- Plant Pathology (AREA)
- Biophysics (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Physics & Mathematics (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Pharmaceuticals Containing Other Organic And Inorganic Compounds (AREA)
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
Abstract
The disclosure provides engineered nucleotide sequences, vectors, and pharmaceutical compositions that can be used in methods for treating a disease or disorder with an associated haploinsufficiency and/or cancer.
Description
Directed Acetylation of mRNA Through Engineered snoRNA Adapters
CROSS-REFERENCE TO RELATED APPLICATIONS
[0001] This application claims the benefit of U.S. Provisional Application No. 63/423,714, filed November 8, 2022, which is incorporated by reference herein in its entirety.
GOVERNMENT RIGHTS
[0002] This invention was made with Government support. The National Cancer Institute and the National Institutes of Health funded the subject matter of this disclosure (ZIA BC 011239). The Federal Government has certain rights to this invention.
REFERENCE TO AN ELECTRONIC SEQUENCE LISTING
[0003] The instant application contains an electronic Sequence Listing that has been submitted electronically and is hereby incorporated by reference in its entirety. The sequence listing was created on November 6, 2023, is named “22-1953- WO_Sequence-Listing. xml” and is 9,934 bytes in size.
BACKGROUND OF THE INVENTION
Field of the Disclosure
[0004] This disclosure provides compositions and methods related to directed acetylation by engineered snoRNA molecules.
Description of Related Art
[0005] Small nucleolar RNAs (snoRNAs) are non-coding RNAs that primarily accumulate in the nucleoli and typically comprise about 60 to 300 nucleotides. Based on the characteristic nucleotide motifs and association with canonical partner proteins, snoRNAs are classified into either C/D- (SNORD) or H/ACA-box (SNORA) subfamilies. Eukaryotic box C/D RNAs are typically 70 to 120 nucleotides in length and contain two conserved motif elements: box C (RUGAUGA; wherein R is a purine) and box D (CUGA) located at the 5’- and 3’-termini of the RNA molecule, respectively (see Figure 1). Human N-acetyltransferase 10 (NAT10) acetylates RNA through association with snoRNA guides. NAT10 activity can be manipulated to develop novel therapeutics for specific defects in mRNA translation. Traditional gene therapy approaches rely on eliminating deleterious transcripts. However, strategies that involve genome manipulation
carry inherent risks and ablation does not address diseases that result from inadequate mRNA translation. Thus, there is a need fortreatment of any disease or disorder where moderate protein expression can lead to alleviation of disease phenotype (/.e., a disease or disorder with an associated haploinsufficiency). Provided herein are tools and methods that function in human cells for precise RNA epigenetic engineering. The compositions and methods provided herein can modulate gene regulation, target diseases, and serve as the basis of curative therapies for many previously untreatable diseases and/or disorders.
SUMMARY OF THE INVENTION
[0006] snoRNA-guided acetylation of select mRNA. The inventors examined the feasibility of capitalizing on NATIO’s property to direct acetylation, particularly the N4-acetylcytidine (ac4C) modification, to specific transcript substrates to regulate protein synthesis. As reported herein, engineered chimeric snoRNA guides can recruit NAT10 to a target transcript that causes directed acetylation of the target transcript, which regulates protein production. As a proof of principle, the inventors generated a chimeric snoRNA composed of the Box C/D snoRNA SNORD13 and antisense elements directed to, for example, GAPDH. The snoRNA was delivered into HeLa cells through incorporation into a minigene vector. Recovery of GAPDH transcripts and mass spectrometry confirmed successful targeted acetylation in human cells lines. As further described below, this disclosure provides directed acetylation through engineered snoRNA adapters and can be useful for the treatment of any disease or disorder where moderate protein expression can lead to alleviation of disease phenotype (/.e., a disease or disorder with an associated haploinsufficiency).
[0007] It is against the above background that the present invention provides certain advantages over the prior art.
[0008] In one aspect, this disclosure provides an engineered nucleotide sequence comprising, 5’ to 3’:
(a) a first targeting sequence;
(b) a C box motif containing sequence;
(c) a second targeting sequence; and
(d) a D box motif containing sequence.
[0009] In certain embodiments of the engineered nucleotide sequence disclosed herein, the first targeting sequence comprises about 15 to about 30 nucleotides.
[0010] In some embodiments of the engineered nucleotide sequence disclosed herein, the second targeting sequence comprises about 15 to about 30 nucleotides.
[0011] In some embodiments of the engineered nucleotide sequence disclosed herein, the first targeting sequence and the second targeting sequence are complementary to separate regions of a target sequence.
[0012] In some embodiments of the engineered nucleotide sequence disclosed herein, the target sequence comprises an RNA transcript.
[0013] In some embodiments of the engineered nucleotide sequence disclosed herein, the target sequence of the RNA transcript comprises a 5’ untranslated region (UTR) of the RNA transcript.
[0014] In some embodiments of the engineered nucleotide sequence disclosed herein, the target sequence of the RNA transcript comprises a coding sequence (CDS) of the RNA transcript.
[0015] In some embodiments of the engineered nucleotide sequence disclosed herein, the target sequence comprises a known acetylation site or a putative acetylation site.
[0016] In some embodiments of the engineered nucleotide sequence disclosed herein, the target sequence is within about 30-50 nucleotides of a known acetylation site or a putative acetylation site.
[0017] In some embodiments of the engineered nucleotide sequence disclosed herein, the C box motif containing sequence comprises a 5’ RUGAUGA-3’ sequence (wherein R is a purine).
[0018] In some embodiments of the engineered nucleotide sequence disclosed herein, the C box motif containing sequence comprises about 40 to about 200 nucleotides.
[0019] In some embodiments of the engineered nucleotide sequence disclosed herein, the D box motif containing sequence comprises a 5’ CUGA-3’ sequence.
[0020] In some embodiments of the engineered nucleotide sequence disclosed herein, the D box motif containing sequence comprises about 15 to about 50 nucleotides.
[0021] In some embodiments of the engineered nucleotide sequence disclosed herein, the engineered nucleotide sequence recruits NAT10 to the target sequence and causes acetylation (e g., N4-acetylcytidine (ac4C) modification) of the target sequence.
[0022] In some embodiments of the engineered nucleotide sequence disclosed herein, the targeting sequences are complementary to a target sequence in a desired target gene, and the
gene is involved or implicated in any disease or disorder where moderate protein expression can lead to alleviation of the disease or disorder phenotype.
[0023] In some embodiments of the engineered nucleotide sequence disclosed herein, the desired target gene encodes a ribosomal protein, a gene involved in RNA processing, a gene involved in gene transcription and/or translation, or a gene involved in proteostasis.
[0024] In some embodiments of the engineered nucleotide sequence disclosed herein, the desired target gene may be selected from the group consisting of, for example, but not limited to, CD25, CHD2, CFTR, CHD8, COL1A1 , COL1A2, FBN1, FIGLA, FOXG1 , FOXP2, FZD4, GATA2, GATA4, GLI2, GLI3, HNF1 B, HNF4A, HOXD13, IKZF1 , IL2RA, IRF6, KIF1B, KIF21A, MECP2, MED13L, MEF2C, MEIS2, NHF1B, NKX2-5, NSD1, NR2F2, NR5A1 , PAX2, PAX3, PAX5, PAX6, PBX1, PITX2, PPARG, POU3F2, RA11 , RUNX1, RUNX2, SCN1A, SETBP1, SETD1A, SIM, SIX3, SLC52A2, SOX2, SOX9, SOX10, TBX1 , TBX5, TBR1 , ZIC2, and ZFPM2.
[0025] In some embodiments of the engineered nucleotide sequence disclosed herein, the targeting sequences are complementary to a target sequence in a desired target gene, and the desired target gene is involved or implicated in cancer.
[0026] In some embodiments of the engineered nucleotide sequence disclosed herein, the desired target gene may be selected from the group consisting of, for example, but not limited to, AKT1 , ALK, APC, ATM, BAG1 , BARD1 , BMPR1A, BRAF, BRCA1 , BRCA2, BRIP1 , CDK4, CDH1 , CDK4, CDKN1C, CDKN2A, CHEK2, CHK2, CREBBP, CTNNB1 , EGFR, EP300, EPCAM, ERBB2, ETV6, FBXW7, FGFR2, FHIT, FLT3, FOXL2, GNAQ, GNAS, HOXB13, HRAS, KIT, KRAS, MAP2K1 , MET, MLH1 , MLL, MSH2, MSH6, MYC, NF1 , NTR3, NTRK1, NRAS, PALB2, PAX8, PDGFRA, PIK3CA, PMS2, PPARy, PRCC, PRKAR1A, PTEN, RAD51 D, RET, RUNXBP2, SMAD4, SRC, STK11 , TFE3, TGF- , TGF-pRII, TP53, VEGFA, and VWVOX.
[0027] In another aspect, the disclosure provides a vector comprising the engineered nucleotide sequence disclosed herein.
[0028] In certain embodiments of the vector disclosed herein, the vector is a virus, a lentivirus, an adenovirus, a retrovirus, an adeno-associated virus (AAV), a transposon, a DNA vector, or a RNA vector.
[0029] In another aspect, the disclosure provides a pharmaceutical composition comprising the engineered nucleotide sequence disclosed herein, or the vector disclosed herein, and a pharmaceutically acceptable excipient. In some embodiments, the engineered nucleotide
sequence disclosed herein can be administered via a pharmaceutical composition comprising the engineered nucleotide sequence disclosed herein in a lipid nanoparticle composition.
[0030] In another aspect, the disclosure provides a method for directing acetylation to a target sequence, wherein the method comprises administering the engineered nucleotide sequence disclosed herein, the vector disclosed herein, or the pharmaceutical composition disclosed herein.
[0031] In another aspect, the disclosure provides a method for treating a disease or disorder, wherein the method comprises administering the engineered nucleotide sequence disclosed herein, the vector disclosed herein, or the pharmaceutical composition disclosed herein to a subject in need thereof.
[0032] In certain embodiments of the methods disclosed herein, the disease or disorder comprises any disease or disorder where moderate protein expression can lead to alleviation of the disease or disorder phenotype (/.e., a disease or disorder with an associated haploinsufficiency).
[0033] In certain embodiments of the methods disclosed herein, the disease or disorder comprises a cancer.
[0034] In another aspect, the disclosure provides a use of the engineered nucleotide sequence disclosed herein, the vector disclosed herein, or the pharmaceutical composition disclosed herein in the treatment of a disease or disorder in a subject in need thereof.
[0035] In certain embodiments of the use disclosed herein, the disease or disorder comprises any disease or disorder where moderate protein expression can lead to alleviation of the disease or disorder phenotype (/.e., a disease or disorder with an associated haploinsufficiency).
[0036] In certain embodiments of the methods and use disclosed herein, the disease or disorder is associated with a haploinsuffciency and may be selected from the group consisting of, for example, but not limited to, Rett syndrome, Marfan syndrome, Dravet syndrome, and cancers (for example, but not limited to, PTEN or p53 null).
[0037] In certain embodiments of the methods and use disclosed herein, the method and/or use results in increased protein levels of the protein encoded by the target sequence.
[0038] In certain embodiments of the methods and use disclosed herein, the method and/or use results in decreased protein levels of the protein encoded by the target sequence.
[0039] These and other features and advantages of the present invention will be more fully understood from the following detailed description taken together with the accompanying claims.
It is noted that the scope of the claims is defined by the recitations therein and not by the specific discussion of features and advantages set forth in the present description.
BRIEF DESCRIPTION OF THE DRAWINGS
[0040] The following detailed description of the embodiments of the present invention can be best understood when read in conjunction with the following drawings, where like structure is indicated with like reference numerals and in which:
[0041] FIG. 1 shows a schematic of the unnatural and engineered snoRNA, SAS-49, (5’- GUCCGACCUUCACCUUCCGAGCGUGAUGAUUGGGUGUUCAUACGCUUGUGUGAGAUGU GCCACCCUUUGUCUGAGCGAUGUGGUGGGCACAUUACCCGUCUGACC-3’; SEQ ID NO:9) designed to target the Kozak sequence of GAPDH mRNA (5’ CACAUCGCUCAGACACCAUGGGGAAGGUGAAGGUCGGA-3’; SEQ ID NO: 10). The box indicates the putative acetylation site.
[0042] FIG. 2 shows a schematic of an excerpt of the pCMV [3-globin plasmid used for the delivery of the synthetic snoRNA analogue of SNORD13 (U13). Exons 1-3, as well as the intervening introns sequences stem from the natural p-globin gene and enable transfer and expression of unnatural snoRNAs in transfected cells. The snoRNA is inserted into the intron through utilization of the restriction enzymes Clal and Xhol.
[0043] FIG. 3 shows expression of SAS-49 increases N4-acetylcytidine levels in GAPDH mRNA. Quantitative data for N4-acetylcytidine levels in isolated GAPDH mRNA determined by UHPLC-MS/MS analysis. The control sample represents results from HeLa WT cells transfected with the empty pCMV [3-globin plasmid. SAS-49 represents HeLa WT cells transfected with the pCMV p-globin plasmid containing the optimal SNORD13 analogue, SAS-49, targeting the Kozak sequence of GAPDH mRNA.
[0044] FIG. 4A and 4B show a schematic of the NAT10/snoRNA scaffold (FIG. 4A) and positional information derived from CLIP and CLASH-seq (FIG. 4B)
[0045] FIG. 5 shows a schematic for NAT10 adapters for mRNA acetylation.
[0046] FIG. 6 shows that snoRNAs are a preferred substrate for NAT10 and establish their utility as flexible adapters. NAT10 CLIP-seq revealed 115 interactions with snoRNAs, of which 108 are of the Box C/D variety.
[0047] FIG. 7 shows that depletion of NAT10 associated SNORD14B (U14B) in HeLa cells results in decreased RNA acetylation. NAT10 CLIP-seq identified strong binding to U14B. To assess its involvement in RNA acetylation, HeLa cells were transfected with 5 individual antisense oligonucleotides (ASOs) directed against U14B. Total RNA was isolated to assess the efficiency of knockdown through RTqPCR (top) and ac4C levels through immunoNorthern (bottom). A decrease in ac4C levels was observed that correlated with knockdown efficiency.
[0048] FIG. 8 shows that chimeric U13 reproducibly increases GAPDH mRNA acetylation. Mass spectrometric quantification of ac4C levels normalized against cytidine after targeting of GAPDH mRNA with chimeric SNORD13 in wildtype HeLa cells.
[0049] FIG. 9A and 9B show the haploinsufficient mRNA CD25 exhibits enhanced sensitivity to ac4C as compared to PTEN mRNA. (FIG. 9A) Western blot for PTEN with GAPDH as loading control upon transfection of in vitro transcribed PTEN mRNA with varying levels of ac4C incorporation into wildtype HeLa cells. (FIG. 9B) Western blot for CD25 with GAPDH as loading control upon transfection of in vitro transcribed CD25 mRNA +/- ac4C into wild-type HeLa cells.
[0050] FIG. 10A and 10B show that ac4C stimulates mRNA translation as compared to unmodified mRNA and mRNA containing the uridine modification pseudouridine. (FIG. 10A) A Western blot for Nanoluciferase (Nanoluc) protein levels in HeLa cells transfected with in vitro transcribed Nanoluc mRNA with either canonical nucleotides, or nucleoside triphosphates (NTPs), in which ac4C or pseudouridine (PseudoU) have been substituted for C or U, respectively. Endogenous GAPDH serves as a loading control. (FIG. 10B) Luminescence for Nanoluc in transfected HeLa cells with in vitro transcribed Nanoluc mRNA with either canonical nucleotides, or mixtures containing ac4C or PseudoU as in FIG. 10A.
[0051] Skilled artisans will appreciate that elements in the Figures are illustrated for simplicity and clarity and have not necessarily been drawn to scale. For example, the dimensions of some of the elements in the Figures can be exaggerated relative to other elements to help improve understanding of the embodiment(s) of the present invention.
DETAILED DESCRIPTION OF THE DISCLOSURE
[0052] All patents, patent applications, and other publications, including all sequences disclosed within these references, referred to herein are expressly incorporated herein by reference, to the same extent as if each individual publication, patent or patent application was specifically and individually indicated to be incorporated by reference. All documents cited are,
in relevant part, incorporated herein by reference in their entireties for the purposes indicated by the context of their citation herein. However, the citation of any document is not to be construed as an admission that it is prior art with respect to the present disclosure.
[0053] Before describing the present invention in detail, a number of terms will be defined. Unless otherwise required by context, singular terms shall include pluralities and plural terms shall include the singular. For example, the singular forms “a,” “an,” and “the” include plural referents unless the context clearly dictates otherwise.
[0054] It is noted that terms like “preferably,” “commonly,” and “typically” are not utilized herein to limit the scope of the claimed invention or to imply that certain features are critical, essential, or even important to the structure or function of the claimed invention. Rather, these terms are merely intended to highlight alternative or additional features that can or cannot be utilized in a particular embodiment of the present invention.
[0055] The terms “comprises” and variations thereof do not have a limiting meaning where these terms appear in the description and claims.
[0056] For the purposes of describing and defining the present invention it is noted that the term “substantially” is utilized herein to represent the inherent degree of uncertainty that can be attributed to any quantitative comparison, value, measurement, or other representation. The term “substantially” is also utilized herein to represent the degree by which a quantitative representation can vary from a stated reference without resulting in a change in the basic function of the subject matter at issue.
[0057] As used herein, the term “about” is used to provide literal support for the exact number that it precedes, as well as a number that is near to or approximately the number that the term precedes. In determining whether a number is near to or approximately a specifically recited number, the near or approximating unrecited number may be a number which, in the context in which it is presented, provides the substantial equivalent of the specifically recited number. Ranges and amounts can be expressed as “about” a particular value or range. About can also include the exact amount. Typically, the term “about” includes an amount that would be expected to be within experimental error. The term “about” includes values that are within 10% less to 10% greater of the value provided.
[0058] The words “preferred” and “preferably” refer to embodiments of the disclosure that may afford certain benefits, under certain circumstances. However, other embodiments may also be preferred, under the same or other circumstances. Furthermore, the recitation of one or more
preferred embodiments does not imply that other embodiments are not useful, and is not intended to exclude other embodiments from the scope of the disclosure.
[0059] As used herein, the terms "polynucleotide," "nucleotide," "oligonucleotide," and "nucleic acid" can be used interchangeably to refer to nucleic acid comprising DNA, cDNA, RNA, mRNA, derivatives thereof, or combinations thereof.
[0060] As utilized in accordance with the present disclosure, unless otherwise indicated, all technical and scientific terms shall be understood to have the same meaning as commonly understood by one of ordinary skill in the art.
[0061] Methods well known to those skilled in the art can be used to construct genetic expression constructs and recombinant cells according to this invention. These methods include in vitro recombinant DNA techniques, synthetic techniques, in vivo recombination techniques, and polymerase chain reaction (PCR) techniques. See, for example, techniques as described in Green & Sambrook, 2012, MOLECULAR CLONING: A LABORATORY MANUAL, Fourth Edition, Cold Spring Harbor Laboratory, New York; Ausubel et al., 1989, CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, Greene Publishing Associates and Wiley Interscience, New York, and PCR Protocols: A Guide to Methods and Applications (Innis et al., 1990, Academic Press, San Diego, CA). Standard recombinant DNA methodologies are used to construct the polynucleotides of the disclosure, incorporate such polynucleotides into recombinant expression vectors, and introduce such vectors into host cells to produce the amino acid sequences, proteins, and polypeptides of the disclosure. See e.g., Sambrook et al., 2001 , MOLECULAR CLONING: A LABORATORY MANUAL (Cold Spring Harbor Laboratory Press, 3rd ed.). Unless specific definitions are provided, the nomenclature utilized in connection with, and the laboratory procedures and techniques of, analytical chemistry, synthetic organic chemistry, and medicinal and pharmaceutical chemistry described herein are those well-known and commonly used in the art. Similarly, conventional techniques may be used for chemical syntheses, chemical analyses, pharmaceutical preparation, formulation, delivery, and treatment of patients.
[0062] The inventors engineered chimeric snoRNA guides to recruit NAT10 to a target transcript that causes directed acetylation of the target transcript, which can regulate protein production. The inventors aim to target acetylation to specific mRNAs in cells to regulate protein synthesis (e.g., enhanced expression of tumor suppressor genes or reduced expression of oncogenes). Through transcriptome-wide analysis of RNAs that interact with the NAT10 enzyme, the inventors identified 97 novel interactions with small nucleolar RNAs (snoRNAs) that showed stronger interaction with NAT10 as compared to the previously established association with
SNORD13. NAT10 is the sole enzyme to mediate acetylation in human RNA and snoRNAs are established guides for RNA-modifying enzymes. The inventors aim to determine the potential of these snoRNA to target a transcript of interest for acetylation in in vivo experiments. Endogenous NAT10 would subsequently be recruited to the targeted transcript and produce the acetylated cytidine. Mass spectrometric analysis of the transcript, and (semi-)quantitative protein expression will be used to validate formation of ac4C and effects on protein biosynthesis. As proof of principle, the inventors modulated the NAT10-interacting snoRNA SNORD13 to target GAPDH mRNA.
[0063] The engineered nucleotides (e.g., snoRNAs) as disclosed herein comprise (5’ to 3’) a first targeting sequence, a C box motif, a second targeting sequence, and a D box motif. The first targeting sequence can be from about 15 to about 30 nucleotides in length, for example 18 - 28 nucleotides, or 21-25 nucleotides. The C box motif can be about 40 to about 200 nucleotides in length, and comprises the sequence 5’ RUGAUGA-3’ (wherein R is a purine) or a portion thereof. The second targeting sequence can be from about 15 to about 30 nucleotides in length, for example 18 - 28 nucleotides, or 21-25 nucleotides. The D box motif can be about 15 to about 50 nucleotides in length (e.g., and can be substantially complementary to a portion of a target sequence). In some embodiments, the D box motif comprises a 5’-CUGA-3’ sequence.
[0064] In certain embodiments, first targeting sequence and the second targeting sequence are substantially complementary to separate regions of a target sequence. In certain embodiments, first targeting sequence and the second targeting sequence are substantially complementary to separate regions of a target sequence in the same desired target gene. A target sequence serves to hybridize to a target nucleic acid sequence (/.e., the mRNA of a desired target gene for which modified expression is the result). Typically the first targeting sequence and the second targeting sequence are substantially complementary to a portion of target nucleic acid sequence that is about is 15-30 nucleotides in length, but may be longer depending upon sequence and base composition. The term "substantially" complementary means that there does not need to be exact complementarity between the target nucleic acid sequence and first targeting sequence and the second targeting sequence of the engineered nucleotide sequence. However, it is to be understood that there should be a high degree of sequence identity, typically greater than 90% or 95%, which is sufficient to ensure specific binding according to known RNA or DNA/RNA base pairing rules. As used herein, the term "hybridize" refers to the sequence specific non-covalent binding interaction with a complementary nucleic acid. In one embodiment, the
hybridization is under high stringency conditions. Appropriate stringency conditions that promote hybridization are known to those skilled in the art.
[0065] In certain embodiments, the desired target gene can be a gene involved or implicated in any disease or disorder where moderate protein expression can lead to alleviation of the disease or disorder phenotype (/.e., a disease or disorder with an associated haploinsufficiency). In certain embodiments, the disease or disorder is associated with a haploinsuffciency and may be selected from the group consisting of, for example, but not limited to, Holt-Oram syndrome, Rett syndrome, Marfan syndrome, Dravet syndrome, Smith-Magenis syndrome, Waardenburg syndrome, Williams syndrome, and cancers (for example, but not limited to, PTEN or p53 null). In some embodiments, the desired target gene encodes a ribosomal protein, or the desired target gene encodes a gene involved in RNA processing, or a gene involved in gene transcription and/or translation, or a gene involved in proteostasis. For example, the desired target gene can be one or more of, but not limited to, CD25, CHD2, CFTR, CHD8, COL1A1 , COL1A2, FBN1 , FIGLA, FOXG1, FOXP2, FZD4, GATA2, GATA4, GLI2, GLI3, HNF1B, HNF4A, HOXD13, IKZF1 , IL2RA, IRF6, KIF1B, KIF21A, MECP2, MED13L, MEF2C, MEIS2, NHF1 B, NKX2-5, NSD1, NR2F2, NR5A1 , PAX2, PAX3, PAX5, PAX6, PBX1, PITX2, PPARG, POU3F2, RAI1 , RUNX1 , RUNX2, SCN1A, SETBP1 , SETD1A, SIM, SIX3, SLC52A2, SOX2, SOX9, SOX10, TBX1, TBX5, TBR1 , ZIC2, and ZFPM2.
[0066] In certain embodiments, the desired target gene can be a gene involved or implicated in cancer. For example, the desired target gene can be one or more of, but not limited to, AKT 1 , ALK, APC, ATM, BAG1 , BARD1 , BMPR1A, BRAF, BRCA1 , BRCA2, BRIP1, CDK4, CDH1 , CDK4, CDKN1C, CDKN2A, CHEK2, CHK2, CREBBP, CTNNB1 , EGFR, EP300, EPCAM, ERBB2, ETV6, FBXW7, FGFR2, FHIT, FLT3, FOXL2, GNAQ, GNAS, HOXB13, HRAS, KIT, KRAS, MAP2K1 , MET, MLH1 , MLL, MSH2, MSH6, MYC, NF1 , NTR3, NTRK1, NRAS, PALB2, PAX8, PDGFRA, PIK3CA, PMS2, PPARy, PRCC, PRKAR1A, PTEN, RAD51 D, RET, RUNXBP2, SMAD4, SRC, STK11 , TFE3, TGF- , TGF-[3RII, TP53, VEGFA, and WWOX.
[0067] In some embodiments, the target sequence is an RNA sequence, typically an mRNA, tRNA, miRNA or rRNA sequence, or an RNA virus genomic sequence or transcript derived thereof of the desired target gene. In certain embodiments, the target sequence is an mRNA sequence. In certain embodiments, the target sequence is a coding sequence (CDS) within the mRNA sequence. Within a target nucleic acid sequence the complementary region can be either in the intron, or exon, or particularly at intron - exon junctions, but can also reside in 5' or 3' untranslated
regions. In certain embodiments, targeting of the engineered nucleic acids as disclosed herein to the CDS of a desired target gene can increase protein production of the desired target gene.
[0068] In certain embodiments, the target sequence is in the 5’ untranslated region of an mRNA transcript. In certain embodiments, the target sequence is in the 3’ untranslated region of an mRNA transcript. In certain embodiments, targeting of the engineered nucleic acids as disclosed herein to the 5’ untranslated region of a desired target gene can repress protein production of the desired target gene.
[0069] In some embodiments, the target sequence comprises a known acetylation site or a putative acetylation site. In some embodiments, the target sequence is within about 30 to about 50 nucleotides of a known acetylation site or a putative acetylation site.
[0070] In a further aspect, this disclosure provides a vector or other nucleic acid construct capable of expressing at least one engineered nucleotide sequence as disclosed herein. The vector may further comprise common nucleic acid features often found in conventional nucleic acid vectors and well known to the skilled artisan. For example, the vector may comprise a selection marker gene for facilitating identification of cells into which the nucleic acid construct has been transformed or transfected. In some embodiments, the vector comprises at least one promoter, such as a constitutive or controllable promoter known in the art, for facilitating expression of the nucleic acid encoding engineered nucleotide sequences disclosed herein. In certain embodiments, the vectors as disclosed herein may also be delivered into a cell using viral vectors, such as lentivirus, retrovirus or adenovirus-derived vectors known in the art.
[0071] In a further aspect, this disclosure provides pharmaceutical compositions comprising the engineered nucleotides as disclosed herein, or the vectors capable of the engineered nucleotides as disclosed herein and a pharmaceutically acceptable carrier therefore.
[0072] Pharmaceutical formulations include those suitable for oral, topical (including dermal, buccal and sublingual), rectal or parenteral (including subcutaneous, intradermal, intramuscular and intravenous), nasal and pulmonary administration e.g., by inhalation. The formulation may, where appropriate, be conveniently presented in discrete dosage units and may be prepared by any of the methods well known in the art of pharmacy. All methods include the step of bringing into association an active compound with liquid carriers or finely divided solid carriers or both and then, if necessary, shaping the product into the desired formulation.
[0073] This disclosure also provides methods of directing acetylation to a target sequence of a desired target gene, which can regulate gene expression of the desired target gene. Thus, the
engineered nucleotide sequence as disclosed herein can be used to treat a disease or condition in a subject where it is desirable to modulate (either up-regulate or down-regulate) expression of a desired target gene or genes. For example, acetylation of the CDS of a desired target gene can increase protein production (in particular, beneficial for haploinsufficiency disorders). For example, targeting to a 5’-UTR of a desired target gene can be stimulatory in upstream/alternative CUG initiation sites (for example, stimulate anti-oncogenic protein isoforms) and repressive in annotated AUG initiation sites or surrounding Kozak sequence (for example, potential target to suppress oncogenes in cancer).
[0074] This disclosure also provides methods comprising the step of administering to a subject in need thereof, a therapeutically effective amount of the engineered nucleotide sequence, the vector, or the pharmaceutical composition as disclosed herein to a subject in need thereof.
[0075] As used herein the phrase “treating” refers to inhibiting or arresting the development of a disease, disorder or condition and/or causing the reduction, remission, or regression of a disease, disorder or condition in an individual suffering from, or diagnosed with, the disease, disorder or condition. In some embodiments, the disease or disorder is associated with a haploinsuffciency. In some embodiments, the disease or disorder is selected from the group consisting of, for example, but not limited to Rett syndrome, Marfan syndrome, and Dravet syndrome. In some embodiments, the disease or disorder is cancer. In certain embodiments, targeting of the engineered nucleic acids as disclosed herein to a CDS of a desired target gene can increase protein production of the desired target gene. In certain embodiments, targeting of the engineered nucleic acids as disclosed herein to a 5’ untranslated region of a desired target gene can repress protein production of the desired target gene.
[0076] Without limiting the disclosure, a number of embodiments of the disclosure are described below for purpose of illustration.
[0077] The invention will be further described in the following examples, which do not limit the scope of the invention described in the claims.
EXAMPLES
[0078] The Examples that follow are illustrative of specific embodiments of the disclosure, and various uses thereof. They are set forth for explanatory purposes only, and should not be construed as limiting the scope of the invention in any way.
[0079] Previous results have identified a dual role of ac4C in mRNA translation elongation and initiation (1 , 2). In elongation, acetylation within coding sequences (CDS) enhances mRNA stability and protein translation (2). In initiation, acetylation of Kozak sequences reduced translation, whereas ac4C in upstream translation initiation (upTIS) CUG sites enhanced alternative translation initiation. Provided herein is an engineered chimeric snoRNA composed of the Box C/D snoRNA SNORD13 and antisense elements directed to, as proof of principle, GAPDH mRNA. The engineered snoRNA was delivered into human cells through incorporation into a minigene vector. Recovery of GAPDH transcripts and mass spectrometry confirmed successful targeted acetylation in vitro.
Engineering for the targeting of the Kozak region of GAPDH mRNA
[0080] SNORD13 was identified to exhibit two imperfect targeting sequences surrounding the target cytidine in helix 45 of 18S rRNA. The inventors designed a homologue of this snoRNA (Figure 1) with two targeting sequences around the Kozak cytidine that match the GAPDH mRNA sequence to 100%. (Sequence SAS-49):
5 ' GTCCGACCTTCACCTTCCGAGCGTGATGATTGGGTGTTCATACGCTTGTGTGAGATGTGCCACCCTTTGTCTGAG CGATGTGGTGGGCACATTACCCGTCTGACC 3 ' (SEQ ID NO : 1)
Expression of targeting snoRNAs
[0081] In previous studies (3) it was shown that snoRNAs inserted into an intron of the P-globin gene can be processed to mature, functional snoRNAs. Kiss et al. developed a plasmid (Figure 2) containing introns and exons of the -globin gene with restriction sites for insertion of any kind of snoRNA.
[0082] Despite the fact that the processing of SNORD13 in nature occurs differently because it is regulated and transcribed from an independent gene (4), the inventors hypothesize that a synthetic snoRNA might be expressed in vivo via the same pathway as snoRNAs that are naturally processed from introns.
Engineering of pCMV [3-globin plasmid containing the synthetic snoRNA
[0083] The plasmid provided by the Filipowicz lab (3) was digested using the Clal and Xhol restriction enzymes and the ends were dephosphorylated using the phosphatase rSAP. The snoRNA analogue was purchased from IDT with the respective restriction sites on the 5’ and 3’ end and digested to yield compatible ends. Both fragments were purified via column purification (PCR Clean-Up kit, Qiagen). Utilizing T4 DNA ligase, the snoRNA was inserted into the plasmid overnight at 16 °C and the product transformed into DH5a cells. Colony-PCR with T7 FWD and
SAS-51/BGH polyA REV primers revealed good candidates which were confirmed by Sanger sequencing. The material was amplified through a standard midi-prep protocol (ZymoResearch).
Transfection into HeLa WT cells
[0084] To study the expression of the synthetic SNORD13 derivative that targets the Kozak sequence of GAPDH mRNA, a plasmid containing the |3-globin gene with introns harboring the snoRNA sequence was transfected. To this end HeLa WT cells cultured routinely in DMEM medium supplemented with 1% L-glutamine and 10% bovine calf serum were seeded to 2.2x106 cells into a 10 cm culture dish and allowed to attach at 37 °C in atmosphere containing 5% CO2. After 24 h the plasmid DNA was transfected according to the standard Lipofectamine 2000 (Invitrogen) protocol using 10 pg of plasmid DNA and 60 pL of Lipofectamine 2000 solution. The medium was replaced with fresh medium after 4-5 h and the cells were cultured for a total of 24 h after transfection.
Isolation of GAPDH mRNA after transfection of the pCMV P-globin plasmid containing the synthetic snoRNA
[0085] The cells were harvested for RNA after 24 h by scraping with 1 mL TRIZOL (Invitrogen). The total RNA was isolated according to the standard protocol. From total RNA the GAPDH mRNA was enriched utilizing a biotinylated antisense oligo (biotin-ASO) according to a previously published protocol (5). In brief, the RNA (up to 150 pg) was diluted in 50 pL ultrapure water. The biotin-ASO complementary to the GAPDH Kozak sequence (SAS-41 , 1 pL, 100 pmol) was added and the volume brought to 100 pL with ultrapure water and 25 pL 20xSSC buffer. Magnetically separated DynaBeads My One Streptavidin-T1 (25 pL) were washed three times with buffer A (1 M NaCI, 0.5 mM Na2[EDTA], 5 mM TRIS-HCI, pH 7.5), once with 5xSSC buffer (25 pL) before resuspension in 5xSSC buffer (25 pL). The RNA was heated for 3 min at 90 °C, then at 65 °C for 10 min and finally allowed to cool to room temperature. The prepared beads (25 pL) were added to the sample and incubated for 30 min at 600 rpm and 25 °C. Subsequently, the beads were washed once with 1xSSC buffer (50 pL) and three times with O.IxSSC buffer (50 pL). Finally, the beads were resuspended in ultrapure water (15 pL), incubated at 75 °C for 2 min and the RNA transferred to a new tube.
HPLC-MS/MS analysis of GAPDH mRNA
[0086] As quality check, enriched GAPDH mRNA was subjected to mass spectrometric analysis. To this end, enriched GAPDH mRNA (100 ng) was diluted to a total volume of 20 pL
with nuclease-free water. A master mix of snake venom phosphodiesterase (0.2 U), calf intestinal phosphatase (2 U) and benzonase (2 U) was prepared with 1 mM MgCh, 5 mM TRIS (pH 8), 10 nmol butylated hydroxytoluene and 5 iig tetrahydrouridine (modified from Heiss (2021); (5)) containing isotope analogues of N6-methyladenosine, 5-Methylcytidine and N4-acetylcytidine as internal standards. After addition of 15 gL of the mastermix, samples were incubated at 37 °C for 2 h. The samples were diluted with 15 jiL LC-MS buffer A (0.0075% formic acid in ultrapure water) and filtered for 35 min at 3750 g and 4 C through 0.2 ,um Super AcroPrep Advance 96-well plates (Pall corporation). Of the filtrate, 39 piL were subjected to mass spectrometric analysis on an Agilent triple quad 6495C mass spectrometer. For HPLC, buffer A (described above) and buffer B (0.0075% formic acid in acetonitrile) were used in combination with a C-8 reversed phase column (Agilent Poroshell 120 SB-C8, 2.1 x 150 mm, 2.7 urn LC column - BC). The gradient settings and nucleoside-specific parameters in the mass spectrometer (MS) are detailed in Table 2 and 3. The parameters of the MS were the following: 35 °C column temperature, 90 °C gas temperature, 19 L/min gas flow, 166 V fragmentor voltage, 30 psi nebulizer, 150 °C sheath gas temperature, 12 L/min sheath gas flow, 0 V nozzle voltage, positive high pressure RF 80, positive low pressure RF 40, negative high pressure RF 80, negative low pressure RF 40, 2500 V and - 2000 capillary voltage, respectively. All data were analyzed according to a published protocol utilizing the isotope dilution technique (6). Data (Figure 3) confirm an increase of acetylation in GAPDH mRNA after expression of an engineered snoRNA based on SNORD13 (SAS-49). A chimeric U13 reproducibly increased GAPDH mRNA acetylation (based on mass spectrometric quantification of ac4C levels normalized against cytidine after targeting of GAPDH mRNA with chimeric SNORD13 in wild-type HeLa cells).
Validation controls
[0087] Various controls will ensure integrity of the data. To rule out transfection artifacts nontransfected cells serve as negative control. The effect of the addition of the [3-globin plasmid will be evaluated by transfecting the empty pCMV -globin plasmid. As non-targeting control the inventors seek to use the natural SNORD13 sequence. Due to its natural processing pathway and abundance (~2 x 105 copies/cell in HeLa; (7)) in the cells, the inventors do not expect major effects, but this control is important to assess the effect of aberrant SNORD13 processing. snoRNAs for directed mRNA acetylation in mammalian subjects
[0088] The snoRNA guide disclosed herein demonstrates proof of principle in cell lines through transfection of a DNA-vector. For delivery into patients, methods could include the use
of Adeno-associated virus (AAV) or lentiviral vectors. Such vectors are ideally suited for the delivery of the snoRNA guides. AAV and lentiviral vectors are already being used/investigated clinically, so application of this therapy using such vectors appears feasible.
[0089] Further, these snoRNA guides and methods of delivery as disclosed herein can be developed as a patient-specific treatment, as well as a generic treatment. In vivo studies are performed to identify whether protein expression of a particular haploinsufficiency gene(s) increases upon acetylation. Subsequently, the optimized snoRNA guide are identified and engineered for those specific haploinsufficiency gene(s). Once the guide is identified, the vector- encoded snoRNA could be made broadly available to any patient suffering from that specific haploinsufficiency. snoRNAs for guided mRNA acetylation
[0090] NAT10 activity can be manipulated to develop novel therapeutics for specific defects in mRNA translation. Traditional gene therapy approaches rely on eliminating deleterious transcripts. However, strategies that involve genome manipulation carry inherent risks, and mRNA ablation does not address the numerous human diseases that result for inadequate mRNA translation, such as haploinsufficiency disorders (8). Haploinsufficiency is a genetic condition that arises when a single wild-type allele in combination with a mutant allele produces insufficient protein, manifesting in disease (9). There are over 300 known, and perhaps thousands of additional, human haploinsufficiencies (10, 11). This class of disease is generally recalcitrant to therapeutic interventions as technologies for enhancing the abundance of specific proteins are lacking. In addition, many haploinsufficiencies arise from spontaneous germline mutations (12), rendering gene editing approaches untenable. Recent RNA-based therapeutics have aimed to address this unmet need through direct delivery of codon optimized mRNAs and through ASO technologies that promote protein expression via exclusion of toxic-exons (13). Though promising in principle, mRNA delivery is size-constrained and involves uniform incorporation of modified RNA nucleotides that may exert antagonistic influences and ASO technologies are limited to pre- mRNAs with toxic exons (14). Provided herein are engineered snoRNA guides for targeting endogenous NAT10 to specific mRNA substrates in vivo as a versatile and mutation-agnostic platform for fine-tuning protein expression at the level of mRNA translation. snoRNA guides
[0091] The successful development of these engineered snoRNA guides hinges on the ability to bind NAT10 and mRNA in cells. As proof-of-concept, the inventors re-engineered U13 snoRNA
to target GAPDH mRNA proximal to its start codon, rather than 18S rRNA, through mutating the two antisense guide elements (Fig. 4). Chimeric U13 was cloned into the beta-globin minigene and transfected into HeLa cells. GAPDH mRNA was recovered from cell lysates through biotinylated antisense probe and subjected to mass spectrometric analysis for ac4C prevalence. Preliminary quantification detected a ~20% increase in ac4C within chimeric U13 targeted GAPDH mRNA as compared to control transfected cells (Fig. 3). Considering that haploinsufficiencies can display dosage sensitivity, wherein small increases in protein expression are beneficial, while large increases are detrimental (15), this initial value is significant. Importantly, ac4C influenced total protein abundance within this stoichiometry range in HeLa mRNAs (1). To next gauge specificity with respect to precisely acetylated residues RetraC:T and nanopore sequencing is performed on the recovered transcripts. The overall impact on GAPDH expression in cell lysates is also examined. Considering that NAT10 binding to U13 was relatively reduced as compared to other snoRNAs in HeLa CLIP-seq, these promising preliminary findings bolster the development of improved snoRNA scaffolds.
[0092] To design optimal NAT10-binding snoRNAs, HeLa NAT10 eCLIP-seq and CLASH- seq results were interrogated for generalized principles. In brief, the presence of abrupt RT-stops in eCLIP enriched sequencing reads are often indicative of direct RBP binding. Homology search of NAT 10-associated snoRNAs revealed several conserved sequences that generally overlapped eCLIP-seq RT-stops, strongly suggestive that the associated sequences are involved in NAT10- binding. These regions formed the basis of an optimized NAT10 scaffold along with consensus Box C and Box D motifs several nucleotides from the 5’ and 3’ ends, respectively (Fig. 4A). The remainder of the scaffold and specific antisense placement was guided by CLASH-seq determination of substrate-interacting regions. Antisense targeting sequences were directed to GAPDH as proof of principle and other abundant mRNAs (e.g., CD25 and PTEN) to confirm acetylation through mass spectrometry and base-resolution RNA-sequencing. The net impact on protein expression as compared to non-target mRNAs was also examined. The scaffolds were optimized through sequence randomization. In parallel, optimized U13 snoRNA scaffolds were pursued through mutagenesis. To evaluate specificity, a pool of the top candidates was expressed in HeLa and NAT10 eCLIP/CLASH-seq was performed to verify NAT10/snoRNA association and assess interacting mRNAs. The engineered snoRNA guides were tested by mass spectrometric analysis for inadvertent targeting of 2’-0-methylation to non-substrates. This disclosure provides a reliable snoRNA scaffold to target NAT10 to specific mRNAs through a flexible antisense platform.
[0093] NAT10 CLIP-seq was performed and revealed 2460 CLIP sites that are enriched in wild-type HeLa as compared to control. Examination of the substrate biotypes discovered a strong prevalence of snoRNAs demonstrating that NAT10-CLIP targets are enriched for snoRNAs (FIG. 6). Furthermore, depletion of NAT10 associated SNORD14B in HeLa cells resulted in decreased RNA acetylation. HeLa cells were transfected with 5 individual ASOs directed against U14B. Total RNA was isolated to assess the efficiency of knockdown through RTqPCR and ac4C levels through immunoNorthern. A decrease in ac4C levels was observed that correlated with knockdown efficiency (FIG.7).
Knockdown of SNORD14B - cell culture
[0094] HeLa WT cells maintained in complete (containing 1 % L-glutamine and 10% bovine calf serum (BCS)) Dulbecco’s modified Eagle’s medium (DMEM) at 37 °C in atmosphere containing 5% CO2 were trypsinized and counted. For each condition, 200,000 cells were seeded into 6 wells. The medium was completely replaced with OptiMEM 24 hours after seeding, and for each condition 50 nM antisense-oligo (ASO1-5, Table 1) was transfected using Lipofectamine 2000 reagent in a total volume of 200 pL according to manufacturer’s instructions. The medium was replaced with fresh complete DMEM after 4 hours. The cells were lysed directly in 500 pL TRIZOL reagent (Invitrogen) 48 hours after transfection.
Table 1 : Antisense-oligos utilized in SNORD14B knockdown experiments. PO=phosphate backbone, 2’-MOE= 2’-O-methoxyethyl-modified ribose.
Knockdown of SNORD14B - RNA isolation and analysis
[0095] RNA was extracted according to manufacturer’s instructions. Briefly, 2 pg of RNA were diluted to a total volume of 10 pL and 2x formamide loading dye (95% formamide, 0.5 mM EDTA, traces of bromophenol blue and xylene cyanol FF) was added to a volume of 20 uL. The RNA
was heated for 5 minutes at 65 °C and separated on an 1% agarose gel containing 1x MOPS (NorthernMax™ Denaturing Gel Buffer (10X), Catalog number: AM8676). Subsequently, the RNA was transferred to a HyBond nylon membrane with 20x sodium citrate/chloride buffer for 3 hours at room temperature and cross-linked twice to the membrane with the auto-crosslink function of the Stratalinker (120 mJ/cm2). After quick rinse with phosphate buffered saline (PBS), the membrane was blocked for 15 minutes in 5% milk and PBS containing 0.01% Tween-20 (PBS- T). The membrane was incubated with primary antibody against ac4C (monoclonal rabbit, 1:1000 in 1 %milk/PBS-T) overnight at 4 °C. After three washes for 5 minutes with PBS-T, the membrane was incubated with secondary horseradish peroxidase (HRP) conjugated antibody against rabbit IgG (monoclonal mouse, 1 :10,000 in 5% milk/PBS-T, Jackson ImmunoResearch 211-032-171). The membrane was washed three times with PBS-T, incubated for 2 minutes with Femto reagent (Thermo Fisher Scientific) and the chemiluminescent signal captured with the Biorad ChemiDoc machine.
[0096] For quantitative reverse transcription polymerase chain reaction (qRTPCR), 1 pg of RNA was converted into cDNA with superscript IV reverse transcriptase using random hexamers. The cDNA was subsequently subjected to qPCR on an LightCycler 480 Instrument (Roche Diagnostics) using the SYBRGreen 2x mastermix from Roche Diagnostics and SNORD14B- specific primers (forward: 5’-TCACTGTGATGATGGTTTTC-3’ (SEQ ID NO:7) and reverse: 5’-CTCACTCAGACATCCAAGGA-3’ (SEQ ID NO:8)).
[0097] As shown in FIG. 9A and 9B, haploinsufficient CD25 mRNA shows enhanced sensitivity to ac4C as compared to PTEN mRNA. Western blot for PTEN with GAPDH as loading control upon transfection of in vitro transcribed PTEN mRNA with varying levels of ac4C incorporation into wildtype HeLa cells. Western blot for CD25 with GAPDH as loading control upon transfection of in vitro transcribed CD25 mRNA +/- ac4C into wild-type HeLa cells.
Generation of reporter mRNA
[0098] To compare the effect of ac4C on nanoluciferase, PTEN and CD25 expression with pseudouridine and unmodified mRNA, the transcripts were in vitro transcribed from a template containing no cytidines in the 5’-untranslated region, Kozak sequence and first 15 nucleotides of the coding sequence, as well as a 20 nucleotide 3’-UTR. The acetylated and pseudouridylated mRNAs were achieved by omission or reduction of cytidine and uridine, respectively. The transcripts were simultaneously capped with AG-CleanCap (Tri Link) according to manufacturer’s protocol. PTEN and CD25 were subsequently polyadenylated with E. coli polyA-polymerase
(according to manufacturer’s protocol). The construct for nanoluciferase included templated poly(A)-sequence making polyadenylation with E. Coli polyA-polymerase obsolete.
Transfection of PTEN mRNA into HeLa WT cells
[0099] HeLa WT cells maintained in complete (containing 1 % L-glutamine and 10% bovine calf serum (BCS)) Dulbecco’s modified Eagle’s medium (DMEM) at 37 °C in atmosphere containing 5% CO2 were trypsinized and counted. For each condition, 200,000 cells were seeded in 12 wells in biological triplicates and grown for 8 hours at 37 °C in atmosphere containing 5% CO2. The transfections were performed at different time points, either 24 hours, 12 hours, 6 hours, 3 hours or 0 hours before cell harvest. For each condition and time point, 333 ng of PTEN mRNA (0-100% ac4C) and 167 ng of unmodified Nanoluciferase mRNA were transfected utilizing 0.75 jiL Lipofectamine MessengerMax reagent with a total volume of 30 pL OptiMEM according to manufacturer’s instructions. At time point 0 hours after transfection, the cells were washed with PBS, trypsinized for 3 minutes, quenched with complete DMEM medium, washed twice with PBS and lysed in RIPA buffer containing 1 % protease inhibitor.
Western blot to determine PTEN levels
[0100] The protein concentration was determined with the Pierce BCA Protein Assay (Thermo Fisher Scientific) and 25 pg separated for 45 minutes at 200 on 4-12% BisTris gels (Invitrogen) using MOPS buffer (Invitrogen). The proteins were transferred to PVDF membranes using the Biorad TurboBlot midi gel mixed molecular weight settings for 7 minutes. The membranes were blocked with 5% milk in TRIS-buffered saline (TBS) containing 0.01% Tween-20 (TBS-T). The membrane was incubated with primary antibody against PTEN (monoclonal rabbit, 1 :1000 in 5% milk/TBS-T, Cell Signaling Technologies 9559S) overnight at 4 °C on a rocking device. The following day, the membrane was washed three times for 5 minutes with TBS-T and then incubated for 45 minutes at room temperature with secondary horseradish peroxidase (HRP)- conjugated antibody against rabbit IgG (monoclonal mouse, 1 :10,000 in 5% milk/TBS-T, Jackson ImmunoResearch 211-032-171). The membrane was washed three times with TBS-T, incubated for 2 minutes with ECL-Select reagent (Amersham), rinsed quickly with TBS and the chemiluminescent signal captured with the Biorad ChemiDoc machine. The membrane was rinsed again and blocked for 1 hour with 5% milk/TBS-T. The membrane was incubated with primary antibody against GAPDH (monoclonal mouse, 1 :10,000 in 5% milk/TBS-T, Santa Cruz Biotechnology SC32233) overnight at 4 °C on a rocking device. The following day, the membrane was washed three times for 5 minutes with TBS-T and then incubated for 1 hour at room temperature with secondary horseradish peroxidase (HRP)-conjugated antibody against mouse
IgG (polyclonal goat, 1 :10,000 in 5% milk/TBS-T, abeam ab6789). The membrane was washed three times with TBS-T, incubated for 2 minutes with ECL reagent (Amersham) and the chemiluminescent signal captured with the Biorad ChemiDoc machine.
Transfection of CD25 mRNA into HeLa WT cells
[0101] HeLa WT cells routinely maintained in complete DMEM at 37 °C in atmosphere containing 5% CO2 were trypsinized and counted. For each condition, 1x106 cells were seeded into 6 wells the day before the transfection. For each condition, 2.5 pg of mRNA utilizing 3.75 j L Lipofectamine MessengerMax reagent in a total volume of 250 pL were transfected according to manufacturer’s instructions. The medium was replaced with fresh complete DMEM containing 1% L-glutamine and 10% bovine calf serum (BCS) after 4 hours. After 24 h, the cells were trypsinized, washed twice with PBS and lysed in RIPA buffer containing 1 % protease inhibitor. The protein concentration was determined with the Pierce BOA Protein Assay (Thermo Fisher Scientific) and 40 pg of protein separated on 4-12% BisTris gels using MOPS buffer. The proteins were transferred to a nitrocellulose membrane using the Biorad TurboBlot mixed molecular weight settings for 7 minutes. The membranes were blocked with 5% milk in TRIS-buffered saline (TBS) containing 0.01 % Tween-20 (TBS-T). The membrane was incubated with primary antibody against CD25 (monoclonal rabbit, 1 :1000 in 5% milk/TBS-T, abeam ab231441) overnight at 4 °C on a rocking device. The following day, the membrane was washed three times for 5 minutes with TBS-T and then incubated for 1 hour at room temperature with secondary horseradish peroxidase (HRP)- conjugated antibody against rabbit IgG (monoclonal mouse, 1 :10,000 in 5% milk/TBS-T, Jackson ImmunoResearch 211-032-171). The membrane was washed three times with TBS-T, incubated with HRP-substrate and the chemiluminescent signal captured with the Biorad ChemiDoc machine.
Transfection of nanoluciferase mRNA into HeLa WT cells
[0102] HeLa WT cells routinely maintained in complete (containing 1% L-glutamine and 10% bovine calf serum (BCS)) Dulbecco’s modified Eagle’s medium (DMEM) at 37 °C in atmosphere containing 5% CO2 were trypsinized and counted. For each condition in biological triplicates, 1x106 cells were transfected with 2.5 pg of mRNA utilizing 3.75 pL Lipofectamine MessengerMax reagent in a total volume of 250 pL (according to manufacturer’s instructions). After incubation of cells with mRNA for 5 min, the medium was removed, the cells washed twice with PBS, and 10,000 cells each in triplicate were seeded into luciferase plate and at the indicated time points NanoGio reagent (Promega) was added and the luminescence recorded with the SpectraMax
iD3. The remaining cells were plated into 6 wells each. At the indicated time points, the cells were scraped, washed twice with PBS and counted. 250,000 cells were lysed in RIPA buffer containing 1% protease inhibitor. The protein concentration was determined with the BCA protein quantitation assay and 10 pg separated on 4-12% BisTris gels using MOPS buffer. Proteins were transferred to a nitrocellulose membrane using the Biorad TurboBlot mixed molecular weight settings for 7 min. The membranes were blocked with 5% milk in TRIS-buffered saline (TBS) containing 0.01 % Tween-20 (TBS-T). The membrane was incubated with primary antibody against Nanoluciferase (monoclonal mouse, 1 :500 in 5% milk/TBS-T, R&D Systems MAB10026) overnight at 4 °C on a rocking device. The following day, the membrane was washed three times for 5 min with TBS-T and then incubated for 1 h at room temperature with secondary secondary HRP-conjugated antibody against mouse IgG (polyclonal goat, 1 :10,000 in 5% milk/TBS-T, abeam ab6789). The membrane was washed three times with TBS-T, incubated with HRP- substrate and the chemiluminescent signal captured with the Biorad ChemiDoc machine.
[0103] As shown in FIG. 10A and 10B, ac4C stimulates mRNA translation as compared to unmodified mRNA and mRNA containing the uridine modification pseudouridine. Western blot for Nanoluciferase (Nanoluc) protein levels in HeLa cells transfected with in vitro transcribed Nanoluc mRNA with either canonical nucleotides, or NTPs in which ac4C or pseudouridine (PseudoU) have been substituted for C or U, respectively. Luminescence for Nanoluc in transfected HeLa cells with in vitro transcribed Nanoluc mRNA with either canonical nucleotides, or mixtures containing ac4C or PseudoU as in (10A).
[0104]
References
1. Arango et al., Mol Cell 2022.
2. Arango et al., Cell 2018, 175, 1872-1886 e1824.
3. Kiss and Filipowicz, Genes Dev 1995, 9, 1411-1424.
4. Reichow et al., Nucleic Acids Res 2007, 35, 1452-1464.
5. Heiss et al., Methods Mol Biol 2021 , 2298, 279-306.
6. Traube et al., , Nat Protoc 2019, 14, 283-312.
7. Tyc and Steitz, EM BO J 1989, 8, 3113-3119.
8. Veitia, Exploring the etiology of haploinsufficiency. Bioessays 2022, 24, 175-184.
9. Fisher, and Scambier, Human haploinsufficiency--one for sorrow, two for joy. Nat Genet 1994, 7, 5-7.
10. Dang et al., Identification of human haploinsufficient genes and their genomic proximity to segmental duplications. Eur J Hum Genet 2008, 16, 1350-1357.
11. Collins et al., A cross-disorder dosage sensitivity map of the human genome. Cell 2022, 185, 3041 -3055. e3025.
12. Claes et al., De Novo Mutations in the Sodium-Channel Gene SCN1A Cause Severe Myoclonic Epilepsy of Infancy. The American Journal of Human Genetics 2001 ,68, 1327- 1332.
13. Moumne et al., Oligonucleotide Therapeutics: From Discovery and Development to Patentability. Pharmaceutics 2022, 14.
14. Gagliardi, M., and Ashizawa, A. T. The Challenges and Strategies of Antisense Oligonucleotide Drug Delivery. Biomedicines 2021 , 9.
15. Collins et al., Mild overexpression of MeCP2 causes a progressive neurological disorder in mice. Hum Mol Genet 2004, 13, 2679-2689.
[0105] Having described the invention in detail and by reference to specific embodiments thereof, it will be apparent that modifications and variations are possible without departing from the scope of the invention defined in the appended claims. More specifically, although some aspects of the present invention are identified herein as particularly advantageous, it is contemplated that the present invention is not necessarily limited to these particular aspects of the invention.
Claims
1. An engineered nucleotide sequence, comprising, 5’ to 3’:
(a) a first targeting sequence;
(b) a C box motif containing sequence;
(c) a second targeting sequence; and
(d) a D box motif containing sequence.
2. The engineered nucleotide sequence of claim 1 , wherein the first targeting sequence comprises about 15 to about 30 nucleotides.
3. The engineered nucleotide sequence of either claim 1 or claim 2, wherein the second targeting sequence comprises about 15 to about 30 nucleotides.
4. The engineered nucleotide sequence of any one of claims 1-3, wherein the first targeting sequence and the second targeting sequence are complementary to separate regions of a target sequence.
5. The engineered nucleotide sequence of claim 4, wherein the target sequence comprises an RNA transcript.
6. The engineered nucleotide sequence of claim 5, wherein the target sequence of the RNA transcript comprises a 5’ untranslated region of the RNA transcript.
7. The engineered nucleotide sequence of claim 5, wherein the target sequence of the RNA transcript comprises a coding sequence of the RNA transcript.
8. The engineered nucleotide sequence of any one of claims 4-7, wherein the target sequence comprises a known acetylation site or a putative acetylation site, optionally wherein the target sequence is within about 30-50 nucleotides of the known acetylation site or the putative acetylation site.
9. The engineered nucleotide sequence of any one of claims 1-8, wherein the C box motif containing sequence comprises a 5’ RUGAUGA-3’ sequence (wherein R is a purine).
The engineered nucleotide sequence of any one of claims 1-9, wherein the C box motif containing sequence comprises about 40 to about 200 nucleotides. The engineered nucleotide sequence of any one of claims 1-10, wherein the D box motif containing sequence comprises a 5’ CUGA-3’ sequence. The engineered nucleotide sequence of any one of claims 1-11, wherein the D box motif containing sequence comprises about 15 to about 50 nucleotides. The engineered nucleotide sequence of any one of claims 1-12, wherein the engineered nucleotide sequence recruits NAT10 to the target sequence and causes acetylation (e.g., N4-acetylcytidine (ac4C) modification) of the target sequence. The engineered nucleotide sequence of any one of claims 1-13, wherein the targeting sequences are complementary to a target sequence in a desired target gene, and the gene is involved or implicated in any disease or disorder where moderate protein expression can lead to alleviation of the disease or disorder phenotype. The engineered nucleotide sequence of claim 14, wherein the desired target gene encodes a ribosomal protein, a gene involved in RNA processing, a gene involved in gene transcription and/or translation, or a gene involved in proteostasis. The engineered nucleotide sequence of claim 15, wherein the desired target gene is selected from the group consisting of CD25, CHD2, CFTR, CHD8, COL1A1 , COL1A2, FBN1, FIGLA, FOXG1 , FOXP2, FZD4, GATA2, GATA4, GLI2, GLI3, HNF1 B, HNF4A, HOXD13, IKZF1 , IL2RA, IRF6, KIF1 B, KIF21A, MECP2, MED13L, MEF2C, MEIS2, NHF1 B, NKX2-5, NSD1 , NR2F2, NR5A1 , PAX2, PAX3, PAX5, PAX6, PBX1 , PITX2, PPARG, POU3F2, RAI1, RUNX1 , RUNX2, SCN1A, SETBP1 , SETD1A, SIM, SIX3, SLC52A2, SOX2, SOX9, SOX10, TBX1 , TBX5, TBR1 , ZIC2, and ZFPM2. The engineered nucleotide sequence of any one of claims 1-13, wherein the targeting sequences are complementary to a target sequence in a desired target gene, and the desired target gene is involved or implicated in cancer.
The engineered nucleotide sequence of claim 17, wherein the desired target gene is selected from the group consisting of AKT1, ALK, APC, ATM, BAG1 , BARD1, BMPR1A, BRAF, BRCA1 , BRCA2, BRIP1, CDK4, CDH1 , CDK4, CDKN1C, CDKN2A, CHEK2, CHK2, CREBBP, CTNNB1 , EGFR, EP300, EPCAM, ERBB2, ETV6, FBXW7, FGFR2, FHIT, FLT3, FOXL2, GNAQ, GNAS, HOXB13, HRAS, KIT, KRAS, MAP2K1 , MET, MLH1 , MLL, MSH2, MSH6, MYC, NF1 , NTR3, NTRK1 , NRAS, PALB2, PAX8, PDGFRA, PIK3CA, PMS2, PPARy, PRCC, PRKAR1A, PTEN, RAD51 D, RET, RUNXBP2, SMAD4, SRC, STK11, TFE3, TGF- , TGF-(3RII, TP53, VEGFA, and WWOX. A vector comprising the engineered nucleotide sequence of any one of claims 1 to 18. The vector of claim 19, wherein the vector is a virus, a lentivirus, an adenovirus, a retrovirus, an adeno-associated virus (AAV), a transposon, a DNA vector, or a RNA vector. A pharmaceutical composition, comprising the engineered nucleotide sequence of any one of claims 1 to 18, or the vector of either claim 19 or 20, and a pharmaceutically acceptable excipient. A method for directing acetylation to a target sequence, wherein the method comprises administering the engineered nucleotide sequence of any of claims 1 to 18, the vector of either claim 19 or 20, or the pharmaceutical composition of claim 21. A method for treating a disease or disorder, wherein the method comprises administering the engineered nucleotide sequence of any of claims 1 to 18, the vector of either claim 19 or 20, or the pharmaceutical composition of claim 21 to a subject in need thereof. The method of claim 23, wherein the disease or disorder comprises any disease or disorder where moderate protein expression can lead to alleviation of the disease or disorder phenotype (/.e., a disease or disorder with an associated haploinsufficiency). The method of claim 23, wherein the disease or disorder is cancer.
Use of the engineered nucleotide sequence of any of claims 1 to 18, the vector of either claim 19 or 20, or the pharmaceutical composition of claim 21 in the treatment of a disease or disorder in a subject in need thereof. The use of claim 26, wherein the disease or disorder comprises any disease or disorder where moderate protein expression can lead to alleviation of the disease or disorder phenotype (/.e., a disease or disorder with an associated haploinsufficiency). The use of claim 26, wherein the disease or disorder is cancer. The method and/or use of any of claims 23 to 27, wherein the disease or disorder is associated with a haploinsuffciency. The method and/or use of any of claims 23 to 27, wherein the disease or disorder is cancer. The method and/or use of any of claims 22 to 30, wherein the method and/or use results in increased protein levels of the protein encoded by the desired target gene. The method and/or use of any of claims 22 to 30, wherein the method and/or use results in decreased protein levels of the protein encoded by the desired target gene.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US202263423714P | 2022-11-08 | 2022-11-08 | |
US63/423,714 | 2022-11-08 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2024102659A1 true WO2024102659A1 (en) | 2024-05-16 |
Family
ID=89222875
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2023/078830 WO2024102659A1 (en) | 2022-11-08 | 2023-11-06 | Directed acetylation of mrna through engineered snorna adapters |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2024102659A1 (en) |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2009037490A1 (en) * | 2007-09-22 | 2009-03-26 | University Court Of The University Of Dundee | Targeted modulation of gene expression |
WO2012009667A2 (en) * | 2010-07-16 | 2012-01-19 | University Of Rochester | Targeted 2'-o-methylation of telomerase non-coding rna |
-
2023
- 2023-11-06 WO PCT/US2023/078830 patent/WO2024102659A1/en unknown
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2009037490A1 (en) * | 2007-09-22 | 2009-03-26 | University Court Of The University Of Dundee | Targeted modulation of gene expression |
WO2012009667A2 (en) * | 2010-07-16 | 2012-01-19 | University Of Rochester | Targeted 2'-o-methylation of telomerase non-coding rna |
Non-Patent Citations (21)
Title |
---|
ARANGO DANIEL ET AL: "Acetylation of Cytidine in mRNA Promotes Translation Efficiency", CELL, vol. 175, no. 7, 13 December 2018 (2018-12-13), pages 1872 - 1886, XP085560737, ISSN: 0092-8674, DOI: 10.1016/J.CELL.2018.10.030 * |
ARANGO DANIEL ET AL: "Direct epitranscriptomic regulation of mammalian translation initiation through N4-acetylcytidine", MOLECULAR CELL, vol. 82, no. 15, 8 June 2022 (2022-06-08), pages 2797 - 2814, XP087138513, ISSN: 1097-2765, [retrieved on 20220608], DOI: 10.1016/J.MOLCEL.2022.05.016 * |
ARANGO ET AL., CELL, vol. 175, 2018, pages 1872 - 1886 |
ARANGO ET AL., MOL CELL, 2022 |
CLAES ET AL.: "De Novo Mutations in the Sodium-Channel Gene SCN1A Cause Severe Myoclonic Epilepsy of Infancy", THE AMERICAN JOURNAL OF HUMAN GENETICS, vol. 68, 2001, pages 1327 - 1332, XP002973534, DOI: 10.1086/320609 |
COLLINS ET AL.: "A cross-disorder dosage sensitivity map of the human genome", CELL, vol. 185, 2022, pages 3041 - 3055 |
COLLINS ET AL.: "Mild overexpression of MeCP2 causes a progressive neurological disorder in mice", HUM MOL GENET, vol. 13, 2004, pages 2679 - 2689 |
DALHAT MAHMOOD HASSAN ET AL: "NAT10: An RNA cytidine transferase regulates fatty acid metabolism in cancer cells", CLINICAL AND TRANSLATIONAL MEDICINE, vol. 12, no. 9, E1045, 1 September 2022 (2022-09-01), SE, pages 1 - 20, XP093130166, ISSN: 2001-1326, Retrieved from the Internet <URL:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9505754/pdf/CTM2-12-e1045.pdf> DOI: 10.1002/ctm2.1045 * |
DANG ET AL.: "Identification of human haploinsufficient genes and their genomic proximity to segmental duplications", EUR J HUM GENET, vol. 16, 2008, pages 1350 - 1357 |
FISHERSCAMBLER: "Human haploinsufficiency--one for sorrow, two for joy", NAT GENET, vol. 7, 1994, pages 5 - 7 |
GAGLIARDI, M.ASHIZAWA, A. T.: "The Challenges and Strategies of Antisense Oligonucleotide Drug Delivery", BIOMEDICINES, vol. 9, 2021, pages 15 |
HEISS ET AL., METHODS MOL BIOL, vol. 2298, 2021, pages 279 - 306 |
INNIS ET AL.: "PCR Protocols: A Guide to Methods and Applications", 1990, ACADEMIC PRESS |
KARTHIYA R ET AL: "Emerging role of N4-acetylcytidine modification of RNA in gene regulation and cellular functions", MOLECULAR BIOLOGY REPORTS, vol. 47, no. 11, 2020, pages 9189 - 9199, XP037297938, ISSN: 0301-4851, DOI: 10.1007/S11033-020-05963-W * |
KISSFILIPOWICZ, GENES DEV, vol. 9, 1995, pages 1411 - 1424 |
MOUMNE: "Oligonucleotide Therapeutics: From Discovery and Development to Patentability", PHARMACEUTICS, 2022, pages 14 |
REICHOW ET AL., NUCLEIC ACIDS RES, vol. 35, 2007, pages 1452 - 1464 |
SUPUNI THALALLA ET AL: "Antisense pairing and SNORD13 structure guide RNA cytidine acetylation", RNA, vol. 28, 20 September 2022 (2022-09-20), pages 1582 - 1596, XP093130250, DOI: 10.1261/rna * |
TRAUBE ET AL., NAT PROTOC, vol. 14, 2019, pages 283 - 312 |
TYCSTEITZ, EM BO J, vol. 8, 1989, pages 3113 - 3119 |
VEITIA: "Exploring the etiology of haploinsufficiency", BIOESSAYS, vol. 24, 2022, pages 175 - 184, XP071524190, DOI: 10.1002/bies.10023 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
AU2018352221B2 (en) | Peptides and nanoparticles for intracellular delivery of mRNA | |
JP7007304B2 (en) | Nucleic acid molecule for reducing mRNA of PAPD5 or PAPD7 for the treatment of hepatitis B infection | |
AU2015227733B2 (en) | Antisense nucleic acids | |
JP7416852B2 (en) | Antisense oligonucleotides for regulating HTRA1 expression | |
JP2023165986A (en) | Peptides and nanoparticles for intracellular delivery of genome-editing molecules | |
KR102302679B1 (en) | Pharmaceutical composition for treating cancers comprising guide rna and endonuclease | |
EP3778892A1 (en) | Novel small activating rna | |
CN111511914B (en) | Nucleic acid molecules that reduce PAPD5 and PAPD7 mRNA for the treatment of hepatitis B infection | |
JP2024041845A (en) | Gapmer oligonucleotides comprising phosphorodithioate internucleoside linkage | |
JP2023139252A (en) | Novel thiophosphoramidites | |
WO2023240261A1 (en) | Nucleobase editing system and method of using same for modifying nucleic acid sequences | |
JP2023002469A (en) | Novel mRNA composition used for antiviral and anticancer vaccines and method for producing the same | |
JP2024041886A (en) | Oligonucleotides comprising hosphorodithioate internucleoside linkage | |
WO2010032704A1 (en) | Pharmaceutical preparation comprising micro-rna-143 derivative | |
WO2024102659A1 (en) | Directed acetylation of mrna through engineered snorna adapters | |
WO2022138929A1 (en) | Guide rna for editing polyadenylation signal sequence of target rna | |
US20230313231A1 (en) | Rna and dna base editing via engineered adar | |
CN114426960A (en) | PDA @ CRISPR/Cas9/sgHMGA2NPs complex, preparation method and application | |
CN114901823A (en) | Antisense nucleic acids inducing skipping of exon 50 | |
CN107446012B (en) | Tumor fluorescent imaging agent, preparation and application thereof | |
CN114026234A (en) | Antisense oligonucleotides altering DUX4pre-mRNA splicing | |
WO2024002045A1 (en) | Oligonucleotide delivery agents, pharmaceutical compositions and methods using the same | |
CN105803053B (en) | Application of human RBM17 gene and related medicine thereof | |
KR100966333B1 (en) | Agent for Inhibiting Large Intestine Cancer Cell Proliferation Comprising Gene Delivery Vehicle Containing a GABRA1 Gene | |
WO2019181305A1 (en) | CANCER PROLIFERATION INHIBITOR COMPRISING snoRNA EXPRESSION INHIBITOR AS ACTIVE INGREDIENT |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 23825138 Country of ref document: EP Kind code of ref document: A1 |