WO2023205637A1 - Compositions ciblant l'arn et procédés pour traiter les maladies c9/orf72 - Google Patents
Compositions ciblant l'arn et procédés pour traiter les maladies c9/orf72 Download PDFInfo
- Publication number
- WO2023205637A1 WO2023205637A1 PCT/US2023/065889 US2023065889W WO2023205637A1 WO 2023205637 A1 WO2023205637 A1 WO 2023205637A1 US 2023065889 W US2023065889 W US 2023065889W WO 2023205637 A1 WO2023205637 A1 WO 2023205637A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- sequence
- seq
- rna
- vector
- promoter
- Prior art date
Links
- 239000000203 mixture Substances 0.000 title claims abstract description 112
- 238000000034 method Methods 0.000 title claims abstract description 79
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 title abstract description 36
- 201000010099 disease Diseases 0.000 title abstract description 24
- 231100000331 toxic Toxicity 0.000 claims abstract description 31
- 230000002588 toxic effect Effects 0.000 claims abstract description 31
- 206010002026 amyotrophic lateral sclerosis Diseases 0.000 claims abstract description 30
- 108091032973 (ribonucleotides)n+m Proteins 0.000 claims description 181
- 239000013598 vector Substances 0.000 claims description 180
- 108020005004 Guide RNA Proteins 0.000 claims description 156
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 154
- 108090000623 proteins and genes Proteins 0.000 claims description 133
- 150000007523 nucleic acids Chemical group 0.000 claims description 129
- 239000013607 AAV vector Substances 0.000 claims description 115
- 102000004169 proteins and genes Human genes 0.000 claims description 114
- 230000014509 gene expression Effects 0.000 claims description 105
- 230000000692 anti-sense effect Effects 0.000 claims description 64
- 125000006850 spacer group Chemical group 0.000 claims description 62
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 57
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 54
- 229920001184 polypeptide Polymers 0.000 claims description 47
- 150000001413 amino acids Chemical group 0.000 claims description 41
- 230000004570 RNA-binding Effects 0.000 claims description 37
- 230000027455 binding Effects 0.000 claims description 34
- 201000011240 Frontotemporal dementia Diseases 0.000 claims description 19
- 102000040430 polynucleotide Human genes 0.000 claims description 16
- 108091033319 polynucleotide Proteins 0.000 claims description 16
- 239000002157 polynucleotide Substances 0.000 claims description 16
- 241000702421 Dependoparvovirus Species 0.000 claims description 9
- 241000713666 Lentivirus Species 0.000 claims description 8
- 230000009467 reduction Effects 0.000 claims description 8
- 241000124008 Mammalia Species 0.000 claims description 7
- 208000024891 symptom Diseases 0.000 claims description 7
- 241000701161 unidentified adenovirus Species 0.000 claims description 6
- 101710135281 DNA polymerase III PolC-type Proteins 0.000 claims description 5
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 claims description 5
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 claims description 5
- -1 polyplex Substances 0.000 claims description 5
- 241001430294 unidentified retrovirus Species 0.000 claims description 5
- 239000004474 valine Substances 0.000 claims description 5
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 claims description 3
- 239000000412 dendrimer Substances 0.000 claims description 3
- 229920000736 dendritic polymer Polymers 0.000 claims description 3
- 239000002479 lipoplex Substances 0.000 claims description 3
- 239000002502 liposome Substances 0.000 claims description 3
- 239000000693 micelle Substances 0.000 claims description 3
- 239000002105 nanoparticle Substances 0.000 claims description 3
- 229920000575 polymersome Polymers 0.000 claims description 3
- 102100021244 Integral membrane protein GPR180 Human genes 0.000 claims 4
- 238000001415 gene therapy Methods 0.000 abstract description 7
- 230000000903 blocking effect Effects 0.000 abstract description 3
- 235000018102 proteins Nutrition 0.000 description 103
- 210000004027 cell Anatomy 0.000 description 88
- 230000008685 targeting Effects 0.000 description 64
- 125000003729 nucleotide group Chemical group 0.000 description 54
- 239000002773 nucleotide Substances 0.000 description 53
- 239000013612 plasmid Substances 0.000 description 37
- 102000044126 RNA-Binding Proteins Human genes 0.000 description 36
- 101710159080 Aconitate hydratase A Proteins 0.000 description 35
- 101710159078 Aconitate hydratase B Proteins 0.000 description 35
- 101710105008 RNA-binding protein Proteins 0.000 description 35
- 108091033409 CRISPR Proteins 0.000 description 30
- 210000001519 tissue Anatomy 0.000 description 30
- 238000010354 CRISPR gene editing Methods 0.000 description 29
- 102000039446 nucleic acids Human genes 0.000 description 28
- 108020004707 nucleic acids Proteins 0.000 description 28
- 108020001507 fusion proteins Proteins 0.000 description 24
- 102000037865 fusion proteins Human genes 0.000 description 24
- 238000002347 injection Methods 0.000 description 24
- 239000007924 injection Substances 0.000 description 24
- 230000001105 regulatory effect Effects 0.000 description 24
- 239000013603 viral vector Substances 0.000 description 24
- 241000699666 Mus <mouse, genus> Species 0.000 description 16
- 241000699670 Mus sp. Species 0.000 description 15
- 239000003623 enhancer Substances 0.000 description 15
- 210000004962 mammalian cell Anatomy 0.000 description 15
- 108091081021 Sense strand Proteins 0.000 description 14
- 238000011282 treatment Methods 0.000 description 14
- 108020004414 DNA Proteins 0.000 description 13
- 108090000331 Firefly luciferases Proteins 0.000 description 13
- 230000000694 effects Effects 0.000 description 13
- 230000035772 mutation Effects 0.000 description 13
- 108020004705 Codon Proteins 0.000 description 12
- 101710163270 Nuclease Proteins 0.000 description 12
- 230000000295 complement effect Effects 0.000 description 12
- 208000035475 disorder Diseases 0.000 description 12
- 108010016626 Dipeptides Proteins 0.000 description 11
- 108010029485 Protein Isoforms Proteins 0.000 description 11
- 102000001708 Protein Isoforms Human genes 0.000 description 11
- 230000001124 posttranscriptional effect Effects 0.000 description 11
- 230000010076 replication Effects 0.000 description 11
- 230000008030 elimination Effects 0.000 description 10
- 238000003379 elimination reaction Methods 0.000 description 10
- 210000002950 fibroblast Anatomy 0.000 description 10
- 238000001727 in vivo Methods 0.000 description 10
- 108010042407 Endonucleases Proteins 0.000 description 9
- 102000004533 Endonucleases Human genes 0.000 description 9
- 108010083644 Ribonucleases Proteins 0.000 description 9
- 102000006382 Ribonucleases Human genes 0.000 description 9
- 230000009977 dual effect Effects 0.000 description 9
- 230000002068 genetic effect Effects 0.000 description 9
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 8
- 241000725303 Human immunodeficiency virus Species 0.000 description 8
- 108010052090 Renilla Luciferases Proteins 0.000 description 8
- 108020004566 Transfer RNA Proteins 0.000 description 8
- 241000700605 Viruses Species 0.000 description 8
- 235000001014 amino acid Nutrition 0.000 description 8
- 229940024606 amino acid Drugs 0.000 description 8
- 238000004458 analytical method Methods 0.000 description 8
- 238000003491 array Methods 0.000 description 8
- 210000004899 c-terminal region Anatomy 0.000 description 8
- 239000013604 expression vector Substances 0.000 description 8
- 210000005260 human cell Anatomy 0.000 description 8
- 238000010172 mouse model Methods 0.000 description 8
- 210000002569 neuron Anatomy 0.000 description 8
- 230000001717 pathogenic effect Effects 0.000 description 8
- 238000001890 transfection Methods 0.000 description 8
- 101150020662 ATP5F1B gene Proteins 0.000 description 7
- 108020005544 Antisense RNA Proteins 0.000 description 7
- 238000011529 RT qPCR Methods 0.000 description 7
- 238000009396 hybridization Methods 0.000 description 7
- 238000000338 in vitro Methods 0.000 description 7
- 210000001082 somatic cell Anatomy 0.000 description 7
- 210000000278 spinal cord Anatomy 0.000 description 7
- 238000013518 transcription Methods 0.000 description 7
- 230000035897 transcription Effects 0.000 description 7
- 241000702423 Adeno-associated virus - 2 Species 0.000 description 6
- 108700030955 C9orf72 Proteins 0.000 description 6
- 102000043334 C9orf72 Human genes 0.000 description 6
- 102100023882 Endoribonuclease ZC3H12A Human genes 0.000 description 6
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 6
- 101000976212 Homo sapiens Endoribonuclease ZC3H12A Proteins 0.000 description 6
- 210000000234 capsid Anatomy 0.000 description 6
- 239000003184 complementary RNA Substances 0.000 description 6
- 210000000981 epithelium Anatomy 0.000 description 6
- 108020004445 glyceraldehyde-3-phosphate dehydrogenase Proteins 0.000 description 6
- 210000000115 thoracic cavity Anatomy 0.000 description 6
- 238000013519 translation Methods 0.000 description 6
- 239000003981 vehicle Substances 0.000 description 6
- 230000003612 virological effect Effects 0.000 description 6
- 108091026890 Coding region Proteins 0.000 description 5
- 102100031181 Glyceraldehyde-3-phosphate dehydrogenase Human genes 0.000 description 5
- 108091081062 Repeated sequence (DNA) Proteins 0.000 description 5
- 239000000090 biomarker Substances 0.000 description 5
- 239000000872 buffer Substances 0.000 description 5
- 210000002919 epithelial cell Anatomy 0.000 description 5
- 210000004907 gland Anatomy 0.000 description 5
- 238000011534 incubation Methods 0.000 description 5
- 238000004020 luminiscence type Methods 0.000 description 5
- 230000001575 pathological effect Effects 0.000 description 5
- 238000003259 recombinant expression Methods 0.000 description 5
- 241000713704 Bovine immunodeficiency virus Species 0.000 description 4
- 241000713756 Caprine arthritis encephalitis virus Species 0.000 description 4
- 108700010070 Codon Usage Proteins 0.000 description 4
- 241000713730 Equine infectious anemia virus Species 0.000 description 4
- 241000713800 Feline immunodeficiency virus Species 0.000 description 4
- 108060001084 Luciferase Proteins 0.000 description 4
- 239000005089 Luciferase Substances 0.000 description 4
- 108010059343 MM Form Creatine Kinase Proteins 0.000 description 4
- 241001465754 Metazoa Species 0.000 description 4
- 241000713311 Simian immunodeficiency virus Species 0.000 description 4
- 108020003224 Small Nucleolar RNA Proteins 0.000 description 4
- 102000042773 Small Nucleolar RNA Human genes 0.000 description 4
- 238000003556 assay Methods 0.000 description 4
- 238000002474 experimental method Methods 0.000 description 4
- 238000011065 in-situ storage Methods 0.000 description 4
- 208000015181 infectious disease Diseases 0.000 description 4
- 238000001990 intravenous administration Methods 0.000 description 4
- 108020004999 messenger RNA Proteins 0.000 description 4
- 230000030648 nucleus localization Effects 0.000 description 4
- 239000002245 particle Substances 0.000 description 4
- 230000003950 pathogenic mechanism Effects 0.000 description 4
- RXWNCPJZOCPEPQ-NVWDDTSBSA-N puromycin Chemical compound C1=CC(OC)=CC=C1C[C@H](N)C(=O)N[C@H]1[C@@H](O)[C@H](N2C3=NC=NC(=C3N=C2)N(C)C)O[C@@H]1CO RXWNCPJZOCPEPQ-NVWDDTSBSA-N 0.000 description 4
- 238000012163 sequencing technique Methods 0.000 description 4
- 241000894007 species Species 0.000 description 4
- 238000010361 transduction Methods 0.000 description 4
- 230000026683 transduction Effects 0.000 description 4
- 241001655883 Adeno-associated virus - 1 Species 0.000 description 3
- 241000202702 Adeno-associated virus - 3 Species 0.000 description 3
- 241000580270 Adeno-associated virus - 4 Species 0.000 description 3
- 241001634120 Adeno-associated virus - 5 Species 0.000 description 3
- 241000972680 Adeno-associated virus - 6 Species 0.000 description 3
- 241001164823 Adeno-associated virus - 7 Species 0.000 description 3
- 241001164825 Adeno-associated virus - 8 Species 0.000 description 3
- 108700028369 Alleles Proteins 0.000 description 3
- 241000894006 Bacteria Species 0.000 description 3
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 3
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 3
- 102000016621 Focal Adhesion Protein-Tyrosine Kinases Human genes 0.000 description 3
- 108010067715 Focal Adhesion Protein-Tyrosine Kinases Proteins 0.000 description 3
- 241000287828 Gallus gallus Species 0.000 description 3
- 208000032818 Microsatellite Instability Diseases 0.000 description 3
- 102100038931 Proenkephalin-A Human genes 0.000 description 3
- 238000012338 Therapeutic targeting Methods 0.000 description 3
- 108700019146 Transgenes Proteins 0.000 description 3
- 241001492404 Woodchuck hepatitis virus Species 0.000 description 3
- 230000001580 bacterial effect Effects 0.000 description 3
- 230000002457 bidirectional effect Effects 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- 210000000349 chromosome Anatomy 0.000 description 3
- 230000006378 damage Effects 0.000 description 3
- 230000002950 deficient Effects 0.000 description 3
- 210000003527 eukaryotic cell Anatomy 0.000 description 3
- 239000012634 fragment Substances 0.000 description 3
- 239000001257 hydrogen Substances 0.000 description 3
- 229910052739 hydrogen Inorganic materials 0.000 description 3
- 238000007913 intrathecal administration Methods 0.000 description 3
- 230000001404 mediated effect Effects 0.000 description 3
- 239000013642 negative control Substances 0.000 description 3
- 230000016273 neuron death Effects 0.000 description 3
- 210000004940 nucleus Anatomy 0.000 description 3
- 108010074732 preproenkephalin Proteins 0.000 description 3
- 238000012552 review Methods 0.000 description 3
- 239000000243 solution Substances 0.000 description 3
- KISWVXRQTGLFGD-UHFFFAOYSA-N 2-[[2-[[6-amino-2-[[2-[[2-[[5-amino-2-[[2-[[1-[2-[[6-amino-2-[(2,5-diamino-5-oxopentanoyl)amino]hexanoyl]amino]-5-(diaminomethylideneamino)pentanoyl]pyrrolidine-2-carbonyl]amino]-3-hydroxypropanoyl]amino]-5-oxopentanoyl]amino]-5-(diaminomethylideneamino)p Chemical compound C1CCN(C(=O)C(CCCN=C(N)N)NC(=O)C(CCCCN)NC(=O)C(N)CCC(N)=O)C1C(=O)NC(CO)C(=O)NC(CCC(N)=O)C(=O)NC(CCCN=C(N)N)C(=O)NC(CO)C(=O)NC(CCCCN)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 KISWVXRQTGLFGD-UHFFFAOYSA-N 0.000 description 2
- FZWGECJQACGGTI-UHFFFAOYSA-N 2-amino-7-methyl-1,7-dihydro-6H-purin-6-one Chemical compound NC1=NC(O)=C2N(C)C=NC2=N1 FZWGECJQACGGTI-UHFFFAOYSA-N 0.000 description 2
- OIVLITBTBDPEFK-UHFFFAOYSA-N 5,6-dihydrouracil Chemical compound O=C1CCNC(=O)N1 OIVLITBTBDPEFK-UHFFFAOYSA-N 0.000 description 2
- LRSASMSXMSNRBT-UHFFFAOYSA-N 5-methylcytosine Chemical compound CC1=CNC(=O)N=C1N LRSASMSXMSNRBT-UHFFFAOYSA-N 0.000 description 2
- LRFVTYWOQMYALW-UHFFFAOYSA-N 9H-xanthine Chemical compound O=C1NC(=O)NC2=C1NC=N2 LRFVTYWOQMYALW-UHFFFAOYSA-N 0.000 description 2
- 238000010453 CRISPR/Cas method Methods 0.000 description 2
- 108090000565 Capsid Proteins Proteins 0.000 description 2
- 241000282556 Cercocebus atys Species 0.000 description 2
- 102100023321 Ceruloplasmin Human genes 0.000 description 2
- 241000282552 Chlorocebus aethiops Species 0.000 description 2
- 102000053602 DNA Human genes 0.000 description 2
- 101100239628 Danio rerio myca gene Proteins 0.000 description 2
- 102000004190 Enzymes Human genes 0.000 description 2
- 108090000790 Enzymes Proteins 0.000 description 2
- 108700024394 Exon Proteins 0.000 description 2
- 108700028146 Genetic Enhancer Elements Proteins 0.000 description 2
- 102000053171 Glial Fibrillary Acidic Human genes 0.000 description 2
- 101710193519 Glial fibrillary acidic protein Proteins 0.000 description 2
- 241000282412 Homo Species 0.000 description 2
- 101001111338 Homo sapiens Neurofilament heavy polypeptide Proteins 0.000 description 2
- 101000979333 Homo sapiens Neurofilament light polypeptide Proteins 0.000 description 2
- 241001496637 Listeria weihenstephanensis FSL R9-0317 Species 0.000 description 2
- 102100036837 Metabotropic glutamate receptor 2 Human genes 0.000 description 2
- 108010072388 Methyl-CpG-Binding Protein 2 Proteins 0.000 description 2
- 102100039124 Methyl-CpG-binding protein 2 Human genes 0.000 description 2
- 108091092878 Microsatellite Proteins 0.000 description 2
- 102000047918 Myelin Basic Human genes 0.000 description 2
- 101710107068 Myelin basic protein Proteins 0.000 description 2
- 102100024007 Neurofilament heavy polypeptide Human genes 0.000 description 2
- 102100023057 Neurofilament light polypeptide Human genes 0.000 description 2
- 108020003217 Nuclear RNA Proteins 0.000 description 2
- 102000043141 Nuclear RNA Human genes 0.000 description 2
- 102000011931 Nucleoproteins Human genes 0.000 description 2
- 108010061100 Nucleoproteins Proteins 0.000 description 2
- 102000012288 Phosphopyruvate Hydratase Human genes 0.000 description 2
- 108010022181 Phosphopyruvate Hydratase Proteins 0.000 description 2
- 108010038512 Platelet-Derived Growth Factor Proteins 0.000 description 2
- 102000010780 Platelet-Derived Growth Factor Human genes 0.000 description 2
- 108010076504 Protein Sorting Signals Proteins 0.000 description 2
- 241000125945 Protoparvovirus Species 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- 102000001435 Synapsin Human genes 0.000 description 2
- 108050009621 Synapsin Proteins 0.000 description 2
- 102000017299 Synapsin-1 Human genes 0.000 description 2
- 108050005241 Synapsin-1 Proteins 0.000 description 2
- 102000002248 Thyroxine-Binding Globulin Human genes 0.000 description 2
- 108010000259 Thyroxine-Binding Globulin Proteins 0.000 description 2
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 2
- 241000713325 Visna/maedi virus Species 0.000 description 2
- 230000004931 aggregating effect Effects 0.000 description 2
- 238000013459 approach Methods 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 230000002146 bilateral effect Effects 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- 238000003776 cleavage reaction Methods 0.000 description 2
- 210000002777 columnar cell Anatomy 0.000 description 2
- 210000003618 cortical neuron Anatomy 0.000 description 2
- 210000000805 cytoplasm Anatomy 0.000 description 2
- 210000001671 embryonic stem cell Anatomy 0.000 description 2
- 238000011156 evaluation Methods 0.000 description 2
- 210000004602 germ cell Anatomy 0.000 description 2
- 210000002980 germ line cell Anatomy 0.000 description 2
- 210000005046 glial fibrillary acidic protein Anatomy 0.000 description 2
- 210000003958 hematopoietic stem cell Anatomy 0.000 description 2
- FDGQSTZJBFJUBT-UHFFFAOYSA-N hypoxanthine Chemical compound O=C1NC=NC2=C1NC=N2 FDGQSTZJBFJUBT-UHFFFAOYSA-N 0.000 description 2
- 230000001976 improved effect Effects 0.000 description 2
- 210000004263 induced pluripotent stem cell Anatomy 0.000 description 2
- 230000001939 inductive effect Effects 0.000 description 2
- 230000002458 infectious effect Effects 0.000 description 2
- 238000001802 infusion Methods 0.000 description 2
- 230000000977 initiatory effect Effects 0.000 description 2
- DRAVOWXCEBXPTN-UHFFFAOYSA-N isoguanine Chemical compound NC1=NC(=O)NC2=C1NC=N2 DRAVOWXCEBXPTN-UHFFFAOYSA-N 0.000 description 2
- 238000003670 luciferase enzyme activity assay Methods 0.000 description 2
- 210000004705 lumbosacral region Anatomy 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 230000007246 mechanism Effects 0.000 description 2
- 108010038421 metabotropic glutamate receptor 2 Proteins 0.000 description 2
- 238000007069 methylation reaction Methods 0.000 description 2
- 210000002161 motor neuron Anatomy 0.000 description 2
- 238000004806 packaging method and process Methods 0.000 description 2
- 239000008194 pharmaceutical composition Substances 0.000 description 2
- 108010079892 phosphoglycerol kinase Proteins 0.000 description 2
- 239000013600 plasmid vector Substances 0.000 description 2
- 230000008488 polyadenylation Effects 0.000 description 2
- 239000013641 positive control Substances 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 229950010131 puromycin Drugs 0.000 description 2
- 230000003252 repetitive effect Effects 0.000 description 2
- 230000001177 retroviral effect Effects 0.000 description 2
- 230000007017 scission Effects 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- XUIIKFGFIJCVMT-UHFFFAOYSA-N thyroxine-binding globulin Natural products IC1=CC(CC([NH3+])C([O-])=O)=CC(I)=C1OC1=CC(I)=C(O)C(I)=C1 XUIIKFGFIJCVMT-UHFFFAOYSA-N 0.000 description 2
- 231100000419 toxicity Toxicity 0.000 description 2
- 230000001988 toxicity Effects 0.000 description 2
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 2
- XQCZBXHVTFVIFE-UHFFFAOYSA-N 2-amino-4-hydroxypyrimidine Chemical compound NC1=NC=CC(O)=N1 XQCZBXHVTFVIFE-UHFFFAOYSA-N 0.000 description 1
- 108020005345 3' Untranslated Regions Proteins 0.000 description 1
- 108020003589 5' Untranslated Regions Proteins 0.000 description 1
- 101710169336 5'-deoxyadenosine deaminase Proteins 0.000 description 1
- ZAYHVCMSTBRABG-UHFFFAOYSA-N 5-Methylcytidine Natural products O=C1N=C(N)C(C)=CN1C1C(O)C(O)C(CO)O1 ZAYHVCMSTBRABG-UHFFFAOYSA-N 0.000 description 1
- ZAYHVCMSTBRABG-JXOAFFINSA-N 5-methylcytidine Chemical compound O=C1N=C(N)C(C)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](CO)O1 ZAYHVCMSTBRABG-JXOAFFINSA-N 0.000 description 1
- OGHAROSJZRTIOK-KQYNXXCUSA-O 7-methylguanosine Chemical compound C1=2N=C(N)NC(=O)C=2[N+](C)=CN1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O OGHAROSJZRTIOK-KQYNXXCUSA-O 0.000 description 1
- 241000649045 Adeno-associated virus 10 Species 0.000 description 1
- 241000649046 Adeno-associated virus 11 Species 0.000 description 1
- 241000649047 Adeno-associated virus 12 Species 0.000 description 1
- 102000055025 Adenosine deaminases Human genes 0.000 description 1
- 241000203069 Archaea Species 0.000 description 1
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 1
- 102100026031 Beta-glucuronidase Human genes 0.000 description 1
- 241000283690 Bos taurus Species 0.000 description 1
- 102000004657 Calcium-Calmodulin-Dependent Protein Kinase Type 2 Human genes 0.000 description 1
- 108010003721 Calcium-Calmodulin-Dependent Protein Kinase Type 2 Proteins 0.000 description 1
- 101710116137 Calcium/calmodulin-dependent protein kinase II Proteins 0.000 description 1
- 241000282465 Canis Species 0.000 description 1
- 101150044789 Cap gene Proteins 0.000 description 1
- 241000210552 Carnobacterium gallinarum DSM 4847 Species 0.000 description 1
- 108090000994 Catalytic RNA Proteins 0.000 description 1
- 102000053642 Catalytic RNA Human genes 0.000 description 1
- 108010077544 Chromatin Proteins 0.000 description 1
- 241000557626 Corvus corax Species 0.000 description 1
- 241000918600 Corynebacterium ulcerans Species 0.000 description 1
- FBPFZTCFMRRESA-KVTDHHQDSA-N D-Mannitol Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-KVTDHHQDSA-N 0.000 description 1
- WQZGKKKJIJFFOK-QTVWNMPRSA-N D-mannopyranose Chemical compound OC[C@H]1OC(O)[C@@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-QTVWNMPRSA-N 0.000 description 1
- 238000000116 DAPI staining Methods 0.000 description 1
- 102100036912 Desmin Human genes 0.000 description 1
- 108010044052 Desmin Proteins 0.000 description 1
- 229920002307 Dextran Polymers 0.000 description 1
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 1
- 241000283073 Equus caballus Species 0.000 description 1
- 241000206602 Eukaryota Species 0.000 description 1
- 102100031562 Excitatory amino acid transporter 2 Human genes 0.000 description 1
- 241000282324 Felis Species 0.000 description 1
- 102000004437 G-Protein-Coupled Receptor Kinase 1 Human genes 0.000 description 1
- 241001123946 Gaga Species 0.000 description 1
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- 108010060309 Glucuronidase Proteins 0.000 description 1
- 102000034575 Glutamate transporters Human genes 0.000 description 1
- 108091006151 Glutamate transporters Proteins 0.000 description 1
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 1
- 208000009889 Herpes Simplex Diseases 0.000 description 1
- 108091027305 Heteroduplex Proteins 0.000 description 1
- 101000933465 Homo sapiens Beta-glucuronidase Proteins 0.000 description 1
- 101000866287 Homo sapiens Excitatory amino acid transporter 2 Proteins 0.000 description 1
- 101000611338 Homo sapiens Rhodopsin Proteins 0.000 description 1
- 241000714260 Human T-lymphotropic virus 1 Species 0.000 description 1
- 206010062767 Hypophysitis Diseases 0.000 description 1
- UGQMRVRMYYASKQ-UHFFFAOYSA-N Hypoxanthine nucleoside Natural products OC1C(O)C(CO)OC1N1C(NC=NC2=O)=C2N=C1 UGQMRVRMYYASKQ-UHFFFAOYSA-N 0.000 description 1
- 229930010555 Inosine Natural products 0.000 description 1
- UGQMRVRMYYASKQ-KQYNXXCUSA-N Inosine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C2=NC=NC(O)=C2N=C1 UGQMRVRMYYASKQ-KQYNXXCUSA-N 0.000 description 1
- 108020004684 Internal Ribosome Entry Sites Proteins 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 1
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 1
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 1
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 1
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 1
- UBORTCNDUKBEOP-UHFFFAOYSA-N L-xanthosine Natural products OC1C(O)C(CO)OC1N1C(NC(=O)NC2=O)=C2N=C1 UBORTCNDUKBEOP-UHFFFAOYSA-N 0.000 description 1
- 241000904817 Lachnospiraceae bacterium Species 0.000 description 1
- 241000254158 Lampyridae Species 0.000 description 1
- 241000029590 Leptotrichia wadei Species 0.000 description 1
- 241000778057 Leptotrichia wadei F0279 Species 0.000 description 1
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 1
- 241000390917 Listeria newyorkensis Species 0.000 description 1
- 241000186807 Listeria seeligeri Species 0.000 description 1
- 229930195725 Mannitol Natural products 0.000 description 1
- 108700011259 MicroRNAs Proteins 0.000 description 1
- 208000026072 Motor neurone disease Diseases 0.000 description 1
- 241001529936 Murinae Species 0.000 description 1
- 208000008238 Muscle Spasticity Diseases 0.000 description 1
- 208000010428 Muscle Weakness Diseases 0.000 description 1
- 206010028372 Muscular weakness Diseases 0.000 description 1
- 102000005604 Myosin Heavy Chains Human genes 0.000 description 1
- VQAYFKKCNSOZKM-IOSLPCCCSA-N N(6)-methyladenosine Chemical compound C1=NC=2C(NC)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O VQAYFKKCNSOZKM-IOSLPCCCSA-N 0.000 description 1
- 208000012902 Nervous system disease Diseases 0.000 description 1
- 208000025966 Neurological disease Diseases 0.000 description 1
- 108091092724 Noncoding DNA Proteins 0.000 description 1
- 108010066154 Nuclear Export Signals Proteins 0.000 description 1
- 108010077850 Nuclear Localization Signals Proteins 0.000 description 1
- 102000010175 Opsin Human genes 0.000 description 1
- 108050001704 Opsin Proteins 0.000 description 1
- 241000283973 Oryctolagus cuniculus Species 0.000 description 1
- 101000768120 Oryza nivara Uncharacterized protein ycf76 Proteins 0.000 description 1
- 241000007215 Paludibacter propionicigenes WB4 Species 0.000 description 1
- 206010033799 Paralysis Diseases 0.000 description 1
- 241000701945 Parvoviridae Species 0.000 description 1
- 102000002508 Peptide Elongation Factors Human genes 0.000 description 1
- 108010068204 Peptide Elongation Factors Proteins 0.000 description 1
- 108091000080 Phosphotransferase Proteins 0.000 description 1
- 102100037596 Platelet-derived growth factor subunit A Human genes 0.000 description 1
- 101800001494 Protease 2A Proteins 0.000 description 1
- 101800001066 Protein 2A Proteins 0.000 description 1
- 229930185560 Pseudouridine Natural products 0.000 description 1
- PTJWIQPHWPFNBW-UHFFFAOYSA-N Pseudouridine C Natural products OC1C(O)C(CO)OC1C1=CNC(=O)NC1=O PTJWIQPHWPFNBW-UHFFFAOYSA-N 0.000 description 1
- 108020005067 RNA Splice Sites Proteins 0.000 description 1
- 108700020471 RNA-Binding Proteins Proteins 0.000 description 1
- 241000242739 Renilla Species 0.000 description 1
- 208000004756 Respiratory Insufficiency Diseases 0.000 description 1
- 206010038802 Reticuloendothelial system stimulated Diseases 0.000 description 1
- 241000730262 Rhodobacter capsulatus DE442 Species 0.000 description 1
- 241000730265 Rhodobacter capsulatus R121 Species 0.000 description 1
- 241000433126 Rhodobacter capsulatus SB 1003 Species 0.000 description 1
- 241000220317 Rosa Species 0.000 description 1
- 241000714474 Rous sarcoma virus Species 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 108020004682 Single-Stranded DNA Proteins 0.000 description 1
- 229930006000 Sucrose Natural products 0.000 description 1
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 1
- 206010042566 Superinfection Diseases 0.000 description 1
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 1
- 239000004473 Threonine Substances 0.000 description 1
- 108010022394 Threonine synthase Proteins 0.000 description 1
- 108091023040 Transcription factor Proteins 0.000 description 1
- 102000040945 Transcription factor Human genes 0.000 description 1
- 206010046865 Vaccinia virus infection Diseases 0.000 description 1
- 108010051583 Ventricular Myosins Proteins 0.000 description 1
- UBORTCNDUKBEOP-HAVMAKPUSA-N Xanthosine Natural products O[C@@H]1[C@H](O)[C@H](CO)O[C@H]1N1C(NC(=O)NC2=O)=C2N=C1 UBORTCNDUKBEOP-HAVMAKPUSA-N 0.000 description 1
- 241000274840 [Clostridium] aminophilum DSM 10710 Species 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 230000003213 activating effect Effects 0.000 description 1
- 239000002671 adjuvant Substances 0.000 description 1
- 210000004100 adrenal gland Anatomy 0.000 description 1
- 210000004504 adult stem cell Anatomy 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 230000000735 allogeneic effect Effects 0.000 description 1
- 102000015395 alpha 1-Antitrypsin Human genes 0.000 description 1
- 108010050122 alpha 1-Antitrypsin Proteins 0.000 description 1
- 229940024142 alpha 1-antitrypsin Drugs 0.000 description 1
- VREFGVBLTWBCJP-UHFFFAOYSA-N alprazolam Chemical compound C12=CC(Cl)=CC=C2N2C(C)=NN=C2CN=C1C1=CC=CC=C1 VREFGVBLTWBCJP-UHFFFAOYSA-N 0.000 description 1
- WNROFYMDJYEPJX-UHFFFAOYSA-K aluminium hydroxide Chemical compound [OH-].[OH-].[OH-].[Al+3] WNROFYMDJYEPJX-UHFFFAOYSA-K 0.000 description 1
- 125000000539 amino acid group Chemical group 0.000 description 1
- 238000000137 annealing Methods 0.000 description 1
- 239000003963 antioxidant agent Substances 0.000 description 1
- 210000000040 apocrine gland Anatomy 0.000 description 1
- 235000009582 asparagine Nutrition 0.000 description 1
- 229960001230 asparagine Drugs 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical compound [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 description 1
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 1
- WGDUUQDYDIIBKT-UHFFFAOYSA-N beta-Pseudouridine Natural products OC1OC(CN2C=CC(=O)NC2=O)C(O)C1O WGDUUQDYDIIBKT-UHFFFAOYSA-N 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- 230000003115 biocidal effect Effects 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 210000004204 blood vessel Anatomy 0.000 description 1
- 210000004556 brain Anatomy 0.000 description 1
- 239000007975 buffered saline Substances 0.000 description 1
- 150000001720 carbohydrates Chemical class 0.000 description 1
- 235000014633 carbohydrates Nutrition 0.000 description 1
- 239000000969 carrier Substances 0.000 description 1
- 230000003197 catalytic effect Effects 0.000 description 1
- 210000003169 central nervous system Anatomy 0.000 description 1
- 239000002738 chelating agent Substances 0.000 description 1
- 210000003483 chromatin Anatomy 0.000 description 1
- 239000007979 citrate buffer Substances 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- CVSVTCORWBXHQV-UHFFFAOYSA-N creatine Chemical compound NC(=[NH2+])N(C)CC([O-])=O CVSVTCORWBXHQV-UHFFFAOYSA-N 0.000 description 1
- 210000004748 cultured cell Anatomy 0.000 description 1
- 235000018417 cysteine Nutrition 0.000 description 1
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 1
- 230000007850 degeneration Effects 0.000 description 1
- 239000008367 deionised water Substances 0.000 description 1
- 229910021641 deionized water Inorganic materials 0.000 description 1
- 230000001934 delay Effects 0.000 description 1
- 210000005045 desmin Anatomy 0.000 description 1
- 102000004419 dihydrofolate reductase Human genes 0.000 description 1
- ZPTBLXKRQACLCR-XVFCMESISA-N dihydrouridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)CC1 ZPTBLXKRQACLCR-XVFCMESISA-N 0.000 description 1
- 239000003085 diluting agent Substances 0.000 description 1
- LOKCTEFSRHRXRJ-UHFFFAOYSA-I dipotassium trisodium dihydrogen phosphate hydrogen phosphate dichloride Chemical compound P(=O)(O)(O)[O-].[K+].P(=O)(O)([O-])[O-].[Na+].[Na+].[Cl-].[K+].[Cl-].[Na+] LOKCTEFSRHRXRJ-UHFFFAOYSA-I 0.000 description 1
- 208000037765 diseases and disorders Diseases 0.000 description 1
- 239000012636 effector Substances 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 230000001973 epigenetic effect Effects 0.000 description 1
- 230000002349 favourable effect Effects 0.000 description 1
- 210000000232 gallbladder Anatomy 0.000 description 1
- 238000001476 gene delivery Methods 0.000 description 1
- 102000034356 gene-regulatory proteins Human genes 0.000 description 1
- 108091006104 gene-regulatory proteins Proteins 0.000 description 1
- 239000008103 glucose Substances 0.000 description 1
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 1
- RWSXRVCMGQZWBV-WDSKDSINSA-N glutathione Chemical compound OC(=O)[C@@H](N)CCC(=O)N[C@@H](CS)C(=O)NCC(O)=O RWSXRVCMGQZWBV-WDSKDSINSA-N 0.000 description 1
- 230000001771 impaired effect Effects 0.000 description 1
- 229960003786 inosine Drugs 0.000 description 1
- 238000002743 insertional mutagenesis Methods 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 210000000936 intestine Anatomy 0.000 description 1
- 238000007918 intramuscular administration Methods 0.000 description 1
- 238000007912 intraperitoneal administration Methods 0.000 description 1
- 230000002601 intratumoral effect Effects 0.000 description 1
- 229960000310 isoleucine Drugs 0.000 description 1
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 1
- 210000003734 kidney Anatomy 0.000 description 1
- 238000002684 laminectomy Methods 0.000 description 1
- 210000000867 larynx Anatomy 0.000 description 1
- 210000004185 liver Anatomy 0.000 description 1
- 231100000053 low toxicity Toxicity 0.000 description 1
- 238000003468 luciferase reporter gene assay Methods 0.000 description 1
- 210000004072 lung Anatomy 0.000 description 1
- 230000002132 lysosomal effect Effects 0.000 description 1
- 210000001161 mammalian embryo Anatomy 0.000 description 1
- 239000000594 mannitol Substances 0.000 description 1
- 235000010355 mannitol Nutrition 0.000 description 1
- 125000000956 methoxy group Chemical group [H]C([H])([H])O* 0.000 description 1
- 238000001000 micrograph Methods 0.000 description 1
- 230000033607 mismatch repair Effects 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 238000009126 molecular therapy Methods 0.000 description 1
- 208000005264 motor neuron disease Diseases 0.000 description 1
- 210000002894 multi-fate stem cell Anatomy 0.000 description 1
- 210000003205 muscle Anatomy 0.000 description 1
- 210000004498 neuroglial cell Anatomy 0.000 description 1
- 230000007935 neutral effect Effects 0.000 description 1
- 108091027963 non-coding RNA Proteins 0.000 description 1
- 102000042567 non-coding RNA Human genes 0.000 description 1
- 230000030147 nuclear export Effects 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 239000001301 oxygen Substances 0.000 description 1
- 229910052760 oxygen Inorganic materials 0.000 description 1
- 210000000496 pancreas Anatomy 0.000 description 1
- 230000002023 papillomaviral effect Effects 0.000 description 1
- 238000007911 parenteral administration Methods 0.000 description 1
- 230000007170 pathology Effects 0.000 description 1
- 230000002093 peripheral effect Effects 0.000 description 1
- 210000001428 peripheral nervous system Anatomy 0.000 description 1
- 239000000546 pharmaceutical excipient Substances 0.000 description 1
- 210000003800 pharynx Anatomy 0.000 description 1
- 239000002953 phosphate buffered saline Substances 0.000 description 1
- 102000020233 phosphotransferase Human genes 0.000 description 1
- 210000004560 pineal gland Anatomy 0.000 description 1
- 210000003635 pituitary gland Anatomy 0.000 description 1
- 108010017843 platelet-derived growth factor A Proteins 0.000 description 1
- 210000001778 pluripotent stem cell Anatomy 0.000 description 1
- 102000054765 polymorphisms of proteins Human genes 0.000 description 1
- 238000004321 preservation Methods 0.000 description 1
- 239000003755 preservative agent Substances 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 230000001566 pro-viral effect Effects 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 238000004393 prognosis Methods 0.000 description 1
- 210000001236 prokaryotic cell Anatomy 0.000 description 1
- 230000012743 protein tagging Effects 0.000 description 1
- PTJWIQPHWPFNBW-GBNDHIKLSA-N pseudouridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1C1=CNC(=O)NC1=O PTJWIQPHWPFNBW-GBNDHIKLSA-N 0.000 description 1
- 238000011002 quantification Methods 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 230000007115 recruitment Effects 0.000 description 1
- 101150066583 rep gene Proteins 0.000 description 1
- 201000004193 respiratory failure Diseases 0.000 description 1
- 210000003705 ribosome Anatomy 0.000 description 1
- 125000000548 ribosyl group Chemical group C1([C@H](O)[C@H](O)[C@H](O1)CO)* 0.000 description 1
- 108091092562 ribozyme Proteins 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 210000001732 sebaceous gland Anatomy 0.000 description 1
- 210000001044 sensory neuron Anatomy 0.000 description 1
- 230000037432 silent mutation Effects 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 210000000952 spleen Anatomy 0.000 description 1
- 210000000130 stem cell Anatomy 0.000 description 1
- 210000002784 stomach Anatomy 0.000 description 1
- 238000007920 subcutaneous administration Methods 0.000 description 1
- 239000005720 sucrose Substances 0.000 description 1
- 238000002560 therapeutic procedure Methods 0.000 description 1
- 210000001541 thymus gland Anatomy 0.000 description 1
- 210000001685 thyroid gland Anatomy 0.000 description 1
- 230000000699 topical effect Effects 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000009261 transgenic effect Effects 0.000 description 1
- 230000014621 translational initiation Effects 0.000 description 1
- 125000001493 tyrosinyl group Chemical group [H]OC1=C([H])C([H])=C(C([H])=C1[H])C([H])([H])C([H])(N([H])[H])C(*)=O 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- 229940035893 uracil Drugs 0.000 description 1
- 208000007089 vaccinia Diseases 0.000 description 1
- 210000003462 vein Anatomy 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Chemical compound O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- 229940075420 xanthine Drugs 0.000 description 1
- UBORTCNDUKBEOP-UUOKFMHZSA-N xanthosine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(NC(=O)NC2=O)=C2N=C1 UBORTCNDUKBEOP-UUOKFMHZSA-N 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/113—Non-coding nucleic acids modulating the expression of genes, e.g. antisense oligonucleotides; Antisense DNA or RNA; Triplex- forming oligonucleotides; Catalytic nucleic acids, e.g. ribozymes; Nucleic acids used in co-suppression or gene silencing
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2310/00—Structure or type of the nucleic acid
- C12N2310/10—Type of nucleic acid
- C12N2310/20—Type of nucleic acid involving clustered regularly interspaced short palindromic repeats [CRISPRs]
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2750/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssDNA viruses
- C12N2750/00011—Details
- C12N2750/14011—Parvoviridae
- C12N2750/14111—Dependovirus, e.g. adenoassociated viruses
- C12N2750/14141—Use of virus, viral particle or viral elements as a vector
- C12N2750/14143—Use of virus, viral particle or viral elements as a vector viral genome or elements thereof as genetic vector
Definitions
- the disclosure is directed to molecular biology, gene therapy, and compositions and methods for modifying expression and activity of RNA molecules.
- MRE microsatellite repeat expansion
- a GGGGCC (G4C2) hexanucleotide repeat expansion (HRE) in the first intron of C9ORF72 gene is the most common genetic cause of frontotemporal dementia and amyotrophic lateral sclerosis (c9ALS/FTD).
- Bidirectional transcription at the C9ORF72 repeat locus produces both sense G4C2 and antisense C4G2 containing transcripts.
- the disclosure provides a composition comprising a nucleic acid sequence encoding an RNA-binding polypeptide capable of binding a first target RNA sequence and a second target RNA sequence, wherein the first target RNA comprises a toxic hexanucleotide repeat RNA sequence.
- the toxic hexanucleotide repeat RNA sequence is GGGGCC (sense) or CCCCGG (antisense).
- the second target RNA sequence is GGGGCC, CCCCGG, or a flanking sequence thereof.
- the RNA-binding polypeptide is a guided RNA-binding polypeptide
- the composition comprises one or more guide RNAs (gRNAs), wherein the one or more gRNAs comprise a direct repeat (DR) sequence capable of binding to the guided RNA-binding polypeptide and a spacer sequence capable of binding to the first or second target RNA sequence.
- gRNAs guide RNAs
- DR direct repeat
- the spacer sequence is a sequence that comprises, consists essentially of, or consists of a nucleic acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% identical to at least one of SEQ ID NO: 13 - SEQ ID NO: 131 or SEQ ID NO: 158 - SEQ ID NO: 169.
- the DR sequence comprises, consists essentially of, or consists of a nucleic acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% identical to at least one of SEQ ID NO: 157, or SEQ ID NO: 170, SEQ ID NO: 171.
- the guided RNA-binding polypeptide is a Cas protein.
- the Cas protein is Casl3d.
- the Casl3d protein is encoded by a nucleic acid sequence that comprises, consists essentially of, or consists of a nucleic acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% identical to any one of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9, SEQ ID NO: 11, or SEQ ID NO: 147.
- the nucleic acid sequence comprises one or more promoters.
- the one or more promoters are selected from a human U6 promoter, a mouse U6 promoter, a 7SK promoter, a tRAN valine promoter, an EFS promoter, or a CMV promoter.
- the disclosure provides a vector comprising the composition of any one of the preceding claims.
- the vector is selected from the group consisting of: adeno- associated virus (AAV), retrovirus, lentivirus, adenovirus, nanoparticle, micelle, liposome, lipoplex, polymersome, polyplex, and dendrimer.
- AAV adeno-associated virus
- retrovirus retrovirus
- lentivirus lentivirus
- adenovirus lentivirus
- nanoparticle micelle
- liposome lipoplex
- polymersome lipoplex
- polymersome polymersome
- polyplex polyplex
- dendrimer dendrimer
- the disclosure provides an AAV vector comprising a nucleic acid sequence encoding an RNA-binding polypeptide capable of binding a first target RNA sequence and a second target RNA sequence, wherein the first target RNA sequence comprises a toxic hexanucleotide repeat RNA sequence, wherein the AAV vector comprises: a first AAV ITR sequence; a first promoter sequence; a first gRNA comprising a spacer sequence capable of binding to the first target RNA sequence; a second gRNA comprising a spacer sequence capable of binding to the second target RNA sequence; a second promoter sequence; a polynucleotide sequence encoding for at least one repeat RNA-binding polypeptide; and a second AAV ITR sequence.
- the RNA-binding polypeptide is a Cas protein.
- the first promoter sequence is a PolIII promoter sequence.
- the first promoter sequence is a U6 promoter sequence.
- the U6 promoter sequence comprises, consists essentially of, or consists of a nucleic acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% identical to any one of SEQ ID NO: 156 or SEQ ID NO: 172 - SEQ ID NO: 174.
- the spacer sequence of the first gRNA comprises comprises, consists essentially of, or consists of a nucleic acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% identical to any one of SEQ ID NO: 13 - SEQ ID NO: 131 or SEQ ID NO: 158 - SEQ ID NO: 169.
- the spacer sequence of the second gRNA comprises comprises, consists essentially of, or consists of a nucleic acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% identical to any one of SEQ ID NO: 13 - SEQ ID NO: 131 or SEQ ID NO: 158 - SEQ ID NO: 169.
- the second promoter sequence is a constitutive promoter or a tissue-specific promoter.
- the second promoter sequence comprises, consists essentially of, or consists of a nucleic acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% identical to any one of SEQ ID NO: 155 or SEQ ID NO: 186.
- the Casl3d protein comprises, consists essentially of, or consists of an amino acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% identical to any one of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, or SEQ ID NO: 146.
- the Casl3d protein is encoded by a nucleic acid sequences that comprises, consists essentially of, or consists of a nucleic acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% identical to any one of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9, SEQ ID NO: 11, or SEQ ID NO: 147.
- the first ITR sequence comprises, consists essentially of, or consists of a nucleic acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% identical to any one of SEQ ID NO: 149 or SEQ ID NO: 150.
- the second ITR sequence comprises, consists essentially of, or consists of a nucleic acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% identical to any one of SEQ ID NO: 149 or SEQ ID NO: 150.
- the AAV vector comprises, consists essentially of, or consists of a nucleic acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% identical to any one of SEQ ID NO: 202 - SEQ ID NO: 218.
- the disclosure provides a cell comprising the vector of any one of the preceding claims.
- the disclosure provides a method of treating frontotemporal dementia (FTD) or amyotrophic lateral sclerosis (ALS) in a mammal having a toxic hexanucleotide repeat comprising administering a composition or AAV vector according to any one of the preceding claims to a toxic target repeat expansion RNA sequence comprising C4G2 and/or G4C2 in tissues of the mammal whereby the level of expression of the toxic target RNA is reduced.
- FTD frontotemporal dementia
- ALS amyotrophic lateral sclerosis
- the composition or AAV vector is administered to the subject intravenously, intrathecally, intracerebrally, intraventricularly, or subpially. In some aspects, the composition or AAV vector is administered to the subject intravenously. [0027] In some aspects, the reduced level of expression of the toxic target RNA thereby ameliorates symptoms of ALS or FTD in the mammal. In some aspects, the level of expression of the toxic target RNA is reduced compared to the level of expression of untreated toxic target repeat RNA. In some aspects, the level of reduction is between 1-fold and 20-fold.
- compositions of the disclosure or vectors of the disclosure further comprise a third target RNA, wherein the third target RNA is selected from the group consisting of GGGGCC, CCCCGG, flanking sequences adjacent to GGGGCC, and flanking sequences adjacent to CCCCGG.
- the Casl3d protei is a Casl3d variant engineered to improve on- target knockdown of G4C2 sense and C4G2 antisense flanking region.
- FIGS. 1A-D show a series of schematics depicting exemplary CRISPR/Casl3d targeting systems and RNA expression reporter constructs used herein for the targeting of sense and anti-sense strands of C9ORF72 hexanucleotide repeat-containing RNA.
- FIG. 1A depicts an exemplary CRISPR/Casl3d targeting system with a single guide target of hexanucleotide repeat sense strand G4C2.
- FIG. IB depicts an exemplary CRISPR/Casl3d targeting system with a dual array target of sense strand G4C2 and flanking sequence adjacent of anti-sense C4G2.
- FIG. 1C depicts an exemplary G4C2 sense strand RNA expression reporter construct.
- FIG. ID depicts an exemplary anti-sense C4G2 flanking sequence RNA expression reporter construct.
- FIGS. 2A-D show a series of bar graphs depicting knockdown of expression of RNA containing sense strand G4C2 and anti-sense C4G2 flanking regions in mammalian cells using the exemplary CRISPR/Casl3d targeting system in vitro.
- Mammalian cells were transfected with a first exemplary expression reporter construct expressing sense or anti-sense target RNA and a first exemplary CRISPR/Casl3d targeting system.
- Expression of target G4C2 RNA was determined using qPCR and normalized to GAPDH and cells treated with the non-targeting (NT) guide.
- FIG. 2A shows a bar graph depicting expression of sense strand G4C2 RNA in cells transfected with the NT Guide (P000MJ Single Lambda Guide) or Single Guide (P000II Single G4C2gl) vector.
- the x-axis depicts the vector.
- the y-axis depicts normalized G4C2 expression.
- FIG. 2B shows a bar graph depicting Casl3d expression in cells transfected with the NT Guide (P000MJ Single Lambda Guide), Single Guide (P000II Single G4C2gl), Dual Array #1 (P02928C9flankg20_G4C2gl) or Dual Array #2 (P02929C9flankgl9_G4C2gl) vector.
- the x-axis depicts the vector.
- the y-axis depicts normalized Casl3d expression.
- 2C shows a bar graph depicting expression of anti-sense C4G2 flanking sequence in cells transfected with the Luciferase antisense reporter, NT Guide (P000MJ Single Lambda Guide), Dual Array #1 (P02928C9flankg20_G4C2gl) or Dual Array #2
- FIG. 2D shows a bar graph depicting expression of sense strand G4C2 in cells transfected with the NT Guide (P000MJ Single Lambda Guide), Single Guide (P000II Single G4C2gl), Dual Array #1 (P02928C9flankg20_G4C2gl) or Dual Array #2 (P02929C9flankgl9_G4C2gl) vector.
- the x-axis depicts the vector.
- the y-axis depicts G4C2 normalized expression.
- FIGS. 3A-B show the elimination (knockdown) of sense strand G4C2 expression foci in host cells transfected with exemplary CRISPR/Casl3d targeting vectors.
- FIG. 3A shows a series of microscopy images of cells transfected with non-targeting (P000MJ Single Lambda Guide), Single Guide-G4C2 (P000II Single G4C2gl), Dual Array #1 (P02928C9flankg20_G4C2gl), or Dual Array #2 (P02929C9flankgl9_G4C2gl) vector. Expression of sense strand G4C2 was determined using RNA-FISH analysis. Cells were visualized by DAPI staining.
- FIG. 3B shows a bar graph depicting the average G4C2 expression foci per cell in the exemplary images depicted in FIG. 3A.
- FIGS. 4A-B show an exemplary model of CRISPR/Casl3d based RNA targeting knockdown of sense hexanucleotide repeat expansions in vivo.
- FIG. 4A shows a table depicting mouse strain, dosage of vehicle or AAV9-Casl3d-G4C2 vector (targeting the sense G4C2 repeats) administered and sample collection timepoint of the exemplary mouse model.
- FIG. 4B shows a schematic of the exemplary subpial injection, the tissue sites of collection (cervical, thoracic, lumbar) and corresponding samples collected (RNA, protein) of the exemplary in vivo Cast 3d based RNA targeting knockdown model.
- FIGS. 4A shows a table depicting mouse strain, dosage of vehicle or AAV9-Casl3d-G4C2 vector (targeting the sense G4C2 repeats) administered and sample collection timepoint of the exemplary mouse model.
- FIG. 4B shows a schematic of the exemplary sub
- FIG. 5A-D show a series of bar graphs depicting exemplary Casl3d based RNA targeting knockdown of sense G4C2 RNA hexanucleotide repeat expansions in vivo using the exemplary mouse model depicted in FIG. 4B.
- FIG. 5A shows a bar graph depicting Cast 3d RNA expression in tissue following subpial injection of vector or AAV9-Casl3d-G4C2.
- the x-axis depicts the tissue.
- the y-axis depicts Casl3 expression. Casl3 expression was determined using ddPCR and normalized to Atp5b.
- FIG. 5A shows a bar graph depicting Cast 3d RNA expression in tissue following subpial injection of vector or AAV9-Casl3d-G4C2.
- the x-axis depicts the tissue.
- the y-axis depicts Casl3 expression. Casl3 expression was determined using ddPCR and normalized to
- 5B shows a bar graph depicting Cast 3 protein levels in tissue following subpial injection of vector or AAV9-Casl3d-G4C2.
- the x- axis depicts the tissue.
- the y-axis depicts Casl3 protein signal determined using Meso Scale Discovery (MSD).
- FIG. 5C shows a bar graph depicting G4C2 sense strand (pathological variant) RNA expression relative to total C9ORF72 gene expression in mouse tissue following subpial injection of vector or AAV9-Casl3d-G4C2.
- the x-axis depicts the tissue.
- the y-axis depicts relative G4C2 RNA expression.
- FIG. 5D shows a bar graph depicting expression of C9 total RNA relative to Atp5b reference gene expression in mouse tissue following subpial injection of vector or AAV9-Casl3d-G4C2.
- the x-axis depicts the tissue.
- the y-axis depicts relative C9 total RNA expression.
- FIGS. 6A-D show a series of bar graphs depicting knockdown of expression of RNA containing sense G4C2 and antisense C4G2 repeats, as well as decrease in dipeptide repeats (DPRs) poly-GP aggregates in C9-ALS derived fibroblasts after treatment with Casl3d multi-targeting (MT) construct.
- C9-ALS derived fibroblasts were transduced with lentivirus expressing Casl3d and selected with puromycin 48 h after transduction. After selection cells were transduced with the lentivirus for guide expression targeting sense and antisense transcripts and harvested 72 h after the second transduction.
- FIG. 6A Bar graph depicting expression of sense strand G4C2 RNA (FIG. 6A) and antisense RNA (FIG. 6B) in C9-ALS fibroblasts transduced with Cast 3d (Seq212) and with the NT guide or multi -targeting guide array (MT).
- the x-axis depicts the treatment.
- the y-axis depicts normalized sense or antisense expression.
- FIGs. 6C-D shows bar graphs depicting Cast 3d protein expression (FIG. 6C) or poly-GP DPR (FIG.
- FIG. 6D by MSD in C9-ALS fibroblasts transduced with Casl3d and NT guide or multi -targeting (MT) guide array.
- the y-axis depicts firnol of Casl3d per total mg of protein (FIG. 6C) or raw signal for poly-GP (FIG. 6D).
- the x-axis depicts the treatment.
- FIGS. 7A-F show a series of bar graphs depicting RNA knockdown of sense G4C2 and antisense C4G2 RNAs, as well as decrease in dipeptide repeats (DPRs) poly-GP aggregates in C9-ALS mouse model C9 B AC-500 (containing 500 G4C2 repeats) after 6 weeks subpial injection of AAV9 Casl3d multi-targeting (MT) (A02562).
- FIG. 7A shows a table depicting mouse strain, dosage of vehicle or AAV9-Casl3d MT vector administered and sample collection timepoint of the exemplary mouse model.
- FIG. 7B shows a schematic of the exemplary subpial injection, the tissue sites of collection (cervical, thoracic, lumbar) and corresponding samples collected (RNA, protein) of the exemplary in vivo Casl3d-MT based RNA targeting knockdown model.
- Decrease on sense RNA (V3 pathological isoform) (FIG. 7C) and antisense RNA (FIG. 7D) is shown by ddPCR as copies of the respective transcript per 1000 copies of the Atp5b reference gene.
- RNA was extracted from the lumbar region, close to the site of injection.
- the x-axis depicts the treatment.
- the y-axis depicts relative sense RNA expression (FIG. 7C) or antisense (FIG. 7D).
- Casl3d protein expression in tissue (thoracic region) following subpial injection of vector (Vehicle) or AAV9-Casl3d-MT (A02562) is shown in (FIG. 7E) and poly-GP DPRs in (FIG. 7F).
- the x-axis depicts the tissue (T represents Thoracic T9-12) and treatment, and the y-axis depicts Casl3 expression (fmol per mg of total protein) (FIG. 7E) or raw signal for poly-GP (FIG. 7F) using Meso Scale Discovery (MSD) assay.
- FIGS. 8A-E show a series of bar graphs depicting RNA knockdown of G4C2 sense and antisense C4G2 repeats, as well as decrease in dipeptide repeats (DPRs) poly-GP aggregates in C9-ALS mouse model C9 BAC-500 (containing 500 G4C2 repeats) after 6 weeks intrastriatal injection of AAV9 Casl3d multi -targeting (MT) (A02562). Mice were injected unilaterally using AAV9 Casl3d MT with the non-injected side used as a negative control tissue, and two age groups were used, 17 weeks and 24 weeks-old mice, for evaluation.
- FIG. 8A shows a table depicting mouse strain, dosage of AAV9-Casl3d MT vector administered, mouse age at the treatment, and sample collection timepoint of the exemplary mouse model.
- sense RNA pathological isoform
- FIG. 8C antisense RNA
- the x-axis depicts the groups and treatment.
- the y-axis depicts relative G4C2 sense RNA expression (FIG. 8B) or antisense RNA (FIG. 8C).
- FIGS. 8D and 8E Casl3d expression in tissue following intrastriatal injection of AAV9-Casl3d-MT (A02562) is shown in (FIG. 8D) and poly-GP DPRs in (FIG. 8E) relative to untreated tissue (contralateral control).
- the x-axis depicts the groups and treatment, and the y-axis depicts Cast 3 expression (fmol per mg of total protein; in FIG. 8D) or relative poly-GP DPRs normalized to control untreated groups (FIG. 8E) using Meso Scale Discovery (MSD) assay.
- MSD Meso Scale Discovery
- FIG. 9A-B shows a schematic of the reporter construct containing the sense flanking region adjacent to the G4C2 repeats added downstream to firefly luciferase and a bar graph depicting knockdown of the sense flanking RNA in mammalian cells using the Cast 3d targeting system in vitro.
- FIG. 9A shows the reporter construct expressing the sense flanking region adjacent to the G4C2 repeats added downstream to firefly luciferase (FLUC) for screening purposes.
- FIG. 9B is bar graph depicting expression of sense flanking strand RNA in cells transfected with Casl3d and NT control or targeting guides.
- Mammalian cells were transfected with the luciferase reporter containing the sense flanking region, Cast 3d and the NT Guide (P000MJ Single Lambda Guide; negative control), firefly luciferase Guide (positive control), and guides targeting the sense flanking region.
- the x-axis depicts the guides used with Casl3d.
- the y-axis depicts expression of target RNA determined by firefly luciferase luminescence normalized to renilla luciferase (RLUC) transfection control and NT (nontargeting condition).
- FIGS. 10A-E show optimized Casl3d variants that have been engineered to improved on-target knockdown of both G4C2 sense and C4G2 antisense flanking region.
- FIG. 10A is a schematic of reporter plasmid used to express G4C2 repeats driven by the CMV promoter.
- FIG. 10B is a bar graph showing quantitation of G4C2 RNA levels (y-axis) normalized to reference gene GAPDH and NT guide control condition after multiple treatments (x-axis). Wildtype (wt) Casl3d was co-transfected with non-targeting (NT) and G4C2- targeting guide (red and green, respectively).
- FIG. 10C is a schematic of C4G2 flanking sequence reporter plasmid cloned downstream of firefly luciferase ORF (FLUC). Renilla luciferase (RLUC) used as transfection control.
- FLUC firefly luciferase ORF
- RLUC Renilla luciferase
- DZE Bar graphs of two independent experiments testing C4G2 antisense flank knockdown in luciferase assay. Antisense C4G2 flanking level on y-axis, Seq212 variants on x-axis.
- FIG. 10D shows Seq212 point mutation variants
- FIG. 10E shows a separate experiment testing RNA binding domains (RBD) tethered to Seq212. Dotted lines denote wildtype Seq212 knockdown levels.
- RNA-targeting gene therapy compositions and methods for treating HRE causing diseases and/or disorders such as ALS and FD are provided.
- a GGGGCC (G4C2) hexanucleotide repeat expansion (HRE) in the first intron of C9ORF72 gene is the most common genetic cause of frontotemporal dementia (FTD) and amyotrophic lateral sclerosis (c9ALS/FTD).
- FTD frontotemporal dementia
- c9ALS/FTD amyotrophic lateral sclerosis
- Bidirectional transcription at the C9ORF72 repeat locus produces both sense G4C2 and antisense C4G2 containing transcripts.
- ALS is the most common motor neuron disease.
- the disease pathology involves the degeneration of upper and lower motor neurons, causing muscle weakness, spasticity, and paralysis leading to respiratory failure.
- Patients afflicted with ALS typically have about 700- 5000 repeats whereas a healthy patient has less than 30 repeats. See Ling, S.C. et al., Neuron (2013); Dejesus-Hernandez, M. et al., Neuron (2011); Renton et al., Neuron (2011); van Bittlerswijk et al. Curr Opin Neurol (2013).
- RNA toxicity of the sense and antisense hexanucleotide repeats and DPR (dipeptide proteins) toxicity caused by repeat associated non-ATG (RAN) translation which leads to expression of DPR.
- RAN repeat associated non-ATG
- G4C2 targeting CRISPR/Casl3d compositions showed no overt safety concerns in wildtype mice at 8 weeks post-subpial delivery.
- these constructs reduced the G4C2-HRE containing isoforms of C9ORF72 in both cultured neonatal cortical neurons and in the spinal cord of transgenic C9ORF72 mice containing 500- G4C2 repeats following subpial delivery, while largely preserving total C9ORF72 transcript (transcripts from both alleles and including normal non-expanded isoforms) levels.
- MT Casl3d-based multi-targeting
- AAV9 Casl3d MT RNA knockdown of G4C2 sense and C4G2 antisense transcripts, as well as decrease in dipeptide repeats (DPRs) poly- GP aggregates were shown in in vivo.
- DPRs dipeptide repeats
- the disclosure provides RNA-guided RNA-binding systems.
- the RNA-guided RNA-binding system is an RNase Cas-based RNA-guided RNA-binding polypeptide.
- a nucleic acid sequence encodes an RNA- guided RNA-binding polypeptide which is an RNase Cas protein (or a deactivated RNase Cas protein).
- the nucleic acid sequence further comprises a gRNA sequence comprising a spacer sequence which binds to a toxic target repeat RNA and a direct repeat (DR) sequence which binds to the RNase Cas protein.
- DR direct repeat
- a Casl3d system is catalytically active, in which case, the Casl3d nucleoprotein complex cleaves and destroys toxic RNA repeats.
- a Casl3d system is catalytically inactive, in which case, the Casl3d nucleoprotein complex binds and blocks (but does not cleave) the RNA repeats.
- a Cas 13d comprises a catalytically inactive Casl3d fused to an endonuclease which is capable of cleaving the toxic RNA repeats.
- the endonuclease is an active RNase.
- Exemplary endonucleases with RNase activity can be found herein, and these include, for example, a domain from a ZC3H12A zinc-finger (also referred herein as E17) or a PIN endonuclease.
- the RNase Cas protein is a Casl3 protein.
- the Casl3 protein is a Casl3d protein.
- the Casl3d protein is a deactivated RNase Casl3d protein (dCasl3d).
- the dCasl3d protein is a fusion protein comprising 1) dCasl3d and 2) a polypeptide encoding a protein or fragment thereof having nuclease activity.
- the dCasl3d protein is a fusion protein comprising 1) dCasl3d and 2) a nuclease domain of ZC3H12A, a zinc-finger endonuclease, (referred to as El 7 herein).
- the Cas configuration comprises a signal sequence(s) such as NLS(s) and/or NES(s).
- an SV-40 NLS can comprise, consist essentially of, or consist of an amino acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% (or any percentage in between) identical to SEQ ID NO: 148.
- an SV-40 NLS can be encoded a nucleic acid sequence that can comprise, consist essentially of, or consist of a nucleic acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% (or any percentage in between) identical to SEQ ID NO: 153.
- a cMyc NLS can comprise, consist essentially of, or consist of an amino acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% (or any percentage in between) identical to SEQ ID NO: 179.
- a cMyc NLS can be encoded a nucleic acid sequence that can comprise, consist essentially of, or consist of a nucleic acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% (or any percentage in between) identical to SEQ ID NO: 188.
- a focal adhesion kinase (FAK) NES can comprise, consist essentially of, or consist of an amino acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% (or any percentage in between) identical to SEQ ID NO: 178.
- an FAK NES can be encoded a nucleic acid sequence that can comprise, consist essentially of, or consist of a nucleic acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% (or any percentage in between) identical to SEQ ID NO: 187.
- the dCasl3d is linked to E17 via a linker sequence.
- the linker sequence is VDTANGS.
- the nucleic acid sequence encoding the Cast 3d or dCasl3d fusion proteins are operably linked to at least one promoter sequence.
- the promoter sequence comprises an enhancer and/or an intron.
- the promoter sequence is a constitutive promoter such as, EIFla or its truncated form, the EFS promoter sequence, or the full length or truncated (tCAG) promoter sequence or the EFS/UBB promoter sequence.
- the promoter is a tissue specific promoter such as the neuron specific synapsin promoter sequence.
- the nucleic acid sequence comprises a first promoter sequence that controls expression of a Cast 3d protein or Cast 3d fusion protein and a second promoter sequence that controls expression of the at least one guide RNA sequence.
- the sequence encoding the RNA-guided RNA binding protein comprises a sequence isolated or derived from a protein with no DNA nuclease activity.
- the sequence encoding the RNA-guided RNA binding protein disclosed herein comprises a sequence isolated or derived from a CRISPR Cas protein.
- the CRISPR Cas protein is not a Type II CRISPR Cas protein.
- the CRISPR Cas protein is not a Cas9 protein.
- the sequence encoding the RNA-guided RNA binding protein comprises a Type VI CRISPR Cas protein or portion thereof.
- the Type VI CRISPR Cas protein comprises a Cas 13 protein or portion thereof.
- Exemplary Casl3 proteins of the disclosure may be isolated or derived from any species, including, but not limited to, bacteria or archaea.
- Exemplary Cast 3 proteins of the disclosure may be isolated or derived from any species, including, but not limited to, Leptotrichia wadei, Listeria seeligeri serovar l/2b (strain ATCC 35967 / DSM 20751 / CIP 100100 / SLCC 3954), Lachnospiraceae bacterium, Clostridium aminophilum DSM 10710, Carnobacterium gallinarum DSM 4847, Paludibacter propionicigenes WB4, Listeria weihenstephanensis FSL R9-0317, Listeria weihenstephanensis FSL R9-0317, bacterium FSL M6-0635 (Listeria newyorkensis), Leptotrichia wadei F0279, Rhodobacter capsulatus SB 1003, Rhodobacter capsulatus R121, Rhodobacter capsulatus DE442 and Corynebacterium ulcerans.
- Leptotrichia wadei Listeria see
- Exemplary Casl3 proteins of the disclosure may be DNA nuclease inactivated.
- Exemplary Casl3 proteins of the disclosure include, but are not limited to, Casl3a, Casl3b, Casl3c, Casl3d and orthologs thereof.
- Exemplary Casl3b proteins of the disclosure include, but are not limited to, subtypes 1 and 2 referred to herein as Csx27 and Csx28, respectively.
- the sequence encoding the RNA binding protein comprises a sequence isolated or derived from a Casl3d protein.
- Casl3d is an effector of the type VI-D CRISPR-Cas systems.
- the Casl3d protein is an RNA-guided RNA endonuclease enzyme that can cut or bind RNA.
- the Cast 3d protein can include one or more higher eukaryotes and prokaryotes nucleotide-binding (HEPN) domains.
- the Cast 3d protein can include either a wild-type or mutated HEPN domain.
- the Cast 3d protein includes a mutated HEPN domain that cannot cut RNA but can process guide RNA. In some embodiments, the Cast 3d protein does not require a protospacer flanking sequence. Also see WO Publication No. W02019/040664 & US2019/0062724, which is incorporated herein by reference in its entirety, for further examples and sequences of Cast 3d protein, without limitation.
- Casl3d sequences of the disclosure include without limitation SEQ ID NOS: 1-296 of WO 2019/040664, so numbered herein and included herewith.
- Yan et al. (2016) Mol Cell. 70(2):327-339 (doi: 10.1016/j.molcel.2018.02.2018) and Konermann et al. (2016) Cell 173(3):665-676 (doi: 10.1016/j .cell/2018.02.033) have described Casl3d proteins and both of which are incorporated by reference herein in their entireties.
- a Casl3d seq212 protein can comprise, consist essentially of, or consist of an amino acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% (or any percentage in between) identical to SEQ ID NO: 146.
- a Casl3d seq212 protein can be encoded a nucleic acid sequence that can comprise, consist essentially of, or consist of a nucleic acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% (or any percentage in between) identical to SEQ ID NO: 147.
- the disclosure provides Casl3d variants for enhanced C9 G4C2 sense and antisense knockdown.
- Exemplary Casl3d variants are referred to herein as Seq212.
- Exemplary Casl3d Seq212 variants can comprise, consist essentially of, or consist of a nucleotide sequence or amino acid sequences as set forth below.
- Cas polypeptides are fused to at least one protein and/or enzyme. In some aspects, Cas polypeptides of the disclosure are fused to adenosine deaminase acting on RNA (ADAR) or a domain thereof. In some aspects, the ADAR is an AD ARI . In some aspects, the domain of ADAR can be the Z alpha (Za) or Z beta (Zb) domain.
- Za and Zb ADAR binding domains can be tethered to Cas polypeptides, including Cast 3d Seq212, in order to enhance recruitment of target RNAs for cleavage. Without wishing to be bound by theory, positioning of the ADAR binding domains close to the active Cas 13d endonuclease site improves on -target RNA knockdown.
- the Za ADAR binding domain can comprise, consist essentially of, or consist of the nucleotide sequence:
- the Za ADAR binding domain can comprise, consist essentially of, or consist of the amino acid sequence: GSHMLSIYQDQEQRILKFLEELGEGKATTAHDLSGKLGTPKKEINRVLYSLAKKGKL QKEAGTPPLWKIAVSTQAWNQHSG (SEQ ID NO: 199).
- the Zb ADAR binding domain can comprise, consist essentially of, or consist of the nucleotide sequence:
- the Zb ADAR binding domain can comprise, consist essentially of, or consist of the amino acid sequence:
- GSHMASLDMAEIKEKICDYLFNVSDSSALNLAKNIGLTKARDINAVLIDMERQGDVY RQGTTPPIWHLTDKKRERMQIK (SEQ ID NO: 201).
- a Casl3d seq212 protein can comprise, consist essentially of, or consist of an amino acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% (or any percentage in between) identical to SEQ ID NO: 146.
- a Cast 3d seq212 protein can be encoded a nucleic acid sequence that can comprise, consist essentially of, or consist of a nucleic acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% (or any percentage in between) identical to SEQ ID NO: 147.
- a Casl3d variant having mutations D823A, D830A, T832A, K827A, T213A can comprise, consist essentially of, or consist of a nucleic acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% (or any percentage in between) identical to SEQ ID NO: 1.
- a Casl3d variant having mutations D823A, D830A, T832A, K827A, T213A can be encoded a nucleic acid sequence that can comprise, consist essentially of, or consist of anamino acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% (or any percentage in between) identical to SEQ ID NO: 2.
- a Casl3d variant having mutations D823A, D830A, T832A, K827A, Y215A can comprise, consist essentially of, or consist of a nucleic acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% (or any percentage in between) identical to SEQ ID NO: 3.
- a Casl3d variant having mutations D823A, D830A, T832A, K827A, Y215A can be encoded a nucleic acid sequence that can comprise, consist essentially of, or consist of an amino acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% (or any percentage in between) identical to SEQ ID NO: 4.
- a Cast 3d variant having mutations S808A, K825A, K826A, K827A can comprise, consist essentially of, or consist of a nucleic acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% (or any percentage in between) identical to SEQ ID NO: 5.
- a Cast 3d variant having mutations S808A, K825A, K826A, K827A can be encoded a nucleic acid sequence that can comprise, consist essentially of, or consist of an amino acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% (or any percentage in between) identical to SEQ ID NO: 6.
- a Casl3d variant having mutations A784S can comprise, consist essentially of, or consist of a nucleic acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% (or any percentage in between) identical to SEQ ID NO: 7.
- a Casl3d variant having mutations A784S can be encoded a nucleic acid sequence that can comprise, consist essentially of, or consist of an amino acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% (or any percentage in between) identical to SEQ ID NO: 8.
- a Cast 3d seq 212 sequencing having a Za ADAR domain tethered between L228 and K229 can comprise, consist essentially of, or consist of a nucleic acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% (or any percentage in between) identical to SEQ ID NO: 9.
- a Cast 3d seq 212 sequencing having a Za ADAR domain tethered between L228 and K229 can be encoded a nucleic acid sequence that can comprise, consist essentially of, or consist of an amino acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% (or any percentage in between) identical to SEQ ID NO: 10.
- a Cast 3d seq 212 sequencing having a Zb ADAR domain tethered between L228 and K229 can comprise, consist essentially of, or consist of a nucleic acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% (or any percentage in between) identical to SEQ ID NO: 11.
- a Cast 3d seq 212 sequencing having a Zb ADAR domain tethered between L228 and K229 can be encoded a nucleic acid sequence that can comprise, consist essentially of, or consist of an amino acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% (or any percentage in between) identical to SEQ ID NO: 12.
- gRNA guide RNA
- sgRNA single guide RNA
- Guide RNAs of the disclosure may comprise of a “direct repeat” (DR) sequence and a spacer sequence.
- a guide RNA is a single guide RNA (sgRNA) comprising a contiguous DR sequence and spacer sequence.
- the spacer sequence and the DR sequence are not contiguous.
- the gRNA comprises a DR sequence.
- DR sequences refer to the repetitive sequences in the CRISPR locus (naturally-occurring in a bacterial genome or plasmid) that are interspersed with the spacer sequences.
- a guide RNA comprises a direct repeat (DR) sequence and a spacer sequence.
- a sequence encoding a guide RNA or single guide RNA of the disclosure comprises or consists of a spacer sequence and a DR sequence, that are separated by a linker sequence.
- the linker sequence may comprise or consist of 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 30, 35, 40, 45, 50 or any number of nucleotides (nt) in between.
- the linker sequence may comprise at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 30, 35, 40, 45, 50 or any number of nucleotides in between.
- the DR sequence is a Casl3d DR sequence.
- the gRNA that hybridizes with the one or more target RNA molecules in a Casl3d-mediated manner includes one or more direct repeat (DR) sequences, one or more spacer sequences, such as, e.g., one or more sequences comprising an array of DR- spacer-DR-spacer and so on.
- DR direct repeat
- spacer sequences such as, e.g., one or more sequences comprising an array of DR- spacer-DR-spacer and so on.
- a plurality of gRNAs are generated from a single array, wherein each gRNA can be different, for example target different RNAs or target multiple regions of a single RNA, or combinations thereof.
- an isolated gRNA includes one or more direct repeat sequences, such as an unprocessed (e.g., about 36 nt) or processed DR (e.g., about 30 nt).
- a gRNA can further include one or more spacer sequences specific for (e.g., is complementary to) the target RNA.
- multiple polIII promoters can be used to drive multiple gRNAs, spacers and/or DRs.
- a guide array comprises a DR (about 36nt)-spacer (about 30nt)-DR (about 36nt)-spacer (about 30nt).
- Guide RNAs (gRNAs) of the disclosure may comprise non-naturally occurring nucleotides.
- a guide RNA of the disclosure or a sequence encoding the guide RNA comprises or consists of modified or synthetic RNA nucleotides.
- RNA nucleotides include, but are not limited to, pseudouridine ( ), dihydrouridine (D), inosine (I), and 7-m ethylguanosine (m7G), hypoxanthine, xanthine, xanthosine, 7- methylguanine, 5, 6-Dihydrouracil, 5-methylcytosine, 5-methylcytidine, 5- hydropxymethylcytosine, isoguanine, and isocytosine.
- pseudouridine ), dihydrouridine (D), inosine (I), and 7-m ethylguanosine (m7G), hypoxanthine, xanthine, xanthosine, 7- methylguanine, 5, 6-Dihydrouracil, 5-methylcytosine, 5-methylcytidine, 5- hydropxymethylcytosine, isoguanine, and isocytosine.
- Guide RNAs (gRNAs) of the disclosure may bind modified RNA within a target sequence.
- guide RNAs (gRNAs) of the disclosure may bind modified or mutated (e.g., pathogenic) RNA.
- exemplary epigenetically or post-transcriptionally modified RNA include, but are not limited to, 2’-O-Methylation (2’-0Me) (2’-O-methylation occurs on the oxygen of the free 2’-OH of the ribose moiety), N6-methyladenosine (m6A), and 5-methylcytosine (m5C).
- a guide RNA of the disclosure comprises at least one sequence encoding a non-coding C/D box small nucleolar RNA (snoRNA) sequence.
- the snoRNA sequence comprises at least one sequence that is complementary to the target RNA, wherein the target sequence of the RNA molecule comprises at least one 2’-OMe.
- the snoRNA sequence comprises at least one sequence that is complementary to the target RNA, wherein the at least one sequence that is complementary to the target RNA comprises a box C motif (RUGAUGA) and a box D motif (CUGA).
- Spacer sequences of the disclosure bind to the target sequence of an RNA molecule. In some embodiments, spacer sequences of the disclosure bind to pathogenic target RNA.
- the sequence comprising the gRNA further comprises a spacer sequence that specifically binds to the target RNA sequence.
- the spacer sequence has at least 50%, 55%, 60%, 65%, 70%, 75%, 80%, 87%, 90%, 95%, 97%, 99% or any percentage in between of complementarity to the target RNA sequence.
- the spacer sequence has 100% complementarity to the target RNA sequence.
- the spacer sequence comprises or consists of 20 nucleotides.
- the spacer sequence comprises or consists of 21 nucleotides, 22 nucleotides, 23 nucleotides, 24 nucleotides, 25 nucleotides, 26 nucleotides, 27 nucleotides, 28 nucleotides, 29 nucleotides, 30 nucleotides, 31 nucleotides, 32 nucleotides, 33 nucleotides, 34 nucleotides, 35 nucloetides, 36 nucleotides, 37 nucleotdes, 38 nucleotides, 39 nucleotides, or 40 nucleotides.
- the spacer sequence comprises or consists of 26 nucleotides.
- the spacer sequence is nonprocessed and comprises or consists of 30 nucleotides.
- the nonprocessed spacer sequence comprises or consists of 30-36 nucleotides.
- DR sequences of the disclosure bind the Cas polypeptide of the disclosure.
- the Cas protein bound to the DR sequence of the gRNA is positioned at the target RNA sequence.
- a DR sequence binds to the Cas protein via a nucleic acid - amino acid interaction, typically through hydrogen bonds and salt bridges.
- DR sequences disclosed herein have a sequence % identity of at least 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96, 97%, 98%, 99%, or any percentage identity in between. In some embodiments, a DR sequence disclosed herein has 100% sequence identity.
- DR sequences of the disclosure comprise a secondary structure or a tertiary structure.
- Exemplary secondary structures include, but are not limited to, a helix, a stem loop, a bulge, a tetraloop and a pseudoknot.
- Exemplary tertiary structures include, but are not limited to, an A-form of a helix, a B-form of a helix, and a Z- form of a helix.
- Exemplary tertiary structures include, but are not limited to, a twisted or helicized stem loop.
- Exemplary tertiary structures include, but are not limited to, a twisted or helicized pseudoknot.
- DR sequences of the disclosure comprise at least one secondary structure or at least one tertiary structure.
- DR sequences of the disclosure comprise one or more secondary structure(s) or one or more tertiary structure(s).
- a guide RNA or a portion thereof selectively binds to a tetraloop motif in an RNA molecule of the disclosure.
- a target sequence of an RNA molecule comprises a tetraloop motif.
- the tetraloop motif is a “GRNA” motif comprising or consisting of one or more of the sequences of GAAA, GUGA, GCAA or GAGA.
- a guide RNA or a portion thereof that binds to a target sequence of an RNA molecule hybridizes to the target sequence of the RNA molecule.
- a guide RNA or a portion thereof that binds to a first RNA binding protein or to a second RNA binding protein covalently binds to the first RNA binding protein or to the second RNA binding protein.
- a guide RNA or a portion thereof that binds to a first RNA binding protein or to a second RNA binding protein non-covalently binds to the first RNA binding protein or to the second RNA binding protein.
- a guide RNA or a portion thereof comprises or consists of between 10 and 100 nucleotides, inclusive of the endpoints.
- a spacer sequence of the disclosure comprises or consists of between 10 and 30 nucleotides, inclusive of the endpoints.
- a spacer sequence of the disclosure comprises or consists of 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, or 40 nucleotides.
- the spacer sequence of the disclosure comprises or consists of 20 nucleotides.
- the spacer sequence of the disclosure comprises or consists of 21 nucleotides.
- the spacer sequence of the disclosure comprises or consists of 26 nucleotides.
- Guide molecules generally exist in various states of processing.
- an unprocessed guide RNA is 36nt of DR followed by 30-32 nt of spacer.
- the guide RNA is processed (truncated/modified) by Cas 13d itself or other RNases into the shorter "mature" form.
- an unprocessed guide sequence is about, or at least about 30, 35, 40, 45, 50, 55, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 80, 85, 90, 95, 100, 105, 110, 115, 120, 125, 130, 135, 140, 145, 150 or more nucleotides (nt) in length.
- an unprocessed guide sequence is 102 nt in length, e.g., DR 36 nt, spacer 30 nt and DR is 36 nt.
- a processed guide sequence is about 44 to 70 nt (such as 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 75, 80, 85, 90, 95, 100, 105, 110, 115, 120, 125, 130, 135, 140, 145, 150 nt).
- an unprocessed spacer is about 28-40 nt long (such as 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, or 40 nt) while the mature (processed) spacer can be about 10 to 30 nt, 10 to 25 nt, 14 to 25 nt, 20 to 22 nt, or 14-30 nt (such as 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, or 35 nt).
- an unprocessed DR is about 36 nt (such as 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or 41 nt), while the processed DR is about 30 nt (such as 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, or 35 nt).
- a DR sequence is truncated by 1-10 nucleotides (such as 1, 2, 3, 4, 5, 6, 7, 8, 9, to 10 nucleotides at e.g., the 5’ end in order to be expressed as mature pre-processed guide RNAs.
- a guide RNA or a portion thereof does not comprise a nuclear localization sequence (NLS).
- NLS nuclear localization sequence
- a guide RNA or a portion thereof comprises a sequence complementary to a protospacer flanking sequence (PFS).
- PFS protospacer flanking sequence
- the first RNA binding protein may comprise a sequence isolated or derived from a Cas 13 protein.
- the first RNA binding protein may comprise a sequence encoding a Cas 13 protein or an RNA-binding portion thereof.
- the guide RNA or a portion thereof does not comprise a sequence complementary to a PFS.
- compositions and vectors comprising two guide RNA molecules comprising spacer sequences binding distinct target RNA sequences.
- the first spacer sequence binds a G4C2 (sense) target RNA sequence and the second spacer sequence binds a sense flanking sequence.
- the first spacer sequence binds a C4G2 (antisense) target RNA sequence and the second spacer sequence binds an antisense flanking sequence.
- two targets are selected from the group consisting of GGGGCC (G4C2) (sense), flanking sequences adjacent to GGGGCC (G4C2) (sense), and flanking sequences adjacent to CCCCGG (C4G2) (antisense).
- two targets are selected from the group consisting of CCCCGG (C4G2) (antisense), flanking sequences adjacent to CCCCGG (C4G2) (antisense), and flanking sequences adjacent to GGGGCC (G4C2).
- target sequences can comprise intronic or exonic sequences.
- the intronic or exonic sequence can flank an HRE target sequence of the disclosure.
- G4C2 spacers for gRNA constructs disclosed herein are:
- Exemplary C4G2 spacers for gRNA constructs disclosed herein are:
- Exemplary spacers for gRNAs comprising C9 C4G2 flanking regions are:
- Table 3 Exemplary spacers for gRNAs comprising C9 C4G2 flanking regions
- Exemplary guide array sequences targeting the sense intronic flanking region are:
- Table 4 Exemplary guide array sequences targeting the sense intronic flanking region
- a guide array comprises at least two guide RNA sequences. In some aspects, the at least two guide RNA sequences bind non-overlapping target RNA sequences. In some aspects, guide arrays of the disclosure comprise at least three guide RNA sequences. In some aspects, guide arrays of the disclosure comprise one gRNA that binds a sense strand and a second gRNA that binds an anti-sense strand. In some aspects, guide arrays of the disclosure comprise one gRNA that binds a sense strand and a second gRNA that also binds a sense strand.
- guide arrays of the disclosure comprise one gRNA that binds an anti-sense strand and a second gRNA that binds an anti-sense strand.
- Guide arrays of the disclosure can be delivered to a cell or subject as part of an AAV vector of the disclosure. In some aspects, following delivery to the subject or cell, the guide array is processed to yield individual gRNA sequences.
- Guide arrays of the disclosure can comprise, consist essentially of, or consist of a nucleic acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% (or any percentage in between) identical to any one of SEQ ID NO: 132 - SEQ ID NO: 143.
- an AAV vector of the disclosure comprises a guide array that can comprise, consist essentially of, or consist of a nucleic acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% (or any percentage in between) identical to any one of SEQ ID NO: 132 - SEQ ID NO: 143.
- Exemplary guide arrays i.e. gRNA sequences disclosed herein are:
- compositions of the disclosure bind and destroy a target sequence of an RNA molecule comprising a pathogenic repeat sequence.
- the target RNA comprises a sequence motif corresponding to or complementary to a spacer sequence of the guide RNA corresponding to the RNA-guided RNA-binding protein.
- one or more spacer sequences are used to target one or more target sequences.
- multiple spacers are used to target multiple target RNAs.
- Such target RNAs can be different target sites within the same RNA molecule or can be different target sites within different RNA molecules.
- Spacer sequences can also target non-coding RNA.
- multiple promoters e.g., Pol III promoters can be used to drive multiple spacers in a gRNA for targeting multiple target RNAs.
- the destruction or blocking of the target RNA(s) or target sequence motif(s) reduces expression of pathogenic HRE repeat RNA.
- the destruction or blocking of the target repeat RNA(s) or target sequence motif(s) reduces expression of pathogenic HRE repeat RNA (C4G2 and/or G4C2, and/or flanking sequences thereof) thereby treating C9ORF72 disease such as ALS or FTD and/or ameliorating one or more symptoms associated with the diseases.
- the sequence motif of the target RNA is a signature of a disease or disorder.
- a sequence motif of the disclosure may be isolated or derived from a sequence of foreign or exogenous sequence found in a genomic sequence, and therefore translated into an mRNA molecule of the disclosure or a sequence of foreign or exogenous sequence found in an RNA sequence of the disclosure.
- a target sequence motif of the disclosure may comprise or consist of a repeated sequence.
- the repeated sequence may be associated with a microsatellite instability (MSI).
- MSI microsatellite instability
- a hypervariable sequence of DNA may be transcribed into an mRNA of the disclosure comprising a target sequence comprising or consisting of the hypervariable sequence.
- a target sequence motif of the disclosure may comprise or consist of a biomarker.
- the biomarker may indicate a risk of developing a disease or disorder.
- the biomarker may indicate a healthy gene (and a low or no determinable risk of developing a disease or disorder).
- the biomarker may indicate an edited gene.
- Exemplary biomarkers include, but are not limited to, single nucleotide polymorphisms (SNPs), sequence variations or mutations, epigenetic marks, splice acceptor sites, exogenous sequences, heterologous sequences, and any combination thereof.
- a target sequence motif of the disclosure may comprise or consist of a secondary, tertiary or quaternary structure.
- the secondary, tertiary or quaternary structure may be endogenous or naturally occurring.
- the secondary, tertiary or quaternary structure may be induced or non-naturally occurring.
- the secondary, tertiary or quaternary structure may be encoded by an endogenous, exogenous, or heterologous sequence.
- a target sequence of an RNA molecule comprises or consists of between 2 and 100 nucleotides or nucleic acid bases, inclusive of the endpoints. In some embodiments, the target sequence of an RNA molecule comprises or consists of between 2 and 50 nucleotides or nucleic acid bases, inclusive of the endpoints. In some embodiments, the target sequence of an RNA molecule comprises or consists of between 2 and 20 nucleotides or nucleic acid bases, inclusive of the endpoints. In some embodiments, the target sequence of an RNA molecule comprises or consists of between 20-30 nucleotides or nucleic acid bases, inclusive of the endpoints. In some embodiments, the target sequence of an RNA molecule comprises or consists of about 26 nucleotides or nucleic acid bases, inclusive of the endpoints.
- a target sequence of an RNA molecule is continuous.
- the target sequence of an RNA molecule is discontinuous.
- the target sequence of an RNA molecule may comprise or consist of one or more nucleotides or nucleic acid bases that are not contiguous because one or more intermittent nucleotides are positioned in between the nucleotides of the target sequence.
- a target sequence of an RNA molecule is naturally occurring.
- the target sequence of an RNA molecule is non-naturally occurring.
- Exemplary non-naturally occurring target sequences may comprise or consist of sequence variations or mutations, chimeric sequences, exogenous sequences, heterologous sequences, chimeric sequences, recombinant sequences, sequences comprising a modified or synthetic nucleotide or any combination thereof.
- a target sequence of an RNA molecule binds to a guide RNA of the disclosure. In some embodiments of the compositions and methods of the disclosure, one or more target sequences of an RNA molecule binds to one or more guide RNA spacer sequences of the disclosure.
- a target sequence of an RNA molecule binds to a first RNA binding protein of the disclosure.
- a target sequence of an RNA molecule binds to a second RNA binding protein of the disclosure.
- compositions of the disclosure comprise a gRNA comprising a spacer sequence that specifically binds to a target toxic RNA repeat sequence.
- the spacer which binds the target RNA repeat sequence comprises or consists of about 20-30 nucleotides.
- a gRNA comprises one or more spacer sequences.
- AAV vector refers to a vector comprising, consisting essentially of, or consisting of one or more nucleic acid molecules and one or more AAV inverted terminal repeat sequences (ITRs).
- ITRs AAV inverted terminal repeat sequences
- the nucleic acid molecule encodes for a repeat targeting protein and/or composition of the disclosure.
- AAV vectors can be replicated and packaged into infectious viral particles when present in a host cell that provides the functionality of rep and cap gene products, for example, by transfection of the host cell.
- AAV vectors contain a promoter, at least one nucleic acid that may encode at least one protein or RNA, and/or an enhancer and/or a terminator within the flanking ITRs that is packaged into the infectious AAV particle.
- the encapsidated nucleic acid portion may be referred to as the AAV vector genome.
- Plasmids containing AAV vectors may also contain elements for manufacturing purposes, e.g., antibiotic resistance genes, origin of replication sequences etc., but these are not encapsidated and thus do not form part of the AAV particle.
- an AAV vector can comprise at least one nucleic acid molecule encoding a repeat targeting composition of the disclosure. In some aspects, an AAV vector can comprise at least one regulatory sequence. In some aspects, an AAV vector can comprise at least one AAV inverted terminal (ITR) sequence. In some aspects, an AAV vector can comprise a first ITR sequence and a second ITR sequence.
- a first ITR sequence can comprise, consist essentially of, or consist of a nucleic acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% (or any percentage in between) identical to SEQ ID NO: 149 or SEQ ID NO: 150.
- a second ITR sequence can comprise, consist essentially of, or consist of a nucleic acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% (or any percentage in between) identical to SEQ ID NO: 149 or SEQ ID NO: 150.
- an AAV vector can comprise at least one promoter sequence. In some aspects, an AAV vector can comprise at least one enhancer sequence. In some aspects, an AAV vector can comprise at least one polyA sequence. In some aspects, an AAV vector can comprise at least one linker sequence. In some aspects, an AAV vector of the disclosure can comprise at least one nuclear localization signal, or nuclear export signal and or both.
- an AAV vector of the disclosure will comprise a WPRE (woodchuck hepatitis virus post-trancriptional regulatory element) or portion thereof.
- a WPRE sequence can comprise, consist essentially of, or consist of a nucleic acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% (or any percentage in between) identical to SEQ ID NO: 151 or SEQ ID NO: 191.
- an AAV vector of the disclosure can comprise a Cas protein, peptide, or fragment thereof.
- an AAV vector of the disclosure can comprise a polyadenylation (poly A) sequence.
- a polyA sequence can comprise an SV- 40 polyA sequence
- an SV-40 polyA sequence can comprise, consist essentially of, or consist of a nucleic acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% (or any percentage in between) identical to SEQ ID NO: 152.
- an AAV vector of the disclosure can comprise an endonuclease protein, peptide, or fragment thereof.
- an AAV vector of the disclosure can comprise a guide RNA, in some cases a repeat targeting guide RNA.
- AAV vectors of the disclosure can comprise a fusion protein comprising one or more elements of the disclosure, including, but not limited to, an RNA-targeting protein (such as a Cas, PUF, or PUMBY) and an endonuclease.
- fusion proteins of the AAV vector can further comprise a linker amino acid sequence between the one or more elements of the disclosure.
- a AAV vector can comprise a first AAV ITR sequence, a promoter sequence, a RNA-targeting composition nucleic acid molecule, a regulatory sequence and a second AAV ITR sequence.
- an AAV vector can comprise, in the 5’ to 3’ direction, a first AAV ITR sequence, a promoter sequence, a transgene nucleic acid molecule, and a second AAV ITR sequence.
- a vector comprises a guide RNA of the disclosure.
- the vector comprises at least one guide RNA of the disclosure.
- the vector comprises one or more guide RNA(s) of the disclosure.
- the vector comprises two or more guide RNAs of the disclosure.
- the vector comprises three guide RNAs.
- the vector comprises four guide RNAs.
- the vector comprises a PolIII promoter, one or multiple guides, a PolII promoter, the Cas protein, a regulatory element and a polyA sequence.
- the vector further comprises a guided or non-guided RNA-binding protein of the disclosure.
- the vector further comprises an RNA-binding fusion protein of the disclosure.
- the fusion protein comprises a first RNA binding protein and a second RNA binding protein.
- the RNA-guided RNA-binding systems comprising an RNA-binding protein and a gRNA are in a single vector.
- the single vector comprises the RNA-guided RNA-binding systems which are Casl3d RNA-guided RNA-binding systems or catalytic deactivated Cast 3d (dCasl3d) RNA-guided RNA-binding systems.
- the single vector comprises the Casl3d RNA-guided RNA- binding systems which are CasRx or dCasRx RNA-guided RNA-binding systems. In another embodiment, the single vector comprises the Casl3d RNA-guided RNA-binding systems which are Seq212 Cast 3d or Seq212 dCasl3d or Seq212 Cast 3d variants RNA-guided RNA- binding systems. In another embodiment, the single vector comprises a non-guided RNA- binding system comprising a PUF or PUMBY-based protein.
- the single vector comprises a non-guided RNA-binding system comprising a PUF or PUMBY- based protein fused with a nuclease domain from ZC3H12A, such as E17 (SEQ ID NO: 358).
- the single vector comprises a dCasl3d RNA-binding system fused with a nuclease domain from ZC3H12A, such as E17 (SEQ ID NO: 359).
- nuclease domain from ZC3H12A, E17 can comprise, consist essentially of, or consist of the amino acid sequence:
- nuclease domain from ZC3H12A, E17 can comprise, consist essentially of, or consist of the amino acid sequence:
- a first vector comprises a guide RNA of the disclosure and a second vector comprises an RNA- binding protein or RNA-binding fusion protein of the disclosure.
- the first vector comprises at least one guide RNA of the disclosure.
- the first vector comprises one or more guide RNA(s) of the disclosure.
- the first vector comprises two or more guide RNA(s) of the disclosure.
- the fusion protein comprises a first RNA binding protein and a second RNA binding protein.
- the first vector and the second vector are identical vectors or vector serotypes.
- the first vector and the second vector are not identical vectors or vector serotypes.
- the RNA-binding systems capable of targeting toxic RNA repeats are in a single vector.
- vectors refers to a circular double stranded DNA loop into which additional DNA segments can be inserted, such as by standard molecular cloning techniques.
- viral vector e.g., retroviruses, replication defective retroviruses, adenoviruses, replication defective adenoviruses, and adeno- associated viruses.
- Viral vectors also include polynucleotides carried by a virus for transfection into a host cell.
- the vector is a lentivirus (such as an integration-deficient lentiviral vector) or adeno-associated viral (AAV) vector.
- Vectors are capable of autonomous replication in a host cell into which they are introduced such as e.g., bacterial vectors having a bacterial origin of replication and episomal mammalian vectors and other vectors such as, e.g., non-episomal mammalian vectors, are integrated into the genome of a host cell upon introduction into the host cell, and thereby are replicated along with the host genome.
- vectors such as e.g., expression vectors
- Common expression vectors are often in the form of plasmids.
- recombinant expression vectors comprise a nucleic acid provided herein such as e.g., a guide RNA which can be expressed from a DNA sequence, and a nucleic acid encoding a Cas 13d protein, in a form suitable for expression of a protein in a host cell.
- Recombinant expression vectors include one or more regulatory elements, which may be selected on the basis of the host cells to be used for expression, that is operatively-linked to the nucleic acid sequence to be expressed.
- operably linked is intended to mean that the nucleotide sequence of interest is linked to the regulatory element(s) in a manner that allows for expression of the nucleotide sequence such as e.g., in an in vitro transcription/translation system or in a host cell when the vector is introduced into the host cell. Certain embodiments of a vector depend on factors such as the choice of the host cell to be transformed, and the level of expression desired.
- a vector can be introduced into host cells to thereby produce transcripts, proteins, or peptides, including fusion proteins or peptides, encoded by nucleic acids as described herein such as, e.g., CRISPR transcripts, proteins, enzymes, mutant forms thereof, fusion proteins thereof, etc.
- a vector of the disclosure is a viral vector.
- the viral vector comprises a sequence isolated or derived from a retrovirus.
- the viral vector comprises a sequence isolated or derived from a lentivirus.
- the viral vector comprises a sequence isolated or derived from an adenovirus.
- the viral vector comprises a sequence isolated or derived from an adeno-associated virus (AAV).
- AAV adeno-associated virus
- the viral vector is replication incompetent.
- the viral vector is isolated or recombinant.
- the viral vector is self- complementary.
- Adeno-associated virus refers to a member of the class of viruses associated with this name and belonging to the genus Dependoparvovirus, family Parvoviridae.
- Adeno-associated virus is a single-stranded DNA virus that grows in cells in which certain functions are provided by a co-infecting helper virus.
- General information and reviews of AAV can be found in, for example, Carter, 1989, Handbook of Parvoviruses, Vol. 1, pp. 169- 228, and Berns, 1990, Virology, pp. 1743-1764, Raven Press, (New York).
- AAV possesses unique features that make it attractive as a vector for delivering foreign DNA to cells, for example, in gene therapy. AAV infection of cells in culture is noncytopathic, and natural infection of humans and other animals is silent and asymptomatic.
- AAV infects many mammalian cells allowing the possibility of targeting many different tissues in vivo.
- AAV transduces slowly dividing and non-dividing cells, and can persist essentially for the lifetime of those cells as a transcriptionally active nuclear episome (extrachromosomal element).
- the AAV proviral genome is inserted as cloned DNA in plasmids, which makes construction of recombinant genomes feasible.
- the signals directing AAV replication and genome encapsidation are contained within the ITRs of the AAV genome, some or all of the internal approximately 4.3 kb of the genome (encoding replication and structural capsid proteins, rep-cap) may be replaced with foreign DNA to generate AAV vectors.
- the rep and cap proteins may be provided in trans.
- Another significant feature of AAV is that it is an extremely stable and hearty virus. It easily withstands the conditions used to inactivate adenovirus (56° to 65°C for several hours), making cold preservation of AAV less critical. AAV may even be lyophilized. Finally, AAV-infected cells are not resistant to superinfection.
- Recombinant AAV (rAAV) genomes of the invention comprise, consist essentially of, or consist of a nucleic acid molecule encoding a repeat targeting composition (such as a PUF, PUMB Y, or RNA-guided protein) and one or more AAV ITRs flanking the nucleic acid molecule.
- a repeat targeting composition such as a PUF, PUMB Y, or RNA-guided protein
- AAV ITRs flanking the nucleic acid molecule e.g., W02001083692.
- Other types of rAAV variants, for example rAAV with capsid mutations are also contemplated. See, e.g., Marsic et al., Molecular Therapy, 22(11): 1900-1909 (2014).
- the nucleotide sequences of the genomes of various AAV serotypes are known in the art.
- the viral vector comprises a sequence isolated or derived from an adeno-associated virus (AAV).
- the viral vector comprises an inverted terminal repeat sequence or a capsid sequence that is isolated or derived from an AAV of serotype AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV10 AAVrhlO, AAV11, AAV12, AAVrhlO, or AAVrh74.
- the AAV serotype is AAVrh74.
- the AAV serotype is AAV9.
- the AAV vector comprises a modified capsid.
- the AAV vector is an AAV2-Tyr mutant vector.
- the AAV vector comprises a capsid with a non-tyrosine amino acid at a position that corresponds to a surface-exposed tyrosine residue in position Tyr252, Tyr272, Tyr275, Tyr281, Tyr508, Tyr612, Tyr704, Tyr720, Tyr730 or Tyr673 of wild-type AAV2. See also WO 2008/124724 incorporated herein in its entirety.
- the AAV vector comprises an engineered capsid.
- AAV vectors comprising engineered capsids include without limitation, AAV2.7m8, AAV9.7m8, AAV2 2tYF, and AAV8 Y733F).
- the viral vector is replication incompetent.
- the viral vector is isolated or recombinant (rAAV).
- the viral vector is self-complementary (scAAV).
- a vector of the disclosure is a non-viral vector.
- the vector comprises or consists of a nanoparticle, a micelle, a liposome or lipoplex, a polymersome, a polyplex or a dendrimer.
- the vector is an expression vector or recombinant expression system.
- the term “recombinant expression system” refers to a genetic construct for the expression of certain genetic material formed by recombination.
- an expression vector, viral vector or non-viral vector provided herein includes without limitation, an expression control element.
- An “expression control element” as used herein refers to any sequence that regulates the expression of a coding sequence, such as a gene.
- Exemplary expression control elements include but are not limited to promoters, enhancers, microRNAs, post-transcriptional regulatory elements, polyadenylation signal sequences, introns, and 5’ and 3’ UTR sequences (to provide RNA stability and/or improve transcription/ translation or promote RNA nuclear export).
- Expression control elements may be constitutive, inducible, repressible, or tissue-specific, for example.
- a “promoter” is a control sequence that is a region of a polynucleotide sequence at which initiation and rate of transcription are controlled. It may contain genetic elements at which regulatory proteins and molecules may bind such as RNA polymerase and other transcription factors. In some embodiments, expression control by a promoter is tissue-specific. In some embodiments, expression control by a promoter is constitutive or ubiquitous.
- Non-limiting exemplary promoters include a Pol III promoter such as, e.g., U6 and Hl promoters and/or a Pol II promoter e.g., SV40, CMV (optionally including the CMV enhancer), RSV (Rous Sarcoma Virus LTR promoter (optionally including RSV enhancer), CBA (hybrid CMV enhancer/ chicken B-actin), CAG (hybrid CMV enhancer fused to chicken B-actin), truncated CAG, Cbh (hybrid CBA), EF-la (human elongation factor alpha- 1) or EFS (short intron-less EF-1 alpha), PGK (phosphoglycerol kinase), CEF (chicken embryo fibroblasts), UBC (ubiquitinC), GUSB (lysosomal enzyme beta-glucuronidase), UCOE (ubiquitous chromatin opening element), hAAT (alpha- 1 antitryps
- Enhancer is a region of DNA that can be bound by activating proteins to increase the likelihood or frequency of transcription.
- Non-limiting exemplary enhancers and posttranscriptional regulatory elements include the CMV enhancer, MCK enhancer, R-U5’ segment in LTR of HTLV-1, SV40 enhancer, the intron sequence between exons 2 and 3 of rabbit B-globin, and Woodchuck Hepatitis Virus (WHP) Posttranscriptional Regulatory Element (WPRE).
- WPRE Woodchuck Hepatitis Virus
- an intron is used to enhance promoter activity such as a UBB intron.
- the UBB intron is used with an EFS promoter.
- a CMV enhancer/promoter sequence can comprise, consist essentially of, or consist of a nucleic acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% (or any percentage in between) identical to SEQ ID NO: 186.
- an EFS promoter sequence can comprise, consist essentially of, or consist of a nucleic acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% (or any percentage in between) identical to SEQ ID NO: 155.
- an expression vector, viral vector or non-viral vector includes without limitation, vector elements such as an IRES or 2A peptide sites for configuration of “multicistronic” or “polycistronic” or “bicistronic” or tricistronic” constructs, i.e., having double or triple or multiple coding areas or exons, and as such will have the capability to express from mRNA two or more proteins from a single construct.
- Multicistronic vectors simultaneously express two or more separate proteins from the same mRNA.
- the two strategies most widely used for constructing multicistronic configurations are through the use of an IRES or a 2A self-cleaving site.
- an “IRES” refers to an internal ribosome entry site or portion thereof of viral, prokaryotic, or eukaryotic origin which are used within polycistronic vector constructs.
- an IRES is an RNA element that allows for translation initiation in a cap- independent manner.
- self-cleaving peptides or “sequences encoding self-cleaving peptides” or “2A self-cleaving site” refer to linking sequences which are used within vector constructs to incorporate sites to promote ribosomal skipping and thus to generate two polypeptides from a single promoter, such self-cleaving peptides include without limitation, T2A, and P2A peptides or other sequences encoding the self-cleaving peptides.
- exemplary vector configurations are shown in Figures 1A-1B.
- Exemplary vector configurations comprise a promoter or regulatory sequence (promoter/enhancer combination) driving the expression of the nucleic acid encoding the RNA- targeting Casl3d-based system.
- a vector configuration comprises a promoter driving expression of the RNA-guided Cas RNase RNA-binding protein, or dCas protein fusion in operable linkage with a second promoter driving expressing of a cognate gRNA.
- the vector configuration comprises a linker and one or more tags.
- vectors comprising guide RNA sequences of the disclosure comprises a promoter sequence to drive expression of the guide RNA.
- a vector comprising a guide RNA sequence of the disclosure comprises a promoter sequence to drive expression of the guide RNA.
- the promoter to drive expression of the guide RNA is a constitutive promoter.
- the promoter sequence is an inducible promoter.
- the promoter is a sequence is a tissue-specific and/or cell-type specific promoter.
- the promoter is a hybrid or a recombinant promoter.
- the promoter is a promoter capable of expressing the guide RNA in a mammalian cell. In some embodiments, the promoter is a promoter capable of expressing the guide RNA in a human cell. In some embodiments, the promoter is a promoter capable of expressing the guide RNA and restricting the guide RNA to the nucleus of the cell. In some embodiments, the promoter is a human RNA polymerase promoter or a sequence isolated or derived from a sequence encoding a human RNA polymerase promoter. In some embodiments, the promoter is a U6 promoter or a sequence isolated or derived from a sequence encoding a U6 promoter.
- the U6 promoter is a human or mouse U6 promoter.
- the promoter is a human or mouse tRNA promoter or a sequence isolated or derived from a sequence encoding a human or mouse tRNA promoter.
- the promoter is a human or mouse valine tRNA promoter or a sequence isolated or derived from a sequence encoding a human or mouse valine tRNA promoter.
- a promoter to drive expression of the guide RNA further comprises a regulatory element.
- a vector comprising a promoter sequence to drive expression of the guide RNA further comprises a regulatory element.
- a regulatory element enhances expression of the guide RNA.
- Exemplary regulatory elements include, but are not limited to, an enhancer element, an intron, an exon, or a combination thereof.
- a vector of the disclosure comprises one or more of a sequence encoding a guide RNA, a promoter sequence to drive expression of the guide RNA and a sequence encoding a regulatory element. In some embodiments of the compositions of the disclosure, the vector further comprises a sequence encoding a fusion protein of the disclosure.
- the vector is a viral vector.
- the vector is an adenoviral vector, an adeno-associated viral (AAV) vector, or a lentiviral vector.
- the vector is a retroviral vector, an adenoviral/retroviral chimera vector, a herpes simplex viral I or II vector, a parvoviral vector, a reticuloendotheliosis viral vector, a polioviral vector, a papillomaviral vector, a vaccinia viral vector, or any hybrid or chimeric vector incorporating favorable aspects of two or more viral vectors.
- the vector further comprises one or more expression control elements operably linked to the polynucleotide. In some embodiments, the vector further comprises one or more selectable markers. In some embodiments, the AAV vector has low toxicity. In some embodiments, the AAV vector does not incorporate into the host genome, thereby having a low probability of causing insertional mutagenesis. In some embodiments, the AAV vector can encode a range of total polynucleotides from 4.5 kb to 4.75 kb.
- exemplary AAV vectors that may be used in any of the herein described compositions, systems, methods, and kits can include an AAV1 vector, a modified AAV1 vector, an AAV2 vector, a modified AAV2 vector, an AAV2-Tyr mutant vector, an AAV3 vector, a modified AAV3 vector, an AAV4 vector, a modified AAV4 vector, an AAV5 vector, a modified AAV5 vector, an AAV6 vector, a modified AAV6 vector, an AAV7 vector, a modified AAV7 vector, an AAV8 vector, an AAV9 vector, an AAV.rhlO vector, a modified AAV.rhlO vector, an AAVrh.74, an AAV.rh32/33 vector, a modified AAV.rh32/33 vector, an AAV.rh43 vector, a modified AAV.rh43 vector, an AAV.rh64Rl vector, and a modified AAV.rh64Rl vector, an AAV
- the lentiviral vector is an integrase-competent lentiviral vector (ICLV).
- the lentiviral vector can refer to the transgene plasmid vector as well as the transgene plasmid vector in conjunction with related plasmids (e.g., a packaging plasmid, a rev expressing plasmid, an envelope plasmid) as well as a lentiviral-based particle capable of introducing exogenous nucleic acid into a cell through a viral or viral-like entry mechanism.
- Lentiviral vectors are well-known in the art (see, e.g., Trono D.
- exemplary lentiviral vectors that may be used in any of the herein described compositions, systems, methods, and kits can include a human immunodeficiency virus (HIV) 1 vector, a modified human immunodeficiency virus (HIV) 1 vector, a human immunodeficiency virus (HIV) 2 vector, a modified human immunodeficiency virus (HIV) 2 vector, a sooty mangabey simian immunodeficiency virus (SIVSM) vector, a modified sooty mangabey simian immunodeficiency virus (SIVSM) vector, a African green monkey simian immunodeficiency virus (SIVAGM) vector, a modified African green monkey simian immunodeficiency virus (SIVAGM) vector, an equine infectious an HIV 1 vector
- a modified human immunodeficiency virus (HIV) 1 vector a human immunodeficiency virus (HIV) 2 vector
- AAV vectors comprising RNA- binding polypeptides capable of binding toxic hexanucleotide repeat RNA sequences.
- AAV vetors of the disclosure compise Casl3d constructs capable of binding tergat RNA sequences disclosed herein.
- an AAV vector of the disclosure comprises from 5’ to 3’ : a first ITR sequence, a first promoter sequence, at least one gRNA sequence, a second promoter sequence, a cas polypeptide, a posttranscriptional regulatory element sequence, a polyA sequence, and a second ITR sequence.
- an AAV vector of the disclosure comprises from 5’ to 3’: a first ITR sequence, a first promoter sequence, a first gRNA sequence, a second gRNA sequence, a second promoter sequence, a cas polypeptide, a posttranscriptional regulatory element sequence, a polyA sequence, and a second ITR sequence.
- an AAV vector of the disclosure comprises from 5’ to 3’: a first ITR sequence, a first promoter sequence, a first gRNA sequence, a second gRNA sequence, a third gRNA sequence, a second promoter sequence, a cas polypeptide, a posttranscriptional regulatory element sequence, a polyA sequence, and a second ITR sequence.
- an AAV vector of the disclosure comprises from 5’ to 3’ : a first ITR sequence, a human U6 promoter sequence, at least one gRNA sequence, an EFS promoter sequence, a cas polypeptide, a WPRE sequence, an SV-40 polyA sequence, and a second ITR sequence.
- an AAV vector of the disclosure comprises from 5’ to 3’ : a first ITR sequence, a human U6 promoter sequence, a first gRNA sequence, a second gRNA sequence, an EFS promoter sequence, a cas polypeptide, a WPRE sequence, an SV-40 polyA sequence, and a second ITR sequence.
- an AAV vector of the disclosure comprises from 5’ to 3’ : a first ITR sequence, a human U6 promoter sequence, a first gRNA sequence, a second gRNA sequence, an EFS promoter sequence, a cas polypeptide, a linker sequence, an SV-40 nuclear localization sequence, a WPRE sequence, an SV-40 polyA sequence, and a second ITR sequence.
- an AAV vector of the disclosure comprises from 5’ to 3’ : a first ITR sequence, a first promoter sequence, a cas polypeptide, a posttranscriptional regulatory element sequence, a polyA sequence, a second promoter sequence, at least one gRNA sequence, and a second ITR sequence.
- an AAV vector of the disclosure comprises from 5’ to 3’ : a first ITR sequence, a first promoter sequence, a cas polypeptide, a posttranscriptional regulatory element sequence, a polyA sequence, a second promoter sequence, a first gRNA sequence, a second gRNA sequence and a second ITR sequence.
- an AAV vector of the disclosure comprises from 5’ to 3 ’a first promoter sequence, a cas polypeptide, a posttranscriptional regulatory element sequence, a polyA sequence, a second promoter sequence, a first gRNA sequence, and a second gRNA sequence.
- an AAV vector of the disclosure comprises from 5’ to 3’ : a first ITR sequence, an EFS promoter sequence, a cas polypeptide, a linker sequence, an SV-40 nuclear localization sequence, a WPRE sequence, an SV-40 polyA sequence, a human U6 promoter sequence, a first gRNA sequence, a second gRNA sequence, and a second ITR sequence.
- an AAV vector of the disclosure comprises from 5’ to 3 ’a first promoter sequence, a cas polypeptide, a posttranscriptional regulatory element sequence, a polyA sequence, a second promoter sequence, a first gRNA sequence, a second gRNA sequence, and a third gRNA sequence.
- an AAV vector of the disclosure comprises from 5’ to 3’ a first gRNA sequence, a second gRNA sequence, and a third gRNA sequence, a first promoter sequence (reverse strand), a second promoter sequence, a cas polypeptide, a posttranscriptional regulatory element sequence, a polyA sequence.
- an AAV vector of the disclosure comprises from 5’ to 3’: a first ITR sequence, an EFS promoter sequence, a cas polypeptide, a linker sequence, an SV-40 nuclear localization sequence, a WPRE sequence, an SV-40 polyA sequence, a first gRNA sequence, a second gRNA sequence, a human U6 promoter sequence (reverse strand), and a second ITR sequence.
- AAV vector A02557 can comprise from 5’ to 3’ the elements set forth below.
- AAV vector A02557 can comprise, consist essentially of, or consist of a nucleic acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% (or any percentage in between) identical to SEQ ID NO: 202.
- AAV vector A02558 can comprise from 5’ to 3’ the elements set forth below.
- AAV vector A02558 can comprise, consist essentially of, or consist of a nucleic acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% (or any percentage in between) identical to SEQ ID NO: 203.
- AAV vector A02559 can comprise from 5’ to 3’ the elements set forth below.
- AAV vector A02559 can comprise, consist essentially of, or consist of a nucleic acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% (or any percentage in between) identical to SEQ ID NO: 204.
- AAV vector A02560 can comprise from 5’ to 3’ the elements set forth below.
- AAV vector A02560 can comprise, consist essentially of, or consist of a nucleic acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% (or any percentage in between) identical to SEQ ID NO: 205.
- AAV vector A02561 can comprise from 5’ to 3’ the elements set forth below.
- AAV vector A02561 can comprise, consist essentially of, or consist of a nucleic acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% (or any percentage in between) identical to SEQ ID NO: 206.
- AAV vector A02562 can comprise from 5’ to 3’ the elements set forth below.
- AAV vector A02562 can comprise, consist essentially of, or consist of a nucleic acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% (or any percentage in between) identical to SEQ ID NO: 207.
- AAV vector A02563 can comprise from 5’ to 3’ the elements set forth below.
- AAV vector A02563 can comprise, consist essentially of, or consist of a nucleic acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% (or any percentage in between) identical to SEQ ID NO: 208.
- AAV vector A02564 can comprise from 5’ to 3’ the elements set forth below.
- AAV vector A02564 can comprise, consist essentially of, or consist of a nucleic acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% (or any percentage in between) identical to SEQ ID NO: 209.
- AAV vector A02565 can comprise from 5’ to 3’ the elements set forth below.
- AAV vector A02565 can comprise, consist essentially of, or consist of a nucleic acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% (or any percentage in between) identical to SEQ ID NO: 210.
- AAV vector A02566 can comprise from 5’ to 3’ the elements set forth below.
- AAV vector A02566 can comprise, consist essentially of, or consist of a nucleic acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% (or any percentage in between) identical to SEQ ID NO: 211.
- AAV vector A02567 can comprise from 5’ to 3’ the elements set forth below.
- AAV vector A02567 can comprise, consist essentially of, or consist of a nucleic acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% (or any percentage in between) identical to SEQ ID NO: 212.
- AAV vector A02568 can comprise from 5’ to 3’ the elements set forth below.
- AAV vector A02568 can comprise, consist essentially of, or consist of a nucleic acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% (or any percentage in between) identical to SEQ ID NO: 213.
- the disclosure provides constructs having PolIII promoter and guide expression cassette orientated upstream or downstream to the Cas and on the sense or antisense strands.
- these can comprise constructs: P03189, P03190, and P03191.
- AAV vector P03189 can comprise from 5’ to 3’ the elements set forth below.
- AAV vector P03189 can comprise, consist essentially of, or consist of a nucleic acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% (or any percentage in between) identical to SEQ ID NO: 214.
- AAV vector P03190 can comprise from 5’ to 3’ the elements set forth below.
- AAV vector P03190 can comprise, consist essentially of, or consist of a nucleic acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% (or any percentage in between) identical to SEQ ID NO: 215.
- AAV vector P03191 can comprise from 5’ to 3’ the elements set forth below.
- AAV vector P03191 can comprise, consist essentially of, or consist of a nucleic acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% (or any percentage in between) identical to SEQ ID NO: 216.
- HRE targeting compositions and AAV vectors of the disclosure can comprise promoter sequences which regulate expression of RNA-targeting and/or binding polypeptides of the disclosure, such as Casl3d polypeptides, and guide RNA sequences of the disclosure.
- vectors of the disclosure can comprise two Pol III promoters.
- Exemplary PolIII promoters can comprise, consist essentially of, or consist of a nucleic acid as set forth below.
- a human U6 promoter can comprise, consist essentially of, or consist of a nucleic acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% (or any percentage in between) identical to SEQ ID NO: 156.
- a human U6 promoter (reverse strand) can comprise, consist essentially of, or consist of a nucleic acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% (or any percentage in between) identical to SEQ ID NO:
- a human U6 promoter with a point mutation can comprise, consist essentially of, or consist of a nucleic acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% (or any percentage in between) identical to SEQ ID NO:
- a mouse U6 promoter can comprise, consist essentially of, or consist of a nucleic acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% (or any percentage in between) identical to SEQ ID NO: 174.
- an Hl promoter can comprise, consist essentially of, or consist of a nucleic acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% (or any percentage in between) identical to SEQ ID NO: 175.
- a 7SK promoter can comprise, consist essentially of, or consist of a nucleic acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% (or any percentage in between) identical to SEQ ID NO: 176.
- a valine tRNA promoter can comprise, consist essentially of, or consist of a nucleic acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% (or any percentage in between) identical to SEQ ID NO: 177.
- AAV vector P03235 can comprise from 5’ to 3’ the elements set forth below.
- AAV vector P03235 can comprise, consist essentially of, or consist of a nucleic acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% (or any percentage in between) identical to SEQ ID NO: 217.
- P03236 pcDNA3.1_hU6_G4C2gl _hU6 C9 C4G2 flanking_ g20_EFS_Seq212-
- AAV vector P03236 can comprise from 5’ to 3’ the elements set forth below.
- AAV vector P03236 can comprise, consist essentially of, or consist of a nucleic acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% (or any percentage in between) identical to SEQ ID NO: 218.
- the disclosure provides lentiviral vectors comprising RNA repeat targeting compositions of the disclosure.
- L04473 Seq212 with C-terminal NES and C-terminal NLS x2 for nucleus/cytoplasm shuttling
- Lentiviral AAV vector L04473 can comprise from 5’ to 3’ the elements set forth below.
- AAV vector L04473 can comprise, consist essentially of, or consist of a nucleic acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% (or any percentage in between) identical to SEQ ID NO: 219.
- Lentiviral AAV vector L04474 can comprise from 5’ to 3’ the elements set forth below.
- AAV vector L04474 can comprise, consist essentially of, or consist of a nucleic acid sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% (or any percentage in between) identical to SEQ ID NO: 220.
- nucleic acid sequences encoding RNA-binding G4C2 repeattargeting systems disclosed herein for use in gene transfer and expression techniques described herein. It should be understood, although not always explicitly stated that the sequences provided herein can be used to provide the expression product as well as substantially identical sequences that produce a protein that has the same biological properties. These “biologically equivalent” or “biologically active” or “equivalent” polypeptides are encoded by equivalent polynucleotides as described herein.
- They may possess at least 60%, or alternatively, at least 65%, or alternatively, at least 70%, or alternatively, at least 75%, or alternatively, at least 80%, or alternatively at least 85%, or alternatively at least 90%, or alternatively at least 95% or alternatively at least 98%, identical primary amino acid sequence to the reference polypeptide when compared using sequence identity methods run under default conditions.
- Specific polypeptide sequences are provided as examples of particular embodiments. Modifications to the sequences to amino acids with alternate amino acids that have similar charge.
- an equivalent polynucleotide is one that hybridizes under stringent conditions to the reference polynucleotide or its complement or in reference to a polypeptide, a polypeptide encoded by a polynucleotide that hybridizes to the reference encoding polynucleotide under stringent conditions or its complementary strand.
- an equivalent polypeptide or protein is one that is expressed from an equivalent polynucleotide.
- nucleic acid sequences e.g., polynucleotide sequences
- exemplary Cas sequences are codon optimized for expression in human cells. Codon optimization refers to the fact that different cells differ in their usage of particular codons. This codon bias corresponds to a bias in the relative abundance of particular tRNAs in the cell type. By altering the codons in the sequence to match with the relative abundance of corresponding tRNAs, it is possible to increase expression. It is also possible to decrease expression by deliberately choosing codons for which the corresponding tRNAs are known to be rare in a particular cell type.
- Codon usage tables are known in the art for mammalian cells, as well as for a variety of other organisms. Based on the genetic code, nucleic acid sequences coding for, e.g., a Cas protein, can be generated. In some embodiments, such a sequence is optimized for expression in a host or target cell, such as a host cell used to express the Cas protein or a cell in which the disclosed methods are practiced (such as in a mammalian cell, e.g., a human cell).
- a host or target cell such as a host cell used to express the Cas protein or a cell in which the disclosed methods are practiced (such as in a mammalian cell, e.g., a human cell).
- Codon preferences and codon usage tables for a particular species can be used to engineer isolated nucleic acid molecules encoding a Cas protein (such as one encoding a protein having at least 80%, at least 85%, at least 90%, at least 92%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity to its corresponding wild-type protein) that takes advantage of the codon usage preferences of that particular species.
- the Cas proteins disclosed herein can be designed to have codons that are preferentially used by a particular organism of interest.
- a Cas nucleic acid sequence is optimized for expression in human cells, such as one having at least 70%, at least 80%, at least 85%, at least 90%, at least 92%, at least 95%, at least 98%, or at least 99% sequence identity to its corresponding wild-type or originating nucleic acid sequence.
- an isolated nucleic acid molecule encoding at least one Cas protein (which can be part of a vector) includes at least one Cas protein coding sequence that is codon optimized for expression in a eukaryotic cell, or at least one Cas protein coding sequence codon optimized for expression in a human cell.
- such a codon optimized Cas coding sequence has at least 80%, at least 85%, at least 90%, at least 92%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity to its corresponding wild-type or originating sequence.
- a eukaryotic cell codon optimized nucleic acid sequence encodes a Cas protein having at least 85%, at least 90%, at least 92%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity to its corresponding wild-type or originating protein.
- clones containing functionally equivalent nucleic acids may be routinely generated, such as nucleic acids which differ in sequence but which encode the same Cas protein sequence.
- Silent mutations in the coding sequence result from the degeneracy (i.e., redundancy) of the genetic code, whereby more than one codon can encode the same amino acid residue.
- leucine can be encoded by CTT, CTC, CTA, CTG, TTA, or TTG; serine can be encoded by TCT, TCC, TCA, TCG, AGT, or AGC; asparagine can be encoded by AAT or AAC; aspartic acid can be encoded by GAT or GAC; cysteine can be encoded by TGT or TGC; alanine can be encoded by GCT, GCC, GCA, or GCG; glutamine can be encoded by CAA or CAG; tyrosine can be encoded by TAT or TAC; and isoleucine can be encoded by ATT, ATC, or ATA. Tables showing the standard genetic code can be found in various sources (see, for example, Stryer, 1988, Biochemistry, 3 rd Edition, W.H. 5 Freeman and Co., NY).
- RNA sequences disclosed herein can be depicted as DNA sequences.
- a uracil (U) may be denoted as a threonine (T).
- an RNA sequence will be expressed as a DNA sequence encoding said RNA sequence.
- Hybridization refers to a reaction in which one or more polynucleotides react to form a complex that is stabilized via hydrogen bonding between the bases of the nucleotide residues.
- the hydrogen bonding may occur by Watson-Crick base pairing, Hoogstein binding, or in any other sequence-specific manner.
- the complex may comprise two strands forming a duplex structure, three or more strands forming a multi -stranded complex, a single self- hybridizing strand, or any combination of these.
- a hybridization reaction may constitute a step in a more extensive process, such as the initiation of a PC reaction, or the enzymatic cleavage of a polynucleotide by a ribozyme.
- Examples of stringent hybridization conditions include: incubation temperatures of about 25°C to about 37°C; hybridization buffer concentrations of about 6x SSC to about lOx SSC; formamide concentrations of about 0% to about 25%; and wash solutions from about 4x SSC to about 8x SSC.
- Examples of moderate hybridization conditions include: incubation temperatures of about 40°C to about 50°C; buffer concentrations of about 9x SSC to about 2x SSC; formamide concentrations of about 30% to about 50%; and wash solutions of about 5x SSC to about 2x SSC.
- Examples of high stringency conditions include: incubation temperatures of about 55°C to about 68°C; buffer concentrations of about lx SSC to about 0.
- lx SSC formamide concentrations of about 55% to about 75%
- wash solutions of about lx SSC, O. lx SSC, or deionized water.
- hybridization incubation times are from 5 minutes to 24 hours, with 1, 2, or more washing steps, and wash incubation times are about 1, 2, or 15 minutes.
- SSC is 0.15 M NaCl and 15 mM citrate buffer. It is understood that equivalents of SSC using other buffer systems can be employed.
- Homology refers to sequence similarity between two peptides or between two nucleic acid molecules. Homology can be determined by comparing a position in each sequence which may be aligned for purposes of comparison. When a position in the compared sequence is occupied by the same base or amino acid, then the molecules are homologous at that position. A degree of homology between sequences is a function of the number of matching or homologous positions shared by the sequences. An “unrelated” or “non-homologous” sequence shares less than 40% identity, or alternatively less than 25% identity, with one of the sequences of the present invention.
- a cell of the disclosure is a prokaryotic cell.
- a cell of the disclosure is a eukaryotic cell.
- the cell is a mammalian cell.
- the cell is a bovine, murine, feline, equine, porcine, canine, simian, or human cell.
- the cell is a non-human mammalian cell such as a non-human primate cell.
- a cell of the disclosure is a somatic cell.
- a cell of the disclosure is a germline cell. In some embodiments, a germline cell of the disclosure is not a human cell.
- a cell of the disclosure is a stem cell.
- a cell of the disclosure is an embryonic stem cell.
- an embryonic stem cell of the disclosure is not a human cell.
- a cell of the disclosure is a multipotent stem cell or a pluripotent stem cell.
- a cell of the disclosure is an adult stem cell.
- a cell of the disclosure is an induced pluripotent stem cell (iPSC).
- iPSC induced pluripotent stem cell
- a cell of the disclosure is a hematopoietic stem cell (HSC).
- a somatic cell of the disclosure is a neuronal cell.
- a cell or cells of a patient treated with compositions disclosed herein include, without limitation, central nervous system (neurons), peripheral nervous system (neurons), peripheral motor neurons, and/or sensory neurons.
- a neuronal cell is a glial cell.
- a somatic cell of the disclosure is a fibroblast or an epithelial cell.
- an epithelial cell of the disclosure forms a squamous cell epithelium, a cuboidal cell epithelium, a columnar cell epithelium, a stratified cell epithelium, a pseudostratified columnar cell epithelium or a transitional cell epithelium.
- an epithelial cell of the disclosure forms a gland including, but not limited to, a pineal gland, a thymus gland, a pituitary gland, a thyroid gland, an adrenal gland, an apocrine gland, a holocrine gland, a merocrine gland, a serous gland, a mucous gland and a sebaceous gland.
- an epithelial cell of the disclosure contacts an outer surface of an organ including, but not limited to, a lung, a spleen, a stomach, a pancreas, a bladder, an intestine, a kidney, a gallbladder, a liver, a larynx or a pharynx.
- an epithelial cell of the disclosure contacts an outer surface of a blood vessel or a vein.
- a somatic cell of the disclosure is a primary cell.
- a somatic cell of the disclosure is a cultured cell.
- a somatic cell of the disclosure is in vivo, in vitro, ex vivo or in situ.
- a somatic cell of the disclosure is autologous or allogeneic.
- the disclosure provides a method of modifying level of expression of an RNA molecule of the disclosure or a protein encoded by the RNA molecule comprising contacting the composition of the disclosure and the RNA molecule under conditions suitable for binding of one or more of the guide RNA or the RNA-binding protein or RNA-binding fusion protein (or a portion thereof) to the RNA molecule.
- the disclosure provides a method of modifying an activity of a protein encoded by an RNA molecule comprising contacting the composition of the disclosure and the RNA molecule under conditions suitable for binding of one or more of the guide RNA or the RNA- binding protein or the fusion protein (or a portion thereof) to the RNA molecule.
- the disclosure provides a method of modifying level of expression of an RNA molecule of the disclosure or a protein encoded by the RNA molecule comprising contacting the composition of the disclosure and a cell comprising the RNA molecule under conditions suitable for binding of one or more of the guide RNA or the RNA-binding protein or fusion protein (or a portion thereof) to the RNA molecule.
- the cell is in vivo, in vitro, ex vivo or in situ.
- the composition of the disclosure comprises a vector comprising a guide RNA of the disclosure and an RNA-binding protein or fusion protein of the disclosure.
- the vector is an AAV.
- the disclosure provides a method of modifying an activity of a protein encoded by an RNA molecule comprising contacting the composition of the disclosure and a cell comprising the RNA molecule under conditions suitable for binding of one or more of the guide RNA or the RNA-binding protein or fusion protein (or a portion thereof) to the RNA molecule.
- the disclosure provides a method of modifying the level of expression of an RNA molecule of the disclosure or a protein encoded by the RNA molecule comprising contacting the composition of the disclosure and the RNA molecule under conditions suitable for RNA nuclease activity wherein the RNA-binding protein or fusion protein induces a break in the RNA molecule.
- the disclosure provides a method of modifying an activity of a protein encoded by an RNA molecule comprising contacting the composition of the disclosure and the RNA molecule under conditions suitable for RNA nuclease activity wherein the RNA-binding protein or fusion protein induces a break in the RNA molecule.
- the disclosure provides a method of modifying a level of expression of an RNA molecule of the disclosure or a protein encoded by the RNA molecule comprising contacting the composition of the disclosure and a cell comprising the RNA molecule under conditions suitable for RNA nuclease activity wherein the RNA-binding protein or fusion protein induces a break in the RNA molecule.
- the cell is in vivo, in vitro, ex vivo or in situ.
- the composition comprises a vector comprising composition comprising a guide RNA of the disclosure and an RNA-binding fusion protein of the disclosure.
- the vector is an AAV.
- the disclosure provides a method of modifying an activity of a protein encoded by an RNA molecule comprising contacting the composition and a cell comprising the RNA molecule under conditions suitable for RNA nuclease activity wherein the RNA-binding protein or fusion protein induces a break in the RNA molecule.
- the cell is in vivo, in vitro, ex vivo or in situ.
- the composition comprises a vector comprising composition comprising a guide RNA or a single guide RNA of the disclosure and a nucleic acid sequence encoding an RNA-binding protein or fusion protein of the disclosure.
- the vector is an AAV.
- the disclosure provides a method of treating a disease or disorder comprising administering to a subject a therapeutically effective amount of a composition of the disclosure.
- the disclosure provides a method of treating toxic hexanucleotide repeat diseases.
- the repeat disorder is ALS and FTD.
- the disclosure provides a method of treating a repeat disease, wherein the composition modifies, reduces, destroys, knocks down, blocks or ablates a level of expression of a toxic repeat RNA (compared to the level of expression of a toxic repeat RNA treated with a non-targeting (NT) control or compared to no treatment).
- the level of reduction is 1-fold or greater.
- the level of reduction is 2-fold, 3- fold, 4-fold, 5-fold, 6-fold, 7-fold, 8-fold, 9-fold or 10-fold.
- the level of reduction is 10-fold or greater.
- the level of reduction is between 10- fold and 20-fold.
- the level of reduction is 11-fold, 12-fold, 13-fold, 14-fold, 15-fold, 16-fold, 17-fold, 18-fold, 19-fold, or 20-fold.
- the gene therapy compositions disclosed herein when administered to a patient lead to 20%-100% destruction of the toxic G4C2 repeat RNA.
- the % elimination of the toxic repeat RNA is any of 20-99%, 25%-99%, 50%-99%, 80%-99%, 90%-99%, 95%-99%.
- the % elimination is 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99%.
- % elimination is complete elimination or 100% elimination of the toxic repeat RNA.
- a subject of the disclosure is female. In some embodiments of the methods of the disclosure, a subject of the disclosure is male. In some embodiments, a subject of the disclosure has two XX or XY chromosomes. In some embodiments, a subject of the disclosure has two XX or XY chromosomes and a third chromosome, either an X or a Y.
- a subject of the disclosure is a neonate, an infant, a child, an adult, a senior adult, or an elderly adult. In some embodiments of the methods of the disclosure, a subject of the disclosure is at least 1, 2, 3, 4, 5,6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27,28, 29, 30 or 31 days old. In some embodiments of the methods of the disclosure, a subject of the disclosure is at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 or 12 months old.
- a subject of the disclosure is at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100 or any number of years or partial years in between of age.
- a subject of the disclosure is a mammal. In some embodiments, a subject of the disclosure is a non-human mammal.
- a subject of the disclosure is a human.
- a therapeutically effective amount comprises a single dose of a composition of the disclosure. In some embodiments, a therapeutically effective amount comprises a therapeutically effective amount comprises at least one dose of a composition of the disclosure. In some embodiments, a therapeutically effective amount comprises a therapeutically effective amount comprises one or more dose(s) of a composition of the disclosure.
- a therapeutically effective amount eliminates a sign or symptom of the disease or disorder. In some embodiments, a therapeutically effective amount reduces a severity of a sign or symptom of the disease or disorder.
- a therapeutically effective amount eliminates the disease or disorder.
- a therapeutically effective amount prevents an onset of a disease or disorder. In some embodiments, a therapeutically effective amount delays the onset of a disease or disorder. In some embodiments, a therapeutically effective amount reduces the severity of a sign or symptom of the disease or disorder. In some embodiments, a therapeutically effective amount improves a prognosis for the subject.
- a composition of the disclosure is administered to the subject via intrathecal, intravenous (IV), or subpial administration.
- a composition of the disclosure is administered to the subject via intracerebral administration.
- the composition of the disclosure is administered to the subj ect by an intrastriatal route.
- the composition of the disclosure is administered to the subject by a stereotaxic injection or an infusion.
- the composition is administered to the brain.
- a composition of the disclosure is administered to the subject locally.
- compositions disclosed herein are formulated as pharmaceutical compositions.
- pharmaceutical compositions for use as disclosed herein may comprise a protein(s) or a polynucleotide encoding the protein(s), optionally comprised in an AAV, which is optionally also immune orthogonal, in combination with one or more pharmaceutically or physiologically acceptable carriers, diluents or excipients.
- compositions may comprise buffers such as neutral buffered saline, phosphate buffered saline and the like; carbohydrates such as glucose, mannose, sucrose or dextrans, mannitol; proteins; polypeptides or amino acids such as glycine; antioxidants; chelating agents such as EDTA or glutathione; adjuvants (e.g., aluminum hydroxide); and preservatives.
- buffers such as neutral buffered saline, phosphate buffered saline and the like
- carbohydrates such as glucose, mannose, sucrose or dextrans, mannitol
- proteins such as glucose, mannose, sucrose or dextrans, mannitol
- proteins such as glucose, mannose, sucrose or dextrans, mannitol
- proteins such as glucose, mannose, sucrose or dextrans, mannitol
- proteins such as glucose, mannose, sucrose or dextrans, mannitol
- proteins such as glucose, mannose
- compositions of the disclosure may be formulated for routes of administration, such as e.g., oral, enteral, topical, transdermal, intranasal, and/or inhalation; and for routes of administration via injection or infusion such as, e.g., intravenous, intramuscular, subpial, intrathecal, intraparenchymal, intrathecal, intrastriatal, subcutaneous, intradermal, intraperitoneal, intratumoral, intravenous, intraocular, and/or parenteral administration.
- the compositions of the present disclosure are formulated for intracerebral or intrastriatal administration.
- EXAMPLE 1 AAV9 MEDIATED DELIVERY OF RNA TARGETING SYSTEMS
- GGGGCC G4C2 hexanucleotide repeat expansions (HRE) in the first intron of C9ORF72 gene are the most common genetic cause of frontotemporal dementia (FTD) and amyotrophic lateral sclerosis (ALS).
- FTD frontotemporal dementia
- ALS amyotrophic lateral sclerosis
- Bidirectional transcription at the C9ORF72 repeat locus produces both sense G4C2 and antisense C4G2 containing transcripts.
- Studies to date have elucidated multiple pathogenic mechanisms including RNA-gain of function of both sense and antisense HREs, dipeptide repeats generated by RAN translation of both sense and antisense HREs, and haploinsufficiency of the C9ORF72 gene.
- FIGS. 1A-B show qPCR and FIGS. 3A-B show RNA-FISH analyses of cells transfected with G4C2 and C4G2 reporters showed efficient elimination of HREs.
- FIGS. 2A-D show qPCR
- FIGS. 3A-B show RNA-FISH analyses of cells transfected with G4C2 and C4G2 reporters showed efficient elimination of HREs.
- G4C2 targeting guide RNA with two different orthologues of CRISPR/Casl3d in AAV9 with high yields.
- FIG. 4B shows a schematic depicting the subpial delivery model. Further, these constructs reduced the HRE containing isoforms of C9ORF72 in both cultured neonatal cortical neurons and in the spinal cord of BAC C9-500 poly-HRE mice, following subpial delivery, while largely preserving total C9ORF72 transcript (transcripts from both alleles and including normal non-expanded isoforms) levels.
- 5C-D show the reduced levels of HRE containing isoforms of C9ORF72 in the spinal cord and the preserved total C9ORF72 transcript in BAC C9-500-poly-HRE mice, following subpial delivery of the exemplary CRISPR/Casl3d based RNA targeting system.
- Mammalian cells (COSM6, HEK293T) were transfected with CRISPR/Casl3d based RNA targeting vectors and reporters expressing the G4C2 (60 to 90 repeats) or C4G2 flanking region added downstream of firefly luciferase. Then, for quantification of G4C2 expression RNA was extracted from the cells and expression of G4C2 was analyzed by qRT-PCR normalizing to a reference gene and non-targeting condition (NT). For the antisense flanking sequence RNA knockdown was analyzed by Firefly Luciferase luminescence and normalized to Renilla Luciferase transfection control and NT condition.
- FIGS. 3A-B show RNA- FISH analyses of cells transfected with G4C2 and showed efficient elimination of HRE.
- RNA was analyzed using qRT-PCR. Results are shown as G4C2 expression normalized to GAPDH and relative to non-targeting (NT) guide in untreated cells (n 2).
- FIG. 4B shows a schematic depicting the subpial delivery model, and the samples that were taken (RNA, protein) and the associated mouse tissue following subpial delivery of the vehicle or AAV9-packaged G4C2 targeting CRISPR/Casl3d targeting system.
- FIG. 4A shows the vector, mouse strain, and dosage of vector administered per animal.
- the vector was administered at a dose of 5E10 vg/animal (5E12 vg/ml, 5 pL/injection site).
- the indicated samples were collected ten weeks following administration of the treatment.
- RNA was extracted from the cervical and lumbar tissue and quantified by ddPCR.
- Protein was extracted from the cervical and thoracic tissue and quantified by MSD.
- FIGS. 5C-D show the reduced levels of HRE containing isoforms of C9ORF72 in the spinal cord and the preserved total C9ORF72 transcript in BAC C9-500-poly- HRE mice, following subpial delivery of the exemplary CRISPR/Casl3d based RNA targeting system.
- FIGS. 5A-B show the relative RNA and protein levels of Cast 3d in mouse tissue following subpial injection of the AAV9-packaged G4C2 targeting CRISPR/Casl3d vector or administration of a vehicle. Casl3d expression was normalized to Atp5b reference gene expression.
- mice were administered either vehicle or AAV9-Casl3d-G4C2 vector by bilateral subpial injection targeting spinal cord.
- a Th 13 Laminectomy was used to expose LI spinal cord.
- the vector was administered at a dose of 5E10 vg/animal (5E12 vg/ml, 5 pL/injection site).
- EXAMPLE 4 REDUCTION IN SENSE AND ANTISENSE RNA AND POLY-GP IN MULTI-TARGETING (MT) CAS13D TREATED C9-ALS FIBROBLASTS
- MT construct targeting sense strand G4C2 and antisense C4G2 flanking region was used to evaluate knockdown of both sense and antisense transcripts and decrease of DPRS in C9-ALS derived fibroblasts.
- DPRs dipeptide repeats
- the fibroblasts were transduced with lentivirus expressing Cast 3d and selected with puromycin 48 hrs after transduction. After selection cells were transduced with the lentivirus for MT guide expression targeting sense and antisense transcripts and harvested 72 hrs after the second transduction.
- DPRs dipeptide repeats
- MSD Meso Scale Discovery
- RNA knockdown of G4C2 sense and antisense transcripts, as well as decrease in dipeptide repeats (DPRs) poly-GP aggregates in C9-ALS mouse model C9 BAC-500 (containing 500 G4C2 repeats) were observed after 6 weeks intrastriatal injection of AAV9 Casl3d MT (A02562). See FIG. 8A-8E.
- DPRs dipeptide repeats
- mice were injected unilaterally with the non-injected side used as a control, and two age groups were used 17 weeks and 24 weeks-old mice for evaluation. Decrease on sense and antisense RNAs (pathological isoforms) were shown by ddPCR as copies of the respective transcript per 1000 copies of the Atp5b reference gene.
- Casl3d expression in tissue following intrastriatal injection of AAV9-Casl3d-MT (A02562) and poly-GP DPRs relative to untreated tissue (contralateral control) were measured using the Meso Scale Discovery (MSD) assay.
- MSD Meso Scale Discovery
- EXAMPLE 7 KNOCKDOWN OF SENSE G4C2 FLANKING REGION WITH SEQ212 USING LUCIFERASE REPORTER ASSAY
- Mammalian cells were transfected with the luciferase reporter containing the sense flanking region, Cast 3d, and the NT Guide (P000MJ Single Lambda Guide; negative control), firefly luciferase Guide (positive control), and guides targeting the sense flanking region. Knockdown of sense G4C2 flanking region in the mammalian cells was observed. See FIG.
- EXAMPLE 8 SEQ212 VARIANTS FOR IMPROVED G4C2 AND ANTISENSE
- Casl3d variants were engineered to improve on-target knockdown of both G4C2 sense and C4G2 antisense flanking region.
- Two reporter plasmids were constructed to express G4C2 repeats driven by the CMV promoter (10A) and a Luciferase reporter containing the antisense flanking region (10C). See FIG. 10A-10E.
- Wildtype (wt) Cast 3d was co-transfected with non-targeting (NT) and G4C2- targeting guide.
- Engineered Cast 3d variants were co-transfected with G4C2-targeting guide.
- C4G2 flanking sequence reporter plasmid was cloned downstream of firefly luciferase ORF (FLUC). Renilla luciferase (RLUC) was used as a transfection control.
- FLUC firefly luciferase ORF
- RLUC Renilla luciferase
- Two independent experiments tested C4G2 antisense flank knockdown in luciferase assay.
- a separate experiment tested RNA binding domains (RBD) tethered to Seq212.
Abstract
L'invention porte sur des compositions de thérapie génique ciblant l'ARN et des procédés permettant de détruire ou de bloquer l'ARN répétitif hexanucléotidique toxique afin de traiter des maladies telles que la SLA et la DFT.
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US202263332097P | 2022-04-18 | 2022-04-18 | |
US63/332,097 | 2022-04-18 | ||
US202363442271P | 2023-01-31 | 2023-01-31 | |
US63/442,271 | 2023-01-31 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2023205637A1 true WO2023205637A1 (fr) | 2023-10-26 |
Family
ID=86332193
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2023/065889 WO2023205637A1 (fr) | 2022-04-18 | 2023-04-18 | Compositions ciblant l'arn et procédés pour traiter les maladies c9/orf72 |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2023205637A1 (fr) |
Citations (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2001083692A2 (fr) | 2000-04-28 | 2001-11-08 | The Trustees Of The University Of Pennsylvania | Vecteurs aav recombinants dotes de capsides aav5 et vecteurs aav5 pseudotypes dans des capsides heterologues |
WO2008124724A1 (fr) | 2007-04-09 | 2008-10-16 | University Of Florida Research Foundation, Inc. | Compositions à base de vecteurs raav comprenant des protéines de capside modifiées par la tyrosine et procédés d'utilisation correspondants |
WO2017091630A1 (fr) * | 2015-11-23 | 2017-06-01 | The Regents Of The University Of California | Suivi et manipulation d'arn cellulaire par distribution nucléaire de crispr/cas9 |
WO2017216771A2 (fr) * | 2016-06-17 | 2017-12-21 | Genesis Technologies Limited | Système crips-cas, matériels et procédés |
WO2018183403A1 (fr) | 2017-03-28 | 2018-10-04 | Caribou Biosciences, Inc. | Protéine associée à crispr (cas) |
WO2019006471A2 (fr) | 2017-06-30 | 2019-01-03 | Arbor Biotechnologies, Inc. | Nouveaux enzymes de ciblage d'arn crispr, systèmes et utilisations associés |
WO2019040664A1 (fr) | 2017-08-22 | 2019-02-28 | Salk Institute For Biological Studies | Méthodes et compositions de ciblage d'arn |
WO2020160150A1 (fr) * | 2019-01-29 | 2020-08-06 | The Regents Of The University Of California | Enzymes cas ciblant l'arn |
WO2021101980A1 (fr) * | 2019-11-19 | 2021-05-27 | The Regents Of The University Of California | Compositions et procédés d'utilisation de protéines de fusion modifiées qui se lient à des répétitions humaines g4c2 |
WO2022219200A1 (fr) * | 2021-04-16 | 2022-10-20 | Ucl Business Ltd | Systèmes casrx/cas13d ciblant c9orf72 |
-
2023
- 2023-04-18 WO PCT/US2023/065889 patent/WO2023205637A1/fr unknown
Patent Citations (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2001083692A2 (fr) | 2000-04-28 | 2001-11-08 | The Trustees Of The University Of Pennsylvania | Vecteurs aav recombinants dotes de capsides aav5 et vecteurs aav5 pseudotypes dans des capsides heterologues |
WO2008124724A1 (fr) | 2007-04-09 | 2008-10-16 | University Of Florida Research Foundation, Inc. | Compositions à base de vecteurs raav comprenant des protéines de capside modifiées par la tyrosine et procédés d'utilisation correspondants |
WO2017091630A1 (fr) * | 2015-11-23 | 2017-06-01 | The Regents Of The University Of California | Suivi et manipulation d'arn cellulaire par distribution nucléaire de crispr/cas9 |
WO2017216771A2 (fr) * | 2016-06-17 | 2017-12-21 | Genesis Technologies Limited | Système crips-cas, matériels et procédés |
WO2018183403A1 (fr) | 2017-03-28 | 2018-10-04 | Caribou Biosciences, Inc. | Protéine associée à crispr (cas) |
WO2019006471A2 (fr) | 2017-06-30 | 2019-01-03 | Arbor Biotechnologies, Inc. | Nouveaux enzymes de ciblage d'arn crispr, systèmes et utilisations associés |
WO2019040664A1 (fr) | 2017-08-22 | 2019-02-28 | Salk Institute For Biological Studies | Méthodes et compositions de ciblage d'arn |
US20190062724A1 (en) | 2017-08-22 | 2019-02-28 | Salk Institute For Biological Studies | Rna targeting methods and compositions |
WO2020160150A1 (fr) * | 2019-01-29 | 2020-08-06 | The Regents Of The University Of California | Enzymes cas ciblant l'arn |
WO2021101980A1 (fr) * | 2019-11-19 | 2021-05-27 | The Regents Of The University Of California | Compositions et procédés d'utilisation de protéines de fusion modifiées qui se lient à des répétitions humaines g4c2 |
WO2022219200A1 (fr) * | 2021-04-16 | 2022-10-20 | Ucl Business Ltd | Systèmes casrx/cas13d ciblant c9orf72 |
Non-Patent Citations (16)
Title |
---|
BATRA RANJAN ET AL: "Elimination of Toxic Microsatellite Repeat Expansion RNA by RNA-Targeting Cas9", CELL, ELSEVIER, AMSTERDAM NL, vol. 170, no. 5, 10 August 2017 (2017-08-10), pages 899, XP085170516, ISSN: 0092-8674, DOI: 10.1016/J.CELL.2017.07.010 * |
BERNS: "Virology", 1990, RAVEN PRESS, pages: 1743 - 1764 |
BLACKLOWE: "Parvoviruses and Human Disease", 1988, pages: 165 - 174 |
BRAEMS ET AL., ACTA NEUROPATH, 2020 |
CARTER: "Handbook of Parvoviruses", vol. 1, 1989, pages: 169 - 228 |
CHEW JEANNIE ET AL: "AAV9 mediated delivery of RNA targeting systems eliminate hexanucleotide repeat expansions in C9ORF72 ALS/FTD models", 25TH ANNUAL MEETING OF THE AMERICAN-SOCIETY-OF-GENE-AND-CELL-THERAPY (ASGCT); WASHINGTON, DC, USA; MAY 16 -19, 2022, 19 May 2022 (2022-05-19), XP093071675, Retrieved from the Internet <URL:https://locanabio.com/wp-content/uploads/2022/05/2022-ASGCT.pdf> [retrieved on 20230808] * |
DEJESUS-HERNANDEZ, M. ET AL., NEURON, 2011 |
DURAND ET AL., VIRUSES, vol. 3, no. 2, 2011, pages 132 - 159 |
KONERMANN ET AL., CELL, vol. 173, no. 3, 2018, pages 665 - 676 |
LING, S.C. ET AL., NEURON, 2013 |
MARSIC ET AL., MOLECULAR THERAPY, vol. 22, no. 11, 2014, pages 1900 - 1909 |
ROSE: "Comprehensive Virology", vol. 3, 1974, pages: 1 - 61 |
TRONO D: "Lentiviral vectors", 2002, SPRING-VERLAG BERLIN HEIDELBERG |
VAN BITTLERSWIJK ET AL., CURR OPIN NEUROL, 2013 |
YAN ET AL., MOL CELL, vol. 70, no. 2, 2018, pages 327 - 339 |
ZHANG NAN ET AL: "A CRISPR-Cas13a Based Strategy That Tracks and Degrades Toxic RNA in Myotonic Dystrophy Type 1", FRONTIERS IN GENETICS, vol. 11, 10 December 2020 (2020-12-10), XP055938561, DOI: 10.3389/fgene.2020.594576 * |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP3298134B1 (fr) | Édition génique de mutations introniques profondes | |
KR20220004680A (ko) | 신규 aav 캡시드 및 이를 포함하는 조성물 | |
US10016514B2 (en) | Polynucleotides, vectors and methods for insertion and expression of transgenes | |
US20240000972A1 (en) | Rna-targeting compositions and methods for treating cag repeat diseases | |
US20240108751A1 (en) | Rna-targeting compositions and methods for treating myotonic dystrophy type 1 | |
CN112805382A (zh) | 针对α-突触核蛋白的变体RNAi | |
CN113557243A (zh) | 用于神经变性疾病的基因疗法 | |
US20190256867A1 (en) | Adeno-associated virus compositions for restoring pah gene function and methods of use thereof | |
EP3997241A1 (fr) | Compositions et méthodes pour le traitement de la maladie des urines à odeur de sirop d'érable | |
WO2023205637A1 (fr) | Compositions ciblant l'arn et procédés pour traiter les maladies c9/orf72 | |
US20240024514A1 (en) | Recombinant adeno-associated viruses with enhanced liver tropism and uses thereof | |
KR20230026504A (ko) | 폴리뉴클레오티드 | |
US20230279397A1 (en) | Compositions and methods comprising engineered short nuclear rna (snrna) | |
WO2020223308A2 (fr) | Méthodes et compositions pour la reprogrammation de müller glia | |
WO2022221278A1 (fr) | Compositions et procédés comprenant des promoteurs hybrides | |
WO2024086650A1 (fr) | Compositions et procédés comprenant des arnsn programmables pour l'édition d'arn | |
WO2023154807A2 (fr) | Compositions et procédés de modulation d'épissage de pré-arnm | |
WO2023185861A1 (fr) | Acide nucléique guide ciblant ube3a-ats et ses utilisations | |
CN117320741A (zh) | 用于治疗cag重复疾病的靶向rna的组合物和方法 | |
CN116801901A (zh) | 用于治疗1型强直性肌营养不良的靶向rna的组合物和方法 | |
WO2023235726A2 (fr) | Agents thérapeutiques d'interférence crispr pour une maladie d'expansion de répétition c9orf72 | |
WO2024069144A1 (fr) | Vecteur d'édition d'arn | |
CN116670159A (zh) | 组合物及其用于治疗安格尔曼综合征的用途 | |
JP2023551911A (ja) | アンジェルマン症候群の治療のための組成物及びその使用 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 23723384 Country of ref document: EP Kind code of ref document: A1 |