WO2023172995A1 - Systems and methods for genetic modulation to treat ocular diseases - Google Patents
Systems and methods for genetic modulation to treat ocular diseases Download PDFInfo
- Publication number
- WO2023172995A1 WO2023172995A1 PCT/US2023/064004 US2023064004W WO2023172995A1 WO 2023172995 A1 WO2023172995 A1 WO 2023172995A1 US 2023064004 W US2023064004 W US 2023064004W WO 2023172995 A1 WO2023172995 A1 WO 2023172995A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- fold
- cell
- prpf31
- expression
- nucleic acid
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 168
- 230000002068 genetic effect Effects 0.000 title description 3
- 208000022873 Ocular disease Diseases 0.000 title description 2
- 230000014509 gene expression Effects 0.000 claims abstract description 295
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 claims abstract description 70
- 201000010099 disease Diseases 0.000 claims abstract description 67
- 108090000623 proteins and genes Proteins 0.000 claims description 546
- 210000004027 cell Anatomy 0.000 claims description 421
- 101000610557 Homo sapiens U4/U6 small nuclear ribonucleoprotein Prp31 Proteins 0.000 claims description 217
- 102100040118 U4/U6 small nuclear ribonucleoprotein Prp31 Human genes 0.000 claims description 216
- 102000004169 proteins and genes Human genes 0.000 claims description 209
- 102000040430 polynucleotide Human genes 0.000 claims description 191
- 108091033319 polynucleotide Proteins 0.000 claims description 191
- 239000002157 polynucleotide Substances 0.000 claims description 191
- 150000007523 nucleic acids Chemical class 0.000 claims description 146
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 145
- 102000039446 nucleic acids Human genes 0.000 claims description 138
- 108020004707 nucleic acids Proteins 0.000 claims description 138
- 150000001413 amino acids Chemical class 0.000 claims description 137
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 137
- 229920001184 polypeptide Polymers 0.000 claims description 134
- 230000000694 effects Effects 0.000 claims description 103
- 108700028369 Alleles Proteins 0.000 claims description 75
- 230000001965 increasing effect Effects 0.000 claims description 62
- 125000006850 spacer group Chemical group 0.000 claims description 39
- 210000003583 retinal pigment epithelium Anatomy 0.000 claims description 30
- 230000027455 binding Effects 0.000 claims description 27
- 210000004081 cilia Anatomy 0.000 claims description 26
- 108091006106 transcriptional activators Proteins 0.000 claims description 25
- 230000008685 targeting Effects 0.000 claims description 21
- 210000001519 tissue Anatomy 0.000 claims description 20
- 230000007018 DNA scission Effects 0.000 claims description 19
- 230000002829 reductive effect Effects 0.000 claims description 14
- 208000007014 Retinitis pigmentosa Diseases 0.000 claims description 13
- 108700009124 Transcription Initiation Site Proteins 0.000 claims description 11
- 230000001747 exhibiting effect Effects 0.000 claims description 11
- 230000009870 specific binding Effects 0.000 claims description 9
- 238000002347 injection Methods 0.000 claims description 8
- 239000007924 injection Substances 0.000 claims description 8
- 108091008695 photoreceptors Proteins 0.000 claims description 8
- 230000002207 retinal effect Effects 0.000 claims description 8
- 230000009471 action Effects 0.000 claims description 3
- 206010057249 Phagocytosis Diseases 0.000 claims description 2
- 230000008782 phagocytosis Effects 0.000 claims description 2
- 101001109965 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) 60S ribosomal protein L7-A Proteins 0.000 claims 1
- 101001109960 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) 60S ribosomal protein L7-B Proteins 0.000 claims 1
- 235000018102 proteins Nutrition 0.000 description 184
- 235000001014 amino acid Nutrition 0.000 description 139
- 229940024606 amino acid Drugs 0.000 description 136
- 125000003729 nucleotide group Chemical group 0.000 description 102
- 239000002773 nucleotide Substances 0.000 description 100
- 101710163270 Nuclease Proteins 0.000 description 83
- 125000003275 alpha amino acid group Chemical group 0.000 description 67
- 238000006467 substitution reaction Methods 0.000 description 65
- 108020005004 Guide RNA Proteins 0.000 description 63
- 102000053602 DNA Human genes 0.000 description 62
- 108020004414 DNA Proteins 0.000 description 60
- 239000000203 mixture Substances 0.000 description 58
- 239000012636 effector Substances 0.000 description 46
- 125000000539 amino acid group Chemical group 0.000 description 27
- 210000000349 chromosome Anatomy 0.000 description 25
- 239000012634 fragment Substances 0.000 description 25
- 108010033040 Histones Proteins 0.000 description 24
- 229920002477 rna polymer Polymers 0.000 description 24
- 239000013598 vector Substances 0.000 description 24
- 108091033409 CRISPR Proteins 0.000 description 23
- 238000003776 cleavage reaction Methods 0.000 description 21
- 230000007017 scission Effects 0.000 description 21
- 230000028327 secretion Effects 0.000 description 21
- 108091028043 Nucleic acid sequence Proteins 0.000 description 20
- 230000001105 regulatory effect Effects 0.000 description 20
- 210000000130 stem cell Anatomy 0.000 description 20
- -1 diTP Chemical compound 0.000 description 19
- 239000003607 modifier Substances 0.000 description 19
- 230000035772 mutation Effects 0.000 description 18
- 238000011144 upstream manufacturing Methods 0.000 description 18
- 230000003612 virological effect Effects 0.000 description 16
- 238000010459 TALEN Methods 0.000 description 15
- 108010043645 Transcription Activator-Like Effector Nucleases Proteins 0.000 description 15
- 230000002103 transcriptional effect Effects 0.000 description 15
- 238000010453 CRISPR/Cas method Methods 0.000 description 14
- 102100026140 TCF3 fusion partner Human genes 0.000 description 14
- 101710112485 TCF3 fusion partner Proteins 0.000 description 14
- 230000008859 change Effects 0.000 description 14
- 108010073062 Transcription Activator-Like Effectors Proteins 0.000 description 13
- 230000001973 epigenetic effect Effects 0.000 description 13
- 108010017070 Zinc Finger Nucleases Proteins 0.000 description 12
- 230000008901 benefit Effects 0.000 description 12
- 230000000295 complement effect Effects 0.000 description 12
- 108020004999 messenger RNA Proteins 0.000 description 12
- 241000196324 Embryophyta Species 0.000 description 11
- 239000002245 particle Substances 0.000 description 11
- 239000013608 rAAV vector Substances 0.000 description 11
- 239000013607 AAV vector Substances 0.000 description 10
- 102000004533 Endonucleases Human genes 0.000 description 10
- 108010042407 Endonucleases Proteins 0.000 description 10
- 230000003247 decreasing effect Effects 0.000 description 10
- 229950010342 uridine triphosphate Drugs 0.000 description 10
- 241000193996 Streptococcus pyogenes Species 0.000 description 9
- 238000000338 in vitro Methods 0.000 description 9
- 238000001727 in vivo Methods 0.000 description 9
- 238000004806 packaging method and process Methods 0.000 description 9
- 239000013603 viral vector Substances 0.000 description 9
- 230000004568 DNA-binding Effects 0.000 description 8
- 102000006947 Histones Human genes 0.000 description 8
- 102100021244 Integral membrane protein GPR180 Human genes 0.000 description 8
- 102000044126 RNA-Binding Proteins Human genes 0.000 description 8
- 230000006870 function Effects 0.000 description 8
- 230000004927 fusion Effects 0.000 description 8
- 230000004048 modification Effects 0.000 description 8
- 238000012986 modification Methods 0.000 description 8
- 229920000642 polymer Polymers 0.000 description 8
- 208000001957 retinitis pigmentosa 11 Diseases 0.000 description 8
- 241000702421 Dependoparvovirus Species 0.000 description 7
- 241000124008 Mammalia Species 0.000 description 7
- 102100025169 Max-binding protein MNT Human genes 0.000 description 7
- 102000009572 RNA Polymerase II Human genes 0.000 description 7
- 108010009460 RNA Polymerase II Proteins 0.000 description 7
- 235000004279 alanine Nutrition 0.000 description 7
- 230000037430 deletion Effects 0.000 description 7
- 238000012217 deletion Methods 0.000 description 7
- 230000005782 double-strand break Effects 0.000 description 7
- 230000037431 insertion Effects 0.000 description 7
- 238000003780 insertion Methods 0.000 description 7
- 230000029279 positive regulation of transcription, DNA-dependent Effects 0.000 description 7
- 241000701022 Cytomegalovirus Species 0.000 description 6
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 6
- 241001465754 Metazoa Species 0.000 description 6
- 108010077850 Nuclear Localization Signals Proteins 0.000 description 6
- 230000001594 aberrant effect Effects 0.000 description 6
- 239000004126 brilliant black BN Substances 0.000 description 6
- 125000002091 cationic group Chemical group 0.000 description 6
- 210000002919 epithelial cell Anatomy 0.000 description 6
- 210000003527 eukaryotic cell Anatomy 0.000 description 6
- 150000002632 lipids Chemical class 0.000 description 6
- 238000004519 manufacturing process Methods 0.000 description 6
- 238000013518 transcription Methods 0.000 description 6
- 230000035897 transcription Effects 0.000 description 6
- 230000037426 transcriptional repression Effects 0.000 description 6
- 108091006107 transcriptional repressors Proteins 0.000 description 6
- 238000012546 transfer Methods 0.000 description 6
- 241000283984 Rodentia Species 0.000 description 5
- 241000700605 Viruses Species 0.000 description 5
- 239000000908 ammonium hydroxide Substances 0.000 description 5
- 230000003197 catalytic effect Effects 0.000 description 5
- 230000001886 ciliary effect Effects 0.000 description 5
- 230000034431 double-strand break repair via homologous recombination Effects 0.000 description 5
- VYXSBFYARXAAKO-UHFFFAOYSA-N ethyl 2-[3-(ethylamino)-6-ethylimino-2,7-dimethylxanthen-9-yl]benzoate;hydron;chloride Chemical compound [Cl-].C1=2C=C(C)C(NCC)=CC=2OC2=CC(=[NH+]CC)C(C)=CC2=C1C1=CC=CC=C1C(=O)OCC VYXSBFYARXAAKO-UHFFFAOYSA-N 0.000 description 5
- 108020001507 fusion proteins Proteins 0.000 description 5
- 102000037865 fusion proteins Human genes 0.000 description 5
- 210000004209 hair Anatomy 0.000 description 5
- VEXZGXHMUGYJMC-UHFFFAOYSA-N hydrochloric acid Substances Cl VEXZGXHMUGYJMC-UHFFFAOYSA-N 0.000 description 5
- 210000004263 induced pluripotent stem cell Anatomy 0.000 description 5
- 230000000670 limiting effect Effects 0.000 description 5
- 210000004072 lung Anatomy 0.000 description 5
- 239000000347 magnesium hydroxide Substances 0.000 description 5
- 210000003097 mucus Anatomy 0.000 description 5
- 239000002105 nanoparticle Substances 0.000 description 5
- 230000008569 process Effects 0.000 description 5
- 230000001177 retroviral effect Effects 0.000 description 5
- 239000004055 small Interfering RNA Substances 0.000 description 5
- 208000024891 symptom Diseases 0.000 description 5
- 230000001225 therapeutic effect Effects 0.000 description 5
- 239000001226 triphosphate Substances 0.000 description 5
- 235000011178 triphosphate Nutrition 0.000 description 5
- OAKPWEUQDVLTCN-NKWVEPMBSA-N 2',3'-Dideoxyadenosine-5-triphosphate Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@H]1CC[C@@H](CO[P@@](O)(=O)O[P@](O)(=O)OP(O)(O)=O)O1 OAKPWEUQDVLTCN-NKWVEPMBSA-N 0.000 description 4
- 101710159080 Aconitate hydratase A Proteins 0.000 description 4
- 101710159078 Aconitate hydratase B Proteins 0.000 description 4
- 102000008682 Argonaute Proteins Human genes 0.000 description 4
- 108010088141 Argonaute Proteins Proteins 0.000 description 4
- 102100036279 DNA (cytosine-5)-methyltransferase 1 Human genes 0.000 description 4
- 108010008532 Deoxyribonuclease I Proteins 0.000 description 4
- 102000007260 Deoxyribonuclease I Human genes 0.000 description 4
- 102000004190 Enzymes Human genes 0.000 description 4
- 108090000790 Enzymes Proteins 0.000 description 4
- 102100022846 Histone acetyltransferase KAT2B Human genes 0.000 description 4
- 241000282412 Homo Species 0.000 description 4
- 101000931098 Homo sapiens DNA (cytosine-5)-methyltransferase 1 Proteins 0.000 description 4
- 101001047006 Homo sapiens Histone acetyltransferase KAT2B Proteins 0.000 description 4
- 208000026350 Inborn Genetic disease Diseases 0.000 description 4
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 4
- 241000699670 Mus sp. Species 0.000 description 4
- 108700020471 RNA-Binding Proteins Proteins 0.000 description 4
- 101710105008 RNA-binding protein Proteins 0.000 description 4
- 241000194020 Streptococcus thermophilus Species 0.000 description 4
- 108091027544 Subgenomic mRNA Proteins 0.000 description 4
- 108020004566 Transfer RNA Proteins 0.000 description 4
- 108700019146 Transgenes Proteins 0.000 description 4
- 241000251539 Vertebrata <Metazoa> Species 0.000 description 4
- HCHKCACWOHOZIP-UHFFFAOYSA-N Zinc Chemical compound [Zn] HCHKCACWOHOZIP-UHFFFAOYSA-N 0.000 description 4
- ARLKCWCREKRROD-POYBYMJQSA-N [[(2s,5r)-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl] phosphono hydrogen phosphate Chemical compound O=C1N=C(N)C=CN1[C@@H]1O[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)CC1 ARLKCWCREKRROD-POYBYMJQSA-N 0.000 description 4
- 239000004480 active ingredient Substances 0.000 description 4
- 230000001580 bacterial effect Effects 0.000 description 4
- 239000002299 complementary DNA Substances 0.000 description 4
- RGWHQCVHVJXOKC-SHYZEUOFSA-J dCTP(4-) Chemical compound O=C1N=C(N)C=CN1[C@@H]1O[C@H](COP([O-])(=O)OP([O-])(=O)OP([O-])([O-])=O)[C@@H](O)C1 RGWHQCVHVJXOKC-SHYZEUOFSA-J 0.000 description 4
- 230000002950 deficient Effects 0.000 description 4
- 238000001514 detection method Methods 0.000 description 4
- 230000002255 enzymatic effect Effects 0.000 description 4
- 229940088598 enzyme Drugs 0.000 description 4
- 238000010362 genome editing Methods 0.000 description 4
- 210000004907 gland Anatomy 0.000 description 4
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 4
- 210000004962 mammalian cell Anatomy 0.000 description 4
- 230000011987 methylation Effects 0.000 description 4
- 238000007069 methylation reaction Methods 0.000 description 4
- 239000008194 pharmaceutical composition Substances 0.000 description 4
- 239000000546 pharmaceutical excipient Substances 0.000 description 4
- 239000013612 plasmid Substances 0.000 description 4
- 230000014493 regulation of gene expression Effects 0.000 description 4
- PYWVYCXTNDRMGF-UHFFFAOYSA-N rhodamine B Chemical compound [Cl-].C=12C=CC(=[N+](CC)CC)C=C2OC2=CC(N(CC)CC)=CC=C2C=1C1=CC=CC=C1C(O)=O PYWVYCXTNDRMGF-UHFFFAOYSA-N 0.000 description 4
- 210000004918 root sheath Anatomy 0.000 description 4
- 210000000106 sweat gland Anatomy 0.000 description 4
- ABZLKHKQJHEPAX-UHFFFAOYSA-N tetramethylrhodamine Chemical compound C=12C=CC(N(C)C)=CC2=[O+]C2=CC(N(C)C)=CC=C2C=1C1=CC=CC=C1C([O-])=O ABZLKHKQJHEPAX-UHFFFAOYSA-N 0.000 description 4
- 210000001685 thyroid gland Anatomy 0.000 description 4
- 238000011282 treatment Methods 0.000 description 4
- 125000002264 triphosphate group Chemical class [H]OP(=O)(O[H])OP(=O)(O[H])OP(=O)(O[H])O* 0.000 description 4
- 241000701161 unidentified adenovirus Species 0.000 description 4
- 241001430294 unidentified retrovirus Species 0.000 description 4
- 239000003981 vehicle Substances 0.000 description 4
- 239000011701 zinc Substances 0.000 description 4
- 229910052725 zinc Inorganic materials 0.000 description 4
- 102100023971 ADP-ribosylation factor-like protein 13B Human genes 0.000 description 3
- 101710096649 ADP-ribosylation factor-like protein 13B Proteins 0.000 description 3
- ZKHQWZAMYRWXGA-KQYNXXCUSA-J ATP(4-) Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](COP([O-])(=O)OP([O-])(=O)OP([O-])([O-])=O)[C@@H](O)[C@H]1O ZKHQWZAMYRWXGA-KQYNXXCUSA-J 0.000 description 3
- ZKHQWZAMYRWXGA-UHFFFAOYSA-N Adenosine triphosphate Natural products C1=NC=2C(N)=NC=NC=2N1C1OC(COP(O)(=O)OP(O)(=O)OP(O)(O)=O)C(O)C1O ZKHQWZAMYRWXGA-UHFFFAOYSA-N 0.000 description 3
- 241000271566 Aves Species 0.000 description 3
- 241000283690 Bos taurus Species 0.000 description 3
- 102000004127 Cytokines Human genes 0.000 description 3
- 108090000695 Cytokines Proteins 0.000 description 3
- 102000052510 DNA-Binding Proteins Human genes 0.000 description 3
- 101710096438 DNA-binding protein Proteins 0.000 description 3
- AHCYMLUZIRLXAA-SHYZEUOFSA-N Deoxyuridine 5'-triphosphate Chemical compound O1[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)[C@@H](O)C[C@@H]1N1C(=O)NC(=O)C=C1 AHCYMLUZIRLXAA-SHYZEUOFSA-N 0.000 description 3
- 238000002965 ELISA Methods 0.000 description 3
- 241000233866 Fungi Species 0.000 description 3
- 239000004471 Glycine Substances 0.000 description 3
- 102100022893 Histone acetyltransferase KAT5 Human genes 0.000 description 3
- 102100038885 Histone acetyltransferase p300 Human genes 0.000 description 3
- 102100029768 Histone-lysine N-methyltransferase SETD1A Human genes 0.000 description 3
- 101000865038 Homo sapiens Histone-lysine N-methyltransferase SETD1A Proteins 0.000 description 3
- 101000610640 Homo sapiens U4/U6 small nuclear ribonucleoprotein Prp3 Proteins 0.000 description 3
- 108700026244 Open Reading Frames Proteins 0.000 description 3
- 241000288906 Primates Species 0.000 description 3
- 241000700584 Simplexvirus Species 0.000 description 3
- 108020004459 Small interfering RNA Proteins 0.000 description 3
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 3
- 102100040374 U4/U6 small nuclear ribonucleoprotein Prp3 Human genes 0.000 description 3
- HDRRAMINWIWTNU-NTSWFWBYSA-N [[(2s,5r)-5-(2-amino-6-oxo-3h-purin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl] phosphono hydrogen phosphate Chemical compound C1=2NC(N)=NC(=O)C=2N=CN1[C@H]1CC[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 HDRRAMINWIWTNU-NTSWFWBYSA-N 0.000 description 3
- 230000021736 acetylation Effects 0.000 description 3
- 238000006640 acetylation reaction Methods 0.000 description 3
- 239000012190 activator Substances 0.000 description 3
- 125000003295 alanine group Chemical group N[C@@H](C)C(=O)* 0.000 description 3
- 210000004102 animal cell Anatomy 0.000 description 3
- 238000003556 assay Methods 0.000 description 3
- 210000000270 basal cell Anatomy 0.000 description 3
- 230000033228 biological regulation Effects 0.000 description 3
- 239000003795 chemical substances by application Substances 0.000 description 3
- URGJWIFLBWJRMF-JGVFFNPUSA-N ddTTP Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)CC1 URGJWIFLBWJRMF-JGVFFNPUSA-N 0.000 description 3
- 208000035475 disorder Diseases 0.000 description 3
- 239000003623 enhancer Substances 0.000 description 3
- 210000003743 erythrocyte Anatomy 0.000 description 3
- 210000001035 gastrointestinal tract Anatomy 0.000 description 3
- 238000012239 gene modification Methods 0.000 description 3
- 208000016361 genetic disease Diseases 0.000 description 3
- 238000010353 genetic engineering Methods 0.000 description 3
- 210000002175 goblet cell Anatomy 0.000 description 3
- 210000004919 hair shaft Anatomy 0.000 description 3
- 238000005734 heterodimerization reaction Methods 0.000 description 3
- 210000005260 human cell Anatomy 0.000 description 3
- 230000001939 inductive effect Effects 0.000 description 3
- 239000004615 ingredient Substances 0.000 description 3
- 230000010354 integration Effects 0.000 description 3
- 210000003734 kidney Anatomy 0.000 description 3
- 239000000252 konjac Substances 0.000 description 3
- 210000000440 neutrophil Anatomy 0.000 description 3
- 210000002394 ovarian follicle Anatomy 0.000 description 3
- 230000002085 persistent effect Effects 0.000 description 3
- 230000001817 pituitary effect Effects 0.000 description 3
- 210000001236 prokaryotic cell Anatomy 0.000 description 3
- 230000008439 repair process Effects 0.000 description 3
- 210000002345 respiratory system Anatomy 0.000 description 3
- 210000001525 retina Anatomy 0.000 description 3
- 238000003757 reverse transcription PCR Methods 0.000 description 3
- 108020004418 ribosomal RNA Proteins 0.000 description 3
- 210000003079 salivary gland Anatomy 0.000 description 3
- 239000011780 sodium chloride Substances 0.000 description 3
- 210000001324 spliceosome Anatomy 0.000 description 3
- 210000002784 stomach Anatomy 0.000 description 3
- 210000002105 tongue Anatomy 0.000 description 3
- 238000010361 transduction Methods 0.000 description 3
- 230000026683 transduction Effects 0.000 description 3
- 230000010415 tropism Effects 0.000 description 3
- 230000002485 urinary effect Effects 0.000 description 3
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 2
- VGIRNWJSIRVFRT-UHFFFAOYSA-N 2',7'-difluorofluorescein Chemical compound OC(=O)C1=CC=CC=C1C1=C2C=C(F)C(=O)C=C2OC2=CC(O)=C(F)C=C21 VGIRNWJSIRVFRT-UHFFFAOYSA-N 0.000 description 2
- WCKQPPQRFNHPRJ-UHFFFAOYSA-N 4-[[4-(dimethylamino)phenyl]diazenyl]benzoic acid Chemical compound C1=CC(N(C)C)=CC=C1N=NC1=CC=C(C(O)=O)C=C1 WCKQPPQRFNHPRJ-UHFFFAOYSA-N 0.000 description 2
- SJQRQOKXQKVJGJ-UHFFFAOYSA-N 5-(2-aminoethylamino)naphthalene-1-sulfonic acid Chemical compound C1=CC=C2C(NCCN)=CC=CC2=C1S(O)(=O)=O SJQRQOKXQKVJGJ-UHFFFAOYSA-N 0.000 description 2
- 241001164823 Adeno-associated virus - 7 Species 0.000 description 2
- 241001164825 Adeno-associated virus - 8 Species 0.000 description 2
- 241000272517 Anseriformes Species 0.000 description 2
- 241000195940 Bryophyta Species 0.000 description 2
- 108091079001 CRISPR RNA Proteins 0.000 description 2
- 108091026890 Coding region Proteins 0.000 description 2
- 108020004635 Complementary DNA Proteins 0.000 description 2
- 241000218631 Coniferophyta Species 0.000 description 2
- 102220605874 Cytosolic arginine sensor for mTORC1 subunit 2_D10A_mutation Human genes 0.000 description 2
- 108010024491 DNA Methyltransferase 3A Proteins 0.000 description 2
- 108010024985 DNA methyltransferase 3B Proteins 0.000 description 2
- 108010053770 Deoxyribonucleases Proteins 0.000 description 2
- 102000016911 Deoxyribonucleases Human genes 0.000 description 2
- 206010013801 Duchenne Muscular Dystrophy Diseases 0.000 description 2
- 108700024394 Exon Proteins 0.000 description 2
- 241000192016 Finegoldia magna Species 0.000 description 2
- 241000589599 Francisella tularensis subsp. novicida Species 0.000 description 2
- 241000287828 Gallus gallus Species 0.000 description 2
- 108700007698 Genetic Terminator Regions Proteins 0.000 description 2
- 241000713813 Gibbon ape leukemia virus Species 0.000 description 2
- 102000003886 Glycoproteins Human genes 0.000 description 2
- 108090000288 Glycoproteins Proteins 0.000 description 2
- XKMLYUALXHKNFT-UUOKFMHZSA-N Guanosine-5'-triphosphate Chemical compound C1=2NC(N)=NC(=O)C=2N=CN1[C@@H]1O[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)[C@@H](O)[C@H]1O XKMLYUALXHKNFT-UUOKFMHZSA-N 0.000 description 2
- 208000028782 Hereditary disease Diseases 0.000 description 2
- 108010074870 Histone Demethylases Proteins 0.000 description 2
- 102000008157 Histone Demethylases Human genes 0.000 description 2
- 101710116149 Histone acetyltransferase KAT5 Proteins 0.000 description 2
- 102100021467 Histone acetyltransferase type B catalytic subunit Human genes 0.000 description 2
- 108090000246 Histone acetyltransferases Proteins 0.000 description 2
- 102000003893 Histone acetyltransferases Human genes 0.000 description 2
- 102100038720 Histone deacetylase 9 Human genes 0.000 description 2
- 108010016918 Histone-Lysine N-Methyltransferase Proteins 0.000 description 2
- 102000000581 Histone-lysine N-methyltransferase Human genes 0.000 description 2
- 102100022103 Histone-lysine N-methyltransferase 2A Human genes 0.000 description 2
- 102100022102 Histone-lysine N-methyltransferase 2B Human genes 0.000 description 2
- 102100038970 Histone-lysine N-methyltransferase EZH2 Human genes 0.000 description 2
- 102100032742 Histone-lysine N-methyltransferase SETD2 Human genes 0.000 description 2
- 102100029239 Histone-lysine N-methyltransferase, H3 lysine-36 specific Human genes 0.000 description 2
- 101001046967 Homo sapiens Histone acetyltransferase KAT2A Proteins 0.000 description 2
- 101000882390 Homo sapiens Histone acetyltransferase p300 Proteins 0.000 description 2
- 101000898976 Homo sapiens Histone acetyltransferase type B catalytic subunit Proteins 0.000 description 2
- 101001045846 Homo sapiens Histone-lysine N-methyltransferase 2A Proteins 0.000 description 2
- 101001045848 Homo sapiens Histone-lysine N-methyltransferase 2B Proteins 0.000 description 2
- 101001008894 Homo sapiens Histone-lysine N-methyltransferase 2D Proteins 0.000 description 2
- 101000882127 Homo sapiens Histone-lysine N-methyltransferase EZH2 Proteins 0.000 description 2
- 101000634050 Homo sapiens Histone-lysine N-methyltransferase, H3 lysine-36 specific Proteins 0.000 description 2
- 101000613625 Homo sapiens Lysine-specific demethylase 4A Proteins 0.000 description 2
- 101001088893 Homo sapiens Lysine-specific demethylase 4C Proteins 0.000 description 2
- 101000687346 Homo sapiens PR domain zinc finger protein 2 Proteins 0.000 description 2
- 101000893100 Homo sapiens Protein fantom Proteins 0.000 description 2
- 241000725303 Human immunodeficiency virus Species 0.000 description 2
- 108091092195 Intron Proteins 0.000 description 2
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 2
- 241000713666 Lentivirus Species 0.000 description 2
- 102100024640 Low-density lipoprotein receptor Human genes 0.000 description 2
- 102100040863 Lysine-specific demethylase 4A Human genes 0.000 description 2
- 102100033230 Lysine-specific demethylase 4C Human genes 0.000 description 2
- 108060004795 Methyltransferase Proteins 0.000 description 2
- 241001529936 Murinae Species 0.000 description 2
- 241000714177 Murine leukemia virus Species 0.000 description 2
- 241000699666 Mus <mouse, genus> Species 0.000 description 2
- 101000978776 Mus musculus Neurogenic locus notch homolog protein 1 Proteins 0.000 description 2
- 206010028980 Neoplasm Diseases 0.000 description 2
- 102100024885 PR domain zinc finger protein 2 Human genes 0.000 description 2
- 241001494479 Pecora Species 0.000 description 2
- 102100040970 Protein fantom Human genes 0.000 description 2
- 241000700159 Rattus Species 0.000 description 2
- 102100038247 Retinol-binding protein 3 Human genes 0.000 description 2
- 101710137010 Retinol-binding protein 3 Proteins 0.000 description 2
- 241000713311 Simian immunodeficiency virus Species 0.000 description 2
- 108010003165 Small Nuclear Ribonucleoproteins Proteins 0.000 description 2
- 102000004598 Small Nuclear Ribonucleoproteins Human genes 0.000 description 2
- 108091027967 Small hairpin RNA Proteins 0.000 description 2
- 241000191967 Staphylococcus aureus Species 0.000 description 2
- 241000187191 Streptomyces viridochromogenes Species 0.000 description 2
- 241000203587 Streptosporangium roseum Species 0.000 description 2
- 210000001744 T-lymphocyte Anatomy 0.000 description 2
- 102100035222 Transcription initiation factor TFIID subunit 1 Human genes 0.000 description 2
- 241000605939 Wolinella succinogenes Species 0.000 description 2
- 240000008042 Zea mays Species 0.000 description 2
- 235000002017 Zea mays subsp mays Nutrition 0.000 description 2
- 230000004913 activation Effects 0.000 description 2
- 210000004504 adult stem cell Anatomy 0.000 description 2
- 210000002383 alveolar type I cell Anatomy 0.000 description 2
- 210000002588 alveolar type II cell Anatomy 0.000 description 2
- 210000002255 anal canal Anatomy 0.000 description 2
- 238000013459 approach Methods 0.000 description 2
- 229940009098 aspartate Drugs 0.000 description 2
- 230000003190 augmentative effect Effects 0.000 description 2
- 210000003719 b-lymphocyte Anatomy 0.000 description 2
- 210000003651 basophil Anatomy 0.000 description 2
- 210000002228 beta-basophil Anatomy 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- 210000004369 blood Anatomy 0.000 description 2
- 239000008280 blood Substances 0.000 description 2
- 210000001772 blood platelet Anatomy 0.000 description 2
- 210000000988 bone and bone Anatomy 0.000 description 2
- 210000000233 bronchiolar non-ciliated Anatomy 0.000 description 2
- 238000010804 cDNA synthesis Methods 0.000 description 2
- 230000032823 cell division Effects 0.000 description 2
- 108091092259 cell-free RNA Proteins 0.000 description 2
- 210000003169 central nervous system Anatomy 0.000 description 2
- 235000013330 chicken meat Nutrition 0.000 description 2
- 210000001612 chondrocyte Anatomy 0.000 description 2
- 210000003737 chromaffin cell Anatomy 0.000 description 2
- 238000010668 complexation reaction Methods 0.000 description 2
- 210000004087 cornea Anatomy 0.000 description 2
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 2
- 210000001151 cytotoxic T lymphocyte Anatomy 0.000 description 2
- SUYVUBYJARFZHO-RRKCRQDMSA-N dATP Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-RRKCRQDMSA-N 0.000 description 2
- SUYVUBYJARFZHO-UHFFFAOYSA-N dATP Natural products C1=NC=2C(N)=NC=NC=2N1C1CC(O)C(COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-UHFFFAOYSA-N 0.000 description 2
- 230000007547 defect Effects 0.000 description 2
- 238000002716 delivery method Methods 0.000 description 2
- 210000004443 dendritic cell Anatomy 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 238000006471 dimerization reaction Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 210000003979 eosinophil Anatomy 0.000 description 2
- 230000004049 epigenetic modification Effects 0.000 description 2
- 210000000981 epithelium Anatomy 0.000 description 2
- 210000003238 esophagus Anatomy 0.000 description 2
- 239000013604 expression vector Substances 0.000 description 2
- 238000002825 functional assay Methods 0.000 description 2
- 210000001156 gastric mucosa Anatomy 0.000 description 2
- 238000003633 gene expression assay Methods 0.000 description 2
- 238000001415 gene therapy Methods 0.000 description 2
- 230000013595 glycosylation Effects 0.000 description 2
- 238000006206 glycosylation reaction Methods 0.000 description 2
- 210000002443 helper t lymphocyte Anatomy 0.000 description 2
- 210000003630 histaminocyte Anatomy 0.000 description 2
- 210000000987 immune system Anatomy 0.000 description 2
- 238000003364 immunohistochemistry Methods 0.000 description 2
- 230000005764 inhibitory process Effects 0.000 description 2
- 210000000936 intestine Anatomy 0.000 description 2
- 230000031910 intraflagellar transport Effects 0.000 description 2
- 210000002510 keratinocyte Anatomy 0.000 description 2
- 238000002372 labelling Methods 0.000 description 2
- 210000001756 lactotroph Anatomy 0.000 description 2
- 238000001638 lipofection Methods 0.000 description 2
- 239000002502 liposome Substances 0.000 description 2
- 210000004185 liver Anatomy 0.000 description 2
- 230000004807 localization Effects 0.000 description 2
- 239000000314 lubricant Substances 0.000 description 2
- 210000002540 macrophage Anatomy 0.000 description 2
- 210000001730 macula densa epithelial cell Anatomy 0.000 description 2
- 210000005075 mammary gland Anatomy 0.000 description 2
- 210000003593 megakaryocyte Anatomy 0.000 description 2
- 210000002752 melanocyte Anatomy 0.000 description 2
- 210000003584 mesangial cell Anatomy 0.000 description 2
- 108091070501 miRNA Proteins 0.000 description 2
- 239000002679 microRNA Substances 0.000 description 2
- 238000000520 microinjection Methods 0.000 description 2
- 210000001616 monocyte Anatomy 0.000 description 2
- 210000000214 mouth Anatomy 0.000 description 2
- 210000003550 mucous cell Anatomy 0.000 description 2
- 210000003205 muscle Anatomy 0.000 description 2
- 210000000581 natural killer T-cell Anatomy 0.000 description 2
- 210000000822 natural killer cell Anatomy 0.000 description 2
- 210000004498 neuroglial cell Anatomy 0.000 description 2
- 210000001719 neurosecretory cell Anatomy 0.000 description 2
- 230000006780 non-homologous end joining Effects 0.000 description 2
- 230000009437 off-target effect Effects 0.000 description 2
- 238000006384 oligomerization reaction Methods 0.000 description 2
- 238000012014 optical coherence tomography Methods 0.000 description 2
- 210000000056 organ Anatomy 0.000 description 2
- 210000002997 osteoclast Anatomy 0.000 description 2
- 210000001711 oxyntic cell Anatomy 0.000 description 2
- 210000003889 oxyphil cell of parathyroid gland Anatomy 0.000 description 2
- 210000003134 paneth cell Anatomy 0.000 description 2
- 210000002655 parathyroid chief cell Anatomy 0.000 description 2
- 230000026731 phosphorylation Effects 0.000 description 2
- 238000006366 phosphorylation reaction Methods 0.000 description 2
- 210000001778 pluripotent stem cell Anatomy 0.000 description 2
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 2
- 230000000069 prophylactic effect Effects 0.000 description 2
- 210000002307 prostate Anatomy 0.000 description 2
- 230000010076 replication Effects 0.000 description 2
- 210000001995 reticulocyte Anatomy 0.000 description 2
- 230000002441 reversible effect Effects 0.000 description 2
- 230000003248 secreting effect Effects 0.000 description 2
- 210000000582 semen Anatomy 0.000 description 2
- 210000001625 seminal vesicle Anatomy 0.000 description 2
- 125000003607 serino group Chemical group [H]N([H])[C@]([H])(C(=O)[*])C(O[H])([H])[H] 0.000 description 2
- 210000000717 sertoli cell Anatomy 0.000 description 2
- 210000001764 somatotrope Anatomy 0.000 description 2
- 239000000126 substance Substances 0.000 description 2
- 239000004094 surface-active agent Substances 0.000 description 2
- 210000001550 testis Anatomy 0.000 description 2
- 238000001890 transfection Methods 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- 230000010474 transient expression Effects 0.000 description 2
- 210000003708 urethra Anatomy 0.000 description 2
- 210000001215 vagina Anatomy 0.000 description 2
- 201000010653 vesiculitis Diseases 0.000 description 2
- 238000001262 western blot Methods 0.000 description 2
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 1
- GOJUJUVQIVIZAV-UHFFFAOYSA-N 2-amino-4,6-dichloropyrimidine-5-carbaldehyde Chemical group NC1=NC(Cl)=C(C=O)C(Cl)=N1 GOJUJUVQIVIZAV-UHFFFAOYSA-N 0.000 description 1
- 108020005345 3' Untranslated Regions Proteins 0.000 description 1
- ZLOIGESWDJYCTF-XVFCMESISA-N 4-thiouridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=S)C=C1 ZLOIGESWDJYCTF-XVFCMESISA-N 0.000 description 1
- 108020003589 5' Untranslated Regions Proteins 0.000 description 1
- LQLQRFGHAALLLE-UHFFFAOYSA-N 5-bromouracil Chemical compound BrC1=CNC(=O)NC1=O LQLQRFGHAALLLE-UHFFFAOYSA-N 0.000 description 1
- NJYVEMPWNAYQQN-UHFFFAOYSA-N 5-carboxyfluorescein Chemical compound C12=CC=C(O)C=C2OC2=CC(O)=CC=C2C21OC(=O)C1=CC(C(=O)O)=CC=C21 NJYVEMPWNAYQQN-UHFFFAOYSA-N 0.000 description 1
- OSXFATOLZGZLSK-UHFFFAOYSA-N 6,7-dimethoxy-2-(4-methyl-1,4-diazepan-1-yl)-N-[1-(phenylmethyl)-4-piperidinyl]-4-quinazolinamine Chemical compound C=12C=C(OC)C(OC)=CC2=NC(N2CCN(C)CCC2)=NC=1NC(CC1)CCN1CC1=CC=CC=C1 OSXFATOLZGZLSK-UHFFFAOYSA-N 0.000 description 1
- WQZIDRAQTRIQDX-UHFFFAOYSA-N 6-carboxy-x-rhodamine Chemical compound OC(=O)C1=CC=C(C([O-])=O)C=C1C(C1=CC=2CCCN3CCCC(C=23)=C1O1)=C2C1=C(CCC1)C3=[N+]1CCCC3=C2 WQZIDRAQTRIQDX-UHFFFAOYSA-N 0.000 description 1
- 230000005730 ADP ribosylation Effects 0.000 description 1
- 102100030089 ATP-dependent RNA helicase DHX8 Human genes 0.000 description 1
- 241001430193 Absiella dolichum Species 0.000 description 1
- 241000007910 Acaryochloris marina Species 0.000 description 1
- 241001135192 Acetohalobium arabaticum Species 0.000 description 1
- 241000604451 Acidaminococcus Species 0.000 description 1
- 241001464929 Acidithiobacillus caldus Species 0.000 description 1
- 241000605222 Acidithiobacillus ferrooxidans Species 0.000 description 1
- 241001134630 Acidothermus cellulolyticus Species 0.000 description 1
- 241000460100 Acidovorax ebreus Species 0.000 description 1
- 102100032746 Actin-histidine N-methyltransferase Human genes 0.000 description 1
- 241000251468 Actinopterygii Species 0.000 description 1
- 241001655883 Adeno-associated virus - 1 Species 0.000 description 1
- 241000702423 Adeno-associated virus - 2 Species 0.000 description 1
- 241000202702 Adeno-associated virus - 3 Species 0.000 description 1
- 241000580270 Adeno-associated virus - 4 Species 0.000 description 1
- 241001634120 Adeno-associated virus - 5 Species 0.000 description 1
- 241000972680 Adeno-associated virus - 6 Species 0.000 description 1
- 241000649046 Adeno-associated virus 11 Species 0.000 description 1
- 241000649047 Adeno-associated virus 12 Species 0.000 description 1
- 235000001674 Agaricus brunnescens Nutrition 0.000 description 1
- 235000016626 Agrimonia eupatoria Nutrition 0.000 description 1
- 241000702462 Akkermansia muciniphila Species 0.000 description 1
- 241000190857 Allochromatium vinosum Species 0.000 description 1
- 241001621924 Aminomonas paucivorans Species 0.000 description 1
- 241000147155 Ammonifex degensii Species 0.000 description 1
- 108090000672 Annexin A5 Proteins 0.000 description 1
- 102000004121 Annexin A5 Human genes 0.000 description 1
- 101100218322 Arabidopsis thaliana ATXR3 gene Proteins 0.000 description 1
- 101100443354 Arabidopsis thaliana DME gene Proteins 0.000 description 1
- 101100331657 Arabidopsis thaliana DML2 gene Proteins 0.000 description 1
- 101100091498 Arabidopsis thaliana ROS1 gene Proteins 0.000 description 1
- 241000620196 Arthrospira maxima Species 0.000 description 1
- 240000002900 Arthrospira platensis Species 0.000 description 1
- 235000016425 Arthrospira platensis Nutrition 0.000 description 1
- 241001495183 Arthrospira sp. Species 0.000 description 1
- 239000000592 Artificial Cell Substances 0.000 description 1
- 241000512259 Ascophyllum nodosum Species 0.000 description 1
- 206010003594 Ataxia telangiectasia Diseases 0.000 description 1
- 229930192334 Auxin Natural products 0.000 description 1
- 235000000832 Ayote Nutrition 0.000 description 1
- 241000589941 Azospirillum Species 0.000 description 1
- 241000193755 Bacillus cereus Species 0.000 description 1
- 241000906059 Bacillus pseudomycoides Species 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 241000606124 Bacteroides fragilis Species 0.000 description 1
- 241000218495 Bactrocera correcta Species 0.000 description 1
- 102100022794 Bestrophin-1 Human genes 0.000 description 1
- 102100022548 Beta-hexosaminidase subunit alpha Human genes 0.000 description 1
- BVKZGUZCCUSVTD-UHFFFAOYSA-M Bicarbonate Chemical compound OC([O-])=O BVKZGUZCCUSVTD-UHFFFAOYSA-M 0.000 description 1
- 241000186016 Bifidobacterium bifidum Species 0.000 description 1
- 241000186020 Bifidobacterium dentium Species 0.000 description 1
- 241001608472 Bifidobacterium longum Species 0.000 description 1
- 241001474374 Blennius Species 0.000 description 1
- 206010005176 Blindness congenital Diseases 0.000 description 1
- 208000019838 Blood disease Diseases 0.000 description 1
- 208000005692 Bloom Syndrome Diseases 0.000 description 1
- 241001536303 Botryococcus braunii Species 0.000 description 1
- 241000589173 Bradyrhizobium Species 0.000 description 1
- 241000823281 Burkholderiales bacterium Species 0.000 description 1
- 101150018129 CSF2 gene Proteins 0.000 description 1
- 101150069031 CSN2 gene Proteins 0.000 description 1
- 101100002344 Caenorhabditis elegans arid-1 gene Proteins 0.000 description 1
- 101100026251 Caenorhabditis elegans atf-2 gene Proteins 0.000 description 1
- 241000589875 Campylobacter jejuni Species 0.000 description 1
- 241001496650 Candidatus Desulforudis Species 0.000 description 1
- 241000327160 Candidatus Puniceispirillum marinum Species 0.000 description 1
- 241000282465 Canis Species 0.000 description 1
- 241000282472 Canis lupus familiaris Species 0.000 description 1
- 241000190885 Capnocytophaga ochracea Species 0.000 description 1
- 241000283707 Capra Species 0.000 description 1
- 108090000565 Capsid Proteins Proteins 0.000 description 1
- 208000020446 Cardiac disease Diseases 0.000 description 1
- 108090000994 Catalytic RNA Proteins 0.000 description 1
- 102000053642 Catalytic RNA Human genes 0.000 description 1
- 241001443867 Catenibacterium mitsuokai Species 0.000 description 1
- 102100023321 Ceruloplasmin Human genes 0.000 description 1
- 241000195597 Chlamydomonas reinhardtii Species 0.000 description 1
- 244000249214 Chlorella pyrenoidosa Species 0.000 description 1
- 235000007091 Chlorella pyrenoidosa Nutrition 0.000 description 1
- 108010077544 Chromatin Proteins 0.000 description 1
- 102100038641 Cleavage and polyadenylation specificity factor subunit 1 Human genes 0.000 description 1
- 241000193163 Clostridioides difficile Species 0.000 description 1
- 241000193155 Clostridium botulinum Species 0.000 description 1
- 241000193468 Clostridium perfringens Species 0.000 description 1
- 241000243321 Cnidaria Species 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- 102100024079 Coiled-coil and C2 domain-containing protein 2A Human genes 0.000 description 1
- 241000907165 Coleofasciculus chthonoplastes Species 0.000 description 1
- 206010010356 Congenital anomaly Diseases 0.000 description 1
- 206010053138 Congenital aplastic anaemia Diseases 0.000 description 1
- 241000220677 Coprococcus catus Species 0.000 description 1
- KQLDDLUWUFBQHP-UHFFFAOYSA-N Cordycepin Natural products C1=NC=2C(N)=NC=NC=2N1C1OCC(CO)C1O KQLDDLUWUFBQHP-UHFFFAOYSA-N 0.000 description 1
- 241000186216 Corynebacterium Species 0.000 description 1
- 229920000742 Cotton Polymers 0.000 description 1
- 108091029523 CpG island Proteins 0.000 description 1
- 241000065716 Crocosphaera watsonii Species 0.000 description 1
- 240000004244 Cucurbita moschata Species 0.000 description 1
- 235000009854 Cucurbita moschata Nutrition 0.000 description 1
- 235000009804 Cucurbita pepo subsp pepo Nutrition 0.000 description 1
- 241000159506 Cyanothece Species 0.000 description 1
- 201000003883 Cystic fibrosis Diseases 0.000 description 1
- 150000008574 D-amino acids Chemical class 0.000 description 1
- 101150064551 DML1 gene Proteins 0.000 description 1
- 230000007067 DNA methylation Effects 0.000 description 1
- 230000033616 DNA repair Effects 0.000 description 1
- 101150117307 DRM3 gene Proteins 0.000 description 1
- 101001095965 Dictyostelium discoideum Phospholipid-inositol phosphatase Proteins 0.000 description 1
- 241001595867 Dinoroseobacter shibae Species 0.000 description 1
- 108010028143 Dioxygenases Proteins 0.000 description 1
- 102000016680 Dioxygenases Human genes 0.000 description 1
- 102100032049 E3 ubiquitin-protein ligase LRSAM1 Human genes 0.000 description 1
- 241000258955 Echinodermata Species 0.000 description 1
- 102000002322 Egg Proteins Human genes 0.000 description 1
- 108010000912 Egg Proteins Proteins 0.000 description 1
- 241001338691 Elusimicrobium minutum Species 0.000 description 1
- 102100038132 Endogenous retrovirus group K member 6 Pro protein Human genes 0.000 description 1
- 101710091045 Envelope protein Proteins 0.000 description 1
- 241000283086 Equidae Species 0.000 description 1
- 241000283073 Equus caballus Species 0.000 description 1
- 239000001856 Ethyl cellulose Substances 0.000 description 1
- 241000326311 Exiguobacterium sibiricum Species 0.000 description 1
- 108010037362 Extracellular Matrix Proteins Proteins 0.000 description 1
- 102000010834 Extracellular Matrix Proteins Human genes 0.000 description 1
- 201000004939 Fanconi anemia Diseases 0.000 description 1
- 241000282326 Felis catus Species 0.000 description 1
- 241000605896 Fibrobacter succinogenes Species 0.000 description 1
- 241001282092 Filifactor alocis Species 0.000 description 1
- 241000604777 Flavobacterium columnare Species 0.000 description 1
- 229930091371 Fructose Natural products 0.000 description 1
- 239000005715 Fructose Substances 0.000 description 1
- RFSUNEUAIZKAJO-ARQDHWQXSA-N Fructose Chemical compound OC[C@H]1O[C@](O)(CO)[C@@H](O)[C@@H]1O RFSUNEUAIZKAJO-ARQDHWQXSA-N 0.000 description 1
- 241000605986 Fusobacterium nucleatum Species 0.000 description 1
- 210000000712 G cell Anatomy 0.000 description 1
- 235000010469 Glycine max Nutrition 0.000 description 1
- 244000068988 Glycine max Species 0.000 description 1
- 241000219146 Gossypium Species 0.000 description 1
- 102100039214 Guanine nucleotide-binding protein G(t) subunit alpha-2 Human genes 0.000 description 1
- NYHBQMYGNKIUIF-UUOKFMHZSA-N Guanosine Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O NYHBQMYGNKIUIF-UUOKFMHZSA-N 0.000 description 1
- 108091005772 HDAC11 Proteins 0.000 description 1
- 102000029812 HNH nuclease Human genes 0.000 description 1
- 108060003760 HNH nuclease Proteins 0.000 description 1
- 241000590006 Helicobacter mustelae Species 0.000 description 1
- 208000031220 Hemophilia Diseases 0.000 description 1
- 208000009292 Hemophilia A Diseases 0.000 description 1
- 208000002972 Hepatolenticular Degeneration Diseases 0.000 description 1
- 108010068250 Herpes Simplex Virus Protein Vmw65 Proteins 0.000 description 1
- 102100033071 Histone acetyltransferase KAT6A Human genes 0.000 description 1
- 102100033070 Histone acetyltransferase KAT6B Human genes 0.000 description 1
- 102100033068 Histone acetyltransferase KAT7 Human genes 0.000 description 1
- 102100033069 Histone acetyltransferase KAT8 Human genes 0.000 description 1
- 102100039996 Histone deacetylase 1 Human genes 0.000 description 1
- 102100039385 Histone deacetylase 11 Human genes 0.000 description 1
- 102100039999 Histone deacetylase 2 Human genes 0.000 description 1
- 102100021455 Histone deacetylase 3 Human genes 0.000 description 1
- 102100021454 Histone deacetylase 4 Human genes 0.000 description 1
- 102100021453 Histone deacetylase 5 Human genes 0.000 description 1
- 102100038715 Histone deacetylase 8 Human genes 0.000 description 1
- 102100027755 Histone-lysine N-methyltransferase 2C Human genes 0.000 description 1
- 102100026265 Histone-lysine N-methyltransferase ASH1L Human genes 0.000 description 1
- 102100035043 Histone-lysine N-methyltransferase EHMT1 Human genes 0.000 description 1
- 102100035042 Histone-lysine N-methyltransferase EHMT2 Human genes 0.000 description 1
- 102100027770 Histone-lysine N-methyltransferase KMT5B Human genes 0.000 description 1
- 102100027788 Histone-lysine N-methyltransferase KMT5C Human genes 0.000 description 1
- 102100029234 Histone-lysine N-methyltransferase NSD2 Human genes 0.000 description 1
- 102100029235 Histone-lysine N-methyltransferase NSD3 Human genes 0.000 description 1
- 102100024594 Histone-lysine N-methyltransferase PRDM16 Human genes 0.000 description 1
- 102100029129 Histone-lysine N-methyltransferase PRDM7 Human genes 0.000 description 1
- 102100029144 Histone-lysine N-methyltransferase PRDM9 Human genes 0.000 description 1
- 102100030095 Histone-lysine N-methyltransferase SETD1B Human genes 0.000 description 1
- 102100027711 Histone-lysine N-methyltransferase SETD5 Human genes 0.000 description 1
- 102100027704 Histone-lysine N-methyltransferase SETD7 Human genes 0.000 description 1
- 102100023696 Histone-lysine N-methyltransferase SETDB1 Human genes 0.000 description 1
- 102100023676 Histone-lysine N-methyltransferase SETDB2 Human genes 0.000 description 1
- 102100028998 Histone-lysine N-methyltransferase SUV39H1 Human genes 0.000 description 1
- 102100028988 Histone-lysine N-methyltransferase SUV39H2 Human genes 0.000 description 1
- 101000864666 Homo sapiens ATP-dependent RNA helicase DHX8 Proteins 0.000 description 1
- 101000901099 Homo sapiens Achaete-scute homolog 1 Proteins 0.000 description 1
- 101000654703 Homo sapiens Actin-histidine N-methyltransferase Proteins 0.000 description 1
- 101000903449 Homo sapiens Bestrophin-1 Proteins 0.000 description 1
- 101000957603 Homo sapiens Cleavage and polyadenylation specificity factor subunit 1 Proteins 0.000 description 1
- 101000910414 Homo sapiens Coiled-coil and C2 domain-containing protein 2A Proteins 0.000 description 1
- 101000888142 Homo sapiens Guanine nucleotide-binding protein G(t) subunit alpha-2 Proteins 0.000 description 1
- 101001046996 Homo sapiens Histone acetyltransferase KAT5 Proteins 0.000 description 1
- 101000944179 Homo sapiens Histone acetyltransferase KAT6A Proteins 0.000 description 1
- 101000944166 Homo sapiens Histone acetyltransferase KAT7 Proteins 0.000 description 1
- 101000944170 Homo sapiens Histone acetyltransferase KAT8 Proteins 0.000 description 1
- 101001035024 Homo sapiens Histone deacetylase 1 Proteins 0.000 description 1
- 101001035011 Homo sapiens Histone deacetylase 2 Proteins 0.000 description 1
- 101000899282 Homo sapiens Histone deacetylase 3 Proteins 0.000 description 1
- 101000899259 Homo sapiens Histone deacetylase 4 Proteins 0.000 description 1
- 101000899255 Homo sapiens Histone deacetylase 5 Proteins 0.000 description 1
- 101001032113 Homo sapiens Histone deacetylase 7 Proteins 0.000 description 1
- 101001032118 Homo sapiens Histone deacetylase 8 Proteins 0.000 description 1
- 101001032092 Homo sapiens Histone deacetylase 9 Proteins 0.000 description 1
- 101001008892 Homo sapiens Histone-lysine N-methyltransferase 2C Proteins 0.000 description 1
- 101000785963 Homo sapiens Histone-lysine N-methyltransferase ASH1L Proteins 0.000 description 1
- 101000877314 Homo sapiens Histone-lysine N-methyltransferase EHMT1 Proteins 0.000 description 1
- 101000877312 Homo sapiens Histone-lysine N-methyltransferase EHMT2 Proteins 0.000 description 1
- 101001028782 Homo sapiens Histone-lysine N-methyltransferase EZH1 Proteins 0.000 description 1
- 101001008821 Homo sapiens Histone-lysine N-methyltransferase KMT5B Proteins 0.000 description 1
- 101001008824 Homo sapiens Histone-lysine N-methyltransferase KMT5C Proteins 0.000 description 1
- 101000634048 Homo sapiens Histone-lysine N-methyltransferase NSD2 Proteins 0.000 description 1
- 101000634046 Homo sapiens Histone-lysine N-methyltransferase NSD3 Proteins 0.000 description 1
- 101000686942 Homo sapiens Histone-lysine N-methyltransferase PRDM16 Proteins 0.000 description 1
- 101001124898 Homo sapiens Histone-lysine N-methyltransferase PRDM7 Proteins 0.000 description 1
- 101001124887 Homo sapiens Histone-lysine N-methyltransferase PRDM9 Proteins 0.000 description 1
- 101000864672 Homo sapiens Histone-lysine N-methyltransferase SETD1B Proteins 0.000 description 1
- 101000654725 Homo sapiens Histone-lysine N-methyltransferase SETD2 Proteins 0.000 description 1
- 101000650669 Homo sapiens Histone-lysine N-methyltransferase SETD5 Proteins 0.000 description 1
- 101000650682 Homo sapiens Histone-lysine N-methyltransferase SETD7 Proteins 0.000 description 1
- 101000684609 Homo sapiens Histone-lysine N-methyltransferase SETDB1 Proteins 0.000 description 1
- 101000684615 Homo sapiens Histone-lysine N-methyltransferase SETDB2 Proteins 0.000 description 1
- 101000696705 Homo sapiens Histone-lysine N-methyltransferase SUV39H1 Proteins 0.000 description 1
- 101000696699 Homo sapiens Histone-lysine N-methyltransferase SUV39H2 Proteins 0.000 description 1
- 101001008896 Homo sapiens Inactive histone-lysine N-methyltransferase 2E Proteins 0.000 description 1
- 101001010724 Homo sapiens Intraflagellar transport protein 88 homolog Proteins 0.000 description 1
- 101100019690 Homo sapiens KAT6B gene Proteins 0.000 description 1
- 101000613629 Homo sapiens Lysine-specific demethylase 4B Proteins 0.000 description 1
- 101001088895 Homo sapiens Lysine-specific demethylase 4D Proteins 0.000 description 1
- 101001088883 Homo sapiens Lysine-specific demethylase 5B Proteins 0.000 description 1
- 101001088887 Homo sapiens Lysine-specific demethylase 5C Proteins 0.000 description 1
- 101001088879 Homo sapiens Lysine-specific demethylase 5D Proteins 0.000 description 1
- 101000653360 Homo sapiens Methylcytosine dioxygenase TET1 Proteins 0.000 description 1
- 101001008816 Homo sapiens N-lysine methyltransferase KMT5A Proteins 0.000 description 1
- 101000650674 Homo sapiens N-lysine methyltransferase SETD6 Proteins 0.000 description 1
- 101000602926 Homo sapiens Nuclear receptor coactivator 1 Proteins 0.000 description 1
- 101001123306 Homo sapiens PR domain zinc finger protein 10 Proteins 0.000 description 1
- 101001123302 Homo sapiens PR domain zinc finger protein 12 Proteins 0.000 description 1
- 101001123300 Homo sapiens PR domain zinc finger protein 13 Proteins 0.000 description 1
- 101001123298 Homo sapiens PR domain zinc finger protein 14 Proteins 0.000 description 1
- 101001123296 Homo sapiens PR domain zinc finger protein 15 Proteins 0.000 description 1
- 101000687340 Homo sapiens PR domain zinc finger protein 4 Proteins 0.000 description 1
- 101001124906 Homo sapiens PR domain zinc finger protein 5 Proteins 0.000 description 1
- 101001124900 Homo sapiens PR domain zinc finger protein 8 Proteins 0.000 description 1
- 101000738757 Homo sapiens Phosphatidylglycerophosphatase and protein-tyrosine phosphatase 1 Proteins 0.000 description 1
- 101001122801 Homo sapiens Pre-mRNA-processing factor 17 Proteins 0.000 description 1
- 101001125496 Homo sapiens Pre-mRNA-processing factor 19 Proteins 0.000 description 1
- 101001105692 Homo sapiens Pre-mRNA-processing factor 6 Proteins 0.000 description 1
- 101001105683 Homo sapiens Pre-mRNA-processing-splicing factor 8 Proteins 0.000 description 1
- 101001122792 Homo sapiens Pre-mRNA-splicing factor 18 Proteins 0.000 description 1
- 101000907912 Homo sapiens Pre-mRNA-splicing factor ATP-dependent RNA helicase DHX16 Proteins 0.000 description 1
- 101001122811 Homo sapiens Pre-mRNA-splicing factor ATP-dependent RNA helicase PRP16 Proteins 0.000 description 1
- 101000912686 Homo sapiens Probable ATP-dependent RNA helicase DDX23 Proteins 0.000 description 1
- 101000874142 Homo sapiens Probable ATP-dependent RNA helicase DDX46 Proteins 0.000 description 1
- 101000686031 Homo sapiens Proto-oncogene tyrosine-protein kinase ROS Proteins 0.000 description 1
- 101000651467 Homo sapiens Proto-oncogene tyrosine-protein kinase Src Proteins 0.000 description 1
- 101001124901 Homo sapiens Putative histone-lysine N-methyltransferase PRDM6 Proteins 0.000 description 1
- 101000755643 Homo sapiens RIMS-binding protein 2 Proteins 0.000 description 1
- 101000756365 Homo sapiens Retinol-binding protein 2 Proteins 0.000 description 1
- 101000829506 Homo sapiens Rhodopsin kinase GRK1 Proteins 0.000 description 1
- 101000670189 Homo sapiens Ribulose-phosphate 3-epimerase Proteins 0.000 description 1
- 101000650667 Homo sapiens SET domain-containing protein 4 Proteins 0.000 description 1
- 101100149326 Homo sapiens SETD2 gene Proteins 0.000 description 1
- 101000707546 Homo sapiens Splicing factor 3A subunit 1 Proteins 0.000 description 1
- 101000707561 Homo sapiens Splicing factor 3A subunit 2 Proteins 0.000 description 1
- 101000707569 Homo sapiens Splicing factor 3A subunit 3 Proteins 0.000 description 1
- 101000707567 Homo sapiens Splicing factor 3B subunit 1 Proteins 0.000 description 1
- 101000596093 Homo sapiens Transcription initiation factor TFIID subunit 1 Proteins 0.000 description 1
- 101000836339 Homo sapiens Transposon Hsmar1 transposase Proteins 0.000 description 1
- 101000577737 Homo sapiens U4/U6 small nuclear ribonucleoprotein Prp4 Proteins 0.000 description 1
- 101001104102 Homo sapiens X-linked retinitis pigmentosa GTPase regulator Proteins 0.000 description 1
- 101000818735 Homo sapiens Zinc finger protein 10 Proteins 0.000 description 1
- 208000023105 Huntington disease Diseases 0.000 description 1
- 108010003272 Hyaluronate lyase Proteins 0.000 description 1
- 102000001974 Hyaluronidases Human genes 0.000 description 1
- 208000000563 Hyperlipoproteinemia Type II Diseases 0.000 description 1
- 241000411974 Ilyobacter polytropus Species 0.000 description 1
- 102100027767 Inactive histone-lysine N-methyltransferase 2E Human genes 0.000 description 1
- 229930010555 Inosine Natural products 0.000 description 1
- UGQMRVRMYYASKQ-KQYNXXCUSA-N Inosine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C2=NC=NC(O)=C2N=C1 UGQMRVRMYYASKQ-KQYNXXCUSA-N 0.000 description 1
- 102100030007 Intraflagellar transport protein 88 homolog Human genes 0.000 description 1
- 241001430080 Ktedonobacter racemifer Species 0.000 description 1
- 150000008575 L-amino acids Chemical class 0.000 description 1
- 108010001831 LDL receptors Proteins 0.000 description 1
- 241000186842 Lactobacillus coryniformis Species 0.000 description 1
- 241000186673 Lactobacillus delbrueckii Species 0.000 description 1
- 241000186606 Lactobacillus gasseri Species 0.000 description 1
- 241000218588 Lactobacillus rhamnosus Species 0.000 description 1
- 241000186869 Lactobacillus salivarius Species 0.000 description 1
- 108091026898 Leader sequence (mRNA) Proteins 0.000 description 1
- 241000589242 Legionella pneumophila Species 0.000 description 1
- 241000270322 Lepidosauria Species 0.000 description 1
- 241000029603 Leptotrichia shahii Species 0.000 description 1
- 208000009625 Lesch-Nyhan syndrome Diseases 0.000 description 1
- 102000003752 Lipocalin 1 Human genes 0.000 description 1
- 108010057281 Lipocalin 1 Proteins 0.000 description 1
- 241000186805 Listeria innocua Species 0.000 description 1
- 235000007688 Lycopersicon esculentum Nutrition 0.000 description 1
- 241000195947 Lycopodium Species 0.000 description 1
- 241001134698 Lyngbya Species 0.000 description 1
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- 102100040860 Lysine-specific demethylase 4B Human genes 0.000 description 1
- 102100033231 Lysine-specific demethylase 4D Human genes 0.000 description 1
- 102100033246 Lysine-specific demethylase 5A Human genes 0.000 description 1
- 102100033247 Lysine-specific demethylase 5B Human genes 0.000 description 1
- 102100033249 Lysine-specific demethylase 5C Human genes 0.000 description 1
- 102100033143 Lysine-specific demethylase 5D Human genes 0.000 description 1
- 239000007993 MOPS buffer Substances 0.000 description 1
- 241000218922 Magnoliophyta Species 0.000 description 1
- 240000003183 Manihot esculenta Species 0.000 description 1
- 235000016735 Manihot esculenta subsp esculenta Nutrition 0.000 description 1
- 241000196323 Marchantiophyta Species 0.000 description 1
- 241000501784 Marinobacter sp. Species 0.000 description 1
- 206010027145 Melanocytic naevus Diseases 0.000 description 1
- 208000024556 Mendelian disease Diseases 0.000 description 1
- 241000204637 Methanohalobium evestigatum Species 0.000 description 1
- 102100030819 Methylcytosine dioxygenase TET1 Human genes 0.000 description 1
- 102000016397 Methyltransferase Human genes 0.000 description 1
- 241000192710 Microcystis aeruginosa Species 0.000 description 1
- 241000190928 Microscilla marina Species 0.000 description 1
- 102000016943 Muramidase Human genes 0.000 description 1
- 108010014251 Muramidase Proteins 0.000 description 1
- 101000969137 Mus musculus Metallothionein-1 Proteins 0.000 description 1
- 101100494762 Mus musculus Nedd9 gene Proteins 0.000 description 1
- 208000023178 Musculoskeletal disease Diseases 0.000 description 1
- 241001148552 Mycoplasma canis Species 0.000 description 1
- 241000204022 Mycoplasma gallisepticum Species 0.000 description 1
- 241000202964 Mycoplasma mobile Species 0.000 description 1
- 241001148556 Mycoplasma ovipneumoniae Species 0.000 description 1
- 241000202942 Mycoplasma synoviae Species 0.000 description 1
- KWYHDKDOAIKMQN-UHFFFAOYSA-N N,N,N',N'-tetramethylethylenediamine Chemical compound CN(C)CCN(C)C KWYHDKDOAIKMQN-UHFFFAOYSA-N 0.000 description 1
- LZHSWRWIMQRTOP-UHFFFAOYSA-N N-(furan-2-ylmethyl)-3-[4-[methyl(propyl)amino]-6-(trifluoromethyl)pyrimidin-2-yl]sulfanylpropanamide Chemical compound CCCN(C)C1=NC(=NC(=C1)C(F)(F)F)SCCC(=O)NCC2=CC=CO2 LZHSWRWIMQRTOP-UHFFFAOYSA-N 0.000 description 1
- 108010062010 N-Acetylmuramoyl-L-alanine Amidase Proteins 0.000 description 1
- 102100027771 N-lysine methyltransferase KMT5A Human genes 0.000 description 1
- 102100027709 N-lysine methyltransferase SETD6 Human genes 0.000 description 1
- 102100031455 NAD-dependent protein deacetylase sirtuin-1 Human genes 0.000 description 1
- 102100022913 NAD-dependent protein deacetylase sirtuin-2 Human genes 0.000 description 1
- 238000005481 NMR spectroscopy Methods 0.000 description 1
- 108091061960 Naked DNA Proteins 0.000 description 1
- 241001250129 Nannochloropsis gaditana Species 0.000 description 1
- 241000167285 Natranaerobius thermophilus Species 0.000 description 1
- 241000588650 Neisseria meningitidis Species 0.000 description 1
- 241000244206 Nematoda Species 0.000 description 1
- 101100385413 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) csm-3 gene Proteins 0.000 description 1
- 208000007256 Nevus Diseases 0.000 description 1
- 244000061176 Nicotiana tabacum Species 0.000 description 1
- 235000002637 Nicotiana tabacum Nutrition 0.000 description 1
- 241000135933 Nitratifractor salsuginis Species 0.000 description 1
- 241000605156 Nitrobacter hamburgensis Species 0.000 description 1
- 241000919925 Nitrosococcus halophilus Species 0.000 description 1
- 241001515112 Nitrosococcus watsonii Species 0.000 description 1
- 241000203619 Nocardiopsis dassonvillei Species 0.000 description 1
- 241001223105 Nodularia spumigena Species 0.000 description 1
- 241000192673 Nostoc sp. Species 0.000 description 1
- 108090001145 Nuclear Receptor Coactivator 3 Proteins 0.000 description 1
- 102100022883 Nuclear receptor coactivator 3 Human genes 0.000 description 1
- 108020004711 Nucleic Acid Probes Proteins 0.000 description 1
- 241000385061 Oenococcus kitaharae Species 0.000 description 1
- 108091034117 Oligonucleotide Proteins 0.000 description 1
- 241000927555 Olsenella uli Species 0.000 description 1
- 240000007594 Oryza sativa Species 0.000 description 1
- 235000007164 Oryza sativa Nutrition 0.000 description 1
- 241000192520 Oscillatoria sp. Species 0.000 description 1
- 229910019142 PO4 Inorganic materials 0.000 description 1
- 102100024894 PR domain zinc finger protein 1 Human genes 0.000 description 1
- 102100028955 PR domain zinc finger protein 10 Human genes 0.000 description 1
- 102100028958 PR domain zinc finger protein 12 Human genes 0.000 description 1
- 102100028973 PR domain zinc finger protein 13 Human genes 0.000 description 1
- 102100028974 PR domain zinc finger protein 14 Human genes 0.000 description 1
- 102100028975 PR domain zinc finger protein 15 Human genes 0.000 description 1
- 102100024890 PR domain zinc finger protein 4 Human genes 0.000 description 1
- 102100029132 PR domain zinc finger protein 5 Human genes 0.000 description 1
- 102100029128 PR domain zinc finger protein 8 Human genes 0.000 description 1
- 241000260425 Parasutterella excrementihominis Species 0.000 description 1
- 241001386755 Parvibaculum lavamentivorans Species 0.000 description 1
- 241000606856 Pasteurella multocida Species 0.000 description 1
- 241000142651 Pelotomaculum thermopropionicum Species 0.000 description 1
- 108010047320 Pepsinogen A Proteins 0.000 description 1
- 108010077524 Peptide Elongation Factor 1 Proteins 0.000 description 1
- 108091093037 Peptide nucleic acid Proteins 0.000 description 1
- 241000374256 Peptoniphilus duerdenii Species 0.000 description 1
- 241000983938 Petrotoga mobilis Species 0.000 description 1
- 241000286209 Phasianidae Species 0.000 description 1
- 102100028251 Phosphoglycerate kinase 1 Human genes 0.000 description 1
- 101710139464 Phosphoglycerate kinase 1 Proteins 0.000 description 1
- 101100533304 Plasmodium falciparum (isolate 3D7) SETVS gene Proteins 0.000 description 1
- 241001599925 Polaromonas naphthalenivorans Species 0.000 description 1
- 241001472610 Polaromonas sp. Species 0.000 description 1
- 108010009975 Positive Regulatory Domain I-Binding Factor 1 Proteins 0.000 description 1
- 102100028730 Pre-mRNA-processing factor 17 Human genes 0.000 description 1
- 102100029522 Pre-mRNA-processing factor 19 Human genes 0.000 description 1
- 102100021232 Pre-mRNA-processing factor 6 Human genes 0.000 description 1
- 102100021231 Pre-mRNA-processing-splicing factor 8 Human genes 0.000 description 1
- 102100028731 Pre-mRNA-splicing factor 18 Human genes 0.000 description 1
- 102100023390 Pre-mRNA-splicing factor ATP-dependent RNA helicase DHX16 Human genes 0.000 description 1
- 102100028729 Pre-mRNA-splicing factor ATP-dependent RNA helicase PRP16 Human genes 0.000 description 1
- 241001141020 Prevotella micans Species 0.000 description 1
- 241000605860 Prevotella ruminicola Species 0.000 description 1
- 102100026136 Probable ATP-dependent RNA helicase DDX23 Human genes 0.000 description 1
- 102100035725 Probable ATP-dependent RNA helicase DDX46 Human genes 0.000 description 1
- 229940124158 Protease/peptidase inhibitor Drugs 0.000 description 1
- 108700040121 Protein Methyltransferases Proteins 0.000 description 1
- 102000055027 Protein Methyltransferases Human genes 0.000 description 1
- 101710188315 Protein X Proteins 0.000 description 1
- 102100023347 Proto-oncogene tyrosine-protein kinase ROS Human genes 0.000 description 1
- 241000590028 Pseudoalteromonas haloplanktis Species 0.000 description 1
- 241000589517 Pseudomonas aeruginosa Species 0.000 description 1
- 102100029134 Putative histone-lysine N-methyltransferase PRDM6 Human genes 0.000 description 1
- 102000015097 RNA Splicing Factors Human genes 0.000 description 1
- 108010039259 RNA Splicing Factors Proteins 0.000 description 1
- 230000004570 RNA-binding Effects 0.000 description 1
- 238000003559 RNA-seq method Methods 0.000 description 1
- 108091030071 RNAI Proteins 0.000 description 1
- 238000011529 RT qPCR Methods 0.000 description 1
- 241001135508 Ralstonia syzygii Species 0.000 description 1
- 102000018120 Recombinases Human genes 0.000 description 1
- 108010091086 Recombinases Proteins 0.000 description 1
- 108090000783 Renin Proteins 0.000 description 1
- 102100028255 Renin Human genes 0.000 description 1
- 108091081062 Repeated sequence (DNA) Proteins 0.000 description 1
- 108700008625 Reporter Genes Proteins 0.000 description 1
- 201000000582 Retinoblastoma Diseases 0.000 description 1
- 241000190950 Rhodopseudomonas palustris Species 0.000 description 1
- 102100023742 Rhodopsin kinase GRK1 Human genes 0.000 description 1
- 241000190984 Rhodospirillum rubrum Species 0.000 description 1
- 102000006382 Ribonucleases Human genes 0.000 description 1
- 108010083644 Ribonucleases Proteins 0.000 description 1
- 102000004389 Ribonucleoproteins Human genes 0.000 description 1
- 108010081734 Ribonucleoproteins Proteins 0.000 description 1
- 108091028664 Ribonucleotide Proteins 0.000 description 1
- 108060007030 Ribulose-phosphate 3-epimerase Proteins 0.000 description 1
- 102100039270 Ribulose-phosphate 3-epimerase Human genes 0.000 description 1
- 241000398180 Roseburia intestinalis Species 0.000 description 1
- 241000192029 Ruminococcus albus Species 0.000 description 1
- 102100027707 SET domain-containing protein 4 Human genes 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- 240000000111 Saccharum officinarum Species 0.000 description 1
- 235000007201 Saccharum officinarum Nutrition 0.000 description 1
- 241000593524 Sargassum patens Species 0.000 description 1
- 108091081021 Sense strand Proteins 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 101150117538 Set2 gene Proteins 0.000 description 1
- 101001010097 Shigella phage SfV Bactoprenol-linked glucose translocase Proteins 0.000 description 1
- 108020004682 Single-Stranded DNA Proteins 0.000 description 1
- 108010041191 Sirtuin 1 Proteins 0.000 description 1
- 108010041216 Sirtuin 2 Proteins 0.000 description 1
- 240000003768 Solanum lycopersicum Species 0.000 description 1
- 244000061456 Solanum tuberosum Species 0.000 description 1
- 235000002595 Solanum tuberosum Nutrition 0.000 description 1
- 241001464874 Solobacterium moorei Species 0.000 description 1
- 241000639167 Sphaerochaeta globosa Species 0.000 description 1
- 102100031713 Splicing factor 3A subunit 1 Human genes 0.000 description 1
- 102100031712 Splicing factor 3A subunit 2 Human genes 0.000 description 1
- 102100031710 Splicing factor 3A subunit 3 Human genes 0.000 description 1
- 102100031711 Splicing factor 3B subunit 1 Human genes 0.000 description 1
- 241000794282 Staphylococcus pseudintermedius Species 0.000 description 1
- 241000194019 Streptococcus mutans Species 0.000 description 1
- 241000194022 Streptococcus sp. Species 0.000 description 1
- 241001518258 Streptomyces pristinaespiralis Species 0.000 description 1
- 241000282887 Suidae Species 0.000 description 1
- 241000123713 Sutterella wadsworthensis Species 0.000 description 1
- 241000192560 Synechococcus sp. Species 0.000 description 1
- 108700005078 Synthetic Genes Proteins 0.000 description 1
- 208000022292 Tay-Sachs disease Diseases 0.000 description 1
- 208000002903 Thalassemia Diseases 0.000 description 1
- 241000206213 Thermosipho africanus Species 0.000 description 1
- 108091036066 Three prime untranslated region Proteins 0.000 description 1
- 102000006601 Thymidine Kinase Human genes 0.000 description 1
- 108020004440 Thymidine kinase Proteins 0.000 description 1
- 208000035317 Total hypoxanthine-guanine phosphoribosyl transferase deficiency Diseases 0.000 description 1
- 102100037116 Transcription elongation factor 1 homolog Human genes 0.000 description 1
- 108050004072 Transcription initiation factor TFIID subunit 1 Proteins 0.000 description 1
- 102000008579 Transposases Human genes 0.000 description 1
- 108010020764 Transposases Proteins 0.000 description 1
- 102100027172 Transposon Hsmar1 transposase Human genes 0.000 description 1
- 241000589892 Treponema denticola Species 0.000 description 1
- 241000078013 Trichormus variabilis Species 0.000 description 1
- 235000021307 Triticum Nutrition 0.000 description 1
- 244000098338 Triticum aestivum Species 0.000 description 1
- 206010045261 Type IIa hyperlipidaemia Diseases 0.000 description 1
- 102100028852 U4/U6 small nuclear ribonucleoprotein Prp4 Human genes 0.000 description 1
- 241001148134 Veillonella Species 0.000 description 1
- 241001447269 Verminephrobacter eiseniae Species 0.000 description 1
- 108020005202 Viral DNA Proteins 0.000 description 1
- 102220504024 Wilms tumor protein_K73R_mutation Human genes 0.000 description 1
- 208000018839 Wilson disease Diseases 0.000 description 1
- JCZSFCLRSONYLH-UHFFFAOYSA-N Wyosine Natural products N=1C(C)=CN(C(C=2N=C3)=O)C=1N(C)C=2N3C1OC(CO)C(O)C1O JCZSFCLRSONYLH-UHFFFAOYSA-N 0.000 description 1
- 102100040092 X-linked retinitis pigmentosa GTPase regulator Human genes 0.000 description 1
- 201000006083 Xeroderma Pigmentosum Diseases 0.000 description 1
- 101000771024 Zea mays DNA (cytosine-5)-methyltransferase 1 Proteins 0.000 description 1
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 description 1
- 235000016383 Zea mays subsp huehuetenangensis Nutrition 0.000 description 1
- 102100021112 Zinc finger protein 10 Human genes 0.000 description 1
- 241001673106 [Bacillus] selenitireducens Species 0.000 description 1
- 241001531188 [Eubacterium] rectale Species 0.000 description 1
- NOXMCJDDSWCSIE-DAGMQNCNSA-N [[(2R,3S,4R,5R)-5-(2-amino-4-oxo-3H-pyrrolo[2,3-d]pyrimidin-7-yl)-3,4-dihydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl] phosphono hydrogen phosphate Chemical compound C1=2NC(N)=NC(=O)C=2C=CN1[C@@H]1O[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)[C@@H](O)[C@H]1O NOXMCJDDSWCSIE-DAGMQNCNSA-N 0.000 description 1
- AZJLCKAEZFNJDI-DJLDLDEBSA-N [[(2r,3s,5r)-5-(4-aminopyrrolo[2,3-d]pyrimidin-7-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl] phosphono hydrogen phosphate Chemical compound C1=CC=2C(N)=NC=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 AZJLCKAEZFNJDI-DJLDLDEBSA-N 0.000 description 1
- AZRNEVJSOSKAOC-VPHBQDTQSA-N [[(2r,3s,5r)-5-[5-[(e)-3-[6-[5-[(3as,4s,6ar)-2-oxo-1,3,3a,4,6,6a-hexahydrothieno[3,4-d]imidazol-4-yl]pentanoylamino]hexanoylamino]prop-1-enyl]-2,4-dioxopyrimidin-1-yl]-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl] phosphono hydrogen phosphate Chemical compound O1[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)[C@@H](O)C[C@@H]1N1C(=O)NC(=O)C(\C=C\CNC(=O)CCCCCNC(=O)CCCC[C@H]2[C@H]3NC(=O)N[C@H]3CS2)=C1 AZRNEVJSOSKAOC-VPHBQDTQSA-N 0.000 description 1
- PGAVKCOVUIYSFO-UHFFFAOYSA-N [[5-(2,4-dioxopyrimidin-1-yl)-3,4-dihydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl] phosphono hydrogen phosphate Chemical compound OC1C(O)C(COP(O)(=O)OP(O)(=O)OP(O)(O)=O)OC1N1C(=O)NC(=O)C=C1 PGAVKCOVUIYSFO-UHFFFAOYSA-N 0.000 description 1
- ZXZIQGYRHQJWSY-NKWVEPMBSA-N [hydroxy-[[(2s,5r)-5-(6-oxo-3h-purin-9-yl)oxolan-2-yl]methoxy]phosphoryl] phosphono hydrogen phosphate Chemical compound O1[C@H](COP(O)(=O)OP(O)(=O)OP(O)(=O)O)CC[C@@H]1N1C(NC=NC2=O)=C2N=C1 ZXZIQGYRHQJWSY-NKWVEPMBSA-N 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 230000003213 activating effect Effects 0.000 description 1
- 201000010275 acute porphyria Diseases 0.000 description 1
- 230000004721 adaptive immunity Effects 0.000 description 1
- 210000001789 adipocyte Anatomy 0.000 description 1
- 230000001919 adrenal effect Effects 0.000 description 1
- 210000004100 adrenal gland Anatomy 0.000 description 1
- 210000001132 alveolar macrophage Anatomy 0.000 description 1
- 210000000411 amacrine cell Anatomy 0.000 description 1
- 210000001053 ameloblast Anatomy 0.000 description 1
- 150000003862 amino acid derivatives Chemical class 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 230000003110 anti-inflammatory effect Effects 0.000 description 1
- 238000002617 apheresis Methods 0.000 description 1
- 230000001640 apoptogenic effect Effects 0.000 description 1
- 230000006907 apoptotic process Effects 0.000 description 1
- 210000004396 apud cell Anatomy 0.000 description 1
- 229940011019 arthrospira platensis Drugs 0.000 description 1
- 210000001130 astrocyte Anatomy 0.000 description 1
- 210000002453 autonomic neuron Anatomy 0.000 description 1
- 239000002363 auxin Substances 0.000 description 1
- 210000000979 axoneme Anatomy 0.000 description 1
- 210000004082 barrier epithelial cell Anatomy 0.000 description 1
- 230000004888 barrier function Effects 0.000 description 1
- 210000002947 bartholin's gland Anatomy 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 229940002008 bifidobacterium bifidum Drugs 0.000 description 1
- 229940009291 bifidobacterium longum Drugs 0.000 description 1
- 238000001574 biopsy Methods 0.000 description 1
- 239000011616 biotin Substances 0.000 description 1
- 229960002685 biotin Drugs 0.000 description 1
- 235000020958 biotin Nutrition 0.000 description 1
- 238000006664 bond formation reaction Methods 0.000 description 1
- 210000002449 bone cell Anatomy 0.000 description 1
- 210000001185 bone marrow Anatomy 0.000 description 1
- 210000000465 brunner gland Anatomy 0.000 description 1
- 239000006172 buffering agent Substances 0.000 description 1
- 210000002533 bulbourethral gland Anatomy 0.000 description 1
- 201000011510 cancer Diseases 0.000 description 1
- 210000000234 capsid Anatomy 0.000 description 1
- 150000001720 carbohydrates Chemical class 0.000 description 1
- 235000014633 carbohydrates Nutrition 0.000 description 1
- 230000000747 cardiac effect Effects 0.000 description 1
- 210000004413 cardiac myocyte Anatomy 0.000 description 1
- 210000000845 cartilage Anatomy 0.000 description 1
- 210000003321 cartilage cell Anatomy 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 238000010822 cell death assay Methods 0.000 description 1
- 230000004663 cell proliferation Effects 0.000 description 1
- 238000002659 cell therapy Methods 0.000 description 1
- 210000000250 cementoblast Anatomy 0.000 description 1
- 210000003793 centrosome Anatomy 0.000 description 1
- 235000013339 cereals Nutrition 0.000 description 1
- 238000007385 chemical modification Methods 0.000 description 1
- 210000004691 chief cell of stomach Anatomy 0.000 description 1
- 210000003483 chromatin Anatomy 0.000 description 1
- 239000013611 chromosomal DNA Substances 0.000 description 1
- 210000000254 ciliated cell Anatomy 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- HISOCSRUFLPKDE-KLXQUTNESA-N cmt-2 Chemical compound C1=CC=C2[C@](O)(C)C3CC4C(N(C)C)C(O)=C(C#N)C(=O)[C@@]4(O)C(O)=C3C(=O)C2=C1O HISOCSRUFLPKDE-KLXQUTNESA-N 0.000 description 1
- 230000008045 co-localization Effects 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 230000021615 conjugation Effects 0.000 description 1
- 210000002808 connective tissue Anatomy 0.000 description 1
- 238000011109 contamination Methods 0.000 description 1
- 210000000555 contractile cell Anatomy 0.000 description 1
- 230000008094 contradictory effect Effects 0.000 description 1
- 101150055601 cops2 gene Proteins 0.000 description 1
- OFEZSBMBBKLLBJ-BAJZRUMYSA-N cordycepin Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](CO)C[C@H]1O OFEZSBMBBKLLBJ-BAJZRUMYSA-N 0.000 description 1
- OFEZSBMBBKLLBJ-UHFFFAOYSA-N cordycepine Natural products C1=NC=2C(N)=NC=NC=2N1C1OC(CO)CC1O OFEZSBMBBKLLBJ-UHFFFAOYSA-N 0.000 description 1
- 239000011258 core-shell material Substances 0.000 description 1
- 235000005822 corn Nutrition 0.000 description 1
- 210000004246 corpus luteum Anatomy 0.000 description 1
- 230000001054 cortical effect Effects 0.000 description 1
- 238000005520 cutting process Methods 0.000 description 1
- 229940104302 cytosine Drugs 0.000 description 1
- HAAZLUGHYHWQIW-KVQBGUIXSA-N dGTP Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 HAAZLUGHYHWQIW-KVQBGUIXSA-N 0.000 description 1
- NHVNXKFIZYSCEB-XLPZGREQSA-N dTTP Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)[C@@H](O)C1 NHVNXKFIZYSCEB-XLPZGREQSA-N 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 239000005549 deoxyribonucleoside Substances 0.000 description 1
- 239000005547 deoxyribonucleotide Substances 0.000 description 1
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 206010012601 diabetes mellitus Diseases 0.000 description 1
- 239000005546 dideoxynucleotide Substances 0.000 description 1
- 230000004069 differentiation Effects 0.000 description 1
- 102000038379 digestive enzymes Human genes 0.000 description 1
- 108091007734 digestive enzymes Proteins 0.000 description 1
- 210000002249 digestive system Anatomy 0.000 description 1
- ZPTBLXKRQACLCR-XVFCMESISA-N dihydrouridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)CC1 ZPTBLXKRQACLCR-XVFCMESISA-N 0.000 description 1
- 239000003085 diluting agent Substances 0.000 description 1
- 206010013023 diphtheria Diseases 0.000 description 1
- 239000006185 dispersion Substances 0.000 description 1
- 239000002612 dispersion medium Substances 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 239000003937 drug carrier Substances 0.000 description 1
- 210000001198 duodenum Anatomy 0.000 description 1
- 235000013399 edible fruits Nutrition 0.000 description 1
- 210000001671 embryonic stem cell Anatomy 0.000 description 1
- 239000003995 emulsifying agent Substances 0.000 description 1
- 210000000750 endocrine system Anatomy 0.000 description 1
- 210000004696 endometrium Anatomy 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 210000002322 enterochromaffin cell Anatomy 0.000 description 1
- 210000004188 enterochromaffin-like cell Anatomy 0.000 description 1
- 210000003158 enteroendocrine cell Anatomy 0.000 description 1
- 239000002532 enzyme inhibitor Substances 0.000 description 1
- 229940125532 enzyme inhibitor Drugs 0.000 description 1
- 210000001339 epidermal cell Anatomy 0.000 description 1
- 210000005175 epidermal keratinocyte Anatomy 0.000 description 1
- 210000003426 epidermal langerhans cell Anatomy 0.000 description 1
- 238000012236 epigenome editing Methods 0.000 description 1
- LYCAIKOWRPUZTN-UHFFFAOYSA-N ethylene glycol Natural products OCCO LYCAIKOWRPUZTN-UHFFFAOYSA-N 0.000 description 1
- 210000003499 exocrine gland Anatomy 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 238000010195 expression analysis Methods 0.000 description 1
- 210000002744 extracellular matrix Anatomy 0.000 description 1
- 208000030533 eye disease Diseases 0.000 description 1
- 210000000744 eyelid Anatomy 0.000 description 1
- 201000001386 familial hypercholesterolemia Diseases 0.000 description 1
- 210000002950 fibroblast Anatomy 0.000 description 1
- 210000004905 finger nail Anatomy 0.000 description 1
- 210000004904 fingernail bed Anatomy 0.000 description 1
- 238000000684 flow cytometry Methods 0.000 description 1
- GNBHRKFJIUUOQI-UHFFFAOYSA-N fluorescein Chemical compound O1C(=O)C2=CC=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 GNBHRKFJIUUOQI-UHFFFAOYSA-N 0.000 description 1
- 238000002509 fluorescent in situ hybridization Methods 0.000 description 1
- 210000001650 focal adhesion Anatomy 0.000 description 1
- 230000002538 fungal effect Effects 0.000 description 1
- 230000000574 ganglionic effect Effects 0.000 description 1
- 230000027119 gastric acid secretion Effects 0.000 description 1
- 210000002618 gastric chief cell Anatomy 0.000 description 1
- 238000003500 gene array Methods 0.000 description 1
- 238000001476 gene delivery Methods 0.000 description 1
- 230000009368 gene silencing by RNA Effects 0.000 description 1
- 210000004602 germ cell Anatomy 0.000 description 1
- 150000004676 glycans Chemical class 0.000 description 1
- 239000003163 gonadal steroid hormone Substances 0.000 description 1
- 210000003714 granulocyte Anatomy 0.000 description 1
- 210000003772 granulosa lutein cell Anatomy 0.000 description 1
- 229940029575 guanosine Drugs 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 208000019622 heart disease Diseases 0.000 description 1
- 238000010438 heat treatment Methods 0.000 description 1
- 208000014951 hematologic disease Diseases 0.000 description 1
- 208000018706 hematopoietic system disease Diseases 0.000 description 1
- 230000010224 hepatic metabolism Effects 0.000 description 1
- 208000033552 hepatic porphyria Diseases 0.000 description 1
- 208000006359 hepatoblastoma Diseases 0.000 description 1
- 210000003494 hepatocyte Anatomy 0.000 description 1
- 208000002557 hidradenitis Diseases 0.000 description 1
- 238000002744 homologous recombination Methods 0.000 description 1
- 230000006801 homologous recombination Effects 0.000 description 1
- 210000002287 horizontal cell Anatomy 0.000 description 1
- 239000005556 hormone Substances 0.000 description 1
- 229940088597 hormone Drugs 0.000 description 1
- 229960002773 hyaluronidase Drugs 0.000 description 1
- 230000007062 hydrolysis Effects 0.000 description 1
- 238000006460 hydrolysis reaction Methods 0.000 description 1
- WGCNASOHLSPBMP-UHFFFAOYSA-N hydroxyacetaldehyde Natural products OCC=O WGCNASOHLSPBMP-UHFFFAOYSA-N 0.000 description 1
- 238000003384 imaging method Methods 0.000 description 1
- 238000003018 immunoassay Methods 0.000 description 1
- 238000012744 immunostaining Methods 0.000 description 1
- 230000001771 impaired effect Effects 0.000 description 1
- 238000002513 implantation Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000005462 in vivo assay Methods 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- SEOVTRFCIGRIMH-UHFFFAOYSA-N indole-3-acetic acid Chemical compound C1=CC=C2C(CC(=O)O)=CNC2=C1 SEOVTRFCIGRIMH-UHFFFAOYSA-N 0.000 description 1
- 239000003701 inert diluent Substances 0.000 description 1
- 229960003786 inosine Drugs 0.000 description 1
- 239000012212 insulator Substances 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 210000002570 interstitial cell Anatomy 0.000 description 1
- 238000001990 intravenous administration Methods 0.000 description 1
- 230000001788 irregular Effects 0.000 description 1
- 239000007951 isotonicity adjuster Substances 0.000 description 1
- 210000003292 kidney cell Anatomy 0.000 description 1
- 210000001865 kupffer cell Anatomy 0.000 description 1
- 210000004561 lacrimal apparatus Anatomy 0.000 description 1
- 230000001381 lactotroph Effects 0.000 description 1
- 229940115932 legionella pneumophila Drugs 0.000 description 1
- 210000003644 lens cell Anatomy 0.000 description 1
- 210000000265 leukocyte Anatomy 0.000 description 1
- 210000002332 leydig cell Anatomy 0.000 description 1
- 239000003446 ligand Substances 0.000 description 1
- 230000029226 lipidation Effects 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- 239000010687 lubricating oil Substances 0.000 description 1
- 210000004698 lymphocyte Anatomy 0.000 description 1
- 210000003563 lymphoid tissue Anatomy 0.000 description 1
- 229960000274 lysozyme Drugs 0.000 description 1
- 239000004325 lysozyme Substances 0.000 description 1
- 235000010335 lysozyme Nutrition 0.000 description 1
- 108010021853 m(5)C rRNA methyltransferase Proteins 0.000 description 1
- 235000009973 maize Nutrition 0.000 description 1
- 240000004308 marijuana Species 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 238000012083 mass cytometry Methods 0.000 description 1
- 238000004949 mass spectrometry Methods 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 208000030159 metabolic disease Diseases 0.000 description 1
- 230000002503 metabolic effect Effects 0.000 description 1
- 230000004060 metabolic process Effects 0.000 description 1
- 238000002493 microarray Methods 0.000 description 1
- 210000003632 microfilament Anatomy 0.000 description 1
- 210000000274 microglia Anatomy 0.000 description 1
- 230000002025 microglial effect Effects 0.000 description 1
- 210000000110 microvilli Anatomy 0.000 description 1
- 235000013336 milk Nutrition 0.000 description 1
- 210000004080 milk Anatomy 0.000 description 1
- 239000008267 milk Substances 0.000 description 1
- 239000003226 mitogen Substances 0.000 description 1
- 239000003068 molecular probe Substances 0.000 description 1
- 125000004573 morpholin-4-yl group Chemical group N1(CCOCC1)* 0.000 description 1
- 230000000877 morphologic effect Effects 0.000 description 1
- 210000000663 muscle cell Anatomy 0.000 description 1
- 210000000066 myeloid cell Anatomy 0.000 description 1
- 210000000107 myocyte Anatomy 0.000 description 1
- 210000000653 nervous system Anatomy 0.000 description 1
- 210000005155 neural progenitor cell Anatomy 0.000 description 1
- 210000002569 neuron Anatomy 0.000 description 1
- 230000007935 neutral effect Effects 0.000 description 1
- 210000001331 nose Anatomy 0.000 description 1
- 239000002853 nucleic acid probe Substances 0.000 description 1
- 210000004940 nucleus Anatomy 0.000 description 1
- 210000001915 nurse cell Anatomy 0.000 description 1
- 210000001706 olfactory mucosa Anatomy 0.000 description 1
- 210000000287 oocyte Anatomy 0.000 description 1
- 210000002380 oogonia Anatomy 0.000 description 1
- 210000001328 optic nerve Anatomy 0.000 description 1
- 230000008520 organization Effects 0.000 description 1
- 210000002220 organoid Anatomy 0.000 description 1
- 210000000963 osteoblast Anatomy 0.000 description 1
- 210000004409 osteocyte Anatomy 0.000 description 1
- 210000004681 ovum Anatomy 0.000 description 1
- 230000003647 oxidation Effects 0.000 description 1
- 238000007254 oxidation reaction Methods 0.000 description 1
- 210000000496 pancreas Anatomy 0.000 description 1
- 210000000277 pancreatic duct Anatomy 0.000 description 1
- 230000000849 parathyroid Effects 0.000 description 1
- 210000002990 parathyroid gland Anatomy 0.000 description 1
- 229940051027 pasteurella multocida Drugs 0.000 description 1
- 244000052769 pathogen Species 0.000 description 1
- 230000001717 pathogenic effect Effects 0.000 description 1
- 230000001575 pathological effect Effects 0.000 description 1
- 239000000137 peptide hydrolase inhibitor Substances 0.000 description 1
- 210000003668 pericyte Anatomy 0.000 description 1
- 210000002856 peripheral neuron Anatomy 0.000 description 1
- 239000010452 phosphate Substances 0.000 description 1
- 210000000608 photoreceptor cell Anatomy 0.000 description 1
- 230000001766 physiological effect Effects 0.000 description 1
- 210000004694 pigment cell Anatomy 0.000 description 1
- 210000001127 pigmented epithelial cell Anatomy 0.000 description 1
- 210000000793 pinealocyte Anatomy 0.000 description 1
- 210000004043 pneumocyte Anatomy 0.000 description 1
- 210000000557 podocyte Anatomy 0.000 description 1
- 229920001282 polysaccharide Polymers 0.000 description 1
- 239000005017 polysaccharide Substances 0.000 description 1
- 235000012015 potatoes Nutrition 0.000 description 1
- 244000144977 poultry Species 0.000 description 1
- 235000013594 poultry meat Nutrition 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 239000003755 preservative agent Substances 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 230000001141 propulsive effect Effects 0.000 description 1
- 230000004850 protein–protein interaction Effects 0.000 description 1
- 210000000512 proximal kidney tubule Anatomy 0.000 description 1
- 235000015136 pumpkin Nutrition 0.000 description 1
- 210000001747 pupil Anatomy 0.000 description 1
- 239000002096 quantum dot Substances 0.000 description 1
- QQXQGKSPIMGUIZ-AEZJAUAXSA-N queuosine Chemical compound C1=2C(=O)NC(N)=NC=2N([C@H]2[C@@H]([C@H](O)[C@@H](CO)O2)O)C=C1CN[C@H]1C=C[C@H](O)[C@@H]1O QQXQGKSPIMGUIZ-AEZJAUAXSA-N 0.000 description 1
- 108700022487 rRNA Genes Proteins 0.000 description 1
- 230000002285 radioactive effect Effects 0.000 description 1
- 238000010188 recombinant method Methods 0.000 description 1
- 230000007115 recruitment Effects 0.000 description 1
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 description 1
- 210000003289 regulatory T cell Anatomy 0.000 description 1
- 238000007634 remodeling Methods 0.000 description 1
- 238000009256 replacement therapy Methods 0.000 description 1
- 210000004994 reproductive system Anatomy 0.000 description 1
- 230000000241 respiratory effect Effects 0.000 description 1
- 230000004243 retinal function Effects 0.000 description 1
- 210000003994 retinal ganglion cell Anatomy 0.000 description 1
- 239000002342 ribonucleoside Substances 0.000 description 1
- 239000002336 ribonucleotide Substances 0.000 description 1
- 125000002652 ribonucleotide group Chemical group 0.000 description 1
- 108091092562 ribozyme Proteins 0.000 description 1
- 235000009566 rice Nutrition 0.000 description 1
- 239000000523 sample Substances 0.000 description 1
- 210000003786 sclera Anatomy 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 210000001732 sebaceous gland Anatomy 0.000 description 1
- 210000002374 sebum Anatomy 0.000 description 1
- 230000001953 sensory effect Effects 0.000 description 1
- 210000000697 sensory organ Anatomy 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 210000003728 serous cell Anatomy 0.000 description 1
- 208000007056 sickle cell anemia Diseases 0.000 description 1
- 230000005783 single-strand break Effects 0.000 description 1
- 238000002741 site-directed mutagenesis Methods 0.000 description 1
- 210000002363 skeletal muscle cell Anatomy 0.000 description 1
- 210000003491 skin Anatomy 0.000 description 1
- 210000004927 skin cell Anatomy 0.000 description 1
- 210000000813 small intestine Anatomy 0.000 description 1
- 210000001622 small lutein cell Anatomy 0.000 description 1
- 150000003384 small molecules Chemical class 0.000 description 1
- 239000002904 solvent Substances 0.000 description 1
- 210000002325 somatostatin-secreting cell Anatomy 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 210000004336 spermatogonium Anatomy 0.000 description 1
- 210000000952 spleen Anatomy 0.000 description 1
- 230000010473 stable expression Effects 0.000 description 1
- 238000007619 statistical method Methods 0.000 description 1
- 210000004500 stellate cell Anatomy 0.000 description 1
- 210000000352 storage cell Anatomy 0.000 description 1
- 230000010741 sumoylation Effects 0.000 description 1
- 230000008093 supporting effect Effects 0.000 description 1
- 230000001629 suppression Effects 0.000 description 1
- 239000000725 suspension Substances 0.000 description 1
- 230000009182 swimming Effects 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 210000001779 taste bud Anatomy 0.000 description 1
- 230000002123 temporal effect Effects 0.000 description 1
- 210000002435 tendon Anatomy 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- MPLHNVLQVRSVEE-UHFFFAOYSA-N texas red Chemical compound [O-]S(=O)(=O)C1=CC(S(Cl)(=O)=O)=CC=C1C(C1=CC=2CCCN3CCCC(C=23)=C1O1)=C2C1=C(CCC1)C3=[N+]1CCCC3=C2 MPLHNVLQVRSVEE-UHFFFAOYSA-N 0.000 description 1
- IBVCSSOEYUMRLC-GABYNLOESA-N texas red-5-dutp Chemical compound O1[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)[C@@H](O)C[C@@H]1N1C(=O)NC(=O)C(C#CCNS(=O)(=O)C=2C=C(C(C=3C4=CC=5CCCN6CCCC(C=56)=C4OC4=C5C6=[N+](CCC5)CCCC6=CC4=3)=CC=2)S([O-])(=O)=O)=C1 IBVCSSOEYUMRLC-GABYNLOESA-N 0.000 description 1
- 210000003684 theca cell Anatomy 0.000 description 1
- 230000008719 thickening Effects 0.000 description 1
- 239000002562 thickening agent Substances 0.000 description 1
- 125000003396 thiol group Chemical class [H]S* 0.000 description 1
- ANRHNWWPFJCPAZ-UHFFFAOYSA-M thionine Chemical compound [Cl-].C1=CC(N)=CC2=[S+]C3=CC(N)=CC=C3N=C21 ANRHNWWPFJCPAZ-UHFFFAOYSA-M 0.000 description 1
- 210000001541 thymus gland Anatomy 0.000 description 1
- 210000004906 toe nail Anatomy 0.000 description 1
- 210000000515 tooth Anatomy 0.000 description 1
- 210000003014 totipotent stem cell Anatomy 0.000 description 1
- 231100000440 toxicity profile Toxicity 0.000 description 1
- 239000003053 toxin Substances 0.000 description 1
- 231100000765 toxin Toxicity 0.000 description 1
- 230000002463 transducing effect Effects 0.000 description 1
- 230000001052 transient effect Effects 0.000 description 1
- 238000003146 transient transfection Methods 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 230000014621 translational initiation Effects 0.000 description 1
- 230000005945 translocation Effects 0.000 description 1
- 230000032258 transport Effects 0.000 description 1
- 210000002014 trichocyte Anatomy 0.000 description 1
- UNXRWKVEANCORM-UHFFFAOYSA-N triphosphoric acid Chemical compound OP(O)(=O)OP(O)(=O)OP(O)(O)=O UNXRWKVEANCORM-UHFFFAOYSA-N 0.000 description 1
- 230000034512 ubiquitination Effects 0.000 description 1
- 238000010798 ubiquitination Methods 0.000 description 1
- 210000003932 urinary bladder Anatomy 0.000 description 1
- 210000004291 uterus Anatomy 0.000 description 1
- 235000013311 vegetables Nutrition 0.000 description 1
- 108700026220 vif Genes Proteins 0.000 description 1
- 210000002845 virion Anatomy 0.000 description 1
- 239000000277 virosome Substances 0.000 description 1
- 238000012800 visualization Methods 0.000 description 1
- 210000004127 vitreous body Anatomy 0.000 description 1
- 210000001849 von ebner gland Anatomy 0.000 description 1
- JCZSFCLRSONYLH-QYVSTXNMSA-N wyosin Chemical compound N=1C(C)=CN(C(C=2N=C3)=O)C=1N(C)C=2N3[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O JCZSFCLRSONYLH-QYVSTXNMSA-N 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
Classifications
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P27/00—Drugs for disorders of the senses
- A61P27/02—Ophthalmic agents
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K48/00—Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy
- A61K48/005—Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy characterised by an aspect of the 'active' part of the composition delivered, i.e. the nucleic acid delivered
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K48/00—Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy
- A61K48/0075—Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy characterised by an aspect of the delivery route, e.g. oral, subcutaneous
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/46—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates
- C07K14/47—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/46—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates
- C07K14/47—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals
- C07K14/4701—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals not used
- C07K14/4702—Regulators; Modulating activity
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/113—Non-coding nucleic acids modulating the expression of genes, e.g. antisense oligonucleotides; Antisense DNA or RNA; Triplex- forming oligonucleotides; Catalytic nucleic acids, e.g. ribozymes; Nucleic acids used in co-suppression or gene silencing
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N5/00—Undifferentiated human, animal or plant cells, e.g. cell lines; Tissues; Cultivation or maintenance thereof; Culture media therefor
- C12N5/06—Animal cells or tissues; Human cells or tissues
- C12N5/0602—Vertebrate cells
- C12N5/0618—Cells of the nervous system
- C12N5/0621—Eye cells, e.g. cornea, iris pigmented cells
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/16—Hydrolases (3) acting on ester bonds (3.1)
- C12N9/22—Ribonucleases RNAses, DNAses
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K38/00—Medicinal preparations containing peptides
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/62—DNA sequences coding for fusion proteins
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2310/00—Structure or type of the nucleic acid
- C12N2310/10—Type of nucleic acid
- C12N2310/20—Type of nucleic acid involving clustered regularly interspaced short palindromic repeats [CRISPRs]
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2320/00—Applications; Uses
- C12N2320/10—Applications; Uses in screening processes
- C12N2320/11—Applications; Uses in screening processes for the determination of target sites, i.e. of active nucleic acids
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2330/00—Production
- C12N2330/50—Biochemical production, i.e. in a transformed host cell
- C12N2330/51—Specially adapted vectors
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2506/00—Differentiation of animal cells from one lineage to another; Differentiation of pluripotent cells
- C12N2506/45—Differentiation of animal cells from one lineage to another; Differentiation of pluripotent cells from artificially induced pluripotent stem cells
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2510/00—Genetically modified cells
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2750/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssDNA viruses
- C12N2750/00011—Details
- C12N2750/14011—Parvoviridae
- C12N2750/14111—Dependovirus, e.g. adenoassociated viruses
- C12N2750/14141—Use of virus, viral particle or viral elements as a vector
- C12N2750/14143—Use of virus, viral particle or viral elements as a vector viral genome or elements thereof as genetic vector
-
- C—CHEMISTRY; METALLURGY
- C40—COMBINATORIAL TECHNOLOGY
- C40B—COMBINATORIAL CHEMISTRY; LIBRARIES, e.g. CHEMICAL LIBRARIES
- C40B40/00—Libraries per se, e.g. arrays, mixtures
- C40B40/04—Libraries containing only organic compounds
- C40B40/06—Libraries containing nucleotides or polynucleotides, or derivatives thereof
Definitions
- Aberrant expression of one or more genes can lead to a disease or a condition in a subject.
- aberrant expression of an enzyme regulator e.g., an enzyme inhibitor, such as a protease inhibitor
- an enzyme regulator e.g., an enzyme inhibitor, such as a protease inhibitor
- the aberrant expression can be due to one or more hereditary genetic mutations in a gene encoding the enzyme regulator.
- mutation of PRPF31 can lead to retinitis pigmentosa or autosomal congenital blindness in a subj ect.
- Modifying aberrant expression of a mutant allele (e.g., a disease-causing allele) in a cell may not be sufficient to treat or cure a disease that is manifested by the aberrant expression of the mutant allele.
- a non-disease causing allele e.g., a wild-type allele
- the present disclosure provides a system comprising: a heterologous polypeptide comprising an actuator moiety, wherein the actuator moiety is for binding an endogenous target gene encoding PRPF31 in a cell, to increase expression level of the PRPF31 in the cell, wherein: (i) the actuator moiety substantially lacks DNA cleavage activity; and/or (ii) the actuator moiety is coupled to a transcriptional activator.
- the present disclosure provides one or more polynucleotides encoding any of the system provided herein.
- the present disclosure provides a method comprising administrating any of the system or the one or more polynucleotides provided herein to a subject in need thereof.
- the present disclosure provides a method comprising: increasing expression level of an endogenous target gene encoding PRPF31 in a cell, via binding of a heterologous polypeptide comprising an actuator moiety to bind the endogenous target gene, wherein: (a) the actuator moiety substantially lacks DNA cleavage activity; and/or (b) the actuator moiety is coupled to a transcriptional activator.
- FIG. 1 illustrates an exemplary construct encoding the dCas and the actuator moiety (effector).
- Promoter photoreceptor specific or ubiquitous promoter
- dCas small dead Cas molecule such as dCasMini or equivalent effector: effector to activate expression such as VPR or equivalent thereof
- Pr Promoter for gRNA such as Hl or U6 or equivalent thereof
- gRNA gRNA targeting the endogenous target gene comprising PRPF31.
- FIG. 2 illustrates a schematic for treating retinitis pigmentosa 11 (RP11) with the system described herein.
- AAV can be engineered to deliver an exemplary construct via subretinal or intravitreal injection to a subject in need thereof, where the expression of the exemplary construct can activate expression of PRPF31 encoded by the heterologous CDS of the construct.
- FIG. 3 illustrates exemplary genomic loci NM_015629 (ENST00000419967.5) and (ENST00000321030.9) transcripts that can be targeted by the gRNA of the system and the method described herein.
- FIGs. 4A-4F schematically illustrate example vectors encoding the system of the present disclosure.
- FIG. 5A schematically illustrates an example experimental procedure to screen for target endogenous polynucleotide sequences to modulate expression ofPRPF31
- FIG. 5B shows change in endogenous PRPF31 by different guide nucleic acid molecules.
- FIG. 6A shows a fluorescent image of normal retinal pigment epithelium (RPE) cells
- FIG. 6B shows a fluorescent image of diseased RPE cells.
- the term “about” or “approximately” generally mean within an acceptable error range for the particular value as determined by one of ordinary skill in the art, which will depend in part on how the value is measured or determined, i.e., the limitations of the measurement system. For example, “about” can mean within 1 or more than 1 standard deviation, per the practice in the art. Alternatively, “about” can mean a range of up to 20%, up to 10%, up to 5%, or up to 1% of a given value. Alternatively, particularly with respect to biological systems or processes, the term can mean within an order of magnitude, preferably within 5-fold, and more preferably within 2- fold, of a value. Where particular values are described in the application and claims, unless otherwise stated, the term “about” meaning within an acceptable error range for the particular value should be assumed.
- cell generally refers to a biological cell.
- a cell can be the basic structural, functional and/or biological unit of a living organism.
- a cell can originate from any organism having one or more cells. Some non-limiting examples include: a prokaryotic cell, eukaryotic cell, a bacterial cell, an archaeal cell, a cell of a single-cell eukaryotic organism, a protozoa cell, a cell from a plant (e.g.
- algal cells from plant crops, fruits, vegetables, grains, soy bean, corn, maize, wheat, seeds, tomatoes, rice, cassava, sugarcane, pumpkin, hay, potatoes, cotton, cannabis, tobacco, flowering plants, conifers, gymnosperms, fems, clubmosses, hornworts, liverworts, mosses), an algal cell, (e.g., Botryococcus braunii, Chlamydomonas reinhardtii, Nannochloropsis gaditana, Chlorella pyrenoidosa, Sargassum patens C. Agardh, and the like), seaweeds (e.g.
- a fungal cell e.g., a yeast cell, a cell from a mushroom
- an animal cell e.g. fruit fly, cnidarian, echinoderm, nematode, etc.
- a cell from a vertebrate animal e.g., fish, amphibian, reptile, bird, mammal
- a cell from a mammal e.g., a pig, a cow, a goat, a sheep, a rodent, a rat, a mouse, a non-human primate, a human, etc.
- a cell is not originating from a natural organism (e.g. a cell can be a synthetically made, sometimes termed an artificial cell).
- nucleotide generally refers to a base-sugar-phosphate combination.
- a nucleotide can comprise a synthetic nucleotide.
- a nucleotide can comprise a synthetic nucleotide analog.
- Nucleotides can be monomeric units of a nucleic acid sequence (e.g. deoxyribonucleic acid (DNA) and ribonucleic acid (RNA)).
- nucleotide can include ribonucleoside triphosphates adenosine triphosphate (ATP), uridine triphosphate (UTP), cytosine triphosphate (CTP), guanosine triphosphate (GTP) and deoxyribonucleoside triphosphates such as dATP, dCTP, diTP, dUTP, dGTP, dTTP, or derivatives thereof.
- ATP ribonucleoside triphosphates adenosine triphosphate
- UDP uridine triphosphate
- CTP cytosine triphosphate
- GTP guanosine triphosphate
- deoxyribonucleoside triphosphates such as dATP, dCTP, diTP, dUTP, dGTP, dTTP, or derivatives thereof.
- derivatives can include, for example, [aS]dATP, 7-deaza-dGTP and 7-deaza-dATP, and nucleot
- nucleotide as used herein can refer to dideoxyribonucleoside triphosphates (ddNTPs) and their derivatives.
- ddNTPs dideoxyribonucleoside triphosphates
- Illustrative examples of dideoxyribonucleoside triphosphates can include, but are not limited to, ddATP, ddCTP, ddGTP, ddITP, and ddTTP.
- a nucleotide may be unlabeled or detectably labeled by well-known techniques. Labeling can also be carried out with quantum dots.
- Detectable labels can include, for example, radioactive isotopes, fluorescent labels, chemiluminescent labels, bioluminescent labels and enzyme labels.
- Fluorescent labels of nucleotides may include but are not limited fluorescein, 5-carboxyfluorescein (FAM), 2'7'-dimethoxy-4'5-dichloro-6- carboxyfluorescein (JOE), rhodamine, 6-carboxyrhodamine (R6G), N,N,N',N'-tetramethyl-6- carboxyrhodamine (TAMRA), 6-carboxy-X-rhodamine (ROX), 4-(4 'dimethylaminophenylazo) benzoic acid (DABCYL), Cascade Blue, Oregon Green, Texas Red, Cyanine and 5-(2'- aminoethyl)aminonaphthalene-l -sulfonic acid (EDANS).
- FAM 5-carboxyfluorescein
- JE 2'7'-dimethoxy-4'5-dichloro-6- carboxyfluorescein
- rhodamine 6-carboxyrh
- fluorescently labeled nucleotides can include [R6G]dUTP, [TAMRA]dUTP, [R110]dCTP, [R6G] dCTP, [TAMRA] dCTP, [JOE] ddATP, [R6G] ddATP, [FAM] ddCTP, [R110]ddCTP, [TAMRA]ddGTP, [ROX]ddTTP, [dR6G]ddATP, [dRl 10]ddCTP, [dTAMRA]ddGTP, and [dROX]ddTTP available from Perkin Elmer, Foster City, Calif.
- Nucleotides can also be labeled or marked by chemical modification.
- a chemically-modified single nucleotide can be biotin-dNTP.
- biotinylated dNTPs can include, biotin-dATP (e.g., bio-N6-ddATP, biotin- 14-dATP), biotin- dCTP (e.g., biotin- 11 -dCTP, biotin-14-dCTP), and biotin-dUTP (e.g. biotin- 11-dUTP, biotin-16- dUTP, biotin-20-dUTP).
- polynucleotide generally refers to a polymeric form of nucleotides of any length, either deoxyribonucleotides or ribonucleotides, or analogs thereof, either in single-, double-, or multistranded form.
- a polynucleotide can be exogenous or endogenous to a cell.
- a polynucleotide can exist in a cell-free environment.
- a polynucleotide can be a gene or fragment thereof.
- a polynucleotide can be DNA.
- a polynucleotide can be RNA.
- a polynucleotide can have any three dimensional structure, and can perform any function, known or unknown.
- a polynucleotide can comprise one or more analogs (e.g. altered backbone, sugar, or nucleobase). If present, modifications to the nucleotide structure can be imparted before or after assembly of the polymer.
- analogs include: 5-bromouracil, peptide nucleic acid, xeno nucleic acid, morpholinos, locked nucleic acids, glycol nucleic acids, threose nucleic acids, dideoxynucleotides, cordycepin, 7-deaza-GTP, florophores (e g.
- thiol containing nucleotides thiol containing nucleotides, biotin linked nucleotides, fluorescent base analogs, CpG islands, methyl-7-guanosine, methylated nucleotides, inosine, thiouridine, pseudourdine, dihydrouridine, queuosine, and wyosine.
- Non-limiting examples of polynucleotides include coding or non-coding regions of a gene or gene fragment, loci (locus) defined from linkage analysis, exons, introns, messenger RNA (mRNA), transfer RNA (tRNA), ribosomal RNA (rRNA), short interfering RNA (siRNA), short-hairpin RNA (shRNA), micro-RNA (miRNA), ribozymes, cDNA, recombinant polynucleotides, branched polynucleotides, plasmids, vectors, isolated DNA of any sequence, isolated RNA of any sequence, cell-free polynucleotides including cell-free DNA (cfDNA) and cell-free RNA (cfRNA), nucleic acid probes, and primers.
- the sequence of nucleotides can be interrupted by non-nucleotide components.
- sequence identity generally refers to an exact nucleotide-to-nucleotide or amino acid-to-amino acid correspondence of two polynucleotides or polypeptide sequences, respectively.
- techniques for determining sequence identity include determining the nucleotide sequence of a polynucleotide and/or determining the amino acid sequence encoded thereby, and comparing these sequences to a second nucleotide or amino acid sequence.
- Two or more sequences can be compared by determining their “percent identity .”
- the percent identity of two sequences, whether nucleic acid or amino acid sequences is the number of exact matches between two aligned sequences divided by the length of the longer sequence and multiplied by 100. Percent identity may also be determined, for example, by comparing sequence information using the advanced BLAST computer program, including version 2.2.9, available from the National Institutes of Health.
- the BLAST program is based on the alignment method of Karlin and Altschul, Proc. Natl. Acad. Sci. USA, 87:2264-2268 (1990) and as discussed in Altschul, et al., J. Mol.
- the program may be used to determine percent identity over the entire length of the proteins being compared. Default parameters are provided to optimize searches with short query sequences in, for example, with the blastp program.
- the program also allows use of an SEG filter to mask-off segments of the query sequences as determined by the SEG program of Wootton and Federhen, Computers and Chemistry 17: 149-163 (1993). Ranges of desired degrees of sequence identity are approximately 50% to 100% and integer values therebetween.
- this disclosure encompasses sequences with at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, or at least 98% sequence identity with any sequence provided herein.
- the term “gene” generally refers to a nucleic acid (e g., DNA such as genomic DNA and cDNA) and its corresponding nucleotide sequence that is involved in encoding an RNA transcript.
- genomic DNA includes intervening, noncoding regions as well as regulatory regions and can include 5' and 3' ends.
- the term encompasses the transcribed sequences, including 5' and 3' untranslated regions (5'-UTR and 3'-UTR), exons and introns.
- the transcribed region will contain “open reading frames” that encode polypeptides.
- a “gene” comprises only the coding sequences (e g., an “open reading frame” or “coding region”) necessary for encoding a polypeptide.
- genes do not encode a polypeptide, for example, ribosomal RNA genes (rRNA) and transfer RNA (tRNA) genes.
- rRNA ribosomal RNA genes
- tRNA transfer RNA
- the term “gene” includes not only the transcribed sequences, but in addition, also includes non-transcribed regions including upstream and downstream regulatory regions, enhancers and promoters.
- a gene can refer to a portion of the gene that is near or adjacent to a transcription start site (TSS) of the gene.
- TSS transcription start site
- the gene (e.g., that is targeted as disclosed herein) can be at least or up to about 2,000 nucleobases, at least or up to about 1,800 nucleobases, at least or up to about 1,600 nucleobases, at least or up to about 1,500 nucleobases, at least or up to about 1,400 nucleobases, at least or up to about 1,200 nucleobases, at least or up to about 1,000 nucleobases, at least or up to about 900 nucleobases, at least or up to about 800 nucleobases, at least or up to about 700 nucleobases, at least or up to about 600 nucleobases, at least or up to about 500 nucleobases, at least or up to about 400 nucleobases, at least or up to about 300 nucleobases, at least or up to about 200 nucleobases, at least or up to about 100 nucleobases, or at least or up to about 50 nucleobases away from the TSS of the gene
- a gene can refer to an “endogenous gene” or a native gene in its natural location in the genome of an organism.
- a gene can refer to an “exogenous gene” or a non-native gene.
- a nonnative gene can refer to a gene not normally found in the host organism but which is introduced into the host organism by gene transfer.
- a non-native gene can also refer to a gene not in its natural location in the genome of an organism.
- a non-native gene can also refer to a naturally occurring nucleic acid or polypeptide sequence that comprises mutations, insertions and/or deletions (e.g., non-native sequence).
- the term “expression” generally refers to one or more processes by which a polynucleotide is transcribed from a DNA template (such as into an mRNA or other RNA transcript) and/or the process by which a transcribed mRNA is subsequently translated into peptides, polypeptides, or proteins.
- Transcripts and encoded polypeptides can be collectively referred to as “gene product.” If the polynucleotide is derived from genomic DNA, expression can include splicing of the mRNA in a eukaryotic cell.
- Up-regulated generally refers to an increased expression level of a polynucleotide (e.g., RNA such as mRNA) and/or polypeptide sequence relative to its expression level in a wild-type state while “down-regulated” generally refers to a decreased expression level of a polynucleotide (e.g., RNA such as mRNA) and/or polypeptide sequence relative to its expression in a wild-type state.
- Expression of a transfected gene can occur transiently or stably in a cell. During “transient expression” the transfected gene is not transferred to the daughter cell during cell division. Since its expression is restricted to the transfected cell, expression of the gene is lost over time.
- stable expression of a transfected gene can occur when the gene is co-transfected with another gene that confers a selection advantage to the transfected cell.
- a selection advantage may be a resistance towards a certain toxin that is presented to the cell.
- expression profile generally refers to quantitative (e.g., abundance) and qualitative expression of one or more genes in a sample (e.g., a cell).
- the one or more genes can be expressed and ascertained in the form of a nucleic acid molecule (e.g., an mRNA or other RNA transcript).
- the one or more genes can be expressed and ascertained in the form of a polypeptide (e.g., a protein measured via Western blot).
- An expression profile of a gene may be defined as a shape of an expression level of the gene over a time period (e.g., at least or up to about 1 hour, at least or up to about 2 hours, at least or up to about 3 hours, at least or up to about 4 hours, at least or up to about 5 hours, at least or up to about 6 hours, at least or up to about 7 hours, at least or up to about 8 hours, at least or up to about 9 hours, at least or up to about 10 hours, at least or up to about 11 hours, at least or up to about 12 hours, at least or up to about 16 hours, at least or up to about 18 hours, at least or up to about 24 hours, at least or up to about 36 hours, at least or up to about 48 hours, at least up to about 3 days, at least up to about 4 days, at least up to about 5 days, at least up to about 6 days, at least up to about 7 days, at least up to about 8 days, at least up to about 9 days, at least up to about 10 days, at least up to about
- an expression profile of a gene may be defined as an expression level of the gene at a time point of interest (e.g., the expression level of the gene measured at least or up to about 1 hour, at least or up to about 2 hours, at least or up to about 3 hours, at least or up to about 4 hours, at least or up to about 5 hours, at least or up to about 6 hours, at least or up to about 7 hours, at least or up to about 8 hours, at least or up to about 9 hours, at least or up to about 10 hours, at least or up to about 11 hours, at least or up to about 12 hours, at least or up to about 16 hours, at least or up to about 18 hours, at least or up to about 24 hours, at least or up to about 36 hours, at least or up to about 48 hours, at least up to about 3 days, at least up to about 4 days, at least up to about 5 days, at least up to about 6 days, at least up to about 7 days, at least up to about 8 days, at least up to about 9 days, at least up to about 10
- polymer does not connote a specific length of polymer, nor is it intended to imply or distinguish whether the peptide is produced using recombinant techniques, chemical or enzymatic synthesis, or is naturally occurring.
- the terms apply to naturally occurring amino acid polymers as well as amino acid polymers comprising at least one modified amino acid.
- the polymer can be interrupted by non-amino acids.
- the terms include amino acid chains of any length, including full length proteins, and proteins with or without secondary and/or tertiary structure (e.g., domains).
- amino acid polymer that has been modified, for example, by disulfide bond formation, glycosylation, lipidation, acetylation, phosphorylation, oxidation, and any other manipulation such as conjugation with a labeling component.
- amino acid and amino acids generally refer to natural and non-natural amino acids, including, but not limited to, modified amino acids and amino acid analogues.
- Modified amino acids can include natural amino acids and non-natural amino acids, which have been chemically modified to include a group or a chemical moiety not naturally present on the amino acid.
- Amino acid analogues can refer to amino acid derivatives.
- amino acid includes both D- amino acids and L-amino acids.
- derivative generally refers to a polypeptide related to a wild type polypeptide, for example either by amino acid sequence, structure (e.g., secondary and/or tertiary), activity (e g., enzymatic activity) and/or function.
- Derivatives, variants and fragments of a polypeptide can comprise one or more amino acid variations (e.g., mutations, insertions, and deletions), truncations, modifications, or combinations thereof compared to a wild type polypeptide.
- polypeptide molecule e.g., a protein
- engineered generally refers to a polypeptide molecule having a heterologous amino acid sequence or an altered amino acid sequence as a result of the application of genetic engineering techniques to nucleic acids which encode the polypeptide molecule, as well as cells or organisms which express the polypeptide molecule.
- engineered or “recombinant,” as used herein with respect to a polynucleotide molecule (e.g., a DNA or RNA molecule), generally refers to a polynucleotide molecule having a heterologous nucleic acid sequence or an altered nucleic acid sequence as a result of the application of genetic engineering techniques. Genetic engineering techniques include, but are not limited to, PCR and DNA cloning technologies; transfection, transformation and other gene transfer technologies, homologous recombination; site-directed mutagenesis; and gene fusion. In some cases, an engineered or recombinant polynucleotide (e.g., a genomic DNA sequence) can be modified or altered by a gene editing moiety.
- Genetic engineering techniques include, but are not limited to, PCR and DNA cloning technologies; transfection, transformation and other gene transfer technologies, homologous recombination; site-directed mutagenesis; and gene fusion.
- engineered and “modified” are used interchangeably herein.
- engineing and “modifying” are used interchangeably herein.
- engineered cell or “modified cell” are used interchangeably herein.
- engineered characteristic and “modified characteristic” are used interchangeably herein.
- the term “enhanced expression,” “increased expression,” or “upregulated expression” generally refers to production of a moiety of interest (e.g., a polynucleotide or a polypeptide) to a level that is above a normal level of expression of the moiety of interest in a host strain (e.g., a host cell).
- the normal level of expression can be substantially zero (or null) or higher than zero.
- the moiety of interest can comprise an endogenous gene or polypeptide construct of the host strain.
- the moiety of interest can comprise a heterologous gene or polypeptide construct that is introduced to or into the host strain.
- a heterologous gene encoding a polypeptide of interest can be knocked-in (KI) to a genome of the host strain for enhanced expression of the polypeptide of interest in the host strain.
- the term “enhanced activity,” “increased activity,” or “upregulated activity” generally refers to activity of a moiety of interest (e.g., a polynucleotide or a polypeptide) that is modified to a level that is above a normal level of activity of the moiety of interest in a host strain (e.g., a host cell).
- the normal level of activity can be substantially zero (or null) or higher than zero.
- the moiety of interest can comprise a polypeptide construct of the host strain.
- the moiety of interest can comprise a heterologous polypeptide construct that is introduced to or into the host strain.
- a heterologous gene encoding a polypeptide of interest can be knocked-in (KI) to a genome of the host strain for enhanced activity of the polypeptide of interest in the host strain.
- the term “reduced expression,” “decreased expression,” or “downregulated expression” generally refers to a production of a moiety of interest (e g., a polynucleotide or a polypeptide) to a level that is below a normal level of expression of the moiety of interest in a host strain (e.g., a host cell). The normal level of expression is higher than zero.
- the moiety of interest can comprise an endogenous gene or polypeptide construct of the host strain.
- the moiety of interest can be knocked-out or knocked-down in the host strain.
- reduced expression of the moiety of interest can include a complete inhibition of such expression in the host strain.
- the term “reduced activity,” “decreased activity,” or “downregulated activity” generally refers to activity of a moiety of interest (e g., a polynucleotide or a polypeptide) that is modified to a level that is below a normal level of activity of the moiety of interest in a host strain (e.g., a host cell). The normal level of activity is higher than zero.
- the moiety of interest can comprise an endogenous gene or polypeptide construct of the host strain.
- the moiety of interest can be knocked-out or knocked-down in the host strain.
- reduced activity of the moiety of interest can include a complete inhibition of such activity in the host strain.
- the term “subject,” “individual,” or “patient,” as used interchangeably herein, generally refers to a vertebrate, preferably a mammal such as a human. Mammals include, but are not limited to, murines, simians, humans, farm animals, sport animals, and pets. Tissues, cells and their progeny of a biological entity obtained in vivo or cultured in vitro are also encompassed.
- treatment or “treating” generally refers to an approach for obtaining beneficial or desired results including but not limited to a therapeutic benefit and/or a prophylactic benefit.
- a treatment can comprise administering a system or cell population disclosed herein.
- composition can be administered to a subject at risk of developing a particular disease, condition, or symptom, or to a subject reporting one or more of the physiological symptoms of a disease, even though the disease, condition, or symptom may not have yet been manifested.
- the term “effective amount” or “therapeutically effective amount” generally refers to the quantity of a composition, for example a composition comprising heterologous polypeptides, heterologous polynucleotides, and/or modified cells (e.g., modified stem cells), that is sufficient to result in a desired activity upon administration to a subject in need thereof.
- therapeutically effective generally refers to that quantity of a composition that is sufficient to delay the manifestation, arrest the progression, relieve or alleviate at least one symptom of a disorder treated by the methods of the present disclosure.
- the target gene is encoded from a heterologous polynucleotide described herein.
- the target gene encoded by the heterologous polynucleotide is a non-disease causing allele.
- the present disclosure provides a system comprising: a heterologous polypeptide comprising an actuator moiety, wherein the actuator moiety is for binding an endogenous target gene encoding PRPF31 in a cell, to increase expression level of the PRPF31 in the cell, wherein: the actuator moiety substantially lacks DNA cleavage activity; and/or the actuator moiety is coupled to a transcriptional activator.
- described herein is one or more polynucleotides encoding a system described herein.
- a method comprising administrating the system described herein to a subject in need thereof.
- described herein is a method comprising: increasing expression level of an endogenous target gene encoding PRPF31 in a cell, via binding of a heterologous polypeptide comprising an actuator moiety to bind the endogenous target gene, wherein: the actuator moiety substantially lacks DNA cleavage activity; and/or the actuator moiety is coupled to a transcriptional activator.
- the systems, compositions, or methods described herein increase expression ofPRPF31 in a cell.
- the cell is an eye cell.
- the cell is a retinal cell.
- the cell is a retinal pigment epithelium (RPE) cell.
- the PRPF31 expression being increased by the systems, compositions, or methods described herein is encoded from a non-disease allele of PRPF31.
- the PRPF31 expression being increased by the systems, compositions, or methods described herein is encoded from a non-disease allele of PRPF31, where the non-disease allele of PRPF31 is encoded by a heterologous polynucleotide described herein.
- the systems, compositions, or methods described herein increase expression of PRPF31 in a cell by contacting the cell with an actuator moiety described herein.
- the actuator moiety is not capable of binding an additional target endogenous gene encoding a mutant allele of the PRPF31.
- the system comprises a heterologous polypeptide comprising an actuator moiety, wherein the actuator moiety is for binding an endogenous target gene encoding a target protein in a cell, to decrease expression level of the target protein, and wherein the actuator moiety substantially lacks DNA cleavage activity; and a heterologous polynucleotide encoding a non-disease causing variant of the endogenous target gene.
- the endogenous target gene is a non-disease causing variant.
- the non-disease causing variant is a wild type variant.
- the endogenous target gene is a disease causing variant.
- the system comprises the heterologous polynucleotide not integrated into the endogenous target gene.
- the system comprises the heterologous polypeptide that is under the control of a tissue-specific promoter.
- the tissuespecific promoter can be a rod cell specific promoter, a cone cell specific promoter, a retina cell specific promoter, a photoreceptor specific promoter, or a combination thereof.
- the tissue-specific promoter is a photoreceptor specific promoter.
- the tissue-specific promoter is a PRPF31 promoter.
- Non-limiting example pf tissue-specific promoter can include MOPS promoter, GRK1 promoter, IRBP promoter, PR2.1 promoter, IRBP/GNAT2 promoter, VMD2 promoter, VEcad/VEcadherin promoter, or a combination thereof.
- the system comprises the heterologous polypeptide that is under the control of a constitutive promoter.
- constitutive promoter can include CMV promoter, EFla promoter, CAG promoter, PGK promoter, TRE promoter, U6 promoter, or UAS promoter.
- the constitutive promoter can be a Pol III promoter (e.g., 7SK, U6, Hl, etc.).
- the constitutive promoter can be a Pol II promoter (e.g., CMV, RSV, etc.).
- the actuator moiety comprises a nuclease such as an endonuclease (e.g., a heterologous endonuclease).
- the nuclease can be a deactivated nuclease such as a deactivated endonuclease, where the deactivated endonuclease does not cleave nucleic acid.
- the system comprises a guide nucleic acid.
- a guide nucleic acid capable of forming a complex with the actuator moiety, wherein the complex binds the endogenous target gene.
- the guide nucleic acid comprises a plurality of different guide nucleic acids capable of targeting different regions of the endogenous target gene.
- the system comprises an actuator moiety or a heterologous polynucleotide encoding the actuator moiety.
- the actuator moiety is coupled to a transcriptional repressor.
- the actuator moiety is fused to the transcriptional repressor.
- the system modulates a gene expression of an endogenous target gene in a cell.
- the cell is an eye cell such as a rod cell, a cone cell, or a retina cell.
- the cell is a photoreceptor cell, a bipolar cell, a retinal ganglion cell, a horizontal cell, or an amacrine cells.
- the cell is a cell of pigmented layer.
- the cell is a cell of layer of rods and cones.
- the cell is a cell of membrana limitans externa.
- the cell is a cell of outer nuclear layer.
- the cell is a cell of outer plexiform layer. In some embodiments, the cell is a cell of inner nuclear layer. In some embodiments, the cell is a cell of inner plexiform layer. In some embodiments, the cell is a cell of ganglionic layer. In some embodiments, the cell is a cell of stratum opticum. In some embodiments, the cell is a cell of membrana limitans interna. In some embodiments, the cell a retinal pigment epithelium (RPE) cell.
- RPE retinal pigment epithelium
- described herein is one or more polynucleotides encoding the system described herein.
- the one or more polynucleotides comprise a single polynucleotide encoding at least the heterologous polypeptide and the heterologous polynucleotide.
- the single polynucleotide further encodes the guide nucleic acid.
- the single polynucleotide has a size of less than or equal to about 5 kilobases (kb). In some embodiments, the single polynucleotide has a size of less than or equal to about 4.7 kilobases.
- the single polynucleotide has a size of less than or equal to about 0.1 kb to about 10 kb. In some embodiments, the single polynucleotide has a size of less than or equal to about 10 kb to about 9 kb, about 10 kb to about 8 kb, about 10 kb to about 7 kb, about 10 kb to about 6 kb, about 10 kb to about 5 kb, about 10 kb to about 4.7 kb, about 10 kb to about 4 kb, about 10 kb to about 3 kb, about 10 kb to about 2 kb, about 10 kb to about 1 kb, about 10 kb to about 0.1 kb, about 9 kb to about 8 kb, about 9 kb to about 7 kb, about 9 kb to about 6 kb, about 9 kb to about 5 kb, about 9 kb to about 4.7 kb
- the single polynucleotide has a size of less than or equal to about 10 kb, about 9 kb, about 8 kb, about 7 kb, about 6 kb, about 5 kb, about 4.7 kb, about 4 kb, about 3 kb, about 2 kb, about 1 kb, or about 0.1 kb.
- the single polynucleotide has a size of less than or equal to at least about 10 kb, about 9 kb, about 8 kb, about 7 kb, about 6 kb, about 5 kb, about 4.7 kb, about 4 kb, about 3 kb, about 2 kb, or about 1 kb. In some embodiments, the single polynucleotide has a size of less than or equal to at most about 9 kb, about 8 kb, about 7 kb, about 6 kb, about 5 kb, about 4.7 kb, about 4 kb, about 3 kb, about 2 kb, about 1 kb, or about 0.1 kb.
- the system comprises a guide nucleic acid or one or more polynucleotides encoding a guide nucleic acid, where the guide nucleic acid targets an endogenous target gene described herein.
- the guide nucleic acid can be complexed with an actuator moiety described herein.
- the guide nucleic acid can direct the actuator moiety to the endogenous target gene in the cell.
- compositions comprising any component or any combination of components of the system described herein.
- the composition comprises at least one of the heterologous polypeptide described herein.
- the compositions comprises at least one of the heterologous polynucleotide described herein.
- the composition comprises at least one of the heterologous polypeptide described herein and at least one of the heterologous polynucleotide described herein.
- the composition can be further formulated into a pharmaceutical composition.
- the composition can comprise at least one pharmaceutically acceptable carrier.
- Described herein, in some aspects, is a method comprising: increasing expression level of an endogenous target gene encoding a target protein in a cell, via action of a heterologous polypeptide comprising an actuator moiety, wherein the actuator moiety is for binding the endogenous target gene, and wherein the actuator moiety substantially lacks DNA cleavage activity; and contacting the cell with a heterologous polynucleotide encoding a non-disease causing variant of the endogenous target gene.
- the method comprises determining that the subject has certain condition.
- the method comprises selecting for the subject to be treated by the method and the system described herein by determining if the subject harbors a mutant allele or a disease-causing allele of the endogenous target gene. In some embodiments, once the subject is determined to harbor the mutant allele or the disease-causing allele of the endogenous target gene, a system described herein, any component of the system described herein, or any combination of the component of the system described herein can be administered to the subject to treat the disease or condition.
- the endogenous target gene comprises a disease causing allele of the target protein. In some embodiments, the endogenous target gene comprises a non-disease causing allele of the endogenous target protein.
- the non-disease causing allele is a wild type allele.
- the endogenous target gene is a member of Pre-mRNA-processing- splicing factor (PRPF) family.
- the PRPF comprises PRPF1, PRPF2, PRPF3, PRPF4, PRPF5, PRPF6, PRPF7, PRPF8, PRPF9, PRPF10, PRPF11, PRPF12, PRPF13, PRPF14, PRPF15, PRPF16, PRPF17, PRPF18, PRPF19, PRPF20, PRPF21, PRPF22, PRPF23, PRPF24, PRPF25, PRPF26, PRPF27, PRPF28, PRPF29, PRPF30, PRPF31, a fragment thereof, or a combination thereof .
- the PRPF comprises PRPF31.
- the endogenous target gene is PRPF31.
- described herein is a method for administering the system descried herein to a subject in need thereof. In some embodiments, the method comprises determining whether the subject has or is suspected of having retinitis pigmentosa 11 (RP11).
- the increased expression level of the non-disease causing allele of the PRPF31 can be sufficient to treat or ameliorate a condition (e.g., retinitis pigmentosa (RP), such as RP11) of a cell or a subject comprising the cell.
- a condition e.g., retinitis pigmentosa (RP), such as RP11
- the method increases the expression of the endogenous target gene (e.g., non-disease causing allele thereof) encoding the target protein by at least about 0.01 fold to about 5,000 fold (e.g., as compared to a control cell lacking the heterologous polypeptide and/or the guide nucleic acid).
- the endogenous target gene e.g., non-disease causing allele thereof
- the method increases the expression of the endogenous target gene encoding the target protein by at least about 0.01 fold to about 0.05 fold, about 0.01 fold to about 0.1 fold, about 0.01 fold to about 0.5 fold, about 0.01 fold to about 1 fold, about 0.01 fold to about 5 fold, about 0.01 fold to about 10 fold, about 0.01 fold to about 50 fold, about 0.01 fold to about 100 fold, about 0.01 fold to about 500 fold, about 0.01 fold to about 1,000 fold, about 0.01 fold to about 5,000 fold, about 0.05 fold to about 0.1 fold, about 0.05 fold to about 0.5 fold, about 0.05 fold to about 1 fold, about 0.05 fold to about 5 fold, about 0 05 fold to about 10 fold, about 0.05 fold to about 50 fold, about 0.05 fold to about 100 fold, about 0.05 fold to about 500 fold, about 0.05 fold to about 1,000 fold, about 0.05 fold to about 5,000 fold, about 0.1 fold to about 0.5 fold, about 0.1 fold to about 1 fold, about 0.1 fold to about 5 fold, about 0 05 fold to
- the method increases the expression of the endogenous target gene encoding the target protein by at least about 0.01 fold, about 0.05 fold, about 0.1 fold, about 0.5 fold, about 1 fold, about 5 fold, about 10 fold, about 50 fold, about 100 fold, about 500 fold, about 1,000 fold, or about 5,000 fold (e.g., as compared to a control cell lacking the heterologous polypeptide and/or the guide nucleic acid).
- the method increases the expression of the endogenous target gene encoding the target protein by at least at least about 0.01 fold, about 0.05 fold, about 0.1 fold, about 0.5 fold, about 1 fold, about 5 fold, about 10 fold, about 50 fold, about 100 fold, about 500 fold, or about 1,000 fold (e.g., as compared to a control cell lacking the heterologous polypeptide and/or the guide nucleic acid).
- the method increases the expression of the endogenous target gene encoding the target protein by at least at most about 0.05 fold, about 0.1 fold, about 0.5 fold, about 1 fold, about 5 fold, about 10 fold, about 50 fold, about 100 fold, about 500 fold, about 1,000 fold, or about 5,000 fold (e.g., as compared to a control cell lacking the heterologous polypeptide and/or the guide nucleic acid).
- the method increases the expression of the endogenous target gene encoding the target protein, where the target protein is a PRPF31 (e g , a non-disease causing PRPF31 variant).
- the method increases the expression of PRPF31 by at least about 0.01 fold to about 5,000 fold (e.g., as compared to a control cell lacking the heterologous polypeptide and/or the guide nucleic acid). In some embodiments, the method increases the expression of PRPF31 by at least about 0.01 fold to about 0.05 fold, about 0.01 fold to about 0.1 fold, about 0.01 fold to about 0.5 fold, about 0.01 fold to about 1 fold, about 0.01 fold to about 5 fold, about 0.01 fold to about 10 fold, about 0.01 fold to about 50 fold, about 0.01 fold to about 100 fold, about 0.01 fold to about 500 fold, about 0.01 fold to about 1,000 fold, about 0.01 fold to about 5,000 fold, about 0.05 fold to about 0.1 fold, about 0.05 fold to about 0.5 fold, about 0.05 fold to about 1 fold, about 0.05 fold to about 5 fold, about 0.05 fold to about 10 fold, about 0.05 fold to about 50 fold, about 0.05 fold to about 100 fold, about 0.05 fold to about 500 fold, about 0.05 fold to about 100
- the method increases the expression of PRPF31 by at least about 0.01 fold, about 0.05 fold, about 0.1 fold, about 0.5 fold, about 1 fold, about 5 fold, about 10 fold, about 50 fold, about 100 fold, about 500 fold, about 1,000 fold, or about 5,000 fold (e.g., as compared to a control cell lacking the heterologous polypeptide and/or the guide nucleic acid).
- the method increases the expression of PRPF31 by at least at least about 0.01 fold, about 0.05 fold, about 0.1 fold, about 0.5 fold, about 1 fold, about 5 fold, about 10 fold, about 50 fold, about 100 fold, about 500 fold, or about 1,000 fold (e g , as compared to a control cell lacking the heterologous polypeptide and/or the guide nucleic acid).
- the method increases the expression of PRPF31 by at least at most about 0.05 fold, about 0.1 fold, about 0.5 fold, about 1 fold, about 5 fold, about 10 fold, about 50 fold, about 100 fold, about 500 fold, about 1,000 fold, or about 5,000 fold (e.g., as compared to a control cell lacking the heterologous polypeptide and/or the guide nucleic acid).
- the method increases the expression of PRPF31 (e.g., a nondisease variant of PRPF31) without increasing expression of a gene neighboring PRPF31 or expression of a gene associated with PRPF31.
- the method without increasing the expression of a gene neighboring PRPF31 or expression of a gene associated with PRPF31, increases the expression of PRPF31 compared to endogenous expression of PRPF31 by at least about 0.01 fold to about 5,000 fold.
- the method without increasing the expression of a gene neighboring PRPF31 or expression of a gene associated with PRPF31, increases the expression of PRPF31 compared to endogenous expression of PRPF31 by at least about 0.01 fold to about 0.05 fold, about 0.01 fold to about 0.1 fold, about 0.01 fold to about 0.5 fold, about 0.01 fold to about 1 fold, about 0.01 fold to about 5 fold, about 0.01 fold to about 10 fold, about 0.01 fold to about 50 fold, about 0.01 fold to about 100 fold, about 0.01 fold to about 500 fold, about 0.01 fold to about 1,000 fold, about 0.01 fold to about 5,000 fold, about 0.05 fold to about 0.1 fold, about 0.05 fold to about 0.5 fold, about 0.05 fold to about 1 fold, about 0.05 fold to about 5 fold, about 0.05 fold to about 10 fold, about 0.05 fold to about 50 fold, about 0.05 fold to about 100 fold, about 0.05 fold to about 500 fold, about 0.05 fold to about 1,000 fold, about 0.05 fold to about 5,000 fold, about 0.1 fold to
- the method without increasing the expression of a gene neighboring PRPF31 or expression of a gene associated with PRPF31, increases the expression of PRPF31 compared to endogenous expression of PRPF31 by at least about 0.01 fold, about 0.05 fold, about 0.1 fold, about 0.5 fold, about 1 fold, about 5 fold, about 10 fold, about 50 fold, about 100 fold, about 500 fold, about 1,000 fold, or about 5,000 fold.
- the method without increasing the expression of a gene neighboring PRPF31 or expression of a gene associated with PRPF31, increases the expression of PRPF31 compared to endogenous expression of PRPF31 by at least at least about 0.01 fold, about 0.05 fold, about 0.1 fold, about 0.5 fold, about 1 fold, about 5 fold, about 10 fold, about 50 fold, about 100 fold, about 500 fold, or about 1,000 fold.
- the method without increasing the expression of a gene neighboring PRPF31 or expression of a gene associated with PRPF31, increases the expression of PRPF31 compared to endogenous expression of PRPF31 by at least at most about 0.05 fold, about 0.1 fold, about 0.5 fold, about 1 fold, about 5 fold, about 10 fold, about 50 fold, about 100 fold, about 500 fold, about 1,000 fold, or about 5,000 fold.
- the method increases the expression of PRPF31 (e.g., a nondisease causing PRPF31 variant) without increasing expression of a gene neighboring PRPF31 on chromosome 19 or expression of a gene associated with PRPF31 on chromosome 19.
- the method without increasing the expression of a gene neighboring PRPF31 on chromosome 19 or expression of a gene associated with PRPF31 on chromosome 19, increases the expression of PRPF31 compared to endogenous expression ofPRPF31 by at least about 0.01 fold to about 5,000 fold.
- the method without increasing the expression of a gene neighboring PRPF31 on chromosome 19 or expression of a gene associated with PRPF31 on chromosome 19, increases the expression of PRPF31 compared to endogenous expression of PRPF31 by at least about 0.01 fold to about 0.05 fold, about 0.01 fold to about 0.1 fold, about 0.01 fold to about 0.5 fold, about 0.01 fold to about 1 fold, about 0.01 fold to about 5 fold, about 0.01 fold to about 10 fold, about 0.01 fold to about 50 fold, about 0.01 fold to about 100 fold, about 0.01 fold to about 500 fold, about 0.01 fold to about 1,000 fold, about 0.01 fold to about 5,000 fold, about 0.05 fold to about 0.1 fold, about 0.05 fold to about 0.5 fold, about 0.05 fold to about 1 fold, about 0.05 fold to about 5 fold, about 0.05 fold to about 10 fold, about 0.05 fold to about 50 fold, about 0.05 fold to about 100 fold, about 0.05 fold to about 500 fold, about 0.05 fold to about 1,000 fold, about 0.05 fold to about 5,000 fold
- the method without increasing the expression of a gene neighboring PRPF31 on chromosome 19 or expression of a gene associated with PRPF31 on chromosome 19, increases the expression of PRPF31 compared to endogenous expression of PRPF31 by at least about 0.01 fold, about 0.05 fold, about 0.1 fold, about 0.5 fold, about 1 fold, about 5 fold, about 10 fold, about 50 fold, about 100 fold, about 500 fold, about 1,000 fold, or about 5,000 fold. In some embodiments, the method, without increasing the expression of a gene neighboring
- PRPF31 on chromosome 19 or expression of a gene associated with PRPF31 on chromosome 19, increases the expression of PRPF31 compared to endogenous expression of PRPF31 by at least at least about 0.01 fold, about 0.05 fold, about 0.1 fold, about 0.5 fold, about 1 fold, about 5 fold, about 10 fold, about 50 fold, about 100 fold, about 500 fold, or about 1,000 fold.
- the method without increasing the expression of a gene neighboring PRPF31 on chromosome 19 or expression of a gene associated with PRPF31 on chromosome 19, increases the expression of PRPF31 compared to endogenous expression ofPRPF31 by at least at most about 0.05 fold, about 0.1 fold, about 0.5 fold, about 1 fold, about 5 fold, about 10 fold, about 50 fold, about 100 fold, about 500 fold, about 1,000 fold, or about 5,000 fold.
- the method increases the expression of PRPF31 (e.g., a nondisease causing PRPF31 variant) without increasing expression of TCF3 Fusion Partner (TFPT).
- the method without increasing the expression of TFPT, increases the expression of PRPF31 compared to endogenous expression of PRPF31 by at least about 0.01 fold to about 5,000 fold.
- the method without increasing the expression of TFPT, increases the expression of PRPF31 compared to endogenous expression of PRPF31 by at least about 0.01 fold to about 0.05 fold, about 0.01 fold to about 0.1 fold, about 0.01 fold to about 0.5 fold, about 0.01 fold to about 1 fold, about 0.01 fold to about 5 fold, about 0.01 fold to about 10 fold, about 0.01 fold to about 50 fold, about 0.01 fold to about 100 fold, about 0.01 fold to about 500 fold, about 0.01 fold to about 1,000 fold, about 0.01 fold to about 5,000 fold, about 0.05 fold to about 0.1 fold, about 0.05 fold to about 0.5 fold, about 0.05 fold to about 1 fold, about 0.05 fold to about 5 fold, about 0.05 fold to about 10 fold, about 0.05 fold to about 50 fold, about 0.05 fold to about 100 fold, about 0.05 fold to about 500 fold, about 0.05 fold to about 1,000 fold, about 0.05 fold to about 5,000 fold, about 0.1 fold to about 0.5 fold, about 0.05 fold to about 0.5 fold, about 0.05 fold to about
- the method without increasing the expression of TFPT, increases the expression of PRPF31 compared to endogenous expression ofPRPF31 by at least about 0.01 fold, about 0.05 fold, about 0.1 fold, about 0.5 fold, about 1 fold, about 5 fold, about 10 fold, about 50 fold, about 100 fold, about 500 fold, about 1,000 fold, or about 5,000 fold. In some embodiments, the method, without increasing the expression of TFPT, increases the expression of PRPF31 compared to endogenous expression of PRPF31 by at least at least about 0.01 fold, about 0.05 fold, about 0.1 fold, about 0.5 fold, about 1 fold, about 5 fold, about 10 fold, about 50 fold, about 100 fold, about 500 fold, or about 1,000 fold.
- the method without increasing the expression of TFPT, increases the expression ofPRPF31 compared to endogenous expression of PRPF31 by at least at most about 0.05 fold, about 0.1 fold, about 0.5 fold, about 1 fold, about 5 fold, about 10 fold, about 50 fold, about 100 fold, about 500 fold, about 1,000 fold, or about 5,000 fold.
- the increase in the expression level of the PRPF31 in the (e.g., non-disease causing PRPF31 variant) cell as provided herein effects enhanced (or increased) cilium length and/or incidence, as compared to that in a control cell comprising a mutant allele of the PRPF31 that is in absence of the heterologous polypeptide and/or the guide nucleic acid.
- the increase in the cilium length can be at least or up to about 1%, at least or up to about 2%, at least or up to about 5%, at least or up to about 10%, at least or up to about 15%, at least or up to about 20%, at least or up to about 25%, at least or up to about 30%, at least or up to about 40%, at least or up to about 50%, at least or up to about 60%, at least or up to about 70%, at least or up to about 80%, at least or up to about 90%, at least or up to about 100%, at least or up to about 120%, at least or up to about 150%, at least or up to about 200%, at least or up to about 300%, at least or up to about 400%, or at least or up to about 500%, as compared to such control cell.
- the increase in the cilium length can be at least or up to about 0.1-fold, at least or up to about 0.2-fold, at least or up to about 0.5- fold, at least or up to about 1-fold, at least or up to about 2-fold, at least or up to about 3-fold, at least or up to about 4-fold, at least or up to about 5-fold, at least or up to about 6-fold, at least or up to about 7-fold, at least or up to about 8-fold, at least or up to about 9-fold, at least or up to about 10-fold, at least or up to about 15-fold, at least or up to about 20-fold, at least or up to about 30-fold, at least or up to about 40-fold, at least or up to about 50-fold, at least or up to about 60-fold, at least or up to about 70-fold, at least or up to about 80-fold, at least or up to about 90-fold, at least or up to about 100-fold, at least or up to about 150-
- the increase in the cilium incidence can be at least or up to about 1%, at least or up to about 2%, at least or up to about 5%, at least or up to about 10%, at least or up to about 15%, at least or up to about 20%, at least or up to about 25%, at least or up to about 30%, at least or up to about 40%, at least or up to about 50%, at least or up to about 60%, at least or up to about 70%, at least or up to about 80%, at least or up to about 90%, at least or up to about 100%, at least or up to about 120%, at least or up to about 150%, at least or up to about 200%, at least or up to about 300%, at least or up to about 400%, or at least or up to about 500%, as compared to such control cell.
- the increase in the cilium incidence can be at least or up to about 0. 1 -fold, at least or up to about 0.2-fold, at least or up to about 0.5-fold, at least or up to about 1-fold, at least or up to about 2-fold, at least or up to about 3-fold, at least or up to about 4-fold, at least or up to about 5-fold, at least or up to about 6-fold, at least or up to about 7-fold, at least or up to about 8-fold, at least or up to about 9-fold, at least or up to about 10-fold, at least or up to about 15-fold, at least or up to about 20-fold, at least or up to about 30-fold, at least or up to about 40-fold, at least or up to about 50-fold, at least or up to about 60-fold, at least or up to about 70-fold, at least or up to about 80-fold, at least or up to about 90-fold, at least or up to about 100-fold, at least or up to about 150
- the increase in the expression level of the PRPF31 (e.g., nondisease causing PRPF31 variant) in the cell as provided herein effects enhanced localization of a ciliary protein to at least a portion of a cilium, as compared to that in a control cell comprising a mutant allele of the PRPF31 that is in absence of the heterologous polypeptide and/or the guide nucleic acid.
- the ciliary protein can be IFT88 and the at least a portion of the cilium can be a ciliary tip.
- the ciliary protein can be a transition zone (TZ) protein (e.g., CC2D2A, RPGRIP1L, etc.), and the at least the portion of the cilium can be a ciliary axoneme.
- TZ transition zone
- the heterologous polypeptide e.g., the dCas-transcriptional effector complexed with a guide RNA
- the systems and the methods of the present disclosure may not and need not require a heterologous PRPF31 and/or a heterologous gene encoding thereof to increase the expression level of the PRPF31 in the cell.
- Table 4 provides an exemplary list of gRNA (e.g., Cas9 gRNAs) spacer sequence that can bind to an endogenous target gene described herein (e.g., PRPF31).
- a spacer sequence of gRNA as described herein can comprise a polynucleotide sequence (e.g., a consecutive polynucleotide sequence) that exhibits at least or up to about 50%, at least or up to about 55%, at least or up to about 60%, at least or up to about 65%, at least or up to about 70%, at least or up to about 75%, at least or up to about 80%, at least or up to about 85%, at least or up to about 90%, at least or up to about 91%, at least or up to about 92%, at least or up to about 93%, at least or up to about 94%, at least or up to about 95%, at least or up to about 96%, at least or up to about 97%, at least or up to about 98%, at least or up to
- Table 5 provides an additional exemplary list of gRNA (e.g., Cas gRNAs) spacer sequence that can bind to an endogenous target gene described herein (e.g., PRPF31).
- a spacer sequence of gRNA as described herein can comprise a polynucleotide sequence (e.g., a consecutive polynucleotide sequence) that exhibits at least or up to about 50%, at least or up to about 55%, at least or up to about 60%, at least or up to about 65%, at least or up to about 70%, at least or up to about 75%, at least or up to about 80%, at least or up to about 85%, at least or up to about 90%, at least or up to about 91%, at least or up to about 92%, at least or up to about 93%, at least or up to about 94%, at least or up to about 95%, at least or up to about 96%, at least or up to about 97%, at least or up to about 98%, at least or up to to
- the spacer sequence of the guide nucleic acid can target a positive-sense strand (+) of the endogenous target gene. In some cases, the spacer sequence of the guide nucleic acid can target a negative-sense strand (-) of the endogenous target gene.
- the systems e.g., the heterologous polypeptide and/or a guide nucleic acid
- methods thereof as provided herein can target (e.g., bind) at least one target polynucleotide sequence (e.g., a consecutive polynucleotide sequence) found in the polynucleotide sequence of one or more members in Table 6.
- the at least one target polynucleotide sequence can comprise at least or up to about 1, at least or up to about 2, at least or up to about 3, at least or up to about 4, at least or up to about 5, at least or up to about 6, at least or up to about 7, at least or up to about 8, at least or up to about 9, at least or up to about 10, at least or up to about 15, or at least or up to about 20 target polynucleotide sequence(s).
- the at least one target polynucleotide sequence can have a length of at least or up to about 6 nucleobases, at least or up to about 8 nucleobases, at least or up to about 10 nucleobases, at least or up to about 12 nucleobases, at least or up to about 16 nucleobases, at least or up to about 18 nucleobases, at least or up to about 20 nucleobases, at least or up to about 22 nucleobases, at least or up to about 24 nucleobases, at least or up to about 26 nucleobases, at least or up to about 28 nucleobases, at least or up to about 30 nucleobases, at least or up to about 32 nucleobases, at least or up to about 34 nucleobases, at least or up to about 36 nucleobases, at least or up to about 38 nucleobases, at least or up to about 40 nucleobases, at least or up to about 45 nucleobases, or
- At least a portion of a positive-sense strand (+) of the endogenous target gene can be targeted.
- the at least the portion of the positive-sense strand can comprise a polynucleotide sequence that exhibits at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99%, or substantially about 100% sequence identity to a consecutive polynucleotide sequence found in
- At least a portion of a negative- sense strand (-) of the endogenous target gene can be targeted.
- the at least the portion of the negative-sense strand can comprise a polynucleotide sequence that exhibits at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99%, or substantially about 100% sequence identity to a consecutive polynucleotide sequence found in
- the heterologous polypeptide comprising the actuator moiety can be utilized for binding a target gene, such as an endogenous target gene (e.g., a chromosomal DNA sequence).
- the actuator moiety can be a nuclease, such as an endonuclease (e.g., a heterologous endonuclease).
- Suitable nucleases include, but are not limited to, CRISPR-associated (Cas) proteins or Cas nucleases including type I CRISPR-associated (Cas) polypeptides, type II CRISPR-associated (Cas) polypeptides, type III CRISPR-associated (Cas) polypeptides, type IV CRISPR-associated (Cas) polypeptides, type V CRISPR-associated (Cas) polypeptides, and type VI CRISPR-associated (Cas) polypeptides; zinc finger nucleases (ZFN); transcription activator-like effector nucleases (TALEN); meganucleases; RNA-binding proteins (RBP); CRISPR-associated RNA binding proteins; recombinases; flippases; transposases; Argonaute (Ago) proteins (e.g., prokaryotic Argonaute (pAgo), archaeal Argonaute (aAgo), and eukaryotic Argon
- the actuator moiety can comprise a DNA nuclease such as an engineered (e.g., programmable or targetable) DNA nuclease that is nuclease-deficient.
- the actuator moiety can comprise a nuclease-null DNA binding protein derived from a DNA nuclease that does not induce transcriptional activation or repression of a target DNA sequence unless it is present in a complex with one or more heterologous gene effectors of the disclosure.
- the actuator moiety can comprise a nuclease-null DNA binding protein derived from a DNA nuclease that can induce transcriptional activation or repression of a target DNA sequence (e.g., which can be altered or augmented by the presence of a heterologous gene effector of the disclosure).
- the actuator moiety can comprise an RNA nuclease such as an engineered (e.g., programmable or targetable) RNA nuclease.
- the actuator moiety can comprise a nuclease-null RNA binding protein derived from an RNA nuclease that does not induce transcriptional activation or repression of a target RNA sequence unless it is present in a complex with one or more heterologous gene effectors of the disclosure.
- the actuator moiety can comprise a nuclease-null RNA binding protein derived from a RNA nuclease that can induce transcriptional activation or repression of a target RNA sequence (e g., which can be altered or augmented by the presence of a heterologous gene effector of the disclosure).
- the actuator moiety can comprise a nucleic acid-guided targeting system.
- the actuator moiety can comprise a DNA-guided targeting system.
- the actuator moiety can comprise an RNA-guided targeting system.
- the nucleic acid-guided targeting system can comprise and utilize, for example, a guide nucleic acid sequence that facilitates specific binding of a CRISPR-Cas system (e.g., a nuclease deficient form thereof, such as dCas9 or dCasl4) to a target gene (e.g., target endogenous gene) or target gene regulatory sequence.
- Binding specificity can be determined by use of a guide nucleic acid, such as a single guide RNA (sgRNA) or a part thereof.
- a guide nucleic acid such as a single guide RNA (sgRNA) or a part thereof.
- sgRNA single guide RNA
- the use of different sgRNAs allows the compositions and methods of the disclosure to be used with (e.g., targeted to) different target genes (e.g., target endogenous genes) or target gene regulatory sequences.
- Prokaryotic CRISPR-Cas (Clustered regularly interspaced short palindromic repeats- CRISPR associated) systems, for example, Class II CRISPR-Cas systems such as Cas9 and Cpfl, can be repurposed as a tool for regulation of gene expression, epigenome editing, and chromatin looping in compositions and methods of the disclosure.
- Nucl ease-deactivated Cas (dCas) proteins complexed with heterologous gene effectors can allow for regulation of expression of target genes (e.g., target endogenous genes) adjacent to a site bound by the dCas.
- the actuator moiety can comprise a CRISPR-associated (Cas) protein or a Cas nuclease that functions in a non-naturally occurring CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats)/Cas (CRISPR-associated) system.
- CRISPR-associated CRISPR-associated protein
- this system can provide adaptive immunity against foreign DNA.
- a CRISPR/Cas system e.g., modified and/or unmodified
- a CRISPR/Cas system can comprise a guide nucleic acid such as a guide RNA (gRNA) complexed with a Cas protein for targeted regulation of gene expression and/or activity or nucleic acid binding.
- gRNA guide RNA
- RNA-guided Cas protein e g., a Cas nuclease such as a Cas9 nuclease
- a target polynucleotide e.g., DNA
- the Cas protein if possessing nuclease activity, can cleave the DNA.
- the Cas protein is mutated and/or modified to yield a nuclease deficient protein or a protein with decreased nuclease activity relative to a wild-type Cas protein.
- a nuclease deficient protein can retain the ability to bind DNA, but may lack or have reduced nucleic acid cleavage activity.
- the actuator moiety can comprise a Cas protein that forms a complex with a guide nucleic acid, such as a guide RNA or a part thereof.
- the actuator moiety can comprise a Cas protein that forms a complex with a single guide nucleic acid, such as a single guide RNA (sgRNA).
- the actuator moiety can comprise a RNA-binding protein (RBP) optionally complexed with a guide nucleic acid, such as a guide RNA (e g., sgRNA), which is able to form a complex with a Cas protein.
- a guide nucleic acid such as a guide RNA or a part thereof.
- the actuator moiety can comprise a Cas protein that forms a complex with a single guide nucleic acid, such as a single guide RNA (sgRNA).
- RBP RNA-binding protein
- the actuator moiety can comprise a nuclease-null DNA binding protein derived from a DNA nuclease that can induce transcriptional activation or repression of a target DNA sequence. In some embodiments, the actuator moiety can comprise a nuclease-null RNA binding protein derived from a RNA.
- a guide nucleic acid used in compositions and methods of the disclosure can comprise a spacer sequence that can bind to an endogenous target gene described herein.
- the spacer sequence can be, for example, at least 8, at least 9, at least 10, at least 11, at least 12, at least 13, at least 14, at least 15, at least 16, at least 17, at least 18, at least 19, at least 20, at least 21, at least 22, at least 23, at least 24, at least 25, at least 26, at least 27, at least 28, at least 29, at least
- a spacer sequence of a guide nucleic acid used in compositions and methods of the disclosure is at most at most 10, at most 11, at most 12, at most 13, at most 14, at most 15, at most 16, at most 17, at most 18, at most 19, at most 20, at most 21, at most 22, at most 23, at most 24, at most 25, at most 26, at most 27, at most 28, at most 29, at most 30, at most 31, at most 32, at most 33, at most 34, at most 35, at most 36, at most 37, at most 38, at most 39, or at most 40 nucleotides.
- a spacer sequence of a guide nucleic acid used in compositions and methods of the disclosure is between about 8 and about 40 nucleotides, between about 10 and about 40 nucleotides, between about 11 and about 40 nucleotides, between about 12 and about 40 nucleotides, between about 13 and about 40 nucleotides, between about 14 and about 40 nucleotides, between about 15 and about 40 nucleotides, between about 16 and about 40 nucleotides, between about 17 and about 40 nucleotides, between about 18 and about 40 nucleotides, between about 19 and about 40 nucleotides, between about 20 and about 40 nucleotides, between about 22 and about 40 nucleotides, between about 24 and about 40 nucleotides, between about 26 and about 40 nucleotides, between about 28 and about 40 nucleotides, between about 30 and about 40 nucleotides, between about 8 and about 30 nucleotides, between about 10 and about 30 nucleotides, between about 10
- Non-limiting examples of a guide RNA scaffold sequence are provided in Table 2.
- the guide RNA scaffold sequence can comprise a polynucleotide sequence (e.g., a consecutive polynucleotide sequence) that exhibits at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99%, or substantially about 100% sequence identity to the polynucleotide sequence of one or more members selected from Table 2 (e.g., one or more members selected from the group consisting of SEQ ID NOs. 500-596).
- a guide RNA may comprise, from 5' to 3', (i) the polynucleotide sequence of one or more members selected from Table 2 and (ii) a spacer sequence. In some cases, a guide RNA may comprise, from 5' to 3', (i) the polynucleotide sequence of SEQ ID NO: 530, (ii) a spacer sequence, and (ii) the polynucleotide sequence of TT. In some cases, a guide RNA may comprise, from 5' to 3', (i) the polynucleotide sequence of SEQ ID NO: 532, (ii) a spacer sequence, and (ii) the polynucleotide sequence of TTTTA.
- a guide RNA may comprise, from 5' to 3', (i) the polynucleotide sequence of SEQ ID NO: 534, (ii) a spacer sequence, and (ii) the polynucleotide sequence of TTTTG. In some cases, a guide RNA may comprise, from 5' to 3', (i) the polynucleotide sequence of SEQ ID NO: 536, (ii) a spacer sequence, and (ii) the polynucleotide sequence of SEQ ID NO: 537.
- a guide RNA may comprise, from 5' to 3', (i) the polynucleotide sequence of SEQ ID NO: 538, (ii) a spacer sequence, and (ii) the polynucleotide sequence of SEQ ID NO: 539. In some cases, a guide RNA may comprise, from 5' to 3', (i) the polynucleotide sequence of SEQ ID NO: 541, (ii) a spacer sequence, and (ii) the polynucleotide sequence of SEQ ID NO: 542.
- a guide RNA may comprise, from 5' to 3', (i) the polynucleotide sequence of SEQ ID NO: 543, (ii) a spacer sequence, and (ii) the polynucleotide sequence of SEQ ID NO: 544. In some cases, a guide RNA may comprise, from 5' to 3', (i) the polynucleotide sequence of SEQ ID NO: 549, (ii) a spacer sequence, and (ii) the polynucleotide sequence of SEQ ID NO: 550.
- a guide RNA may comprise, from 5' to 3', (i) the polynucleotide sequence of SEQ ID NO: 551, (ii) a spacer sequence, and (ii) the polynucleotide sequence of SEQ ID NO: 552. In some cases, a guide RNA may comprise, from 5' to 3', (i) the polynucleotide sequence of SEQ ID NO: 554, (ii) a spacer sequence, and (ii) the polynucleotide sequence of TTTTA.
- a guide RNA may comprise, from 5' to 3', (i) the polynucleotide sequence of SEQ ID NO: 564, (ii) a spacer sequence, and (ii) the polynucleotide sequence of SEQ ID NO: 550. In some cases, a guide RNA may comprise, from 5' to 3', (i) the polynucleotide sequence of SEQ ID NO: 565, (ii) a spacer sequence, and (ii) the polynucleotide sequence of SEQ ID NO: 550.
- Non-limiting examples of a guide RNA scaffold fragment sequence are provided in Table 3.
- the guide RNA scaffold sequence can comprise a polynucleotide sequence (e.g, a consecutive polynucleotide sequence) that exhibits at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99%, or substantially about 100% sequence identity to the polynucleotide sequence of one or more members selected from Table 3 (e.g., one or more members selected from the group consisting of SEQ ID NOs. 597-601).
- a CRISPR/Cas system can be referred to using a variety of naming systems.
- a CRISPR/Cas system can be a type I, a type II, a type III, a type IV, a type V, a type VI system, or any other suitable CRISPR/Cas system.
- a CRISPR/Cas system as used herein can be a Class 1, Class 2, or any other suitably classified CRISPR/Cas system. Class 1 or Class 2 determination can be based upon the genes encoding the effector module.
- Class 1 systems generally have a multi-subunit crRNA-effector complex
- Class 2 systems generally have a single protein, such as Cas9, Cpfl, C2cl, C2c2, C2c3 or a crRNA- effector complex
- a Class 1 CRISPR/Cas system can use a complex of multiple Cas proteins to effect regulation.
- a Class 1 CRISPR/Cas system can comprise, for example, type I (e.g., I, IA, IB, IC, ID, IE, IF, IU), type III (e.g. III, IIIA, IIIB, IIIC, IIID), and type IV (e g, IV, IVA, IVB) CRISPR/Cas type.
- a Class 2 CRISPR/Cas system can use a single large Cas protein to effect regulation.
- a Class 2 CRISPR/Cas systems can comprise, for example, type II (e.g, II, IIA, IIB) and type V CRISPR/Cas type.
- CRISPR systems can be complementary to each other, and/or can lend functional units in trans to facilitate CRISPR locus targeting.
- a actuator moiety can comprise a Cas protein or derivative thereof
- the Cas protein or derivative thereof can be a Class 1 or a Class 2 Cas protein.
- a Cas protein can be a type I, type II, type III, type IV, type V Cas protein, or type VI Cas protein.
- a Cas protein can comprise one or more domains. Non-limiting examples of domains include, guide nucleic acid recognition and/or binding domain, nuclease domains (e.g., DNase or RNase domains, RuvC, HNH), DNA binding domain, RNA binding domain, helicase domains, protein-protein interaction domains, and dimerization domains.
- a guide nucleic acid recognition and/or binding domain can interact with a guide nucleic acid.
- a nuclease domain can comprise catalytic activity for nucleic acid cleavage.
- a nuclease domain can lack catalytic activity to prevent nucleic acid cleavage.
- a Cas protein can be a chimeric Cas protein or fragment thereof that is fused to other proteins or polypeptides.
- a Cas protein can be a chimera of various Cas proteins, for example, comprising domains from different Cas proteins.
- Non-limiting examples of Cas proteins include c2cl, C2c2, c2c3, Casl, CaslB, Cas2, Cas3, Cas4, Cas5, Cas5e (CasD), Cash, Cas6e, Cas6f, Cas7, Cas8a, Cas8al, Cas8a2, Cas8b, Cas8c, Cas9 (Csnl or Csxl2), CaslO, CaslOd, CaslO, CaslOd, CasF, CasG, CasH, Cpfl, Csyl, Csy2, Csy3, Csel (CasA), Cse2 (CasB), Cse3 (CasE), Cse4 (CasC), Cscl, Csc2, Csa5, Csn2, Csm2, Csm3, Csm4, Csm5, Csm6, Cmrl, Cmr3, Cmr4,
- a Cas protein or fragment or derivative thereof can be from any suitable organism.
- Nonlimiting examples include Streptococcus pyogenes, Streptococcus thermophilus, Streptococcus sp., Staphylococcus aureus, Nocardiopsis rougevillei, Streptomyces pristinae spiralis, Streptomyces viridochromo genes, Streptomyces viridochromogenes, Streptosporangium roseum, Streptosporangium roseum, AlicyclobacHlus acidocaldarius, Bacillus pseudomycoides, Bacillus selenitireducens, Exiguobacterium sibiricum, Lactobacillus delbrueckii, Lactobacillus salivarius, Microscilla marina, Burkholderiales bacterium, Polaromonas nap hthalenivorans, Polaromonas sp., Crocosphaera watsonii, Cyanothece s
- the organism is Streptococcus pyogenes (S. pyogenes) In some aspects, the organism is Staphylococcus aureus (S. aureus). In some aspects, the organism is Streptococcus thermophilus (S. therm ophilus).
- a Cas protein can be derived from a variety of bacterial species including, but not limited to, Veillonella atypical, Fusobacterium nucleatum, Filifactor alocis, Solobacterium moorei, Coprococcus catus, Treponema denticola, Peptoniphilus duerdenii, Catenibacterium mitsuokai, Streptococcus mutans, Listeria innocua, Staphylococcus pseudintermedius, Acidaminococcus intestine, Olsenella uli, Oenococcus kitaharae, Bifidobacterium bifidum, Lactobacillus rhamnosus, Lactobacillus gasseri, Finegoldia magna, Mycoplasma mobile, Mycoplasma gallisepticum, Mycoplasma ovipneumoniae, Mycoplasma canis, Mycoplasma synoviae, Eubacterium rectale, Streptoc
- Torquens Ilyobacter polytropus, Ruminococcus albus, Akkermansia muciniphila, Acidothermus cellulolyticus, Bifidobacterium longum, Bifidobacterium dentium, Corynebacterium diphtheria, Elusimicrobium minutum, Nitratifractorsalsuginis, Sphaerochaeta globus, Fibrobacter succinogenes subsp.
- Succinogenes Bacteroides fragilis, Capnocytophaga ochracea, Rhodopseudomonas palustris, Prevotella micans, Prevotella ruminicola, Flavobacterium columnare, Aminomonas paucivorans, Rhodospirillum rubrum, Candidatus Puniceispirillum marinum, Verminephrobacter eiseniae, Ralstonia syzygii, Dinoroseobacter shibae, Azospirillum, Nitrobacter hamburgensis, Bradyrhizobium, Wolinellasuccinogenes, Campylobacter jejuni subsp.
- Jejuni Helicobacter mustelae, Bacillus cereus, Acidovorax ebreus, Clostridium perfringens, Parvibaculum lavamentivorans, Roseburia intestinalis, Neisseria meningitidis, Pasteurella multocida subsp. Multocida, Sutterella wadsworthensis, proteobacterium, Legionella pneumophila, Parasutterella excrementihominis, Wolinella succinogenes, and Francisella novicida.
- a Cas protein as used herein can be a wildtype or a modified form of a Cas protein.
- a Cas protein can be an active variant, inactive variant, or fragment of a wild type or modified Cas protein.
- a Cas protein can comprise an amino acid change such as a deletion, insertion, substitution, variant, mutation, fusion, chimera, or any combination thereof relative to a wildtype version of the Cas protein (e.g., a wild-type version of Casl4).
- a Cas protein can be a polypeptide with at least about 5%, at least about 10%, at least about 20%, at least about 30%, at least about 40%, at least about 50%, at least about 60%, at least about 70%, at least about 80%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99%, or 100% sequence identity or sequence similarity to a wild type Cas protein.
- a Cas protein can be a polypeptide with at most about 5%, at most about 10%, at most about 20%, at most about 30%, at most about 40%, at most about 50%, at most about 60%, at most about 70%, at most about 80%, at most about 90%, or at most about 100% sequence identity and/or sequence similarity to a wild type exemplary Cas protein.
- Variants or fragments can comprise at least about 5%, at least about 10%, at least about 20%, at least about 30%, at least about 40%, at least about 50%, at least about 60%, at least about 70%, at least about 80%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99%, or 100% sequence identity or sequence similarity to a wild type or modified Cas protein or a portion thereof. Variants or fragments can be targeted to a nucleic acid locus in complex with a guide nucleic acid while lacking nucleic acid cleavage activity.
- a Cas protein can comprise one or more nuclease domains, such as DNase domains.
- a Cas9 protein can comprise a RuvC-like nuclease domain and/or an HNH-like 20 nuclease domain.
- the in a nuclease active form of Cas9, RuvC and HNH domains can each cut a different strand of double-stranded DNA to make a double-stranded break in the DNA.
- a Cas protein can comprise only one nuclease domain (e.g., Cpfl comprises RuvC domain but lacks HNH domain).
- nuclease domains are absent.
- nuclease domains are present but inactive or have reduced or minimal activity.
- nuclease domains are present and active.
- One or a plurality of the nuclease domains (e.g., RuvC, HNH) of a Cas protein can be deleted or mutated so that they are no longer functional or comprise reduced nuclease activity.
- a Cas protein comprising at least two nuclease domains (e.g., Cas9)
- the resulting Cas protein known as a nickase, can generate a single-strand break at a CRISPR RNA (crRNA) recognition sequence within a doublestranded DNA but not a double-strand break.
- crRNA CRISPR RNA
- Such a nickase can cleave the complementary strand or the non-complementary strand, but may not cleave both. If all of the nuclease domains of a Cas protein (e.g., both RuvC and HNH nuclease domains in a Cas9 protein; RuvC nuclease domain in a Cpfl protein) are deleted or mutated, the resulting Cas protein can have a reduced or no ability to cleave both strands of a double-stranded DNA.
- a Cas protein e.g., both RuvC and HNH nuclease domains in a Cas9 protein; RuvC nuclease domain in a Cpfl protein
- An example of a mutation that can convert a Cas9 protein into a nickase is a D10A (aspartate to alanine at position 10 of Cas9) mutation in the RuvC domain of Cas9 from S. pyogenes.
- H939A histidine to alanine at amino acid position 839) or H840A (histidine to alanine at amino acid position 840) in the HNH domain of Cas9 from S. pyogenes can convert the Cas9 into a nickase.
- An example of a mutation that can convert a Cas9 protein into a dead Cas9 is a D10A (aspartate to alanine at position 10 of Cas9) mutation in the RuvC domain and H939A (histidine to alanine at amino acid position 839) or H840A (histidine to alanine at amino acid position 840) in the HNH domain of Cas9 from S. pyogenes.
- a nuclease dead Cas protein can comprise one or more mutations relative to a wild-type version of the protein.
- the mutation can result in no more than 90%, no more than 80%, no more than 70%, no more than 60%, no more than 50%, no more than 40%, no more than 30%, no more than 20%, no more than 10%, no more than 5%, or no more than 1% of the nucleic acid-cleaving activity in one or more of the plurality of nucleic acid-cleaving domains of the wild-type Cas protein.
- the mutation can result in one or more of the plurality of nucleic acid-cleaving domains retaining the ability to cleave the complementary strand of the target nucleic acid but reducing its ability to cleave the non-complementary strand of the target nucleic acid.
- the mutation can result in one or more of the plurality of nucleic acid-cleaving domains retaining the ability to cleave the non-complementary strand of the target nucleic acid but reducing its ability to cleave the complementary strand of the target nucleic acid.
- the mutation can result in one or more of the plurality of nucleic acid-cleaving domains lacking the ability to cleave the complementary strand and the non-complementary strand of the target nucleic acid.
- the residues to be mutated in a nuclease domain can correspond to one or more catalytic residues of the nuclease.
- residues in the wild type exemplary S. pyogenes Cas9 polypeptide such as AsplO, His840, Asn854 and Asn856 can be mutated to inactivate one or more of the plurality of nucleic acidcleaving domains (e.g., nuclease domains).
- the residues to be mutated in a nuclease domain of a Cas protein can correspond to residues AsplO, His840, Asn854 and Asn856 in the wild type S.
- a Cas protein can comprise an amino acid sequence having at least about 5%, at least about 10%, at least about 20%, at least about 30%, at least about 40%, at least about 50%, at least about 60%, at least about 70%, at least about 80%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99%, or 100% sequence identity or sequence similarity to a nuclease domain (e.g., RuvC domain, HNH domain) of a wild-type Cas protein.
- a nuclease domain e.g., RuvC domain, HNH domain
- a Cas protein, variant or derivative thereof can be modified to enhance regulation of gene expression by compositions and methods of the disclosure, e.g., as part of a complex disclosed herein.
- a Cas protein can be modified to increase or decrease nucleic acid binding affinity, nucleic acid binding specificity, enzymatic activity, and/or binding to other factors, such as heterodimerization or oligomerization domains and induce ligands.
- Cas proteins can also be modified to change any other activity or property of the protein, such as stability. For example, one or more nuclease domains of the Cas protein can be modified, deleted, or inactivated, or a Cas protein can be truncated to remove domains that are not essential for the desired function of the protein or complex.
- a Cas protein can be modified to modulate (e.g., enhance or reduce) the activity of the Cas protein for regulating gene expression by a complex of the disclosure that comprises a heterologous gene effector.
- a Cas protein can be coupled (e.g., fused, covalently coupled, or non- covalently coupled) to a heterologous gene effector (e.g., an epigenetic modification domain, a transcriptional activation domain, and/or a transcriptional repressor domain).
- a Cas protein can be coupled (e.g., fused, covalently coupled, or non-covalently coupled) to an oligomerization or dimerization domain as disclosed herein (e.g., a heterodimerization domain).
- a Cas protein can be coupled (e.g., fused, covalently coupled, or non-covalently coupled) to a heterologous polypeptide that provides increased or decreased stability.
- a Cas protein can be coupled (e g., fused, covalently coupled, or non-covalently coupled) to a sequence that can facilitate degradation of the Cas protein or a complex containing the Cas protein, for example, a degron, such as an inducible degron (e.g., auxin inducible).
- a degron such as an inducible degron (e.g., auxin inducible).
- a Cas protein can be coupled (e.g., fused, covalently coupled, or non-covalently coupled) to any suitable number of partners, for example, at least one, at least two, at least three, at least four, or at least five, at least six, at least seven, or at least 8 partners.
- a Cas protein of the disclosure is coupled (e.g., fused, covalently coupled, or non-covalently coupled) to at most two, at most three, at most four, at most five, at most six, at most seven, at most eight, or at most ten partners.
- a Cas protein of the disclosure is coupled (e g., fused, covalently coupled, or non-covalently coupled) to 1 - 5, 1 - 4, 1 - 3, 1 - 2, 2 - 5, 2 - 4, 2 - 3, 3 - 5, 3 - 4, or 4 - 5 partners.
- a Cas protein of the disclosure is coupled (e.g., fused, covalently coupled, or non-covalently coupled) to one partner.
- a Cas protein of the disclosure is coupled (e.g., fused, covalently coupled, or non- covalently coupled) to two partners.
- a Cas protein of the disclosure is coupled (e.g., fused, covalently coupled, or non-covalently coupled) to three partners. In some embodiments, a Cas protein of the disclosure is coupled (e.g., fused, covalently coupled, or non- covalently coupled) to four partners. In some embodiments, a Cas protein of the disclosure is coupled (e.g., fused, covalently coupled, or non-covalently coupled) to five partners. In some embodiments, a Cas protein of the disclosure is coupled (e.g., fused, covalently coupled, or non- covalently coupled) to six partners.
- a Cas protein can be a fusion protein, e.g., a fusion comprising the Cas protein and one or more of the partners as disclosed herein
- the fused domain or heterologous polypeptide can be located at the N-terminus, the C-terminus, or internally within the Cas protein.
- a partner of the Cas protein e.g., covalently or non-covalently coupled to a dCas protein as disclosed herein
- a transcriptional effector e.g., a transcriptional activator or a transcriptional repressor.
- the transcriptional effector can be heterologous to the cell as provided herein.
- the Cas protein and the transcriptional effector can be fused in a single polypeptide sequence.
- the Cas protein and the transcriptional effector can be fused directly to one another.
- the Cas protein and the transcriptional effector can be fused via a peptide linker (or an amino acid linker) that is heterologous to the Cas protein and the transcriptional activator.
- the peptide linker can be derived from a natural polypeptide sequence.
- the peptide linker can be a synthetic sequence.
- the peptide linker can have a length of at least or up to about 1 amino acid residue, at least or up to about 2 amino acid residues, at least or up to about 3 amino acid residues, at least or up to about 4 amino acid residues, at least or up to about 5 amino acid residues, at least or up to about 10 amino acid residues, at least or up to about 15 amino acid residues, at least or up to about 20 amino acid residues, at least or up to about 25 amino acid residues, at least or up to about 30 amino acid residues, at least or up to about 35 amino acid residues, at least or up to about 40 amino acid residues, at least or up to about 45 amino acid residues, at least or up to about 50 amino acid residues, at least or up to about 60 amino acid residues, at least or up to about 70 amino acid residues, at least or up to about 80 amino acid residues, at least or up to about 90 amino acid residues, or at least or up to about 100 amino acid residues.
- the peptide linker can be a length of
- GS linker or “GS linker sequence,” as used interchangeably herein, generally refers to a peptide linker that mainly comprises glycine and serine residues. Particularly, at least or up to about 60%, at least or up to about 65%, at least or up to about 70%, at least or up to about 75%, at least or up to about 80%, at least or up to about 85%, at least or up to about 90%, at least or up to about 95% or substantially about 100% of the amino acid residues in the GS linker sequence can be selected from glycine and serine residues.
- the GS linker sequence according to the present invention can, for example, comprise from about 1 to about 50 amino acid residues, from about 1 to about 45 amino acid residues, from about 1 to about 40 amino acid residues, from about 1 to about 35 amino acid residues, or from about 1 to about 30 amino acid residues, in total. In some cases, the GS linker sequence may not comprise about 10, about 5, about 4, about 3, about 2 or about 1 amino acid residue (s) other than glycine or serine.
- the transcriptional effector can be a histone epigenetic modifier (or a histone modifier).
- the histone epigenetic modifier can modulate histones through methylation (e.g., a histone methylation modifier, such as an amino acid methyltransferase, e.g., KRAB).
- the histone epigenetic modifier can modulate histones through acetylation.
- the histone epigenetic modifier can modulate histones through phosphorylation.
- the histone epigenetic modifier can modulate histones through ADP-ribosylation.
- the histone epigenetic modifier can modulate histones through glycosylation.
- the histone epigenetic modifier can modulate histones through SUMOylation. In some cases, the histone epigenetic modifier can modulate histones through ubiquitination. In some cases, the histone epigenetic modifier can modulate histones by remodeling histone structure, e.g., via an ATP hydrolysis-dependent process.
- the transcriptional effector can be a gene epigenetic modifier (or a gene modifier).
- a gene modifier can modulate genes through methylation (e.g., a gene methylation modifier, such as a DNA methyltransferase or DNMT).
- a gene modifier can modulate genes through acetylation.
- the transcriptional effector is from a family of related histone acetyltransferases.
- histone acetyltransferases include GNAT subfamily, MYST subfamily, p300/CBP subfamily, HAT1 subfamily, GCN5, PCAF, Tip60, MOZ, MORF, MOF, HBO1, p300, CBP, HAT1, ATF-2, SRC1, and TAFII250.
- the transcriptional effector is from a histone epigenetic modifier (e.g., a histone lysine methyltransferase, a histone lysine demethylase, or a DNA methylase).
- a histone epigenetic modifier e.g., a histone lysine methyltransferase, a histone lysine demethylase, or a DNA methylase.
- Non-limiting examples of histone epigenetic modifier include EZH subfamily, Non-SET subfamily, Other SET subfamily, PRDM subfamily, SET1 subfamily, SET2 subfamily, SUV39 subfamily, SYMD subfamily, ASH IL, EHMT1, EHMT2, EZH1, EZH2, MLL, MLL2, MLL3, MLL4, MLL5, NSD1, NSD2, NSD3, PRDM1, PRDM10, PRDM 11, PRDM12, PRDM13, PRDM14, PRDM15, PRDM16, PRDM2, PRDM4, PRDM5, PRDM6, PRDM7, PRDM8, PRDM9, SET1, SET1L, SET2L, SETD2, SETD3, SETD4, SETD5, SETD6, SETD7, SETD8, SETDB1, SETDB2, SETMAR, SUV39H1, SUV39H2, SUV420H1, SUV420H2, SYMD1, SYMD2, SYMD3, SYMD4, and SYMD5.
- proteins (or fragments thereof) that can be used as a fusion partner to increase transcription include but are not limited to: transcriptional activators such as VP16, VP64, VP48, VP 160, p65 subdomain (e.g., from NFkB), and activation domain of EDLL and/or TAL activation domain (e.g., for activity in plants); and histone epigenetic modifier such as SET1A, SET1B, MLL1 to 5, ASH1, SYMD2, NSD1, IHDM2a/b, UTX, JMID3, GCN5, PCAF, CBP, p300, TAF1, TIP60/PLIP, M0ZMYST3, M0RFMYST4, SRC1, ACTR, PI 60, CLOCK, Ten-Eleven Translocation (TET) dioxygenase 1 (TET1CD), TET1, DME, DML1, DML2, ROS1, and the like.
- transcriptional activators such as VP16, VP64, VP48
- proteins (or fragments thereof) that can be used as a fusion partner to decrease transcription include but are not limited to: transcriptional repressors such as the Kruppel associated box (KRAB or SKD); KOX1 repression domain; the Mad mSIN3 interaction domain (SID); the ERF repressor domain (ERD), the SRDX repression domain (e.g., for repression in plants), and the like; histone lysine methyltransferases such as Pr-SET7/8, SUV4- 20H1, RIZ1, and the like; histone lysine demethylases such as JMJD2A/JHDM3A, JMJD2B, JMJD2C/GASC1, JMJD2D, J ARID 1 A/RBP2, JARID1B/PLU- 1 , J ARID 1C/SMCX, JARIDID/SMCY, and the like; histone lysine deacetylases such as HDAC1, H, HDAC1, H
- a Cas protein can be provided in any form.
- a Cas protein can be provided in the form of a protein, such as a Cas protein alone or complexed with a guide nucleic acid as a ribonucleoprotein.
- a Cas protein can be provided in a complex, for example, complexed with a guide nucleic acid and/or one or more heterologous gene effectors of the disclosure.
- a Cas protein can be provided in the form of a nucleic acid encoding the Cas protein, such as an RNA (e.g., messenger RNA (mRNA)), or DNA.
- the nucleic acid encoding the Cas protein can be codon optimized for efficient translation into protein in a particular cell or organism.
- Nucleic acids encoding Cas proteins, fragments, or derivatives thereof can be stably integrated in the genome of a cell.
- Nucleic acids encoding Cas proteins can be operably linked to a promoter, for example, a promoter that is constitutively or inducibly active in the cell.
- Nucleic acids encoding Cas proteins can be operably linked to a promoter in an expression construct.
- Expression constructs can include any nucleic acid constructs capable of directing expression of a gene or other nucleic acid sequence of interest (e.g., a Cas gene) and which can transfer such a nucleic acid sequence of interest to a target cell.
- a Cas protein, variant or derivative thereof is a nuclease dead Cas (dCas) protein.
- a dead Cas protein can be a protein that lacks nucleic acid cleavage activity [OHl]
- a Cas protein can comprise a modified form of a wild type Cas protein.
- the modified form of the wild type Cas protein can comprise an amino acid change (e.g., deletion, insertion, or substitution) that reduces the nucleic acid-cleaving activity of the Cas protein.
- the modified form of the Cas protein can have no more than 90%, no more than 80%, no more than 70%, no more than 60%, no more than 50%, no more than 40%, no more than 30%, no more than 20%, no more than 10%, no more than 5%, or no more than 1% of the nucleic acid-cleaving activity of the wild-type Cas protein (e g., Cas9 from S. pyogenes).
- the modified form of Cas protein can have no substantial nucleic acid-cleaving activity.
- a Cas protein is a modified form that has no substantial nucleic acid-cleaving activity, it can be referred to as enzymatically inactive, “deactivated” and/or “dead” (abbreviated by “d”).
- a dead Cas protein (e.g., dCas, dCas9, dCasl4) can bind to a target polynucleotide but may not cleave or minimally cleaves the target polynucleotide.
- a dead Cas protein is a dead Casl4 protein.
- a dead Cas protein is a not a dead Casl4 protein.
- a dCas polypeptide (e.g., dCasl4 polypeptide) can associate with a single guide RNA (sgRNA) to activate or repress transcription of a target gene (e.g., target endogenous gene), for example, in combination with heterologous gene effector(s) disclosed herein.
- sgRNAs can be introduced into cells expressing the Cas or variant thereof, as provided herein. In some cases, such cells can contain one or more different sgRNAs that target the same target gene (e.g., target endogenous gene) or target gene regulatory sequence. In other cases, the sgRNAs target different nucleic acids in the cell (e.g., different target genes, different target gene regulatory sequences, or different sequences within the same target gene or target gene regulatory sequence).
- Enzymatically inactive can refer to a nuclease that can bind to a nucleic acid sequence in a polynucleotide in a sequence-specific manner, but will not cleave a target polynucleotide or will cleave it at a substantially reduced frequency.
- An enzymatically inactive guide moiety can comprise an enzymatically inactive domain (e.g. nuclease domain).
- Enzymatically inactive can refer to no activity.
- Enzymatically inactive can refer to substantially no activity.
- Enzymatically inactive can refer to essentially no activity.
- Enzymatically inactive can refer to an activity no more than 1%, no more than 2%, no more than 3%, no more than 4%, no more than 5%, no more than 6%, no more than 7%, no more than 8%, no more than 9%, or no more than 10% activity compared to a comparable wild-type activity (e.g., nucleic acid cleaving activity, wild-type Cas9 or wild-type Cas 14 activity).
- the actuator moiety as disclosed herein does not contain a nucleic acid-guided targeting system.
- the actuator moiety can include proteins that bind to a target gene (e.g., target endogenous gene) or target gene regulatory sequence based on protein structural features, such as certain nucleases disclosed herein.
- the wild-type Cas protein that the engineered Cas protein is a modification of has a native amino acid sequence with a length of less than 800 amino acids (e.g., Casl4 or a variant thereof).
- This relatively small size provides several advantages to the provided engineered Cas protein. For example, the small size can allow the Cas protein to be delivered to a host cell, e.g., a cell of a human patient, via a single adeno-associated virus delivery system that would be otherwise incapable of delivering a larger protein.
- the native amino acid sequence can have a length that is, for example, between 500 amino acids and 700 amino acids, e.g., between 500 amino acids and 620 amino acids, between 540 amino acids and 660 amino acids, between 560 amino acids and 680 amino acids, or between 580 amino acids and 700 amino acids.
- the native amino acid sequence can have a length that is less than 700 amino acids, e.g., less than 680 amino acids, less than 660 amino acids, less than 640 amino acids, less than 620 amino acids, less than 600 amino acids, less than 580 amino acids, less than 560 amino acids, less than 540 amino acids, or less than 520 amino acids.
- the native amino acid sequence can have an length that is greater than 500 amino acids, e g., greater than 520 amino acids, greater than 540 amino acid, greater than 560 amino acids, greater than 580 amino acids, greater than 600 amino acids, greater than 620 amino acids, greater than 640 amino acids, greater than 660 amino acids, or greater than 700 amino acids. Larger lengths, e.g., greater than 700 amino acids, and smaller lengths, e.g., less than 500 amino acids, are also contemplated.
- the modified amino acid sequence of the engineered Cas protein includes one or more substitutions in the native amino acid sequence, where the positions of at least some of these substitutions follow one or more particular rules determined to have surprising advantages for the characteristics of the engineered Cas protein.
- the particular substitution rules have been selected for their ability to produce engineered Cas proteins capable of functioning within eukaryotic cells.
- all or some of the one or more substitutions in the native amino acid sequence are either (1) within or no more than 30 amino acids downstream of a (D/E/K/N)X(R/F)(E/K)N motif of the native amino acid sequence, (2) at or no more than 30 amino acids upstream or downstream of position 241 of the native amino acid sequence, (3) at or no more than 30 amino acids upstream or downstream of position 516 of the native amino acid sequence, and/or (4) having an electrically charged amino acid in the native amino acid sequence.
- the native amino acid sequence includes a (D/E/K/N)X(R/F)(E/K)N motif
- the modified amino acid sequence includes one or more substitutions at positions within or no more than 30 amino acids upstream or downstream of the motif.
- the modified amino acid sequence can include, for example, one, two, three, four, five, six, seven, eight, nine, ten, or more than ten substitutions within or no more than 30 amino acids upstream or downstream of the motif.
- At least one of the one or more substitutions to the native amino acid sequence can be, for example, within or no more than 28 amino acids, 26 amino acids, 24 amino acids, 22 amino acids, 20 amino acids, 18 amino acids, 16 amino acids, 14 amino acids, 12 amino acids, or 10 amino acids of the motif.
- at least one of the one or more substitutions within or no more than 30 amino acids upstream or downstream of the motif is to an R, A, S, or G.
- each of the one or more substitutions within or no more than 30 amino acids upstream or downstream of the motif is independently to an R, A, S, or G.
- all of the substitutions to the native amino acid sequence are at positions within or no more than 30 amino acids upstream or downstream of the motif.
- Some embodiments of the present disclosure are directed to a Cas protein that is not a variant of CasX.
- Some embodiments of the present disclosure are directed to small Cas-based regulation of gene expression, such as at the transcriptional and/or translational level.
- Small Cas proteins can be targeted to DNA and/or RNA, and are much smaller than typical CRISPR effectors, e.g., ranging in size from about 400 amino acids to about 700 amino acids.
- the small size of can allow such Cas proteins proteins and/or effector domain fusions thereof to be paired with a CRISPR array encoding multiple guide RNAs while remaining under the packaging size limit of various delivery vehicles, such as the versatile adeno-associated virus (AAV) delivery vehicle or non-viral delivery vehicles (e.g., lipid nanoparticles), for primary cell and in vivo delivery.
- AAV versatile adeno-associated virus
- non-viral delivery vehicles e.g., lipid nanoparticles
- the Cas protein or a variant thereof as provided herein can have a size of at most about 800 amino acids, at most about 780 amino acids, at most about 760 amino acids, at most about 750 amino acids, at most about 740 amino acids, at most about 720 amino acids, at most about 700 amino acids, at most about 680 amino acids, at most about 660 amino acids, at most about 650 amino acids, at most about 640 amino acids, at most about 620 amino acids, at most about 600 amino acids, at most about 580 amino acids, at most about 560 amino acids, at most about 550 amino acids, at most about 540 amino acids, at most about 520 amino acids, at most about 500 amino acids, 480 amino acids, at most about 460 amino acids, at most about 450 amino acids, at most about 440 amino acids, at most about 420 amino acids, at most about 400 amino acids, or less.
- Non-limiting examples of Cas protein are provided in Table 1.
- the Cas protein or the deactivated Cas protein (dCas) as provided herein can comprise a polypeptide sequence (e.g., a consecutive polypeptide sequence) that exhibits at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99%, or substantially about 100% sequence identity to the polypeptide sequence of one or more members selected from Table 1 (e.g., one or more members selected from the group consisting of SEQ ID NOs. 1-201).
- the Cas protein or a variant thereof, as provided herein can comprise the amino acid sequence having at least about 60%, at least about 65%, at least about 70%, at least about 71%, at least about 72%, at least about 73%, at least about 74%, at least about
- Cas protein or a variant thereof, as provided herein, can comprise the amino acid sequence having at most about 100%, at most about 99%, at most about 98%, at most about 97%, at most about 96%, at most about 95%, at most about 94%, at most about 93%, at most about 92%, at most about 91%, at most about 90%, at most about 89%, at most about 88%, at most about 87%, at most about 86%, at most about 85%, at most about 84%, at most about 83%, at most about 82%, at most about 81%, at most about 80%, at most about 79%, at most about 78%, at most about 77%, at most about 76%, at most about 75%, at most about 74%, at most about 73%, at most about 72%, at most about 71%, at most about 70%, at most about 65%, at most most
- a Cas protein or a variant thereof as disclosed herein can exhibit a greater cationic charge (e.g., at least 1, at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, or more cationic charges) as compared to the wild-type Casl4.
- the enhanced cationic charge can (i) enhance complexation of the Cas protein to the guide nucleic acid and/or (ii) enhance complexation of the Cas protein to the target polynucleotide sequence (e g., endogenous target polynucleotide sequence).
- the Cas protein can comprise one or more substitutions for the enhanced cationic charge.
- the one or more substitutions at positions within or no more than 30 amino acids upstream or downstream of the (D/E/K/N)X(R/F)(E/K)N motif of the native amino acid sequence can include, for example, one or more substitutions at positions selected from positions 143, 147, 151, and 154 of the native amino acid sequence.
- the one or more substitutions include substitutions are at one or more positions selected from D143, T147, E151, and K154.
- the one or more substitutions include one or more substitutions selected from D143R, T147R, E151R, and K154R.
- the modified amino acid sequence includes one or more substitutions at or no more than 30 amino acids upstream or downstream of position 241 of the native amino acid sequence.
- the modified amino acid sequence can include, for example, one, two, three, four, five, six, seven, eight, nine, ten, or more than ten substitutions within or no more than 30 amino acids upstream or downstream of position 241.
- At least one of the one or more substitutions to the native amino acid sequence can be, for example, within or no more than 28 amino acids, 26 amino acids, 24 amino acids, 22 amino acids, 20 amino acids, 18 amino acids, 16 amino acids, 14 amino acids, 12 amino acids, or 10 amino acids of position 241.
- At least one of the one or more substitutions within or no more than 30 amino acids upstream or downstream of position 241 is to an R, A, S, or G. In some embodiments, each of the one or more substitutions within or no more than 30 amino acids upstream or downstream of position 241 is independently to an R, A, S, or G. In some embodiments, all of the substitutions to the native amino acid sequence are at positions within or no more than 30 amino acids upstream or downstream of position 241.
- the one or more substitutions at positions having an electrically charged amino include substitutions are at one or more positions selected from KI 1, K73, D143, E151, K154, E241, D318, K330, K457, E425, E462, E507, E527, and E528.
- the one or more substitutions include one or more substitutions selected from KI IR, K73R, D143R, E151R, K154R, E241R, D318R, K330R, E425N, K457R, E462R, E507R, E527R, and E528R.
- the modified amino acid sequence includes a D143R substitution. In some embodiments, the only substitution in the modified amino acid sequence is D143R.
- the modified amino acid sequence of the engineered Cas protein includes two substitutions in the native amino acid sequence. In some embodiments, the modified amino acid sequence has exactly two substitutions in the native amino acid sequence. In some embodiments, the modified amino acid sequence includes two substitutions at positions selected from positions 143, 147, 151, 154, 241, 330, 425, 504, 507, 516, 519, 527, and 528. In some embodiments, the modified amino acid sequence has exactly two substitutions, where the exactly two substitutions are at positions selected from positions 143, 147, 151, 154, 241, 330, 425, 504, 507, 516, 519, 527, and 528.
- the modified amino acid sequence when the native amino acid sequence is the sequence of SEQ ID NO: 1, the modified amino acid sequence includes two substitutions at positions selected from D143, T147, E151, K154, E241, K330, E425, N504, E507, N516, N519, E527, and E528. In some embodiments, e g, when the native amino acid sequence is the sequence of SEQ ID NO: 1, the modified amino acid sequence has exactly two substitutions, where the exactly two substitutions are at positions selected from D143, T147, E151, K154, E241, K330, E425, N504, E507, N516, N519, E527, and E528.
- the modified amino acid sequence includes a substitution at position 143 and a substitution at a position selected from positions 147, 151, 154, 241, 330, 425, 504, 507, 516, 519, 527, and 528.
- the modified amino acid includes a substitution at position 143 and exactly one other substitution, where the exactly one other substitution is at a position selected from positions 147, 151, 154, 241, 330, 425, 504, 507, 516, 519, 527, and 528.
- the modified amino acid sequence includes a substitution at position D143 and a substitution at a position selected from positions T147, E151, K154, E241, K330R, E425N, N504, E507, N516, N519, E527, and E528.
- the modified amino acid includes a substitution at position D143 and exactly one other substitution, where the exactly one other substitution is at a position selected from positions T147, E151, K154, E241, K330R, E425N, N504, E507, N516, N519, E527, and E528.
- the modified amino acid includes two substitutions selected from D143R, T147R, E151R, E151A, K154R, E241R, N504R, E507R, N516R, N519R, E527R, and E528R.
- the modified amino acid includes exactly two substitutions, where the two substitutions are selected from D143R, T147R, E151R, E151A, K154R, E241R, N504R, E507R, N516R, N519R, E527R, and E528R.
- the modified amino acid includes two substitutions selected from D143R/T147R, D143R/E151R, D143R/E241R, D143R/E425N, D143R/E507R, D143R/N519R, D143R/E527R,, D143R/E528R, D143R/R151S, D143/R151G, and D143R/E151A.
- D143R/T147R D143R/E151R, D143R/E241R, D143R/E425N, D143R/E507R, D143R/N519R, D143R/E527R,, D143R/E528R, D143R/R151S, D143/R151G, and D143R/E151A.
- the modified amino acid when the native amino acid sequence is the sequence of SEQ ID NO: 1, the modified amino acid includes exactly two substitutions, where the two substitutions are selected from D143R/T147R, D143R/E151R, D143R/E241R, D143R/E425N, D143R/E507R, D143R/N519R, D143R/E527R,, D143R/E528R, D143R/R151S, D143/R151G, and D143R/E151A.
- the modified amino acid sequence includes a D143R substitution and a T147R substitution.
- the only substitutions in the modified amino acid sequence are a D143R substitution and a T147R substitution.
- a dCas protein or a variant thereof where one or more amino acids of the parental Cas protein from which it is derived have been altered or otherwise removed to reduce or eliminate its nuclease activity.
- the amino acids include D326 and D510 with respect to SEQ ID NO: 1.
- one or both of D326 and D510 are substituted with an amino acid that reduces, substantially eliminates, or eliminates nuclease activity.
- one or both of D326 and D510 are substituted with alanine (e.g., D326A and/or D510A based on SEQ ID NO: 1).
- the dCas protein exhibits reduced or eliminated nuclease activity, or nuclease activity is absent or substantially absent within levels of detection.
- the dCas protein or a variant thereof comprises the amino acid sequence of SEQ ID NO: 1 or a variant thereof having at least about 70%, at least about 71%, at least about 72%, at least about 73%, at least about 74%, at least about 75%, at least about 76%, at least about 77%, at least about 78%, at least about 79%, at least about 80%, at least about 81%, at least about 82%, at least about 83%, at least about 84%, at least about 85%, at least about 86%, at least about 87%, at least about 88%, at least about 89%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99%, or greater sequence identity to the amino acid sequence of SEQ ID NO: 1.
- the target nucleic acid is dsDNA.
- dsDNA-targeting specificity is determined, at least in part, by two parameters: the gRNA spacer targeting a protospacer in the target dsDNA (the sequence in the target dsDNA corresponding to the gRNA spacer on the non-complementary DNA strand) and a short sequence, the protospacer-adjacent motif (PAM), located immediately 5' (upstream) of the protospacer on the non-complementary DNA strand.
- the PAM is 5'-TTTG-3' or 5'-TTTA-3'.
- the PAM is 5'-TTTG-3'. In some embodiments, the PAM is 5'-TTTA-3'.
- the target nucleic acid is RNA.
- RNA-targeting specificity is determined, at least in part, by the gRNA spacer targeting a protospacer-like sequence in the target RNA (the sequence in the target RNA complementary to the gRNA spacer), and is independent of the sequence located immediately 5' (upstream) of the protospacer-like sequence.
- the Cas protein system is also capable of targeting a dsDNA molecule, wherein the gRNA spacer is selected such that it targets a protospacer in the target dsDNA molecule having a PAM selected from 5'-TTTG-3' and 5 -TTTA-3'.
- the Cas protein system is incapable of targeting a dsDNA molecule, wherein the gRNA spacer is selected such that any protospacers in the dsDNA molecule targeted by the gRNA spacer do not have a PAM selected from 5'-TTTG-3' and 5'-TTTA-3'.
- a actuator moiety can comprise a zinc finger nuclease (ZFN) or a variant, fragment, or derivative thereof.
- ZFN can refer to a fusion between a cleavage domain, such as a cleavage domain of Fokl, and at least one zinc finger motif (e.g., at least 2, at least 3, at least 4, or at least 5 zinc finger motifs) which can bind polynucleotides such as DNA and RNA.
- a ZFN is used in a targeting moiety of the disclosure to bind a polynucleotide (e.g., target gene or target gene regulatory sequence), but the ZFN does not cleave or substantially does not cleave the polynucleotide, e.g., a nuclease dead ZFN.
- a ZFN or a variant, fragment, or derivative thereof can be fused to or associated with one of more heterologous gene effectors to form a complex of the disclosure.
- the heterodimerization at certain positions in a polynucleotide of two individual ZFNs in certain orientation and spacing can lead to cleavage of the polynucleotide in nuclease-active ZFN.
- a ZFN binding to DNA can induce a double-strand break in the DNA.
- two individual ZFNs can bind opposite strands of DNA with their C-termini at a certain distance apart.
- linker sequences between the zinc finger domain and the cleavage domain can require the 5' edge of each binding site to be separated by about 5-7 base pairs.
- a cleavage domain is fused to the C-terminus of each zinc finger domain.
- the cleavage domain of an actuator moiety comprising a ZFN comprises a modified form of a wild type cleavage domain.
- the modified form of the cleavage domain can comprise an amino acid change (e.g., deletion, insertion, or substitution) that reduces the nucleic acid-cleaving activity of the cleavage domain.
- the modified form of the cleavage domain can have no more than 90%, no more than 80%, no more than 70%, no more than 60%, no more than 50%, no more than 40%, no more than 30%, no more than 20%, no more than 10%, no more than 5%, or no more than 1% of the nucleic acid-cleaving activity of the corresponding wild-type cleavage domain.
- the modified form of the cleavage domain can have no substantial nucleic acid-cleaving activity.
- the cleavage domain is enzymatically inactive.
- a actuator moiety can comprise a “TALEN” or “TAL-effector nuclease” or a variant, fragment, or derivative thereof.
- TALENs refer to engineered transcription activator-like effector nucleases that generally contain a central domain of DNA-binding tandem repeats and a cleavage domain. TALENs can be produced by fusing a TAL effector DNA binding domain to a DNA cleavage domain.
- a DNA-binding tandem repeat comprises 33-35 amino acids in length and contains two hypervariable amino acid residues at positions 12 and 13 that can recognize at least one specific DNA base pair.
- a transcription activator-like effector (TALE) protein can be fused to a nuclease such as a wild-type or mutated Fokl endonuclease or the catalytic domain ofFokl.
- a TALEN is used in a targeting moiety of the disclosure to bind a polynucleotide (e.g., target gene or target gene regulatory sequence), but the TALEN does not cleave or substantially does not cleave the polynucleotide, e.g., a nuclease dead TALEN.
- a TALEN or a variant, fragment, or derivative thereof can be fused to or associated with one of more heterologous gene effectors to form a complex of the disclosure.
- a TALEN is engineered for reduced nuclease activity.
- the nuclease domain of a TALEN comprises a modified form of a wild type nuclease domain.
- the modified form of the nuclease domain can comprise an amino acid change (e.g., deletion, insertion, or substitution) that reduces the nucleic acid-cleaving activity of the nuclease domain.
- the modified form of the nuclease domain can have no more than 90%, no more than 80%, no more than 70%, no more than 60%, no more than 50%, no more than 40%, no more than 30%, no more than 20%, no more than 10%, no more than 5%, or no more than 1% of the nucleic acid-cleaving activity of the wild-type nuclease domain.
- the modified form of the nuclease domain can have no substantial nucleic acid-cleaving activity.
- the nuclease domain is enzymatically inactive.
- a TALEN or a variant, fragment, or derivative thereof can be fused to or associated with one of more heterologous gene effectors to form a complex of the disclosure.
- TALENs which, for example, improve cleavage specificity or activity.
- Such TALENs can be engineered to bind any desired DNA sequence.
- TALENs can be used to generate gene modifications (e.g., nucleic acid sequence editing) by creating a double-strand break in a target DNA sequence, which in turn, undergoes NHF.J or HDR
- a TALE or a variant, fragment, or derivative thereof can be fused to or associated with one of more heterologous gene effectors to form a complex of the disclosure.
- the transcription activator-like effector (TALE) protein is fused to a heterologous gene effector and does not comprise a nuclease.
- a TALEN does not cleave or substantially does not cleave the polynucleotide, e.g., a nuclease dead TALE.
- a TALE or a variant, fragment, or derivative thereof can be fused to or associated with one of more heterologous gene effectors to form a complex of the disclosure.
- the complex of the transcription activator-like effector (TALE) protein and the heterologous gene effector is designed to function as a transcriptional activator.
- the complex of the transcription activator-like effector (TALE) protein and the heterologous gene effector is designed to function as a transcriptional repressor.
- the DNA-binding domain of the transcription activator-like effector (TALE) protein can be fused (e.g., linked) to one or more heterologous gene effectors that comprise transcriptional activation domains, or to one or more heterologous gene effectors that comprise transcriptional repression domains.
- a actuator moiety can comprise a meganuclease.
- Meganucleases generally refer to rare-cutting endonucleases or homing endonucleases that can be highly sequence specific. Meganucleases can recognize DNA target sites ranging from at least 12 base pairs in length, e g., from 12 to 40 base pairs, 12 to 50 base pairs, or 12 to 60 base pairs in length. Meganucleases can be modular DNA-binding nucleases such as any fusion protein comprising at least one catalytic domain of an endonuclease and at least one DNA binding domain or protein specifying a nucleic acid target sequence.
- the DNA-binding domain can contain at least one motif that recognizes single- or double-stranded DNA.
- a nuclease-active meganuclease can generate a double-stranded break.
- a meganuclease is used in a targeting moiety of the disclosure to bind a polynucleotide (e.g., target gene or target gene regulatory sequence), but the meganuclease does not cleave or substantially does not cleave the polynucleotide, e.g., a nuclease dead meganuclease.
- a meganuclease or a variant, fragment, or derivative thereof can be fused to or associated with one of more heterologous gene effectors to form a complex of the disclosure.
- the meganuclease can be monomeric or dimeric. In some embodiments, the meganuclease is naturally-occurring (found in nature) or wild-type, and in other instances, the meganuclease is non-natural, artificial, engineered, synthetic, rationally designed, or man-made. In some embodiments, the meganuclease of the present disclosure includes an I-Crel meganuclease, I-Ceul meganuclease, I-Msol meganuclease, I-Scel meganuclease, variants thereof, derivatives thereof, and fragments thereof.
- the nuclease domain of a meganuclease comprises a modified form of a wild type nuclease domain.
- the modified form of the nuclease domain can comprise an amino acid change (e.g., deletion, insertion, or substitution) that reduces or eliminates the nucleic acid-cleaving activity of the nuclease domain.
- the modified form of the nuclease domain can have no more than 90%, no more than 80%, no more than 70%, no more than 60%, no more than 50%, no more than 40%, no more than 30%, no more than 20%, no more than 10%, no more than 5%, or no more than 1% of the nucleic acid-cleaving activity of the wild-type nuclease domain.
- the modified form of the nuclease domain can have no substantial nucleic acid-cleaving activity.
- the nuclease domain is enzymatically inactive.
- a meganuclease can bind DNA but cannot cleave the DNA.
- a nuclease-inactive meganuclease is fused to or associated with one or more heterologous gene effectors to generate a complex of the disclosure.
- the heterologous polypeptide comprising the actuator moiety can regulate expression and/or activity of a target gene (e.g., target endogenous gene).
- the heterologous polypeptide and/or a complex thereof can edit the sequence of a nucleic acid (e.g., a gene and/or gene product).
- a nuclease-active Cas protein can edit a nucleic acid sequence by generating a double-stranded break or single-stranded break in a target polynucleotide.
- the heterologous polypeptide comprising the actuator moiety can generate a double-strand break in a target polynucleotide, such as DNA.
- a double-strand break in DNA can result in DNA break repair which allows for the introduction of gene modification(s) (e.g., nucleic acid editing).
- a nuclease induces site-specific single-strand DNA breaks or nicks, thus resulting in HDR.
- a double-strand break in DNA can result in DNA break repair which allows for the introduction of gene modification(s) (e.g., nucleic acid editing).
- DNA break repair can occur via non-homologous end joining (NHEJ) or homology-directed repair (HDR).
- NHEJ non-homologous end joining
- HDR homology-directed repair
- a donor DNA repair template or template polynucleotide that contains homology arms flanking sites of the target DNA can be provided.
- the heterologous polypeptide comprising the actuator moiety does not generate a double-strand break in a target polynucleotide, such as DNA. Binding of the heterologous polypeptide of the complex comprising the heterologous polypeptide (e.g., a complex comprising a dCas-effector and a guide RNA) without a nucleic acid break can be sufficient to regulate expression (e.g., enhance or suppress) of a target gene (e.g., endogenous target gene).
- a target polynucleotide such as DNA.
- the disclosure provides compositions, methods, and systems for modulating expression of target genes.
- the target genes can be one or more endogenous target genes, such as a disease causing allele, e.g., a mutant allele.
- a disease causing allele e.g., a mutant allele.
- disclosed herein are complexes that comprise a guide moiety and one or more heterologous polypeptides comprising an actuator moiety that can increase or decrease an activity or expression level of a target gene.
- a target gene or regulatory sequence thereof is endogenous to a cell, for example, present in the cell’s genome, or endogenous to a subject, for example, present in the subject’s genome. In some embodiments, a target gene or regulatory sequence thereof is not part of an engineered reporter system.
- a target gene is exogenous to a host subject, for example, a pathogen target gene or an exogenous gene expressed as a result of a therapeutic intervention, such as a gene therapy and/or cell therapy.
- a target gene is an exogenous reporter gene.
- a target gene is an exogenous synthetic gene.
- an expression level is an RNA expression level can be measured by, for example, RNAseq, qPCR, microarray, gene array, FISH, etc.
- an expression level is a protein expression level can be measured by, for example, Western Blot, ELISA, multiplex immunoassay, mass spectrometry, NMR, proteomics, flow cytometry, mass cytometry, etc.
- the systems and methods as disclosed herein can modulate (e.g., increase or decrease) expression of a target gene (e.g., upon introducing a complex comprising the heterologous polypeptide into a cell or population of cells) by at least about 10%, at least about 20%, at least about 30%, at least about 40%, at least about 50%, at least about 60%, at least about 70%, at least about 80%, at least about 90%, at least about 2-fold, at least about 3 fold, at least about 4 fold, at least about 5 fold, at least about 6 fold, at least about 7 fold, at least about 8 fold, at least about 9 fold, at least about 10 fold, at least about 11 fold, at least about 12 fold, at least about 13 fold, at least about 14, at least fold about 15 fold, at least about 20 fold, at least about 30 fold, at least about 40 fold, at least about 50 fold, at least about 60 fold, at least about 70 fold, at least about 80 fold, at least about 90 fold, at least about 100 fold, at least about 150 fold
- the systems and methods as disclosed herein can modulate (e.g., increase or decrease) expression of a target gene (e.g., upon introducing a complex comprising the heterologous polypeptide into a cell or population of cells) by at most about 50%, at most about 60%, at most about 70%, at most about 80%, at most about 90%, at most about 2-fold, at most about 3 fold, at most about 4 fold, at most about 5 fold, at most about 6 fold, at most about 7 fold, at most about 8 fold, at most about 9 fold, at most about 10 fold, at most about 11 fold, at most about 12 fold, at most about 13 fold, at most about 14, at most fold about 15 fold, at most about 20 fold, at most about 30 fold, at most about 40 fold, at most about 50 fold, at most about 60 fold, at most about 70 fold, at most about 80 fold, at most about 90 fold, at most about 100 fold, at most about 150 fold, at most about 200 fold, at most about 250 fold, at most about 300 fold, at most about 500 fold, at most about
- the systems and methods as disclosed herein can modulate (e.g., increase or decrease) expression of a target gene (e.g., upon introducing a complex comprising the heterologous polypeptide into a cell or population of cells) by about 10%, about 20%, about 30%, about 40%, about 50%, about 60%, about 70%, about 80%, about 90%, about 2-fold, about 3 fold, about 4 fold, about 5 fold, about 6 fold, about 7 fold, about 8 fold, about 9 fold, about 10 fold, about 11 fold, about 12 fold, about 13 fold, about 14, about 15 fold, about 20 fold, about 30 fold, about 40 fold, about 50 fold, about 60 fold, about 70 fold, about 80 fold, about 90 fold, about 100 fold, about 150 fold, about 200 fold, about 250 fold, about 300 fold, about 350 fold, about 400 fold, about 500 fold, about 600 fold, about 700 fold, about 800 fold, about 900 fold, about 1000 fold, about 1500 fold, about 2000 fold, about 3000 fold, about 5000 fold, or about 10
- the systems and methods as disclosed herein can modulate (e.g., increase or decrease) expression of a target gene (e.g., upon introducing a complex comprising the heterologous polypeptide into a cell or population of cells) from below a limit of detection to a detectable level.
- the degree in change of expression is relative to before introducing the system of the present disclosure (e.g., a complex comprising the heterologous polypeptide) into the cell or population of cells.
- the degree in change of expression is relative to a corresponding control cell or population of cells that are not treated with the system of the present disclosure.
- the degree in change of expression is relative to a corresponding control cell or population of cells that are treated with an alternative to the system of the present disclosure.
- the system and method as disclosed herein can modulate (e.g., increase or decrease) an activity level of a target gene (e.g., upon introducing a complex comprising the heterologous polypeptide into a cell or population of cells).
- An activity level can be determined by a suitable functional assay for the target gene in question depending on the functional characteristics of the target gene. For example, an activity level of a target gene that is a mitogen could be determined by measuring cell proliferation; an activity level of a target gene that induces apoptosis could be measured by an annexin V assay or other suitable cell death assay; an activity level of an anti-inflammatory cytokine could be measured by an LPS-induced cytokine release assay.
- the system and method as disclosed herein can modulate (e.g., increase or decrease) the activity of the target gene by at least about 10%, at least about 20%, at least about 30%, at least about 40%, at least about 50%, at least about 60%, at least about 70%, at least about 80%, at least about 90%, at least about 2-fold, at least about 3 fold, at least about 4 fold, at least about 5 fold, at least about 6 fold, at least about 7 fold, at least about 8 fold, at least about 9 fold, at least about 10 fold, at least about 11 fold, at least about 12 fold, at least about 13 fold, at least about 14, at least about 15 fold, at least about 20 fold, at least about 30 fold, at least about 40 fold, at least about 50 fold, at least about 60 fold, at least about 70 fold, at least about 80 fold, at least about 90 fold, at least about 100 fold, at least about 150 fold, at least about 200 fold, at least about 250 fold, at least about 300 fold, at least about 350 fold, at least about 400 fold,
- the system and method as disclosed herein can modulate (e.g., increase or decrease) the activity of the target gene by at most 50%, at most 60%, at most 70%, at most 80%, at most 90%, at most about 2-fold, at most about 3 fold, at most about 4 fold, at most about 5 fold, at most about 6 fold, at most about 7 fold, at most about 8 fold, at most about 9 fold, at most about 10 fold, at most about 11 fold, at most about 12 fold, at most about 13 fold, at most about 14, at most about 15 fold, at most about 20 fold, at most about 30 fold, at most about 40 fold, at most about 50 fold, at most about 60 fold, at most about 70 fold, at most about 80 fold, at most about 90 fold, at most about 100 fold, at most about 150 fold, at most about 200 fold, at most about 250 fold, at most about 300 fold, at most about 350 fold, at most about 400 fold, at most about 500 fold, at most about 600 fold, at most about 700 fold, at most about 800 fold, at most about 2-fold, at most about
- the system to method increases the expression of the endogenous target gene encoding the target protein by at least about 0.01 fold to about 5,000 fold. In some embodiments, the system to method increases the expression of the endogenous target gene encoding the target protein by at least about 0.01 fold to about 0.05 fold, about 0.01 fold to about 0.1 fold, about 0.01 fold to about 0.5 fold, about 0.01 fold to about 1 fold, about 0.01 fold to about 5 fold, about 0.01 fold to about 10 fold, about 0.01 fold to about 50 fold, about 0.01 fold to about 100 fold, about 0.01 fold to about 500 fold, about 0.01 fold to about 1,000 fold, about 0.01 fold to about 5,000 fold, about 0.05 fold to about 0.1 fold, about 0.05 fold to about 0.5 fold, about 0.05 fold to about 1 fold, about 0.05 fold to about 5 fold, about 0.05 fold to about 10 fold, about 0.05 fold to about 50 fold, about 0.05 fold to about 100 fold, about 0.05 fold to about 500 fold, about 0.05 fold to about 1,000 fold, about 0.01 fold to about
- the system to method increases the expression of the endogenous target gene encoding the target protein by at least about 0.01 fold, about 0.05 fold, about 0.1 fold, about 0.5 fold, about 1 fold, about 5 fold, about 10 fold, about 50 fold, about 100 fold, about 500 fold, about 1,000 fold, or about 5,000 fold. In some embodiments, the system to method increases the expression of the endogenous target gene encoding the target protein by at least at least about 0.01 fold, about 0.05 fold, about 0.1 fold, about 0.5 fold, about 1 fold, about 5 fold, about 10 fold, about 50 fold, about 100 fold, about 500 fold, or about 1,000 fold.
- the system to method increases the expression of the endogenous target gene encoding the target protein by at least at most about 0.05 fold, about 0.1 fold, about 0.5 fold, about 1 fold, about 5 fold, about 10 fold, about 50 fold, about 100 fold, about 500 fold, about 1,000 fold, or about 5,000 fold.
- the system to method increases the expression of the endogenous target gene encoding the target protein, where the target protein is a PRPF31. In some embodiments, the system to method increases the expression of PRPF31 by at least about 0.01 fold to about 5,000 fold.
- the system to method increases the expression of PRPF31 by at least about 0.01 fold to about 0.05 fold, about 0.01 fold to about 0.1 fold, about 0.01 fold to about 0.5 fold, about 0.01 fold to about 1 fold, about 0.01 fold to about 5 fold, about 0.01 fold to about 10 fold, about 0.01 fold to about 50 fold, about 0.01 fold to about 100 fold, about 0.01 fold to about 500 fold, about 0.01 fold to about 1,000 fold, about 0.01 fold to about 5,000 fold, about 0.05 fold to about 0.1 fold, about 0.05 fold to about 0.5 fold, about 0.05 fold to about 1 fold, about 0.05 fold to about 5 fold, about 0.05 fold to about 10 fold, about 0.05 fold to about 50 fold, about 0.05 fold to about 100 fold, about 0.05 fold to about 500 fold, about 0.05 fold to about 1,000 fold, about 0.05 fold to about 5,000 fold, about 0.1 fold to about 0.5 fold, about 0.1 fold to about 1 fold, about 0.1 fold to about 5 fold, about 0.05 fold to about 10 fold, about 0.05 fold to about 50
- the system to method increases the expression of PRPF31 by at least about 0.01 fold, about 0.05 fold, about 0.1 fold, about 0.5 fold, about 1 fold, about 5 fold, about 10 fold, about 50 fold, about 100 fold, about 500 fold, about 1,000 fold, or about 5,000 fold. In some embodiments, the system to method increases the expression of PRPF31 by at least at least about 0.01 fold, about 0.05 fold, about 0.1 fold, about 0.5 fold, about 1 fold, about 5 fold, about 10 fold, about 50 fold, about 100 fold, about 500 fold, or about 1,000 fold.
- the system to method increases the expression of PRPF31 by at least at most about 0.05 fold, about 0.1 fold, about 0.5 fold, about 1 fold, about 5 fold, about 10 fold, about 50 fold, about 100 fold, about 500 fold, about 1,000 fold, or about 5,000 fold.
- the system to method increases the expression of PRPF31 without increasing expression of a gene neighboring PRPF31 or expression of a gene associated with PRPF31. In some embodiments, the system to method, without increasing the expression of a gene neighboring PRPF31 or expression of a gene associated with PRPF31, increases the expression of PRPF31 compared to endogenous expression of PRPF31 by at least about 0.01 fold to about 5,000 fold.
- the system to method without increasing the expression of a gene neighboring PRPF31 or expression of a gene associated with PRPF31, increases the expression of PRPF31 compared to endogenous expression of PRPF31 by at least about 0.01 fold to about 0.05 fold, about 0.01 fold to about 0.1 fold, about 0.01 fold to about 0.5 fold, about 0.01 fold to about 1 fold, about 0.01 fold to about 5 fold, about 0.01 fold to about 10 fold, about 0.01 fold to about 50 fold, about 0.01 fold to about 100 fold, about 0.01 fold to about 500 fold, about 0.01 fold to about 1,000 fold, about 0.01 fold to about 5,000 fold, about 0.05 fold to about 0.1 fold, about 0.05 fold to about 0.5 fold, about 0.05 fold to about 1 fold, about 0.05 fold to about 5 fold, about 0.05 fold to about 10 fold, about 0.05 fold to about 50 fold, about 0.05 fold to about 100 fold, about 0.05 fold to about 500 fold, about 0.05 fold to about 1,000 fold, about 0.05 fold to about 5,000 fold, about 0.1 fold, about 0.01
- the system to method without increasing the expression of a gene neighboring PRPF31 or expression of a gene associated with PRPF31, increases the expression of PRPF31 compared to endogenous expression of PRPF31 by at least about 0.01 fold, about 0.05 fold, about 0.1 fold, about 0.5 fold, about 1 fold, about 5 fold, about 10 fold, about 50 fold, about 100 fold, about 500 fold, about 1,000 fold, or about 5,000 fold.
- the system to method without increasing the expression of a gene neighboring PRPF31 or expression of a gene associated with PRPF31, increases the expression of PRPF31 compared to endogenous expression ofPRPF31 by at least at least about 0.01 fold, about 0.05 fold, about 0.1 fold, about 0.5 fold, about 1 fold, about 5 fold, about 10 fold, about 50 fold, about 100 fold, about 500 fold, or about 1,000 fold.
- the system to method without increasing the expression of a gene neighboring PRPF31 or expression of a gene associated with PRPF31, increases the expression of PRPF31 compared to endogenous expression of PRPF31 by at least at most about 0.05 fold, about 0.1 fold, about 0.5 fold, about 1 fold, about 5 fold, about 10 fold, about 50 fold, about 100 fold, about 500 fold, about 1,000 fold, or about 5,000 fold.
- the system to method increases the expression of PRPF31 without increasing expression of a gene neighboring PRPF31 on chromosome 19 or expression of a gene associated with PRPF31 on chromosome 19. In some embodiments, the system to method, without increasing the expression of a gene neighboring PRPF31 on chromosome 19 or expression of a gene associated with PRPF31 on chromosome 19, increases the expression of PRPF31 compared to endogenous expression of PRPF31 by at least about 0.01 fold to about 5,000 fold.
- the system to method without increasing the expression of a gene neighboring PRPF31 on chromosome 19 or expression of a gene associated with PRPF31 on chromosome 19, increases the expression of PRPF31 compared to endogenous expression of PRPF31 by at least about 0.01 fold to about 0.05 fold, about 0.01 fold to about 0.1 fold, about 0.01 fold to about 0.5 fold, about 0.01 fold to about 1 fold, about 0.01 fold to about 5 fold, about 0.01 fold to about 10 fold, about 0.01 fold to about 50 fold, about 0.01 fold to about 100 fold, about 0.01 fold to about 500 fold, about 0 01 fold to about 1,000 fold, about 0.01 fold to about 5,000 fold, about 0.05 fold to about 0.1 fold, about 0.05 fold to about 0.5 fold, about 0.05 fold to about 1 fold, about 0.05 fold to about 5 fold, about 0.05 fold to about 10 fold, about 0.05 fold to about 50 fold, about 0.05 fold to about 100 fold, about 0.05 fold to about 500 fold, about 0.05 fold to about 1,000 fold, about
- the system to method without increasing the expression of a gene neighboring PRPF31 on chromosome 19 or expression of a gene associated with PRPF31 on chromosome 19, increases the expression of PRPF31 compared to endogenous expression of PRPF31 by at least about 0.01 fold, about 0.05 fold, about 0.1 fold, about 0.5 fold, about 1 fold, about 5 fold, about 10 fold, about 50 fold, about 100 fold, about 500 fold, about 1,000 fold, or about 5,000 fold.
- the system to method without increasing the expression of a gene neighboring PRPF31 on chromosome 19 or expression of a gene associated with PRPF31 on chromosome 19, increases the expression of PRPF31 compared to endogenous expression ofPRPF31 by at least at least about 0.01 fold, about 0.05 fold, about 0.1 fold, about 0.5 fold, about 1 fold, about 5 fold, about 10 fold, about 50 fold, about 100 fold, about 500 fold, or about 1,000 fold.
- the system to method without increasing the expression of a gene neighboring PRPF31 on chromosome 19 or expression of a gene associated with PRPF31 on chromosome 19, increases the expression of PRPF31 compared to endogenous expression of PRPF31 by at least at most about 0.05 fold, about 0.1 fold, about 0.5 fold, about 1 fold, about 5 fold, about 10 fold, about 50 fold, about 100 fold, about 500 fold, about 1,000 fold, or about 5,000 fold.
- the system to method increases the expression of PRPF31 without increasing expression of TCF3 Fusion Partner (TFPT). In some embodiments, the system to method, without increasing the expression of TFPT, increases the expression of PRPF31 compared to endogenous expression of PRPF31 by at least about 0.01 fold to about 5,000 fold.
- TFPT TCF3 Fusion Partner
- the system to method without increasing the expression of TFPT, increases the expression of PRPF31 compared to endogenous expression ofPRPF31 by at least about 0.01 fold to about 0.05 fold, about 0.01 fold to about 0.1 fold, about 0.01 fold to about 0.5 fold, about 0.01 fold to about 1 fold, about 0.01 fold to about 5 fold, about 0.01 fold to about 10 fold, about 0.01 fold to about 50 fold, about 0.01 fold to about 100 fold, about 0.01 fold to about 500 fold, about 0.01 fold to about 1,000 fold, about 0.01 fold to about 5,000 fold, about 0.05 fold to about 0.1 fold, about 0.05 fold to about 0.5 fold, about 0.05 fold to about 1 fold, about 0.05 fold to about 5 fold, about 0.05 fold to about 10 fold, about 0.05 fold to about 50 fold, about 0.05 fold to about 100 fold, about 0.05 fold to about 500 fold, about 0.05 fold to about 1,000 fold, about 0.05 fold to about 5,000 fold, about 0.1 fold to about 0.5 fold, about 0.05 fold to about 0.5 fold, about 0.05 fold
- the system to method without increasing the expression of TFPT, increases the expression of PRPF31 compared to endogenous expression of PRPF31 by at least about 0.01 fold, about 0.05 fold, about 0.1 fold, about 0.5 fold, about 1 fold, about 5 fold, about 10 fold, about 50 fold, about 100 fold, about 500 fold, about 1,000 fold, or about 5,000 fold.
- the system to method without increasing the expression of TFPT, increases the expression ofPRPF31 compared to endogenous expression ofPRPF31 by at least at least about 0.01 fold, about 0.05 fold, about 0.1 fold, about 0.5 fold, about 1 fold, about 5 fold, about 10 fold, about 50 fold, about 100 fold, about 500 fold, or about 1,000 fold.
- the system to method without increasing the expression of TFPT, increases the expression of PRPF31 compared to endogenous expression of PRPF31 by at least at most about 0.05 fold, about 0.1 fold, about 0.5 fold, about 1 fold, about 5 fold, about 10 fold, about 50 fold, about 100 fold, about 500 fold, about 1,000 fold, or about 5,000 fold.
- the systems and methods as disclosed herein can modulate (e.g., increase or decrease) the activity of the target gene by about 10%, about 20%, about 30%, about 40%, about 50%, about 60%, about 70%, about 80%, about 90%, about 2-fold, about 3 fold, about 4 fold, about 5 fold, about 6 fold, about 7 fold, about 8 fold, about 9 fold, about 10 fold, about 11 fold, about 12 fold, about 13 fold, about 14, about 15 fold, about 20 fold, about 30 fold, about 40 fold, about 50 fold, about 60 fold, about 70 fold, about 80 fold, about 90 fold, about 100 fold, about 150 fold, about 200 fold, about 250 fold, about 300 fold, about 350 fold, about 400 fold, about 500 fold, about 600 fold, about 700 fold, about 800 fold, about 900 fold, about 1000 fold, about 1500 fold, about 2000 fold, about 3000 fold, about 5000 fold, or about 10000 fold.
- the systems and methods as disclosed herein can modulate (e.g., increase or decrease) expression of a target gene (e.g., upon introducing a complex comprising the heterologous polypeptide into a cell or population of cells) from below a limit of detection to a detectable level.
- the degree in change of an activity level is relative to before introducing the system of the present disclosure (e.g., a complex comprising the heterologous polypeptide) into the cell or population of cells. In some embodiments, the degree in change of an activity level is relative to a corresponding control cell or population of cells that are not treated with the system of the present disclosure. In some embodiments, the degree in change of an activity level is relative to a corresponding control cell or population of cells that are treated with an alternative to the system of the present disclosure.
- the system of the present disclosure e.g., a complex comprising the heterologous polypeptide
- the systems and methods of the present disclosure can, in some cases, elicit changes in expression and/or activity level of a target gene (e.g., target endogenous gene) that persists for longer than can be achieved with alternative compositions and methods (e.g., suppression via RNAi, e.g., using siRNA).
- a target gene e.g., target endogenous gene
- alternative compositions and methods e.g., suppression via RNAi, e.g., using siRNA.
- persistent modulation of gene expression is advantageous as compared to transient modulation
- the systems and methods as disclosed herein can modulate (e.g., increase or decrease) expression and/or activity level of a target gene for at least about 1 hour, at least about 2 hours, at least about 3 hours, at least about 4 hours, at least about 5 hours, at least about 6 hours, at least about 7 hours, at least about 8 hours, at least about 9 hours, at least about 10 hours, at least about 12 hours, at least about 14 hours, at least about 18 hours, at least about 20 hours, at least about 1 day, at least about 2 days, at least about 3 days, at least about 4 days, at least about 5 days, at least about 6 days, at least about 7 days, at least about 8 days, at least about
- 9 days at least about 10 days, at least about 14 days, at least about 21 days, at least about 28 days, at least about 5 weeks, at least about 6 weeks, at least about 7 weeks, at least about 8 weeks, at least about 9 weeks, at least about 10 weeks, at least about 12 weeks, at least about 14 weeks, at least about 18 weeks, at least about 20 weeks, at least about 26 weeks, or at least about 5 months, at least about 6 months, at least about 9 months, at least about 12 months, or more.
- the systems and methods as disclosed herein can modulate (e.g., increase or decrease) expression and/or activity level of a target gene (e.g., target endogenous gene) to above a certain threshold for at most about 1 hour, at most about 2 hours, at most about 3 hours, at most about 4 hours, at most about 5 hours, at most about 6 hours, at most about 7 hours, at most about 8 hours, at most about 9 hours, at most about 10 hours, at most about 12 hours, at most about 14 hours, at most about 18 hours, at most about 20 hours, at most about 1 day, at most about 2 days, at most about 3 days, at most about 4 days, at most about 5 days, at most about 6 days, at most about 7 days, at most about 8 days, at most about 9 days, at most about 10 days, at most about 14 days, at most about 21 days, at most about 28 days, at most about 5 weeks, at most about 6 weeks, at most about 7 weeks, at most about 8 weeks, at most about 9 weeks, at most about 10
- a target gene
- the systems and methods as disclosed herein can modulate (e.g., increase or decrease) expression and/or activity level of a target gene (e.g., target endogenous gene) to above a certain threshold for about 1 hour, about 2 hours, about 3 hours, about 4 hours, about 5 hours, about 6 hours, about 7 hours, about 8 hours, about 9 hours, about 10 hours, about 12 hours, about 14 hours, about 18 hours, about 20 hours, about 1 day, about 2 days, about 3 days, about 4 days, about 5 days, about 6 days, about 7 days, about 8 days, about 9 days, about
- a target gene e.g., target endogenous gene
- the target gene (e.g., endogenous target gene) can be a diseasecausing allele, such as a mutant variant of a wild type allele.
- the disease can be a genetic disease, such as a hereditary disorder.
- Non-limiting examples of the genetic disorder can include Duchenne muscular dystrophy (DMD), hemophilia, cystic fibrosis, Huntington's chorea, familial hypercholesterolemia (LDL receptor defect), hepatoblastoma, Wilson's disease, congenital hepatic porphyria, inherited disorders of hepatic metabolism, Lesch Nyhan syndrome, sickle cell anemia, thalassaemias, xeroderma pigmentosum, Fanconi's anemia, retinitis pigmentosa, ataxia telangiectasia, Bloom's syndrome, retinoblastoma, and Tay-Sachs disease.
- the target gene can be a gene encoding a protein.
- the target gene can be a gene regulatory sequence (e.g., promoters, enhancers, repressors, silencers, insulators, cis-regulatory elements, trans-regulatory elements, epigenetic modification (e.g., DNA methylation) sites, etc.) that can influence expression of a gene encoding a protein of interest as provided herein.
- a gene regulatory sequence e.g., promoters, enhancers, repressors, silencers, insulators, cis-regulatory elements, trans-regulatory elements, epigenetic modification (e.g., DNA methylation) sites, etc.
- target gene regulatory sequences can be physically located outside of the transcriptional unit or open reading frame that encodes a product of the target gene.
- a target gene regulatory sequence does not contain a nucleotide sequence that is exogenous to the subject or host cell. In some embodiments, a target gene regulatory sequence does not contain an engineered or artificially generated or introduced nucleotide sequence.
- a target gene (e.g., target endogenous gene) is a gene that is overexpressed or under-expressed in a disease or condition.
- a target gene is a gene that is over-expressed or under-expressed in a heritable genetic disease.
- the disease is retinitis pigmentosa 11 (RP11).
- a target gene e.g., an endogenous target gene
- a disease causing gene e.g., a mutant allele
- the systems and compositions of the present disclosure can further comprise a heterologous polynucleotide encoding a non-disease causing gene thereof (e.g., a wild type allele), e.g., as a gene replacement therapy.
- the methods as disclosed herein can comprise introducing such system or compositions to a cell or to a subject, e.g., contacting the cell with such systems or compositions (e.g., via delivery or expression of such systems or compositions in the cell).
- the systems and compositions can comprise the non-disease causing wild type or variant of the target gene, as abovementioned.
- the systems and compositions can comprise a heterologous polynucleotide sequence encoding (or comprising) at least the non-disease causing wild type or variant of the target gene (e.g., that of the endogenous target gene) as disclosed herein.
- the present disclosure provides a composition comprising at least a portion of the system as described, e.g., (i) the heterologous polypeptide comprising the actuator moiety or a heterologous polynucleotide encoding the heterologous polypeptide, (ii) the guide nucleic acid or a heterologous polynucleotide encoding the guide nucleic acid, as disclosed herein, (iii) the heterologous polynucleotide encoding a non-disease causing allele of a gene, for use in any of the methods as disclosed herein.
- the subject composition can be usable for modifying a cell in vitro, ex vivo, or in vivo.
- the subject composition can be usable for treating or enhancing a condition of a subject, as disclosed herein.
- composition as disclosed herein can comprise an active ingredient (e.g., the heterologous polypeptide comprising the actuator moiety, the guide nucleic acid, the heterologous polynucleotide encoding the non-disease causing allele of a gene, etc.) and optionally an additional ingredient (e.g., excipient). If necessary and/or desirable, the composition can be divided, shaped and/or packaged into a desired single- or multi-dose unit or single-or multi-implantation unit.
- an active ingredient e.g., the heterologous polypeptide comprising the actuator moiety, the guide nucleic acid, the heterologous polynucleotide encoding the non-disease causing allele of a gene, etc.
- an additional ingredient e.g., excipient
- the composition can comprise one or more heterologous polynucleotides encoding the active ingredients as disclosed herein.
- each member can be encoded by a different heterologous polynucleotide.
- two or more (e.g., all of) the ingredients can be encoded by a single heterologous polynucleotide.
- a single heterologous polynucleotide an encode (i) the heterologous polypeptide comprising the actuator moiety (e.g., dCas- transcriptional effector fusion protein, such as dCas-KRAB or dCas-DNMT) and (ii) one or more guide nucleic acids (e g., at least 1, at least 2, at least 3, at least 4, at least 5, or more guide nucleic acids) for targeting specific region(s) or sequence(s) of the target gene.
- the actuator moiety e.g., dCas- transcriptional effector fusion protein, such as dCas-KRAB or dCas-DNMT
- guide nucleic acids e g., at least 1, at least 2, at least 3, at least 4, at least 5, or more guide nucleic acids
- a single heterologous polynucleotide an encode (i) the heterologous polypeptide comprising the actuator moiety (e.g., dCas-transcriptional effector fusion protein, such as dCas-KRAB, dCas- DNMT, or dCas-VPR), (ii) one or more guide nucleic acids (e.g., at least 1, at least 2, at least 3, at least 4, at least 5, or more guide nucleic acids) for targeting specific region(s) or sequence(s) of the target gene, and (iii) the heterologous polynucleotide encoding a non-disease causing allele of a gene.
- the actuator moiety e.g., dCas-transcriptional effector fusion protein, such as dCas-KRAB, dCas- DNMT, or dCas-VPR
- guide nucleic acids e.g., at least 1, at least
- the one or more heterologous polynucleotides can further comprise one or more promoters (or one or more transcriptional control elements, as used interchangeably herein). Different active ingredients encoded by the one or more heterologous polynucleotides can be under the control of the same promoter or different promoters.
- a promoter as disclosed herein can be active in a eukaryotic, mammalian, non-human mammalian or human cell.
- the promoter can be an inducible or constitutively active promoter. Alternatively or additionally, the promoter can be tissue or cell specific.
- suitable eukaryotic promoters i.e.
- promoters functional in a eukaryotic cell can include those from cytomegalovirus (CMV) immediate early, herpes simplex virus (HSV) thymidine kinase, early and late SV40, long terminal repeats (LTRs) from retrovirus, human elongation factor-1 promoter (EFl), a hybrid construct comprising the cytomegalovirus (CMV) enhancer fused to the chicken beta-active promoter (CAG), murine stem cell virus promoter (MSCV), phosphoglycerate kinase- 1 locus promoter (PGK) and mouse metallothionein-I.
- the promoter can be a fungi promoter.
- the promoter can be a plant promoter.
- a database of plant promoters can be found (e.g., PlantProm).
- the expression vector may also contain a ribosome binding site for translation initiation and a transcription terminator.
- the expression vector may also include appropriate sequences for amplifying expression.
- a promoter as disclosed herein can be a promoter specific for any of the tissues provided herein, or a promoter specific for any of the cell types provided herein.
- a heterologous polynucleotide of the one or more heterologous polynucleotides can have a size of at least or up to about 2.5 kilobases, at least or up to about 2.6 kilobases, at least or up to about 2.7 kilobases, at least or up to about 2.8 kilobases, at least or up to about 2.9 kilobases, at least or up to about 3.0 kilobases, at least or up to about 3.1 kilobases, at least or up to about 3.2 kilobases, at least or up to about 3.3 kilobases, at least or up to about 3.4 kilobases, at least or up to about 3.5 kilobases, at least or up to about 3.6 kilobases, at least or up to about 3.7 kilobases, at least or up to about 3.8 kilobases, at least or up to
- the heterologous polynucleotide of the one or more heterologous polynucleotides can have a size of between about 3 kilobases and about 5 kilobases, between about 3 kilobases and about 4.8 kilobases, between about 3 kilobases and about 4.6 kilobases, between about 3 kilobases and about 4.4 kilobases, between about 3 kilobases and about 4.2 kilobases, between about 3 kilobases and about 4.0 kilobases, between about 3 kilobases and about 3.5 kilobases, between about 3.5 kilobases and about 5 kilobases, between about 3.5 kilobases and about 4.8 kilobases, between about 3.5 kilobases and about 4.6 kilobases, between about 3.5 kilobases and about 4.4 kilobases,
- a method of delivery of the one or more heterologous polynucleotides provided herein to the cell can involve viral delivery methods or non-viral delivery methods.
- the one or more heterologous polynucleotides can be one or more viral vectors (e.g., one or more AAV vectors).
- the one or more heterologous polynucleotides can be non-viral vectors that are complexed with or encapsulated by non-viral delivery moieties, such as cationic lipids and/or lipid particles (e.g., lipid nanoparticles (LNP)).
- LNP lipid nanoparticles
- Methods of non-viral delivery of nucleic acids can include lipofection, nucleofection, microinjection, biolistics, virosomes, liposomes, immunoliposomes, polycation or lipidmucleic acid conjugates, naked DNA, artificial virions, and agent-enhanced uptake of DNA.
- Cationic and neutral lipids that are suitable for efficient receptor-recognition lipofection of polynucleotides can be used. Delivery can be to cells (e.g. in vitro or ex vivo administration) or target tissues (e.g. in vivo administration).
- the compositions and systems provided herein are delivered to a subject using a viral vector.
- the viral vector is an adeno-associated viral (AAV) vector.
- AAV adeno-associated viral
- rAAV refers to recombinant adeno-associated virus, also referred to as a recombinant AAV vector (or “rAAV vector”).
- AAV includes AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, A AVI 0, AAV11, AAV12, rhlO, and hybrids thereof, avian AAV, bovine AAV, canine AAV, equine AAV, primate AAV, non-primate AAV, and ovine AAV.
- TRs native terminal repeats
- Rep proteins Rep proteins
- capsid subunits are known in the art. Such sequences may be found in the literature or in public databases such as GenBank.
- rAAV vector refers to an AAV vector comprising a polynucleotide sequence not of AAV origin (i.e., a polynucleotide heterologous to AAV), typically a sequence of interest for the genetic transformation of a cell.
- the heterologous polynucleotide is flanked by at least one, and generally by two, AAV inverted terminal repeat sequences (ITRs).
- ITRs AAV inverted terminal repeat sequences
- the term rAAV vector encompasses both rAAV vector particles and rAAV vector plasmids.
- An rAAV vector may either be single-stranded (ssAAV) or self-complementary (scAAV).
- An “AAV virus” or “AAV viral particle” or “rAAV vector particle” refers to a viral particle composed of at least one AAV capsid protein and an encapsidated polynucleotide rAAV vector. If the particle comprises a heterologous polynucleotide (i.e., a polynucleotide other than a wild-type AAV genome such as a transgene to be delivered to a mammalian cell), it is typically referred to as an “rAAV vector particle” or simply an “rAAV vector”. Thus, production of rAAV particle necessarily includes production of rAAV vector, as such a vector is contained within an rAAV particle.
- a heterologous polynucleotide i.e., a polynucleotide other than a wild-type AAV genome such as a transgene to be delivered to a mammalian cell
- the AAV vector is selected based on the tropism of viral vector.
- an AAV vector with tropism for the target tissue e.g., eye
- may be used e.g., AAV7, AAV8, AAV9 to deliver polynucleotides encoding the compositions and systems provided herein to the target tissue (e.g., eye).
- RNA or DNA viral based systems can be used to target specific cells in the body and trafficking the viral payload to the nucleus of the cell.
- Viral vectors can be administered directly (in vivo) or they can be used to treat cells in vitro, and the modified cells can optionally be administered (ex vivo).
- Viral based systems can include retroviral, lentivirus, adenoviral, adeno- associated and herpes simplex virus vectors for gene transfer. Integration in the host genome can occur with the retrovirus, lentivirus, and adeno-associated virus gene transfer methods, which can result in long term expression of the inserted transgene. High transduction efficiencies can be observed in many different cell types and target tissues.
- Lentiviral vectors are retroviral vectors that can transduce or infect non-dividing cells and produce high viral titers. Selection of a retroviral gene transfer system can depend on the target tissue. Retroviral vectors can comprise cis-acting long terminal repeats with packaging capacity for up to 6-10 kb of foreign sequence. The minimum cis-acting LTRs can be sufficient for replication and packaging of the vectors, which can be used to integrate the therapeutic gene into the target cell to provide permanent transgene expression.
- Retroviral vectors can include those based upon murine leukemia virus (MuLV), gibbon ape leukemia virus (GaLV), Simian Immuno deficiency virus (SIV), human immuno deficiency virus (HIV), and combinations thereof.
- MiLV murine leukemia virus
- GaLV gibbon ape leukemia virus
- SIV Simian Immuno deficiency virus
- HAV human immuno deficiency virus
- Adenoviral-based systems can be used. Adenoviral-based systems can lead to transient expression of the transgene. Adenoviral based vectors can have high transduction efficiency in cells and may not require cell division. High titer and levels of expression can be obtained with adenoviral based vectors. Adeno-associated virus (“AAV”) vectors can be used to transduce cells with target nucleic acids, e.g., in the in vitro production of nucleic acids and peptides, and for in vivo and ex vivo gene therapy procedures.
- AAV Adeno-associated virus
- Packaging cells can be used to form virus particles capable of infecting a host cell.
- Such cells can include 293 cells, (e.g., for packaging adenovirus), and Psi2 cells or PA317 cells (e.g., for packaging retrovirus).
- Viral vectors can be generated by producing a cell line that packages a nucleic acid vector into a viral particle.
- the vectors can contain the minimal viral sequences required for packaging and subsequent integration into a host.
- the vectors can contain other viral sequences being replaced by an expression cassette for the polynucleotide(s) to be expressed.
- the missing viral functions can be supplied in trans by the packaging cell line.
- AAV vectors can comprise ITR sequences from the AAV genome which are required for packaging and integration into the host genome.
- Viral DNA can be packaged in a cell line, which can contain a helper plasmid encoding the other AAV genes, namely rep and cap, while lacking ITR sequences.
- the cell line can also be infected with adenovirus as a helper.
- the helper virus can promote replication of the AAV vector and expression of AAV genes from the helper plasmid. Contamination with adenovirus can be reduced by, e g., heat treatment to which adenovirus is more sensitive than AAV.
- a host cell can be transiently or non-transiently transfected with one or more vectors described herein.
- a cell can be transfected as it naturally occurs in a subject.
- a cell can be taken or derived from a subject and transfected.
- a cell can be derived from cells taken from a subject, such as a cell line.
- a cell transfected with one or more vectors described herein is used to establish a new cell line comprising one or more vector-derived sequences.
- a cell transiently transfected with the compositions of the disclosure (such as by transient transfection of one or more vectors, or transfection with RNA), and modified through the activity of a an actuator moiety such as a CRISPR complex, is used to establish a new cell line comprising cells containing the modification but lacking any other exogenous sequence.
- a an actuator moiety such as a CRISPR complex
- Any suitable vector compatible with the host cell can be used with the methods of the disclosure.
- vectors for eukaryotic host cells include pXTl, pSG5 (StratageneTM), pSVK3, pBPV, pMSG, and pSVLSV40 (PharmaciaTM).
- the additional ingredient of the composition as disclosed herein can comprise an excipient.
- the excipient can include solvents, dispersion media, diluents, or other liquid vehicles, dispersion or suspension aids, surface active agents, isotonic agents, thickening or emulsifying agents, preservatives, lipidoids, liposomes, lipid nanoparticles, polymers, lipoplexes, core-shell nanoparticles, peptides, proteins, hyaluronidase, nanoparticle mimics, inert diluents, buffering agents, lubricating agents, oils, and combinations thereof.
- the composition as disclosed herein can include one or more excipients, each in an amount that together increases the stability of (i) the heterologous polypeptide or the heterologous gene encoding thereof and/or (ii) cells or modified cells.
- the present disclosure provides a kit comprising such composition and instructions directing (i) contacting the cell with the composition (e.g., in vitro, ex vivo, or in vivo), or (ii) administration of cells comprising any one of the compositions disclosed herein to a subject.
- the subject may have or may be suspected of having a condition, such as a hereditary disease.
- any of the compositions as disclosed herein can be administered to the subject via orally, intraperitoneally, intravenously, intraarterially, transdermally, intramuscularly, liposomally, via local delivery by catheter or stent, subcutaneously, intraadiposally, or intrathecally.
- the compositions and systems provided herein can be administered to a subject via intravenous administration.
- compositions e.g., pharmaceutical compositions
- compositions can be suitable for administration to humans.
- such compositions can be suitable for administration to any other animal, e.g., to non-human animals, e.g. non-human mammals.
- Modification of pharmaceutical compositions suitable for administration to humans in order to render the compositions suitable for administration to various animals is well understood, and the ordinarily skilled veterinary pharmacologist can design and/or perform such modification with merely ordinary, if any, experimentation.
- Subjects to which administration of the pharmaceutical compositions is contemplated include, but are not limited to, humans and/or other primates; mammals, including commercially relevant mammals such as cattle, pigs, horses, sheep, cats, dogs, mice, and/or rats; and/or birds, including commercially relevant birds such as poultry, chickens, ducks, geese, and/or turkeys.
- a cell as provided herein may be referred to as a target cell.
- the systems, compositions, and methods as provided herein can be applied to modify a target cell (e.g., modify expression profile of a target gene of the target cell).
- a target cell can include a wide variety of cell types.
- a target cell can be in vitro.
- a target cell can be in vivo.
- a target cell can be ex vivo.
- a target cell can be an isolated cell.
- a target cell can be a cell inside of an organism.
- a target cell can be an organism.
- a target cell can be a cell in a cell culture.
- a target cell can be one of a collection of cells.
- a target cell can be a mammalian cell or derived from a mammalian cell.
- a target cell can be a rodent cell or derived from a rodent cell.
- a target cell can be a human cell or derived from a human cell.
- a target cell can be a prokaryotic cell or derived from a prokaryotic cell.
- a target cell can be a bacterial cell or can be derived from a bacterial cell.
- a target cell can be an archaeal cell or derived from an archaeal cell.
- a target cell can be a eukaryotic cell or derived from a eukaryotic cell.
- a target cell can be a pluripotent stem cell.
- a target cell can be a plant cell or derived from a plant cell.
- a target cell can be an animal cell or derived from an animal cell.
- a target cell can be an invertebrate cell or derived from an invertebrate cell.
- a target cell can be a vertebrate cell or derived from a vertebrate cell.
- a target cell can be a microbe cell or derived from a microbe cell.
- a target cell can be a fungi cell or derived from a fungi cell.
- a target cell can be from a specific organ or tissue.
- a target cell can be a stem cell or progenitor cell.
- Target cells can include stem cells (e.g., adult stem cells, embryonic stem cells, induced pluripotent stem (iPS) cells) and progenitor cells (e.g., cardiac progenitor cells, neural progenitor cells, etc.).
- Target cells can include mammalian stem cells and progenitor cells, including rodent stem cells, rodent progenitor cells, human stem cells, human progenitor cells, etc.
- Clonal cells can comprise the progeny of a cell.
- a target cell can comprise a target nucleic acid.
- a target cell can be in a living organism.
- a target cell can be a genetically modified cell.
- a target cell can be a host cell.
- a target cell can be a primary cell.
- cultures of primary cells can be passaged 0 times, 1 time, 2 times, 4 times, 5 times, 10 times, 15 times or more.
- Cells can be unicellular organisms. Cells can be grown in culture.
- a target cell can be a diseased cell.
- a diseased cell can have altered metabolic, gene expression, and/or morphologic features.
- a diseased cell can be a cancer cell, a diabetic cell, and a apoptotic cell.
- a diseased cell can be a cell from a diseased subject. Exemplary diseases can include blood disorders, cancers, metabolic disorders, eye disorders, organ disorders, musculoskeletal disorders, cardiac disease, and the like.
- the target cells are primary cells, they may be harvested from an individual by any method.
- leukocytes may be harvested by apheresis, leukocytapheresis, density gradient separation, etc.
- Cells from tissues such as skin, muscle, bone marrow, spleen, liver, pancreas, lung, intestine, stomach, etc. can be harvested by biopsy.
- Non-limiting examples of cells which can be target cells include, but are not limited to, lymphoid cells, such as B cell, T cell (Cytotoxic T cell, Natural Killer T cell, Regulatory T cell, T helper cell), Natural killer cell, cytokine induced killer (CIK) cells; myeloid cells, such as granulocytes (Basophil granulocyte, Eosinophil granulocyte, Neutrophil granulocyte/Hypersegmented neutrophil), Monocyte/Macrophage, Red blood cell (Reticulocyte), Mast cell, Thrombocyte/Megakaryocyte, Dendritic cell; cells from the endocrine system, including thyroid (Thyroid epithelial cell, Parafollicular cell), parathyroid (Parathyroid chief cell, Oxyphil cell), adrenal (Chromaffin cell), pineal (Pinealocyte) cells; cells of the nervous system, including glial cells (Astrocyte, Microglia), Magnocellular neurosecretory cell, Stellate cell,
- Apocrine sweat gland cell odoriferous secretion, sex -hormone sensitive
- Gland of Moll cell in eyelid specialized sweat gland
- Sebaceous gland cell lipid-rich sebum secretion
- Bowman's gland cell in nose washes olfactory epithelium
- Brunner's gland cell in duodenum enzymes and alkaline mucus
- Seminal vesicle cell secretes seminal fluid components, including fructose for swimming sperm), Prostate gland cell (secretes seminal fluid components), Bulbourethral gland cell (mucus secretion), Bartholin's gland cell (vaginal lubricant secretion), Gland of Littre cell (mucus secretion), Uterus endometrium cell (carbohydrate secretion), Isolated goblet cell of respiratory and digestive tracts (mucus secretion), Stomach lining mucous cell (mucus secretion), Gas
- the cell can be engineered to comprise (or exhibit) any one of the systems or compositions as disclosed herein or can be treated by any one of the methods disclosed herein in vitro or ex vivo, then administered to the subject, e.g., to treat a condition of the subject.
- any subject modified cell product can be administered to the subject to treat a condition of a bodily tissue of the subject.
- the cell can be resident inside the subject’s body, and any of the systems or compositions thereof can be administered to the subject, to contact the cell by the systems/compositions (e.g., to engineer the cell with the systems/compositions).
- Gene expression can be modulated in a cell by utilizing a system or a method described herein.
- the gene being modulated by the system or the method can be a mutant allele that can cause a disease or condition in a subject.
- the gene being modulated can be a non-disease causing variant (e.g. a wild-type allele).
- the gene expression can modulated by the system or the method described herein by both decreasing the expression of the mutant allele in a cell and simultaneously increasing expression of the wildtype allele.
- the wild-type allele is encoded by at least one of the heterologous polynucleotides described herein.
- FIG. 1 illustrates an exemplary construct encoding the dCas and the actuator moiety (effector).
- dCas can be coupled with a transcription repressor for decreasing expression of the expression of the mutant allele of the endogenous target gene in the cell.
- FIG. 2 illustrates a schematic for treating retinitis pigmentosa with the system described herein.
- AAV can be engineered to deliver an exemplary construct via subretinal or intravitreal injection to a subject in need thereof.
- the modulation of the endogenous target gene expression by the system or method described herein can be used to treat a disease or condition in a subject.
- a subject suspected of having a disease or condition associated with mutation of the endogenous target gene can be first screened for the presence of a mutant allele of the endogenous target gene.
- the system described herein can be administered to the subject to simultaneously decrease expression of the mutant allele of endogenous target gene and increase expression of the non-disease causing allele of endogenous target gene.
- PRPF31 expression can be modulated in a cell by utilizing a system or a method described herein.
- the PRPF31 is a mutant allele ofPRPF31.
- the mutant PRPF31 can cause disease in a subject.
- the PRPF31 is a non-disease causing variant of PRPF31 (e.g. wild type allele of PRPF31).
- the PRPF31 expression can modulated by the system or the method described herein by both decreasing the expression of the mutant allele of PRPF31 in a cell and simultaneously increasing expression of the wild type allele of PRPF31.
- the wild type allele of PRPF31 is encoded by at least one of the heterologous polynucleotides described herein.
- FIG. 3 illustrates exemplary target endogenous polynucleotide sequences (e.g., transcripts) that can be targeted by the gRNA of the system and the method described herein for decreasing or increasing the expression of PRPF31.
- the modulation of the PRPF31 expression by the system or method described herein can be used to treat a disease or condition in a subject.
- a subject suspected of having a disease or condition associated with PRPF31 mutation can be first screened for the presence of PRPF31 variant (e.g., a mutant allele of PRPF31).
- the systems as provided herein can be delivered to a cell via one or more expression cassettes encoding one or more components of the systems.
- the one or more expression cassettes can comprise a vector, such as a viral vector (e.g., an AAV vector comprising two Inverted terminal repeats (ITRs)).
- FIGs. 4A-4F schematically illustrate examples of a single vector encoding one or more components the system as provided herein.
- FIG. 4A shows a schematic representation of a construct in which a RNA Pol II promoter drives the expression of a nuclear localization signal (NSL), a deactivated Cas (dCas), a protein linker (e.g., a GS linker), a modulator (e.g., a transcriptional effector), and Poly A signal placed at the 3 ’-end of the modulator.
- the construct includes a RNA Pol III promoter that drives the expression of a guide RNA (gRNA) exhibiting specific binding against an endogenous target gene (e.g., encoding PRPF3 l)-terminator sequence, which is placed at the 3’-end of the poly A signal.
- gRNA guide RNA
- FIG. 4B shows a schematic representation of a construct in which a RNA Pol III promoter that drives the expression of a guide RNA (gRNA) exhibiting specific binding against an endogenous target gene (e.g., encoding PRPF31)-terminator sequence, and a terminator sequence placed at the 3 ’-end of the gRNA.
- gRNA guide RNA
- the construct includes, downstream of the 3 ’-end of the terminator sequence, a RNA Pol II promoter drives the expression of a nuclear localization signal (NSL), a deactivated Cas (dCas), a protein linker (e.g., a GS linker), a modulator (e.g., a transcriptional effector), and Poly A signal placed at the 3’-end of the modulator.
- NSL nuclear localization signal
- dCas deactivated Cas
- protein linker e.g., a GS linker
- modulator e.g., a transcriptional effector
- Poly A signal placed at the 3’-end of the modulator.
- FIG. 4C shows a schematic representation of a construct in which a RNA Pol II promoter drives the expression of a nuclear localization signal (NSL), a deactivated Cas (dCas), a protein linker (e.g., a GS linker), a modulator (e.g., a transcriptional effector), and Poly A signal placed at the 3 ’-end of the modulator.
- the construct includes a RNA Pol III promoter that drives the expression of a guide RNA (gRNA) exhibiting specific binding against an endogenous target gene (e.g., encoding PRPF3 l)-terminator sequence.
- the RNA Pol III promoter (and the components under the control of such promoter) is downstream and on the opposite strand of the RNA Pol II promoter (and the components under the control of such promoter).
- FIG. 4D shows a schematic representation of a construct similar to that shown in FIG. 4A.
- the RNA Pol II promoter (and the components under the control of such promoter) in the construct of FIG. 4D is further downstream away from the 5’ ITR and towards the 3 ’ ITR.
- FIG. 4E shows a schematic representation of a construct similar to that shown in FIG. 4B. As compared to that in FIG. 4B, the RNA Pol II promoter (and the components under the control of such promoter) in the construct of FIG. 4E is further upstream towards the 5’ ITR and away from the 3 ’ ITR.
- FIG. 4F shows a schematic representation of a construct similar to that shown in FIG. 4C.
- the RNA Pol II promoter (and the components under the control of such promoter) in the construct of FIG. 4F is further upstream towards the 5’ ITR and away from the 3 ’ ITR.
- RPE retinal pigment epithelium
- gRNAs e.g., with different spacer sequences and/or different viral vector construct designs
- certain gRNAs yielded greater expression of PRPF31 in the RPE cells.
- Example 5 In vitro functional assay.
- Induced pluripotent stem cells derived from cells of healthy or non-diseased subjects/patients (control iPSC) and iPSCs derived from cells of a RPl l subjects/patients (e.g., RP11-iPSCs) can be differentiated into RPE cells (control RPE cells and RP11-RPE cells, respectively) using an established differentiation protocol.
- PRPF31 co-localization with small nuclear ribonucleoproteins (snRNPs) and/or ADP-ribosylation factor-like protein 13B (ARL13B) can be assessed for the iPSC-derived RPEs, to confirm PRPF31 localization in both splicing complexes and cilia.
- immunostaining against the cilia marker ARL13B can be performed on one or more members of the following: (i) wild type RPE cells (e.g., control RPE cells or isolated RPE cells) that is not engineered with the system of the present disclosure (e.g., a complex comprising dCas-activator and a guide RNA against PRPF31), (ii) RP11-RPE cells (e.g., RP11-iPSCs or isolated mutant RPE cells) that are not engineered with the system of the present disclosure, and (iii) RP11-RPE cells that are engineered with the system of the present disclosure. Both cilia incidence and cilia length can be measured via imaging (e g., ImagExpress Micro) for statistical analysis.
- imaging e g., ImagExpress Micro
- FIGs. 6A and 6B show examples images of spliceosome normal RPE cells (FIG. 6A, with cilium indicated by four arrows) and RPE cells with defected spliceosome (FIG. 6B).
- PRPF31 mutations can lead to impaired pre-mRNA splicing of key components involved in the splicing process itself.
- gene expression assays e.g., RT-PCR
- RPE cells can be assessed accordingly.
- RT-PCR retinal organoids derived from either control iPSCs or RP11-iPSCs.
- Expression level and change thereof of one or more key genes involved in cilia formation and/or outer segments of photoreceptors can be assessed accordingly.
- Non-limiting examples of such key genes can include RPGR, RPGRIP1L, CN0T3, intraflagellar transport (IFT122), actin filament organization, centrosome and focal adhesion (SORB SI), and pre-mRNA 3 '-end processing (CPSF1).
- IFT122 intraflagellar transport
- SORB SI centrosome and focal adhesion
- CPSF1 pre-mRNA 3 '-end processing
- Example 8 In vivo assay.
- Gene delivery constructs e.g., AAV constructs as shown in Example 3 in saline or a blank saline can be injected into the eye in vivo to treat or ameliorate an ocular disease (e.g., RP11).
- an ocular disease e.g., RP11
- mice e.g., age 4-5 weeks
- Mice can be anaesthetized and their pupils can be dilated. Mice can then be placed under an operating microscope.
- a guide hole can be made through the sclera behind the iris (e.g., at an angle) towards the optic nerve.
- a micro-injection syringe can then be inserted into the hole and AAV (or blank saline) can be delivered to the vitreous body (for intravitreal injections), or the needle can be used to pierce the temporal retina and inject AAV to the subretinal space (for subretinal injections).
- Effects of the AAV treatment can be assessed in later time points via visualization (e.g., fundoscopy, optical coherence tomography (OCT)), retinal function assay (e.g., electroretinogram (ERG)), immunohistochemistry (IHC), gene expression analysis or sequencing of retinal cells extracted from the mice, etc.
- visualization e.g., fundoscopy, optical coherence tomography (OCT)
- retinal function assay e.g., electroretinogram (ERG)
- IHC immunohistochemistry
- gene expression analysis or sequencing of retinal cells extracted from the mice etc.
- Table 3 Exemplary list of guide RNA scaffold fragment sequences.
- Table 6 Exemplary human genomic region to be targeted.
- Embodiment 1 A system comprising: a heterologous polypeptide comprising an actuator moiety, wherein the actuator moiety is for binding an endogenous target gene encoding PRPF31 in a cell, to increase expression level of the PRPF31 in the cell, wherein:
- the actuator moiety substantially lacks DNA cleavage activity
- the actuator moiety is coupled to a transcriptional activator, optionally wherein:
- the actuator moiety substantially lacks DNA cleavage activity
- the actuator moiety is coupled to the transcriptional activator
- the actuator moiety is fused to the transcriptional activator;
- the cell is a retinal cell
- the cell is a retinal pigment epithelium (RPE) cell; and/or
- the endogenous target gene comprises a non-disease causing allele of the PRPF31;
- the actuator moiety is not capable of binding an additional endogenous target gene encoding a mutant allele of the PRPF31;
- the target endogenous gene further comprises a disease causing allele of the PRPF31;
- the actuator moiety is a deactivated Cas (dCas) protein, further optionally wherein:
- a size of the dCas is less than or equal to about 800 amino acids
- a size of the dCas is less than or equal to about 600 amino acids;
- the dCas protein comprises a polynucleotide sequence exhibiting at least about 90% sequence identity to the polynucleotide sequence selected from Table 1;
- the system further comprises a guide nucleic acid capable of forming a complex with the actuator moiety, wherein the complex binds the endogenous target gene, further optionally wherein the guide nucleic acid comprises a plurality of different guide nucleic acids capable of targeting different polynucleotide sequences of the endogenous target gene; and/or
- the heterologous polypeptide and/or the guide nucleic acid is under control of a tissue-specific promoter;
- the heterologous polypeptide and/or the guide nucleic acid is under control of a photoreceptor-specific promoter;
- heterologous polypeptide and/or the guide nucleic acid is under control of a constitutive promoter
- the increase in the expression level of the PRPF31 in the cell effects enhanced cilium length and/or incidence, as compared to that in a control cell comprising a mutant allele of the PRPF31 in absence of the heterologous polypeptide and/or the guide nucleic acid; and/or
- a portion of the endogenous target gene that is targeted by the actuator moiety and/or the guide nucleic acid is at most about 1,000 nucleobases away from a transcription start site (TSS) of the endogenous target gene; and/or
- the guide nucleic acid comprises (i) a scaffold sequence for forming the complex with the actuator moiety and (ii) a spacer sequence exhibiting specific binding to the endogenous target gene, wherein the spacer sequence exhibits at least about 80%, at least about 90%, at least about 95%, or substantially about 100% sequence identity to the polynucleotide sequence selected from Table 5.
- Embodiment 2 One or more polynucleotides encoding the system of Embodiment 1, optionally wherein the one or more polynucleotides comprise a single polynucleotide encoding at least the heterologous polypeptide and the guide nucleic acid, further optionally wherein:
- the single polynucleotide has a size of less than or equal to about 5 kilobases;
- the single polynucleotide has a size of less than or equal to about 4.7 kilobases;
- the single polynucleotide has a size of less than or equal to about 4.2 kilobases;
- the guide nucleic acid comprises the plurality of different guide nucleic acids.
- Embodiment 3 A method comprising administrating the system or the one or more polynucleotides of Embodiment 1 or Embodiment 2 to a subject in need thereof, optionally wherein: (1) the administrating comprises intravitreal injection or subretinal injection; and/or
- RP retinitis pigmentosa
- the method further comprises, prior to the administrating, determining that the subject has the RP; and/or
- Embodiment 4 A method comprising: increasing expression level of an endogenous target gene encoding PRPF31 in a cell, via binding of a heterologous polypeptide comprising an actuator moiety to bind the endogenous target gene, wherein:
- the actuator moiety substantially lacks DNA cleavage activity
- the actuator moiety is coupled to a transcriptional activator, optionally wherein:
- the actuator moiety substantially lacks DNA cleavage activity
- the actuator moiety is coupled to the transcriptional activator
- the actuator moiety is fused to the transcriptional activator;
- the cell is a retinal cell
- the cell is a retinal pigment epithelium (RPE) cell; and/or
- the endogenous target gene comprises a non-disease causing allele of the PRPF31;
- the actuator moiety is not capable of binding an additional endogenous target gene encoding a mutant allele of the PRPF31;
- the endogenous target gene further comprises a disease causing allele of the PRPF31;
- the actuator moiety is a deactivated Cas (dCas) protein, further optionally wherein:
- a size of the dCas is less than or equal to about 800 amino acids
- a size of the dCas is less than or equal to about 600 amino acids;
- the dCas protein comprises a polynucleotide sequence exhibiting at least about 90% sequence identity to the polynucleotide sequence selected from Table 1.
- the increasing is via action of a complex comprising the actuator moiety and a guide nucleic acid, wherein the complex binds the endogenous target gene, further optionally wherein the guide nucleic acid comprises a plurality of different guide nucleic acids capable of targeting different regions of the endogenous target gene; and/or (12) the heterologous polypeptide is under control of a tissue-specific promoter; and/or
- heterologous polypeptide is under control of a photoreceptor-specific promoter
- heterologous polypeptide and/or the guide nucleic acid is under control of a constitutive promoter
- expression level of the non-disease causing allele of the PRPF31 is enhanced relative to that of a mutant allele of the PRPF31 by at least about 0.1 -fold, at least about 1-fold, at least about 5-fold, at least about 10-fold, at least about 50-fold, at least about 100-fold, or at least about 500-fold; and/or
- a phagocytosis level of the cell is reduced by at least about 0.1- fold, at least about 1-fold, at least about 5-fold, at least about 10-fold, at least about 50-fold, at least about 100-fold, or at least about 500-fold, as compared to that of a control cell comprising a mutant allele of the PRPF31 in absence of the heterologous polypeptide and/or the guide nucleic acid; and/or
- a length of a primary cilia of the cell is longer by at least about 0.1-fold, at least about 1-fold, at least about 5-fold, at least about 10-fold, at least about 50-fold, at least about 100-fold, or at least about 500-fold, as compared to that of a control cell comprising a mutant allele of the PRPF31 in absence of the heterologous polypeptide and/or the guide nucleic acid; and/or
- the expression level of the non-disease causing allele of the PRPF31 is substantially the same as that in a cell of a non-diseased cell;
- a portion of the endogenous target gene that is targeted by the actuator moiety and/or the guide nucleic acid is at most about 1,000 nucleobases away from a transcription start site (TSS) of the endogenous target gene; and/or
- the guide nucleic acid comprises (i) a scaffold sequence for forming the complex with the actuator moiety and (ii) a spacer sequence exhibiting specific binding to the endogenous target gene, wherein the spacer sequence exhibits at least about 80%, at least about 90%, at least about 95%, or substantially about 100% sequence identity to the polynucleotide sequence selected from the group consisting of Table 5.
- compositions of matter disclosed herein in the composition section of the present disclosure may be utilized in the method section including methods of use and production disclosed herein, or vice versa.
Abstract
Described herein are systems and methods for modulating gene expression. Also described herein are systems and methods for treating a disease or a condition by modulating gene expression.
Description
SYSTEMS AND METHODS FOR GENETIC MODULATION TO TREAT OCULAR
DISEASES
CROSS-REFERENCE
[0001] This application claims the benefit of U.S. Provisional Application No. 63/319,088, filed March 11, 2022, which is incorporated herein by reference in its entirety.
BACKGROUND
[0002] Aberrant expression of one or more genes (e.g., endogenous genes) can lead to a disease or a condition in a subject. In some cases, aberrant expression of an enzyme regulator (e.g., an enzyme inhibitor, such as a protease inhibitor) in a cell in the subject can lead to irregular enzymatic activity within the cell, thereby effecting various diseases. The aberrant expression can be due to one or more hereditary genetic mutations in a gene encoding the enzyme regulator. For example, mutation of PRPF31 can lead to retinitis pigmentosa or autosomal congenital blindness in a subj ect.
SUMMARY
[0003] Modifying aberrant expression of a mutant allele (e.g., a disease-causing allele) in a cell may not be sufficient to treat or cure a disease that is manifested by the aberrant expression of the mutant allele. Thus, there remains a substantial need for systems, compositions, and methods to modify the aberrant expression of the mutant allele and introduce expression of a non-disease causing allele (e.g., a wild-type allele) in a cell.
[0004] In an aspect, the present disclosure provides a system comprising: a heterologous polypeptide comprising an actuator moiety, wherein the actuator moiety is for binding an endogenous target gene encoding PRPF31 in a cell, to increase expression level of the PRPF31 in the cell, wherein: (i) the actuator moiety substantially lacks DNA cleavage activity; and/or (ii) the actuator moiety is coupled to a transcriptional activator.
[0005] In another aspect, the present disclosure provides one or more polynucleotides encoding any of the system provided herein.
[0006] In another aspect, the present disclosure provides a method comprising administrating any of the system or the one or more polynucleotides provided herein to a subject in need thereof.
[0007] In another aspect, the present disclosure provides a method comprising: increasing expression level of an endogenous target gene encoding PRPF31 in a cell, via binding of a heterologous polypeptide comprising an actuator moiety to bind the endogenous target gene, wherein: (a) the actuator moiety substantially lacks DNA cleavage activity; and/or (b) the actuator moiety is coupled to a transcriptional activator.
[0008] Additional aspects and advantages of the present disclosure will become readily apparent to those skilled in this art from the following detailed description, wherein only illustrative embodiments of the present disclosure are shown and described. As will be realized, the present disclosure is capable of other and different embodiments, and its several details are capable of modifications in various obvious respects, all without departing from the disclosure.
Accordingly, the drawings and description are to be regarded as illustrative in nature, and not as restrictive.
INCORPORATION BY REFERENCE
[0009] All publications, patents, and patent applications mentioned in this specification are herein incorporated by reference to the same extent as if each individual publication, patent, or patent application was specifically and individually indicated to be incorporated by reference. To the extent publications and patents or patent applications incorporated by reference contradict the disclosure contained in the specification, the specification is intended to supersede and/or take precedence over any such contradictory material.
BRIEF DESCRIPTION OF THE DRAWINGS
[0010] The novel features of the invention are set forth with particularity in the appended claims. A better understanding of the features and advantages of the present invention will be obtained by reference to the following detailed description that sets forth illustrative embodiments, in which the principles of the invention are utilized, and the accompanying drawings (also “Figure” and “FIG.” herein), of which:
[0011] FIG. 1 illustrates an exemplary construct encoding the dCas and the actuator moiety (effector). Legend: Promoter: photoreceptor specific or ubiquitous promoter; dCas: small dead Cas molecule such as dCasMini or equivalent effector: effector to activate expression such as VPR or equivalent thereof; Pr: Promoter for gRNA such as Hl or U6 or equivalent thereof; gRNA: gRNA targeting the endogenous target gene comprising PRPF31.
[0012] FIG. 2 illustrates a schematic for treating retinitis pigmentosa 11 (RP11) with the system described herein. AAV can be engineered to deliver an exemplary construct via subretinal or intravitreal injection to a subject in need thereof, where the expression of the exemplary construct can activate expression of PRPF31 encoded by the heterologous CDS of the construct.
[0013] FIG. 3 illustrates exemplary genomic loci NM_015629 (ENST00000419967.5) and (ENST00000321030.9) transcripts that can be targeted by the gRNA of the system and the method described herein.
[0014] FIGs. 4A-4F schematically illustrate example vectors encoding the system of the present disclosure.
[0015] FIG. 5A schematically illustrates an example experimental procedure to screen for target endogenous polynucleotide sequences to modulate expression ofPRPF31, and FIG. 5B shows change in endogenous PRPF31 by different guide nucleic acid molecules.
[0016] FIG. 6A shows a fluorescent image of normal retinal pigment epithelium (RPE) cells, and FIG. 6B shows a fluorescent image of diseased RPE cells.
DETAILED DESCRIPTION
[0017] While various embodiments of the invention have been shown and described herein, it will be obvious to those skilled in the art that such embodiments are provided by way of example only. Numerous variations, changes, and substitutions may occur to those skilled in the art without departing from the invention. It should be understood that various alternatives to the embodiments of the invention described herein may be employed.
[0018] Whenever the term “at least,” “greater than,” or “greater than or equal to” precedes the first numerical value in a series of two or more numerical values, the term “at least,” “greater than” or “greater than or equal to” applies to each of the numerical values in that series of numerical values. For example, greater than or equal to 1, 2, or 3 is equivalent to greater than or equal to 1, greater than or equal to 2, or greater than or equal to 3.
[0019] Whenever the term “no more than,” “less than,” or “less than or equal to” precedes the first numerical value in a series of two or more numerical values, the term “no more than,” “less than,” or “less than or equal to” applies to each of the numerical values in that series of numerical values. For example, less than or equal to 3, 2, or 1 is equivalent to less than or equal to 3, less than or equal to 2, or less than or equal to 1.
[0020] The term “about” or “approximately” generally mean within an acceptable error range for the particular value as determined by one of ordinary skill in the art, which will depend in part on how the value is measured or determined, i.e., the limitations of the measurement system. For example, “about” can mean within 1 or more than 1 standard deviation, per the practice in the art. Alternatively, “about” can mean a range of up to 20%, up to 10%, up to 5%, or up to 1% of a given value. Alternatively, particularly with respect to biological systems or processes, the term can mean within an order of magnitude, preferably within 5-fold, and more preferably within 2- fold, of a value. Where particular values are described in the application and claims, unless otherwise stated, the term “about” meaning within an acceptable error range for the particular value should be assumed.
[0021] The use of the alternative (e.g., “or”) should be understood to mean either one, both, or
any combination thereof of the alternatives. The term “and/or” should be understood to mean either one, or both of the alternatives.
[0022] The term “cell” generally refers to a biological cell. A cell can be the basic structural, functional and/or biological unit of a living organism. A cell can originate from any organism having one or more cells. Some non-limiting examples include: a prokaryotic cell, eukaryotic cell, a bacterial cell, an archaeal cell, a cell of a single-cell eukaryotic organism, a protozoa cell, a cell from a plant (e.g. cells from plant crops, fruits, vegetables, grains, soy bean, corn, maize, wheat, seeds, tomatoes, rice, cassava, sugarcane, pumpkin, hay, potatoes, cotton, cannabis, tobacco, flowering plants, conifers, gymnosperms, fems, clubmosses, hornworts, liverworts, mosses), an algal cell, (e.g., Botryococcus braunii, Chlamydomonas reinhardtii, Nannochloropsis gaditana, Chlorella pyrenoidosa, Sargassum patens C. Agardh, and the like), seaweeds (e.g. kelp), a fungal cell (e.g., a yeast cell, a cell from a mushroom), an animal cell, a cell from an invertebrate animal (e.g. fruit fly, cnidarian, echinoderm, nematode, etc.), a cell from a vertebrate animal (e.g., fish, amphibian, reptile, bird, mammal), a cell from a mammal (e.g., a pig, a cow, a goat, a sheep, a rodent, a rat, a mouse, a non-human primate, a human, etc.), and etcetera. Sometimes a cell is not originating from a natural organism (e.g. a cell can be a synthetically made, sometimes termed an artificial cell).
[0023] The term “nucleotide,” as used herein, generally refers to a base-sugar-phosphate combination. A nucleotide can comprise a synthetic nucleotide. A nucleotide can comprise a synthetic nucleotide analog. Nucleotides can be monomeric units of a nucleic acid sequence (e.g. deoxyribonucleic acid (DNA) and ribonucleic acid (RNA)). The term nucleotide can include ribonucleoside triphosphates adenosine triphosphate (ATP), uridine triphosphate (UTP), cytosine triphosphate (CTP), guanosine triphosphate (GTP) and deoxyribonucleoside triphosphates such as dATP, dCTP, diTP, dUTP, dGTP, dTTP, or derivatives thereof. Such derivatives can include, for example, [aS]dATP, 7-deaza-dGTP and 7-deaza-dATP, and nucleotide derivatives that confer nuclease resistance on the nucleic acid molecule containing them. The term nucleotide as used herein can refer to dideoxyribonucleoside triphosphates (ddNTPs) and their derivatives. Illustrative examples of dideoxyribonucleoside triphosphates can include, but are not limited to, ddATP, ddCTP, ddGTP, ddITP, and ddTTP. A nucleotide may be unlabeled or detectably labeled by well-known techniques. Labeling can also be carried out with quantum dots. Detectable labels can include, for example, radioactive isotopes, fluorescent labels, chemiluminescent labels, bioluminescent labels and enzyme labels. Fluorescent labels of nucleotides may include but are not limited fluorescein, 5-carboxyfluorescein (FAM), 2'7'-dimethoxy-4'5-dichloro-6- carboxyfluorescein (JOE), rhodamine, 6-carboxyrhodamine (R6G), N,N,N',N'-tetramethyl-6-
carboxyrhodamine (TAMRA), 6-carboxy-X-rhodamine (ROX), 4-(4 'dimethylaminophenylazo) benzoic acid (DABCYL), Cascade Blue, Oregon Green, Texas Red, Cyanine and 5-(2'- aminoethyl)aminonaphthalene-l -sulfonic acid (EDANS). Specific examples of fluorescently labeled nucleotides can include [R6G]dUTP, [TAMRA]dUTP, [R110]dCTP, [R6G] dCTP, [TAMRA] dCTP, [JOE] ddATP, [R6G] ddATP, [FAM] ddCTP, [R110]ddCTP, [TAMRA]ddGTP, [ROX]ddTTP, [dR6G]ddATP, [dRl 10]ddCTP, [dTAMRA]ddGTP, and [dROX]ddTTP available from Perkin Elmer, Foster City, Calif. FluoroLink DeoxyNucleotides, FluoroLink Cy3-dCTP, FluoroLink Cy5-dCTP, FluoroLink Fluor X-dCTP, FluoroLink Cy3- dUTP, and FluoroLink Cy5-dUTP available from Amersham, Arlington Heights, Ill.; Fluorescein- 15-dATP, Fluorescein- 12-dUTP, Tetramethyl-rodamine-6-dUTP, IR770-9-dATP, Fluorescein- 12-ddUTP, Fluorescein- 12-UTP, and Fluorescein- 15 -2 '-dATP available from Boehringer Mannheim, Indianapolis, Ind.; and Chromosome Labeled Nucleotides, BODIPY-FL- 14-UTP, BODIPY-FL-4-UTP, BODIPY-TMR-14-UTP, BODIPY-TMR-14-dUTP, BODIPY- TR-14-UTP, BODIPY-TR-14-dUTP, Cascade Blue-7-UTP, Cascade Blue-7-dUTP, fluorescein- 12-UTP, fluorescein- 12-dUTP, Oregon Green 488-5-dUTP, Rhodamine Green-5-UTP, Rhodamine Green-5-dUTP, tetramethylrhodamine-6-UTP, tetramethylrhodamine-6-dUTP, Texas Red-5-UTP, Texas Red-5-dUTP, and Texas Red-12-dUTP available from Molecular Probes, Eugene, Oreg. Nucleotides can also be labeled or marked by chemical modification. A chemically-modified single nucleotide can be biotin-dNTP. Some non-limiting examples of biotinylated dNTPs can include, biotin-dATP (e.g., bio-N6-ddATP, biotin- 14-dATP), biotin- dCTP (e.g., biotin- 11 -dCTP, biotin-14-dCTP), and biotin-dUTP (e.g. biotin- 11-dUTP, biotin-16- dUTP, biotin-20-dUTP).
[0024] The term “polynucleotide,” “oligonucleotide,” or “nucleic acid,” as used interchangeably herein, generally refers to a polymeric form of nucleotides of any length, either deoxyribonucleotides or ribonucleotides, or analogs thereof, either in single-, double-, or multistranded form. A polynucleotide can be exogenous or endogenous to a cell. A polynucleotide can exist in a cell-free environment. A polynucleotide can be a gene or fragment thereof. A polynucleotide can be DNA. A polynucleotide can be RNA. A polynucleotide can have any three dimensional structure, and can perform any function, known or unknown. A polynucleotide can comprise one or more analogs (e.g. altered backbone, sugar, or nucleobase). If present, modifications to the nucleotide structure can be imparted before or after assembly of the polymer. Some non-limiting examples of analogs include: 5-bromouracil, peptide nucleic acid, xeno nucleic acid, morpholinos, locked nucleic acids, glycol nucleic acids, threose nucleic acids, dideoxynucleotides, cordycepin, 7-deaza-GTP, florophores (e g. rhodamine or flurescein linked
to the sugar), thiol containing nucleotides, biotin linked nucleotides, fluorescent base analogs, CpG islands, methyl-7-guanosine, methylated nucleotides, inosine, thiouridine, pseudourdine, dihydrouridine, queuosine, and wyosine. Non-limiting examples of polynucleotides include coding or non-coding regions of a gene or gene fragment, loci (locus) defined from linkage analysis, exons, introns, messenger RNA (mRNA), transfer RNA (tRNA), ribosomal RNA (rRNA), short interfering RNA (siRNA), short-hairpin RNA (shRNA), micro-RNA (miRNA), ribozymes, cDNA, recombinant polynucleotides, branched polynucleotides, plasmids, vectors, isolated DNA of any sequence, isolated RNA of any sequence, cell-free polynucleotides including cell-free DNA (cfDNA) and cell-free RNA (cfRNA), nucleic acid probes, and primers. The sequence of nucleotides can be interrupted by non-nucleotide components.
[0025] The term “sequence identity” generally refers to an exact nucleotide-to-nucleotide or amino acid-to-amino acid correspondence of two polynucleotides or polypeptide sequences, respectively. Typically, techniques for determining sequence identity include determining the nucleotide sequence of a polynucleotide and/or determining the amino acid sequence encoded thereby, and comparing these sequences to a second nucleotide or amino acid sequence. Two or more sequences (polynucleotide or amino acid) can be compared by determining their “percent identity .” The percent identity of two sequences, whether nucleic acid or amino acid sequences, is the number of exact matches between two aligned sequences divided by the length of the longer sequence and multiplied by 100. Percent identity may also be determined, for example, by comparing sequence information using the advanced BLAST computer program, including version 2.2.9, available from the National Institutes of Health. The BLAST program is based on the alignment method of Karlin and Altschul, Proc. Natl. Acad. Sci. USA, 87:2264-2268 (1990) and as discussed in Altschul, et al., J. Mol. Biol., 215:403-410 (1990); Karlin And Altschul, Proc. Natl. Acad. Sci. USA, 90:5873-5877 (1993); and Altschul et al., Nucleic Acids Res., 25:3389- 3402 (1997). The program may be used to determine percent identity over the entire length of the proteins being compared. Default parameters are provided to optimize searches with short query sequences in, for example, with the blastp program. The program also allows use of an SEG filter to mask-off segments of the query sequences as determined by the SEG program of Wootton and Federhen, Computers and Chemistry 17: 149-163 (1993). Ranges of desired degrees of sequence identity are approximately 50% to 100% and integer values therebetween. In general, this disclosure encompasses sequences with at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, or at least 98% sequence identity with any sequence provided herein.
[0026] The term “gene” generally refers to a nucleic acid (e g., DNA such as genomic DNA and
cDNA) and its corresponding nucleotide sequence that is involved in encoding an RNA transcript. The term as used herein with reference to genomic DNA includes intervening, noncoding regions as well as regulatory regions and can include 5' and 3' ends. In some uses, the term encompasses the transcribed sequences, including 5' and 3' untranslated regions (5'-UTR and 3'-UTR), exons and introns. In some genes, the transcribed region will contain “open reading frames” that encode polypeptides. In some uses of the term, a “gene” comprises only the coding sequences (e g., an “open reading frame” or “coding region”) necessary for encoding a polypeptide. In some cases, genes do not encode a polypeptide, for example, ribosomal RNA genes (rRNA) and transfer RNA (tRNA) genes. In some cases, the term “gene” includes not only the transcribed sequences, but in addition, also includes non-transcribed regions including upstream and downstream regulatory regions, enhancers and promoters. For example, a gene can refer to a portion of the gene that is near or adjacent to a transcription start site (TSS) of the gene. The gene (e.g., that is targeted as disclosed herein) can be at least or up to about 2,000 nucleobases, at least or up to about 1,800 nucleobases, at least or up to about 1,600 nucleobases, at least or up to about 1,500 nucleobases, at least or up to about 1,400 nucleobases, at least or up to about 1,200 nucleobases, at least or up to about 1,000 nucleobases, at least or up to about 900 nucleobases, at least or up to about 800 nucleobases, at least or up to about 700 nucleobases, at least or up to about 600 nucleobases, at least or up to about 500 nucleobases, at least or up to about 400 nucleobases, at least or up to about 300 nucleobases, at least or up to about 200 nucleobases, at least or up to about 100 nucleobases, or at least or up to about 50 nucleobases away from the TSS of the gene.
[0027] A gene can refer to an “endogenous gene” or a native gene in its natural location in the genome of an organism. A gene can refer to an “exogenous gene” or a non-native gene. A nonnative gene can refer to a gene not normally found in the host organism but which is introduced into the host organism by gene transfer. A non-native gene can also refer to a gene not in its natural location in the genome of an organism. A non-native gene can also refer to a naturally occurring nucleic acid or polypeptide sequence that comprises mutations, insertions and/or deletions (e.g., non-native sequence).
[0028] The term “expression” generally refers to one or more processes by which a polynucleotide is transcribed from a DNA template (such as into an mRNA or other RNA transcript) and/or the process by which a transcribed mRNA is subsequently translated into peptides, polypeptides, or proteins. Transcripts and encoded polypeptides can be collectively referred to as “gene product.” If the polynucleotide is derived from genomic DNA, expression can include splicing of the mRNA in a eukaryotic cell. “Up-regulated,” with reference to
expression, generally refers to an increased expression level of a polynucleotide (e.g., RNA such as mRNA) and/or polypeptide sequence relative to its expression level in a wild-type state while “down-regulated” generally refers to a decreased expression level of a polynucleotide (e.g., RNA such as mRNA) and/or polypeptide sequence relative to its expression in a wild-type state. Expression of a transfected gene can occur transiently or stably in a cell. During “transient expression” the transfected gene is not transferred to the daughter cell during cell division. Since its expression is restricted to the transfected cell, expression of the gene is lost over time. In contrast, stable expression of a transfected gene can occur when the gene is co-transfected with another gene that confers a selection advantage to the transfected cell. Such a selection advantage may be a resistance towards a certain toxin that is presented to the cell.
[0029] The term “expression profile” generally refers to quantitative (e.g., abundance) and qualitative expression of one or more genes in a sample (e.g., a cell). The one or more genes can be expressed and ascertained in the form of a nucleic acid molecule (e.g., an mRNA or other RNA transcript). Alternatively or in addition to, the one or more genes can be expressed and ascertained in the form of a polypeptide (e.g., a protein measured via Western blot). An expression profile of a gene may be defined as a shape of an expression level of the gene over a time period (e.g., at least or up to about 1 hour, at least or up to about 2 hours, at least or up to about 3 hours, at least or up to about 4 hours, at least or up to about 5 hours, at least or up to about 6 hours, at least or up to about 7 hours, at least or up to about 8 hours, at least or up to about 9 hours, at least or up to about 10 hours, at least or up to about 11 hours, at least or up to about 12 hours, at least or up to about 16 hours, at least or up to about 18 hours, at least or up to about 24 hours, at least or up to about 36 hours, at least or up to about 48 hours, at least up to about 3 days, at least up to about 4 days, at least up to about 5 days, at least up to about 6 days, at least up to about 7 days, at least up to about 8 days, at least up to about 9 days, at least up to about 10 days, at least up to about 11 days, at least up to about 12 days, at least up to about 13 days, at least up to about 14 days, etc.). Alternatively, an expression profile of a gene may be defined as an expression level of the gene at a time point of interest (e.g., the expression level of the gene measured at least or up to about 1 hour, at least or up to about 2 hours, at least or up to about 3 hours, at least or up to about 4 hours, at least or up to about 5 hours, at least or up to about 6 hours, at least or up to about 7 hours, at least or up to about 8 hours, at least or up to about 9 hours, at least or up to about 10 hours, at least or up to about 11 hours, at least or up to about 12 hours, at least or up to about 16 hours, at least or up to about 18 hours, at least or up to about 24 hours, at least or up to about 36 hours, at least or up to about 48 hours, at least up to about 3 days, at least up to about 4 days, at least up to about 5 days, at least up to about 6 days, at
least up to about 7 days, at least up to about 8 days, at least up to about 9 days, at least up to about 10 days, at least up to about 11 days, at least up to about 12 days, at least up to about 13 days, or at least up to about 14 days after treating a cell to induce such expression level.) [0030] The term “peptide,” “polypeptide,” or “protein,” as used interchangeably herein, generally refers to a polymer of at least two amino acid residues joined by peptide bond(s). This term does not connote a specific length of polymer, nor is it intended to imply or distinguish whether the peptide is produced using recombinant techniques, chemical or enzymatic synthesis, or is naturally occurring. The terms apply to naturally occurring amino acid polymers as well as amino acid polymers comprising at least one modified amino acid. In some cases, the polymer can be interrupted by non-amino acids. The terms include amino acid chains of any length, including full length proteins, and proteins with or without secondary and/or tertiary structure (e.g., domains). The terms also encompass an amino acid polymer that has been modified, for example, by disulfide bond formation, glycosylation, lipidation, acetylation, phosphorylation, oxidation, and any other manipulation such as conjugation with a labeling component. The terms “amino acid” and “amino acids,” as used herein, generally refer to natural and non-natural amino acids, including, but not limited to, modified amino acids and amino acid analogues. Modified amino acids can include natural amino acids and non-natural amino acids, which have been chemically modified to include a group or a chemical moiety not naturally present on the amino acid. Amino acid analogues can refer to amino acid derivatives. The term “amino acid” includes both D- amino acids and L-amino acids.
[0031] The term “derivative,” “variant,” or “fragment,” as used herein with reference to a polypeptide, generally refers to a polypeptide related to a wild type polypeptide, for example either by amino acid sequence, structure (e.g., secondary and/or tertiary), activity (e g., enzymatic activity) and/or function. Derivatives, variants and fragments of a polypeptide can comprise one or more amino acid variations (e.g., mutations, insertions, and deletions), truncations, modifications, or combinations thereof compared to a wild type polypeptide.
[0032] The term “engineered,” “chimeric,” or “recombinant,” as used herein with respect to a polypeptide molecule (e.g., a protein), generally refers to a polypeptide molecule having a heterologous amino acid sequence or an altered amino acid sequence as a result of the application of genetic engineering techniques to nucleic acids which encode the polypeptide molecule, as well as cells or organisms which express the polypeptide molecule. The term “engineered” or “recombinant,” as used herein with respect to a polynucleotide molecule (e.g., a DNA or RNA molecule), generally refers to a polynucleotide molecule having a heterologous nucleic acid sequence or an altered nucleic acid sequence as a result of the application of genetic engineering
techniques. Genetic engineering techniques include, but are not limited to, PCR and DNA cloning technologies; transfection, transformation and other gene transfer technologies, homologous recombination; site-directed mutagenesis; and gene fusion. In some cases, an engineered or recombinant polynucleotide (e.g., a genomic DNA sequence) can be modified or altered by a gene editing moiety.
[0033] The terms “engineered” and “modified” are used interchangeably herein. The terms “engineering” and “modifying” are used interchangeably herein. The terms “engineered cell” or “modified cell” are used interchangeably herein. The terms “engineered characteristic” and “modified characteristic” are used interchangeably herein.
[0034] The term “enhanced expression,” “increased expression,” or “upregulated expression” generally refers to production of a moiety of interest (e.g., a polynucleotide or a polypeptide) to a level that is above a normal level of expression of the moiety of interest in a host strain (e.g., a host cell). The normal level of expression can be substantially zero (or null) or higher than zero. The moiety of interest can comprise an endogenous gene or polypeptide construct of the host strain. The moiety of interest can comprise a heterologous gene or polypeptide construct that is introduced to or into the host strain. For example, a heterologous gene encoding a polypeptide of interest can be knocked-in (KI) to a genome of the host strain for enhanced expression of the polypeptide of interest in the host strain.
[0035] The term “enhanced activity,” “increased activity,” or “upregulated activity” generally refers to activity of a moiety of interest (e.g., a polynucleotide or a polypeptide) that is modified to a level that is above a normal level of activity of the moiety of interest in a host strain (e.g., a host cell). The normal level of activity can be substantially zero (or null) or higher than zero. The moiety of interest can comprise a polypeptide construct of the host strain. The moiety of interest can comprise a heterologous polypeptide construct that is introduced to or into the host strain. For example, a heterologous gene encoding a polypeptide of interest can be knocked-in (KI) to a genome of the host strain for enhanced activity of the polypeptide of interest in the host strain. [0036] The term “reduced expression,” “decreased expression,” or “downregulated expression” generally refers to a production of a moiety of interest (e g., a polynucleotide or a polypeptide) to a level that is below a normal level of expression of the moiety of interest in a host strain (e.g., a host cell). The normal level of expression is higher than zero. The moiety of interest can comprise an endogenous gene or polypeptide construct of the host strain. In some cases, the moiety of interest can be knocked-out or knocked-down in the host strain. In some examples, reduced expression of the moiety of interest can include a complete inhibition of such expression in the host strain.
[0037] The term “reduced activity,” “decreased activity,” or “downregulated activity” generally refers to activity of a moiety of interest (e g., a polynucleotide or a polypeptide) that is modified to a level that is below a normal level of activity of the moiety of interest in a host strain (e.g., a host cell). The normal level of activity is higher than zero. The moiety of interest can comprise an endogenous gene or polypeptide construct of the host strain. In some cases, the moiety of interest can be knocked-out or knocked-down in the host strain. In some examples, reduced activity of the moiety of interest can include a complete inhibition of such activity in the host strain.
[0038] The term “subject,” “individual,” or “patient,” as used interchangeably herein, generally refers to a vertebrate, preferably a mammal such as a human. Mammals include, but are not limited to, murines, simians, humans, farm animals, sport animals, and pets. Tissues, cells and their progeny of a biological entity obtained in vivo or cultured in vitro are also encompassed. [0039] The term “treatment” or “treating” generally refers to an approach for obtaining beneficial or desired results including but not limited to a therapeutic benefit and/or a prophylactic benefit. For example, a treatment can comprise administering a system or cell population disclosed herein. By therapeutic benefit is meant any therapeutically relevant improvement in or effect on one or more diseases, conditions, or symptoms under treatment. For prophylactic benefit, a composition can be administered to a subject at risk of developing a particular disease, condition, or symptom, or to a subject reporting one or more of the physiological symptoms of a disease, even though the disease, condition, or symptom may not have yet been manifested.
[0040] The term “effective amount” or “therapeutically effective amount” generally refers to the quantity of a composition, for example a composition comprising heterologous polypeptides, heterologous polynucleotides, and/or modified cells (e.g., modified stem cells), that is sufficient to result in a desired activity upon administration to a subject in need thereof. Within the context of the present disclosure, the term “therapeutically effective” generally refers to that quantity of a composition that is sufficient to delay the manifestation, arrest the progression, relieve or alleviate at least one symptom of a disorder treated by the methods of the present disclosure.
Overview
[0041] Gene expression underpins various physiological and pathological effects in cells and tissues, contributing to many diseases and conditions, thus agents that modulate expression of specific genes in a desirable way could have therapeutic benefit.
[0042] Developing agents that elicit robust, persistent, and/or reversible changes in gene expression has proven challenging, however, as many candidate therapeutics achieve only modest or short lived effects, or conversely result in off-target effects. Additionally, many current
approaches to gene editing and genome engineering can result in off-target effects that can be associated with undesirable toxicity profiles, and in some cases, undesirable effects can be permanent. There is thus a need for novel strategies to regulate gene expression that allow robust, persistent, and/or reversible modulation of target gene expression and activity, for example, expression of genes that impact human disease.
[0043] For instance, described herein are systems and methods for increasing expression of a target gene in a cell. In some embodiments, the target gene is encoded from a heterologous polynucleotide described herein. In some embodiments, the target gene encoded by the heterologous polynucleotide is a non-disease causing allele.
Systems, compositions, and methods thereof
[0044] In an aspect, the present disclosure provides a system comprising: a heterologous polypeptide comprising an actuator moiety, wherein the actuator moiety is for binding an endogenous target gene encoding PRPF31 in a cell, to increase expression level of the PRPF31 in the cell, wherein: the actuator moiety substantially lacks DNA cleavage activity; and/or the actuator moiety is coupled to a transcriptional activator. In some embodiments, described herein is one or more polynucleotides encoding a system described herein. Also described herein, in some aspects, is a method comprising administrating the system described herein to a subject in need thereof. In some aspects, described herein is a method comprising: increasing expression level of an endogenous target gene encoding PRPF31 in a cell, via binding of a heterologous polypeptide comprising an actuator moiety to bind the endogenous target gene, wherein: the actuator moiety substantially lacks DNA cleavage activity; and/or the actuator moiety is coupled to a transcriptional activator.
[0045] In some embodiments, the systems, compositions, or methods described herein increase expression ofPRPF31 in a cell. In some embodiments, the cell is an eye cell. In some embodiments, the cell is a retinal cell. In some embodiments, the cell is a retinal pigment epithelium (RPE) cell. In some embodiments, the PRPF31 expression being increased by the systems, compositions, or methods described herein is encoded from a non-disease allele of PRPF31. In some embodiments, the PRPF31 expression being increased by the systems, compositions, or methods described herein is encoded from a non-disease allele of PRPF31, where the non-disease allele of PRPF31 is encoded by a heterologous polynucleotide described herein. In some embodiments, the systems, compositions, or methods described herein increase expression of PRPF31 in a cell by contacting the cell with an actuator moiety described herein. In some embodiments, the actuator moiety is not capable of binding an additional target endogenous gene encoding a mutant allele of the PRPF31.
[0046] In some embodiments, described herein is a system for modulating a gene expression of an endogenous target gene described herein In some embodiments, the system comprises a heterologous polypeptide comprising an actuator moiety, wherein the actuator moiety is for binding an endogenous target gene encoding a target protein in a cell, to decrease expression level of the target protein, and wherein the actuator moiety substantially lacks DNA cleavage activity; and a heterologous polynucleotide encoding a non-disease causing variant of the endogenous target gene. In some embodiments, the endogenous target gene is a non-disease causing variant. In some embodiments, the non-disease causing variant is a wild type variant. In some embodiments, the endogenous target gene is a disease causing variant. In some embodiments, the system comprises the heterologous polynucleotide not integrated into the endogenous target gene. In some embodiments, the system comprises the heterologous polypeptide that is under the control of a tissue-specific promoter. For example, the tissuespecific promoter can be a rod cell specific promoter, a cone cell specific promoter, a retina cell specific promoter, a photoreceptor specific promoter, or a combination thereof. In some embodiments, the tissue-specific promoter is a photoreceptor specific promoter. In some embodiments, the tissue-specific promoter is a PRPF31 promoter. Non-limiting example pf tissue-specific promoter can include MOPS promoter, GRK1 promoter, IRBP promoter, PR2.1 promoter, IRBP/GNAT2 promoter, VMD2 promoter, VEcad/VEcadherin promoter, or a combination thereof.
[0047] In some embodiments, the system comprises the heterologous polypeptide that is under the control of a constitutive promoter. Non-limiting example of constitutive promoter can include CMV promoter, EFla promoter, CAG promoter, PGK promoter, TRE promoter, U6 promoter, or UAS promoter. For example, the constitutive promoter can be a Pol III promoter (e.g., 7SK, U6, Hl, etc.). In another example, the constitutive promoter can be a Pol II promoter (e.g., CMV, RSV, etc.). In some embodiments, the actuator moiety comprises a nuclease such as an endonuclease (e.g., a heterologous endonuclease). In some embodiments, the nuclease can be a deactivated nuclease such as a deactivated endonuclease, where the deactivated endonuclease does not cleave nucleic acid.
[0048] In some embodiments, the system comprises a guide nucleic acid. In some embodiments, a guide nucleic acid capable of forming a complex with the actuator moiety, wherein the complex binds the endogenous target gene. In some embodiments, the guide nucleic acid comprises a plurality of different guide nucleic acids capable of targeting different regions of the endogenous target gene.
[0049] In some embodiments, the system comprises an actuator moiety or a heterologous polynucleotide encoding the actuator moiety. In some embodiments, the actuator moiety is coupled to a transcriptional repressor. In some embodiments, the actuator moiety is fused to the transcriptional repressor.
[0050] In some embodiments, the system modulates a gene expression of an endogenous target gene in a cell. In some embodiments, the cell is an eye cell such as a rod cell, a cone cell, or a retina cell. In some embodiments, the cell is a photoreceptor cell, a bipolar cell, a retinal ganglion cell, a horizontal cell, or an amacrine cells. In some embodiments, the cell is a cell of pigmented layer. In some embodiments, the cell is a cell of layer of rods and cones. In some embodiments, the cell is a cell of membrana limitans externa. In some embodiments, the cell is a cell of outer nuclear layer. In some embodiments, the cell is a cell of outer plexiform layer. In some embodiments, the cell is a cell of inner nuclear layer. In some embodiments, the cell is a cell of inner plexiform layer. In some embodiments, the cell is a cell of ganglionic layer. In some embodiments, the cell is a cell of stratum opticum. In some embodiments, the cell is a cell of membrana limitans interna. In some embodiments, the cell a retinal pigment epithelium (RPE) cell.
[0051] In some embodiments, described herein is one or more polynucleotides encoding the system described herein. In some embodiments, the one or more polynucleotides comprise a single polynucleotide encoding at least the heterologous polypeptide and the heterologous polynucleotide. In some embodiments, the single polynucleotide further encodes the guide nucleic acid. In some embodiments, the single polynucleotide has a size of less than or equal to about 5 kilobases (kb). In some embodiments, the single polynucleotide has a size of less than or equal to about 4.7 kilobases. In some embodiments, the single polynucleotide has a size of less than or equal to about 0.1 kb to about 10 kb. In some embodiments, the single polynucleotide has a size of less than or equal to about 10 kb to about 9 kb, about 10 kb to about 8 kb, about 10 kb to about 7 kb, about 10 kb to about 6 kb, about 10 kb to about 5 kb, about 10 kb to about 4.7 kb, about 10 kb to about 4 kb, about 10 kb to about 3 kb, about 10 kb to about 2 kb, about 10 kb to about 1 kb, about 10 kb to about 0.1 kb, about 9 kb to about 8 kb, about 9 kb to about 7 kb, about 9 kb to about 6 kb, about 9 kb to about 5 kb, about 9 kb to about 4.7 kb, about 9 kb to about 4 kb, about 9 kb to about 3 kb, about 9 kb to about 2 kb, about 9 kb to about 1 kb, about 9 kb to about 0.1 kb, about 8 kb to about 7 kb, about 8 kb to about 6 kb, about 8 kb to about 5 kb, about 8 kb to about 4.7 kb, about 8 kb to about 4 kb, about 8 kb to about 3 kb, about 8 kb to about 2 kb, about 8 kb to about 1 kb, about 8 kb to about 0.1 kb, about 7 kb to about 6 kb, about 7 kb to about 5 kb, about 7 kb to about 4.7 kb, about 7 kb to about 4 kb, about 7 kb to about 3 kb, about 7 kb to
about 2 kb, about 7 kb to about 1 kb, about 7 kb to about 0.1 kb, about 6 kb to about 5 kb, about 6 kb to about 4.7 kb, about 6 kb to about 4 kb, about 6 kb to about 3 kb, about 6 kb to about 2 kb, about 6 kb to about 1 kb, about 6 kb to about 0.1 kb, about 5 kb to about 4.7 kb, about 5 kb to about 4 kb, about 5 kb to about 3 kb, about 5 kb to about 2 kb, about 5 kb to about 1 kb, about 5 kb to about 0.1 kb, about 4.7 kb to about 4 kb, about 4.7 kb to about 3 kb, about 4.7 kb to about 2 kb, about 4.7 kb to about 1 kb, about 4.7 kb to about 0.1 kb, about 4 kb to about 3 kb, about 4 kb to about 2 kb, about 4 kb to about 1 kb, about 4 kb to about 0.1 kb, about 3 kb to about 2 kb, about 3 kb to about 1 kb, about 3 kb to about 0.1 kb, about 2 kb to about 1 kb, about 2 kb to about 0.1 kb, or about 1 kb to about 0.1 kb. In some embodiments, the single polynucleotide has a size of less than or equal to about 10 kb, about 9 kb, about 8 kb, about 7 kb, about 6 kb, about 5 kb, about 4.7 kb, about 4 kb, about 3 kb, about 2 kb, about 1 kb, or about 0.1 kb. In some embodiments, the single polynucleotide has a size of less than or equal to at least about 10 kb, about 9 kb, about 8 kb, about 7 kb, about 6 kb, about 5 kb, about 4.7 kb, about 4 kb, about 3 kb, about 2 kb, or about 1 kb. In some embodiments, the single polynucleotide has a size of less than or equal to at most about 9 kb, about 8 kb, about 7 kb, about 6 kb, about 5 kb, about 4.7 kb, about 4 kb, about 3 kb, about 2 kb, about 1 kb, or about 0.1 kb.
[0052] In some embodiments, the system comprises a guide nucleic acid or one or more polynucleotides encoding a guide nucleic acid, where the guide nucleic acid targets an endogenous target gene described herein. In some embodiments, the guide nucleic acid can be complexed with an actuator moiety described herein. In some embodiments the guide nucleic acid can direct the actuator moiety to the endogenous target gene in the cell.
[0053] In some embodiments, described herein is a composition comprising any component or any combination of components of the system described herein. In some embodiments, the composition comprises at least one of the heterologous polypeptide described herein. In some embodiments, the compositions comprises at least one of the heterologous polynucleotide described herein. In some embodiments, the composition comprises at least one of the heterologous polypeptide described herein and at least one of the heterologous polynucleotide described herein. In some embodiments, the composition can be further formulated into a pharmaceutical composition. For example, the composition can comprise at least one pharmaceutically acceptable carrier.
[0054] Described herein, in some aspects, is a method comprising: increasing expression level of an endogenous target gene encoding a target protein in a cell, via action of a heterologous polypeptide comprising an actuator moiety, wherein the actuator moiety is for binding the endogenous target gene, and wherein the actuator moiety substantially lacks DNA cleavage
activity; and contacting the cell with a heterologous polynucleotide encoding a non-disease causing variant of the endogenous target gene. In some embodiments, the method comprises determining that the subject has certain condition. In some embodiments, the method comprises selecting for the subject to be treated by the method and the system described herein by determining if the subject harbors a mutant allele or a disease-causing allele of the endogenous target gene. In some embodiments, once the subject is determined to harbor the mutant allele or the disease-causing allele of the endogenous target gene, a system described herein, any component of the system described herein, or any combination of the component of the system described herein can be administered to the subject to treat the disease or condition. In some embodiments, the endogenous target gene comprises a disease causing allele of the target protein. In some embodiments, the endogenous target gene comprises a non-disease causing allele of the endogenous target protein. In some embodiments, the non-disease causing allele is a wild type allele. In some embodiments, the endogenous target gene is a member of Pre-mRNA-processing- splicing factor (PRPF) family. In some embodiments, the PRPF comprises PRPF1, PRPF2, PRPF3, PRPF4, PRPF5, PRPF6, PRPF7, PRPF8, PRPF9, PRPF10, PRPF11, PRPF12, PRPF13, PRPF14, PRPF15, PRPF16, PRPF17, PRPF18, PRPF19, PRPF20, PRPF21, PRPF22, PRPF23, PRPF24, PRPF25, PRPF26, PRPF27, PRPF28, PRPF29, PRPF30, PRPF31, a fragment thereof, or a combination thereof . In some embodiments, the PRPF comprises PRPF31. In In some embodiments, the endogenous target gene is PRPF31. In some embodiments, described herein is a method for administering the system descried herein to a subject in need thereof. In some embodiments, the method comprises determining whether the subject has or is suspected of having retinitis pigmentosa 11 (RP11). Without wishing to be bound by theory, even when the expression levels of both the non-disease causing allele and the disease causing allele of the PRPF31 are increased, the increased expression level of the non-disease causing allele of the PRPF31 can be sufficient to treat or ameliorate a condition (e.g., retinitis pigmentosa (RP), such as RP11) of a cell or a subject comprising the cell.
[0055] In some embodiments, the method increases the expression of the endogenous target gene (e.g., non-disease causing allele thereof) encoding the target protein by at least about 0.01 fold to about 5,000 fold (e.g., as compared to a control cell lacking the heterologous polypeptide and/or the guide nucleic acid). In some embodiments, the method increases the expression of the endogenous target gene encoding the target protein by at least about 0.01 fold to about 0.05 fold, about 0.01 fold to about 0.1 fold, about 0.01 fold to about 0.5 fold, about 0.01 fold to about 1 fold, about 0.01 fold to about 5 fold, about 0.01 fold to about 10 fold, about 0.01 fold to about 50 fold, about 0.01 fold to about 100 fold, about 0.01 fold to about 500 fold, about 0.01 fold to about
1,000 fold, about 0.01 fold to about 5,000 fold, about 0.05 fold to about 0.1 fold, about 0.05 fold to about 0.5 fold, about 0.05 fold to about 1 fold, about 0.05 fold to about 5 fold, about 0 05 fold to about 10 fold, about 0.05 fold to about 50 fold, about 0.05 fold to about 100 fold, about 0.05 fold to about 500 fold, about 0.05 fold to about 1,000 fold, about 0.05 fold to about 5,000 fold, about 0.1 fold to about 0.5 fold, about 0.1 fold to about 1 fold, about 0.1 fold to about 5 fold, about 0.1 fold to about 10 fold, about 0.1 fold to about 50 fold, about 0.1 fold to about 100 fold, about 0.1 fold to about 500 fold, about 0.1 fold to about 1,000 fold, about 0.1 fold to about 5,000 fold, about 0.5 fold to about 1 fold, about 0.5 fold to about 5 fold, about 0.5 fold to about 10 fold, about 0.5 fold to about 50 fold, about 0.5 fold to about 100 fold, about 0.5 fold to about 500 fold, about 0.5 fold to about 1,000 fold, about 0.5 fold to about 5,000 fold, about 1 fold to about 5 fold, about 1 fold to about 10 fold, about 1 fold to about 50 fold, about 1 fold to about 100 fold, about 1 fold to about 500 fold, about 1 fold to about 1,000 fold, about 1 fold to about 5,000 fold, about 5 fold to about 10 fold, about 5 fold to about 50 fold, about 5 fold to about 100 fold, about 5 fold to about 500 fold, about 5 fold to about 1,000 fold, about 5 fold to about 5,000 fold, about 10 fold to about 50 fold, about 10 fold to about 100 fold, about 10 fold to about 500 fold, about 10 fold to about 1,000 fold, about 10 fold to about 5,000 fold, about 50 fold to about 100 fold, about 50 fold to about 500 fold, about 50 fold to about 1,000 fold, about 50 fold to about 5,000 fold, about 100 fold to about 500 fold, about 100 fold to about 1,000 fold, about 100 fold to about 5,000 fold, about 500 fold to about 1,000 fold, about 500 fold to about 5,000 fold, or about 1,000 fold to about 5,000 fold (e.g., as compared to a control cell lacking the heterologous polypeptide and/or the guide nucleic acid). In some embodiments, the method increases the expression of the endogenous target gene encoding the target protein by at least about 0.01 fold, about 0.05 fold, about 0.1 fold, about 0.5 fold, about 1 fold, about 5 fold, about 10 fold, about 50 fold, about 100 fold, about 500 fold, about 1,000 fold, or about 5,000 fold (e.g., as compared to a control cell lacking the heterologous polypeptide and/or the guide nucleic acid). In some embodiments, the method increases the expression of the endogenous target gene encoding the target protein by at least at least about 0.01 fold, about 0.05 fold, about 0.1 fold, about 0.5 fold, about 1 fold, about 5 fold, about 10 fold, about 50 fold, about 100 fold, about 500 fold, or about 1,000 fold (e.g., as compared to a control cell lacking the heterologous polypeptide and/or the guide nucleic acid). In some embodiments, the method increases the expression of the endogenous target gene encoding the target protein by at least at most about 0.05 fold, about 0.1 fold, about 0.5 fold, about 1 fold, about 5 fold, about 10 fold, about 50 fold, about 100 fold, about 500 fold, about 1,000 fold, or about 5,000 fold (e.g., as compared to a control cell lacking the heterologous polypeptide and/or the guide nucleic acid).
[0056] In some embodiments, the method increases the expression of the endogenous target gene encoding the target protein, where the target protein is a PRPF31 (e g , a non-disease causing PRPF31 variant). In some embodiments, the method increases the expression of PRPF31 by at least about 0.01 fold to about 5,000 fold (e.g., as compared to a control cell lacking the heterologous polypeptide and/or the guide nucleic acid). In some embodiments, the method increases the expression of PRPF31 by at least about 0.01 fold to about 0.05 fold, about 0.01 fold to about 0.1 fold, about 0.01 fold to about 0.5 fold, about 0.01 fold to about 1 fold, about 0.01 fold to about 5 fold, about 0.01 fold to about 10 fold, about 0.01 fold to about 50 fold, about 0.01 fold to about 100 fold, about 0.01 fold to about 500 fold, about 0.01 fold to about 1,000 fold, about 0.01 fold to about 5,000 fold, about 0.05 fold to about 0.1 fold, about 0.05 fold to about 0.5 fold, about 0.05 fold to about 1 fold, about 0.05 fold to about 5 fold, about 0.05 fold to about 10 fold, about 0.05 fold to about 50 fold, about 0.05 fold to about 100 fold, about 0.05 fold to about 500 fold, about 0.05 fold to about 1,000 fold, about 0.05 fold to about 5,000 fold, about 0.1 fold to about 0.5 fold, about 0.1 fold to about 1 fold, about 0.1 fold to about 5 fold, about 0.1 fold to about 10 fold, about 0.1 fold to about 50 fold, about 0.1 fold to about 100 fold, about 0.1 fold to about 500 fold, about 0.1 fold to about 1,000 fold, about 0.1 fold to about 5,000 fold, about 0.5 fold to about 1 fold, about 0.5 fold to about 5 fold, about 0.5 fold to about 10 fold, about 0.5 fold to about 50 fold, about 0.5 fold to about 100 fold, about 0.5 fold to about 500 fold, about 0.5 fold to about 1,000 fold, about 0.5 fold to about 5,000 fold, about 1 fold to about 5 fold, about 1 fold to about 10 fold, about 1 fold to about 50 fold, about 1 fold to about 100 fold, about 1 fold to about 500 fold, about 1 fold to about 1,000 fold, about 1 fold to about 5,000 fold, about 5 fold to about 10 fold, about 5 fold to about 50 fold, about 5 fold to about 100 fold, about 5 fold to about 500 fold, about 5 fold to about 1,000 fold, about 5 fold to about 5,000 fold, about 10 fold to about 50 fold, about 10 fold to about 100 fold, about 10 fold to about 500 fold, about 10 fold to about 1,000 fold, about 10 fold to about 5,000 fold, about 50 fold to about 100 fold, about 50 fold to about 500 fold, about 50 fold to about 1,000 fold, about 50 fold to about 5,000 fold, about 100 fold to about 500 fold, about 100 fold to about 1,000 fold, about 100 fold to about 5,000 fold, about 500 fold to about 1,000 fold, about 500 fold to about 5,000 fold, or about 1,000 fold to about 5,000 fold (e.g., as compared to a control cell lacking the heterologous polypeptide and/or the guide nucleic acid). In some embodiments, the method increases the expression of PRPF31 by at least about 0.01 fold, about 0.05 fold, about 0.1 fold, about 0.5 fold, about 1 fold, about 5 fold, about 10 fold, about 50 fold, about 100 fold, about 500 fold, about 1,000 fold, or about 5,000 fold (e.g., as compared to a control cell lacking the heterologous polypeptide and/or the guide nucleic acid). In some embodiments, the method increases the expression of PRPF31 by at
least at least about 0.01 fold, about 0.05 fold, about 0.1 fold, about 0.5 fold, about 1 fold, about 5 fold, about 10 fold, about 50 fold, about 100 fold, about 500 fold, or about 1,000 fold (e g , as compared to a control cell lacking the heterologous polypeptide and/or the guide nucleic acid). In some embodiments, the method increases the expression of PRPF31 by at least at most about 0.05 fold, about 0.1 fold, about 0.5 fold, about 1 fold, about 5 fold, about 10 fold, about 50 fold, about 100 fold, about 500 fold, about 1,000 fold, or about 5,000 fold (e.g., as compared to a control cell lacking the heterologous polypeptide and/or the guide nucleic acid).
[0057] In some embodiments, the method increases the expression of PRPF31 (e.g., a nondisease variant of PRPF31) without increasing expression of a gene neighboring PRPF31 or expression of a gene associated with PRPF31. In some embodiments, the method, without increasing the expression of a gene neighboring PRPF31 or expression of a gene associated with PRPF31, increases the expression of PRPF31 compared to endogenous expression of PRPF31 by at least about 0.01 fold to about 5,000 fold. In some embodiments, the method, without increasing the expression of a gene neighboring PRPF31 or expression of a gene associated with PRPF31, increases the expression of PRPF31 compared to endogenous expression of PRPF31 by at least about 0.01 fold to about 0.05 fold, about 0.01 fold to about 0.1 fold, about 0.01 fold to about 0.5 fold, about 0.01 fold to about 1 fold, about 0.01 fold to about 5 fold, about 0.01 fold to about 10 fold, about 0.01 fold to about 50 fold, about 0.01 fold to about 100 fold, about 0.01 fold to about 500 fold, about 0.01 fold to about 1,000 fold, about 0.01 fold to about 5,000 fold, about 0.05 fold to about 0.1 fold, about 0.05 fold to about 0.5 fold, about 0.05 fold to about 1 fold, about 0.05 fold to about 5 fold, about 0.05 fold to about 10 fold, about 0.05 fold to about 50 fold, about 0.05 fold to about 100 fold, about 0.05 fold to about 500 fold, about 0.05 fold to about 1,000 fold, about 0.05 fold to about 5,000 fold, about 0.1 fold to about 0.5 fold, about 0.1 fold to about 1 fold, about 0.1 fold to about 5 fold, about 0.1 fold to about 10 fold, about 0.1 fold to about 50 fold, about 0.1 fold to about 100 fold, about 0.1 fold to about 500 fold, about 0.1 fold to about 1,000 fold, about 0.1 fold to about 5,000 fold, about 0.5 fold to about 1 fold, about 0.5 fold to about 5 fold, about 0.5 fold to about 10 fold, about 0.5 fold to about 50 fold, about 0.5 fold to about 100 fold, about 0.5 fold to about 500 fold, about 0.5 fold to about 1,000 fold, about 0.5 fold to about 5,000 fold, about 1 fold to about 5 fold, about 1 fold to about 10 fold, about 1 fold to about 50 fold, about 1 fold to about 100 fold, about 1 fold to about 500 fold, about 1 fold to about 1,000 fold, about 1 fold to about 5,000 fold, about 5 fold to about 10 fold, about 5 fold to about 50 fold, about 5 fold to about 100 fold, about 5 fold to about 500 fold, about 5 fold to about 1,000 fold, about 5 fold to about 5,000 fold, about 10 fold to about 50 fold, about 10 fold to about 100 fold, about 10 fold to about 500 fold, about 10 fold to about 1,000 fold, about 10 fold to about
5,000 fold, about 50 fold to about 100 fold, about 50 fold to about 500 fold, about 50 fold to about 1,000 fold, about 50 fold to about 5,000 fold, about 100 fold to about 500 fold, about 100 fold to about 1,000 fold, about 100 fold to about 5,000 fold, about 500 fold to about 1,000 fold, about 500 fold to about 5,000 fold, or about 1,000 fold to about 5,000 fold. In some embodiments, the method, without increasing the expression of a gene neighboring PRPF31 or expression of a gene associated with PRPF31, increases the expression of PRPF31 compared to endogenous expression of PRPF31 by at least about 0.01 fold, about 0.05 fold, about 0.1 fold, about 0.5 fold, about 1 fold, about 5 fold, about 10 fold, about 50 fold, about 100 fold, about 500 fold, about 1,000 fold, or about 5,000 fold. In some embodiments, the method, without increasing the expression of a gene neighboring PRPF31 or expression of a gene associated with PRPF31, increases the expression of PRPF31 compared to endogenous expression of PRPF31 by at least at least about 0.01 fold, about 0.05 fold, about 0.1 fold, about 0.5 fold, about 1 fold, about 5 fold, about 10 fold, about 50 fold, about 100 fold, about 500 fold, or about 1,000 fold. In some embodiments, the method, without increasing the expression of a gene neighboring PRPF31 or expression of a gene associated with PRPF31, increases the expression of PRPF31 compared to endogenous expression of PRPF31 by at least at most about 0.05 fold, about 0.1 fold, about 0.5 fold, about 1 fold, about 5 fold, about 10 fold, about 50 fold, about 100 fold, about 500 fold, about 1,000 fold, or about 5,000 fold.
[0058] In some embodiments, the method increases the expression of PRPF31 (e.g., a nondisease causing PRPF31 variant) without increasing expression of a gene neighboring PRPF31 on chromosome 19 or expression of a gene associated with PRPF31 on chromosome 19. In some embodiments, the method, without increasing the expression of a gene neighboring PRPF31 on chromosome 19 or expression of a gene associated with PRPF31 on chromosome 19, increases the expression of PRPF31 compared to endogenous expression ofPRPF31 by at least about 0.01 fold to about 5,000 fold. In some embodiments, the method, without increasing the expression of a gene neighboring PRPF31 on chromosome 19 or expression of a gene associated with PRPF31 on chromosome 19, increases the expression of PRPF31 compared to endogenous expression of PRPF31 by at least about 0.01 fold to about 0.05 fold, about 0.01 fold to about 0.1 fold, about 0.01 fold to about 0.5 fold, about 0.01 fold to about 1 fold, about 0.01 fold to about 5 fold, about 0.01 fold to about 10 fold, about 0.01 fold to about 50 fold, about 0.01 fold to about 100 fold, about 0.01 fold to about 500 fold, about 0.01 fold to about 1,000 fold, about 0.01 fold to about 5,000 fold, about 0.05 fold to about 0.1 fold, about 0.05 fold to about 0.5 fold, about 0.05 fold to about 1 fold, about 0.05 fold to about 5 fold, about 0.05 fold to about 10 fold, about 0.05 fold to about 50 fold, about 0.05 fold to about 100 fold, about 0.05 fold to about 500 fold, about 0.05
fold to about 1,000 fold, about 0.05 fold to about 5,000 fold, about 0.1 fold to about 0.5 fold, about 0.1 fold to about 1 fold, about 0.1 fold to about 5 fold, about 0.1 fold to about 10 fold, about 0.1 fold to about 50 fold, about 0.1 fold to about 100 fold, about 0.1 fold to about 500 fold, about 0.1 fold to about 1,000 fold, about 0.1 fold to about 5,000 fold, about 0.5 fold to about 1 fold, about 0.5 fold to about 5 fold, about 0.5 fold to about 10 fold, about 0.5 fold to about 50 fold, about 0.5 fold to about 100 fold, about 0.5 fold to about 500 fold, about 0.5 fold to about 1,000 fold, about 0.5 fold to about 5,000 fold, about 1 fold to about 5 fold, about 1 fold to about 10 fold, about 1 fold to about 50 fold, about 1 fold to about 100 fold, about 1 fold to about 500 fold, about 1 fold to about 1,000 fold, about 1 fold to about 5,000 fold, about 5 fold to about 10 fold, about 5 fold to about 50 fold, about 5 fold to about 100 fold, about 5 fold to about 500 fold, about 5 fold to about 1,000 fold, about 5 fold to about 5,000 fold, about 10 fold to about 50 fold, about 10 fold to about 100 fold, about 10 fold to about 500 fold, about 10 fold to about 1,000 fold, about 10 fold to about 5,000 fold, about 50 fold to about 100 fold, about 50 fold to about 500 fold, about 50 fold to about 1,000 fold, about 50 fold to about 5,000 fold, about 100 fold to about 500 fold, about 100 fold to about 1,000 fold, about 100 fold to about 5,000 fold, about 500 fold to about 1,000 fold, about 500 fold to about 5,000 fold, or about 1,000 fold to about 5,000 fold. In some embodiments, the method, without increasing the expression of a gene neighboring PRPF31 on chromosome 19 or expression of a gene associated with PRPF31 on chromosome 19, increases the expression of PRPF31 compared to endogenous expression of PRPF31 by at least about 0.01 fold, about 0.05 fold, about 0.1 fold, about 0.5 fold, about 1 fold, about 5 fold, about 10 fold, about 50 fold, about 100 fold, about 500 fold, about 1,000 fold, or about 5,000 fold. In some embodiments, the method, without increasing the expression of a gene neighboring
PRPF31 on chromosome 19 or expression of a gene associated with PRPF31 on chromosome 19, increases the expression of PRPF31 compared to endogenous expression of PRPF31 by at least at least about 0.01 fold, about 0.05 fold, about 0.1 fold, about 0.5 fold, about 1 fold, about 5 fold, about 10 fold, about 50 fold, about 100 fold, about 500 fold, or about 1,000 fold. In some embodiments, the method, without increasing the expression of a gene neighboring PRPF31 on chromosome 19 or expression of a gene associated with PRPF31 on chromosome 19, increases the expression of PRPF31 compared to endogenous expression ofPRPF31 by at least at most about 0.05 fold, about 0.1 fold, about 0.5 fold, about 1 fold, about 5 fold, about 10 fold, about 50 fold, about 100 fold, about 500 fold, about 1,000 fold, or about 5,000 fold.
[0059] In some embodiments, the method increases the expression of PRPF31 (e.g., a nondisease causing PRPF31 variant) without increasing expression of TCF3 Fusion Partner (TFPT). In some embodiments, the method, without increasing the expression of TFPT, increases the
expression of PRPF31 compared to endogenous expression of PRPF31 by at least about 0.01 fold to about 5,000 fold. In some embodiments, the method, without increasing the expression of TFPT, increases the expression of PRPF31 compared to endogenous expression of PRPF31 by at least about 0.01 fold to about 0.05 fold, about 0.01 fold to about 0.1 fold, about 0.01 fold to about 0.5 fold, about 0.01 fold to about 1 fold, about 0.01 fold to about 5 fold, about 0.01 fold to about 10 fold, about 0.01 fold to about 50 fold, about 0.01 fold to about 100 fold, about 0.01 fold to about 500 fold, about 0.01 fold to about 1,000 fold, about 0.01 fold to about 5,000 fold, about 0.05 fold to about 0.1 fold, about 0.05 fold to about 0.5 fold, about 0.05 fold to about 1 fold, about 0.05 fold to about 5 fold, about 0.05 fold to about 10 fold, about 0.05 fold to about 50 fold, about 0.05 fold to about 100 fold, about 0.05 fold to about 500 fold, about 0.05 fold to about 1,000 fold, about 0.05 fold to about 5,000 fold, about 0.1 fold to about 0.5 fold, about 0.1 fold to about 1 fold, about 0.1 fold to about 5 fold, about 0.1 fold to about 10 fold, about 0.1 fold to about 50 fold, about 0.1 fold to about 100 fold, about 0.1 fold to about 500 fold, about 0.1 fold to about 1,000 fold, about 0.1 fold to about 5,000 fold, about 0.5 fold to about 1 fold, about 0.5 fold to about 5 fold, about 0.5 fold to about 10 fold, about 0.5 fold to about 50 fold, about 0.5 fold to about 100 fold, about 0.5 fold to about 500 fold, about 0.5 fold to about 1,000 fold, about 0.5 fold to about 5,000 fold, about 1 fold to about 5 fold, about 1 fold to about 10 fold, about 1 fold to about 50 fold, about 1 fold to about 100 fold, about 1 fold to about 500 fold, about 1 fold to about 1,000 fold, about 1 fold to about 5,000 fold, about 5 fold to about 10 fold, about 5 fold to about 50 fold, about 5 fold to about 100 fold, about 5 fold to about 500 fold, about 5 fold to about 1,000 fold, about 5 fold to about 5,000 fold, about 10 fold to about 50 fold, about 10 fold to about 100 fold, about 10 fold to about 500 fold, about 10 fold to about 1,000 fold, about 10 fold to about 5,000 fold, about 50 fold to about 100 fold, about 50 fold to about 500 fold, about 50 fold to about 1,000 fold, about 50 fold to about 5,000 fold, about 100 fold to about 500 fold, about 100 fold to about 1,000 fold, about 100 fold to about 5,000 fold, about 500 fold to about 1,000 fold, about 500 fold to about 5,000 fold, or about 1,000 fold to about 5,000 fold. In some embodiments, the method, without increasing the expression of TFPT, increases the expression of PRPF31 compared to endogenous expression ofPRPF31 by at least about 0.01 fold, about 0.05 fold, about 0.1 fold, about 0.5 fold, about 1 fold, about 5 fold, about 10 fold, about 50 fold, about 100 fold, about 500 fold, about 1,000 fold, or about 5,000 fold. In some embodiments, the method, without increasing the expression of TFPT, increases the expression of PRPF31 compared to endogenous expression of PRPF31 by at least at least about 0.01 fold, about 0.05 fold, about 0.1 fold, about 0.5 fold, about 1 fold, about 5 fold, about 10 fold, about 50 fold, about 100 fold, about 500 fold, or about 1,000 fold. In some embodiments, the method, without
increasing the expression of TFPT, increases the expression ofPRPF31 compared to endogenous expression of PRPF31 by at least at most about 0.05 fold, about 0.1 fold, about 0.5 fold, about 1 fold, about 5 fold, about 10 fold, about 50 fold, about 100 fold, about 500 fold, about 1,000 fold, or about 5,000 fold.
[0060] In some embodiments, the increase in the expression level of the PRPF31 in the (e.g., non-disease causing PRPF31 variant) cell as provided herein effects enhanced (or increased) cilium length and/or incidence, as compared to that in a control cell comprising a mutant allele of the PRPF31 that is in absence of the heterologous polypeptide and/or the guide nucleic acid. [0061] The increase in the cilium length (e.g., average cilium length) can be at least or up to about 1%, at least or up to about 2%, at least or up to about 5%, at least or up to about 10%, at least or up to about 15%, at least or up to about 20%, at least or up to about 25%, at least or up to about 30%, at least or up to about 40%, at least or up to about 50%, at least or up to about 60%, at least or up to about 70%, at least or up to about 80%, at least or up to about 90%, at least or up to about 100%, at least or up to about 120%, at least or up to about 150%, at least or up to about 200%, at least or up to about 300%, at least or up to about 400%, or at least or up to about 500%, as compared to such control cell. The increase in the cilium length (e.g., average cilium length) can be at least or up to about 0.1-fold, at least or up to about 0.2-fold, at least or up to about 0.5- fold, at least or up to about 1-fold, at least or up to about 2-fold, at least or up to about 3-fold, at least or up to about 4-fold, at least or up to about 5-fold, at least or up to about 6-fold, at least or up to about 7-fold, at least or up to about 8-fold, at least or up to about 9-fold, at least or up to about 10-fold, at least or up to about 15-fold, at least or up to about 20-fold, at least or up to about 30-fold, at least or up to about 40-fold, at least or up to about 50-fold, at least or up to about 60-fold, at least or up to about 70-fold, at least or up to about 80-fold, at least or up to about 90-fold, at least or up to about 100-fold, at least or up to about 150-fold, at least or up to about 200-fold, at least or up to about 300-fold, at least or up to about 400-fold, or at least or up to about 500-fold, as compared to such control cell.
[0062] The increase in the cilium incidence (e.g., average cilium incidence) can be at least or up to about 1%, at least or up to about 2%, at least or up to about 5%, at least or up to about 10%, at least or up to about 15%, at least or up to about 20%, at least or up to about 25%, at least or up to about 30%, at least or up to about 40%, at least or up to about 50%, at least or up to about 60%, at least or up to about 70%, at least or up to about 80%, at least or up to about 90%, at least or up to about 100%, at least or up to about 120%, at least or up to about 150%, at least or up to about 200%, at least or up to about 300%, at least or up to about 400%, or at least or up to about 500%, as compared to such control cell. The increase in the cilium incidence (e.g., average cilium
incidence) can be at least or up to about 0. 1 -fold, at least or up to about 0.2-fold, at least or up to about 0.5-fold, at least or up to about 1-fold, at least or up to about 2-fold, at least or up to about 3-fold, at least or up to about 4-fold, at least or up to about 5-fold, at least or up to about 6-fold, at least or up to about 7-fold, at least or up to about 8-fold, at least or up to about 9-fold, at least or up to about 10-fold, at least or up to about 15-fold, at least or up to about 20-fold, at least or up to about 30-fold, at least or up to about 40-fold, at least or up to about 50-fold, at least or up to about 60-fold, at least or up to about 70-fold, at least or up to about 80-fold, at least or up to about 90-fold, at least or up to about 100-fold, at least or up to about 150-fold, at least or up to about 200-fold, at least or up to about 300-fold, at least or up to about 400-fold, or at least or up to about 500-fold, as compared to such control cell.
[0063] In some embodiments, the increase in the expression level of the PRPF31 (e.g., nondisease causing PRPF31 variant) in the cell as provided herein effects enhanced localization of a ciliary protein to at least a portion of a cilium, as compared to that in a control cell comprising a mutant allele of the PRPF31 that is in absence of the heterologous polypeptide and/or the guide nucleic acid. For example, the ciliary protein can be IFT88 and the at least a portion of the cilium can be a ciliary tip. In another example, the ciliary protein can be a transition zone (TZ) protein (e.g., CC2D2A, RPGRIP1L, etc.), and the at least the portion of the cilium can be a ciliary axoneme.
[0064] In some embodiments, the heterologous polypeptide (e.g., the dCas-transcriptional effector complexed with a guide RNA) can be sufficient to increase the expression level of the PRPF31 in the cell to render the cell functional. Thus, the systems and the methods of the present disclosure may not and need not require a heterologous PRPF31 and/or a heterologous gene encoding thereof to increase the expression level of the PRPF31 in the cell.
[0065] Table 4 provides an exemplary list of gRNA (e.g., Cas9 gRNAs) spacer sequence that can bind to an endogenous target gene described herein (e.g., PRPF31). A spacer sequence of gRNA as described herein can comprise a polynucleotide sequence (e.g., a consecutive polynucleotide sequence) that exhibits at least or up to about 50%, at least or up to about 55%, at least or up to about 60%, at least or up to about 65%, at least or up to about 70%, at least or up to about 75%, at least or up to about 80%, at least or up to about 85%, at least or up to about 90%, at least or up to about 91%, at least or up to about 92%, at least or up to about 93%, at least or up to about 94%, at least or up to about 95%, at least or up to about 96%, at least or up to about 97%, at least or up to about 98%, at least or up to about 99%, or substantially about 100% sequence identity to a polynucleotide sequence selected from Table 4 or a complementary
sequence thereof (e.g., one or more members selected from the group consisting of SEQ ID NOs: 700-785).
[0066] Table 5 provides an additional exemplary list of gRNA (e.g., Cas gRNAs) spacer sequence that can bind to an endogenous target gene described herein (e.g., PRPF31). A spacer sequence of gRNA as described herein can comprise a polynucleotide sequence (e.g., a consecutive polynucleotide sequence) that exhibits at least or up to about 50%, at least or up to about 55%, at least or up to about 60%, at least or up to about 65%, at least or up to about 70%, at least or up to about 75%, at least or up to about 80%, at least or up to about 85%, at least or up to about 90%, at least or up to about 91%, at least or up to about 92%, at least or up to about 93%, at least or up to about 94%, at least or up to about 95%, at least or up to about 96%, at least or up to about 97%, at least or up to about 98%, at least or up to about 99%, or substantially about 100% sequence identity to a polynucleotide sequence selected from Table 5 or a complementary sequence thereof (e.g., one or more members selected from the group consisting of SEQ ID NOs: 786-1040). In some cases, the spacer sequence of the guide nucleic acid can target a positive-sense strand (+) of the endogenous target gene. In some cases, the spacer sequence of the guide nucleic acid can target a negative-sense strand (-) of the endogenous target gene.
[0067] The systems (e.g., the heterologous polypeptide and/or a guide nucleic acid) and methods thereof as provided herein can target (e.g., bind) at least one target polynucleotide sequence (e.g., a consecutive polynucleotide sequence) found in the polynucleotide sequence of one or more members in Table 6. The at least one target polynucleotide sequence can comprise at least or up to about 1, at least or up to about 2, at least or up to about 3, at least or up to about 4, at least or up to about 5, at least or up to about 6, at least or up to about 7, at least or up to about 8, at least or up to about 9, at least or up to about 10, at least or up to about 15, or at least or up to about 20 target polynucleotide sequence(s). The at least one target polynucleotide sequence can have a length of at least or up to about 6 nucleobases, at least or up to about 8 nucleobases, at least or up to about 10 nucleobases, at least or up to about 12 nucleobases, at least or up to about 16 nucleobases, at least or up to about 18 nucleobases, at least or up to about 20 nucleobases, at least or up to about 22 nucleobases, at least or up to about 24 nucleobases, at least or up to about 26 nucleobases, at least or up to about 28 nucleobases, at least or up to about 30 nucleobases, at least or up to about 32 nucleobases, at least or up to about 34 nucleobases, at least or up to about 36 nucleobases, at least or up to about 38 nucleobases, at least or up to about 40 nucleobases, at least or up to about 45 nucleobases, or at least or up to about 50 nucleobases.
[0068] In some cases, at least a portion of a positive-sense strand (+) of the endogenous target gene can be targeted. The at least the portion of the positive-sense strand can comprise a polynucleotide sequence that exhibits at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99%, or substantially about 100% sequence identity to a consecutive polynucleotide sequence found in
SEQ ID NO: 1041 or SEQ ID NO: 1042.
[0069] In some cases, at least a portion of a negative- sense strand (-) of the endogenous target gene can be targeted. The at least the portion of the negative-sense strand can comprise a polynucleotide sequence that exhibits at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99%, or substantially about 100% sequence identity to a consecutive polynucleotide sequence found in
SEQ ID NO: 1043 or SEQ ID NO: 1044.
Heterologous polypeptide comprising an actuator moiety
[0070] In various aspects of the present disclosure, the heterologous polypeptide comprising the actuator moiety, as disclosed herein, can be utilized for binding a target gene, such as an endogenous target gene (e.g., a chromosomal DNA sequence). The actuator moiety can be a nuclease, such as an endonuclease (e.g., a heterologous endonuclease). Suitable nucleases include, but are not limited to, CRISPR-associated (Cas) proteins or Cas nucleases including type I CRISPR-associated (Cas) polypeptides, type II CRISPR-associated (Cas) polypeptides, type III CRISPR-associated (Cas) polypeptides, type IV CRISPR-associated (Cas) polypeptides, type V CRISPR-associated (Cas) polypeptides, and type VI CRISPR-associated (Cas) polypeptides; zinc finger nucleases (ZFN); transcription activator-like effector nucleases (TALEN); meganucleases; RNA-binding proteins (RBP); CRISPR-associated RNA binding proteins; recombinases; flippases; transposases; Argonaute (Ago) proteins (e.g., prokaryotic Argonaute (pAgo), archaeal Argonaute (aAgo), and eukaryotic Argonaute (eAgo)); any derivative thereof; any variant thereof and any fragment thereof.
[0071] In some embodiments, the actuator moiety can comprise a DNA nuclease such as an engineered (e.g., programmable or targetable) DNA nuclease that is nuclease-deficient. In some embodiments, the actuator moiety can comprise a nuclease-null DNA binding protein derived from a DNA nuclease that does not induce transcriptional activation or repression of a target
DNA sequence unless it is present in a complex with one or more heterologous gene effectors of the disclosure. In some embodiments, the actuator moiety can comprise a nuclease-null DNA binding protein derived from a DNA nuclease that can induce transcriptional activation or repression of a target DNA sequence (e.g., which can be altered or augmented by the presence of a heterologous gene effector of the disclosure).
[0072] In some embodiments, the actuator moiety can comprise an RNA nuclease such as an engineered (e.g., programmable or targetable) RNA nuclease. In some embodiments, the actuator moiety can comprise a nuclease-null RNA binding protein derived from an RNA nuclease that does not induce transcriptional activation or repression of a target RNA sequence unless it is present in a complex with one or more heterologous gene effectors of the disclosure. In some embodiments, the actuator moiety can comprise a nuclease-null RNA binding protein derived from a RNA nuclease that can induce transcriptional activation or repression of a target RNA sequence (e g., which can be altered or augmented by the presence of a heterologous gene effector of the disclosure).
[0073] In some embodiments, the actuator moiety can comprise a nucleic acid-guided targeting system. In some embodiments, the actuator moiety can comprise a DNA-guided targeting system. In some embodiments, the actuator moiety can comprise an RNA-guided targeting system. The nucleic acid-guided targeting system can comprise and utilize, for example, a guide nucleic acid sequence that facilitates specific binding of a CRISPR-Cas system (e.g., a nuclease deficient form thereof, such as dCas9 or dCasl4) to a target gene (e.g., target endogenous gene) or target gene regulatory sequence. Binding specificity can be determined by use of a guide nucleic acid, such as a single guide RNA (sgRNA) or a part thereof. In some embodiments, the use of different sgRNAs allows the compositions and methods of the disclosure to be used with (e.g., targeted to) different target genes (e.g., target endogenous genes) or target gene regulatory sequences.
[0074] Prokaryotic CRISPR-Cas (Clustered regularly interspaced short palindromic repeats- CRISPR associated) systems, for example, Class II CRISPR-Cas systems such as Cas9 and Cpfl, can be repurposed as a tool for regulation of gene expression, epigenome editing, and chromatin looping in compositions and methods of the disclosure. Nucl ease-deactivated Cas (dCas) proteins complexed with heterologous gene effectors can allow for regulation of expression of target genes (e.g., target endogenous genes) adjacent to a site bound by the dCas.
[0075] In some embodiments, the actuator moiety can comprise a CRISPR-associated (Cas) protein or a Cas nuclease that functions in a non-naturally occurring CRISPR (Clustered
Regularly Interspaced Short Palindromic Repeats)/Cas (CRISPR-associated) system. In bacteria, this system can provide adaptive immunity against foreign DNA.
[0076] In a wide variety of organisms including diverse mammals, animals, plants, microbes, and yeast, a CRISPR/Cas system (e.g., modified and/or unmodified) can be utilized as a genome engineering tool, or can be modified to direct specific binding of engineered proteins to target loci as disclosed herein. A CRISPR/Cas system can comprise a guide nucleic acid such as a guide RNA (gRNA) complexed with a Cas protein for targeted regulation of gene expression and/or activity or nucleic acid binding. An RNA-guided Cas protein (e g., a Cas nuclease such as a Cas9 nuclease) can specifically bind a target polynucleotide (e.g., DNA) in a sequencedependent manner. The Cas protein, if possessing nuclease activity, can cleave the DNA.
[0077] In some cases, the Cas protein is mutated and/or modified to yield a nuclease deficient protein or a protein with decreased nuclease activity relative to a wild-type Cas protein. A nuclease deficient protein can retain the ability to bind DNA, but may lack or have reduced nucleic acid cleavage activity.
[0078] In some embodiments, the actuator moiety can comprise a Cas protein that forms a complex with a guide nucleic acid, such as a guide RNA or a part thereof. In some embodiments, the actuator moiety can comprise a Cas protein that forms a complex with a single guide nucleic acid, such as a single guide RNA (sgRNA). In some embodiments, the actuator moiety can comprise a RNA-binding protein (RBP) optionally complexed with a guide nucleic acid, such as a guide RNA (e g., sgRNA), which is able to form a complex with a Cas protein. In some embodiments, the actuator moiety can comprise a nuclease-null DNA binding protein derived from a DNA nuclease that can induce transcriptional activation or repression of a target DNA sequence. In some embodiments, the actuator moiety can comprise a nuclease-null RNA binding protein derived from a RNA.
[0079] A guide nucleic acid used in compositions and methods of the disclosure can comprise a spacer sequence that can bind to an endogenous target gene described herein. The spacer sequence can be, for example, at least 8, at least 9, at least 10, at least 11, at least 12, at least 13, at least 14, at least 15, at least 16, at least 17, at least 18, at least 19, at least 20, at least 21, at least 22, at least 23, at least 24, at least 25, at least 26, at least 27, at least 28, at least 29, at least
30, at least 31, at least 32, at least 33, at least 34, at least 35, at least 36, at least 37, at least 38, at least 39, or at least 40 nucleotides.
[0080] In some embodiments, a spacer sequence of a guide nucleic acid used in compositions and methods of the disclosure is at most at most 10, at most 11, at most 12, at most 13, at most 14, at most 15, at most 16, at most 17, at most 18, at most 19, at most 20, at most 21, at most 22,
at most 23, at most 24, at most 25, at most 26, at most 27, at most 28, at most 29, at most 30, at most 31, at most 32, at most 33, at most 34, at most 35, at most 36, at most 37, at most 38, at most 39, or at most 40 nucleotides.
[0081] In some embodiments, a spacer sequence of a guide nucleic acid used in compositions and methods of the disclosure is between about 8 and about 40 nucleotides, between about 10 and about 40 nucleotides, between about 11 and about 40 nucleotides, between about 12 and about 40 nucleotides, between about 13 and about 40 nucleotides, between about 14 and about 40 nucleotides, between about 15 and about 40 nucleotides, between about 16 and about 40 nucleotides, between about 17 and about 40 nucleotides, between about 18 and about 40 nucleotides, between about 19 and about 40 nucleotides, between about 20 and about 40 nucleotides, between about 22 and about 40 nucleotides, between about 24 and about 40 nucleotides, between about 26 and about 40 nucleotides, between about 28 and about 40 nucleotides, between about 30 and about 40 nucleotides, between about 8 and about 30 nucleotides, between about 10 and about 30 nucleotides, between about 11 and about 30 nucleotides, between about 12 and about 30 nucleotides, between about 13 and about 30 nucleotides, between about 14 and about 30 nucleotides, between about 15 and about 30 nucleotides, between about 16 and about 30 nucleotides, between about 17 and about 30 nucleotides, between about 18 and about 30 nucleotides, between about 19 and about 30 nucleotides, between about 20 and about 30 nucleotides, between about 22 and about 30 nucleotides, between about 24 and about 30 nucleotides, between about 26 and about 30 nucleotides, between about 28 and about 30 nucleotides, between about 8 and about 25 nucleotides, between about 10 and about 25 nucleotides, between about 11 and about 25 nucleotides, between about 12 and about 25 nucleotides, between about 13 and about 25 nucleotides, between about 14 and about 25 nucleotides, between about 15 and about 25 nucleotides, between about 16 and about 25 nucleotides, between about 17 and about 25 nucleotides, between about 18 and about 25 nucleotides, between about 19 and about 25 nucleotides, between about 20 and about 25 nucleotides, between about 22 and about 25 nucleotides, between about 24 and about 25 nucleotides, between about 8 and about 20 nucleotides, between about 10 and about 20 nucleotides, between about 11 and about 20 nucleotides, between about 12 and about 20 nucleotides, between about 13 and about 20 nucleotides, between about 14 and about 20 nucleotides, between about 15 and about 20 nucleotides, between about 16 and about 20 nucleotides, between about 17 and about 20 nucleotides, between about 18 and about 20 nucleotides, between about 19 and about 20 nucleotides, between about 8 and about 18 nucleotides, between about 10 and about 18
nucleotides, between about 11 and about 18 nucleotides, between about 12 and about 18 nucleotides, between about 13 and about 18 nucleotides, between about 14 and about 18 nucleotides, between about 15 and about 18 nucleotides, between about 16 and about 18 nucleotides, between about 8 and about 16 nucleotides, between about 10 and about 16 nucleotides, between about 11 and about 16 nucleotides, between about 12 and about 16 nucleotides, between about 13 and about 16 nucleotides, between about 14 and about 16 nucleotides, or between about 15 and about 16 nucleotides. In some embodiments, a guide nucleic acid can be a guide RNA or a part thereof.
[0082] Non-limiting examples of a guide RNA scaffold sequence are provided in Table 2. In some embodiments, the guide RNA scaffold sequence can comprise a polynucleotide sequence (e.g., a consecutive polynucleotide sequence) that exhibits at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99%, or substantially about 100% sequence identity to the polynucleotide sequence of one or more members selected from Table 2 (e.g., one or more members selected from the group consisting of SEQ ID NOs. 500-596).
[0083] In some cases, a guide RNA may comprise, from 5' to 3', (i) the polynucleotide sequence of one or more members selected from Table 2 and (ii) a spacer sequence. In some cases, a guide RNA may comprise, from 5' to 3', (i) the polynucleotide sequence of SEQ ID NO: 530, (ii) a spacer sequence, and (ii) the polynucleotide sequence of TT. In some cases, a guide RNA may comprise, from 5' to 3', (i) the polynucleotide sequence of SEQ ID NO: 532, (ii) a spacer sequence, and (ii) the polynucleotide sequence of TTTTA. In some cases, a guide RNA may comprise, from 5' to 3', (i) the polynucleotide sequence of SEQ ID NO: 534, (ii) a spacer sequence, and (ii) the polynucleotide sequence of TTTTG. In some cases, a guide RNA may comprise, from 5' to 3', (i) the polynucleotide sequence of SEQ ID NO: 536, (ii) a spacer sequence, and (ii) the polynucleotide sequence of SEQ ID NO: 537. In some cases, a guide RNA may comprise, from 5' to 3', (i) the polynucleotide sequence of SEQ ID NO: 538, (ii) a spacer sequence, and (ii) the polynucleotide sequence of SEQ ID NO: 539. In some cases, a guide RNA may comprise, from 5' to 3', (i) the polynucleotide sequence of SEQ ID NO: 541, (ii) a spacer sequence, and (ii) the polynucleotide sequence of SEQ ID NO: 542. In some cases, a guide RNA may comprise, from 5' to 3', (i) the polynucleotide sequence of SEQ ID NO: 543, (ii) a spacer sequence, and (ii) the polynucleotide sequence of SEQ ID NO: 544. In some cases, a guide RNA may comprise, from 5' to 3', (i) the polynucleotide sequence of SEQ ID NO: 549, (ii) a spacer
sequence, and (ii) the polynucleotide sequence of SEQ ID NO: 550. In some cases, a guide RNA may comprise, from 5' to 3', (i) the polynucleotide sequence of SEQ ID NO: 551, (ii) a spacer sequence, and (ii) the polynucleotide sequence of SEQ ID NO: 552. In some cases, a guide RNA may comprise, from 5' to 3', (i) the polynucleotide sequence of SEQ ID NO: 554, (ii) a spacer sequence, and (ii) the polynucleotide sequence of TTTTA. In some cases, a guide RNA may comprise, from 5' to 3', (i) the polynucleotide sequence of SEQ ID NO: 564, (ii) a spacer sequence, and (ii) the polynucleotide sequence of SEQ ID NO: 550. In some cases, a guide RNA may comprise, from 5' to 3', (i) the polynucleotide sequence of SEQ ID NO: 565, (ii) a spacer sequence, and (ii) the polynucleotide sequence of SEQ ID NO: 550.
[0084] Non-limiting examples of a guide RNA scaffold fragment sequence are provided in Table 3. In some embodiments, the guide RNA scaffold sequence can comprise a polynucleotide sequence (e.g, a consecutive polynucleotide sequence) that exhibits at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99%, or substantially about 100% sequence identity to the polynucleotide sequence of one or more members selected from Table 3 (e.g., one or more members selected from the group consisting of SEQ ID NOs. 597-601).
[0085] Any suitable CRISPR/Cas system can be used. A CRISPR/Cas system can be referred to using a variety of naming systems. A CRISPR/Cas system can be a type I, a type II, a type III, a type IV, a type V, a type VI system, or any other suitable CRISPR/Cas system. A CRISPR/Cas system as used herein can be a Class 1, Class 2, or any other suitably classified CRISPR/Cas system. Class 1 or Class 2 determination can be based upon the genes encoding the effector module. Class 1 systems generally have a multi-subunit crRNA-effector complex, whereas Class 2 systems generally have a single protein, such as Cas9, Cpfl, C2cl, C2c2, C2c3 or a crRNA- effector complex. A Class 1 CRISPR/Cas system can use a complex of multiple Cas proteins to effect regulation. A Class 1 CRISPR/Cas system can comprise, for example, type I (e.g., I, IA, IB, IC, ID, IE, IF, IU), type III (e.g. III, IIIA, IIIB, IIIC, IIID), and type IV (e g, IV, IVA, IVB) CRISPR/Cas type. A Class 2 CRISPR/Cas system can use a single large Cas protein to effect regulation. A Class 2 CRISPR/Cas systems can comprise, for example, type II (e.g, II, IIA, IIB) and type V CRISPR/Cas type. CRISPR systems can be complementary to each other, and/or can lend functional units in trans to facilitate CRISPR locus targeting.
[0086] When a actuator moiety can comprise a Cas protein or derivative thereof, the Cas protein or derivative thereof can be a Class 1 or a Class 2 Cas protein. A Cas protein can be a type I, type
II, type III, type IV, type V Cas protein, or type VI Cas protein. A Cas protein can comprise one or more domains. Non-limiting examples of domains include, guide nucleic acid recognition and/or binding domain, nuclease domains (e.g., DNase or RNase domains, RuvC, HNH), DNA binding domain, RNA binding domain, helicase domains, protein-protein interaction domains, and dimerization domains. A guide nucleic acid recognition and/or binding domain can interact with a guide nucleic acid. A nuclease domain can comprise catalytic activity for nucleic acid cleavage. A nuclease domain can lack catalytic activity to prevent nucleic acid cleavage. A Cas protein can be a chimeric Cas protein or fragment thereof that is fused to other proteins or polypeptides. A Cas protein can be a chimera of various Cas proteins, for example, comprising domains from different Cas proteins.
[0087] Non-limiting examples of Cas proteins include c2cl, C2c2, c2c3, Casl, CaslB, Cas2, Cas3, Cas4, Cas5, Cas5e (CasD), Cash, Cas6e, Cas6f, Cas7, Cas8a, Cas8al, Cas8a2, Cas8b, Cas8c, Cas9 (Csnl or Csxl2), CaslO, CaslOd, CaslO, CaslOd, CasF, CasG, CasH, Cpfl, Csyl, Csy2, Csy3, Csel (CasA), Cse2 (CasB), Cse3 (CasE), Cse4 (CasC), Cscl, Csc2, Csa5, Csn2, Csm2, Csm3, Csm4, Csm5, Csm6, Cmrl, Cmr3, Cmr4, Cmr5, Cmr6, Csbl, Csb2, Csb3, Csxl7, Csxl4, CsxlO, Csxl6, CsaX, Csx3, Csxl, Csxl5, Csfl, Csf2, Csf3, Csf4, Cul966, Casl3a, Casl3b, Casl3c, Casl3d, Casl3X, Casl3Y, Casl4 (e.g., Casl4 variants, such as Casl4a, Casl4b, Casl4c, etc.) and homologs or modified versions thereof.
[0088] A Cas protein or fragment or derivative thereof can be from any suitable organism. Nonlimiting examples include Streptococcus pyogenes, Streptococcus thermophilus, Streptococcus sp., Staphylococcus aureus, Nocardiopsis dassonvillei, Streptomyces pristinae spiralis, Streptomyces viridochromo genes, Streptomyces viridochromogenes, Streptosporangium roseum, Streptosporangium roseum, AlicyclobacHlus acidocaldarius, Bacillus pseudomycoides, Bacillus selenitireducens, Exiguobacterium sibiricum, Lactobacillus delbrueckii, Lactobacillus salivarius, Microscilla marina, Burkholderiales bacterium, Polaromonas nap hthalenivorans, Polaromonas sp., Crocosphaera watsonii, Cyanothece sp., Microcystis aeruginosa, Pseudomonas aeruginosa, Synechococcus sp., Acetohalobium arabaticum, Ammonifex degensii, Caldicelulosiruptor becscii, Candidatus Desulforudis, Clostridium botulinum, Clostridium difficile, Finegoldia magna, Natranaerobius thermophilus, Pelotomaculum thermopropionicum, Acidithiobacillus caldus, Acidithiobacillus ferrooxidans, Allochromatium vinosum, Marinobacter sp., Nitrosococcus halophilus, Nitrosococcus watsoni, Pseudoalteromonas haloplanktis, Ktedonobacter racemifer, Methanohalobium evestigatum, Anabaena variabilis, Nodularia spumigena, Nostoc sp., Arthrospira maxima, Arthrospira platensis, Arthrospira sp., Lyngbya sp., Microcoleus chthonoplastes, Oscillatoria sp., Petrotoga mobilis, Thermosipho africanus,
Acaryochloris marina, Leptotrichia shahii, and Francisella novicida. In some aspects, the organism is Streptococcus pyogenes (S. pyogenes) In some aspects, the organism is Staphylococcus aureus (S. aureus). In some aspects, the organism is Streptococcus thermophilus (S. therm ophilus).
[0089] A Cas protein can be derived from a variety of bacterial species including, but not limited to, Veillonella atypical, Fusobacterium nucleatum, Filifactor alocis, Solobacterium moorei, Coprococcus catus, Treponema denticola, Peptoniphilus duerdenii, Catenibacterium mitsuokai, Streptococcus mutans, Listeria innocua, Staphylococcus pseudintermedius, Acidaminococcus intestine, Olsenella uli, Oenococcus kitaharae, Bifidobacterium bifidum, Lactobacillus rhamnosus, Lactobacillus gasseri, Finegoldia magna, Mycoplasma mobile, Mycoplasma gallisepticum, Mycoplasma ovipneumoniae, Mycoplasma canis, Mycoplasma synoviae, Eubacterium rectale, Streptococcus thermophilus, Eubacterium dolichum, Lactobacillus coryniformis subsp. Torquens, Ilyobacter polytropus, Ruminococcus albus, Akkermansia muciniphila, Acidothermus cellulolyticus, Bifidobacterium longum, Bifidobacterium dentium, Corynebacterium diphtheria, Elusimicrobium minutum, Nitratifractorsalsuginis, Sphaerochaeta globus, Fibrobacter succinogenes subsp. Succinogenes, Bacteroides fragilis, Capnocytophaga ochracea, Rhodopseudomonas palustris, Prevotella micans, Prevotella ruminicola, Flavobacterium columnare, Aminomonas paucivorans, Rhodospirillum rubrum, Candidatus Puniceispirillum marinum, Verminephrobacter eiseniae, Ralstonia syzygii, Dinoroseobacter shibae, Azospirillum, Nitrobacter hamburgensis, Bradyrhizobium, Wolinellasuccinogenes, Campylobacter jejuni subsp. Jejuni, Helicobacter mustelae, Bacillus cereus, Acidovorax ebreus, Clostridium perfringens, Parvibaculum lavamentivorans, Roseburia intestinalis, Neisseria meningitidis, Pasteurella multocida subsp. Multocida, Sutterella wadsworthensis, proteobacterium, Legionella pneumophila, Parasutterella excrementihominis, Wolinella succinogenes, and Francisella novicida.
[0090] A Cas protein as used herein can be a wildtype or a modified form of a Cas protein. A Cas protein can be an active variant, inactive variant, or fragment of a wild type or modified Cas protein. A Cas protein can comprise an amino acid change such as a deletion, insertion, substitution, variant, mutation, fusion, chimera, or any combination thereof relative to a wildtype version of the Cas protein (e.g., a wild-type version of Casl4). A Cas protein can be a polypeptide with at least about 5%, at least about 10%, at least about 20%, at least about 30%, at least about 40%, at least about 50%, at least about 60%, at least about 70%, at least about 80%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99%,
or 100% sequence identity or sequence similarity to a wild type Cas protein. A Cas protein can be a polypeptide with at most about 5%, at most about 10%, at most about 20%, at most about 30%, at most about 40%, at most about 50%, at most about 60%, at most about 70%, at most about 80%, at most about 90%, or at most about 100% sequence identity and/or sequence similarity to a wild type exemplary Cas protein. Variants or fragments can comprise at least about 5%, at least about 10%, at least about 20%, at least about 30%, at least about 40%, at least about 50%, at least about 60%, at least about 70%, at least about 80%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99%, or 100% sequence identity or sequence similarity to a wild type or modified Cas protein or a portion thereof. Variants or fragments can be targeted to a nucleic acid locus in complex with a guide nucleic acid while lacking nucleic acid cleavage activity.
[0091] A Cas protein can comprise one or more nuclease domains, such as DNase domains. For example, a Cas9 protein can comprise a RuvC-like nuclease domain and/or an HNH-like 20 nuclease domain. The in a nuclease active form of Cas9, RuvC and HNH domains can each cut a different strand of double-stranded DNA to make a double-stranded break in the DNA. A Cas protein can comprise only one nuclease domain (e.g., Cpfl comprises RuvC domain but lacks HNH domain). In some embodiments, nuclease domains are absent. In some embodiments, nuclease domains are present but inactive or have reduced or minimal activity. In some embodiments, nuclease domains are present and active.
[0092] One or a plurality of the nuclease domains (e.g., RuvC, HNH) of a Cas protein can be deleted or mutated so that they are no longer functional or comprise reduced nuclease activity. For example, in a Cas protein comprising at least two nuclease domains (e.g., Cas9), if one of the nuclease domains is deleted or mutated, the resulting Cas protein, known as a nickase, can generate a single-strand break at a CRISPR RNA (crRNA) recognition sequence within a doublestranded DNA but not a double-strand break. Such a nickase can cleave the complementary strand or the non-complementary strand, but may not cleave both. If all of the nuclease domains of a Cas protein (e.g., both RuvC and HNH nuclease domains in a Cas9 protein; RuvC nuclease domain in a Cpfl protein) are deleted or mutated, the resulting Cas protein can have a reduced or no ability to cleave both strands of a double-stranded DNA. An example of a mutation that can convert a Cas9 protein into a nickase is a D10A (aspartate to alanine at position 10 of Cas9) mutation in the RuvC domain of Cas9 from S. pyogenes. H939A (histidine to alanine at amino acid position 839) or H840A (histidine to alanine at amino acid position 840) in the HNH domain of Cas9 from S. pyogenes can convert the Cas9 into a nickase. An example of a mutation that can
convert a Cas9 protein into a dead Cas9 is a D10A (aspartate to alanine at position 10 of Cas9) mutation in the RuvC domain and H939A (histidine to alanine at amino acid position 839) or H840A (histidine to alanine at amino acid position 840) in the HNH domain of Cas9 from S. pyogenes.
[0093] A nuclease dead Cas protein can comprise one or more mutations relative to a wild-type version of the protein. The mutation can result in no more than 90%, no more than 80%, no more than 70%, no more than 60%, no more than 50%, no more than 40%, no more than 30%, no more than 20%, no more than 10%, no more than 5%, or no more than 1% of the nucleic acid-cleaving activity in one or more of the plurality of nucleic acid-cleaving domains of the wild-type Cas protein. The mutation can result in one or more of the plurality of nucleic acid-cleaving domains retaining the ability to cleave the complementary strand of the target nucleic acid but reducing its ability to cleave the non-complementary strand of the target nucleic acid. The mutation can result in one or more of the plurality of nucleic acid-cleaving domains retaining the ability to cleave the non-complementary strand of the target nucleic acid but reducing its ability to cleave the complementary strand of the target nucleic acid. The mutation can result in one or more of the plurality of nucleic acid-cleaving domains lacking the ability to cleave the complementary strand and the non-complementary strand of the target nucleic acid. The residues to be mutated in a nuclease domain can correspond to one or more catalytic residues of the nuclease. For example, residues in the wild type exemplary S. pyogenes Cas9 polypeptide such as AsplO, His840, Asn854 and Asn856 can be mutated to inactivate one or more of the plurality of nucleic acidcleaving domains (e.g., nuclease domains). The residues to be mutated in a nuclease domain of a Cas protein can correspond to residues AsplO, His840, Asn854 and Asn856 in the wild type S. pyogenes Cas9 polypeptide, for example, as determined by sequence and/or structural alignment. [0094] A Cas protein can comprise an amino acid sequence having at least about 5%, at least about 10%, at least about 20%, at least about 30%, at least about 40%, at least about 50%, at least about 60%, at least about 70%, at least about 80%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99%, or 100% sequence identity or sequence similarity to a nuclease domain (e.g., RuvC domain, HNH domain) of a wild-type Cas protein.
[0095] A Cas protein, variant or derivative thereof can be modified to enhance regulation of gene expression by compositions and methods of the disclosure, e.g., as part of a complex disclosed herein. A Cas protein can be modified to increase or decrease nucleic acid binding affinity, nucleic acid binding specificity, enzymatic activity, and/or binding to other factors, such as heterodimerization or oligomerization domains and induce ligands. Cas proteins can also be
modified to change any other activity or property of the protein, such as stability. For example, one or more nuclease domains of the Cas protein can be modified, deleted, or inactivated, or a Cas protein can be truncated to remove domains that are not essential for the desired function of the protein or complex. A Cas protein can be modified to modulate (e.g., enhance or reduce) the activity of the Cas protein for regulating gene expression by a complex of the disclosure that comprises a heterologous gene effector.
[0096] For example, a Cas protein can be coupled (e.g., fused, covalently coupled, or non- covalently coupled) to a heterologous gene effector (e.g., an epigenetic modification domain, a transcriptional activation domain, and/or a transcriptional repressor domain). A Cas protein can be coupled (e.g., fused, covalently coupled, or non-covalently coupled) to an oligomerization or dimerization domain as disclosed herein (e.g., a heterodimerization domain). A Cas protein can be coupled (e.g., fused, covalently coupled, or non-covalently coupled) to a heterologous polypeptide that provides increased or decreased stability. A Cas protein can be coupled (e g., fused, covalently coupled, or non-covalently coupled) to a sequence that can facilitate degradation of the Cas protein or a complex containing the Cas protein, for example, a degron, such as an inducible degron (e.g., auxin inducible).
[0097] A Cas protein can be coupled (e.g., fused, covalently coupled, or non-covalently coupled) to any suitable number of partners, for example, at least one, at least two, at least three, at least four, or at least five, at least six, at least seven, or at least 8 partners. In some embodiments, a Cas protein of the disclosure is coupled (e.g., fused, covalently coupled, or non-covalently coupled) to at most two, at most three, at most four, at most five, at most six, at most seven, at most eight, or at most ten partners. In some embodiments, a Cas protein of the disclosure is coupled (e g., fused, covalently coupled, or non-covalently coupled) to 1 - 5, 1 - 4, 1 - 3, 1 - 2, 2 - 5, 2 - 4, 2 - 3, 3 - 5, 3 - 4, or 4 - 5 partners. In some embodiments, a Cas protein of the disclosure is coupled (e.g., fused, covalently coupled, or non-covalently coupled) to one partner. In some embodiments, a Cas protein of the disclosure is coupled (e.g., fused, covalently coupled, or non- covalently coupled) to two partners. In some embodiments, a Cas protein of the disclosure is coupled (e.g., fused, covalently coupled, or non-covalently coupled) to three partners. In some embodiments, a Cas protein of the disclosure is coupled (e.g., fused, covalently coupled, or non- covalently coupled) to four partners. In some embodiments, a Cas protein of the disclosure is coupled (e.g., fused, covalently coupled, or non-covalently coupled) to five partners. In some embodiments, a Cas protein of the disclosure is coupled (e.g., fused, covalently coupled, or non- covalently coupled) to six partners.
[0098] A Cas protein can be a fusion protein, e.g., a fusion comprising the Cas protein and one or more of the partners as disclosed herein The fused domain or heterologous polypeptide can be located at the N-terminus, the C-terminus, or internally within the Cas protein.
[0099] A partner of the Cas protein (e.g., covalently or non-covalently coupled to a dCas protein as disclosed herein) can be a transcriptional effector (e.g., a transcriptional activator or a transcriptional repressor). The transcriptional effector can be heterologous to the cell as provided herein.
[0100] In some embodiments, the Cas protein and the transcriptional effector (e.g., transcriptional activator) can be fused in a single polypeptide sequence. The Cas protein and the transcriptional effector can be fused directly to one another. Alternatively, the Cas protein and the transcriptional effector can be fused via a peptide linker (or an amino acid linker) that is heterologous to the Cas protein and the transcriptional activator. The peptide linker can be derived from a natural polypeptide sequence. Alternatively, the peptide linker can be a synthetic sequence. The peptide linker can have a length of at least or up to about 1 amino acid residue, at least or up to about 2 amino acid residues, at least or up to about 3 amino acid residues, at least or up to about 4 amino acid residues, at least or up to about 5 amino acid residues, at least or up to about 10 amino acid residues, at least or up to about 15 amino acid residues, at least or up to about 20 amino acid residues, at least or up to about 25 amino acid residues, at least or up to about 30 amino acid residues, at least or up to about 35 amino acid residues, at least or up to about 40 amino acid residues, at least or up to about 45 amino acid residues, at least or up to about 50 amino acid residues, at least or up to about 60 amino acid residues, at least or up to about 70 amino acid residues, at least or up to about 80 amino acid residues, at least or up to about 90 amino acid residues, or at least or up to about 100 amino acid residues. In some cases, the peptide linker can be a GS linker.
[0101] The term “GS linker” or “GS linker sequence,” as used interchangeably herein, generally refers to a peptide linker that mainly comprises glycine and serine residues. Particularly, at least or up to about 60%, at least or up to about 65%, at least or up to about 70%, at least or up to about 75%, at least or up to about 80%, at least or up to about 85%, at least or up to about 90%, at least or up to about 95% or substantially about 100% of the amino acid residues in the GS linker sequence can be selected from glycine and serine residues. The GS linker sequence according to the present invention can, for example, comprise from about 1 to about 50 amino acid residues, from about 1 to about 45 amino acid residues, from about 1 to about 40 amino acid residues, from about 1 to about 35 amino acid residues, or from about 1 to about 30 amino
acid residues, in total. In some cases, the GS linker sequence may not comprise about 10, about 5, about 4, about 3, about 2 or about 1 amino acid residue (s) other than glycine or serine.
[0102] In some embodiments, the transcriptional effector can be a histone epigenetic modifier (or a histone modifier). In some cases, the histone epigenetic modifier can modulate histones through methylation (e.g., a histone methylation modifier, such as an amino acid methyltransferase, e.g., KRAB). In some cases, the histone epigenetic modifier can modulate histones through acetylation. In some cases, the histone epigenetic modifier can modulate histones through phosphorylation. In some cases, the histone epigenetic modifier can modulate histones through ADP-ribosylation. In some cases, the histone epigenetic modifier can modulate histones through glycosylation. In some cases, the histone epigenetic modifier can modulate histones through SUMOylation. In some cases, the histone epigenetic modifier can modulate histones through ubiquitination. In some cases, the histone epigenetic modifier can modulate histones by remodeling histone structure, e.g., via an ATP hydrolysis-dependent process.
[0103] In some embodiments, the transcriptional effector can be a gene epigenetic modifier (or a gene modifier). In some cases, a gene modifier can modulate genes through methylation (e.g., a gene methylation modifier, such as a DNA methyltransferase or DNMT). In some cases, a gene modifier can modulate genes through acetylation.
[0104] In some embodiments, the transcriptional effector is from a family of related histone acetyltransferases. Non-limiting examples of histone acetyltransferases include GNAT subfamily, MYST subfamily, p300/CBP subfamily, HAT1 subfamily, GCN5, PCAF, Tip60, MOZ, MORF, MOF, HBO1, p300, CBP, HAT1, ATF-2, SRC1, and TAFII250.
[0105] In some embodiments, the transcriptional effector is from a histone epigenetic modifier (e.g., a histone lysine methyltransferase, a histone lysine demethylase, or a DNA methylase). Non-limiting examples of histone epigenetic modifier include EZH subfamily, Non-SET subfamily, Other SET subfamily, PRDM subfamily, SET1 subfamily, SET2 subfamily, SUV39 subfamily, SYMD subfamily, ASH IL, EHMT1, EHMT2, EZH1, EZH2, MLL, MLL2, MLL3, MLL4, MLL5, NSD1, NSD2, NSD3, PRDM1, PRDM10, PRDM 11, PRDM12, PRDM13, PRDM14, PRDM15, PRDM16, PRDM2, PRDM4, PRDM5, PRDM6, PRDM7, PRDM8, PRDM9, SET1, SET1L, SET2L, SETD2, SETD3, SETD4, SETD5, SETD6, SETD7, SETD8, SETDB1, SETDB2, SETMAR, SUV39H1, SUV39H2, SUV420H1, SUV420H2, SYMD1, SYMD2, SYMD3, SYMD4, and SYMD5.
[0106] Examples of proteins (or fragments thereof) that can be used as a fusion partner to increase transcription include but are not limited to: transcriptional activators such as VP16, VP64, VP48, VP 160, p65 subdomain (e.g., from NFkB), and activation domain of EDLL and/or
TAL activation domain (e.g., for activity in plants); and histone epigenetic modifier such as SET1A, SET1B, MLL1 to 5, ASH1, SYMD2, NSD1, IHDM2a/b, UTX, JMID3, GCN5, PCAF, CBP, p300, TAF1, TIP60/PLIP, M0ZMYST3, M0RFMYST4, SRC1, ACTR, PI 60, CLOCK, Ten-Eleven Translocation (TET) dioxygenase 1 (TET1CD), TET1, DME, DML1, DML2, ROS1, and the like. An additional example of such gene activating modulator is VP64-p65-Rta fusion polypeptide (VPR).
[0107] Examples of proteins (or fragments thereof) that can be used as a fusion partner to decrease transcription include but are not limited to: transcriptional repressors such as the Kruppel associated box (KRAB or SKD); KOX1 repression domain; the Mad mSIN3 interaction domain (SID); the ERF repressor domain (ERD), the SRDX repression domain (e.g., for repression in plants), and the like; histone lysine methyltransferases such as Pr-SET7/8, SUV4- 20H1, RIZ1, and the like; histone lysine demethylases such as JMJD2A/JHDM3A, JMJD2B, JMJD2C/GASC1, JMJD2D, J ARID 1 A/RBP2, JARID1B/PLU- 1 , J ARID 1C/SMCX, JARIDID/SMCY, and the like; histone lysine deacetylases such as HDAC1, HDAC2, HDAC3, HDAC8, HDAC4, HDAC5, HDAC7, HDAC9, SIRT1, SIRT2, HDAC11, and the like; DNA methylases such as Hhal DNA m5c-methyltransferase (M.Hhal), DNA methyltransferase 1 (DNMT1), DNA methyltransferase 3a (DNMT3a), DNA methyltransferase 3b (DNMT3b), METI, DRM3 (plants), ZMET2, CMT1, CMT2 (plants), and the like; and periphery recruitment elements such as Lamin A, Lamin B, and the like.
[0108] A Cas protein can be provided in any form. For example, a Cas protein can be provided in the form of a protein, such as a Cas protein alone or complexed with a guide nucleic acid as a ribonucleoprotein. A Cas protein can be provided in a complex, for example, complexed with a guide nucleic acid and/or one or more heterologous gene effectors of the disclosure. A Cas protein can be provided in the form of a nucleic acid encoding the Cas protein, such as an RNA (e.g., messenger RNA (mRNA)), or DNA. The nucleic acid encoding the Cas protein can be codon optimized for efficient translation into protein in a particular cell or organism.
[0109] Nucleic acids encoding Cas proteins, fragments, or derivatives thereof can be stably integrated in the genome of a cell. Nucleic acids encoding Cas proteins can be operably linked to a promoter, for example, a promoter that is constitutively or inducibly active in the cell. Nucleic acids encoding Cas proteins can be operably linked to a promoter in an expression construct. Expression constructs can include any nucleic acid constructs capable of directing expression of a gene or other nucleic acid sequence of interest (e.g., a Cas gene) and which can transfer such a nucleic acid sequence of interest to a target cell.
[0110] In some embodiments, a Cas protein, variant or derivative thereof is a nuclease dead Cas (dCas) protein. A dead Cas protein can be a protein that lacks nucleic acid cleavage activity [OHl] A Cas protein can comprise a modified form of a wild type Cas protein. The modified form of the wild type Cas protein can comprise an amino acid change (e.g., deletion, insertion, or substitution) that reduces the nucleic acid-cleaving activity of the Cas protein. For example, the modified form of the Cas protein can have no more than 90%, no more than 80%, no more than 70%, no more than 60%, no more than 50%, no more than 40%, no more than 30%, no more than 20%, no more than 10%, no more than 5%, or no more than 1% of the nucleic acid-cleaving activity of the wild-type Cas protein (e g., Cas9 from S. pyogenes). The modified form of Cas protein can have no substantial nucleic acid-cleaving activity. When a Cas protein is a modified form that has no substantial nucleic acid-cleaving activity, it can be referred to as enzymatically inactive, “deactivated” and/or “dead” (abbreviated by “d”). A dead Cas protein (e.g., dCas, dCas9, dCasl4) can bind to a target polynucleotide but may not cleave or minimally cleaves the target polynucleotide. In some aspects, a dead Cas protein is a dead Casl4 protein. In some aspects, a dead Cas protein is a not a dead Casl4 protein.
[0112] A dCas polypeptide (e.g., dCasl4 polypeptide) can associate with a single guide RNA (sgRNA) to activate or repress transcription of a target gene (e.g., target endogenous gene), for example, in combination with heterologous gene effector(s) disclosed herein. sgRNAs can be introduced into cells expressing the Cas or variant thereof, as provided herein. In some cases, such cells can contain one or more different sgRNAs that target the same target gene (e.g., target endogenous gene) or target gene regulatory sequence. In other cases, the sgRNAs target different nucleic acids in the cell (e.g., different target genes, different target gene regulatory sequences, or different sequences within the same target gene or target gene regulatory sequence).
[0113] Enzymatically inactive can refer to a nuclease that can bind to a nucleic acid sequence in a polynucleotide in a sequence-specific manner, but will not cleave a target polynucleotide or will cleave it at a substantially reduced frequency. An enzymatically inactive guide moiety can comprise an enzymatically inactive domain (e.g. nuclease domain). Enzymatically inactive can refer to no activity. Enzymatically inactive can refer to substantially no activity. Enzymatically inactive can refer to essentially no activity. Enzymatically inactive can refer to an activity no more than 1%, no more than 2%, no more than 3%, no more than 4%, no more than 5%, no more than 6%, no more than 7%, no more than 8%, no more than 9%, or no more than 10% activity compared to a comparable wild-type activity (e.g., nucleic acid cleaving activity, wild-type Cas9 or wild-type Cas 14 activity).
[0114] In some embodiments, the actuator moiety as disclosed herein does not contain a nucleic acid-guided targeting system. For example, the actuator moiety can include proteins that bind to a target gene (e.g., target endogenous gene) or target gene regulatory sequence based on protein structural features, such as certain nucleases disclosed herein.
[0115] In some embodiments, the wild-type Cas protein that the engineered Cas protein is a modification of has a native amino acid sequence with a length of less than 800 amino acids (e.g., Casl4 or a variant thereof). This relatively small size provides several advantages to the provided engineered Cas protein. For example, the small size can allow the Cas protein to be delivered to a host cell, e.g., a cell of a human patient, via a single adeno-associated virus delivery system that would be otherwise incapable of delivering a larger protein. The native amino acid sequence can have a length that is, for example, between 500 amino acids and 700 amino acids, e.g., between 500 amino acids and 620 amino acids, between 540 amino acids and 660 amino acids, between 560 amino acids and 680 amino acids, or between 580 amino acids and 700 amino acids. In terms of upper limits, the native amino acid sequence can have a length that is less than 700 amino acids, e.g., less than 680 amino acids, less than 660 amino acids, less than 640 amino acids, less than 620 amino acids, less than 600 amino acids, less than 580 amino acids, less than 560 amino acids, less than 540 amino acids, or less than 520 amino acids. In terms of lower limits, the native amino acid sequence can have an length that is greater than 500 amino acids, e g., greater than 520 amino acids, greater than 540 amino acid, greater than 560 amino acids, greater than 580 amino acids, greater than 600 amino acids, greater than 620 amino acids, greater than 640 amino acids, greater than 660 amino acids, or greater than 700 amino acids. Larger lengths, e.g., greater than 700 amino acids, and smaller lengths, e.g., less than 500 amino acids, are also contemplated. [0116] In some embodiments, the modified amino acid sequence of the engineered Cas protein includes one or more substitutions in the native amino acid sequence, where the positions of at least some of these substitutions follow one or more particular rules determined to have surprising advantages for the characteristics of the engineered Cas protein. For example, the particular substitution rules have been selected for their ability to produce engineered Cas proteins capable of functioning within eukaryotic cells. According to these particular rules, all or some of the one or more substitutions in the native amino acid sequence are either (1) within or no more than 30 amino acids downstream of a (D/E/K/N)X(R/F)(E/K)N motif of the native amino acid sequence, (2) at or no more than 30 amino acids upstream or downstream of position 241 of the native amino acid sequence, (3) at or no more than 30 amino acids upstream or downstream of position 516 of the native amino acid sequence, and/or (4) having an electrically charged amino acid in the native amino acid sequence.
[0117] In some embodiments, the native amino acid sequence includes a (D/E/K/N)X(R/F)(E/K)N motif, and the modified amino acid sequence includes one or more substitutions at positions within or no more than 30 amino acids upstream or downstream of the motif. The modified amino acid sequence can include, for example, one, two, three, four, five, six, seven, eight, nine, ten, or more than ten substitutions within or no more than 30 amino acids upstream or downstream of the motif. At least one of the one or more substitutions to the native amino acid sequence can be, for example, within or no more than 28 amino acids, 26 amino acids, 24 amino acids, 22 amino acids, 20 amino acids, 18 amino acids, 16 amino acids, 14 amino acids, 12 amino acids, or 10 amino acids of the motif. In some embodiments, at least one of the one or more substitutions within or no more than 30 amino acids upstream or downstream of the motif is to an R, A, S, or G. In some embodiments, each of the one or more substitutions within or no more than 30 amino acids upstream or downstream of the motif is independently to an R, A, S, or G. In some embodiments, all of the substitutions to the native amino acid sequence are at positions within or no more than 30 amino acids upstream or downstream of the motif.
[0118] Some embodiments of the present disclosure are directed to a Cas protein that is not a variant of CasX.
[0119] Some embodiments of the present disclosure are directed to small Cas-based regulation of gene expression, such as at the transcriptional and/or translational level. Small Cas proteins can be targeted to DNA and/or RNA, and are much smaller than typical CRISPR effectors, e.g., ranging in size from about 400 amino acids to about 700 amino acids. The small size of can allow such Cas proteins proteins and/or effector domain fusions thereof to be paired with a CRISPR array encoding multiple guide RNAs while remaining under the packaging size limit of various delivery vehicles, such as the versatile adeno-associated virus (AAV) delivery vehicle or non-viral delivery vehicles (e.g., lipid nanoparticles), for primary cell and in vivo delivery.
[0120] In some embodiments, the Cas protein or a variant thereof as provided herein (e.g., a variant of SEQ ID NO: 1 as disclosed herein) can have a size of at most about 800 amino acids, at most about 780 amino acids, at most about 760 amino acids, at most about 750 amino acids, at most about 740 amino acids, at most about 720 amino acids, at most about 700 amino acids, at most about 680 amino acids, at most about 660 amino acids, at most about 650 amino acids, at most about 640 amino acids, at most about 620 amino acids, at most about 600 amino acids, at most about 580 amino acids, at most about 560 amino acids, at most about 550 amino acids, at most about 540 amino acids, at most about 520 amino acids, at most about 500 amino acids, 480 amino acids, at most about 460 amino acids, at most about 450 amino acids, at most about 440 amino acids, at most about 420 amino acids, at most about 400 amino acids, or less.
[0121] Non-limiting examples of Cas protein are provided in Table 1. In some embodiments, the Cas protein or the deactivated Cas protein (dCas) as provided herein can comprise a polypeptide sequence (e.g., a consecutive polypeptide sequence) that exhibits at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99%, or substantially about 100% sequence identity to the polypeptide sequence of one or more members selected from Table 1 (e.g., one or more members selected from the group consisting of SEQ ID NOs. 1-201).
[0122] In some embodiments, the Cas protein or a variant thereof, as provided herein, can comprise the amino acid sequence having at least about 60%, at least about 65%, at least about 70%, at least about 71%, at least about 72%, at least about 73%, at least about 74%, at least about
75%, at least about 76%, at least about 77%, at least about 78%, at least about 79%, at least about
80%, at least about 81%, at least about 82%, at least about 83%, at least about 84%, at least about
85%, at least about 86%, at least about 87%, at least about 88%, at least about 89%, at least about
90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about
95%, at least about 96%, at least about 97%, at least about 98%, at least about 99%, or greater sequence identity to the amino acid sequence of SEQ ID NO: 1. Cas protein or a variant thereof, as provided herein, can comprise the amino acid sequence having at most about 100%, at most about 99%, at most about 98%, at most about 97%, at most about 96%, at most about 95%, at most about 94%, at most about 93%, at most about 92%, at most about 91%, at most about 90%, at most about 89%, at most about 88%, at most about 87%, at most about 86%, at most about 85%, at most about 84%, at most about 83%, at most about 82%, at most about 81%, at most about 80%, at most about 79%, at most about 78%, at most about 77%, at most about 76%, at most about 75%, at most about 74%, at most about 73%, at most about 72%, at most about 71%, at most about 70%, at most about 65%, at most about 60%, or less sequence identity to the amino acid sequence of SEQ ID NO: 1.
[0123] In some embodiments, a Cas protein or a variant thereof as disclosed herein can exhibit a greater cationic charge (e.g., at least 1, at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, or more cationic charges) as compared to the wild-type Casl4. The enhanced cationic charge can (i) enhance complexation of the Cas protein to the guide nucleic acid and/or (ii) enhance complexation of the Cas protein to the target polynucleotide sequence (e g., endogenous target polynucleotide sequence). In some cases, the Cas protein can comprise one or more substitutions for the enhanced cationic charge. The one or more
substitutions at positions within or no more than 30 amino acids upstream or downstream of the (D/E/K/N)X(R/F)(E/K)N motif of the native amino acid sequence can include, for example, one or more substitutions at positions selected from positions 143, 147, 151, and 154 of the native amino acid sequence. In some embodiments, e.g., when the native amino acid sequence is the sequence of SEQ ID NO: 1, the one or more substitutions include substitutions are at one or more positions selected from D143, T147, E151, and K154. In some embodiments, e.g., when the native amino acid sequence is the sequence of SEQ ID NO: 1, the one or more substitutions include one or more substitutions selected from D143R, T147R, E151R, and K154R.
[0124] In some embodiments, the modified amino acid sequence includes one or more substitutions at or no more than 30 amino acids upstream or downstream of position 241 of the native amino acid sequence. The modified amino acid sequence can include, for example, one, two, three, four, five, six, seven, eight, nine, ten, or more than ten substitutions within or no more than 30 amino acids upstream or downstream of position 241. At least one of the one or more substitutions to the native amino acid sequence can be, for example, within or no more than 28 amino acids, 26 amino acids, 24 amino acids, 22 amino acids, 20 amino acids, 18 amino acids, 16 amino acids, 14 amino acids, 12 amino acids, or 10 amino acids of position 241. In some embodiments, at least one of the one or more substitutions within or no more than 30 amino acids upstream or downstream of position 241 is to an R, A, S, or G. In some embodiments, each of the one or more substitutions within or no more than 30 amino acids upstream or downstream of position 241 is independently to an R, A, S, or G. In some embodiments, all of the substitutions to the native amino acid sequence are at positions within or no more than 30 amino acids upstream or downstream of position 241.
[0125] In some embodiments, e.g., when the native amino acid sequence is the sequence of SEQ ID NO: 1, the one or more substitutions at positions having an electrically charged amino include substitutions are at one or more positions selected from KI 1, K73, D143, E151, K154, E241, D318, K330, K457, E425, E462, E507, E527, and E528. In some embodiments, e.g., when the native amino acid sequence is the sequence of SEQ ID NO: 1, the one or more substitutions include one or more substitutions selected from KI IR, K73R, D143R, E151R, K154R, E241R, D318R, K330R, E425N, K457R, E462R, E507R, E527R, and E528R. In some embodiments, the modified amino acid sequence includes a D143R substitution. In some embodiments, the only substitution in the modified amino acid sequence is D143R.
[0126] In some embodiments, the modified amino acid sequence of the engineered Cas protein includes two substitutions in the native amino acid sequence. In some embodiments, the modified amino acid sequence has exactly two substitutions in the native amino acid sequence. In some
embodiments, the modified amino acid sequence includes two substitutions at positions selected from positions 143, 147, 151, 154, 241, 330, 425, 504, 507, 516, 519, 527, and 528. In some embodiments, the modified amino acid sequence has exactly two substitutions, where the exactly two substitutions are at positions selected from positions 143, 147, 151, 154, 241, 330, 425, 504, 507, 516, 519, 527, and 528. In some embodiments, e g, when the native amino acid sequence is the sequence of SEQ ID NO: 1, the modified amino acid sequence includes two substitutions at positions selected from D143, T147, E151, K154, E241, K330, E425, N504, E507, N516, N519, E527, and E528. In some embodiments, e g, when the native amino acid sequence is the sequence of SEQ ID NO: 1, the modified amino acid sequence has exactly two substitutions, where the exactly two substitutions are at positions selected from D143, T147, E151, K154, E241, K330, E425, N504, E507, N516, N519, E527, and E528.
[0127] In some embodiments, the modified amino acid sequence includes a substitution at position 143 and a substitution at a position selected from positions 147, 151, 154, 241, 330, 425, 504, 507, 516, 519, 527, and 528. In some embodiments, the modified amino acid includes a substitution at position 143 and exactly one other substitution, where the exactly one other substitution is at a position selected from positions 147, 151, 154, 241, 330, 425, 504, 507, 516, 519, 527, and 528. In some embodiments, e.g, when the native amino acid sequence is the sequence of SEQ ID NO: 1, the modified amino acid sequence includes a substitution at position D143 and a substitution at a position selected from positions T147, E151, K154, E241, K330R, E425N, N504, E507, N516, N519, E527, and E528. In some embodiments, e.g., when the native amino acid sequence is the sequence of SEQ ID NO: 1, the modified amino acid includes a substitution at position D143 and exactly one other substitution, where the exactly one other substitution is at a position selected from positions T147, E151, K154, E241, K330R, E425N, N504, E507, N516, N519, E527, and E528.
[0128] In some embodiments, e.g., when the native amino acid sequence is the sequence of SEQ ID NO: 1, the modified amino acid includes two substitutions selected from D143R, T147R, E151R, E151A, K154R, E241R, N504R, E507R, N516R, N519R, E527R, and E528R. In some embodiments, e.g., when the native amino acid sequence is the sequence of SEQ ID NO: 1, the modified amino acid includes exactly two substitutions, where the two substitutions are selected from D143R, T147R, E151R, E151A, K154R, E241R, N504R, E507R, N516R, N519R, E527R, and E528R. In some embodiments, e.g., when the native amino acid sequence is the sequence of SEQ ID NO: 1, the modified amino acid includes two substitutions selected from D143R/T147R, D143R/E151R, D143R/E241R, D143R/E425N, D143R/E507R, D143R/N519R, D143R/E527R,, D143R/E528R, D143R/R151S, D143/R151G, and D143R/E151A. In some embodiments, e.g.
when the native amino acid sequence is the sequence of SEQ ID NO: 1, the modified amino acid includes exactly two substitutions, where the two substitutions are selected from D143R/T147R, D143R/E151R, D143R/E241R, D143R/E425N, D143R/E507R, D143R/N519R, D143R/E527R,, D143R/E528R, D143R/R151S, D143/R151G, and D143R/E151A. In some embodiments, the modified amino acid sequence includes a D143R substitution and a T147R substitution. In some embodiments, the only substitutions in the modified amino acid sequence are a D143R substitution and a T147R substitution.
[0129] In some embodiments, provide herein is a dCas protein or a variant thereof where one or more amino acids of the parental Cas protein from which it is derived have been altered or otherwise removed to reduce or eliminate its nuclease activity. In some embodiments, the amino acids include D326 and D510 with respect to SEQ ID NO: 1. In some embodiments, one or both of D326 and D510 are substituted with an amino acid that reduces, substantially eliminates, or eliminates nuclease activity. In some embodiments, one or both of D326 and D510 are substituted with alanine (e.g., D326A and/or D510A based on SEQ ID NO: 1). In some embodiments, the dCas protein exhibits reduced or eliminated nuclease activity, or nuclease activity is absent or substantially absent within levels of detection.
[0130] In some embodiments, the dCas protein or a variant thereof comprises the amino acid sequence of SEQ ID NO: 1 or a variant thereof having at least about 70%, at least about 71%, at least about 72%, at least about 73%, at least about 74%, at least about 75%, at least about 76%, at least about 77%, at least about 78%, at least about 79%, at least about 80%, at least about 81%, at least about 82%, at least about 83%, at least about 84%, at least about 85%, at least about 86%, at least about 87%, at least about 88%, at least about 89%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99%, or greater sequence identity to the amino acid sequence of SEQ ID NO: 1.
[0131] In some embodiments, according to any of the Cas protein systems described herein, the target nucleic acid is dsDNA. In such embodiments, dsDNA-targeting specificity is determined, at least in part, by two parameters: the gRNA spacer targeting a protospacer in the target dsDNA (the sequence in the target dsDNA corresponding to the gRNA spacer on the non-complementary DNA strand) and a short sequence, the protospacer-adjacent motif (PAM), located immediately 5' (upstream) of the protospacer on the non-complementary DNA strand. In some embodiments, the PAM is 5'-TTTG-3' or 5'-TTTA-3'. In some embodiments, the PAM is 5'-TTTG-3'. In some embodiments, the PAM is 5'-TTTA-3'.
[0132] In some embodiments, according to any of the Cas protein systems described herein, the target nucleic acid is RNA. In such embodiments, RNA-targeting specificity is determined, at least in part, by the gRNA spacer targeting a protospacer-like sequence in the target RNA (the sequence in the target RNA complementary to the gRNA spacer), and is independent of the sequence located immediately 5' (upstream) of the protospacer-like sequence. In some embodiments, the Cas protein system is also capable of targeting a dsDNA molecule, wherein the gRNA spacer is selected such that it targets a protospacer in the target dsDNA molecule having a PAM selected from 5'-TTTG-3' and 5 -TTTA-3'. In other embodiments, the Cas protein system is incapable of targeting a dsDNA molecule, wherein the gRNA spacer is selected such that any protospacers in the dsDNA molecule targeted by the gRNA spacer do not have a PAM selected from 5'-TTTG-3' and 5'-TTTA-3'.
[0133] In some embodiments, a actuator moiety can comprise a zinc finger nuclease (ZFN) or a variant, fragment, or derivative thereof. ZFN can refer to a fusion between a cleavage domain, such as a cleavage domain of Fokl, and at least one zinc finger motif (e.g., at least 2, at least 3, at least 4, or at least 5 zinc finger motifs) which can bind polynucleotides such as DNA and RNA. In some embodiments, a ZFN is used in a targeting moiety of the disclosure to bind a polynucleotide (e.g., target gene or target gene regulatory sequence), but the ZFN does not cleave or substantially does not cleave the polynucleotide, e.g., a nuclease dead ZFN. A ZFN or a variant, fragment, or derivative thereof can be fused to or associated with one of more heterologous gene effectors to form a complex of the disclosure.
[0134] The heterodimerization at certain positions in a polynucleotide of two individual ZFNs in certain orientation and spacing can lead to cleavage of the polynucleotide in nuclease-active ZFN. For example, a ZFN binding to DNA can induce a double-strand break in the DNA. In order to allow two cleavage domains to dimerize and cleave DNA, two individual ZFNs can bind opposite strands of DNA with their C-termini at a certain distance apart. In some cases, linker sequences between the zinc finger domain and the cleavage domain can require the 5' edge of each binding site to be separated by about 5-7 base pairs. In some cases, a cleavage domain is fused to the C-terminus of each zinc finger domain.
[0135] In some embodiments, the cleavage domain of an actuator moiety comprising a ZFN comprises a modified form of a wild type cleavage domain. The modified form of the cleavage domain can comprise an amino acid change (e.g., deletion, insertion, or substitution) that reduces the nucleic acid-cleaving activity of the cleavage domain. For example, the modified form of the cleavage domain can have no more than 90%, no more than 80%, no more than 70%, no more than 60%, no more than 50%, no more than 40%, no more than 30%, no more than 20%, no more
than 10%, no more than 5%, or no more than 1% of the nucleic acid-cleaving activity of the corresponding wild-type cleavage domain. The modified form of the cleavage domain can have no substantial nucleic acid-cleaving activity. In some embodiments, the cleavage domain is enzymatically inactive.
[0136] In some embodiments, a actuator moiety can comprise a “TALEN” or “TAL-effector nuclease” or a variant, fragment, or derivative thereof. TALENs refer to engineered transcription activator-like effector nucleases that generally contain a central domain of DNA-binding tandem repeats and a cleavage domain. TALENs can be produced by fusing a TAL effector DNA binding domain to a DNA cleavage domain. In some cases, a DNA-binding tandem repeat comprises 33-35 amino acids in length and contains two hypervariable amino acid residues at positions 12 and 13 that can recognize at least one specific DNA base pair. A transcription activator-like effector (TALE) protein can be fused to a nuclease such as a wild-type or mutated Fokl endonuclease or the catalytic domain ofFokl. In some embodiments, a TALEN is used in a targeting moiety of the disclosure to bind a polynucleotide (e.g., target gene or target gene regulatory sequence), but the TALEN does not cleave or substantially does not cleave the polynucleotide, e.g., a nuclease dead TALEN. A TALEN or a variant, fragment, or derivative thereof can be fused to or associated with one of more heterologous gene effectors to form a complex of the disclosure.
[0137] In some embodiments, a TALEN is engineered for reduced nuclease activity. In some embodiments, the nuclease domain of a TALEN comprises a modified form of a wild type nuclease domain. The modified form of the nuclease domain can comprise an amino acid change (e.g., deletion, insertion, or substitution) that reduces the nucleic acid-cleaving activity of the nuclease domain. For example, the modified form of the nuclease domain can have no more than 90%, no more than 80%, no more than 70%, no more than 60%, no more than 50%, no more than 40%, no more than 30%, no more than 20%, no more than 10%, no more than 5%, or no more than 1% of the nucleic acid-cleaving activity of the wild-type nuclease domain. The modified form of the nuclease domain can have no substantial nucleic acid-cleaving activity. In some embodiments, the nuclease domain is enzymatically inactive. A TALEN or a variant, fragment, or derivative thereof can be fused to or associated with one of more heterologous gene effectors to form a complex of the disclosure.
[0138] Several mutations to Fokl have been made for its use in TALENs, which, for example, improve cleavage specificity or activity. Such TALENs can be engineered to bind any desired DNA sequence. TALENs can be used to generate gene modifications (e.g., nucleic acid sequence
editing) by creating a double-strand break in a target DNA sequence, which in turn, undergoes NHF.J or HDR
[0139] A TALE or a variant, fragment, or derivative thereof can be fused to or associated with one of more heterologous gene effectors to form a complex of the disclosure. In some embodiments, the transcription activator-like effector (TALE) protein is fused to a heterologous gene effector and does not comprise a nuclease. In some embodiments, a TALEN does not cleave or substantially does not cleave the polynucleotide, e.g., a nuclease dead TALE. A TALE or a variant, fragment, or derivative thereof can be fused to or associated with one of more heterologous gene effectors to form a complex of the disclosure.
[0140] In some embodiments, the complex of the transcription activator-like effector (TALE) protein and the heterologous gene effector is designed to function as a transcriptional activator. In some embodiments, the complex of the transcription activator-like effector (TALE) protein and the heterologous gene effector is designed to function as a transcriptional repressor. For example, the DNA-binding domain of the transcription activator-like effector (TALE) protein can be fused (e.g., linked) to one or more heterologous gene effectors that comprise transcriptional activation domains, or to one or more heterologous gene effectors that comprise transcriptional repression domains.
[0141] In some embodiments, a actuator moiety can comprise a meganuclease. Meganucleases generally refer to rare-cutting endonucleases or homing endonucleases that can be highly sequence specific. Meganucleases can recognize DNA target sites ranging from at least 12 base pairs in length, e g., from 12 to 40 base pairs, 12 to 50 base pairs, or 12 to 60 base pairs in length. Meganucleases can be modular DNA-binding nucleases such as any fusion protein comprising at least one catalytic domain of an endonuclease and at least one DNA binding domain or protein specifying a nucleic acid target sequence. The DNA-binding domain can contain at least one motif that recognizes single- or double-stranded DNA. A nuclease-active meganuclease can generate a double-stranded break. In some embodiments, a meganuclease is used in a targeting moiety of the disclosure to bind a polynucleotide (e.g., target gene or target gene regulatory sequence), but the meganuclease does not cleave or substantially does not cleave the polynucleotide, e.g., a nuclease dead meganuclease. A meganuclease or a variant, fragment, or derivative thereof can be fused to or associated with one of more heterologous gene effectors to form a complex of the disclosure.
[0142] The meganuclease can be monomeric or dimeric. In some embodiments, the meganuclease is naturally-occurring (found in nature) or wild-type, and in other instances, the meganuclease is non-natural, artificial, engineered, synthetic, rationally designed, or man-made.
In some embodiments, the meganuclease of the present disclosure includes an I-Crel meganuclease, I-Ceul meganuclease, I-Msol meganuclease, I-Scel meganuclease, variants thereof, derivatives thereof, and fragments thereof.
[0143] In some embodiments, the nuclease domain of a meganuclease comprises a modified form of a wild type nuclease domain. The modified form of the nuclease domain can comprise an amino acid change (e.g., deletion, insertion, or substitution) that reduces or eliminates the nucleic acid-cleaving activity of the nuclease domain. For example, the modified form of the nuclease domain can have no more than 90%, no more than 80%, no more than 70%, no more than 60%, no more than 50%, no more than 40%, no more than 30%, no more than 20%, no more than 10%, no more than 5%, or no more than 1% of the nucleic acid-cleaving activity of the wild-type nuclease domain. The modified form of the nuclease domain can have no substantial nucleic acid-cleaving activity. In some embodiments, the nuclease domain is enzymatically inactive. In some embodiments, a meganuclease can bind DNA but cannot cleave the DNA. In some embodiments, a nuclease-inactive meganuclease is fused to or associated with one or more heterologous gene effectors to generate a complex of the disclosure.
[0144] In some embodiments, the heterologous polypeptide comprising the actuator moiety (e.g., and/or a complex comprising the heterologous polypeptide) can regulate expression and/or activity of a target gene (e.g., target endogenous gene). In some embodiments, the heterologous polypeptide and/or a complex thereof can edit the sequence of a nucleic acid (e.g., a gene and/or gene product). A nuclease-active Cas protein can edit a nucleic acid sequence by generating a double-stranded break or single-stranded break in a target polynucleotide.
[0145] In some embodiments, the heterologous polypeptide comprising the actuator moiety (e.g., and/or a complex comprising the heterologous polypeptide) can generate a double-strand break in a target polynucleotide, such as DNA. A double-strand break in DNA can result in DNA break repair which allows for the introduction of gene modification(s) (e.g., nucleic acid editing). In some embodiments, a nuclease induces site-specific single-strand DNA breaks or nicks, thus resulting in HDR.
[0146] A double-strand break in DNA can result in DNA break repair which allows for the introduction of gene modification(s) (e.g., nucleic acid editing). DNA break repair can occur via non-homologous end joining (NHEJ) or homology-directed repair (HDR). In HDR, a donor DNA repair template or template polynucleotide that contains homology arms flanking sites of the target DNA can be provided.
[0147] In some embodiments, the heterologous polypeptide comprising the actuator moiety (e.g., and/or a complex comprising the heterologous polypeptide) does not generate a double-strand
break in a target polynucleotide, such as DNA. Binding of the heterologous polypeptide of the complex comprising the heterologous polypeptide (e.g., a complex comprising a dCas-effector and a guide RNA) without a nucleic acid break can be sufficient to regulate expression (e.g., enhance or suppress) of a target gene (e.g., endogenous target gene).
Target gene
[0148] The disclosure provides compositions, methods, and systems for modulating expression of target genes. The target genes can be one or more endogenous target genes, such as a disease causing allele, e.g., a mutant allele. For example, disclosed herein are complexes that comprise a guide moiety and one or more heterologous polypeptides comprising an actuator moiety that can increase or decrease an activity or expression level of a target gene.
[0149] In some embodiments, a target gene or regulatory sequence thereof is endogenous to a cell, for example, present in the cell’s genome, or endogenous to a subject, for example, present in the subject’s genome. In some embodiments, a target gene or regulatory sequence thereof is not part of an engineered reporter system.
[0150] In some embodiments, a target gene is exogenous to a host subject, for example, a pathogen target gene or an exogenous gene expressed as a result of a therapeutic intervention, such as a gene therapy and/or cell therapy. In some embodiments, a target gene is an exogenous reporter gene. In some embodiments, a target gene is an exogenous synthetic gene.
[0151] In some embodiments, the systems and methods as disclosed herein can modulate (e.g., increase or decrease) expression of a target gene (e.g., upon introducing a complex comprising the heterologous polypeptide into a cell or population of cells). In some embodiments, an expression level is an RNA expression level can be measured by, for example, RNAseq, qPCR, microarray, gene array, FISH, etc. In some embodiments, an expression level is a protein expression level can be measured by, for example, Western Blot, ELISA, multiplex immunoassay, mass spectrometry, NMR, proteomics, flow cytometry, mass cytometry, etc.
[0152] In some embodiments, the systems and methods as disclosed herein can modulate (e.g., increase or decrease) expression of a target gene (e.g., upon introducing a complex comprising the heterologous polypeptide into a cell or population of cells) by at least about 10%, at least about 20%, at least about 30%, at least about 40%, at least about 50%, at least about 60%, at least about 70%, at least about 80%, at least about 90%, at least about 2-fold, at least about 3 fold, at least about 4 fold, at least about 5 fold, at least about 6 fold, at least about 7 fold, at least about 8 fold, at least about 9 fold, at least about 10 fold, at least about 11 fold, at least about 12 fold, at least about 13 fold, at least about 14, at least fold about 15 fold, at least about 20 fold, at least about 30 fold, at least about 40 fold, at least about 50 fold, at least about 60 fold, at least about 70
fold, at least about 80 fold, at least about 90 fold, at least about 100 fold, at least about 150 fold, at least about 200 fold, at least about 250 fold, at least about 300 fold, at least about 350 fold, at least about 400 fold, at least about 500 fold, at least about 600 fold, at least about 700 fold, at least about 800 fold, at least about 900 fold, at least about 1000 fold, at least about 1500 fold, at least about 2000 fold, or at least about 3000 fold.
[0153] In some embodiments, the systems and methods as disclosed herein can modulate (e.g., increase or decrease) expression of a target gene (e.g., upon introducing a complex comprising the heterologous polypeptide into a cell or population of cells) by at most about 50%, at most about 60%, at most about 70%, at most about 80%, at most about 90%, at most about 2-fold, at most about 3 fold, at most about 4 fold, at most about 5 fold, at most about 6 fold, at most about 7 fold, at most about 8 fold, at most about 9 fold, at most about 10 fold, at most about 11 fold, at most about 12 fold, at most about 13 fold, at most about 14, at most fold about 15 fold, at most about 20 fold, at most about 30 fold, at most about 40 fold, at most about 50 fold, at most about 60 fold, at most about 70 fold, at most about 80 fold, at most about 90 fold, at most about 100 fold, at most about 150 fold, at most about 200 fold, at most about 250 fold, at most about 300 fold, at most about 350 fold, at most about 400 fold, at most about 500 fold, at most about 600 fold, at most about 700 fold, at most about 800 fold, at most about 900 fold, at most about 1000 fold, at most about 1500 fold, at most about 2000 fold, at most about 3000 fold, at most about
5000 fold, or at most about 10000 fold.
[0154] In some embodiments, the systems and methods as disclosed herein can modulate (e.g., increase or decrease) expression of a target gene (e.g., upon introducing a complex comprising the heterologous polypeptide into a cell or population of cells) by about 10%, about 20%, about 30%, about 40%, about 50%, about 60%, about 70%, about 80%, about 90%, about 2-fold, about 3 fold, about 4 fold, about 5 fold, about 6 fold, about 7 fold, about 8 fold, about 9 fold, about 10 fold, about 11 fold, about 12 fold, about 13 fold, about 14, about 15 fold, about 20 fold, about 30 fold, about 40 fold, about 50 fold, about 60 fold, about 70 fold, about 80 fold, about 90 fold, about 100 fold, about 150 fold, about 200 fold, about 250 fold, about 300 fold, about 350 fold, about 400 fold, about 500 fold, about 600 fold, about 700 fold, about 800 fold, about 900 fold, about 1000 fold, about 1500 fold, about 2000 fold, about 3000 fold, about 5000 fold, or about 10000 fold.
[0155] In some embodiments, the systems and methods as disclosed herein can modulate (e.g., increase or decrease) expression of a target gene (e.g., upon introducing a complex comprising the heterologous polypeptide into a cell or population of cells) from below a limit of detection to a detectable level.
[0156] In some embodiments, the degree in change of expression is relative to before introducing the system of the present disclosure (e.g., a complex comprising the heterologous polypeptide) into the cell or population of cells. In some embodiments, the degree in change of expression is relative to a corresponding control cell or population of cells that are not treated with the system of the present disclosure. In some embodiments, the degree in change of expression is relative to a corresponding control cell or population of cells that are treated with an alternative to the system of the present disclosure.
[0157] In some embodiments, the system and method as disclosed herein can modulate (e.g., increase or decrease) an activity level of a target gene (e.g., upon introducing a complex comprising the heterologous polypeptide into a cell or population of cells). An activity level can be determined by a suitable functional assay for the target gene in question depending on the functional characteristics of the target gene. For example, an activity level of a target gene that is a mitogen could be determined by measuring cell proliferation; an activity level of a target gene that induces apoptosis could be measured by an annexin V assay or other suitable cell death assay; an activity level of an anti-inflammatory cytokine could be measured by an LPS-induced cytokine release assay.
[0158] In some embodiments, the system and method as disclosed herein can modulate (e.g., increase or decrease) the activity of the target gene by at least about 10%, at least about 20%, at least about 30%, at least about 40%, at least about 50%, at least about 60%, at least about 70%, at least about 80%, at least about 90%, at least about 2-fold, at least about 3 fold, at least about 4 fold, at least about 5 fold, at least about 6 fold, at least about 7 fold, at least about 8 fold, at least about 9 fold, at least about 10 fold, at least about 11 fold, at least about 12 fold, at least about 13 fold, at least about 14, at least about 15 fold, at least about 20 fold, at least about 30 fold, at least about 40 fold, at least about 50 fold, at least about 60 fold, at least about 70 fold, at least about 80 fold, at least about 90 fold, at least about 100 fold, at least about 150 fold, at least about 200 fold, at least about 250 fold, at least about 300 fold, at least about 350 fold, at least about 400 fold, at least about 500 fold, at least about 600 fold, at least about 700 fold, at least about 800 fold, at least about 900 fold, at least about 1000 fold, at least about 1500 fold, at least about 2000 fold, or at least about 3000 fold.
[0159] In some embodiments, the system and method as disclosed herein can modulate (e.g., increase or decrease) the activity of the target gene by at most 50%, at most 60%, at most 70%, at most 80%, at most 90%, at most about 2-fold, at most about 3 fold, at most about 4 fold, at most about 5 fold, at most about 6 fold, at most about 7 fold, at most about 8 fold, at most about 9 fold, at most about 10 fold, at most about 11 fold, at most about 12 fold, at most about 13 fold, at most
about 14, at most about 15 fold, at most about 20 fold, at most about 30 fold, at most about 40 fold, at most about 50 fold, at most about 60 fold, at most about 70 fold, at most about 80 fold, at most about 90 fold, at most about 100 fold, at most about 150 fold, at most about 200 fold, at most about 250 fold, at most about 300 fold, at most about 350 fold, at most about 400 fold, at most about 500 fold, at most about 600 fold, at most about 700 fold, at most about 800 fold, at most about 900 fold, at most about 1000 fold, at most about 1500 fold, at most about 2000 fold, at most about 3000 fold, at most about 5000 fold, or at most about 10000 fold.
[0160] In some embodiments, the system to method increases the expression of the endogenous target gene encoding the target protein by at least about 0.01 fold to about 5,000 fold. In some embodiments, the system to method increases the expression of the endogenous target gene encoding the target protein by at least about 0.01 fold to about 0.05 fold, about 0.01 fold to about 0.1 fold, about 0.01 fold to about 0.5 fold, about 0.01 fold to about 1 fold, about 0.01 fold to about 5 fold, about 0.01 fold to about 10 fold, about 0.01 fold to about 50 fold, about 0.01 fold to about 100 fold, about 0.01 fold to about 500 fold, about 0.01 fold to about 1,000 fold, about 0.01 fold to about 5,000 fold, about 0.05 fold to about 0.1 fold, about 0.05 fold to about 0.5 fold, about 0.05 fold to about 1 fold, about 0.05 fold to about 5 fold, about 0.05 fold to about 10 fold, about 0.05 fold to about 50 fold, about 0.05 fold to about 100 fold, about 0.05 fold to about 500 fold, about 0.05 fold to about 1,000 fold, about 0.05 fold to about 5,000 fold, about 0.1 fold to about 0.5 fold, about 0.1 fold to about 1 fold, about 0.1 fold to about 5 fold, about 0.1 fold to about 10 fold, about 0.1 fold to about 50 fold, about 0.1 fold to about 100 fold, about 0.1 fold to about 500 fold, about 0.1 fold to about 1,000 fold, about 0.1 fold to about 5,000 fold, about 0.5 fold to about 1 fold, about 0.5 fold to about 5 fold, about 0.5 fold to about 10 fold, about 0.5 fold to about 50 fold, about 0.5 fold to about 100 fold, about 0.5 fold to about 500 fold, about 0.5 fold to about 1,000 fold, about 0.5 fold to about 5,000 fold, about 1 fold to about 5 fold, about 1 fold to about 10 fold, about 1 fold to about 50 fold, about 1 fold to about 100 fold, about 1 fold to about 500 fold, about 1 fold to about 1,000 fold, about 1 fold to about 5,000 fold, about 5 fold to about 10 fold, about 5 fold to about 50 fold, about 5 fold to about 100 fold, about 5 fold to about 500 fold, about 5 fold to about 1,000 fold, about 5 fold to about 5,000 fold, about 10 fold to about 50 fold, about 10 fold to about 100 fold, about 10 fold to about 500 fold, about 10 fold to about 1,000 fold, about 10 fold to about 5,000 fold, about 50 fold to about 100 fold, about 50 fold to about 500 fold, about 50 fold to about 1,000 fold, about 50 fold to about 5,000 fold, about 100 fold to about 500 fold, about 100 fold to about 1,000 fold, about 100 fold to about 5,000 fold, about 500 fold to about 1,000 fold, about 500 fold to about 5,000 fold, or about 1,000 fold to about 5,000 fold. In some embodiments, the system to method increases the expression of the endogenous
target gene encoding the target protein by at least about 0.01 fold, about 0.05 fold, about 0.1 fold, about 0.5 fold, about 1 fold, about 5 fold, about 10 fold, about 50 fold, about 100 fold, about 500 fold, about 1,000 fold, or about 5,000 fold. In some embodiments, the system to method increases the expression of the endogenous target gene encoding the target protein by at least at least about 0.01 fold, about 0.05 fold, about 0.1 fold, about 0.5 fold, about 1 fold, about 5 fold, about 10 fold, about 50 fold, about 100 fold, about 500 fold, or about 1,000 fold. In some embodiments, the system to method increases the expression of the endogenous target gene encoding the target protein by at least at most about 0.05 fold, about 0.1 fold, about 0.5 fold, about 1 fold, about 5 fold, about 10 fold, about 50 fold, about 100 fold, about 500 fold, about 1,000 fold, or about 5,000 fold.
[0161] In some embodiments, the system to method increases the expression of the endogenous target gene encoding the target protein, where the target protein is a PRPF31. In some embodiments, the system to method increases the expression of PRPF31 by at least about 0.01 fold to about 5,000 fold. In some embodiments, the system to method increases the expression of PRPF31 by at least about 0.01 fold to about 0.05 fold, about 0.01 fold to about 0.1 fold, about 0.01 fold to about 0.5 fold, about 0.01 fold to about 1 fold, about 0.01 fold to about 5 fold, about 0.01 fold to about 10 fold, about 0.01 fold to about 50 fold, about 0.01 fold to about 100 fold, about 0.01 fold to about 500 fold, about 0.01 fold to about 1,000 fold, about 0.01 fold to about 5,000 fold, about 0.05 fold to about 0.1 fold, about 0.05 fold to about 0.5 fold, about 0.05 fold to about 1 fold, about 0.05 fold to about 5 fold, about 0.05 fold to about 10 fold, about 0.05 fold to about 50 fold, about 0.05 fold to about 100 fold, about 0.05 fold to about 500 fold, about 0.05 fold to about 1,000 fold, about 0.05 fold to about 5,000 fold, about 0.1 fold to about 0.5 fold, about 0.1 fold to about 1 fold, about 0.1 fold to about 5 fold, about 0.1 fold to about 10 fold, about 0.1 fold to about 50 fold, about 0.1 fold to about 100 fold, about 0.1 fold to about 500 fold, about 0.1 fold to about 1,000 fold, about 0.1 fold to about 5,000 fold, about 0.5 fold to about 1 fold, about 0.5 fold to about 5 fold, about 0.5 fold to about 10 fold, about 0.5 fold to about 50 fold, about 0.5 fold to about 100 fold, about 0.5 fold to about 500 fold, about 0.5 fold to about 1,000 fold, about 0.5 fold to about 5,000 fold, about 1 fold to about 5 fold, about 1 fold to about 10 fold, about 1 fold to about 50 fold, about 1 fold to about 100 fold, about 1 fold to about 500 fold, about 1 fold to about 1,000 fold, about 1 fold to about 5,000 fold, about 5 fold to about 10 fold, about 5 fold to about 50 fold, about 5 fold to about 100 fold, about 5 fold to about 500 fold, about 5 fold to about 1,000 fold, about 5 fold to about 5,000 fold, about 10 fold to about 50 fold, about 10 fold to about 100 fold, about 10 fold to about 500 fold, about 10 fold to about 1,000 fold, about 10 fold to about 5,000 fold, about 50 fold to about 100 fold, about 50 fold to about
500 fold, about 50 fold to about 1,000 fold, about 50 fold to about 5,000 fold, about 100 fold to about 500 fold, about 100 fold to about 1,000 fold, about 100 fold to about 5,000 fold, about 500 fold to about 1,000 fold, about 500 fold to about 5,000 fold, or about 1,000 fold to about 5,000 fold. In some embodiments, the system to method increases the expression of PRPF31 by at least about 0.01 fold, about 0.05 fold, about 0.1 fold, about 0.5 fold, about 1 fold, about 5 fold, about 10 fold, about 50 fold, about 100 fold, about 500 fold, about 1,000 fold, or about 5,000 fold. In some embodiments, the system to method increases the expression of PRPF31 by at least at least about 0.01 fold, about 0.05 fold, about 0.1 fold, about 0.5 fold, about 1 fold, about 5 fold, about 10 fold, about 50 fold, about 100 fold, about 500 fold, or about 1,000 fold. In some embodiments, the system to method increases the expression of PRPF31 by at least at most about 0.05 fold, about 0.1 fold, about 0.5 fold, about 1 fold, about 5 fold, about 10 fold, about 50 fold, about 100 fold, about 500 fold, about 1,000 fold, or about 5,000 fold.
[0162] In some embodiments, the system to method increases the expression of PRPF31 without increasing expression of a gene neighboring PRPF31 or expression of a gene associated with PRPF31. In some embodiments, the system to method, without increasing the expression of a gene neighboring PRPF31 or expression of a gene associated with PRPF31, increases the expression of PRPF31 compared to endogenous expression of PRPF31 by at least about 0.01 fold to about 5,000 fold. In some embodiments, the system to method, without increasing the expression of a gene neighboring PRPF31 or expression of a gene associated with PRPF31, increases the expression of PRPF31 compared to endogenous expression of PRPF31 by at least about 0.01 fold to about 0.05 fold, about 0.01 fold to about 0.1 fold, about 0.01 fold to about 0.5 fold, about 0.01 fold to about 1 fold, about 0.01 fold to about 5 fold, about 0.01 fold to about 10 fold, about 0.01 fold to about 50 fold, about 0.01 fold to about 100 fold, about 0.01 fold to about 500 fold, about 0.01 fold to about 1,000 fold, about 0.01 fold to about 5,000 fold, about 0.05 fold to about 0.1 fold, about 0.05 fold to about 0.5 fold, about 0.05 fold to about 1 fold, about 0.05 fold to about 5 fold, about 0.05 fold to about 10 fold, about 0.05 fold to about 50 fold, about 0.05 fold to about 100 fold, about 0.05 fold to about 500 fold, about 0.05 fold to about 1,000 fold, about 0.05 fold to about 5,000 fold, about 0.1 fold to about 0.5 fold, about 0.1 fold to about 1 fold, about 0.1 fold to about 5 fold, about 0.1 fold to about 10 fold, about 0.1 fold to about 50 fold, about 0.1 fold to about 100 fold, about 0.1 fold to about 500 fold, about 0.1 fold to about 1,000 fold, about 0.1 fold to about 5,000 fold, about 0.5 fold to about 1 fold, about 0.5 fold to about 5 fold, about 0.5 fold to about 10 fold, about 0.5 fold to about 50 fold, about 0.5 fold to about 100 fold, about 0.5 fold to about 500 fold, about 0.5 fold to about 1,000 fold, about 0.5 fold to about 5,000 fold, about 1 fold to about 5 fold, about 1 fold to about 10 fold, about 1 fold to
about 50 fold, about 1 fold to about 100 fold, about 1 fold to about 500 fold, about 1 fold to about 1,000 fold, about 1 fold to about 5,000 fold, about 5 fold to about 10 fold, about 5 fold to about 50 fold, about 5 fold to about 100 fold, about 5 fold to about 500 fold, about 5 fold to about 1,000 fold, about 5 fold to about 5,000 fold, about 10 fold to about 50 fold, about 10 fold to about 100 fold, about 10 fold to about 500 fold, about 10 fold to about 1,000 fold, about 10 fold to about 5,000 fold, about 50 fold to about 100 fold, about 50 fold to about 500 fold, about 50 fold to about 1,000 fold, about 50 fold to about 5,000 fold, about 100 fold to about 500 fold, about 100 fold to about 1,000 fold, about 100 fold to about 5,000 fold, about 500 fold to about 1,000 fold, about 500 fold to about 5,000 fold, or about 1,000 fold to about 5,000 fold. In some embodiments, the system to method, without increasing the expression of a gene neighboring PRPF31 or expression of a gene associated with PRPF31, increases the expression of PRPF31 compared to endogenous expression of PRPF31 by at least about 0.01 fold, about 0.05 fold, about 0.1 fold, about 0.5 fold, about 1 fold, about 5 fold, about 10 fold, about 50 fold, about 100 fold, about 500 fold, about 1,000 fold, or about 5,000 fold. In some embodiments, the system to method, without increasing the expression of a gene neighboring PRPF31 or expression of a gene associated with PRPF31, increases the expression of PRPF31 compared to endogenous expression ofPRPF31 by at least at least about 0.01 fold, about 0.05 fold, about 0.1 fold, about 0.5 fold, about 1 fold, about 5 fold, about 10 fold, about 50 fold, about 100 fold, about 500 fold, or about 1,000 fold. In some embodiments, the system to method, without increasing the expression of a gene neighboring PRPF31 or expression of a gene associated with PRPF31, increases the expression of PRPF31 compared to endogenous expression of PRPF31 by at least at most about 0.05 fold, about 0.1 fold, about 0.5 fold, about 1 fold, about 5 fold, about 10 fold, about 50 fold, about 100 fold, about 500 fold, about 1,000 fold, or about 5,000 fold.
[0163] In some embodiments, the system to method increases the expression of PRPF31 without increasing expression of a gene neighboring PRPF31 on chromosome 19 or expression of a gene associated with PRPF31 on chromosome 19. In some embodiments, the system to method, without increasing the expression of a gene neighboring PRPF31 on chromosome 19 or expression of a gene associated with PRPF31 on chromosome 19, increases the expression of PRPF31 compared to endogenous expression of PRPF31 by at least about 0.01 fold to about 5,000 fold. In some embodiments, the system to method, without increasing the expression of a gene neighboring PRPF31 on chromosome 19 or expression of a gene associated with PRPF31 on chromosome 19, increases the expression of PRPF31 compared to endogenous expression of PRPF31 by at least about 0.01 fold to about 0.05 fold, about 0.01 fold to about 0.1 fold, about 0.01 fold to about 0.5 fold, about 0.01 fold to about 1 fold, about 0.01 fold to about 5 fold, about
0.01 fold to about 10 fold, about 0.01 fold to about 50 fold, about 0.01 fold to about 100 fold, about 0.01 fold to about 500 fold, about 0 01 fold to about 1,000 fold, about 0.01 fold to about 5,000 fold, about 0.05 fold to about 0.1 fold, about 0.05 fold to about 0.5 fold, about 0.05 fold to about 1 fold, about 0.05 fold to about 5 fold, about 0.05 fold to about 10 fold, about 0.05 fold to about 50 fold, about 0.05 fold to about 100 fold, about 0.05 fold to about 500 fold, about 0.05 fold to about 1,000 fold, about 0.05 fold to about 5,000 fold, about 0.1 fold to about 0.5 fold, about 0.1 fold to about 1 fold, about 0.1 fold to about 5 fold, about 0.1 fold to about 10 fold, about 0.1 fold to about 50 fold, about 0.1 fold to about 100 fold, about 0.1 fold to about 500 fold, about 0.1 fold to about 1,000 fold, about 0.1 fold to about 5,000 fold, about 0.5 fold to about 1 fold, about 0.5 fold to about 5 fold, about 0.5 fold to about 10 fold, about 0.5 fold to about 50 fold, about 0.5 fold to about 100 fold, about 0.5 fold to about 500 fold, about 0.5 fold to about 1,000 fold, about 0.5 fold to about 5,000 fold, about 1 fold to about 5 fold, about 1 fold to about 10 fold, about 1 fold to about 50 fold, about 1 fold to about 100 fold, about 1 fold to about 500 fold, about 1 fold to about 1,000 fold, about 1 fold to about 5,000 fold, about 5 fold to about 10 fold, about 5 fold to about 50 fold, about 5 fold to about 100 fold, about 5 fold to about 500 fold, about 5 fold to about 1,000 fold, about 5 fold to about 5,000 fold, about 10 fold to about 50 fold, about 10 fold to about 100 fold, about 10 fold to about 500 fold, about 10 fold to about 1,000 fold, about 10 fold to about 5,000 fold, about 50 fold to about 100 fold, about 50 fold to about 500 fold, about 50 fold to about 1,000 fold, about 50 fold to about 5,000 fold, about 100 fold to about 500 fold, about 100 fold to about 1,000 fold, about 100 fold to about 5,000 fold, about 500 fold to about 1,000 fold, about 500 fold to about 5,000 fold, or about 1,000 fold to about 5,000 fold. In some embodiments, the system to method, without increasing the expression of a gene neighboring PRPF31 on chromosome 19 or expression of a gene associated with PRPF31 on chromosome 19, increases the expression of PRPF31 compared to endogenous expression of PRPF31 by at least about 0.01 fold, about 0.05 fold, about 0.1 fold, about 0.5 fold, about 1 fold, about 5 fold, about 10 fold, about 50 fold, about 100 fold, about 500 fold, about 1,000 fold, or about 5,000 fold. In some embodiments, the system to method, without increasing the expression of a gene neighboring PRPF31 on chromosome 19 or expression of a gene associated with PRPF31 on chromosome 19, increases the expression of PRPF31 compared to endogenous expression ofPRPF31 by at least at least about 0.01 fold, about 0.05 fold, about 0.1 fold, about 0.5 fold, about 1 fold, about 5 fold, about 10 fold, about 50 fold, about 100 fold, about 500 fold, or about 1,000 fold. In some embodiments, the system to method, without increasing the expression of a gene neighboring PRPF31 on chromosome 19 or expression of a gene associated with PRPF31 on chromosome 19, increases the expression of PRPF31 compared to endogenous
expression of PRPF31 by at least at most about 0.05 fold, about 0.1 fold, about 0.5 fold, about 1 fold, about 5 fold, about 10 fold, about 50 fold, about 100 fold, about 500 fold, about 1,000 fold, or about 5,000 fold.
[0164] In some embodiments, the system to method increases the expression of PRPF31 without increasing expression of TCF3 Fusion Partner (TFPT). In some embodiments, the system to method, without increasing the expression of TFPT, increases the expression of PRPF31 compared to endogenous expression of PRPF31 by at least about 0.01 fold to about 5,000 fold. In some embodiments, the system to method, without increasing the expression of TFPT, increases the expression of PRPF31 compared to endogenous expression ofPRPF31 by at least about 0.01 fold to about 0.05 fold, about 0.01 fold to about 0.1 fold, about 0.01 fold to about 0.5 fold, about 0.01 fold to about 1 fold, about 0.01 fold to about 5 fold, about 0.01 fold to about 10 fold, about 0.01 fold to about 50 fold, about 0.01 fold to about 100 fold, about 0.01 fold to about 500 fold, about 0.01 fold to about 1,000 fold, about 0.01 fold to about 5,000 fold, about 0.05 fold to about 0.1 fold, about 0.05 fold to about 0.5 fold, about 0.05 fold to about 1 fold, about 0.05 fold to about 5 fold, about 0.05 fold to about 10 fold, about 0.05 fold to about 50 fold, about 0.05 fold to about 100 fold, about 0.05 fold to about 500 fold, about 0.05 fold to about 1,000 fold, about 0.05 fold to about 5,000 fold, about 0.1 fold to about 0.5 fold, about 0.1 fold to about 1 fold, about 0.1 fold to about 5 fold, about 0.1 fold to about 10 fold, about 0.1 fold to about 50 fold, about 0.1 fold to about 100 fold, about 0.1 fold to about 500 fold, about 0.1 fold to about 1,000 fold, about 0.1 fold to about 5,000 fold, about 0.5 fold to about 1 fold, about 0.5 fold to about 5 fold, about 0.5 fold to about 10 fold, about 0.5 fold to about 50 fold, about 0.5 fold to about 100 fold, about 0.5 fold to about 500 fold, about 0.5 fold to about 1,000 fold, about 0.5 fold to about 5,000 fold, about 1 fold to about 5 fold, about 1 fold to about 10 fold, about 1 fold to about 50 fold, about 1 fold to about 100 fold, about 1 fold to about 500 fold, about 1 fold to about 1,000 fold, about 1 fold to about 5,000 fold, about 5 fold to about 10 fold, about 5 fold to about 50 fold, about 5 fold to about 100 fold, about 5 fold to about 500 fold, about 5 fold to about 1,000 fold, about 5 fold to about 5,000 fold, about 10 fold to about 50 fold, about 10 fold to about 100 fold, about 10 fold to about 500 fold, about 10 fold to about 1,000 fold, about 10 fold to about 5,000 fold, about 50 fold to about 100 fold, about 50 fold to about 500 fold, about 50 fold to about 1,000 fold, about 50 fold to about 5,000 fold, about 100 fold to about 500 fold, about 100 fold to about 1,000 fold, about 100 fold to about 5,000 fold, about 500 fold to about 1,000 fold, about 500 fold to about 5,000 fold, or about 1,000 fold to about 5,000 fold. In some embodiments, the system to method, without increasing the expression of TFPT, increases the expression of PRPF31 compared to endogenous expression of PRPF31 by at least about 0.01 fold, about 0.05 fold, about 0.1 fold,
about 0.5 fold, about 1 fold, about 5 fold, about 10 fold, about 50 fold, about 100 fold, about 500 fold, about 1,000 fold, or about 5,000 fold. In some embodiments, the system to method, without increasing the expression of TFPT, increases the expression ofPRPF31 compared to endogenous expression ofPRPF31 by at least at least about 0.01 fold, about 0.05 fold, about 0.1 fold, about 0.5 fold, about 1 fold, about 5 fold, about 10 fold, about 50 fold, about 100 fold, about 500 fold, or about 1,000 fold. In some embodiments, the system to method, without increasing the expression of TFPT, increases the expression of PRPF31 compared to endogenous expression of PRPF31 by at least at most about 0.05 fold, about 0.1 fold, about 0.5 fold, about 1 fold, about 5 fold, about 10 fold, about 50 fold, about 100 fold, about 500 fold, about 1,000 fold, or about 5,000 fold.
[0165] In some embodiments, the systems and methods as disclosed herein can modulate (e.g., increase or decrease) the activity of the target gene by about 10%, about 20%, about 30%, about 40%, about 50%, about 60%, about 70%, about 80%, about 90%, about 2-fold, about 3 fold, about 4 fold, about 5 fold, about 6 fold, about 7 fold, about 8 fold, about 9 fold, about 10 fold, about 11 fold, about 12 fold, about 13 fold, about 14, about 15 fold, about 20 fold, about 30 fold, about 40 fold, about 50 fold, about 60 fold, about 70 fold, about 80 fold, about 90 fold, about 100 fold, about 150 fold, about 200 fold, about 250 fold, about 300 fold, about 350 fold, about 400 fold, about 500 fold, about 600 fold, about 700 fold, about 800 fold, about 900 fold, about 1000 fold, about 1500 fold, about 2000 fold, about 3000 fold, about 5000 fold, or about 10000 fold. [0166] In some embodiments, the systems and methods as disclosed herein can modulate (e.g., increase or decrease) expression of a target gene (e.g., upon introducing a complex comprising the heterologous polypeptide into a cell or population of cells) from below a limit of detection to a detectable level.
[0167] In some embodiments, the degree in change of an activity level is relative to before introducing the system of the present disclosure (e.g., a complex comprising the heterologous polypeptide) into the cell or population of cells. In some embodiments, the degree in change of an activity level is relative to a corresponding control cell or population of cells that are not treated with the system of the present disclosure. In some embodiments, the degree in change of an activity level is relative to a corresponding control cell or population of cells that are treated with an alternative to the system of the present disclosure.
[0168] The systems and methods of the present disclosure can, in some cases, elicit changes in expression and/or activity level of a target gene (e.g., target endogenous gene) that persists for longer than can be achieved with alternative compositions and methods (e.g., suppression via
RNAi, e.g., using siRNA). In some embodiments, persistent modulation of gene expression is advantageous as compared to transient modulation
[0169] In some embodiments, the systems and methods as disclosed herein can modulate (e.g., increase or decrease) expression and/or activity level of a target gene for at least about 1 hour, at least about 2 hours, at least about 3 hours, at least about 4 hours, at least about 5 hours, at least about 6 hours, at least about 7 hours, at least about 8 hours, at least about 9 hours, at least about 10 hours, at least about 12 hours, at least about 14 hours, at least about 18 hours, at least about 20 hours, at least about 1 day, at least about 2 days, at least about 3 days, at least about 4 days, at least about 5 days, at least about 6 days, at least about 7 days, at least about 8 days, at least about
9 days, at least about 10 days, at least about 14 days, at least about 21 days, at least about 28 days, at least about 5 weeks, at least about 6 weeks, at least about 7 weeks, at least about 8 weeks, at least about 9 weeks, at least about 10 weeks, at least about 12 weeks, at least about 14 weeks, at least about 18 weeks, at least about 20 weeks, at least about 26 weeks, or at least about 5 months, at least about 6 months, at least about 9 months, at least about 12 months, or more. [0170] In some embodiments the systems and methods as disclosed herein can modulate (e.g., increase or decrease) expression and/or activity level of a target gene (e.g., target endogenous gene) to above a certain threshold for at most about 1 hour, at most about 2 hours, at most about 3 hours, at most about 4 hours, at most about 5 hours, at most about 6 hours, at most about 7 hours, at most about 8 hours, at most about 9 hours, at most about 10 hours, at most about 12 hours, at most about 14 hours, at most about 18 hours, at most about 20 hours, at most about 1 day, at most about 2 days, at most about 3 days, at most about 4 days, at most about 5 days, at most about 6 days, at most about 7 days, at most about 8 days, at most about 9 days, at most about 10 days, at most about 14 days, at most about 21 days, at most about 28 days, at most about 5 weeks, at most about 6 weeks, at most about 7 weeks, at most about 8 weeks, at most about 9 weeks, at most about 10 weeks, at most about 12 weeks, at most about 14 weeks, at most about 18 weeks, at most about 20 weeks, at most about 26 weeks, or at most about 5 months, at most about 6 months, at most about 9 months, at most about 12 months, or more.
[0171] In some embodiments, the systems and methods as disclosed herein can modulate (e.g., increase or decrease) expression and/or activity level of a target gene (e.g., target endogenous gene) to above a certain threshold for about 1 hour, about 2 hours, about 3 hours, about 4 hours, about 5 hours, about 6 hours, about 7 hours, about 8 hours, about 9 hours, about 10 hours, about 12 hours, about 14 hours, about 18 hours, about 20 hours, about 1 day, about 2 days, about 3 days, about 4 days, about 5 days, about 6 days, about 7 days, about 8 days, about 9 days, about
10 days, about 14 days, about 21 days, about 28 days, about 5 weeks, about 6 weeks, about 7
weeks, about 8 weeks, about 9 weeks, about 10 weeks, about 12 weeks, about 14 weeks, about 18 weeks, about 20 weeks, about 26 weeks, about 5 months, about 6 months, about 9 months, or about 12 months.
[0172] In some embodiments, the target gene (e.g., endogenous target gene) can be a diseasecausing allele, such as a mutant variant of a wild type allele. The disease can be a genetic disease, such as a hereditary disorder. Non-limiting examples of the genetic disorder can include Duchenne muscular dystrophy (DMD), hemophilia, cystic fibrosis, Huntington's chorea, familial hypercholesterolemia (LDL receptor defect), hepatoblastoma, Wilson's disease, congenital hepatic porphyria, inherited disorders of hepatic metabolism, Lesch Nyhan syndrome, sickle cell anemia, thalassaemias, xeroderma pigmentosum, Fanconi's anemia, retinitis pigmentosa, ataxia telangiectasia, Bloom's syndrome, retinoblastoma, and Tay-Sachs disease. In some cases, the target gene can be a gene encoding a protein. In some cases, the target gene can be a gene regulatory sequence (e.g., promoters, enhancers, repressors, silencers, insulators, cis-regulatory elements, trans-regulatory elements, epigenetic modification (e.g., DNA methylation) sites, etc.) that can influence expression of a gene encoding a protein of interest as provided herein. For example, target gene regulatory sequences can be physically located outside of the transcriptional unit or open reading frame that encodes a product of the target gene.
[0173] In some embodiments, a target gene regulatory sequence does not contain a nucleotide sequence that is exogenous to the subject or host cell. In some embodiments, a target gene regulatory sequence does not contain an engineered or artificially generated or introduced nucleotide sequence.
[0174] In some embodiments, a target gene (e.g., target endogenous gene) is a gene that is overexpressed or under-expressed in a disease or condition. In some embodiments, a target gene is a gene that is over-expressed or under-expressed in a heritable genetic disease. In some embodiments, the disease is retinitis pigmentosa 11 (RP11).
Heterologous polynucleotide
[0175] In some embodiments, a target gene (e.g., an endogenous target gene) can be a disease causing gene (e.g., a mutant allele), and the systems and compositions of the present disclosure can further comprise a heterologous polynucleotide encoding a non-disease causing gene thereof (e.g., a wild type allele), e.g., as a gene replacement therapy. Accordingly, the methods as disclosed herein can comprise introducing such system or compositions to a cell or to a subject, e.g., contacting the cell with such systems or compositions (e.g., via delivery or expression of such systems or compositions in the cell).
[0176] Thus, the systems and compositions can comprise the non-disease causing wild type or variant of the target gene, as abovementioned. Alternatively or in addition to, the systems and compositions can comprise a heterologous polynucleotide sequence encoding (or comprising) at least the non-disease causing wild type or variant of the target gene (e.g., that of the endogenous target gene) as disclosed herein.
Composition
[0177] In some aspects, the present disclosure provides a composition comprising at least a portion of the system as described, e.g., (i) the heterologous polypeptide comprising the actuator moiety or a heterologous polynucleotide encoding the heterologous polypeptide, (ii) the guide nucleic acid or a heterologous polynucleotide encoding the guide nucleic acid, as disclosed herein, (iii) the heterologous polynucleotide encoding a non-disease causing allele of a gene, for use in any of the methods as disclosed herein. The subject composition can be usable for modifying a cell in vitro, ex vivo, or in vivo. The subject composition can be usable for treating or enhancing a condition of a subject, as disclosed herein.
[0178] The composition as disclosed herein can comprise an active ingredient (e.g., the heterologous polypeptide comprising the actuator moiety, the guide nucleic acid, the heterologous polynucleotide encoding the non-disease causing allele of a gene, etc.) and optionally an additional ingredient (e.g., excipient). If necessary and/or desirable, the composition can be divided, shaped and/or packaged into a desired single- or multi-dose unit or single-or multi-implantation unit.
[0179] In some embodiments, the composition can comprise one or more heterologous polynucleotides encoding the active ingredients as disclosed herein. When there are different members within the active ingredients, each member can be encoded by a different heterologous polynucleotide. Alternatively, two or more (e.g., all of) the ingredients can be encoded by a single heterologous polynucleotide. In some cases, a single heterologous polynucleotide an encode (i) the heterologous polypeptide comprising the actuator moiety (e.g., dCas- transcriptional effector fusion protein, such as dCas-KRAB or dCas-DNMT) and (ii) one or more guide nucleic acids (e g., at least 1, at least 2, at least 3, at least 4, at least 5, or more guide nucleic acids) for targeting specific region(s) or sequence(s) of the target gene. In some cases, a single heterologous polynucleotide an encode (i) the heterologous polypeptide comprising the actuator moiety (e.g., dCas-transcriptional effector fusion protein, such as dCas-KRAB, dCas- DNMT, or dCas-VPR), (ii) one or more guide nucleic acids (e.g., at least 1, at least 2, at least 3, at least 4, at least 5, or more guide nucleic acids) for targeting specific region(s) or sequence(s) of the target gene, and (iii) the heterologous polynucleotide encoding a non-disease causing allele of
a gene.
[0180] The one or more heterologous polynucleotides can further comprise one or more promoters (or one or more transcriptional control elements, as used interchangeably herein). Different active ingredients encoded by the one or more heterologous polynucleotides can be under the control of the same promoter or different promoters. A promoter as disclosed herein can be active in a eukaryotic, mammalian, non-human mammalian or human cell. The promoter can be an inducible or constitutively active promoter. Alternatively or additionally, the promoter can be tissue or cell specific. Non-limiting examples of suitable eukaryotic promoters (i.e. promoters functional in a eukaryotic cell) can include those from cytomegalovirus (CMV) immediate early, herpes simplex virus (HSV) thymidine kinase, early and late SV40, long terminal repeats (LTRs) from retrovirus, human elongation factor-1 promoter (EFl), a hybrid construct comprising the cytomegalovirus (CMV) enhancer fused to the chicken beta-active promoter (CAG), murine stem cell virus promoter (MSCV), phosphoglycerate kinase- 1 locus promoter (PGK) and mouse metallothionein-I. The promoter can be a fungi promoter. The promoter can be a plant promoter. A database of plant promoters can be found (e.g., PlantProm). The expression vector may also contain a ribosome binding site for translation initiation and a transcription terminator. The expression vector may also include appropriate sequences for amplifying expression. In some cases, a promoter as disclosed herein can be a promoter specific for any of the tissues provided herein, or a promoter specific for any of the cell types provided herein.
[0181] A heterologous polynucleotide of the one or more heterologous polynucleotides (e.g., the single heterologous polynucleotide) can have a size of at least or up to about 2.5 kilobases, at least or up to about 2.6 kilobases, at least or up to about 2.7 kilobases, at least or up to about 2.8 kilobases, at least or up to about 2.9 kilobases, at least or up to about 3.0 kilobases, at least or up to about 3.1 kilobases, at least or up to about 3.2 kilobases, at least or up to about 3.3 kilobases, at least or up to about 3.4 kilobases, at least or up to about 3.5 kilobases, at least or up to about 3.6 kilobases, at least or up to about 3.7 kilobases, at least or up to about 3.8 kilobases, at least or up to about 3.9 kilobases, at least or up to about 4.0 kilobases, at least or up to about 4.1 kilobases, at least or up to about 4.2 kilobases, at least or up to about 4.3 kilobases, at least or up to about 4.4 kilobases, at least or up to about 4.5 kilobases, at least or up to about 4.6 kilobases, at least or up to about 4.7 kilobases, at least or up to about 4.8 kilobases, at least or up to about 4.9 kilobases, at least or up to about 5.0 kilobases, at least or up to about 5.5 kilobases, at least or up to about 6.0 kilobases, at least or up to about 6.5 kilobases, at least or up to about 7.0 kilobases, at least or up to about 7.5 kilobases, at least or up to about 8.0 kilobases, at least or up
to about 9.0 kilobases, or at least or up to about 10 kilobases. In some cases, the heterologous polynucleotide of the one or more heterologous polynucleotides (e.g., the single heterologous polynucleotide) can have a size of between about 3 kilobases and about 5 kilobases, between about 3 kilobases and about 4.8 kilobases, between about 3 kilobases and about 4.6 kilobases, between about 3 kilobases and about 4.4 kilobases, between about 3 kilobases and about 4.2 kilobases, between about 3 kilobases and about 4.0 kilobases, between about 3 kilobases and about 3.5 kilobases, between about 3.5 kilobases and about 5 kilobases, between about 3.5 kilobases and about 4.8 kilobases, between about 3.5 kilobases and about 4.6 kilobases, between about 3.5 kilobases and about 4.4 kilobases, between about 3.5 kilobases and about 4.2 kilobases, between about 3.5 kilobases and about 4 kilobases, between about 4 kilobases and about 5 kilobases, between about 4 kilobases and about 4.9 kilobases, between about 4 kilobases and about 4.8 kilobases, between about 4 kilobases and about 4.7 kilobases, between about 4 kilobases and about 4.6 kilobases, between about 4 kilobases and about 4.5 kilobases, between about 4 kilobases and about 4.4 kilobases, between about 4 kilobases and about 4.3 kilobases, between about 4 kilobases and about 4.2 kilobases, or between about 4 kilobases and about 4.1 kilobases.
[0182] A method of delivery of the one or more heterologous polynucleotides provided herein to the cell can involve viral delivery methods or non-viral delivery methods. Thus, the one or more heterologous polynucleotides can be one or more viral vectors (e.g., one or more AAV vectors). Alternatively, the one or more heterologous polynucleotides can be non-viral vectors that are complexed with or encapsulated by non-viral delivery moieties, such as cationic lipids and/or lipid particles (e.g., lipid nanoparticles (LNP)).
[0183] Methods of non-viral delivery of nucleic acids can include lipofection, nucleofection, microinjection, biolistics, virosomes, liposomes, immunoliposomes, polycation or lipidmucleic acid conjugates, naked DNA, artificial virions, and agent-enhanced uptake of DNA. Cationic and neutral lipids that are suitable for efficient receptor-recognition lipofection of polynucleotides can be used. Delivery can be to cells (e.g. in vitro or ex vivo administration) or target tissues (e.g. in vivo administration).
[0184] In some embodiments, the compositions and systems provided herein are delivered to a subject using a viral vector. In some cases, the viral vector is an adeno-associated viral (AAV) vector. The term “AAV” is an abbreviation for adeno-associated virus, and may be used to refer to the virus itself or a derivative thereof. The term covers all serotypes, subtypes, and both naturally occurring and recombinant forms, except where required otherwise. The abbreviation “rAAV” refers to recombinant adeno-associated virus, also referred to as a recombinant AAV
vector (or “rAAV vector”). The term “AAV” includes AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, A AVI 0, AAV11, AAV12, rhlO, and hybrids thereof, avian AAV, bovine AAV, canine AAV, equine AAV, primate AAV, non-primate AAV, and ovine AAV. The genomic sequences of various serotypes of AAV, as well as the sequences of the native terminal repeats (TRs), Rep proteins, and capsid subunits are known in the art. Such sequences may be found in the literature or in public databases such as GenBank. An “rAAV vector” as used herein refers to an AAV vector comprising a polynucleotide sequence not of AAV origin (i.e., a polynucleotide heterologous to AAV), typically a sequence of interest for the genetic transformation of a cell. In general, the heterologous polynucleotide is flanked by at least one, and generally by two, AAV inverted terminal repeat sequences (ITRs). The term rAAV vector encompasses both rAAV vector particles and rAAV vector plasmids. An rAAV vector may either be single-stranded (ssAAV) or self-complementary (scAAV). An “AAV virus” or “AAV viral particle” or “rAAV vector particle” refers to a viral particle composed of at least one AAV capsid protein and an encapsidated polynucleotide rAAV vector. If the particle comprises a heterologous polynucleotide (i.e., a polynucleotide other than a wild-type AAV genome such as a transgene to be delivered to a mammalian cell), it is typically referred to as an “rAAV vector particle” or simply an “rAAV vector”. Thus, production of rAAV particle necessarily includes production of rAAV vector, as such a vector is contained within an rAAV particle. In some cases, the AAV vector is selected based on the tropism of viral vector. In some embodiments, an AAV vector with tropism for the target tissue (e.g., eye) may be used (e.g., AAV7, AAV8, AAV9) to deliver polynucleotides encoding the compositions and systems provided herein to the target tissue (e.g., eye).
[0185] RNA or DNA viral based systems can be used to target specific cells in the body and trafficking the viral payload to the nucleus of the cell. Viral vectors can be administered directly (in vivo) or they can be used to treat cells in vitro, and the modified cells can optionally be administered (ex vivo). Viral based systems can include retroviral, lentivirus, adenoviral, adeno- associated and herpes simplex virus vectors for gene transfer. Integration in the host genome can occur with the retrovirus, lentivirus, and adeno-associated virus gene transfer methods, which can result in long term expression of the inserted transgene. High transduction efficiencies can be observed in many different cell types and target tissues.
[0186] The tropism of a retrovirus can be altered by incorporating foreign envelope proteins, expanding the potential target population of target cells. Lentiviral vectors are retroviral vectors that can transduce or infect non-dividing cells and produce high viral titers. Selection of a retroviral gene transfer system can depend on the target tissue. Retroviral vectors can comprise
cis-acting long terminal repeats with packaging capacity for up to 6-10 kb of foreign sequence. The minimum cis-acting LTRs can be sufficient for replication and packaging of the vectors, which can be used to integrate the therapeutic gene into the target cell to provide permanent transgene expression. Retroviral vectors can include those based upon murine leukemia virus (MuLV), gibbon ape leukemia virus (GaLV), Simian Immuno deficiency virus (SIV), human immuno deficiency virus (HIV), and combinations thereof.
[0187] An adenoviral-based systems can be used. Adenoviral-based systems can lead to transient expression of the transgene. Adenoviral based vectors can have high transduction efficiency in cells and may not require cell division. High titer and levels of expression can be obtained with adenoviral based vectors. Adeno-associated virus (“AAV”) vectors can be used to transduce cells with target nucleic acids, e.g., in the in vitro production of nucleic acids and peptides, and for in vivo and ex vivo gene therapy procedures.
[0188] Packaging cells can be used to form virus particles capable of infecting a host cell. Such cells can include 293 cells, (e.g., for packaging adenovirus), and Psi2 cells or PA317 cells (e.g., for packaging retrovirus). Viral vectors can be generated by producing a cell line that packages a nucleic acid vector into a viral particle. The vectors can contain the minimal viral sequences required for packaging and subsequent integration into a host. The vectors can contain other viral sequences being replaced by an expression cassette for the polynucleotide(s) to be expressed. The missing viral functions can be supplied in trans by the packaging cell line. For example, AAV vectors can comprise ITR sequences from the AAV genome which are required for packaging and integration into the host genome. Viral DNA can be packaged in a cell line, which can contain a helper plasmid encoding the other AAV genes, namely rep and cap, while lacking ITR sequences. The cell line can also be infected with adenovirus as a helper. The helper virus can promote replication of the AAV vector and expression of AAV genes from the helper plasmid. Contamination with adenovirus can be reduced by, e g., heat treatment to which adenovirus is more sensitive than AAV.
[0189] A host cell can be transiently or non-transiently transfected with one or more vectors described herein. A cell can be transfected as it naturally occurs in a subject. A cell can be taken or derived from a subject and transfected. A cell can be derived from cells taken from a subject, such as a cell line. In some embodiments, a cell transfected with one or more vectors described herein is used to establish a new cell line comprising one or more vector-derived sequences. In some embodiments, a cell transiently transfected with the compositions of the disclosure (such as by transient transfection of one or more vectors, or transfection with RNA), and modified through the activity of a an actuator moiety such as a CRISPR complex, is used to establish a
new cell line comprising cells containing the modification but lacking any other exogenous sequence.
[0190] Any suitable vector compatible with the host cell can be used with the methods of the disclosure. Non-limiting examples of vectors for eukaryotic host cells include pXTl, pSG5 (Stratagene™), pSVK3, pBPV, pMSG, and pSVLSV40 (Pharmacia™).
[0191] In some embodiments, the additional ingredient of the composition as disclosed herein can comprise an excipient. Non-limiting examples of the excipient can include solvents, dispersion media, diluents, or other liquid vehicles, dispersion or suspension aids, surface active agents, isotonic agents, thickening or emulsifying agents, preservatives, lipidoids, liposomes, lipid nanoparticles, polymers, lipoplexes, core-shell nanoparticles, peptides, proteins, hyaluronidase, nanoparticle mimics, inert diluents, buffering agents, lubricating agents, oils, and combinations thereof. In some examples, the composition as disclosed herein can include one or more excipients, each in an amount that together increases the stability of (i) the heterologous polypeptide or the heterologous gene encoding thereof and/or (ii) cells or modified cells.
[0192] In some aspects, the present disclosure provides a kit comprising such composition and instructions directing (i) contacting the cell with the composition (e.g., in vitro, ex vivo, or in vivo), or (ii) administration of cells comprising any one of the compositions disclosed herein to a subject. The subject may have or may be suspected of having a condition, such as a hereditary disease.
[0193] In some embodiments, any of the compositions as disclosed herein, can be administered to the subject via orally, intraperitoneally, intravenously, intraarterially, transdermally, intramuscularly, liposomally, via local delivery by catheter or stent, subcutaneously, intraadiposally, or intrathecally. In particular aspects, the compositions and systems provided herein (including polynucleotides encoding said compositions and systems, e.g., contained in an AAV vector) can be administered to a subject via intravenous administration.
[0194] The compositions (e.g., pharmaceutical compositions) as disclosed herein can be suitable for administration to humans. In addition, such compositions can be suitable for administration to any other animal, e.g., to non-human animals, e.g. non-human mammals. Modification of pharmaceutical compositions suitable for administration to humans in order to render the compositions suitable for administration to various animals is well understood, and the ordinarily skilled veterinary pharmacologist can design and/or perform such modification with merely ordinary, if any, experimentation. Subjects to which administration of the pharmaceutical compositions is contemplated include, but are not limited to, humans and/or other primates; mammals, including commercially relevant mammals such as cattle, pigs, horses, sheep, cats,
dogs, mice, and/or rats; and/or birds, including commercially relevant birds such as poultry, chickens, ducks, geese, and/or turkeys.
Cells
[0195] In some embodiments, a cell as provided herein may be referred to as a target cell. In some embodiments, the systems, compositions, and methods as provided herein can be applied to modify a target cell (e.g., modify expression profile of a target gene of the target cell). A target cell can include a wide variety of cell types. A target cell can be in vitro. A target cell can be in vivo. A target cell can be ex vivo. A target cell can be an isolated cell. A target cell can be a cell inside of an organism. A target cell can be an organism. A target cell can be a cell in a cell culture. A target cell can be one of a collection of cells. A target cell can be a mammalian cell or derived from a mammalian cell. A target cell can be a rodent cell or derived from a rodent cell. A target cell can be a human cell or derived from a human cell. A target cell can be a prokaryotic cell or derived from a prokaryotic cell. A target cell can be a bacterial cell or can be derived from a bacterial cell. A target cell can be an archaeal cell or derived from an archaeal cell. A target cell can be a eukaryotic cell or derived from a eukaryotic cell. A target cell can be a pluripotent stem cell. A target cell can be a plant cell or derived from a plant cell. A target cell can be an animal cell or derived from an animal cell. A target cell can be an invertebrate cell or derived from an invertebrate cell. A target cell can be a vertebrate cell or derived from a vertebrate cell. A target cell can be a microbe cell or derived from a microbe cell. A target cell can be a fungi cell or derived from a fungi cell. A target cell can be from a specific organ or tissue.
[0196] A target cell can be a stem cell or progenitor cell. Target cells can include stem cells (e.g., adult stem cells, embryonic stem cells, induced pluripotent stem (iPS) cells) and progenitor cells (e.g., cardiac progenitor cells, neural progenitor cells, etc.). Target cells can include mammalian stem cells and progenitor cells, including rodent stem cells, rodent progenitor cells, human stem cells, human progenitor cells, etc. Clonal cells can comprise the progeny of a cell. A target cell can comprise a target nucleic acid. A target cell can be in a living organism. A target cell can be a genetically modified cell. A target cell can be a host cell.
[0197] A target cell can be a primary cell. For example, cultures of primary cells can be passaged 0 times, 1 time, 2 times, 4 times, 5 times, 10 times, 15 times or more. Cells can be unicellular organisms. Cells can be grown in culture.
[0198] A target cell can be a diseased cell. A diseased cell can have altered metabolic, gene expression, and/or morphologic features. A diseased cell can be a cancer cell, a diabetic cell, and a apoptotic cell. A diseased cell can be a cell from a diseased subject. Exemplary diseases can include blood disorders, cancers, metabolic disorders, eye disorders, organ disorders,
musculoskeletal disorders, cardiac disease, and the like.
[0199] If the target cells are primary cells, they may be harvested from an individual by any method. For example, leukocytes may be harvested by apheresis, leukocytapheresis, density gradient separation, etc. Cells from tissues such as skin, muscle, bone marrow, spleen, liver, pancreas, lung, intestine, stomach, etc. can be harvested by biopsy.
[0200] Non-limiting examples of cells which can be target cells include, but are not limited to, lymphoid cells, such as B cell, T cell (Cytotoxic T cell, Natural Killer T cell, Regulatory T cell, T helper cell), Natural killer cell, cytokine induced killer (CIK) cells; myeloid cells, such as granulocytes (Basophil granulocyte, Eosinophil granulocyte, Neutrophil granulocyte/Hypersegmented neutrophil), Monocyte/Macrophage, Red blood cell (Reticulocyte), Mast cell, Thrombocyte/Megakaryocyte, Dendritic cell; cells from the endocrine system, including thyroid (Thyroid epithelial cell, Parafollicular cell), parathyroid (Parathyroid chief cell, Oxyphil cell), adrenal (Chromaffin cell), pineal (Pinealocyte) cells; cells of the nervous system, including glial cells (Astrocyte, Microglia), Magnocellular neurosecretory cell, Stellate cell, Boettcher cell, and pituitary (Gonadotrope, Corticotrope, Thyrotrope, Somatotrope, Lactotroph); cells of the Respiratory system, including Pneumocyte (Type I pneumocyte, Type II pneumocyte), Clara cell, Goblet cell, Dust cell; cells of the circulatory system, including Myocardiocyte, Pericyte; cells of the digestive system, including stomach (Gastric chief cell, Parietal cell), Goblet cell, Paneth cell, G cells, D cells, ECL cells, I cells, K cells, S cells; enteroendocrine cells, including enterochromaffin cell, APUD cell, liver (Hepatocyte, Kupffer cell), Cartilage/bone/muscle; bone cells, including Osteoblast, Osteocyte, Osteoclast, teeth (Cementoblast, Ameloblast); cartilage cells, including Chondroblast, Chondrocyte; skin cells, including Trichocyte, Keratinocyte, Melanocyte (Nevus cell); muscle cells, including Myocyte; urinary system cells, including Podocyte, Juxtaglomerular cell, Intraglomerular mesangial cell/Extraglomerular mesangial cell, Kidney proximal tubule brush border cell, Macula densa cell; reproductive system cells, including Spermatozoon, Sertoli cell, Leydig cell, Ovum; and other cells, including Adipocyte, Fibroblast, Tendon cell, Epidermal keratinocyte (differentiating epidermal cell), Epidermal basal cell (stem cell), Keratinocyte of fingernails and toenails, Nail bed basal cell (stem cell), Medullary hair shaft cell, Cortical hair shaft cell, Cuticular hair shaft cell, Cuticular hair root sheath cell, Hair root sheath cell of Huxley's layer, Hair root sheath cell of Henle's layer, External hair root sheath cell, Hair matrix cell (stem cell), Wet stratified barrier epithelial cells, Surface epithelial cell of stratified squamous epithelium of cornea, tongue, oral cavity, esophagus, anal canal, distal urethra and vagina, basal cell (stem cell) of epithelia of cornea, tongue, oral cavity, esophagus, anal canal, distal urethra and vagina, Urinary epithelium
cell (lining urinary bladder and urinary ducts), Exocrine secretory epithelial cells, Salivary gland mucous cell (polysaccharide-rich secretion), Salivary gland serous cell (glycoprotein enzymerich secretion), Von Ebner's gland cell in tongue (washes taste buds), Mammary gland cell (milk secretion), Lacrimal gland cell (tear secretion), Ceruminous gland cell in ear (wax secretion), Eccrine sweat gland dark cell (glycoprotein secretion), Eccrine sweat gland clear cell (small molecule secretion). Apocrine sweat gland cell (odoriferous secretion, sex -hormone sensitive), Gland of Moll cell in eyelid (specialized sweat gland), Sebaceous gland cell (lipid-rich sebum secretion), Bowman's gland cell in nose (washes olfactory epithelium), Brunner's gland cell in duodenum (enzymes and alkaline mucus), Seminal vesicle cell (secretes seminal fluid components, including fructose for swimming sperm), Prostate gland cell (secretes seminal fluid components), Bulbourethral gland cell (mucus secretion), Bartholin's gland cell (vaginal lubricant secretion), Gland of Littre cell (mucus secretion), Uterus endometrium cell (carbohydrate secretion), Isolated goblet cell of respiratory and digestive tracts (mucus secretion), Stomach lining mucous cell (mucus secretion), Gastric gland zymogenic cell (pepsinogen secretion), Gastric gland oxyntic cell (hydrochloric acid secretion), Pancreatic acinar cell (bicarbonate and digestive enzyme secretion), Paneth cell of small intestine (lysozyme secretion), Type II pneumocyte of lung (surfactant secretion), Clara cell of lung, Hormone secreting cells, Anterior pituitary cells, Somatotropes, Lactotropes, Thyrotropes, Gonadotropes, Corticotropes, Intermediate pituitary cell, Magnocellular neurosecretory cells, Gut and respiratory tract cells, Thyroid gland cells, thyroid epithelial cell, parafollicular cell, Parathyroid gland cells, Parathyroid chief cell, Oxyphil cell, Adrenal gland cells, chromaffin cells, Ley dig cell of testes, Theca interna cell of ovarian follicle, Corpus luteum cell of ruptured ovarian follicle, Granulosa lutein cells, Theca lutein cells, Juxtaglomerular cell (renin secretion), Macula densa cell of kidney, Metabolism and storage cells, Barrier function cells (Lung, Gut, Exocrine Glands and Urogenital Tract), Kidney, Type I pneumocyte (lining air space of lung), Pancreatic duct cell (centroacinar cell), Nonstriated duct cell (of sweat gland, salivary gland, mammary gland, etc ), Duct cell (of seminal vesicle, prostate gland, etc.), Epithelial cells lining closed internal body cavities, Ciliated cells with propulsive function, Extracellular matrix secretion cells, Contractile cells; Skeletal muscle cells, stem cell, Heart muscle cells, Blood and immune system cells, Erythrocyte (red blood cell), Megakaryocyte (platelet precursor), Monocyte, Connective tissue macrophage (various types), Epidermal Langerhans cell, Osteoclast (in bone), Dendritic cell (in lymphoid tissues), Microglial cell (in central nervous system), Neutrophil granulocyte, Eosinophil granulocyte, Basophil granulocyte, Mast cell, Helper T cell, Suppressor T cell, Cytotoxic T cell, Natural Killer T cell, B cell, Natural killer cell, Reticulocyte, Stem cells
and committed progenitors for the blood and immune system (various types), Pluripotent stem cells, Totipotent stem cells, Induced pluripotent stem cells, adult stem cells, Sensory transducer cells, Autonomic neuron cells, Sense organ and peripheral neuron supporting cells, Central nervous system neurons and glial cells, Lens cells, Pigment cells, Melanocyte, Retinal pigmented epithelial cell, Germ cells, Oogonium/Oocyte, Spermatid, Spermatocyte, Spermatogonium cell (stem cell for spermatocyte), Spermatozoon, Nurse cells, Ovarian follicle cell, Sertoli cell (in testis), Thymus epithelial cell, Interstitial cells, and Interstitial kidney cells.
[0201] The cell (or target cell) can be engineered to comprise (or exhibit) any one of the systems or compositions as disclosed herein or can be treated by any one of the methods disclosed herein in vitro or ex vivo, then administered to the subject, e.g., to treat a condition of the subject. For example, any subject modified cell product can be administered to the subject to treat a condition of a bodily tissue of the subject. In some cases, the cell can be resident inside the subject’s body, and any of the systems or compositions thereof can be administered to the subject, to contact the cell by the systems/compositions (e.g., to engineer the cell with the systems/compositions).
EXAMPLES
Example 1. Gene expression modulation
[0202] Gene expression can be modulated in a cell by utilizing a system or a method described herein. In some cases, the gene being modulated by the system or the method can be a mutant allele that can cause a disease or condition in a subject. In some cases, the gene being modulated can be a non-disease causing variant (e.g. a wild-type allele). In some embodiments, the gene expression can modulated by the system or the method described herein by both decreasing the expression of the mutant allele in a cell and simultaneously increasing expression of the wildtype allele. In some cases, the wild-type allele is encoded by at least one of the heterologous polynucleotides described herein. FIG. 1 illustrates an exemplary construct encoding the dCas and the actuator moiety (effector). dCas can be coupled with a transcription repressor for decreasing expression of the expression of the mutant allele of the endogenous target gene in the cell. FIG. 2 illustrates a schematic for treating retinitis pigmentosa with the system described herein. AAV can be engineered to deliver an exemplary construct via subretinal or intravitreal injection to a subject in need thereof.
[0203] The modulation of the endogenous target gene expression by the system or method described herein can be used to treat a disease or condition in a subject. A subject suspected of having a disease or condition associated with mutation of the endogenous target gene can be first screened for the presence of a mutant allele of the endogenous target gene. Afterwards, the system described herein can be administered to the subject to simultaneously decrease expression
of the mutant allele of endogenous target gene and increase expression of the non-disease causing allele of endogenous target gene.
Example 2. PRPF31 expression modulation
[0204] PRPF31 expression can be modulated in a cell by utilizing a system or a method described herein. In some cases, the PRPF31 is a mutant allele ofPRPF31. In some cases, the mutant PRPF31 can cause disease in a subject. In some cases, the PRPF31 is a non-disease causing variant of PRPF31 (e.g. wild type allele of PRPF31). In some embodiments, the PRPF31 expression can modulated by the system or the method described herein by both decreasing the expression of the mutant allele of PRPF31 in a cell and simultaneously increasing expression of the wild type allele of PRPF31. In some cases, the wild type allele of PRPF31 is encoded by at least one of the heterologous polynucleotides described herein. FIG. 3 illustrates exemplary target endogenous polynucleotide sequences (e.g., transcripts) that can be targeted by the gRNA of the system and the method described herein for decreasing or increasing the expression of PRPF31. The modulation of the PRPF31 expression by the system or method described herein can be used to treat a disease or condition in a subject. A subject suspected of having a disease or condition associated with PRPF31 mutation can be first screened for the presence of PRPF31 variant (e.g., a mutant allele of PRPF31).
Example 3. Expression cassettes (or constructs).
[0205] The systems as provided herein can be delivered to a cell via one or more expression cassettes encoding one or more components of the systems. The one or more expression cassettes can comprise a vector, such as a viral vector (e.g., an AAV vector comprising two Inverted terminal repeats (ITRs)). FIGs. 4A-4F schematically illustrate examples of a single vector encoding one or more components the system as provided herein.
[0206] FIG. 4A shows a schematic representation of a construct in which a RNA Pol II promoter drives the expression of a nuclear localization signal (NSL), a deactivated Cas (dCas), a protein linker (e.g., a GS linker), a modulator (e.g., a transcriptional effector), and Poly A signal placed at the 3 ’-end of the modulator. Additionally, the construct includes a RNA Pol III promoter that drives the expression of a guide RNA (gRNA) exhibiting specific binding against an endogenous target gene (e.g., encoding PRPF3 l)-terminator sequence, which is placed at the 3’-end of the poly A signal.
[0207] FIG. 4B shows a schematic representation of a construct in which a RNA Pol III promoter that drives the expression of a guide RNA (gRNA) exhibiting specific binding against
an endogenous target gene (e.g., encoding PRPF31)-terminator sequence, and a terminator sequence placed at the 3 ’-end of the gRNA. Additionally, the construct includes, downstream of the 3 ’-end of the terminator sequence, a RNA Pol II promoter drives the expression of a nuclear localization signal (NSL), a deactivated Cas (dCas), a protein linker (e.g., a GS linker), a modulator (e.g., a transcriptional effector), and Poly A signal placed at the 3’-end of the modulator.
[0208] FIG. 4C shows a schematic representation of a construct in which a RNA Pol II promoter drives the expression of a nuclear localization signal (NSL), a deactivated Cas (dCas), a protein linker (e.g., a GS linker), a modulator (e.g., a transcriptional effector), and Poly A signal placed at the 3 ’-end of the modulator. Additionally, the construct includes a RNA Pol III promoter that drives the expression of a guide RNA (gRNA) exhibiting specific binding against an endogenous target gene (e.g., encoding PRPF3 l)-terminator sequence. The RNA Pol III promoter (and the components under the control of such promoter) is downstream and on the opposite strand of the RNA Pol II promoter (and the components under the control of such promoter).
[0209] FIG. 4D shows a schematic representation of a construct similar to that shown in FIG. 4A. As compared to that in FIG. 4A, the RNA Pol II promoter (and the components under the control of such promoter) in the construct of FIG. 4D is further downstream away from the 5’ ITR and towards the 3 ’ ITR.
[0210] FIG. 4E shows a schematic representation of a construct similar to that shown in FIG. 4B. As compared to that in FIG. 4B, the RNA Pol II promoter (and the components under the control of such promoter) in the construct of FIG. 4E is further upstream towards the 5’ ITR and away from the 3 ’ ITR.
[0211] FIG. 4F shows a schematic representation of a construct similar to that shown in FIG. 4C. As compared to that in FIG. 4C, the RNA Pol II promoter (and the components under the control of such promoter) in the construct of FIG. 4F is further upstream towards the 5’ ITR and away from the 3 ’ ITR.
Example 4. gRNA screening.
[0212] Populations of retinal pigment epithelium (RPE) cells (e g., human RPE cells) were engineered with a library of gRNAs against different portions of PRPF31 gene, along with a dCas-modulator (e g., transcription activator such as VPR) fusion protein (e.g., via transduction with AAV vectors comprising of one of the constructs discussed in Example 3). Subsequently, expression level of PRPF31 (e.g., a non-disease causing PRPF31 variant) in the RPE cells was measured (e.g., via a gene expression assay such as enzyme-linked immunoassay (ELISA) or
reverse transcription-polymerase chain reaction (RT-PCR)) (FIG. 5A), to rank different gRNAs (e.g., with different spacer sequences and/or different viral vector construct designs) for the ability to induce the expression level of PRPF31. As shown in FIG. 5B, certain gRNAs (e.g., from a library of gRNAs derived from Table 5) yielded greater expression of PRPF31 in the RPE cells.
Example 5. In vitro functional assay.
[0213] Induced pluripotent stem cells (iPSCs) derived from cells of healthy or non-diseased subjects/patients (control iPSC) and iPSCs derived from cells of a RPl l subjects/patients (e.g., RP11-iPSCs) can be differentiated into RPE cells (control RPE cells and RP11-RPE cells, respectively) using an established differentiation protocol. PRPF31 co-localization with small nuclear ribonucleoproteins (snRNPs) and/or ADP-ribosylation factor-like protein 13B (ARL13B) can be assessed for the iPSC-derived RPEs, to confirm PRPF31 localization in both splicing complexes and cilia.
Example 6. Ciliogenesis.
[0214] To measure cilia length and incidence in RP11-RPE cells, immunostaining against the cilia marker ARL13B can be performed on one or more members of the following: (i) wild type RPE cells (e.g., control RPE cells or isolated RPE cells) that is not engineered with the system of the present disclosure (e.g., a complex comprising dCas-activator and a guide RNA against PRPF31), (ii) RP11-RPE cells (e.g., RP11-iPSCs or isolated mutant RPE cells) that are not engineered with the system of the present disclosure, and (iii) RP11-RPE cells that are engineered with the system of the present disclosure. Both cilia incidence and cilia length can be measured via imaging (e g., ImagExpress Micro) for statistical analysis.
[0215] FIGs. 6A and 6B show examples images of spliceosome normal RPE cells (FIG. 6A, with cilium indicated by four arrows) and RPE cells with defected spliceosome (FIG. 6B). The RPE cells with spliceosome defect display decreased cilium.
Example 7. Detection of splicing events.
[0216] PRPF31 mutations can lead to impaired pre-mRNA splicing of key components involved in the splicing process itself. To evaluate the restoration of splicing events after transducing the system of the present disclosure (e.g., a complex comprising dCas-activator and a guide RNA against PRPF31) to mutant RPE, gene expression assays (e.g., RT-PCR) can be performed in RPE cells and retinal organoids derived from either control iPSCs or RP11-iPSCs. Expression
level and change thereof of one or more key genes involved in cilia formation and/or outer segments of photoreceptors can be assessed accordingly. Non-limiting examples of such key genes can include RPGR, RPGRIP1L, CN0T3, intraflagellar transport (IFT122), actin filament organization, centrosome and focal adhesion (SORB SI), and pre-mRNA 3 '-end processing (CPSF1).
Example 8. In vivo assay.
[0217] Gene delivery constructs (e.g., AAV constructs as shown in Example 3) in saline or a blank saline can be injected into the eye in vivo to treat or ameliorate an ocular disease (e.g., RP11).
[0218] For example, mice (e.g., age 4-5 weeks) can be anaesthetized and their pupils can be dilated. Mice can then be placed under an operating microscope. Upon visualizing the fundus of the mouse eye, a guide hole can be made through the sclera behind the iris (e.g., at an angle) towards the optic nerve. A micro-injection syringe can then be inserted into the hole and AAV (or blank saline) can be delivered to the vitreous body (for intravitreal injections), or the needle can be used to pierce the temporal retina and inject AAV to the subretinal space (for subretinal injections).
[0219] Effects of the AAV treatment can be assessed in later time points via visualization (e.g., fundoscopy, optical coherence tomography (OCT)), retinal function assay (e.g., electroretinogram (ERG)), immunohistochemistry (IHC), gene expression analysis or sequencing of retinal cells extracted from the mice, etc.
Table 4. Exemplary gRNA spacer nucleic acid sequence
Table 5. Exemplary gRNA spacer nucleic acid sequence
EMBODIMENTS
[0220] The following non-limiting embodiments provide illustrative examples of the invention, but do not limit the scope of the invention.
[0221] Embodiment 1. A system comprising: a heterologous polypeptide comprising an actuator moiety, wherein the actuator moiety is for binding an endogenous target gene encoding PRPF31 in a cell, to increase expression level of the PRPF31 in the cell, wherein:
(i) the actuator moiety substantially lacks DNA cleavage activity; and/or
(ii) the actuator moiety is coupled to a transcriptional activator, optionally wherein:
(1) the actuator moiety substantially lacks DNA cleavage activity; and/or
(2) the actuator moiety is coupled to the transcriptional activator; and/or
(3) (i) the actuator moiety substantially lacks DNA cleavage activity and (ii) the actuator moiety is coupled to the transcriptional activator; and/or
(4) the actuator moiety is fused to the transcriptional activator; and/or
(5) the cell is a retinal cell; and/or
(6) the cell is a retinal pigment epithelium (RPE) cell; and/or
(7) the endogenous target gene comprises a non-disease causing allele of the PRPF31; and/or
(8) the actuator moiety is not capable of binding an additional endogenous target gene encoding a mutant allele of the PRPF31; and/or
(9) the target endogenous gene further comprises a disease causing allele of the PRPF31; and/or
(10) the actuator moiety is a deactivated Cas (dCas) protein, further optionally wherein:
(i) a size of the dCas is less than or equal to about 800 amino acids; and/or
(ii) a size of the dCas is less than or equal to about 600 amino acids; and/or
(iii) the dCas protein comprises a polynucleotide sequence exhibiting at least about 90% sequence identity to the polynucleotide sequence selected from Table 1; and/or
(11) the system further comprises a guide nucleic acid capable of forming a complex with the actuator moiety, wherein the complex binds the endogenous target gene,
further optionally wherein the guide nucleic acid comprises a plurality of different guide nucleic acids capable of targeting different polynucleotide sequences of the endogenous target gene; and/or
(12) the heterologous polypeptide and/or the guide nucleic acid is under control of a tissue-specific promoter; and/or
(13) the heterologous polypeptide and/or the guide nucleic acid is under control of a photoreceptor-specific promoter; and/or
(14) the heterologous polypeptide and/or the guide nucleic acid is under control of a constitutive promoter; and/or
(15) the increase in the expression level of the PRPF31 in the cell effects enhanced cilium length and/or incidence, as compared to that in a control cell comprising a mutant allele of the PRPF31 in absence of the heterologous polypeptide and/or the guide nucleic acid; and/or
(16) a portion of the endogenous target gene that is targeted by the actuator moiety and/or the guide nucleic acid is at most about 1,000 nucleobases away from a transcription start site (TSS) of the endogenous target gene; and/or
(17) the guide nucleic acid comprises (i) a scaffold sequence for forming the complex with the actuator moiety and (ii) a spacer sequence exhibiting specific binding to the endogenous target gene, wherein the spacer sequence exhibits at least about 80%, at least about 90%, at least about 95%, or substantially about 100% sequence identity to the polynucleotide sequence selected from Table 5.
[0222] Embodiment 2. One or more polynucleotides encoding the system of Embodiment 1, optionally wherein the one or more polynucleotides comprise a single polynucleotide encoding at least the heterologous polypeptide and the guide nucleic acid, further optionally wherein:
(i) the single polynucleotide has a size of less than or equal to about 5 kilobases; and/or
(ii) the single polynucleotide has a size of less than or equal to about 4.7 kilobases; and/or
(iii) the single polynucleotide has a size of less than or equal to about 4.2 kilobases; and/or
(iv) the guide nucleic acid comprises the plurality of different guide nucleic acids. [0223] Embodiment 3. A method comprising administrating the system or the one or more polynucleotides of Embodiment 1 or Embodiment 2 to a subject in need thereof, optionally wherein:
(1) the administrating comprises intravitreal injection or subretinal injection; and/or
(2) the subject has or is suspected of having retinitis pigmentosa (RP); and/or
(3) the method further comprises, prior to the administrating, determining that the subject has the RP; and/or
(4) the RP is RP1E
[0224] Embodiment 4. A method comprising: increasing expression level of an endogenous target gene encoding PRPF31 in a cell, via binding of a heterologous polypeptide comprising an actuator moiety to bind the endogenous target gene, wherein:
(a) the actuator moiety substantially lacks DNA cleavage activity; and/or
(b) the actuator moiety is coupled to a transcriptional activator, optionally wherein:
(1) the actuator moiety substantially lacks DNA cleavage activity; and/or
(2) the actuator moiety is coupled to the transcriptional activator; and/or
(3) (i) the actuator moiety substantially lacks DNA cleavage activity and (ii) the actuator moiety is coupled to the transcriptional activator; and/or
(4) the actuator moiety is fused to the transcriptional activator; and/or
(5) the cell is a retinal cell; and/or
(6) the cell is a retinal pigment epithelium (RPE) cell; and/or
(7) the endogenous target gene comprises a non-disease causing allele of the PRPF31; and/or
(8) the actuator moiety is not capable of binding an additional endogenous target gene encoding a mutant allele of the PRPF31; and/or
(9) the endogenous target gene further comprises a disease causing allele of the PRPF31; and/or
(10) the actuator moiety is a deactivated Cas (dCas) protein, further optionally wherein:
(i) a size of the dCas is less than or equal to about 800 amino acids; and/or
(ii) a size of the dCas is less than or equal to about 600 amino acids; and/or
(iii) the dCas protein comprises a polynucleotide sequence exhibiting at least about 90% sequence identity to the polynucleotide sequence selected from Table 1.
(11) the increasing is via action of a complex comprising the actuator moiety and a guide nucleic acid, wherein the complex binds the endogenous target gene, further optionally wherein the guide nucleic acid comprises a plurality of different guide nucleic acids capable of targeting different regions of the endogenous target gene; and/or
(12) the heterologous polypeptide is under control of a tissue-specific promoter; and/or
(13) the heterologous polypeptide is under control of a photoreceptor-specific promoter; and/or
(14) the heterologous polypeptide and/or the guide nucleic acid is under control of a constitutive promoter; and/or
(15) expression level of the non-disease causing allele of the PRPF31 is enhanced relative to that of a mutant allele of the PRPF31 by at least about 0.1 -fold, at least about 1-fold, at least about 5-fold, at least about 10-fold, at least about 50-fold, at least about 100-fold, or at least about 500-fold; and/or
(16) upon the increasing, a phagocytosis level of the cell is reduced by at least about 0.1- fold, at least about 1-fold, at least about 5-fold, at least about 10-fold, at least about 50-fold, at least about 100-fold, or at least about 500-fold, as compared to that of a control cell comprising a mutant allele of the PRPF31 in absence of the heterologous polypeptide and/or the guide nucleic acid; and/or
(17) the increasing effects enhanced cilium length and/or incidence, as compared to that in a control cell comprising a mutant allele of the PRPF31 in absence of the heterologous polypeptide and/or the guide nucleic acid; and/or
(18) upon the increasing, a length of a primary cilia of the cell is longer by at least about 0.1-fold, at least about 1-fold, at least about 5-fold, at least about 10-fold, at least about 50-fold, at least about 100-fold, or at least about 500-fold, as compared to that of a control cell comprising a mutant allele of the PRPF31 in absence of the heterologous polypeptide and/or the guide nucleic acid; and/or
(19) the expression level of the non-disease causing allele of the PRPF31 is substantially the same as that in a cell of a non-diseased cell; and/or
(20) a portion of the endogenous target gene that is targeted by the actuator moiety and/or the guide nucleic acid is at most about 1,000 nucleobases away from a transcription start site (TSS) of the endogenous target gene; and/or
(21) the guide nucleic acid comprises (i) a scaffold sequence for forming the complex with the actuator moiety and (ii) a spacer sequence exhibiting specific binding to the endogenous target gene, wherein the spacer sequence exhibits at least about 80%, at least about 90%, at least about 95%, or substantially about 100% sequence identity to the polynucleotide sequence selected from the group consisting of Table 5.
[0225] It shall be understood that different aspects of the invention can be appreciated individually, collectively, or in combination with each other. Various aspects of the invention
described herein may be applied to any of the particular applications disclosed herein. The compositions of matter disclosed herein in the composition section of the present disclosure may be utilized in the method section including methods of use and production disclosed herein, or vice versa.
[0226] While preferred embodiments of the present invention have been shown and described herein, it will be obvious to those skilled in the art that such embodiments are provided by way of example only. It is not intended that the invention be limited by the specific examples provided within the specification. While the invention has been described with reference to the aforementioned specification, the descriptions and illustrations of the embodiments herein are not meant to be construed in a limiting sense. Numerous variations, changes, and substitutions will now occur to those skilled in the art without departing from the invention. Furthermore, it shall be understood that all aspects of the invention are not limited to the specific depictions, configurations or relative proportions set forth herein which depend upon a variety of conditions and variables. It should be understood that various alternatives to the embodiments of the invention described herein may be employed in practicing the invention. It is therefore contemplated that the invention shall also cover any such alternatives, modifications, variations or equivalents. It is intended that the following claims define the scope of the invention and that methods and structures within the scope of these claims and their equivalents be covered thereby.
Claims
1. A system comprising: a heterologous polypeptide comprising an actuator moiety, wherein the actuator moiety is for binding an endogenous target gene encoding PRPF31 in a cell, to increase expression level of the PRPF31 in the cell, wherein:
(i) the actuator moiety substantially lacks DNA cleavage activity; and/or
(ii) the actuator moiety is coupled to a transcriptional activator.
2. The system of claim 1, wherein the actuator moiety substantially lacks DNA cleavage activity.
3. The system of claim 1, wherein the actuator moiety is coupled to the transcriptional activator.
4. The system of claim 1, wherein (i) the actuator moiety substantially lacks DNA cleavage activity and (ii) the actuator moiety is coupled to the transcriptional activator.
5. The system of any one of the preceding claims, wherein the actuator moiety is fused to the transcriptional activator.
6. The system of any one of the preceding claims, wherein the cell is a retinal cell.
7. The system of any one of the preceding claims, wherein the cell is a retinal pigment epithelium (RPE) cell.
8. The system of any one of the preceding claims, wherein the endogenous target gene comprises a non-disease causing allele of the PRPF31.
9. The system of any one of the preceding claims, wherein the actuator moiety is not capable of binding an additional endogenous target gene encoding a mutant allele of the PRPF31.
10. The system of any one of the preceding claims, wherein the target endogenous gene further comprises a disease causing allele of the PRPF31.
11. The system of any one of the preceding claims, wherein the actuator moiety is a deactivated Cas (dCas) protein.
12. The system of claim 11, wherein a size of the dCas is less than or equal to about 800 amino acids.
13. The system of claim 11, wherein a size of the dCas is less than or equal to about 600 amino acids.
14. The system of claim 11, wherein the dCas protein comprises a polynucleotide sequence exhibiting at least about 90% sequence identity to the polynucleotide sequence selected from Table 1.
15. The system of any one of the preceding claims, further comprising a guide nucleic acid capable of forming a complex with the actuator moiety, wherein the complex binds the endogenous target gene.
16. The system of claim 15, wherein the guide nucleic acid comprises a plurality of different guide nucleic acids capable of targeting different polynucleotide sequences of the endogenous target gene.
17. The system of any one of the preceding claims, wherein the heterologous polypeptide and/or the guide nucleic acid is under control of a tissue-specific promoter.
18. The system of any one of the preceding claims, wherein the heterologous polypeptide and/or the guide nucleic acid is under control of a photoreceptor-specific promoter.
19. The system of any one of the preceding claims, wherein the heterologous polypeptide and/or the guide nucleic acid is under control of a constitutive promoter.
20. The system of any one of the preceding claims, wherein the increase in the expression level of the PRPF31 in the cell effects enhanced cilium length and/or incidence, as compared to that in a control cell comprising a mutant allele of the PRPF31 in absence of the heterologous polypeptide and/or the guide nucleic acid.
21. The system of any one of the preceding claims, wherein a portion of the endogenous target gene that is targeted by the actuator moiety and/or the guide nucleic acid is at most about 1,000 nucleobases away from a transcription start site (TSS) of the endogenous target gene.
22. The system of any one of the preceding claims, wherein the guide nucleic acid comprises (i) a scaffold sequence for forming the complex with the actuator moiety and (ii) a spacer sequence exhibiting specific binding to the endogenous target gene, wherein the spacer sequence exhibits at least about 80%, at least about 90%, at least about 95%, or substantially about 100% sequence identity to the polynucleotide sequence selected from Table 5.
23. One or more polynucleotides encoding the system of any one of the preceding claims.
24. The one or more polynucleotides of claim 23, wherein the one or more polynucleotides comprise a single polynucleotide encoding at least the heterologous polypeptide and the guide nucleic acid.
25. The one or more polynucleotides of claim 24, wherein the single polynucleotide has a size of less than or equal to about 5 kilobases.
26. The one or more polynucleotides of claim 24, wherein the single polynucleotide has a size of less than or equal to about 4.7 kilobases.
27. The one or more polynucleotides of claim 24, wherein the single polynucleotide has a size of less than or equal to about 4.2 kilobases.
28. The one or more polynucleotides of claim 24, wherein the guide nucleic acid comprises the plurality of different guide nucleic acids.
29. A method comprising administrating the system or the one or more polynucleotides of any one of the preceding claims to a subject in need thereof.
30. The method of claim 29, wherein the administrating comprises intravitreal injection or subretinal injection.
31. The method of any one of the preceding claims, wherein the subject has or is suspected of having retinitis pigmentosa (RP).
32. The method of any one of the preceding claims, further comprising, prior to the administrating, determining that the subject has the RP.
33. The method of any one of the preceding claims, wherein the RP is RP11.
34. A method comprising: increasing expression level of an endogenous target gene encoding PRPF31 in a cell, via binding of a heterologous polypeptide comprising an actuator moiety to bind the endogenous target gene, wherein:
(a) the actuator moiety substantially lacks DNA cleavage activity; and/or
(b) the actuator moiety is coupled to a transcriptional activator.
35. The method of claim 34, wherein the actuator moiety substantially lacks DNA cleavage activity.
36. The method of claim 34, wherein the actuator moiety is coupled to the transcriptional activator.
37. The method of claim 34, wherein (i) the actuator moiety substantially lacks DNA cleavage activity and (ii) the actuator moiety is coupled to the transcriptional activator.
38. The method of any one of the preceding claims, wherein the actuator moiety is fused to the transcriptional activator.
39. The method of any one of the preceding claims, wherein the cell is a retinal cell.
40. The method of any one of the preceding claims, wherein the cell is a retinal pigment epithelium (RPE) cell.
41. The method of any one of the preceding claims, wherein the endogenous target gene comprises a non-disease causing allele of the PRPF31.
42. The method of any one of the preceding claims, wherein the actuator moiety is not capable of binding an additional endogenous target gene encoding a mutant allele of the PRPF31.
43. The method of any one of the preceding claims, wherein the endogenous target gene further comprises a disease causing allele of the PRPF31.
44. The method of any one of the preceding claims, wherein the actuator moiety is a deactivated Cas (dCas) protein.
45. The method of claim 44, wherein a size of the dCas is less than or equal to about 800 amino acids.
46. The method of claim 44, wherein a size of the dCas is less than or equal to about 600 amino acids.
47. The method of claim 44, wherein the dCas protein comprises a polynucleotide sequence exhibiting at least about 90% sequence identity to the polynucleotide sequence selected from Table 1.
48. The method of any one of the preceding claims, wherein the increasing is via action of a complex comprising the actuator moiety and a guide nucleic acid, wherein the complex binds the endogenous target gene.
49. The method of claim 48, wherein the guide nucleic acid comprises a plurality of different guide nucleic acids capable of targeting different regions of the endogenous target gene.
50. The method of any one of the preceding claims, wherein the heterologous polypeptide is under control of a tissue-specific promoter.
51. The method of any one of the preceding claims, wherein the heterologous polypeptide is under control of a photoreceptor-specific promoter.
52. The method of any one of the preceding claims, wherein the heterologous polypeptide and/or the guide nucleic acid is under control of a constitutive promoter.
53. The method of any one of the preceding claims, wherein expression level of the non-disease causing allele of the PRPF31 is enhanced relative to that of a mutant allele of the PRPF31 by at least about 0.1-fold, at least about 1-fold, at least about 5-fold, at least about 10-fold, at least about 50-fold, at least about 100-fold, or at least about 500-fold.
54. The method of any one of the preceding claims, wherein, upon the increasing, a phagocytosis level of the cell is reduced by at least about 0.1-fold, at least about 1-fold, at least about 5-fold, at least about 10-fold, at least about 50-fold, at least about 100-fold, or at least about 500-fold, as compared to that of a control cell comprising a mutant allele of the PRPF31 in absence of the heterologous polypeptide and/or the guide nucleic acid.
55. The method of any one of the preceding claims, wherein the increasing effects enhanced cilium length and/or incidence, as compared to that in a control cell comprising a mutant allele of the PRPF31 in absence of the heterologous polypeptide and/or the guide nucleic acid.
56. The method of any one of the preceding claims, wherein, upon the increasing, a length of a primary cilia of the cell is longer by at least about 0.1 -fold, at least about 1-fold, at least about 5- fold, at least about 10-fold, at least about 50-fold, at least about 100-fold, or at least about 500- fold, as compared to that of a control cell comprising a mutant allele of the PRPF31 in absence of the heterologous polypeptide and/or the guide nucleic acid.
57. The method of any one of the preceding claims, wherein the expression level of the nondisease causing allele of the PRPF31 is substantially the same as that in a cell of a non-diseased cell.
58. The method of any one of the preceding claims, wherein a portion of the endogenous target gene that is targeted by the actuator moiety and/or the guide nucleic acid is at most about 1,000 nucleobases away from a transcription start site (TSS) of the endogenous target gene.
59. The method of any one of the preceding claims, wherein the guide nucleic acid comprises (i) a scaffold sequence for forming the complex with the actuator moiety and (ii) a spacer sequence exhibiting specific binding to the endogenous target gene, wherein the spacer sequence exhibits at least about 80%, at least about 90%, at least about 95%, or substantially about 100% sequence identity to the polynucleotide sequence selected from the group consisting of Table 5.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US202263319088P | 2022-03-11 | 2022-03-11 | |
US63/319,088 | 2022-03-11 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2023172995A1 true WO2023172995A1 (en) | 2023-09-14 |
Family
ID=87935941
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2023/064004 WO2023172995A1 (en) | 2022-03-11 | 2023-03-09 | Systems and methods for genetic modulation to treat ocular diseases |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2023172995A1 (en) |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2019210305A1 (en) * | 2018-04-27 | 2019-10-31 | The Trustees Of Columbia University In The City Of New York | Methods of inactivating gene editing machineries |
US20190351074A1 (en) * | 2017-02-07 | 2019-11-21 | The Regents Of The University Of California | Gene therapy for haploinsufficiency |
US10934536B2 (en) * | 2018-12-14 | 2021-03-02 | Pioneer Hi-Bred International, Inc. | CRISPR-CAS systems for genome editing |
WO2022020393A1 (en) * | 2020-07-20 | 2022-01-27 | Mammoth Biosciences, Inc. | High-throughput single-chamber programmable nuclease assay |
-
2023
- 2023-03-09 WO PCT/US2023/064004 patent/WO2023172995A1/en unknown
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20190351074A1 (en) * | 2017-02-07 | 2019-11-21 | The Regents Of The University Of California | Gene therapy for haploinsufficiency |
WO2019210305A1 (en) * | 2018-04-27 | 2019-10-31 | The Trustees Of Columbia University In The City Of New York | Methods of inactivating gene editing machineries |
US10934536B2 (en) * | 2018-12-14 | 2021-03-02 | Pioneer Hi-Bred International, Inc. | CRISPR-CAS systems for genome editing |
WO2022020393A1 (en) * | 2020-07-20 | 2022-01-27 | Mammoth Biosciences, Inc. | High-throughput single-chamber programmable nuclease assay |
Non-Patent Citations (1)
Title |
---|
BUSKIN ADRIANA, ZHU LILI, CHICHAGOVA VALERIA, BASU BASUDHA, MOZAFFARI-JOVIN SINA, DOLAN DAVID, DROOP ALASTAIR, COLLIN JOSEPH, BRON: "Disrupted alternative splicing for genes implicated in splicing and ciliogenesis causes PRPF31 retinitis pigmentosa", NATURE COMMUNICATIONS, vol. 9, no. 1, 1 December 2018 (2018-12-01), XP055829450, DOI: 10.1038/s41467-018-06448-y * |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11591622B2 (en) | Method of making and using mammalian liver cells for treating hemophilia or lysosomal storage disorder | |
Suzuki et al. | In vivo genome editing via CRISPR/Cas9 mediated homology-independent targeted integration | |
EP4186921A1 (en) | Gene editing for autosomal dominant diseases | |
ES2898917T3 (en) | Gene therapy for autosomal dominant diseases | |
JP6646573B2 (en) | Delivery methods and compositions for nuclease-mediated genomic engineering | |
JP2019525756A (en) | Therapeutic application of genome editing based on CPF1 | |
KR20230169449A (en) | Rna-guided nucleic acid modifying enzymes and methods of use thereof | |
CA2932478A1 (en) | Delivery, use and therapeutic applications of the crispr-cas systems and compositions for genome editing | |
CN113423831A (en) | Nuclease-mediated repeat amplification | |
US20220315948A1 (en) | Aav vectors encoding mini-pcdh15 and uses thereof | |
CN110249051A (en) | Enhance the method and composition that functional myelin generates | |
WO2019134561A1 (en) | High efficiency in vivo knock-in using crispr | |
WO2023173110A1 (en) | Compositions, systems, and methods for treating familial hypercholesterolemia by targeting pcsk9 | |
EP3640334A1 (en) | Genome editing system for repeat expansion mutation | |
WO2023172995A1 (en) | Systems and methods for genetic modulation to treat ocular diseases | |
CN112805012A (en) | Genetic modification of mitochondrial genome | |
WO2023173120A1 (en) | Systems and methods for genetic modulation to treat ocular diseases | |
WO2023173072A1 (en) | Systems and methods for genetic modulation to treat liver disease | |
JP2023545132A (en) | CRISPR/CAS-based base editing compositions to restore dystrophin function | |
Molinari et al. | Gene and epigenetic editing in the treatment of primary ciliopathies | |
WO2023168242A1 (en) | Engineered nucleases, compositions, and methods of use thereof | |
US20220380756A1 (en) | Methods and compositions for treating thalassemia or sickle cell disease | |
Kantor | Development of CRISPR-Cas genome and epigenome engineering tools towards retinal degenerations | |
LLADO SANTAEULARIA | THERAPEUTIC GENOME EDITING IN RETINA AND LIVER |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 23767678 Country of ref document: EP Kind code of ref document: A1 |