WO2017035416A2 - Compositions and methods of improving specificity in genomic engineering using rna-guided endonucleases - Google Patents
Compositions and methods of improving specificity in genomic engineering using rna-guided endonucleases Download PDFInfo
- Publication number
- WO2017035416A2 WO2017035416A2 PCT/US2016/048798 US2016048798W WO2017035416A2 WO 2017035416 A2 WO2017035416 A2 WO 2017035416A2 US 2016048798 W US2016048798 W US 2016048798W WO 2017035416 A2 WO2017035416 A2 WO 2017035416A2
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- grna
- protospacer
- dna
- target
- cas9
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 234
- 239000000203 mixture Substances 0.000 title description 48
- 108010042407 Endonucleases Proteins 0.000 title description 15
- 102000004533 Endonucleases Human genes 0.000 title description 15
- 108020005004 Guide RNA Proteins 0.000 claims abstract description 869
- 230000027455 binding Effects 0.000 claims abstract description 203
- 230000001965 increasing effect Effects 0.000 claims abstract description 31
- 108091033409 CRISPR Proteins 0.000 claims description 409
- 108090000623 proteins and genes Proteins 0.000 claims description 361
- 210000004027 cell Anatomy 0.000 claims description 269
- 239000002773 nucleotide Substances 0.000 claims description 224
- 125000003729 nucleotide group Chemical group 0.000 claims description 205
- 230000000694 effects Effects 0.000 claims description 177
- 230000009545 invasion Effects 0.000 claims description 140
- 108091032973 (ribonucleotides)n+m Proteins 0.000 claims description 112
- 102000040430 polynucleotide Human genes 0.000 claims description 105
- 108091033319 polynucleotide Proteins 0.000 claims description 105
- 239000002157 polynucleotide Substances 0.000 claims description 105
- 239000013598 vector Substances 0.000 claims description 96
- 150000007523 nucleic acids Chemical class 0.000 claims description 86
- 238000003776 cleavage reaction Methods 0.000 claims description 85
- 230000007017 scission Effects 0.000 claims description 85
- 230000014509 gene expression Effects 0.000 claims description 80
- 230000000295 complement effect Effects 0.000 claims description 79
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 70
- 102000039446 nucleic acids Human genes 0.000 claims description 68
- 108020004707 nucleic acids Proteins 0.000 claims description 68
- 101710163270 Nuclease Proteins 0.000 claims description 67
- 108020001507 fusion proteins Proteins 0.000 claims description 64
- 102000037865 fusion proteins Human genes 0.000 claims description 64
- 230000035897 transcription Effects 0.000 claims description 58
- 238000013518 transcription Methods 0.000 claims description 58
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 37
- 108020004414 DNA Proteins 0.000 claims description 35
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 34
- 230000008685 targeting Effects 0.000 claims description 34
- 238000001727 in vivo Methods 0.000 claims description 32
- 229920001184 polypeptide Polymers 0.000 claims description 32
- 102000053602 DNA Human genes 0.000 claims description 31
- 238000013461 design Methods 0.000 claims description 29
- 238000010362 genome editing Methods 0.000 claims description 29
- 230000006780 non-homologous end joining Effects 0.000 claims description 28
- 108010033040 Histones Proteins 0.000 claims description 27
- 230000004913 activation Effects 0.000 claims description 25
- 230000003993 interaction Effects 0.000 claims description 25
- 102100024108 Dystrophin Human genes 0.000 claims description 22
- 101000615488 Homo sapiens Methyl-CpG-binding domain protein 2 Proteins 0.000 claims description 20
- 102100021299 Methyl-CpG-binding domain protein 2 Human genes 0.000 claims description 20
- 230000034431 double-strand break repair via homologous recombination Effects 0.000 claims description 20
- 238000000137 annealing Methods 0.000 claims description 19
- 238000003556 assay Methods 0.000 claims description 18
- 238000012360 testing method Methods 0.000 claims description 18
- 230000007018 DNA scission Effects 0.000 claims description 17
- 230000001404 mediated effect Effects 0.000 claims description 16
- 230000004048 modification Effects 0.000 claims description 16
- 238000012986 modification Methods 0.000 claims description 16
- 238000004422 calculation algorithm Methods 0.000 claims description 15
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 14
- 230000003247 decreasing effect Effects 0.000 claims description 14
- 230000002950 deficient Effects 0.000 claims description 12
- 238000012163 sequencing technique Methods 0.000 claims description 12
- 230000004568 DNA-binding Effects 0.000 claims description 11
- 101001053946 Homo sapiens Dystrophin Proteins 0.000 claims description 11
- 238000000342 Monte Carlo simulation Methods 0.000 claims description 11
- 101001053942 Saccharolobus solfataricus (strain ATCC 35092 / DSM 1617 / JCM 11322 / P2) Diphosphomevalonate decarboxylase Proteins 0.000 claims description 11
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 claims description 11
- 101001048956 Homo sapiens Homeobox protein EMX1 Proteins 0.000 claims description 10
- 230000001939 inductive effect Effects 0.000 claims description 10
- 201000010099 disease Diseases 0.000 claims description 9
- 102100023823 Homeobox protein EMX1 Human genes 0.000 claims description 8
- 108700019146 Transgenes Proteins 0.000 claims description 8
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 claims description 6
- 101150030763 Vegfa gene Proteins 0.000 claims description 6
- 238000010354 CRISPR gene editing Methods 0.000 claims description 5
- 230000015556 catabolic process Effects 0.000 claims description 5
- 238000006731 degradation reaction Methods 0.000 claims description 5
- 102100025169 Max-binding protein MNT Human genes 0.000 claims description 3
- 210000003527 eukaryotic cell Anatomy 0.000 claims description 3
- 108091006106 transcriptional activators Proteins 0.000 claims description 3
- 108091006107 transcriptional repressors Proteins 0.000 claims description 3
- 230000002829 reductive effect Effects 0.000 abstract description 7
- 102000004169 proteins and genes Human genes 0.000 description 130
- 235000018102 proteins Nutrition 0.000 description 116
- 239000000758 substrate Substances 0.000 description 55
- 239000003623 enhancer Substances 0.000 description 51
- 241000282414 Homo sapiens Species 0.000 description 37
- 238000002474 experimental method Methods 0.000 description 37
- 230000002068 genetic effect Effects 0.000 description 36
- 230000001105 regulatory effect Effects 0.000 description 31
- 108091027544 Subgenomic mRNA Proteins 0.000 description 28
- 238000004630 atomic force microscopy Methods 0.000 description 28
- 239000013612 plasmid Substances 0.000 description 27
- 239000003795 chemical substances by application Substances 0.000 description 26
- 238000009826 distribution Methods 0.000 description 26
- 101000882390 Homo sapiens Histone acetyltransferase p300 Proteins 0.000 description 25
- 230000008859 change Effects 0.000 description 23
- 230000006870 function Effects 0.000 description 23
- 230000035772 mutation Effects 0.000 description 23
- 108091079001 CRISPR RNA Proteins 0.000 description 22
- 108091026890 Coding region Proteins 0.000 description 22
- 235000001014 amino acid Nutrition 0.000 description 22
- 230000007246 mechanism Effects 0.000 description 22
- 230000008488 polyadenylation Effects 0.000 description 22
- 102100038885 Histone acetyltransferase p300 Human genes 0.000 description 21
- 238000010494 dissociation reaction Methods 0.000 description 21
- 230000005593 dissociations Effects 0.000 description 21
- 108010069091 Dystrophin Proteins 0.000 description 20
- 101000978776 Mus musculus Neurogenic locus notch homolog protein 1 Proteins 0.000 description 20
- 229940024606 amino acid Drugs 0.000 description 20
- 108010090804 Streptavidin Proteins 0.000 description 18
- 150000001413 amino acids Chemical class 0.000 description 18
- 230000008439 repair process Effects 0.000 description 18
- 108020004485 Nonsense Codon Proteins 0.000 description 17
- -1 silencers Substances 0.000 description 17
- 108020005067 RNA Splice Sites Proteins 0.000 description 15
- 230000001594 aberrant effect Effects 0.000 description 15
- 230000009437 off-target effect Effects 0.000 description 15
- 239000013607 AAV vector Substances 0.000 description 14
- 241000702423 Adeno-associated virus - 2 Species 0.000 description 14
- 238000004458 analytical method Methods 0.000 description 14
- 239000012636 effector Substances 0.000 description 14
- 238000001890 transfection Methods 0.000 description 14
- 206010013801 Duchenne Muscular Dystrophy Diseases 0.000 description 13
- 101001000998 Homo sapiens Protein phosphatase 1 regulatory subunit 12C Proteins 0.000 description 13
- 102100035620 Protein phosphatase 1 regulatory subunit 12C Human genes 0.000 description 13
- 241000193996 Streptococcus pyogenes Species 0.000 description 13
- 239000012212 insulator Substances 0.000 description 13
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 12
- 241000829100 Macaca mulatta polyomavirus 1 Species 0.000 description 12
- TWRXJAOTZQYOKJ-UHFFFAOYSA-L Magnesium chloride Chemical compound [Mg+2].[Cl-].[Cl-] TWRXJAOTZQYOKJ-UHFFFAOYSA-L 0.000 description 12
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 12
- 238000012937 correction Methods 0.000 description 12
- 230000007423 decrease Effects 0.000 description 12
- 230000001419 dependent effect Effects 0.000 description 12
- 230000009438 off-target cleavage Effects 0.000 description 12
- 108010077544 Chromatin Proteins 0.000 description 11
- 108020004705 Codon Proteins 0.000 description 11
- 102000006947 Histones Human genes 0.000 description 11
- 208000026350 Inborn Genetic disease Diseases 0.000 description 11
- 241000124008 Mammalia Species 0.000 description 11
- 210000003483 chromatin Anatomy 0.000 description 11
- 208000016361 genetic disease Diseases 0.000 description 11
- 239000005090 green fluorescent protein Substances 0.000 description 11
- 230000007704 transition Effects 0.000 description 11
- 238000011144 upstream manufacturing Methods 0.000 description 11
- 102100039819 Actin, alpha cardiac muscle 1 Human genes 0.000 description 10
- 241000701022 Cytomegalovirus Species 0.000 description 10
- 102000003893 Histone acetyltransferases Human genes 0.000 description 10
- 108090000246 Histone acetyltransferases Proteins 0.000 description 10
- 101000959247 Homo sapiens Actin, alpha cardiac muscle 1 Proteins 0.000 description 10
- 238000012217 deletion Methods 0.000 description 10
- 230000037430 deletion Effects 0.000 description 10
- 230000005782 double-strand break Effects 0.000 description 10
- 238000005516 engineering process Methods 0.000 description 10
- 108020004999 messenger RNA Proteins 0.000 description 10
- 239000000523 sample Substances 0.000 description 10
- 238000004088 simulation Methods 0.000 description 10
- 210000001519 tissue Anatomy 0.000 description 10
- 241000700605 Viruses Species 0.000 description 9
- 230000001580 bacterial effect Effects 0.000 description 9
- 238000005520 cutting process Methods 0.000 description 9
- 238000003384 imaging method Methods 0.000 description 9
- 230000036961 partial effect Effects 0.000 description 9
- 210000002027 skeletal muscle Anatomy 0.000 description 9
- 102000004190 Enzymes Human genes 0.000 description 8
- 108090000790 Enzymes Proteins 0.000 description 8
- 239000002299 complementary DNA Substances 0.000 description 8
- 230000002596 correlated effect Effects 0.000 description 8
- CVSVTCORWBXHQV-UHFFFAOYSA-N creatine Chemical compound NC(=[NH2+])N(C)CC([O-])=O CVSVTCORWBXHQV-UHFFFAOYSA-N 0.000 description 8
- 238000004520 electroporation Methods 0.000 description 8
- 239000012634 fragment Substances 0.000 description 8
- 230000037433 frameshift Effects 0.000 description 8
- 230000004927 fusion Effects 0.000 description 8
- 238000000338 in vitro Methods 0.000 description 8
- 230000000977 initiatory effect Effects 0.000 description 8
- 238000003780 insertion Methods 0.000 description 8
- 210000003205 muscle Anatomy 0.000 description 8
- 239000008194 pharmaceutical composition Substances 0.000 description 8
- 230000008569 process Effects 0.000 description 8
- 125000006850 spacer group Chemical group 0.000 description 8
- 241000701161 unidentified adenovirus Species 0.000 description 8
- 108020004635 Complementary DNA Proteins 0.000 description 7
- 241000702421 Dependoparvovirus Species 0.000 description 7
- 102000040945 Transcription factor Human genes 0.000 description 7
- 108091023040 Transcription factor Proteins 0.000 description 7
- 230000008901 benefit Effects 0.000 description 7
- 238000010804 cDNA synthesis Methods 0.000 description 7
- 238000012350 deep sequencing Methods 0.000 description 7
- 239000000499 gel Substances 0.000 description 7
- 238000009396 hybridization Methods 0.000 description 7
- 230000037431 insertion Effects 0.000 description 7
- 239000002502 liposome Substances 0.000 description 7
- 210000004962 mammalian cell Anatomy 0.000 description 7
- 239000000463 material Substances 0.000 description 7
- 230000037361 pathway Effects 0.000 description 7
- 239000000047 product Substances 0.000 description 7
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 6
- 241000701044 Human gammaherpesvirus 4 Species 0.000 description 6
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 6
- 239000004472 Lysine Substances 0.000 description 6
- 241000714474 Rous sarcoma virus Species 0.000 description 6
- 108091081024 Start codon Proteins 0.000 description 6
- 210000005260 human cell Anatomy 0.000 description 6
- 230000006872 improvement Effects 0.000 description 6
- 229910001629 magnesium chloride Inorganic materials 0.000 description 6
- 238000005259 measurement Methods 0.000 description 6
- 239000011780 sodium chloride Substances 0.000 description 6
- 239000000243 solution Substances 0.000 description 6
- 238000006467 substitution reaction Methods 0.000 description 6
- 238000012546 transfer Methods 0.000 description 6
- 230000014616 translation Effects 0.000 description 6
- 230000003612 virological effect Effects 0.000 description 6
- 108091092724 Noncoding DNA Proteins 0.000 description 5
- 238000013459 approach Methods 0.000 description 5
- 230000015572 biosynthetic process Effects 0.000 description 5
- 239000000872 buffer Substances 0.000 description 5
- 210000000349 chromosome Anatomy 0.000 description 5
- 210000004165 myocardium Anatomy 0.000 description 5
- 239000002105 nanoparticle Substances 0.000 description 5
- 238000012545 processing Methods 0.000 description 5
- 230000029058 respiratory gaseous exchange Effects 0.000 description 5
- 230000003584 silencer Effects 0.000 description 5
- 230000002103 transcriptional effect Effects 0.000 description 5
- 230000001052 transient effect Effects 0.000 description 5
- 238000013519 translation Methods 0.000 description 5
- 239000013603 viral vector Substances 0.000 description 5
- YYGNTYWPHWGJRM-UHFFFAOYSA-N (6E,10E,14E,18E)-2,6,10,15,19,23-hexamethyltetracosa-2,6,10,14,18,22-hexaene Chemical compound CC(C)=CCCC(C)=CCCC(C)=CCCC=C(C)CCC=C(C)CCC=C(C)C YYGNTYWPHWGJRM-UHFFFAOYSA-N 0.000 description 4
- 108010085238 Actins Proteins 0.000 description 4
- 102000007469 Actins Human genes 0.000 description 4
- 241000713826 Avian leukosis virus Species 0.000 description 4
- 241000894006 Bacteria Species 0.000 description 4
- 241000713704 Bovine immunodeficiency virus Species 0.000 description 4
- 102220605874 Cytosolic arginine sensor for mTORC1 subunit 2_D10A_mutation Human genes 0.000 description 4
- 108010008532 Deoxyribonuclease I Proteins 0.000 description 4
- 102000007260 Deoxyribonuclease I Human genes 0.000 description 4
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 4
- 108010054147 Hemoglobins Proteins 0.000 description 4
- 102000001554 Hemoglobins Human genes 0.000 description 4
- 102000008157 Histone Demethylases Human genes 0.000 description 4
- 108010074870 Histone Demethylases Proteins 0.000 description 4
- 108010000521 Human Growth Hormone Proteins 0.000 description 4
- 102000002265 Human Growth Hormone Human genes 0.000 description 4
- 239000000854 Human Growth Hormone Substances 0.000 description 4
- 241000725303 Human immunodeficiency virus Species 0.000 description 4
- 241000713666 Lentivirus Species 0.000 description 4
- 238000007476 Maximum Likelihood Methods 0.000 description 4
- 241001465754 Metazoa Species 0.000 description 4
- 102000003505 Myosin Human genes 0.000 description 4
- 108060008487 Myosin Proteins 0.000 description 4
- 108091005461 Nucleic proteins Proteins 0.000 description 4
- 108091034117 Oligonucleotide Proteins 0.000 description 4
- BHEOSNUKNHRBNM-UHFFFAOYSA-N Tetramethylsqualene Natural products CC(=C)C(C)CCC(=C)C(C)CCC(C)=CCCC=C(C)CCC(C)C(=C)CCC(C)C(C)=C BHEOSNUKNHRBNM-UHFFFAOYSA-N 0.000 description 4
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 4
- 230000004071 biological effect Effects 0.000 description 4
- 108010006025 bovine growth hormone Proteins 0.000 description 4
- 210000000234 capsid Anatomy 0.000 description 4
- 238000006243 chemical reaction Methods 0.000 description 4
- 230000008876 conformational transition Effects 0.000 description 4
- 229960003624 creatine Drugs 0.000 description 4
- 239000006046 creatine Substances 0.000 description 4
- PRAKJMSDJKAYCZ-UHFFFAOYSA-N dodecahydrosqualene Natural products CC(C)CCCC(C)CCCC(C)CCCCC(C)CCCC(C)CCCC(C)C PRAKJMSDJKAYCZ-UHFFFAOYSA-N 0.000 description 4
- 239000003814 drug Substances 0.000 description 4
- 230000001747 exhibiting effect Effects 0.000 description 4
- 230000002349 favourable effect Effects 0.000 description 4
- 238000009472 formulation Methods 0.000 description 4
- 231100000221 frame shift mutation induction Toxicity 0.000 description 4
- 238000001415 gene therapy Methods 0.000 description 4
- 102000044787 human EP300 Human genes 0.000 description 4
- 230000001976 improved effect Effects 0.000 description 4
- 238000011534 incubation Methods 0.000 description 4
- 150000002632 lipids Chemical class 0.000 description 4
- 238000002844 melting Methods 0.000 description 4
- 230000008018 melting Effects 0.000 description 4
- 230000000813 microbial effect Effects 0.000 description 4
- 238000010369 molecular cloning Methods 0.000 description 4
- 230000000869 mutational effect Effects 0.000 description 4
- 210000001087 myotubule Anatomy 0.000 description 4
- 230000037434 nonsense mutation Effects 0.000 description 4
- 229920000447 polyanionic polymer Polymers 0.000 description 4
- 229920002643 polyglutamic acid Polymers 0.000 description 4
- 230000023603 positive regulation of transcription initiation, DNA-dependent Effects 0.000 description 4
- 230000029279 positive regulation of transcription, DNA-dependent Effects 0.000 description 4
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 description 4
- 230000009870 specific binding Effects 0.000 description 4
- 229940031439 squalene Drugs 0.000 description 4
- TUHBEKDERLKLEC-UHFFFAOYSA-N squalene Natural products CC(=CCCC(=CCCC(=CCCC=C(/C)CCC=C(/C)CC=C(C)C)C)C)C TUHBEKDERLKLEC-UHFFFAOYSA-N 0.000 description 4
- 238000010561 standard procedure Methods 0.000 description 4
- 239000007858 starting material Substances 0.000 description 4
- RWQNBRDOKXIBIV-UHFFFAOYSA-N thymine Chemical compound CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 description 4
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 3
- 241000203069 Archaea Species 0.000 description 3
- 102100029822 B- and T-lymphocyte attenuator Human genes 0.000 description 3
- 102100022005 B-lymphocyte antigen CD20 Human genes 0.000 description 3
- 241000283690 Bos taurus Species 0.000 description 3
- 102100039498 Cytotoxic T-lymphocyte protein 4 Human genes 0.000 description 3
- 230000007067 DNA methylation Effects 0.000 description 3
- 230000033616 DNA repair Effects 0.000 description 3
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 3
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 3
- 102100039869 Histone H2B type F-S Human genes 0.000 description 3
- 102000003964 Histone deacetylase Human genes 0.000 description 3
- 108090000353 Histone deacetylase Proteins 0.000 description 3
- 101000897405 Homo sapiens B-lymphocyte antigen CD20 Proteins 0.000 description 3
- 101000899111 Homo sapiens Hemoglobin subunit beta Proteins 0.000 description 3
- 101001035372 Homo sapiens Histone H2B type F-S Proteins 0.000 description 3
- 101000808011 Homo sapiens Vascular endothelial growth factor A Proteins 0.000 description 3
- 206010028289 Muscle atrophy Diseases 0.000 description 3
- 241000588650 Neisseria meningitidis Species 0.000 description 3
- 108010057163 Ribonuclease III Proteins 0.000 description 3
- 102000003661 Ribonuclease III Human genes 0.000 description 3
- 241000282898 Sus scrofa Species 0.000 description 3
- 102100039037 Vascular endothelial growth factor A Human genes 0.000 description 3
- 230000005856 abnormality Effects 0.000 description 3
- 125000002777 acetyl group Chemical group [H]C([H])([H])C(*)=O 0.000 description 3
- 230000021736 acetylation Effects 0.000 description 3
- 238000006640 acetylation reaction Methods 0.000 description 3
- 230000003213 activating effect Effects 0.000 description 3
- 230000006978 adaptation Effects 0.000 description 3
- 101150063416 add gene Proteins 0.000 description 3
- 239000002671 adjuvant Substances 0.000 description 3
- 230000004888 barrier function Effects 0.000 description 3
- 230000033228 biological regulation Effects 0.000 description 3
- 230000001413 cellular effect Effects 0.000 description 3
- 210000001072 colon Anatomy 0.000 description 3
- 239000003086 colorant Substances 0.000 description 3
- VHJLVAABSRFDPM-QWWZWVQMSA-N dithiothreitol Chemical compound SC[C@@H](O)[C@H](O)CS VHJLVAABSRFDPM-QWWZWVQMSA-N 0.000 description 3
- 238000002337 electrophoretic mobility shift assay Methods 0.000 description 3
- 238000012236 epigenome editing Methods 0.000 description 3
- 239000013604 expression vector Substances 0.000 description 3
- 238000002744 homologous recombination Methods 0.000 description 3
- 230000006801 homologous recombination Effects 0.000 description 3
- 210000003917 human chromosome Anatomy 0.000 description 3
- 238000002347 injection Methods 0.000 description 3
- 239000007924 injection Substances 0.000 description 3
- 210000003734 kidney Anatomy 0.000 description 3
- 239000011159 matrix material Substances 0.000 description 3
- 125000002496 methyl group Chemical group [H]C([H])([H])* 0.000 description 3
- 230000011987 methylation Effects 0.000 description 3
- 238000007069 methylation reaction Methods 0.000 description 3
- 210000000663 muscle cell Anatomy 0.000 description 3
- 230000009871 nonspecific binding Effects 0.000 description 3
- 244000052769 pathogen Species 0.000 description 3
- 239000000546 pharmaceutical excipient Substances 0.000 description 3
- 239000002953 phosphate buffered saline Substances 0.000 description 3
- 229920002401 polyacrylamide Polymers 0.000 description 3
- 230000009467 reduction Effects 0.000 description 3
- 238000007634 remodeling Methods 0.000 description 3
- 230000010076 replication Effects 0.000 description 3
- 230000004044 response Effects 0.000 description 3
- KIUKXJAPPMFGSW-DNGZLQJQSA-N (2S,3S,4S,5R,6R)-6-[(2S,3R,4R,5S,6R)-3-Acetamido-2-[(2S,3S,4R,5R,6R)-6-[(2R,3R,4R,5S,6R)-3-acetamido-2,5-dihydroxy-6-(hydroxymethyl)oxan-4-yl]oxy-2-carboxy-4,5-dihydroxyoxan-3-yl]oxy-5-hydroxy-6-(hydroxymethyl)oxan-4-yl]oxy-3,4,5-trihydroxyoxane-2-carboxylic acid Chemical compound CC(=O)N[C@H]1[C@H](O)O[C@H](CO)[C@@H](O)[C@@H]1O[C@H]1[C@H](O)[C@@H](O)[C@H](O[C@H]2[C@@H]([C@@H](O[C@H]3[C@@H]([C@@H](O)[C@H](O)[C@H](O3)C(O)=O)O)[C@H](O)[C@@H](CO)O2)NC(C)=O)[C@@H](C(O)=O)O1 KIUKXJAPPMFGSW-DNGZLQJQSA-N 0.000 description 2
- LRFVTYWOQMYALW-UHFFFAOYSA-N 9H-xanthine Chemical compound O=C1NC(=O)NC2=C1NC=N2 LRFVTYWOQMYALW-UHFFFAOYSA-N 0.000 description 2
- 241001634120 Adeno-associated virus - 5 Species 0.000 description 2
- 241000972680 Adeno-associated virus - 6 Species 0.000 description 2
- 241001164825 Adeno-associated virus - 8 Species 0.000 description 2
- 102000007471 Adenosine A2A receptor Human genes 0.000 description 2
- 108010085277 Adenosine A2A receptor Proteins 0.000 description 2
- 244000303258 Annona diversifolia Species 0.000 description 2
- 235000002198 Annona diversifolia Nutrition 0.000 description 2
- 241000282817 Bovidae Species 0.000 description 2
- 229940045513 CTLA4 antagonist Drugs 0.000 description 2
- BHPQYMZQTOCNFJ-UHFFFAOYSA-N Calcium cation Chemical compound [Ca+2] BHPQYMZQTOCNFJ-UHFFFAOYSA-N 0.000 description 2
- 241000283707 Capra Species 0.000 description 2
- 238000011238 DNA vaccination Methods 0.000 description 2
- 241000701832 Enterobacteria phage T3 Species 0.000 description 2
- YQYJSBFKSSDGFO-UHFFFAOYSA-N Epihygromycin Natural products OC1C(O)C(C(=O)C)OC1OC(C(=C1)O)=CC=C1C=C(C)C(=O)NC1C(O)C(O)C2OCOC2C1O YQYJSBFKSSDGFO-UHFFFAOYSA-N 0.000 description 2
- 108700024394 Exon Proteins 0.000 description 2
- 241000589601 Francisella Species 0.000 description 2
- 101710113436 GTPase KRas Proteins 0.000 description 2
- 241000287828 Gallus gallus Species 0.000 description 2
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 2
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 2
- 108010034791 Heterochromatin Proteins 0.000 description 2
- 108010036115 Histone Methyltransferases Proteins 0.000 description 2
- 102000011787 Histone Methyltransferases Human genes 0.000 description 2
- 102100022846 Histone acetyltransferase KAT2B Human genes 0.000 description 2
- 241000282412 Homo Species 0.000 description 2
- 101000889276 Homo sapiens Cytotoxic T-lymphocyte protein 4 Proteins 0.000 description 2
- 101001047006 Homo sapiens Histone acetyltransferase KAT2B Proteins 0.000 description 2
- 101001137987 Homo sapiens Lymphocyte activation gene 3 protein Proteins 0.000 description 2
- 101000946889 Homo sapiens Monocyte differentiation antigen CD14 Proteins 0.000 description 2
- 101000666896 Homo sapiens V-type immunoglobulin domain-containing suppressor of T-cell activation Proteins 0.000 description 2
- 206010020751 Hypersensitivity Diseases 0.000 description 2
- 108010043610 KIR Receptors Proteins 0.000 description 2
- 102000002698 KIR Receptors Human genes 0.000 description 2
- 241001112693 Lachnospiraceae Species 0.000 description 2
- 102100024640 Low-density lipoprotein receptor Human genes 0.000 description 2
- 102100020862 Lymphocyte activation gene 3 protein Human genes 0.000 description 2
- 102100035877 Monocyte differentiation antigen CD14 Human genes 0.000 description 2
- 241000713333 Mouse mammary tumor virus Species 0.000 description 2
- 241000699666 Mus <mouse, genus> Species 0.000 description 2
- 101100494762 Mus musculus Nedd9 gene Proteins 0.000 description 2
- 102100030217 Myocardin Human genes 0.000 description 2
- 108010081823 Myocardin Proteins 0.000 description 2
- 206010028980 Neoplasm Diseases 0.000 description 2
- 108700026244 Open Reading Frames Proteins 0.000 description 2
- 241001494479 Pecora Species 0.000 description 2
- 229920002873 Polyethylenimine Polymers 0.000 description 2
- 241000605861 Prevotella Species 0.000 description 2
- 241000700159 Rattus Species 0.000 description 2
- 108700008625 Reporter Genes Proteins 0.000 description 2
- 102000004389 Ribonucleoproteins Human genes 0.000 description 2
- 108010081734 Ribonucleoproteins Proteins 0.000 description 2
- 108020005038 Terminator Codon Proteins 0.000 description 2
- 108090000848 Ubiquitin Proteins 0.000 description 2
- 102000044159 Ubiquitin Human genes 0.000 description 2
- 102100038929 V-set domain-containing T-cell activation inhibitor 1 Human genes 0.000 description 2
- 102100038282 V-type immunoglobulin domain-containing suppressor of T-cell activation Human genes 0.000 description 2
- 241000251539 Vertebrata <Metazoa> Species 0.000 description 2
- 108010067390 Viral Proteins Proteins 0.000 description 2
- 239000000370 acceptor Substances 0.000 description 2
- ZSLZBFCDCINBPY-ZSJPKINUSA-N acetyl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)C)O[C@H]1N1C2=NC=NC(N)=C2N=C1 ZSLZBFCDCINBPY-ZSJPKINUSA-N 0.000 description 2
- 102000005421 acetyltransferase Human genes 0.000 description 2
- 108020002494 acetyltransferase Proteins 0.000 description 2
- 239000012190 activator Substances 0.000 description 2
- 210000005006 adaptive immune system Anatomy 0.000 description 2
- 230000002411 adverse Effects 0.000 description 2
- 230000032683 aging Effects 0.000 description 2
- 230000004075 alteration Effects 0.000 description 2
- 125000000637 arginyl group Chemical group N[C@@H](CCCNC(N)=N)C(=O)* 0.000 description 2
- 238000000089 atomic force micrograph Methods 0.000 description 2
- 230000002238 attenuated effect Effects 0.000 description 2
- 210000003719 b-lymphocyte Anatomy 0.000 description 2
- 230000008827 biological function Effects 0.000 description 2
- 230000005540 biological transmission Effects 0.000 description 2
- 238000001369 bisulfite sequencing Methods 0.000 description 2
- 229910001424 calcium ion Inorganic materials 0.000 description 2
- 239000001506 calcium phosphate Substances 0.000 description 2
- 229910000389 calcium phosphate Inorganic materials 0.000 description 2
- 235000011010 calcium phosphates Nutrition 0.000 description 2
- 238000004364 calculation method Methods 0.000 description 2
- 230000022131 cell cycle Effects 0.000 description 2
- 210000000170 cell membrane Anatomy 0.000 description 2
- 210000002230 centromere Anatomy 0.000 description 2
- 210000003763 chloroplast Anatomy 0.000 description 2
- 238000010367 cloning Methods 0.000 description 2
- 230000001276 controlling effect Effects 0.000 description 2
- 230000000875 corresponding effect Effects 0.000 description 2
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 2
- LOKCTEFSRHRXRJ-UHFFFAOYSA-I dipotassium trisodium dihydrogen phosphate hydrogen phosphate dichloride Chemical compound P(=O)(O)(O)[O-].[K+].P(=O)(O)([O-])[O-].[Na+].[Na+].[Cl-].[K+].[Cl-].[Na+] LOKCTEFSRHRXRJ-UHFFFAOYSA-I 0.000 description 2
- 208000035475 disorder Diseases 0.000 description 2
- 229940079593 drug Drugs 0.000 description 2
- 241001493065 dsRNA viruses Species 0.000 description 2
- 230000002708 enhancing effect Effects 0.000 description 2
- 230000001973 epigenetic effect Effects 0.000 description 2
- 230000004049 epigenetic modification Effects 0.000 description 2
- 210000003238 esophagus Anatomy 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 239000000835 fiber Substances 0.000 description 2
- 230000002538 fungal effect Effects 0.000 description 2
- 238000012239 gene modification Methods 0.000 description 2
- 230000030279 gene silencing Effects 0.000 description 2
- 230000037442 genomic alteration Effects 0.000 description 2
- 230000012010 growth Effects 0.000 description 2
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 2
- 210000004458 heterochromatin Anatomy 0.000 description 2
- 229920002674 hyaluronan Polymers 0.000 description 2
- 229960003160 hyaluronic acid Drugs 0.000 description 2
- FDGQSTZJBFJUBT-UHFFFAOYSA-N hypoxanthine Chemical compound O=C1NC=NC2=C1NC=N2 FDGQSTZJBFJUBT-UHFFFAOYSA-N 0.000 description 2
- 102000006639 indoleamine 2,3-dioxygenase Human genes 0.000 description 2
- 108020004201 indoleamine 2,3-dioxygenase Proteins 0.000 description 2
- 230000002401 inhibitory effect Effects 0.000 description 2
- 238000002743 insertional mutagenesis Methods 0.000 description 2
- 238000007689 inspection Methods 0.000 description 2
- DRAVOWXCEBXPTN-UHFFFAOYSA-N isoguanine Chemical compound NC1=NC(=O)NC2=C1NC=N2 DRAVOWXCEBXPTN-UHFFFAOYSA-N 0.000 description 2
- 238000002955 isolation Methods 0.000 description 2
- 210000004185 liver Anatomy 0.000 description 2
- 230000007774 longterm Effects 0.000 description 2
- 210000004072 lung Anatomy 0.000 description 2
- 238000013507 mapping Methods 0.000 description 2
- 239000003550 marker Substances 0.000 description 2
- 239000002609 medium Substances 0.000 description 2
- 244000005700 microbiome Species 0.000 description 2
- 210000003470 mitochondria Anatomy 0.000 description 2
- 239000003607 modifier Substances 0.000 description 2
- 229940035032 monophosphoryl lipid a Drugs 0.000 description 2
- 125000001446 muramyl group Chemical group N[C@@H](C=O)[C@@H](O[C@@H](C(=O)*)C)[C@H](O)[C@H](O)CO 0.000 description 2
- 238000002703 mutagenesis Methods 0.000 description 2
- 231100000350 mutagenesis Toxicity 0.000 description 2
- 210000000107 myocyte Anatomy 0.000 description 2
- 108091027963 non-coding RNA Proteins 0.000 description 2
- 102000042567 non-coding RNA Human genes 0.000 description 2
- 210000004940 nucleus Anatomy 0.000 description 2
- 230000030648 nucleus localization Effects 0.000 description 2
- 210000000056 organ Anatomy 0.000 description 2
- 230000001717 pathogenic effect Effects 0.000 description 2
- YBYRMVIVWMBXKQ-UHFFFAOYSA-N phenylmethanesulfonyl fluoride Chemical compound FS(=O)(=O)CC1=CC=CC=C1 YBYRMVIVWMBXKQ-UHFFFAOYSA-N 0.000 description 2
- XIBUKSQTWSKJMQ-QTNFYWBSSA-M potassium;(4s)-4-amino-5-hydroxy-5-oxopentanoate;hydrate Chemical compound O.[K+].[O-]C(=O)[C@@H](N)CCC(O)=O XIBUKSQTWSKJMQ-QTNFYWBSSA-M 0.000 description 2
- 230000003389 potentiating effect Effects 0.000 description 2
- 238000001556 precipitation Methods 0.000 description 2
- 230000000750 progressive effect Effects 0.000 description 2
- 210000002307 prostate Anatomy 0.000 description 2
- 230000004952 protein activity Effects 0.000 description 2
- 238000000746 purification Methods 0.000 description 2
- 150000004053 quinones Chemical class 0.000 description 2
- 230000006798 recombination Effects 0.000 description 2
- 238000005215 recombination Methods 0.000 description 2
- 230000000754 repressing effect Effects 0.000 description 2
- 239000011347 resin Substances 0.000 description 2
- 229920005989 resin Polymers 0.000 description 2
- 230000002441 reversible effect Effects 0.000 description 2
- 238000013515 script Methods 0.000 description 2
- 230000035945 sensitivity Effects 0.000 description 2
- 241000894007 species Species 0.000 description 2
- 210000002784 stomach Anatomy 0.000 description 2
- 238000003860 storage Methods 0.000 description 2
- 239000000126 substance Substances 0.000 description 2
- 239000004094 surface-active agent Substances 0.000 description 2
- 238000003786 synthesis reaction Methods 0.000 description 2
- 230000009885 systemic effect Effects 0.000 description 2
- 108091035539 telomere Proteins 0.000 description 2
- 102000055501 telomere Human genes 0.000 description 2
- 210000003411 telomere Anatomy 0.000 description 2
- 229940113082 thymine Drugs 0.000 description 2
- 230000009261 transgenic effect Effects 0.000 description 2
- 238000004627 transmission electron microscopy Methods 0.000 description 2
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 2
- 241001515965 unidentified phage Species 0.000 description 2
- 229940035893 uracil Drugs 0.000 description 2
- 230000029663 wound healing Effects 0.000 description 2
- IIZPXYDJLKNOIY-JXPKJXOSSA-N 1-palmitoyl-2-arachidonoyl-sn-glycero-3-phosphocholine Chemical compound CCCCCCCCCCCCCCCC(=O)OC[C@H](COP([O-])(=O)OCC[N+](C)(C)C)OC(=O)CCC\C=C/C\C=C/C\C=C/C\C=C/CCCCC IIZPXYDJLKNOIY-JXPKJXOSSA-N 0.000 description 1
- XQCZBXHVTFVIFE-UHFFFAOYSA-N 2-amino-4-hydroxypyrimidine Chemical compound NC1=NC=CC(O)=N1 XQCZBXHVTFVIFE-UHFFFAOYSA-N 0.000 description 1
- 108020005345 3' Untranslated Regions Proteins 0.000 description 1
- 108010013043 Acetylesterase Proteins 0.000 description 1
- 241000604451 Acidaminococcus Species 0.000 description 1
- 229930024421 Adenine Natural products 0.000 description 1
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 1
- 241001655883 Adeno-associated virus - 1 Species 0.000 description 1
- 102100032157 Adenylate cyclase type 10 Human genes 0.000 description 1
- 108010088751 Albumins Proteins 0.000 description 1
- 102000009027 Albumins Human genes 0.000 description 1
- GUBGYTABKSRVRQ-XLOQQCSPSA-N Alpha-Lactose Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](CO)O[C@H](O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-XLOQQCSPSA-N 0.000 description 1
- 108091093088 Amplicon Proteins 0.000 description 1
- 206010003594 Ataxia telangiectasia Diseases 0.000 description 1
- 102100039341 Atrial natriuretic peptide receptor 2 Human genes 0.000 description 1
- 101710102159 Atrial natriuretic peptide receptor 2 Proteins 0.000 description 1
- 101710144268 B- and T-lymphocyte attenuator Proteins 0.000 description 1
- 101150068685 BTG2 gene Proteins 0.000 description 1
- 102100022548 Beta-hexosaminidase subunit alpha Human genes 0.000 description 1
- 241000283726 Bison Species 0.000 description 1
- 208000005692 Bloom Syndrome Diseases 0.000 description 1
- 241000030939 Bubalus bubalis Species 0.000 description 1
- 101001042337 Burkholderia plantarii Lipase-specific foldase Proteins 0.000 description 1
- 108010014064 CCCTC-Binding Factor Proteins 0.000 description 1
- 125000006519 CCH3 Chemical group 0.000 description 1
- 101150052909 CCL2 gene Proteins 0.000 description 1
- 101150102665 CCL20 gene Proteins 0.000 description 1
- 101150111331 CCL5 gene Proteins 0.000 description 1
- 108010040163 CREB-Binding Protein Proteins 0.000 description 1
- 102100021975 CREB-binding protein Human genes 0.000 description 1
- 238000010454 CRISPR gRNA design Methods 0.000 description 1
- 238000010356 CRISPR-Cas9 genome editing Methods 0.000 description 1
- 238000010453 CRISPR/Cas method Methods 0.000 description 1
- 108091011896 CSF1 Proteins 0.000 description 1
- 101100371682 Caenorhabditis elegans cyk-3 gene Proteins 0.000 description 1
- 101100402853 Caenorhabditis elegans mtd-1 gene Proteins 0.000 description 1
- 241000282836 Camelus dromedarius Species 0.000 description 1
- 241000700199 Cavia porcellus Species 0.000 description 1
- 101150009911 Ccl7 gene Proteins 0.000 description 1
- 101150023376 Cebpd gene Proteins 0.000 description 1
- 241000282693 Cercopithecidae Species 0.000 description 1
- 241000282994 Cervidae Species 0.000 description 1
- 108091062157 Cis-regulatory element Proteins 0.000 description 1
- 102000008186 Collagen Human genes 0.000 description 1
- 108010035532 Collagen Proteins 0.000 description 1
- 208000032170 Congenital Abnormalities Diseases 0.000 description 1
- 206010010356 Congenital anomaly Diseases 0.000 description 1
- 206010053138 Congenital aplastic anaemia Diseases 0.000 description 1
- 241000699800 Cricetinae Species 0.000 description 1
- 101150031350 Cxcl2 gene Proteins 0.000 description 1
- 101150097262 Cxcl5 gene Proteins 0.000 description 1
- 201000003883 Cystic fibrosis Diseases 0.000 description 1
- 102220605872 Cytosolic arginine sensor for mTORC1 subunit 2_D16A_mutation Human genes 0.000 description 1
- FBPFZTCFMRRESA-FSIIMWSLSA-N D-Glucitol Natural products OC[C@H](O)[C@H](O)[C@@H](O)[C@H](O)CO FBPFZTCFMRRESA-FSIIMWSLSA-N 0.000 description 1
- FBPFZTCFMRRESA-KVTDHHQDSA-N D-Mannitol Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-KVTDHHQDSA-N 0.000 description 1
- FBPFZTCFMRRESA-JGWLITMVSA-N D-glucitol Chemical compound OC[C@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-JGWLITMVSA-N 0.000 description 1
- 238000010442 DNA editing Methods 0.000 description 1
- 230000008301 DNA looping mechanism Effects 0.000 description 1
- 241000252212 Danio rerio Species 0.000 description 1
- 108010053770 Deoxyribonucleases Proteins 0.000 description 1
- 102000016911 Deoxyribonucleases Human genes 0.000 description 1
- 229920002307 Dextran Polymers 0.000 description 1
- SNRUBQQJIBEYMU-UHFFFAOYSA-N Dodecane Natural products CCCCCCCCCCCC SNRUBQQJIBEYMU-UHFFFAOYSA-N 0.000 description 1
- 102100025682 Dystroglycan 1 Human genes 0.000 description 1
- 108010071885 Dystroglycans Proteins 0.000 description 1
- 101710147147 Dystrophin-1 Proteins 0.000 description 1
- 101150081946 ENPP2 gene Proteins 0.000 description 1
- 101150068427 EP300 gene Proteins 0.000 description 1
- 241000196324 Embryophyta Species 0.000 description 1
- 241000283073 Equus caballus Species 0.000 description 1
- 101150070878 Ereg gene Proteins 0.000 description 1
- 241000289659 Erinaceidae Species 0.000 description 1
- 108700039887 Essential Genes Proteins 0.000 description 1
- 108010022894 Euchromatin Proteins 0.000 description 1
- 108060002716 Exonuclease Proteins 0.000 description 1
- 108010037362 Extracellular Matrix Proteins Proteins 0.000 description 1
- 102000010834 Extracellular Matrix Proteins Human genes 0.000 description 1
- 101150064015 FAS gene Proteins 0.000 description 1
- 208000037149 Facioscapulohumeral dystrophy Diseases 0.000 description 1
- 201000004939 Fanconi anemia Diseases 0.000 description 1
- CWYNVVGOOAEACU-UHFFFAOYSA-N Fe2+ Chemical compound [Fe+2] CWYNVVGOOAEACU-UHFFFAOYSA-N 0.000 description 1
- 241000282326 Felis catus Species 0.000 description 1
- 108090000386 Fibroblast Growth Factor 1 Proteins 0.000 description 1
- 102100031706 Fibroblast growth factor 1 Human genes 0.000 description 1
- 108700005088 Fungal Genes Proteins 0.000 description 1
- 230000010337 G2 phase Effects 0.000 description 1
- 101150111008 Gbp2 gene Proteins 0.000 description 1
- 108010010803 Gelatin Proteins 0.000 description 1
- 108700023863 Gene Components Proteins 0.000 description 1
- 206010064571 Gene mutation Diseases 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- 101150043233 H2-D1 gene Proteins 0.000 description 1
- 101150102784 H2-K1 gene Proteins 0.000 description 1
- 101150086355 HBG gene Proteins 0.000 description 1
- 108050008753 HNH endonucleases Proteins 0.000 description 1
- 102000000310 HNH endonucleases Human genes 0.000 description 1
- 102100031573 Hematopoietic progenitor cell antigen CD34 Human genes 0.000 description 1
- 102100021519 Hemoglobin subunit beta Human genes 0.000 description 1
- 108091005904 Hemoglobin subunit beta Proteins 0.000 description 1
- 102100039894 Hemoglobin subunit delta Human genes 0.000 description 1
- 102100030826 Hemoglobin subunit epsilon Human genes 0.000 description 1
- 208000031220 Hemophilia Diseases 0.000 description 1
- 208000009292 Hemophilia A Diseases 0.000 description 1
- 208000002972 Hepatolenticular Degeneration Diseases 0.000 description 1
- 108091027305 Heteroduplex Proteins 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- 108010093488 His-His-His-His-His-His Proteins 0.000 description 1
- 108010030471 Histamine N-methyltransferase Proteins 0.000 description 1
- 101710155878 Histone acetyltransferase p300 Proteins 0.000 description 1
- 108010016918 Histone-Lysine N-Methyltransferase Proteins 0.000 description 1
- 102000000581 Histone-lysine N-methyltransferase Human genes 0.000 description 1
- 102100023696 Histone-lysine N-methyltransferase SETDB1 Human genes 0.000 description 1
- 101000775498 Homo sapiens Adenylate cyclase type 10 Proteins 0.000 description 1
- 101100389547 Homo sapiens EP300 gene Proteins 0.000 description 1
- 101000777663 Homo sapiens Hematopoietic progenitor cell antigen CD34 Proteins 0.000 description 1
- 101001035503 Homo sapiens Hemoglobin subunit delta Proteins 0.000 description 1
- 101001083591 Homo sapiens Hemoglobin subunit epsilon Proteins 0.000 description 1
- 101001046967 Homo sapiens Histone acetyltransferase KAT2A Proteins 0.000 description 1
- 101000684609 Homo sapiens Histone-lysine N-methyltransferase SETDB1 Proteins 0.000 description 1
- 101000843809 Homo sapiens Hydroxycarboxylic acid receptor 2 Proteins 0.000 description 1
- 101001076407 Homo sapiens Interleukin-1 receptor antagonist protein Proteins 0.000 description 1
- 101001057159 Homo sapiens Melanoma-associated antigen C3 Proteins 0.000 description 1
- 101001023043 Homo sapiens Myoblast determination protein 1 Proteins 0.000 description 1
- 101000601661 Homo sapiens Paired box protein Pax-7 Proteins 0.000 description 1
- 101001050288 Homo sapiens Transcription factor Jun Proteins 0.000 description 1
- 101000955999 Homo sapiens V-set domain-containing T-cell activation inhibitor 1 Proteins 0.000 description 1
- 101001074035 Homo sapiens Zinc finger protein GLI2 Proteins 0.000 description 1
- 208000023105 Huntington disease Diseases 0.000 description 1
- 208000000563 Hyperlipoproteinemia Type II Diseases 0.000 description 1
- UGQMRVRMYYASKQ-UHFFFAOYSA-N Hypoxanthine nucleoside Natural products OC1C(O)C(CO)OC1N1C(NC=NC2=O)=C2N=C1 UGQMRVRMYYASKQ-UHFFFAOYSA-N 0.000 description 1
- 101150061340 Ier2 gene Proteins 0.000 description 1
- 108060003951 Immunoglobulin Proteins 0.000 description 1
- 229930010555 Inosine Natural products 0.000 description 1
- UGQMRVRMYYASKQ-KQYNXXCUSA-N Inosine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C2=NC=NC(O)=C2N=C1 UGQMRVRMYYASKQ-KQYNXXCUSA-N 0.000 description 1
- 102100026018 Interleukin-1 receptor antagonist protein Human genes 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- 101150026829 JUNB gene Proteins 0.000 description 1
- 101150092727 KLF10 gene Proteins 0.000 description 1
- 101150055061 LCN2 gene Proteins 0.000 description 1
- 108010001831 LDL receptors Proteins 0.000 description 1
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 description 1
- 208000009625 Lesch-Nyhan syndrome Diseases 0.000 description 1
- 239000012097 Lipofectamine 2000 Substances 0.000 description 1
- 102000007077 Lysine Acetyltransferases Human genes 0.000 description 1
- 108010033293 Lysine Acetyltransferases Proteins 0.000 description 1
- 101150049386 MMP3 gene Proteins 0.000 description 1
- 101150013665 MT1 gene Proteins 0.000 description 1
- 241000282560 Macaca mulatta Species 0.000 description 1
- 102100028123 Macrophage colony-stimulating factor 1 Human genes 0.000 description 1
- 229930195725 Mannitol Natural products 0.000 description 1
- 108091027974 Mature messenger RNA Proteins 0.000 description 1
- 208000024556 Mendelian disease Diseases 0.000 description 1
- 201000009906 Meningitis Diseases 0.000 description 1
- 108010059724 Micrococcal Nuclease Proteins 0.000 description 1
- 101000957678 Mus musculus Cytochrome P450 7B1 Proteins 0.000 description 1
- 101000981253 Mus musculus GPI-linked NAD(P)(+)-arginine ADP-ribosyltransferase 1 Proteins 0.000 description 1
- 101100450096 Mus musculus H2-L gene Proteins 0.000 description 1
- 101100396696 Mus musculus Nfkbie gene Proteins 0.000 description 1
- 101001095964 Mus musculus Patatin-like phospholipase domain-containing protein 7 Proteins 0.000 description 1
- 101100519207 Mus musculus Pdcd1 gene Proteins 0.000 description 1
- 101100533945 Mus musculus Serpina3g gene Proteins 0.000 description 1
- 101100369238 Mus musculus Tgtp2 gene Proteins 0.000 description 1
- 101100369986 Mus musculus Tnfaip2 gene Proteins 0.000 description 1
- 241000699670 Mus sp. Species 0.000 description 1
- 208000010428 Muscle Weakness Diseases 0.000 description 1
- 206010028372 Muscular weakness Diseases 0.000 description 1
- 102100035077 Myoblast determination protein 1 Human genes 0.000 description 1
- 241000588653 Neisseria Species 0.000 description 1
- 101100058191 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) bcp-1 gene Proteins 0.000 description 1
- 101150091206 Nfkbia gene Proteins 0.000 description 1
- 102000007399 Nuclear hormone receptor Human genes 0.000 description 1
- 108020005497 Nuclear hormone receptor Proteins 0.000 description 1
- 108700020796 Oncogene Proteins 0.000 description 1
- 102000043276 Oncogene Human genes 0.000 description 1
- 239000012124 Opti-MEM Substances 0.000 description 1
- 241000283973 Oryctolagus cuniculus Species 0.000 description 1
- 108010008164 PAX7 Transcription Factor Proteins 0.000 description 1
- 102000006989 PAX7 Transcription Factor Human genes 0.000 description 1
- 102100035423 POU domain, class 5, transcription factor 1 Human genes 0.000 description 1
- 101710126211 POU domain, class 5, transcription factor 1 Proteins 0.000 description 1
- 101150000187 PTGS2 gene Proteins 0.000 description 1
- 102100037503 Paired box protein Pax-7 Human genes 0.000 description 1
- 241000282577 Pan troglodytes Species 0.000 description 1
- 241000701945 Parvoviridae Species 0.000 description 1
- 102000005877 Peptide Initiation Factors Human genes 0.000 description 1
- 108010044843 Peptide Initiation Factors Proteins 0.000 description 1
- 241000009328 Perro Species 0.000 description 1
- 108700001094 Plant Genes Proteins 0.000 description 1
- 241000288906 Primates Species 0.000 description 1
- 241000283080 Proboscidea <mammal> Species 0.000 description 1
- 102000007568 Proto-Oncogene Proteins c-fos Human genes 0.000 description 1
- 108010071563 Proto-Oncogene Proteins c-fos Proteins 0.000 description 1
- CZPWVGJYEJSRLH-UHFFFAOYSA-N Pyrimidine Chemical compound C1=CN=CN=C1 CZPWVGJYEJSRLH-UHFFFAOYSA-N 0.000 description 1
- 230000026279 RNA modification Effects 0.000 description 1
- 230000004570 RNA-binding Effects 0.000 description 1
- 108091030071 RNAI Proteins 0.000 description 1
- 101000957679 Rattus norvegicus 25-hydroxycholesterol 7-alpha-hydroxylase Proteins 0.000 description 1
- 208000007014 Retinitis pigmentosa Diseases 0.000 description 1
- 201000000582 Retinoblastoma Diseases 0.000 description 1
- 108010083644 Ribonucleases Proteins 0.000 description 1
- 102000006382 Ribonucleases Human genes 0.000 description 1
- 108091028664 Ribonucleotide Proteins 0.000 description 1
- 230000018199 S phase Effects 0.000 description 1
- 101150059392 SLC25A25 gene Proteins 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- 108020004682 Single-Stranded DNA Proteins 0.000 description 1
- 241000191967 Staphylococcus aureus Species 0.000 description 1
- 210000001744 T-lymphocyte Anatomy 0.000 description 1
- 208000022292 Tay-Sachs disease Diseases 0.000 description 1
- 208000002903 Thalassemia Diseases 0.000 description 1
- 108091036066 Three prime untranslated region Proteins 0.000 description 1
- 208000035317 Total hypoxanthine-guanine phosphoribosyl transferase deficiency Diseases 0.000 description 1
- 108091028113 Trans-activating crRNA Proteins 0.000 description 1
- 102100023132 Transcription factor Jun Human genes 0.000 description 1
- 102100027671 Transcriptional repressor CTCF Human genes 0.000 description 1
- 206010045261 Type IIa hyperlipidaemia Diseases 0.000 description 1
- 108010079206 V-Set Domain-Containing T-Cell Activation Inhibitor 1 Proteins 0.000 description 1
- 108010019530 Vascular Endothelial Growth Factors Proteins 0.000 description 1
- 102000005789 Vascular Endothelial Growth Factors Human genes 0.000 description 1
- 206010047139 Vasoconstriction Diseases 0.000 description 1
- 241001416177 Vicugna pacos Species 0.000 description 1
- 108020005202 Viral DNA Proteins 0.000 description 1
- 108700005077 Viral Genes Proteins 0.000 description 1
- 208000018839 Wilson disease Diseases 0.000 description 1
- 201000006083 Xeroderma Pigmentosum Diseases 0.000 description 1
- 101150108009 ZFP36 gene Proteins 0.000 description 1
- HCHKCACWOHOZIP-UHFFFAOYSA-N Zinc Chemical compound [Zn] HCHKCACWOHOZIP-UHFFFAOYSA-N 0.000 description 1
- 108010017070 Zinc Finger Nucleases Proteins 0.000 description 1
- 102100035558 Zinc finger protein GLI2 Human genes 0.000 description 1
- 108091007916 Zinc finger transcription factors Proteins 0.000 description 1
- 102000038627 Zinc finger transcription factors Human genes 0.000 description 1
- 230000000397 acetylating effect Effects 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 201000010275 acute porphyria Diseases 0.000 description 1
- 230000004721 adaptive immunity Effects 0.000 description 1
- 239000000654 additive Substances 0.000 description 1
- 229960000643 adenine Drugs 0.000 description 1
- 230000002776 aggregation Effects 0.000 description 1
- 238000004220 aggregation Methods 0.000 description 1
- 238000013019 agitation Methods 0.000 description 1
- 208000026935 allergic disease Diseases 0.000 description 1
- 210000004102 animal cell Anatomy 0.000 description 1
- 230000003466 anti-cipated effect Effects 0.000 description 1
- 210000004436 artificial bacterial chromosome Anatomy 0.000 description 1
- 210000001106 artificial yeast chromosome Anatomy 0.000 description 1
- 230000037444 atrophy Effects 0.000 description 1
- 230000006399 behavior Effects 0.000 description 1
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 1
- 238000001574 biopsy Methods 0.000 description 1
- 238000007413 biotinylation Methods 0.000 description 1
- 230000006287 biotinylation Effects 0.000 description 1
- 230000000903 blocking effect Effects 0.000 description 1
- 210000000988 bone and bone Anatomy 0.000 description 1
- 210000001185 bone marrow Anatomy 0.000 description 1
- 210000000481 breast Anatomy 0.000 description 1
- 230000000747 cardiac effect Effects 0.000 description 1
- 239000000969 carrier Substances 0.000 description 1
- 101150038500 cas9 gene Proteins 0.000 description 1
- 230000003197 catalytic effect Effects 0.000 description 1
- 230000024245 cell differentiation Effects 0.000 description 1
- 230000032823 cell division Effects 0.000 description 1
- 230000003915 cell function Effects 0.000 description 1
- 230000010261 cell growth Effects 0.000 description 1
- 230000004663 cell proliferation Effects 0.000 description 1
- 239000006285 cell suspension Substances 0.000 description 1
- 230000033077 cellular process Effects 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 101150016727 ch25h gene Proteins 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 210000004978 chinese hamster ovary cell Anatomy 0.000 description 1
- 210000001136 chorion Anatomy 0.000 description 1
- 239000013611 chromosomal DNA Substances 0.000 description 1
- 230000003081 coactivator Effects 0.000 description 1
- 229920001436 collagen Polymers 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 238000005094 computer simulation Methods 0.000 description 1
- 230000021615 conjugation Effects 0.000 description 1
- 238000010219 correlation analysis Methods 0.000 description 1
- 239000013078 crystal Substances 0.000 description 1
- 230000001186 cumulative effect Effects 0.000 description 1
- 230000001351 cycling effect Effects 0.000 description 1
- 229940104302 cytosine Drugs 0.000 description 1
- 210000004292 cytoskeleton Anatomy 0.000 description 1
- 230000001086 cytosolic effect Effects 0.000 description 1
- 230000006378 damage Effects 0.000 description 1
- 238000007405 data analysis Methods 0.000 description 1
- 230000034994 death Effects 0.000 description 1
- 210000003785 decidua Anatomy 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 230000007123 defense Effects 0.000 description 1
- 230000000994 depressogenic effect Effects 0.000 description 1
- 230000000368 destabilizing effect Effects 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 239000008121 dextrose Substances 0.000 description 1
- 230000004069 differentiation Effects 0.000 description 1
- 239000003085 diluting agent Substances 0.000 description 1
- 238000006073 displacement reaction Methods 0.000 description 1
- 101150015424 dmd gene Proteins 0.000 description 1
- 238000009509 drug development Methods 0.000 description 1
- 239000012149 elution buffer Substances 0.000 description 1
- 210000001671 embryonic stem cell Anatomy 0.000 description 1
- 210000004696 endometrium Anatomy 0.000 description 1
- 230000007608 epigenetic mechanism Effects 0.000 description 1
- 210000000267 erythroid cell Anatomy 0.000 description 1
- 229950005470 eteplirsen Drugs 0.000 description 1
- 210000000632 euchromatin Anatomy 0.000 description 1
- 102000013165 exonuclease Human genes 0.000 description 1
- 239000013613 expression plasmid Substances 0.000 description 1
- 210000002744 extracellular matrix Anatomy 0.000 description 1
- 208000008570 facioscapulohumeral muscular dystrophy Diseases 0.000 description 1
- 201000001386 familial hypercholesterolemia Diseases 0.000 description 1
- 229940029303 fibroblast growth factor-1 Drugs 0.000 description 1
- IJJVMEJXYNJXOJ-UHFFFAOYSA-N fluquinconazole Chemical compound C=1C=C(Cl)C=C(Cl)C=1N1C(=O)C2=CC(F)=CC=C2N=C1N1C=NC=N1 IJJVMEJXYNJXOJ-UHFFFAOYSA-N 0.000 description 1
- 210000005153 frontal cortex Anatomy 0.000 description 1
- 239000008273 gelatin Substances 0.000 description 1
- 229920000159 gelatin Polymers 0.000 description 1
- 235000019322 gelatine Nutrition 0.000 description 1
- 235000011852 gelatine desserts Nutrition 0.000 description 1
- 230000009368 gene silencing by RNA Effects 0.000 description 1
- 238000010363 gene targeting Methods 0.000 description 1
- 231100000722 genetic damage Toxicity 0.000 description 1
- 230000005017 genetic modification Effects 0.000 description 1
- 235000013617 genetically modified food Nutrition 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 210000002064 heart cell Anatomy 0.000 description 1
- 230000010224 hepatic metabolism Effects 0.000 description 1
- 208000033552 hepatic porphyria Diseases 0.000 description 1
- 208000006359 hepatoblastoma Diseases 0.000 description 1
- 210000003494 hepatocyte Anatomy 0.000 description 1
- 238000012165 high-throughput sequencing Methods 0.000 description 1
- 230000006195 histone acetylation Effects 0.000 description 1
- 230000006197 histone deacetylation Effects 0.000 description 1
- 239000005556 hormone Substances 0.000 description 1
- 229940088597 hormone Drugs 0.000 description 1
- 230000009610 hypersensitivity Effects 0.000 description 1
- 230000028993 immune response Effects 0.000 description 1
- 230000036039 immunity Effects 0.000 description 1
- 230000005847 immunogenicity Effects 0.000 description 1
- 102000018358 immunoglobulin Human genes 0.000 description 1
- 238000001114 immunoprecipitation Methods 0.000 description 1
- 208000015181 infectious disease Diseases 0.000 description 1
- 230000005764 inhibitory process Effects 0.000 description 1
- 229960003786 inosine Drugs 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 239000000543 intermediate Substances 0.000 description 1
- 238000001361 intraarterial administration Methods 0.000 description 1
- 238000007918 intramuscular administration Methods 0.000 description 1
- 238000007912 intraperitoneal administration Methods 0.000 description 1
- 238000007913 intrathecal administration Methods 0.000 description 1
- 238000001990 intravenous administration Methods 0.000 description 1
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 1
- 239000000644 isotonic solution Substances 0.000 description 1
- 238000005304 joining Methods 0.000 description 1
- 210000003125 jurkat cell Anatomy 0.000 description 1
- 239000008101 lactose Substances 0.000 description 1
- 229940067606 lecithin Drugs 0.000 description 1
- 239000000787 lecithin Substances 0.000 description 1
- 235000010445 lecithin Nutrition 0.000 description 1
- 230000003902 lesion Effects 0.000 description 1
- 230000000670 limiting effect Effects 0.000 description 1
- 238000001638 lipofection Methods 0.000 description 1
- 210000005229 liver cell Anatomy 0.000 description 1
- 239000012139 lysis buffer Substances 0.000 description 1
- 102100031622 mRNA decay activator protein ZFP36 Human genes 0.000 description 1
- 239000000594 mannitol Substances 0.000 description 1
- 235000010355 mannitol Nutrition 0.000 description 1
- 239000000155 melt Substances 0.000 description 1
- 229910021645 metal ion Inorganic materials 0.000 description 1
- 239000010445 mica Substances 0.000 description 1
- 229910052618 mica group Inorganic materials 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 238000003032 molecular docking Methods 0.000 description 1
- 235000013919 monopotassium glutamate Nutrition 0.000 description 1
- 210000005088 multinucleated cell Anatomy 0.000 description 1
- 201000000585 muscular atrophy Diseases 0.000 description 1
- 201000006938 muscular dystrophy Diseases 0.000 description 1
- 210000003098 myoblast Anatomy 0.000 description 1
- 210000000653 nervous system Anatomy 0.000 description 1
- 108020004017 nuclear receptors Proteins 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 239000005022 packaging material Substances 0.000 description 1
- 238000004806 packaging method and process Methods 0.000 description 1
- 210000000496 pancreas Anatomy 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 239000008188 pellet Substances 0.000 description 1
- 210000003819 peripheral blood mononuclear cell Anatomy 0.000 description 1
- 229940021222 peritoneal dialysis isotonic solution Drugs 0.000 description 1
- 230000000144 pharmacologic effect Effects 0.000 description 1
- 238000002205 phenol-chloroform extraction Methods 0.000 description 1
- 230000026731 phosphorylation Effects 0.000 description 1
- 238000006366 phosphorylation reaction Methods 0.000 description 1
- 238000000053 physical method Methods 0.000 description 1
- 230000009894 physiological stress Effects 0.000 description 1
- 239000013600 plasmid vector Substances 0.000 description 1
- 230000002028 premature Effects 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 235000004252 protein component Nutrition 0.000 description 1
- 230000004853 protein function Effects 0.000 description 1
- 210000001938 protoplast Anatomy 0.000 description 1
- 210000002097 psoas muscle Anatomy 0.000 description 1
- 239000002510 pyrogen Substances 0.000 description 1
- 238000004445 quantitative analysis Methods 0.000 description 1
- 102000005912 ran GTP Binding Protein Human genes 0.000 description 1
- 238000005295 random walk Methods 0.000 description 1
- 238000009790 rate-determining step (RDS) Methods 0.000 description 1
- 239000000376 reactant Substances 0.000 description 1
- 238000010188 recombinant method Methods 0.000 description 1
- 238000001454 recorded image Methods 0.000 description 1
- 230000008263 repair mechanism Effects 0.000 description 1
- 230000003362 replicative effect Effects 0.000 description 1
- 230000001718 repressive effect Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 239000002336 ribonucleotide Substances 0.000 description 1
- 125000002652 ribonucleotide group Chemical group 0.000 description 1
- 102200082946 rs33948578 Human genes 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
- 238000004621 scanning probe microscopy Methods 0.000 description 1
- 238000010187 selection method Methods 0.000 description 1
- 239000013605 shuttle vector Substances 0.000 description 1
- 208000007056 sickle cell anemia Diseases 0.000 description 1
- 238000001542 size-exclusion chromatography Methods 0.000 description 1
- 230000022379 skeletal muscle tissue development Effects 0.000 description 1
- 210000000813 small intestine Anatomy 0.000 description 1
- 150000003384 small molecules Chemical class 0.000 description 1
- 101150082727 sod-3 gene Proteins 0.000 description 1
- 239000011734 sodium Substances 0.000 description 1
- 229910052708 sodium Inorganic materials 0.000 description 1
- 239000002904 solvent Substances 0.000 description 1
- 238000000638 solvent extraction Methods 0.000 description 1
- 230000000392 somatic effect Effects 0.000 description 1
- 238000000527 sonication Methods 0.000 description 1
- 239000000600 sorbitol Substances 0.000 description 1
- 238000001179 sorption measurement Methods 0.000 description 1
- 210000000952 spleen Anatomy 0.000 description 1
- 230000002269 spontaneous effect Effects 0.000 description 1
- 230000006641 stabilisation Effects 0.000 description 1
- 238000011105 stabilization Methods 0.000 description 1
- 239000003381 stabilizer Substances 0.000 description 1
- 230000000087 stabilizing effect Effects 0.000 description 1
- 210000000130 stem cell Anatomy 0.000 description 1
- 238000002948 stochastic simulation Methods 0.000 description 1
- 239000012536 storage buffer Substances 0.000 description 1
- 210000003699 striated muscle Anatomy 0.000 description 1
- 238000007920 subcutaneous administration Methods 0.000 description 1
- 125000001424 substituent group Chemical group 0.000 description 1
- 230000010741 sumoylation Effects 0.000 description 1
- 239000006228 supernatant Substances 0.000 description 1
- 230000008093 supporting effect Effects 0.000 description 1
- 239000000725 suspension Substances 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 238000003419 tautomerization reaction Methods 0.000 description 1
- 230000002123 temporal effect Effects 0.000 description 1
- 210000002435 tendon Anatomy 0.000 description 1
- 230000001225 therapeutic effect Effects 0.000 description 1
- 238000002560 therapeutic procedure Methods 0.000 description 1
- 230000017423 tissue regeneration Effects 0.000 description 1
- 230000005100 tissue tropism Effects 0.000 description 1
- 239000003053 toxin Substances 0.000 description 1
- 231100000765 toxin Toxicity 0.000 description 1
- 108700012359 toxins Proteins 0.000 description 1
- 239000012096 transfection reagent Substances 0.000 description 1
- 238000010798 ubiquitination Methods 0.000 description 1
- 238000002604 ultrasonography Methods 0.000 description 1
- 238000009827 uniform distribution Methods 0.000 description 1
- 230000025033 vasoconstriction Effects 0.000 description 1
- 239000003981 vehicle Substances 0.000 description 1
- 210000003501 vero cell Anatomy 0.000 description 1
- 108700026220 vif Genes Proteins 0.000 description 1
- 230000029812 viral genome replication Effects 0.000 description 1
- 239000011534 wash buffer Substances 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- 229940075420 xanthine Drugs 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
- 229910052725 zinc Inorganic materials 0.000 description 1
- 239000011701 zinc Substances 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P43/00—Drugs for specific purposes, not provided for in groups A61P1/00-A61P41/00
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/10—Processes for the isolation, preparation or purification of DNA or RNA
- C12N15/102—Mutagenizing nucleic acids
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/111—General methods applicable to biologically active non-coding nucleic acids
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/113—Non-coding nucleic acids modulating the expression of genes, e.g. antisense oligonucleotides; Antisense DNA or RNA; Triplex- forming oligonucleotides; Catalytic nucleic acids, e.g. ribozymes; Nucleic acids used in co-suppression or gene silencing
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/85—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
- C12N15/86—Viral vectors
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/87—Introduction of foreign genetic material using processes not otherwise provided for, e.g. co-transformation
- C12N15/90—Stable introduction of foreign DNA into chromosome
- C12N15/902—Stable introduction of foreign DNA into chromosome using homologous recombination
- C12N15/907—Stable introduction of foreign DNA into chromosome using homologous recombination in mammalian cells
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/16—Hydrolases (3) acting on ester bonds (3.1)
- C12N9/22—Ribonucleases RNAses, DNAses
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B15/00—ICT specially adapted for analysing two-dimensional or three-dimensional molecular structures, e.g. structural or functional relations or structure alignment
- G16B15/10—Nucleic acid folding
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B15/00—ICT specially adapted for analysing two-dimensional or three-dimensional molecular structures, e.g. structural or functional relations or structure alignment
- G16B15/30—Drug targeting using structural data; Docking or binding prediction
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B25/00—ICT specially adapted for hybridisation; ICT specially adapted for gene or protein expression
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B30/00—ICT specially adapted for sequence analysis involving nucleotides or amino acids
- G16B30/10—Sequence alignment; Homology search
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2310/00—Structure or type of the nucleic acid
- C12N2310/10—Type of nucleic acid
- C12N2310/20—Type of nucleic acid involving clustered regularly interspaced short palindromic repeats [CRISPRs]
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2740/00—Reverse transcribing RNA viruses
- C12N2740/00011—Details
- C12N2740/10011—Retroviridae
- C12N2740/15011—Lentivirus, not HIV, e.g. FIV, SIV
- C12N2740/15041—Use of virus, viral particle or viral elements as a vector
Definitions
- the present disclosure is directed to optimized guide RNAs (gRNAs) and methods of designing and using said gRNAs that have increased target binding specificity and reduced off-target binding.
- gRNAs optimized guide RNAs
- RNA-guided endonucleases notably the protein Cas9
- Cas9 have been hailed as a potential "perfect genomic engineering tool" because they can be directed by a single 'guide RNA' molecule to cut DNA with nearly any sequence. This ability has been recently exploited for a number of emerging biological and medical applications, generating tremendous excitement and promise for their future use.
- practical genomic engineering requires extremely precise control over the ability to target selectively and cut precise DNA sequences, lest off-target DNA become inadvertently damaged and mutated.
- Cas9 is the endonuclease of the prokaryotic type II CRISPR (clustered, regularly interspaced, short palindromic repeats) - CRISPR-associated (Cas) response to invasive foreign DNA.
- Cas9 is first bound by a CRISPR RNA (crRNA) : tram'-activating crRNA (tracrRNA) duplex, and then directed to cleave DNA that contain 20 basepair (bp) 'protospacer' sites complementary to a variable 20 bp segment of the crRNA (FIG. 1 A).
- the Cas9-sgRNA complex binds to 20 bp 'protospacer' sequences in targeted DNA, provided that the protospacer is directly followed by a protospacer adjacent motif (PAM, here 'TGG'). Following binding, the Cas9 endonuclease produces double- strand breaks (triangles) within the protospacer.
- a short protospacer adjacent motif such as 'NGG' in the case of S. pyogenes Cas9, must immediately follow the protospacer sites in the foreign DNA molecule.
- RNA hairpin a single- guide RNA (sgRNA) hairpin which combine the essential components of crRNA : tracrRNA duplexes into single functional molecules.
- sgRNA single- guide RNA
- Cas9 can be introduced into a variety of organisms to produce targeted double strand breaks in vivo for remarkably facile genomic engineering.
- Nuclease-null Cas9 (D10A/H840A, known as 'dCas9') and chimeric dCas9 derivatives have also been used to alter gene expression via targeted binding at or near promoter sites in vivo as well as to introduce targeted epigenetic modifications.
- Cas9/dCas9 activity First, the most widespread effort is largely accomplished through intelligent selection of target sequences without similar other sequences in the genome, although a recent survey found that these methods performed poorly in their ability to predict off-target cleavage. Additionally, efforts have also been made to directly engineer the protein itself, through introduction of point mutations which were found to modulate or increase specificity in PAM or protospacer binding. Cas9 derivatives which only nick a single strand of DNA rather than perform double stranded DNA cleavage are also used in pairs ('paired nickases'), with the assumption that the probability that off-target nicking at multiple sites that are close enough to each other to produce a double-strand break would be extremely rare.
- the present invention is directed to a method of generating an optimized guide RNA (gRNA).
- the method comprises: a) identifying a target region of interest, the target region of interest comprising a protospacer sequence; b) determining a polynucleotide sequence of a full- length gRNA that targets the target region of interest, the full-length gRNA comprising a protospacer-targeting sequence or segment; c) determining at least one or more off-target sites for the full-length gRNA; d) generating a polynucleotide sequence of a first gRNA, the first gRNA comprising the polynucleotide sequence of the full-length gRNA and a RNA segment, the RNA segment comprising a polynucleotide sequence having a length of M nucleotides that is complementary to a nucleotide segment of the protospacer-targeting sequence or segment, the RNA segment is at the 5' end of the polynucleotide
- the present invention is directed to a method of generating an optimized guide RNA (gRNA).
- the method comprises: a) identifying a target region of interest, the target region of interest comprising a protospacer sequence; b) determining a polynucleotide sequence of a full- length gRNA that targets the target region of interest, the full-length gRNA comprising a protospacer-targeting sequence or segment; c) determining at least one or more off-target sites for the full-length gRNA; d) generating a polynucleotide sequence of a first gRNA, the first gRNA comprising the polynucleotide sequence of the full-length gRNA and a RNA segment, the RNA segment comprising a polynucleotide sequence having a length of M nucleotides that is complementary to a nucleotide segment of the protospacer-targeting sequence or segment, the RNA segment is at the 3' end of the polynucleotide
- the present invention is directed to an optimized gRNA generated by the methods described above.
- the present invention is directed to an isolated polynucleotide encoding the optimized gRNA described above.
- the present invention is directed to a vector comprising the isolated polynucleotide described above.
- the present invention is directed to a cell comprising the isolated polynucleotide described above or the vector described above.
- the present invention is directed to a kit comprising the isolated polynucleotide described above, the vector described above, or the cell described above.
- the present invention is directed to a method of epigenomic editing in a target cell or a subject.
- the method comprises contacting a cell or a subject with an effective amount of the optimized gRNA molecule described above and a fusion protein, the fusion protein comprising a first polypeptide domain comprising a nuclease-deficient Cas9 and a second polypeptide domain having an activity selected from the group consisting of transcription activation activity, transcription repression activity, nuclease activity, transcription release factor activity, histone modification activity, nucleic acid association activity, DNA methylase activity, and direct or indirect DNA demethylase activity.
- the present invention is directed to a method of site specific DNA cleavage in a target cell or a subject.
- the method comprises contacting a cell or a subject with an effective amount of the optimized gRNA molecule described above and a fusion protein or Cas9 protein, the fusion protein comprising a first polypeptide domain comprising a nuclease-deficient Cas9 and a second polypeptide domain having an activity selected from the group consisting of transcription activation activity, transcription repression activity, nuclease activity, transcription release factor activity, histone modification activity, nucleic acid association activity, DNA methylase activity, and direct or indirect DNA demethylase activity.
- the present invention is directed to a method of genome editing in a cell.
- the method comprises administering to the cell an effective amount of the optimized gRNA molecule described above and a fusion protein, the fusion protein comprising a first polypeptide domain comprising a nuclease-deficient Cas9 and a second polypeptide domain having an activity selected from the group consisting of transcription activation activity, transcription repression activity, nuclease activity, transcription release factor activity, histone modification activity, nucleic acid association activity, DNA methylase activity, and direct or indirect DNA demethylase activity.
- the present invention is directed to a method of modulating gene expression in a cell.
- the method comprises contacting the cell with an effective amount of the optimized gRNA described above and a fusion protein, the fusion protein comprising a first polypeptide domain comprising a nuclease-deficient Cas9 and a second polypeptide domain having an activity selected from the group consisting of transcription activation activity, transcription repression activity, nuclease activity, transcription release factor activity, histone modification activity, nucleic acid association activity, DNA methylase activity, and direct or indirect DNA demethylase activity.
- FIG. 1 A shows a schematic representation of Cas9 activity.
- FIG. IB shows an atomic force microscopy (AFM) image of dCas9-sgRNA bound at the protospacer sequence within a single streptavidin-labeled DNA molecule derived from the human AAVSl locus.
- AFM atomic force microscopy
- FIGS. 1C-1D show fraction of bound DNA occupied by Cas9/dCas9-sgRNA along an AAVSl -derived (FIG. 1C) or an engineered DNA substrate (FIG. ID) designed with a series of fully-complementary and partially-complementary protospacer sequences. Vertical lines represent the (23 bp) segments where each significant feature is located on the respective substrates.
- FIGS. 2A-2D show modulation of binding affinity and specificity by guide RNA variants.
- FIG. 2A shows a schematic of dCas9 bound to a single-guide RNA with a two nucleotide truncation from its 5' - end (tru-gRNA, purple).
- FIG. 2B shows a schematic and proposed mechanism of dCas9 bound to a single-guide RNA with 5'- end extension that forms a hairpin with the PAM-distal binding segment of its targeting region (hp-gRNA, blue).
- FIG. 2D shows single-site binding affinities (K A ) for dCas9 with guide RNAs with 5' - hairpins that overlap the nucleotides complementary to the last six (hp6-gRNA, blue) or ten (hplO-gRNA, green) PAM-distal nucleotides of the protospacer.
- FIGS. 3 A-3D show Cas9 undergoes a progressive conformational transition as it binds to sites that increasingly match the protospacer sequence.
- FIG. 3 A shows fraction of bound DNA occupied by Cas9/dCas9 along the DNA substrates, with colours representing populations of Cas9/dCas9 clustered according to their structures (by mean-squared difference after alignment, see text).
- 3D shows mean volumes and heights of Cas9/dCas9 with sgRNAs (red circles, with red labels for Cas9 and blue labels for dCas9) or tru-gRNAs (purple circles) bound at each feature on the substrates. Note that dCas9 with tru-gRNAs are only expected to interact the first 3 or 8 PAM-distal mismatches of the 5MM and 10MM sites (labelled '3MM' and '8MM' here, respectively). For standard errors of mean volumes and heights, see Table 2.
- FIGS. 4A-4D show Kinetic Monte Carlo (KMC) experiments revealing differences in the stability of the R-loop, or the structure formed by the protospacer duplex with an invading guide RNA, within stably bound Cas9 for different guide RNA variants.
- FIG. 4A shows a schematic of strand invasion of the protospacer (green) by the guide RNA (red) for KMC experiments. The R-loop is highlighted.
- Transition rates for invasion (v/for the rate of m ⁇ m + 1, where m is the extent of the strand invasion or, equivalently, the length of the R-loop) or duplex re-annealing (v r for the rate of m ⁇ m - 1) are a function of the nearest-neighbour DNA:DNA and RNA:DNA hybridization energies. See text and Supplementary Methods for details.
- Asterisks highlight the starting position for the simulation, (insert) Histogram of the respective lifetimes during which the R- loop is > 16 bp long.
- FIG. 4D shows proposed model for the mechanisms governing Cas9/dCas9 specificity, based on results of AFM imaging and kinetic Monte Carlo (KMC) experiments (see main text).
- Cas9/dCas9 binds to the PAM and the guide RNA invades into the PAM-adjacent protospacer duplex. During this strand invasion, the guide RNA must displace the
- FIGS. 5A-5C show Kinetic Monte Carlo (KMC) experiments reveal differences in ability to traverse mismatches (MM) and invade the protospacer depending on guide RNA structure.
- White X's indicate positions of mismatches. Simulation run until t > 10,000 (arbitrary units) and the results are averaged over 100 trials.
- FIG. 5C shows
- FIGS. 6A-6B show experimental (Hsu et ol. (2013) Nature biotechnology, 31, 827- 832) cutting frequencies at target sites containing a single rG dG, rC dC, rA dA, and rU dT mismatch in the PAM-distal region (> 10 th protospacer site) are correlated with stabilities of the R-loop determined from kinetic Monte Carlo experiments.
- FIG. 6A shows logio(p-value) of the correlations between Cas9 cutting frequency and stability of R-loop at sites m (fraction of time the guide RNA remains bound to the protospacer at site m, see text) during strand invasion initiated at site m.
- Colour corresponds to the correlation coefficient.
- FIGS. 7A-7C show a summary of proposed mechanisms by which the structure of the guide RNA affects Cas9/dCas9 specificity.
- FIG. 7A shows that for the single guide RNA
- sgRNA the first few nucleotides of the RNA (which bind to the 18th - 20th sites of the protospacer) stabilize R-loop breathing and binding at the 14th - 17th sites of the protospacer, allow efficient conformational transition to the active state to permit cleavage.
- this increased stability imparted by these bases allows for transient stabilization at mismatched sites and the conformational change permitting cleavage. In many cases, having traversed a mismatch, R-loops remain stably fully-invaded.
- FIG. 7B shows that for guide RNAs with the first few (here 2) nucleotides truncated (tru-gRNA), the reduced stability of the R-loop (characterized by significant volatility) decreases the probability of maintaining the active conformation. When there are mismatched sites in the protospacer, the volatility of the R-loop ensures that it will becomes quickly and repeatedly 're-trapped' behind the mismatch and greatly hindered at those sites.
- FIG. 7C shows that while 'simple' extensions of the 5' - end of the guide RNA to target the protospacer and adjoining sites beyond the protospacer was found to be digested back to approximately sgRNA length in vivo (FIG.
- hp-gRNAs guide RNAs with 5' - hairpins complementary to 'PAM-distal' -targeting segments
- hp-gRNAs guide RNAs with 5' - hairpins complementary to 'PAM-distal' -targeting segments
- FIGS. 8A-8B show purity of expressed Cas9 and dCas9 in SDS gel of purified Cas9 (FIG. 8 A) and dCas9 (FIG. 8B) products (nominal molecular weight: 160 kDa). Eluted bands show product is -95% pure.
- FIGS. 9A-9C show additional images of Cas9 / dCas9 bound to DNA.
- A) Binding distribution of dCas9 to substrate containing no homology to the AAVsl protospacer sequence (compare with FIG. 1) (n 443). Overlaid is the cumulative distribution (CDF) of PAM sites (CDFPAM, black) and CDF of bases bound by dCas9 (red, CDFc AS 9)- Comparison begins 100 bases from each end to avoid artifacts introduced by overlap with streptavidin tag (a criteria for DNA selection) and binding to exposed blunt ends of DNA (resulting in expected increase in non-specific binding).
- Dashed line is Kolmogorov-Smirnov criterion for goodness-of-fit of two distributions.
- Vertical red line is experimental Sup(D n ), indicating that experimental dCas9 binding more closely matches the experimental PAM distribution than it does to 71.20% of generated sequences.
- FIGS. lOA-lOC show binding to 'nonsense' substrate containing no homology (>3 bp) to protospacer sequence.
- A Images of dCas9 alone.
- mean height is 1.746 nm (95% confidence: 1.689 nm - 1.802 nm) with standard deviation 0.441 nm
- mean volume is 1302 nm 3 (95% confidence: 1266 nm 3 - 1337 nm 3 ) with standard deviation 259.1 nm 3 (note that because the dCas9 here do not have a DNA within its binding channel, their recorded volumes may appear artificially low because of decreased mechanical resistance to the AFM probe).
- the heights were measured relative to the median value of a 10- pixel area surrounding each protein, and the volumes recorded as the contiguous features greater than twice the standard deviation of the local background heights.
- C Additional representative images of dCas9 bound to DNA which has been labeled at one end with a monovalent streptavidin.
- FIGS. 1 1 A-l ID show a representative figure of dCas9-sgRNA bound to RNA and example of processing of protein structural properties.
- FIG. 1 1 A shows a representative wide- field image of dCas9 bound to engineered DNA.
- FIG. 1 IB shows a close-up of boxed region.
- White arrows are monovalent streptavidin and red arrows are dCas9 proteins.
- FIGS. 1 lC-1 ID show an example of extraction from original image (FIG. 11C) and isolation (FIG. 1 ID) of Cas9/dCas9 structures.
- FIGS. 12A-12B show properties of Cas9/dCas9-sgRNAs mapped to their respective binding sites.
- the binding distribution of extracted Cas9/dCas9 molecules (FIGS. lOA-lOC) closely matches that of the entire dataset (FIG. 1C-1D, FIGS. 8A- 8B), indicating that the selection procedure is unbiased and the selected proteins are
- FIG. 13 shows structural properties of Cas9/dCas9 with tru-gRNA and hp-gRNAs at their respective binding sites. Fraction of bound DNA occupied by Cas9/dCas9 with along the engineered DNA substrate, with colors representing populations of Cas9/dCas9 clustered according to their structures (see FIG. 3C). Protein structures were classified according to the dCas9/Cas9 with sgRNA that they most closely resembled (by mean-squared difference after alignment, see text).
- FIGS. 14A-14C show model of the strand invasion of DNA protospacers by guide RNAs, and estimated binding stabilities of RNA invaded into protospacers with PAM-distal mismatches.
- the invaded RNAs are highly unstable at protospacer sites with 15 PAM-distal mismatches (15MM), and experimentally we rarely observe Cas9/dCas9 bound at these sites (FIG. ID).
- the invaded RNA (prior to dissociation) at protospacer sites with 10 or 5 PAM-distal mismatches (10MM and 5MM) are calculated to remain for significantly longer than those at 15MM sites, but within an order of magnitude of each other; we find their binding propensity to be approximately equal and lower than full protospacer sites (0MM) in AFM experiments.
- FIG. 15 shows simulated mean first passage times to traverse the mismatched site during strand invasion by sgRNA and tru-gRNA. Simulated (kinetic Monte Carlo) mean first passage times to traverse the mismatched site during strand invasion by sgRNA (blue) and tru- gRNA (red) for different positions of the mismatched site. Error bars are standard deviations of recorded first passage times. Sequence of protospacer (AAVS1 site) in box. [0037] FIGS. 16A-16B shows correlations between Cas9 cleavage frequency (Hsu et al. (2013) Nature Biotechnology, 31, 827-832) and measures of R-loop stability derived from kinetic Monte Carlo. FIG.
- FIG. 16B shows correlation between fractions of time the R-loop is of size m vs. the probability that the kinetic Monte Carlo trial predicts that the invading strand will dissociate before traversing the mismatch.
- Binding at sites 10 ⁇ 14-15 is very strongly anti-correlated (-0.5-0.85) with the probability of dissociation before traversing the mismatch, while from the AFM imaging experiments we find that binding at sites ⁇ >16 are associated with a conformational change in the Cas9/dCas9.
- FIG. 17 shows a summary of Deep-Seq data, comparing ontarget activities.
- FIG. 18 shows a summary of Deep-Seq data, comparing specificity increases.
- FIG. 19 shows protospacerl, Dystrophin; Lane 1 shows GFP Control; Lane 2 shows Full gRNA; Lane 3 shows Tru-gRNA 19 nt; Lane 4 shows Tru-gRNA 18 nt; Lane 5 shows Tru- gRNA 17 nt; Lane 6 shows Tru-gRNA 16 nt; Lane 7 shows Hp-gRNA 4 bp; Lane 8 shows Hp- gRNA 5 bp; Lane 9 shows Hp-gRNA 6 bp; Lane 10 shows Hp-gRNA 7 bp; Lane 11 shows Hp- gRNA 8 bp; and Lane 12 shows Hp-gRNA 9 bp, hairpin 1 (Lane 12, 9nt hp) - GtgagtoggttcgCCTACTCAGACTGTTACTC (SEQ ID NO: 335), wherein italicized is part of hairpin and underlined is the hairpin loop.
- Lane 1 shows GFP Control
- Lane 3 shows Tru-gRNA 19 nt
- Lane 4 shows Tru
- FIG. 20 shows protospacerl, Dystrophin, internal loops
- FIG. 21 shows Calculated secondary structures of the 5' - ends of the protospacer- targeting segments of hp-gRNAs used for Deep Seq experiments (using NuPack software suite). Colors are probability of each nucleotide existing in that secondary structure at equilibrium.
- FIG. 22 shows Dystrophin, indel rates, all sites
- FIG. 23 shows Dystrophin, ontarget/sum(offtargets).
- FIGS. 26A-26C show hairpin structures.
- FIG. 26A shows hairpin 1 which is a 6 bp 5'- hairpin.
- FIG. 26B shows hairpin 2 which is a 5 bp 5' - hairpin on 18 nt (truncated) gRNA.
- FIG. 26A shows hairpin 1 which is a 6 bp 5'- hairpin.
- FIG. 26B shows hairpin 2 which is a 5 bp 5' - hairpin on 18 nt (truncated) gRNA.
- FIG. 26A shows hairpin 1 which is a 6 bp 5'- hairpin.
- FIG. 26B shows hairpin 2 which is a 5 bp 5' - hairpin on 18 nt (truncated) gRNA.
- 26C shows hairpin 3 which is a 3 bp 5' - hairpin.
- FIG. 27 shows EMX1, Indel rates, all sites.
- FIG. 28 shows EMX1, indel rates, low-rate offtargets.
- FIG. 29 shows EMX1, ontarget/sum(offtargets).
- FIG. 30 shows protospacer3, VEGFAl .
- Lane 1 shows GFP Control
- Lane 2 shows Full gRNA
- Lane 3 shows Tru-gRNA
- Lane 4 shows 10-bp hp-gRNA
- Lane 5 shows 6-bp hp-gRNA.
- FIG. 31 shows protospacer3, VEGFAl : pam proximal hairpins.
- Lane 1 shows GFP control; Lane 2 shows Full gRNA; Lane 3 shows hp-gRNAl; Lane 4 shows hp-gRNA2; Lane 5 shows hp-gRNA3; Lane 6 shows hp-gRNA4; Lane 7 shows hp-gRNA5; and Lane 8 shows hp- gRNA6.
- FIG. 32 shows protospacer3, VEGFAl : pam proximal hairpins.
- FIG. 33 shows protospacer3, VEGFl, internal loops.
- Lane 1 shows Control; lane 2 shows Full; lane 3 shows 2nt hp; lane 4 shows 3nt hp, hairpin 5; and lane 5 shows 4nt hp.
- FIGS. 34A and 34B show Deep-seq Experiments for hairpins 1, 2, and 3 failed.
- FIG. 25A shows Hairpin 4 - Computationally-derived hairpin designed to discriminate against Off- target site 2 while maintaining on-target activity.
- FIG. 25B shows Hairpin 5-4 bp 5' - hairpin (gRNA normally has significant 3' secondary structure).
- FIG. 35 shows VEGFl, indel rates, all sites.
- FIG. 36 shows VEGFl, indel rates, low-rate offtargets.
- FIG. 37 shows VEGFl, ontarget/sum(offtargets).
- FIG. 38 shows protospacer 4, VEGFA3.
- Lane 1 shows GFP Control
- Lane 2 shows Full gRNA
- Lane 3 shows Tru-gRNA
- Lane 4 shows 3 -bp hp-gRNA
- Lane 5 shows 4-bp hp- gRNA
- Lane 6 shows 5-bp hp-gRNA
- Lane 7 shows 6-bp hp-gRNA
- Lane 8 shows 10-bp hp-gRNA.
- FIG. 39 shows gRNA4, VEGFA3 : pam proximal hairpins.
- Lane 1 shows GFP control; Lane 2 shows Full gRNA; Lane 3 shows hp-gRNAl; Lane 4 shows hp-gRNA2; Lane 5 shows hp-gRNA3; Lane 6 shows hp-gRNA4; Lane 7 shows hp-gRNA5; and Lane 8 shows hp-gRNA6.
- FIG. 40A shows Hairpin 1- 4 bp hairpin targeting 3'- region.
- FIG. 40B shows Hairpin 2-4 bp hairpin targeting 3' - region with G-U wobble pairs.
- FIG. 40 shows Hairpin 3- 4 bp hairpin targeting 3'- region with G-U wobble pair (variant design).
- FIG. 41 shows VEGF3, indel rates, all sites.
- FIG. 42 shows VEGF3, indel rates, low-rate offtargets.
- FIG. 43 shows VEGF3, ontarget/sum(offtargets).
- FIG. 44 A shows a hairpin designed to target EMX1 gene.
- FIG. 44B shows the EMXl-sgl sequence of the hairpin of FIG. 44A.
- FIG. 44C shows the effect of decreasing protospacer length and increasing hairpin length on specificity.
- FIG. 45 A-45D show DNA/RNA Sequences.
- FIG. 46 shows a figure that describes the Surveyor assays.
- FIG. 47 shows tolerance of AsCpfl and LbCpfl to mismatched or truncated crRNAs and endogenous gene modification by AsCpfl and LbCpfl using crRNAs that contain singly mismatched bases.
- FIG. 48 shows surveyor assay results for hp-gRNAs used with a Type V CRISPR system in which a hairpin is added to the 3' end of a full-length gRNA to abolish off-target activity.
- composition and methods for site specific DNA targeting and epigenomic gene editing and/or transcriptional regulation such as DNA cleavage and gene activation or repression.
- the present invention is directed to a modular method for designing and using optimized guide RNAs that have hairpin structures (hpgRNA) that can be easily incorporated into the existing biotechnology infrastructure and which results in a controlled decrease of off-target activity, all while maintaining the ability to target the correct DNA sequence specifically.
- the methods described herein provide a novel approach to engineering the optimized gRNA to perform significantly better than other available methods and can be used in combination with other protein-specific means of improving increasing specifically for highly improved performance.
- the disclosed methods and optimized gRNAs have the great advantage of being easily adapted to current methodologies and infrastructures already in place to perform RNA-guided genomic engineering.
- Cas9, dCas9, or Cpfl are delivered into a cell using viral vectors along with vectors coding for the transcription of the optimized gRNAs in the cell.
- the current invention would require only a few additional nucleotides to the vector coding for the optimized gRNA, which can be easily accommodated by the current and standard practices.
- the optimized gRNAs or hpgRNAs can be used in combination with paired nickases, for example, or other modifications of the
- the optimized gRNA abolishes or significantly weakens activity at targets containing only a few mismatched DNA sequences, which tend to be the sites at which off-target activity by RNA-guided endonucleases occurs.
- the optimized gRNA also provide specificity of cleavage activity in mammalian cells at sites which are known to induce off-target activity even in the best known improvements to the guide RNAs.
- the invention is a generally-applicable method to decrease off-target activity by RNA-guided endonucleases, particularly Cas9, by engineering changes the structural design of the guide RNA.
- each intervening number there between with the same degree of precision is explicitly contemplated.
- the numbers 7 and 8 are contemplated in addition to 6 and 9, and for the range 6.0-7.0, the number 6.0, 6.1, 6.2, 6.3, 6.4, 6.5, 6.6, 6.7, 6.8, 6.9, and 7.0 are explicitly contemplated.
- Adeno-associated virus or "AAV” as used interchangeably herein refers to a small virus belonging to the genus Dependovirus of the Parvoviridae family that infects humans and some other primate species. AAV is not currently known to cause disease and consequently the virus causes a very mild immune response.
- Binding region refers to the region within a nuclease target region that is recognized and bound by the nuclease, such as Cas9.
- Chromatin refers to an organized complex of chromosomal DNA associated with histones.
- CREs Crohn's disease
- CRISPRs Clustered Regularly Interspaced Short Palindromic Repeats
- CRISPRs CRISPRs
- Coding sequence or "encoding nucleic acid” as used herein means the nucleic acids (RNA or DNA molecule) that comprise a nucleotide sequence which encodes a protein.
- the coding sequence can further include initiation and termination signals operably linked to regulatory elements including a promoter and polyadenylation signal capable of directing expression in the cells of an individual or mammal to which the nucleic acid is administered.
- the coding sequence may be codon optimize.
- “Complement” or “complementary” as used herein means a nucleic acid can mean Watson-Crick (e.g., A-T/U and C-G) or Hoogsteen base pairing between nucleotides or nucleotide analogs of nucleic acid molecules. "Complementarity” refers to a property shared between two nucleic acid sequences, such that when they are aligned antiparallel to each other, the nucleotide bases at each position will be complementary.
- Correcting or restoring a mutant gene may include replacing the region of the gene that has the mutation or replacing the entire mutant gene with a copy of the gene that does not have the mutation with a repair mechanism such as homology-directed repair (HDR).
- HDR homology-directed repair
- Correcting or restoring a mutant gene may also include repairing a frameshift mutation that causes a premature stop codon, an aberrant splice acceptor site or an aberrant splice donor site, by generating a double stranded break in the gene that is then repaired using non-homologous end joining (HEJ).
- HEJ may add or delete at least one base pair during repair which may restore the proper reading frame and eliminate the premature stop codon.
- Correcting or restoring a mutant gene may also include disrupting an aberrant splice acceptor site or splice donor sequence.
- Correcting or restoring a mutant gene may also include deleting a non-essential gene segment by the simultaneous action of two nucleases on the same DNA strand in order to restore the proper reading frame by removing the DNA between the two nuclease target sites and repairing the DNA break by NHEJ.
- Demethylases refers to an enzyme that removes methy (CH3-) groups from nucleic acids, proteins (in particular histones), and other molecules. Demethylase enzymes are important in epigenetic modification mechanisms. The demethylase proteins alter transcriptional regulation of the genome by controlling the methylation levels that occur on DNA and histones and, in turn, regulate the chromatin state at specific gene loci within organisms. "Histone demethylase” refers to a methylase that removes methy groups from histones. There are several families of histone demethylases, which act on different substrates and play different roles in cellular function. The Fe(II)-dependent lysine demethylases may be a JMJC
- a JMJC demethylase is a histone demethylase containing a JumonjiC (JmjC) domain.
- the JMJC demethylase may be a member of the KDM3, KDM4, KDM5, or KDM6 family of histone demethylases.
- DNase I hypersensitive sites or "DHS” as used interchangeably herein refers to docking sites for the transcription factors and chromatin modifiers, including p300 that coordinate distal target gene expression.
- Donor DNA refers to a double-stranded DNA fragment or molecule that includes at least a portion of the gene of interest.
- the donor DNA may encode a full-functional protein or a partially-functional protein.
- Endogenous gene refers to a gene that originates from within an organism, tissue, or cell.
- An endogenous gene is native to a cell, which is in its normal genomic and chromatin context, and which is not heterologous to the cell.
- Such cellular genes include, e.g., animal genes, plant genes, bacterial genes, protozoal genes, fungal genes, mitochondrial genes, and chloroplastic genes.
- An "endogenous target gene” as used herein refers to an endogenous gene that is targeted by an optimized gRNA and CRISPR/Cas9-based system or CRISPR/Cpfl -based system.
- Enhancer refers to non-coding DNA sequences containing multiple activator and repressor binding sites. Enhancers range from 50 bp to 1500 bp in length and may be either proximal, 5' upstream to the promoter, within any intron of the regulated gene, or distal, in introns of neighboring genes, or intergenic regions far away from the locus, or on regions on different chromosomes. More than one enhancer may interact with a promoter.
- enhancers may regulate more than one gene without linkage restriction and may "skip" neighboring genes to regulate more distant ones.
- Transcriptional regulation may involve elements located in a chromosome different to one where the promoter resides.
- Proximal enhancers or promoters of neighboring genes may serve as platforms to recruit more distal elements.
- DMD Duchenne Muscular Dystrophy
- DMD is a common hereditary monogenic disease and occurs in 1 in 3500 males. DMD is the result of inherited or spontaneous mutations that cause nonsense or frame shift mutations in the dystrophin gene. The majority of dystrophin mutations that cause DMD are deletions of exons that disrupt the reading frame and cause premature translation termination in the dystrophin gene. DMD patients typically lose the ability to physically support themselves during childhood, become progressively weaker during the teenage years, and die in their twenties.
- Dystrophin refers to a rod-shaped cytoplasmic protein which is a part of a protein complex that connects the cytoskeleton of a muscle fiber to the surrounding extracellular matrix through the cell membrane.
- Dystrophin provides structural stability to the dystroglycan complex of the cell membrane that is responsible for regulating muscle cell integrity and function.
- the dystrophin gene or "DMD gene” as used interchangeably herein is 2.2 megabases at locus Xp21. The primary transcription measures about 2,400 kb with the mature mRNA being about 14 kb. 79 exons code for the protein which is over 3500 amino acids.
- Exon 51 refers to the 51 st exon of the dystrophin gene. Exon 51 is frequently adjacent to frame-disrupting deletions in DMD patients and has been targeted in clinical trials for oligonucleotide-based exon skipping. A clinical trial for the exon 51 skipping compound eteplirsen recently reported a significant functional benefit across 48 weeks, with an average of 47% dystrophin positive fibers compared to baseline. Mutations in exon 51 are ideally suited for permanent correction by NHEJ-based genome editing.
- “Frameshift” or “frameshift mutation” as used interchangeably herein refers to a type of gene mutation wherein the addition or deletion of one or more nucleotides causes a shift in the reading frame of the codons in the mRNA.
- the shift in reading frame may lead to the alteration in the amino acid sequence at protein translation, such as a missense mutation or a premature stop codon.
- Full-length gRNA or “standard gRNA” as used interchangeably herein refers to a gRNA that includes a “scaffold” and a protospacer-targeting sequence or segment that is typically 20 nucleotides in length.
- a “functional gene” refers to a gene transcribed to mRNA, which is translated to a functional protein.
- Fusion protein refers to a chimeric protein created through the joining of two or more genes that originally coded for separate proteins. The translation of the fusion gene results in a single polypeptide with functional properties derived from each of the original proteins.
- Genetic construct refers to the DNA or RNA molecules that comprise a nucleotide sequence that encodes a protein. The coding sequence includes initiation and termination signals operably linked to regulatory elements including a promoter and
- the term "expressible form” refers to gene constructs that contain the necessary regulatory elements operable linked to a coding sequence that encodes a protein such that when present in the cell of the individual, the coding sequence will be expressed.
- Genetic disease refers to a disease, partially or completely, directly or indirectly, caused by one or more abnormalities in the genome, especially a condition that is present from birth.
- the abnormality may be a mutation, an insertion or a deletion.
- the abnormality may affect the coding sequence of the gene or its regulatory sequence.
- the genetic disease may be, but not limited to DMD, hemophilia, cystic fibrosis, Huntington's chorea, familial hypercholesterolemia (LDL receptor defect), hepatoblastoma, Wilson's disease, congenital hepatic porphyria, inherited disorders of hepatic metabolism, Lesch Nyhan syndrome, sickle cell anemia, thalassaemias, xeroderma pigmentosum, Fanconi's anemia, retinitis pigmentosa, ataxia telangiectasia, Bloom's syndrome, retinoblastoma, and Tay-Sachs disease.
- DMD hemophilia
- cystic fibrosis Huntington's chorea
- hepatoblastoma Wilson's disease
- congenital hepatic porphyria congenital hepatic porphyria
- inherited disorders of hepatic metabolism Lesch Nyhan
- Gene refers to the complete set of genes or genetic material present in a cell or organism.
- the genome includes DNA or RNA in RNA viruses.
- the genome includes both the genes, (the coding regions), the noncoding DNA and the genomes of the mitochondria and chloroplasts.
- guide RNA refers to a short synthetic RNA composed of a “scaffold” sequence necessary for Cas9-binding or Cpfl -binding and a user-defined “spacer” or “targeting sequence " (also referred to herein as a protospacer-targeting sequence or segment) which defines the genomic target to be modified.
- hpgRNA refers to a gRNA that has additional nucleotides at either the 5'- end or 3' - end that can form a secondary structure with all or part of the protospacer-targeting sequence or segment.
- Histone acetyltransferases or “HATs” are used interchangeably herein refers to enzymes that acetylate conserved lysine amino acids on histone proteins by transferring an acetyl group from acetyl CoA to form ⁇ - ⁇ -acetyllysine. DNA is wrapped around histones, and, by transferring an acetyl group to the histones, genes can be turned on and off. In general, histone acetylation increases gene expression as it is linked to transcriptional activation and associated with euchromatin. Histone acetyltransferases can also acetylate non-histone proteins, such as nuclear receptors and other transcription factors to facilitate gene expression.
- KDAC lysine deacetylases
- Histone methyltransferase or “HMTs” as used interchangeably herein refers to hi stone-modifying enzymes (e.g., histone-lysine N-methyltransferases and histone-arginine N- methyltransferases), that catalyze the transfer of one, two, or three methyl groups
- HDR Homology-directed repair
- HDR uses a donor DNA template to guide repair and may be used to create specific sequence changes to the genome, including the targeted addition of whole genes. If a donor template is provided along with the site specific nuclease, such as with a CRISPR/Cas9-based system or CRISPR/Cpfl -based system, then the cellular machinery will repair the break by homologous recombination, which is enhanced several orders of magnitude in the presence of DNA cleavage. When the homologous DNA piece is absent, non-homologous end joining may take place instead.
- Gene refers to the complete set of genes or genetic material present in a cell or organism.
- the genome includes DNA or RNA in RNA viruses.
- the genome includes both the genes, (the coding regions), the noncoding DNA and the genomes of the mitochondria and chloroplasts.
- Genome editing refers to changing a gene. Genome editing may include correcting or restoring a mutant gene. Genome editing may include knocking out a gene, such as a mutant gene or a normal gene. Genome editing may be used to treat disease or enhance muscle repair by changing the gene of interest. [00109] "Identical” or “identity” as used herein in the context of two or more nucleic acids or polypeptide sequences means that the sequences have a specified percentage of residues that are the same over a specified region.
- the percentage may be calculated by optimally aligning the two sequences, comparing the two sequences over the specified region, determining the number of positions at which the identical residue occurs in both sequences to yield the number of matched positions, dividing the number of matched positions by the total number of positions in the specified region, and multiplying the result by 100 to yield the percentage of sequence identity.
- the residues of single sequence are included in the denominator but not the numerator of the calculation.
- thymine (T) and uracil (U) may be considered equivalent. Identity may be performed manually or by using a computer sequence algorithm such as BLAST or BLAST 2.0.
- Insulators refers to a genetic boundary element that blocks the interaction between enhancers and promoters. By residing between the enhancer and promoter, the insulator may inhibit their subsequent interactions. Insulators can determine the set of genes an enhancer can influence. Insulators are needed where two adjacent genes on a chromosome have very different transcription patterns and the inducing or repressing mechanisms of one does not interfere with the neighboring gene. Insulators have also been found to cluster at the boundaries of topological association domains (TADs) and may have a role in partitioning the genome into "chromosome neighborhoods" - genomic regions within which regulation occurs. Insulator activity is thought to occur primarily through the 3D structure of DNA mediated by proteins including CTCF.
- TADs topological association domains
- Insulators are likely to function through multiple mechanisms. Many enhancers form DNA loops that put them in close physical proximity to promoter regions during transcriptional activation. Insulators may promote the formation of DNA loops that prevent the promoter-enhancer loops from forming. Barrier insulators may prevent the spread of
- “Invasion” as used herein refers to the disruption of a DNA duplex at a protospacer region in a target region of a target gene, such as by a gRNA that binds to the DNA sequence that is complementary to the protospacer.
- “Invasion kinetics” as used herein refers to the rate at which invasion proceeds.
- Invasion kinetics can refer to the rate at which the guide RNA invades the duplex, either to "full invasion” such that the protospacer is completely invaded, or the rate at which the segment of protospacer DNA bound to the guide RNA expands as it is displaced from its complementary strand and bound to the guide RNA nucleotide-by-nucleotide from its PAM-proximal region through to full invasion.
- Lifetime refers to period of time that a gRNA remains invaded in the region in a target region of a target gene.
- Locus control regions refers to a long-range cis-regulatory element that enhances expression of linked genes at distal chromatin sites. It functions in a copy number-dependent manner and is tissue-specific, as seen in the selective expression of ⁇ - globin genes in erythroid cells. Expression levels of genes can be modified by the LCR and gene-proximal elements, such as promoters, enhancers, and silencers. The LCR functions by recruiting chromatin-modifying, coactivator, and transcription complexes. Its sequence is conserved in many vertebrates, and conservation of specific sites may suggest importance in function.
- mismatched or “MM” as used interchangeably herein refers to mismatched bases that include a G/T or A/C pairing. Mismatches are commonly due to tautomerization of bases during G2. The damage is repaired by recognition of the deformity caused by the mismatch, determining the template and non-template strand, and excising the wrongly incorporated base and replacing it with the correct nucleotide.
- Modulate as used herein may mean any altering of activity, such as regulate, down regulate, upregulate, reduce, inhibit, increase, decrease, deactivate, or activate.
- mutant gene or “mutated gene” as used interchangeably herein refers to a gene that has undergone a detectable mutation.
- a mutant gene has undergone a change, such as the loss, gain, or exchange of genetic material, which affects the normal transmission and expression of the gene.
- a "disrupted gene” as used herein refers to a mutant gene that has a mutation that causes a premature stop codon. The disrupted gene product is truncated relative to a full-length undisrupted gene product.
- NHEJ Non-homologous end joining pathway
- the template-independent re-ligation of DNA ends by NHEJ is a stochastic, error-prone repair process that introduces random micro-insertions and micro- deletions (indels) at the DNA breakpoint. This method may be used to intentionally disrupt, delete, or alter the reading frame of targeted gene sequences.
- NHEJ typically uses short homologous DNA sequences called microhomologies to guide repair. These microhomologies are often present in single-stranded overhangs on the end of double-strand breaks. When the overhangs are perfectly compatible, NHEJ usually repairs the break accurately, yet imprecise repair leading to loss of nucleotides may also occur, but is much more common when the overhangs are not compatible.
- Normal gene refers to a gene that has not undergone a change, such as a loss, gain, or exchange of genetic material. The normal gene undergoes normal gene transmission and gene expression.
- Nuclease mediated NHEJ refers to NHEJ that is initiated after a nuclease, such as a cas9, cuts double stranded DNA.
- nucleic acid or "oligonucleotide” or “polynucleotide” as used herein means at least two nucleotides covalently linked together. The depiction of a single strand also defines the sequence of the complementary strand. Thus, a nucleic acid also encompasses the
- nucleic acid also encompasses
- nucleic acids substantially identical nucleic acids and complements thereof.
- a single strand provides a probe that may hybridize to a target sequence under stringent hybridization conditions.
- a nucleic acid also encompasses a probe that hybridizes under stringent hybridization conditions.
- Nucleic acids may be single stranded or double stranded, or may contain portions of both double stranded and single stranded sequence.
- the nucleic acid may be DNA, both genomic and cDNA, RNA, or a hybrid, where the nucleic acid may contain combinations of deoxyribo- and ribo-nucleotides, and combinations of bases including uracil, adenine, thymine, cytosine, guanine, inosine, xanthine hypoxanthine, isocytosine and isoguanine.
- Nucleic acids may be obtained by chemical synthesis methods or by recombinant methods.
- On-target site refers to the target region or sequence in a genome to which the gRNA is intended to target. Ideally, the on-target site has perfect homology (100% identity or homology) to the target DNA sequence with no homology elsewhere in the genome.
- Off-target site refers to a region of the genome which has partial homology or partial identity to the on-target site or target region of the gRNA, but which the gRNA is not intended or designed to target.
- operably linked means that expression of a gene is under the control of a promoter with which it is spatially connected.
- a promoter may be positioned 5' (upstream) or 3' (downstream) of a gene under its control.
- the distance between the promoter and a gene may be approximately the same as the distance between that promoter and the gene it controls in the gene from which the promoter is derived. As is known in the art, variation in this distance may be accommodated without loss of promoter function.
- p300 protein ⁇ 300
- El A binding protein p300 refers to the adenovirus El A-associated cellular p300 transcriptional co-activator protein encoded by the EP300 gene.
- p300 is a highly conserved acetyltransferase involved in a wide range of cellular processes.
- p300 functions as a histone acetyltransferase that regulates transcription via chromatin remodeling and is involved with the processes of cell proliferation and cell differentiation.
- Partially-functional as used herein describes a protein that is encoded by a mutant gene and has less biological activity than a functional protein but more than a non-functional protein.
- Premature stop codon or "out-of-frame stop codon” as used interchangeably herein refers to nonsense mutation in a sequence of DNA, which results in a stop codon at location not normally found in the wild-type gene.
- a premature stop codon may cause a protein to be truncated or shorter compared to the full-length version of the protein.
- Primary cell refers to cells taken directly from living tissue (e.g. biopsy material). Primary cells can be established for growth in vitro. These cells have undergone very few population doublings and are therefore more representative of the main functional component of the tissue from which they are derived in comparison to continuous (tumor or artificially immortalized) cell lines thus representing a more representative model to the in vivo state. Primary cells may be taken from different species, such as mouse or humans.
- Protospacer sequence or “protospacer segment” as used interchangeably herein refers to a DNA sequence targeted by the Cas9 nuclease or Cpf 1 nuclease in the CRISPR bacterial adaptive immune system.
- the protospacer sequence is typically followed by a protospacer-adjacent motif (PAM); the PAM is at the 5' - end.
- PAM protospacer-adjacent motif
- the protospacer sequence is followed by the protospacer sequence; the PAM is at the 3' - end.
- Protospacer-targeting sequence or “protospacer-targeting segment” as used interchangeably herein refers to a nucleotide sequence of a gRNA that corresponds to the protospacer sequence and facilitates targeting of the CRISPR/Cas9-based system or
- Promoter means a synthetic or naturally-derived molecule which is capable of conferring, activating or enhancing expression of a nucleic acid in a cell.
- a promoter may comprise one or more specific transcriptional regulatory sequences to further enhance expression and/or to alter the spatial expression and/or temporal expression of same.
- a promoter may also comprise distal enhancer or repressor elements, which may be located as much as several thousand base pairs, or anywhere in the genome, from the start site of transcription.
- a promoter may be derived from sources including viral, bacterial, fungal, plants, insects, and animals.
- a promoter may regulate the expression of a gene component constitutively, or differentially with respect to cell, the tissue or organ in which expression occurs or, with respect to the developmental stage at which expression occurs, or in response to external stimuli such as physiological stresses, hormones, toxins, drugs, pathogens, metal ions, or inducing agents.
- promoters include the bacteriophage T7 promoter, bacteriophage T3 promoter, SP6 promoter, lac operator-promoter, tac promoter, SV40 late promoter, SV40 early promoter, RSV-LTR promoter, CMV IE promoter, SV40 early promoter or SV40 late promoter and the CMV IE promoter.
- PAM Protospacer adjacent motif or "PAM” as used herein refers to a DNA sequence immediately following the DNA sequence targeted by the Cas9 or immediately before the DNA sequence targeted by the Cpfl nuclease in the CRISPR bacterial adaptive immune system.
- PAM is a component of the invading virus or plasmid, but is not a component of the bacterial CRISPR locus. Cas9 and Cpfl will not successfully bind to or cleave the target DNA sequence if it is not followed by or preceded by the PAM sequence, respectively.
- PAM is an essential targeting component (not found in bacterial genome) which distinguishes bacterial self from non-self DNA, thereby preventing the CRISPR locus from being targeted and destroyed by nuclease.
- recombinant when used with reference, e.g., to a cell, or nucleic acid, protein, or vector, indicates that the cell, nucleic acid, protein or vector, has been modified by the introduction of a heterologous nucleic acid or protein or the alteration of a native nucleic acid or protein, or that the cell is derived from a cell so modified.
- recombinant cells express genes that are not found within the native (naturally occurring) form of the cell or express a second copy of a native gene that is otherwise normally or abnormally expressed, under expressed or not expressed at all.
- silencers or "repressors” as used interchangeably herein refer to a DNA sequence capable of binding transcription regulation factors and preventing genes from being expressed as proteins.
- a silencer is a sequence-specific element that induces a negative effect on the transcription of its particular gene. There are many positions in which a silencer element can be located in DNA. The most common position is found upstream of the target gene where it can help repress the transcription of the gene. This distance can vary greatly between approximately -20 bp to -2000 bp upstream of a gene. Certain silencers can be found downstream of a promoter located within the intron or exon of the gene itself. Silencers have also been found within the 3 prime untranslated region (3' UTR) of mRNA.
- 3' UTR 3 prime untranslated region
- silencers in DNA There are two main types of silencers in DNA, which are the classical silencer element and the non-classical negative regulatory element (NRE).
- classical silencers the gene is actively repressed by the silencer element, mostly by interfering with general transcription factor (GTF) assembly.
- GTF general transcription factor
- NREs passively repress the gene, usually by inhibiting other elements that are upstream of the gene.
- Skeletal muscle refers to a type of striated muscle, which is under the control of the somatic nervous system and attached to bones by bundles of collagen fibers known as tendons. Skeletal muscle is made up of individual components known as myocytes, or “muscle cells”, sometimes colloquially called “muscle fibers.” Myocytes are formed from the fusion of developmental myoblasts (a type of embryonic progenitor cell that gives rise to a muscle cell) in a process known as myogenesis. These long, cylindrical, multinucleated cells are also called myofibers.
- Sketal muscle condition refers to a condition related to the skeletal muscle, such as muscular dystrophies, aging, muscle degeneration, wound healing, and muscle weakness or atrophy.
- a mammal e.g., cow, pig, camel, llama, horse, goat, rabbit, sheep, hamsters, guinea pig, cat, dog, rat, and mouse
- a non-human primate for example, a monkey, such as a cynomolgous or rhesus monkey, chimpanzee, etc.
- the subject may be a human or a non-human.
- the subject or patient may be undergoing other
- Super enhancer refers to a region of the mammalian genome comprising multiple enhancers that is collectively bound by an array of transcription factor proteins to drive transcriptionof genes involved in cell identity.
- Super-enhancers are frequently identified near genes important for controlling and defining cell identity and can be used to quickly identify key nodes regulating cell identity. Enhancers have several quantifiable traits that have a range of values, and these traits are generally elevated at super-enhancers.
- Super- enhancers are bound by higher levels of transcription-regulating proteins and are associated with genes that are more highly expressed. Expression of genes associated with super-enhancers is particularly sensitive to perturbations, which may facilitate cell state transitions or explain sensitivity of super-enhancer— associated genes to small molecules that target transcription.
- Target enhancer refers to enhancer that is targeted by a gRNA and CRISPR/Cas9-based system.
- the target enhancer may be within the target region.
- Target gene refers to any nucleotide sequence encoding a known or putative gene product.
- the target gene may be a mutated gene involved in a genetic disease.
- target region refers to the region of the target gene to which the CRISPR/Cas9- based system or CRISPR/Cpfl -based system targets.
- Transcribed region refers to the region of DNA that is transcribed into single-stranded RNA molecule, known as messenger RNA, resulting in the transfer of genetic information from the DNA molecule to the messenger RNA.
- messenger RNA RNA
- RNA polymerase reads the template strand in the 3' to 5' direction and synthesizes the RNA from 5' to 3' .
- the mRNA sequence is complementary to the DNA strand.
- Target regulatory element refers to a regulatory element that is targeted by a gRNA and CRISPR/Cas9-based system.
- the target regulatory element may be within the target region.
- Transcribed region refers to the region of DNA that is transcribed into single-stranded RNA molecule, known as messenger RNA, resulting in the transfer of genetic information from the DNA molecule to the messenger RNA.
- messenger RNA RNA
- RNA polymerase reads the template strand in the 3' to 5' direction and synthesizes the RNA from 5' to 3' .
- the mRNA sequence is complementary to the DNA strand.
- TSS Transcriptional Start Site
- Transgene refers to a gene or genetic material containing a gene sequence that has been isolated from one organism and is introduced into a different organism. This non-native segment of DNA may retain the ability to produce RNA or protein in the transgenic organism, or it may alter the normal function of the transgenic organism's genetic code. The introduction of a transgene has the potential to change the phenotype of an organism.
- true gRNA refers to a full-length guide RNA with nucleotides truncated from their 5'- end, typically 2 nucleotides.
- Trans-regulatory elements refers to regions of non-coding DNA which regulate the transcription of genes distant from the gene from which they were
- Trans-regulatory elements may be on the same or different chromosome from the target gene. Examples of trans-regulatory elements include enhancers, super-enhancers, silencers, insulators, and locus control regions.
- Variant used herein with respect to a nucleic acid means (i) a portion or fragment of a referenced nucleotide sequence (including nucleotide sequences that have insertions or deletions as compared to the referenced nucleotide sequences); (ii) the complement of a referenced nucleotide sequence or portion thereof; (iii) a nucleic acid that is substantially identical to a referenced nucleic acid or the complement thereof; or (iv) a nucleic acid that hybridizes under stringent conditions to the referenced nucleic acid, complement thereof, or a sequences substantially identical thereto.
- Variant with respect to a peptide or polypeptide that differs in amino acid sequence by the insertion, deletion, or conservative substitution of amino acids, but retain at least one biological activity.
- Variant may also mean a protein with an amino acid sequence that is substantially identical to a referenced protein with an amino acid sequence that retains at least one biological activity.
- a conservative substitution of an amino acid i.e., replacing an amino acid with a different amino acid of similar properties ⁇ e.g., hydrophilicity, degree and
- hydropathic index of amino acids is based on a consideration of its hydrophobicity and charge. It is known in the art that amino acids of similar hydropathic indexes may be substituted and still retain protein function. In one aspect, amino acids having hydropathic indexes of ⁇ 2 are substituted. The hydrophilicity of amino acids may also be used to reveal substitutions that would result in proteins retaining biological function.
- hydrophilicity of amino acids in the context of a peptide permits calculation of the greatest local average hydrophilicity of that peptide.
- Substitutions may be performed with amino acids having hydrophilicity values within ⁇ 2 of each other. Both the hydrophobicity index and the hydrophilicity value of amino acids are influenced by the particular side chain of that amino acid. Consistent with that observation, amino acid substitutions that are compatible with biological function are understood to depend on the relative similarity of the amino acids, and particularly the side chains of those amino acids, as revealed by the hydrophobicity, hydrophilicity, charge, size, and other properties.
- Vector as used herein means a nucleic acid sequence containing an origin of replication.
- a vector may be a viral vector, bacteriophage, bacterial artificial chromosome or yeast artificial chromosome.
- a vector may be a DNA or RNA vector.
- a vector may be a self- replicating extrachromosomal vector, and preferably, is a DNA plasmid.
- the vector may encode Cas9 and at least one optimized gRNA nucleotide sequence of any one of SEQ ID NOs: 149-315, 321-323, and 326-329.
- the CRISPR system is a microbial nuclease system involved in defense against invading phages and plasmids that provides a form of acquired immunity.
- the CRISPR loci in microbial hosts can contain a combination of CRISPR-associated (Cas) genes as well as non- coding RNA elements capable of programming the specificity of the CRISPR-mediated nucleic acid cleavage. Short segments of foreign DNA, called spacers, are incorporated into the genome between CRISPR repeats, and serve as a 'memory' of past exposures.
- Cas9 forms a complex with the 3' end of the single guide RNA ("sgRNA”), and the protein-RNA pair recognizes its genomic target by complementary base pairing between the 5' end of the sgRNA sequence and a predefined 20 bp DNA sequence, known as the protospacer.
- This complex is directed to homologous loci of pathogen DNA via regions encoded within the CRISPR RNA ("crRNA”), i.e., the protospacers, and protospacer-adjacent motifs (PAMs) within the pathogen genome.
- the non-coding CRISPR array is transcribed and cleaved within direct repeats into short crRNAs containing individual spacer sequences, which direct Cas nucleases to the target site
- CRISPR spacer By simply exchanging the 20 bp recognition sequence of the expressed chimeric sgRNA, the Cas9 nuclease can be directed to new genomic targets. CRISPR spacers are used to recognize and silence exogenous genetic elements in a manner analogous to RNAi in eukaryotic organisms.
- Type II effector system carries out targeted DNA double-strand break in four sequential steps, using a single effector enzyme, Cas9, to cleave dsDNA.
- Cas9 effector enzyme
- the Type II effector system may function in alternative contexts such as eukaryotic cells.
- the Type II effector system consists of a long pre-crRNA, which is transcribed from the spacer-containing CRISPR locus, the Cas9 protein, and a tracrRNA, which is involved in pre-crRNA processing.
- the tracrRNAs hybridize to the repeat regions separating the spacers of the pre-crRNA, thus initiating dsRNA cleavage by endogenous RNase III. This cleavage is followed by a second cleavage event within each spacer by Cas9, producing mature crRNAs that remain associated with the tracrRNA and Cas9, forming a Cas9:crRNA-tracrRNA complex.
- gRNA guide RNA
- the Cas9:crRNA-tracrRNA complex unwinds the DNA duplex and searches for sequences matching the crRNA to cleave.
- Target recognition occurs upon detection of complementarity between a "protospacer" sequence in the target DNA and the remaining spacer sequence in the crRNA.
- Cas9 mediates cleavage of target DNA if a correct protospacer-adjacent motif (PAM) is also present at the 3' end of the protospacer.
- PAM protospacer-adjacent motif
- the sequence must be immediately followed by the protospacer-adjacent motif (PAM), a short sequence recognized by the Cas9 nuclease that is required for DNA cleavage.
- PAM protospacer-adjacent motif
- Different Type II systems have differing PAM requirements. The S.
- pyogenes CRISPR system may have the PAM sequence for this Cas9 (SpCas9) as 5'-NRG-3', where R is either A or G, and characterized the specificity of this system in human cells.
- SpCas9 the PAM sequence for this Cas9
- R is either A or G
- a unique capability of the CRISPR/Cas9-based system is the straightforward ability to simultaneously target multiple distinct genomic loci by co- expressing a single Cas9 protein with two or more sgRNAs.
- the Streptococcus pyogenes Type II system naturally prefers to use an "NGG" sequence, where "N” can be any nucleotide, but also accepts other PAM sequences, such as "NAG” in engineered systems (Hsu et al. (2013) Nature Biotechnology, 31, 827-832).
- the Cas9 derived from Neisseria meningitidis normally has a native PAM of NNNNGATT, but has activity across a variety of PAMs, including a highly degenerate NNNNGNNN PAM (Esvelt et al. Nature Methods (2013) doi: 10.1038/nmeth.2681).
- CRISPR/Cas9 systems that include an optimized gRNA, such as a hairpin gRNA (also referred herein as “hpgRNA” or “hp-gRNA”), that allow improved DNA targeting for use in epigenomic editing and transcriptional regulation, such as specifically cleaving a target region of interest, such as a target gene, or activating or repressing gene expression of a target gene.
- the optimized gRNAs provide increased target binding specificity, while having decreased off-target binding and off-target activity of the CRISPR/Cas9-based and CRISPR/Cpfl -based systems by modulating lifetimes at off-target locations so as to minimize any activity at those off-target sites.
- the optimized gRNA can modulate the Cas9-fusion protein activities by modulating the Cas9 lifetime at these locations and modulating the overall invasion kinetics without regard to second domain activity.
- gRNA binding to the protospacer at the 5'- end of the protospacer targeting segment may also be involved with Cas9 cleavage. The decreased binding to off-target sites would limit the potential for full invasion/cleavage at these off-target sites.
- An engineered form of the Type II effector system of Streptococcus pyogenes was shown to function in human cells for genome engineering.
- the Cas9 protein was directed to genomic target sites by a synthetically reconstituted "guide RNA” (“gRNA”, also used interchangeably herein as a chimeric single guide RNA (“sgRNA”)), which for Cas9 is a crRNA-tracrRNA fusion that obviates the need for RNase III and crRNA processing in general.
- gRNA guide RNA
- sgRNA chimeric single guide RNA
- CRISPR/Cas9-based systems for use in genome editing and treating genetic diseases.
- the CRISPR/Cas9-based systems may be designed to target any gene, including genes involved in a genetic disease, aging, tissue regeneration, or wound healing.
- the CRISPR/Cas9-based systems may include a Cas9 protein or Cas9 fusion protein and at least one optimized gRNA, as described below.
- the Cas9 fusion protein may, for example, include a domain that has a different activity that what is endogenous to Cas9, such as a transactivation domain.
- the target gene may have a mutation such as a frameshift mutation or a nonsense mutation. If the target gene has a mutation that causes a premature stop codon, an aberrant splice acceptor site or an aberrant splice donor site, the CRISPR/Cas9-based system may be designed to recognize and bind a nucleotide sequence upstream or downstream from the premature stop codon, the aberrant splice acceptor site or the aberrant splice donor site. The CRISPR-Cas9- based system may also be used to disrupt normal gene splicing by targeting splice acceptors and donors to induce skipping of premature stop codons or restore a disrupted reading frame. The CRISPR/Cas9-based system may or may not mediate off-target changes to protein-coding regions of the genome,
- the CRISPR/Cas9-based system may include a Cas9 protein or a Cas9 fusion protein.
- Cas9 protein is an endonuclease that cleaves nucleic acid and is encoded by the CRISPR loci and is involved in the Type II CRISPR system.
- the Cas9 protein may be from any bacterial or archaea species, such as Streptococcus pyogenes.
- the Cas9 protein may be mutated so that the nuclease activity is inactivated.
- iCas9 Streptococcus pyogenes
- dCas9 An inactivated Cas9 protein from Streptococcus pyogenes
- iCas9 and dCas9 both refer to a Cas9 protein that has the amino acid substitutions D10A and H840A and has its nuclease activity inactivated.
- an inactivated Cas9 protein from Neisseria meningitides such as NmCas9
- the CRISPR/Cas9-based system may include a iCas9 of SEQ ID NO: 1.
- the CRISPR/Cas9-based system may include a fusion protein of a Cas9 protein that does not have nuclease activity, such as dCas9, and a second domain.
- the second domain may include a transcription activation domain, such as a VP64 domain or p300 domain, transcription repression domain, such as KRAB domain, nuclease domain, transcription release factor domain, histone modification domain, nucleic acid association domain, acetylase domain, deacetylase domain, methylase domain, such as a DNA methylase domain, demethylase domain,
- the second domain may be a modifier of DNA methylation or chromatin looping.
- the fusion protein can include a dCas9 domain and a transcriptional activator.
- the fusion protein can include the amino acid sequence of SEQ ID NO: 2.
- the fusion protein can include a dCas9 domain and a transcriptional repressor.
- the fusion protein comprises the amino acid sequence of SEQ ID NO:3.
- the fusion protein can include a dCas9 domain and a site- specific nuclease that is different from Cas9 nuclease activity.
- the fusion protein may comprise two heterologous polypeptide domains, wherein the first polypeptide domain comprises a Cas protein and the second polypeptide domain has does not have nuclease activity.
- the fusion protein may include a Cas9 protein or a mutated Cas9 protein, as described above, fused to a second polypeptide domain that has nuclease activity.
- the second polypeptide domain may have nuclease activity that is different from the nuclease activity of the Cas9 protein.
- a nuclease, or a protein having nuclease activity is an enzyme capable of cleaving the phosphodiester bonds between the nucleotide subunits of nucleic acids.
- Nucleases are usually further divided into endonucleases and exonucleases, although some of the enzymes may fall in both categories.
- Well known nucleases are deoxyribonuclease and ribonuclease.
- the CRISPR/Cas9-based system can be a CRISPR/Cas9-based gene activation system that can activate regulatory element function with exceptional specificity of epigenome editing.
- the CRISPR/Cas9-based gene activation system can be used to screen for enhancers, insulators, silencers, and locus control regions that can be targeted to increase or decrease target gene expression. This technology can be used to assign function to putative regulatory elements identified through genomic studies such as the ENCODE and the Roadmap Epigenomics projects.
- the CRISPR/Cas9-based gene activation system may activate gene expression by modifying DNA methylation, chromatin looping or catalyzing acetylation of histone H3 lysine 27 at its target sites, leading to robust transcriptional activation of target genes from promoters and proximal and distal enhancers.
- the CRISPR/Cas9-based gene activation system is highly specific and may be guided to the target gene using as few as one guide RNA.
- CRISPR/Cas9-based gene activation system may activate the expression of one gene or a family of genes by targeting enhancers at distant locations in the genome.
- HAT Histone acetyltransferase
- the CRISPR/Cas9-based gene activation system may include a histone
- acetyltransferase protein such as a p300 protein, CREB binding protein (CBP; an analog of p300), GCN5, or PCAF, or fragment thereof.
- CBP CREB binding protein
- GCN5 GCN5
- PCAF PCAF
- Acetylating histones in regulatory elements using a programmable CRISPR/Cas9-based fusion protein is an effective strategy to increase the expression of target genes.
- a CRISPR/Cas9-based histone acetyltransferase that can be targeted to any site in the genome is uniquely capable of activating distal regulatory elements.
- the histone acetyltransferase protein may include a human p300 protein or a fragment thereof.
- the histone acetyltransferase protein may include a wild-type human p300 protein or a mutant human p300 protein, or fragments thereof.
- the histone acetyltransferase protein may include the core lysine-acetyltransferase domain of the human p300 protein, i.e., the p300 HAT Core (also known as "p300 Core").
- the p300 protein regulates the activity of many genes in tissues throughout the body.
- the p300 protein plays a role in regulating cell growth and division, prompting cells to mature and assume specialized functions (differentiate) and preventing the growth of cancerous tumors.
- the p300 protein may activate transcription by connecting transcription factors with a complex of proteins that carry out transcription in the cell's nucleus.
- the p300 protein also functions as a histone acetyltransferase that regulates transcription via chromatin remodeling.
- the dCas9 p300 Core fusion protein is a potent and easily programmable tool to synthetically manipulate acetylation at targeted endogenous loci, leading to regulation of proximal and distal enhancer-regulated genes.
- the p300 Core acetylates lysine 27 on histone H3 (H3K27ac) and may provide H3K27ac enrichment.
- the fusion of the catalytic core domain of p300 to dCas9 may result in substantially higher transactivation of downstream genes than the direct fusion of full-length p300 protein despite robust protein expression.
- the dCas9 p300 Core fusion protein may also exhibit an increased transactivation capacity relative to dCas9 W64 , including in the context of the Nw-dCas9 scaffold, especially at distal enhancer regions, at which dCas9 W64 displayed little, if any, measurable downstream transcriptional activity. Additionally, the dCas9 p300 Core displays precise and robust genome-wide transcriptional specificity. dCas9 p300 Core may be capable of potent transcriptional activation and co-enrichment of acetylation at promoters targeted by the epigenetically modified enhancer.
- the dCas9 p300 Core may activate gene expression through a single gRNA that target and bind a promoter and/or a characterized enhancer. This technology also affords the ability to synthetically transactivate distal genes from putative and known regulatory regions and simplifies transactivation via the application of a single programmable effector and single target site. These capabilities allow multiplexing to target several promoters and/or enhancers simultaneously.
- the mammalian origin of p300 may provide advantages over virally-derived effector domains for in vivo applications by minimizing potential immunogenicity.
- Gene activation by dCas9 p300"Core is highly specific for the target gene.
- the p300 Core includes amino acids 1048-1664 of SEQ ID NO: 2 (i.e., SEQ ID NO: 4).
- the CRISPR/Cas9-based gene activation system includes a dCas9 p300 Core fusion protein of SEQ ID NO: 2 or an Nm-dCas9 p30 ° Core fusion protein of SEQ ID NO: 5.
- the CRISPR/Cas9-based system can be a CRISPR/Cas9-based gene repression system which can inhibit regulatory element function with exceptional specificity of epigenome editing.
- the CRISPR/Cas9-based gene repression system such as one that include dCas9 KRAB , can interfere with distal enhancer activity by highly specific remodeling of the epigenetic state of targeted genetic loci.
- the dCas9 KRAB repressor is a highly specific epigenome editing tool that can be used in loss-of-function screens to study gene function and discover targets for drug development.
- the dCas9 KRAB has exceptional specificity to target a particular enhancer, silence only the target genes of that enhancer, and create a repressive heterochromatin environment at that site.
- dCas9- KRAB can be used to screen for novel regulatory elements within the endogenous genomic context by silencing proximal or distal regulatory elements and corresponding gene targets.
- the specificity of dCas9-KRAB repressors allows it to be used for transcriptome-wide specificity for silencing endogenous genes. Epigenetic mechanisms for disruption at targeted locus such as histone methylation.
- KRAB domain a common heterochromatin-forming motif in naturally occurring zinc finger transcription factors, has been genetically linked to dCas9 to create an RNA-guided synthetic repressor, dCas9 KRAB .
- the Kruppel-associated box (“KRAB”) recruits
- KRAB-based synthetic repressors can effectively silence the expression of single genes and have been employed to repress oncogenes, inhibit viral replication, and treat dominant negative diseases.
- the disclosed optimized gRNA may be used with a Clustered Regularly Interspaced Short Palindromic Repeats from Prevotella and Francisella 1 or (“CRISPR/Cpfl") system.
- CRISPR/Cpfl Clustered Regularly Interspaced Short Palindromic Repeats from Prevotella and Francisella 1 or
- CRISPR/Cpfl system a DNA-editing technology analogous to the CRISPR/Cas9 system, is found in Prevotella and Francisella bacteria and prevents genetic damage from viruses.
- Cpfl is an RNA-guided endonuclease of a class II CRISPR/Cas system containing a 1,300 amino acid protein.
- Cpfl genes are associated with the CRISPR locus, coding for an endonuclease that use a guide RNA to find and cleave viral DNA.
- Cpfl is a smaller and simpler endonuclease than Cas9 and has a smaller sgRNA molecule (proximately half as many nucleotides as Cas9) as functional Cpfl does not need the tracrRNA and only crRNA is required.
- Examples of Cpfl that can be used with the optimized gRNA include Cpfl from Acidaminococcus and Lachnospiraceae bacterial.
- the Cpfl loci encode Casl, Cas2 and Cas4 proteins more similar to types I and III than from type II systems.
- the Cpfl locus contains a mixed alpha/beta domain, a RuvC-I followed by a helical region, a RuvC-II and a zinc finger-like domain.
- the Cpfl protein has a RuvC-like endonuclease domain that is similar to the RuvC domain of Cas9.
- Cpfl does not have a HNH endonuclease domain, and the N-terminal of Cpfl does not have the alfa-helical recognition lobe of Cas9.
- Cpfl CRISPR-Cas domain architecture shows that Cpfl is
- the CRISPR/Cpfl system consists of a Cpfl enzyme and a guide RNA that finds and positions the complex at the correct spot on the double helix to cleave target DNA.
- CRISPR/Cpfl systems activity has three stages: adaptation, formation of crRNAs and interference.
- adaptation stage Casl and Cas2 proteins facilitate the adaptation of small fragments of DNA into the CRISPR array.
- the formation of crRNAs stage involves processing of pre-cr-RNAs producing of mature crRNAs to guide the Cas protein.
- the Cpfl is bound to a crRNA to form a binary complex to identify and cleave a target DNA sequence.
- the Cpfl -crRNA complex cleaves target DNA or RNA by identification of a protospacer adjacent motif 5'-YTN-3' (where "Y” is a pyrimidine and “N” is any nucleobase) or 5'-TTN-3', in contrast to the G-rich PAM targeted by Cas9.
- the PAM targeted by Cpfl is on the 5' side of the guide RNA, in contrast to the PAM targeted by Cas9, which is on the 3 ' side of the guide RNA.
- Cpfl introduces a sticky-end-like DNA double- stranded break of 4 or 5 nucleotides overhang in contrast to the blunt end cuts of Cas9 thereby enhancing the efficiency of genetic insertions and specificity during NHEJ or HDR.
- TTN PAM sites are more useful for human genomic engineering than GGN PAM sites because the human genome is more T-rich than G-rich.
- Protospacer-targeting segment of the gRNA for Cpfl is at its extreme 3'- end, while Cas9 gRNAs are at its extreme 5' end.
- the CRISPR/Cas9-based system or CRISPR/Cpfl -based system may include at least one gRNA, such as an optimized gRNA as described herein, which targets a nucleic acid sequence.
- the gRNA provides the specific targeting of the CRISPR/Cas9-based system or CRISPR/Cpfl -based system to a target region or gene.
- the gRNA is a fusion of two noncoding RNAs: a crRNA and a tracrRNA.
- the gRNA or sgRNA may target any desired DNA sequence by exchanging the sequence encoding a 20 bp protospacer which confers targeting specificity through complementary base pairing with the desired DNA target.
- gRNA mimics the naturally occurring crRNA: tracrRNA duplex involved in the Type II Effector system.
- This duplex which may include, for example, a 42-nucleotide crRNA and a 75-nucleotide tracrRNA, acts as a guide for the Cas9 to cleave the target nucleic acid.
- the gRNA may target and bind a target region of a target gene.
- the gRNA is a crRNA.
- the CRISPR/Cas9-based system or CRISPR/Cpfl -based system may include at least one gRNA, such as an optimized gRNA described herein, wherein the gRNAs target different DNA sequences.
- the target DNA sequences may be overlapping.
- the target sequence or protospacer is followed by a PAM sequence at the 3' end of the protospacer.
- Different Type II systems have differing PAM requirements.
- the Streptococcus pyogenes Type II system uses an "NGG" sequence, where "N" can be any nucleotide.
- the present disclosure is directed towards methods of generating optimized gRNAs, such as hairpin gRNAs (also referred to herein as "hpgRNA” and “hp-gRNA”).
- the optimized gRNA includes a nucleotide sequence of a full-length gRNA and nucleotides added to the 5' end or the 3' end of the full-length gRNA.
- the full-length gRNA can be designed using a program such as SgRNA designer, CRISPR MultiTargeter, or SSFinder.
- the nucleotides added to the 5' end for the CRISPR/Cas9 system or the 3' end for the CRISPR/Cpfl system of the full-length gRNA can form secondary structures by hybridizing or partially hybridizing to the nucleotides in the protospacer-targeting sequence of the full-length gRNA.
- the secondary structure modulates DNA binding or cleavage by disrupting invasion of the DNA duplex by the gRNA.
- the secondary structure influences the invasion kinetics of the gRNA rather than the binding energy of the gRNA with the complementary DNA strand.
- guide RNAs of type II CRISPR-Cas systems bind to protospacers through a Cas9-facilitated process known as 'strand invasion,' where the Cas9 protein itself first binds to and melts the protospacer adjacent motif (PAM) through direct interactions, followed by base-pairing of the 3' - end of the gRNA with the PAM-adjacent nucleotides (the 'seed' region) then proceeding nucleotide-by-nucleotide from the 3'- of the gRNA to the 5' - end base-pairing with the protospacer.
- PAM protospacer adjacent motif
- a similar mechanism is used with the CRISPR/Cpfl system.
- nucleotides added to the 5' end or 3' end of the full-length gRNA are not merely added to hybridize with the protospacer-targeting segment of the guide RNA (hairpins) to block access to the protospacer at thermodynamic equilibrium.
- the equilibrium thermodynamic secondary structure properties (such as melting temperature of the gRNA secondary structure) are not at all correlated with the specificity of the guide RNA.
- secondary structure elements which are designed to be stable at equilibrium may become rapidly destabilized during strand invasion (for example, as the rG-rU wobble pairs become the terminal base-pair of the stem as adjacent nucleotides invade the protospacer, incurring a significant energetic penalty on the RNA secondary structure, modulating the strand invasion and binding kinetics by an entirely separate mechanism than by merely blocking access to the protospacer at thermodynamic equilibrium.
- nucleotides may be added to the 5' end or 3' end of a full-length gRNA to disrupt a 'naturally-occurring' secondary structure on the protospacer targeting segment of the gRNA in the 'seed' region to enhance the initiation of strand invasion by the guide RNA.
- these nucleotides which form secondary structures that alter strand invasion by hybridizing partially hybridizing nucleotides in the protospacer-targeting sequence to modulate DNA binding or cleavage represent a different class of guide RNA modification.
- the optimized gRNAs are designed to minimize binding at an off-target site and to allow binding to a protospacer sequence.
- the off-target site is a known or predicted off-target site.
- the methods involve identifying a target region of interest, the target region of interest comprising a protospacer sequence; determining a polynucleotide sequence of a full-length gRNA that targets the target region of interest, the full- length gRNA comprising a protospacer-targeting sequence or segment; determining at least one or more off-target sites for the full-length gRNA; generating a polynucleotide sequence of a first gRNA, the first gRNA comprising the polynucleotide sequence of the full-length gRNA and a RNA segment, the RNA segment comprising a polynucleotide sequence having a length of M nucleotides that is complementary to a nucleotide segment of the protospacer-targeting sequence or
- the methods involve identifying a target region of interest, the target region of interest comprising a protospacer sequence; determining a polynucleotide sequence of a full-length gRNA that targets the target region of interest, the full-length gRNA comprising a protospacer-targeting sequence or segment; determining at least one or more off- target sites for the full-length gRNA; generating a polynucleotide sequence of a first gRNA, the first gRNA comprising the polynucleotide sequence of the full-length gRNA and a RNA segment, the RNA segment comprising a polynucleotide sequence having a length of M nucleotides that is complementary to a nucleotide segment of the protospacer-targeting sequence or segment, the RNA segment is at the 3' end of the polynucleotide sequence of the full-length gRNA, the first gRNA optionally comprising a linker between the 3' end of
- the energetics of further invasion of a different gRNA is determined by determining the energetics of at least one of (I) breaking a DNA-DNA base- pairing, (II) forming an RNA-DNA base-pair, (III) energetic difference resulting from disrupting or forming different secondary structure within the uninvaded guide RNA, and (IV) forming or disrupting interactions between the displaced DNA strand that is complementary to the protospacer and any unpaired guide RNA nucleotides which are not involved in secondary structures.
- the energetics of re-annealing of the first gRNA to the DNA sequence that is complementary to the protospacer sequence is determined by determining the energetics of at least one of (I) forming a DNA-DNA base-pairing, (II) breaking an RNA-DNA base-pair, (III) energetic difference resulting from disrupting or forming different secondary structure within the newly uninvaded guide RNA, and (IV) forming or disrupting interactions between the displaced DNA strand that is complementary to the protospacer and any unpaired guide RNA nucleotides which are not involved in secondary structures.
- the method further comprises determining the energetic considerations from at least one of (V) base-pairing across mismatches, (VI) interactions with the Cas9 protein, and/or (VII) additional heuristics, wherein the additional heuristics relate to binding lifetime, extent of invasion, stability of invading guide RNA, or other calculated / simulated properties of gRNA invasion to Cas9 cleavage activity.
- the CRISPR/Cas9-based system or CRISPR/Cpfl -based system can use gRNA, such as an optimized gRNA described herein, of varying sequences and lengths.
- gRNA such as an optimized gRNA described herein, of varying sequences and lengths.
- a full-length gRNA may comprise a protospacer-targeting segment which corresponds to the polynucleotide sequence of the target DNA sequence (i.e., protospacer).
- the protospacer-targeting segment may have at least 10 nucleotides, at least 11 nucleotides, at least 12 nucleotides, at least 13 nucleotides, at least 14 nucleotides, at least 15 nucleotides, at least 16 nucleotides, at least 17 nucleotides, at least 18 nucleotides, at least 19 nucleotides, at least 20 nucleotides, at least 21 nucleotides, at least 22 nucleotides, at least 23 nucleotides, at least 24 nucleotides, at least 25 nucleotides, at least 30 nucleotides, or at least 35 nucleotides.
- the gRNA may target at least one of a promoter region, an enhancer region, a repressor region, an insulator region, a silencer region, a region involved in DNA looping with the promoter region, a gene splicing region, or the transcribed region of the target gene.
- the full-length gRNA comprises a protospacer-targeting segment having between about 15 and 20 nucleotides.
- the RNA segment comprises between 2 and 20 nucleotides, between 3 and 10 nucleotides, or between 5 and 8 nucleotides. In some embodiments, the RNA segment comprises between 2 and 20 nucleotides, between 3 and 10 nucleotides, or between 5 and 8 nucleotides that complement the protospacer-targeting sequence.
- M is between 1 and 20, between 1 and 19, between 1 and 18, between 1 and 17, between 1 and 16, between 1 and 15, between 1 and 14, between 1 and 13, between 1 and 12, between 1 and 11, between 1 and 10, between 1 and 9, between 1 and 8, between 1 and 7, between 1 and 6, between 1 and 5, between 2 and 20, between 2 and 19, between 2 and 18, between 2 and 17, between 2 and 16, between 2 and 15, between 2 and 14, between 2 and 13, between 2 and 12, between 2 and 11, between 2 and 10, between 2 and 9, between 2 and 8, between 2 and 7, between 2 and 6, between 2 and 5, between 3 and 20, between 3 and 19, between 3 and 18, between 3 and 17, between 3 and 16, between 3 and 15, between 3 and 14, between 3 and 13, between 3 and 12, between 3 and 11, between 3 and 10, between 3 and 9, between 3 and 8, between 3 and 7, between 3 and 6, between 3 and 5, between 4 and 20, between 4 and 19, between 4 and 18, between 4 and 17, between 4 and 16, between 4 and 15, between 4 and 14, between 4 and 13, between 4 and 12, between 4 and 11, between 4 and 10, between 4 and 9, between 4 and 8, between 4 and 7,
- M can be 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20.
- the RNA segment can have between 1 and 20, between 1 and 19, between 1 and 18, between 1 and 17, between 1 and 16, between 1 and 15, between 1 and 14, between 1 and 13, between 1 and 12, between 1 and 11, between 1 and 10, between 1 and 9, between 1 and 8, between 1 and 7, between 1 and 6, between 1 and 5, between 2 and 20, between 2 and 19, between 2 and 18, between 2 and 17, between 2 and 16, between 2 and 15, between 2 and 14, between 2 and 13, between 2 and 12, between 2 and 11, between 2 and 10, between 2 and 9, between 2 and 8, between 2 and 7, between 2 and 6, between 2 and 5, between 3 and 20, between 3 and 19, between 3 and 18, between 3 and 17, between 3 and 16, between 3 and 15, between 3 and 14, between 3 and 13, between 3 and 12, between 3 and 11, between 3 and 10, between 3 and 9, between 3 and 8, between 3 and 7, between 3 and 6, between 3 and 5, between 4 and 20, between 4 and 19, between 4 and 18, between 4 and 17, between 4 and 16, between 4 and 15, between 4 and 20, between
- the RNA segment can have 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20 nucleotides.
- N is between 1 and 20, between 1 and 19, between 1 and 18, between 1 and 17, between 1 and 16, between 1 and 15, between 1 and 14, between 1 and 13, between 1 and 12, between 1 and 11, between 1 and 10, between 1 and 9, between 1 and 8, between 1 and 7, between 1 and 6, between 1 and 5, between 2 and 20, between 2 and 19, between 2 and 18, between 2 and 17, between 2 and 16, between 2 and 15, between 2 and 14, between 2 and 13, between 2 and 12, between 2 and 11, between 2 and 10, between 2 and 9, between 2 and 8, between 2 and 7, between 2 and 6, between 2 and 5, between 3 and 20, between 3 and 19, between 3 and 18, between 3 and 17, between 3 and 16, between 3 and 15, between 3 and 14, between 3 and 13, between 3 and 12, between 3 and 11, between 3 and 10, between 3 and 9, between 3 and 8, between 3 and 7, between 3 and 6, between 3 and 5, between 4 and 20, between 4 and 19, between 4 and 18, between 4
- N can be 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20.
- the linker comprises between 1 and 20 nucleotides, between 3 and 10 nucleotides, or between 5 and 8 nucleotides.
- the linker can have between 1 and 20, between 1 and 19, between 1 and 18, between 1 and 17, between 1 and 16, between 1 and 15, between 1 and 14, between 1 and 13, between 1 and 12, between 1 and 11, between 1 and 10, between 1 and 9, between 1 and 8, between 1 and 7, between 1 and 6, between 1 and 5, between 2 and 20, between 2 and 19, between 2 and 18, between 2 and 17, between 2 and 16, between 2 and 15, between 2 and 14, between 2 and 13, between 2 and 12, between 2 and 11, between 2 and 10, between 2 and 9, between 2 and 8, between 2 and 7, between 2 and 6, between 2 and 5, between 3 and 20, between 3 and 19, between 3 and 18, between 3 and 17, between 3 and 16, between 3 and 15, between 3 and 14, between 3 and 13, between 3 and 12, between 3 and 11, between 3 and 10, between 3 and 9, between 3 and 8, between 3 and 7, between 3 and 6, between 3 and 5, between 4 and 20, between 4 and 19, between 4 and 18, between 4 and 17, between 4 and 16, between 4 and 15, between 4 and 14, between 4 and 13, between 4 and 12, between 4 and 11, between 4 and 10, between 4 and 9, between 4 and 8, between 4 and
- the linker can have 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20 nucleotides.
- the linker can include a stabilizing linker, such as a tetraloop. Examples of tetraloop, include but are not limited to ANY A, CUYG, GNRA, UMAC and UNCG.
- the RNA segment and/or protospacer-targeting sequence provide a secondary structure.
- the secondary structure is formed by partially hybridizing the protospacer-targeting sequence with the RNA segment.
- the secondary structure modulates DNA binding or cleavage by Cas9 by disrupting invasion of the protospacer duplex or off-target duplex by the optimized gRNA.
- the secondary structure keeps the 5' - end of the gRNA stably within the protein and protects the optimized gRNA within the Cas9 to prevent degradation
- the secondary structure is formed by hybridizing all or part of the RNA segment to nucleotides in the 5' end of the protospacer-targeting sequence or segment, nucleotides in the middle of the protospacer-targeting sequence or segment, and/or nucleotides in the 3'- end of the protospacer-targeting sequence or segment.
- contiguous segments of the RNA segment hybridize to the protospacer-targeting sequence or segment.
- non-contiguous segment of the RNA segment hybridize to the protospacer- targeting sequence or segment.
- the secondary structure is a hairpin.
- the secondary structure is stable at room temperature or 37°C. In some embodiments, overall equilibrium free energy of the secondary structure is less than about 2 kcal/mol at a temperature between about 4°C and about 50°C, such as room temperature or 37°C.
- the overall equilibrium free energy of the secondary structure can be less than about 10 kcal/mol, less than about 5 kcal/mol, less than about 4 kcal/mol , less than about 3 kcal/mol, less than about 2 kcal/mol, less than about 1 kcal/mol, or less than about 0.5 kcal/mol at a temperature between about 4°C and about 50°C, between about 4°C and about 40°C, between about 4°C and about 37°C, between about 4°C and about 30°C, between about 4°C and about 25°C , between about 4°C and about 20°C, between about 4°C and about 10°C , between about 5°C and about 50°C, between about 5°C and about 40°C, between about 5°C and about 37°C, between about 5°C and about 30°C, between about 5°C and about 25°C , between about 5°C and about 20°C, between about 5°C and about 10°C , between about 10°C and about and about
- the RNA segment hybridizes or forms non-canonical base pairs with at least two nucleotides of the protospacer-targeting sequence or segment.
- the non-canonical base pair is rU-rG.
- between 1 and 20 nucleotides are randomized in the linker. For example, between 1 and 20, between 1 and 15, between 1 and 10, between 1 and 9, between 1 and 8, between 1 and 7, between 1 and 6, between 1 and 5, between 1 and 4, between 1 and 3, between 1 and 2, between 2 and 20, between 2 and 15, between 2 and 10, between 2 and 9, between 2 and 8, between 2 and 7, between 2 and 6, between 2 and 5, between 2 and 4, between
- the between 1 and 20 nucleotides are randomized in the RNA segment.
- the between 1 and 20 nucleotides are randomized in the RNA segment.
- step (g) is repeated X number of times, thereby generating X number of gRNAs and repeating step (e) with each X number of gRNAs, wherein X is between 0 to 20.
- X can be is between 1 and 20, between 1 and 19, between 1 and 18, between 1 and 17, between 1 and 16, between 1 and 15, between 1 and 14, between 1 and 13, between 1 and 12, between 1 and 11, between 1 and 10, between 1 and 9, between 1 and 8, between 1 and 7, between 1 and 6, between 1 and 5, between 2 and 20, between 2 and 19, between 2 and 18, between 2 and 17, between 2 and 16, between 2 and 15, between 2 and 14, between 2 and 13, between 2 and 12, between 2 and 11, between 2 and 10, between 2 and 9, between 2 and 8, between 2 and 7, between 2 and 6, between 2 and 5, between 3 and 20, between 3 and 19, between 3 and 18, between 3 and 17, between 3 and 16, between 3 and 15, between 3 and 14, between 3 and 13, between 3 and 12, between 3 and 11, between 3 and 10, between 3 and 9, between 3 and 8, between 3 and
- the invasion kinetics and lifetime are calculated using kinetic Monte Carlo method or Gillespie algorithm.
- the invasion kinetics and lifetime can be determined using 'deterministic' methods such as differential equations which model strand invasion, which are known to one of skill in the art.
- the kinetic Monte Carlo (KMC) method is a Monte Carlo method computer simulation intended to simulate the time evolution of some processes occurring in nature. The processes are typically processes that occur with known transition rates among states. These known transition rates are inputs to the KMC algorithm.
- the Gillespie algorithm also known as the Doob-Gillespie algorithm
- the Gillespie algorithm generates a statistically correct trajectory (possible solution) of a stochastic equation.
- the Gillespie algorithm can be used to simulate increasingly complex systems.
- the algorithm is particularly useful for simulating reactions within cells where the number of reagents typically number in the tens of molecules (or less). Mathematically, it is a variety of a dynamic Monte Carlo method and similar to the kinetic Monte Carlo methods.
- the Gillespie algorithm allows a discrete and stochastic simulation of a system with few reactants because every reaction is explicitly simulated. A trajectory corresponding to a single Gillespie simulation represents an exact sample from the probability mass function that is the solution of the master equation.
- the design criteria can be specificity, modulation of binding lifetime, and/or estimated cleavage specificity.
- the optimized gRNA may be designed to have a binding lifetime greater than or equal to that of the full gRNA at an on-target site, and/or a binding lifetime less than or equal to that of the full-length gRNA at an off-target site.
- the optimized gRNA is selected to have a binding lifetime less than or equal to that of the full-length gRNA to at least three off-target sites, wherein the off-target sites are predicted to be the closest off-target sites or predicted to have the highest identity to the on-target sites.
- the design criteria comprises a lifetime or cleavage rate at an off-target site that is less than or equal to the lifetime or cleavage rate of a full-length gRNA or truncated gRNA at the off-target site and/or a predicted on-target activity rate that is greater than 10% of the predicted on-target activity rate of a full-length gRNA or truncated gRNA.
- the optimized gRNA is tested in step i) using a mismatch- sensitive nuclease to determine CRISPR activity, such as using surveyor assay or T7
- the optimized gRNA is tested in step i) using a reporter assay, wherein the Cas9-fusion protein activity alters the expression of a reporter protein, such as GFP.
- GUIDE-Seq is an assay that has been devised to assay off-target cleavages.
- the target region can be determined based on a sequence's proximity to a PAM sequence using a program, such as CRISPR design (Ran, et al.
- the target sites can include promoters, DNAse I hypersensitivity sites, Transposase- Accessible Chromatin sites, DNA methylation sites, transcription factor binding sites, epigenetic marks, expression quantitative trait loci, and/or regions associated with human traits or phenotypes in genetic association studies.
- the target sites can be determined by DNase- sequencing (DNase-seq), Assay for Transposase- Accessible Chromatin with high throughput sequencing (ATAC-seq), ChlP-sequencing, self-transcribing active regulatory region sequencing (STARR-Seq), single molecule real time sequencing (SMRT), Formaldehyde- Assisted Isolation of Regulatory Elements sequencing (FAIRE-seq), micrococcal nuclease sequencing (MNase- seq), reduced representation bisulfite sequencing (RRBS-seq), whole genome bisulfite sequencing, methyl-binding DNA immunoprecipitation (MEDIP-seq), or genetic association studies.
- the off-target site can be determined using CasOT (PKU
- the CRISPR/Cas9-based system or CRISPR/Cpfl -based system may be designed to target and cleave any target gene.
- the gRNA such as the optimized gRNA described herein, may target and bind a target region in a target gene.
- the target gene may be an endogenous gene, a transgene, or a viral gene in a cell line.
- the target gene may be a known gene.
- the target gene is an unknown gene.
- the gRNA may target any nucleic acid sequence.
- the nucleic acid sequence target may be DNA.
- the DNA may be any gene.
- the gRNA may target a gene, such as DMD, EMX1, or VEGFA.
- the target gene is a disease-relevant gene.
- the target cell is a mammalian cell.
- the genome includes a human genome.
- the target gene may be a prokaryotic gene or a eukaryotic gene, such as a mammalian gene.
- the CRISPR/Cas9-based system or CRISPR/Cpfl -based system may target a mammalian gene, such as DMD (dystrophin gene), EMXl, VEGFA, IL1RN, MYOD1, OCT4, HBE, HBG, HBD, HBB, MYOCD (Myocardin), PAX7 (Paired box protein Pax- 7), / ⁇ ( -7 7 (fibroblast growth factor- 1) genes, such as FGF1A, FGF1B, and FGF1C.
- target genes include, but not limited to, Atf3, Axudl, Btg2, c-Fos, c-Jun, Cxcll, Cxcl2, Ednl, Ereg, Fos, Gadd45b, Ier2, Ier3, Ifrdl, Illb, 116, Irfl, Junb, Lif, Nfkbia, Nfkbiz, Ptgs2, Slc25a25, Sqstml, Tieg, Tnf, Tnfaip3, Zfp36, Birc2, Ccl2, Ccl20, Ccl7, Cebpd, Ch25h, CSF1, Cx3cll, CxcllO, Cxcl5, Gch, Icaml, Ifi47, Ifngr2, MmplO, Nfkbie, Npall, p21, Relb, Ripk2, Rndl, Slpr3, Stxl l, Tgtp, Tlr2, Tmeml
- compositions for genome editing, genomic alteration or altering gene expression of a target gene include an optimized gRNA generated by the disclosed method with a a CRISPR/Cas9-based system or
- the gRNA can discriminate between on- and off-target sites with minimal thermodynamic energetic differences between the sites and provide increased specificity.
- the optimized gRNA modulates strand invasion into the protospacer.
- the increase in specificity is achieved by adding an extension to the 5'-end or 3' - end of a full-length or standard gRNA such that it forms a 'hairpin' structure that is self- complementary to the segment of the full-length or standard gRNA which targets the
- the hairpins serve as a kinetic barrier to strand invasion of the protospacer, but the hairpins are displaced during strand invasion of the full target sites so full invasion can occur.
- Optimized gRNAs with 5'-hairpins or 3 '-hairpins significantly enhanced specificity in binding compared to both standard guide RNAs and the best available guide RNA variants (see examples), and abolished or significantly weakened binding at protospacer sites containing mismatches.
- Increasing lengths of the hairpin increased the specificity of dCas9 binding.
- Optimized gRNA and hpgRNAs can be used to tune Cas9/dCas9 or Cpfl binding affinities and specificity. Based on the size and structure of the hairpin, the hairpin of hpgRNAs could be accommodated within the DNA-binding channel of Cas9/dCas9 molecule and protected from degradation.
- the hairpin length, loop length, and loop composition may be changed to allow for more fine control of these properties.
- the hairpin length can be between about 1 and about 20 nucleotides or between about 3 to about 10 nucleotides.
- the hairpin length can be between 1 and 20, between 1 and 19, between 1 and 18, between 1 and 17, between 1 and 16, between 1 and 15, between 1 and 14, between 1 and 13, between 1 and 12, between 1 and 11, between 1 and 10, between 1 and 9, between 1 and 8, between 1 and 7, between 1 and 6, between 1 and 5, between 2 and 20, between 2 and 19, between 2 and 18, between 2 and 17, between 2 and 16, between 2 and 15, between 2 and 14, between 2 and 13, between 2 and 12, between 2 and 11, between 2 and 10, between 2 and 9, between 2 and 8, between 2 and 7, between 2 and 6, between 2 and 5, between 3 and 20, between 3 and 19, between 3 and 18, between 3 and 17, between 3 and 16, between 3 and 15, between 3 and 14, between 3 and 13, between 3 and 12, between 3 and 11, between 3 and 10, between 3 and 9, between 3 and 8, between 3 and 7, between 3 and 6, between 3 and 5, between 3 and 20, between 3 and 19, between 3 and 18, between 3 and 17, between 3 and 16, between 3 and 15, between 3 and 14, between 3 and 13, between 3 and 12, between 3 and 11, between 3 and 10, between 3 and 9, between 3 and 8, between 3 and 7,
- 4 and 20 between 4 and 19, between 4 and 18, between 4 and 17, between 4 and 16, between 4 and 15, between 4 and 14, between 4 and 13, between 4 and 12, between 4 and 11, between 4 and 10, between 4 and 9, between 4 and 8, between 4 and 7, between 4 and 6, between 4 and 5, between 5 and 20, between 5 and 19, between 5 and 18, between 5 and 17, between 5 and 16, between 5 and 15, between 5 and 14, between 5 and 13, between 5 and 12, between 5 and 11, between 5 and 10, between 5 and 9, between 5 and 8, between 5 and 7, between 5 and 6, between 6 and 20, between 6 and 19, between 6 and 18, between 6 and 17, between 6 and 16, between 6 and 15, between 6 and 14, between 6 and 13, between 6 and 12, between 6 and 11, between 6 and 10, between 6 and 9, between 6 and 8, between 6 and 7, between 7 and 20, between 7 and 19, between 7 and 18, between 7 and 17, between 7 and 16, between 7 and 15, between 7 and 14, between 7 and 13, between 7 and 12, between 7 and 11, between 7 and 10, between 7 and 9, between 7 and 8, between 8 and 20, between 8 and 19, between 8 and 18, between 8 and 17, between 8 and 16, between 8 and
- the loop length can be between about 1 and about 20 nucleotides, between about 3 to about 10 nucleotides, or between about 5 to about 8 nucleotides.
- the loop length can be between 1 and 20, between 1 and 19, between 1 and 18, between 1 and 17, between 1 and 16, between 1 and 15, between 1 and 14, between 1 and 13, between 1 and 12, between 1 and 11, between 1 and 10, between 1 and 9, between 1 and 8, between 1 and 7, between 1 and 6, between 1 and 5, between 2 and 20, between 2 and 19, between 2 and 18, between 2 and 17, between 2 and 16, between 2 and 15, between 2 and 14, between 2 and 13, between 2 and 12, between 2 and 11, between 2 and 10, between 2 and 9, between 2 and 8, between 2 and 7, between 2 and 6, between 2 and 5, between 3 and 20, between 3 and 19, between 3 and 18, between 3 and 17, between 3 and 16, between 3 and 15, between 3 and 14, between 3 and 13, between 3 and 12, between 3 and 11, between 3 and 10, between 3 and 9, between 3 and 8, between 3 and 7, between 3 and 6, between 3 and 5, between 4 and 20,
- the loop composition can be between about 1 and about 20 nucleotides, between about 3 to about 10 nucleotides, or about 5 to about 8 nucleotides.
- the loop composition can be between 1 and 20, between 1 and 19, between 1 and 18, between 1 and 17, between 1 and 16, between 1 and 15, between 1 and 14, between 1 and 13, between 1 and 12, between 1 and 11, between 1 and 10, between 1 and 9, between 1 and 8, between 1 and 7, between 1 and 6, between 1 and 5, between 2 and 20, between 2 and 19, between 2 and 18, between 2 and 17, between 2 and 16, between 2 and 15, between 2 and 14, between 2 and 13, between 2 and 12, between 2 and 11, between 2 and 10, between 2 and 9, between 2 and 8, between 2 and 7, between 2 and 6, between 2 and 5, between 3 and 20, between 3 and 19, between 3 and 18, between 3 and 17, between 3 and 16, between 3 and 15, between 3 and 14, between 3 and 13, between 3 and 12, between 3 and 11, between 3 and 10, between 3 and 9, between 3 and 8, between 3 and 7, between 3 and 6, between 3 and 5, between 4 and 20, between 1 and 20, between 1 and 19, between 1
- compositions may include a may include viral vector and a CRISPR/Cas9-based system or CRISPR/Cpfl -based system with at least one gRNA, such as an optimized gRNA described herein.
- the composition includes a modified AAV vector and a nucleotide sequence encoding a CRISPR/Cas9-based system with at least one gRNA, such as an optimized gRNA described herein.
- the composition may further comprise a donor DNA or a transgene.
- the target gene may be involved in differentiation of a cell or any other process in which activation, repression, or disruption of a gene may be desired, or may have a mutation such as a deletion, frameshift mutation, or a nonsense mutation.
- the CRISPR/Cas9-based system or CRISPR/Cpfl -based system with at least one gRNA, such as an optimized gRNA described herein may be designed to recognize and bind a nucleotide sequence upstream or downstream from the premature stop codon, the aberrant splice acceptor site or the aberrant splice donor site.
- CRISPR/Cpfl -based system with at least one gRNA may also be used to disrupt normal gene splicing by targeting splice acceptors and donors to induce skipping of premature stop codons or restore a disrupted reading frame.
- CRISPR/Cas9-based system or CRISPR/Cpfl -based system with at least one gRNA, such as an optimized gRNA described herein, may or may not mediate off-target changes to protein-coding regions of the genome.
- the CRISPR/Cas9-based system induces or represses the gene expression of a target gene by at least about 1 fold, at least about 2 fold, at least about 3 fold, at least about 4 fold, at least about 5 fold, at least about 6 fold, at least about 7 fold, at least about 8 fold, at least about 9 fold, at least about 10 fold, at least 15 fold, at least 20 fold, at least 30 fold, at least 40 fold, at least 50 fold, at least 60 fold, at least 70 fold, at least 80 fold, at least 90 fold, at least 100 fold, at least about 110 fold, at least 120 fold, at least 130 fold, at least 140 fold, at least 150 fold, at least 160 fold, at least 170 fold, at least 180 fold, at least 190 fold, at least 200 fold, at least about 300 fold, at least 400 fold, at least 500 fold, at least 600 fold, at least 700 fold, at least 800 fold, at least 900 fold, at least 1000 fold, at least 1500 fold, at least 2000 fold, at least
- compositions for genome editing, genomic alteration or altering gene expression of a target gene may include a modified lentiviral vector.
- the modified lentiviral vector includes a first polynucleotide sequence encoding a DNA targeting system and a second polynucleotide sequence encoding at least one sgRNA.
- the first polynucleotide sequence may be operably linked to a promoter.
- the promoter may be a constitutive promoter, an inducible promoter, a repressible promoter, or a regulatable promoter.
- the second polynucleotide sequence encodes at least 1 gRNA, such as an optimized gRNA described herein.
- the second polynucleotide sequence may encode at least 1 gRNA, at least 2 gRNAs, at least 3 gRNAs, at least 4 gRNAs, at least 5 gRNAs, at least 6 gRNAs, at least 7 gRNAs, at least 8 gRNAs, at least 9 gRNAs, at least 10 gRNAs, at least 11 gRNA, at least 12 gRNAs, at least 13 gRNAs, at least 14 gRNAs, at least 15 gRNAs, at least 16 gRNAs, at least 17 gRNAs, at least 18 gRNAs, at least 19 gRNAs, at least 20 gRNAs, at least 25 gRNA, at least 30 gRNAs, at least 35 gRNAs, at least 40 gRNAs, at least 45 gRNAs, or at least 50 gRNAs.
- the second polynucleotide sequence may encode between 1 gRNA and 50 gRNAs, between 1 gRNA and 45 gRNAs, between 1 gRNA and 40 gRNAs, between 1 gRNA and 35 gRNAs, between 1 gRNA and 30 gRNAs, between 1 gRNA and 25 different gRNAs, between 1 gRNA and 20 gRNAs, between 1 gRNA and 16 gRNAs, between 1 gRNA and 8 different gRNAs, between 4 different gRNAs and 50 different gRNAs, between 4 different gRNAs and 45 different gRNAs, between 4 different gRNAs and 40 different gRNAs, between 4 different gRNAs and 35 different gRNAs, between 4 different gRNAs and 30 different gRNAs, between 4 different gRNAs and 25 different gRNAs, between 4 different gRNAs and 20 different gRNAs, between 4 different gRNAs and 16 different gRNAs, between 4 different gRNAs and 8 different g
- Each of the polynucleotide sequences encoding the different gRNAs may be operably linked to a promoter.
- the promoters that are operably linked to the different gRNAs may be the same promoter.
- the promoters that are operably linked to the different gRNAs may be different promoters.
- the promoter may be a constitutive promoter, an inducible promoter, a repressible promoter, or a regulatable promoter.
- At least one gRNA may bind to a target gene or loci. If more than one gRNA is included, each of the gRNAs binds to a different target region within one target loci or each of the gRNA binds to a different target region within different gene loci.
- AAV may be used to deliver the compositions to the cell using various construct configurations.
- AAV may deliver a CRISPR/Cas9-based system or CRISPR/Cpfl- based system and gRNA expression cassettes on separate vectors.
- the small Cas9 proteins derived from species such as Staphylococcus aureus or Neisseria meningitidis, are used then both the Cas9 and up to two gRNA expression cassettes may be combined in a single AAV vector within the 4.7 kb packaging limit.
- the composition includes a modified adeno-associated virus (AAV) vector.
- the modified AAV vector may be capable of delivering and expressing the CRISPR/Cas9-based system or CRISPR/Cpfl -based system in the cell of a mammal.
- the modified AAV vector may be an AAV-SASTG vector (Piacentino et al. (2012) Human Gene Therapy 23 :635-646).
- the modified AAV vector may be based on one or more of several capsid types, including AAVl, AAV2, AAV5, AAV6, AAV8, and AAV9.
- the modified AAV vector may be based on AAV2 pseudotype with alternative muscle-tropic AAV capsids, such as AAV2/1, AAV2/6, AAV2/7, AAV2/8, AAV2/9, AAV2.5 and AAV/SASTG vectors that efficiently transduce skeletal muscle or cardiac muscle by systemic and local delivery (Seto et al. Current Gene Therapy (2012) 12: 139-151).
- AAV2 pseudotype with alternative muscle-tropic AAV capsids such as AAV2/1, AAV2/6, AAV2/7, AAV2/8, AAV2/9, AAV2.5 and AAV/SASTG vectors that efficiently transduce skeletal muscle or cardiac muscle by systemic and local delivery (Seto et al. Current Gene Therapy (2012) 12: 139-151).
- the gRNA such as an optimized gRNA described herein, may be used with a CRISPR/Cas9 system with any type of cell.
- the cell is a bacterial cell, a fungal cell, an archaea cell, a plant cell or an animal cell, such as a mammalian cell. In some embodiments, this may be an organ or an animal organism.
- the cell may be any cell type or cell line, including but not limited to, 293-T cells, 3T3 cells, 721 cells, 9L cells, A2780 cells, A2780ADR cells, A2780cis cells, A172 cells, A20 cells, A253 cells, A431 cells, A-549 cells, ALC cells, B 16 cells, B35 cells, BCP-1 cells, BEAS-2B cells, bEnd.3 cells, BHK-21 cells, BR 293 cells, BxPC3 cells, C2C12 cells, C3H-10T1/2 cells, C6/36 cells, Cal-27 cells, CHO cells, COR-L23 cells, COR-L23/CPR cells, COR-L23/5010 cells, COR- L23/R23 cells, COS-7 cells, COV-434 cells, CML Tl cells, CMT cells, CT26 cells, D17 cells, DH82 cells, DU145 cells, DuCaP cells, EL4 cells, EM2 cells, EM3
- GM12878, K562 HI human embryonic stem cells, HeLa-S3, HepG2, HUVEC, SK-N-SH, IMR90, A549, MCF7, HMEC or LHCM, CD14+, CD20+, primary heart or liver cells, differentiated HI cells, 8988T, Adult_CD4_naive, Adult_CD4_ThO, Adult_CD4_Thl,
- BC Adipose UHNOOOOl BC_Adrenal_Gland_H12803N, BC_Bladder_01-l 1002,
- BC_Spleen_H12817N BC Stomach O 1-11002, BC_Stomach_H12817N, BC_Testis_N30, BC_Uterus_BN0765, BE2 C, BG02ES, BG02ES-EBD, BJ, bone_marrow_HS27a,
- HA0AF 61 1 1301.9 HAoEC, HAoEC_7071706.1, HA0EC 8O61 102.1, HA-sp, HBMEC, HBVP, HBVSMC, HCF, HCFaa, HCH, HCH OOl 1308.2P, HCH 8100808.2, HCM, HConF, HCPEpiC, HCT-1 16, Heart OC, Heart_STL003, HEEpiC, HEK293, HEK293T, HEK293-T- REx, Hepatocytes, HFDPC, HFDPC 0100503.2, HFDPC 0102703.3, HFF,HFF-Myc,
- HPIEpC_9041503.2 HRCEpiC, HRE, HRGEC, HRPEpiC, HSaVEC, HSaVEC_0022202.16, HSaVEC_9100101.15, HSMM, HSMM emb, HSMM FSHD, HSMMtube, HSMMtube emb, HSMMtube F SHD, HT-1080, HTR8svn, Huh-7, Huh-7.5, HVMF, HVMF 6091203.3, HVMF_6100401.3, HWP, HWP_0092205, HWP_8120201.5, iPS, iPS_CWRUl,
- Skeletal_Muscle_BC SkMC, SKMC, SkMC_8121902.17, SkMC_9011302, SK-N-MC, SK-N- SH RA, Small intestine OC, Spleen OC, Stellate, Stomach BC, T_cells_CD4+, T-47D, T98G, TBEC, Thl, Thl_Wb33676984, Thl_Wb54553204, Thl7, Th2, Th2_Wb33676984,
- the target cell can be any cell, such as a primary cell, a HEK293 cell, 293Ts cell, SKBR3 cell, A431 cell, K562 cell, HCT116 cell, HepG2 cell, or K-Ras-dependent and K-Ras-independent cell groups.
- the present disclosure relates to a method of epigenomic editing in a target cell or a subject with a CRISPR/Cas9-based system or CRISPR/Cpfl -based system.
- the method can be used to activate or repress a target gene.
- the method includes contacting a cell or a subject with an effective amount of the optimized gRNA molecule, as described herein, and a CRISPR/Cas9- based system or CRISPR/Cpfl -based system.
- the optimized gRNA is encoded by a polynucleotide sequence and packaged into a lentiviral vector.
- the lentiviral vector comprises an expression cassette comprising a promoter operably linked to the polynucleotide sequence encoding the sgRNA.
- the promoter operably linked to the polynucleotide encoding the optimized gRNA is inducible.
- the present disclosure relates to a method of site specific DNA cleavage in a target cell or a subject with a CRISPR/Cas9-based system or CRISPR/Cpfl -based system.
- the method includes contacting a cell or a subject with an effective amount of the optimized gRNA molecule, as described herein, and a CRISPR/Cas9-based system or CRISPR/Cpfl -based system.
- the optimized gRNA is encoded by a polynucleotide sequence and packaged into a lentiviral vector.
- the lentiviral vector comprises an expression cassette comprising a promoter operably linked to the polynucleotide sequence encoding the sgRNA.
- the promoter operably linked to the polynucleotide encoding the optimized gRNA is inducible.
- the number of gRNA administered to the cell or sample may be at least 1 gRNA, at least 2 different gRNA, at least 3 different gRNA at least 4 different gRNA, at least 5 different gRNA, at least 6 different gRNA, at least 7 different gRNA, at least 8 different gRNA, at least 9 different gRNA, at least 10 different gRNAs, at least 11 different gRNAs, at least 12 different gRNAs, at least 13 different gRNAs, at least 14 different gRNAs, at least 15 different gRNAs, at least 16 different gRNAs, at least 17 different gRNAs, at least 18 different gRNAs, at least 18 different gRNAs, at least 20 different gRNAs, at least 25 different gRNAs, at least 30 different gRNAs, at least 35 different gRNAs, at least 40 different gRNAs, at least 45 different gRNAs, or at least 50 different gRNAs.
- the number of gRNA administered to the cell may be between at least 1 gRNA to at least 50 different gRNAs, at least 1 gRNA to at least 45 different gRNAs, at least 1 gRNA to at least 40 different gRNAs, at least 1 gRNA to at least 35 different gRNAs, at least 1 gRNA to at least 30 different gRNAs, at least 1 gRNA to at least 25 different gRNAs, at least 1 gRNA to at least 20 different gRNAs, at least 1 gRNA to at least 16 different gRNAs, at least 1 gRNA to at least 12 different gRNAs, at least 1 gRNA to at least 8 different gRNAs, at least 1 gRNA to at least 4 different gRNAs, at least 4 gRNAs to at least 50 different gRNAs, at least 4 different gRNAs to at least 45 different gRNAs, at least 4 different gRNAs to at least 40 different gRNAs, at least 4 different gRNAs to at least 35
- the gRNA may comprise a complementary polynucleotide sequence of the target DNA sequence followed by a PAM sequence.
- the gRNA may comprise a "G" at the 5' end of the complementary polynucleotide sequence.
- the gRNA may comprise at least a 10 base pair, at least a l l base pair, at least a 12 base pair, at least a 13 base pair, at least a 14 base pair, at least a 15 base pair, at least a 16 base pair, at least a 17 base pair, at least a 18 base pair, at least a 19 base pair, at least a 20 base pair, at least a 21 base pair, at least a 22 base pair, at least a 23 base pair, at least a 24 base pair, at least a 25 base pair, at least a 30 base pair, or at least a 35 base pair complementary polynucleotide sequence of the target DNA sequence followed by a PAM sequence.
- the PAM sequence may be "NGG", where "N” can be any nucleotide.
- the gRNA may target at least one of the promoter region, the enhancer region or the transcribed region of the target gene. In some embodiments, the gRNA targets a nucleic acid sequence having a polynucleotide sequence of at least one of SEQ ID NOs: 13-148, 316, 317, or 320.
- the gRNA may include a nucleic acid sequence of at least one of SEQ ID NOs: 149-315, 321-323, or 326- 329.
- the present disclosure is also directed to a method of correcting a mutant gene in a subject.
- the method comprises administering to a cell of the subject the composition, as described above.
- CRISPR/Cpfl -based system with at least one gRNA such as an optimized gRNA described herein, to the cell may restore the expression of a full-functional or partially-functional protein with a repair template or donor DNA, which can replace the entire gene or the region containing the mutation.
- the CRISPR/Cas9-based system or CRISPR/Cpfl -based system with at least one gRNA, such as an optimized gRNA described herein, may be used to introduce site-specific double strand breaks at targeted genomic loci.
- Site-specific double-strand breaks are created when the CRISPR/Cas9-based system or CRISPR/Cpfl -based system with at least one gRNA, such as an optimized gRNA described herein, binds to a target DNA sequences, thereby permitting cleavage of the target DNA.
- This DNA cleavage may stimulate the natural DNA- repair machinery, leading to one of two possible repair pathways: homology-directed repair (HDR) or the non-homologous end joining (NHEJ) pathway.
- HDR homology-directed repair
- NHEJ non-homologous end joining
- the present disclosure is directed to genome editing with a CRISPR/Cas9-based system or CRISPR/Cpfl -based system with at least one gRNA, such as an optimized gRNA described herein, without a repair template, which can efficiently correct the reading frame and restore the expression of a functional protein involved in a genetic disease.
- the disclosed CRISPR/Cas9-based system or CRISPR/Cpfl -based system with at least one gRNA, such as an optimized gRNA described herein, may involve using homology-directed repair or nuclease- mediated non-homologous end joining (HEJ)-based correction approaches, which enable efficient correction in proliferation-limited primary cell lines that may not be amenable to homologous recombination or selection-based gene correction.
- HEJ nuclease- mediated non-homologous end joining
- This strategy integrates the rapid and robust assembly of active the CRISPR/Cas9-based system or CRISPR/Cpfl -based system with at least one gRNA, such as an optimized gRNA described herein, with an efficient gene editing method for the treatment of genetic diseases caused by mutations in nonessential coding regions that cause frameshifts, premature stop codons, aberrant splice donor sites or aberrant splice acceptor sites.
- at least one gRNA such as an optimized gRNA described herein
- Restoration of protein expression from an endogenous mutated gene may be through template-free NHEJ-mediated DNA repair.
- the correction of the target gene reading frame in the genome by a transiently expressed CRISPR/Cas9-based system or CRISPR/Cpfl -based system with at least one gRNA, such as an optimized gRNA described herein, may lead to permanently restored target gene expression by each modified cell and all of its progeny.
- Nuclease mediated NHEJ gene correction may correct the mutated target gene and offers several potential advantages over the HDR pathway.
- NHEJ does not require a donor template, which may cause nonspecific insertional mutagenesis.
- NHEJ operates efficiently in all stages of the cell cycle and therefore may be effectively exploited in both cycling and post-mitotic cells, such as muscle fibers. This provides a robust, permanent gene restoration alternative to oligonucleotide-based exon skipping or pharmacologic forced read-through of stop codons and could theoretically require as few as one drug treatment.
- NHEJ-based gene correction using a CRISPR/Cas9-based system or CRISPR/Cpfl -based system may be combined with other existing ex vivo and in vivo platforms for cell- and gene- based therapies, in addition to the plasmid electroporation approach described here.
- delivery of a CRISPR/Cas9-based system or CRISPR/Cpfl -based system by mRNA- based gene transfer or as purified cell permeable proteins could enable a DNA-free genome editing approach that would circumvent any possibility of insertional mutagenesis.
- the method as described above further includes administrating a donor template to the cell.
- the donor template may include a nucleotide sequence encoding a full-functional protein or a partially-functional protein.
- the donor template may include a miniaturized dystrophin construct, termed minidystrophin ("minidys”), a full- functional dystrophin construct for restoring a mutant dystrophin gene, or a fragment of the dystrophin gene that after homology-directed repair leads to restoration of the mutant dystrophin gene.
- minidystrophin minidystrophin
- the present disclosure is also directed to genome editing with the CRISPR/Cas9-based system or CRISPR/Cpfl -based system described above to restore the expression of a full- functional or partially-functional protein with a repair template or donor DNA, which can replace the entire gene or the region containing the mutation.
- the CRISPR/Cas9-based system or CRISPR/Cpfl -based system may be used to introduce site-specific double strand breaks at targeted genomic loci. Site-specific double-strand breaks are created when the CRISPR/Cas9- based system or CRISPR/Cpfl -based system binds to a target DNA sequences using the gRNA, thereby permitting cleavage of the target DNA.
- CRISPR/Cpfl -based system has the advantage of advanced genome editing due to their high rate of successful and efficient genetic modification.
- This DNA cleavage may stimulate the natural DNA-repair machinery, leading to one of two possible repair pathways: homology-directed repair (HDR) or the non-homologous end joining (NHEJ) pathway.
- HDR homology-directed repair
- NHEJ non-homologous end joining
- the present disclosure is directed to genome editing with CRISPR/Cas9-based system or CRISPR/Cpfl -based system without a repair template, which can efficiently correct the reading frame and restore the expression of a functional protein involved in a genetic disease.
- the disclosed CRISPR/Cas9-based system or CRISPR/Cpfl -based system and methods may involve using homology-directed repair or nuclease-mediated non-homologous end joining (NHEJ)-based correction approaches, which enable efficient correction in proliferation-limited primary cell lines that may not be amenable to homologous recombination or selection-based gene correction.
- NHEJ nuclease-mediated non-homologous end joining
- This strategy integrates the rapid and robust assembly of active CRISPR/Cas9- based system or CRISPR/Cpfl -based system with an efficient gene editing method for the treatment of genetic diseases caused by mutations in nonessential coding regions that cause frameshifts, premature stop codons, aberrant splice donor sites or aberrant splice acceptor sites.
- the present disclosure provides methods of correcting a mutant gene in a cell and treating a subject suffering from a genetic disease, such as DMD.
- the method may include administering to a cell or subject a CRISPR/Cas9-based system or CRISPR/Cpfl -based system, a polynucleotide or vector encoding said CRISPR/Cas9-based system or CRISPR/Cpfl -based system, or composition of said CRISPR/Cas9-based system or CRISPR/Cpfl -based system as described above.
- the method may include administering a CRISPR/Cas9-based system or CRISPR/Cpfl -based system, such as administering a Cas9 protein, a Cpfl protein, a Cas9 fusion protein containing a second domain, a nucleotide sequence encoding said Cas9 protein, Cpfl protein, or Cas9 fusion protein, and/or at least one gRNA, wherein the gRNAs target different DNA sequences.
- the target DNA sequences may be overlapping.
- the number of gRNA administered to the cell may be at least 1 gRNA, at least 2 different gRNA, at least 3 different gRNA at least 4 different gRNA, at least 5 different gRNA, at least 6 different gRNA, at least 7 different gRNA, at least 8 different gRNA, at least 9 different gRNA, at least 10 different gRNA, at least 15 different gRNA, at least 20 different gRNA, at least 30 different gRNA, or at least 50 different gRNA, as described above.
- the gRNA may include a nucleic acid sequence of at least one of SEQ ID NOs: 149-315, 321-323, or 326-329.
- the method may involve homology- directed repair or non-homologous end joining.
- compositions may comprise genetic constructs that encodes the CRISPR/Cas9-based system or CRISPR/Cpfl -based system, as disclosed herein.
- the genetic construct such as a plasmid, may comprise a nucleic acid that encodes the
- compositions may comprise genetic constructs that encodes the modified AAV vector and a nucleic acid sequence that encodes the CRISPR/Cas9-based system or CRISPR/Cpfl -based system with at least one gRNA, such as an optimized gRNA described herein.
- the genetic construct such as a plasmid, may comprise a nucleic acid that encodes the CRISPR/Cas9-based system or CRISPR/Cpfl -based system with at least one gRNA, such as an optimized gRNA described herein.
- the compositions, as described above, may comprise genetic constructs that encodes the modified lentiviral vector, as disclosed herein.
- the genetic construct, such as a plasmid may comprise a nucleic acid that encodes a Cas9-fusion protein and at least one sgRNA.
- the genetic construct may be present in the cell as a functioning extrachromosomal molecule.
- the genetic construct may be a linear minichromosome including centromere, telomeres or plasmids or cosmids.
- the genetic construct may also be part of a genome of a recombinant viral vector, including recombinant lentivirus, recombinant adenovirus, and recombinant adenovirus associated virus.
- the genetic construct may be part of the genetic material in attenuated live microorganisms or recombinant microbial vectors which live in cells.
- the genetic constructs may comprise regulatory elements for gene expression of the coding sequences of the nucleic acid.
- the regulatory elements may be a promoter, an enhancer, an initiation codon, a stop codon, or a polyadenylation signal.
- the nucleic acid sequences may make up a genetic construct that may be a vector.
- the vector may be capable of expressing the fusion protein, such as a Cas9-fusion protein, in the cell of a mammal.
- the vector may be recombinant.
- the vector may comprise heterologous nucleic acid encoding the Cas9-fusion protein.
- the vector may be a plasmid.
- the vector may be useful for transfecting cells with nucleic acid encoding the Cas9-fusion protein, which the transformed host cell is cultured and maintained under conditions wherein expression of the Cas9-fusion protein system takes place.
- Coding sequences may be optimized for stability and high levels of expression.
- codons are selected to reduce secondary structure formation of the RNA such as that formed due to intramolecular bonding.
- the vector may comprise heterologous nucleic acid encoding the CRISPR/Cas9-based system or CRISPR/Cpfl -based system and may further comprise an initiation codon, which may be upstream of the CRISPR/Cas9-based system or CRISPR/Cpfl -based system coding sequence, and a stop codon, which may be downstream of the CRISPR/Cas9-based system or
- the initiation and termination codon may be in frame with the CRISPR/Cas9-based system or CRISPR/Cpfl -based system coding sequence.
- the vector may also comprise a promoter that is operably linked to the CRISPR/Cas9-based system or CRISPR/Cpfl -based system coding sequence.
- the promoter operably linked to the CRISPR/Cas9-based system or CRISPR/Cpfl -based system coding sequence may be a promoter from simian virus 40 (SV40), a mouse mammary tumor virus (MMTV) promoter, a human immunodeficiency virus (HIV) promoter such as the bovine immunodeficiency virus (BIV) long terminal repeat (LTR) promoter, a Moloney virus promoter, an avian leukosis virus (ALV) promoter, a cytomegalovirus (CMV) promoter such as the CMV immediate early promoter, Epstein Barr virus (EBV) promoter, or a Rous sarcoma virus (RSV) promoter.
- SV40 simian virus 40
- MMTV mouse mammary tumor virus
- HSV human immunodeficiency virus
- HSV human immunodeficiency virus
- BIV bovine immunodeficiency virus
- LTR long terminal repeat
- Moloney virus promoter
- the promoter may also be a promoter from a human gene such as human ubiquitin C (hUbC), human actin, human myosin, human hemoglobin, human muscle creatine, or human metalothionein.
- the promoter may also be a tissue specific promoter, such as a muscle or skin specific promoter, natural or synthetic. Examples of such promoters are described in US Patent Application Publication No. US20040175727, the contents of which are incorporated herein in its entirety.
- the vector may also comprise a polyadenylation signal, which may be downstream of the CRISPR/Cas9-based system or CRISPR/Cpfl -based system.
- the polyadenylation signal may be a SV40 polyadenylation signal, LTR polyadenylation signal, bovine growth hormone (bGH) polyadenylation signal, human growth hormone (hGH) polyadenylation signal, or human ⁇ -globin polyadenylation signal.
- the SV40 polyadenylation signal may be a polyadenylation signal from a pCEP4 vector (Invitrogen, San Diego, CA).
- the vector may also comprise an enhancer upstream of the CRISPR/Cas9-based system or CRISPR/Cpfl -based system, i.e., the Cas9 protein, the Cpfl protein, or Cas9 fusion protein coding sequence or sgRNA, such as an optimized gRNA described herein.
- the enhancer may be necessary for DNA expression.
- the enhancer may be human actin, human myosin, human hemoglobin, human muscle creatine or a viral enhancer such as one from CMV, HA, RSV or EBV.
- Polynucleotide function enhancers are described in U.S. Patent Nos. 5,593,972, 5,962,428, and WO94/016737, the contents of each are fully incorporated by reference.
- the vector may also comprise a mammalian origin of replication in order to maintain the vector extrachromosomally and produce multiple copies of the vector in a cell.
- the vector may also comprise a regulatory sequence, which may be well suited for gene expression in a mammalian or human cell into which the vector is administered.
- the vector may also comprise a reporter gene, such as green fluorescent protein ("GFP") and/or a selectable marker, such as hygromycin ("Hygro").
- the vector may be expression vectors or systems to produce protein by routine techniques and readily available starting materials including Sambrook et a/., Molecular Cloning and Laboratory Manual, Second Ed., Cold Spring Harbor (1989), which is incorporated fully by reference.
- the vector may comprise the nucleic acid sequence encoding the CRISPR/Cas9-based system or CRISPR/Cpfl -based system, including the nucleic acid sequence encoding the Cas9 protein, the Cpf 1 protein, or Cas9 fusion protein and the nucleic acid sequence encoding the at least one gRNA comprising the nucleic acid sequence of at least one of SEQ ID NOs: 149-315, 321-323, or 326-329.
- the composition may be in a pharmaceutical composition.
- the pharmaceutical composition may comprise about 1 ng to about 10 mg of DNA encoding the CRISPR/Cas9- based system, CRISPR/Cpfl -based system, or CRISPR/Cas9-based system protein component, i.e., the Cas9 protein, the Cpfl protein, or Cas9 fusion protein.
- the pharmaceutical composition may comprise about 1 ng to about 10 mg of the DNA of the modified AAV vector and nucleotide sequence encoding the CRISPR/Cas9-based system with at least one gRNA, such as an optimized gRNA described herein.
- the pharmaceutical composition may comprise about 1 ng to about 10 mg of the DNA of the modified lentiviral vector.
- compositions according to the present invention are formulated according to the mode of administration to be used.
- pharmaceutical compositions are injectable pharmaceutical compositions, they are sterile, pyrogen free and particulate free.
- An isotonic formulation is preferably used.
- additives for isotonicity may include sodium chloride, dextrose, mannitol, sorbitol and lactose.
- isotonic solutions such as phosphate buffered saline are preferred.
- Stabilizers include gelatin and albumin.
- a vasoconstriction agent is added to the formulation.
- the composition may further comprise a pharmaceutically acceptable excipient.
- the pharmaceutically acceptable excipient may be functional molecules as vehicles, adjuvants, carriers, or diluents.
- the pharmaceutically acceptable excipient may be a transfection facilitating agent, which may include surface active agents, such as immune-stimulating complexes
- the transfection facilitating agent is a polyanion, polycation, including poly-L- glutamate (LGS), or lipid.
- the transfection facilitating agent is poly-L-glutamate, and more preferably, the poly-L-glutamate is present in the composition for genome editing at a concentration less than 6 mg/ml.
- the transfection facilitating agent may also include surface active agents such as immune-stimulating complexes (ISCOMS), Freunds incomplete adjuvant, LPS analog including monophosphoryl lipid A, muramyl peptides, quinone analogs and vesicles such as squalene and squalene, and hyaluronic acid may also be used administered in conjunction with the genetic construct.
- surface active agents such as immune-stimulating complexes (ISCOMS), Freunds incomplete adjuvant, LPS analog including monophosphoryl lipid A, muramyl peptides, quinone analogs and vesicles such as squalene and squalene, and hyaluronic acid may also be used administered in conjunction with the genetic construct.
- the DNA vector encoding the composition may also include a transfection facilitating agent such as lipids, liposomes, including lecithin liposomes or other liposomes known in the art, as a DNA-liposome mixture (see for example W09324640), calcium ions, viral proteins, polyanions, polycations, or nanoparticles, or other known transfection facilitating agents.
- a transfection facilitating agent such as lipids, liposomes, including lecithin liposomes or other liposomes known in the art, as a DNA-liposome mixture (see for example W09324640), calcium ions, viral proteins, polyanions, polycations, or nanoparticles, or other known transfection facilitating agents.
- the transfection facilitating agent is a polyanion, polycation, including poly-L-glutamate (LGS), or lipid.
- compositions may comprise genetic constructs that encodes the CRISPR/Cas9-based system or CRISPR/Cpfl -based system, as disclosed herein.
- the genetic construct such as a plasmid or expression vector, may comprise a nucleic acid that encodes the CRISPR/Cas9-based system or CRISPR/Cpfl -based system, and/or at least one gRNA, such as an optimized gRNA described herein.
- the compositions, as described above may comprise genetic constructs that encodes the modified lentiviral vector and a nucleic acid sequence that encodes the CRISPR/Cas9-based system or CRISPR/Cpfl -based system, as disclosed herein.
- the genetic construct such as a plasmid, may comprise a nucleic acid that encodes the CRISPR/Cas9-based system or CRISPR/Cpfl -based system.
- the compositions, as described above, may comprise genetic constructs that encodes a modified lentiviral vector.
- the genetic construct, such as a plasmid, may comprise a nucleic acid that encodes the
- the genetic construct may be present in the cell as a functioning extrachromosomal molecule.
- the genetic construct may be a linear
- minichromosome including centromere, telomeres or plasmids or cosmids.
- the genetic construct may also be part of a genome of a recombinant viral vector, including recombinant lentivirus, recombinant adenovirus, and recombinant adenovirus associated virus.
- the genetic construct may be part of the genetic material in attenuated live microorganisms or recombinant microbial vectors which live in cells.
- the genetic constructs may comprise regulatory elements for gene expression of the coding sequences of the nucleic acid.
- the regulatory elements may be a promoter, an enhancer, an initiation codon, a stop codon, or a polyadenylation signal.
- the nucleic acid sequences may make up a genetic construct that may be a vector.
- the vector may be capable of expressing the fusion protein, such as the CRISPR/Cas9-based system or CRISPR/Cpfl -based system, in the cell of a mammal.
- the vector may be
- the vector may comprise heterologous nucleic acid encoding the fusion protein, such as the CRISPR/Cas9-based system.
- the vector may be a plasmid.
- the vector may be useful for transfecting cells with nucleic acid encoding the CRISPR/Cas9-based system or CRISPR/Cpfl -based system, which the transformed host cell is cultured and maintained under conditions wherein expression of the CRISPR/Cas9-based system or CRISPR/Cpfl -based system takes place.
- Coding sequences may be optimized for stability and high levels of expression. In some instances, codons are selected to reduce secondary structure formation of the RNA such as that formed due to intramolecular bonding.
- the vector may comprise heterologous nucleic acid encoding the CRISPR/Cas9-based system or CRISPR/Cpfl -based system and may further comprise an initiation codon, which may be upstream of the CRISPR/Cas9-based system or CRISPR/Cpfl -based system coding sequence, and a stop codon, which may be downstream of the CRISPR/Cas9-based system or
- the initiation and termination codon may be in frame with the CRISPR/Cas9-based system or CRISPR/Cpfl -based system coding sequence.
- the vector may also comprise a promoter that is operably linked to the CRISPR/Cas9-based system or CRISPR/Cpfl -based system coding sequence.
- the CRISPR/Cas9-based system or CRISPR/Cpfl -based system may be under the light-inducible or chemically inducible control to enable the dynamic control of in space and time.
- CRISPR/Cas9-based system or CRISPR/Cpfl -based system coding sequence may be a promoter from simian virus 40 (SV40), a mouse mammary tumor virus (MMTV) promoter, a human immunodeficiency virus (HIV) promoter such as the bovine immunodeficiency virus (BIV) long terminal repeat (LTR) promoter, a Moloney virus promoter, an avian leukosis virus (ALV) promoter, a cytomegalovirus (CMV) promoter such as the CMV immediate early promoter, Epstein Barr virus (EBV) promoter, or a Rous sarcoma virus (RSV) promoter.
- SV40 simian virus 40
- MMTV mouse mammary tumor virus
- HSV human immunodeficiency virus
- HSV human immunodeficiency virus
- BIV bovine immunodeficiency virus
- LTR long terminal repeat
- Moloney virus promoter an avian leukosis
- the promoter may also be a promoter from a human gene such as human ubiquitin C (hUbC), human actin, human myosin, human hemoglobin, human muscle creatine, or human metalothionein.
- the promoter may also be a tissue specific promoter, such as a muscle or skin specific promoter, natural or synthetic. Examples of such promoters are described in US Patent Application Publication No. US20040175727, the contents of which are incorporated herein in its entirety.
- the vector may also comprise a polyadenylation signal, which may be downstream of the CRISPR/Cas9-based system or CRISPR/Cpfl -based system.
- the polyadenylation signal may be a SV40 polyadenylation signal, LTR polyadenylation signal, bovine growth hormone (bGH) polyadenylation signal, human growth hormone (hGH) polyadenylation signal, or human ⁇ -globin polyadenylation signal.
- the SV40 polyadenylation signal may be a polyadenylation signal from a pCEP4 vector (Invitrogen, San Diego, CA).
- the vector may also comprise an enhancer upstream of the CRISPR/Cas9-based system or CRISPR/Cpfl -based system and/or sgRNA, such as an optimized gRNA described herein.
- the enhancer may be necessary for DNA expression.
- the enhancer may be human actin, human myosin, human hemoglobin, human muscle creatine or a viral enhancer such as one from CMV, HA, RSV or EBV.
- Polynucleotide function enhancers are described in U.S. Patent Nos. 5,593,972, 5,962,428, and WO94/016737, the contents of each are fully incorporated by reference.
- the vector may also comprise a mammalian origin of replication in order to maintain the vector extrachromosomally and produce multiple copies of the vector in a cell.
- the vector may also comprise a regulatory sequence, which may be well suited for gene expression in a mammalian or human cell into which the vector is administered.
- the vector may also comprise a reporter gene, such as green fluorescent protein ("GFP") and/or a selectable marker, such as hygromycin ("Hygro").
- the vector may be expression vectors or systems to produce protein by routine techniques and readily available starting materials including Sambrook et al., Molecular Cloning and Laboratory Manual, Second Ed., Cold Spring Harbor (1989), which is incorporated fully by reference.
- the vector may comprise the nucleic acid sequence encoding the CRISPR/Cas9-based system or CRISPR/Cpfl -based system and the nucleic acid sequence encoding the at least one gRNA, such as an optimized gRNA described herein.
- the gRNA such as an optimized gRNA described herein, is encoded by a polynucleotide sequence and packaged into a lentiviral vector.
- the lentiviral vector includes an expression cassette.
- the expression cassette can includes a promoter operably linked to the polynucleotide sequence encoding the gRNA, such as an optimized gRNA described herein.
- the promoter operably linked to the polynucleotide encoding the gRNA is inducible.
- the composition includes a modified adeno-associated virus (AAV) vector.
- the modified AAV vector may have enhanced cardiac and skeletal muscle tissue tropism.
- the modified AAV vector may be capable of delivering and expressing the
- the modified AAV vector may be an AAV-SASTG vector (Piacentino et al. (2012) Human Gene Therapy 23 :635- 646).
- the modified AAV vector may deliver nucleases to skeletal and cardiac muscle in vivo.
- the modified AAV vector may be based on one or more of several capsid types, including AAV1, AAV2, AAV5, AAV6, AAV8, and AAV9.
- the modified AAV vector may be based on AAV2 pseudotype with alternative muscle-tropic AAV capsids, such as AAV2/1, AAV2/6, AAV2/7, AAV2/8, AAV2/9, AAV2.5 and AAV/SASTG vectors that efficiently transduce skeletal muscle or cardiac muscle by systemic and local delivery (Seto et al. Current Gene Therapy (2012) 12: 139-151).
- AAV2 pseudotype with alternative muscle-tropic AAV capsids such as AAV2/1, AAV2/6, AAV2/7, AAV2/8, AAV2/9, AAV2.5 and AAV/SASTG vectors that efficiently transduce skeletal muscle or cardiac muscle by systemic and local delivery (Seto et al. Current Gene Therapy (2012) 12: 139-151).
- a method for delivering the CRISPR/Cas9-based system or CRISPR/Cpfl -based system and the optimized gRNA described herein for providing genetic constructs and/or proteins of the CRISPR/Cas9-based system or CRISPR/Cpfl -based system may be the transfection or electroporation of the
- the CRISPR/Cas9-based system or CRISPR/Cpfl -based system protein may be delivered to the cell.
- the nucleic acid molecules may be electroporated using BioRad Gene Pulser Xcell or Amaxa Nucleofector lib devices or other electroporation device.
- Transfections may include a transfection reagent, such as Lipofectamine 2000.
- the vector encoding a CRISPR/Cas9-based system or CRISPR/Cpfl -based system protein may be delivered to the modified target cell in a tissue or subject by DNA injection (also referred to as DNA vaccination) with and without in vivo electroporation, liposome mediated, nanoparticle facilitated, and/or recombinant vectors.
- the recombinant vector may be delivered by any viral mode.
- the viral mode may be recombinant lentivirus, recombinant adenovirus, and/or recombinant adeno-associated virus.
- the nucleotide encoding a CRISPR/Cas9-based system or CRISPR/Cpfl -based system protein may be introduced into a cell to induce gene expression of the target gene.
- one or more nucleotide sequences encoding the CRISPR/Cas9-based system or CRISPR/Cpfl -based system directed towards a target gene may be introduced into a mammalian cell.
- the transfected cells Upon delivery of the CRISPR/Cas9-based system or CRISPR/Cpfl -based system to the cell, and thereupon the vector into the cells of the mammal, the transfected cells will express the CRISPR/Cas9-based system or CRISPR/Cpfl -based system.
- the CRISPR/Cas9-based system or CRISPR/Cpfl -based system may be administered to a mammal to induce or modulate gene expression of the target gene in a mammal.
- the mammal may be human, non-human primate, cow, pig, sheep, goat, antelope, bison, water buffalo, bovids, deer, hedgehogs, elephants, llama, alpaca, mice, rats, or chicken, and preferably human, cow, pig, or chicken.
- Methods of introducing a nucleic acid into a host cell are known in the art, and any known method can be used to introduce a nucleic acid (e.g., an expression construct) into a cell. Suitable methods include, include e.g., viral or bacteriophage infection, transfection,
- composition may be delivered by mRNA delivery and ribonucleoprotein (RNP) complex delivery.
- RNP ribonucleoprotein
- compositions may be administered to a subject by different routes including orally, parenterally, sublingually, transdermally, rectally, transmucosally, topically, via inhalation, via buccal administration, intrapleurally, intravenous, intraarterial, intraperitoneal, subcutaneous, intramuscular, intranasal intrathecal, and intraarticular or combinations thereof.
- the composition may be administered as a suitably acceptable formulation in accordance with normal veterinary practice. The veterinarian may readily determine the dosing regimen and route of administration that is most appropriate for a particular animal.
- the compositions may be administered by traditional syringes, needleless injection devices,
- microprojectile bombardment gone guns or other physical methods such as electroporation (“EP), “hydrodynamic method”, or ultrasound.
- EP electroporation
- hydrodynamic method or ultrasound.
- the composition may be delivered to the mammal by several technologies including DNA injection (also referred to as DNA vaccination) with and without in vivo electroporation, liposome mediated, nanoparticle facilitated, recombinant vectors such as recombinant lentivirus, recombinant adenovirus, and recombinant adenovirus associated virus.
- DNA injection also referred to as DNA vaccination
- liposome mediated such as recombinant lentivirus, recombinant adenovirus, and recombinant adenovirus associated virus.
- the composition may be injected into the skeletal muscle or cardiac muscle.
- the composition may be injected into the tibialis anterior muscle.
- kits which may be used for site-specific DNA binding.
- the kit comprises a composition, as described above, and instructions for using said composition.
- kits may be affixed to packaging material or may be included as a package insert. While the instructions are typically written or printed materials they are not limited to such. Any medium capable of storing such instructions and communicating them to an end user is contemplated by this disclosure. Such media include, but are not limited to, electronic storage media ⁇ e.g., magnetic discs, tapes, cartridges, chips), optical media ⁇ e.g., CD ROM), and the like. As used herein, the term "instructions" may include the address of an internet site that provides the instructions.
- the composition may include a modified lentiviral vector and a nucleotide sequence encoding a CRISPR/Cas9-based system and the optimized gRNA, as described above.
- the CRISPR/Cas9-based system as described above, may be included in the kit to specifically bind and target a particular regulatory region of the target gene. 20. Examples
- Tris-HCl (pH 7.6) buffer was obtained from Corning Life Sciences.
- L- glutamic acid monopotassium salt monohydrate, dithiothreitol (DTT), and magnesium chloride were obtained from Sigma Aldrich Co., LLC.
- the DNA substrates used for imaging— (i) a 1198 bp substrate derived from a segment of the AAVS1 locus of human chromosome 19; (ii) an 'engineered' 989 bp DNA substrate containing a series of six full, partial, or mismatched target sites; and (iii) a 1078 bp 'nonsense' substrate containing no homology to the protospacer (> 3 bp)— were also generated using standard techniques.
- the plasmids encoding wild-type Cas9 and dCas9 were obtained from Addgene (plasmid 39312 and plasmid 47106).
- Plasmids for the expression of Cas9 and dCas9 in bacteria were cloned using Gateway Cloning (Life Technologies). Briefly, PCR was used to amplify Cas9 and dCas9 genes and to add flanking attLl and attL2 sites. BP recombination was performed to transfer these genes to a shuttle vector, after which LP recombination was performed to transfer these genes to pDestl7, which adds an N-terminal hexa-histidine tag (Life Technologies). The plasmids encoding the chimeric sgRNA and sgRNA variants (described below) were cloned as previously described ( Perez-Pinera et al., (2013) Nature methods, 10, 973-976).
- Cell pellets were resuspended in 30 mL of Lysis Buffer (50mM Tris-HCl, 500 mM NaCl, 10 mM MgCl 2 , 10 % v/v glycerol, 0.2% Triton-1000, and ImM PMSF).
- Lysis Buffer 50mM Tris-HCl, 500 mM NaCl, 10 mM MgCl 2 , 10 % v/v glycerol, 0.2% Triton-1000, and ImM PMSF.
- the cell suspension was lysed by sonication at 30% duty cycle for 5 minutes.
- the suspension was then centrifuged for 30 minutes at 12,000xg.
- the supernatant was then taken and incubated with Ni- NTA resin (Qiagen) for 30 minutes under gentle agitation.
- the resin was then loaded onto a column, washed with Wash Buffer (35 mM imidizole, 50mM Tris-HCl, 500 mM NaCl, 10 mM MgCl 2 , 10 % v/v glycerol), and eluted with Elution Buffer (120 mM imidizole, 50mM Tris-HCl, 500 mM NaCl, 10 mM MgCl 2 , 10 % v/v glycerol). Ultracel-30k centrifugal filters were then used to exchange solvents to the Storage Buffer (50mM Tris-HCl, 500 mM NaCl, 10 mM
- RNA and guide RNA variants were in vitro transcribed using the MEGAshortscript T7 Transcription Kit (Life Technologies. DNA templates with a T7 promoter were generated via PCR from guide RNA plasmids and reactions were set up following the manufacturer's instructions. The T7 templates for the guide RNAs with 2 nucleotides truncated from their 5' - ends (tru-gRNAs) and those with 5' extensions that form hairpins (hp-gRNAs) were generated by PCR off of the standard gRNA plasmids. The RNA was then purified using phenol-chloroform extraction using standard techniques
- Genomic DNA was extracted and purified from HEK293T cell line using the DNeasy kit (Qiagen), following the manufacturer's protocol.
- the AAVS1 locus was then amplified using PCR.
- the 1198 bp AAVS1 -derived substrate was constructed via direct PCR from genomic DNA using primers from Integrated DNA
- GAGCTCT ACTGGCTTCTGCG-3 ' where ⁇ Bt ⁇ represents a biotinylation of the primer at the 5'- end.
- the 'engineered' DNA substrate which contains a series of PAMs and full or partial protospacer sites, was ordered as two gBlock fragments each containing an EcoRI restriction site on one end. Substrates were digested, ligated together, and then enriched via PCR with primers (Integrated DNA Technologies, IDT): 5 ' - ⁇ Bt ⁇ -C ATGACGTGC AGC AAGC-3 ' and 5'- CGACGATGCGCTGAATC-3' .
- a 'nonsense' substrate containing no sites exhibiting homology (greater than 3 bp) to the protospacer a 690 bp DNA construct was synthesized (GeneScript, Inc.) containing a series of restriction sites, and an addition length of DNA from lambda DNA (New England Biolabs) was sub-cloned into the construct; the 1078 bp substrate was then PCR amplified using primers (IDT): 5'- ⁇ Bt ⁇ -
- Atomic Force Microscopy Atomic force microscopy (AFM) was performed in air using a Bruker (nee Veeco) Nanoscope V Multimode with RTSEP (Bruker) probes (nominal spring constant 40 N/m, resonance frequency, 300 kHz). Prior to experiments, protein and guide RNAs were mixed in 1 : 1.5 ratio for 10 minutes. Protein and DNA were mixed in a solution of working buffer for at least 10 minutes (up to 35 minutes) at room temperature, deposited for 8 seconds on freshly cleaved mica (Ted Pella, Inc.) that had been treated with 3- aminopropylsiloxane (prepared as previously described (24)), rinsed with ultra-pure (> 17 ⁇ ) water, and dried in air.
- AFM Atomic force microscopy
- Proteins were centrifuged briefly prior to incubation with DNA. When the standard sgRNA was used, at least four preparations for each experimental condition were imaged, and at least two for experiments with the other guide RNA variants. In general images were acquired with pixel resolution of 1024 x 1024 over 2.75 micron square areas or 2048 x 2048 over 5.5 micron square areas at 1-1.5 line/s for each sample. Images of several thousand (-2500 - 6000) DNA molecules were resolved for each experimental condition.
- the position of next element of the backbone (x 2 ) is estimated by stepping 2.5 nm toward the nearest hand-traced points beyond the estimated boundary of the streptavidin.
- An 1 1 -pixel line is drawn on a two-fold linear interpolation of the image of the DNA perpendicularly to the (xi-x 2 ) line segment at x 2 .
- x 2 is relocated to the position on the normal line with the maximum topographical height then adjusted to the 2.5 nm from xi on the new (xi-x 2 ) line.
- x n are then iteratively estimated using the nearest hand-traced points to generate the initial guess for the next backbone position then corrected as before, and the correction process continues until the point xicide is less than 2.5 nm from the end of the traced DNA molecule.
- the position of the DNA is instead estimated as the point on a cubic Hermite spline (using points 3 ⁇ 4, Xi, X j , and X j +i, where X j is the first point of the hand-drawn trace beyond the estimated Cas9/dCas9 boundary) located 2.5 nm from Xi.
- the height of the DNA along the contour is extracted (relative to the median pixel height of the local region).
- the estimated boundaries of the streptavidin and Cas9/dCas9 were iteratively expanded or retracted around the original estimate until they expanded to a contiguous region greater than ( ⁇ + c ), where [i d and c are the mean and standard deviation of the height of the traced DNA beyond the estimated positions of bound proteins, and the estimate converges.
- KMC Kinetic Monte Carlo
- the guide RNA is base-paired with the protospacer up to protospacer site m (1 ⁇ m ⁇ 20 for sgRNA and 1 ⁇ m ⁇ 18 for a truncated sgRNA (tru-gRNA)) and, to first-order, the forward rate (rate of additional guide RNA invasion) v is estimated using the symmetric approximation to be exp(-(AG°(w +1) RN A :DN A - AG°(m +1) DN A: DN A)/2RT), where R is Boltzmann's constant, T is the temperature (here 37°C to correspond with parameter set that was used), AG°(m + 1) RN A: DN A 1S free energy of the base- pairing between the RNA and protospacer at site m + 1 and AG°(m + 1) DN A :DN A is the free energy of the base-pairing between the protospacer and its complementary DNA strand (the 1/2 corrective term is included to satisfy detailed balance), v at state
- Free energy parameters are derived from the literature from experiments at 1M NaCl at 37°C. Sequence-dependent DNA:DNA hybridization free energies AG°(X) DN A: DN A were obtained from SantaLucia et al. (1996) Biochemistry, 35, 3555-3562; sequence-dependent RNA:DNA hybridization free energies AG°(X) RN A: DN A were obtained from Sugimoto et al. (1995) Biochemistry, 34, 11211-11216; and AG°(X) RN A: DN A values in cases of introduced point mismatches rG dG, rC dC, rA dA, and rU dT were obtained from Watkins et al.
- the sequence of the protospacer used is 'ATCCTGTCCCTAGTGGCCCC (SEQ ID NO: 336), the AAVS1 target site as in the AFM experiments;
- the sequence of the protospacer complementary DNA is ' GGGGCC ACT AGGGAC AGGAT ' (SEQ ID NO: 337), and the sequence of the guide RNA was either ' GGGGCC ACUAGGGACAGGAU' (SEQ ID NO: 338) for sgRNA or
- dCas9-tru-gRNA and dCas9-hp-gRNA data for comparison with dCas9-sgRNA structural properties.
- the AFM imaging conditions should remain mostly consistent so as not to introduce artifacts. This does not generally present an issue, for example, when comparing heights and volumes of dCas9 bound to different sites on the engineered DNA molecules, but presents a challenge when comparing the structural properties of dCas9/Cas9 when using different guide RNAs or DNA substrates.
- the heights and volumes of the streptavidin proteins used to label the ends of the traced DNA molecules were used, which should remain unchanged across all experimental conditions, for the different experiments.
- mean heights of the streptavidins differed by less than 0.1 nm (mean difference: 0.087 nm; standard deviation of differences: 0.052 nm) and their mean volumes (1098 nm 3 ) differed by less than 15 nm 3 (mean difference: 14.461nm 3 ; standard deviation of differences: 10.419 nm 3 ).
- the mean heights and volumes between the experiments with tru-gRNA and the hp-gRNAs differed from those with sgRNAs by up to 0.14 nm and 225 nm 3 , respectively.
- the heights of dCas9 with tru- gRNA and hp-gRNAs on engineered DNA were shifted by their difference in mean heights relative to those with sgRNAs and the volumes scaled by the percent difference of the mean volumes.
- Atomic force microscopy captures Cas9/dCas9 binding specifically and non-specifically along engineered DNA substrates with high resolution
- MM mismatched
- S. pyogenes Cas9 is a 160 kDa monomeric protein approximately 10 nm x 10 nm x 5 nm (from crystal structures), roughly divided into two lobe-like halves each containing a nuclease domain. Consistent with the x-ray structures, dCas9 - sgRNA imaged via AFM appears as large ovular structures (FIGS. 1 OA- IOC), after incubating Cas9 or dCas9 with DNA these structures bound along DNA were observed and assigned to be Cas9 or dCas9, respectively (FIG. IB, FIGS. 1 OA- IOC, and FIGS. 11A-11D).
- the biotinylated DNA molecules were labelled at one end with monovalent streptavidin tag prior to AFM imaging.
- DNA molecules that were observed with bound Cas9 or dCas9 proteins were selected for further analysis and traced with sub-pixel resolution according to a modified protocol adapted from that of Wiggins et al. (25), and the sites bound by Cas9/dCas9 were extracted (see Supplementary Methods for details).
- Table 1 Peaks recorded in binding histograms of FIGS. 1C-D for Cas9/dCas9-sgRNA and FIG. 2C for dCas9 with sgRNAs possessing 2 nt truncation at 5'- end (tru- gRNA), based on empirical fit to Gaussian o
- tru-gRNA is expected to interact with only the first 3 of the
- Peak width is A/2G, with ⁇ as the standard deviation
- kNumber of dCas9 molecules observed within 1 peak width (V2o) of binding site. If Cas9/dCas9 appeared to contact DNA at n sites, that molecule is weighted by 1/n. If molecules overlapped both 10MM and 5 MM sites, # was weighted by an additional 1/2.
- Table 2 Apparent dissociation constants for dCas9 with different guide RNA variants from the 989 bp 'engineered' DNA substrates (e.g., FIGS. ID, 2C, and 2D) that contain a series of fully- and partially- complementary protospacer sites
- sgRNA Full-length single-guide RNA
- Truncated sgRNA (first two nt at 5' - truncated)
- dCas9 is relatively tolerant to distal mismatches
- the mean single-site dissociation constant for dCas9 binding along the 'nonspecific' substrate was estimated to be approximately 867 nM (standard deviation ⁇ 209 nM). This can be understood as an estimate of the dCas9 binding dissociation constant on DNA with no protospacer homology.
- sgRNAs with a two nucleotide truncation at their 5'- ends do not increase binding specificity of dCas9 in vitro
- Cas9 was found to still exhibit cleavage activity even if up to four nucleotides of the guide (protospacer-targeting) segment of the sgRNA or crRNA were truncated from their 5 '-ends and Fu et al. (21) recently showed that use of sgRNAs with these 5' - truncations (optimally by 2 - 3 nucleotides) can actually result in orders-of-magnitude increase in Cas9 cleavage fidelity in vivo. It was suggested that the increased sensitivity to mismatched sites (MM) using these truncated sgRNAs (termed 'tru-gRNAs', FIG.
- MM mismatched sites
- dCas9 was imaged with a tru-gRNA with a two nucleotide 5'- truncation relative to the sgRNA used previously.
- the dCas9-tru-gRNA complexes were incubated with the engineered substrates that contained a series of full and partial protospacer sites. Again a distinct peak was found precisely at the full protospacer site (FIG. 2C and Table 1), although the apparent association constant relative to dCas9 with a full sgRNA at this site decreases considerably ⁇ i.e., dissociation constant increases, see Table 2).
- FIG. 2C shows the binding affinity of dCas9 with a standard gRNA (dashed line) compared with the binding affinity of a dCas9 with a tru-gRNA (papRNA, purple line) on a DNA molecule which contains a full protospacer (site i) as well as protospacer sites with 5 and 10 PAM-distal mismatches (sites ii and iii, respectively).
- 2C shows the standard guide RNAs retain significant ability to bind to these off-target sites (containing mismatches), and that gruRNAs exhibit no relative enhancement in binding specificity at sites which contained mismatches in the 5 10 nucleotides at the PAM distal end of the protospacer.
- the binding distribution of dCas9 with tru-gRNAs exhibits distinct peaks in its affinity exactly at the protospacer sites with 10 PAM-distal mismatches and 5 PAM-distal mismatches, demonstrating that it does not have increased binding specificity relative to full sgRNAs (see Table 1).
- the 'peaks' in the binding histogram are indicative of specific, stable binding at these off-target sites.
- binding at the off-target sites by dCas9-gruRNAs actually increases relative to binding to the protospacer compared to the standard guide RNA.
- This promiscuous binding may limit their utility for dCas9 and chimeric dCas9 derivatives. It may also reflect the off-target cleavage reported for this system which, while improved relative to the standard guide RNAs, was still significant at some off-target sites. For comparison, we found no specific binding of the hpgRNAs at these sites with mismatches (FIG. 2D).
- hpgRNAs bound at these sites with approximately the same affinity as they do nonspecifically to DNA with no homology to the protospacer, with a -22% decrease in the maximum observed off-target binding affinities relative to the truncated gRNAs. Additionally, based on the narrow geometry of the Cas9 DNA-binding channel, we expect that the presence of an unopened hairpin at mismatched protospacers may inhibit the conformational change in Cas9 necessary to perform cleavage (FIG. IB).
- hp- gRNAs Guide RNAs with 5'- hairpins complementary to 'PAM-distaP -targeting segments (hp- gRNAs) modulate the absolute binding propensity and profile of dCas9s bound to DNA with mismatched protospacers in vitro
- dCas9 specificity may be increased by extending the 5'- end of the sgRNA such that it formed a hairpin structure which overlapped the 'PAM-distal' -targeting (or 'non-seed') segment of the sgRNA (FIG. 2B).
- the hairpin is opened upon binding to a full protospacer and full strand invasion can occur. If there are PAM-distal mismatches at the target site, then it is more energetically favourable for the hairpin to remain closed and strand invasion is hindered. Similar topologies have been used recently for 'dynamic DNA circuits' which are driven by strand invasion.
- the hairpins serve as kinetic barriers to invasion, with oligonucleotide invasion rates slowed several orders of magnitude in cases of attempted invasion by targets with mismatches.
- the hairpins here may be displaced during invasion of the full target sites, but inhibit invasion if there were mismatches between the target and the non-seed targeting region of the guide RNA (FIG. 2B). In those cases, it is more energetically favourable for the hairpins to remain closed. While previous efforts which had added 5'- extensions to sgRNAs in order to complement additional nucleotides beyond the protospacer, these guide RNAs did not show increased Cas9 cleavage specificity in vivo. Rather, they were digested back approximately to their standard length in living cells. Based on the size and structure of the hairpin, the hairpin may be accommodated within the DNA-binding channel of Cas9/dCas9 molecule and protected from degradation.
- sgRNAs were generated with 5' - hairpins (hp-gRNAs) which overlapped the nucleotides complementary to the last six (hp6-gRNA) or ten (hplO-gRNA) PAM-distal sites of the protospacer.
- hp-gRNAs 5' - hairpins
- hplO-gRNA ten
- dCas9 with hp-gRNAs show a similar drop in affinity for the target site as with tru-gRNAs, however, in contrast to dCas9 with tru-gRNAs, dCas9 with hp- RNAs do not present any sharp binding peaks at off-target sites which would otherwise indicate strong, specific binding.
- hp6-gRNA there was an enrichment of binding around the sites of protospacers with 5 or 10 mismatched PAM-distal sites.
- binding to these mismatched sites is approximately at the level of the non-specific binding elsewhere on the substrate, representing a 22% decrease in the maximum observed off-target binding affinity relative to the tru-gRNAs (decrease in the maximum observed association constant from to 3.18 10 6 M to 2.48 10 6 M, FIG. 2D).
- This increase in specificity of hplO-gRNA is also reflected by a similar binding dissociation constant as hp6-gRNA to the protospacer sites but a significant increase in the overall dissociation constant to the entire (specific + non-specific) engineered substrate relative (Table 2).
- Cas9 and dCas9 undergo a progressive structural transition as they bind to DNA sites that increasingly match the targeted protospacer sequence
- TEM negative-stain transmission electron microscopy
- dCas9 molecules with sgRNAs were predominately smaller and egg-shaped (FIG. 3C(iii), and Table 3). But as dCas9 proteins bind to increasingly complementary target sequences (FIG. 3(a - ⁇ )), their height and volume significantly increase (FIGS. 3D and 12A-12B, Table 2) relative to non-specific binding, reaching a maximum size at the protospacer sequence. This increase is likewise accompanied by a shift in the population of dCas9 (FIG. 3A, and 12A-12B, Table 2) from structures clustering with the flatter and egg-shaped conformations (FIGS.
- Catalytically active Cas9 undergoes a significant increase in size as it binds to the protospacer sequence as well (FIG. 3( ⁇ )); however there is a small, but statistically significant, decrease in size relative to dCas9, and the conformation of Cas9 at full protospacer sites tends to cluster with the flatter (green) structures.
- tru-gRNA is expected to interact with only the first 8 of the 10 PAM-distal mismatched nucleotides at the 10MM site (labelled '8MM' in FIG. 3D).
- tru-gRNA is expected to interact with only the first 3 of the 5 PAM-distal mismatched nucleotides at the 5MM site (labelled '3MM' in FIG. 3D).
- AFM imaging directly reveals that although dCas9/Cas9 retains a significant propensity to bind protospacer sites with up to ten distal mismatches, binding to DNA sites that are increasingly complementary to the protospacer drives an increasing shift in the population of dCas9/Cas9 proteins toward what appear to be the active conformation. Notably, we see similar shift in structure between off-target sites and perfectly-matched sites for dCas9 with hp-gRNAs as well (Table 2 and FIG. 13). The presence of complementary PAM-distal sequences is known to be associated with increased stability of Cas9 on DNA.
- v r is estimated similarly as proportional to exp(-(AG°(w) D NA:DNA - AG° (W)RNA:DNA)/ 2RT) . Transition rates of this type have been previously used for computational studies of nucleotide base-pairing and stability, and here they allowed us to capture the general dynamics of the R-loop in a sequence- dependent manner.
- RNA:DNA base-pairs are energetically stronger than DNA:DNA base- pairs, and at equilibrium we see from the KMC trajectories that the guide RNAs are stably bound to the protospacer, as expected (FIG. 4C).
- sgRNA is quite stable and remains nearly totally invaded— during 95% of simulated time course, the strand remains invaded up to the 19 th protospacer site (FIG. 4B)— tru-gRNA exhibits significant fluctuations of protospacer re-annealing at PAM-distal sites (FIGS. 4B and 4C).
- the KMC experiments show that the mean lifetime between full invasion and re- annealing of the DNA back to the 16 th site is decreased by two orders of magnitude when replacing the sgRNA with the tru-gRNA (FIG. 4C inset).
- This result is consistent with the earlier finding that while Cas9 activity with tru-gRNA variants with 2 or 3 nucleotide (nt) truncations was modulated depending on sequence context, and that cleavage in all tested cases was dramatically reduced by -90% - 100% by 4 nt truncations and abolished after a 5 nt truncation.
- the conformational change to the protein activate state is stabilized by these interactions at or near the 16 th site of the protospacer.
- TTCTTCTTCTGCTCGG 83 GAGACCGAGCAGAAGA 219
- Truncation of the guide RNA by 4 or 5 nucleotides may abolish cleavage activity by sufficiently destabilizing the R-loop at the 14 th or 15 th position in much the same way that the tru-gRNA destabilized the R-loop at the 16 th - 17 th sites.
- 14 th and 15 th sites are necessarily invaded whenever the 16 th site is bound by sgRNA, it is likely that these positions are additionally informative because they are also more strongly anti -correlated with the probability of sgRNA dissociation from the duplex prior to bypassing the mismatched site (FIG. 6Ai and FIG. 16B), another mechanism by which cleavage would fail to occur.
- the R-loop as a dynamic structure in competition between strand invasion and DNA re-annealing can be useful in understanding mechanisms of off-target cleavage and mismatch tolerance.
- No statistically significant correlation was found between cleavage rate and the predicted DNA-RNA binding energies alone (FIG. 6B), suggesting that the kinetics of strand invasion can be considered when attempting to determine Cas9 activity at off- target sites. While cleavage is abolished when 4 or 5 nucleotides are truncated from the guide RNA, Cas9 is still able to cleave DNA with up to 6 distal-mismatch sites.
- hp-gRNAs were found to significantly weakened or abolished specific binding at homeologous targets. hp-gRNAs may be valuable for modulating dCas9 binding affinity and specificity in their potential applications in biology and medicine. Specifically, based on the narrow geometry of the Cas9 binding channel, the presence of an unopened hairpin at mismatched protospacers may inhibit the conformational change by Cas9 to the active state. The opening of the hairpin in hp-gRNAs upon binding could also be used as a binding-dependent signal in vivo, for example, to nucleate dynamic DNA/RNA structures only upon binding to specific sites.
- gRNA activity was tested in living cells to investigate dCas9 binding specificity.
- hp-gRNAs hairpin gRNAs
- Several hairpin gRNAs were designed for each of four target locations (protospacers) in the human genome (FIGS. 17 and 18).
- two targets were in the VEGFA gene, labeled VEGFA 1 (FIGS. 30-37) and VEGFA3 (FIGS. 38-43). All experiments were done in HEK293T cells.
- nt additional nucleotides (nt) were added to the 5' - end of full guide RNA (gRNAs, full length 20 nt) and designed to form Hairpins and secondary structures by hybridizing with the 5'- protospacer-targeting nucleotides, or nucleotides in the middle or the 3' - end of the protospacer- targeting region, in order to modulate binding and cleavage activity of Cas9 to protospacers.
- gRNAs full guide RNA
- VEGF A 1 -targeting hp-gRNA was computationally designed using the methods described herein to prevent binding at a known off-target site while allowing binding to the full protospacer (FIG. 44A-44C).
- the hp-gRNA was selected to have a binding lifetime greater than or equal to that of the full gRNA at the on-target site, and a binding lifetime less than or equal to that of the full-length gRNA at the top 3 off-target sites.
- 5'- structures were designed to include dG-rU wobble pairs to modulate the energetics of the secondary structures of the hp-gRNAs, or added to the end of truncated gRNAs (tru-gRNAs, ⁇ 20 nt) which themselves have been shown to promote higher specificity of Cas9 activity.
- Surveyor Assay Compared to Deep-Sequencing, the surveyor assay is lower in throughput and less sensitive. However the surveyor assay is faster and less technical in data analysis, providing gel images. Thus surveyors were done as a first pass, and the best conditions were analyzed in triplicated with Deep-sequencing. Both DeepSequencing and Surveyor are methods to quantify mutational events caused by Cas9+gRNA.
- the cell work for Surveyor was the same as described above. After genomic DNA was purified, primers were designed to amplify the targeted site. A pool of 200k cells was used in this experiment and each one of them had a different mutation since DNA repair is stochastic. The site across 200k cells was amplified to generate a heterogenous PCR product: some amplicons had deletions, some had insertions, and some were wild-type and unmodified, due to each cell stochastically(i.e. randomly, error prone) repairing Cas9 cut sites.
- the heterogenous-PCR pool was heated and repaired, and in some cases different strands annealed to each other: a wild-type DNA strand might bind to DNA with an insertion, or an insertion might bind to a deletion. When this happens a little "bubble” formed and this structure is called a DNA heteroduplex (see FIG. 46).
- the surveyor nuclease was used to detect these heteroduplexes by digesting and cleaving them. DNA cleavage was then a proxy for Cas9's mutational activity. The PCR pool was separated on a gel and the intensity of these digested bands was used to quantify the rate of Cas9 activity.
- a hp-gRNA hairpin targeting EMXl exhibited >6000-fold improvement in specificity over full gRNA (vs. tru-gRNA with ⁇ 100-fold improvement over gRNA).
- the VEGFAl -targeting hp-gRNA with a computationally-designed secondary structure using an in- house algorithm greatly outperformed the tru-gRNA activity in terms of specificity (18-fold vs. 3-fold improvement over gRNA).
- These hp-gRNAs were tested in conjunction with S. pyogenes Cas9.
- FIG. 44A-44C shows Surveyor assays of EMXl -targeting hp-gRNAs with Cas9 from S. aureous exhibiting on-target activity and no detectable off-target activity, in contrast to tru- gRNAs which show significant off-target activity.
- Kleinstiver et al. used full-length gRNAs to show that Lachnospiraceae Cpfl is susceptible to cut at off-target sites with mismatches at the 8-9 nucleotides in addition to PAM-distal sites, by using gRNAs which had mismatches with the target site at different locations (FIG. 47).
- hairpin guide RNAs used with the Type V CRISPR-Cas system CRISPR-Cpfl were designed and tested as described above using the methods of the present invention.
- the D MT1 gene (TTTC CTGATGGGTCCATGTCTGTTACTC (SEQ ID NO: 330)) was targeted for cleavage by Cpfl .
- "Off-target activity” was tested by using guide RNAs which had a mismatched nucleotide at position 9, e.g. CTGATGGTgCATGTCTGTTA (SEQ ID NO: 331), using full-length guide RNAs 20 nucleotides long or truncated gRNAs 17 nucleotides long CTGATGGTgCATGTCTG (SEQ ID NO: 332).
- CTGATGGTgCATG TCTG CATGcACCA (SEQ ID NO: 334).
- a Surveyor assay shows that that inclusion of these additional 3'- elements decreased or abolished the off-target activity at the DNMTl site exhibited by the full or truncated gRNAs.
- hp-gRNAs were designed with an "internal" hairpin design in which the PAM-distal 4 nucleotide served as the loop. The hairpin was added to the 3' - end of the gRNA.
- Table 7 shows the sequences of the hp-gRNA with a space in the sequences that separates this region. The mismatch is shown in lower case.
- FIG. 48 Surveyor results of these hp-gRNAs are shown in FIG. 48 and show that the addition of the hairpin to the 3 '-end abolished off-target activity.
- Lane 1 shows the control;
- lane 2 shows a full-length gRNA containing a mismatched nucleotide at position 9;
- lane 3 shows the full- length gRNA containing a mismatched nucleotide at position 9 and an additional 3'- hairpin structure;
- lane 4 shows a truncated gRNA containing a mismatched nucleotide at position 9;
- lane 5 shows the truncated gRNA containing a mismatched nucleotide at position 9 and an additional 3'- hairpin structure.
- the Surveyor primers used are also shown in Table 7.
- Cpfl tolerates mismatches at nucleotides 8-10 when using normal guide RNAs and cleaves DNA at those off-target sites (FIG. 47). As shown in FIG. 48, the Cpfl hp-gRNA were able to abolish the off-target activity shown in the Kleinstiver, while the truncated gRNAs could not. Table 7
- a method of generating an optimized guide RNA comprising: a) identifying a target region of interest, the target region of interest comprising a protospacer sequence; b) determining a polynucleotide sequence of a full-length gRNA that targets the target region of interest, the full-length gRNA comprising a protospacer-targeting sequence or segment; c) determining at least one or more off-target sites for the full-length gRNA; d) generating a polynucleotide sequence of a first gRNA, the first gRNA comprising the polynucleotide sequence of the full-length gRNA and a RNA segment, the RNA segment comprising a polynucleotide sequence having a length of M nucleotides that is complementary to a nucleotide segment of the protospacer-targeting sequence or segment, the RNA segment is at the 5' end of the polynucleotide sequence of
- a method of generating an optimized guide RNA comprising: a) identifying a target region of interest, the target region of interest comprising a protospacer sequence; b) determining a polynucleotide sequence of a full-length gRNA that targets the target region of interest, the full-length gRNA comprising a protospacer-targeting sequence or segment; c) determining at least one or more off-target sites for the full-length gRNA; d) generating a polynucleotide sequence of a first gRNA, the first gRNA comprising the polynucleotide sequence of the full-length gRNA and a RNA segment, the RNA segment comprising a polynucleotide sequence having a length of M nucleotides that is complementary to a nucleotide segment of the protospacer-targeting sequence or segment, the RNA segment is at the 3' end of the polynucleotide sequence of
- Clause 3 The method of clause 1 or 2, wherein the energetics of further invasion of a different gRNA is determined by determining the energetics of at least one of (I) breaking a DNA-DNA base-pairing, (II) forming an RNA-DNA base-pair, (III) energetic difference resulting from disrupting or forming different secondary structure within the uninvaded guide RNA, and (IV) forming or disrupting interactions between the displaced DNA strand that is complementary to the protospacer and any unpaired guide RNA nucleotides which are not involved in secondary structures.
- Clause 13 The method of clause 12, wherein the secondary structure is formed by partially hybridizing the protospacer-targeting sequence with the RNA segment.
- Clause 14 The method of clause 13, wherein the secondary structure modulates DNA binding or cleavage by Cas9 by disrupting invasion of the protospacer duplex or off-target duplex by the optimized gRNA.
- Clause 15 The method of any one of clauses 12-14, wherein the secondary structure is formed by hybridizing all or part of the RNA segment to nucleotides in the 5' - end of the protospacer-targeting sequence or segment, nucleotides in the middle of the protospacer- targeting sequence or segment, and/or nucleotides in the 3' - end of the protospacer-targeting sequence or segment.
- Clause 16 The method of any one of clauses 12-15, wherein the secondary structure is a hairpin.
- Clause 17 The method of any one of clauses 12-16, wherein the secondary structure is stable at room temperature or 37°C.
- Clause 18 The method of any one of clauses 12-17, wherein the overall equilibrium free energy of the secondary structure is less than about 2 kcal/mol at room temperature or 37°C.
- Clause 19 The method of any one of clauses 1-18, wherein the RNA segment hybridizes or forms non-canonical base pairs with at least two nucleotides of the protospacer- targeting sequence or segment.
- Clause 20 The method of clause 19, wherein the non-canonical base pair is rU-rG.
- Clause 21 The method of any one of clauses 1-20, wherein the optimized gRNA is used with a CRISPR/Cas9-based system or CRISPR/Cpfl -based system in a cell.
- Clause 22 The method of any one of clauses 1-21, wherein the secondary structure protects the optimized gRNA within the CRISPR/Cas9-based system or CRISPR/Cpfl -based system to prevent degradation within the cell.
- Clause 23 The method of any one of clauses 1-22, wherein 1-20 nucleotides are randomized in the linker.
- Clause 24 The method of any one of clauses 1-23, wherein 1-20 nucleotides are randomized in the RNA segment.
- Clause 25 The method of any one of clauses 1-24, wherein step (g) is repeated X number of times, thereby generating X number of gRNAs and repeating step (e) with each X number of gRNAs, wherein X is between 0 to 20.
- Clause 26 The method of any one of clauses 1-25, wherein the invasion kinetics and lifetime are calculated using kinetic Monte Carlo method or Gillespie algorithm.
- Clause 27 The method of any one of clauses 1-26, wherein the invasion kinetics is the rate at which the guide RNA invades the protospacer duplex to full invasion such that the protospacer is completely invaded and/or the rate at which the segment of protospacer DNA bound to the gRNA expands as it is displaced from its complementary strand and bound to the gRNA nucleotide-by-nucleotide from its PAM proximal region through to full invasion.
- Clause 28 The method of any one of clauses 1-27, wherein the design criteria comprises specificity, modulation of binding lifetime, and/or estimated cleavage specificity.
- Clause 29 The method of clause 28, wherein the design criteria comprises an optimized gRNA having a binding lifetime greater than or equal to the binding lifetime of a full- length gRNA to the on-target site and/or a binding lifetime less than or equal to the binding lifetime of a full-length gRNA to an off-target site.
- Clause 30 The method of clause 29, wherein the design criteria comprises an optimized gRNA having a binding lifetime less than or equal to the binding lifetime of a full- length gRNA to at least three off-target sites, wherein the off-target sites are predicted to be the closest off-target sites or predicted to have the highest identity to the on-target sites.
- Clause 31 The method of clause 28, wherein the design criteria comprises a lifetime or cleavage rate at an off-target site that is less than or equal to the lifetime or cleavage rate of a full-length gRNA or truncated gRNA at the off-target site and/or a predicted on-target activity rate that is greater than 10% of the predicted on-target activity rate of a full-length gRNA or truncated gRNA.
- Clause 32 The method of any one of clauses 1-31, wherein the optimized gRNA is tested in step i) using surveyor assay, next-gen sequencing techniques, or GU DE-Seq.
- Clause 33 The method of any one of clauses 1-32, wherein the optimized gRNA is designed to minimize binding at an off-target site and allow binding to a protospacer sequence.
- Clause 34 The method of any one of clauses 1-33, wherein the off-target site is a known or predicted off-target site.
- Clause 35 The method of any one of clauses 1-34, wherein the full-length gRNA targets a mammalian gene.
- Clause 36 The method of any one of clauses 1-35, wherein the target gene comprises an endogenous target gene or a transgene.
- Clause 37 The method of any one of clauses 1-36, wherein the target gene comprises a disease-relevant gene.
- Clause 38 The method of any one of clauses 1-37, wherein the target gene is a DMD, EMX1, or VEGFA gene.
- Clause 42 The optimized gRNA of clause 40 or 41, wherein the optimized gRNA modulates strand invasion into the protospacer.
- Clause 43 The optimized gRNA of any one of clauses 40-42, wherein the optimized gRNA comprises a nucleotide sequence of at least one of SEQ ID NOs: 149-315, 321-323, and 326-329.
- Clause 45 A vector comprising the isolated polynucleotide of clause 44.
- Clause 46 A cell comprising the isolated polynucleotide of clause 44 or the vector of clause 45.
- Clause 47 A kit comprising the isolated polynucleotide of clause 44, the vector of clause 45, or the cell of clause 46.
- Clause 48 A method of epigenomic editing in a target cell or a subject, the method comprising contacting a cell or a subject with an effective amount of the optimized gRNA molecule of any one of clauses 40-43 or the isolated polynucleotide of clause 44 and a fusion protein, the fusion protein comprising a first polypeptide domain comprising a nuclease-deficient Cas9 and a second polypeptide domain having an activity selected from the group consisting of transcription activation activity, transcription repression activity, nuclease activity, transcription release factor activity, histone modification activity, nucleic acid association activity, DNA methylase activity, and direct or indirect DNA demethylase activity.
- Clause 49 A method of site specific DNA cleavage in a target cell or a subject, the method comprising contacting a cell or a subject with an effective amount of the optimized gRNA molecule of any one of clauses 40-43 or the isolated polynucleotide of clause 44 and a fusion protein or Cas9 protein, the fusion protein comprising a first polypeptide domain comprising a nuclease-deficient Cas9 and a second polypeptide domain having an activity selected from the group consisting of transcription activation activity, transcription repression activity, nuclease activity, transcription release factor activity, histone modification activity, nucleic acid association activity, DNA methylase activity, and direct or indirect DNA
- Clause 50 A method of genome editing in a cell, the method comprising
- the fusion protein comprising a first polypeptide domain comprising a nuclease-deficient Cas9 and a second polypeptide domain having an activity selected from the group consisting of transcription activation activity, transcription repression activity, nuclease activity, transcription release factor activity, histone modification activity, nucleic acid association activity, DNA methylase activity, and direct or indirect DNA demethylase activity.
- Clause 51 The method of clause 50, wherein the genome editing comprises correcting a mutant gene or inserting a transgene.
- Clause 52 The method of clause 51, wherein correcting a mutant gene comprises deleting, rearranging, or replacing the mutant gene.
- Clause 53 The method of any one of clauses 51 or 52, wherein correcting the mutant gene comprises nuclease-mediated non-homologous end joining or homology-directed repair.
- Clause 54 A method of modulating gene expression in a cell, the method comprising contacting the cell with an effective amount of the optimized gRNA molecule of any one of clauses 40-43 or the isolated polynucleotide of clause 44 and a fusion protein, the fusion protein comprising a first polypeptide domain comprising a nuclease-deficient Cas9 and a second polypeptide domain having an activity selected from the group consisting of transcription activation activity, transcription repression activity, nuclease activity, transcription release factor activity, histone modification activity, nucleic acid association activity, DNA methylase activity, and direct or indirect DNA demethylase activity.
- Clause 55 The method of clause 54, wherein the gene expression of the at least one target gene is modulated when gene expression levels of the at least one target gene are increased or decreased compared to normal gene expression levels for the at least one target gene.
- Clause 56 The method of clause 54 or 55, wherein the fusion protein comprises a dCas9 domain and a transcriptional activator.
- Clause 58 The method of clause 54 or 55, wherein the fusion protein comprises a dCas9 domain and a transcriptional repressor.
- Clause 60 The method of clause 54 or 55, wherein the fusion protein comprises a dCas9 domain and a site-specific nuclease.
- Clause 61 The method of any one of clauses 48-60 wherein the optimized gRNA is encoded by a polynucleotide sequence and packaged into a lentiviral vector.
- Clause 62 The method of clause 61, wherein the lentiviral vector comprises an expression cassette comprising a promoter operably linked to the polynucleotide sequence encoding the gRNA.
- Clause 63 The method of clause 62, wherein the promoter operably linked to the polynucleotide encoding the optimized gRNA is inducible.
- Clause 64 The method of any one of clauses 61-63, herein the lentiviral vector further comprises a polynucleotide sequence encoding the Cas9 protein or fusion protein.
- Clause 65 The method of any one of clauses 48-64, wherein the at least one target gene is a disease-relevant gene.
- Clause 66 The method of any one of clauses 48-65, wherein the target cell is a eukaryotic cell.
- Clause 67 The method of any one of clauses 48-66, wherein the target cell is a mammalian cell.
- Streptococcus pyogenes Cas 9 (with D10A, H840A) (SEP ID NO: 1)
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Genetics & Genomics (AREA)
- Chemical & Material Sciences (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biomedical Technology (AREA)
- Biotechnology (AREA)
- Molecular Biology (AREA)
- Organic Chemistry (AREA)
- Physics & Mathematics (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Biophysics (AREA)
- Biochemistry (AREA)
- Microbiology (AREA)
- Plant Pathology (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Theoretical Computer Science (AREA)
- Medical Informatics (AREA)
- Evolutionary Biology (AREA)
- Medicinal Chemistry (AREA)
- Crystallography & Structural Chemistry (AREA)
- Pharmacology & Pharmacy (AREA)
- Cell Biology (AREA)
- Mycology (AREA)
- Analytical Chemistry (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- General Chemical & Material Sciences (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Animal Behavior & Ethology (AREA)
- Public Health (AREA)
- Veterinary Medicine (AREA)
- Virology (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
- Enzymes And Modification Thereof (AREA)
Abstract
Description
Claims
Priority Applications (20)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP24154252.1A EP4345455A3 (en) | 2015-08-25 | 2016-08-25 | Compositions and methods of improving specificity in genomic engineering using rna-guided endonucleases |
CA2996001A CA2996001A1 (en) | 2015-08-25 | 2016-08-25 | Compositions and methods of improving specificity in genomic engineering using rna-guided endonucleases |
EP24154247.1A EP4345454A3 (en) | 2015-08-25 | 2016-08-25 | Compositions and methods of improving specificity in genomic engineering using rna-guided endonucleases |
IL288662A IL288662B2 (en) | 2015-08-25 | 2016-08-25 | Compositions and methods of improving specificity in genomic engineering using rna-guided endonucleases |
CN202210083840.3A CN114634930A (en) | 2015-08-25 | 2016-08-25 | Compositions and methods for improving genome engineering specificity using RNA-guided endonucleases |
KR1020187008282A KR20180038558A (en) | 2015-08-25 | 2016-08-25 | Compositions and methods for improving specificity in genomic manipulation using RNA-guided endonuclease |
MX2018002339A MX2018002339A (en) | 2015-08-25 | 2016-08-25 | Compositions and methods of improving specificity in genomic engineering using rna-guided endonucleases. |
BR112018003625A BR112018003625A2 (en) | 2015-08-25 | 2016-08-25 | genomic engineering specificity enhancement compositions and methods using rna-guided endonucleases |
CN201680061639.8A CN108351350B (en) | 2015-08-25 | 2016-08-25 | Compositions and methods for improving genome engineering specificity using RNA-guided endonucleases |
EP22181872.7A EP4177346A3 (en) | 2015-08-25 | 2016-08-25 | Compositions and methods of improving specificity in genomic engineering using rna-guided endonucleases |
EP16840171.9A EP3341727B1 (en) | 2015-08-25 | 2016-08-25 | Compositions and methods of improving specificity in genomic engineering using rna-guided endonucleases |
AU2016311454A AU2016311454C1 (en) | 2015-08-25 | 2016-08-25 | Compositions and methods of improving specificity in genomic engineering using RNA-guided endonucleases |
JP2018510820A JP6905755B2 (en) | 2015-08-25 | 2016-08-25 | Compositions and Methods to Improve Specificity in Genomic Engineering Using RNA-Guided Endonucleases |
EA201890565A EA201890565A1 (en) | 2015-08-25 | 2016-08-25 | COMPOSITIONS AND METHODS OF IMPROVING SPECIFICITY IN GENOMIC ENGINEERING WITH THE USE OF RNA-DIRECTED ENDONUCLEAS |
ES16840171T ES2929110T3 (en) | 2015-08-25 | 2016-08-25 | Compositions and methods to improve the specificity in genetic engineering using RNA-guided endonucleases |
US15/754,861 US11427817B2 (en) | 2015-08-25 | 2016-08-25 | Compositions and methods of improving specificity in genomic engineering using RNA-guided endonucleases |
IL257677A IL257677B (en) | 2015-08-25 | 2018-02-22 | Compositions and methods of improving specificity in genomic engineering using rna-guided endonucleases |
US17/879,506 US20230047669A1 (en) | 2015-08-25 | 2022-08-02 | Compositions and methods of improving specificity in genomic engineering using rna-guided endonucleases |
US17/897,033 US20230304000A1 (en) | 2015-08-25 | 2022-08-26 | Compositions and methods of improving specificity in genomic engineering using rna-guided endonucleases |
AU2023241327A AU2023241327A1 (en) | 2015-08-25 | 2023-10-05 | Compositions and methods of improving specificity in genomic engineering using RNA-guided endonucleases |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201562209466P | 2015-08-25 | 2015-08-25 | |
US62/209,466 | 2015-08-25 |
Related Child Applications (3)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US15/754,861 A-371-Of-International US11427817B2 (en) | 2015-08-25 | 2016-08-25 | Compositions and methods of improving specificity in genomic engineering using RNA-guided endonucleases |
US17/879,506 Division US20230047669A1 (en) | 2015-08-25 | 2022-08-02 | Compositions and methods of improving specificity in genomic engineering using rna-guided endonucleases |
US17/897,033 Continuation US20230304000A1 (en) | 2015-08-25 | 2022-08-26 | Compositions and methods of improving specificity in genomic engineering using rna-guided endonucleases |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2017035416A2 true WO2017035416A2 (en) | 2017-03-02 |
WO2017035416A3 WO2017035416A3 (en) | 2017-04-06 |
Family
ID=58101224
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2016/048798 WO2017035416A2 (en) | 2015-08-25 | 2016-08-25 | Compositions and methods of improving specificity in genomic engineering using rna-guided endonucleases |
Country Status (13)
Country | Link |
---|---|
US (3) | US11427817B2 (en) |
EP (4) | EP4345454A3 (en) |
JP (3) | JP6905755B2 (en) |
KR (1) | KR20180038558A (en) |
CN (2) | CN108351350B (en) |
AU (2) | AU2016311454C1 (en) |
BR (1) | BR112018003625A2 (en) |
CA (1) | CA2996001A1 (en) |
EA (1) | EA201890565A1 (en) |
ES (1) | ES2929110T3 (en) |
IL (2) | IL288662B2 (en) |
MX (1) | MX2018002339A (en) |
WO (1) | WO2017035416A2 (en) |
Cited By (45)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9999671B2 (en) | 2013-09-06 | 2018-06-19 | President And Fellows Of Harvard College | Delivery of negatively charged proteins using cationic lipids |
US10077453B2 (en) | 2014-07-30 | 2018-09-18 | President And Fellows Of Harvard College | CAS9 proteins including ligand-dependent inteins |
US10113163B2 (en) | 2016-08-03 | 2018-10-30 | President And Fellows Of Harvard College | Adenosine nucleobase editors and uses thereof |
US10167457B2 (en) | 2015-10-23 | 2019-01-01 | President And Fellows Of Harvard College | Nucleobase editors and uses thereof |
WO2018209158A3 (en) * | 2017-05-10 | 2019-01-03 | Editas Medicine, Inc. | Crispr/rna-guided nuclease systems and methods |
WO2019089761A1 (en) * | 2017-10-31 | 2019-05-09 | The Broad Institute, Inc. | Crispr protein inhibitors |
US10323236B2 (en) | 2011-07-22 | 2019-06-18 | President And Fellows Of Harvard College | Evaluation and improvement of nuclease cleavage specificity |
WO2019066490A3 (en) * | 2017-09-28 | 2019-06-27 | 주식회사 툴젠 | Artificial genome manipulation for gene expression regulation |
WO2019157326A1 (en) | 2018-02-08 | 2019-08-15 | Zymergen Inc. | Genome editing using crispr in corynebacterium |
CN110382695A (en) * | 2016-11-28 | 2019-10-25 | 得克萨斯州大学系统董事会 | Prevent muscular dystrophy by the gene editing of CRISPR/CPF1 mediation |
US10465176B2 (en) | 2013-12-12 | 2019-11-05 | President And Fellows Of Harvard College | Cas variants for gene editing |
US10508298B2 (en) | 2013-08-09 | 2019-12-17 | President And Fellows Of Harvard College | Methods for identifying a target site of a CAS9 nuclease |
US10597679B2 (en) | 2013-09-06 | 2020-03-24 | President And Fellows Of Harvard College | Switchable Cas9 nucleases and uses thereof |
US10745677B2 (en) | 2016-12-23 | 2020-08-18 | President And Fellows Of Harvard College | Editing of CCR5 receptor gene to protect against HIV infection |
WO2020210776A1 (en) * | 2019-04-12 | 2020-10-15 | Duke University | Crispr/cas-based base editing composition for restoring dystrophin function |
US10858639B2 (en) | 2013-09-06 | 2020-12-08 | President And Fellows Of Harvard College | CAS9 variants and uses thereof |
US20210040460A1 (en) | 2012-04-27 | 2021-02-11 | Duke University | Genetic correction of mutated genes |
US11028388B2 (en) | 2014-03-05 | 2021-06-08 | Editas Medicine, Inc. | CRISPR/Cas-related methods and compositions for treating Usher syndrome and retinitis pigmentosa |
US11046948B2 (en) | 2013-08-22 | 2021-06-29 | President And Fellows Of Harvard College | Engineered transcription activator-like effector (TALE) domains and uses thereof |
US11141493B2 (en) | 2014-03-10 | 2021-10-12 | Editas Medicine, Inc. | Compositions and methods for treating CEP290-associated disease |
US11155796B2 (en) | 2015-02-09 | 2021-10-26 | Duke University | Compositions and methods for epigenome editing |
US11180793B2 (en) | 2015-04-24 | 2021-11-23 | Editas Medicine, Inc. | Evaluation of Cas9 molecule/guide RNA molecule complexes |
US11242525B2 (en) | 2014-03-26 | 2022-02-08 | Editas Medicine, Inc. | CRISPR/CAS-related methods and compositions for treating sickle cell disease |
US11268082B2 (en) | 2017-03-23 | 2022-03-08 | President And Fellows Of Harvard College | Nucleobase editors comprising nucleic acid programmable DNA binding proteins |
US11268086B2 (en) | 2014-03-10 | 2022-03-08 | Editas Medicine, Inc. | CRISPR/CAS-related methods and compositions for treating Leber's Congenital Amaurosis 10 (LCA10) |
US20220088083A1 (en) * | 2019-01-18 | 2022-03-24 | Osaka University | Composition for use in treating dystrophic epidermolysis bullosa |
US11306324B2 (en) | 2016-10-14 | 2022-04-19 | President And Fellows Of Harvard College | AAV delivery of nucleobase editors |
US11319532B2 (en) | 2017-08-30 | 2022-05-03 | President And Fellows Of Harvard College | High efficiency base editors comprising Gam |
US11339437B2 (en) | 2014-03-10 | 2022-05-24 | Editas Medicine, Inc. | Compositions and methods for treating CEP290-associated disease |
US11421251B2 (en) | 2015-10-13 | 2022-08-23 | Duke University | Genome engineering with type I CRISPR systems in eukaryotic cells |
US11447770B1 (en) | 2019-03-19 | 2022-09-20 | The Broad Institute, Inc. | Methods and compositions for prime editing nucleotide sequences |
US11512311B2 (en) | 2016-03-25 | 2022-11-29 | Editas Medicine, Inc. | Systems and methods for treating alpha 1-antitrypsin (A1AT) deficiency |
US11542509B2 (en) | 2016-08-24 | 2023-01-03 | President And Fellows Of Harvard College | Incorporation of unnatural amino acids into proteins using base editing |
US11542496B2 (en) | 2017-03-10 | 2023-01-03 | President And Fellows Of Harvard College | Cytosine to guanine base editor |
US11560566B2 (en) | 2017-05-12 | 2023-01-24 | President And Fellows Of Harvard College | Aptazyme-embedded guide RNAs for use with CRISPR-Cas9 in genome editing and transcriptional activation |
US11566263B2 (en) | 2016-08-02 | 2023-01-31 | Editas Medicine, Inc. | Compositions and methods for treating CEP290 associated disease |
US11572574B2 (en) | 2017-09-28 | 2023-02-07 | Toolgen Incorporated | Artificial genome manipulation for gene expression regulation |
US11661590B2 (en) | 2016-08-09 | 2023-05-30 | President And Fellows Of Harvard College | Programmable CAS9-recombinase fusion proteins and uses thereof |
US11732274B2 (en) | 2017-07-28 | 2023-08-22 | President And Fellows Of Harvard College | Methods and compositions for evolving base editors using phage-assisted continuous evolution (PACE) |
US11795443B2 (en) | 2017-10-16 | 2023-10-24 | The Broad Institute, Inc. | Uses of adenosine base editors |
US11851690B2 (en) | 2017-03-14 | 2023-12-26 | Editas Medicine, Inc. | Systems and methods for the treatment of hemoglobinopathies |
US11898179B2 (en) | 2017-03-09 | 2024-02-13 | President And Fellows Of Harvard College | Suppression of pain by gene editing |
US11912985B2 (en) | 2020-05-08 | 2024-02-27 | The Broad Institute, Inc. | Methods and compositions for simultaneous editing of both strands of a target double-stranded nucleotide sequence |
US12031132B2 (en) | 2018-03-14 | 2024-07-09 | Editas Medicine, Inc. | Systems and methods for the treatment of hemoglobinopathies |
US12084663B2 (en) | 2022-11-14 | 2024-09-10 | President And Fellows Of Harvard College | Incorporation of unnatural amino acids into proteins using base editing |
Families Citing this family (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11380420B1 (en) * | 2018-05-07 | 2022-07-05 | X Development Llc | Data structure, compilation service, and graphical user interface for rapid simulation generation |
EP3794130A4 (en) | 2018-05-16 | 2022-07-27 | Synthego Corporation | Methods and systems for guide rna design and use |
US10839937B1 (en) * | 2018-07-19 | 2020-11-17 | X Development Llc | Whole cell circular delta viewer and navigator |
US20220220468A1 (en) * | 2019-05-17 | 2022-07-14 | Beth Israel Deaconess Medical Center, Inc. | Compositions and methods for homology directed repair |
US20220340969A1 (en) * | 2019-09-04 | 2022-10-27 | Edigene Inc. | Method for evaluating gene editing therapy based on off-target assessment |
CN110777159B (en) * | 2019-12-03 | 2023-05-05 | 苏州缔因安生物科技有限公司 | CRISPR-Cas 9-based nucleic acid detection system and application thereof |
CN111172164B (en) * | 2020-03-09 | 2022-11-15 | 中国药科大学 | Composition and method for any nucleic acid editing |
CN111781343B (en) * | 2020-07-17 | 2023-03-21 | 北京林业大学 | Method for detecting influence of DNA methylation modification on combination of transcription factor and DNA sequence |
JP2023540285A (en) * | 2020-08-31 | 2023-09-22 | エタロン ダイアグノスティックス | Systems and methods for producing or identifying non-human animals with predetermined phenotypes or genotypes |
CN112233730B (en) * | 2020-10-16 | 2023-11-28 | 南京大学 | Construction method for distinguishing effect model of PBDEs derivative on activity of enoyl-ACP reductase |
CN112662674B (en) * | 2021-01-12 | 2023-04-11 | 广州瑞风生物科技有限公司 | gRNA for targeted editing of VEGFA gene exon region and application thereof |
CN118556126A (en) * | 2021-11-17 | 2024-08-27 | 康奈尔大学 | Modulation of CRISPR disorders |
CN114231596B (en) * | 2021-12-12 | 2023-11-21 | 徐州医科大学 | Gene detection method based on CRISPR/dcas9 and magnetic nano material and application thereof |
WO2023225349A2 (en) * | 2022-05-20 | 2023-11-23 | Helex Inc. | Tissue specific methods and compositions for gene editing |
Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1993024640A2 (en) | 1992-06-04 | 1993-12-09 | The Regents Of The University Of California | Methods and compositions for in vivo gene therapy |
WO1994016737A1 (en) | 1993-01-26 | 1994-08-04 | Weiner David B | Compositions and methods for delivery of genetic material |
US5593972A (en) | 1993-01-26 | 1997-01-14 | The Wistar Institute | Genetic immunization |
US5962428A (en) | 1995-03-30 | 1999-10-05 | Apollon, Inc. | Compositions and methods for delivery of genetic material |
US20040175727A1 (en) | 2002-11-04 | 2004-09-09 | Advisys, Inc. | Synthetic muscle promoters with activities exceeding naturally occurring regulatory sequences in cardiac cells |
WO2014018423A2 (en) | 2012-07-25 | 2014-01-30 | The Broad Institute, Inc. | Inducible dna binding proteins and genome perturbation tools and applications thereof |
WO2014093709A1 (en) | 2012-12-12 | 2014-06-19 | The Broad Institute, Inc. | Methods, models, systems, and apparatus for identifying target sequences for cas enzymes or crispr-cas systems for target sequences and conveying results thereof |
WO2014204728A1 (en) | 2013-06-17 | 2014-12-24 | The Broad Institute Inc. | Delivery, engineering and optimization of systems, methods and compositions for targeting and modeling diseases and disorders of post mitotic cells |
WO2015048690A1 (en) | 2013-09-27 | 2015-04-02 | The Regents Of The University Of California | Optimized small guide rnas and methods of use |
Family Cites Families (112)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5139941A (en) | 1985-10-31 | 1992-08-18 | University Of Florida Research Foundation, Inc. | AAV transduction vectors |
US5436146A (en) | 1989-09-07 | 1995-07-25 | The Trustees Of Princeton University | Helper-free stocks of recombinant adeno-associated virus vectors |
GB9206016D0 (en) | 1992-03-19 | 1992-04-29 | Sandoz Ltd | Improvements in or relating to organic compounds |
US6268213B1 (en) | 1992-06-03 | 2001-07-31 | Richard Jude Samulski | Adeno-associated virus vector and cis-acting regulatory and promoter elements capable of expressing at least one gene and method of using same for gene therapy |
US5869305A (en) | 1992-12-04 | 1999-02-09 | The University Of Pittsburgh | Recombinant viral vector system |
US5658784A (en) | 1994-04-14 | 1997-08-19 | Dana-Farber Cancer Institute, Inc. | Nucleic acid encoding transcription factor p300 and uses of p300 |
US6204059B1 (en) | 1994-06-30 | 2001-03-20 | University Of Pittsburgh | AAV capsid vehicles for molecular transfer |
US6093570A (en) | 1995-06-07 | 2000-07-25 | The University Of North Carolina At Chapel Hill | Helper virus-free AAV production |
US5741683A (en) | 1995-06-07 | 1998-04-21 | The Research Foundation Of State University Of New York | In vitro packaging of adeno-associated virus DNA |
EP1007637B1 (en) | 1997-04-14 | 2004-06-30 | Cell Genesys, Inc. | Methods for increasing the efficiency of recombinant aav product |
WO1999061643A1 (en) | 1998-05-27 | 1999-12-02 | University Of Florida | Method of preparing recombinant adeno-associated virus compositions by using an iodixananol gradient |
JP2002538770A (en) | 1998-11-10 | 2002-11-19 | ユニバーシティ オブ ノース カロライナ アット チャペル ヒル | Viral vectors and methods for their production and administration |
ES2478635T3 (en) | 1999-08-09 | 2014-07-22 | Targeted Genetics Corporation | Increased expression of a single stranded heterologous nucleotide sequence of recombinant viral vectors by designing the sequence so that it forms intracatenary base pairs |
WO2001083783A2 (en) | 2000-04-28 | 2001-11-08 | Genzyme Corporation | In vivo loading of mhc |
AU2001268149B2 (en) | 2000-06-01 | 2005-08-18 | University Of North Carolina At Chapel Hill | Methods and compounds for controlled release of recombinant parvovirus vectors |
ATE520707T1 (en) | 2001-11-13 | 2011-09-15 | Univ Pennsylvania | METHOD FOR THE DETECTION AND/OR IDENTIFICATION OF ADENO-ASSOCIATED VIRUS (AAV) SEQUENCES AND ISOLATION OF NEW SEQUENCES IDENTIFIED THEREFROM |
AU2002360291A1 (en) | 2001-12-17 | 2003-06-30 | The Trustees Of The University Of Pennsylvania | Adeno-associated virus (aav) serotype 8 sequences |
WO2003104413A2 (en) | 2002-06-05 | 2003-12-18 | University Of Florida | Production of pseudotyped recombinant aav virions |
PL3211085T3 (en) | 2003-09-30 | 2021-11-15 | The Trustees Of The University Of Pennsylvania | Adeno-associated virus (aav) clades, sequences, vectors containing same, and uses therefor |
DE602005026386D1 (en) | 2004-06-28 | 2011-03-31 | Univ Western Australia | ANTISENSE OLIGONUCLEOTIDES FOR THE INDUCTION OF EXON-SKIPPING AND METHOD OF USE THEREOF |
EP4282956A3 (en) | 2005-04-07 | 2024-03-13 | The Trustees of the University of Pennsylvania | Method of increasing the function of an aav vector |
WO2006132118A1 (en) | 2005-06-09 | 2006-12-14 | Matsushita Electric Industrial Co., Ltd. | Amplitude error compensating apparatus and orthogonality error compensating apparatus |
US7588772B2 (en) | 2006-03-30 | 2009-09-15 | Board Of Trustees Of The Leland Stamford Junior University | AAV capsid library and AAV capsid proteins |
US8685687B2 (en) | 2006-07-05 | 2014-04-01 | The Scripps Research Institute | Chimeric zinc finger recombinases optimized for catalysis by directed evolution |
NZ584793A (en) | 2007-10-26 | 2012-05-25 | Academisch Ziekenhuis Leiden | Means and methods for counteracting muscle disorders |
CN102083878B (en) | 2008-05-13 | 2013-07-10 | 华盛顿大学 | Diblock copolymers and polynucleotide complexes thereof for delivery into cells |
ES2560281T3 (en) | 2008-10-17 | 2016-02-18 | Joule Unlimited Technologies, Inc. | Ethanol production by microorganisms |
CA2742880A1 (en) | 2008-11-06 | 2010-05-14 | University Of Washington | Multiblock copolymers |
AU2009311667B2 (en) | 2008-11-07 | 2016-04-14 | Massachusetts Institute Of Technology | Aminoalcohol lipidoids and uses thereof |
EP2206723A1 (en) | 2009-01-12 | 2010-07-14 | Bonas, Ulla | Modular DNA-binding domains |
EP2281579A1 (en) | 2009-08-05 | 2011-02-09 | BioNTech AG | Vaccine composition comprising 5'-Cap modified RNA |
US8586526B2 (en) | 2010-05-17 | 2013-11-19 | Sangamo Biosciences, Inc. | DNA-binding proteins and uses thereof |
EP2480659A2 (en) | 2009-09-24 | 2012-08-01 | Cellectis | Meganuclease reagents of uses thereof for treating genetic diseases caused by frame shift/non sense mutations |
US20110197290A1 (en) | 2010-02-11 | 2011-08-11 | Fahrenkrug Scott C | Methods and materials for producing transgenic artiodactyls |
CA2793633A1 (en) | 2010-03-29 | 2011-10-13 | The Trustees Of The University Of Pennsylvania | Pharmacologically induced transgene ablation system |
US9315825B2 (en) | 2010-03-29 | 2016-04-19 | The Trustees Of The University Of Pennsylvania | Pharmacologically induced transgene ablation system |
CA2799095A1 (en) | 2010-05-12 | 2011-11-17 | Cellectis | Meganuclease variants cleaving a dna target sequence from the dystrophin gene and uses thereof |
IT1400425B1 (en) | 2010-06-08 | 2013-05-31 | Amsterdam Molecular Therapeutics Bv | MODIFIED SNRNAS FOR USE IN THERAPY. |
CN103534417B (en) | 2011-05-12 | 2016-03-23 | 日立建机株式会社 | Engineering machinery |
EP2717893B1 (en) | 2011-06-08 | 2019-05-08 | Translate Bio, Inc. | Lipid nanoparticle compositions and methods for mrna delivery |
US9540623B2 (en) | 2011-07-08 | 2017-01-10 | Cellectis | Method for increasing the efficiency of double-strand-break induced mutagenesis |
JP2015509922A (en) | 2012-01-27 | 2015-04-02 | プロセンサ テクノロジーズ ビー.ブイ.Prosensa Technologies B.V. | RNA regulatory oligonucleotides with improved characteristics for the treatment of Duchenne and Becker muscular dystrophy |
WO2013143555A1 (en) | 2012-03-26 | 2013-10-03 | Biontech Ag | Rna formulation for immunotherapy |
EP2834357B1 (en) | 2012-04-04 | 2017-12-27 | Life Technologies Corporation | Tal-effector assembly platform, customized services, kits and assays |
WO2013163628A2 (en) | 2012-04-27 | 2013-10-31 | Duke University | Genetic correction of mutated genes |
US9738879B2 (en) | 2012-04-27 | 2017-08-22 | Duke University | Genetic correction of mutated genes |
TR201806812T4 (en) * | 2012-05-25 | 2018-06-21 | Charpentier Emmanuelle | Methods and compositions for RNA-directed target DNA modification and for RNA-directed transcription modification. |
EP3800254A1 (en) | 2012-06-08 | 2021-04-07 | Ethris GmbH | Pulmonary delivery of messenger rna |
EP4397759A2 (en) * | 2012-10-23 | 2024-07-10 | Toolgen Incorporated | Composition for cleaving a target dna comprising a guide rna specific for the target dna and cas protein-encoding nucleic acid or cas protein, and use thereof |
CA3150985A1 (en) | 2012-11-01 | 2014-05-08 | Factor Bioscience Inc. | Methods and products for expressing proteins in cells |
US20140140969A1 (en) | 2012-11-20 | 2014-05-22 | Sangamo Biosciences, Inc. | Methods and compositions for muscular dystrophies |
KR102243092B1 (en) | 2012-12-06 | 2021-04-22 | 시그마-알드리치 컴퍼니., 엘엘씨 | Crispr-based genome modification and regulation |
US20140189896A1 (en) | 2012-12-12 | 2014-07-03 | Feng Zhang | Crispr-cas component systems, methods and compositions for sequence manipulation |
WO2014093701A1 (en) | 2012-12-12 | 2014-06-19 | The Broad Institute, Inc. | Functional genomics using crispr-cas systems, compositions, methods, knock out libraries and applications thereof |
WO2014093694A1 (en) | 2012-12-12 | 2014-06-19 | The Broad Institute, Inc. | Crispr-cas nickase systems, methods and compositions for sequence manipulation in eukaryotes |
KR20150105956A (en) * | 2012-12-12 | 2015-09-18 | 더 브로드 인스티튜트, 인코퍼레이티드 | Delivery, engineering and optimization of systems, methods and compositions for sequence manipulation and therapeutic applications |
US8697359B1 (en) | 2012-12-12 | 2014-04-15 | The Broad Institute, Inc. | CRISPR-Cas systems and methods for altering expression of gene products |
US20140242664A1 (en) * | 2012-12-12 | 2014-08-28 | The Broad Institute, Inc. | Engineering of systems, methods and optimized guide compositions for sequence manipulation |
DK2931898T3 (en) | 2012-12-12 | 2016-06-20 | Massachusetts Inst Technology | CONSTRUCTION AND OPTIMIZATION OF SYSTEMS, PROCEDURES AND COMPOSITIONS FOR SEQUENCE MANIPULATION WITH FUNCTIONAL DOMAINS |
WO2014144592A2 (en) | 2013-03-15 | 2014-09-18 | The General Hospital Corporation | Using truncated guide rnas (tru-grnas) to increase specificity for rna-guided genome editing |
US10760064B2 (en) | 2013-03-15 | 2020-09-01 | The General Hospital Corporation | RNA-guided targeting of genetic and epigenomic regulatory proteins to specific genomic loci |
CA3159715A1 (en) | 2013-03-15 | 2014-09-18 | Edward J. Britt | Energy conversion device and method for making and using same |
US9828582B2 (en) | 2013-03-19 | 2017-11-28 | Duke University | Compositions and methods for the induction and tuning of gene expression |
US20160186208A1 (en) | 2013-04-16 | 2016-06-30 | Whitehead Institute For Biomedical Research | Methods of Mutating, Modifying or Modulating Nucleic Acid in a Cell or Nonhuman Mammal |
CA2910489A1 (en) | 2013-05-15 | 2014-11-20 | Sangamo Biosciences, Inc. | Methods and compositions for treatment of a genetic condition |
EP3309248B1 (en) | 2013-05-29 | 2021-06-09 | Cellectis | Methods for engineering t cells for immunotherapy by using rna-guided cas nuclease system |
US9267135B2 (en) | 2013-06-04 | 2016-02-23 | President And Fellows Of Harvard College | RNA-guided transcriptional regulation |
CN114230675A (en) | 2013-06-05 | 2022-03-25 | 杜克大学 | RNA-guided gene editing and gene regulation |
BR112015031608A2 (en) * | 2013-06-17 | 2017-08-22 | Massachusetts Inst Technology | APPLICATION AND USE OF CRISPR-CAS SYSTEMS, VECTORS AND COMPOSITIONS FOR LIVER TARGETING AND THERAPY |
US10011850B2 (en) | 2013-06-21 | 2018-07-03 | The General Hospital Corporation | Using RNA-guided FokI Nucleases (RFNs) to increase specificity for RNA-Guided Genome Editing |
US10421957B2 (en) | 2013-07-29 | 2019-09-24 | Agilent Technologies, Inc. | DNA assembly using an RNA-programmable nickase |
EP3027221A1 (en) | 2013-07-30 | 2016-06-08 | PhaseRx, Inc. | Block copolymers and their conjugates or complexes with oligonucleotides |
US9340799B2 (en) * | 2013-09-06 | 2016-05-17 | President And Fellows Of Harvard College | MRNA-sensing switchable gRNAs |
DK3066201T3 (en) | 2013-11-07 | 2018-06-06 | Editas Medicine Inc | CRISPR-RELATED PROCEDURES AND COMPOSITIONS WITH LEADING GRADES |
KR20160089530A (en) | 2013-12-12 | 2016-07-27 | 더 브로드 인스티튜트, 인코퍼레이티드 | Delivery, use and therapeutic applications of the crispr-cas systems and compositions for hbv and viral diseases and disorders |
CA2932478A1 (en) * | 2013-12-12 | 2015-06-18 | Massachusetts Institute Of Technology | Delivery, use and therapeutic applications of the crispr-cas systems and compositions for genome editing |
KR20160089526A (en) * | 2013-12-12 | 2016-07-27 | 더 브로드 인스티튜트, 인코퍼레이티드 | Delivery, use and therapeutic applications of the crispr-cas systems and compositions for targeting disorders and diseases using particle delivery components |
EP3080271B1 (en) | 2013-12-12 | 2020-02-12 | The Broad Institute, Inc. | Systems, methods and compositions for sequence manipulation with optimized functional crispr-cas systems |
US9840699B2 (en) | 2013-12-12 | 2017-12-12 | President And Fellows Of Harvard College | Methods for nucleic acid editing |
BR112016019068A2 (en) | 2014-02-18 | 2017-10-10 | Univ Duke | construct, recombinant vector, pharmaceutical composition, method of inhibiting viral replication or expression of a target sequence in a cell infected with a virus, recombinant sau cas9 polypeptide, recombinant sau cas9 construct, recombinant construct for expression of an individual guide and kit |
WO2016022866A1 (en) | 2014-08-07 | 2016-02-11 | Agilent Technologies, Inc. | Cis-blocked guide rna |
WO2016094880A1 (en) | 2014-12-12 | 2016-06-16 | The Broad Institute Inc. | Delivery, use and therapeutic applications of crispr systems and compositions for genome editing as to hematopoietic stem cells (hscs) |
EP3889260A1 (en) | 2014-12-12 | 2021-10-06 | The Broad Institute, Inc. | Protected guide rnas (pgrnas) |
US10190106B2 (en) | 2014-12-22 | 2019-01-29 | Univesity Of Massachusetts | Cas9-DNA targeting unit chimeras |
US20180002706A1 (en) | 2014-12-30 | 2018-01-04 | University Of South Florida | Methods and compositions for cloning into large vectors |
US10676726B2 (en) | 2015-02-09 | 2020-06-09 | Duke University | Compositions and methods for epigenome editing |
EP3265559B1 (en) | 2015-03-03 | 2021-01-06 | The General Hospital Corporation | Engineered crispr-cas9 nucleases with altered pam specificity |
WO2017015637A1 (en) | 2015-07-22 | 2017-01-26 | Duke University | High-throughput screening of regulatory element function with epigenome editing technologies |
EP3362571A4 (en) | 2015-10-13 | 2019-07-10 | Duke University | Genome engineering with type i crispr systems in eukaryotic cells |
AU2016362282B2 (en) | 2015-11-30 | 2023-03-16 | Duke University | Therapeutic targets for the correction of the human dystrophin gene by gene editing and methods of use |
WO2017180915A2 (en) | 2016-04-13 | 2017-10-19 | Duke University | Crispr/cas9-based repressors for silencing gene targets in vivo and methods of use |
US20190134221A1 (en) | 2016-05-05 | 2019-05-09 | Duke University | Crispr/cas-related methods and compositions for treating duchenne muscular dystrophy |
EP3487523B1 (en) | 2016-07-19 | 2023-09-06 | Duke University | Therapeutic applications of cpf1-based genome editing |
WO2018031762A1 (en) | 2016-08-10 | 2018-02-15 | Duke University | Compositions, systems and methods for programming immune cell function through targeted gene regulation |
AU2017313912B2 (en) | 2016-08-19 | 2024-01-04 | Whitehead Institute For Biomedical Research | Methods of editing DNA methylation |
WO2018148256A1 (en) | 2017-02-07 | 2018-08-16 | The Regents Of The University Of California | Gene therapy for haploinsufficiency |
AU2018390957A1 (en) | 2017-12-22 | 2020-07-02 | KWS SAAT SE & Co. KGaA | Targeted transcriptional regulation using synthetic transcription factors |
US20200347105A1 (en) | 2018-01-19 | 2020-11-05 | Duke University | Genome engineering with crispr-cas systems in eukaryotes |
SG11202103314QA (en) | 2018-10-02 | 2021-04-29 | Sangamo Therapeutics Inc | Engineered genetic modulators |
JP2022526669A (en) | 2019-04-12 | 2022-05-25 | デューク ユニバーシティ | A CRISPR / Cas-based base editing composition for repairing dystrophin function |
AR118668A1 (en) | 2019-04-14 | 2021-10-20 | Univ Duke | COMPOSITIONS TO EDIT GENOME BASED ON CRISPR / CAS TO RESTORE DISTROPHIN FUNCTION |
US20220184229A1 (en) | 2019-04-14 | 2022-06-16 | Duke University | Aav vector-mediated deletion of large mutational hotspot for treatment of duchenne muscular dystrophy |
US20220396790A1 (en) | 2019-08-08 | 2022-12-15 | Duke University | High-throughput screening platform for engineering next-generation gene therapy vectors |
US20220307015A1 (en) | 2019-08-19 | 2022-09-29 | Duke University | Compositions and methods for identifying regulators of cell type fate specification |
WO2021034984A2 (en) | 2019-08-19 | 2021-02-25 | Duke University | Skeletal myoblast progenitor cell lineage specification by crispr/cas9-based transcriptional activators |
US20220364124A1 (en) | 2019-10-02 | 2022-11-17 | Duke University | Epigenetic modulation of genomic targets to control expression of pws-associated genes |
AR122347A1 (en) | 2019-12-03 | 2022-09-07 | Univ Duke | SYSTEMS AND METHODS FOR THE DISTRIBUTION OF GENE EDITING MACHINERY BY LIPID NANOPARTICLES |
EP4126224A4 (en) | 2020-04-27 | 2024-07-03 | Univ Duke | A high-throughput screening method to discover optimal grna pairs for crispr-mediated exon deletion |
WO2021222268A1 (en) | 2020-04-27 | 2021-11-04 | Duke University | Crispr/cas9 therapies for correcting duchenne muscular dystrophy by targeted genomic integration |
JP2023515709A (en) | 2020-04-27 | 2023-04-13 | デューク ユニバーシティ | Gene editing of satellite cells in vivo using AAV vectors encoding muscle-specific promoters |
WO2021222328A1 (en) | 2020-04-27 | 2021-11-04 | Duke University | Targeted genomic integration to restore neurofibromin coding sequence in neurofibromatosis type 1 (nf1) |
EP4127179A2 (en) | 2020-05-08 | 2023-02-08 | Duke University | Chromatin remodelers to enhance targeted gene activation |
-
2016
- 2016-08-25 CA CA2996001A patent/CA2996001A1/en active Pending
- 2016-08-25 CN CN201680061639.8A patent/CN108351350B/en active Active
- 2016-08-25 JP JP2018510820A patent/JP6905755B2/en active Active
- 2016-08-25 CN CN202210083840.3A patent/CN114634930A/en active Pending
- 2016-08-25 EP EP24154247.1A patent/EP4345454A3/en active Pending
- 2016-08-25 ES ES16840171T patent/ES2929110T3/en active Active
- 2016-08-25 IL IL288662A patent/IL288662B2/en unknown
- 2016-08-25 BR BR112018003625A patent/BR112018003625A2/en active Search and Examination
- 2016-08-25 US US15/754,861 patent/US11427817B2/en active Active
- 2016-08-25 MX MX2018002339A patent/MX2018002339A/en unknown
- 2016-08-25 WO PCT/US2016/048798 patent/WO2017035416A2/en active Application Filing
- 2016-08-25 KR KR1020187008282A patent/KR20180038558A/en active Application Filing
- 2016-08-25 EP EP16840171.9A patent/EP3341727B1/en active Active
- 2016-08-25 EA EA201890565A patent/EA201890565A1/en unknown
- 2016-08-25 EP EP24154252.1A patent/EP4345455A3/en active Pending
- 2016-08-25 AU AU2016311454A patent/AU2016311454C1/en active Active
- 2016-08-25 EP EP22181872.7A patent/EP4177346A3/en not_active Withdrawn
-
2018
- 2018-02-22 IL IL257677A patent/IL257677B/en unknown
-
2021
- 2021-06-21 JP JP2021102505A patent/JP2021166523A/en active Pending
-
2022
- 2022-08-02 US US17/879,506 patent/US20230047669A1/en active Pending
- 2022-08-26 US US17/897,033 patent/US20230304000A1/en active Pending
-
2023
- 2023-09-20 JP JP2023151951A patent/JP2023179511A/en active Pending
- 2023-10-05 AU AU2023241327A patent/AU2023241327A1/en active Pending
Patent Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1993024640A2 (en) | 1992-06-04 | 1993-12-09 | The Regents Of The University Of California | Methods and compositions for in vivo gene therapy |
WO1994016737A1 (en) | 1993-01-26 | 1994-08-04 | Weiner David B | Compositions and methods for delivery of genetic material |
US5593972A (en) | 1993-01-26 | 1997-01-14 | The Wistar Institute | Genetic immunization |
US5962428A (en) | 1995-03-30 | 1999-10-05 | Apollon, Inc. | Compositions and methods for delivery of genetic material |
US20040175727A1 (en) | 2002-11-04 | 2004-09-09 | Advisys, Inc. | Synthetic muscle promoters with activities exceeding naturally occurring regulatory sequences in cardiac cells |
WO2014018423A2 (en) | 2012-07-25 | 2014-01-30 | The Broad Institute, Inc. | Inducible dna binding proteins and genome perturbation tools and applications thereof |
WO2014093709A1 (en) | 2012-12-12 | 2014-06-19 | The Broad Institute, Inc. | Methods, models, systems, and apparatus for identifying target sequences for cas enzymes or crispr-cas systems for target sequences and conveying results thereof |
WO2014204728A1 (en) | 2013-06-17 | 2014-12-24 | The Broad Institute Inc. | Delivery, engineering and optimization of systems, methods and compositions for targeting and modeling diseases and disorders of post mitotic cells |
WO2015048690A1 (en) | 2013-09-27 | 2015-04-02 | The Regents Of The University Of California | Optimized small guide rnas and methods of use |
Non-Patent Citations (21)
Title |
---|
"GenBank", Database accession no. BC026642.1 |
ESVELT ET AL., NATURE METHODS, 2013 |
GILLESPIE, JOURNAL OF COMPUTATIONAL PHYSICS, vol. 22, 1976, pages 403 - 434 |
HOWARTH ET AL., NATURE METHODS, vol. 3, 2006, pages 267 - 273 |
HSU ET AL., NATURE BIOTECHNOLOGY, vol. 31, 2013, pages 827 - 832 |
HSU ET AL., NATURE BIOTECHNOLOGY, vol. 31, pages 827 - 832 |
KLEINSTIVER ET AL., NAT. BIOTECH., vol. 34, 2016, pages 869 - 874 |
KYTE ET AL., J. MOL. BIOL., vol. 157, 1982, pages 105 - 132 |
LAIORODRIGUEZ, SCIENCE, vol. 344, 2014, pages 1492 - 1496 |
PEREZ-PINERA ET AL., NATURE METHODS, vol. 10, 2013, pages 973 - 976 |
PIACENTINO ET AL., HUMAN GENE THERAPY, vol. 23, 2012, pages 635 - 646 |
PINELLO ET AL., NAT BIOTECHNOL, vol. 34, no. 7, 2016, pages 695 - 697 |
RAN ET AL., NATURE PROTOCOLS, vol. 8, 2013, pages 2281 - 2308 |
SAMBROOK, J.FRITSCH, E.F.MANIATIS, T.: "Molecular Cloning and Laboratory Manual", 1989, COLD SPRING HARBOR LABORATORY PRESS |
SANTALUCIA ET AL., BIOCHEMISTRY, vol. 35, 1996, pages 3555 - 3562 |
SETO ET AL., CURRENT GENE THERAPY, vol. 12, 2012, pages 139 - 151 |
STEMMER, PLOS ONE, vol. 10, 2015, pages e0124633 |
SUGIMOTO ET AL., BIOCHEMISTRY, vol. 34, 1995, pages 11211 - 11216 |
WATKINS ET AL., NUCLEIC ACIDS RESEARCH, vol. 39, 2011, pages 1894 - 1902 |
WIGGINS ET AL., NATURE NANOTECHNOLOGY, vol. 1, 2006, pages 137 - 141 |
YANG ET AL., NUCLEIC ACIDS RES, vol. 33, 2005, pages 4322 - 4334 |
Cited By (69)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10323236B2 (en) | 2011-07-22 | 2019-06-18 | President And Fellows Of Harvard College | Evaluation and improvement of nuclease cleavage specificity |
US12006520B2 (en) | 2011-07-22 | 2024-06-11 | President And Fellows Of Harvard College | Evaluation and improvement of nuclease cleavage specificity |
US20210040460A1 (en) | 2012-04-27 | 2021-02-11 | Duke University | Genetic correction of mutated genes |
US11976307B2 (en) | 2012-04-27 | 2024-05-07 | Duke University | Genetic correction of mutated genes |
US11920181B2 (en) | 2013-08-09 | 2024-03-05 | President And Fellows Of Harvard College | Nuclease profiling system |
US10954548B2 (en) | 2013-08-09 | 2021-03-23 | President And Fellows Of Harvard College | Nuclease profiling system |
US10508298B2 (en) | 2013-08-09 | 2019-12-17 | President And Fellows Of Harvard College | Methods for identifying a target site of a CAS9 nuclease |
US11046948B2 (en) | 2013-08-22 | 2021-06-29 | President And Fellows Of Harvard College | Engineered transcription activator-like effector (TALE) domains and uses thereof |
US11299755B2 (en) | 2013-09-06 | 2022-04-12 | President And Fellows Of Harvard College | Switchable CAS9 nucleases and uses thereof |
US9999671B2 (en) | 2013-09-06 | 2018-06-19 | President And Fellows Of Harvard College | Delivery of negatively charged proteins using cationic lipids |
US10912833B2 (en) | 2013-09-06 | 2021-02-09 | President And Fellows Of Harvard College | Delivery of negatively charged proteins using cationic lipids |
US10858639B2 (en) | 2013-09-06 | 2020-12-08 | President And Fellows Of Harvard College | CAS9 variants and uses thereof |
US10597679B2 (en) | 2013-09-06 | 2020-03-24 | President And Fellows Of Harvard College | Switchable Cas9 nucleases and uses thereof |
US10682410B2 (en) | 2013-09-06 | 2020-06-16 | President And Fellows Of Harvard College | Delivery system for functional nucleases |
US11053481B2 (en) | 2013-12-12 | 2021-07-06 | President And Fellows Of Harvard College | Fusions of Cas9 domains and nucleic acid-editing domains |
US10465176B2 (en) | 2013-12-12 | 2019-11-05 | President And Fellows Of Harvard College | Cas variants for gene editing |
US11124782B2 (en) | 2013-12-12 | 2021-09-21 | President And Fellows Of Harvard College | Cas variants for gene editing |
US11028388B2 (en) | 2014-03-05 | 2021-06-08 | Editas Medicine, Inc. | CRISPR/Cas-related methods and compositions for treating Usher syndrome and retinitis pigmentosa |
US11141493B2 (en) | 2014-03-10 | 2021-10-12 | Editas Medicine, Inc. | Compositions and methods for treating CEP290-associated disease |
US11339437B2 (en) | 2014-03-10 | 2022-05-24 | Editas Medicine, Inc. | Compositions and methods for treating CEP290-associated disease |
US11268086B2 (en) | 2014-03-10 | 2022-03-08 | Editas Medicine, Inc. | CRISPR/CAS-related methods and compositions for treating Leber's Congenital Amaurosis 10 (LCA10) |
US11242525B2 (en) | 2014-03-26 | 2022-02-08 | Editas Medicine, Inc. | CRISPR/CAS-related methods and compositions for treating sickle cell disease |
US11578343B2 (en) | 2014-07-30 | 2023-02-14 | President And Fellows Of Harvard College | CAS9 proteins including ligand-dependent inteins |
US10077453B2 (en) | 2014-07-30 | 2018-09-18 | President And Fellows Of Harvard College | CAS9 proteins including ligand-dependent inteins |
US10704062B2 (en) | 2014-07-30 | 2020-07-07 | President And Fellows Of Harvard College | CAS9 proteins including ligand-dependent inteins |
US11155796B2 (en) | 2015-02-09 | 2021-10-26 | Duke University | Compositions and methods for epigenome editing |
US11180793B2 (en) | 2015-04-24 | 2021-11-23 | Editas Medicine, Inc. | Evaluation of Cas9 molecule/guide RNA molecule complexes |
US11421251B2 (en) | 2015-10-13 | 2022-08-23 | Duke University | Genome engineering with type I CRISPR systems in eukaryotic cells |
US11214780B2 (en) | 2015-10-23 | 2022-01-04 | President And Fellows Of Harvard College | Nucleobase editors and uses thereof |
US12043852B2 (en) | 2015-10-23 | 2024-07-23 | President And Fellows Of Harvard College | Evolved Cas9 proteins for gene editing |
US10167457B2 (en) | 2015-10-23 | 2019-01-01 | President And Fellows Of Harvard College | Nucleobase editors and uses thereof |
US11512311B2 (en) | 2016-03-25 | 2022-11-29 | Editas Medicine, Inc. | Systems and methods for treating alpha 1-antitrypsin (A1AT) deficiency |
US11566263B2 (en) | 2016-08-02 | 2023-01-31 | Editas Medicine, Inc. | Compositions and methods for treating CEP290 associated disease |
US10947530B2 (en) | 2016-08-03 | 2021-03-16 | President And Fellows Of Harvard College | Adenosine nucleobase editors and uses thereof |
US11702651B2 (en) | 2016-08-03 | 2023-07-18 | President And Fellows Of Harvard College | Adenosine nucleobase editors and uses thereof |
US10113163B2 (en) | 2016-08-03 | 2018-10-30 | President And Fellows Of Harvard College | Adenosine nucleobase editors and uses thereof |
US11999947B2 (en) | 2016-08-03 | 2024-06-04 | President And Fellows Of Harvard College | Adenosine nucleobase editors and uses thereof |
US11661590B2 (en) | 2016-08-09 | 2023-05-30 | President And Fellows Of Harvard College | Programmable CAS9-recombinase fusion proteins and uses thereof |
US11542509B2 (en) | 2016-08-24 | 2023-01-03 | President And Fellows Of Harvard College | Incorporation of unnatural amino acids into proteins using base editing |
US11306324B2 (en) | 2016-10-14 | 2022-04-19 | President And Fellows Of Harvard College | AAV delivery of nucleobase editors |
CN110382695A (en) * | 2016-11-28 | 2019-10-25 | 得克萨斯州大学系统董事会 | Prevent muscular dystrophy by the gene editing of CRISPR/CPF1 mediation |
US11820969B2 (en) | 2016-12-23 | 2023-11-21 | President And Fellows Of Harvard College | Editing of CCR2 receptor gene to protect against HIV infection |
US10745677B2 (en) | 2016-12-23 | 2020-08-18 | President And Fellows Of Harvard College | Editing of CCR5 receptor gene to protect against HIV infection |
US11898179B2 (en) | 2017-03-09 | 2024-02-13 | President And Fellows Of Harvard College | Suppression of pain by gene editing |
US11542496B2 (en) | 2017-03-10 | 2023-01-03 | President And Fellows Of Harvard College | Cytosine to guanine base editor |
US11851690B2 (en) | 2017-03-14 | 2023-12-26 | Editas Medicine, Inc. | Systems and methods for the treatment of hemoglobinopathies |
US11268082B2 (en) | 2017-03-23 | 2022-03-08 | President And Fellows Of Harvard College | Nucleobase editors comprising nucleic acid programmable DNA binding proteins |
US11963982B2 (en) | 2017-05-10 | 2024-04-23 | Editas Medicine, Inc. | CRISPR/RNA-guided nuclease systems and methods |
WO2018209158A3 (en) * | 2017-05-10 | 2019-01-03 | Editas Medicine, Inc. | Crispr/rna-guided nuclease systems and methods |
US11560566B2 (en) | 2017-05-12 | 2023-01-24 | President And Fellows Of Harvard College | Aptazyme-embedded guide RNAs for use with CRISPR-Cas9 in genome editing and transcriptional activation |
US11732274B2 (en) | 2017-07-28 | 2023-08-22 | President And Fellows Of Harvard College | Methods and compositions for evolving base editors using phage-assisted continuous evolution (PACE) |
US11932884B2 (en) | 2017-08-30 | 2024-03-19 | President And Fellows Of Harvard College | High efficiency base editors comprising Gam |
US11319532B2 (en) | 2017-08-30 | 2022-05-03 | President And Fellows Of Harvard College | High efficiency base editors comprising Gam |
US11572574B2 (en) | 2017-09-28 | 2023-02-07 | Toolgen Incorporated | Artificial genome manipulation for gene expression regulation |
WO2019066490A3 (en) * | 2017-09-28 | 2019-06-27 | 주식회사 툴젠 | Artificial genome manipulation for gene expression regulation |
RU2767201C2 (en) * | 2017-09-28 | 2022-03-16 | Тулджен Инкорпорейтед | Artificial genome modification for gene expression regulation |
US11795443B2 (en) | 2017-10-16 | 2023-10-24 | The Broad Institute, Inc. | Uses of adenosine base editors |
US11760984B2 (en) | 2017-10-31 | 2023-09-19 | The Broad Institute, Inc. | CRISPR protein inhibitors |
WO2019089761A1 (en) * | 2017-10-31 | 2019-05-09 | The Broad Institute, Inc. | Crispr protein inhibitors |
WO2019157326A1 (en) | 2018-02-08 | 2019-08-15 | Zymergen Inc. | Genome editing using crispr in corynebacterium |
US12031132B2 (en) | 2018-03-14 | 2024-07-09 | Editas Medicine, Inc. | Systems and methods for the treatment of hemoglobinopathies |
US20220088083A1 (en) * | 2019-01-18 | 2022-03-24 | Osaka University | Composition for use in treating dystrophic epidermolysis bullosa |
US11643652B2 (en) | 2019-03-19 | 2023-05-09 | The Broad Institute, Inc. | Methods and compositions for prime editing nucleotide sequences |
US11447770B1 (en) | 2019-03-19 | 2022-09-20 | The Broad Institute, Inc. | Methods and compositions for prime editing nucleotide sequences |
US11795452B2 (en) | 2019-03-19 | 2023-10-24 | The Broad Institute, Inc. | Methods and compositions for prime editing nucleotide sequences |
WO2020210776A1 (en) * | 2019-04-12 | 2020-10-15 | Duke University | Crispr/cas-based base editing composition for restoring dystrophin function |
US11912985B2 (en) | 2020-05-08 | 2024-02-27 | The Broad Institute, Inc. | Methods and compositions for simultaneous editing of both strands of a target double-stranded nucleotide sequence |
US12031126B2 (en) | 2020-05-08 | 2024-07-09 | The Broad Institute, Inc. | Methods and compositions for simultaneous editing of both strands of a target double-stranded nucleotide sequence |
US12084663B2 (en) | 2022-11-14 | 2024-09-10 | President And Fellows Of Harvard College | Incorporation of unnatural amino acids into proteins using base editing |
Also Published As
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20230304000A1 (en) | Compositions and methods of improving specificity in genomic engineering using rna-guided endonucleases | |
JP7094323B2 (en) | Optimization Function Systems, Methods and Compositions for Sequence Manipulation with CRISPR-Cas Systems | |
US20230287373A1 (en) | Engineering and optimization of systems, methods, enzymes and guide scaffolds of cas9 orthologs and variants for sequence manipulation | |
RU2766685C2 (en) | Rna-guided human genome engineering | |
JP2022527740A (en) | Editing Methods and Compositions for Editing Nucleotide Sequences | |
CA3026110A1 (en) | Novel crispr enzymes and systems | |
EP3325018A1 (en) | High-throughput screening of regulatory element function with epigenome editing technologies | |
Macarrón Palacios et al. | Revolutionizing in vivo therapy with CRISPR/Cas genome editing: breakthroughs, opportunities and challenges | |
KR20240132120A (en) | Compositions and methods of improving specificity in genomic engineering using rna-guided endonucleases | |
EA046214B1 (en) | COMPOSITIONS AND METHODS FOR IMPROVING SPECIFICITY IN GENOMIC ENGINEERING USING RNA-GUIDED ENDONUCLEASES |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 16840171 Country of ref document: EP Kind code of ref document: A2 |
|
ENP | Entry into the national phase |
Ref document number: 2996001 Country of ref document: CA |
|
WWE | Wipo information: entry into national phase |
Ref document number: 257677 Country of ref document: IL |
|
WWE | Wipo information: entry into national phase |
Ref document number: MX/A/2018/002339 Country of ref document: MX |
|
ENP | Entry into the national phase |
Ref document number: 2018510820 Country of ref document: JP Kind code of ref document: A |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
ENP | Entry into the national phase |
Ref document number: 2016311454 Country of ref document: AU Date of ref document: 20160825 Kind code of ref document: A |
|
WWE | Wipo information: entry into national phase |
Ref document number: 201890565 Country of ref document: EA |
|
ENP | Entry into the national phase |
Ref document number: 20187008282 Country of ref document: KR Kind code of ref document: A |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2016840171 Country of ref document: EP |
|
REG | Reference to national code |
Ref country code: BR Ref legal event code: B01A Ref document number: 112018003625 Country of ref document: BR |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 16840171 Country of ref document: EP Kind code of ref document: A2 |
|
ENP | Entry into the national phase |
Ref document number: 112018003625 Country of ref document: BR Kind code of ref document: A2 Effective date: 20180223 |