WO2023215831A1 - Guide rna compositions for programmable gene insertion - Google Patents
Guide rna compositions for programmable gene insertion Download PDFInfo
- Publication number
- WO2023215831A1 WO2023215831A1 PCT/US2023/066609 US2023066609W WO2023215831A1 WO 2023215831 A1 WO2023215831 A1 WO 2023215831A1 US 2023066609 W US2023066609 W US 2023066609W WO 2023215831 A1 WO2023215831 A1 WO 2023215831A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- site
- atgrna
- sequence
- integration
- dna sequence
- Prior art date
Links
- 108020005004 Guide RNA Proteins 0.000 title claims abstract description 145
- 108090000623 proteins and genes Proteins 0.000 title claims description 281
- 239000000203 mixture Substances 0.000 title abstract description 64
- 238000003780 insertion Methods 0.000 title description 19
- 230000037431 insertion Effects 0.000 title description 19
- 230000010354 integration Effects 0.000 claims abstract description 327
- 102100034343 Integrase Human genes 0.000 claims abstract description 238
- 108091028043 Nucleic acid sequence Proteins 0.000 claims abstract description 199
- 108010092799 RNA-directed DNA polymerase Proteins 0.000 claims abstract description 190
- 230000027455 binding Effects 0.000 claims abstract description 95
- 125000006850 spacer group Chemical group 0.000 claims abstract description 30
- 210000004027 cell Anatomy 0.000 claims description 204
- 239000002773 nucleotide Substances 0.000 claims description 198
- 125000003729 nucleotide group Chemical group 0.000 claims description 198
- 102000040430 polynucleotide Human genes 0.000 claims description 135
- 108091033319 polynucleotide Proteins 0.000 claims description 135
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 135
- 239000002157 polynucleotide Substances 0.000 claims description 134
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 130
- 229920001184 polypeptide Polymers 0.000 claims description 127
- 230000000295 complement effect Effects 0.000 claims description 115
- 102000007260 Deoxyribonuclease I Human genes 0.000 claims description 99
- 238000000034 method Methods 0.000 claims description 97
- 108010008532 Deoxyribonuclease I Proteins 0.000 claims description 92
- 230000004568 DNA-binding Effects 0.000 claims description 77
- 102000053602 DNA Human genes 0.000 claims description 66
- 108020004414 DNA Proteins 0.000 claims description 66
- 102000004190 Enzymes Human genes 0.000 claims description 63
- 108090000790 Enzymes Proteins 0.000 claims description 63
- 108020004682 Single-Stranded DNA Proteins 0.000 claims description 41
- 230000002441 reversible effect Effects 0.000 claims description 21
- 210000004962 mammalian cell Anatomy 0.000 claims description 17
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 claims description 11
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 claims description 11
- 150000007523 nucleic acids Chemical group 0.000 abstract description 61
- 230000001413 cellular effect Effects 0.000 abstract description 17
- 230000003834 intracellular effect Effects 0.000 abstract description 3
- 108091033409 CRISPR Proteins 0.000 description 130
- 150000002632 lipids Chemical class 0.000 description 107
- 125000005647 linker group Chemical group 0.000 description 85
- 102000004169 proteins and genes Human genes 0.000 description 74
- 235000018102 proteins Nutrition 0.000 description 66
- 108010061833 Integrases Proteins 0.000 description 53
- 101710163270 Nuclease Proteins 0.000 description 52
- 239000013598 vector Substances 0.000 description 50
- 230000008685 targeting Effects 0.000 description 49
- 102000039446 nucleic acids Human genes 0.000 description 44
- 108020004707 nucleic acids Proteins 0.000 description 44
- 101000941029 Homo sapiens Endoplasmic reticulum junction formation protein lunapark Proteins 0.000 description 36
- 101000991410 Homo sapiens Nucleolar and spindle-associated protein 1 Proteins 0.000 description 36
- 102100030991 Nucleolar and spindle-associated protein 1 Human genes 0.000 description 36
- 239000002245 particle Substances 0.000 description 34
- 230000014509 gene expression Effects 0.000 description 31
- 230000000694 effects Effects 0.000 description 28
- 230000035772 mutation Effects 0.000 description 27
- 238000000926 separation method Methods 0.000 description 27
- 238000012384 transportation and delivery Methods 0.000 description 27
- 239000002105 nanoparticle Substances 0.000 description 26
- 238000010354 CRISPR gene editing Methods 0.000 description 24
- 108020004999 messenger RNA Proteins 0.000 description 24
- 241000701161 unidentified adenovirus Species 0.000 description 24
- 239000012634 fragment Substances 0.000 description 20
- -1 polymerase Proteins 0.000 description 19
- 102000018120 Recombinases Human genes 0.000 description 18
- 108010091086 Recombinases Proteins 0.000 description 18
- HVYWMOMLDIMFJA-DPAQBDIFSA-N cholesterol Chemical compound C1C=C2C[C@@H](O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@H](C)CCCC(C)C)[C@@]1(C)CC2 HVYWMOMLDIMFJA-DPAQBDIFSA-N 0.000 description 18
- 238000009472 formulation Methods 0.000 description 18
- 238000010362 genome editing Methods 0.000 description 17
- 230000003993 interaction Effects 0.000 description 17
- 239000013612 plasmid Substances 0.000 description 17
- 229920001223 polyethylene glycol Polymers 0.000 description 16
- 238000010839 reverse transcription Methods 0.000 description 16
- 239000013603 viral vector Substances 0.000 description 16
- 241000196324 Embryophyta Species 0.000 description 15
- 150000001413 amino acids Chemical class 0.000 description 14
- 239000013604 expression vector Substances 0.000 description 14
- 230000004927 fusion Effects 0.000 description 14
- 229910052739 hydrogen Inorganic materials 0.000 description 14
- 239000001257 hydrogen Substances 0.000 description 14
- 238000001727 in vivo Methods 0.000 description 14
- 238000010348 incorporation Methods 0.000 description 14
- 239000002502 liposome Substances 0.000 description 14
- 230000001404 mediated effect Effects 0.000 description 14
- 238000004806 packaging method and process Methods 0.000 description 14
- 150000003904 phospholipids Chemical class 0.000 description 14
- 102100031437 Cell cycle checkpoint protein RAD1 Human genes 0.000 description 13
- 108091027305 Heteroduplex Proteins 0.000 description 13
- 101001130384 Homo sapiens Cell cycle checkpoint protein RAD1 Proteins 0.000 description 13
- 235000001014 amino acid Nutrition 0.000 description 13
- 230000036961 partial effect Effects 0.000 description 13
- 239000000243 solution Substances 0.000 description 13
- 101000607560 Homo sapiens Ubiquitin-conjugating enzyme E2 variant 3 Proteins 0.000 description 12
- 102100039936 Ubiquitin-conjugating enzyme E2 variant 3 Human genes 0.000 description 12
- 210000001519 tissue Anatomy 0.000 description 12
- 230000003612 virological effect Effects 0.000 description 12
- 241000702421 Dependoparvovirus Species 0.000 description 11
- 125000002091 cationic group Chemical group 0.000 description 11
- 238000012217 deletion Methods 0.000 description 11
- 230000037430 deletion Effects 0.000 description 11
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 11
- 239000012636 effector Substances 0.000 description 11
- 230000033607 mismatch repair Effects 0.000 description 11
- 238000002156 mixing Methods 0.000 description 11
- 241001430294 unidentified retrovirus Species 0.000 description 11
- 241000713666 Lentivirus Species 0.000 description 10
- 241000713869 Moloney murine leukemia virus Species 0.000 description 10
- 230000000670 limiting effect Effects 0.000 description 10
- 102100033934 DNA repair protein RAD51 homolog 2 Human genes 0.000 description 9
- 101001132307 Homo sapiens DNA repair protein RAD51 homolog 2 Proteins 0.000 description 9
- 108010077850 Nuclear Localization Signals Proteins 0.000 description 9
- 230000015572 biosynthetic process Effects 0.000 description 9
- 235000012000 cholesterol Nutrition 0.000 description 9
- 201000010099 disease Diseases 0.000 description 9
- 230000007246 mechanism Effects 0.000 description 9
- 125000002467 phosphate group Chemical group [H]OP(=O)(O[H])O[*] 0.000 description 9
- 230000006798 recombination Effects 0.000 description 9
- 238000005215 recombination Methods 0.000 description 9
- 230000002829 reductive effect Effects 0.000 description 9
- 230000010076 replication Effects 0.000 description 9
- 239000013607 AAV vector Substances 0.000 description 8
- 108700004991 Cas12a Proteins 0.000 description 8
- 102100022641 Coagulation factor IX Human genes 0.000 description 8
- 230000007018 DNA scission Effects 0.000 description 8
- 108010076282 Factor IX Proteins 0.000 description 8
- 101001003581 Homo sapiens Lamin-B1 Proteins 0.000 description 8
- 206010020365 Homocystinuria Diseases 0.000 description 8
- 208000010159 IgA glomerulonephritis Diseases 0.000 description 8
- 206010021263 IgA nephropathy Diseases 0.000 description 8
- 102100021244 Integral membrane protein GPR180 Human genes 0.000 description 8
- 102100026517 Lamin-B1 Human genes 0.000 description 8
- 208000000599 Ornithine Carbamoyltransferase Deficiency Disease Diseases 0.000 description 8
- 208000035903 Ornithine transcarbamylase deficiency Diseases 0.000 description 8
- 239000003795 chemical substances by application Substances 0.000 description 8
- 238000001514 detection method Methods 0.000 description 8
- 108020001507 fusion proteins Proteins 0.000 description 8
- 102000037865 fusion proteins Human genes 0.000 description 8
- 201000008350 membranous glomerulonephritis Diseases 0.000 description 8
- 201000011278 ornithine carbamoyltransferase deficiency Diseases 0.000 description 8
- 230000002085 persistent effect Effects 0.000 description 8
- 230000008569 process Effects 0.000 description 8
- WGYKZJWCGVVSQN-UHFFFAOYSA-N propylamine Chemical compound CCCN WGYKZJWCGVVSQN-UHFFFAOYSA-N 0.000 description 8
- 238000010200 validation analysis Methods 0.000 description 8
- 108060002716 Exonuclease Proteins 0.000 description 7
- 206010018372 Glomerulonephritis membranous Diseases 0.000 description 7
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerol Natural products OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 7
- 102000012330 Integrases Human genes 0.000 description 7
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 7
- 241000700605 Viruses Species 0.000 description 7
- 238000013459 approach Methods 0.000 description 7
- 210000003719 b-lymphocyte Anatomy 0.000 description 7
- 230000003197 catalytic effect Effects 0.000 description 7
- 230000002950 deficient Effects 0.000 description 7
- 102000013165 exonuclease Human genes 0.000 description 7
- 229960004222 factor ix Drugs 0.000 description 7
- 238000001415 gene therapy Methods 0.000 description 7
- 230000002068 genetic effect Effects 0.000 description 7
- 231100000855 membranous nephropathy Toxicity 0.000 description 7
- 230000004048 modification Effects 0.000 description 7
- 238000012986 modification Methods 0.000 description 7
- 230000007935 neutral effect Effects 0.000 description 7
- 244000052769 pathogen Species 0.000 description 7
- 239000000126 substance Substances 0.000 description 7
- 208000024891 symptom Diseases 0.000 description 7
- 238000001890 transfection Methods 0.000 description 7
- 108010042407 Endonucleases Proteins 0.000 description 6
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 6
- 101000860092 Francisella tularensis subsp. novicida (strain U112) CRISPR-associated endonuclease Cas12a Proteins 0.000 description 6
- JZNWSCPGTDBMEW-UHFFFAOYSA-N Glycerophosphorylethanolamin Natural products NCCOP(O)(=O)OCC(O)CO JZNWSCPGTDBMEW-UHFFFAOYSA-N 0.000 description 6
- 208000005777 Lupus Nephritis Diseases 0.000 description 6
- 102100028200 Ornithine transcarbamylase, mitochondrial Human genes 0.000 description 6
- 201000011252 Phenylketonuria Diseases 0.000 description 6
- 229930182558 Sterol Natural products 0.000 description 6
- 230000008901 benefit Effects 0.000 description 6
- 210000004369 blood Anatomy 0.000 description 6
- 239000008280 blood Substances 0.000 description 6
- 238000003776 cleavage reaction Methods 0.000 description 6
- MWRBNPKJOOWZPW-CLFAGFIQSA-N dioleoyl phosphatidylethanolamine Chemical compound CCCCCCCC\C=C/CCCCCCCC(=O)OCC(COP(O)(=O)OCCN)OC(=O)CCCCCCC\C=C/CCCCCCCC MWRBNPKJOOWZPW-CLFAGFIQSA-N 0.000 description 6
- 238000011304 droplet digital PCR Methods 0.000 description 6
- 230000009977 dual effect Effects 0.000 description 6
- 238000005516 engineering process Methods 0.000 description 6
- 210000003494 hepatocyte Anatomy 0.000 description 6
- 239000012528 membrane Substances 0.000 description 6
- 210000000822 natural killer cell Anatomy 0.000 description 6
- 125000002347 octyl group Chemical group [H]C([*])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])[H] 0.000 description 6
- 150000008104 phosphatidylethanolamines Chemical class 0.000 description 6
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 6
- 230000008439 repair process Effects 0.000 description 6
- 230000007017 scission Effects 0.000 description 6
- 150000003432 sterols Chemical class 0.000 description 6
- 235000003702 sterols Nutrition 0.000 description 6
- 201000000596 systemic lupus erythematosus Diseases 0.000 description 6
- 210000002700 urine Anatomy 0.000 description 6
- 108010079245 Cystic Fibrosis Transmembrane Conductance Regulator Proteins 0.000 description 5
- 201000003883 Cystic fibrosis Diseases 0.000 description 5
- 102100023419 Cystic fibrosis transmembrane conductance regulator Human genes 0.000 description 5
- 101000986595 Homo sapiens Ornithine transcarbamylase, mitochondrial Proteins 0.000 description 5
- 108010066154 Nuclear Export Signals Proteins 0.000 description 5
- 108700026244 Open Reading Frames Proteins 0.000 description 5
- 206010052450 Ornithine transcarbamoylase deficiency Diseases 0.000 description 5
- 108010069013 Phenylalanine Hydroxylase Proteins 0.000 description 5
- 102100038223 Phenylalanine-4-hydroxylase Human genes 0.000 description 5
- 239000002202 Polyethylene glycol Substances 0.000 description 5
- 241000191967 Staphylococcus aureus Species 0.000 description 5
- 206010047115 Vasculitis Diseases 0.000 description 5
- 238000004458 analytical method Methods 0.000 description 5
- 239000007864 aqueous solution Substances 0.000 description 5
- 210000004899 c-terminal region Anatomy 0.000 description 5
- 230000015556 catabolic process Effects 0.000 description 5
- 210000000349 chromosome Anatomy 0.000 description 5
- 230000007812 deficiency Effects 0.000 description 5
- 238000006731 degradation reaction Methods 0.000 description 5
- 238000003113 dilution method Methods 0.000 description 5
- 150000004665 fatty acids Chemical class 0.000 description 5
- 230000006870 function Effects 0.000 description 5
- 238000000338 in vitro Methods 0.000 description 5
- 230000001965 increasing effect Effects 0.000 description 5
- 230000002401 inhibitory effect Effects 0.000 description 5
- 210000003734 kidney Anatomy 0.000 description 5
- 230000007115 recruitment Effects 0.000 description 5
- 230000001177 retroviral effect Effects 0.000 description 5
- 241000894007 species Species 0.000 description 5
- 230000001225 therapeutic effect Effects 0.000 description 5
- 230000009261 transgenic effect Effects 0.000 description 5
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 4
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 4
- FVXDQWZBHIXIEJ-LNDKUQBDSA-N 1,2-di-[(9Z,12Z)-octadecadienoyl]-sn-glycero-3-phosphocholine Chemical compound CCCCC\C=C/C\C=C/CCCCCCCC(=O)OC[C@H](COP([O-])(=O)OCC[N+](C)(C)C)OC(=O)CCCCCCC\C=C/C\C=C/CCCCC FVXDQWZBHIXIEJ-LNDKUQBDSA-N 0.000 description 4
- 102100022712 Alpha-1-antitrypsin Human genes 0.000 description 4
- 208000016323 C3 glomerulonephritis Diseases 0.000 description 4
- 208000029574 C3 glomerulopathy Diseases 0.000 description 4
- 102000053642 Catalytic RNA Human genes 0.000 description 4
- 108090000994 Catalytic RNA Proteins 0.000 description 4
- 108010073644 Cystathionine beta-synthase Proteins 0.000 description 4
- 108010069091 Dystrophin Proteins 0.000 description 4
- 102100031780 Endonuclease Human genes 0.000 description 4
- 101000911390 Homo sapiens Coagulation factor VIII Proteins 0.000 description 4
- FFFHZYDWPBMWHY-VKHMYHEASA-N L-homocysteine Chemical compound OC(=O)[C@@H](N)CCS FFFHZYDWPBMWHY-VKHMYHEASA-N 0.000 description 4
- 206010028980 Neoplasm Diseases 0.000 description 4
- 101800001494 Protease 2A Proteins 0.000 description 4
- 101800001066 Protein 2A Proteins 0.000 description 4
- 102000008579 Transposases Human genes 0.000 description 4
- 108010020764 Transposases Proteins 0.000 description 4
- 125000000217 alkyl group Chemical group 0.000 description 4
- 239000013078 crystal Substances 0.000 description 4
- 238000010790 dilution Methods 0.000 description 4
- 239000012895 dilution Substances 0.000 description 4
- 239000013024 dilution buffer Substances 0.000 description 4
- 210000002950 fibroblast Anatomy 0.000 description 4
- 235000011187 glycerol Nutrition 0.000 description 4
- 210000004263 induced pluripotent stem cell Anatomy 0.000 description 4
- 238000002347 injection Methods 0.000 description 4
- 239000007924 injection Substances 0.000 description 4
- 238000001638 lipofection Methods 0.000 description 4
- 238000009126 molecular therapy Methods 0.000 description 4
- 210000000663 muscle cell Anatomy 0.000 description 4
- 210000002569 neuron Anatomy 0.000 description 4
- 208000027134 non-immunoglobulin-mediated membranoproliferative glomerulonephritis Diseases 0.000 description 4
- 230000030648 nucleus localization Effects 0.000 description 4
- WTJKGGKOPKCXLL-RRHRGVEJSA-N phosphatidylcholine Chemical compound CCCCCCCCCCCCCCCC(=O)OC[C@H](COP([O-])(=O)OCC[N+](C)(C)C)OC(=O)CCCCCCCC=CCCCCCCCC WTJKGGKOPKCXLL-RRHRGVEJSA-N 0.000 description 4
- 239000013600 plasmid vector Substances 0.000 description 4
- LXNHXLLTXMVWPM-UHFFFAOYSA-N pyridoxine Chemical compound CC1=NC=C(CO)C(CO)=C1O LXNHXLLTXMVWPM-UHFFFAOYSA-N 0.000 description 4
- 230000002040 relaxant effect Effects 0.000 description 4
- 238000011160 research Methods 0.000 description 4
- 108091092562 ribozyme Proteins 0.000 description 4
- 150000003839 salts Chemical class 0.000 description 4
- 208000007056 sickle cell anemia Diseases 0.000 description 4
- 150000003431 steroids Chemical class 0.000 description 4
- 238000003786 synthesis reaction Methods 0.000 description 4
- 238000013518 transcription Methods 0.000 description 4
- 230000035897 transcription Effects 0.000 description 4
- 230000005030 transcription termination Effects 0.000 description 4
- 238000012546 transfer Methods 0.000 description 4
- OILXMJHPFNGGTO-UHFFFAOYSA-N (22E)-(24xi)-24-methylcholesta-5,22-dien-3beta-ol Natural products C1C=C2CC(O)CCC2(C)C2C1C1CCC(C(C)C=CC(C)C(C)C)C1(C)CC2 OILXMJHPFNGGTO-UHFFFAOYSA-N 0.000 description 3
- LDGWQMRUWMSZIU-LQDDAWAPSA-M 2,3-bis[(z)-octadec-9-enoxy]propyl-trimethylazanium;chloride Chemical compound [Cl-].CCCCCCCC\C=C/CCCCCCCCOCC(C[N+](C)(C)C)OCCCCCCCC\C=C/CCCCCCCC LDGWQMRUWMSZIU-LQDDAWAPSA-M 0.000 description 3
- HXVVOLDXHIMZJZ-UHFFFAOYSA-N 3-[2-[2-[2-[bis[3-(dodecylamino)-3-oxopropyl]amino]ethyl-[3-(dodecylamino)-3-oxopropyl]amino]ethylamino]ethyl-[3-(dodecylamino)-3-oxopropyl]amino]-n-dodecylpropanamide Chemical compound CCCCCCCCCCCCNC(=O)CCN(CCC(=O)NCCCCCCCCCCCC)CCN(CCC(=O)NCCCCCCCCCCCC)CCNCCN(CCC(=O)NCCCCCCCCCCCC)CCC(=O)NCCCCCCCCCCCC HXVVOLDXHIMZJZ-UHFFFAOYSA-N 0.000 description 3
- 241001655883 Adeno-associated virus - 1 Species 0.000 description 3
- 102100027211 Albumin Human genes 0.000 description 3
- 108010088751 Albumins Proteins 0.000 description 3
- 208000023275 Autoimmune disease Diseases 0.000 description 3
- 102100034976 Cystathionine beta-synthase Human genes 0.000 description 3
- 102000052510 DNA-Binding Proteins Human genes 0.000 description 3
- 108700024394 Exon Proteins 0.000 description 3
- 229940123611 Genome editing Drugs 0.000 description 3
- 208000009292 Hemophilia A Diseases 0.000 description 3
- 241001465754 Metazoa Species 0.000 description 3
- 241000699666 Mus <mouse, genus> Species 0.000 description 3
- 229910019142 PO4 Inorganic materials 0.000 description 3
- 238000012228 RNA interference-mediated gene silencing Methods 0.000 description 3
- 238000010459 TALEN Methods 0.000 description 3
- 108010043645 Transcription Activator-Like Effector Nucleases Proteins 0.000 description 3
- 108010017070 Zinc Finger Nucleases Proteins 0.000 description 3
- 230000002159 abnormal effect Effects 0.000 description 3
- 239000002253 acid Substances 0.000 description 3
- 125000004429 atom Chemical group 0.000 description 3
- 230000001580 bacterial effect Effects 0.000 description 3
- 239000007853 buffer solution Substances 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- 108010041758 cleavase Proteins 0.000 description 3
- 230000004186 co-expression Effects 0.000 description 3
- 239000011248 coating agent Substances 0.000 description 3
- 150000001875 compounds Chemical class 0.000 description 3
- 230000001086 cytosolic effect Effects 0.000 description 3
- 230000001419 dependent effect Effects 0.000 description 3
- 235000014113 dietary fatty acids Nutrition 0.000 description 3
- 239000003814 drug Substances 0.000 description 3
- 238000004520 electroporation Methods 0.000 description 3
- 210000002889 endothelial cell Anatomy 0.000 description 3
- 125000001495 ethyl group Chemical group [H]C([H])([H])C([H])([H])* 0.000 description 3
- 210000001808 exosome Anatomy 0.000 description 3
- 210000001508 eye Anatomy 0.000 description 3
- 239000000194 fatty acid Substances 0.000 description 3
- 229930195729 fatty acid Natural products 0.000 description 3
- 230000009368 gene silencing by RNA Effects 0.000 description 3
- 238000009396 hybridization Methods 0.000 description 3
- 239000004615 ingredient Substances 0.000 description 3
- 230000017730 intein-mediated protein splicing Effects 0.000 description 3
- 210000004185 liver Anatomy 0.000 description 3
- 210000000066 myeloid cell Anatomy 0.000 description 3
- GLGLUQVVDHRLQK-WRBBJXAJSA-N n,n-dimethyl-2,3-bis[(z)-octadec-9-enoxy]propan-1-amine Chemical compound CCCCCCCC\C=C/CCCCCCCCOCC(CN(C)C)OCCCCCCCC\C=C/CCCCCCCC GLGLUQVVDHRLQK-WRBBJXAJSA-N 0.000 description 3
- 238000005457 optimization Methods 0.000 description 3
- 230000001717 pathogenic effect Effects 0.000 description 3
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 3
- 239000010452 phosphate Substances 0.000 description 3
- 238000002360 preparation method Methods 0.000 description 3
- 238000012545 processing Methods 0.000 description 3
- 230000016434 protein splicing Effects 0.000 description 3
- 230000012743 protein tagging Effects 0.000 description 3
- 238000003259 recombinant expression Methods 0.000 description 3
- 230000009467 reduction Effects 0.000 description 3
- 210000000130 stem cell Anatomy 0.000 description 3
- 210000004881 tumor cell Anatomy 0.000 description 3
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 3
- YWWVWXASSLXJHU-AATRIKPKSA-N (9E)-tetradecenoic acid Chemical compound CCCC\C=C\CCCCCCCC(O)=O YWWVWXASSLXJHU-AATRIKPKSA-N 0.000 description 2
- PORPENFLTBBHSG-MGBGTMOVSA-N 1,2-dihexadecanoyl-sn-glycerol-3-phosphate Chemical compound CCCCCCCCCCCCCCCC(=O)OC[C@H](COP(O)(O)=O)OC(=O)CCCCCCCCCCCCCCC PORPENFLTBBHSG-MGBGTMOVSA-N 0.000 description 2
- BUOBCSGIAFXNKP-KWXKLSQISA-N 1-[2,2-bis[(9z,12z)-octadeca-9,12-dienyl]-1,3-dioxolan-4-yl]-n,n-dimethylmethanamine Chemical compound CCCCC\C=C/C\C=C/CCCCCCCCC1(CCCCCCCC\C=C/C\C=C/CCCCC)OCC(CN(C)C)O1 BUOBCSGIAFXNKP-KWXKLSQISA-N 0.000 description 2
- NCYCYZXNIZJOKI-IOUUIBBYSA-N 11-cis-retinal Chemical compound O=C/C=C(\C)/C=C\C=C(/C)\C=C\C1=C(C)CCCC1(C)C NCYCYZXNIZJOKI-IOUUIBBYSA-N 0.000 description 2
- KSXTUUUQYQYKCR-LQDDAWAPSA-M 2,3-bis[[(z)-octadec-9-enoyl]oxy]propyl-trimethylazanium;chloride Chemical compound [Cl-].CCCCCCCC\C=C/CCCCCCCC(=O)OCC(C[N+](C)(C)C)OC(=O)CCCCCCC\C=C/CCCCCCCC KSXTUUUQYQYKCR-LQDDAWAPSA-M 0.000 description 2
- WALUVDCNGPQPOD-UHFFFAOYSA-M 2,3-di(tetradecoxy)propyl-(2-hydroxyethyl)-dimethylazanium;bromide Chemical compound [Br-].CCCCCCCCCCCCCCOCC(C[N+](C)(C)CCO)OCCCCCCCCCCCCCC WALUVDCNGPQPOD-UHFFFAOYSA-M 0.000 description 2
- LRFJOIPOPUJUMI-KWXKLSQISA-N 2-[2,2-bis[(9z,12z)-octadeca-9,12-dienyl]-1,3-dioxolan-4-yl]-n,n-dimethylethanamine Chemical compound CCCCC\C=C/C\C=C/CCCCCCCCC1(CCCCCCCC\C=C/C\C=C/CCCCC)OCC(CCN(C)C)O1 LRFJOIPOPUJUMI-KWXKLSQISA-N 0.000 description 2
- WRMNZCZEMHIOCP-UHFFFAOYSA-N 2-phenylethanol Chemical compound OCCC1=CC=CC=C1 WRMNZCZEMHIOCP-UHFFFAOYSA-N 0.000 description 2
- JGNSLMSDBLEHCK-UHFFFAOYSA-N 4-[2-(didodecylamino)ethyl]-n,n,1-tridodecylpiperazin-2-amine Chemical compound CCCCCCCCCCCCN(CCCCCCCCCCCC)CCN1CCN(CCCCCCCCCCCC)C(N(CCCCCCCCCCCC)CCCCCCCCCCCC)C1 JGNSLMSDBLEHCK-UHFFFAOYSA-N 0.000 description 2
- BBGNINPPDHJETF-UHFFFAOYSA-N 5-heptadecylresorcinol Chemical compound CCCCCCCCCCCCCCCCCC1=CC(O)=CC(O)=C1 BBGNINPPDHJETF-UHFFFAOYSA-N 0.000 description 2
- OQMZNAMGEHIHNN-UHFFFAOYSA-N 7-Dehydrostigmasterol Natural products C1C(O)CCC2(C)C(CCC3(C(C(C)C=CC(CC)C(C)C)CCC33)C)C3=CC=C21 OQMZNAMGEHIHNN-UHFFFAOYSA-N 0.000 description 2
- RSWGJHLUYNHPMX-UHFFFAOYSA-N Abietic-Saeure Natural products C12CCC(C(C)C)=CC2=CCC2C1(C)CCCC2(C)C(O)=O RSWGJHLUYNHPMX-UHFFFAOYSA-N 0.000 description 2
- 241000604451 Acidaminococcus Species 0.000 description 2
- 241000702423 Adeno-associated virus - 2 Species 0.000 description 2
- 241001634120 Adeno-associated virus - 5 Species 0.000 description 2
- 108700028369 Alleles Proteins 0.000 description 2
- QGZKDVFQNNGYKY-UHFFFAOYSA-N Ammonia Chemical compound N QGZKDVFQNNGYKY-UHFFFAOYSA-N 0.000 description 2
- 208000027412 CDKL5-deficiency disease Diseases 0.000 description 2
- 102000000844 Cell Surface Receptors Human genes 0.000 description 2
- 108010001857 Cell Surface Receptors Proteins 0.000 description 2
- 108700020911 DNA-Binding Proteins Proteins 0.000 description 2
- XULFJDKZVHTRLG-JDVCJPALSA-N DOSPA trifluoroacetate Chemical compound [O-]C(=O)C(F)(F)F.CCCCCCCC\C=C/CCCCCCCCOCC(C[N+](C)(C)CCNC(=O)C(CCCNCCCN)NCCCN)OCCCCCCCC\C=C/CCCCCCCC XULFJDKZVHTRLG-JDVCJPALSA-N 0.000 description 2
- RTZKZFJDLAIYFH-UHFFFAOYSA-N Diethyl ether Chemical compound CCOCC RTZKZFJDLAIYFH-UHFFFAOYSA-N 0.000 description 2
- 239000006144 Dulbecco’s modified Eagle's medium Substances 0.000 description 2
- 102000001039 Dystrophin Human genes 0.000 description 2
- 102000004533 Endonucleases Human genes 0.000 description 2
- LYCAIKOWRPUZTN-UHFFFAOYSA-N Ethylene glycol Chemical compound OCCO LYCAIKOWRPUZTN-UHFFFAOYSA-N 0.000 description 2
- 108010054218 Factor VIII Proteins 0.000 description 2
- 102000001690 Factor VIII Human genes 0.000 description 2
- 201000003542 Factor VIII deficiency Diseases 0.000 description 2
- 108010010803 Gelatin Proteins 0.000 description 2
- 241000713813 Gibbon ape leukemia virus Species 0.000 description 2
- 108010078851 HIV Reverse Transcriptase Proteins 0.000 description 2
- 241000257232 Haematobia irritans Species 0.000 description 2
- 101000756632 Homo sapiens Actin, cytoplasmic 1 Proteins 0.000 description 2
- 101000823116 Homo sapiens Alpha-1-antitrypsin Proteins 0.000 description 2
- 101001047341 Homo sapiens Heat shock protein beta-8 Proteins 0.000 description 2
- 101001109620 Homo sapiens Nucleolar and coiled-body phosphoprotein 1 Proteins 0.000 description 2
- 101001000998 Homo sapiens Protein phosphatase 1 regulatory subunit 12C Proteins 0.000 description 2
- VEXZGXHMUGYJMC-UHFFFAOYSA-N Hydrochloric acid Chemical compound Cl VEXZGXHMUGYJMC-UHFFFAOYSA-N 0.000 description 2
- 101710203526 Integrase Proteins 0.000 description 2
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 2
- 102000003960 Ligases Human genes 0.000 description 2
- 108090000364 Ligases Proteins 0.000 description 2
- 235000007688 Lycopersicon esculentum Nutrition 0.000 description 2
- 241000124008 Mammalia Species 0.000 description 2
- 241001529936 Murinae Species 0.000 description 2
- 241000714177 Murine leukemia virus Species 0.000 description 2
- 108020004485 Nonsense Codon Proteins 0.000 description 2
- 102100022726 Nucleolar and coiled-body phosphoprotein 1 Human genes 0.000 description 2
- 108091034117 Oligonucleotide Proteins 0.000 description 2
- ISWSIDIOOBJBQZ-UHFFFAOYSA-N Phenol Chemical compound OC1=CC=CC=C1 ISWSIDIOOBJBQZ-UHFFFAOYSA-N 0.000 description 2
- ZTHYODDOHIVTJV-UHFFFAOYSA-N Propyl gallate Chemical compound CCCOC(=O)C1=CC(O)=C(O)C(O)=C1 ZTHYODDOHIVTJV-UHFFFAOYSA-N 0.000 description 2
- 102100035620 Protein phosphatase 1 regulatory subunit 12C Human genes 0.000 description 2
- 208000007014 Retinitis pigmentosa Diseases 0.000 description 2
- 102100040756 Rhodopsin Human genes 0.000 description 2
- AUNGANRZJHBGPY-SCRDCRAPSA-N Riboflavin Chemical compound OC[C@@H](O)[C@@H](O)[C@@H](O)CN1C=2C=C(C)C(C)=CC=2N=C2C1=NC(=O)NC2=O AUNGANRZJHBGPY-SCRDCRAPSA-N 0.000 description 2
- KHPCPRHQVVSZAH-HUOMCSJISA-N Rosin Natural products O(C/C=C/c1ccccc1)[C@H]1[C@H](O)[C@@H](O)[C@@H](O)[C@@H](CO)O1 KHPCPRHQVVSZAH-HUOMCSJISA-N 0.000 description 2
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 2
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- 240000003768 Solanum lycopersicum Species 0.000 description 2
- 241000193996 Streptococcus pyogenes Species 0.000 description 2
- RAHZWNYVWXNFOC-UHFFFAOYSA-N Sulphur dioxide Chemical compound O=S=O RAHZWNYVWXNFOC-UHFFFAOYSA-N 0.000 description 2
- 210000001744 T-lymphocyte Anatomy 0.000 description 2
- 241000723792 Tobacco etch virus Species 0.000 description 2
- 108700019146 Transgenes Proteins 0.000 description 2
- 241000251539 Vertebrata <Metazoa> Species 0.000 description 2
- 108020005202 Viral DNA Proteins 0.000 description 2
- ATBOMIWRCZXYSZ-XZBBILGWSA-N [1-[2,3-dihydroxypropoxy(hydroxy)phosphoryl]oxy-3-hexadecanoyloxypropan-2-yl] (9e,12e)-octadeca-9,12-dienoate Chemical compound CCCCCCCCCCCCCCCC(=O)OCC(COP(O)(=O)OCC(O)CO)OC(=O)CCCCCCC\C=C\C\C=C\CCCCC ATBOMIWRCZXYSZ-XZBBILGWSA-N 0.000 description 2
- HCAJCMUKLZSPFT-KWXKLSQISA-N [3-(dimethylamino)-2-[(9z,12z)-octadeca-9,12-dienoyl]oxypropyl] (9z,12z)-octadeca-9,12-dienoate Chemical compound CCCCC\C=C/C\C=C/CCCCCCCC(=O)OCC(CN(C)C)OC(=O)CCCCCCC\C=C/C\C=C/CCCCC HCAJCMUKLZSPFT-KWXKLSQISA-N 0.000 description 2
- 210000001789 adipocyte Anatomy 0.000 description 2
- 235000004279 alanine Nutrition 0.000 description 2
- 125000003295 alanine group Chemical group N[C@@H](C)C(=O)* 0.000 description 2
- 150000001345 alkine derivatives Chemical class 0.000 description 2
- MBMBGCFOFBJSGT-KUBAVDMBSA-N all-cis-docosa-4,7,10,13,16,19-hexaenoic acid Chemical compound CC\C=C/C\C=C/C\C=C/C\C=C/C\C=C/C\C=C/CCC(O)=O MBMBGCFOFBJSGT-KUBAVDMBSA-N 0.000 description 2
- 108010050122 alpha 1-Antitrypsin Proteins 0.000 description 2
- 229940024142 alpha 1-antitrypsin Drugs 0.000 description 2
- AWUCVROLDVIAJX-UHFFFAOYSA-N alpha-glycerophosphate Natural products OCC(O)COP(O)(O)=O AWUCVROLDVIAJX-UHFFFAOYSA-N 0.000 description 2
- DTOSIQBPPRVQHS-PDBXOOCHSA-N alpha-linolenic acid Chemical compound CC\C=C/C\C=C/C\C=C/CCCCCCCC(O)=O DTOSIQBPPRVQHS-PDBXOOCHSA-N 0.000 description 2
- 125000000539 amino acid group Chemical group 0.000 description 2
- 125000000129 anionic group Chemical group 0.000 description 2
- 238000000137 annealing Methods 0.000 description 2
- YZXBAPSDXZZRGB-DOFZRALJSA-N arachidonic acid Chemical compound CCCCC\C=C/C\C=C/C\C=C/C\C=C/CCCC(O)=O YZXBAPSDXZZRGB-DOFZRALJSA-N 0.000 description 2
- LGJMUZUPVCAVPU-UHFFFAOYSA-N beta-Sitostanol Natural products C1CC2CC(O)CCC2(C)C2C1C1CCC(C(C)CCC(CC)C(C)C)C1(C)CC2 LGJMUZUPVCAVPU-UHFFFAOYSA-N 0.000 description 2
- 230000008238 biochemical pathway Effects 0.000 description 2
- 230000000903 blocking effect Effects 0.000 description 2
- 210000003995 blood forming stem cell Anatomy 0.000 description 2
- 201000011510 cancer Diseases 0.000 description 2
- 210000004413 cardiac myocyte Anatomy 0.000 description 2
- 210000003169 central nervous system Anatomy 0.000 description 2
- 229940106189 ceramide Drugs 0.000 description 2
- OSASVXMJTNOKOY-UHFFFAOYSA-N chlorobutanol Chemical compound CC(C)(O)C(Cl)(Cl)Cl OSASVXMJTNOKOY-UHFFFAOYSA-N 0.000 description 2
- 238000000576 coating method Methods 0.000 description 2
- 238000011109 contamination Methods 0.000 description 2
- 230000003247 decreasing effect Effects 0.000 description 2
- 238000002716 delivery method Methods 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 150000001982 diacylglycerols Chemical class 0.000 description 2
- 125000005265 dialkylamine group Chemical group 0.000 description 2
- 230000004069 differentiation Effects 0.000 description 2
- 238000006471 dimerization reaction Methods 0.000 description 2
- LOKCTEFSRHRXRJ-UHFFFAOYSA-I dipotassium trisodium dihydrogen phosphate hydrogen phosphate dichloride Chemical compound P(=O)(O)(O)[O-].[K+].P(=O)(O)([O-])[O-].[Na+].[Na+].[Cl-].[K+].[Cl-].[Na+] LOKCTEFSRHRXRJ-UHFFFAOYSA-I 0.000 description 2
- 208000035475 disorder Diseases 0.000 description 2
- UKMSUNONTOPOIO-UHFFFAOYSA-N docosanoic acid Chemical compound CCCCCCCCCCCCCCCCCCCCCC(O)=O UKMSUNONTOPOIO-UHFFFAOYSA-N 0.000 description 2
- POULHZVOKOAJMA-UHFFFAOYSA-N dodecanoic acid Chemical compound CCCCCCCCCCCC(O)=O POULHZVOKOAJMA-UHFFFAOYSA-N 0.000 description 2
- 229940079593 drug Drugs 0.000 description 2
- 230000007613 environmental effect Effects 0.000 description 2
- 210000002919 epithelial cell Anatomy 0.000 description 2
- CBOQJANXLMLOSS-UHFFFAOYSA-N ethyl vanillin Chemical compound CCOC1=CC(C=O)=CC=C1O CBOQJANXLMLOSS-UHFFFAOYSA-N 0.000 description 2
- 210000003527 eukaryotic cell Anatomy 0.000 description 2
- 229960000301 factor viii Drugs 0.000 description 2
- 239000008273 gelatin Substances 0.000 description 2
- 229920000159 gelatin Polymers 0.000 description 2
- 235000019322 gelatine Nutrition 0.000 description 2
- 235000011852 gelatine desserts Nutrition 0.000 description 2
- 229960004956 glycerylphosphorylcholine Drugs 0.000 description 2
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 2
- 238000010438 heat treatment Methods 0.000 description 2
- 210000003958 hematopoietic stem cell Anatomy 0.000 description 2
- 208000009429 hemophilia B Diseases 0.000 description 2
- NRLNQCOGCKAESA-UHFFFAOYSA-N heptatriaconta-6,9,28,31-tetraen-19-yl 4-(dimethylamino)butanoate Chemical compound CCCCCC=CCC=CCCCCCCCCC(OC(=O)CCCN(C)C)CCCCCCCCC=CCC=CCCCCC NRLNQCOGCKAESA-UHFFFAOYSA-N 0.000 description 2
- IPCSVZSSVZVIGE-UHFFFAOYSA-N hexadecanoic acid Chemical compound CCCCCCCCCCCCCCCC(O)=O IPCSVZSSVZVIGE-UHFFFAOYSA-N 0.000 description 2
- 102000046085 human HSPB8 Human genes 0.000 description 2
- 229920001477 hydrophilic polymer Polymers 0.000 description 2
- VKOBVWXKNCXXDE-UHFFFAOYSA-N icosanoic acid Chemical compound CCCCCCCCCCCCCCCCCCCC(O)=O VKOBVWXKNCXXDE-UHFFFAOYSA-N 0.000 description 2
- 229940099472 immunoglobulin a Drugs 0.000 description 2
- 238000007918 intramuscular administration Methods 0.000 description 2
- 208000017169 kidney disease Diseases 0.000 description 2
- 210000000265 leukocyte Anatomy 0.000 description 2
- 230000004777 loss-of-function mutation Effects 0.000 description 2
- 210000004072 lung Anatomy 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 239000002609 medium Substances 0.000 description 2
- 229910052751 metal Inorganic materials 0.000 description 2
- 239000002184 metal Substances 0.000 description 2
- 231100000350 mutagenesis Toxicity 0.000 description 2
- 239000003471 mutagenic agent Substances 0.000 description 2
- XVUQPECVOGMPRU-ZPPAUJSGSA-N n,n-dimethyl-1,2-bis[(9z,12z)-octadeca-9,12-dienoxy]propan-1-amine Chemical compound CCCCC\C=C/C\C=C/CCCCCCCCOC(C)C(N(C)C)OCCCCCCCC\C=C/C\C=C/CCCCC XVUQPECVOGMPRU-ZPPAUJSGSA-N 0.000 description 2
- 229910052754 neon Inorganic materials 0.000 description 2
- GKAOGPIIYCISHV-UHFFFAOYSA-N neon atom Chemical compound [Ne] GKAOGPIIYCISHV-UHFFFAOYSA-N 0.000 description 2
- 238000006384 oligomerization reaction Methods 0.000 description 2
- 210000000056 organ Anatomy 0.000 description 2
- 210000002220 organoid Anatomy 0.000 description 2
- SECPZKHBENQXJG-FPLPWBNLSA-N palmitoleic acid Chemical compound CCCCCC\C=C/CCCCCCCC(O)=O SECPZKHBENQXJG-FPLPWBNLSA-N 0.000 description 2
- 239000008194 pharmaceutical composition Substances 0.000 description 2
- 239000000546 pharmaceutical excipient Substances 0.000 description 2
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 2
- 239000002953 phosphate buffered saline Substances 0.000 description 2
- 150000008103 phosphatidic acids Chemical class 0.000 description 2
- 150000008105 phosphatidylcholines Chemical class 0.000 description 2
- 229920000642 polymer Polymers 0.000 description 2
- 230000000379 polymerizing effect Effects 0.000 description 2
- 210000001236 prokaryotic cell Anatomy 0.000 description 2
- 125000001436 propyl group Chemical group [H]C([*])([H])C([H])([H])C([H])([H])[H] 0.000 description 2
- 230000000306 recurrent effect Effects 0.000 description 2
- 230000001105 regulatory effect Effects 0.000 description 2
- 229920006395 saturated elastomer Polymers 0.000 description 2
- 230000011664 signaling Effects 0.000 description 2
- NLQLSVXGSXCXFE-UHFFFAOYSA-N sitosterol Natural products CC=C(/CCC(C)C1CC2C3=CCC4C(C)C(O)CCC4(C)C3CCC2(C)C1)C(C)C NLQLSVXGSXCXFE-UHFFFAOYSA-N 0.000 description 2
- 150000003408 sphingolipids Chemical class 0.000 description 2
- 238000007920 subcutaneous administration Methods 0.000 description 2
- 238000006467 substitution reaction Methods 0.000 description 2
- 239000000375 suspending agent Substances 0.000 description 2
- 230000008961 swelling Effects 0.000 description 2
- KHPCPRHQVVSZAH-UHFFFAOYSA-N trans-cinnamyl beta-D-glucopyranoside Natural products OC1C(O)C(O)C(CO)OC1OCC=CC1=CC=CC=C1 KHPCPRHQVVSZAH-UHFFFAOYSA-N 0.000 description 2
- 230000002103 transcriptional effect Effects 0.000 description 2
- 238000010361 transduction Methods 0.000 description 2
- 230000026683 transduction Effects 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- 238000013519 translation Methods 0.000 description 2
- 230000017105 transposition Effects 0.000 description 2
- 229940011671 vitamin b6 Drugs 0.000 description 2
- KZJWDPNRJALLNS-VPUBHVLGSA-N (-)-beta-Sitosterol Natural products O[C@@H]1CC=2[C@@](C)([C@@H]3[C@H]([C@H]4[C@@](C)([C@H]([C@H](CC[C@@H](C(C)C)CC)C)CC4)CC3)CC=2)CC1 KZJWDPNRJALLNS-VPUBHVLGSA-N 0.000 description 1
- IHNKQIMGVNPMTC-UHFFFAOYSA-N (2-hydroxy-3-octadecanoyloxypropyl) 2-(trimethylazaniumyl)ethyl phosphate Chemical compound CCCCCCCCCCCCCCCCCC(=O)OCC(O)COP([O-])(=O)OCC[N+](C)(C)C IHNKQIMGVNPMTC-UHFFFAOYSA-N 0.000 description 1
- XLKQWAMTMYIQMG-SVUPRYTISA-N (2-{[(2r)-2,3-bis[(4z,7z,10z,13z,16z,19z)-docosa-4,7,10,13,16,19-hexaenoyloxy]propyl phosphonato]oxy}ethyl)trimethylazanium Chemical compound CC\C=C/C\C=C/C\C=C/C\C=C/C\C=C/C\C=C/CCC(=O)OC[C@H](COP([O-])(=O)OCC[N+](C)(C)C)OC(=O)CC\C=C/C\C=C/C\C=C/C\C=C/C\C=C/C\C=C/CC XLKQWAMTMYIQMG-SVUPRYTISA-N 0.000 description 1
- CSVWWLUMXNHWSU-UHFFFAOYSA-N (22E)-(24xi)-24-ethyl-5alpha-cholest-22-en-3beta-ol Natural products C1CC2CC(O)CCC2(C)C2C1C1CCC(C(C)C=CC(CC)C(C)C)C1(C)CC2 CSVWWLUMXNHWSU-UHFFFAOYSA-N 0.000 description 1
- RQOCXCFLRBRBCS-UHFFFAOYSA-N (22E)-cholesta-5,7,22-trien-3beta-ol Natural products C1C(O)CCC2(C)C(CCC3(C(C(C)C=CCC(C)C)CCC33)C)C3=CC=C21 RQOCXCFLRBRBCS-UHFFFAOYSA-N 0.000 description 1
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 1
- WCGUUGGRBIKTOS-GPOJBZKASA-N (3beta)-3-hydroxyurs-12-en-28-oic acid Chemical compound C1C[C@H](O)C(C)(C)[C@@H]2CC[C@@]3(C)[C@]4(C)CC[C@@]5(C(O)=O)CC[C@@H](C)[C@H](C)[C@H]5C4=CC[C@@H]3[C@]21C WCGUUGGRBIKTOS-GPOJBZKASA-N 0.000 description 1
- YUFFSWGQGVEMMI-JLNKQSITSA-N (7Z,10Z,13Z,16Z,19Z)-docosapentaenoic acid Chemical compound CC\C=C/C\C=C/C\C=C/C\C=C/C\C=C/CCCCCC(O)=O YUFFSWGQGVEMMI-JLNKQSITSA-N 0.000 description 1
- OYHQOLUKZRVURQ-NTGFUMLPSA-N (9Z,12Z)-9,10,12,13-tetratritiooctadeca-9,12-dienoic acid Chemical compound C(CCCCCCC\C(=C(/C\C(=C(/CCCCC)\[3H])\[3H])\[3H])\[3H])(=O)O OYHQOLUKZRVURQ-NTGFUMLPSA-N 0.000 description 1
- WRIDQFICGBMAFQ-UHFFFAOYSA-N (E)-8-Octadecenoic acid Natural products CCCCCCCCCC=CCCCCCCC(O)=O WRIDQFICGBMAFQ-UHFFFAOYSA-N 0.000 description 1
- LVNGJLRDBYCPGB-LDLOPFEMSA-N (R)-1,2-distearoylphosphatidylethanolamine Chemical compound CCCCCCCCCCCCCCCCCC(=O)OC[C@H](COP([O-])(=O)OCC[NH3+])OC(=O)CCCCCCCCCCCCCCCCC LVNGJLRDBYCPGB-LDLOPFEMSA-N 0.000 description 1
- GVJHHUAWPYXKBD-IEOSBIPESA-N (R)-alpha-Tocopherol Natural products OC1=C(C)C(C)=C2O[C@@](CCC[C@H](C)CCC[C@H](C)CCCC(C)C)(C)CCC2=C1C GVJHHUAWPYXKBD-IEOSBIPESA-N 0.000 description 1
- SSCDRSKJTAQNNB-DWEQTYCFSA-N 1,2-di-(9Z,12Z-octadecadienoyl)-sn-glycero-3-phosphoethanolamine Chemical compound CCCCC\C=C/C\C=C/CCCCCCCC(=O)OC[C@H](COP(O)(=O)OCCN)OC(=O)CCCCCCC\C=C/C\C=C/CCCCC SSCDRSKJTAQNNB-DWEQTYCFSA-N 0.000 description 1
- LZLVZIFMYXDKCN-QJWFYWCHSA-N 1,2-di-O-arachidonoyl-sn-glycero-3-phosphocholine Chemical compound CCCCC\C=C/C\C=C/C\C=C/C\C=C/CCCC(=O)OC[C@H](COP([O-])(=O)OCC[N+](C)(C)C)OC(=O)CCC\C=C/C\C=C/C\C=C/C\C=C/CCCCC LZLVZIFMYXDKCN-QJWFYWCHSA-N 0.000 description 1
- XXKFQTJOJZELMD-JICBSJGISA-N 1,2-di-[(9Z,12Z,15Z)-octadecatrienoyl]-sn-glycero-3-phosphocholine Chemical compound CC\C=C/C\C=C/C\C=C/CCCCCCCC(=O)OC[C@H](COP([O-])(=O)OCC[N+](C)(C)C)OC(=O)CCCCCCC\C=C/C\C=C/C\C=C/CC XXKFQTJOJZELMD-JICBSJGISA-N 0.000 description 1
- KILNVBDSWZSGLL-KXQOOQHDSA-N 1,2-dihexadecanoyl-sn-glycero-3-phosphocholine Chemical compound CCCCCCCCCCCCCCCC(=O)OC[C@H](COP([O-])(=O)OCC[N+](C)(C)C)OC(=O)CCCCCCCCCCCCCCC KILNVBDSWZSGLL-KXQOOQHDSA-N 0.000 description 1
- SLKDGVPOSSLUAI-PGUFJCEWSA-N 1,2-dihexadecanoyl-sn-glycero-3-phosphoethanolamine zwitterion Chemical compound CCCCCCCCCCCCCCCC(=O)OC[C@H](COP(O)(=O)OCCN)OC(=O)CCCCCCCCCCCCCCC SLKDGVPOSSLUAI-PGUFJCEWSA-N 0.000 description 1
- DSNRWDQKZIEDDB-SQYFZQSCSA-N 1,2-dioleoyl-sn-glycero-3-phospho-(1'-sn-glycerol) Chemical compound CCCCCCCC\C=C/CCCCCCCC(=O)OC[C@H](COP(O)(=O)OC[C@@H](O)CO)OC(=O)CCCCCCC\C=C/CCCCCCCC DSNRWDQKZIEDDB-SQYFZQSCSA-N 0.000 description 1
- SNKAWJBJQDLSFF-NVKMUCNASA-N 1,2-dioleoyl-sn-glycero-3-phosphocholine Chemical compound CCCCCCCC\C=C/CCCCCCCC(=O)OC[C@H](COP([O-])(=O)OCC[N+](C)(C)C)OC(=O)CCCCCCC\C=C/CCCCCCCC SNKAWJBJQDLSFF-NVKMUCNASA-N 0.000 description 1
- MWRBNPKJOOWZPW-NYVOMTAGSA-N 1,2-dioleoyl-sn-glycero-3-phosphoethanolamine zwitterion Chemical compound CCCCCCCC\C=C/CCCCCCCC(=O)OC[C@H](COP(O)(=O)OCCN)OC(=O)CCCCCCC\C=C/CCCCCCCC MWRBNPKJOOWZPW-NYVOMTAGSA-N 0.000 description 1
- NRJAVPSFFCBXDT-HUESYALOSA-N 1,2-distearoyl-sn-glycero-3-phosphocholine Chemical compound CCCCCCCCCCCCCCCCCC(=O)OC[C@H](COP([O-])(=O)OCC[N+](C)(C)C)OC(=O)CCCCCCCCCCCCCCCCC NRJAVPSFFCBXDT-HUESYALOSA-N 0.000 description 1
- TZCPCKNHXULUIY-RGULYWFUSA-N 1,2-distearoyl-sn-glycero-3-phosphoserine Chemical compound CCCCCCCCCCCCCCCCCC(=O)OC[C@H](COP(O)(=O)OC[C@H](N)C(O)=O)OC(=O)CCCCCCCCCCCCCCCCC TZCPCKNHXULUIY-RGULYWFUSA-N 0.000 description 1
- JFBCSFJKETUREV-LJAQVGFWSA-N 1,2-ditetradecanoyl-sn-glycerol Chemical compound CCCCCCCCCCCCCC(=O)OC[C@H](CO)OC(=O)CCCCCCCCCCCCC JFBCSFJKETUREV-LJAQVGFWSA-N 0.000 description 1
- WTJKGGKOPKCXLL-VYOBOKEXSA-N 1-hexadecanoyl-2-(9Z-octadecenoyl)-sn-glycero-3-phosphocholine Chemical compound CCCCCCCCCCCCCCCC(=O)OC[C@H](COP([O-])(=O)OCC[N+](C)(C)C)OC(=O)CCCCCCC\C=C/CCCCCCCC WTJKGGKOPKCXLL-VYOBOKEXSA-N 0.000 description 1
- NHBKXEKEPDILRR-UHFFFAOYSA-N 2,3-bis(butanoylsulfanyl)propyl butanoate Chemical compound CCCC(=O)OCC(SC(=O)CCC)CSC(=O)CCC NHBKXEKEPDILRR-UHFFFAOYSA-N 0.000 description 1
- CHHHXKFHOYLYRE-UHFFFAOYSA-M 2,4-Hexadienoic acid, potassium salt (1:1), (2E,4E)- Chemical compound [K+].CC=CC=CC([O-])=O CHHHXKFHOYLYRE-UHFFFAOYSA-M 0.000 description 1
- XLPHMKQBBCKEFO-DHYROEPTSA-N 2-azaniumylethyl [(2r)-2,3-bis(3,7,11,15-tetramethylhexadecanoyloxy)propyl] phosphate Chemical compound CC(C)CCCC(C)CCCC(C)CCCC(C)CC(=O)OC[C@H](COP(O)(=O)OCCN)OC(=O)CC(C)CCCC(C)CCCC(C)CCCC(C)C XLPHMKQBBCKEFO-DHYROEPTSA-N 0.000 description 1
- CFWRDBDJAOHXSH-SECBINFHSA-N 2-azaniumylethyl [(2r)-2,3-diacetyloxypropyl] phosphate Chemical compound CC(=O)OC[C@@H](OC(C)=O)COP(O)(=O)OCCN CFWRDBDJAOHXSH-SECBINFHSA-N 0.000 description 1
- SPCKHVPPRJWQRZ-UHFFFAOYSA-N 2-benzhydryloxy-n,n-dimethylethanamine;2-hydroxypropane-1,2,3-tricarboxylic acid Chemical compound OC(=O)CC(O)(C(O)=O)CC(O)=O.C=1C=CC=CC=1C(OCCN(C)C)C1=CC=CC=C1 SPCKHVPPRJWQRZ-UHFFFAOYSA-N 0.000 description 1
- LQJBNNIYVWPHFW-UHFFFAOYSA-N 20:1omega9c fatty acid Natural products CCCCCCCCCCC=CCCCCCCCC(O)=O LQJBNNIYVWPHFW-UHFFFAOYSA-N 0.000 description 1
- SLQKYSPHBZMASJ-QKPORZECSA-N 24-methylene-cholest-8-en-3β-ol Chemical compound C([C@@]12C)C[C@H](O)C[C@@H]1CCC1=C2CC[C@]2(C)[C@@H]([C@H](C)CCC(=C)C(C)C)CC[C@H]21 SLQKYSPHBZMASJ-QKPORZECSA-N 0.000 description 1
- KLEXDBGYSOIREE-UHFFFAOYSA-N 24xi-n-propylcholesterol Natural products C1C=C2CC(O)CCC2(C)C2C1C1CCC(C(C)CCC(CCC)C(C)C)C1(C)CC2 KLEXDBGYSOIREE-UHFFFAOYSA-N 0.000 description 1
- ZISVTYVLWSZJAL-UHFFFAOYSA-N 3,6-bis[4-[bis(2-hydroxydodecyl)amino]butyl]piperazine-2,5-dione Chemical compound CCCCCCCCCCC(O)CN(CC(O)CCCCCCCCCC)CCCCC1NC(=O)C(CCCCN(CC(O)CCCCCCCCCC)CC(O)CCCCCCCCCC)NC1=O ZISVTYVLWSZJAL-UHFFFAOYSA-N 0.000 description 1
- RLCKHJSFHOZMDR-PWCSWUJKSA-N 3,7R,11R,15-tetramethyl-hexadecanoic acid Chemical compound CC(C)CCC[C@@H](C)CCC[C@@H](C)CCCC(C)CC(O)=O RLCKHJSFHOZMDR-PWCSWUJKSA-N 0.000 description 1
- HNTKPUXXCNQLFR-KWXKLSQISA-N 3-[2,2-bis[(9z,12z)-octadeca-9,12-dienyl]-1,3-dioxolan-4-yl]-n,n-dimethylpropan-1-amine Chemical compound CCCCC\C=C/C\C=C/CCCCCCCCC1(CCCCCCCC\C=C/C\C=C/CCCCC)OCC(CCCN(C)C)O1 HNTKPUXXCNQLFR-KWXKLSQISA-N 0.000 description 1
- ILBCSMHIEBDGJY-UHFFFAOYSA-N 3-[4-(3-aminopropylamino)butylamino]propylcarbamic acid Chemical compound NCCCNCCCCNCCCNC(O)=O ILBCSMHIEBDGJY-UHFFFAOYSA-N 0.000 description 1
- WXNZTHHGJRFXKQ-UHFFFAOYSA-N 4-chlorophenol Chemical compound OC1=CC=C(Cl)C=C1 WXNZTHHGJRFXKQ-UHFFFAOYSA-N 0.000 description 1
- QSBYPNXLFMSGKH-UHFFFAOYSA-N 9-Heptadecensaeure Natural products CCCCCCCC=CCCCCCCCC(O)=O QSBYPNXLFMSGKH-UHFFFAOYSA-N 0.000 description 1
- YWWVWXASSLXJHU-UHFFFAOYSA-N 9E-tetradecenoic acid Natural products CCCCC=CCCCCCCCC(O)=O YWWVWXASSLXJHU-UHFFFAOYSA-N 0.000 description 1
- 101000860090 Acidaminococcus sp. (strain BV3L6) CRISPR-associated endonuclease Cas12a Proteins 0.000 description 1
- 241000580270 Adeno-associated virus - 4 Species 0.000 description 1
- 241001164825 Adeno-associated virus - 8 Species 0.000 description 1
- 102100032157 Adenylate cyclase type 10 Human genes 0.000 description 1
- 229920001817 Agar Polymers 0.000 description 1
- 229930188104 Alkylresorcinol Natural products 0.000 description 1
- GUBGYTABKSRVRQ-XLOQQCSPSA-N Alpha-Lactose Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](CO)O[C@H](O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-XLOQQCSPSA-N 0.000 description 1
- 244000144730 Amygdalus persica Species 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- 102000008682 Argonaute Proteins Human genes 0.000 description 1
- 108010088141 Argonaute Proteins Proteins 0.000 description 1
- 241000713838 Avian myeloblastosis virus Species 0.000 description 1
- 235000021357 Behenic acid Nutrition 0.000 description 1
- 108010006654 Bleomycin Proteins 0.000 description 1
- 208000019838 Blood disease Diseases 0.000 description 1
- OILXMJHPFNGGTO-NRHJOKMGSA-N Brassicasterol Natural products O[C@@H]1CC=2[C@@](C)([C@@H]3[C@H]([C@H]4[C@](C)([C@H]([C@@H](/C=C/[C@H](C(C)C)C)C)CC4)CC3)CC=2)CC1 OILXMJHPFNGGTO-NRHJOKMGSA-N 0.000 description 1
- DPUOLQHDNGRHBS-UHFFFAOYSA-N Brassidinsaeure Natural products CCCCCCCCC=CCCCCCCCCCCCC(O)=O DPUOLQHDNGRHBS-UHFFFAOYSA-N 0.000 description 1
- YDNKGFDKKRUKPY-JHOUSYSJSA-N C16 ceramide Natural products CCCCCCCCCCCCCCCC(=O)N[C@@H](CO)[C@H](O)C=CCCCCCCCCCCCCC YDNKGFDKKRUKPY-JHOUSYSJSA-N 0.000 description 1
- 238000010356 CRISPR-Cas9 genome editing Methods 0.000 description 1
- 101100285688 Caenorhabditis elegans hrg-7 gene Proteins 0.000 description 1
- 102100033093 Calcium/calmodulin-dependent protein kinase type II subunit alpha Human genes 0.000 description 1
- SGNBVLSWZMBQTH-FGAXOLDCSA-N Campesterol Natural products O[C@@H]1CC=2[C@@](C)([C@@H]3[C@H]([C@H]4[C@@](C)([C@H]([C@H](CC[C@H](C(C)C)C)C)CC4)CC3)CC=2)CC1 SGNBVLSWZMBQTH-FGAXOLDCSA-N 0.000 description 1
- 241000589875 Campylobacter jejuni Species 0.000 description 1
- 241000282472 Canis lupus familiaris Species 0.000 description 1
- 102100035673 Centrosomal protein of 290 kDa Human genes 0.000 description 1
- 101710198317 Centrosomal protein of 290 kDa Proteins 0.000 description 1
- 241000282693 Cercopithecidae Species 0.000 description 1
- ZKLPARSLTMPFCP-UHFFFAOYSA-N Cetirizine Chemical compound C1CN(CCOCC(=O)O)CCN1C(C=1C=CC(Cl)=CC=1)C1=CC=CC=C1 ZKLPARSLTMPFCP-UHFFFAOYSA-N 0.000 description 1
- 108091006146 Channels Proteins 0.000 description 1
- 208000016718 Chromosome Inversion Diseases 0.000 description 1
- LPZCCMIISIBREI-MTFRKTCUSA-N Citrostadienol Natural products CC=C(CC[C@@H](C)[C@H]1CC[C@H]2C3=CC[C@H]4[C@H](C)[C@@H](O)CC[C@]4(C)[C@H]3CC[C@]12C)C(C)C LPZCCMIISIBREI-MTFRKTCUSA-N 0.000 description 1
- 102100023804 Coagulation factor VII Human genes 0.000 description 1
- 102100026735 Coagulation factor VIII Human genes 0.000 description 1
- 101100007328 Cocos nucifera COS-1 gene Proteins 0.000 description 1
- 108091026890 Coding region Proteins 0.000 description 1
- 102000016918 Complement C3 Human genes 0.000 description 1
- 108010028780 Complement C3 Proteins 0.000 description 1
- 102100027591 Copper-transporting ATPase 2 Human genes 0.000 description 1
- 241000195493 Cryptophyta Species 0.000 description 1
- YPWSLBHSMIKTPR-UHFFFAOYSA-N Cystathionine Natural products OC(=O)C(N)CCSSCC(N)C(O)=O YPWSLBHSMIKTPR-UHFFFAOYSA-N 0.000 description 1
- 102220607024 Cytosolic arginine sensor for mTORC1 subunit 2_R66A_mutation Human genes 0.000 description 1
- 102220606910 Cytosolic arginine sensor for mTORC1 subunit 2_R70A_mutation Human genes 0.000 description 1
- 102220606911 Cytosolic arginine sensor for mTORC1 subunit 2_R74A_mutation Human genes 0.000 description 1
- 102220606905 Cytosolic arginine sensor for mTORC1 subunit 2_R78A_mutation Human genes 0.000 description 1
- AUNGANRZJHBGPY-UHFFFAOYSA-N D-Lyxoflavin Natural products OCC(O)C(O)C(O)CN1C=2C=C(C)C(C)=CC=2N=C2C1=NC(=O)NC2=O AUNGANRZJHBGPY-UHFFFAOYSA-N 0.000 description 1
- ILRYLPWNYFXEMH-UHFFFAOYSA-N D-cystathionine Natural products OC(=O)C(N)CCSCC(N)C(O)=O ILRYLPWNYFXEMH-UHFFFAOYSA-N 0.000 description 1
- HMFHBZSHGGEWLO-SOOFDHNKSA-N D-ribofuranose Chemical compound OC[C@H]1OC(O)[C@H](O)[C@@H]1O HMFHBZSHGGEWLO-SOOFDHNKSA-N 0.000 description 1
- 230000033616 DNA repair Effects 0.000 description 1
- 101710096438 DNA-binding protein Proteins 0.000 description 1
- 206010011953 Decreased activity Diseases 0.000 description 1
- ARVGMISWLZPBCH-UHFFFAOYSA-N Dehydro-beta-sitosterol Natural products C1C(O)CCC2(C)C(CCC3(C(C(C)CCC(CC)C(C)C)CCC33)C)C3=CC=C21 ARVGMISWLZPBCH-UHFFFAOYSA-N 0.000 description 1
- 102000016911 Deoxyribonucleases Human genes 0.000 description 1
- 108010053770 Deoxyribonucleases Proteins 0.000 description 1
- 229920002307 Dextran Polymers 0.000 description 1
- GZDFHIJNHHMENY-UHFFFAOYSA-N Dimethyl dicarbonate Chemical compound COC(=O)OC(=O)OC GZDFHIJNHHMENY-UHFFFAOYSA-N 0.000 description 1
- 235000021294 Docosapentaenoic acid Nutrition 0.000 description 1
- 241000255598 Drosophila mauritiana Species 0.000 description 1
- 206010013801 Duchenne Muscular Dystrophy Diseases 0.000 description 1
- 239000001692 EU approved anti-caking agent Substances 0.000 description 1
- 102100038132 Endogenous retrovirus group K member 6 Pro protein Human genes 0.000 description 1
- 101710091045 Envelope protein Proteins 0.000 description 1
- DNVPQKQSNYMLRS-NXVQYWJNSA-N Ergosterol Natural products CC(C)[C@@H](C)C=C[C@H](C)[C@H]1CC[C@H]2C3=CC=C4C[C@@H](O)CC[C@]4(C)[C@@H]3CC[C@]12C DNVPQKQSNYMLRS-NXVQYWJNSA-N 0.000 description 1
- URXZXNYJPAJJOQ-UHFFFAOYSA-N Erucic acid Natural products CCCCCCC=CCCCCCCCCCCCC(O)=O URXZXNYJPAJJOQ-UHFFFAOYSA-N 0.000 description 1
- 241000588724 Escherichia coli Species 0.000 description 1
- 102000010834 Extracellular Matrix Proteins Human genes 0.000 description 1
- 108010037362 Extracellular Matrix Proteins Proteins 0.000 description 1
- 206010016077 Factor IX deficiency Diseases 0.000 description 1
- 108010023321 Factor VII Proteins 0.000 description 1
- 241000713800 Feline immunodeficiency virus Species 0.000 description 1
- 241000714165 Feline leukemia virus Species 0.000 description 1
- 102000008857 Ferritin Human genes 0.000 description 1
- 108050000784 Ferritin Proteins 0.000 description 1
- 238000008416 Ferritin Methods 0.000 description 1
- 241000710198 Foot-and-mouth disease virus Species 0.000 description 1
- 241000589601 Francisella Species 0.000 description 1
- 108700036482 Francisella novicida Cas9 Proteins 0.000 description 1
- 241000589599 Francisella tularensis subsp. novicida Species 0.000 description 1
- 241000588088 Francisella tularensis subsp. novicida U112 Species 0.000 description 1
- 241000223221 Fusarium oxysporum Species 0.000 description 1
- 101150014889 Gad1 gene Proteins 0.000 description 1
- 102100035902 Glutamate decarboxylase 1 Human genes 0.000 description 1
- 102100035857 Glutamate decarboxylase 2 Human genes 0.000 description 1
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 1
- ZWZWYGMENQVNFU-UHFFFAOYSA-N Glycerophosphorylserin Natural products OC(=O)C(N)COP(O)(=O)OCC(O)CO ZWZWYGMENQVNFU-UHFFFAOYSA-N 0.000 description 1
- 102000003886 Glycoproteins Human genes 0.000 description 1
- 108090000288 Glycoproteins Proteins 0.000 description 1
- 101150013707 HBB gene Proteins 0.000 description 1
- 102000029812 HNH nuclease Human genes 0.000 description 1
- 108060003760 HNH nuclease Proteins 0.000 description 1
- BTEISVKTSQLKST-UHFFFAOYSA-N Haliclonasterol Natural products CC(C=CC(C)C(C)(C)C)C1CCC2C3=CC=C4CC(O)CCC4(C)C3CCC12C BTEISVKTSQLKST-UHFFFAOYSA-N 0.000 description 1
- 102100021519 Hemoglobin subunit beta Human genes 0.000 description 1
- 208000031220 Hemophilia Diseases 0.000 description 1
- 208000002972 Hepatolenticular Degeneration Diseases 0.000 description 1
- SQUHHTBVTRBESD-UHFFFAOYSA-N Hexa-Ac-myo-Inositol Natural products CC(=O)OC1C(OC(C)=O)C(OC(C)=O)C(OC(C)=O)C(OC(C)=O)C1OC(C)=O SQUHHTBVTRBESD-UHFFFAOYSA-N 0.000 description 1
- 101000775498 Homo sapiens Adenylate cyclase type 10 Proteins 0.000 description 1
- 101000944249 Homo sapiens Calcium/calmodulin-dependent protein kinase type II subunit alpha Proteins 0.000 description 1
- 101000936280 Homo sapiens Copper-transporting ATPase 2 Proteins 0.000 description 1
- 101000907783 Homo sapiens Cystic fibrosis transmembrane conductance regulator Proteins 0.000 description 1
- 101000873786 Homo sapiens Glutamate decarboxylase 2 Proteins 0.000 description 1
- 101000843809 Homo sapiens Hydroxycarboxylic acid receptor 2 Proteins 0.000 description 1
- 101000952182 Homo sapiens Max-like protein X Proteins 0.000 description 1
- 101001057159 Homo sapiens Melanoma-associated antigen C3 Proteins 0.000 description 1
- 101001018717 Homo sapiens Mitofusin-2 Proteins 0.000 description 1
- 101001105486 Homo sapiens Proteasome subunit alpha type-7 Proteins 0.000 description 1
- 101000600434 Homo sapiens Putative uncharacterized protein encoded by MIR7-3HG Proteins 0.000 description 1
- 101000738771 Homo sapiens Receptor-type tyrosine-protein phosphatase C Proteins 0.000 description 1
- 101000801643 Homo sapiens Retinal-specific phospholipid-transporting ATPase ABCA4 Proteins 0.000 description 1
- 101000742373 Homo sapiens Vesicular inhibitory amino acid transporter Proteins 0.000 description 1
- 101001104102 Homo sapiens X-linked retinitis pigmentosa GTPase regulator Proteins 0.000 description 1
- 101001074035 Homo sapiens Zinc finger protein GLI2 Proteins 0.000 description 1
- CPELXLSAUQHCOX-UHFFFAOYSA-N Hydrogen bromide Chemical compound Br CPELXLSAUQHCOX-UHFFFAOYSA-N 0.000 description 1
- 108010021625 Immunoglobulin Fragments Proteins 0.000 description 1
- 102000008394 Immunoglobulin Fragments Human genes 0.000 description 1
- 108010067060 Immunoglobulin Variable Region Proteins 0.000 description 1
- 102000017727 Immunoglobulin Variable Region Human genes 0.000 description 1
- 208000026350 Inborn Genetic disease Diseases 0.000 description 1
- 206010051792 Infusion related reaction Diseases 0.000 description 1
- 208000032578 Inherited retinal disease Diseases 0.000 description 1
- 102100034349 Integrase Human genes 0.000 description 1
- 102100024383 Integrator complex subunit 10 Human genes 0.000 description 1
- 101710149805 Integrator complex subunit 10 Proteins 0.000 description 1
- 201000006347 Intellectual Disability Diseases 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- ILRYLPWNYFXEMH-WHFBIAKZSA-N L-cystathionine Chemical compound [O-]C(=O)[C@@H]([NH3+])CCSC[C@H]([NH3+])C([O-])=O ILRYLPWNYFXEMH-WHFBIAKZSA-N 0.000 description 1
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 1
- 241001112693 Lachnospiraceae Species 0.000 description 1
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 description 1
- 239000005639 Lauric acid Substances 0.000 description 1
- 208000004609 Leber congenital amaurosis 10 Diseases 0.000 description 1
- 239000000232 Lipid Bilayer Substances 0.000 description 1
- 241001417045 Lophius litulon Species 0.000 description 1
- 208000019693 Lung disease Diseases 0.000 description 1
- 101150083522 MECP2 gene Proteins 0.000 description 1
- 102220486635 Mannose-1-phosphate guanyltransferase beta_S56A_mutation Human genes 0.000 description 1
- 102100037423 Max-like protein X Human genes 0.000 description 1
- 201000009906 Meningitis Diseases 0.000 description 1
- 108091060294 Messenger RNP Proteins 0.000 description 1
- 102100039124 Methyl-CpG-binding protein 2 Human genes 0.000 description 1
- 229920000168 Microcrystalline cellulose Polymers 0.000 description 1
- 241001024304 Mino Species 0.000 description 1
- 102100033703 Mitofusin-2 Human genes 0.000 description 1
- 101000981253 Mus musculus GPI-linked NAD(P)(+)-arginine ADP-ribosyltransferase 1 Proteins 0.000 description 1
- 241000699670 Mus sp. Species 0.000 description 1
- 101000606416 Mycolicibacterium smegmatis (strain ATCC 700084 / mc(2)155) Acyltransferase PE Proteins 0.000 description 1
- CRJGESKKUOMBCT-VQTJNVASSA-N N-acetylsphinganine Chemical compound CCCCCCCCCCCCCCC[C@@H](O)[C@H](CO)NC(C)=O CRJGESKKUOMBCT-VQTJNVASSA-N 0.000 description 1
- 108091061960 Naked DNA Proteins 0.000 description 1
- 241000588653 Neisseria Species 0.000 description 1
- 108700019961 Neoplasm Genes Proteins 0.000 description 1
- 102000048850 Neoplasm Genes Human genes 0.000 description 1
- 101100058191 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) bcp-1 gene Proteins 0.000 description 1
- 239000005642 Oleic acid Substances 0.000 description 1
- ZQPPMHVWECSIRJ-UHFFFAOYSA-N Oleic acid Natural products CCCCCCCCC=CCCCCCCCC(O)=O ZQPPMHVWECSIRJ-UHFFFAOYSA-N 0.000 description 1
- 101710198224 Ornithine carbamoyltransferase, mitochondrial Proteins 0.000 description 1
- 241000283973 Oryctolagus cuniculus Species 0.000 description 1
- WIHSZOXPODIZSW-KJIWEYRQSA-N PE(18:3(9Z,12Z,15Z)/18:3(9Z,12Z,15Z)) Chemical compound CC\C=C/C\C=C/C\C=C/CCCCCCCC(=O)OC[C@H](COP(O)(=O)OCCN)OC(=O)CCCCCCC\C=C/C\C=C/C\C=C/CC WIHSZOXPODIZSW-KJIWEYRQSA-N 0.000 description 1
- 235000021314 Palmitic acid Nutrition 0.000 description 1
- 235000021319 Palmitoleic acid Nutrition 0.000 description 1
- 235000019483 Peanut oil Nutrition 0.000 description 1
- 101000882917 Penaeus paulensis Hemolymph clottable protein Proteins 0.000 description 1
- 108091005804 Peptidases Proteins 0.000 description 1
- 244000302661 Phyllostachys pubescens Species 0.000 description 1
- 235000003570 Phyllostachys pubescens Nutrition 0.000 description 1
- 241000605861 Prevotella Species 0.000 description 1
- XBDQKXXYIPTUBI-UHFFFAOYSA-N Propionic acid Chemical class CCC(O)=O XBDQKXXYIPTUBI-UHFFFAOYSA-N 0.000 description 1
- 206010060862 Prostate cancer Diseases 0.000 description 1
- 208000000236 Prostatic Neoplasms Diseases 0.000 description 1
- 239000004365 Protease Substances 0.000 description 1
- 102100021201 Proteasome subunit alpha type-7 Human genes 0.000 description 1
- 101710188315 Protein X Proteins 0.000 description 1
- 235000006040 Prunus persica var persica Nutrition 0.000 description 1
- 241000221301 Puccinia graminis Species 0.000 description 1
- 102100037401 Putative uncharacterized protein encoded by MIR7-3HG Human genes 0.000 description 1
- 241000700159 Rattus Species 0.000 description 1
- 102100037422 Receptor-type tyrosine-protein phosphatase C Human genes 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 208000006265 Renal cell carcinoma Diseases 0.000 description 1
- 102100033617 Retinal-specific phospholipid-transporting ATPase ABCA4 Human genes 0.000 description 1
- 208000006289 Rett Syndrome Diseases 0.000 description 1
- 108090000820 Rhodopsin Proteins 0.000 description 1
- PYMYPHUHKUWMLA-LMVFSUKVSA-N Ribose Natural products OC[C@@H](O)[C@@H](O)[C@@H](O)C=O PYMYPHUHKUWMLA-LMVFSUKVSA-N 0.000 description 1
- 101100170553 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) DLD2 gene Proteins 0.000 description 1
- 208000026552 Severe hemophilia A Diseases 0.000 description 1
- 241000713311 Simian immunodeficiency virus Species 0.000 description 1
- 241000700584 Simplexvirus Species 0.000 description 1
- 108010003723 Single-Domain Antibodies Proteins 0.000 description 1
- 108020004459 Small interfering RNA Proteins 0.000 description 1
- 241000191940 Staphylococcus Species 0.000 description 1
- 241001134656 Staphylococcus lugdunensis Species 0.000 description 1
- 208000027073 Stargardt disease Diseases 0.000 description 1
- 235000021355 Stearic acid Nutrition 0.000 description 1
- 241000194017 Streptococcus Species 0.000 description 1
- 108091027544 Subgenomic mRNA Proteins 0.000 description 1
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 1
- 229930006000 Sucrose Natural products 0.000 description 1
- QAOWNCQODCNURD-UHFFFAOYSA-L Sulfate Chemical compound [O-]S([O-])(=O)=O QAOWNCQODCNURD-UHFFFAOYSA-L 0.000 description 1
- 102000001435 Synapsin Human genes 0.000 description 1
- 108050009621 Synapsin Proteins 0.000 description 1
- 108010017842 Telomerase Proteins 0.000 description 1
- 241000906446 Theraps Species 0.000 description 1
- 208000024770 Thyroid neoplasm Diseases 0.000 description 1
- XYNPYHXGMWJBLV-VXPJTDKGSA-N Tomatidine Chemical compound O([C@@H]1[C@@H]([C@]2(CC[C@@H]3[C@@]4(C)CC[C@H](O)C[C@@H]4CC[C@H]3[C@@H]2C1)C)[C@@H]1C)[C@@]11CC[C@H](C)CN1 XYNPYHXGMWJBLV-VXPJTDKGSA-N 0.000 description 1
- QMGSCYSTMWRURP-UHFFFAOYSA-N Tomatine Natural products CC1CCC2(NC1)OC3CC4C5CCC6CC(CCC6(C)C5CCC4(C)C3C2C)OC7OC(CO)C(OC8OC(CO)C(O)C(OC9OCC(O)C(O)C9OC%10OC(CO)C(O)C(O)C%10O)C8O)C(O)C7O QMGSCYSTMWRURP-UHFFFAOYSA-N 0.000 description 1
- 108091028113 Trans-activating crRNA Proteins 0.000 description 1
- 241000589892 Treponema denticola Species 0.000 description 1
- 101800005109 Triakontatetraneuropeptide Proteins 0.000 description 1
- DTQVDTLACAAQTR-UHFFFAOYSA-M Trifluoroacetate Chemical compound [O-]C(=O)C(F)(F)F DTQVDTLACAAQTR-UHFFFAOYSA-M 0.000 description 1
- 235000021307 Triticum Nutrition 0.000 description 1
- 244000098338 Triticum aestivum Species 0.000 description 1
- OILXMJHPFNGGTO-ZRUUVFCLSA-N UNPD197407 Natural products C1C=C2C[C@@H](O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@H](C)C=C[C@H](C)C(C)C)[C@@]1(C)CC2 OILXMJHPFNGGTO-ZRUUVFCLSA-N 0.000 description 1
- HZYXFRGVBOPPNZ-UHFFFAOYSA-N UNPD88870 Natural products C1C=C2CC(O)CCC2(C)C2C1C1CCC(C(C)=CCC(CC)C(C)C)C1(C)CC2 HZYXFRGVBOPPNZ-UHFFFAOYSA-N 0.000 description 1
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 1
- 102100038170 Vesicular inhibitory amino acid transporter Human genes 0.000 description 1
- 208000018839 Wilson disease Diseases 0.000 description 1
- 102100040092 X-linked retinitis pigmentosa GTPase regulator Human genes 0.000 description 1
- 102100035558 Zinc finger protein GLI2 Human genes 0.000 description 1
- NJFCSWSRXWCWHV-USYZEHPZSA-N [(2R)-2,3-bis(octadec-1-enoxy)propyl] 2-(trimethylazaniumyl)ethyl phosphate Chemical compound CCCCCCCCCCCCCCCCC=COC[C@H](COP([O-])(=O)OCC[N+](C)(C)C)OC=CCCCCCCCCCCCCCCCC NJFCSWSRXWCWHV-USYZEHPZSA-N 0.000 description 1
- HIHOWBSBBDRPDW-PTHRTHQKSA-N [(3s,8s,9s,10r,13r,14s,17r)-10,13-dimethyl-17-[(2r)-6-methylheptan-2-yl]-2,3,4,7,8,9,11,12,14,15,16,17-dodecahydro-1h-cyclopenta[a]phenanthren-3-yl] n-[2-(dimethylamino)ethyl]carbamate Chemical compound C1C=C2C[C@@H](OC(=O)NCCN(C)C)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@H](C)CCCC(C)C)[C@@]1(C)CC2 HIHOWBSBBDRPDW-PTHRTHQKSA-N 0.000 description 1
- ZKSPKDDUPMUGBG-KWXKLSQISA-N [(9z,12z)-octadeca-9,12-dienyl] 3-(dimethylamino)-2-[(9z,12z)-octadeca-9,12-dienoxy]propanoate Chemical compound CCCCC\C=C/C\C=C/CCCCCCCCOC(CN(C)C)C(=O)OCCCCCCCC\C=C/C\C=C/CCCCC ZKSPKDDUPMUGBG-KWXKLSQISA-N 0.000 description 1
- FGYYWCMRFGLJOB-MQWKRIRWSA-N [2,3-dihydroxypropoxy(hydroxy)phosphoryl] (2s)-2,6-diaminohexanoate Chemical compound NCCCC[C@H](N)C(=O)OP(O)(=O)OCC(O)CO FGYYWCMRFGLJOB-MQWKRIRWSA-N 0.000 description 1
- NYDLOCKCVISJKK-WRBBJXAJSA-N [3-(dimethylamino)-2-[(z)-octadec-9-enoyl]oxypropyl] (z)-octadec-9-enoate Chemical compound CCCCCCCC\C=C/CCCCCCCC(=O)OCC(CN(C)C)OC(=O)CCCCCCC\C=C/CCCCCCCC NYDLOCKCVISJKK-WRBBJXAJSA-N 0.000 description 1
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 1
- HMNZFMSWFCAGGW-XPWSMXQVSA-N [3-[hydroxy(2-hydroxyethoxy)phosphoryl]oxy-2-[(e)-octadec-9-enoyl]oxypropyl] (e)-octadec-9-enoate Chemical compound CCCCCCCC\C=C\CCCCCCCC(=O)OCC(COP(O)(=O)OCCO)OC(=O)CCCCCCC\C=C\CCCCCCCC HMNZFMSWFCAGGW-XPWSMXQVSA-N 0.000 description 1
- 150000001242 acetic acid derivatives Chemical class 0.000 description 1
- DPXJVFZANSGRMM-UHFFFAOYSA-N acetic acid;2,3,4,5,6-pentahydroxyhexanal;sodium Chemical compound [Na].CC(O)=O.OCC(O)C(O)C(O)C(O)C=O DPXJVFZANSGRMM-UHFFFAOYSA-N 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 125000002252 acyl group Chemical group 0.000 description 1
- 239000008272 agar Substances 0.000 description 1
- 235000010419 agar Nutrition 0.000 description 1
- 230000002776 aggregation Effects 0.000 description 1
- 238000004220 aggregation Methods 0.000 description 1
- 125000003342 alkenyl group Chemical group 0.000 description 1
- 125000002355 alkine group Chemical group 0.000 description 1
- 125000003282 alkyl amino group Chemical group 0.000 description 1
- JAZBEHYOTPTENJ-JLNKQSITSA-N all-cis-5,8,11,14,17-icosapentaenoic acid Chemical compound CC\C=C/C\C=C/C\C=C/C\C=C/C\C=C/CCCC(O)=O JAZBEHYOTPTENJ-JLNKQSITSA-N 0.000 description 1
- 229940087168 alpha tocopherol Drugs 0.000 description 1
- HMFHBZSHGGEWLO-UHFFFAOYSA-N alpha-D-Furanose-Ribose Natural products OCC1OC(O)C(O)C1O HMFHBZSHGGEWLO-UHFFFAOYSA-N 0.000 description 1
- 235000020661 alpha-linolenic acid Nutrition 0.000 description 1
- 230000004075 alteration Effects 0.000 description 1
- 125000003368 amide group Chemical group 0.000 description 1
- 150000001408 amides Chemical class 0.000 description 1
- 229910021529 ammonia Inorganic materials 0.000 description 1
- 230000000931 anti-neutrophil effect Effects 0.000 description 1
- 239000000427 antigen Substances 0.000 description 1
- 108091007433 antigens Proteins 0.000 description 1
- 102000036639 antigens Human genes 0.000 description 1
- 239000003963 antioxidant agent Substances 0.000 description 1
- 235000006708 antioxidants Nutrition 0.000 description 1
- 235000021342 arachidonic acid Nutrition 0.000 description 1
- 229940114079 arachidonic acid Drugs 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 125000003118 aryl group Chemical group 0.000 description 1
- 238000003556 assay Methods 0.000 description 1
- 150000001540 azides Chemical class 0.000 description 1
- 210000000227 basophil cell of anterior lobe of hypophysis Anatomy 0.000 description 1
- SLQKYSPHBZMASJ-UHFFFAOYSA-N bastadin-1 Natural products CC12CCC(O)CC1CCC1=C2CCC2(C)C(C(C)CCC(=C)C(C)C)CCC21 SLQKYSPHBZMASJ-UHFFFAOYSA-N 0.000 description 1
- 229940116226 behenic acid Drugs 0.000 description 1
- XMQFTWRPUQYINF-UHFFFAOYSA-N bensulfuron-methyl Chemical compound COC(=O)C1=CC=CC=C1CS(=O)(=O)NC(=O)NC1=NC(OC)=CC(OC)=N1 XMQFTWRPUQYINF-UHFFFAOYSA-N 0.000 description 1
- 150000001558 benzoic acid derivatives Chemical class 0.000 description 1
- 108010051210 beta-Fructofuranosidase Proteins 0.000 description 1
- MJVXAPPOFPTTCA-UHFFFAOYSA-N beta-Sistosterol Natural products CCC(CCC(C)C1CCC2C3CC=C4C(C)C(O)CCC4(C)C3CCC12C)C(C)C MJVXAPPOFPTTCA-UHFFFAOYSA-N 0.000 description 1
- NJKOMDUNNDKEAI-UHFFFAOYSA-N beta-sitosterol Natural products CCC(CCC(C)C1CCC2(C)C3CC=C4CC(O)CCC4C3CCC12C)C(C)C NJKOMDUNNDKEAI-UHFFFAOYSA-N 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 229960001561 bleomycin Drugs 0.000 description 1
- OYVAGSVQBOHSSS-UAPAGMARSA-O bleomycin A2 Chemical compound N([C@H](C(=O)N[C@H](C)[C@@H](O)[C@H](C)C(=O)N[C@@H]([C@H](O)C)C(=O)NCCC=1SC=C(N=1)C=1SC=C(N=1)C(=O)NCCC[S+](C)C)[C@@H](O[C@H]1[C@H]([C@@H](O)[C@H](O)[C@H](CO)O1)O[C@@H]1[C@H]([C@@H](OC(N)=O)[C@H](O)[C@@H](CO)O1)O)C=1N=CNC=1)C(=O)C1=NC([C@H](CC(N)=O)NC[C@H](N)C(N)=O)=NC(N)=C1C OYVAGSVQBOHSSS-UAPAGMARSA-O 0.000 description 1
- 210000004204 blood vessel Anatomy 0.000 description 1
- 230000037396 body weight Effects 0.000 description 1
- 210000002449 bone cell Anatomy 0.000 description 1
- 210000004271 bone marrow stromal cell Anatomy 0.000 description 1
- 210000004556 brain Anatomy 0.000 description 1
- 210000004958 brain cell Anatomy 0.000 description 1
- OILXMJHPFNGGTO-ZAUYPBDWSA-N brassicasterol Chemical compound C1C=C2C[C@@H](O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@H](C)/C=C/[C@H](C)C(C)C)[C@@]1(C)CC2 OILXMJHPFNGGTO-ZAUYPBDWSA-N 0.000 description 1
- 235000004420 brassicasterol Nutrition 0.000 description 1
- 239000001506 calcium phosphate Substances 0.000 description 1
- 229910000389 calcium phosphate Inorganic materials 0.000 description 1
- 235000011010 calcium phosphates Nutrition 0.000 description 1
- SGNBVLSWZMBQTH-PODYLUTMSA-N campesterol Chemical compound C1C=C2C[C@@H](O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@H](C)CC[C@@H](C)C(C)C)[C@@]1(C)CC2 SGNBVLSWZMBQTH-PODYLUTMSA-N 0.000 description 1
- 235000000431 campesterol Nutrition 0.000 description 1
- 210000000234 capsid Anatomy 0.000 description 1
- 125000002915 carbonyl group Chemical group [*:2]C([*:1])=O 0.000 description 1
- 229920003123 carboxymethyl cellulose sodium Polymers 0.000 description 1
- 229940063834 carboxymethylcellulose sodium Drugs 0.000 description 1
- 239000003054 catalyst Substances 0.000 description 1
- 230000032823 cell division Effects 0.000 description 1
- 208000015114 central nervous system disease Diseases 0.000 description 1
- ZVEQCJWYRWKARO-UHFFFAOYSA-N ceramide Natural products CCCCCCCCCCCCCCC(O)C(=O)NC(CO)C(O)C=CCCC=C(C)CCCCCCCCC ZVEQCJWYRWKARO-UHFFFAOYSA-N 0.000 description 1
- 150000001783 ceramides Chemical class 0.000 description 1
- 229930183167 cerebroside Natural products 0.000 description 1
- 150000001784 cerebrosides Chemical class 0.000 description 1
- 229960001803 cetirizine Drugs 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 239000002738 chelating agent Substances 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 150000003841 chloride salts Chemical class 0.000 description 1
- 229960004926 chlorobutanol Drugs 0.000 description 1
- WLNARFZDISHUGS-MIXBDBMTSA-N cholesteryl hemisuccinate Chemical compound C1C=C2C[C@@H](OC(=O)CCC(O)=O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@H](C)CCCC(C)C)[C@@]1(C)CC2 WLNARFZDISHUGS-MIXBDBMTSA-N 0.000 description 1
- 230000002759 chromosomal effect Effects 0.000 description 1
- SECPZKHBENQXJG-UHFFFAOYSA-N cis-palmitoleic acid Natural products CCCCCCC=CCCCCCCCC(O)=O SECPZKHBENQXJG-UHFFFAOYSA-N 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 238000005354 coacervation Methods 0.000 description 1
- 210000001072 colon Anatomy 0.000 description 1
- 239000003086 colorant Substances 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 230000001268 conjugating effect Effects 0.000 description 1
- 230000021615 conjugation Effects 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 230000001276 controlling effect Effects 0.000 description 1
- 238000012937 correction Methods 0.000 description 1
- 230000008878 coupling Effects 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- 125000004122 cyclic group Chemical group 0.000 description 1
- 238000006352 cycloaddition reaction Methods 0.000 description 1
- 235000018417 cysteine Nutrition 0.000 description 1
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 1
- 230000007123 defense Effects 0.000 description 1
- CYQFCXCEBYINGO-IAGOWNOFSA-N delta1-THC Chemical compound C1=C(C)CC[C@H]2C(C)(C)OC3=CC(CCCCC)=CC(O)=C3[C@@H]21 CYQFCXCEBYINGO-IAGOWNOFSA-N 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- UREBDLICKHMUKA-CXSFZGCWSA-N dexamethasone Chemical compound C1CC2=CC(=O)C=C[C@]2(C)[C@]2(F)[C@@H]1[C@@H]1C[C@@H](C)[C@@](C(=O)CO)(O)[C@@]1(C)C[C@@H]2O UREBDLICKHMUKA-CXSFZGCWSA-N 0.000 description 1
- 229960003957 dexamethasone Drugs 0.000 description 1
- 125000004663 dialkyl amino group Chemical group 0.000 description 1
- 150000001985 dialkylglycerols Chemical class 0.000 description 1
- UMGXUWVIJIQANV-UHFFFAOYSA-M didecyl(dimethyl)azanium;bromide Chemical compound [Br-].CCCCCCCCCC[N+](C)(C)CCCCCCCCCC UMGXUWVIJIQANV-UHFFFAOYSA-M 0.000 description 1
- 235000005911 diet Nutrition 0.000 description 1
- 230000037213 diet Effects 0.000 description 1
- 239000003085 diluting agent Substances 0.000 description 1
- PSLWZOIUBRXAQW-UHFFFAOYSA-M dimethyl(dioctadecyl)azanium;bromide Chemical compound [Br-].CCCCCCCCCCCCCCCCCC[N+](C)(C)CCCCCCCCCCCCCCCCCC PSLWZOIUBRXAQW-UHFFFAOYSA-M 0.000 description 1
- UAKOZKUVZRMOFN-JDVCJPALSA-M dimethyl-bis[(z)-octadec-9-enyl]azanium;chloride Chemical compound [Cl-].CCCCCCCC\C=C/CCCCCCCC[N+](C)(C)CCCCCCCC\C=C/CCCCCCCC UAKOZKUVZRMOFN-JDVCJPALSA-M 0.000 description 1
- 229960000520 diphenhydramine Drugs 0.000 description 1
- ZGSPNIOCEDOHGS-UHFFFAOYSA-L disodium [3-[2,3-di(octadeca-9,12-dienoyloxy)propoxy-oxidophosphoryl]oxy-2-hydroxypropyl] 2,3-di(octadeca-9,12-dienoyloxy)propyl phosphate Chemical compound [Na+].[Na+].CCCCCC=CCC=CCCCCCCCC(=O)OCC(OC(=O)CCCCCCCC=CCC=CCCCCC)COP([O-])(=O)OCC(O)COP([O-])(=O)OCC(OC(=O)CCCCCCCC=CCC=CCCCCC)COC(=O)CCCCCCCC=CCC=CCCCCC ZGSPNIOCEDOHGS-UHFFFAOYSA-L 0.000 description 1
- 101150015424 dmd gene Proteins 0.000 description 1
- 235000020669 docosahexaenoic acid Nutrition 0.000 description 1
- 229940090949 docosahexaenoic acid Drugs 0.000 description 1
- 239000002552 dosage form Substances 0.000 description 1
- 231100000673 dose–response relationship Toxicity 0.000 description 1
- 230000005782 double-strand break Effects 0.000 description 1
- 239000003937 drug carrier Substances 0.000 description 1
- 235000020673 eicosapentaenoic acid Nutrition 0.000 description 1
- 229960005135 eicosapentaenoic acid Drugs 0.000 description 1
- JAZBEHYOTPTENJ-UHFFFAOYSA-N eicosapentaenoic acid Natural products CCC=CCC=CCC=CCC=CCC=CCCCC(O)=O JAZBEHYOTPTENJ-UHFFFAOYSA-N 0.000 description 1
- 239000003995 emulsifying agent Substances 0.000 description 1
- 238000005538 encapsulation Methods 0.000 description 1
- 230000003511 endothelial effect Effects 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 238000011067 equilibration Methods 0.000 description 1
- DNVPQKQSNYMLRS-SOWFXMKYSA-N ergosterol Chemical compound C1[C@@H](O)CC[C@]2(C)[C@H](CC[C@]3([C@H]([C@H](C)/C=C/[C@@H](C)C(C)C)CC[C@H]33)C)C3=CC=C21 DNVPQKQSNYMLRS-SOWFXMKYSA-N 0.000 description 1
- DPUOLQHDNGRHBS-KTKRTIGZSA-N erucic acid Chemical compound CCCCCCCC\C=C/CCCCCCCCCCCC(O)=O DPUOLQHDNGRHBS-KTKRTIGZSA-N 0.000 description 1
- 230000010856 establishment of protein localization Effects 0.000 description 1
- 150000002148 esters Chemical class 0.000 description 1
- BEFDCLMNVWHSGT-UHFFFAOYSA-N ethenylcyclopentane Chemical compound C=CC1CCCC1 BEFDCLMNVWHSGT-UHFFFAOYSA-N 0.000 description 1
- 229940073505 ethyl vanillin Drugs 0.000 description 1
- 230000002964 excitative effect Effects 0.000 description 1
- 230000007717 exclusion Effects 0.000 description 1
- 210000002744 extracellular matrix Anatomy 0.000 description 1
- 229940012413 factor vii Drugs 0.000 description 1
- 230000001605 fetal effect Effects 0.000 description 1
- 239000000945 filler Substances 0.000 description 1
- 239000012530 fluid Substances 0.000 description 1
- 239000013022 formulation composition Substances 0.000 description 1
- 210000001222 gaba-ergic neuron Anatomy 0.000 description 1
- 239000000499 gel Substances 0.000 description 1
- 238000012224 gene deletion Methods 0.000 description 1
- 238000012239 gene modification Methods 0.000 description 1
- 230000030279 gene silencing Effects 0.000 description 1
- 230000037440 gene silencing effect Effects 0.000 description 1
- 238000012226 gene silencing method Methods 0.000 description 1
- 208000016361 genetic disease Diseases 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 230000007614 genetic variation Effects 0.000 description 1
- 235000013922 glutamic acid Nutrition 0.000 description 1
- 239000004220 glutamic acid Substances 0.000 description 1
- 150000002314 glycerols Chemical class 0.000 description 1
- 150000002327 glycerophospholipids Chemical class 0.000 description 1
- 150000002339 glycosphingolipids Chemical class 0.000 description 1
- 210000003714 granulocyte Anatomy 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 210000002064 heart cell Anatomy 0.000 description 1
- 210000005003 heart tissue Anatomy 0.000 description 1
- 208000014951 hematologic disease Diseases 0.000 description 1
- 208000018706 hematopoietic system disease Diseases 0.000 description 1
- 208000006750 hematuria Diseases 0.000 description 1
- 150000002423 hopanoids Chemical class 0.000 description 1
- 102000056427 human CFTR Human genes 0.000 description 1
- 210000005119 human aortic smooth muscle cell Anatomy 0.000 description 1
- 210000005260 human cell Anatomy 0.000 description 1
- 239000003906 humectant Substances 0.000 description 1
- 125000004435 hydrogen atom Chemical group [H]* 0.000 description 1
- 230000007062 hydrolysis Effects 0.000 description 1
- 238000006460 hydrolysis reaction Methods 0.000 description 1
- WGCNASOHLSPBMP-UHFFFAOYSA-N hydroxyacetaldehyde Natural products OCC=O WGCNASOHLSPBMP-UHFFFAOYSA-N 0.000 description 1
- 238000003384 imaging method Methods 0.000 description 1
- 230000001900 immune effect Effects 0.000 description 1
- 230000028993 immune response Effects 0.000 description 1
- 210000000987 immune system Anatomy 0.000 description 1
- 230000001771 impaired effect Effects 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 229910052500 inorganic mineral Inorganic materials 0.000 description 1
- 229960000367 inositol Drugs 0.000 description 1
- CDAISMWEOUEBRE-GPIVLXJGSA-N inositol Chemical compound O[C@H]1[C@H](O)[C@@H](O)[C@H](O)[C@H](O)[C@@H]1O CDAISMWEOUEBRE-GPIVLXJGSA-N 0.000 description 1
- 238000002743 insertional mutagenesis Methods 0.000 description 1
- 238000009434 installation Methods 0.000 description 1
- 210000004020 intracellular membrane Anatomy 0.000 description 1
- 238000010255 intramuscular injection Methods 0.000 description 1
- 239000007927 intramuscular injection Substances 0.000 description 1
- 238000001990 intravenous administration Methods 0.000 description 1
- 238000010253 intravenous injection Methods 0.000 description 1
- 239000001573 invertase Substances 0.000 description 1
- 235000011073 invertase Nutrition 0.000 description 1
- QXJSBBXBKPUZAA-UHFFFAOYSA-N isooleic acid Natural products CCCCCCCC=CCCCCCCCCC(O)=O QXJSBBXBKPUZAA-UHFFFAOYSA-N 0.000 description 1
- 230000003907 kidney function Effects 0.000 description 1
- 239000008101 lactose Substances 0.000 description 1
- 239000003446 ligand Substances 0.000 description 1
- 229960004488 linolenic acid Drugs 0.000 description 1
- 150000002634 lipophilic molecules Chemical class 0.000 description 1
- 208000019423 liver disease Diseases 0.000 description 1
- 238000011068 loading method Methods 0.000 description 1
- 230000004807 localization Effects 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- 231100000053 low toxicity Toxicity 0.000 description 1
- 210000005265 lung cell Anatomy 0.000 description 1
- 206010025135 lupus erythematosus Diseases 0.000 description 1
- VLBPIWYTPAXCFJ-XMMPIXPASA-N lysophosphatidylcholine O-16:0/0:0 Chemical compound CCCCCCCCCCCCCCCCOC[C@@H](O)COP([O-])(=O)OCC[N+](C)(C)C VLBPIWYTPAXCFJ-XMMPIXPASA-N 0.000 description 1
- 239000011777 magnesium Substances 0.000 description 1
- 229910052749 magnesium Inorganic materials 0.000 description 1
- 230000007257 malfunction Effects 0.000 description 1
- 150000002690 malonic acid derivatives Chemical class 0.000 description 1
- 210000001161 mammalian embryo Anatomy 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 238000002483 medication Methods 0.000 description 1
- 238000002844 melting Methods 0.000 description 1
- 230000008018 melting Effects 0.000 description 1
- 210000002901 mesenchymal stem cell Anatomy 0.000 description 1
- 206010061289 metastatic neoplasm Diseases 0.000 description 1
- 229930182817 methionine Natural products 0.000 description 1
- 239000004530 micro-emulsion Substances 0.000 description 1
- 239000008108 microcrystalline cellulose Substances 0.000 description 1
- 235000019813 microcrystalline cellulose Nutrition 0.000 description 1
- 229940016286 microcrystalline cellulose Drugs 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 239000004005 microsphere Substances 0.000 description 1
- 239000011707 mineral Substances 0.000 description 1
- 238000005065 mining Methods 0.000 description 1
- 230000025608 mitochondrion localization Effects 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 210000001616 monocyte Anatomy 0.000 description 1
- 210000003205 muscle Anatomy 0.000 description 1
- 201000006938 muscular dystrophy Diseases 0.000 description 1
- 230000000869 mutational effect Effects 0.000 description 1
- MAFHEURJBRFHIT-YEUCEMRASA-N n,n-dimethyl-1,2-bis[(z)-octadec-9-enoxy]propan-1-amine Chemical compound CCCCCCCC\C=C/CCCCCCCCOC(C)C(N(C)C)OCCCCCCCC\C=C/CCCCCCCC MAFHEURJBRFHIT-YEUCEMRASA-N 0.000 description 1
- BVKCQBBZBGYNOP-UHFFFAOYSA-N n-Octatriacontane Natural products CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC BVKCQBBZBGYNOP-UHFFFAOYSA-N 0.000 description 1
- WQEPLUUGTLDZJY-UHFFFAOYSA-N n-Pentadecanoic acid Natural products CCCCCCCCCCCCCCC(O)=O WQEPLUUGTLDZJY-UHFFFAOYSA-N 0.000 description 1
- 210000003061 neural cell Anatomy 0.000 description 1
- 230000001537 neural effect Effects 0.000 description 1
- 210000000440 neutrophil Anatomy 0.000 description 1
- VVGIYYKRAMHVLU-UHFFFAOYSA-N newbouldiamide Natural products CCCCCCCCCCCCCCCCCCCC(O)C(O)C(O)C(CO)NC(=O)CCCCCCCCCCCCCCCCC VVGIYYKRAMHVLU-UHFFFAOYSA-N 0.000 description 1
- 230000030147 nuclear export Effects 0.000 description 1
- QIQXTHQIDYTFRH-UHFFFAOYSA-N octadecanoic acid Chemical compound CCCCCCCCCCCCCCCCCC(O)=O QIQXTHQIDYTFRH-UHFFFAOYSA-N 0.000 description 1
- OQCDKBAXFALNLD-UHFFFAOYSA-N octadecanoic acid Natural products CCCCCCCC(C)CCCCCCCCC(O)=O OQCDKBAXFALNLD-UHFFFAOYSA-N 0.000 description 1
- ZQPPMHVWECSIRJ-KTKRTIGZSA-N oleic acid Chemical compound CCCCCCCC\C=C/CCCCCCCC(O)=O ZQPPMHVWECSIRJ-KTKRTIGZSA-N 0.000 description 1
- 235000021313 oleic acid Nutrition 0.000 description 1
- 235000020660 omega-3 fatty acid Nutrition 0.000 description 1
- 229940012843 omega-3 fatty acid Drugs 0.000 description 1
- 150000007524 organic acids Chemical class 0.000 description 1
- 235000005985 organic acids Nutrition 0.000 description 1
- 239000003960 organic solvent Substances 0.000 description 1
- 210000000963 osteoblast Anatomy 0.000 description 1
- 230000002611 ovarian Effects 0.000 description 1
- 230000002018 overexpression Effects 0.000 description 1
- 230000003647 oxidation Effects 0.000 description 1
- 238000007254 oxidation reaction Methods 0.000 description 1
- 229910052760 oxygen Inorganic materials 0.000 description 1
- 239000001301 oxygen Substances 0.000 description 1
- 239000006179 pH buffering agent Substances 0.000 description 1
- 229940090668 parachlorophenol Drugs 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 239000000312 peanut oil Substances 0.000 description 1
- 239000001814 pectin Substances 0.000 description 1
- 235000010987 pectin Nutrition 0.000 description 1
- 229920001277 pectin Polymers 0.000 description 1
- 230000006320 pegylation Effects 0.000 description 1
- 208000027232 peripheral nervous system disease Diseases 0.000 description 1
- 208000033808 peripheral neuropathy Diseases 0.000 description 1
- 229940067107 phenylethyl alcohol Drugs 0.000 description 1
- 229940067605 phosphatidylethanolamines Drugs 0.000 description 1
- 150000003905 phosphatidylinositols Chemical class 0.000 description 1
- 229940067626 phosphatidylinositols Drugs 0.000 description 1
- 150000008106 phosphatidylserines Chemical class 0.000 description 1
- 150000003019 phosphosphingolipids Chemical class 0.000 description 1
- 229940068065 phytosterols Drugs 0.000 description 1
- 230000001817 pituitary effect Effects 0.000 description 1
- 238000003976 plant breeding Methods 0.000 description 1
- 229920002401 polyacrylamide Polymers 0.000 description 1
- 239000000244 polyoxyethylene sorbitan monooleate Substances 0.000 description 1
- 235000010482 polyoxyethylene sorbitan monooleate Nutrition 0.000 description 1
- 229920000053 polysorbate 80 Polymers 0.000 description 1
- 229940068968 polysorbate 80 Drugs 0.000 description 1
- 231100000683 possible toxicity Toxicity 0.000 description 1
- 235000010241 potassium sorbate Nutrition 0.000 description 1
- 239000004302 potassium sorbate Substances 0.000 description 1
- 229940069338 potassium sorbate Drugs 0.000 description 1
- 230000036515 potency Effects 0.000 description 1
- 239000003755 preservative agent Substances 0.000 description 1
- 230000001902 propagating effect Effects 0.000 description 1
- 239000000473 propyl gallate Substances 0.000 description 1
- 235000010388 propyl gallate Nutrition 0.000 description 1
- 229940075579 propyl gallate Drugs 0.000 description 1
- 201000001474 proteinuria Diseases 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- RADKZDMFGJYCBB-UHFFFAOYSA-N pyridoxal hydrochloride Natural products CC1=NC=C(CO)C(C=O)=C1O RADKZDMFGJYCBB-UHFFFAOYSA-N 0.000 description 1
- 235000008160 pyridoxine Nutrition 0.000 description 1
- 239000011677 pyridoxine Substances 0.000 description 1
- 239000013608 rAAV vector Substances 0.000 description 1
- VMXUWOKSQNHOCA-LCYFTJDESA-N ranitidine Chemical compound [O-][N+](=O)/C=C(/NC)NCCSCC1=CC=C(CN(C)C)O1 VMXUWOKSQNHOCA-LCYFTJDESA-N 0.000 description 1
- 229960000620 ranitidine Drugs 0.000 description 1
- 238000003753 real-time PCR Methods 0.000 description 1
- 208000015347 renal cell adenocarcinoma Diseases 0.000 description 1
- 230000028617 response to DNA damage stimulus Effects 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 210000001995 reticulocyte Anatomy 0.000 description 1
- 210000001525 retina Anatomy 0.000 description 1
- 229960002477 riboflavin Drugs 0.000 description 1
- 235000019192 riboflavin Nutrition 0.000 description 1
- 239000002151 riboflavin Substances 0.000 description 1
- 238000007363 ring formation reaction Methods 0.000 description 1
- 102200070544 rs202198133 Human genes 0.000 description 1
- 102200111286 rs2234704 Human genes 0.000 description 1
- 102220242537 rs762217448 Human genes 0.000 description 1
- 102220340490 rs782578166 Human genes 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 150000004671 saturated fatty acids Chemical group 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- CDAISMWEOUEBRE-UHFFFAOYSA-N scyllo-inosotol Natural products OC1C(O)C(O)C(O)C(O)C1O CDAISMWEOUEBRE-UHFFFAOYSA-N 0.000 description 1
- 239000008159 sesame oil Substances 0.000 description 1
- 235000011803 sesame oil Nutrition 0.000 description 1
- PWRIIDWSQYQFQD-UHFFFAOYSA-N sisunine Natural products CC1CCC2(NC1)OC3CC4C5CCC6CC(CCC6(C)C5CCC4(C)C3C2C)OC7OC(CO)C(OC8OC(CO)C(O)C(OC9OC(CO)C(O)C(O)C9OC%10OC(CO)C(O)C(O)C%10O)C8O)C(O)C7O PWRIIDWSQYQFQD-UHFFFAOYSA-N 0.000 description 1
- KZJWDPNRJALLNS-VJSFXXLFSA-N sitosterol Chemical compound C1C=C2C[C@@H](O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@H](C)CC[C@@H](CC)C(C)C)[C@@]1(C)CC2 KZJWDPNRJALLNS-VJSFXXLFSA-N 0.000 description 1
- 235000015500 sitosterol Nutrition 0.000 description 1
- 229950005143 sitosterol Drugs 0.000 description 1
- 210000002363 skeletal muscle cell Anatomy 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- IUVFCFQZFCOKRC-IPKKNMRRSA-M sodium;[(2r)-2,3-bis[[(z)-octadec-9-enoyl]oxy]propyl] 2,3-dihydroxypropyl phosphate Chemical compound [Na+].CCCCCCCC\C=C/CCCCCCCC(=O)OC[C@H](COP([O-])(=O)OCC(O)CO)OC(=O)CCCCCCC\C=C/CCCCCCCC IUVFCFQZFCOKRC-IPKKNMRRSA-M 0.000 description 1
- 239000002904 solvent Substances 0.000 description 1
- 235000010199 sorbic acid Nutrition 0.000 description 1
- 229940075582 sorbic acid Drugs 0.000 description 1
- 239000004334 sorbic acid Substances 0.000 description 1
- 230000006641 stabilisation Effects 0.000 description 1
- 238000011105 stabilization Methods 0.000 description 1
- 239000003381 stabilizer Substances 0.000 description 1
- 230000000087 stabilizing effect Effects 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 239000008117 stearic acid Substances 0.000 description 1
- 125000002328 sterol group Chemical group 0.000 description 1
- 229940032091 stigmasterol Drugs 0.000 description 1
- HCXVJBMSMIARIN-PHZDYDNGSA-N stigmasterol Chemical compound C1C=C2C[C@@H](O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@H](C)/C=C/[C@@H](CC)C(C)C)[C@@]1(C)CC2 HCXVJBMSMIARIN-PHZDYDNGSA-N 0.000 description 1
- 235000016831 stigmasterol Nutrition 0.000 description 1
- BFDNMXAIBMJLBB-UHFFFAOYSA-N stigmasterol Natural products CCC(C=CC(C)C1CCCC2C3CC=C4CC(O)CCC4(C)C3CCC12C)C(C)C BFDNMXAIBMJLBB-UHFFFAOYSA-N 0.000 description 1
- 238000003756 stirring Methods 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- 239000005720 sucrose Substances 0.000 description 1
- 229940044609 sulfur dioxide Drugs 0.000 description 1
- 235000010269 sulphur dioxide Nutrition 0.000 description 1
- 230000001629 suppression Effects 0.000 description 1
- 239000004094 surface-active agent Substances 0.000 description 1
- 210000002437 synoviocyte Anatomy 0.000 description 1
- 238000012385 systemic delivery Methods 0.000 description 1
- 230000002123 temporal effect Effects 0.000 description 1
- 150000003512 tertiary amines Chemical class 0.000 description 1
- 230000002381 testicular Effects 0.000 description 1
- TUNFSRHWOTWDNC-HKGQFRNVSA-N tetradecanoic acid Chemical compound CCCCCCCCCCCCC[14C](O)=O TUNFSRHWOTWDNC-HKGQFRNVSA-N 0.000 description 1
- 229940124597 therapeutic agent Drugs 0.000 description 1
- 201000002510 thyroid cancer Diseases 0.000 description 1
- AOBORMOPSGHCAX-DGHZZKTQSA-N tocofersolan Chemical compound OCCOC(=O)CCC(=O)OC1=C(C)C(C)=C2O[C@](CCC[C@H](C)CCC[C@H](C)CCCC(C)C)(C)CCC2=C1C AOBORMOPSGHCAX-DGHZZKTQSA-N 0.000 description 1
- 229960000984 tocofersolan Drugs 0.000 description 1
- XYNPYHXGMWJBLV-OFMODGJOSA-N tomatidine Natural products O[C@@H]1C[C@H]2[C@@](C)([C@@H]3[C@H]([C@H]4[C@@](C)([C@H]5[C@@H](C)[C@]6(O[C@H]5C4)NC[C@@H](C)CC6)CC3)CC2)CC1 XYNPYHXGMWJBLV-OFMODGJOSA-N 0.000 description 1
- REJLGAUYTKNVJM-SGXCCWNXSA-N tomatine Chemical compound O([C@H]1[C@H](O)[C@@H](CO)O[C@H]([C@@H]1O[C@H]1[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O1)O)O[C@H]1[C@@H](CO)O[C@H]([C@@H]([C@H]1O)O)O[C@@H]1C[C@@H]2CC[C@H]3[C@@H]4C[C@H]5[C@@H]([C@]4(CC[C@@H]3[C@@]2(C)CC1)C)[C@@H]([C@@]1(NC[C@@H](C)CC1)O5)C)[C@@H]1OC[C@@H](O)[C@H](O)[C@H]1O REJLGAUYTKNVJM-SGXCCWNXSA-N 0.000 description 1
- 231100000419 toxicity Toxicity 0.000 description 1
- 230000001988 toxicity Effects 0.000 description 1
- 239000012096 transfection reagent Substances 0.000 description 1
- 230000010474 transient expression Effects 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
- 150000003626 triacylglycerols Chemical class 0.000 description 1
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 1
- 230000010415 tropism Effects 0.000 description 1
- NMEHNETUFHBYEG-IHKSMFQHSA-N tttn Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N1[C@@H](CCC1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](NC(=O)[C@H]1N(CCC1)C(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)[C@@H](C)O)[C@@H](C)O)C1=CC=CC=C1 NMEHNETUFHBYEG-IHKSMFQHSA-N 0.000 description 1
- 238000005199 ultracentrifugation Methods 0.000 description 1
- 150000004670 unsaturated fatty acids Chemical group 0.000 description 1
- PLSAJKYPRJGMHO-UHFFFAOYSA-N ursolic acid Natural products CC1CCC2(CCC3(C)C(C=CC4C5(C)CCC(O)C(C)(C)C5CCC34C)C2C1C)C(=O)O PLSAJKYPRJGMHO-UHFFFAOYSA-N 0.000 description 1
- 229940096998 ursolic acid Drugs 0.000 description 1
- 239000004474 valine Substances 0.000 description 1
- 210000004509 vascular smooth muscle cell Anatomy 0.000 description 1
- 230000002861 ventricular Effects 0.000 description 1
- 210000003501 vero cell Anatomy 0.000 description 1
- 210000002845 virion Anatomy 0.000 description 1
- 239000000277 virosome Substances 0.000 description 1
- 229940088594 vitamin Drugs 0.000 description 1
- 239000011782 vitamin Substances 0.000 description 1
- 229930003231 vitamin Natural products 0.000 description 1
- 235000013343 vitamin Nutrition 0.000 description 1
- 235000019158 vitamin B6 Nutrition 0.000 description 1
- 239000011726 vitamin B6 Substances 0.000 description 1
- 210000004127 vitreous body Anatomy 0.000 description 1
- 238000009736 wetting Methods 0.000 description 1
- 239000000080 wetting agent Substances 0.000 description 1
- 239000002076 α-tocopherol Substances 0.000 description 1
- 235000004835 α-tocopherol Nutrition 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/113—Non-coding nucleic acids modulating the expression of genes, e.g. antisense oligonucleotides; Antisense DNA or RNA; Triplex- forming oligonucleotides; Catalytic nucleic acids, e.g. ribozymes; Nucleic acids used in co-suppression or gene silencing
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/87—Introduction of foreign genetic material using processes not otherwise provided for, e.g. co-transformation
- C12N15/90—Stable introduction of foreign DNA into chromosome
- C12N15/902—Stable introduction of foreign DNA into chromosome using homologous recombination
- C12N15/907—Stable introduction of foreign DNA into chromosome using homologous recombination in mammalian cells
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/12—Transferases (2.) transferring phosphorus containing groups, e.g. kinases (2.7)
- C12N9/1241—Nucleotidyltransferases (2.7.7)
- C12N9/1276—RNA-directed DNA polymerase (2.7.7.49), i.e. reverse transcriptase or telomerase
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/16—Hydrolases (3) acting on ester bonds (3.1)
- C12N9/22—Ribonucleases RNAses, DNAses
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/80—Fusion polypeptide containing a DNA binding domain, e.g. Lacl or Tet-repressor
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2310/00—Structure or type of the nucleic acid
- C12N2310/10—Type of nucleic acid
- C12N2310/20—Type of nucleic acid involving clustered regularly interspaced short palindromic repeats [CRISPRs]
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2310/00—Structure or type of the nucleic acid
- C12N2310/30—Chemical structure
- C12N2310/35—Nature of the modification
- C12N2310/351—Conjugate
- C12N2310/3519—Fusion with another nucleic acid
Definitions
- This disclosure features attachment site containing guide RNA (atgRNA) and guide RNA complex compositions capable of binding to a gene editor protein composition and mediating incorporation of an integration recognition site into a target DNA sequence.
- the integration recognition site also referred to as a beacon
- the integration recognition site is placed in the genome by partial genomic replacement using single guide RNA or dual guide RNAs.
- the atgRNA comprises, from 5’ to 3’: a spacer sequence complementary to a first genomic site of a first strand of a double-stranded target DNA sequence; a scaffold capable of associating with a DNA binding nickase domain and guiding the DNA binding nickase domain to the target DNA sequence; a reverse transcriptase (RT) template comprising: an integration recognition site; and a clamp segment comprising a sequence where upon being reverse transcribed the sequence (reverse Attorney Ref: 38909-52322/WO Client Ref: 005WO complement clamp “rcClamp”) is at least partially complementary to a second genomic site of the first strand of the double-stranded target DNA sequence; and a primer binding site complementary to a third genomic site of the double-stranded target DNA sequence, wherein the third genomic site is on a second strand of the target DNA sequence.
- RT reverse transcriptase
- a clamp segment in the atgRNA allows for a reduction in the number of nucleotides in the primer binding site needed to bind the third genomic site and facilitate reverse transcription.
- this disclosure provides for atgRNA compositions which are capable of being reduced in total sequence length as compared to standard prime editing guide RNAs or gene editing RNAs capable of integration recognition site insertion.
- the clamp segment binding site (i.e., second genomic site) can be selected from within a large genomic window, wherein for example, the genomic window frame can be shifted to screen for variable sequence distance from the third genomic site and/or constrained to select for efficient (e.g., melting temperature, rate of binding, low off target, etc.) hybridization to a defined length clamp segment.
- the present disclosure provides for single guide RNA, split guide RNA, and dual guide RNA methods for programmable gene insertion by partial genomic replacement.
- the present disclosure provides nucleic acid compositions, methods, and an overall platform for site-specific genetic engineering using Programmable Addition via Site-Specific Targeting Elements (PASTE) (see Ionnidi et al.; doi: 10.1101/2021.11.01.466786; the entirety of which is incorporated herein by reference), transposon-mediated gene editing, or other suitable gene editing, or gene incorporation technology.
- PASTE Programmable Addition via Site-Specific Targeting Elements
- this disclosure features an attachment site containing guide RNA (atgRNA) comprising from 5’ to 3’: a spacer sequence complementary to a first genomic site of a first strand of a double-stranded target DNA sequence; a scaffold capable of associating with a DNA binding nickase domain and guiding the DNA binding nickase domain to the target DNA sequence; a reverse transcriptase template comprising: an integration recognition site; and a clamp segment comprising a sequence where upon being reverse transcribed the sequence is at least partially complementary to a second genomic site of the first strand of the double-stranded target DNA sequence; and a primer binding site complementary to a third genomic site of the double-stranded target DNA sequence, wherein the third genomic site is on a second strand of the target DNA Attorney Ref: 38909-52322/WO Client Ref: 005WO sequence, wherein the reverse transcriptase template encodes for a first single-stranded DNA sequence comprising the integration recognition site and the clamp
- this disclosure features an attachment site containing guide RNA (atgRNA) comprising from 5’ to 3’: a spacer sequence complementary to a first genomic site of a first strand of a double-stranded target DNA sequence; a scaffold capable of associating with a DNA binding nickase domain and guiding the DNA binding nickase domain to the target DNA sequence; a reverse transcriptase template comprising: an integration recognition site; and a clamp segment comprising a sequence where upon being reverse transcribed comprises a sequence at least partially complementary to a second genomic site of first strand of the double-stranded target DNA sequence; and a primer binding site comprising: a sequence complementary to a third genomic site of the double-stranded DNA target sequence, wherein the third genomic site is on a second strand of the target DNA sequence; and an anti-reverse transcriptase template (anti-RTT) sequence comprising a sequence at least partially complementary to all or a portion of the RT template, wherein the reverse transcript
- atgRNA guide
- this disclosure features an attachment site containing guide RNA (atgRNA), comprising from 5’ to 3’: a spacer sequence complementary to a first genomic site of a first strand of a double-stranded target DNA sequence; a scaffold capable of associating with a DNA binding nickase domain and guiding the DNA binding nickase domain to the target DNA sequence; a clamp segment at least partially complementary to a second genomic site of the first strand of the double-stranded target DNA sequence; a reverse transcriptase (RT) template comprising an integration recognition site; and a primer binding site complementary to a third genomic site of the double-stranded target DNA sequence, wherein the third genomic site is on a second strand of the target DNA sequence; wherein the reverse transcriptase template encodes for a first single-stranded DNA sequence comprising the integration recognition site.
- atgRNA guide RNA
- this disclosure features an attachment site containing guide RNA (atgRNA) comprising from 5’ to 3’: a spacer sequence complementary to a first genomic site of a first strand of a double-stranded target DNA sequence; a scaffold capable of associating with a DNA binding nickase domain and guiding the DNA binding nickase domain to the target DNA sequence; a clamp segment at least partially complementary to a second genomic site of the first strand of the double-stranded target DNA sequence; a reverse transcriptase (RT) template comprising an integration recognition site; and a primer binding site comprising: a sequence complementary to a third genomic site of the double-stranded DNA target sequence, wherein the Attorney Ref: 38909-52322/WO Client Ref: 005WO third genomic site is on a second strand of the target DNA sequence; and an anti-reverse transcriptase template (anti-RTT) sequence comprising a sequence at least partially complementary to all or a portion of the RT template,
- atgRNA guide
- the atgRNA upon introducing the atgRNA into a cell with a gene editor polypeptide, all or a portion of the single-stranded DNA sequence comprising the integration recognition site is site-specifically integrated into the double-stranded target DNA sequence. [0013] In some embodiments, upon introducing the atgRNA into a cell with a gene editor polypeptide, the atgRNA comprising the clamp segment enhances integration efficiency of the integration recognition site at the double-stranded target DNA sequence as compared to the integration efficiency of the integration recognition site at the double-stranded target DNA sequence when using an atgRNA that does not comprise the clamp segment. [0014] In some embodiments, the atgRNA has a length of 150 to 190 nucleotides.
- the atgRNA has a length of 160 to 180 nucleotides.
- the clamp segment has a length of 1 nucleotide to 40 nucleotides. In some embodiments, the clamp segment has a length of 1 to 10 nucleotides, 11 to 20 nucleotides, 21 to 30 nucleotides, or 31 to 40 nucleotides. In some embodiments, the clamp segment has a length of 10 nucleotides. [0016] In some embodiments, the RT template has a length of 10 nucleotides to 200 nucleotides.
- the RT template has a length of 30, 32, 34, 36, 38, 40, 42, 44, 46, 48 or 50 nucleotides.
- the second genomic site and the third genomic site are separated by one or more genomic bases in the double-stranded target DNA sequence.
- the second genomic site and the third genomic site are separated by 1 to 10 genomic bases, 11 to 20 genomic bases, 21 to30 genomic bases, 31 to 40 genomic bases, or 41 to 45 genomic bases in the double-stranded target DNA sequence.
- the second genomic site and the third genomic site are separated by 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, or 45 genomic bases in the double-stranded target DNA sequence.
- Attorney Ref: 38909-52322/WO Client Ref: 005WO In some embodiments, the second genomic site and the third genomic site are adjacent in the double-stranded target DNA sequence.
- the clamp segment comprises a sequence having 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence complementarity to the second genomic site.
- the clamp segment comprises a sequence that is capable of binding to the second genomic site of the first strand of the double-stranded target DNA sequence with at least a first nucleotide mismatch.
- the clamp segment comprises a sequence that is capable of binding to the second genomic site of the first strand of the double-stranded target DNA sequence with a first nucleotide mismatch and a second nucleotide mismatch.
- the anti-RTT sequence comprises a sequence having 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence complementarity to all or a portion the RT template.
- the anti-RTT sequence has a length between 1 nucleotide to 20 nucleotides, 5 nucleotides to 15 nucleotides, or 10 nucleotides to 20 nucleotides. In some embodiments, the anti-RTT sequence has a length of 10 nucleotides. [0025] In some embodiments, upon introducing the atgRNA into a cell, the anti-RTT sequence increases stability of the atgRNA as compared to an atgRNA that does not comprise the anti-RTT sequence.
- this disclosure features a system for site-specifically incorporating an integration recognition site into a mammalian cell genome at a target DNA sequence, comprising: any of the attachment site containing gRNAs (atgRNAs) described herein; and a gene editor polypeptide comprising a DNA binding nickase domain linked to a reverse transcriptase domain capable of incorporating the integration recognition site into the target DNA sequence, wherein, upon introducing into the cell, the system integrates the integration recognition site into the target DNA sequence.
- atgRNAs gRNAs
- the atgRNA comprising the clamp segment enhances integration efficiency of the integration recognition site at the target Attorney Ref: 38909-52322/WO Client Ref: 005WO DNA sequence as compared to the integration efficiency of the integration recognition site at the target DNA sequence when using an atgRNA that does not comprise the clamp segment.
- this disclosure features a method for site-specifically incorporating an integration recognition site into a mammalian cell genome at a target DNA sequence, comprising: incorporating an integration target recognition site into the genome by delivering into the cell: any of the attachment site containing gRNAs (atgRNAs) described herein; and a gene editor polypeptide or polynucleotide encoding a gene editor polypeptide, wherein the gene editor polypeptide comprises a DNA binding nickase domain linked to a reverse transcriptase domain, wherein the integration recognition site is incorporated into the target DNA sequence by the gene editor polypeptide; and optionally, a nicking gRNA.
- atgRNAs gRNAs
- the method using the atgRNA comprising the clamp segment enhances integration efficiency of the integration recognition site at the target DNA sequence as compared to the integration efficiency of the integration recognition site at the target DNA sequence when using an atgRNA that does not comprise the clamp segment.
- this disclosure features a system for site-specifically integrating a donor polynucleotide template into a mammalian cell genome at a target DNA sequence, comprising: any of the attachment site containing gRNAs (atgRNAs) described herein; a gene editor polypeptide comprising a DNA binding nickase domain linked to a reverse transcriptase domain capable of incorporating the integration recognition site into the target DNA sequence, an integration enzyme; and a donor polynucleotide template linked to a sequence that is an integration cognate of the integration recognition site present in the atgRNA, whereby the integration enzyme integrates the donor polynucleotide template into the target DNA sequence.
- atgRNAs gRNAs
- the atgRNA comprising the clamp enhances integration efficiency of the integration recognition site at the target DNA sequence as compared to the integration efficiency of the integration recognition site at the target DNA sequence when using an atgRNA that does not comprise the clamp segment.
- this disclosure features a method for site-specifically integrating a donor polynucleotide template into a mammalian cell genome at a target DNA sequence, comprising: incorporating an integration recognition site into the genome by delivering into the cell: any of the attachment site containing gRNAs (atgRNAs) described herein; and a gene editor polypeptide or polynucleotide encoding a gene editor polypeptide, wherein the gene editor polypeptide comprises Attorney Ref: 38909-52322/WO Client Ref: 005WO a DNA binding nickase domain linked to a reverse transcriptase domain and is capable of incorporating the integration recognition site into the target DNA sequence; and optionally, a nicking gRNA; and integrating the donor polynucleotide template into the genome by delivering into the cell: an integration enzyme; and a donor polynucleotide template, wherein the donor polynucleotide template is linked to a sequence that is an integration cogn
- the method enhances integration efficiency of the integration recognition site at the double-stranded target DNA sequence as compared to the integration efficiency of the integration recognition site at the target DNA sequence when using an atgRNA that does not comprise the clamp segment. 5.
- FIG.1A-1E shows analysis of AttP variants.
- FIG.1A shows a non-limiting schematic of AttP mutations tested for improving integration efficiency (SEQ ID NOS: 394 and 540-542, respectively, in order of appearance).
- FIG.1A-1E shows analysis of AttP variants.
- FIG.1A shows a non-limiting schematic of AttP mutations tested for improving integration efficiency (SEQ ID NOS: 394 and 540-542, respectively, in order of appearance).
- FIG. 1B shows integration efficiencies of wildtype and mutant AttP sites across a panel of AttB lengths.
- FIG. 1C shows a non-limiting schematic of multiplexed integration of different cargo sets at specific genomic loci.
- Three fluorescent cargos (GFP, mCherry, and YFP) are inserted orthogonally at three different loci (ACTB, LMNB1, NOLC1) for in-frame gene tagging.
- FIG.1D shows orthogonality of top 4 AttB/AttP dinucleotide pairs evaluated for GFP integration with PASTE at the ACTB locus.
- FIG.1E shows efficiency of multiplexed PASTE insertion of combinations of fluorophores at ACTB, LMNB1, and NOLC1 loci.
- FIG. 2 illustrates a schematic of single atgRNA and dual atgRNA approaches for step 1: beacon placement (“integration recognition site”) and step 2: gene insertion, thereby illustrating PASTE.
- FIGs. 3A-3C illustrates a version 1 attachment site-containing guide RNA (atgRNAv1) (FIG.3A) and a mechanism by which the atRNAv1 can be used for site-specifically integrating an integration recognition site into a mammalian cell genome at a desired target site (e.g., a target Attorney Ref: 38909-52322/WO Client Ref: 005WO DNA sequence) (FIGs.3B-3C).
- a target attorney Ref 38909-52322/WO Client Ref: 005WO DNA sequence
- FIG.3A shows an atgRNA-v1 comprising from 5’ to 3’: a spacer sequence that binds to a first genomic site, a scaffold sequence, a clamp segment comprising a sequence that is complementary (binds) to a second genomic site or comprising a sequence (see rcClamp) where upon being reverse transcribed the sequence is at least partially complementary to a second genomic site of the first strand of the double-stranded target DNA sequence, a sequence encoding an integration recognition site (“AttB”), and a primer binding site (PBS) that binds to a third genomic site.
- a spacer sequence that binds to a first genomic site
- a scaffold sequence comprising a sequence that is complementary (binds) to a second genomic site or comprising a sequence (see rcClamp) where upon being reverse transcribed the sequence is at least partially complementary to a second genomic site of the first strand of the double-stranded target DNA sequence
- AttB integration recognition site
- FIG.3B shows atgRNA v1 with the spacer hybridized to the first genomic site and the PBS hybridized to the third genomic site, where the nick is indicated by the inverted triangle.
- FIG.3C shows a mechanism by which the atgRNA-v1 as shown in FIGs.3A-3B can be used for site-specifically integrating an integration recognition site into a mammalian cell genome at a desired target site (e.g., a target DNA sequence).
- a nick as indicated by the inverted triangle
- a nickase e.g., nCas9
- a nickase e.g., nCas9
- FIGs. 4A-4C illustrate a version 2 attachment site-containing guide RNA (atgRNAv2) (FIG.4A) and a mechanism by which the atRNAv2 can be used for site-specifically integrating an integration recognition site into a mammalian cell genome at a desired target site (e.g., a target DNA sequence) (FIGs. 4B-4C).
- FIG. 4A illustrates a version 2 attachment site-containing guide RNA (atgRNAv2) (FIG.4A) and a mechanism by which the atRNAv2 can be used for site-specifically integrating an integration recognition site into a mammalian cell genome at a desired target site (e.g., a target DNA sequence) (FIGs. 4B-4C).
- FIG. 4A shows an atgRNA-v2 comprising 5’ to 3’: a spacer sequence that binds a first genomic site, a scaffold sequence, a clamp comprising a sequence that has complementarity (binds) to a second genomic site which is 3’ to the nick on the targeting strand (i.e., a one or more separation between the second genomic site and a third genomic site) or comprising a sequence (see rcClamp) where upon being reverse transcribed the sequence is at least partially complementary to a second genomic site of the first strand of the double-stranded target DNA sequence, a sequence encoding an integration recognition site (e.g., AttB), and a primer binding site (PBS) that binds to a third genomic site.
- a spacer sequence that binds a first genomic site
- a scaffold sequence comprising a sequence that has complementarity (binds) to a second genomic site which is 3’ to the nick on the targeting strand (i.e., a
- FIG.4B shows atgRNA v2 with the spacer hybridized to the first genomic site and the PBS hybridized to the third genomic site, where the nick is indicated by the inverted triangle.
- FIG.4C shows the mechanism by which the atgRNA-v2 as shown in FIGs. 4A-4B can be used for site-specifically integrating an integration recognition site into a genome of a cell.
- a nick (as indicated by the inverted triangle) on the non- targeting strand induced by a nickase (e.g., nCas9) facilitates binding of the PBS to a sequence 5’ to the nick site on the non-targeting strand.
- the clamp i.e., reverse complement of the clamp
- the sequence to which the clamp (i.e., the reverse complement of the clamp) binds on the targeting strand is greater than 1 nucleotide 5’ to the site of the nick on the non-targeting strand.
- the integration recognition site e.g., AttB is integrated into the genome at the target sequence.
- FIGs.6A-6B illustrate the use of an atgRNA comprising an anti-reverse transcription (RT) sequence that is at least partially complementary to all or a portion of the RT template sequence.
- FIG.6A shows exonuclease-mediated degradation of the 3’ end of an atgRNA with the result being an atgRNA where the 3’ end including the PBS, AttB, and clamp has been degraded.
- RT anti-reverse transcription
- FIGs.7A-7C shows percent beacon placement data for atgRNA capable of site-specifically incorporating an integrase recognition site into a genomic locus.
- the AtgRNA were designed to target the LMNB1 locus (FIG.7A), the hF9 locus (FIG.7B), and the ACTB locus (FIG.7C) and each capable of site-specifically incorporating an integrase recognition site into the LMBN1 locus hF9 locus, and ACTB locus, respectively.
- Each atgRNA included a 10 nucleotide clamp and the indicated separation between the second and third genomic sites.
- FIG.7A shows percent beacon placement (i.e., efficiency of integration of the integration recognition site) for atgRNA targeting the LMNB1 locus having a clamp segment at least partially complementary to a second genomic site and a primer binding site complementary to a third genomic site, where the second genomic site and the third genomic site are separated by 0 to 36 base pairs (see FIG.7A “clamp+0” (i.e., 0 base pair separation) to “clamp+36” (i.e., 36 base pair separation).
- FIG.7B shows percent beacon placement (i.e., efficiency of integration of the integration recognition site) for atgRNA targeting the hF9 locus having a clamp segment at least partially complementary to a second genomic site and a primer binding site complementary to a third genomic site, where the second genomic site and the third genomic site are separated by 0 to 42 base pairs (see FIG.7B “clamp+0” (i.e., 0 base pair separation) to “clamp+42” (i.e., 42 base pair separation).
- FIG.7B shows percent beacon placement (i.e., efficiency of integration of the integration recognition site) for atgRNA targeting the hF9 locus having a clamp segment at least partially complementary to a second genomic site and a primer binding site complementary to a third genomic site, where the second genomic site and the third genomic site are separated by 0 to 42 base pairs (see FIG.7B “clamp+0” (i.e., 0 base pair separation) to “clamp+42” (i.e., 42 base pair separation).
- FIG. 7C shows percent beacon placement (i.e., efficiency of integration of the integration recognition site) for atgRNA targeting the ACTB locus having a clamp segment at least partially complementary to a second genomic site and a primer binding site complementary to a third genomic site, where the second genomic site Attorney Ref: 38909-52322/WO Client Ref: 005WO and the third genomic site are separated by 0 to 33 base pairs (see FIG.7C “clamp+0” (i.e., 0 base pair separation) to “clamp+33” (i.e., 33 base pair separation). 6.
- attachment site containing guide RNA (atgRNA) and guide RNA complex compositions capable of binding to a gene editor protein composition and mediating incorporation of an integration recognition site into a target DNA sequence.
- the integration recognition site also referred to as a beacon
- the integration recognition site is placed in the genome by partial genomic replacement using a single guide RNA or dual guide RNAs.
- the guide RNA comprises, in 5’ to 3’ order, comprises a spacer sequence complementary to a first genomic site of a first strand of a double-stranded target DNA sequence; a scaffold capable of associating with a DNA binding nickase domain and guiding the DNA binding nickase domain to the target DNA sequence; a reverse transcriptase template comprising: an integration recognition site; and a clamp segment comprising a sequence where upon being reverse transcribed the sequence (reverse complement clamp “rcClamp”) is at least partially complementary to a second genomic site of the first strand of the double-stranded target DNA sequence; and a primer binding site complementary to a third genomic site of the double-stranded target DNA sequence, wherein the third genomic site is on a second strand of the target DNA sequence.
- rcClamp reverse complement clamp
- this disclosure is based in part on the finding that by relaxing the requirement that the clamp binding site (i.e., second genomic site) be immediately adjacent to the primer binding site (i.e., third genomic site) the size limitations associated with atgRNAs can be reduced as compared to standard prime editing guide RNAs or gene editing RNAs capable of integration recognition site insertion.
- This finding enables the clamp segment binding site (i.e., second genomic site) to be selected from within a large genomic window, wherein for example, the genomic window frame can be shifted to screen for variable sequence distance from the third genomic site.
- Gene editor is a protein that that can be used to perform gene editing, gene modification, gene insertion, gene deletion, or gene inversion.
- the terms Attorney Ref: 38909-52322/WO Client Ref: 005WO “gene editor polynucleotide” refers to polynucleotide sequence encoding the gene editor protein.
- Such an enzyme or enzyme fusion may contain DNA or RNA targetable nuclease protein (i.e., Cas protein, ADAR, or ADAT), wherein target specificity is mediated by a complexed nucleic acid (i.e., guide RNA).
- Such an enzyme or enzyme fusion may be a DNA/RNA targetable protein, wherein target specificity is mediated by internal, conjugated, fused, or linked amino acids, such as within TALENs, ZFNs, or meganucleases.
- target specificity is mediated by internal, conjugated, fused, or linked amino acids, such as within TALENs, ZFNs, or meganucleases.
- the gene editor can demonstrate targeted nuclease activity, targeted binding with no nuclease activity, or targeted nickase activity (or cleaves activity).
- a gene editor comprising a targetable protein may be fused, linked, complexed, operate in cis or trans to one or more proteins or protein fragment motifs.
- Gene editors may be fused or linked to one or more integrase, recombinase, polymerase, telomerase, reverse transcriptase, or invertase.
- a gene editor can be a prime editor fusion protein or a gene writer fusion protein.
- Prime editor fusion protein describes a protein that is used in prime editing.
- Prime editor system describes the components used in prime editing.
- Prime editing uses CRISPR enzyme that nicks or cuts only single strand of double stranded DNA, i.e., a nickase; the nickase can occur either naturally or by mutation or modification of a nuclease that makes double stranded cuts.
- the nickase is programmed (directed) with a prime-editing guide RNA (pegRNA).
- pegRNA prime-editing guide RNA
- the skilled person in the art would appreciate that the pegRNA both specifies the target site and encodes the desired edit. Described herein are attachment site containing guide RNA (atgRNA) that both specifies the target and encodes for the desired integrase target recognition site.
- the nickase may be programmed (directed) with an atgRNA.
- the nickase is a catalytically-impaired Cas9 endonuclease, a Cas9 nickase, that is fused to the reverse transcriptase.
- the Cas9 nickase part of the protein is guided to the DNA target site by the atgRNA (pegRNA), whereby a nick or single stranded cut occurs.
- the reverse transcriptase domain then uses the atgRNA (pegRNA) to template reverse transcription of the desired edit, directly polymerizing DNA onto the nicked target DNA strand.
- the edited DNA strand replaces the original DNA strand, creating a heteroduplex containing one edited strand and one unedited strand.
- the prime editor (PE) guides resolution of the heteroduplex to favor copying the edit onto the unedited strand, completing the process (typically achieved with a nickase gRNA).
- cleavase e.g., cleavase I enzyme
- an additional agent or agents may be added that improve the efficiency and outcome purity of the prime edit.
- the agent may be chemical Attorney Ref: 38909-52322/WO Client Ref: 005WO or biological and disrupt DNA mismatch repair (MMR) processes at or near the edit site (i.e., PE4 and PE5 and PEmax architecture by Chen et al. Cell, 184, 1-18, October 28, 2021; Chen et al. is incorporated herein by reference).
- the agent is a MMR-inhibiting protein.
- the MMR-inhibiting protein is dominant negative MMR protein. In certain embodiments, the dominant negative MMR protein is MLH1dn. In particular embodiments, the MMR-inhibiting agent is incorporated into the single nucleic acid construct design described herein that encodes any of the gene editor polypeptides described herein. In some embodiments, the MMR-inhibiting agent is linked or fused to the prime editor protein fusion, which may or may not have a linked or fused integrase. In some embodiments, the MMR-inhibiting agent is linked or fused to the Gene WriterTM protein, which may or may not have a linked or fused integrase. [0048] The prime editor or gene editor system can be used to achieve DNA deletion and replacement.
- the DNA deletion replacement is induced using a pair of atgRNA or pegRNA that target opposite DNA strands, programming not only the sites that are nicked but also the outcome of the repair (i.e., PrimeDel by Choi et al. Nat. Biotechnology, October 14, 2021; Choi et al. is incorporated herein by reference and TwinPE by Anzalone et al.BioRxiv, November 2, 2021; Anzalone et al. is incorporated herein by reference).
- the DNA deletion is induced using a single atgRNA.
- the DNA deletion and replacement is induced using a wild type Cas9 prime editor (PE-Cas9) system (i.e., PEDAR by Jiang et al. Nat. Biotechnology, October 14, 2021; Jiang et al. is incorporated herein by reference in its entirety)
- the DNA replacement is an integrase target recognition site or recombinase target recognition site.
- the constructs and methods described herein may be utilized to incorporate the pair of pegRNAs used in PrimeDel, TwinPE (WO2021226558 incorporated by reference herein in its entirety), or PEDAR, the prime editor fusion protein or Gene Writer protein, optionally a nickase guide RNA (ngRNA), an integrase, a nucleic acid cargo, and optionally a recombinase into a single nucleic acid construct described herein.
- the integrase may be directly linked, for example by a peptide linker, to the prime editor fusion or gene writer protein.
- the prime editors can refer to a retrovirus or lentivirus reverse transcriptase such as a Moloney Murine Leukemia Virus (M-MLV) reverse transcriptase (RT) fused to a CRISPR enzyme nickase such as a Cas9 H840A nickase, a Cas9nickase.
- the prime editors can refer to a retrovirus or lentivirus reverse transcriptase such as a Moloney Murine Leukemia Virus (M-MLV) reverse transcriptase (RT) fused to a cleavase.
- the RT can be fused at, near or to the C-terminus of a Cas9nickase, e.g., Cas9 H840A. Fusing the RT to the C-terminus region, e.g., to the C-terminus, of the Cas9 nickase may result in higher editing efficiency.
- a Cas9nickase e.g., Cas9 H840A.
- Fusing the RT to the C-terminus region, e.g., to the C-terminus, of the Cas9 nickase may result in higher editing efficiency.
- Such a complex is called PEI.
- the CRISPR enzyme nickase e.g., Cas9(H840A), i.e., a Cas9nickase
- a Cas9nickase can be linked to a non-M-MLV reverse transcriptase such as an AMV-RT or XRT (Cas9(H840A)-AMV-RT or XRT).
- the CRISPR enzyme nickase instead of being a Cas9 (H840A), i.e., instead of being a Cas9 nickase, the CRISPR enzyme nickase instead can be a CRISPR enzyme that naturally is a nickase or cuts a single strand of double stranded DNA; for instance, the CRISPR enzyme nickase can be Casl2a/b. Alternatively, the CRISPR enzyme nickase can be another mutation of Cas9, such as Cas9(Dl0A).
- a CRISPR enzyme such as a CRISPR enzyme nickase, such as Cas9 (wild type), Cas9(H840A), Cas9(Dl0A) or Cas 12a/b nickase can be fused in some embodiments to a pentamutant of M-MLV RT (D200N/ L603W/ T330P/ T306K/ W313F), whereby there can be up to about 45-fold higher efficiency, and this is called PE2.
- a CRISPR enzyme nickase such as Cas9 (wild type), Cas9(H840A), Cas9(Dl0A) or Cas 12a/b nickase
- a pentamutant of M-MLV RT D200N/ L603W/ T330P/ T306K/ W313F
- the M-MLV RT comprise one or more of the mutations Y8H, P51L, S56A, S67R, E69K, Vl29P, L139P, Tl97A, H204R, V223H, T246E, N249D, E286R, Q2911, E302K, E302R, F309N, M320L, P330E, L435G, L435R, N454K, D524A, D524G, D524N, E562Q, D583N, H594Q, E607K, D653N, and L671P. Specific M-MLV RT mutations are shown in Table 1.
- the reverse transcriptase can also be a wild-type or modified transcription xenopolymerase (RTX), avian myeloblastosis virus reverse transcriptase (AMV RT), Feline Immunodeficiency Virus reverse transcriptase (FIV-RT), FeLV-RT (Feline leukemia virus reverse transcriptase), HIV-RT (Human Immunodeficiency Virus reverse transcriptase).
- RTX transcription xenopolymerase
- AMV RT avian myeloblastosis virus reverse transcriptase
- FV-RT Feline Immunodeficiency Virus reverse transcriptase
- FeLV-RT FeLV-RT
- Feline leukemia virus reverse transcriptase FeLV-RT
- HIV-RT Human Immunodeficiency Virus reverse transcriptase
- the reverse transcriptase can be a fusion of MMuLV to the Sto7d DNA binding domain (see Ionnidi et al.;
- PE3, PE3b, PE4, PE5, and/or PEmax which a skilled person can incorporate into a single nucleic acid construct described herein, involves nicking the non-edited strand, potentially causing the cell to remake that strand using the edited strand as the template to induce HR.
- the nicking of the non-edited strand can involve the use of a nicking guide RNA (ngRNA).
- ngRNA nicking guide RNA
- Prime editors or gene editors can be found in the following: WO2020/191153, WO2020/191171, WO2020/191233, WO2020/191234, WO2020/191239, WO2020/191241, WO2020/191242, WO2020/191243, WO2020/191245, WO2020/191246, WO2020/191248, WO2020/191249, each of which is incorporated by reference herein in its entirety.
- the skilled person can incorporate the selected CRISPR enzyme, as part of the gene editor, gene editor fusion, prime editor, or prime editor fusion compositions described herein.
- the prime editor protein (1) site- specifically targets a genomic locus and (2) performs a catalytic cut or nick. These steps are typically performed by a CRISPR-Cas.
- the Cas protein may be substituted by other nucleic acid programmable DNA binding proteins (napDNAbp) such as zinc finger nucleases (ZFNs), transcription activator-like effector nucleases (TALENs), or meganucleases.
- ZFNs zinc finger nucleases
- TALENs transcription activator-like effector nucleases
- a Gene Writer can introduce novel DNA elements, such as an integration target site, into a DNA locus.
- a Gene Writer protein comprises: (A) a polypeptide or a nucleic acid encoding a polypeptide, wherein the polypeptide comprises (i) a reverse transcriptase domain, and either (x) an endonuclease domain that contains DNA binding functionality or (y) an endonuclease domain and separate DNA binding domain; and (B) a template RNA comprising (i) a sequence that binds the polypeptide and (ii) a heterologous insert sequence. Examples of such Gene WriterTM proteins and related systems can be found in US20200109398, which is incorporated by reference herein in its entirety.
- the prime editor or Gene Writer protein fusion or prime editor protein linked or fused to an integrase is expressed as a split construct.
- the split construct in reconstituted in a cell.
- the split construct can be fused or ligated via intein protein splicing.
- the split construct can be reconstituted via protein-protein inter-molecular bonding and/or interactions.
- the split construct can be reconstituted via chemical, biological, or environmental induced oligomerization.
- the split construct can be reconstituted via nanobody binding ALFA-tagged proteins.
- the split construct can be adapted into one or more single nucleic acid polynucleotides.
- an integrase or recombinase is directly linked or fused, for example by a peptide linker, which may be cleavable or non-cleavable, to the prime editor fusion Attorney Ref: 38909-52322/WO Client Ref: 005WO protein (i.e., fused Cas9 nickase-reverse transcriptase) or Gene Writer protein.
- Suitable linkers for example between the Cas9, RT, and integrase, may be selected from Table 3: Attorney Ref: 38909-52322/WO Client Ref: 005WO [0057]
- the prime editor or Gene Writer protein fusion or prime editor protein linked or fused to an integrase is expressed as a split construct.
- the split construct in reconstituted in a cell.
- the split construct can be fused or ligated via intein protein splicing.
- the split construct can be reconstituted via protein-protein inter-molecular bonding and/or interactions.
- the split construct can be reconstituted via chemical, biological, or environmental induced oligomerization.
- the split construct can be adapted into one or more nucleic acid constructs described herein.
- Type II CRISPR proteins The skilled person can incorporate a selected CRISPR enzyme, described below, as part of the prime editor fusion, as a component of the gene editor polypeptide described herein.
- Streptococcus pyogenes Cas9 (SpCas9), the most common enzyme used in genome-editing applications, is a large nuclease of 1368 amino acid residues.
- Advantages of SpCas9 include its short, 5′-NGG-3′ PAM and very high average editing efficiency.
- SpCas9 consists of two lobes: a recognition (REC) lobe and a nuclease (NUC) lobe.
- the REC lobe can be divided into three regions, a long a helix referred to as the bridge helix (residues 60–93), the REC1 (residues 94–179 and 308–713) domain, and the REC2 (residues 180–307) domain.
- the NUC lobe consists of the RuvC (residues 1–59, 718–769, and 909–1098), HNH (residues 775–908), and PAM-interacting (PI) (residues 1099–1368) domains.
- the negatively charged sgRNA:target DNA heteroduplex is accommodated in a positively charged groove at the interface between the REC and NUC lobes.
- the RuvC domain is assembled from the three split RuvC motifs (RuvC I–III) and interfaces with the PI domain to form a positively charged surface that interacts with the 30 tail of the sgRNA.
- the HNH domain lies between the RuvC II–III motifs and forms only a few contacts with the rest of the protein. Structural aspects of SpCas9 are described by Nishimasu et al., Crystal Structure of Cas9 in Complex with Guide RNA and Target DNA, Cell 156, 935-949, February 27, 2014. [0059] REC lobe: The REC lobe includes the REC1 and REC2 domains. The REC2 domain does not contact the bound guide:target heteroduplex, indicating that truncation of REC lobe may be tolerated by SpCas9.
- SpCas9 mutant lacking the REC2 domain retained ⁇ 50% of the wild-type Cas9 activity, indicating that the REC2 domain is not critical for DNA cleavage.
- the deletion of either the repeat-interacting region (D97–150) or the anti-repeat-interacting region (D312–409) of the REC1 domain abolished the DNA cleavage activity, indicating that the recognition of the repeat:anti-repeat duplex by the REC1 domain is critical for the Cas9 function.
- PAM-Interacting domain The NUC lobe contains the PAM-interacting (PI) domain that is positioned to recognize the PAM sequence on the noncomplementary DNA strand.
- RuvC domain The RuvC nucleases of SpCas9 have an RNase H fold and four catalytic residues, Asp10 (Ala), Glu762, His983, and Asp986, that are critical for the two-metal cleavage of the noncomplementary strand of the target DNA.
- the Cas9 RuvC domain has other structural elements involved in interactions with the guide:target heteroduplex (an end-capping loop between ⁇ 42 and ⁇ 43) and the PI domain/stem loop 3 ( ⁇ hairpin formed by ⁇ 3 and ⁇ 4).
- HNH domain SpCas9 HNH nucleases have three catalytic residues, Asp839, His840, and Asn863 and cleave the complementary strand of the target DNA through a single-metal mechanism.
- the backbone phosphate groups of the guide region interact with the REC1 domain (Arg165, Gly166, Arg403, Asn407, Lys510, Tyr515, and Arg661) and the Attorney Ref: 38909-52322/WO Client Ref: 005WO bridge helix (Arg63, Arg66, Arg70, Arg71, Arg74, and Arg78).
- the 20-hydroxyl groups of G1, C15, U16, and G19 hydrogen bond with Val1009, Tyr450, Arg447/Ile448, and Thr404, respectively.
- RNA-guided DNA targeting: SpCas9 recognizes the guide:target heteroduplex in a sequence-independent manner.
- the backbone phosphate groups of the target DNA interact with the REC1 (Asn497, Trp659, Arg661, and Gln695), RuvC (Gln926), and PI (Glu1108) domains.
- the C2’ atoms of the target DNA form van der Waals interactions with the REC1 domain (Leu169, Tyr450, Met495, Met694, and His698) and the RuvC domain (Ala728).
- the terminal base pair of the guide:target heteroduplex (G1:C20’) is recognized by the RuvC domain via end-capping interactions; the sgRNA G1 and target DNA C20’ nucleobases interact with the Tyr1013 and Val1015 side chains, respectively, whereas the 20-hydroxyl and phosphate groups of sgRNA G1 interact with Val1009 and Gln926, respectively.
- the nucleobase of the flipped U44 is sandwiched between Tyr325 and His328, with its N3 atom hydrogen bonded with Tyr325, whereas the nucleobase of the unpaired G43 stacks with Tyr359 and hydrogen bonds with Asp364.
- the nucleobases of G21 and U50 in the G21:U50 wobble pair stack with the terminal C20:G10 pair in the guide:target heteroduplex and Tyr72 on the bridge helix, respectively, with the U50 O4 atom hydrogen bonded with Arg75.
- A51 adopts the syn conformation and is oriented in the direction opposite to U50.
- Stem-loop recognition Stem loop 1 is primarily recognized by the REC lobe, together with the PI domain.
- the backbone phosphate groups of stem loop 1 interact with the REC1 domain (Leu455, Ser460, Arg467, Thr472, and Ile473), the PI domain (Lys1123 and Lys1124), and the bridge helix (Arg70 and Arg74), with the 20-hydroxyl group of G58 hydrogen bonded with Leu455.
- A52 interacts with Phe1105 through a face-to-edge p-p stacking interaction, and the flipped U59 nucleobase hydrogen bonds with Asn77.
- the single-stranded linker and stem loops 2 and 3 are primarily recognized by the NUC lobe.
- the backbone phosphate groups of the linker interact with the RuvC domain (Glu57, Lys742, and Lys1097), the PI domain (Thr1102), and the bridge helix (Arg69), with the 20-hydroxyl groups of U64 and A65 hydrogen bonded with Glu57 and His721, respectively.
- the C67 nucleobase forms two hydrogen bonds with Val1100.
- Stem loop 2 is recognized by Cas9 via the interactions between the NUC lobe and the non-Watson-Crick A68:G81 pair, which is formed by direct (between the A68 N6 and G81 O6 atoms) and water-mediated (between the A68 N1 and G81 N1 atoms) hydrogen-bonding interactions.
- the A68 and G81 nucleobases contact Ser1351 and Tyr1356, respectively, whereas the A68:G81 pair interacts with Thr1358 via a water-mediated hydrogen bond.
- Stem loop 3 interacts with the NUC lobe more extensively, as compared to stem loop 2.
- the backbone phosphate group of G92 interacts with the RuvC domain (Arg40 and Lys44), whereas the G89 and U90 nucleobases hydrogen bond with Gln1272 and Glu1225/Ala1227, respectively.
- the A88 and C91 nucleobases are recognized by Asn46 via multiple hydrogen- bonding interactions.
- Cas9 proteins smaller than SpCas9 allow more efficient packaging of nucleic acids encoding CRISPR systems, e.g., Cas9 and sgRNA into one rAAV (“all-in-one-AAV”) particle.
- Small Cas9 proteins can be advantageous for multidomain-Cas-nuclease-based systems for prime editing.
- Well characterized smaller Cas9 proteins include Staphylococcus Attorney Ref: 38909-52322/WO Client Ref: 005WO aureus (SauCas9, 1053 amino acid residues) and Campylobacter jejuni (CjCas9, 984 amino residues).
- Staphylococcus lugdunensis (Slu) Cas9 as having genome-editing activity and provided homology mapping to SpCas9 and SauCas9 to facilitate generation of nickases and inactive (“dead”) enzymes (Schmidt et al., 2021, Improved CRISPR genome editing using small highly active and specific engineered RNA-guided nucleases. Nat Commun 12, 4219. doi.org/10.1038/s41467-021-24454-5) and engineered nucleases with higher cleavage activity by fragmenting and shuffling Cas9 DNAs.
- the small Cas9s and nickases are useful in the instant disclosure.
- the Cas9 proteins used herein may also include other “Cas9 variants” having at least about 70% identical, at least about 80% identical, at least about 90% identical, at least about 95% identical, at least about 96% identical, at least about 97% identical, at least about 98% identical, at least about 99% identical, at least about 99.5% identical, or at least about 99.9% identical to any reference Cas9 protein, including any wild type Cas9, or mutant Cas9 (e.g., a dead Cas9 or Cas9 nickase), or fragment Cas9, or circular permutant Cas9, or other variant of Cas9 disclosed herein or known in the art.
- Cas9 variants having at least about 70% identical, at least about 80% identical, at least about 90% identical, at least about 95% identical, at least about 96% identical, at least about 97% identical, at least about 98% identical, at least about 99% identical, at least about 99.5% identical, or at least about 99.9% identical to any reference Cas9 protein, including any wild
- a Cas9 variant may have 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 21, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50 or more amino acid changes compared to a reference Cas9.
- the Cas9 variant comprises a fragment of a reference Cas9 (e.g., a gRNA binding domain or a DNA-cleavage domain), such that the fragment is at least about 70% identical, at least about 80% identical, at least about 90% identical, at least about 95% identical, at least about 96% identical, at least about 97% identical, at least about 98% identical, at least about 99% identical, at least about 99.5% identical, or at least about 99.9% identical to the corresponding fragment of wild type Cas9.
- a reference Cas9 e.g., a gRNA binding domain or a DNA-cleavage domain
- the fragment is at least 30%, at least 35%, at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95% identical, at least 96%, at least 97%, at least 98%, at least 99%, or at least 99.5% of the amino acid length of a corresponding wild type Cas9 (SEQ ID NO: 18).
- the disclosure also may utilize Cas9 fragments that retain their functionality and that are fragments of any herein disclosed Cas9 protein.
- the Cas9 fragment is at least 100 amino acids in length.
- the fragment is at Attorney Ref: 38909-52322/WO Client Ref: 005WO least 100, 150, 200, 250, 300, 350, 400, 450, 500, 550, 600, 650, 700, 750, 800, 850, 900, 950, 1000, 1050, 1100, 1150, 1200, 1250, or at least 1300 amino acids in length.
- the prime editors disclosed herein may comprise one of the Cas9 variants described as follows, or a Cas9 variant thereof having at least about 70% identical, at least about 80% identical, at least about 90% identical, at least about 95% identical, at least about 96% identical, at least about 97% identical, at least about 98% identical, at least about 99% identical, at least about 99.5% identical, or at least about 99.9% identical to any reference Cas9 variants.
- prime editors comprise type V or type VI CRISPR-Cas system enzymes. It will be appreciated that certain CRISPR enzymes exhibit promiscuous ssDNA cleavage activity and appropriate precautions should be considered. In certain embodiments, prime editors comprise a nickase or a dead CRISPR with nuclease function comprised in a different component.
- the nucleic acid programmable DNA binding proteins utilized herein include, without limitation, Cas9 (e.g., dCas9 and nCas9), Cas12a (Cpf1), Cas12b1 (C2c1), Cas12b2, Cas12c (C2c3), Cas12d (CasY), Cas12e (CasX), C2c4, C2c5, C2c8, C2c9, C2c10, Cas13a (C2c2), Cas13b (C2c6), Cas13c (C2c7), Cas13d, and Argonaute.
- Cas9 e.g., dCas9 and nCas9
- Cas9 e.g., dCas9 and nCas9
- Cas9 e.g., dCas9 and nCas9
- Cas9 e.g., dCas9 and nC
- Cas-equivalents further include those described in Makarova et al., “C2c2 is a single-component programmable RNA- guided RNA-targeting CRISPR effector,” Science 2016; 353(6299) and Makarova et al., “Classification and Nomenclature of CRISPR-Cas Systems: Where from Here?,” The CRISPR Journal, Vol.1. No.5, 2018, the contents of which are incorporated herein by reference.
- Cas12a Clustered Regularly Interspaced Short Palindromic Repeats from Prevotella and Francisella 1 (i.e, Cas12a (Cpf1)). Similar to Cas9, Cas12a (Cpf1) is also a Class 2 CRISPR effector, but it is a member of type V subgroup of enzymes, rather than the type II subgroup. It has been shown that Cas12a (Cpf1) mediates robust DNA interference with features distinct from Cas9.
- Cas12a is a single RNA-guided endonuclease lacking tracrRNA, and it utilizes a T- rich protospacer-adjacent motif (TTN, TTTN, or YTN). Moreover, Cpf1 cleaves DNA via a staggered DNA double-stranded break. Out of 16 Cpf1-family proteins, two enzymes from Acidaminococcus and Lachnospiraceae are shown to have efficient genome-editing activity in human cells.
- Cpf1 proteins are known in the art and have been described previously, for example Yamano et al., “Crystal structure of Cpf1 in complex with guide RNA and target DNA.” Cell (165) 2016, p.949-962; the entire contents of which is hereby incorporated by reference. 6.3.
- Type V CRISPR proteins [0078]
- prime editors used herein comprise the type V CRISPR family includes Francisella novicida U112 Cpf1 (FnCpf1) also known as FnCas12a.
- FnCpf1 adopts a bilobed architecture with the two lobes connected by the wedge (WED) domain.
- the N-terminal REC lobe consists of two a-helical domains (REC1 and REC2) that have been shown to coordinate the crRNA-target DNA heteroduplex.
- the C-terminal NUC lobe consists of the C-terminal RuvC and Nuc domains involved in target cleavage, the arginine-rich bridge helix (BH), and the PAM- interacting (PI) domain.
- the repeat-derived segment of the crRNA forms a pseudoknot stabilized by intra-molecular base-pairing and hydrogen-bonding interactions. The pseudoknot is coordinated by residues from the WED, RuvC, and REC2 domains, as well as by two hydrated magnesium cations.
- nucleotides 1–5 of the crRNA are ordered in the central cavity of FnCas12a and adopt an A-form-like helical conformation. Conformational ordering of the seed sequence is facilitated by multiple interactions between the ribose and phosphate moieties of the crRNA backbone and FnCpf1 residues in the WED and REC1 domains. These include residues Thr16, Lys595, His804, and His881 from the WED domain and residues Tyr47, Lys51, Phe182, and Arg186 from the REC1 domain.
- the structure of the FnCas12a-crRNA complex further reveals that the bases of the seed sequence are solvent exposed and poised for hybridization with target DNA.
- the PAM-containing DNA duplex comprises target strand nucleotides dT0–dT8 and non-target strand nucleotides dA(8)*–dA0* and is contacted by the PI, WED, and REC1 domains.
- the 5’-TTN-3’ PAM is recognized in FnCas12a by a mechanism combining the shape- specific recognition of a narrowed minor groove, with base-specific recognition of the PAM bases by two invariant residues, Lys671 and Lys613.
- the duplex of the target DNA is disrupted by the side chain of residue Lys667, which is inserted between the DNA strands and forms a cation- ⁇ stacking interaction with the dA0–dT0* base pair.
- the phosphate group linking target strand residues dT(-1) and dT0 is coordinated by hydrogen-bonding interactions with the side chain of Lys823 and the backbone amide of Gly826.
- Target strand residue dT(-1) bends away from residue T0, allowing the target strand to interact with the seed sequence of the crRNA.
- the non-target strand nucleotides dT1*–dT5* interact with the Arg692- Ser702 loop in FnCas12a through hydrogen-bonding and ionic interactions between backbone phosphate groups and side chains of Arg692, Asn700, Ser702, and Gln704, as well as main-chain amide groups of Lys699, Asn700, and Ser702.
- Alanine substitution of Q704 or replacement of residues Thr698–Ser702 in FnCas12a with the sequence Ala-Gly3 (SEQ ID NO: 115) substantially reduced DNA cleavage activity, suggesting that these residues contribute to R-loop formation by stabilizing the displaced conformation of the nontarget DNA strand.
- the crRNA-target strand heteroduplex is terminated by a stacking interaction with a conserved aromatic residue (Tyr410). This prevents base pairing between the crRNA and the target strand beyond nucleotides U20 and dA(-20), respectively. Beyond this point, the target DNA strand nucleotides re-engage the non-target DNA strand, forming a PAM-distal DNA duplex comprising nucleotides dC(-21)–dA(-27) and dG21*–dT27*, respectively. The duplex is confined between the REC2 and Nuc domains at the end of the central channel formed by the REC and NUC lobes.
- Target DNA cleavage FnCpf1 can independently accommodate both the target and non-target DNA strands in the catalytic pocket of the RuvC domain.
- the RuvC active site contains three catalytic residues (D917, E1006, and D1255). Structural observations suggest that both the target and non-target DNA strands are cleaved by the same catalytic mechanism in a single active site in Cpf1/Cas12a enzymes.
- nuclease comprises a Cas12f effector.
- Small CRISPR- associated effector proteins belonging to the type V-F subtype have been identified through the mining of sequence databases and members classified into Cas12f1 (Cas14a and type V-U3), Cas12f2 (Cas14b) and Cas12f3 (Cas14c, type V-U2 and U4).
- Exemplary CRISPR-Cas proteins and enzymes used in the Prime Editors herein include the following without limitation.
- Protospacer Adjacent Motif refers to an approximately 2-6 base pair DNA sequence (or a 2-, 3-, 4-, 5-, 6-, 7-, 8-, 9-, 10-, 11-, 12-long nucleotide sequence) that is an important targeting component of a Cas9 nuclease.
- the PAM sequence is on either strand, and is downstream in the 5' to 3' direction of Cas9 cut site.
- the canonical PAM sequence (i.e., the PAM sequence that is associated with the Cas9 nuclease of Streptococcus pyogenes or SpCas9) is 5'-NGG-3' wherein “N” is any nucleobase followed by two guanine (“G”) nucleobases.
- N is any nucleobase followed by two guanine (“G”) nucleobases.
- G guanine
- Different PAM sequences can be associated with different Cas9 nucleases or equivalent proteins from different organisms.
- any given Cas9 nuclease may be modified to alter the PAM specificity of the nuclease such that the nuclease recognizes alternative PAM sequence.
- the PAM specificity can be modified by introducing one or more mutations, including (a) D1135V, R1335Q, and T1337R “the VQR variant”, which alters the PAM specificity to NGAN or NGNG, (b) D1135E, R1335Q, and T1337R “the EQR variant”, which alters the PAM specificity to NGAG, and (c) D1135V, G1218R, R1335E, and T1337R “the VRER variant”, which alters the PAM specificity to NGCG.
- Cas9 enzymes from different bacterial species can have varying PAM specificities and in some embodiments are therefore chosen based on the desired PAM recognition.
- Cas9 from Staphylococcus aureus (SaCas9) recognizes NGRRT or NGRRN.
- Cas9 from Neisseria meningitis (NmCas) recognizes NNNNGATT.
- Cas9 from Streptococcus thermophilis recognizes NNAGAAW.
- Cas9 from Treponema denticola recognizes NAAAAC.
- non- SpCas9s bind a variety of PAM sequences, which makes them useful to expand the range of sequences that can be targeted according to the disclosure.
- non-SpCas9s may have other characteristics that make them more useful than SpCas9.
- Cas9 from Staphylococcus aureus is about 1 kilobase smaller than SpCas9, so it can be packaged into adeno-associated virus (AAV).
- AAV adeno-associated virus
- Prime editing uses CRISPR enzyme that nicks or cuts only single strand of double stranded DNA, i.e., a nickase; and a nickase can occur either naturally or by mutation or modification of a nuclease that makes double stranded cuts.
- a nickase can occur either naturally or by mutation or modification of a nuclease that makes double stranded cuts.
- Such an enzyme can be a catalytically-impaired Cas9 endonuclease (a nickase).
- a nickase Such an enzyme can be a Casl2a/b, MAD7, or variant thereof.
- the nickase is fused to an engineered reverse transcriptase (RT).
- the nickase is programmed (directed) with a prime- editing guide RNA (pegRNA).
- pegRNA prime- editing guide RNA
- the pegRNA both specifies the target site and encodes the desired edit.
- the nickase is a catalytically-impaired Cas9 endonuclease, a Cas9 nickase, that is fused to the reverse transcriptase.
- the Cas9 nickase part of the protein is guided to the DNA target site by the pegRNA, whereby a nick or single stranded cut occurs.
- the reverse transcriptase domain then uses the pegRNA to template reverse transcription of the desired edit, directly polymerizing DNA onto the nicked target DNA strand.
- PE1 refers to a PE complex comprising a fusion protein comprising Cas9(H840A) and a wild type MMLV RT having the following N-terminus to C-terminus structure: [NLS]-[Cas9(H840A)]- [linker]-[MMLV_RT(wt)] + a desired PEgRNA.
- the prime editors disclosed herein is comprised of PE1.
- PE2 refers to a PE complex comprising a fusion protein comprising Cas9(H840A) and a variant MMLV RT having the following N-terminus to C-terminus structure: [NLS]-[Cas9(H840A)]- [linker]-[MMLV_RT(D200N)(T330P)(L603W)(T306K)(W313F)] + a desired atgRNA (PEgRNA).
- the prime editors disclosed herein are comprised of PE2.
- the prime editors disclosed herein are comprised of PE2 and co-expression of MMR protein MLH1dn, that is PE4.
- PE3 refers to PE2 plus a second-strand nicking guide RNA that complexes with the PE2 and introduces a nick in the non-edited DNA strand. The induction of the second nick increases the chances of the unedited strand, rather than the edited strand, to be repaired.
- the prime editors disclosed herein are comprised of PE3.
- the prime editors disclosed herein is comprised of PE3 and co-expression of MMR protein MLH1dn, that is PE5.
- PE3b refers to PE3 but wherein the second-strand nicking guide RNA is designed for temporal control such that the second strand nick is not introduced until after the installation of the desired edit. This is achieved by designing a gRNA with a spacer sequence with mismatches to the unedited original allele that matches only the edited strand.
- a prime editing complex consists of a type II CRISPR PE protein containing an RNA-guided DNA-nicking domain fused to a reverse transcriptase (RT) domain and complexed with a pegRNA.
- the pegRNA comprises (5’ to 3’) a spacer that is complementary to the target sequence of a genomic DNA, a nickase (e.g. Cas9) binding site, a reverse transcriptase template including editing positions, and primer binding site (PBS).
- the PE–pegRNA complex binds the target DNA and the CRISPR protein nicks the PAM- containing strand.
- the resulting 3′ end of the nicked target hybridizes to the primer-binding site (PBS) of the pegRNA, then primes reverse transcription of new DNA containing the desired edit using the RT template of the pegRNA.
- the overall structure of the pegRNA is like that of a typical type II sgRNA with a reverse transcriptase template/primer binding site appended to the 3’ end.
- the structure leaves the PBS at the 3’ end of the pegRNA free to bind to the nicked strand complementary to the target which forms the primer for reverse transcription.
- Guide RNAs of CRISPRs differ in overall structure. For example, while the spacer of a type II gRNA is located at the 5’ end, the spacer of a type V gRNA is located towards the 3’ end, with the CRISPR protein (e.g. Cas12a) binding region located toward the 5’ end.
- the regions of a type V pegRNA are rearranged compared to a type II pegRNA.
- the overall structure of the pegRNA is like that of a typical type II sgRNA with a reverse transcriptase template/primer binding site appended to the 3’ end.
- the pegRNA comprises (5’ to 3’) a CRISPR protein-binding region, a spacer which is complementary to the target sequence of a genomic DNA, a reverse transcriptase template including editing positions, and primer binding site (PBS).
- PBS primer binding site
- Attachment Site-Containing Guide RNA [0099]
- the term “attachment site-containing guide RNA” (atgRNA) and the like refer to an extended single guide RNA (sgRNA) comprising a primer binding site (PBS), a reverse transcriptase (RT) template sequence, and wherein the RT template encodes for an integration recognition site or a recombinase recognition site that can be recognized by a recombinase, integrase, or transposase.
- the RT template comprises a clamp sequence and an integration recognition site.
- an atgRNA may be referred to as a guide RNA.
- an integration recognition site or recombinase target recognition site incorporated into the pegRNA is referred to as an attachment site containing guide RNA (atgRNA).
- atgRNA guide RNA
- the term “cognate integration recognition site” or “integration cognate” or “cognate pair” refers to a first integration recognition site (e.g., any of the integration recognition sites described herein) and a second integration recognition site (e.g., any of the integration recognition sites described herein) that can be recombined.
- Recombination between a first integration recognition site (e.g., any of the integration recognition sites described herein) and a second recognition site (e.g., any of the integration recognition sites described herein) is mediated by functional symmetry between the two integration recognition sites and the central dinucleotide of each of the two integration recognition sites.
- a first integration recognition site e.g., any of the integration recognition sites described herein
- a second integration recognition site e.g., any of the integration recognition sites described herein
- a non-limiting example of a cognate pair include an attB site and an attP site, whereby a serine integrase mediates recombination between the attB site and the attP site.
- FIGs.1A-1E show optimization of the integration recognition site.
- an atgRNA comprises a reverse transcriptase template that encodes, partially or in its entirety, an integration recognition site (also referred to as an integration target recognition site) or a recombinase recognition site (also referred to as a recombinase target recognition site).
- the integration target recognition site which is to be placed at a desired location in the genome or intracellular nucleic acid, is referred to as a “beacon,” a “beacon” site or an “attachment site” or a “landing pad” or “landing site.”
- An integration target recognition site or recombinase target recognition site incorporated into the pegRNA is referred to as an attachment site containing guide RNA (atgRNA).
- the primer binding site allows the 3’ end of the nicked DNA strand to hybridize to the atgRNA, while the RT template serves as a template for the Attorney Ref: 38909-52322/WO Client Ref: 005WO synthesis of edited genetic information.
- the atgRNA is capable for instance, without limitation, of (i) identifying the target nucleotide sequence to be edited and (ii) encoding new genetic information that replaces (or in some cases adds) the targeted sequence.
- the atgRNA is capable of (i) identifying the target nucleotide sequence to be edited and (ii) encoding an integration site that replaces (or inserts/deletes within) the targeted sequences.
- the first atgRNA and the second atgRNA are an at least first pair of atgRNAs, where the at least first pair of atgRNAs have domains that are capable of guiding the gene editor protein or prime editor fusion protein to a target sequence
- the first atgRNA further includes a first RT template that comprises at least a portion of the first integration recognition site
- the second atgRNA further includes a second RT template that comprises at least a portion of the first integration recognition site
- the first atgRNA and the second atgRNAs collectively encode the entirety of the first integration recognition site.
- the first atgRNA’s reverse transcriptase template encodes for a first single-stranded DNA sequence (i.e., a first DNA flap) that contains a complementary region to a second single-stranded DNA sequence (i.e., a second DNA flap) encoded by a second atgRNA comprising a second reverse transcriptase template.
- the complementary region between the first and second single-stranded DNA sequences is comprised of more than 5 consecutive bases of an integrase target recognition site.
- the complementary region between the first and second single-stranded DNA sequences is comprised of more than 10 consecutive bases of an integrase target recognition site.
- the complementary region between the first and second single-stranded DNA sequences is comprised of more than 20 consecutive bases of an integrase target recognition site. In certain embodiments, the complementary region between the first and second single-stranded DNA sequences is comprised of more than 30 consecutive bases of an integrase target recognition site.
- Use of two guide RNAs that are (or encode DNA that is) partially complementarity to each other and comprised of consecutive bases of an integrase target recognition site are referred to as dual, paired, annealing, complementary, or twin attachment site-containing guide RNAs (atgRNAs).
- RNAs that are (or encode DNA that is) full complementarity to each other and comprised of consecutive bases of an integrase target recognition site are referred to as dual, paired, annealing, complementary, or twin attachment site- containing guide RNAs (atgRNAs).
- atgRNAs paired, annealing, complementary, or twin attachment site- containing guide RNAs
- the first atgRNA upon introducing the nucleic acid construct into a cell, incorporates the first integration recognition site into the cell’s genome at the target sequence. 6.7.1.
- AtgRNAs Comprising a Clamp Segment
- This disclosure features an attachment site containing guide RNA (atgRNA) comprising a clamp segment (see, e.g., FIGs.3A-3C and FIGs.4A-4C).
- the clamp segment is designed, at least in part, to enhance integration efficiency of the integration recognition site at the double-stranded target DNA sequence (e.g., as compared to integration efficiency of the integration recognition site at the double-stranded target DNA sequence when using an atgRNA that does not comprise the clamp segment).
- this disclosure features an attachment site containing guide RNA (atgRNA) comprising from 5’ to 3’: a spacer sequence complementary to a first genomic site of a first strand of a double-stranded target DNA sequence; a scaffold capable of associating with a DNA binding nickase domain and guiding the DNA binding nickase binding domain to the target DNA sequence; a reverse transcriptase template comprising: an integration recognition site; and a clamp segment comprising a sequence where upon being reverse transcribed the sequence is at least partially complementary (e.g., the reverse transcribed clamp segment comprises a sequence having 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence complementarity) to a second genomic site of the first strand of the double-stranded target DNA sequence; and a primer binding site complementary to a third genomic site of the double-stranded target DNA sequence, wherein the third genomic
- the reverse transcriptase template encodes for a first single-stranded DNA sequence comprising the integration recognition site. In some embodiments, the reverse transcriptase template encodes for a first single-stranded DNA sequence comprising the integration recognition site and the clamp segment. In some embodiments, upon introducing into a cell with a gene editor polypeptide, all or a portion of the single-stranded DNA sequence comprising the integration recognition site is site-specifically integrated into the double-stranded target DNA sequence.
- the atgRNA comprising the clamp enhances integration efficiency of the integration recognition site at the double-stranded target DNA sequence as compared to integration efficiency of the integration recognition site at the double-stranded target DNA sequence when using an atgRNA that does not comprise the clamp segment.
- this disclosure features an attachment site containing guide RNA (atgRNA) comprising from 5’ to 3’: a spacer sequence complementary to a first genomic site of a first strand of a double-stranded target DNA sequence; a scaffold capable of associating with a DNA binding nickase domain and guiding the DNA binding nickase binding domain to the target DNA sequence; a reverse transcriptase template comprising: an integration recognition site; and a clamp segment comprising a sequence where upon being reverse transcribed the sequence is at least partially complementary (e.g., the reverse transcribed clamp segment comprises a sequence having 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence complementarity) to a second genomic site of the first strand of the double-stranded target DNA sequence; and a primer binding site comprising:
- the reverse transcriptase template encodes for a first single-stranded DNA sequence comprising the integration recognition site. In some embodiments, the reverse transcriptase template encodes for a first single-stranded DNA sequence comprising the integration recognition site and the clamp segment. In some embodiments, upon introducing into a cell with a gene editor polypeptide, all or a portion of the single-stranded DNA sequence comprising the integration recognition site is site-specifically integrated into the double-stranded target DNA sequence.
- the atgRNA comprising the clamp enhances integration efficiency of the integration recognition site at the double-stranded target DNA sequence as compared to integration efficiency of the integration recognition site at the double-stranded target DNA sequence when using an atgRNA that does not comprise the clamp segment.
- this disclosure features an attachment site containing guide RNA (atgRNA), comprising from 5’ to 3’: a spacer sequence complementary to a first genomic site of a first strand of a double-stranded target DNA sequence; a scaffold capable of associating with a DNA binding nickase domain and guiding the DNA binding nickase binding domain to the target DNA sequence; a clamp segment at least partially complementary (e.g., the clamp segment comprises a sequence having 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence complementarity) to a second genomic site of the first strand of the double- stranded target DNA sequence; a reverse transcriptase (RT) template comprising an integration Attorney Ref: 38909-52322/WO Client Ref: 005WO recognition site; and a primer binding site complementary to a third genomic site of the double- stranded target DNA sequence, wherein
- the atgRNA upon introducing into a cell with a gene editor polypeptide, all or a portion of the single-stranded DNA sequence comprising the integration recognition site is site-specifically integrated into the double-stranded target DNA sequence.
- the atgRNA comprising the clamp upon introducing the atgRNA into a cell with a gene editor polypeptide, the atgRNA comprising the clamp enhances integration efficiency of the integration recognition site at the double-stranded target DNA sequence as compared to integration efficiency of the integration recognition site at the double-stranded target DNA sequence when using an atgRNA that does not comprise the clamp segment.
- the atgRNA includes a transcription termination signal that prevents reverse transcription of the clamp segment.
- the internucleoside linkage connecting the RT template to the clamp signal comprises a moiety that prevents reverse transcription of the clamp segment.
- this disclosure features an attachment site containing guide RNA (atgRNA) comprising from 5’ to 3’: a spacer sequence complementary to a first genomic site of a first strand of a double-stranded target DNA sequence; a scaffold capable of associating with a DNA binding nickase domain and guiding the DNA binding nickase binding domain to the target DNA sequence; a clamp segment at least partially complementary (e.g., the clamp segment comprises a sequence having 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence complementarity) to a second genomic site of the first strand of the double- stranded target DNA sequence; a reverse transcriptase (RT) template comprising an integration recognition site; and a primer binding site comprising:
- the atgRNA upon introducing into a cell with a gene editor polypeptide, all or a portion of the single-stranded DNA sequence comprising the integration recognition site is site-specifically integrated into the double-stranded target DNA sequence.
- the atgRNA comprising the clamp upon introducing the atgRNA into a cell with a gene editor polypeptide, the atgRNA comprising the clamp enhances integration efficiency of the integration recognition site Attorney Ref: 38909-52322/WO Client Ref: 005WO at the double-stranded target DNA sequence as compared to integration efficiency of the integration recognition site at the double-stranded target DNA sequence when using an atgRNA that does not comprise the clamp segment.
- the atgRNA includes a transcription termination signal that prevents reverse transcription of the clamp segment.
- the internucleoside linkage connecting the RT template to the clamp signal comprises a moiety that prevents reverse transcription of the clamp segment.
- the atgRNA has a length of 150 to 210 nucleotides (e.g., a length of 150 to 200 nucleotides, a length of 150 to 190 nucleotides, a length of 150 to 180 nucleotides, a length of 150 to 170 nucleotides, a length of 150 to 160 nucleotides, a length of 160 to 210 nucleotides, a length of 160 to 200 nucleotides, a length of 160 to 190 nucleotides, a length of 160 to 180 nucleotides, a length of 160 to 170 nucleotides, a length of 170 to 210 nucleotides, a length of 170 to 200 nucleotides, a length of 170 to 190 nucleotides, a length of 170 to a
- the RT template has a length of 10 nucleotides to 200 nucleotides (e.g., a length 10 to 180 nucleotides, a length of 10 to 160 nucleotides, a length of 10 to 140 nucleotides, a length of 10 to 120 nucleotides, a length of 10 to 100 nucleotides, a length of 10 to 80 nucleotides, a length of 10 to 60 nucleotides, a length of 10 to 40 nucleotides, a length of 10 to 20 nucleotides, a length of 20 to 200 nucleotides, a length of 20 to 180 nucleotides, a length of 20 to 160 nucleotides, a length of 20 to 140 nucleotides, a length of 20 to 120 nucleotides, a length of 20 to 100 nucleotides, a length of 20 to 80 nucleotides, a length of 20 to 60 nucleot
- the RT template has a length of 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60 or more nucleotides.
- the atgRNA binds at least a first target genomic site (via spacer sequence), a second genomic site (via clamp sequence), and a third genomic site (via primer binding site). In some embodiments, the atgRNA is designed to optimize the distance between the second genomic site and the third genomic site.
- the distance between the second genomic site and the third genomic site can be based on the length of the RT template to be integrated into the target sequence, the sequence of the target DNA sequence, among other factors. In some cases, the distance between the second genomic site and the third genomic site can be referred to as the “separation” between the second genomic site and the third genomic site. [0114] In some embodiments, the atgRNA is designed such that there is partial genomic replacement of the target DNA sequence upon incorporation of the integration recognition sequence into the target DNA sequence.
- the atgRNA is designed so that the second genomic site and the third genomic site are separated by one or more genomic bases (e.g., one or more base pairs) in the double-stranded target DNA sequence, whereby the one or more genomic bases are deleted upon incorporation of the integration recognition site.
- the atgRNA is designed such that partial genomic replacement comprises deletion of 1 to 10 genomic bases (e.g., base pairs), 11 to 20 genomic bases, 21 to 30 genomic bases, 31 to 40 genomic bases 41 to 50 genomic bases, 51 to 60 genomic bases, 61 to 70 genomic bases, 71 to 80 genomic bases, 81 to 90 genomic bases, or 91-100 genomic bases.
- the atgRNA is designed such that partial genomic replacement comprises deletion of 100 to 1000 genomic bases (e.g., base pairs). In some embodiments, the atgRNA is designed such that partial genomic replacement comprises deletion of 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, or 45 genomic bases (e.g., base pairs). [0116] In some embodiments, the second genomic site and the third genomic site are separated by one or more genomic bases (e.g., one or more base pairs) in the double-stranded target DNA sequence.
- this disclosure provides for guide RNA compositions which are capable of being reduced in total Attorney Ref: 38909-52322/WO Client Ref: 005WO sequence length as compared to standard prime editing guide RNAs or gene editing RNAs capable of integrase target site recognition site insertion (i.e., standard atgRNAs).
- the clamp segment binding site i.e., second genomic site
- the atgRNA clamp segment is less than 50 nucleotides in length. In some embodiments, the atgRNA clamp segment is less than 25 nucleotides in length. In some embodiments, the atgRNA clamp segment is less than 15 nucleotides in length. In some embodiments, the atgRNA clamp segment is less than 10 nucleotides in length.
- this disclosure provides for atgRNA compositions which are capable of increasing efficiency of the integration recognition site genomic incorporation (i.e., beacon placement). In some embodiments, increased efficiency is due to favored repair by endogenous cellular machinery.
- the second genomic site and the third genomic site are separated by 1 to 10 genomic bases, 11 to 20 genomic bases, 21 to 30 genomic bases, 31 to 40 genomic bases, or 41 to 45 genomic bases. In some embodiments, the second genomic site and the third genomic site are separated by 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, or 45 genomic bases (e.g., base pairs).
- the second genomic site and the third genomic site are separated by at least 50, 100, 200, 300, 400, 500, 600, 700, 800, 900, or 1000 (1 kilobase distance) genomic bases. In some embodiments, the second genomic site and the third genomic site are separated by more than 1 kilobase. In some embodiments, second genomic site and the third genomic site are located on different chromosomes. [0118] In some embodiments, the second genomic site and the third genomic site are adjacent in the double-stranded target DNA sequence.
- the genomic bases e.g., base pairs
- the genomic bases are not incorporated into the target DNA sequence, thereby removing (e.g., deleting) these genomic bases from the target DNA sequence.
- the guide RNA is circularized.
- guide RNA circularization is mediated by 5’ and 3’ terminus dimerization motifs.
- the Attorney Ref: 38909-52322/WO Client Ref: 005WO dimerization motifs are kissing loops.
- guide RNA circularization is mediated by one or more of a ribozyme or a ligase.
- guide RNA circularization is mediated by covalently or non-covalently linking the 5’ and 3’ termini.
- one or more guide RNA is comprised in one or more RNA polynucleotides or DNA polynucleotides. 6.7.2. Clamp Segment [0121]
- This disclosures features an attachment site containing guide RNA (atgRNA) comprising a clamp segment that is at least partially complementary to a second genomic site of the first strand of the double-stranded target DNA sequence.
- the clamp is designed to enhance efficiency of integrating the integration recognition site into the target DNA Sequence.
- the atgRNA comprising the clamp upon introducing the atgRNA comprising the clamp into a cell with a gene editor polypeptide, the atgRNA comprising the clamp enhances integration efficiency of the integration recognition site at the double-stranded target DNA sequence as compared to the integration efficiency of the integration recognition site at the double-stranded target DNA sequence when using an atgRNA that does not comprise the clamp segment.
- inclusion of a clamp segment in the atgRNA allows for a reduction in the number of nucleotides in the primer binding site (PBS) need to bind the third genomic site and facilitate reverse transcription. This in turn enables design of atgRNA that are shorter than guide RNAs used for other gene editing methods, for example, prime editing.
- the clamp segment has a length of 1 to 10 nucleotides, 11 to 20 nucleotides, 21 to 30 nucleotides, or 31 to 40 nucleotides. In some embodiments, the clamp segment has a length of 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, or more nucleotides. In some embodiments, the clamp segment has a length of 10 nucleotides.
- the clamp segment comprises a sequence that is partially complementary to the second genomic site of the first strand of the double-stranded target DNA sequence.
- a clamp segment that is partially complementary o the second genomic site of the first strand of the double-stranded target DNA sequence includes a clamp segment having 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence complementarity to the second genomic site.
- the clamp segment comprises a sequence that is complementary to the second genomic site of the first strand of the double-stranded target DNA sequence.
- the clamp segment comprises a sequence that is capable of binding to the second genomic site of the first strand of the double-stranded target DNA sequence with at least a first nucleotide mismatch. In some embodiments, the clamp segment comprises a sequence that is capable of binding to the second genomic site of the first strand of the double-stranded target DNA sequence with a first nucleotide mismatch and a second nucleotide mismatch. In some embodiments, the first nucleotide mismatch and the second nucleotide mismatch are consecutive. In some embodiments, the first nucleotide mismatch and the second nucleotide mismatch are non- consecutive.
- the clamp segment comprises a sequence that is capable of binding to the second genomic site of the first strand of the double-stranded target DNA sequence with the first nucleotide mismatch, the second nucleotide mismatch, and a third nucleotide mismatch. In some embodiments, the clamp segment comprises a sequence that is capable of binding to the second genomic site of the first strand of the double-stranded target DNA sequence with the first nucleotide mismatch, the second nucleotide mismatch, the third nucleotide mismatch, and a fourth nucleotide mismatch.
- the reverse transcriptase template comprises an integration recognition site and a clamp segment.
- the clamp segment includes a sequence where upon being reverse transcribed comprises a sequence at least partially complementary (e.g., the reverse transcribed clamp segment comprises a sequence having 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence complementarity) to a second genomic site of the first strand of the double-stranded target DNA sequence.
- the clamp segment comprises a sequence (e.g., a reverse complement of the clamp “rcClamp”) that upon being reverse transcribed is complementary to the second genomic site of the first strand of the double-stranded target DNA sequence. 6.7.3.
- an atgRNA comprises an anti-reverse transcriptase template (anti- RTT) sequence (see, e.g., FIGs.6A-6B).
- anti- RTT anti-reverse transcriptase template
- the anti-RTT sequence is capable of binding to all or a portion of the RT template, thereby blocking the 3’ end of the atgRNA from degradation (e.g., degradation by exonucleases).
- the reverse transcriptase upon introducing the atgRNA into a cell along with a gene editor polypeptide comprising a DNA binding nickase domain linked to a reverse transcriptase domain, and after binding of the PBS to the third genomic site and initiation of a reverse transcription reaction, the reverse transcriptase can displace the anti- RTT sequence, thereby making the RT template sequence accessible for reverse transcription.
- the binding of the anti-RTT sequence to all or a portion of the RT template increases stability and half-life of the atgRNA by protecting the atgRNA from exonuclease activity.
- the anti-RTT sequence upon introducing the atgRNA into a cell, increases stability and half-life of the atgRNA as compared to an atgRNA that does not comprise the anti-RTT sequence.
- the anti-RTT sequence comprises a sequence that is at least partially complementary to the RT template.
- the anti-RTT sequence comprises a sequence having 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence complementarity to the RT template.
- the anti-RTT sequence comprises a sequence that is complementary to all or a portion of the RT template.
- the anti-RTT sequence has a length of 1 to 20 nucleotides, 5 to 15 nucleotides, or 10 to 20 nucleotides. In some embodiments, the anti-RTT sequence has a length of 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, or more nucleotides. In one embodiment, the anti-RTT sequence has a length of 10 nucleotides.
- the atgRNA further comprises a 3’ extension (also referred to as anti-RTT sequence) capable of binding one or more consecutive nucleotides of the reverse transcriptase template first segment.
- the atgRNA further comprises a 3’ extension capable of binding one or more consecutive nucleotides of the reverse transcriptase template second segment. In some embodiments, the atgRNA further comprises a 3’ extension capable of binding one or more consecutive nucleotides of the reverse transcriptase template first and second segment. In some embodiments, the atgRNA further comprises a 5’ extension capable of binding one or more consecutive nucleotides of the reverse transcriptase template first segment. In some embodiments, the atgRNA further comprises a 5’ extension capable of binding one or more consecutive nucleotides of the reverse transcriptase template second segment.
- the atgRNA further comprises a 5’ extension capable of binding one or more consecutive nucleotides of the reverse transcriptase template first and second segment.
- a 5’ extension capable of binding one or more consecutive nucleotides of the reverse transcriptase template first and second segment.
- Such 5’ and/or 3’ extensions may increase the stability of the guide RNA and/or prevent spurious guide RNA genomic binding.
- the stability is increased from cellular exonucleases, endonucleases, or hydrolysis. 6.7.4.
- the present disclosure contemplates a split atgRNA comprised of two or more polynucleotides that are capable of forming a atgRNA complex (see Liu et al; doi: Attorney Ref: 38909-52322/WO Client Ref: 005WO 10.1038/s41587-022-01255-9).
- the atgRNA may be a atgRNA complex, wherein the atgRNA complex is comprised of at least two polynucleotide components.
- the atgRNA complex is comprised of a first polynucleotide component and a second polynucleotide component.
- the first polynucleotide component comprises a spacer sequence complementary to a target first genomic and a scaffold capable of binding to a DNA binding nickase
- the second polynucleotide component comprises a reverse transcriptase template comprised of at least a first segment and a second segment, wherein said first segment encodes a clamp segment complementary to a second genomic site and said second segment encodes an integrase target recognition site, a primer binding site complementary to a third genomic site, and a RNA-protein recruitment domain, and wherein the reverse transcriptase template encodes for a first single-stranded DNA sequence.
- the RNA- protein recruitment domain is a MS2 hairpin.
- At least one of the first polynucleotide component and the second polynucleotide component are circularized.
- circularization is mediated by a ribozyme or ligase.
- circularization is mediated by covalently or non- covalently linking the 5’ and 3’ termini.
- the atgRNA or the atgRNA complex further comprises one or more of an RNA-protein recruitment domain, RNA-RNA recruitment domain, a transcriptional termination signal, a reverse transcription termination signal, an RNA ribozyme, or a chemical linker.
- one or more atgRNA complex is comprised in one or more RNA polynucleotides or DNA polynucleotides.
- the atgRNA or the atgRNA complex reverse transcriptase template encodes for a first single-stranded DNA sequence (i.e., a first DNA flap) that contains a complementary region to a second single-stranded DNA sequence (i.e., a second DNA flap) encoded by a second atgRNA comprised of a reverse transcriptase template.
- the complementary region between the first and second single-stranded DNA sequences is comprised of more than 10 consecutive bases of an integrase target recognition site.
- the complementary region between the first and second single-stranded DNA sequences is comprised of more than 20 consecutive bases of an integrase target recognition site. In certain embodiments, the complementary region between the first and second single- Attorney Ref: 38909-52322/WO Client Ref: 005WO stranded DNA sequences is comprised of more than 30 consecutive bases of an integrase target recognition site.
- the integrase target recognition site is an Bxb1 attB or attP sequence. In some embodiments, the att-site central dinucleotide is not GT or CA.
- one or more atgRNA or the atgRNA complex is comprised in one or more RNA polynucleotides or DNA polynucleotides.
- the atgRNA or the atgRNA complex reverse transcriptase template encodes for a first single-stranded DNA sequence (i.e., a first DNA flap) that contains a complementary region to a second single-stranded DNA sequence (i.e., a second DNA flap) encoded by a second atgRNA comprised of a reverse transcriptase template.
- the complementary region between the first and second single-stranded DNA sequences is comprised of more than 10 consecutive bases of an integrase target recognition site.
- the complementary region between the first and second single-stranded DNA sequences is comprised of more than 20 consecutive bases of an integrase target recognition site. In certain embodiments, the complementary region between the first and second single- stranded DNA sequences is comprised of more than 30 consecutive bases of an integrase target recognition site.
- the integrase target recognition site is an Bxb1 attB or attP sequence. In some embodiments, the att-site central dinucleotide is not GT or CA.
- one or more atgRNA or the atgRNA complex is comprised in one or more RNA polynucleotides or DNA polynucleotides. 6.7.5.
- AtgRNAs [0141] Table 9 includes atgRNAs, sgRNAs and nicking guides that can be used herein. Spacers are labeled in capital font (SPACER), RT regions in bold capital (RT REGION), AttB sites in bold lower case (attb site), and PBS in capital italics (PBS). Unless otherwise denoted, the AttB is for Bxb1.
- This disclosure also features systems for site-specifically incorporating an integration recognition site into a mammalian cell genome at a desired target site (e.g., a double stranded target DNA sequence), comprising: any of the attachment site containing guide RNAs (atgRNAs) described in Section 6.7; any of the gene editor polypeptides described herein, for example, gene editor polypeptides comprising a DNA binding nickase domain linked to a reverse transcriptase domain that is capable of incorporating the integration recognition site into the target DNA sequence, wherein, upon introducing into the cell, the system integrates the integration recognition site into the target DNA sequence.
- a desired target site e.g., a double stranded target DNA sequence
- the atgRNA comprising the clamp enhances integration efficiency of the integration recognition site at the double-stranded target DNA sequence as compared to the integration efficiency of the integration recognition site at the double-stranded target DNA sequence when using an atgRNA that does not comprise the clamp segment.
- This disclosure also features methods for site-specifically incorporating an integration recognition site into a mammalian cell genome at a desired target site (e.g., a double stranded target DNA sequence), comprising: incorporating an integration target recognition site into the genome by delivering into the cell any of the attachment site containing guide RNAs (atgRNAs) described in Section 6.7; and any of the gene editor polypeptides described herein or any of the polynucleotides encoding a gene editor polypeptide, for example, a gene editor polypeptide comprising a DNA binding nickase domain linked to a reverse transcriptase domain, wherein the integration recognition site is incorporated into the target DNA sequence by the gene editor polypeptide; and optionally, a nicking gRNA.
- a desired target site e.g., a double stranded target DNA sequence
- the method using the atgRNA comprising the clamp segment enhances integration efficiency of the integration recognition site at the target DNA sequence as compared to the integration efficiency of the integration recognition site at the target DNA sequence when using an atgRNA that does not comprise the clamp segment.
- This disclosure also features a system for site-specifically integrating a donor polynucleotide template into a mammalian cell genome at a desired target site (e.g., a double stranded target DNA sequence), comprising: any of the attachment site containing guide RNAs (atgRNAs) described in Section 6.7; any of the gene editor polypeptides described herein, for example, a gene editor polypeptide comprising a DNA binding nickase domain linked to a reverse transcriptase domain that is capable of incorporating the integration recognition site into the target DNA sequence; an integration enzyme; and an donor polynucleotide template linked to a sequence that is an integration cognate of the integration recognition site present in the atgRNA, whereby the integration enzyme integrates
- the atgRNA comprising the clamp enhances integration efficiency of the integration recognition site at the double-stranded target DNA sequence as compared to the integration efficiency of the integration recognition site at the double-stranded target DNA sequence when using an atgRNA that does not comprise the clamp segment.
- This disclosure also features a method for site-specifically integrating a donor polynucleotide template into a genome of a cell, comprising: incorporating an integration target Attorney Ref: 38909-52322/WO Client Ref: 005WO recognition site into the genome by delivering into the cell: any of the attachment site containing guide RNAs (atgRNAs) described in Section 6.7; and any of the gene editor polypeptides described herein or any of the polynucleotides encoding a gene editor polypeptide, for example, a gene editor polypeptide comprising a DNA binding nickase domain linked to a reverse transcriptase domain that is capable of incorporating the integration recognition site into the target DNA sequence; and optionally, a nicking gRNA; and integrating the donor polynucleotide template into the genome by delivering into the cell: an integration enzyme; and a donor polynucleotide template, wherein the donor polynucleotide template is linked to a sequence that
- the method using the atgRNA comprising the clamp enhances integration efficiency of the integration recognition site at the double-stranded target DNA sequence as compared to the integration efficiency of the integration recognition site at the target DNA sequence when using an atgRNA that does not comprise the clamp segment. In some embodiments, the method using the atgRNA comprising the clamp enhances integration efficiency of the donor polynucleotide template site at the target DNA sequence as compared to the integration efficiency of the donor polynucleotide template at the target DNA sequence when using an atgRNA that does not comprise the clamp segment.
- the atgRNA, the gene editor polypeptide (or the polynucleotide encoding the gene editor polypeptide), the integration enzyme, and the donor polynucleotide template are introduced into the cell concurrently. 6.9. Integrases/Recombinases and Integration/Recombination Sites [0146]
- a gene editor polypeptide described herein contains an integrase or recombinase.
- the integrase is delivered as a protein or the integrase is encoded in a delivered polynucleotide.
- the integration enzyme is selected from the group consisting of Dre, Vika, Bxb1, ⁇ C31, RDF, ⁇ BTl, R1, R2, R3, R4, R5, TP901-1, A118, ⁇ FCl, ⁇ C1, MR11, TG1, ⁇ 370.l, W ⁇ , BL3, SPBc, K38, Peaches, Veracruz, Rebeuca, Theia, Benedict, KSSJEB, PattyP, Doom, Scowl, Lockley, Switzer, Bob3, Troube, Abrogate, Anglerfish, Sarfire, SkiPole, ConceptII, Museum, Severus, Airmid, Benedict, Hinder, ICleared, Sheen, Mundrea, BxZ2, ⁇ RV, retrotransposases encoded by a Tc1/mariner family member including but not limited to retrotransposases encoded by LI, Tol2, Tel, Tc3, Himar 1 (isolated from the horn fly, Haematobia irritans
- Xu et al describes Attorney Ref: 38909-52322/WO Client Ref: 005WO methods for evaluating integrase activity in E. coli and mammalian cells and confirmed at least R4, ⁇ C31, ⁇ BT1, Bxb1, SPBc, TP901-1 and W ⁇ integrases to be active on substrates integrated into the genome of HT1080 cells (Xu et al., 2013, Accuracy and efficiency define Bxb1 integrase as the best of fifteen candidate serine recombinases for the integration of DNA into the human genome. BMC Biotechnol.2013 Oct 20;13:87. doi: 10.1186/1472-6750-13-87).
- Durrant describes new large serine recombinases (LSRs) divided into three classes distinguished from one another by efficiency and specificity, including landing pad LSRs which outperform wild-type Bxb1 in episomal and chromosomal integration efficiency, LSRs that achieve both efficient and site- specific integration without a landing pad, and multi-targeting LSRs with minimal site-specificity. Additionally, embodiments can include any serine recombinase such as BceINT, SSCINT, SACINT, and INT10 (see Ionnidi et al., 2021; Drag-and-drop genome insertion without DNA cleavage with CRISPR directed integrases.
- LSRs serine recombinases
- the integration site can be selected from an attB site, an attP site, an attL site, an attR site, a lox71 site a Vox site, or a FRT site.
- desired activity of integrases, transposases and the like can depend on nuclear localization.
- prokaryotic enzymes are adapted to modulate nuclear localization.
- eukaryotic or vertebrate enzymes are adapted to modulate nuclear localization.
- the disclosure provides fusion or hybrid proteins.
- Such modulation can comprise addition or removal of one or more nuclear localization signal (NLS) and/or addition or removal of one or more nuclear export signal (NES).
- NLS nuclear localization signal
- NES nuclear export signal
- FIGs. 1A-1E show analysis of effect of variant AttP sites on integration efficiency.
- This disclosure features a gene editor polypeptide useful for site-specifically integrating an integration recognition site into a target nucleic acid sequence (i.e., beacon placement), for example by using a polypeptide comprising a DNA binding nuclease linked to a reverse transcriptase, and integrating an exogenous nucleic acid sequence at the integrated integration recognition site (i.e., PASTE) by introducing into the cell the exogenous nucleic acid and an integration enzyme.
- a target nucleic acid sequence i.e., beacon placement
- the gene editor polypeptide includes a polypeptide comprised of a (i) DNA binding nickase (optionally with nickase activity), (ii) a reverse transcriptase, and (iii) an integration enzyme (e.g., an integrase or a recombinase), wherein at least any two elements of (i), Attorney Ref: 38909-52322/WO Client Ref: 005WO (ii), or (iii) are linked via at least a first C-terminal linker.
- the gene editor polypeptide includes a second linker.
- a gene editor polypeptide includes a first and a second linker
- the first linker links a first domain (e.g., a DNA binding) to a second domain (e.g., a RT) and the second linker links the second domain to a third domain (e.g., an integration enzyme).
- a first domain e.g., a DNA binding
- a second domain e.g., a RT
- a third domain e.g., an integration enzyme
- the orientation of the DNA binding nuclease, the RT, and the integration enzyme are configured for example from N-terminus to C-terminus: a DNA binding nuclease domain, a RT, and an integration enzyme; a DNA binding nuclease, an integration enzyme, and a RT; an integration enzyme, a DNA binding nuclease, and a RT; an integration enzyme, a RT, and a DNA binding nuclease; a RT, a DNA binding nuclease, and an integration enzyme; and a RT, an integration enzyme, and a DNA binding nuclease, where each of the domains are linked via a linker.
- the C-terminus of the DNA binding nuclease is linked to the reverse transcriptase, wherein the C-terminus of the reverse transcriptase is linked to the integration enzyme.
- the C-terminus of the integration enzyme is linked to the DNA binding nuclease, wherein the C-terminus of the DNA binding nuclease is linked to a reverse transcriptase.
- the gene editor polypeptide is comprised of at least two of any one of (i) a DNA binding nickase, (ii) a reverse transcriptase, and (iii) an integration enzyme wherein at least any two elements of (i), (ii), or (iii) are linked via at least a first C-terminal linker.
- the gene editor polypeptide can be comprised of (i) a DNA binding nickase, (ii) a reverse transcriptase, and (iii) an integration enzyme (e.g., an integrase or a recombinase), wherein at least any two elements of (i), (ii), or (iii) may be linked via two or more C-terminal linkers at the same linking position.
- a gene editor polypeptide includes a DNA binding nickase linked via two linkers to (a reverse transcriptase, and the reverse transcriptase domain fused or linked to an integration enzyme.
- a linker (e.g., the first linker, the second linker, or both) is selected from a flexible linker, a semi-flexible linker, a rigid linker, a flexible-rigid linker.
- the gene editor polypeptide includes at least one flexible linker.
- flexible linkers include L1 (SEQ ID NO: 15), L2 (SEQ ID NO: 551), L5 (SEQ ID NO: 19), L6 (SEQ ID NO: 18), and L13 (SEQ ID NO: 558).
- the gene editor polypeptide includes at least one flexible linker selected from: L1, L2, L5, L6, and L13.
- the gene editor polypeptide includes at least a L1 linker (SEQ ID NO: 15). In some embodiments, the gene editor polypeptide includes at least a L2 linker (SEQ ID NO: Attorney Ref: 38909-52322/WO Client Ref: 005WO 551). In some embodiments, the gene editor polypeptide includes at least a L5 linker (SEQ ID NO: 19) linker. In some embodiments, the gene editor polypeptide includes at least a L6 linker (SEQ ID NO: 18). In some embodiments, the gene editor polypeptide includes at least a L13 linker (SEQ ID NO: 558). [0155] In some embodiments, the gene editor polypeptide includes at least one semi-flexible linker.
- Non-limiting examples of semi-flexible linkers include L7 (SEQ ID NO: 552) and L8 (SEQ ID NO: 553).
- the gene editor polypeptide includes at least a L7 linker (SEQ ID NO: 552).
- the gene editor polypeptide includes at least a L7 linker (SEQ ID NO: 553).
- the gene editor polypeptide includes at least one rigid linker.
- rigid linkers include L3 (SEQ ID NO: 16), L4 (SEQ ID NO: 17), and L9 (SEQ ID NO: 554).
- the gene editor polypeptide includes at least a L3 linker (SEQ ID NO: 16).
- the gene editor polypeptide includes at least a L4 linker (SEQ ID NO: 17). In some embodiments, the gene editor polypeptide includes at least a L9 linker (SEQ ID NO: 554). [0157] In some embodiments, the gene editor polypeptide includes at least one flexible-rigid linker. Non-limiting examples of flexible-rigid linkers include L11 (SEQ ID NO: 557) and L12 (SEQ ID NO: 558). In some embodiments, the gene editor polypeptide includes at least a L11 linker (SEQ ID NO: 557). In some embodiments, the gene editor polypeptide includes at least a L12 linker (SEQ ID NO: 558).
- the gene editor polypeptide includes a linker (e.g., a first linker, a second linker, or both) where the length of the linker is selected from short, medium, and long.
- linker e.g., a first linker, a second linker, or both
- short linkers include: L1 (SEQ ID NO: 15), L3 (SEQ ID NO: 16), L7 (SEQ ID NO: 552), and L8 (SEQ ID NO: 553).
- Non-limiting examples of a medium linker include: L2 (SEQ ID NO: 551), L4 (SEQ ID NO: 17), L5 (SEQ ID NO: 19), L6 (SEQ ID NO:18), L10 (SEQ ID NO: 555), L11 (SEQ ID NO: 556), L12 (SEQ ID NO: 557), and L13 (SEQ ID NO: 558).
- a non-limiting example of a long linker is L9 (SEQ ID NO: 554).
- the gene editor polypeptide includes a linker (e.g., the first linker, the second linker, or both) that is a non-cleavable linker.
- Non-limiting examples of non-cleavable linkers include: L1 (SEQ ID NO: 15), L2 (SEQ ID NO: 551), L3 (SEQ ID NO:16), L4 (SEQ ID NO: 17), L5 (SEQ ID NO: 19), L6 (SEQ ID NO: 18), L7 (SEQ ID NO: 552), L8 (SEQ ID NO: Attorney Ref: 38909-52322/WO Client Ref: 005WO 553), L9 (SEQ ID NO: 554), L10 (SEQ ID NO: (555), L11 (SEQ ID NO: 556), L12 (SEQ ID NO: 557), and L13 (SEQ ID NO: 558).
- the gene editor polypeptide include a linker (e.g., the first linker, the second linker, or both) that is a cleavable linker.
- Cleavable linkers can facilitate the co- expression of multiple proteins within a single open reading frame, effectively separating them post-translationally.
- Non-limiting examples of self-cleaving peptides include 2A peptides (derived from the foot-and-mouth disease virus), such as F2A, T2A, E2A, and P2A; the “Inteins” that facilitate protein splicing; and the tobacco etch virus (TEV) protease recognition site.
- the cleavable linker is 2A peptide.
- the cleavable linker is a P2A peptide.
- the gene editor polypeptide includes at least a first linker and a second linker where: the at least first linker is a non-cleavable linker and the second linker is a cleavable linker; the at least first linker is a cleavable linker and the second linker is a non- cleavable linker; the at least first linker is a cleavable linker and the second linker is a cleavable linker; or the at least first linker is a non-cleavable linker and the second linker is a non-cleavable linker.
- the gene editor polypeptide includes the DNA binding nuclease, the RT, and the integration enzyme, where the C-terminus of the DNA binding nuclease is linked to the reverse transcriptase, wherein the C-terminus of the reverse transcriptase is linked to the integration enzyme, and where any one of L7 (SEQ ID NO: 552), L10 (SEQ ID NO: 555), or L12 (SEQ ID NO: 557) links the DNA binding nuclease to the reverse transcriptase.
- the gene editor polypeptide includes the DNA binding nuclease, the RT, and the integration enzyme, where the C-terminus of the DNA binding nuclease is linked to the reverse transcriptase, wherein the C-terminus of the reverse transcriptase is linked to the integration enzyme, wherein any one of L8 (SEQ ID NO: 553), L9 (SEQ ID NO: 554), L10 (SEQ ID NO: 555), L12 (SEQ ID NO: 557), or L13 (SEQ ID NO: 558) links the reverse transcriptase to the integration enzyme.
- the gene editor polypeptide includes the DNA binding nuclease, the RT, and the integration enzyme, where the C-terminus of the integration enzyme is linked to the DNA binding nuclease by L13 (SEQ NO: 558), and where the C-terminus of the DNA binding nuclease is linked to the reverse transcriptase by L13 (SEQ ID NO: 558).
- the gene editor polypeptide is not comprised of a N-most N-terminal reverse transcriptase.
- the gene editor polypeptide includes the DNA binding nuclease, the RT, and the integration enzyme, where the C-terminus of the DNA binding nuclease is linked to the reverse transcriptase by L13 (SEQ ID NO: 558), and where the C-terminus of the reverse transcriptase is linked to the integration enzyme by a cleavable linker (e.g., a P2A (SEQ ID NO: 13).
- the gene editor polypeptide includes a nuclear localization signal (NLS).
- the gene editor polypeptide comprises a mitochondrial localization signal.
- the gene editor polypeptide comprises: from N-terminus to C- terminus: a DNA binding nickase domain; a P2A linker (SEQ ID NO: 13), a reverse transcriptase; a mXTEN linker, and an integration enzyme.
- the gene editor polypeptide comprises: from N-terminus to C- terminus: an integration enzyme, a mXTEN linker (SEQ ID NO: 558), a DNA binding nuclease domain; a mXTEN linker (SEQ ID NO: 558), and a reverse transcriptase.
- This disclosure also features systems capable of site-specifically integrating at least a first integration recognition site into the genome of a cell where the system includes any of the gene editor polypeptides described herein. [0171] This disclosure also features systems capable of site-specifically integrating an exogenous nucleic acid sequence into the genome of a cell where the system includes any of the gene editor polypeptides described herein. 6.11. Gene Editor Polynucleotide Compositions [0172] In certain embodiments, a polynucleotide encodes the gene editor polypeptide. In some embodiments, the polynucleotide is an RNA or DNA. In some embodiments, the polynucleotide is a mRNA.
- an expression vector comprises the polynucleotide encoding the gene editor polypeptide.
- the expression vector further comprises any one or more of a one or more of a promoter sequence, atgRNA, nicking gRNA (ngRNA), donor Attorney Ref: 38909-52322/WO Client Ref: 005WO polynucleotide template, or donor polynucleotide template complement.
- two or more expression vectors are delivered.
- the expression vector is recombinant adenovirus, helper dependent adenovirus, AAV, lentivirus, HSV, annelovirus, retrovirus, Doggybone DNA, minicircle, plasmid, miniDNA, nanoplasmid, exosome, fusosome, or lipid nanoparticle. 6.12. Programmable Gene Insertion [0175] In some embodiments, the polypeptides described herein are used for programmable polynucleotide (e.g., genes or gene fragments) insertion.
- Programmable polynucleotide/gene insertion includes site-specifically integrating an exogenous nucleic acid sequence (e.g., a donor polynucleotide template) into a mammalian cell genome at a desired target site (e.g., a target DNA sequence).
- an exogenous nucleic acid sequence e.g., a donor polynucleotide template
- a desired target site e.g., a target DNA sequence
- the method of site-specifically integrating an exogenous nucleic acid sequence into a target nucleic acid comprises incorporating an integration recognition site into a cell by delivering: (i) a polynucleotide encoding a gene editor polypeptide (e.g., any of the gene editor polypeptides described herein), (ii) at least a first target specific atgRNAs comprising a first integration recognition site, for example, as described in Section 6.7, and (iii) optionally, a nicking guide RNA.
- a gene editor polypeptide e.g., any of the gene editor polypeptides described herein
- at least a first target specific atgRNAs comprising a first integration recognition site, for example, as described in Section 6.7
- optionally, a nicking guide RNA optionally, a nicking guide RNA.
- the first atgRNA comprises: (i) a domain that is capable of guiding the prime editor system to a target sequence; and (ii) a reverse transcriptase (RT) template that comprises at least a portion of an at least first integration recognition site, whereby the at least portion of the at least first integration recognition site is integrated into the genome of the cell at the target sequence.
- RT reverse transcriptase
- the method of site-specifically integrating an exogenous nucleic acid sequence into a target nucleic acid comprises incorporating an integration recognition site into a cell by delivering: (i) a polynucleotide encoding a gene editor polypeptide (e.g., any of the gene editor polypeptides described herein), (ii) a first target specific atgRNA and a second target specific atgRNA.
- a gene editor polypeptide e.g., any of the gene editor polypeptides described herein
- the first atgRNA and the second atgRNA are an at least first pair of atgRNAs, wherein the at least first pair of atgRNAs have domains that are capable of guiding the prime editor system to a target sequence;
- the first atgRNA further includes a first RT template that comprises at least a portion of an at least first integration recognition site;
- the second atgRNA further includes a second RT template that comprises at least a portion of the first integration recognition site, and the first atgRNA and the second atgRNAs collectively encode the entirety of the first integration recognition site, whereby the first integration recognition site is integrated into the genome of the cell at the target sequence.
- the method includes integrating the polynucleotide in the cellular genome by delivering a donor polynucleotide template, wherein the donor polynucleotide template is comprised of a second integration recognition site that is a cognate pair of the first integration recognition site, wherein the donor polynucleotide is integrated into the cellular genome at the first integration target recognition site by the gene editor polynucleotide.
- the method of site-specifically integrating an exogenous nucleic acid sequence into a mammalian cell genome at a desired target site comprises incorporating an integration target site into a cell by delivering: an expression vector of comprising a polynucleotide encoding a gene editor polypeptide (e.g., any of the gene editor polypeptides described herein), further comprising a promoter, one or more cell genome target specific atgRNA comprising an integration target recognition site, for example, as described in Section 6.7, a donor polynucleotide comprising an integration target site, and optionally, a ngRNA.
- a gene editor polypeptide e.g., any of the gene editor polypeptides described herein
- the method of polynucleotide integration into a cell genome comprises incorporating an integration target site into a cell by delivering: an expression vector of comprising a polynucleotide encoding a gene editor polypeptide (e.g., any of the gene editor polypeptides described herein), further comprising a promoter, two cell genome target specific atgRNA where the two atgRNA collectively encode the entirety of the an integration recognition site, and a donor polynucleotide comprising an integration target site.
- a gene editor polypeptide e.g., any of the gene editor polypeptides described herein
- the method of site-specifically integrating an exogenous nucleic acid sequence into a genome of a cell includes incorporating an integration recognition site into a cell by delivering: any of the polynucleotides described herein; at least a first genome target specific atgRNA comprising a first integration recognition site, for example, as described in Section 6.7; optionally, a nicking gRNA; and integrating an exogenous nucleic acid sequence into the cellular genome by delivering into the cell: a donor polynucleotide template, wherein the donor polynucleotide template is comprised of the exogenous nucleic acid sequence positioned relative to a second integration recognition site that is a cognate pair of the first integration recognition site such that the exogenous nucleic acid sequence is integrated into the cellular genome at the first integration target recognition site when contacted with the integration enzyme and the first integration recognition site.
- the polynucleotides and the at least first atgRNA are delivered concurrently with the donor polynucleotide template.
- the method of site-specifically integrating an exogenous nucleic acid sequence into a target nucleic acid includes: incorporating an integration recognition site into a cell by delivering: any of the expression vectors described herein; and integrating an exogenous Attorney Ref: 38909-52322/WO Client Ref: 005WO nucleic acid sequence into the cellular genome by integrating a donor polynucleotide template, wherein the donor polynucleotide template is comprised of the exogenous nucleic acid sequence positioned relative to a second integration recognition site that is a cognate pair of the first integration recognition site such that the exogenous nucleic acid sequence is integrated into the cellular genome at the first integration target recognition site when contacted with the integration enzyme and the first integration recognition site.
- the expression vector and the donor polynucleotide template are delivered to the cell concurrently.
- a method comprises recombination or integration into a safe harbor site (SHS).
- SHS safe harbor site
- a frequently used human SHS is the AAVS1 site on chromosome 19q, initially identified as a site for recurrent adeno-associated virus insertion.
- Another locus comprises the human homolog of the murine Rosa26 locus.
- Yet another SHS comprises the human H11 locus on chromosome 22.
- a method of the disclosure comprises recombining corrective gene fragments into a defective locus.
- the methods and compositions can be used to target, without limitation, stem cells for example induced pluripotent stem cells (iPSCs), HSCs, HSPCs, mesenchymal stem cells, or neuronal stem cells and cells at various stages of differentiation.
- methods and compositions of the disclosure are adapted to target organoids, including patient derived organoids.
- methods and compositions of the disclosure are adapted to treat muscle cells, not limited to cardiomyocytes for Duchene Muscular Dystrophy (DMD).
- the dystrophin gene is the largest gene in the human genome, spanning ⁇ 2.3 Mb of DNA. DMD is composed of 79 exons resulting in a 14-kb full-length mRNA. Common mutations include mutations that disrupt the reading frame of generate a premature stop codon. An aspect of DMD that lends it to gene editing as a therapeutic approach is the modular structure of the dystrophin protein. Redundancy in the central rod domain permits the deletion of internal segments of the gene that may harbor loss-of-function mutations, thereby restoring the open reading frame (ORFs).
- the methods and systems described herein are used to treat DMD by site- Attorney Ref: 38909-52322/WO Client Ref: 005WO specifically integrating in the genome a polynucleotide template that repairs or replaces all or a portion of the defective DMD gene.
- Inherited Retinal Diseases ⁇ Stargardt Disease (ABCA4) ⁇ Leber congenital amaurosis 10 (CEP290) ⁇ X linked Retinitis Pigmentosa (RPGR) ⁇ Autosomal Dominant Retinitis Pigmentosa (RHO) Liver Diseases: ⁇ Wilson’s disease (ATP7B) ⁇ Alpha-1 antitrypsin (SERPINA1) Intellectual Disabilities: ⁇ Rett Syndrome (MECP2) ⁇ SYNGAP1-ID (SYNGAP1) ⁇ CDKL5 deficiency disorder (CDKL5) Peripheral Neuropathies: ⁇ Charcot-Marie-Tooth 2A (MFN2) Lung Diseases: ⁇ Cystic Fibrosis (CFTR) ⁇ Alpha-1 Antitrypsin (SERPINA1) Autoimmune diseases: ⁇ IgA Nephropathy (Berger’
- the most common cystic fibrosis (CF) mutation F508del removes a single amino acid.
- recombining human CFTR into an SHS of a cell that expresses CFTR F508del is a corrective treatment path.
- the methods and systems described herein are used to CF by site-specifically integrating in the genome a polynucleotide template that corrects the mutation causing CF. Proposed validation is detection of persistent CFTR mRNA and protein expression in transduced cells.
- Sickle cell disease (SCD) is caused by mutation of a specific amino acid — valine to glutamic acid at amino acid position 6.
- SCD is corrected by recombination of the HBB gene into a safe harbor site (SHS) and by demonstrating correction in a proportion of target cells that is high enough to produce a substantial benefit.
- the methods and systems described herein are used to sickle cell disease by site-specifically integrating in the genome a polynucleotide template that corrects the mutation causing the disease.
- validation is detection of persistent HBB mRNA and protein expression in transduced cells.
- DMD Duchenne Muscular Dystrophy. The dystrophin gene is the largest gene in the human genome, spanning ⁇ 2.3 Mb of DNA.
- DMD is composed of 79 exons resulting in a 14- kb full-length mRNA. Common mutations include mutations that disrupt the reading frame of generate a premature stop codon.
- An aspect of DMD that lends it to gene editing as a therapeutic approach is the modular structure of the dystrophin protein. Redundancy in the central rod domain permits the deletion of internal segments of the gene that may harbor loss-of-function mutations, thereby restoring the open reading frame (ORFs).
- ORFs open reading frame
- Attorney Ref: 38909-52322/WO Client Ref: 005WO [0188] In some embodiments, recombination will be into safe harbor sites (SHS).
- a frequently used human SHS is the AAVS1 site on chromosome 19q, initially identified as a site for recurrent adeno-associated virus insertion.
- the site is the human homolog of the e murine Rosa26 locus (pubmed.ncbi.nlm.nih.gov/18037879).
- the site is the human H11 locus on chromosome 22.
- Proposed target cells for recombination include stem cells for example induced pluripotent stem cells (iPSCs) and cells at various stages of differentiation. In some cases, a complete gene may be prohibitively large and replacement of an entire gene impractical.
- rescuing mutants by recombining in corrected gene fragments with the methods and systems described herein is a corrective option.
- correcting mutations in exon 44 (or 51) by recombining in a corrective coding sequence downstream of exon 43 (or 50), using the methods and systems described herein is a corrective option.
- Proposed validation is detection of persistent DMD mRNA and protein expression in transduced cells.
- correcting factor VIII deficiency by recombining the FVIII gene into an SHS is a corrective path.
- the methods and systems described herein are used to correct factor VIII deficiency by site-specifically integrating in the genome a polynucleotide template that corrects the mutation causing the FIX deficiency. Proposed validation is detection of persistent FVIII mRNA and protein expression in transduced cells.
- Factor 9 Hemophilia B, also called factor IX (FIX) deficiency is a genetic disorder caused by missing or defective factor IX, a clotting protein.
- the methods and systems described herein are used to correct factor IX deficiency by site-specifically integrating in the genome a polynucleotide template that corrects the mutation causing the FIX deficiency. Proposed validation is detection of persistent FiX mRNA and protein expression in transduced cells.
- Ornithine transcarbamylase deficiency OTCD
- Ornithine transcarbamylase deficiency is a rare genetic condition that causes ammonia to build up in the blood.
- the condition Attorney Ref: 38909-52322/WO Client Ref: 005WO – more commonly called OTC deficiency – is more common in boys than girls and tends to be more severe when symptoms emerge shortly after birth.
- the methods and systems described herein are used to correct OTC deficiency by site-specifically integrating in the genome a polynucleotide template that corrects the mutation causing the OTC deficiency or integrates a polynucleotide encoding a functional ornithine transcarbamylase enzyme.
- Phenylketonuria also called PKU
- PKU Phenylketonuria
- PKU Phenylketonuria
- PKU is a rare inherited disorder that causes an amino acid called phenylalanine to build up in the body.
- PKU is caused by a change in the phenylalanine hydroxylase (PAH) gene. This gene helps create the enzyme needed to break down phenylalanine.
- PAH phenylalanine hydroxylase
- the methods and systems described herein are used to correct PKU by site-specifically integrating in the genome a polynucleotide template that corrects the mutation causing the PKU deficiency or integrates a polynucleotide encoding a functional phenylalanine hydroxylase (PAH) gene.
- Proposed validation is detection of persistent PAH mRNA and protein expression in transduced cells.
- Homocystinuria HCU.
- Homocystinuria is elevation of the amino acid, homocysteine (protein building block coming from our diet) in the urine or blood.
- HCU Common causes of HCU include: problems with the enzyme cystathionine beta synthase (CBS), which converts homocysteine to the amino acid cystathionine (which then becomes cysteine) and needs the vitamin B6 (pyridoxine); and roblems with converting homocysteine to the amino acid methionine.
- CBS cystathionine beta synthase
- pyridoxine pyridoxine
- the methods and systems described herein are used to correct HCU by site-specifically integrating in the genome a polynucleotide template that corrects the mutation causing the HCU or integrates a polynucleotide encoding a functional copy of a gene (e.g., CBS) able to reduce or prevent buildup of homocysteine in the urine.
- a functional copy of a gene e.g., CBS
- IgA Nephropathy (Berger’s disease). IgA nephropathy, also known as Berger's disease, is a kidney/autoimmune disease that occurs when an antibody called immunoglobulin A (IgA) builds up in the kidneys. Attorney Ref: 38909-52322/WO Client Ref: 005WO [0201]
- the methods and systems described herein are used to treat Berger’s disease by administering to a patient an iPSC-derived Natural Killer cell that includes a polynucleotide site-specifically integrated in the genome of the cell using the methods described herein.
- the iPSC-NK cell Upon administering the iPSC-NK cell to the patient, the iPSC-NK cell is capable of removing native cells (e.g., B cells) that are responsible, at least in part, for the symptoms of Berger’s disease.
- native cells e.g., B cells
- the iPSC-NK cell Upon administering the iPSC-NK cell to the patient, the iPSC-NK cell is capable of removing native cells (e.g., B cells) that are responsible, at least in part, for the symptoms of Berger’s disease.
- ANCA vasculitis is an autoimmune disease affecting small blood vessels in the body. It is caused by autoantibodies called ANCAs, or Anti-Neutrophilic Cytoplasmic Autoantibodies. ANCAs target and attack a certain kind of white blood cells called neutrophils.
- the methods and systems described herein are used to treat ANCA vasculitis by administering to a patient an iPSC-derived Natural Killer cell that includes a polynucleotide site-specifically integrated in the genome of the cell using the methods described herein.
- the iPSC-NK cell Upon administering the iPSC-NK cell to the patient, the iPSC-NK cell is capable of removing native cells (e.g., B cells) that are responsible, at least in part, for the symptoms of ANCA vasculitis.
- native cells e.g., B cells
- SLE Systemic Lupus Erythematosus
- LN Lupus Nephritis
- Lupus is an autoimmune—a disorder in which the body’s immune system attacks the body’s own cells and organs.
- the methods and systems described herein are used to treat SLE/LN by administering to a patient an iPSC-derived Natural Killer cell that includes a polynucleotide site-specifically integrated in the genome of the cell using the methods described herein.
- the iPSC-NK cell Upon administering the iPSC-NK cell to the patient, the iPSC-NK cell is capable of removing native cells (e.g., B cells) that are responsible, at least in part, for the symptoms of SLE/LN.
- MN Membranous Nephropathy
- MN is a kidney disease that affects the filters (glomeruli) of the kidney and can cause protein in the urine, as well as decreased kidney function and swelling. It can sometimes be called membranous glomerulopathy as well (these terms can be used interchangeably and mean the same thing).
- the methods and systems described herein are used to treat MN by administering to a patient an iPSC-derived Natural Killer cell that includes a polynucleotide Attorney Ref: 38909-52322/WO Client Ref: 005WO site-specifically integrated in the genome of the cell using the methods described herein.
- C3 glomerulonephritis C3GN.
- C3 glomerulopathy is a group of related conditions that cause the kidneys to malfunction. The major features of C3 glomerulopathy include high levels of protein in the urine (proteinuria), blood in the urine (hematuria), reduced amounts of urine, low levels of protein in the blood, and swelling in many areas of the body. Affected individuals may have particularly low levels of a protein called complement component 3 (or C3) in the blood.
- the methods and systems described herein are used to treat C3 glomerulopathy by administering to a patient an iPSC-derived Natural Killer cell that includes a polynucleotide site-specifically integrated in the genome of the cell using the methods described herein.
- the iPSC-NK cell Upon administering the iPSC-NK cell to the patient, the iPSC-NK cell is capable of removing native cells (e.g., B cells) that are responsible, at least in part, for the symptoms of C3 glomerulopathy. 6.14.
- Methods of treatment [0210] In another aspect, methods of treatment are presented. The method comprises administering an effective amount of the pharmaceutical composition comprising the nucleic acid construct or vectorized nucleic acid construct described above to a patient in need thereof.
- the polynucleotide encoding a gene editor polypeptide are delivered by a vector, wherein the vector is selected from the group consisting of recombinant adenovirus, helper dependent adenovirus, AAV, lentivirus, HSV, annelovirus, retrovirus, Doggybone TM DNA (dbDNA), minicircle, plasmid, miniDNA, exosome, fusosome, lipid nanoparticle, or nanoplasmid.
- DNA or RNA viral vectors can be administered directly to patients (in vivo), or they can be used to treat cells in vitro, and the modified cells may optionally be administered to patients (ex vivo).
- Conventional viral based systems to be used herein could include retroviral, lentivirus, adenoviral, adeno-associated and herpes simplex virus vectors for gene transfer. Integration in the host genome is possible with the retrovirus, lentivirus, and adeno-associated virus gene transfer methods, often resulting in long term expression of the inserted transgene. Additionally, high transduction efficiencies have been observed in many different cell types and target tissues.
- Methods of non-viral delivery of the polynucleotide encoding a gene editor polypeptide or the gene editor polypeptide described herein include lipofection, nucleofection, microinjection, biolistics, virosomes, liposomes, immunoliposomes, polycation or lipid:nucleic acid conjugates, naked DNA, artificial virions, and agent-enhanced uptake of DNA. Lipofection is described in e.g., U.S. Pat. Nos.
- lipofection reagents are sold commercially (e.g., TransfectamTM and LipofectinTM).
- Cationic and neutral lipids that are suitable for efficient receptor-recognition lipofection of polynucleotides include those of Felgner, WO 91/17424; WO 91/16024. Delivery can be to cells (e.g. in vitro or ex vivo administration) or target tissues (e.g. in vivo administration).
- Delivery can be to cells (e.g. in vitro or ex vivo administration) or target tissues (e.g. in vivo administration).
- 6.14.1. Lipid Nanoparticle Delivery [0213]
- the gene editor polypeptide or polynucleotide encoding the gene editor polypeptide is packaged in a LNP and administered intravenously.
- the gene editor polypeptide or polynucleotide encoding the gene editor polypeptide is packaged in a LNP and administered intrathecally. In some embodiments, the gene editor polypeptide or polynucleotide encoding the gene editor polypeptide is packaged in a LNP and administered by intracerebral ventricular injection. In some embodiments, the gene editor polypeptide or polynucleotide encoding the gene editor polypeptide is packaged in a LNP and administered by intracisternal magna administration. In some embodiments, the gene editor polypeptide or polynucleotide encoding the gene editor polypeptide is packaged in a LNP and administered by intravitreal injection.
- lipid:nucleic acid complexes including targeted liposomes such as immunolipid complexes
- crystal Science 270:404-410 (1995); Blaese et al., Cancer Gene Ther.2:291-297 (1995); Behr et al., Bioconjugate Chem. 5:382-389 (1994); Remy et al., Bioconjugate Chem. 5:647-654 (1994); Gao et al., Gene Therapy 2:710-722 (1995); Ahmad et al., Cancer Res. 52:4817-4820 (1992); U.S. Pat. Nos.
- LNP doses of about 0.01 to about 1 mg per kg of body weight administered intravenously are contemplated.
- Medications to reduce the risk of infusion-related reactions are contemplated, such as dexamethasone, acetampinophen, diphenhydramine or cetirizine, and ranitidine are contemplated.
- Multiple doses of about 0.3 mg per kilogram every 4 weeks for five doses are also contemplated.
- LNPs exhibit a low surface charge compatible with longer circulation times.
- ionizable cationic lipids have been focused upon, namely 1,2-dilineoyl-3-dimethylammonium-propane (DLinDAP), 1,2- dilinoleyloxy-3-N,N-dimethylaminopropane (DLinDMA), 1,2-dilinoleyloxy-keto-N,N-dimethyl- 3-aminopropane (DLinKDMA), and 1,2-dilinoleyl-4-(2-dimethylaminoethyl)-[1,3]-dioxolane (DlinKC2-DMA).
- DLinDAP 1,2-dilineoyl-3-dimethylammonium-propane
- DLinDMA 1,2- dilinoleyloxy-3-N,N-dimethylaminopropane
- DLinKDMA 1,2-dilinoleyloxy-keto-N,
- the LNP siRNA systems containing these lipids exhibit remarkably different gene silencing properties in hepatocytes in vivo, with potencies varying according to the series DlinKC2-DMA>DLinKDMA>DLinDMA>>DLinDAP employing a Factor VII gene silencing model (see, e.g., Rosin et al, Molecular Therapy, vol.19, no.12, pages 1286-2200, December 2011).
- a dosage of 1 ⁇ g/ml of LNP in or associated with the LNP may be contemplated, especially for a formulation containing DlinKC2-DMA.
- the LNP composition comprises one or more one or more ionizable lipids.
- ionizable lipid has its ordinary meaning in the art and may refer to a lipid comprising one or more charged moieties. In some embodiments, an ionizable lipid may be positively charged or negatively charged. In principle, there are no specific limitations concerning the ionizable lipids of the LNP compositions disclosed herein.
- the one or more ionizable lipids are selected from the group consisting of 3-(didodecylamino)- N1,N1,4-tridodecyl-1-piperazineethanamine (KL10), N1-[2-(didodecylamino)ethyl]-N1,N4,N4- tridodecyl-1,4-piperazinediethanami- ne (KL22), 14,25-ditridecyl-15,18,21,24-tetraaza- octatriacontane (KL25), 1,2-dilinoleyloxy-N,N-dimethylaminopropane (Dlin-DMA), 2,2- dilinoleyl-4-dimethylaminomethyl-[1,3]-dioxolane (Dlin-K-DMA), heptatriaconta-6,9,28,31- tetraen-19-yl 4-(
- the ionizable lipid may be selected from, but not limited to, an ionizable lipid described in International Publication Nos. WO2013086354 and WO2013116126.
- the lipid nanoparticle may include one or more (e.g., 1, 2, 3, 4, 5, 6, 7, or 8) cationic and/or ionizable lipids.
- Such cationic and/or ionizable lipids include, but are not limited to, 3-(didodecylamino)-N1,N1,4-tridodecyl-1-piperazineethanamine (KL10), N1-[2- (didodecylamino)ethyl]-N1,N4,N4-tridodecyl-1,4-piperazinediethanami- ne (KL22), 14,25- ditridecyl-15,18,21,24-tetraaza-octatriacontane (KL25), 1,2-dilinoleyloxy-N,N- dimethylaminopropane (Dlin-DMA), 2,2-dilinoleyl-4-dimethylaminomethyl-[1,3]-dioxolane (Dlin-K-DMA), heptatriaconta-6,9,28,31-tetraen-19-yl 4-(dimethylamino
- LIPOFECTIN.RTM including DOTMA and DOPE, available from GIBCO/BRL
- LIPOFECTAMINE.RTM including DOSPA and DOPE, available from GIBCO/BRL
- KL10, KL22, and KL25 are described, for example, in U.S. Pat. No.8,691,750.
- the LNP composition comprises one or more amino lipids.
- amino lipid and “cationic lipid” are used interchangeably herein to include those lipids and salts thereof having one, two, three, or more fatty acid or fatty alkyl chains and a pH-titratable amino head group (e.g., an alkylamino or dialkylamino head group).
- a pH-titratable amino head group e.g., an alkylamino or dialkylamino head group.
- the cationic lipid is typically protonated (i.e., positively charged) at a pH below the pKa of the cationic lipid and is substantially neutral at a pH above the pKa.
- the cationic lipids can also be termed titratable cationic lipids.
- the one or more cationic lipids include: a protonatable tertiary amine (e.g., pH-titratable) head group; alkyl chains, wherein each alkyl chain independently has 0 to 3 (e.g., 0, 1, 2, or 3) double bonds; and ether, ester, or ketal linkages between the head group and alkyl chains.
- Such cationic lipids include, but are not limited to, DSDMA, DODMA, DOTMA, DLinDMA, DLenDMA, .gamma.-DLenDMA, Dlin-K-DMA, Dlin-K-C2- DMA (also known as Dlin-C2K-DMA, XTC2, and C2K), Dlin-K-C3-DMA, Dlin-K-C4-DMA, Dlen-C2K-DMA, y-Dlen-C2-DMA, C12-200, cKK-E12, cKK-A12, cKK-O12, Dlin-MC2-DMA (also known as MC2), and Dlin-MC3-DMA (also known as MC3).
- Anionic lipids suitable for use in lipid nanoparticles include, but are not limited to, phosphatidylglycerol, cardiolipin, diacylphosphatidylserine, diacylphosphatidic acid, N- dodecanoyl phosphatidylethanoloamine, N-succinyl phosphatidylethanolamine, N-glutaryl phosphatidylethanolamine, lysylphosphatidylglycerol, and other anionic modifying groups joined to neutral lipids.
- Neutral lipids suitable for use in lipid nanoparticles include, but are not limited to, diacylphosphatidylcholine, diacylphosphatidylethanolamine, ceramide, sphingomyelin, dihydrosphingomyelin, cephalin, sterols (e.g., cholesterol) and cerebrosides.
- the lipid nanoparticle comprises cholesterol.
- Lipids having a variety of acyl chain groups of varying chain length and degree of saturation are available or may be isolated or synthesized by well-known techniques. Additionally, lipids having mixtures of saturated and unsaturated fatty acid chains and cyclic regions can be used.
- the neutral lipids used in the disclosure are DOPE, DSPC, DPPC, POPC, or any related phosphatidylcholine.
- the neutral lipid may be composed of sphingomyelin, dihydrosphingomyeline, or phospholipids with other head groups, such as serine and inositol.
- amphipathic lipids are included in nanoparticles. Exemplary amphipathic lipids suitable for use in nanoparticles include, but are not limited to, sphingolipids, phospholipids, fatty acids, and amino lipids.
- the lipid composition of the pharmaceutical composition may comprise one or more phospholipids, for example, one or more saturated or (poly)unsaturated phospholipids or a Attorney Ref: 38909-52322/WO Client Ref: 005WO combination thereof.
- phospholipids comprise a phospholipid moiety and one or more fatty acid moieties.
- a phospholipid moiety can be selected, for example, from the non-limiting group consisting of phosphatidyl choline, phosphatidyl ethanolamine, phosphatidyl glycerol, phosphatidyl serine, phosphatidic acid, 2-lysophosphatidyl choline, and a sphingomyelin.
- a fatty acid moiety can be selected, for example, from the non-limiting group consisting of lauric acid, myristic acid, myristoleic acid, palmitic acid, palmitoleic acid, stearic acid, oleic acid, linoleic acid, alpha-linolenic acid, erucic acid, phytanoic acid, arachidic acid, arachidonic acid, eicosapentaenoic acid, behenic acid, docosapentaenoic acid, and docosahexaenoic acid.
- Particular amphipathic lipids can facilitate fusion to a membrane.
- a cationic phospholipid can interact with one or more negatively charged phospholipids of a membrane (e.g., a cellular or intracellular membrane). Fusion of a phospholipid to a membrane can allow one or more elements (e.g., a therapeutic agent) of a lipid-containing composition (e.g., LNPs) to pass through the membrane permitting, e.g., delivery of the one or more elements to a target tissue.
- a lipid-containing composition e.g., LNPs
- Non-natural amphipathic lipid species including natural species with modifications and substitutions including branching, oxidation, cyclization, and alkynes are also contemplated.
- a phospholipid can be functionalized with or cross-linked to one or more alkynes (e.g., an alkenyl group in which one or more double bonds is replaced with a triple bond).
- alkynes e.g., an alkenyl group in which one or more double bonds is replaced with a triple bond.
- an alkyne group can undergo a copper-catalyzed cycloaddition upon exposure to an azide.
- Such reactions can be useful in functionalizing a lipid bilayer of a nanoparticle composition to facilitate membrane permeation or cellular recognition or in conjugating a nanoparticle composition to a useful component such as a targeting or imaging moiety (e.g., a dye).
- Phospholipids include, but are not limited to, glycerophospholipids such as phosphatidylcholines, phosphatidylethanolamines, phosphatidylserines, phosphatidylinositols, phosphatidy glycerols, and phosphatidic acids. Phospholipids also include phosphosphingolipid, such as sphingomyelin. [0229] In some embodiments, the LNP composition comprises one or more phospholipids.
- the phospholipid is selected from the group consisting of 1,2-dilinoleoyl-sn- glycero-3-phosphocholine (DLPC), 1,2-dimyristoyl-sn-glycero-phosphocholine (DMPC), 1,2- dioleoyl-sn-glycero-3-phosphocholine (DOPC), 1,2-dipalmitoyl-sn-glycero-3-phosphocholine Attorney Ref: 38909-52322/WO Client Ref: 005WO (DPPC), 1,2-distearoyl-sn-glycero-3-phosphocholine (DSPC), 1,2-diundecanoyl-sn-glycero- phosphocholine (DUPC), 1-palmitoyl-2-oleoyl-sn-glycero-3-phosphocholine (POPC), 1,2-di-O- octadecenyl-sn-glycero-3-phosphocholine (18:0 Diether
- the LNP composition comprises one or more helper lipids.
- helper lipid refers to lipids that enhance transfection (e.g., transfection of an LNP comprising an mRNA that encodes a site-directed endonuclease, such as a SpCas9 polypeptide).
- helper lipids of the LNP compositions there are no specific limitations concerning the helper lipids of the LNP compositions disclosed herein. Without being bound to any particular theory, it is believed that the mechanism by which the helper lipid enhances transfection includes enhancing particle stability. In some embodiments, the helper lipid enhances membrane fusogenicity.
- the helper lipid of the LNP compositions disclosure herein can be any helper lipid known in the art. Non-limiting examples of helper lipids suitable for the compositions and methods include steroids, sterols, and alkyl resorcinols.
- helper lipids suitable for use in the present disclosure include, but are not limited to, saturated phosphatidylcholine (PC) such as distearoyl-PC (DSPC) and dipalymitoyl-PC (DPPC), dioleoylphosphatidylethanolamine (DOPE), 1,2-dilinoleoyl-sn- glycero-3-phosphocholine (DLPC), cholesterol, 5-heptadecylresorcinol, and cholesterol hemisuccinate.
- the helper lipid of the LNP composition includes cholesterol.
- the LNP composition comprises one or more structural lipids.
- structural lipid refers to sterols and also to lipids containing sterol moieties. Without being bound to any particular theory, it is believed that the incorporation of structural Attorney Ref: 38909-52322/WO Client Ref: 005WO lipids into the LNPs mitigates aggregation of other lipids in the particle.
- Structural lipids can be selected from the group including but not limited to, cholesterol, fecosterol, sitosterol, ergosterol, campesterol, stigmasterol, brassicasterol, tomatidine, tomatine, ursolic acid, alpha-tocopherol, hopanoids, phytosterols, steroids, and mixtures thereof.
- the structural lipid is a sterol.
- sterols are a subgroup of steroids consisting of steroid alcohols.
- the structural lipid is a steroid.
- the structural lipid is cholesterol.
- the structural lipid is an analog of cholesterol.
- the lipid component of a lipid nanoparticle composition may include one or more molecules comprising polyethylene glycol, such as PEG or PEG-modified lipids.
- the LNP composition disclosed herein comprise one or more polyethylene glycol (PEG) lipid.
- PEG-lipid refers to polyethylene glycol (PEG)-modified lipids.
- PEGylated lipids are also referred to as PEGylated lipids.
- PEG-lipids include PEG- modified phosphatidylethanolamine and phosphatidic acid, PEG-ceramide conjugates (e.g., PEG- CerC14 or PEG-CerC20), PEG-modified dialkylamines and PEG-modified 1,2-diacyloxypropan- 3-amines
- a PEG lipid can be PEG-c-DOMG, PEG-DMG, PEG-DLPE, PEG-DMPE, PEG-DPPC, or a PEG-DSPE lipid.
- the PEG-lipid includes, but not limited to 1,2-dimyristoyl-sn-glycerol methoxypolyethylene glycol (PEG-DMG), 1,2-distearoyl-sn- glycero-3-phosphoethanolamine-N-[amino(polyethylene glycol)] (PEG-DSPE), PEG-disteryl glycerol (PEG-DSG), PEG-dipalmetoleyl, PEG-dioleyl, PEG-distearyl, PEG-diacylglycamide (PEG-DAG), PEG-dipalmitoyl phosphatidylethanolamine (PEG-DPPE), or PEG-1,2- dimyristyloxlpropyl-3-amine (PEG-c-DMA).
- PEG-DMG 1,2-dimyristoyl-sn-glycerol methoxypolyethylene glycol
- PEG-DSPE 1,2-distearoyl-sn-
- the PEG-lipid is selected from the group consisting of a PEG-modified phosphatidylethanolamine, a PEG-modified phosphatidic acid, a PEG-modified ceramide, a PEG-modified dialkylamine, a PEG-modified diacylglycerol, a PEG-modified dialkylglycerol, and mixtures thereof.
- the lipid moiety of the PEG-lipids includes those having lengths of from about C.sub.14 to about C.sub.22, preferably from about C.sub.14 to about C.sub.16.
- a PEG moiety for example a mPEG-NH.sub.2, has a size of about 1000, 2000, 5000, 10,000, 15,000 or 20,000 daltons.
- the PEG-lipid is PEG2k-DMG.
- the one or more PEG lipids of the LNP composition comprises PEG-DMPE.
- the one or more PEG lipids of the LNP composition comprises PEG-DMG.
- the ratio between the lipid components and the nucleic acid molecules of the LNP composition is sufficient for (i) formation of LNPs with desired characteristics, e.g., size, charge, and (ii) delivery of a sufficient dose of nucleic acid Attorney Ref: 38909-52322/WO Client Ref: 005WO at a dose of the lipid component(s) that is tolerable for in vivo administration as readily ascertained by one of skill in the art.
- a nanoparticle may be targeted to a particular cell, tissue, and/or organ using a targeting moiety.
- a nanoparticle comprises a targeting moiety.
- targeting moieties include ligands, cell surface receptors, glycoproteins, vitamins (e.g., riboflavin) and antibodies (e.g., full-length antibodies, antibody fragments (e.g., Fv fragments, single chain Fv (scFv) fragments, Fab’ fragments, or F(ab’)2 fragments), single domain antibodies, camelid antibodies and fragments thereof, human antibodies and fragments thereof, monoclonal antibodies, and multispecific antibodies (e.g., bispecific antibodies)).
- the targeting moiety may be a polypeptide.
- the targeting moiety may include the entire polypeptide (e.g., peptide or protein) or fragments thereof.
- a targeting moiety is typically positioned on the outer surface of the nanoparticle in such a manner that the targeting moiety is available for interaction with the target, for example, a cell surface receptor.
- a variety of different targeting moieties and methods are known and available in the art, including those described, e.g., in Sapra et al., Prog. Lipid Res.42(5):439-62, 2003 and Abra et al., J. Liposome Res.12:1-3, 2002.
- a lipid nanoparticle may include a surface coating of hydrophilic polymer chains, such as polyethylene glycol (PEG) chains (see, e.g., Allen et al., Biochimica et Biophysica Acta 1237: 99-108, 1995; DeFrees et al., Journal of the American Chemistry Society 118: 6101-6104, 1996; Blume et al., Biochimica et Biophysica Acta 1149: 180- 184,1993; Klibanov et al., Journal of Liposome Research 2: 321-334, 1992; U.S. Pat. No.
- PEG polyethylene glycol
- a targeting moiety for targeting the lipid nanoparticle is linked to the polar head group of lipids forming the nanoparticle.
- the targeting moiety is attached to the distal ends of the PEG chains forming the hydrophilic polymer coating (see, e.g., Klibanov et al., Journal of Liposome Research 2: 321-334, 1992; Kirpotin et al., FEBS Letters 388: 115-118, 1996).
- Standard methods for coupling the targeting moiety or moieties may be used.
- phosphatidylethanolamine which can be activated for attachment of targeting moieties, or derivatized lipophilic compounds, such as lipid-derivatized bleomycin, can be used.
- Antibody- targeted liposomes can be constructed using, for instance, liposomes that incorporate protein A Attorney Ref: 38909-52322/WO Client Ref: 005WO (see, e.g., Renneisen et al., J. Bio. Chem., 265:16337-16342, 1990 and Leonetti et al., Proc. Natl. Acad. Sci. (USA), 87:2448-2451, 1990).
- Other examples of antibody conjugation are disclosed in U.S. Pat. No.6,027,726.
- targeting moieties can also include other polypeptides that are specific to cellular components, including antigens associated with neoplasms or tumors.
- Polypeptides used as targeting moieties can be attached to the liposomes via covalent bonds (see, for example Heath, Covalent Attachment of Proteins to Liposomes, 149 Methods in Enzymology 111-119 (Academic Press, Inc.1987)).
- Other targeting methods include the biotin-avidin system.
- a lipid nanoparticle includes a targeting moiety that targets the lipid nanoparticle to a cell including, but not limited to, hepatocytes, colon cells, epithelial cells, hematopoietic cells, epithelial cells, endothelial cells, lung cells, bone cells, stem cells, mesenchymal cells, neural cells, cardiac cells, adipocytes, vascular smooth muscle cells, cardiomyocytes, skeletal muscle cells, beta cells, pituitary cells, synovial lining cells, ovarian cells, testicular cells, fibroblasts, B cells, T cells, reticulocytes, leukocytes, granulocytes, and tumor cells (including primary tumor cells and metastatic tumor cells).
- a targeting moiety that targets the lipid nanoparticle to a cell including, but not limited to, hepatocytes, colon cells, epithelial cells, hematopoietic cells, epithelial cells, endothelial cells, lung cells, bone cells, stem cells, me
- the targeting moiety targets the lipid nanoparticle to a hepatocyte.
- the lipid nanoparticles described herein may be lipidoid-based. The synthesis of lipidoids has been extensively described and formulations containing these compounds are particularly suited for delivery of polynucleotides (see Mahon et al., Bioconjug Chem. 201021:1448-1454; Schroeder et al., J Intern Med. 2010267:9-21; Akinc et al., Nat. Biotechnol. 200826:561-569; Love et al., Proc Natl Acad Sci USA.
- lipidoid formulations for intramuscular or subcutaneous routes may vary significantly depending on the target cell type and the ability of formulations to diffuse through the extracellular matrix into the blood stream.
- lipidoid oligonucleotides to deliver the formulation to other cells types including, but not limited to, endothelial cells, myeloid cells, and muscle cells may not be similarly size-limited.
- effective delivery to myeloid cells, such as monocytes, lipidoid formulations may have a similar component molar ratio.
- lipidoids and other components including, but not limited to, a neutral lipid (e.g., diacylphosphatidylcholine), cholesterol, a PEGylated lipid (e.g., PEG-DMPE), and a fatty acid (e.g., an omega-3 fatty acid) may be used to Attorney Ref: 38909-52322/WO Client Ref: 005WO optimize the formulation of the mRNA or system for delivery to different cell types including, but not limited to, hepatocytes, myeloid cells, muscle cells, etc.
- a neutral lipid e.g., diacylphosphatidylcholine
- cholesterol e.g., a PEGylated lipid
- PEG-DMPE PEGylated lipid
- a fatty acid e.g., an omega-3 fatty acid
- Exemplary lipidoids include, but are not limited to, Dlin-DMA, Dlin-K-DMA, Dlin-KC2-DMA, 98N12-5, C12-200 (including variants and derivatives), Dlin-MC3-DMA and analogs thereof.
- the use of lipidoid formulations for the localized delivery of nucleic acids to cells (such as, but not limited to, adipose cells and muscle cells) via either subcutaneous or intramuscular delivery may also not require all of the formulation components which may be required for systemic delivery, and as such may comprise the lipidoid and the mRNA or system.
- a system described herein may be formulated by mixing the mRNA or system, or individual components of the system, with the lipidoid at a set ratio prior to addition to cells.
- In vivo formulations may require the addition of extra ingredients to facilitate circulation throughout the body.
- a system or individual components of a system is added and allowed to integrate with the complex. The encapsulation efficiency is determined using a standard dye exclusion assays.
- In vivo delivery of systems may be affected by many parameters, including, but not limited to, the formulation composition, nature of particle PEGylation, degree of loading, oligonucleotide to lipid ratio, and biophysical parameters such as particle size (Akinc et al., Mol Ther. 2009 17:872-879; herein incorporated by reference in its entirety).
- particle size Akinc et al., Mol Ther. 2009 17:872-879; herein incorporated by reference in its entirety.
- small changes in the anchor chain length of poly(ethylene glycol) (PEG) lipids may result in significant effects on in vivo efficacy.
- Formulations with the different lipidoids including, but not limited to penta[3-(1- laurylaminopropionyl)]-triethylenetetramine hydrochloride (TETA-5LAP; aka 98N12-5, see Murugaiah et al., Analytical Biochemistry, 401:61 (2010)), C12-200 (including derivatives and variants), MD1, Dlin-DMA, Dlin-K-DMA, Dlin-KC2-DMA and Dlin-MC3-DMA can be tested for in vivo activity.
- the lipidoid referred to herein as “98N12-5” is disclosed by Akinc et al., Mol Ther.200917:872-879).
- LNPs in which a nucleic acid is entrapped within the lipid portion of the particle and is protected from degradation can be formed by any method known in the art including, but not limited to, a continuous mixing method, a direct dilution process, and an in-line dilution process. Additional techniques and methods suitable for the preparation of the LNPs described herein include coacervation, microemulsions, supercritical fluid technologies, phase-inversion temperature (PIT) techniques.
- PIT phase-inversion temperature
- the LNPs used herein are produced via a continuous mixing method, e.g., a process that includes providing an aqueous solution a nucleic acid described herein in a first reservoir, providing an organic lipid solution in a second reservoir (wherein the lipids present in the organic lipid solution are solubilized in an organic solvent, e.g., a lower alkanol such as ethanol), and mixing the aqueous solution with the organic lipid solution such that the organic lipid solution mixes with the aqueous solution so as to substantially instantaneously produce a lipid vesicle (e.g., liposome) encapsulating the nucleic acid molecule within the lipid vesicle.
- a continuous mixing method e.g., a process that includes providing an aqueous solution a nucleic acid described herein in a first reservoir, providing an organic lipid solution in a second reservoir (wherein the lipids present in the organic lipid solution are solubilized in an organic solvent
- the LNPs used herein are produced via a direct dilution process that includes forming a lipid vesicle (e.g., liposome) solution and immediately and directly introducing the lipid vesicle solution into a collection vessel containing a controlled amount of dilution buffer.
- the collection vessel includes one or more elements configured to stir the contents of the collection vessel to facilitate dilution.
- the amount of dilution buffer present in the collection vessel is substantially equal to the volume of lipid vesicle solution introduced thereto.
- the LNPs are produced via an in-line dilution process in which a third reservoir containing dilution buffer is fluidly coupled to a second mixing region.
- the lipid vesicle (e.g., liposome) solution formed in a first mixing region is immediately and directly mixed with dilution buffer in the second mixing region.
- a “vector” is a tool that allows or facilitates the transfer of an entity from one environment to another. It is a replicon, such as a plasmid, phage, or cosmid, into which another DNA segment may be inserted so as to bring about the replication of the inserted segment. Generally, a vector is capable of replication when associated with the proper control elements. In general, the term “vector” refers to a nucleic acid molecule capable of transporting another nucleic acid to which it has been linked.
- Vectors include, but are not limited to, nucleic acid molecules that are single- stranded, double-stranded, or partially double-stranded; nucleic acid molecules that comprise one or more free ends, no free ends (e.g. circular); nucleic acid molecules that comprise DNA, RNA, or both; and other varieties of polynucleotides known in the art.
- plasmid refers to a circular double stranded DNA loop into which additional DNA segments can be inserted, such as by standard molecular cloning techniques.
- viral vector wherein virally-derived DNA or RNA sequences are present in the vector for packaging into a virus (e.g.
- Viral vectors also include polynucleotides carried by a virus for transfection into a host cell.
- Certain vectors are capable of autonomous replication in a host cell into which they are introduced (e.g. bacterial vectors having a bacterial origin of replication and episomal mammalian vectors).
- Other vectors e.g., non-episomal mammalian vectors are integrated into the genome of a host cell upon introduction into the host cell, and thereby are replicated along with the host genome.
- vectors are capable of directing the expression of genes to which they are operatively-linked. Such vectors are referred to herein as “expression vectors.” Vectors for and that result in expression in a eukaryotic cell can be referred to herein as “eukaryotic expression vectors.” Common expression vectors of utility in recombinant DNA techniques are often in the form of plasmids.
- Recombinant expression vectors can comprise a nucleic acid of the disclosure in a form suitable for expression of the nucleic acid in a host cell, which means that the recombinant expression vectors include one or more regulatory elements, which may be selected on the basis of the host cells to be used for expression, that is operatively-linked to the nucleic acid sequence to be expressed.
- “operably linked” is intended to mean that the nucleotide sequence of interest is linked to the regulatory element(s) in a manner that allows for expression of the nucleotide sequence (e.g. in an in vitro transcription/translation system or in a host cell when the vector is introduced into the host cell).
- Vector delivery e.g., plasmid, viral delivery:
- the CRISPR enzyme for instance a Type V protein such as C2c1 or C2c3, and/or any of the present RNAs, for instance a guide RNA, can be delivered using any suitable vector, e.g., plasmid or viral vectors, such as adeno associated virus (AAV), lentivirus, adenovirus or other viral vector types, or combinations thereof.
- Effector proteins and one or more guide RNAs can be packaged into one or more vectors, e.g., plasmid or viral vectors.
- the vector e.g., plasmid or viral vector is delivered to the tissue of interest by, for example, an intramuscular injection, while other times the delivery is via intravenous, transdermal, intranasal, oral, mucosal, or other delivery methods. Such delivery may be either via a single dose, or multiple doses.
- the actual dosage to be delivered herein may vary greatly depending upon a variety of factors, such as the vector choice, the target cell, organism, or tissue, the general condition of the subject to be treated, the degree of transformation/modification sought, the administration route, the administration mode, the type of transformation/modification sought, etc.
- Such a dosage may further contain, for example, a carrier (water, saline, ethanol, glycerol, lactose, sucrose, calcium phosphate, gelatin, dextran, agar, pectin, peanut oil, sesame oil, etc.), a diluent, a pharmaceutically-acceptable carrier (e.g., phosphate-buffered saline), a pharmaceutically-acceptable excipient, and/or other compounds known in the art.
- a carrier water, saline, ethanol, glycerol, lactose, sucrose, calcium phosphate, gelatin, dextran, agar, pectin, peanut oil, sesame oil, etc.
- a pharmaceutically-acceptable carrier e.g., phosphate-buffered saline
- a pharmaceutically-acceptable excipient e.g., phosphate-buffered saline
- the dosage may further contain one or more pharmaceutically acceptable salts such as, for example, a mineral acid salt such as a hydrochloride, a hydrobromide, a phosphate, a sulfate, etc.; and the salts of organic acids such as acetates, propionates, malonates, benzoates, etc.
- auxiliary substances such as wetting or emulsifying agents, pH buffering substances, gels or gelling materials, flavorings, colorants, microspheres, polymers, suspension agents, etc. may also be present herein.
- one or more other conventional pharmaceutical ingredients such as preservatives, humectants, suspending agents, surfactants, antioxidants, anticaking agents, fillers, chelating agents, coating agents, chemical stabilizers, etc. may also be present, especially if the dosage form is a reconstitutable form.
- suitable exemplary ingredients include microcrystalline cellulose, carboxymethylcellulose sodium, polysorbate 80, phenylethyl alcohol, chlorobutanol, potassium sorbate, sorbic acid, sulfur dioxide, propyl gallate, the parabens, ethyl vanillin, glycerin, phenol, parachlorophenol, gelatin, albumin and a combination thereof.
- the delivery is via an adenovirus, which may be at a single booster dose containing at least 1 x 10 5 particles (also referred to as particle units, pu) of adenoviral vector.
- the dose preferably is at least about 1 x 10 6 particles (for example, about 1 x 10 6 -1 x 10 11 particles), more preferably at least about 1 x 10 7 particles, more preferably at least about 1 x 10 8 particles (e.g., about 1 x 10 8 -1 x 10 11 particles or about 1 x 10 9 -1 x 10 12 particles), and most preferably at least about 1 x 10 10 particles (e.g., about 1 x 10 9 -1 x 10 10 particles or about 1 x 10 9 -1 x 10 12 particles), or even at least about 1 x 10 10 particles (e.g., about 1 x 10 10 -1 x 10 12 particles) of the adenoviral vector.
- 1 x 10 6 particles for example, about 1 x 10 6 -1 x 10 11 particles
- 1 x 10 8 particles e.g., about 1 x 10 8 -1 x 10 11 particles or about 1 x 10 9 -1 x 10 12 particles
- the dose comprises no more than about 1 x 10 14 particles, preferably no more than about 1 x 10 13 particles, even more preferably no more than about 1 x 10 12 particles, even more preferably no more than about 1 x 10 11 particles, and most preferably no more than about 1 x 10 10 particles (e.g., no more than about 1 x 10 9 particles).
- the dose may contain a single dose of adenoviral vector with, for example, about 1 x 10 6 particle units (pu), about 2 x 10 6 pu, about 4 x 10 6 pu, about 1 x 10 7 pu, about 2 x 10 7 pu, about 4 x 10 7 pu, about 1 x 10 8 pu, about 2 x 10 8 pu, about 4 x 10 8 pu, about 1 x 10 9 pu, about 2 x 10 9 pu, about 4 x 10 9 pu, about 1 x 10 10 pu, about 2 x 10 10 pu, about 4 x 10 10 pu, about 1 x 10 11 pu, about 2 x 10 11 pu, about 4 x 10 11 pu, about 1 x 10 12 pu, about 2 x 10 12 pu, or about 4 x 10 12 pu of adenoviral vector.
- adenoviral vector with, for example, about 1 x 10 6 particle units (pu), about 2 x 10 6 pu, about 4 x 10 6 pu, about 1 x 10 7 pu, about 2 x 10 7 pu
- the adenovirus is delivered via multiple doses.
- the delivery is via an AAV.
- a therapeutically effective dosage for in vivo delivery of the AAV to a human is believed to be in the range of from about 20 to about 50 ml of saline solution containing from about 1 x 10 10 to about 1 x 10 10 functional AAV/ml solution. The dosage may be adjusted to balance the therapeutic benefit against any side effects.
- the AAV dose is generally in the range of concentrations of from about 1 x 10 5 to 1 x 10 50 genomes AAV, from about 1 x 10 8 to 1 x 10 20 genomes AAV, from about 1 x 10 10 to about 1 x 10 16 genomes, or about 1 x 10 11 to about 1 x 10 16 genomes AAV.
- a human dosage may be about 1 x 10 13 genomes AAV.
- concentrations may be delivered in from about 0.001 ml to about 100 ml, about 0.05 to about 50 ml, or about 10 to about 25 ml of a carrier solution.
- Other effective dosages can be readily established by one of ordinary skill in the art through routine trials establishing dose response curves. See, for example, U.S. Pat.
- the promoter used to drive nucleic acid-targeting effector protein coding nucleic acid molecule expression can include: AAV ITR can serve as a promoter: this is advantageous for eliminating the need for an additional promoter element (which can take up space in the vector). The additional space freed up can be used to drive the expression of additional elements (gRNA, Attorney Ref: 38909-52322/WO Client Ref: 005WO etc.). Also, ITR activity is relatively weaker, so can be used to reduce potential toxicity due to over expression of nucleic acid-targeting effector protein.
- promoters CMV, CAG, CBh, PGK, SV40, Ferritin heavy or light chains, etc.
- promoters For brain or other CNS expression, can use promoters: SynapsinI for all neurons, CaMKIIalpha for excitatory neurons, GAD67 or GAD65 or VGAT for GABAergic neurons, etc.
- For liver expression can use Albumin promoter.
- For lung expression can use SP-B.
- endothelial cells can use ICAM.
- hematopoietic cells can use IFNbeta or CD45.
- Osteoblasts can use OG-2.
- the promoter used to drive guide RNA can include: Pol III promoters such as U6 or H1 Use of Pol II promoter and intronic cassettes to express guide RNA Adeno Associated Virus (AAV) [0256] Nucleic acid-targeting effector protein and one or more guide RNA can be delivered using adeno associated virus (AAV), lentivirus, adenovirus or other plasmid or viral vector types, in particular, using formulations and doses from, for example, U.S. Pat. No.8,454,972 (formulations, doses for adenovirus), U.S. Pat. No.8,404,658 (formulations, doses for AAV) and U.S. Pat. No.
- 5,846,946 formulations, doses for DNA plasmids
- the route of administration, formulation and dose can be as in U.S. Pat. No. 8,454,972 and as in clinical trials involving AAV.
- the route of administration, formulation and dose can be as in U.S. Pat. No. 8,404,658 and as in clinical trials involving adenovirus.
- the route of administration, formulation and dose can be as in U.S. Pat. No. 5,846,946 and as in clinical studies involving plasmids.
- Doses may be based on or extrapolated to an average 70 kg individual (e.g., a male adult human), and can be adjusted for patients, subjects, mammals of different weight and species. Frequency of administration is within the ambit of the medical or veterinary practitioner (e.g., physician, veterinarian), depending on usual factors including the age, sex, general health, other conditions of the patient or subject and the particular condition or symptoms being addressed.
- the viral vectors can be injected into the tissue of interest.
- the expression of nucleic acid-targeting effector can be driven by a cell-type specific promoter.
- liver-specific expression might use the Albumin promoter and neuron-specific expression (e.g., for targeting CNS disorders) might use the Synapsin I promoter.
- AAV In terms of in vivo delivery, AAV is advantageous over other viral vectors for a couple of reasons: Low toxicity (this may be due to the purification method not requiring ultra centrifugation Attorney Ref: 38909-52322/WO Client Ref: 005WO of cell particles that can activate the immune response) and Low probability of causing insertional mutagenesis because it doesn’t integrate into the host genome.
- AAV has a packaging limit of 4.5 or 4.75 Kb. This means that nucleic acid-targeting effector protein (such as a Type V protein such as C2c1 or C2c3) as well as a promoter and transcription terminator have to be all fit into the same viral vector.
- embodiments of the disclosure include utilizing homologs of nucleic acid-targeting effector protein (such as a Type V protein such as C2c1 or C2c3) that are shorter.
- the AAV can be AAV1, AAV2, AAV5 or any combination thereof.
- AAV8 is useful for delivery to the liver. The herein promoters and vectors are preferred individually.
- Packaging cells are typically used to form virus particles that are capable of infecting a host cell. Such cells include 293 cells, which package adenovirus, and psi2 cells or PA317 cells, which package retrovirus.
- Viral vectors used in gene therapy are usually generated by producing a cell line that packages a nucleic acid vector into a viral particle. The vectors typically contain the minimal viral sequences required for packaging and subsequent integration into a host, other viral sequences being replaced by an expression cassette for the polynucleotide(s) to be expressed. The missing viral functions are typically supplied in trans by the packaging cell line. For example, AAV vectors used in gene therapy typically only possess ITR sequences from the AAV genome which are required for packaging and integration into the host genome.
- Viral DNA is packaged in a cell line, which contains a helper plasmid encoding the other AAV genes, namely rep and cap, but lacking ITR sequences.
- the cell line may also be infected with adenovirus as a helper.
- the helper virus promotes replication of the AAV vector and expression of AAV genes from the helper plasmid.
- the helper plasmid is not packaged in significant amounts due to a lack of ITR sequences. Contamination with adenovirus can be reduced by, e.g., heat treatment to which adenovirus is more sensitive than AAV. Additional methods for the delivery of nucleic acids to cells are known to those skilled in the art. See, for example, US20030087817, incorporated herein by reference.
- Millington-Ward et al. (Molecular Therapy, vol.19 no.4, 642-649 April 2011) describes adeno-associated virus (AAV) vectors to deliver an RNA interference (RNAi)-based rhodopsin suppressor and a codon-modified rhodopsin replacement gene resistant to suppression due to nucleotide alterations at degenerate positions over the RNAi target site.
- RNAi RNA interference
- An injection of either 6.0 x 10 8 vp or 1.8 x 10 10 vp AAV were subretinally injected into the eyes by Millington-Ward et al.
- Dalkara et al. (Sci Transl Med 5, 189ra76 (2013)) also relates to in vivo directed evolution to fashion an AAV vector that delivers wild-type versions of defective genes throughout the retina after noninjurious injection into the eyes’ vitreous humor. Dalkara describes a 7 mer peptide display library and an AAV library constructed by DNA shuffling of cap genes from AAV1, 2, 4, 5, 6, 8, and 9.
- the rcAAV libraries and rAAV vectors expressing GFP under a CAG or Rho promoter were packaged and deoxyribonuclease-resistant genomic titers were obtained through quantitative PCR.
- the libraries were pooled, and two rounds of evolution were performed, each consisting of initial library diversification followed by three in vivo selection steps.
- P30 rho-GFP mice were intravitreally injected with 2 ml of iodixanol-purified, phosphate- buffered saline (PBS)-dialyzed library with a genomic titer of about 1.times.10.sup.12 vg/ml.
- PBS phosphate- buffered saline
- Retroviral vectors are retroviral vectors that are able to transduce or infect non-dividing cells and typically produce high viral titers. Selection of a retroviral gene transfer system would therefore depend on the target tissue. Retroviral vectors are comprised of cis-acting long terminal repeats with packaging capacity for up to 6-10 kb of foreign sequence.
- retroviral vectors include those based upon murine leukemia virus (MuLV), gibbon ape leukemia virus (GaLV), Simian Immuno deficiency virus (SW), human immune deficiency virus (HIV), and combinations thereof (see, e.g., Buchscher et al., J. Virol. 66 :2731-2739 (1992) ; Johann et al., J.
- adenoviral based systems may be used.
- Adenoviral based vectors are capable of very high transduction efficiency in many cell types and do not require cell division. With such vectors, high titer and levels of expression have been obtained. This vector can be produced in large quantities in a relatively simple system.
- Adeno-associated virus (“AAV”) vectors may also be used to transduce cells with target nucleic acids, e.g., in the in vitro production of nucleic acids and peptides, and for in vivo and ex vivo gene therapy procedures (see, e.g., West Attorney Ref: 38909-52322/WO Client Ref: 005WO et al., Virology 160:38-47 (1987); U.S. Pat. No. 4,797,368; WO 93/24641; Kotin, Human Gene Therapy 5:793-801 (1994); Muzyczka, J. Clin. Invest. 94:1351 (1994).
- AAV Adeno-associated virus
- Packaging cells are typically used to form virus particles that are capable of infecting a host cell. Such cells include 293 cells, which package adenovirus, and yr2 cells or PA317 cells, which package retrovirus.
- Viral vectors used in gene therapy are usually generated by producing a cell line that packages a nucleic acid vector into a viral particle.
- the vectors typically contain the minimal viral sequences required for packaging and subsequent integration into a host, other viral sequences being replaced by an expression cassette for the polynucleotide(s) to be expressed.
- the missing viral functions are typically supplied in trans by the packaging cell line.
- AAV vectors used in gene therapy typically only possess ITR sequences from the AAV genome which are required for packaging and integration into the host genome.
- Viral DNA is packaged in a cell line, which contains a helper plasmid encoding the other AAV genes, namely rep and cap, but lacking ITR sequences.
- the cell line may also be infected with adenovirus as a helper.
- the helper virus promotes replication of the AAV vector and expression of AAV genes from the helper plasmid.
- the helper plasmid is not packaged in significant amounts due to a lack of ITR sequences. Contamination with adenovirus can be reduced by, e.g., heat treatment to which adenovirus is more sensitive than AAV. Additional methods for the delivery of nucleic acids to cells are known to those skilled in the art. See, for example, US20030087817, incorporated herein by reference. [0265]
- a host cell is transiently or non-transiently transfected with one or more vectors described herein.
- a cell is transfected as it naturally occurs in a subject.
- a cell that is transfected is taken from a subject.
- Cells taken from a subject include, but are not limited to, hepatocytes or cells isolated from muscle, the CNS, eye or lung.
- Immunological cells are also contemplated, such as but not limited to T cells, HSCs, B-cells and NK cells.
- Another useful method to deliver proteins, enzymes, and guides comprises transfection of messenger RNA (mRNA).
- mRNA delivery methods and compositions that may be utilized in the present disclosure including, for example, PCT/US2014/028330, US8822663B2, NZ700688A, ES2740248T3, EP2755693A4, EP2755986A4, WO2014152940A1, EP3450553B1, BR112016030852A2, and EP3362461A1.
- Expression of CRISPR systems in particular is Attorney Ref: 38909-52322/WO Client Ref: 005WO described by WO2020014577.
- Each of these publications are incorporated herein by reference in their entireties.
- the cell is derived from cells taken from a subject, such as a cell line.
- a cell line A wide variety of cell lines for tissue culture are known in the art.
- cell lines include, but are not limited to, C8161, CCRF-CEM, MOLT, mIMCD-3, NHDF, HeLa-S3, Huh1, Huh4, Huh7, HUVEC, HASMC, HEKn, HEKa, MiaPaCell, Panc1, PC-3, TF1, CTLL-2, CIR, Rat6, CV1, RPTE, A10, T24, J82, A375, ARH-77, Calu1, SW480, SW620, SKOV3, SK-UT, CaCo2, P388D1, SEM-K2, WEHI-231, HB56, TIB55, Jurkat, J45.01, LRMB, Bcl-1, BC-3, IC21, DLD2, Raw264.7, NRK, NRK-52E, MRC5, MEF, Hep G2, HeLa B, HeLa T4, COS, COS-1, COS-6, COS-M6A, BS-C-1 monkey kidney epithelial, BALB/3
- a cell transfected with one or more vectors described herein is used to establish a new cell line comprising one or more vector-derived sequences.
- one or more vectors described herein are used to produce a non- human transgenic animal or transgenic plant.
- the transgenic animal is a mammal, such as a mouse, rat, or rabbit.
- the organism or subject is a plant.
- the organism or subject or plant is algae.
- the disclosure provides for methods of modifying a target polynucleotide in a prokaryotic or eukaryotic cell, which may be in vivo, ex vivo or in vitro.
- the method comprises sampling a cell or population of cells from a human or non-human animal or plant (including micro-algae) and modifying the cell or cells. Culturing may occur at any stage ex vivo.
- the cell or cells may even be re-introduced into the non-human animal or plant (including micro-algae).
- pathogens are often host-specific. For example, Fusariumn oxysporum f. sp. Lycopersici causes tomato wilt but attacks only tomato, and F. oxysporum f. dianthii Puccinia graminis f. sp. Tritici attacks only wheat. Plants have existing and induced defenses to resist most pathogens. Mutations and recombination events across plant generations lead to genetic variability that gives rise to susceptibility, especially as pathogens reproduce with more frequency than plants.
- non-host resistance e.g., the host and pathogen are incompatible.
- Horizontal Resistance e.g., partial resistance against all races of a pathogen, typically controlled by many genes
- Vertical Resistance e.g., complete resistance to some races of a pathogen but not to other races, typically controlled by a few genes.
- plants and pathogens evolve together, and the genetic changes in one balance changes in other. Accordingly, using Natural Variability, breeders combine most useful genes for Yield. Quality, Uniformity, Hardiness, Resistance.
- the sources of resistance genes include native or foreign Varieties, Heirloom Varieties, Wild Plant Relatives, and Induced Mutations, e.g., treating plant material with mutagenic agents.
- plant breeders are provided with a new tool to induce mutations. Accordingly, one skilled in the art can analyze the genome of sources of resistance genes, and in Varieties having desired characteristics or traits employ the present disclosure to induce the rise of resistance genes, with more precision than previous mutagenic agents and hence accelerate and improve plant breeding programs.
- target polynucleotides include a sequence associated with a signaling biochemical pathway, e.g., a signaling biochemical pathway-associated gene or polynucleotide.
- target polynucleotides include a disease associated gene or polynucleotide.
- a “disease-associated” gene or polynucleotide refers to any gene or polynucleotide which is yielding transcription or translation products at an abnormal level or in an abnormal form in cells derived from a disease-affected tissues compared with tissues or cells of a non disease control. It may be a gene that becomes expressed at an abnormally high level; it may be a gene that becomes Attorney Ref: 38909-52322/WO Client Ref: 005WO expressed at an abnormally low level, where the altered expression correlates with the occurrence and/or progression of the disease.
- a disease-associated gene also refers to a gene possessing mutation(s) or genetic variation that is directly responsible or is in linkage disequilibrium with a gene(s) that is responsible for the etiology of a disease.
- the transcribed or translated products may be known or unknown and may be at a normal or abnormal level.
- a guide RNA comprising from 5’ to 3’: a spacer complementary to a first target genomic site; a scaffold capable of binding to a DNA binding nickase; a reverse transcriptase template comprised of at least a first segment and a second segment, wherein the first segment encodes a clamp segment complementary to a second genomic site and the second segment encodes an integrase target recognition site; and a primer binding site complementary to a third genomic site; wherein the reverse transcriptase template encodes for a first single-stranded DNA sequence.
- Embodiment 2 The guide RNA of any one of Embodiment 1, further comprising a 3’ extension capable of binding one or more consecutive nucleotides of the reverse transcriptase template first segment.
- Embodiment 3 The guide RNA of any one of Embodiments 1-2, wherein the second genomic site and the third genomic site are separated by one or more genomic bases.
- Embodiment 4. The guide RNA of Embodiment 3, wherein the second genomic site and the third genomic site are separated by 15-21, 33-39, or 35-41 genomic bases.
- Embodiment 5. The guide RNA of any one of Embodiments 1-4, wherein the clamp segment binds to the second genomic site with one or more nucleotide mismatches.
- Embodiment 6. The guide RNA of Embodiment 5, wherein the mismatches are consecutive or non-consecutive.
- Embodiment 8 A guide RNA complex, comprising: a spacer complementary to a first target genomic site; a scaffold capable of binding to a DNA binding nickase; a reverse transcriptase template comprised of at least a first segment and a second segment, wherein the first segment Attorney Ref: 38909-52322/WO Client Ref: 005WO encodes a clamp segment complementary to a second genomic site and the second segment encodes an integrase target recognition site; a primer binding site complementary to a third genomic site; wherein the reverse transcriptase template encodes for a first single-stranded DNA sequence.
- Embodiment 9 The guide RNA complex of Embodiment 8, wherein the guide RNA complex is comprised of a first polynucleotide component and a second polynucleotide component.
- Embodiment 10 The guide RNA complex of Embodiment 8, wherein the guide RNA complex is comprised of a first polynucleotide component and a second polynucleotide component.
- Embodiment 11 Embodiment 11.
- Embodiment 12 The guide RNA or the guide RNA complex of any one of Embodiments 1-11, further comprises one or more of an RNA-protein recruitment domain, a transcriptional termination signal, a reverse transcription termination signal, a nucleobase analog, an RNA ribozyme, or a chemical linker.
- Embodiment 13 The guide RNA or the guide RNA complex of any one of Embodiments 1-11, further comprises one or more of an RNA-protein recruitment domain, a transcriptional termination signal, a reverse transcription termination signal, a nucleobase analog, an RNA ribozyme, or a chemical linker.
- Embodiment 14 The guide RNA or the guide RNA complex of any one of Embodiments 1-12, wherein the guide RNA reverse transcriptase template encodes for a first single-stranded DNA sequence that contains a complementary region to a second single-stranded DNA sequence encoded by a second guide RNA comprised of a reverse transcriptase template.
- Embodiment 14 The guide RNA or the guide RNA complex of any one of Embodiments 1-13, wherein an RNA polynucleotide comprises the guide RNA or a DNA polynucleotide encodes the guide RNA.
- adenovirus helper dependent adenovirus
- AAV lentivirus
- HSV lentivirus
- annelovirus retrovirus
- Doggybone DNA minicircle
- plasmid miniDNA
- nanoplasmid nanoplasmid
- exosome exosome
- fusosome mRNA
- RNP lipid nanoparticle
- a method of polynucleotide integration into a cellular genome comprising: incorporating an integration target recognition site into a cell genome by delivering: a gene editor protein or polynucleotide encoding a gene editor protein, wherein the gene editor protein is capable of polynucleotide genomic integration; one or more guide RNA or guide RNA complex selected from any one of Embodiments 1-16; optionally, a nicking gRNA; and integrating the polynucleotide into the cellular genome by delivering: a donor polynucleotide template, wherein the donor polynucleotide template is comprised of an integration target recognition site, wherein the donor polynucleotide is integrated into the cellular genome at the incorporated genomic integration target recognition site by the gene editor protein.
- Attachment Site-Containing guide RNA capable of partial genomic replacement demonstrate improved attachment site genomic placement
- attachment site-containing guide RNAs that are capable of site-specifically incorporating an integrase recognition site into a genomic locus were assessed for percent beacon placement (i.e., percent integration efficiency of integration recognition site).
- an atgRNA targeting the LMBN1 locus and capable of site-specifically incorporating an integrase recognition site into the LMBN1 locus were synthesized to allow for a 0 nucleotide separation (see atgRNA (v1) in FIGs.3A-3C), 18 nucleotide separation (see, e.g., atgRNA (v2) in FIGs. 4A-4C where the separation is 18 nucleotides), or 36 nucleotide separation (see, e.g., atgRNA (v2) in FIGs. 4A-4C where the separation is 36 nucleotides), respectively, between the third genomic site (i.e., PBS binding site) and the second genomic site (i.e., clamp binding site).
- the third genomic site i.e., PBS binding site
- the second genomic site i.e., clamp binding site
- AtgRNAs incorporated a clamp segment 10 nucleotides long and a 38 nucleotide integration recognition site (e.g., AttB or AttP).
- AtgRNAs were electroporated along with mRNA encoding a gene editor polypeptide (i.e., a Cas9-nickase-reverse transcriptase fusion) and a nicking guide RNA (ngRNA) into HEK 293 cells.
- ngRNA nicking guide RNA
- Neon electroporation system was used.
- AtgRNA and nicking gRNAs were ordered from Integrated DNA Technologies as Alt-R gRNA.
- HEK293T cells As shown in Table 11, 1 ⁇ g of mRNA encoding the gene editor polypeptide, 120 pmol of atgRNA, 40 pmol of nicking sgRNA and 50,000 HEK293T cells were mixed in Buffer R and electroporated using 10- ⁇ l Neon tips using the following electroporation parameters: 1,150 V, 20 ms, two pulses. After electroporation, cells were plated in prewarmed 48-well plates with DMEM containing 10% FBS and incubated for 72 h before analysis. l *Cells are either HEK293 or Jurkat cells [0291] As shown in FIG.
- an atgRNA having a clamp segment at least partially complementary to a second genomic site and a primer binding site complementary to a third genomic site, where the second genomic site and the third genomic site are separated by 18 base pairs demonstrated an 82% beacon placement efficiency as measured by droplet digital PCR (ddPCR).
- an atgRNA having a clamp segment at least partially complementary to a second genomic site and a primer binding site complementary to a third genomic site, where the second genomic site are adjacent demonstrated a 74% beacon placement efficiency.
- Example 2 Attachment Site-Containing guide RNA (atgRNAs) capable of partial genomic replacement demonstrate improved attachment site genomic placement
- attachment site-containing guide RNAs atgRNAs
- the AtgRNA were designed to target the LMNB1 locus, the hF9 locus, and the ACTB locus and with each atgRNA capable of site-specifically incorporating an integrase recognition site into the LMBN1 locus, the hF9 locus, and the ACTB locus, respectively.
- the atgRNAs targeting the LMNB1 locus were synthesized to allow for a 0-36 base pair separation between the second genomic site and the third genomic site.
- the atgRNAs targeting the hF9 locus were synthesized to allow for a 0-42 base pair separation between the second genomic site and the third genomic site.
- the atgRNAs targeting the ACTB locus were synthesized to allow for a 0-33 base pair separation between the second genomic site and the third genomic site.
- the synthesized atgRNAs incorporate the use of a 10 nucleotide clamp segment and a 38 nucleotide integration recognition site (e.g., AttB or AttP).
- atgRNA and nicking gRNAs were ordered from Integrated DNA Technologies as Alt-R gRNA.
- the atgRNAs were transfected along with mRNA encoding a gene editor polypeptide (i.e., a Cas9-nickase-reverse transcriptase fusion) and a nicking guide RNA (ngRNA) into HEK 293 cells using Lipofectamine MessengerMax (Thermo Fisher) according to the manufacturer’s instructions.
- a gene editor polypeptide i.e., a Cas9-nickase-reverse transcriptase fusion
- ngRNA nicking guide RNA
- 1 ⁇ g of a gene editor polypeptide (Cas9-nickase linked to a reverse transcriptase) encoding mRNA 120 pmol of atgRNA, 40 pmol of nicking sgRNA were transfected into 50,000 HEK293T cells.
- FIG. 7A shows percent beacon placement (i.e., efficiency of integration of the integration recognition site) for atgRNAs targeting the LMNB1 locus having a clamp segment at least partially complementary to a second genomic site and a primer binding site complementary to a third genomic site, where the second genomic site and the third genomic site are separated by 0 to 36 base pairs (see FIG. 7A “clamp+0” (i.e., 0 base pair separation) to “clamp+36” (i.e., 36 base pair separation). Percent beacon placement was highest for atgRNA having a 10 base pair separation between the second and third genomic sites (see “clamp+10” in FIG.7A).
- FIG. 7B shows percent beacon placement (i.e., efficiency of integration of the integration recognition site) for atgRNAs targeting the hF9 locus having a clamp segment at least partially complementary to a second genomic site and a primer binding site complementary to a Attorney Ref: 38909-52322/WO Client Ref: 005WO third genomic site, where the second genomic site and the third genomic site are separated by 0 to 42 base pairs (see FIG.7B “clamp+0” (i.e., 0 base pair separation) to “clamp+42” (i.e., 42 base pair separation). Percent beacon placement was highest for atgRNA having a 31 base pair separation between the second and third genomic sites (see “clamp+31” in FIG.7B).
- FIG. 7C shows percent beacon placement (i.e., efficiency of integration of the integration recognition site) for atgRNAs targeting the ACTB locus having a clamp segment at least partially complementary to a second genomic site and a primer binding site complementary to a third genomic site, where the second genomic site and the third genomic site are separated by 0 to 33 base pairs (see FIG. 7C “clamp+0” (i.e., 0 base pair separation) to “clamp+33” (i.e., 33 base pair separation).
- Percent beacon placement is highest for atgRNA having a 16 base pair separation between the second and third genomic sites (see “clamp+16” in FIG.7C).
- PASTE with atgRNAs capable of partial genomic replacement A PASTE readout in HEK293 cells is used to identify functionally active atgRNAs capable of inserting an integration recognition site where insertion includes partial genomic replacement. Droplet digital PCR (ddPCR) in a 96-well plate format is applied to verify gene insertion into the ACTB locus. Using Lipofectamine 3000 transfection reagent, 100ng total DNA at a plasmid ratio of 1:1:4 of Cas9(nickase)-RT-BxB1 : nicking guide : atgRNA is delivered to 30,000 HEK293 cells.
- ddPCR Droplet digital PCR
Abstract
Provided herein are attachment site containing guide RNAs (atgRNAs) and compositions useful for site-specifically incorporating an integration recognition site into a cellular genome or intracellular nucleic acid sequence. For example, provided herein is an atgRNA comprising a spacer, a scaffold, a clamp, a reverse transcriptase template, and a primer binding site.
Description
Attorney Ref: 38909-52322/WO Client Ref: 005WO GUIDE RNA COMPOSITIONS FOR PROGRAMMABLE GENE INSERTION 1. CROSS-REFERENCE TO RELATED APPLICATIONS [0001] This application claims the benefit of U.S. Provisional Application No.63/338,077, filed May 4, 2022; which is hereby incorporated in its entirety by reference. 2. SEQUENCE LISTING [0002] The instant application contains a Sequence Listing with 558 sequences, which has been submitted via Patent Center and is hereby incorporated by reference in its entirety. Said XML copy, created on May 4, 2023, is named 52322WO-SequenceListing.xml, and is 757,295 bytes in size. 3. BACKGROUND [0003] Programmable, efficient, and multiplexed genome integration of large, diverse DNA cargo independent of DNA repair remains an unsolved challenge of genome editing. Current gene integration approaches require double strand breaks that evoke DNA damage responses and rely on repair pathways that are inactive in terminally differentiated cells. Furthermore, CRISPR-based approaches that bypass double stranded breaks, such as Prime editing, are limited to modification or insertion of short sequences. [0004] There is a need in the art for techniques and gene editor compositions which address and overcome these shortcomings and enable the programmable genomic integration of a large gene(s). 4. SUMMARY [0005] This disclosure features attachment site containing guide RNA (atgRNA) and guide RNA complex compositions capable of binding to a gene editor protein composition and mediating incorporation of an integration recognition site into a target DNA sequence. In some embodiments, the integration recognition stie (also referred to as a beacon) is placed in the genome by partial genomic replacement using single guide RNA or dual guide RNAs. In typical embodiments, the atgRNA comprises, from 5’ to 3’: a spacer sequence complementary to a first genomic site of a first strand of a double-stranded target DNA sequence; a scaffold capable of associating with a DNA binding nickase domain and guiding the DNA binding nickase domain to the target DNA sequence; a reverse transcriptase (RT) template comprising: an integration recognition site; and a clamp segment comprising a sequence where upon being reverse transcribed the sequence (reverse
Attorney Ref: 38909-52322/WO Client Ref: 005WO complement clamp “rcClamp”) is at least partially complementary to a second genomic site of the first strand of the double-stranded target DNA sequence; and a primer binding site complementary to a third genomic site of the double-stranded target DNA sequence, wherein the third genomic site is on a second strand of the target DNA sequence. [0006] Inclusion of a clamp segment in the atgRNA allows for a reduction in the number of nucleotides in the primer binding site needed to bind the third genomic site and facilitate reverse transcription. By relaxing the requirement that the clamp binding site (i.e., second genomic site) be immediately adjacent to the primer binding site (i.e., third genomic site) or by allowing reduction in the number of nucleotides in the PBS, this disclosure provides for atgRNA compositions which are capable of being reduced in total sequence length as compared to standard prime editing guide RNAs or gene editing RNAs capable of integration recognition site insertion. The clamp segment binding site (i.e., second genomic site) can be selected from within a large genomic window, wherein for example, the genomic window frame can be shifted to screen for variable sequence distance from the third genomic site and/or constrained to select for efficient (e.g., melting temperature, rate of binding, low off target, etc.) hybridization to a defined length clamp segment. The present disclosure provides for single guide RNA, split guide RNA, and dual guide RNA methods for programmable gene insertion by partial genomic replacement. [0007] The present disclosure provides nucleic acid compositions, methods, and an overall platform for site-specific genetic engineering using Programmable Addition via Site-Specific Targeting Elements (PASTE) (see Ionnidi et al.; doi: 10.1101/2021.11.01.466786; the entirety of which is incorporated herein by reference), transposon-mediated gene editing, or other suitable gene editing, or gene incorporation technology. Non-limiting examples of PASTE are also described in U.S. Pat.11,572,556, which is hereby incorporated by reference in its entirety. [0008] In one aspect, this disclosure features an attachment site containing guide RNA (atgRNA) comprising from 5’ to 3’: a spacer sequence complementary to a first genomic site of a first strand of a double-stranded target DNA sequence; a scaffold capable of associating with a DNA binding nickase domain and guiding the DNA binding nickase domain to the target DNA sequence; a reverse transcriptase template comprising: an integration recognition site; and a clamp segment comprising a sequence where upon being reverse transcribed the sequence is at least partially complementary to a second genomic site of the first strand of the double-stranded target DNA sequence; and a primer binding site complementary to a third genomic site of the double-stranded target DNA sequence, wherein the third genomic site is on a second strand of the target DNA
Attorney Ref: 38909-52322/WO Client Ref: 005WO sequence, wherein the reverse transcriptase template encodes for a first single-stranded DNA sequence comprising the integration recognition site and the clamp segment. [0009] In another aspect, this disclosure features an attachment site containing guide RNA (atgRNA) comprising from 5’ to 3’: a spacer sequence complementary to a first genomic site of a first strand of a double-stranded target DNA sequence; a scaffold capable of associating with a DNA binding nickase domain and guiding the DNA binding nickase domain to the target DNA sequence; a reverse transcriptase template comprising: an integration recognition site; and a clamp segment comprising a sequence where upon being reverse transcribed comprises a sequence at least partially complementary to a second genomic site of first strand of the double-stranded target DNA sequence; and a primer binding site comprising: a sequence complementary to a third genomic site of the double-stranded DNA target sequence, wherein the third genomic site is on a second strand of the target DNA sequence; and an anti-reverse transcriptase template (anti-RTT) sequence comprising a sequence at least partially complementary to all or a portion of the RT template, wherein the reverse transcriptase template encodes for a first single-stranded DNA sequence comprising the integration recognition site and the clamp segment. [0010] In another aspect, this disclosure features an attachment site containing guide RNA (atgRNA), comprising from 5’ to 3’: a spacer sequence complementary to a first genomic site of a first strand of a double-stranded target DNA sequence; a scaffold capable of associating with a DNA binding nickase domain and guiding the DNA binding nickase domain to the target DNA sequence; a clamp segment at least partially complementary to a second genomic site of the first strand of the double-stranded target DNA sequence; a reverse transcriptase (RT) template comprising an integration recognition site; and a primer binding site complementary to a third genomic site of the double-stranded target DNA sequence, wherein the third genomic site is on a second strand of the target DNA sequence; wherein the reverse transcriptase template encodes for a first single-stranded DNA sequence comprising the integration recognition site. [0011] In another aspect, this disclosure features an attachment site containing guide RNA (atgRNA) comprising from 5’ to 3’: a spacer sequence complementary to a first genomic site of a first strand of a double-stranded target DNA sequence; a scaffold capable of associating with a DNA binding nickase domain and guiding the DNA binding nickase domain to the target DNA sequence; a clamp segment at least partially complementary to a second genomic site of the first strand of the double-stranded target DNA sequence; a reverse transcriptase (RT) template comprising an integration recognition site; and a primer binding site comprising: a sequence complementary to a third genomic site of the double-stranded DNA target sequence, wherein the
Attorney Ref: 38909-52322/WO Client Ref: 005WO third genomic site is on a second strand of the target DNA sequence; and an anti-reverse transcriptase template (anti-RTT) sequence comprising a sequence at least partially complementary to all or a portion of the RT template, wherein the reverse transcriptase template encodes for a first single-stranded DNA sequence comprising the integration recognition site. [0012] In some embodiments, upon introducing the atgRNA into a cell with a gene editor polypeptide, all or a portion of the single-stranded DNA sequence comprising the integration recognition site is site-specifically integrated into the double-stranded target DNA sequence. [0013] In some embodiments, upon introducing the atgRNA into a cell with a gene editor polypeptide, the atgRNA comprising the clamp segment enhances integration efficiency of the integration recognition site at the double-stranded target DNA sequence as compared to the integration efficiency of the integration recognition site at the double-stranded target DNA sequence when using an atgRNA that does not comprise the clamp segment. [0014] In some embodiments, the atgRNA has a length of 150 to 190 nucleotides. In some embodiments, the atgRNA has a length of 160 to 180 nucleotides. [0015] In some embodiments, the clamp segment has a length of 1 nucleotide to 40 nucleotides. In some embodiments, the clamp segment has a length of 1 to 10 nucleotides, 11 to 20 nucleotides, 21 to 30 nucleotides, or 31 to 40 nucleotides. In some embodiments, the clamp segment has a length of 10 nucleotides. [0016] In some embodiments, the RT template has a length of 10 nucleotides to 200 nucleotides. In some embodiments, the RT template has a length of 30, 32, 34, 36, 38, 40, 42, 44, 46, 48 or 50 nucleotides. [0017] In some embodiments, the second genomic site and the third genomic site are separated by one or more genomic bases in the double-stranded target DNA sequence. [0018] In some embodiments, the second genomic site and the third genomic site are separated by 1 to 10 genomic bases, 11 to 20 genomic bases, 21 to30 genomic bases, 31 to 40 genomic bases, or 41 to 45 genomic bases in the double-stranded target DNA sequence. [0019] In some embodiments, the second genomic site and the third genomic site are separated by 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, or 45 genomic bases in the double-stranded target DNA sequence.
Attorney Ref: 38909-52322/WO Client Ref: 005WO [0020] In some embodiments, the second genomic site and the third genomic site are adjacent in the double-stranded target DNA sequence. [0021] In some embodiments, the clamp segment comprises a sequence having 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence complementarity to the second genomic site. [0022] In some embodiments, the clamp segment comprises a sequence that is capable of binding to the second genomic site of the first strand of the double-stranded target DNA sequence with at least a first nucleotide mismatch. [0023] In some embodiments, the clamp segment comprises a sequence that is capable of binding to the second genomic site of the first strand of the double-stranded target DNA sequence with a first nucleotide mismatch and a second nucleotide mismatch. In some embodiments, the first nucleotide mismatch and the second nucleotide mismatch are consecutive. In some embodiments, the first nucleotide mismatch and the second nucleotide mismatch are non-consecutive. [0024] In some embodiments, the anti-RTT sequence comprises a sequence having 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence complementarity to all or a portion the RT template. In some embodiments, the anti-RTT sequence has a length between 1 nucleotide to 20 nucleotides, 5 nucleotides to 15 nucleotides, or 10 nucleotides to 20 nucleotides. In some embodiments, the anti-RTT sequence has a length of 10 nucleotides. [0025] In some embodiments, upon introducing the atgRNA into a cell, the anti-RTT sequence increases stability of the atgRNA as compared to an atgRNA that does not comprise the anti-RTT sequence. [0026] In another aspect, this disclosure features a system for site-specifically incorporating an integration recognition site into a mammalian cell genome at a target DNA sequence, comprising: any of the attachment site containing gRNAs (atgRNAs) described herein; and a gene editor polypeptide comprising a DNA binding nickase domain linked to a reverse transcriptase domain capable of incorporating the integration recognition site into the target DNA sequence, wherein, upon introducing into the cell, the system integrates the integration recognition site into the target DNA sequence. [0027] In some embodiments, upon introducing the system into the cell, the atgRNA comprising the clamp segment enhances integration efficiency of the integration recognition site at the target
Attorney Ref: 38909-52322/WO Client Ref: 005WO DNA sequence as compared to the integration efficiency of the integration recognition site at the target DNA sequence when using an atgRNA that does not comprise the clamp segment. [0028] In another aspect, this disclosure features a method for site-specifically incorporating an integration recognition site into a mammalian cell genome at a target DNA sequence, comprising: incorporating an integration target recognition site into the genome by delivering into the cell: any of the attachment site containing gRNAs (atgRNAs) described herein; and a gene editor polypeptide or polynucleotide encoding a gene editor polypeptide, wherein the gene editor polypeptide comprises a DNA binding nickase domain linked to a reverse transcriptase domain, wherein the integration recognition site is incorporated into the target DNA sequence by the gene editor polypeptide; and optionally, a nicking gRNA. [0029] In some embodiments, the method using the atgRNA comprising the clamp segment enhances integration efficiency of the integration recognition site at the target DNA sequence as compared to the integration efficiency of the integration recognition site at the target DNA sequence when using an atgRNA that does not comprise the clamp segment. [0030] In another aspect, this disclosure features a system for site-specifically integrating a donor polynucleotide template into a mammalian cell genome at a target DNA sequence, comprising: any of the attachment site containing gRNAs (atgRNAs) described herein; a gene editor polypeptide comprising a DNA binding nickase domain linked to a reverse transcriptase domain capable of incorporating the integration recognition site into the target DNA sequence, an integration enzyme; and a donor polynucleotide template linked to a sequence that is an integration cognate of the integration recognition site present in the atgRNA, whereby the integration enzyme integrates the donor polynucleotide template into the target DNA sequence. [0031] In some embodiments, upon introducing the system into the cell, the atgRNA comprising the clamp enhances integration efficiency of the integration recognition site at the target DNA sequence as compared to the integration efficiency of the integration recognition site at the target DNA sequence when using an atgRNA that does not comprise the clamp segment. [0032] In another aspect, this disclosure features a method for site-specifically integrating a donor polynucleotide template into a mammalian cell genome at a target DNA sequence, comprising: incorporating an integration recognition site into the genome by delivering into the cell: any of the attachment site containing gRNAs (atgRNAs) described herein; and a gene editor polypeptide or polynucleotide encoding a gene editor polypeptide, wherein the gene editor polypeptide comprises
Attorney Ref: 38909-52322/WO Client Ref: 005WO a DNA binding nickase domain linked to a reverse transcriptase domain and is capable of incorporating the integration recognition site into the target DNA sequence; and optionally, a nicking gRNA; and integrating the donor polynucleotide template into the genome by delivering into the cell: an integration enzyme; and a donor polynucleotide template, wherein the donor polynucleotide template is linked to a sequence that is an integration cognate of the integration recognition site present in the atgRNA, and wherein the donor polynucleotide template is integrated into the cellular genome at the incorporated genomic integration recognition site by the integration enzyme. [0033] In some embodiments, the method enhances integration efficiency of the integration recognition site at the double-stranded target DNA sequence as compared to the integration efficiency of the integration recognition site at the target DNA sequence when using an atgRNA that does not comprise the clamp segment. 5. BRIEF DESCRIPTION OF THE DRAWINGS [0034] These and other features, aspects, and advantages of the present disclosure will become better understood with regard to the following description, and accompanying drawings, where: [0035] FIG.1A-1E shows analysis of AttP variants. FIG.1A shows a non-limiting schematic of AttP mutations tested for improving integration efficiency (SEQ ID NOS: 394 and 540-542, respectively, in order of appearance). FIG. 1B shows integration efficiencies of wildtype and mutant AttP sites across a panel of AttB lengths. FIG. 1C shows a non-limiting schematic of multiplexed integration of different cargo sets at specific genomic loci. Three fluorescent cargos (GFP, mCherry, and YFP) are inserted orthogonally at three different loci (ACTB, LMNB1, NOLC1) for in-frame gene tagging. FIG.1D shows orthogonality of top 4 AttB/AttP dinucleotide pairs evaluated for GFP integration with PASTE at the ACTB locus. FIG.1E shows efficiency of multiplexed PASTE insertion of combinations of fluorophores at ACTB, LMNB1, and NOLC1 loci. Data are mean (n= 3) ± s.e.m. [0036] FIG. 2 illustrates a schematic of single atgRNA and dual atgRNA approaches for step 1: beacon placement (“integration recognition site”) and step 2: gene insertion, thereby illustrating PASTE. [0037] FIGs. 3A-3C illustrates a version 1 attachment site-containing guide RNA (atgRNAv1) (FIG.3A) and a mechanism by which the atRNAv1 can be used for site-specifically integrating an integration recognition site into a mammalian cell genome at a desired target site (e.g., a target
Attorney Ref: 38909-52322/WO Client Ref: 005WO DNA sequence) (FIGs.3B-3C). FIG.3A shows an atgRNA-v1 comprising from 5’ to 3’: a spacer sequence that binds to a first genomic site, a scaffold sequence, a clamp segment comprising a sequence that is complementary (binds) to a second genomic site or comprising a sequence (see rcClamp) where upon being reverse transcribed the sequence is at least partially complementary to a second genomic site of the first strand of the double-stranded target DNA sequence, a sequence encoding an integration recognition site (“AttB”), and a primer binding stie (PBS) that binds to a third genomic site. FIG.3B shows atgRNA v1 with the spacer hybridized to the first genomic site and the PBS hybridized to the third genomic site, where the nick is indicated by the inverted triangle. FIG.3C shows a mechanism by which the atgRNA-v1 as shown in FIGs.3A-3B can be used for site-specifically integrating an integration recognition site into a mammalian cell genome at a desired target site (e.g., a target DNA sequence). For example, a nick (as indicated by the inverted triangle) on the non-targeting strand induced by a nickase (e.g., nCas9) facilitates binding of the PBS to a sequence 5’ to the nick site on the non-targeting strand. Following “3’ flap synthesis by a reverse transcriptase, the clamp (i.e., reverse complement of the clamp) binds the targeting strand immediately 3’ to the site of the nick on the non-targeting strand. Following favored repair, the integration recognition site (e.g., AttB) is integrated into the genome at the target sequence. [0038] FIGs. 4A-4C illustrate a version 2 attachment site-containing guide RNA (atgRNAv2) (FIG.4A) and a mechanism by which the atRNAv2 can be used for site-specifically integrating an integration recognition site into a mammalian cell genome at a desired target site (e.g., a target DNA sequence) (FIGs. 4B-4C). FIG. 4A shows an atgRNA-v2 comprising 5’ to 3’: a spacer sequence that binds a first genomic site, a scaffold sequence, a clamp comprising a sequence that has complementarity (binds) to a second genomic site which is 3’ to the nick on the targeting strand (i.e., a one or more separation between the second genomic site and a third genomic site) or comprising a sequence (see rcClamp) where upon being reverse transcribed the sequence is at least partially complementary to a second genomic site of the first strand of the double-stranded target DNA sequence, a sequence encoding an integration recognition site (e.g., AttB), and a primer binding site (PBS) that binds to a third genomic site. FIG.4B shows atgRNA v2 with the spacer hybridized to the first genomic site and the PBS hybridized to the third genomic site, where the nick is indicated by the inverted triangle. FIG.4C shows the mechanism by which the atgRNA-v2 as shown in FIGs. 4A-4B can be used for site-specifically integrating an integration recognition site into a genome of a cell. For example, a nick (as indicated by the inverted triangle) on the non- targeting strand induced by a nickase (e.g., nCas9) facilitates binding of the PBS to a sequence 5’ to the nick site on the non-targeting strand. Following “3’ flap synthesis by a RT,” the clamp (i.e., reverse complement of the clamp) binds the targeting strand at a sequence that is 5’ to the nick on
Attorney Ref: 38909-52322/WO Client Ref: 005WO the corresponding non-targeting strand. In this case, the sequence to which the clamp (i.e., the reverse complement of the clamp) binds on the targeting strand is greater than 1 nucleotide 5’ to the site of the nick on the non-targeting strand. Following favored repair, the integration recognition site (e.g., AttB) is integrated into the genome at the target sequence. [0039] FIG. 5 shows ddPCR data for beacon placement (i.e., incorporation of integration recognition site into a genome of a cell) at the LMNB1 locus using atgRNAv1 or atgRNAv2. [0040] FIGs.6A-6B illustrate the use of an atgRNA comprising an anti-reverse transcription (RT) sequence that is at least partially complementary to all or a portion of the RT template sequence. FIG.6A shows exonuclease-mediated degradation of the 3’ end of an atgRNA with the result being an atgRNA where the 3’ end including the PBS, AttB, and clamp has been degraded. FIG. 6B shows an atgRNA comprising the anti-RT sequence that binds (e.g., intramolecular binding) to the RT template, thereby preventing exonuclease-mediated degradation of the 3’ end of the atgRNA. “X” through the exonuclease indicates blocking of the exonucleases ability to degrade the atgRNA. [0041] FIGs.7A-7C shows percent beacon placement data for atgRNA capable of site-specifically incorporating an integrase recognition site into a genomic locus. The AtgRNA were designed to target the LMNB1 locus (FIG.7A), the hF9 locus (FIG.7B), and the ACTB locus (FIG.7C) and each capable of site-specifically incorporating an integrase recognition site into the LMBN1 locus hF9 locus, and ACTB locus, respectively. Each atgRNA included a 10 nucleotide clamp and the indicated separation between the second and third genomic sites. FIG.7A shows percent beacon placement (i.e., efficiency of integration of the integration recognition site) for atgRNA targeting the LMNB1 locus having a clamp segment at least partially complementary to a second genomic site and a primer binding site complementary to a third genomic site, where the second genomic site and the third genomic site are separated by 0 to 36 base pairs (see FIG.7A “clamp+0” (i.e., 0 base pair separation) to “clamp+36” (i.e., 36 base pair separation). FIG.7B shows percent beacon placement (i.e., efficiency of integration of the integration recognition site) for atgRNA targeting the hF9 locus having a clamp segment at least partially complementary to a second genomic site and a primer binding site complementary to a third genomic site, where the second genomic site and the third genomic site are separated by 0 to 42 base pairs (see FIG.7B “clamp+0” (i.e., 0 base pair separation) to “clamp+42” (i.e., 42 base pair separation). FIG. 7C shows percent beacon placement (i.e., efficiency of integration of the integration recognition site) for atgRNA targeting the ACTB locus having a clamp segment at least partially complementary to a second genomic site and a primer binding site complementary to a third genomic site, where the second genomic site
Attorney Ref: 38909-52322/WO Client Ref: 005WO and the third genomic site are separated by 0 to 33 base pairs (see FIG.7C “clamp+0” (i.e., 0 base pair separation) to “clamp+33” (i.e., 33 base pair separation). 6. DETAILED DESCRIPTION [0042] Described herein are attachment site containing guide RNA (atgRNA) and guide RNA complex compositions capable of binding to a gene editor protein composition and mediating incorporation of an integration recognition site into a target DNA sequence. In some embodiments, the integration recognition stie (also referred to as a beacon) is placed in the genome by partial genomic replacement using a single guide RNA or dual guide RNAs. In typical embodiments, the guide RNA comprises, in 5’ to 3’ order, comprises a spacer sequence complementary to a first genomic site of a first strand of a double-stranded target DNA sequence; a scaffold capable of associating with a DNA binding nickase domain and guiding the DNA binding nickase domain to the target DNA sequence; a reverse transcriptase template comprising: an integration recognition site; and a clamp segment comprising a sequence where upon being reverse transcribed the sequence (reverse complement clamp “rcClamp”) is at least partially complementary to a second genomic site of the first strand of the double-stranded target DNA sequence; and a primer binding site complementary to a third genomic site of the double-stranded target DNA sequence, wherein the third genomic site is on a second strand of the target DNA sequence. [0043] Overall, this disclosure is based in part on the finding that by relaxing the requirement that the clamp binding site (i.e., second genomic site) be immediately adjacent to the primer binding site (i.e., third genomic site) the size limitations associated with atgRNAs can be reduced as compared to standard prime editing guide RNAs or gene editing RNAs capable of integration recognition site insertion. This finding enables the clamp segment binding site (i.e., second genomic site) to be selected from within a large genomic window, wherein for example, the genomic window frame can be shifted to screen for variable sequence distance from the third genomic site. 6.1. Terminology [0044] Unless defined otherwise, all technical and scientific terms used herein have the meaning commonly understood by a person skilled in the art to which this disclosure belongs. As used herein, the following terms have the meanings ascribed to them below. [0045] “Gene editor” as used herein, is a protein that that can be used to perform gene editing, gene modification, gene insertion, gene deletion, or gene inversion. As used herein, the terms
Attorney Ref: 38909-52322/WO Client Ref: 005WO “gene editor polynucleotide” refers to polynucleotide sequence encoding the gene editor protein. Such an enzyme or enzyme fusion may contain DNA or RNA targetable nuclease protein (i.e., Cas protein, ADAR, or ADAT), wherein target specificity is mediated by a complexed nucleic acid (i.e., guide RNA). Such an enzyme or enzyme fusion may be a DNA/RNA targetable protein, wherein target specificity is mediated by internal, conjugated, fused, or linked amino acids, such as within TALENs, ZFNs, or meganucleases. The skilled person in the art would appreciate that the gene editor can demonstrate targeted nuclease activity, targeted binding with no nuclease activity, or targeted nickase activity (or cleaves activity). A gene editor comprising a targetable protein may be fused, linked, complexed, operate in cis or trans to one or more proteins or protein fragment motifs. Gene editors may be fused or linked to one or more integrase, recombinase, polymerase, telomerase, reverse transcriptase, or invertase. A gene editor can be a prime editor fusion protein or a gene writer fusion protein. [0046] “Prime editor fusion protein” as used herein, describes a protein that is used in prime editing. “Prime editor system” as used herein describes the components used in prime editing. Prime editing uses CRISPR enzyme that nicks or cuts only single strand of double stranded DNA, i.e., a nickase; the nickase can occur either naturally or by mutation or modification of a nuclease that makes double stranded cuts. The nickase is programmed (directed) with a prime-editing guide RNA (pegRNA). The skilled person in the art would appreciate that the pegRNA both specifies the target site and encodes the desired edit. Described herein are attachment site containing guide RNA (atgRNA) that both specifies the target and encodes for the desired integrase target recognition site. The nickase may be programmed (directed) with an atgRNA. Advantageously the nickase is a catalytically-impaired Cas9 endonuclease, a Cas9 nickase, that is fused to the reverse transcriptase. During genetic editing, the Cas9 nickase part of the protein is guided to the DNA target site by the atgRNA (pegRNA), whereby a nick or single stranded cut occurs. The reverse transcriptase domain then uses the atgRNA (pegRNA) to template reverse transcription of the desired edit, directly polymerizing DNA onto the nicked target DNA strand. The edited DNA strand replaces the original DNA strand, creating a heteroduplex containing one edited strand and one unedited strand. Afterward, optionally, the prime editor (PE) guides resolution of the heteroduplex to favor copying the edit onto the unedited strand, completing the process (typically achieved with a nickase gRNA). Other enzymes that can be used to nick or cut only a single strand of double stranded DNA includes a cleavase (e.g., cleavase I enzyme). [0047] In some embodiments, an additional agent or agents may be added that improve the efficiency and outcome purity of the prime edit. In some embodiments, the agent may be chemical
Attorney Ref: 38909-52322/WO Client Ref: 005WO or biological and disrupt DNA mismatch repair (MMR) processes at or near the edit site (i.e., PE4 and PE5 and PEmax architecture by Chen et al. Cell, 184, 1-18, October 28, 2021; Chen et al. is incorporated herein by reference). In typical embodiments, the agent is a MMR-inhibiting protein. In certain embodiments, the MMR-inhibiting protein is dominant negative MMR protein. In certain embodiments, the dominant negative MMR protein is MLH1dn. In particular embodiments, the MMR-inhibiting agent is incorporated into the single nucleic acid construct design described herein that encodes any of the gene editor polypeptides described herein. In some embodiments, the MMR-inhibiting agent is linked or fused to the prime editor protein fusion, which may or may not have a linked or fused integrase. In some embodiments, the MMR-inhibiting agent is linked or fused to the Gene Writer™ protein, which may or may not have a linked or fused integrase. [0048] The prime editor or gene editor system can be used to achieve DNA deletion and replacement. In some embodiments, the DNA deletion replacement is induced using a pair of atgRNA or pegRNA that target opposite DNA strands, programming not only the sites that are nicked but also the outcome of the repair (i.e., PrimeDel by Choi et al. Nat. Biotechnology, October 14, 2021; Choi et al. is incorporated herein by reference and TwinPE by Anzalone et al.BioRxiv, November 2, 2021; Anzalone et al. is incorporated herein by reference). In some embodiments described herein, the DNA deletion is induced using a single atgRNA. In some embodiments, the DNA deletion and replacement is induced using a wild type Cas9 prime editor (PE-Cas9) system (i.e., PEDAR by Jiang et al. Nat. Biotechnology, October 14, 2021; Jiang et al. is incorporated herein by reference in its entirety) In some embodiments, the DNA replacement is an integrase target recognition site or recombinase target recognition site. In certain embodiments, the constructs and methods described herein may be utilized to incorporate the pair of pegRNAs used in PrimeDel, TwinPE (WO2021226558 incorporated by reference herein in its entirety), or PEDAR, the prime editor fusion protein or Gene Writer protein, optionally a nickase guide RNA (ngRNA), an integrase, a nucleic acid cargo, and optionally a recombinase into a single nucleic acid construct described herein. The integrase may be directly linked, for example by a peptide linker, to the prime editor fusion or gene writer protein. [0049] In some embodiments, the prime editors can refer to a retrovirus or lentivirus reverse transcriptase such as a Moloney Murine Leukemia Virus (M-MLV) reverse transcriptase (RT) fused to a CRISPR enzyme nickase such as a Cas9 H840A nickase, a Cas9nickase. In some embodiments, the prime editors can refer to a retrovirus or lentivirus reverse transcriptase such as a Moloney Murine Leukemia Virus (M-MLV) reverse transcriptase (RT) fused to a cleavase. In
Attorney Ref: 38909-52322/WO Client Ref: 005WO some embodiments the RT can be fused at, near or to the C-terminus of a Cas9nickase, e.g., Cas9 H840A. Fusing the RT to the C-terminus region, e.g., to the C-terminus, of the Cas9 nickase may result in higher editing efficiency. Such a complex is called PEI. In some embodiments, the CRISPR enzyme nickase, e.g., Cas9(H840A), i.e., a Cas9nickase, can be linked to a non-M-MLV reverse transcriptase such as an AMV-RT or XRT (Cas9(H840A)-AMV-RT or XRT). In some embodiments, instead of the CRISPR enzyme nickase being a Cas9 (H840A), i.e., instead of being a Cas9 nickase, the CRISPR enzyme nickase instead can be a CRISPR enzyme that naturally is a nickase or cuts a single strand of double stranded DNA; for instance, the CRISPR enzyme nickase can be Casl2a/b. Alternatively, the CRISPR enzyme nickase can be another mutation of Cas9, such as Cas9(Dl0A). A CRISPR enzyme, such as a CRISPR enzyme nickase, such as Cas9 (wild type), Cas9(H840A), Cas9(Dl0A) or Cas 12a/b nickase can be fused in some embodiments to a pentamutant of M-MLV RT (D200N/ L603W/ T330P/ T306K/ W313F), whereby there can be up to about 45-fold higher efficiency, and this is called PE2. In some embodiments, the M-MLV RT comprise one or more of the mutations Y8H, P51L, S56A, S67R, E69K, Vl29P, L139P, Tl97A, H204R, V223H, T246E, N249D, E286R, Q2911, E302K, E302R, F309N, M320L, P330E, L435G, L435R, N454K, D524A, D524G, D524N, E562Q, D583N, H594Q, E607K, D653N, and L671P. Specific M-MLV RT mutations are shown in Table 1.
[0050] In some embodiments, the reverse transcriptase can also be a wild-type or modified transcription xenopolymerase (RTX), avian myeloblastosis virus reverse transcriptase (AMV RT), Feline Immunodeficiency Virus reverse transcriptase (FIV-RT), FeLV-RT (Feline leukemia virus reverse transcriptase), HIV-RT (Human Immunodeficiency Virus reverse transcriptase). In some embodiments, the reverse transcriptase can be a fusion of MMuLV to the Sto7d DNA binding domain (see Ionnidi et al.; https://doi.org/10.1101/2021.11.01.466786). The fusion of MMuLV to the Sto7d DNA binding domain sequence is given in Table 2.
Attorney Ref: 38909-52322/WO Client Ref: 005WO
[0051] PE3, PE3b, PE4, PE5, and/or PEmax, which a skilled person can incorporate into a single nucleic acid construct described herein, involves nicking the non-edited strand, potentially causing the cell to remake that strand using the edited strand as the template to induce HR. The nicking of the non-edited strand can involve the use of a nicking guide RNA (ngRNA). [0052] The skilled person can readily incorporate into a gene editor composition described herein a prime editing or CRISPR system. Examples of prime editors or gene editors can be found in the following: WO2020/191153, WO2020/191171, WO2020/191233, WO2020/191234, WO2020/191239, WO2020/191241, WO2020/191242, WO2020/191243, WO2020/191245, WO2020/191246, WO2020/191248, WO2020/191249, each of which is incorporated by reference herein in its entirety. In addition, mention is made, and can be used herein, of CRISPR Patent Applications and Patents of the Zhang laboratory and/or Broad Institute, Inc. and Massachusetts Institute of Technology and/or Broad Institute, Inc., Massachusetts Institute of Technology and President and Fellows of Harvard College and/or Editas Medicine, Inc. Broad Institute, Inc., The University of Iowa Research Foundation and Massachusetts Institute of Technology, including those claiming priority to US Application 61/736,527, filed December 12, 2012, including US Patents 11,104,937, 11,091,798, 11,060,115, 11,041,173, 11,021,740, 11,008,588, 11,001,829, 10,968,257, 10,954,514, 10,946,108, 10,930,367, 10,876,100, 10,851,357, 10,781,444, 10,711,285, 10,689,691, 10,648,020, 10,640,788, 10,577,630, 10,550,372, 10,494,621, 10,377,998, 10,266,887, 10,266,886, 10,190,137, 9,840,713, 9,822,372, 9,790,490, 8,999,641, 8,993,233, 8,945,839, 8,932,814, 8,906,616, 8,895,308, 8,889,418, 8,889,356, 8,871,445, 8,865,406, 8,795,965, 8,771,945, and 8,697,359; CRISPR Patent Applications and Patents of the Doudna laboratory and/or of Regents of the University of California, the University of Vienna and Emmanuelle Charpentier, including those claiming priority to US application 61/652,086, filed May 25,
Attorney Ref: 38909-52322/WO Client Ref: 005WO 2012, and/or 61/716,256, filed October 19, 2012, and/or 61/757,640, filed January 28, 2013, and/or 61/765,576, filed February 15, 2013 and/or 13/842,859, including US Patents 11,028,412, 11,008,590, 11,008,589, 11,001,863, 10,988,782, 10,988,780, 10,982,231, 10,982,230, 10,900,054, 10,793,878, 10,774,344, 10,752,920, 10,676,759, 10,669,560, 10,640,791, 10,626,419, 10,612,045, 10,597,680, 10,577,631, 10,570,419, 10,563,227, 10,550,407, 10,533,190, 10,526,619, 10,519,467, 10,513,712, 10,487,341, 10,443,076, 10,428,352, 10,421,980, 10,415,061, 10,407,697, 10,400,253, 10,385,360, 10,358,659, 10,358,658, 10,351,878, 10,337,029, 10,308,961, 10,301,651, 10,266,850, 10,227,611, 10,113,167, and 10,000,772; CRISPR Patent Applications and Patents of Vilnius University and/or the Siksnys laboratory, including those claiming priority to US application 62/046384 and/or 61/625,420 and/or 61/613,373 and/or PCT/IB2015/056756, including US Patent 10,385,336; CRISPR Patent Applications and Patents of the President and Fellows of Harvard College, including those of George Church’s laboratory and/or claiming priority to US application 61/738,355, filed December 17, 2012, including 11,111,521, 11,085,072, 11,064,684, 10,959,413, 10,925,263, 10,851,369, 10,787,684, 10,767,194, 10,717,990, 10,683,490, 10,640,789, 10,563,225, 10,435,708, 10,435,679, 10,375,938, 10,329,587, 10,273,501, 10,100,291, 9,970,024, 9,914,939, 9,777,262, 9,587,252, 9,267,135, 9,260,723, 9,074,199, 9,023,649; CRISPR Patent Applications and Patents of the President and Fellows of Harvard College, including those of David Liu’s laboratory, including 11,111,472, 11,104,967, 11,078,469, 11,071,790, 11,053,481, 11,046,948, 10,954,548, 10,947,530, 10,912,833, 10,858,639, 10,745,677, 10,704,062, 10,682,410, 10,612,011, 10,597,679, 10,508,298, 10,465,176, 10,323,236, 10,227,581, 10,167,457, 10,113,163, 10,077,453, 9,999,671, 9,840,699, 9,737,604, 9,526,784, 9,388,430, 9,359,599, 9,340,800, 9,340,799, 9,322,037, 9,322,006, 9,228,207, 9,163,284, and 9,068,179; and CRISPR Patent Applications and Patents of Toolgen Incorporated and/or the Kim laboratory and/or claiming priority to US application 61/717,324, filed October 23, 2012 and/or 61/803,599, filed March 20, 2013 and/or 61/837,481, filed June 20, 2013 and/or 62/033,852, filed August 6, 2014 and/or PCT/KR2013/009488 and/or PCT/KR2015/008269, including US Patent 10,851,380, and 10,519,454; and CRISPR Patent Applications and Patents of Sigma and/or Millipore and/or the Chen laboratory and/or claiming priority to US application 61/734,256, filed December 6, 2012 and/or 61/758,624, filed January 30, 2013 and/or 61/761,046, filed February 5, 2013 and/or 61/794,422, filed March 15, 2013, including US Patent 10,731,181, each of which is hereby incorporated herein by reference, and from the disclosures of the foregoing, the skilled person can readily make and use a prime editing or CRISPR system, and can especially
Attorney Ref: 38909-52322/WO Client Ref: 005WO appreciate impaired endonucleases, such as a mutated Cas9 that only nicks a single strand of DNA and is hence a nickase, or a CRISPR enzyme that only makes a single-stranded cut that can be employed in a PASTE system of the disclosure. Further, from the disclosures of the foregoing, the skilled person can incorporate the selected CRISPR enzyme, as part of the gene editor, gene editor fusion, prime editor, or prime editor fusion compositions described herein. [0053] Prior to RT-mediated edit incorporation, the prime editor protein (1) site- specifically targets a genomic locus and (2) performs a catalytic cut or nick. These steps are typically performed by a CRISPR-Cas. However, in some embodiments the Cas protein may be substituted by other nucleic acid programmable DNA binding proteins (napDNAbp) such as zinc finger nucleases (ZFNs), transcription activator-like effector nucleases (TALENs), or meganucleases. In addition, to the extent the “targeting rules” of other napDNAbp are known or are newly determined, it becomes possible to use new napDNAbp, beyond Cas9, to site specifically target and modify genomic sites of interest. [0054] Similar to a prime editor protein, a Gene Writer can introduce novel DNA elements, such as an integration target site, into a DNA locus. A Gene Writer protein comprises: (A) a polypeptide or a nucleic acid encoding a polypeptide, wherein the polypeptide comprises (i) a reverse transcriptase domain, and either (x) an endonuclease domain that contains DNA binding functionality or (y) an endonuclease domain and separate DNA binding domain; and (B) a template RNA comprising (i) a sequence that binds the polypeptide and (ii) a heterologous insert sequence. Examples of such Gene Writer™ proteins and related systems can be found in US20200109398, which is incorporated by reference herein in its entirety. [0055] In some embodiments, the prime editor or Gene Writer protein fusion or prime editor protein linked or fused to an integrase is expressed as a split construct. In typical embodiments, the split construct in reconstituted in a cell. In some embodiments, the split construct can be fused or ligated via intein protein splicing. In some embodiments, the split construct can be reconstituted via protein-protein inter-molecular bonding and/or interactions. In some embodiments, the split construct can be reconstituted via chemical, biological, or environmental induced oligomerization. In some embodiments, the split construct can be reconstituted via nanobody binding ALFA-tagged proteins. In certain embodiments, the split construct can be adapted into one or more single nucleic acid polynucleotides. [0056] In some embodiments, an integrase or recombinase is directly linked or fused, for example by a peptide linker, which may be cleavable or non-cleavable, to the prime editor fusion
Attorney Ref: 38909-52322/WO Client Ref: 005WO protein (i.e., fused Cas9 nickase-reverse transcriptase) or Gene Writer protein. Suitable linkers, for example between the Cas9, RT, and integrase, may be selected from Table 3:
Attorney Ref: 38909-52322/WO Client Ref: 005WO
[0057] In some embodiments, the prime editor or Gene Writer protein fusion or prime editor protein linked or fused to an integrase is expressed as a split construct. In typical embodiments, the split construct in reconstituted in a cell. In some embodiments, the split construct can be fused or ligated via intein protein splicing. In some embodiments, the split construct can be reconstituted via protein-protein inter-molecular bonding and/or interactions. In some embodiments, the split construct can be reconstituted via chemical, biological, or environmental induced oligomerization. In certain embodiments, the split construct can be adapted into one or more nucleic acid constructs described herein. 6.2. Type II CRISPR proteins [0058] The skilled person can incorporate a selected CRISPR enzyme, described below, as part of the prime editor fusion, as a component of the gene editor polypeptide described herein. Streptococcus pyogenes Cas9 (SpCas9), the most common enzyme used in genome-editing applications, is a large nuclease of 1368 amino acid residues. Advantages of SpCas9 include its short, 5′-NGG-3′ PAM and very high average editing efficiency. SpCas9 consists of two lobes: a recognition (REC) lobe and a nuclease (NUC) lobe. The REC lobe can be divided into three regions, a long a helix referred to as the bridge helix (residues 60–93), the REC1 (residues 94–179 and 308–713) domain, and the REC2 (residues 180–307) domain. The NUC lobe consists of the RuvC (residues 1–59, 718–769, and 909–1098), HNH (residues 775–908), and PAM-interacting (PI) (residues 1099–1368) domains. The negatively charged sgRNA:target DNA heteroduplex is accommodated in a positively charged groove at the interface between the REC and NUC lobes.
Attorney Ref: 38909-52322/WO Client Ref: 005WO In the NUC lobe, the RuvC domain is assembled from the three split RuvC motifs (RuvC I–III) and interfaces with the PI domain to form a positively charged surface that interacts with the 30 tail of the sgRNA. The HNH domain lies between the RuvC II–III motifs and forms only a few contacts with the rest of the protein. Structural aspects of SpCas9 are described by Nishimasu et al., Crystal Structure of Cas9 in Complex with Guide RNA and Target DNA, Cell 156, 935-949, February 27, 2014. [0059] REC lobe: The REC lobe includes the REC1 and REC2 domains. The REC2 domain does not contact the bound guide:target heteroduplex, indicating that truncation of REC lobe may be tolerated by SpCas9. Further, SpCas9 mutant lacking the REC2 domain (D175–307) retained ~50% of the wild-type Cas9 activity, indicating that the REC2 domain is not critical for DNA cleavage. In striking contrast, the deletion of either the repeat-interacting region (D97–150) or the anti-repeat-interacting region (D312–409) of the REC1 domain abolished the DNA cleavage activity, indicating that the recognition of the repeat:anti-repeat duplex by the REC1 domain is critical for the Cas9 function. [0060] PAM-Interacting domain: The NUC lobe contains the PAM-interacting (PI) domain that is positioned to recognize the PAM sequence on the noncomplementary DNA strand. The PI domain of SpCas9 is required for the recognition of 5’-NGG-3’ PAM, and deletion of the PI domain (Δ1099-1368) abolished the cleavage activity, indicating that the PI domain is critical for SpCas9 function and a major determinant for the PAM specificity. [0061] RuvC domain: The RuvC nucleases of SpCas9 have an RNase H fold and four catalytic residues, Asp10 (Ala), Glu762, His983, and Asp986, that are critical for the two-metal cleavage of the noncomplementary strand of the target DNA. In addition to the conserved RNase H fold, the Cas9 RuvC domain has other structural elements involved in interactions with the guide:target heteroduplex (an end-capping loop between α42 and α43) and the PI domain/stem loop 3 (β hairpin formed by β3 and β4). [0062] HNH domain: SpCas9 HNH nucleases have three catalytic residues, Asp839, His840, and Asn863 and cleave the complementary strand of the target DNA through a single-metal mechanism. [0063] sgRNA:DNA recognition: The sgRNA guide region is primarily recognized by the REC lobe. The backbone phosphate groups of the guide region (nucleotides 2, 4–6, and 13–20) interact with the REC1 domain (Arg165, Gly166, Arg403, Asn407, Lys510, Tyr515, and Arg661) and the
Attorney Ref: 38909-52322/WO Client Ref: 005WO bridge helix (Arg63, Arg66, Arg70, Arg71, Arg74, and Arg78). The 20-hydroxyl groups of G1, C15, U16, and G19 hydrogen bond with Val1009, Tyr450, Arg447/Ile448, and Thr404, respectively. [0064] A mutational analysis demonstrated that the R66A, R70A, and R74A mutations on the bridge helix markedly reduced the DNA cleavage activities, highlighting the functional significance of the recognition of the sgRNA ‘‘seed’’ region by the bridge helix. Although Arg78 and Arg165 also interact with the ‘‘seed’’ region, the R78A and R165A mutants showed only moderately decreased activities. These results are consistent with the fact that Arg66, Arg70, and Arg74 form multiple salt bridges with the sgRNA backbone, whereas Arg78 and Arg165 form a single salt bridge with the sgRNA backbone. Moreover, the alanine mutations of the repeat:anti- repeat duplex-interacting residues (Arg75 and Lys163) and the stemloop-1-interacting residue (Arg69) resulted in decreased DNA cleavage activity, confirming the functional importance of the recognition of the repeat:anti-repeat duplex and stem loop 1 by Cas9. [0065] RNA-guided DNA targeting: SpCas9 recognizes the guide:target heteroduplex in a sequence-independent manner. The backbone phosphate groups of the target DNA (nucleotides 1, 9–11, 13, and 20) interact with the REC1 (Asn497, Trp659, Arg661, and Gln695), RuvC (Gln926), and PI (Glu1108) domains. The C2’ atoms of the target DNA (nucleotides 5, 7, 8, 11, 19, and 20) form van der Waals interactions with the REC1 domain (Leu169, Tyr450, Met495, Met694, and His698) and the RuvC domain (Ala728). The terminal base pair of the guide:target heteroduplex (G1:C20’) is recognized by the RuvC domain via end-capping interactions; the sgRNA G1 and target DNA C20’ nucleobases interact with the Tyr1013 and Val1015 side chains, respectively, whereas the 20-hydroxyl and phosphate groups of sgRNA G1 interact with Val1009 and Gln926, respectively. [0066] Repeat:Anti-Repeat duplex recognition: The nucleobases of U23/A49 and A42/G43 hydrogen bond with the side chain of Arg1122 and the main-chain carbonyl group of Phe351, respectively. The nucleobase of the flipped U44 is sandwiched between Tyr325 and His328, with its N3 atom hydrogen bonded with Tyr325, whereas the nucleobase of the unpaired G43 stacks with Tyr359 and hydrogen bonds with Asp364. [0067] The nucleobases of G21 and U50 in the G21:U50 wobble pair stack with the terminal C20:G10 pair in the guide:target heteroduplex and Tyr72 on the bridge helix, respectively, with the U50 O4 atom hydrogen bonded with Arg75. Notably, A51 adopts the syn conformation and is oriented in the direction opposite to U50. The nucleobase of A51 is sandwiched between Phe1105
Attorney Ref: 38909-52322/WO Client Ref: 005WO and U63, with its N1, N6, and N7 atoms hydrogen bonded with G62, Gly1103, and Phe1105, respectively. [0068] Stem-loop recognition: Stem loop 1 is primarily recognized by the REC lobe, together with the PI domain. The backbone phosphate groups of stem loop 1 (nucleotides 52, 53, and 59– 61) interact with the REC1 domain (Leu455, Ser460, Arg467, Thr472, and Ile473), the PI domain (Lys1123 and Lys1124), and the bridge helix (Arg70 and Arg74), with the 20-hydroxyl group of G58 hydrogen bonded with Leu455. A52 interacts with Phe1105 through a face-to-edge p-p stacking interaction, and the flipped U59 nucleobase hydrogen bonds with Asn77. [0069] The single-stranded linker and stem loops 2 and 3 are primarily recognized by the NUC lobe. The backbone phosphate groups of the linker (nucleotides 63–65 and 67) interact with the RuvC domain (Glu57, Lys742, and Lys1097), the PI domain (Thr1102), and the bridge helix (Arg69), with the 20-hydroxyl groups of U64 and A65 hydrogen bonded with Glu57 and His721, respectively. The C67 nucleobase forms two hydrogen bonds with Val1100. [0070] Stem loop 2 is recognized by Cas9 via the interactions between the NUC lobe and the non-Watson-Crick A68:G81 pair, which is formed by direct (between the A68 N6 and G81 O6 atoms) and water-mediated (between the A68 N1 and G81 N1 atoms) hydrogen-bonding interactions. The A68 and G81 nucleobases contact Ser1351 and Tyr1356, respectively, whereas the A68:G81 pair interacts with Thr1358 via a water-mediated hydrogen bond. The 20-hydroxyl group of A68 hydrogen bonds with His1349, whereas the G81 nucleobase hydrogen bonds with Lys33. [0071] Stem loop 3 interacts with the NUC lobe more extensively, as compared to stem loop 2. The backbone phosphate group of G92 interacts with the RuvC domain (Arg40 and Lys44), whereas the G89 and U90 nucleobases hydrogen bond with Gln1272 and Glu1225/Ala1227, respectively. The A88 and C91 nucleobases are recognized by Asn46 via multiple hydrogen- bonding interactions. [0072] Cas9 proteins smaller than SpCas9 allow more efficient packaging of nucleic acids encoding CRISPR systems, e.g., Cas9 and sgRNA into one rAAV (“all-in-one-AAV”) particle. In addition, efficient packaging of CRISPR systems can be achieved in other viral vector systems (i.e., lentiviral, integration deficient lentiviral, hd-AAV, etc.) and non viral vector systems (i.e., lipid nanoparticle). Small Cas9 proteins can be advantageous for multidomain-Cas-nuclease-based systems for prime editing. Well characterized smaller Cas9 proteins include Staphylococcus
Attorney Ref: 38909-52322/WO Client Ref: 005WO aureus (SauCas9, 1053 amino acid residues) and Campylobacter jejuni (CjCas9, 984 amino residues). However, both recognize longer PAMs, 5′-NNGRRT-3′ for SauCas9 (R = A or G) and 5′-NNNNRYAC-3′ for CjCas9 (Y = C or T), which reduces the number of uniquely addressable target sites in the genome, in comparison to the NGG SpCas9 PAM. Among smaller Cas9s, Schmidt et al. identified Staphylococcus lugdunensis (Slu) Cas9 as having genome-editing activity and provided homology mapping to SpCas9 and SauCas9 to facilitate generation of nickases and inactive (“dead”) enzymes (Schmidt et al., 2021, Improved CRISPR genome editing using small highly active and specific engineered RNA-guided nucleases. Nat Commun 12, 4219. doi.org/10.1038/s41467-021-24454-5) and engineered nucleases with higher cleavage activity by fragmenting and shuffling Cas9 DNAs. The small Cas9s and nickases are useful in the instant disclosure. [0073] Besides dead Cas9 and Cas9 nickase variants, the Cas9 proteins used herein may also include other “Cas9 variants” having at least about 70% identical, at least about 80% identical, at least about 90% identical, at least about 95% identical, at least about 96% identical, at least about 97% identical, at least about 98% identical, at least about 99% identical, at least about 99.5% identical, or at least about 99.9% identical to any reference Cas9 protein, including any wild type Cas9, or mutant Cas9 (e.g., a dead Cas9 or Cas9 nickase), or fragment Cas9, or circular permutant Cas9, or other variant of Cas9 disclosed herein or known in the art. In some embodiments, a Cas9 variant may have 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 21, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50 or more amino acid changes compared to a reference Cas9. In some embodiments, the Cas9 variant comprises a fragment of a reference Cas9 (e.g., a gRNA binding domain or a DNA-cleavage domain), such that the fragment is at least about 70% identical, at least about 80% identical, at least about 90% identical, at least about 95% identical, at least about 96% identical, at least about 97% identical, at least about 98% identical, at least about 99% identical, at least about 99.5% identical, or at least about 99.9% identical to the corresponding fragment of wild type Cas9. In some embodiments, the fragment is at least 30%, at least 35%, at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95% identical, at least 96%, at least 97%, at least 98%, at least 99%, or at least 99.5% of the amino acid length of a corresponding wild type Cas9 (SEQ ID NO: 18). [0074] In some embodiments, the disclosure also may utilize Cas9 fragments that retain their functionality and that are fragments of any herein disclosed Cas9 protein. In some embodiments, the Cas9 fragment is at least 100 amino acids in length. In some embodiments, the fragment is at
Attorney Ref: 38909-52322/WO Client Ref: 005WO least 100, 150, 200, 250, 300, 350, 400, 450, 500, 550, 600, 650, 700, 750, 800, 850, 900, 950, 1000, 1050, 1100, 1150, 1200, 1250, or at least 1300 amino acids in length. [0075] In various embodiments, the prime editors disclosed herein may comprise one of the Cas9 variants described as follows, or a Cas9 variant thereof having at least about 70% identical, at least about 80% identical, at least about 90% identical, at least about 95% identical, at least about 96% identical, at least about 97% identical, at least about 98% identical, at least about 99% identical, at least about 99.5% identical, or at least about 99.9% identical to any reference Cas9 variants.
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
[0076] In some embodiments, prime editors utilized herein comprise CRISPR-Cas system enzymes other than type II enzymes. In certain embodiments, prime editors comprise type V or type VI CRISPR-Cas system enzymes. It will be appreciated that certain CRISPR enzymes exhibit promiscuous ssDNA cleavage activity and appropriate precautions should be considered. In certain embodiments, prime editors comprise a nickase or a dead CRISPR with nuclease function comprised in a different component. [0077] In various embodiments, the nucleic acid programmable DNA binding proteins utilized herein include, without limitation, Cas9 (e.g., dCas9 and nCas9), Cas12a (Cpf1), Cas12b1 (C2c1), Cas12b2, Cas12c (C2c3), Cas12d (CasY), Cas12e (CasX), C2c4, C2c5, C2c8, C2c9, C2c10, Cas13a (C2c2), Cas13b (C2c6), Cas13c (C2c7), Cas13d, and Argonaute. Cas-equivalents further include those described in Makarova et al., “C2c2 is a single-component programmable RNA- guided RNA-targeting CRISPR effector,” Science 2016; 353(6299) and Makarova et al., “Classification and Nomenclature of CRISPR-Cas Systems: Where from Here?,” The CRISPR Journal, Vol.1. No.5, 2018, the contents of which are incorporated herein by reference. One
Attorney Ref: 38909-52322/WO Client Ref: 005WO example of a nucleic acid programmable DNA-binding protein that has different PAM specificity than Cas9 is Clustered Regularly Interspaced Short Palindromic Repeats from Prevotella and Francisella 1 (i.e, Cas12a (Cpf1)). Similar to Cas9, Cas12a (Cpf1) is also a Class 2 CRISPR effector, but it is a member of type V subgroup of enzymes, rather than the type II subgroup. It has been shown that Cas12a (Cpf1) mediates robust DNA interference with features distinct from Cas9. Cas12a (Cpf1) is a single RNA-guided endonuclease lacking tracrRNA, and it utilizes a T- rich protospacer-adjacent motif (TTN, TTTN, or YTN). Moreover, Cpf1 cleaves DNA via a staggered DNA double-stranded break. Out of 16 Cpf1-family proteins, two enzymes from Acidaminococcus and Lachnospiraceae are shown to have efficient genome-editing activity in human cells. Cpf1 proteins are known in the art and have been described previously, for example Yamano et al., “Crystal structure of Cpf1 in complex with guide RNA and target DNA.” Cell (165) 2016, p.949-962; the entire contents of which is hereby incorporated by reference. 6.3. Type V CRISPR proteins [0078] In some embodiments, prime editors used herein comprise the type V CRISPR family includes Francisella novicida U112 Cpf1 (FnCpf1) also known as FnCas12a. FnCpf1 adopts a bilobed architecture with the two lobes connected by the wedge (WED) domain. The N-terminal REC lobe consists of two a-helical domains (REC1 and REC2) that have been shown to coordinate the crRNA-target DNA heteroduplex. The C-terminal NUC lobe consists of the C-terminal RuvC and Nuc domains involved in target cleavage, the arginine-rich bridge helix (BH), and the PAM- interacting (PI) domain. The repeat-derived segment of the crRNA forms a pseudoknot stabilized by intra-molecular base-pairing and hydrogen-bonding interactions. The pseudoknot is coordinated by residues from the WED, RuvC, and REC2 domains, as well as by two hydrated magnesium cations. Notably, nucleotides 1–5 of the crRNA are ordered in the central cavity of FnCas12a and adopt an A-form-like helical conformation. Conformational ordering of the seed sequence is facilitated by multiple interactions between the ribose and phosphate moieties of the crRNA backbone and FnCpf1 residues in the WED and REC1 domains. These include residues Thr16, Lys595, His804, and His881 from the WED domain and residues Tyr47, Lys51, Phe182, and Arg186 from the REC1 domain. The structure of the FnCas12a-crRNA complex further reveals that the bases of the seed sequence are solvent exposed and poised for hybridization with target DNA. Structural aspects of FnCpf1 are described by Swarts et al., Structural Basis for Guide RNA Processing and Seed-Dependent DNA Targeting by CRISPR-Cas12a, Molecular Cell 66, 221-233, April 20, 2017.
Attorney Ref: 38909-52322/WO Client Ref: 005WO [0079] Pre-crRNA processing: Essential residues for crRNA processing include His843, Lys852, and Lys869. Structural observations are consistent with an acid-base catalytic mechanism in which Lys869 acts as the general base catalyst to deprotonate the attacking 2’-hydroxyl group of U(-19), while His843 acts as a general acid to protonate the 5’-oxygen leaving group of A(-18). In turn, the side chain of Lys852 is involved in charge stabilization of the transition state. Collectively, these interactions facilitate the intra-molecular attack of the 20-hydroxyl group of U(-19) on the scissile phosphate and promote the formation of the 2’,3’-cyclic phosphate product. [0080] R-loop formation: The crRNA-target DNA strand heteroduplex is enclosed in the central cavity formed by the REC and NUC lobes and interacts extensively with the REC1 and REC2 domains. The PAM-containing DNA duplex comprises target strand nucleotides dT0–dT8 and non-target strand nucleotides dA(8)*–dA0* and is contacted by the PI, WED, and REC1 domains. The 5’-TTN-3’ PAM is recognized in FnCas12a by a mechanism combining the shape- specific recognition of a narrowed minor groove, with base-specific recognition of the PAM bases by two invariant residues, Lys671 and Lys613. Directly downstream of the PAM, the duplex of the target DNA is disrupted by the side chain of residue Lys667, which is inserted between the DNA strands and forms a cation-π stacking interaction with the dA0–dT0* base pair. The phosphate group linking target strand residues dT(-1) and dT0 is coordinated by hydrogen-bonding interactions with the side chain of Lys823 and the backbone amide of Gly826. Target strand residue dT(-1) bends away from residue T0, allowing the target strand to interact with the seed sequence of the crRNA. The non-target strand nucleotides dT1*–dT5* interact with the Arg692- Ser702 loop in FnCas12a through hydrogen-bonding and ionic interactions between backbone phosphate groups and side chains of Arg692, Asn700, Ser702, and Gln704, as well as main-chain amide groups of Lys699, Asn700, and Ser702. Alanine substitution of Q704 or replacement of residues Thr698–Ser702 in FnCas12a with the sequence Ala-Gly3 (SEQ ID NO: 115) substantially reduced DNA cleavage activity, suggesting that these residues contribute to R-loop formation by stabilizing the displaced conformation of the nontarget DNA strand. [0081] In the FnCas12a R-loop complex, the crRNA-target strand heteroduplex is terminated by a stacking interaction with a conserved aromatic residue (Tyr410). This prevents base pairing between the crRNA and the target strand beyond nucleotides U20 and dA(-20), respectively. Beyond this point, the target DNA strand nucleotides re-engage the non-target DNA strand, forming a PAM-distal DNA duplex comprising nucleotides dC(-21)–dA(-27) and dG21*–dT27*, respectively. The duplex is confined between the REC2 and Nuc domains at the end of the central channel formed by the REC and NUC lobes.
Attorney Ref: 38909-52322/WO Client Ref: 005WO [0082] Target DNA cleavage: FnCpf1 can independently accommodate both the target and non-target DNA strands in the catalytic pocket of the RuvC domain. The RuvC active site contains three catalytic residues (D917, E1006, and D1255). Structural observations suggest that both the target and non-target DNA strands are cleaved by the same catalytic mechanism in a single active site in Cpf1/Cas12a enzymes. [0083] Another type V CRISPR is AsCpf1 from Acidaminococcus sp BV3L6 (Yamano et al., Crystal structure of Cpf1 in complex with guide RNA and target DNA, Cell 165, 949-962, May 5, 2016) [0084] In certain embodiments, the nuclease comprises a Cas12f effector. Small CRISPR- associated effector proteins belonging to the type V-F subtype have been identified through the mining of sequence databases and members classified into Cas12f1 (Cas14a and type V-U3), Cas12f2 (Cas14b) and Cas12f3 (Cas14c, type V-U2 and U4). (See, e.g., Karvelis et al., PAM recognition by miniature CRISPR–Cas12f nucleases triggers programmable double-stranded DNA target cleavage. Nucleic Acids Research, 21 May 2020, 48(9), 5016–23 doi.org/10.1093/nar/gkaa208). Xu et al. described development of a 529 amino acid Cas12f-based system for mammalian genome engineering through multiple rounds of iterative protein engineering and screening. (Xu, X. et al., Engineered Miniature CRISPR-Cas System for Mammalian Genome Regulation and Editing. Molecular Cell, October 21, 2021, 81(20): 4333-45, doi.org/10.1016/j.molcel.2021.08.008). [0085] Exemplary CRISPR-Cas proteins and enzymes used in the Prime Editors herein include the following without limitation.
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO Table 6 - Cas12b (C2c1) orthologs
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
6.4. Protospacer Adjacent Motif [0086] As used herein, the term “protospacer adjacent sequence” or “protospacer adjacent motif” or “PAM” refers to an approximately 2-6 base pair DNA sequence (or a 2-, 3-, 4-, 5-, 6-, 7-, 8-, 9-, 10-, 11-, 12-long nucleotide sequence) that is an important targeting component of a Cas9 nuclease. Typically, the PAM sequence is on either strand, and is downstream in the 5' to 3' direction of Cas9 cut site. The canonical PAM sequence (i.e., the PAM sequence that is associated with the Cas9 nuclease of Streptococcus pyogenes or SpCas9) is 5'-NGG-3' wherein “N” is any nucleobase followed by two guanine (“G”) nucleobases. Different PAM sequences can be associated with different Cas9 nucleases or equivalent proteins from different organisms. In addition, any given Cas9 nuclease may be modified to alter the PAM specificity of the nuclease such that the nuclease recognizes alternative PAM sequence.
Attorney Ref: 38909-52322/WO Client Ref: 005WO [0087] For example, with reference to the canonical SpCas9 amino acid sequence, the PAM specificity can be modified by introducing one or more mutations, including (a) D1135V, R1335Q, and T1337R “the VQR variant”, which alters the PAM specificity to NGAN or NGNG, (b) D1135E, R1335Q, and T1337R “the EQR variant”, which alters the PAM specificity to NGAG, and (c) D1135V, G1218R, R1335E, and T1337R “the VRER variant”, which alters the PAM specificity to NGCG. In addition, the D1135E variant of canonical SpCas9 still recognizes NGG, but it is more selective compared to the wild type SpCas9 protein. [0088] It will also be appreciated that Cas9 enzymes from different bacterial species (i.e., Cas9 orthologs) can have varying PAM specificities and in some embodiments are therefore chosen based on the desired PAM recognition. For example, Cas9 from Staphylococcus aureus (SaCas9) recognizes NGRRT or NGRRN. In addition, Cas9 from Neisseria meningitis (NmCas) recognizes NNNNGATT. In another example, Cas9 from Streptococcus thermophilis (StCas9) recognizes NNAGAAW. In still another example, Cas9 from Treponema denticola (TdCas) recognizes NAAAAC. These examples are not meant to be limiting. It will be further appreciated that non- SpCas9s bind a variety of PAM sequences, which makes them useful to expand the range of sequences that can be targeted according to the disclosure. Furthermore, non-SpCas9s may have other characteristics that make them more useful than SpCas9. For example, Cas9 from Staphylococcus aureus (SaCas9) is about 1 kilobase smaller than SpCas9, so it can be packaged into adeno-associated virus (AAV). Further reference may be made to Shah et al., “Protospacer recognition motifs: mixed identities and functional diversity,” RNA Biology, 10(5): 891-899 (which is incorporated herein by reference). Gasiunas used cell-free biochemical screens to identify protospacer adjacent motif (PAM) and guide RNA requirements of 79 Cas9 proteins. (Gasiunas et al., A catalogue of biochemically diverse CRISPR-Cas9 orthologs, Nature Communications 11:5512 doi.org/10.1038/s41467-020-19344-1) The authors described 7 classes of gRNA and 50 different PAM requirement. [0089] Oh, Y. et al. describe linking reverse transcriptase to a Francisella novicida Cas9 [FnCas9(H969A)] nickase module. (Oh, Y. et al., Expansion of the prime editing modality with Cas9 from Francisella novicida, bioRxiv 2021.05.25.445577; doi.org/10.1101/2021.05.25.445577). By increasing the distance to the PAM, the FnCas9(H969A) nickase module expands the region of a reverse transcription template (RTT) following the primer binding site.
Attorney Ref: 38909-52322/WO Client Ref: 005WO 6.5. Prime Editors [0090] “Prime editor fusion protein” describes a protein that is used in prime editing. Prime editing uses CRISPR enzyme that nicks or cuts only single strand of double stranded DNA, i.e., a nickase; and a nickase can occur either naturally or by mutation or modification of a nuclease that makes double stranded cuts. Such an enzyme can be a catalytically-impaired Cas9 endonuclease (a nickase). Such an enzyme can be a Casl2a/b, MAD7, or variant thereof. The nickase is fused to an engineered reverse transcriptase (RT). The nickase is programmed (directed) with a prime- editing guide RNA (pegRNA). The skilled person in the art would appreciate that the pegRNA both specifies the target site and encodes the desired edit. Advantageously the nickase is a catalytically-impaired Cas9 endonuclease, a Cas9 nickase, that is fused to the reverse transcriptase. During genetic editing, the Cas9 nickase part of the protein is guided to the DNA target site by the pegRNA, whereby a nick or single stranded cut occurs. The reverse transcriptase domain then uses the pegRNA to template reverse transcription of the desired edit, directly polymerizing DNA onto the nicked target DNA strand. The edited DNA strand replaces the original DNA strand, creating a heteroduplex containing one edited strand and one unedited strand. Afterward, optionally, the prime editor (PE) guides resolution of the heteroduplex to favor copying the edit onto the unedited strand, completing the process (typically achieved with a nickase gRNA). [0091] As used herein, “PE1” refers to a PE complex comprising a fusion protein comprising Cas9(H840A) and a wild type MMLV RT having the following N-terminus to C-terminus structure: [NLS]-[Cas9(H840A)]- [linker]-[MMLV_RT(wt)] + a desired PEgRNA. In various embodiments, the prime editors disclosed herein is comprised of PE1. [0092] As used herein, “PE2” refers to a PE complex comprising a fusion protein comprising Cas9(H840A) and a variant MMLV RT having the following N-terminus to C-terminus structure: [NLS]-[Cas9(H840A)]- [linker]-[MMLV_RT(D200N)(T330P)(L603W)(T306K)(W313F)] + a desired atgRNA (PEgRNA). In various embodiments, the prime editors disclosed herein are comprised of PE2. [0093] In various embodiments, the prime editors disclosed herein are comprised of PE2 and co-expression of MMR protein MLH1dn, that is PE4. [0094] As used herein, “PE3” refers to PE2 plus a second-strand nicking guide RNA that complexes with the PE2 and introduces a nick in the non-edited DNA strand. The induction of the second nick increases the chances of the unedited strand, rather than the edited strand, to be repaired. In various embodiments, the prime editors disclosed herein are comprised of PE3.
Attorney Ref: 38909-52322/WO Client Ref: 005WO [0095] In various embodiments, the prime editors disclosed herein is comprised of PE3 and co-expression of MMR protein MLH1dn, that is PE5. [0096] As used herein, “PE3b” refers to PE3 but wherein the second-strand nicking guide RNA is designed for temporal control such that the second strand nick is not introduced until after the installation of the desired edit. This is achieved by designing a gRNA with a spacer sequence with mismatches to the unedited original allele that matches only the edited strand. Using this strategy, mismatches between the protospacer and the unedited allele should disfavor nicking by the sgRNA until after the editing event on the PAM strand takes place. 6.6. Guides for Prime Editing [0097] Anzalone et al., 2019 (Nature 576:149) describes prime editing and a prime editing complex using a type II CRISPR and can be used herein. A prime editing complex consists of a type II CRISPR PE protein containing an RNA-guided DNA-nicking domain fused to a reverse transcriptase (RT) domain and complexed with a pegRNA. The pegRNA comprises (5’ to 3’) a spacer that is complementary to the target sequence of a genomic DNA, a nickase (e.g. Cas9) binding site, a reverse transcriptase template including editing positions, and primer binding site (PBS). The PE–pegRNA complex binds the target DNA and the CRISPR protein nicks the PAM- containing strand. The resulting 3′ end of the nicked target hybridizes to the primer-binding site (PBS) of the pegRNA, then primes reverse transcription of new DNA containing the desired edit using the RT template of the pegRNA. The overall structure of the pegRNA is like that of a typical type II sgRNA with a reverse transcriptase template/primer binding site appended to the 3’ end. The structure leaves the PBS at the 3’ end of the pegRNA free to bind to the nicked strand complementary to the target which forms the primer for reverse transcription. [0098] Guide RNAs of CRISPRs differ in overall structure. For example, while the spacer of a type II gRNA is located at the 5’ end, the spacer of a type V gRNA is located towards the 3’ end, with the CRISPR protein (e.g. Cas12a) binding region located toward the 5’ end. Accordingly, the regions of a type V pegRNA are rearranged compared to a type II pegRNA. The overall structure of the pegRNA is like that of a typical type II sgRNA with a reverse transcriptase template/primer binding site appended to the 3’ end. The pegRNA comprises (5’ to 3’) a CRISPR protein-binding region, a spacer which is complementary to the target sequence of a genomic DNA, a reverse transcriptase template including editing positions, and primer binding site (PBS).
Attorney Ref: 38909-52322/WO Client Ref: 005WO 6.7. Attachment Site-Containing Guide RNA (atgRNA) [0099] As used herein, the term “attachment site-containing guide RNA” (atgRNA) and the like refer to an extended single guide RNA (sgRNA) comprising a primer binding site (PBS), a reverse transcriptase (RT) template sequence, and wherein the RT template encodes for an integration recognition site or a recombinase recognition site that can be recognized by a recombinase, integrase, or transposase. In some embodiments, the RT template comprises a clamp sequence and an integration recognition site. As referred to herein an atgRNA may be referred to as a guide RNA. An integration recognition site or recombinase target recognition site incorporated into the pegRNA is referred to as an attachment site containing guide RNA (atgRNA). [0100] As used herein, the term “cognate integration recognition site” or “integration cognate” or “cognate pair” refers to a first integration recognition site (e.g., any of the integration recognition sites described herein) and a second integration recognition site (e.g., any of the integration recognition sites described herein) that can be recombined. Recombination between a first integration recognition site (e.g., any of the integration recognition sites described herein) and a second recognition site (e.g., any of the integration recognition sites described herein) is mediated by functional symmetry between the two integration recognition sites and the central dinucleotide of each of the two integration recognition sites. In some cases, a first integration recognition site (e.g., any of the integration recognition sites described herein) that can be recombined with a second integration recognition site (e.g., any of the integration recognition sites described herein) are referred to as a “cognate pair.” A non-limiting example of a cognate pair include an attB site and an attP site, whereby a serine integrase mediates recombination between the attB site and the attP site. FIGs.1A-1E show optimization of the integration recognition site. [0101] In typical embodiments, an atgRNA comprises a reverse transcriptase template that encodes, partially or in its entirety, an integration recognition site (also referred to as an integration target recognition site) or a recombinase recognition site (also referred to as a recombinase target recognition site). The integration target recognition site, which is to be placed at a desired location in the genome or intracellular nucleic acid, is referred to as a “beacon,” a “beacon” site or an “attachment site” or a “landing pad” or “landing site.” An integration target recognition site or recombinase target recognition site incorporated into the pegRNA is referred to as an attachment site containing guide RNA (atgRNA). [0102] During genome editing, the primer binding site allows the 3’ end of the nicked DNA strand to hybridize to the atgRNA, while the RT template serves as a template for the
Attorney Ref: 38909-52322/WO Client Ref: 005WO synthesis of edited genetic information. The atgRNA is capable for instance, without limitation, of (i) identifying the target nucleotide sequence to be edited and (ii) encoding new genetic information that replaces (or in some cases adds) the targeted sequence. In some embodiments, the atgRNA is capable of (i) identifying the target nucleotide sequence to be edited and (ii) encoding an integration site that replaces (or inserts/deletes within) the targeted sequences. [0103] In some embodiments, where the system contains a first atgRNA and a second atgRNA (see FIG.2), the first atgRNA and the second atgRNA are an at least first pair of atgRNAs, where the at least first pair of atgRNAs have domains that are capable of guiding the gene editor protein or prime editor fusion protein to a target sequence, the first atgRNA further includes a first RT template that comprises at least a portion of the first integration recognition site; and the second atgRNA further includes a second RT template that comprises at least a portion of the first integration recognition site, and the first atgRNA and the second atgRNAs collectively encode the entirety of the first integration recognition site. [0104] In some embodiments, the first atgRNA’s reverse transcriptase template encodes for a first single-stranded DNA sequence (i.e., a first DNA flap) that contains a complementary region to a second single-stranded DNA sequence (i.e., a second DNA flap) encoded by a second atgRNA comprising a second reverse transcriptase template. In certain embodiments, the complementary region between the first and second single-stranded DNA sequences is comprised of more than 5 consecutive bases of an integrase target recognition site. In certain embodiments, the complementary region between the first and second single-stranded DNA sequences is comprised of more than 10 consecutive bases of an integrase target recognition site. In certain embodiments, the complementary region between the first and second single-stranded DNA sequences is comprised of more than 20 consecutive bases of an integrase target recognition site. In certain embodiments, the complementary region between the first and second single-stranded DNA sequences is comprised of more than 30 consecutive bases of an integrase target recognition site. Use of two guide RNAs that are (or encode DNA that is) partially complementarity to each other and comprised of consecutive bases of an integrase target recognition site are referred to as dual, paired, annealing, complementary, or twin attachment site-containing guide RNAs (atgRNAs). In certain embodiments, use of two guide RNAs that are (or encode DNA that is) full complementarity to each other and comprised of consecutive bases of an integrase target recognition site are referred to as dual, paired, annealing, complementary, or twin attachment site- containing guide RNAs (atgRNAs).
Attorney Ref: 38909-52322/WO Client Ref: 005WO [0105] In some embodiments, upon introducing the nucleic acid construct into a cell, the first atgRNA incorporates the first integration recognition site into the cell’s genome at the target sequence. 6.7.1. AtgRNAs Comprising a Clamp Segment [0106] This disclosure features an attachment site containing guide RNA (atgRNA) comprising a clamp segment (see, e.g., FIGs.3A-3C and FIGs.4A-4C). The clamp segment is designed, at least in part, to enhance integration efficiency of the integration recognition site at the double-stranded target DNA sequence (e.g., as compared to integration efficiency of the integration recognition site at the double-stranded target DNA sequence when using an atgRNA that does not comprise the clamp segment). [0107] In one embodiment, this disclosure features an attachment site containing guide RNA (atgRNA) comprising from 5’ to 3’: a spacer sequence complementary to a first genomic site of a first strand of a double-stranded target DNA sequence; a scaffold capable of associating with a DNA binding nickase domain and guiding the DNA binding nickase binding domain to the target DNA sequence; a reverse transcriptase template comprising: an integration recognition site; and a clamp segment comprising a sequence where upon being reverse transcribed the sequence is at least partially complementary (e.g., the reverse transcribed clamp segment comprises a sequence having 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence complementarity) to a second genomic site of the first strand of the double-stranded target DNA sequence; and a primer binding site complementary to a third genomic site of the double-stranded target DNA sequence, wherein the third genomic site is on a second strand of the target DNA sequence. In some embodiments, the reverse transcriptase template encodes for a first single-stranded DNA sequence comprising the integration recognition site. In some embodiments, the reverse transcriptase template encodes for a first single-stranded DNA sequence comprising the integration recognition site and the clamp segment. In some embodiments, upon introducing into a cell with a gene editor polypeptide, all or a portion of the single-stranded DNA sequence comprising the integration recognition site is site-specifically integrated into the double-stranded target DNA sequence. In some embodiments, upon introducing the atgRNA into a cell with a gene editor polypeptide, the atgRNA comprising the clamp enhances integration efficiency of the integration recognition site at the double-stranded target DNA sequence as compared to integration efficiency of the integration recognition site at the double-stranded target DNA sequence when using an atgRNA that does not comprise the clamp segment.
Attorney Ref: 38909-52322/WO Client Ref: 005WO [0108] In one embodiment, this disclosure features an attachment site containing guide RNA (atgRNA) comprising from 5’ to 3’: a spacer sequence complementary to a first genomic site of a first strand of a double-stranded target DNA sequence; a scaffold capable of associating with a DNA binding nickase domain and guiding the DNA binding nickase binding domain to the target DNA sequence; a reverse transcriptase template comprising: an integration recognition site; and a clamp segment comprising a sequence where upon being reverse transcribed the sequence is at least partially complementary (e.g., the reverse transcribed clamp segment comprises a sequence having 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence complementarity) to a second genomic site of the first strand of the double-stranded target DNA sequence; and a primer binding site comprising: a sequence complementary to a third genomic site the double-stranded DNA target sequence, wherein the third genomic site is on a second strand of the target DNA sequence; and an anti-reverse transcriptase template (anti-RTT) sequence comprising a sequence at least partially complementary (e.g., the anti-RTT sequence comprises a sequence having 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence complementarity) to all or a portion of the RT template. In some embodiments, the reverse transcriptase template encodes for a first single-stranded DNA sequence comprising the integration recognition site. In some embodiments, the reverse transcriptase template encodes for a first single-stranded DNA sequence comprising the integration recognition site and the clamp segment. In some embodiments, upon introducing into a cell with a gene editor polypeptide, all or a portion of the single-stranded DNA sequence comprising the integration recognition site is site-specifically integrated into the double-stranded target DNA sequence. In some embodiments, upon introducing the atgRNA into a cell with a gene editor polypeptide, the atgRNA comprising the clamp enhances integration efficiency of the integration recognition site at the double-stranded target DNA sequence as compared to integration efficiency of the integration recognition site at the double-stranded target DNA sequence when using an atgRNA that does not comprise the clamp segment. [0109] In one embodiment, this disclosure features an attachment site containing guide RNA (atgRNA), comprising from 5’ to 3’: a spacer sequence complementary to a first genomic site of a first strand of a double-stranded target DNA sequence; a scaffold capable of associating with a DNA binding nickase domain and guiding the DNA binding nickase binding domain to the target DNA sequence; a clamp segment at least partially complementary (e.g., the clamp segment comprises a sequence having 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence complementarity) to a second genomic site of the first strand of the double- stranded target DNA sequence; a reverse transcriptase (RT) template comprising an integration
Attorney Ref: 38909-52322/WO Client Ref: 005WO recognition site; and a primer binding site complementary to a third genomic site of the double- stranded target DNA sequence, wherein the third genomic site is on a second strand of the target DNA sequence; wherein the reverse transcriptase template encodes for a first single-stranded DNA sequence comprising the integration recognition site. In some embodiments, upon introducing into a cell with a gene editor polypeptide, all or a portion of the single-stranded DNA sequence comprising the integration recognition site is site-specifically integrated into the double-stranded target DNA sequence. In some embodiments, upon introducing the atgRNA into a cell with a gene editor polypeptide, the atgRNA comprising the clamp enhances integration efficiency of the integration recognition site at the double-stranded target DNA sequence as compared to integration efficiency of the integration recognition site at the double-stranded target DNA sequence when using an atgRNA that does not comprise the clamp segment. In some embodiments, the atgRNA includes a transcription termination signal that prevents reverse transcription of the clamp segment. In one embodiment, the internucleoside linkage connecting the RT template to the clamp signal comprises a moiety that prevents reverse transcription of the clamp segment. [0110] In one embodiment, this disclosure features an attachment site containing guide RNA (atgRNA) comprising from 5’ to 3’: a spacer sequence complementary to a first genomic site of a first strand of a double-stranded target DNA sequence; a scaffold capable of associating with a DNA binding nickase domain and guiding the DNA binding nickase binding domain to the target DNA sequence; a clamp segment at least partially complementary (e.g., the clamp segment comprises a sequence having 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence complementarity) to a second genomic site of the first strand of the double- stranded target DNA sequence; a reverse transcriptase (RT) template comprising an integration recognition site; and a primer binding site comprising: a sequence complementary to a third genomic site of the double-stranded DNA target sequence, wherein the third genomic site is on a second strand of the target DNA sequence; and an anti-reverse transcriptase template (anti-RTT) sequence comprising a sequence at least partially complementary (e.g., the anti-RTT sequence comprises a sequence having 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence complementarity) to all or a portion of the RT template, wherein the reverse transcriptase template encodes for a first single-stranded DNA sequence comprising the integration recognition site. In some embodiments, upon introducing into a cell with a gene editor polypeptide, all or a portion of the single-stranded DNA sequence comprising the integration recognition site is site-specifically integrated into the double-stranded target DNA sequence. In some embodiments, upon introducing the atgRNA into a cell with a gene editor polypeptide, the atgRNA comprising the clamp enhances integration efficiency of the integration recognition site
Attorney Ref: 38909-52322/WO Client Ref: 005WO at the double-stranded target DNA sequence as compared to integration efficiency of the integration recognition site at the double-stranded target DNA sequence when using an atgRNA that does not comprise the clamp segment. In some embodiments, the atgRNA includes a transcription termination signal that prevents reverse transcription of the clamp segment. In one embodiment, the internucleoside linkage connecting the RT template to the clamp signal comprises a moiety that prevents reverse transcription of the clamp segment. [0111] In some embodiments, the atgRNA has a length of 150 to 210 nucleotides (e.g., a length of 150 to 200 nucleotides, a length of 150 to 190 nucleotides, a length of 150 to 180 nucleotides, a length of 150 to 170 nucleotides, a length of 150 to 160 nucleotides, a length of 160 to 210 nucleotides, a length of 160 to 200 nucleotides, a length of 160 to 190 nucleotides, a length of 160 to 180 nucleotides, a length of 160 to 170 nucleotides, a length of 170 to 210 nucleotides, a length of 170 to 200 nucleotides, a length of 170 to 190 nucleotides, a length of 170 to 180 nucleotides, a length of 180 to 210 nucleotides, a length of 180 to 200 nucleotides, a length of 180 to 190 nucleotides, a length of 190 to 210 nucleotides, a length of 190 to 200 nucleotides, or a length of 200 to 210 nucleotides. [0112] In some embodiments, the RT template has a length of 10 nucleotides to 200 nucleotides (e.g., a length 10 to 180 nucleotides, a length of 10 to 160 nucleotides, a length of 10 to 140 nucleotides, a length of 10 to 120 nucleotides, a length of 10 to 100 nucleotides, a length of 10 to 80 nucleotides, a length of 10 to 60 nucleotides, a length of 10 to 40 nucleotides, a length of 10 to 20 nucleotides, a length of 20 to 200 nucleotides, a length of 20 to 180 nucleotides, a length of 20 to 160 nucleotides, a length of 20 to 140 nucleotides, a length of 20 to 120 nucleotides, a length of 20 to 100 nucleotides, a length of 20 to 80 nucleotides, a length of 20 to 60 nucleotides, a length of 20 to 40 nucleotides, a length of 40 to 200 nucleotides, a length of 40 to 180 nucleotides, a length of 40 to 160 nucleotides, a length of 40 to 140 nucleotides, a length of 40 to 120 nucleotides, a length of 40 to 100 nucleotides, a length of 40 to 80 nucleotides, a length of 40 to 60 nucleotides, a length of 60 to 200 nucleotides, a length of 60 to 180 nucleotides, a length of 60 to 160 nucleotides, a length of 60 to 140 nucleotides, a length of 60 to 120 nucleotides, a length of 60 to 100 nucleotides, a length of 60 to 80 nucleotides, a length of 80 to 200 nucleotides, a length of 80 to 180 nucleotides, a length of 80 to 160 nucleotides, a length of 80 to 140 nucleotides, a length of 80 to 120 nucleotides, a length of 80 to 100 nucleotides, a length of 100 to 200 nucleotides, a length of 100 to 180 nucleotides, a length of 100 to 160 nucleotides, a length of 100 to 140 nucleotides, a length of 100 to 120 nucleotides, a length of 120 to 200 nucleotides, a length of 120 to 180 nucleotides, a length of 120 to 160 nucleotides, a length of 120 to 140 nucleotides, a length
Attorney Ref: 38909-52322/WO Client Ref: 005WO of 140 to 200 nucleotides, a length of 140 to 180 nucleotides, a length of 140 to 160 nucleotides, a length of 160 to 200 nucleotides, a length of 160 to 180 nucleotides, or a length of 180 to 200). In some embodiments, the RT template has a length of 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60 or more nucleotides. [0113] In typical embodiments, the atgRNA binds at least a first target genomic site (via spacer sequence), a second genomic site (via clamp sequence), and a third genomic site (via primer binding site). In some embodiments, the atgRNA is designed to optimize the distance between the second genomic site and the third genomic site. Optimization of the distance between the second genomic site and the third genomic site can be based on the length of the RT template to be integrated into the target sequence, the sequence of the target DNA sequence, among other factors. In some cases, the distance between the second genomic site and the third genomic site can be referred to as the “separation” between the second genomic site and the third genomic site. [0114] In some embodiments, the atgRNA is designed such that there is partial genomic replacement of the target DNA sequence upon incorporation of the integration recognition sequence into the target DNA sequence. In such cases, the atgRNA is designed so that the second genomic site and the third genomic site are separated by one or more genomic bases (e.g., one or more base pairs) in the double-stranded target DNA sequence, whereby the one or more genomic bases are deleted upon incorporation of the integration recognition site. [0115] In some embodiments, the atgRNA is designed such that partial genomic replacement comprises deletion of 1 to 10 genomic bases (e.g., base pairs), 11 to 20 genomic bases, 21 to 30 genomic bases, 31 to 40 genomic bases 41 to 50 genomic bases, 51 to 60 genomic bases, 61 to 70 genomic bases, 71 to 80 genomic bases, 81 to 90 genomic bases, or 91-100 genomic bases. In some embodiments, the atgRNA is designed such that partial genomic replacement comprises deletion of 100 to 1000 genomic bases (e.g., base pairs). In some embodiments, the atgRNA is designed such that partial genomic replacement comprises deletion of 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, or 45 genomic bases (e.g., base pairs). [0116] In some embodiments, the second genomic site and the third genomic site are separated by one or more genomic bases (e.g., one or more base pairs) in the double-stranded target DNA sequence. In some embodiments, by relaxing the requirement that the clamp binding site (i.e., second genomic site) be immediately adjacent to the primer binding site (i.e., third genomic site), this disclosure provides for guide RNA compositions which are capable of being reduced in total
Attorney Ref: 38909-52322/WO Client Ref: 005WO sequence length as compared to standard prime editing guide RNAs or gene editing RNAs capable of integrase target site recognition site insertion (i.e., standard atgRNAs). The clamp segment binding site (i.e., second genomic site) can be selected from within a large genomic window, wherein for example the window frame can be shifted to screen for variable sequence distance from the third genomic site and/or constrained to select for efficient hybridization to a defined length clamp segment. In some embodiments, the atgRNA clamp segment is less than 50 nucleotides in length. In some embodiments, the atgRNA clamp segment is less than 25 nucleotides in length. In some embodiments, the atgRNA clamp segment is less than 15 nucleotides in length. In some embodiments, the atgRNA clamp segment is less than 10 nucleotides in length. In some embodiments, by relaxing the requirement that the clamp binding site (i.e., second genomic site) be immediately adjacent to the primer binding site (i.e., third genomic site), this disclosure provides for atgRNA compositions which are capable of increasing efficiency of the integration recognition site genomic incorporation (i.e., beacon placement). In some embodiments, increased efficiency is due to favored repair by endogenous cellular machinery. In some embodiments, increased efficiency is due to reduced 3’ flap thermodynamic strain during genomic binding (i.e., during “flap equilibration” processes see FIGs.4A-4C)). [0117] In some embodiments, the second genomic site and the third genomic site are separated by 1 to 10 genomic bases, 11 to 20 genomic bases, 21 to 30 genomic bases, 31 to 40 genomic bases, or 41 to 45 genomic bases. In some embodiments, the second genomic site and the third genomic site are separated by 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, or 45 genomic bases (e.g., base pairs). In some embodiments, the second genomic site and the third genomic site are separated by at least 50, 100, 200, 300, 400, 500, 600, 700, 800, 900, or 1000 (1 kilobase distance) genomic bases. In some embodiments, the second genomic site and the third genomic site are separated by more than 1 kilobase. In some embodiments, second genomic site and the third genomic site are located on different chromosomes. [0118] In some embodiments, the second genomic site and the third genomic site are adjacent in the double-stranded target DNA sequence. In some embodiments, the genomic bases (e.g., base pairs) that are between the second genomic site and the third genomic site are not incorporated into the target DNA sequence, thereby removing (e.g., deleting) these genomic bases from the target DNA sequence. [0119] In certain embodiments, the guide RNA is circularized. In some embodiments, guide RNA circularization is mediated by 5’ and 3’ terminus dimerization motifs. In some embodiments, the
Attorney Ref: 38909-52322/WO Client Ref: 005WO dimerization motifs are kissing loops. In some embodiments, guide RNA circularization is mediated by one or more of a ribozyme or a ligase. In some embodiments, guide RNA circularization is mediated by covalently or non-covalently linking the 5’ and 3’ termini. [0120] In typical embodiments, one or more guide RNA is comprised in one or more RNA polynucleotides or DNA polynucleotides. 6.7.2. Clamp Segment [0121] This disclosures features an attachment site containing guide RNA (atgRNA) comprising a clamp segment that is at least partially complementary to a second genomic site of the first strand of the double-stranded target DNA sequence. In some instances, the clamp is designed to enhance efficiency of integrating the integration recognition site into the target DNA Sequence. For example, upon introducing the atgRNA comprising the clamp into a cell with a gene editor polypeptide, the atgRNA comprising the clamp enhances integration efficiency of the integration recognition site at the double-stranded target DNA sequence as compared to the integration efficiency of the integration recognition site at the double-stranded target DNA sequence when using an atgRNA that does not comprise the clamp segment. In some embodiments, inclusion of a clamp segment in the atgRNA allows for a reduction in the number of nucleotides in the primer binding site (PBS) need to bind the third genomic site and facilitate reverse transcription. This in turn enables design of atgRNA that are shorter than guide RNAs used for other gene editing methods, for example, prime editing. [0122] In some embodiments, the clamp segment has a length of 1 to 10 nucleotides, 11 to 20 nucleotides, 21 to 30 nucleotides, or 31 to 40 nucleotides. In some embodiments, the clamp segment has a length of 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, or more nucleotides. In some embodiments, the clamp segment has a length of 10 nucleotides. [0123] In some embodiments, the clamp segment comprises a sequence that is partially complementary to the second genomic site of the first strand of the double-stranded target DNA sequence. In some embodiments, a clamp segment that is partially complementary o the second genomic site of the first strand of the double-stranded target DNA sequence includes a clamp segment having 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence complementarity to the second genomic site. In some embodiments, the clamp segment comprises a sequence that is complementary to the second genomic site of the first strand of the double-stranded target DNA sequence.
Attorney Ref: 38909-52322/WO Client Ref: 005WO [0124] In some embodiments, the clamp segment comprises a sequence that is capable of binding to the second genomic site of the first strand of the double-stranded target DNA sequence with at least a first nucleotide mismatch. In some embodiments, the clamp segment comprises a sequence that is capable of binding to the second genomic site of the first strand of the double-stranded target DNA sequence with a first nucleotide mismatch and a second nucleotide mismatch. In some embodiments, the first nucleotide mismatch and the second nucleotide mismatch are consecutive. In some embodiments, the first nucleotide mismatch and the second nucleotide mismatch are non- consecutive. In some embodiments, the clamp segment comprises a sequence that is capable of binding to the second genomic site of the first strand of the double-stranded target DNA sequence with the first nucleotide mismatch, the second nucleotide mismatch, and a third nucleotide mismatch. In some embodiments, the clamp segment comprises a sequence that is capable of binding to the second genomic site of the first strand of the double-stranded target DNA sequence with the first nucleotide mismatch, the second nucleotide mismatch, the third nucleotide mismatch, and a fourth nucleotide mismatch. [0125] In one embodiment, the reverse transcriptase template comprises an integration recognition site and a clamp segment. In such embodiments, the clamp segment includes a sequence where upon being reverse transcribed comprises a sequence at least partially complementary (e.g., the reverse transcribed clamp segment comprises a sequence having 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence complementarity) to a second genomic site of the first strand of the double-stranded target DNA sequence. In some embodiments, the clamp segment comprises a sequence (e.g., a reverse complement of the clamp “rcClamp”) that upon being reverse transcribed is complementary to the second genomic site of the first strand of the double-stranded target DNA sequence. 6.7.3. Anti-Reverse Transcriptase Template (anti-RTT) Sequence [0126] In some embodiments, an atgRNA comprises an anti-reverse transcriptase template (anti- RTT) sequence (see, e.g., FIGs.6A-6B). In some embodiments, the anti-RTT sequence is capable of binding to all or a portion of the RT template, thereby blocking the 3’ end of the atgRNA from degradation (e.g., degradation by exonucleases). In such embodiments, upon introducing the atgRNA into a cell along with a gene editor polypeptide comprising a DNA binding nickase domain linked to a reverse transcriptase domain, and after binding of the PBS to the third genomic site and initiation of a reverse transcription reaction, the reverse transcriptase can displace the anti- RTT sequence, thereby making the RT template sequence accessible for reverse transcription.
Attorney Ref: 38909-52322/WO Client Ref: 005WO [0127] In some embodiments, upon being introduced into a cell, the binding of the anti-RTT sequence to all or a portion of the RT template increases stability and half-life of the atgRNA by protecting the atgRNA from exonuclease activity. In some embodiments, upon introducing the atgRNA into a cell, the anti-RTT sequence increases stability and half-life of the atgRNA as compared to an atgRNA that does not comprise the anti-RTT sequence. [0128] In some embodiments, the anti-RTT sequence comprises a sequence that is at least partially complementary to the RT template. For example, the anti-RTT sequence comprises a sequence having 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence complementarity to the RT template. In some embodiments, the anti-RTT sequence comprises a sequence that is complementary to all or a portion of the RT template. [0129] In some embodiments, the anti-RTT sequence has a length of 1 to 20 nucleotides, 5 to 15 nucleotides, or 10 to 20 nucleotides. In some embodiments, the anti-RTT sequence has a length of 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, or more nucleotides. In one embodiment, the anti-RTT sequence has a length of 10 nucleotides. [0130] In some embodiments, the atgRNA further comprises a 3’ extension (also referred to as anti-RTT sequence) capable of binding one or more consecutive nucleotides of the reverse transcriptase template first segment. In some embodiments, the atgRNA further comprises a 3’ extension capable of binding one or more consecutive nucleotides of the reverse transcriptase template second segment. In some embodiments, the atgRNA further comprises a 3’ extension capable of binding one or more consecutive nucleotides of the reverse transcriptase template first and second segment. In some embodiments, the atgRNA further comprises a 5’ extension capable of binding one or more consecutive nucleotides of the reverse transcriptase template first segment. In some embodiments, the atgRNA further comprises a 5’ extension capable of binding one or more consecutive nucleotides of the reverse transcriptase template second segment. In some embodiments, the atgRNA further comprises a 5’ extension capable of binding one or more consecutive nucleotides of the reverse transcriptase template first and second segment. Such 5’ and/or 3’ extensions may increase the stability of the guide RNA and/or prevent spurious guide RNA genomic binding. In some embodiments, the stability is increased from cellular exonucleases, endonucleases, or hydrolysis. 6.7.4. Additional Embodiments for atgRNA [0131] The present disclosure contemplates a split atgRNA comprised of two or more polynucleotides that are capable of forming a atgRNA complex (see Liu et al; doi:
Attorney Ref: 38909-52322/WO Client Ref: 005WO 10.1038/s41587-022-01255-9). In some embodiments, the atgRNA may be a atgRNA complex, wherein the atgRNA complex is comprised of at least two polynucleotide components. In certain embodiments, the atgRNA complex is comprised of a first polynucleotide component and a second polynucleotide component. In certain embodiments, the first polynucleotide component comprises a spacer sequence complementary to a target first genomic and a scaffold capable of binding to a DNA binding nickase, and wherein the second polynucleotide component comprises a reverse transcriptase template comprised of at least a first segment and a second segment, wherein said first segment encodes a clamp segment complementary to a second genomic site and said second segment encodes an integrase target recognition site, a primer binding site complementary to a third genomic site, and a RNA-protein recruitment domain, and wherein the reverse transcriptase template encodes for a first single-stranded DNA sequence. In some embodiments, the RNA- protein recruitment domain is a MS2 hairpin. [0132] In some embodiments, at least one of the first polynucleotide component and the second polynucleotide component are circularized. In some embodiments, circularization is mediated by a ribozyme or ligase. In some embodiments, circularization is mediated by covalently or non- covalently linking the 5’ and 3’ termini. [0133] In certain embodiments, the atgRNA or the atgRNA complex further comprises one or more of an RNA-protein recruitment domain, RNA-RNA recruitment domain, a transcriptional termination signal, a reverse transcription termination signal, an RNA ribozyme, or a chemical linker. [0134] In typical embodiments, one or more atgRNA complex is comprised in one or more RNA polynucleotides or DNA polynucleotides. [0135] In some embodiments, the atgRNA or the atgRNA complex reverse transcriptase template encodes for a first single-stranded DNA sequence (i.e., a first DNA flap) that contains a complementary region to a second single-stranded DNA sequence (i.e., a second DNA flap) encoded by a second atgRNA comprised of a reverse transcriptase template. In certain embodiments, the complementary region between the first and second single-stranded DNA sequences is comprised of more than 10 consecutive bases of an integrase target recognition site. In certain embodiments, the complementary region between the first and second single-stranded DNA sequences is comprised of more than 20 consecutive bases of an integrase target recognition site. In certain embodiments, the complementary region between the first and second single-
Attorney Ref: 38909-52322/WO Client Ref: 005WO stranded DNA sequences is comprised of more than 30 consecutive bases of an integrase target recognition site. [0136] In some embodiments, the integrase target recognition site is an Bxb1 attB or attP sequence. In some embodiments, the att-site central dinucleotide is not GT or CA. [0137] In typical embodiments, one or more atgRNA or the atgRNA complex is comprised in one or more RNA polynucleotides or DNA polynucleotides. [0138] In some embodiments, the atgRNA or the atgRNA complex reverse transcriptase template encodes for a first single-stranded DNA sequence (i.e., a first DNA flap) that contains a complementary region to a second single-stranded DNA sequence (i.e., a second DNA flap) encoded by a second atgRNA comprised of a reverse transcriptase template. In certain embodiments, the complementary region between the first and second single-stranded DNA sequences is comprised of more than 10 consecutive bases of an integrase target recognition site. In certain embodiments, the complementary region between the first and second single-stranded DNA sequences is comprised of more than 20 consecutive bases of an integrase target recognition site. In certain embodiments, the complementary region between the first and second single- stranded DNA sequences is comprised of more than 30 consecutive bases of an integrase target recognition site. [0139] In some embodiments, the integrase target recognition site is an Bxb1 attB or attP sequence. In some embodiments, the att-site central dinucleotide is not GT or CA. [0140] In typical embodiments, one or more atgRNA or the atgRNA complex is comprised in one or more RNA polynucleotides or DNA polynucleotides. 6.7.5. Non-limiting Example AtgRNAs [0141] Table 9 includes atgRNAs, sgRNAs and nicking guides that can be used herein. Spacers are labeled in capital font (SPACER), RT regions in bold capital (RT REGION), AttB sites in bold lower case (attb site), and PBS in capital italics (PBS). Unless otherwise denoted, the AttB is for Bxb1.
Attorney Ref: 38909-52322/WO Client Ref: 005WO Table 9
Attorney Ref: 38909-52322/WO Client Ref: 005WO Table 9
Attorney Ref: 38909-52322/WO Client Ref: 005WO Table 9
Attorney Ref: 38909-52322/WO Client Ref: 005WO Table 9
Attorney Ref: 38909-52322/WO Client Ref: 005WO Table 9
Attorney Ref: 38909-52322/WO Client Ref: 005WO Table 9
Attorney Ref: 38909-52322/WO Client Ref: 005WO Table 9
Attorney Ref: 38909-52322/WO Client Ref: 005WO Table 9
Attorney Ref: 38909-52322/WO Client Ref: 005WO Table 9
Attorney Ref: 38909-52322/WO Client Ref: 005WO Table 9
Attorney Ref: 38909-52322/WO Client Ref: 005WO Table 9
Attorney Ref: 38909-52322/WO Client Ref: 005WO Table 9
Attorney Ref: 38909-52322/WO Client Ref: 005WO Table 9
Attorney Ref: 38909-52322/WO Client Ref: 005WO Table 9
Attorney Ref: 38909-52322/WO Client Ref: 005WO Table 9
Attorney Ref: 38909-52322/WO Client Ref: 005WO Table 9
_
Attorney Ref: 38909-52322/WO Client Ref: 005WO Table 9
Attorney Ref: 38909-52322/WO Client Ref: 005WO Table 9
Attorney Ref: 38909-52322/WO Client Ref: 005WO Table 9
_ _
Attorney Ref: 38909-52322/WO Client Ref: 005WO Table 9
Attorney Ref: 38909-52322/WO Client Ref: 005WO Table 9
6.8. Systems and Methods Comprising an atgRNA having a Clamp [0142] This disclosure also features systems for site-specifically incorporating an integration recognition site into a mammalian cell genome at a desired target site (e.g., a double stranded target DNA sequence), comprising: any of the attachment site containing guide RNAs (atgRNAs) described in Section 6.7; any of the gene editor polypeptides described herein, for example, gene editor polypeptides comprising a DNA binding nickase domain linked to a reverse transcriptase domain that is capable of incorporating the integration recognition site into the target DNA sequence, wherein, upon introducing into the cell, the system integrates the integration recognition site into the target DNA sequence. In some embodiments, upon introducing the system into the
Attorney Ref: 38909-52322/WO Client Ref: 005WO cell, the atgRNA comprising the clamp enhances integration efficiency of the integration recognition site at the double-stranded target DNA sequence as compared to the integration efficiency of the integration recognition site at the double-stranded target DNA sequence when using an atgRNA that does not comprise the clamp segment. [0143] This disclosure also features methods for site-specifically incorporating an integration recognition site into a mammalian cell genome at a desired target site (e.g., a double stranded target DNA sequence), comprising: incorporating an integration target recognition site into the genome by delivering into the cell any of the attachment site containing guide RNAs (atgRNAs) described in Section 6.7; and any of the gene editor polypeptides described herein or any of the polynucleotides encoding a gene editor polypeptide, for example, a gene editor polypeptide comprising a DNA binding nickase domain linked to a reverse transcriptase domain, wherein the integration recognition site is incorporated into the target DNA sequence by the gene editor polypeptide; and optionally, a nicking gRNA. In some embodiments, the method using the atgRNA comprising the clamp segment enhances integration efficiency of the integration recognition site at the target DNA sequence as compared to the integration efficiency of the integration recognition site at the target DNA sequence when using an atgRNA that does not comprise the clamp segment. [0144] This disclosure also features a system for site-specifically integrating a donor polynucleotide template into a mammalian cell genome at a desired target site (e.g., a double stranded target DNA sequence), comprising: any of the attachment site containing guide RNAs (atgRNAs) described in Section 6.7; any of the gene editor polypeptides described herein, for example, a gene editor polypeptide comprising a DNA binding nickase domain linked to a reverse transcriptase domain that is capable of incorporating the integration recognition site into the target DNA sequence; an integration enzyme; and an donor polynucleotide template linked to a sequence that is an integration cognate of the integration recognition site present in the atgRNA, whereby the integration enzyme integrates the donor polynucleotide template into the target DNA sequence. In some embodiments of the system, upon introducing the system into the cell, the atgRNA comprising the clamp enhances integration efficiency of the integration recognition site at the double-stranded target DNA sequence as compared to the integration efficiency of the integration recognition site at the double-stranded target DNA sequence when using an atgRNA that does not comprise the clamp segment. [0145] This disclosure also features a method for site-specifically integrating a donor polynucleotide template into a genome of a cell, comprising: incorporating an integration target
Attorney Ref: 38909-52322/WO Client Ref: 005WO recognition site into the genome by delivering into the cell: any of the attachment site containing guide RNAs (atgRNAs) described in Section 6.7; and any of the gene editor polypeptides described herein or any of the polynucleotides encoding a gene editor polypeptide, for example, a gene editor polypeptide comprising a DNA binding nickase domain linked to a reverse transcriptase domain that is capable of incorporating the integration recognition site into the target DNA sequence; and optionally, a nicking gRNA; and integrating the donor polynucleotide template into the genome by delivering into the cell: an integration enzyme; and a donor polynucleotide template, wherein the donor polynucleotide template is linked to a sequence that is an integration cognate of the integration recognition site present in the atgRNA, wherein the donor polynucleotide is integrated into the cellular genome at the incorporated genomic integration recognition site by the integration enzyme. In some embodiments, the method using the atgRNA comprising the clamp enhances integration efficiency of the integration recognition site at the double-stranded target DNA sequence as compared to the integration efficiency of the integration recognition site at the target DNA sequence when using an atgRNA that does not comprise the clamp segment. In some embodiments, the method using the atgRNA comprising the clamp enhances integration efficiency of the donor polynucleotide template site at the target DNA sequence as compared to the integration efficiency of the donor polynucleotide template at the target DNA sequence when using an atgRNA that does not comprise the clamp segment. In some embodiments, the atgRNA, the gene editor polypeptide (or the polynucleotide encoding the gene editor polypeptide), the integration enzyme, and the donor polynucleotide template are introduced into the cell concurrently. 6.9. Integrases/Recombinases and Integration/Recombination Sites [0146] In typical embodiments, a gene editor polypeptide described herein contains an integrase or recombinase. In some embodiments, the integrase is delivered as a protein or the integrase is encoded in a delivered polynucleotide. In some embodiments, the integration enzyme is selected from the group consisting of Dre, Vika, Bxb1, ^C31, RDF, ^BTl, R1, R2, R3, R4, R5, TP901-1, A118, ^FCl, ^C1, MR11, TG1, ^370.l, Wβ, BL3, SPBc, K38, Peaches, Veracruz, Rebeuca, Theia, Benedict, KSSJEB, PattyP, Doom, Scowl, Lockley, Switzer, Bob3, Troube, Abrogate, Anglerfish, Sarfire, SkiPole, ConceptII, Museum, Severus, Airmid, Benedict, Hinder, ICleared, Sheen, Mundrea, BxZ2, ^RV, retrotransposases encoded by a Tc1/mariner family member including but not limited to retrotransposases encoded by LI, Tol2, Tel, Tc3, Himar 1 (isolated from the horn fly, Haematobia irritans), Mos1 (Mosaic element of Drosophila mauritiana), and Minos, and any mutants thereof. As can be used herein, Xu et al describes
Attorney Ref: 38909-52322/WO Client Ref: 005WO methods for evaluating integrase activity in E. coli and mammalian cells and confirmed at least R4, φC31, φBT1, Bxb1, SPBc, TP901-1 and Wβ integrases to be active on substrates integrated into the genome of HT1080 cells (Xu et al., 2013, Accuracy and efficiency define Bxb1 integrase as the best of fifteen candidate serine recombinases for the integration of DNA into the human genome. BMC Biotechnol.2013 Oct 20;13:87. doi: 10.1186/1472-6750-13-87). Durrant describes new large serine recombinases (LSRs) divided into three classes distinguished from one another by efficiency and specificity, including landing pad LSRs which outperform wild-type Bxb1 in episomal and chromosomal integration efficiency, LSRs that achieve both efficient and site- specific integration without a landing pad, and multi-targeting LSRs with minimal site-specificity. Additionally, embodiments can include any serine recombinase such as BceINT, SSCINT, SACINT, and INT10 (see Ionnidi et al., 2021; Drag-and-drop genome insertion without DNA cleavage with CRISPR directed integrases. bioRxiv 2021.11.01.466786, doi.org/10.1101/2021.11.01.466786). In some embodiments, the integration site can be selected from an attB site, an attP site, an attL site, an attR site, a lox71 site a Vox site, or a FRT site. [0147] It will be appreciated that desired activity of integrases, transposases and the like can depend on nuclear localization. In certain embodiments, prokaryotic enzymes are adapted to modulate nuclear localization. In certain embodiments, eukaryotic or vertebrate enzymes are adapted to modulate nuclear localization. In certain embodiments, the disclosure provides fusion or hybrid proteins. Such modulation can comprise addition or removal of one or more nuclear localization signal (NLS) and/or addition or removal of one or more nuclear export signal (NES). Xu et al compared derivatives of fourteen serine integrases that either possess or lack a nuclear localization signal (NLS) to conclude that certain integrases benefit from addition of an NLS whereas others are transported efficiently without addition, and a major determinant of activity in yeast and vertebrate cells is avoidance of toxicity. (Xu et al., 2016, Comparison and optimization of ten phage encoded serine integrases for genome engineering in Saccharomyces cerevisiae. BMC Biotechnol. 2016 Feb 9; 16:13. doi: 10.1186/s12896-016-0241-5). Ramakrishnan et al. systematically studied the effect of different NES mutants developed from mariner-like elements (MLEs) on transposase localization and activity and concluded that nuclear export provides a means of controlling transposition activity and maintaining genome integrity. (Ramakrishnan et al. Nuclear export signal (NES) of transposases affects the transposition activity of mariner-like elements Ppmar1 and Ppmar2 of moso bamboo. Mob DNA. 2019 Aug 19;10:35. doi:10.1186/s13100-019-0179-y). The methods and constructs are used to modulate nuclear localization of system components of the disclosure.
Attorney Ref: 38909-52322/WO Client Ref: 005WO [0148] In typical embodiments, the integrase used herein is selected from below (Table 10).
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
[0149] Sequences of insertion sites (i.e., recognition target sites) suitable for use in embodiments of the disclosure are presented below (Table 11). FIGs. 1A-1E show analysis of effect of variant AttP sites on integration efficiency.
Attorney Ref: 38909-52322/WO Client Ref: 005WO
_
Attorney Ref: 38909-52322/WO Client Ref: 005WO
_
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
Attorney Ref: 38909-52322/WO Client Ref: 005WO
_
Attorney Ref: 38909-52322/WO Client Ref: 005WO
6.10. Gene Editor Polypeptide Compositions [0150] This disclosure features a gene editor polypeptide useful for site-specifically integrating an integration recognition site into a target nucleic acid sequence (i.e., beacon placement), for example by using a polypeptide comprising a DNA binding nuclease linked to a reverse transcriptase, and integrating an exogenous nucleic acid sequence at the integrated integration recognition site (i.e., PASTE) by introducing into the cell the exogenous nucleic acid and an integration enzyme. The gene editor polypeptide includes a polypeptide comprised of a (i) DNA binding nickase (optionally with nickase activity), (ii) a reverse transcriptase, and (iii) an integration enzyme (e.g., an integrase or a recombinase), wherein at least any two elements of (i),
Attorney Ref: 38909-52322/WO Client Ref: 005WO (ii), or (iii) are linked via at least a first C-terminal linker. In some embodiments, the gene editor polypeptide includes a second linker. For example, where a gene editor polypeptide includes a first and a second linker, the first linker links a first domain (e.g., a DNA binding) to a second domain (e.g., a RT) and the second linker links the second domain to a third domain (e.g., an integration enzyme). In some embodiments, the orientation of the DNA binding nuclease, the RT, and the integration enzyme are configured for example from N-terminus to C-terminus: a DNA binding nuclease domain, a RT, and an integration enzyme; a DNA binding nuclease, an integration enzyme, and a RT; an integration enzyme, a DNA binding nuclease, and a RT; an integration enzyme, a RT, and a DNA binding nuclease; a RT, a DNA binding nuclease, and an integration enzyme; and a RT, an integration enzyme, and a DNA binding nuclease, where each of the domains are linked via a linker. In one embodiment, the C-terminus of the DNA binding nuclease is linked to the reverse transcriptase, wherein the C-terminus of the reverse transcriptase is linked to the integration enzyme. In one embodiment, the C-terminus of the integration enzyme is linked to the DNA binding nuclease, wherein the C-terminus of the DNA binding nuclease is linked to a reverse transcriptase. [0151] In some embodiments, the gene editor polypeptide is comprised of at least two of any one of (i) a DNA binding nickase, (ii) a reverse transcriptase, and (iii) an integration enzyme wherein at least any two elements of (i), (ii), or (iii) are linked via at least a first C-terminal linker. [0152] In certain embodiments, the gene editor polypeptide can be comprised of (i) a DNA binding nickase, (ii) a reverse transcriptase, and (iii) an integration enzyme (e.g., an integrase or a recombinase), wherein at least any two elements of (i), (ii), or (iii) may be linked via two or more C-terminal linkers at the same linking position. For example, a gene editor polypeptide includes a DNA binding nickase linked via two linkers to (a reverse transcriptase, and the reverse transcriptase domain fused or linked to an integration enzyme. [0153] In some embodiments, a linker (e.g., the first linker, the second linker, or both) is selected from a flexible linker, a semi-flexible linker, a rigid linker, a flexible-rigid linker. [0154] In some embodiments, the gene editor polypeptide includes at least one flexible linker. Non-limiting examples of flexible linkers include L1 (SEQ ID NO: 15), L2 (SEQ ID NO: 551), L5 (SEQ ID NO: 19), L6 (SEQ ID NO: 18), and L13 (SEQ ID NO: 558). In some embodiments, the gene editor polypeptide includes at least one flexible linker selected from: L1, L2, L5, L6, and L13. In some embodiments, the gene editor polypeptide includes at least a L1 linker (SEQ ID NO: 15). In some embodiments, the gene editor polypeptide includes at least a L2 linker (SEQ ID NO:
Attorney Ref: 38909-52322/WO Client Ref: 005WO 551). In some embodiments, the gene editor polypeptide includes at least a L5 linker (SEQ ID NO: 19) linker. In some embodiments, the gene editor polypeptide includes at least a L6 linker (SEQ ID NO: 18). In some embodiments, the gene editor polypeptide includes at least a L13 linker (SEQ ID NO: 558). [0155] In some embodiments, the gene editor polypeptide includes at least one semi-flexible linker. Non-limiting examples of semi-flexible linkers include L7 (SEQ ID NO: 552) and L8 (SEQ ID NO: 553). In some embodiments, the gene editor polypeptide includes at least a L7 linker (SEQ ID NO: 552). In some embodiments, the gene editor polypeptide includes at least a L7 linker (SEQ ID NO: 553). [0156] In some embodiments, the gene editor polypeptide includes at least one rigid linker. Non- limiting examples of rigid linkers include L3 (SEQ ID NO: 16), L4 (SEQ ID NO: 17), and L9 (SEQ ID NO: 554). In some embodiments, the gene editor polypeptide includes at least a L3 linker (SEQ ID NO: 16). In some embodiments, the gene editor polypeptide includes at least a L4 linker (SEQ ID NO: 17). In some embodiments, the gene editor polypeptide includes at least a L9 linker (SEQ ID NO: 554). [0157] In some embodiments, the gene editor polypeptide includes at least one flexible-rigid linker. Non-limiting examples of flexible-rigid linkers include L11 (SEQ ID NO: 557) and L12 (SEQ ID NO: 558). In some embodiments, the gene editor polypeptide includes at least a L11 linker (SEQ ID NO: 557). In some embodiments, the gene editor polypeptide includes at least a L12 linker (SEQ ID NO: 558). [0158] In some embodiments, the gene editor polypeptide includes a linker (e.g., a first linker, a second linker, or both) where the length of the linker is selected from short, medium, and long. Non-limiting examples of short linkers include: L1 (SEQ ID NO: 15), L3 (SEQ ID NO: 16), L7 (SEQ ID NO: 552), and L8 (SEQ ID NO: 553). Non-limiting examples of a medium linker include: L2 (SEQ ID NO: 551), L4 (SEQ ID NO: 17), L5 (SEQ ID NO: 19), L6 (SEQ ID NO:18), L10 (SEQ ID NO: 555), L11 (SEQ ID NO: 556), L12 (SEQ ID NO: 557), and L13 (SEQ ID NO: 558). A non-limiting example of a long linker is L9 (SEQ ID NO: 554). [0159] In some embodiments, the gene editor polypeptide includes a linker (e.g., the first linker, the second linker, or both) that is a non-cleavable linker. Non-limiting examples of non-cleavable linkers include: L1 (SEQ ID NO: 15), L2 (SEQ ID NO: 551), L3 (SEQ ID NO:16), L4 (SEQ ID NO: 17), L5 (SEQ ID NO: 19), L6 (SEQ ID NO: 18), L7 (SEQ ID NO: 552), L8 (SEQ ID NO:
Attorney Ref: 38909-52322/WO Client Ref: 005WO 553), L9 (SEQ ID NO: 554), L10 (SEQ ID NO: (555), L11 (SEQ ID NO: 556), L12 (SEQ ID NO: 557), and L13 (SEQ ID NO: 558). [0160] In some embodiments, the gene editor polypeptide include a linker (e.g., the first linker, the second linker, or both) that is a cleavable linker. Cleavable linkers can facilitate the co- expression of multiple proteins within a single open reading frame, effectively separating them post-translationally. Non-limiting examples of self-cleaving peptides include 2A peptides (derived from the foot-and-mouth disease virus), such as F2A, T2A, E2A, and P2A; the “Inteins” that facilitate protein splicing; and the tobacco etch virus (TEV) protease recognition site. In some embodiments, the cleavable linker is 2A peptide. In some embodiments, the cleavable linker is a P2A peptide. [0161] In some embodiments, the gene editor polypeptide includes at least a first linker and a second linker where: the at least first linker is a non-cleavable linker and the second linker is a cleavable linker; the at least first linker is a cleavable linker and the second linker is a non- cleavable linker; the at least first linker is a cleavable linker and the second linker is a cleavable linker; or the at least first linker is a non-cleavable linker and the second linker is a non-cleavable linker. [0162] In some embodiments, the gene editor polypeptide includes the DNA binding nuclease, the RT, and the integration enzyme, where the C-terminus of the DNA binding nuclease is linked to the reverse transcriptase, wherein the C-terminus of the reverse transcriptase is linked to the integration enzyme, and where any one of L7 (SEQ ID NO: 552), L10 (SEQ ID NO: 555), or L12 (SEQ ID NO: 557) links the DNA binding nuclease to the reverse transcriptase. [0163] In some embodiments, the gene editor polypeptide includes the DNA binding nuclease, the RT, and the integration enzyme, where the C-terminus of the DNA binding nuclease is linked to the reverse transcriptase, wherein the C-terminus of the reverse transcriptase is linked to the integration enzyme, wherein any one of L8 (SEQ ID NO: 553), L9 (SEQ ID NO: 554), L10 (SEQ ID NO: 555), L12 (SEQ ID NO: 557), or L13 (SEQ ID NO: 558) links the reverse transcriptase to the integration enzyme. [0164] In some embodiments, the gene editor polypeptide includes the DNA binding nuclease, the RT, and the integration enzyme, where the C-terminus of the integration enzyme is linked to the DNA binding nuclease by L13 (SEQ NO: 558), and where the C-terminus of the DNA binding nuclease is linked to the reverse transcriptase by L13 (SEQ ID NO: 558).
Attorney Ref: 38909-52322/WO Client Ref: 005WO [0165] In some embodiments, the gene editor polypeptide is not comprised of a N-most N-terminal reverse transcriptase. [0166] In some embodiments, the gene editor polypeptide includes the DNA binding nuclease, the RT, and the integration enzyme, where the C-terminus of the DNA binding nuclease is linked to the reverse transcriptase by L13 (SEQ ID NO: 558), and where the C-terminus of the reverse transcriptase is linked to the integration enzyme by a cleavable linker (e.g., a P2A (SEQ ID NO: 13). [0167] In some embodiments, the gene editor polypeptide includes a nuclear localization signal (NLS). In some embodiments, the gene editor polypeptide comprises a mitochondrial localization signal. [0168] In one embodiment, the gene editor polypeptide comprises: from N-terminus to C- terminus: a DNA binding nickase domain; a P2A linker (SEQ ID NO: 13), a reverse transcriptase; a mXTEN linker, and an integration enzyme. [0169] In one embodiment, the gene editor polypeptide comprises: from N-terminus to C- terminus: an integration enzyme, a mXTEN linker (SEQ ID NO: 558), a DNA binding nuclease domain; a mXTEN linker (SEQ ID NO: 558), and a reverse transcriptase. [0170] This disclosure also features systems capable of site-specifically integrating at least a first integration recognition site into the genome of a cell where the system includes any of the gene editor polypeptides described herein. [0171] This disclosure also features systems capable of site-specifically integrating an exogenous nucleic acid sequence into the genome of a cell where the system includes any of the gene editor polypeptides described herein. 6.11. Gene Editor Polynucleotide Compositions [0172] In certain embodiments, a polynucleotide encodes the gene editor polypeptide. In some embodiments, the polynucleotide is an RNA or DNA. In some embodiments, the polynucleotide is a mRNA. [0173] In certain embodiments, an expression vector comprises the polynucleotide encoding the gene editor polypeptide. In some embodiments, the expression vector further comprises any one or more of a one or more of a promoter sequence, atgRNA, nicking gRNA (ngRNA), donor
Attorney Ref: 38909-52322/WO Client Ref: 005WO polynucleotide template, or donor polynucleotide template complement. In some embodiments, two or more expression vectors are delivered. [0174] In typical embodiments, the expression vector is recombinant adenovirus, helper dependent adenovirus, AAV, lentivirus, HSV, annelovirus, retrovirus, Doggybone DNA, minicircle, plasmid, miniDNA, nanoplasmid, exosome, fusosome, or lipid nanoparticle. 6.12. Programmable Gene Insertion [0175] In some embodiments, the polypeptides described herein are used for programmable polynucleotide (e.g., genes or gene fragments) insertion. Programmable polynucleotide/gene insertion includes site-specifically integrating an exogenous nucleic acid sequence (e.g., a donor polynucleotide template) into a mammalian cell genome at a desired target site (e.g., a target DNA sequence). In typical embodiments, the method of site-specifically integrating an exogenous nucleic acid sequence into a target nucleic acid comprises incorporating an integration recognition site into a cell by delivering: (i) a polynucleotide encoding a gene editor polypeptide (e.g., any of the gene editor polypeptides described herein), (ii) at least a first target specific atgRNAs comprising a first integration recognition site, for example, as described in Section 6.7, and (iii) optionally, a nicking guide RNA. In some embodiments, the first atgRNA comprises: (i) a domain that is capable of guiding the prime editor system to a target sequence; and (ii) a reverse transcriptase (RT) template that comprises at least a portion of an at least first integration recognition site, whereby the at least portion of the at least first integration recognition site is integrated into the genome of the cell at the target sequence. [0176] In some embodiments, the method of site-specifically integrating an exogenous nucleic acid sequence into a target nucleic acid, the method comprises incorporating an integration recognition site into a cell by delivering: (i) a polynucleotide encoding a gene editor polypeptide (e.g., any of the gene editor polypeptides described herein), (ii) a first target specific atgRNA and a second target specific atgRNA. In such embodiments, the first atgRNA and the second atgRNA are an at least first pair of atgRNAs, wherein the at least first pair of atgRNAs have domains that are capable of guiding the prime editor system to a target sequence; the first atgRNA further includes a first RT template that comprises at least a portion of an at least first integration recognition site; the second atgRNA further includes a second RT template that comprises at least a portion of the first integration recognition site, and the first atgRNA and the second atgRNAs collectively encode the entirety of the first integration recognition site, whereby the first integration recognition site is integrated into the genome of the cell at the target sequence.
Attorney Ref: 38909-52322/WO Client Ref: 005WO [0177] In some embodiments, the method includes integrating the polynucleotide in the cellular genome by delivering a donor polynucleotide template, wherein the donor polynucleotide template is comprised of a second integration recognition site that is a cognate pair of the first integration recognition site, wherein the donor polynucleotide is integrated into the cellular genome at the first integration target recognition site by the gene editor polynucleotide.. [0178] In some embodiments, the method of site-specifically integrating an exogenous nucleic acid sequence into a mammalian cell genome at a desired target site (e.g., a target DNA sequence) comprises incorporating an integration target site into a cell by delivering: an expression vector of comprising a polynucleotide encoding a gene editor polypeptide (e.g., any of the gene editor polypeptides described herein), further comprising a promoter, one or more cell genome target specific atgRNA comprising an integration target recognition site, for example, as described in Section 6.7, a donor polynucleotide comprising an integration target site, and optionally, a ngRNA. In some embodiments, the method of polynucleotide integration into a cell genome comprises incorporating an integration target site into a cell by delivering: an expression vector of comprising a polynucleotide encoding a gene editor polypeptide (e.g., any of the gene editor polypeptides described herein), further comprising a promoter, two cell genome target specific atgRNA where the two atgRNA collectively encode the entirety of the an integration recognition site, and a donor polynucleotide comprising an integration target site. [0179] In some embodiments, the method of site-specifically integrating an exogenous nucleic acid sequence into a genome of a cell includes incorporating an integration recognition site into a cell by delivering: any of the polynucleotides described herein; at least a first genome target specific atgRNA comprising a first integration recognition site, for example, as described in Section 6.7; optionally, a nicking gRNA; and integrating an exogenous nucleic acid sequence into the cellular genome by delivering into the cell: a donor polynucleotide template, wherein the donor polynucleotide template is comprised of the exogenous nucleic acid sequence positioned relative to a second integration recognition site that is a cognate pair of the first integration recognition site such that the exogenous nucleic acid sequence is integrated into the cellular genome at the first integration target recognition site when contacted with the integration enzyme and the first integration recognition site. In some embodiments, the polynucleotides and the at least first atgRNA are delivered concurrently with the donor polynucleotide template. [0180] In some embodiments, the method of site-specifically integrating an exogenous nucleic acid sequence into a target nucleic acid includes: incorporating an integration recognition site into a cell by delivering: any of the expression vectors described herein; and integrating an exogenous
Attorney Ref: 38909-52322/WO Client Ref: 005WO nucleic acid sequence into the cellular genome by integrating a donor polynucleotide template, wherein the donor polynucleotide template is comprised of the exogenous nucleic acid sequence positioned relative to a second integration recognition site that is a cognate pair of the first integration recognition site such that the exogenous nucleic acid sequence is integrated into the cellular genome at the first integration target recognition site when contacted with the integration enzyme and the first integration recognition site. In some embodiments, the expression vector and the donor polynucleotide template are delivered to the cell concurrently. 6.13. Genes and Targets [0181] This disclosure provides compositions, systems and methods for correcting or replacing genes or gene fragments (including introns or exons) or inserting genes in new locations. In certain embodiments, such a method comprises recombination or integration into a safe harbor site (SHS). A frequently used human SHS is the AAVS1 site on chromosome 19q, initially identified as a site for recurrent adeno-associated virus insertion. Another locus comprises the human homolog of the murine Rosa26 locus. Yet another SHS comprises the human H11 locus on chromosome 22. In some cases, a complete gene may be prohibitively large and replacement of an entire gene impractical. In certain embodiments, a method of the disclosure comprises recombining corrective gene fragments into a defective locus. [0182] The methods and compositions can be used to target, without limitation, stem cells for example induced pluripotent stem cells (iPSCs), HSCs, HSPCs, mesenchymal stem cells, or neuronal stem cells and cells at various stages of differentiation. In certain embodiments, methods and compositions of the disclosure are adapted to target organoids, including patient derived organoids. [0183] In certain embodiments, methods and compositions of the disclosure are adapted to treat muscle cells, not limited to cardiomyocytes for Duchene Muscular Dystrophy (DMD). The dystrophin gene is the largest gene in the human genome, spanning ∼2.3 Mb of DNA. DMD is composed of 79 exons resulting in a 14-kb full-length mRNA. Common mutations include mutations that disrupt the reading frame of generate a premature stop codon. An aspect of DMD that lends it to gene editing as a therapeutic approach is the modular structure of the dystrophin protein. Redundancy in the central rod domain permits the deletion of internal segments of the gene that may harbor loss-of-function mutations, thereby restoring the open reading frame (ORFs). In some embodiments, the methods and systems described herein are used to treat DMD by site-
Attorney Ref: 38909-52322/WO Client Ref: 005WO specifically integrating in the genome a polynucleotide template that repairs or replaces all or a portion of the defective DMD gene. [0184] The following are non-limiting diseases that may be treated utilizing the methods and compositions of the present disclosure: Inherited Retinal Diseases: ^ Stargardt Disease (ABCA4) ^ Leber congenital amaurosis 10 (CEP290) ^ X linked Retinitis Pigmentosa (RPGR) ^ Autosomal Dominant Retinitis Pigmentosa (RHO) Liver Diseases: ^ Wilson’s disease (ATP7B) ^ Alpha-1 antitrypsin (SERPINA1) Intellectual Disabilities: ^ Rett Syndrome (MECP2) ^ SYNGAP1-ID (SYNGAP1) ^ CDKL5 deficiency disorder (CDKL5) Peripheral Neuropathies: ^ Charcot-Marie-Tooth 2A (MFN2) Lung Diseases: ^ Cystic Fibrosis (CFTR) ^ Alpha-1 Antitrypsin (SERPINA1) Autoimmune diseases: ^ IgA Nephropathy (Berger’s disease) ^ Anti-Neutrophil Cytoplasmic Antibody (ANCA) Vasculitis ^ Systemic Lupus Erythematosus (SLE) / Lupus Nephritis (LN) ^ Membranous Nephropathy (MN) ^ C3 glomerulonephritis (C3GN) Blood disorders: ^ Sickle Cell ^ Hemophilia
Attorney Ref: 38909-52322/WO Client Ref: 005WO ^ Factor VIII or ^ Factor IX ^ Ornithine transcarbamylase deficiency (OTCD) ^ Homocystinuria (HCU) ^ Phenylketonuria (PKU) Cancer ^ Prostate cancer ^ Renal cell cancer ^ Thyroid cancer [0185] CFTR (cystic fibrosis transmembrane conductance regulator). The most common cystic fibrosis (CF) mutation F508del removes a single amino acid. In some embodiments, recombining human CFTR into an SHS of a cell that expresses CFTR F508del is a corrective treatment path. In some embodiments, the methods and systems described herein are used to CF by site-specifically integrating in the genome a polynucleotide template that corrects the mutation causing CF. Proposed validation is detection of persistent CFTR mRNA and protein expression in transduced cells. [0186] Sickle cell disease (SCD) is caused by mutation of a specific amino acid — valine to glutamic acid at amino acid position 6. In some embodiments, SCD is corrected by recombination of the HBB gene into a safe harbor site (SHS) and by demonstrating correction in a proportion of target cells that is high enough to produce a substantial benefit. In some embodiments, the methods and systems described herein are used to sickle cell disease by site-specifically integrating in the genome a polynucleotide template that corrects the mutation causing the disease. In some embodiments, validation is detection of persistent HBB mRNA and protein expression in transduced cells. [0187] DMD — Duchenne Muscular Dystrophy. The dystrophin gene is the largest gene in the human genome, spanning ∼2.3 Mb of DNA. DMD is composed of 79 exons resulting in a 14- kb full-length mRNA. Common mutations include mutations that disrupt the reading frame of generate a premature stop codon. An aspect of DMD that lends it to gene editing as a therapeutic approach is the modular structure of the dystrophin protein. Redundancy in the central rod domain permits the deletion of internal segments of the gene that may harbor loss-of-function mutations, thereby restoring the open reading frame (ORFs).
Attorney Ref: 38909-52322/WO Client Ref: 005WO [0188] In some embodiments, recombination will be into safe harbor sites (SHS). A frequently used human SHS is the AAVS1 site on chromosome 19q, initially identified as a site for recurrent adeno-associated virus insertion. In some embodiments, the site is the human homolog of the e murine Rosa26 locus (pubmed.ncbi.nlm.nih.gov/18037879). In some embodiments, the site is the human H11 locus on chromosome 22. Proposed target cells for recombination include stem cells for example induced pluripotent stem cells (iPSCs) and cells at various stages of differentiation. In some cases, a complete gene may be prohibitively large and replacement of an entire gene impractical. In such instances, rescuing mutants by recombining in corrected gene fragments with the methods and systems described herein is a corrective option. [0189] In some embodiments, correcting mutations in exon 44 (or 51) by recombining in a corrective coding sequence downstream of exon 43 (or 50), using the methods and systems described herein is a corrective option. Proposed validation is detection of persistent DMD mRNA and protein expression in transduced cells. [0190] F8 (Factor VIII). A large proportion of severe hemophilia A patients harbor one of two types of chromosomal inversions in the FVIII gene. The recombinase technology and methods described herein are well suited to correcting such inversions (and other mutations) by recombining of the FVIII gene into a SHS. [0191] In some embodiments, correcting factor VIII deficiency by recombining the FVIII gene into an SHS is a corrective path. In some embodiments, the methods and systems described herein are used to correct factor VIII deficiency by site-specifically integrating in the genome a polynucleotide template that corrects the mutation causing the FIX deficiency. Proposed validation is detection of persistent FVIII mRNA and protein expression in transduced cells. [0192] Factor 9 (Factor IX) Hemophilia B, also called factor IX (FIX) deficiency is a genetic disorder caused by missing or defective factor IX, a clotting protein. [0193] In some embodiments, the methods and systems described herein are used to correct factor IX deficiency by site-specifically integrating in the genome a polynucleotide template that corrects the mutation causing the FIX deficiency. Proposed validation is detection of persistent FiX mRNA and protein expression in transduced cells. [0194] Ornithine transcarbamylase deficiency (OTCD). Ornithine transcarbamylase deficiency is a rare genetic condition that causes ammonia to build up in the blood. The condition
Attorney Ref: 38909-52322/WO Client Ref: 005WO – more commonly called OTC deficiency – is more common in boys than girls and tends to be more severe when symptoms emerge shortly after birth. [0195] In some embodiments, the methods and systems described herein are used to correct OTC deficiency by site-specifically integrating in the genome a polynucleotide template that corrects the mutation causing the OTC deficiency or integrates a polynucleotide encoding a functional ornithine transcarbamylase enzyme. Proposed validation is detection of persistent OTC mRNA and protein expression in transduced cells. [0196] Phenylketonuria, also called PKU, is a rare inherited disorder that causes an amino acid called phenylalanine to build up in the body. PKU is caused by a change in the phenylalanine hydroxylase (PAH) gene. This gene helps create the enzyme needed to break down phenylalanine. [0197] In some embodiments, the methods and systems described herein are used to correct PKU by site-specifically integrating in the genome a polynucleotide template that corrects the mutation causing the PKU deficiency or integrates a polynucleotide encoding a functional phenylalanine hydroxylase (PAH) gene. Proposed validation is detection of persistent PAH mRNA and protein expression in transduced cells. [0198] Homocystinuria (HCU). Homocystinuria is elevation of the amino acid, homocysteine (protein building block coming from our diet) in the urine or blood. Common causes of HCU include: problems with the enzyme cystathionine beta synthase (CBS), which converts homocysteine to the amino acid cystathionine (which then becomes cysteine) and needs the vitamin B6 (pyridoxine); and roblems with converting homocysteine to the amino acid methionine. [0199] In some embodiments, the methods and systems described herein are used to correct HCU by site-specifically integrating in the genome a polynucleotide template that corrects the mutation causing the HCU or integrates a polynucleotide encoding a functional copy of a gene (e.g., CBS) able to reduce or prevent buildup of homocysteine in the urine. Proposed validation is detection of persistent CBS mRNA and protein expression in transduced cells. [0200] IgA Nephropathy (Berger’s disease). IgA nephropathy, also known as Berger's disease, is a kidney/autoimmune disease that occurs when an antibody called immunoglobulin A (IgA) builds up in the kidneys.
Attorney Ref: 38909-52322/WO Client Ref: 005WO [0201] In some embodiments, the methods and systems described herein are used to treat Berger’s disease by administering to a patient an iPSC-derived Natural Killer cell that includes a polynucleotide site-specifically integrated in the genome of the cell using the methods described herein. Upon administering the iPSC-NK cell to the patient, the iPSC-NK cell is capable of removing native cells (e.g., B cells) that are responsible, at least in part, for the symptoms of Berger’s disease. [0202] Anti-Neutrophil Cytoplasmic Antibody (ANCA) Vasculitis. ANCA vasculitis is an autoimmune disease affecting small blood vessels in the body. It is caused by autoantibodies called ANCAs, or Anti-Neutrophilic Cytoplasmic Autoantibodies. ANCAs target and attack a certain kind of white blood cells called neutrophils. [0203] In some embodiments, the methods and systems described herein are used to treat ANCA vasculitis by administering to a patient an iPSC-derived Natural Killer cell that includes a polynucleotide site-specifically integrated in the genome of the cell using the methods described herein. Upon administering the iPSC-NK cell to the patient, the iPSC-NK cell is capable of removing native cells (e.g., B cells) that are responsible, at least in part, for the symptoms of ANCA vasculitis. [0204] Systemic Lupus Erythematosus (SLE) / Lupus Nephritis (LN). Lupus is an autoimmune—a disorder in which the body’s immune system attacks the body’s own cells and organs. [0205] In some embodiments, the methods and systems described herein are used to treat SLE/LN by administering to a patient an iPSC-derived Natural Killer cell that includes a polynucleotide site-specifically integrated in the genome of the cell using the methods described herein. Upon administering the iPSC-NK cell to the patient, the iPSC-NK cell is capable of removing native cells (e.g., B cells) that are responsible, at least in part, for the symptoms of SLE/LN. [0206] Membranous Nephropathy (MN). MN is a kidney disease that affects the filters (glomeruli) of the kidney and can cause protein in the urine, as well as decreased kidney function and swelling. It can sometimes be called membranous glomerulopathy as well (these terms can be used interchangeably and mean the same thing). [0207] In some embodiments, the methods and systems described herein are used to treat MN by administering to a patient an iPSC-derived Natural Killer cell that includes a polynucleotide
Attorney Ref: 38909-52322/WO Client Ref: 005WO site-specifically integrated in the genome of the cell using the methods described herein. Upon administering the iPSC-NK cell to the patient, the iPSC-NK cell is capable of removing native cells (e.g., B cells) that are responsible, at least in part, for the symptoms of MN. [0208] C3 glomerulonephritis (C3GN). C3 glomerulopathy is a group of related conditions that cause the kidneys to malfunction. The major features of C3 glomerulopathy include high levels of protein in the urine (proteinuria), blood in the urine (hematuria), reduced amounts of urine, low levels of protein in the blood, and swelling in many areas of the body. Affected individuals may have particularly low levels of a protein called complement component 3 (or C3) in the blood. [0209] In some embodiments, the methods and systems described herein are used to treat C3 glomerulopathy by administering to a patient an iPSC-derived Natural Killer cell that includes a polynucleotide site-specifically integrated in the genome of the cell using the methods described herein. Upon administering the iPSC-NK cell to the patient, the iPSC-NK cell is capable of removing native cells (e.g., B cells) that are responsible, at least in part, for the symptoms of C3 glomerulopathy. 6.14. Methods of treatment [0210] In another aspect, methods of treatment are presented. The method comprises administering an effective amount of the pharmaceutical composition comprising the nucleic acid construct or vectorized nucleic acid construct described above to a patient in need thereof. In typical embodiments, the polynucleotide encoding a gene editor polypeptide are delivered by a vector, wherein the vector is selected from the group consisting of recombinant adenovirus, helper dependent adenovirus, AAV, lentivirus, HSV, annelovirus, retrovirus, DoggyboneTM DNA (dbDNA), minicircle, plasmid, miniDNA, exosome, fusosome, lipid nanoparticle, or nanoplasmid. [0211] DNA or RNA viral vectors can be administered directly to patients (in vivo), or they can be used to treat cells in vitro, and the modified cells may optionally be administered to patients (ex vivo). Conventional viral based systems to be used herein could include retroviral, lentivirus, adenoviral, adeno-associated and herpes simplex virus vectors for gene transfer. Integration in the host genome is possible with the retrovirus, lentivirus, and adeno-associated virus gene transfer methods, often resulting in long term expression of the inserted transgene. Additionally, high transduction efficiencies have been observed in many different cell types and target tissues.
Attorney Ref: 38909-52322/WO Client Ref: 005WO [0212] Methods of non-viral delivery of the polynucleotide encoding a gene editor polypeptide or the gene editor polypeptide described herein include lipofection, nucleofection, microinjection, biolistics, virosomes, liposomes, immunoliposomes, polycation or lipid:nucleic acid conjugates, naked DNA, artificial virions, and agent-enhanced uptake of DNA. Lipofection is described in e.g., U.S. Pat. Nos. 5,049,386, 4,946,787; and 4,897,355) and lipofection reagents are sold commercially (e.g., Transfectam™ and Lipofectin™). Cationic and neutral lipids that are suitable for efficient receptor-recognition lipofection of polynucleotides include those of Felgner, WO 91/17424; WO 91/16024. Delivery can be to cells (e.g. in vitro or ex vivo administration) or target tissues (e.g. in vivo administration). 6.14.1. Lipid Nanoparticle Delivery [0213] In some embodiments, the gene editor polypeptide or polynucleotide encoding the gene editor polypeptide is packaged in a LNP and administered intravenously. In some embodiments, the gene editor polypeptide or polynucleotide encoding the gene editor polypeptide is packaged in a LNP and administered intrathecally. In some embodiments, the gene editor polypeptide or polynucleotide encoding the gene editor polypeptide is packaged in a LNP and administered by intracerebral ventricular injection. In some embodiments, the gene editor polypeptide or polynucleotide encoding the gene editor polypeptide is packaged in a LNP and administered by intracisternal magna administration. In some embodiments, the gene editor polypeptide or polynucleotide encoding the gene editor polypeptide is packaged in a LNP and administered by intravitreal injection. [0214] The preparation of lipid:nucleic acid complexes, including targeted liposomes such as immunolipid complexes, is well known to one of skill in the art (see, e.g., Crystal, Science 270:404-410 (1995); Blaese et al., Cancer Gene Ther.2:291-297 (1995); Behr et al., Bioconjugate Chem. 5:382-389 (1994); Remy et al., Bioconjugate Chem. 5:647-654 (1994); Gao et al., Gene Therapy 2:710-722 (1995); Ahmad et al., Cancer Res. 52:4817-4820 (1992); U.S. Pat. Nos. 4,186,183, 4,217,344, 4,235,871, 4,261,975, 4,485,054, 4,501,728, 4,774,085, 4,837,028, and 4,946,787). [0215] In another embodiment, LNP doses of about 0.01 to about 1 mg per kg of body weight administered intravenously are contemplated. Medications to reduce the risk of infusion-related reactions are contemplated, such as dexamethasone, acetampinophen, diphenhydramine or cetirizine, and ranitidine are contemplated. Multiple doses of about 0.3 mg per kilogram every 4 weeks for five doses are also contemplated.
Attorney Ref: 38909-52322/WO Client Ref: 005WO [0216] The charge of the LNP must be taken into consideration. As cationic lipids combined with negatively charged lipids to induce nonbilayer structures that facilitate intracellular delivery. Because charged LNPs are rapidly cleared from circulation following intravenous injection, ionizable cationic lipids with pKa values below 7 were developed (see, e.g., Rosin et al, Molecular Therapy, vol.19, no.12, pages 1286-2200, December 2011). Negatively charged polymers such as RNA may be loaded into LNPs at low pH values (e.g., pH 4) where the ionizable lipids display a positive charge. However, at physiological pH values, the LNPs exhibit a low surface charge compatible with longer circulation times. Four species of ionizable cationic lipids have been focused upon, namely 1,2-dilineoyl-3-dimethylammonium-propane (DLinDAP), 1,2- dilinoleyloxy-3-N,N-dimethylaminopropane (DLinDMA), 1,2-dilinoleyloxy-keto-N,N-dimethyl- 3-aminopropane (DLinKDMA), and 1,2-dilinoleyl-4-(2-dimethylaminoethyl)-[1,3]-dioxolane (DlinKC2-DMA). It has been shown that LNP siRNA systems containing these lipids exhibit remarkably different gene silencing properties in hepatocytes in vivo, with potencies varying according to the series DlinKC2-DMA>DLinKDMA>DLinDMA>>DLinDAP employing a Factor VII gene silencing model (see, e.g., Rosin et al, Molecular Therapy, vol.19, no.12, pages 1286-2200, December 2011). A dosage of 1 µg/ml of LNP in or associated with the LNP may be contemplated, especially for a formulation containing DlinKC2-DMA. [0217] In some embodiments, the LNP composition comprises one or more one or more ionizable lipids. As used herein, the term “ionizable lipid” has its ordinary meaning in the art and may refer to a lipid comprising one or more charged moieties. In some embodiments, an ionizable lipid may be positively charged or negatively charged. In principle, there are no specific limitations concerning the ionizable lipids of the LNP compositions disclosed herein. In some embodiments, the one or more ionizable lipids are selected from the group consisting of 3-(didodecylamino)- N1,N1,4-tridodecyl-1-piperazineethanamine (KL10), N1-[2-(didodecylamino)ethyl]-N1,N4,N4- tridodecyl-1,4-piperazinediethanami- ne (KL22), 14,25-ditridecyl-15,18,21,24-tetraaza- octatriacontane (KL25), 1,2-dilinoleyloxy-N,N-dimethylaminopropane (Dlin-DMA), 2,2- dilinoleyl-4-dimethylaminomethyl-[1,3]-dioxolane (Dlin-K-DMA), heptatriaconta-6,9,28,31- tetraen-19-yl 4-(dimethylamino)butanoate (Dlin-MC3-DMA), 2,2-dilinoleyl-4-(2- dimethylaminoethyl)-[1,3]-dioxolane (Dlin-KC2-DMA), 1,2-dioleyloxy-N,N- dimethylaminopropane (DODMA), 2-({8-[(3)-cholest-5-en-3-yloxy]octyl}oxy)-N,N-dimethyl-3- [(9Z,12Z)-octad- eca-9,12-dien-1-yloxy]propan-1-amine (Octyl-CLinDMA), (2R)-2-({8-[(3)- cholest-5-en-3-yloxy]octyl}oxy)-N,N-dimethyl-3-[(9Z,12Z)—octadeca-9,12-dien-1- yloxy]propan-1-amine (Octyl-CLinDMA (2R)), and (2S)-2-({8-[(3)-cholest-5-en-3- yloxy]octyl}oxy)-N,N-dimethyl-3-[(9Z,12Z)—octadeca-9,12-dien-1-y loxy]propan-1-amine
Attorney Ref: 38909-52322/WO Client Ref: 005WO (Octyl-CLinDMA (2S)). In one embodiment, the ionizable lipid may be selected from, but not limited to, an ionizable lipid described in International Publication Nos. WO2013086354 and WO2013116126. [0218] In some embodiments, the lipid nanoparticle may include one or more (e.g., 1, 2, 3, 4, 5, 6, 7, or 8) cationic and/or ionizable lipids. Such cationic and/or ionizable lipids include, but are not limited to, 3-(didodecylamino)-N1,N1,4-tridodecyl-1-piperazineethanamine (KL10), N1-[2- (didodecylamino)ethyl]-N1,N4,N4-tridodecyl-1,4-piperazinediethanami- ne (KL22), 14,25- ditridecyl-15,18,21,24-tetraaza-octatriacontane (KL25), 1,2-dilinoleyloxy-N,N- dimethylaminopropane (Dlin-DMA), 2,2-dilinoleyl-4-dimethylaminomethyl-[1,3]-dioxolane (Dlin-K-DMA), heptatriaconta-6,9,28,31-tetraen-19-yl 4-(dimethylamino)butanoate (Dlin-MC3- DMA), 2,2-dilinoleyl-4-(2-dimethylaminoethyl)-[1,3]-dioxolane (Dlin-KC2-DMA), 2-({8- [(3.beta.)-cholest-5-en-3-yloxy]octyl}oxy)-N,N-dimethyl-3-[(9Z,12Z)- -octadeca-9,12-dien-1- yloxy]propan-1-amine (Octyl-CLinDMA), (2R)-2-({8-[(3.beta.)-cholest-5-en-3- yloxy]octyl}oxy)-N,N-dimethyl-3-[(9Z- ,12Z)-octadeca-9,12-dien-1-yl oxy]propan-1-amine (Octyl-CLinDMA (2R)), (2S)-2-({8-[(3βcholest-5-en-3-yloxy]octyl}oxy)-N,N-dimethyl-3-[(9Z- ,12Z)-octadeca-9,12-dien-1-yl oxy]propan-1-amine (Octyl-CLinDMA (2S)).N,N-dioleyl-N,N- dimethylammonium chloride (“DODAC”); N-(2,3-dioleyloxy)propyl-N,N—N- triethylammonium chloride (“DOTMA”); N,N-distearyl-N,N-dimethylammonium bromide (“DDAB”); N-(2,3-dioleoyloxy)propyl)-N,N,N-trimethylammonium chloride (“DOTAP”); 1,2- Dioleyloxy-3-trimethylaminopropane chloride salt (“DOTAP.Cl”); 3-.beta.-(N—(N’,N’- dimethylaminoethane)-carbamoyl)cholesterol (“DC-Chol”), N-(1-(2,3-dioleyloxy)propyl)-N-2- (sperminecarboxamido)ethyl)-N,N-dimethyl- -ammonium trifluoracetate (“DOSPA”), dioctadecylamidoglycyl carboxyspermine (“DOGS”), 1,2-dioleoyl-3-dimethylammonium propane (“DODAP”), N,N-dimethyl-2,3-dioleyloxy)propylamine (“DODMA”), and N-(1,2- dimyristyloxyprop-3-yl)-N,N-dimethyl-N-hydroxyethyl ammonium bromide (“DMRIE”). Additionally, a number of commercial preparations of cationic and/or ionizable lipids can be used, such as, e.g., LIPOFECTIN.RTM. (including DOTMA and DOPE, available from GIBCO/BRL), and LIPOFECTAMINE.RTM. (including DOSPA and DOPE, available from GIBCO/BRL). KL10, KL22, and KL25 are described, for example, in U.S. Pat. No.8,691,750. [0219] In some embodiments, the LNP composition comprises one or more amino lipids. The terms “amino lipid” and “cationic lipid” are used interchangeably herein to include those lipids and salts thereof having one, two, three, or more fatty acid or fatty alkyl chains and a pH-titratable amino head group (e.g., an alkylamino or dialkylamino head group). In principle, there are no
Attorney Ref: 38909-52322/WO Client Ref: 005WO specific limitations concerning the amino lipids of the LNP compositions disclosed herein. The cationic lipid is typically protonated (i.e., positively charged) at a pH below the pKa of the cationic lipid and is substantially neutral at a pH above the pKa. The cationic lipids can also be termed titratable cationic lipids. In some embodiments, the one or more cationic lipids include: a protonatable tertiary amine (e.g., pH-titratable) head group; alkyl chains, wherein each alkyl chain independently has 0 to 3 (e.g., 0, 1, 2, or 3) double bonds; and ether, ester, or ketal linkages between the head group and alkyl chains. Such cationic lipids include, but are not limited to, DSDMA, DODMA, DOTMA, DLinDMA, DLenDMA, .gamma.-DLenDMA, Dlin-K-DMA, Dlin-K-C2- DMA (also known as Dlin-C2K-DMA, XTC2, and C2K), Dlin-K-C3-DMA, Dlin-K-C4-DMA, Dlen-C2K-DMA, y-Dlen-C2-DMA, C12-200, cKK-E12, cKK-A12, cKK-O12, Dlin-MC2-DMA (also known as MC2), and Dlin-MC3-DMA (also known as MC3). [0220] Anionic lipids suitable for use in lipid nanoparticles include, but are not limited to, phosphatidylglycerol, cardiolipin, diacylphosphatidylserine, diacylphosphatidic acid, N- dodecanoyl phosphatidylethanoloamine, N-succinyl phosphatidylethanolamine, N-glutaryl phosphatidylethanolamine, lysylphosphatidylglycerol, and other anionic modifying groups joined to neutral lipids. [0221] Neutral lipids (including both uncharged and zwitterionic lipids) suitable for use in lipid nanoparticles include, but are not limited to, diacylphosphatidylcholine, diacylphosphatidylethanolamine, ceramide, sphingomyelin, dihydrosphingomyelin, cephalin, sterols (e.g., cholesterol) and cerebrosides. In some embodiments, the lipid nanoparticle comprises cholesterol. Lipids having a variety of acyl chain groups of varying chain length and degree of saturation are available or may be isolated or synthesized by well-known techniques. Additionally, lipids having mixtures of saturated and unsaturated fatty acid chains and cyclic regions can be used. In some embodiments, the neutral lipids used in the disclosure are DOPE, DSPC, DPPC, POPC, or any related phosphatidylcholine. In some embodiments, the neutral lipid may be composed of sphingomyelin, dihydrosphingomyeline, or phospholipids with other head groups, such as serine and inositol. [0222] In some embodiments, amphipathic lipids are included in nanoparticles. Exemplary amphipathic lipids suitable for use in nanoparticles include, but are not limited to, sphingolipids, phospholipids, fatty acids, and amino lipids. [0223] The lipid composition of the pharmaceutical composition may comprise one or more phospholipids, for example, one or more saturated or (poly)unsaturated phospholipids or a
Attorney Ref: 38909-52322/WO Client Ref: 005WO combination thereof. In general, phospholipids comprise a phospholipid moiety and one or more fatty acid moieties. [0224] A phospholipid moiety can be selected, for example, from the non-limiting group consisting of phosphatidyl choline, phosphatidyl ethanolamine, phosphatidyl glycerol, phosphatidyl serine, phosphatidic acid, 2-lysophosphatidyl choline, and a sphingomyelin. [0225] A fatty acid moiety can be selected, for example, from the non-limiting group consisting of lauric acid, myristic acid, myristoleic acid, palmitic acid, palmitoleic acid, stearic acid, oleic acid, linoleic acid, alpha-linolenic acid, erucic acid, phytanoic acid, arachidic acid, arachidonic acid, eicosapentaenoic acid, behenic acid, docosapentaenoic acid, and docosahexaenoic acid. [0226] Particular amphipathic lipids can facilitate fusion to a membrane. For example, a cationic phospholipid can interact with one or more negatively charged phospholipids of a membrane (e.g., a cellular or intracellular membrane). Fusion of a phospholipid to a membrane can allow one or more elements (e.g., a therapeutic agent) of a lipid-containing composition (e.g., LNPs) to pass through the membrane permitting, e.g., delivery of the one or more elements to a target tissue. [0227] Non-natural amphipathic lipid species including natural species with modifications and substitutions including branching, oxidation, cyclization, and alkynes are also contemplated. For example, a phospholipid can be functionalized with or cross-linked to one or more alkynes (e.g., an alkenyl group in which one or more double bonds is replaced with a triple bond). Under appropriate reaction conditions, an alkyne group can undergo a copper-catalyzed cycloaddition upon exposure to an azide. Such reactions can be useful in functionalizing a lipid bilayer of a nanoparticle composition to facilitate membrane permeation or cellular recognition or in conjugating a nanoparticle composition to a useful component such as a targeting or imaging moiety (e.g., a dye). [0228] Phospholipids include, but are not limited to, glycerophospholipids such as phosphatidylcholines, phosphatidylethanolamines, phosphatidylserines, phosphatidylinositols, phosphatidy glycerols, and phosphatidic acids. Phospholipids also include phosphosphingolipid, such as sphingomyelin. [0229] In some embodiments, the LNP composition comprises one or more phospholipids. In some embodiments, the phospholipid is selected from the group consisting of 1,2-dilinoleoyl-sn- glycero-3-phosphocholine (DLPC), 1,2-dimyristoyl-sn-glycero-phosphocholine (DMPC), 1,2- dioleoyl-sn-glycero-3-phosphocholine (DOPC), 1,2-dipalmitoyl-sn-glycero-3-phosphocholine
Attorney Ref: 38909-52322/WO Client Ref: 005WO (DPPC), 1,2-distearoyl-sn-glycero-3-phosphocholine (DSPC), 1,2-diundecanoyl-sn-glycero- phosphocholine (DUPC), 1-palmitoyl-2-oleoyl-sn-glycero-3-phosphocholine (POPC), 1,2-di-O- octadecenyl-sn-glycero-3-phosphocholine (18:0 Diether PC), 1-oleoyl-2- cholesterylhemisuccinoyl-sn-glycero-3-phosphocholine (OchemsPC), 1-hexadecyl-sn-glycero-3- phosphocholine (C16 Lyso PC), 1,2-dilinolenoyl-sn-glycero-3-phosphocholine, 1,2- diarachidonoyl-sn-glycero-3-phosphocholine, 1,2-didocosahexaenoyl-sn-glycero-3- phosphocholine, 1,2-dioleoyl-sn-glycero-3-phosphoethanolamine (DOPE), 1,2-diphytanoyl-sn- glycero-3-phosphoethanolamine (ME 16:0 PE), 1,2-distearoyl-sn-glycero-3- phosphoethanolamine, 1,2-dilinoleoyl-sn-glycero-3-phosphoethanolamine, 1,2-dilinolenoyl-sn- glycero-3-phosphoethanolamine, 1,2-diarachidonoyl-sn-glycero-3-phosphoethanolamine1,2- didocosahexaenoyl—sn-glycero-3-phosphoethanolamine, 1,2-dioleoyl-sn-glycero-3-phospho- rac-(1-glycerol) sodium salt (DOPG), sphingomyelin, and any mixtures thereof. [0230] Other phosphorus-lacking compounds, such as sphingolipids, glycosphingolipid families, diacylglycerols, and .beta.-acyloxyacids, may also be used. Additionally, such amphipathic lipids can be readily mixed with other lipids, such as triglycerides and sterols. [0231] In some embodiments, the LNP composition comprises one or more helper lipids. The term “helper lipid” as used herein refers to lipids that enhance transfection (e.g., transfection of an LNP comprising an mRNA that encodes a site-directed endonuclease, such as a SpCas9 polypeptide). In principle, there are no specific limitations concerning the helper lipids of the LNP compositions disclosed herein. Without being bound to any particular theory, it is believed that the mechanism by which the helper lipid enhances transfection includes enhancing particle stability. In some embodiments, the helper lipid enhances membrane fusogenicity. Generally, the helper lipid of the LNP compositions disclosure herein can be any helper lipid known in the art. Non-limiting examples of helper lipids suitable for the compositions and methods include steroids, sterols, and alkyl resorcinols. Particularly helper lipids suitable for use in the present disclosure include, but are not limited to, saturated phosphatidylcholine (PC) such as distearoyl-PC (DSPC) and dipalymitoyl-PC (DPPC), dioleoylphosphatidylethanolamine (DOPE), 1,2-dilinoleoyl-sn- glycero-3-phosphocholine (DLPC), cholesterol, 5-heptadecylresorcinol, and cholesterol hemisuccinate. In some embodiments, the helper lipid of the LNP composition includes cholesterol. [0232] In some embodiments, the LNP composition comprises one or more structural lipids. As used herein, the term “structural lipid” refers to sterols and also to lipids containing sterol moieties. Without being bound to any particular theory, it is believed that the incorporation of structural
Attorney Ref: 38909-52322/WO Client Ref: 005WO lipids into the LNPs mitigates aggregation of other lipids in the particle. Structural lipids can be selected from the group including but not limited to, cholesterol, fecosterol, sitosterol, ergosterol, campesterol, stigmasterol, brassicasterol, tomatidine, tomatine, ursolic acid, alpha-tocopherol, hopanoids, phytosterols, steroids, and mixtures thereof. In some embodiments, the structural lipid is a sterol. As defined herein, "sterols" are a subgroup of steroids consisting of steroid alcohols. In certain embodiments, the structural lipid is a steroid. In some embodiments, the structural lipid is cholesterol. In certain embodiments, the structural lipid is an analog of cholesterol. [0233] The lipid component of a lipid nanoparticle composition may include one or more molecules comprising polyethylene glycol, such as PEG or PEG-modified lipids. In some embodiments, the LNP composition disclosed herein comprise one or more polyethylene glycol (PEG) lipid. The term “PEG-lipid” refers to polyethylene glycol (PEG)-modified lipids. Such lipids are also referred to as PEGylated lipids. Non-limiting examples of PEG-lipids include PEG- modified phosphatidylethanolamine and phosphatidic acid, PEG-ceramide conjugates (e.g., PEG- CerC14 or PEG-CerC20), PEG-modified dialkylamines and PEG-modified 1,2-diacyloxypropan- 3-amines For example, a PEG lipid can be PEG-c-DOMG, PEG-DMG, PEG-DLPE, PEG-DMPE, PEG-DPPC, or a PEG-DSPE lipid. In some embodiments, the PEG-lipid includes, but not limited to 1,2-dimyristoyl-sn-glycerol methoxypolyethylene glycol (PEG-DMG), 1,2-distearoyl-sn- glycero-3-phosphoethanolamine-N-[amino(polyethylene glycol)] (PEG-DSPE), PEG-disteryl glycerol (PEG-DSG), PEG-dipalmetoleyl, PEG-dioleyl, PEG-distearyl, PEG-diacylglycamide (PEG-DAG), PEG-dipalmitoyl phosphatidylethanolamine (PEG-DPPE), or PEG-1,2- dimyristyloxlpropyl-3-amine (PEG-c-DMA). In some embodiments, the PEG-lipid is selected from the group consisting of a PEG-modified phosphatidylethanolamine, a PEG-modified phosphatidic acid, a PEG-modified ceramide, a PEG-modified dialkylamine, a PEG-modified diacylglycerol, a PEG-modified dialkylglycerol, and mixtures thereof. In some embodiments, the lipid moiety of the PEG-lipids includes those having lengths of from about C.sub.14 to about C.sub.22, preferably from about C.sub.14 to about C.sub.16. In some embodiments, a PEG moiety, for example a mPEG-NH.sub.2, has a size of about 1000, 2000, 5000, 10,000, 15,000 or 20,000 daltons. In some embodiment, the PEG-lipid is PEG2k-DMG. In some embodiments, the one or more PEG lipids of the LNP composition comprises PEG-DMPE. In some embodiments, the one or more PEG lipids of the LNP composition comprises PEG-DMG. [0234] In some embodiments, the ratio between the lipid components and the nucleic acid molecules of the LNP composition, e.g., the weight ratio, is sufficient for (i) formation of LNPs with desired characteristics, e.g., size, charge, and (ii) delivery of a sufficient dose of nucleic acid
Attorney Ref: 38909-52322/WO Client Ref: 005WO at a dose of the lipid component(s) that is tolerable for in vivo administration as readily ascertained by one of skill in the art. [0235] In certain embodiments, it is desirable to target a nanoparticle, e.g., a lipid nanoparticle, using a targeting moiety that is specific to a cell type and/or tissue type. In some embodiments, a nanoparticle may be targeted to a particular cell, tissue, and/or organ using a targeting moiety. In particular embodiments, a nanoparticle comprises a targeting moiety. Exemplary non-limiting targeting moieties include ligands, cell surface receptors, glycoproteins, vitamins (e.g., riboflavin) and antibodies (e.g., full-length antibodies, antibody fragments (e.g., Fv fragments, single chain Fv (scFv) fragments, Fab’ fragments, or F(ab’)2 fragments), single domain antibodies, camelid antibodies and fragments thereof, human antibodies and fragments thereof, monoclonal antibodies, and multispecific antibodies (e.g., bispecific antibodies)). In some embodiments, the targeting moiety may be a polypeptide. The targeting moiety may include the entire polypeptide (e.g., peptide or protein) or fragments thereof. A targeting moiety is typically positioned on the outer surface of the nanoparticle in such a manner that the targeting moiety is available for interaction with the target, for example, a cell surface receptor. A variety of different targeting moieties and methods are known and available in the art, including those described, e.g., in Sapra et al., Prog. Lipid Res.42(5):439-62, 2003 and Abra et al., J. Liposome Res.12:1-3, 2002. [0236] In some embodiments, a lipid nanoparticle (e.g., a liposome) may include a surface coating of hydrophilic polymer chains, such as polyethylene glycol (PEG) chains (see, e.g., Allen et al., Biochimica et Biophysica Acta 1237: 99-108, 1995; DeFrees et al., Journal of the American Chemistry Society 118: 6101-6104, 1996; Blume et al., Biochimica et Biophysica Acta 1149: 180- 184,1993; Klibanov et al., Journal of Liposome Research 2: 321-334, 1992; U.S. Pat. No. 5,013,556; Zalipsky, Bioconjugate Chemistry 4: 296-299, 1993; Zalipsky, FEBS Letters 353: 71- 74, 1994; Zalipsky, in Stealth Liposomes Chapter 9 (Lasic and Martin, Eds) CRC Press, Boca Raton Fla., 1995). In one approach, a targeting moiety for targeting the lipid nanoparticle is linked to the polar head group of lipids forming the nanoparticle. In another approach, the targeting moiety is attached to the distal ends of the PEG chains forming the hydrophilic polymer coating (see, e.g., Klibanov et al., Journal of Liposome Research 2: 321-334, 1992; Kirpotin et al., FEBS Letters 388: 115-118, 1996). [0237] Standard methods for coupling the targeting moiety or moieties may be used. For example, phosphatidylethanolamine, which can be activated for attachment of targeting moieties, or derivatized lipophilic compounds, such as lipid-derivatized bleomycin, can be used. Antibody- targeted liposomes can be constructed using, for instance, liposomes that incorporate protein A
Attorney Ref: 38909-52322/WO Client Ref: 005WO (see, e.g., Renneisen et al., J. Bio. Chem., 265:16337-16342, 1990 and Leonetti et al., Proc. Natl. Acad. Sci. (USA), 87:2448-2451, 1990). Other examples of antibody conjugation are disclosed in U.S. Pat. No.6,027,726. Examples of targeting moieties can also include other polypeptides that are specific to cellular components, including antigens associated with neoplasms or tumors. Polypeptides used as targeting moieties can be attached to the liposomes via covalent bonds (see, for example Heath, Covalent Attachment of Proteins to Liposomes, 149 Methods in Enzymology 111-119 (Academic Press, Inc.1987)). Other targeting methods include the biotin-avidin system. [0238] In some embodiments, a lipid nanoparticle includes a targeting moiety that targets the lipid nanoparticle to a cell including, but not limited to, hepatocytes, colon cells, epithelial cells, hematopoietic cells, epithelial cells, endothelial cells, lung cells, bone cells, stem cells, mesenchymal cells, neural cells, cardiac cells, adipocytes, vascular smooth muscle cells, cardiomyocytes, skeletal muscle cells, beta cells, pituitary cells, synovial lining cells, ovarian cells, testicular cells, fibroblasts, B cells, T cells, reticulocytes, leukocytes, granulocytes, and tumor cells (including primary tumor cells and metastatic tumor cells). In particular embodiments, the targeting moiety targets the lipid nanoparticle to a hepatocyte. [0239] The lipid nanoparticles described herein may be lipidoid-based. The synthesis of lipidoids has been extensively described and formulations containing these compounds are particularly suited for delivery of polynucleotides (see Mahon et al., Bioconjug Chem. 201021:1448-1454; Schroeder et al., J Intern Med. 2010267:9-21; Akinc et al., Nat. Biotechnol. 200826:561-569; Love et al., Proc Natl Acad Sci USA. 2010107:1864-1869; Siegwart et al., Proc Natl Acad Sci USA.2011108:12996-3001). [0240] The characteristics of optimized lipidoid formulations for intramuscular or subcutaneous routes may vary significantly depending on the target cell type and the ability of formulations to diffuse through the extracellular matrix into the blood stream. While a particle size of less than 150 nm may be desired for effective hepatocyte delivery due to the size of the endothelial fenestrae (see e.g., Akinc et al., Mol Ther.200917:872-879), use of lipidoid oligonucleotides to deliver the formulation to other cells types including, but not limited to, endothelial cells, myeloid cells, and muscle cells may not be similarly size-limited. [0241] In one aspect, effective delivery to myeloid cells, such as monocytes, lipidoid formulations may have a similar component molar ratio. Different ratios of lipidoids and other components including, but not limited to, a neutral lipid (e.g., diacylphosphatidylcholine), cholesterol, a PEGylated lipid (e.g., PEG-DMPE), and a fatty acid (e.g., an omega-3 fatty acid) may be used to
Attorney Ref: 38909-52322/WO Client Ref: 005WO optimize the formulation of the mRNA or system for delivery to different cell types including, but not limited to, hepatocytes, myeloid cells, muscle cells, etc. Exemplary lipidoids include, but are not limited to, Dlin-DMA, Dlin-K-DMA, Dlin-KC2-DMA, 98N12-5, C12-200 (including variants and derivatives), Dlin-MC3-DMA and analogs thereof. The use of lipidoid formulations for the localized delivery of nucleic acids to cells (such as, but not limited to, adipose cells and muscle cells) via either subcutaneous or intramuscular delivery, may also not require all of the formulation components which may be required for systemic delivery, and as such may comprise the lipidoid and the mRNA or system. [0242] According to the present disclosure, a system described herein may be formulated by mixing the mRNA or system, or individual components of the system, with the lipidoid at a set ratio prior to addition to cells. In vivo formulations may require the addition of extra ingredients to facilitate circulation throughout the body. After formation of the particle, a system or individual components of a system is added and allowed to integrate with the complex. The encapsulation efficiency is determined using a standard dye exclusion assays. [0243] In vivo delivery of systems may be affected by many parameters, including, but not limited to, the formulation composition, nature of particle PEGylation, degree of loading, oligonucleotide to lipid ratio, and biophysical parameters such as particle size (Akinc et al., Mol Ther. 2009 17:872-879; herein incorporated by reference in its entirety). As an example, small changes in the anchor chain length of poly(ethylene glycol) (PEG) lipids may result in significant effects on in vivo efficacy. Formulations with the different lipidoids, including, but not limited to penta[3-(1- laurylaminopropionyl)]-triethylenetetramine hydrochloride (TETA-5LAP; aka 98N12-5, see Murugaiah et al., Analytical Biochemistry, 401:61 (2010)), C12-200 (including derivatives and variants), MD1, Dlin-DMA, Dlin-K-DMA, Dlin-KC2-DMA and Dlin-MC3-DMA can be tested for in vivo activity. The lipidoid referred to herein as “98N12-5” is disclosed by Akinc et al., Mol Ther.200917:872-879). The lipidoid referred to herein as “C12-200” is disclosed by Love et al., Proc Natl Acad Sci USA.2010107:1864-1869 and Liu and Huang, Molecular Therapy.2010669- 670. [0244] LNPs in which a nucleic acid is entrapped within the lipid portion of the particle and is protected from degradation, can be formed by any method known in the art including, but not limited to, a continuous mixing method, a direct dilution process, and an in-line dilution process. Additional techniques and methods suitable for the preparation of the LNPs described herein include coacervation, microemulsions, supercritical fluid technologies, phase-inversion temperature (PIT) techniques.
Attorney Ref: 38909-52322/WO Client Ref: 005WO [0245] In some embodiments, the LNPs used herein are produced via a continuous mixing method, e.g., a process that includes providing an aqueous solution a nucleic acid described herein in a first reservoir, providing an organic lipid solution in a second reservoir (wherein the lipids present in the organic lipid solution are solubilized in an organic solvent, e.g., a lower alkanol such as ethanol), and mixing the aqueous solution with the organic lipid solution such that the organic lipid solution mixes with the aqueous solution so as to substantially instantaneously produce a lipid vesicle (e.g., liposome) encapsulating the nucleic acid molecule within the lipid vesicle. This process and the apparatus for carrying out this process are known in the art. More information in this regard can be found in, for example, U.S. Patent Publication No.20040142025. The action of continuously introducing lipid and buffer solutions into a mixing environment, such as in a mixing chamber, causes a continuous dilution of the lipid solution with the buffer solution, thereby producing a lipid vesicle substantially instantaneously upon mixing. By mixing the aqueous solution comprising a nucleic acid molecule with the organic lipid solution, the organic lipid solution undergoes a continuous stepwise dilution in the presence of the buffer solution (e.g., aqueous solution) to produce a nucleic acid-lipid particle. [0246] In some embodiments, the LNPs used herein are produced via a direct dilution process that includes forming a lipid vesicle (e.g., liposome) solution and immediately and directly introducing the lipid vesicle solution into a collection vessel containing a controlled amount of dilution buffer. In some embodiments, the collection vessel includes one or more elements configured to stir the contents of the collection vessel to facilitate dilution. In some embodiments, the amount of dilution buffer present in the collection vessel is substantially equal to the volume of lipid vesicle solution introduced thereto. [0247] In some embodiments, the LNPs are produced via an in-line dilution process in which a third reservoir containing dilution buffer is fluidly coupled to a second mixing region. In these embodiments, the lipid vesicle (e.g., liposome) solution formed in a first mixing region is immediately and directly mixed with dilution buffer in the second mixing region. These processes and the apparatuses for carrying out direct dilution and in-line dilution processes are known in the art. More information in this regard can be found in, for example, U.S. Patent Publication No. 20070042031. 6.14.2. Viral Vector Delivery [0248] In certain aspects this disclosure includes involves vectors, e.g. for delivering or introducing in a cell, but also for propagating these components (e.g. in prokaryotic cells). A used
Attorney Ref: 38909-52322/WO Client Ref: 005WO herein, a “vector” is a tool that allows or facilitates the transfer of an entity from one environment to another. It is a replicon, such as a plasmid, phage, or cosmid, into which another DNA segment may be inserted so as to bring about the replication of the inserted segment. Generally, a vector is capable of replication when associated with the proper control elements. In general, the term “vector” refers to a nucleic acid molecule capable of transporting another nucleic acid to which it has been linked. Vectors include, but are not limited to, nucleic acid molecules that are single- stranded, double-stranded, or partially double-stranded; nucleic acid molecules that comprise one or more free ends, no free ends (e.g. circular); nucleic acid molecules that comprise DNA, RNA, or both; and other varieties of polynucleotides known in the art. One type of vector is a “plasmid,” which refers to a circular double stranded DNA loop into which additional DNA segments can be inserted, such as by standard molecular cloning techniques. Another type of vector is a viral vector, wherein virally-derived DNA or RNA sequences are present in the vector for packaging into a virus (e.g. retroviruses, replication defective retroviruses, adenoviruses, replication defective adenoviruses, and adeno-associated viruses (AAVs)). Viral vectors also include polynucleotides carried by a virus for transfection into a host cell. Certain vectors are capable of autonomous replication in a host cell into which they are introduced (e.g. bacterial vectors having a bacterial origin of replication and episomal mammalian vectors). Other vectors (e.g., non-episomal mammalian vectors) are integrated into the genome of a host cell upon introduction into the host cell, and thereby are replicated along with the host genome. Moreover, certain vectors are capable of directing the expression of genes to which they are operatively-linked. Such vectors are referred to herein as “expression vectors.” Vectors for and that result in expression in a eukaryotic cell can be referred to herein as “eukaryotic expression vectors.” Common expression vectors of utility in recombinant DNA techniques are often in the form of plasmids. [0249] Recombinant expression vectors can comprise a nucleic acid of the disclosure in a form suitable for expression of the nucleic acid in a host cell, which means that the recombinant expression vectors include one or more regulatory elements, which may be selected on the basis of the host cells to be used for expression, that is operatively-linked to the nucleic acid sequence to be expressed. Within a recombinant expression vector, “operably linked” is intended to mean that the nucleotide sequence of interest is linked to the regulatory element(s) in a manner that allows for expression of the nucleotide sequence (e.g. in an in vitro transcription/translation system or in a host cell when the vector is introduced into the host cell). With regards to recombination and cloning methods, mention is made of U.S. patent application Ser. No.10/815,730, published Sep.2, 2004 as US 2004-0171156 A1, the contents of which are herein incorporated by reference in their entirety.
Attorney Ref: 38909-52322/WO Client Ref: 005WO [0250] Vector delivery, e.g., plasmid, viral delivery: The CRISPR enzyme, for instance a Type V protein such as C2c1 or C2c3, and/or any of the present RNAs, for instance a guide RNA, can be delivered using any suitable vector, e.g., plasmid or viral vectors, such as adeno associated virus (AAV), lentivirus, adenovirus or other viral vector types, or combinations thereof. Effector proteins and one or more guide RNAs can be packaged into one or more vectors, e.g., plasmid or viral vectors. In some embodiments, the vector, e.g., plasmid or viral vector is delivered to the tissue of interest by, for example, an intramuscular injection, while other times the delivery is via intravenous, transdermal, intranasal, oral, mucosal, or other delivery methods. Such delivery may be either via a single dose, or multiple doses. One skilled in the art understands that the actual dosage to be delivered herein may vary greatly depending upon a variety of factors, such as the vector choice, the target cell, organism, or tissue, the general condition of the subject to be treated, the degree of transformation/modification sought, the administration route, the administration mode, the type of transformation/modification sought, etc. [0251] Such a dosage may further contain, for example, a carrier (water, saline, ethanol, glycerol, lactose, sucrose, calcium phosphate, gelatin, dextran, agar, pectin, peanut oil, sesame oil, etc.), a diluent, a pharmaceutically-acceptable carrier (e.g., phosphate-buffered saline), a pharmaceutically-acceptable excipient, and/or other compounds known in the art. The dosage may further contain one or more pharmaceutically acceptable salts such as, for example, a mineral acid salt such as a hydrochloride, a hydrobromide, a phosphate, a sulfate, etc.; and the salts of organic acids such as acetates, propionates, malonates, benzoates, etc. Additionally, auxiliary substances, such as wetting or emulsifying agents, pH buffering substances, gels or gelling materials, flavorings, colorants, microspheres, polymers, suspension agents, etc. may also be present herein. In addition, one or more other conventional pharmaceutical ingredients, such as preservatives, humectants, suspending agents, surfactants, antioxidants, anticaking agents, fillers, chelating agents, coating agents, chemical stabilizers, etc. may also be present, especially if the dosage form is a reconstitutable form. Suitable exemplary ingredients include microcrystalline cellulose, carboxymethylcellulose sodium, polysorbate 80, phenylethyl alcohol, chlorobutanol, potassium sorbate, sorbic acid, sulfur dioxide, propyl gallate, the parabens, ethyl vanillin, glycerin, phenol, parachlorophenol, gelatin, albumin and a combination thereof. A thorough discussion of pharmaceutically acceptable excipients is available in REMINGTON’S PHARMACEUTICAL SCIENCES (Mack Pub. Co., N.J.1991) which is incorporated by reference herein. [0252] In an embodiment herein the delivery is via an adenovirus, which may be at a single booster dose containing at least 1 x 105 particles (also referred to as particle units, pu) of adenoviral vector.
Attorney Ref: 38909-52322/WO Client Ref: 005WO In some embodiments, the dose preferably is at least about 1 x 106 particles (for example, about 1 x 106-1 x 1011 particles), more preferably at least about 1 x 107 particles, more preferably at least about 1 x 108 particles (e.g., about 1 x 108-1 x 1011 particles or about 1 x 109-1 x 1012 particles), and most preferably at least about 1 x 1010 particles (e.g., about 1 x 109-1 x 1010 particles or about 1 x 109-1 x 1012 particles), or even at least about 1 x 1010 particles (e.g., about 1 x 1010-1 x 1012 particles) of the adenoviral vector. Alternatively, the dose comprises no more than about 1 x 1014 particles, preferably no more than about 1 x 1013 particles, even more preferably no more than about 1 x 1012 particles, even more preferably no more than about 1 x 1011 particles, and most preferably no more than about 1 x 1010 particles (e.g., no more than about 1 x 109 particles). Thus, the dose may contain a single dose of adenoviral vector with, for example, about 1 x 106 particle units (pu), about 2 x 106 pu, about 4 x 106 pu, about 1 x 107 pu, about 2 x 107 pu, about 4 x 107 pu, about 1 x 108 pu, about 2 x 108 pu, about 4 x 108 pu, about 1 x 109 pu, about 2 x 109 pu, about 4 x 109 pu, about 1 x 1010 pu, about 2 x 1010 pu, about 4 x 1010 pu, about 1 x 1011 pu, about 2 x 1011 pu, about 4 x 1011 pu, about 1 x 1012 pu, about 2 x 1012 pu, or about 4 x 1012 pu of adenoviral vector. See, for example, the adenoviral vectors in U.S. Pat. No. 8,454,972 B2 to Nabel, et. al., granted on Jun.4, 2013; incorporated by reference herein, and the dosages at col 29, lines 36-58 thereof. In some embodiments, the adenovirus is delivered via multiple doses. [0253] In some embodiments, the delivery is via an AAV. A therapeutically effective dosage for in vivo delivery of the AAV to a human is believed to be in the range of from about 20 to about 50 ml of saline solution containing from about 1 x 1010 to about 1 x 1010 functional AAV/ml solution. The dosage may be adjusted to balance the therapeutic benefit against any side effects. In some embodiments, the AAV dose is generally in the range of concentrations of from about 1 x 105 to 1 x 1050 genomes AAV, from about 1 x 108 to 1 x 1020 genomes AAV, from about 1 x 1010 to about 1 x 1016 genomes, or about 1 x 1011 to about 1 x 1016 genomes AAV. A human dosage may be about 1 x 1013 genomes AAV. Such concentrations may be delivered in from about 0.001 ml to about 100 ml, about 0.05 to about 50 ml, or about 10 to about 25 ml of a carrier solution. Other effective dosages can be readily established by one of ordinary skill in the art through routine trials establishing dose response curves. See, for example, U.S. Pat. No.8,404,658 B2 to Hajjar, et al., granted on Mar.26, 2013, at col.27, lines 45-60. [0254] The promoter used to drive nucleic acid-targeting effector protein coding nucleic acid molecule expression can include: AAV ITR can serve as a promoter: this is advantageous for eliminating the need for an additional promoter element (which can take up space in the vector). The additional space freed up can be used to drive the expression of additional elements (gRNA,
Attorney Ref: 38909-52322/WO Client Ref: 005WO etc.). Also, ITR activity is relatively weaker, so can be used to reduce potential toxicity due to over expression of nucleic acid-targeting effector protein. For ubiquitous expression, can use promoters: CMV, CAG, CBh, PGK, SV40, Ferritin heavy or light chains, etc. For brain or other CNS expression, can use promoters: SynapsinI for all neurons, CaMKIIalpha for excitatory neurons, GAD67 or GAD65 or VGAT for GABAergic neurons, etc. For liver expression, can use Albumin promoter. For lung expression, can use SP-B. For endothelial cells, can use ICAM. For hematopoietic cells can use IFNbeta or CD45. For Osteoblasts can use OG-2. [0255] The promoter used to drive guide RNA can include: Pol III promoters such as U6 or H1 Use of Pol II promoter and intronic cassettes to express guide RNA Adeno Associated Virus (AAV) [0256] Nucleic acid-targeting effector protein and one or more guide RNA can be delivered using adeno associated virus (AAV), lentivirus, adenovirus or other plasmid or viral vector types, in particular, using formulations and doses from, for example, U.S. Pat. No.8,454,972 (formulations, doses for adenovirus), U.S. Pat. No.8,404,658 (formulations, doses for AAV) and U.S. Pat. No. 5,846,946 (formulations, doses for DNA plasmids) and from clinical trials and publications regarding the clinical trials involving lentivirus, AAV and adenovirus. For examples, for AAV, the route of administration, formulation and dose can be as in U.S. Pat. No. 8,454,972 and as in clinical trials involving AAV. For Adenovirus, the route of administration, formulation and dose can be as in U.S. Pat. No. 8,404,658 and as in clinical trials involving adenovirus. For plasmid delivery, the route of administration, formulation and dose can be as in U.S. Pat. No. 5,846,946 and as in clinical studies involving plasmids. Doses may be based on or extrapolated to an average 70 kg individual (e.g., a male adult human), and can be adjusted for patients, subjects, mammals of different weight and species. Frequency of administration is within the ambit of the medical or veterinary practitioner (e.g., physician, veterinarian), depending on usual factors including the age, sex, general health, other conditions of the patient or subject and the particular condition or symptoms being addressed. The viral vectors can be injected into the tissue of interest. For cell- type specific genome modification, the expression of nucleic acid-targeting effector can be driven by a cell-type specific promoter. For example, liver-specific expression might use the Albumin promoter and neuron-specific expression (e.g., for targeting CNS disorders) might use the Synapsin I promoter. [0257] In terms of in vivo delivery, AAV is advantageous over other viral vectors for a couple of reasons: Low toxicity (this may be due to the purification method not requiring ultra centrifugation
Attorney Ref: 38909-52322/WO Client Ref: 005WO of cell particles that can activate the immune response) and Low probability of causing insertional mutagenesis because it doesn’t integrate into the host genome. [0258] AAV has a packaging limit of 4.5 or 4.75 Kb. This means that nucleic acid-targeting effector protein (such as a Type V protein such as C2c1 or C2c3) as well as a promoter and transcription terminator have to be all fit into the same viral vector. Therefore embodiments of the disclosure include utilizing homologs of nucleic acid-targeting effector protein (such as a Type V protein such as C2c1 or C2c3) that are shorter. [0259] As to AAV, the AAV can be AAV1, AAV2, AAV5 or any combination thereof. One can select the AAV of the AAV with regard to the cells to be targeted; e.g., one can select AAV serotypes 1, 2, 5 or a hybrid capsid AAV1, AAV2, AAV5 or any combination thereof for targeting brain or neuronal cells; and one can select AAV4 for targeting cardiac tissue. AAV8 is useful for delivery to the liver. The herein promoters and vectors are preferred individually. [0260] Packaging cells are typically used to form virus particles that are capable of infecting a host cell. Such cells include 293 cells, which package adenovirus, and psi2 cells or PA317 cells, which package retrovirus. Viral vectors used in gene therapy are usually generated by producing a cell line that packages a nucleic acid vector into a viral particle. The vectors typically contain the minimal viral sequences required for packaging and subsequent integration into a host, other viral sequences being replaced by an expression cassette for the polynucleotide(s) to be expressed. The missing viral functions are typically supplied in trans by the packaging cell line. For example, AAV vectors used in gene therapy typically only possess ITR sequences from the AAV genome which are required for packaging and integration into the host genome. Viral DNA is packaged in a cell line, which contains a helper plasmid encoding the other AAV genes, namely rep and cap, but lacking ITR sequences. The cell line may also be infected with adenovirus as a helper. The helper virus promotes replication of the AAV vector and expression of AAV genes from the helper plasmid. The helper plasmid is not packaged in significant amounts due to a lack of ITR sequences. Contamination with adenovirus can be reduced by, e.g., heat treatment to which adenovirus is more sensitive than AAV. Additional methods for the delivery of nucleic acids to cells are known to those skilled in the art. See, for example, US20030087817, incorporated herein by reference. [0261] Millington-Ward et al. (Molecular Therapy, vol.19 no.4, 642-649 April 2011) describes adeno-associated virus (AAV) vectors to deliver an RNA interference (RNAi)-based rhodopsin suppressor and a codon-modified rhodopsin replacement gene resistant to suppression due to nucleotide alterations at degenerate positions over the RNAi target site. An injection of either 6.0 x 108 vp or 1.8 x 1010 vp AAV were subretinally injected into the eyes by Millington-Ward et al.
Attorney Ref: 38909-52322/WO Client Ref: 005WO The AAV vectors of Millington-Ward et al. may be applied to the system of the present disclosure, contemplating a dose of about 2 x 1011 to about 6 x 1011 vp administered to a human. [0262] Dalkara et al. (Sci Transl Med 5, 189ra76 (2013)) also relates to in vivo directed evolution to fashion an AAV vector that delivers wild-type versions of defective genes throughout the retina after noninjurious injection into the eyes’ vitreous humor. Dalkara describes a 7 mer peptide display library and an AAV library constructed by DNA shuffling of cap genes from AAV1, 2, 4, 5, 6, 8, and 9. The rcAAV libraries and rAAV vectors expressing GFP under a CAG or Rho promoter were packaged and deoxyribonuclease-resistant genomic titers were obtained through quantitative PCR. The libraries were pooled, and two rounds of evolution were performed, each consisting of initial library diversification followed by three in vivo selection steps. In each such step, P30 rho-GFP mice were intravitreally injected with 2 ml of iodixanol-purified, phosphate- buffered saline (PBS)-dialyzed library with a genomic titer of about 1.times.10.sup.12 vg/ml. The AAV vectors of Dalkara et al. may be applied to the nucleic acid-targeting system of the present disclosure, contemplating a dose of about 1 x 1015 to about 1 x 1016 vg/ml administered to a human. [0263] The tropism of a retrovirus can be altered by incorporating foreign envelope proteins, expanding the potential target population of target cells. Lentiviral vectors are retroviral vectors that are able to transduce or infect non-dividing cells and typically produce high viral titers. Selection of a retroviral gene transfer system would therefore depend on the target tissue. Retroviral vectors are comprised of cis-acting long terminal repeats with packaging capacity for up to 6-10 kb of foreign sequence. The minimum cis-acting LTRs are sufficient for replication and packaging of the vectors, which are then used to integrate the therapeutic gene into the target cell to provide permanent transgene expression. Widely used retroviral vectors include those based upon murine leukemia virus (MuLV), gibbon ape leukemia virus (GaLV), Simian Immuno deficiency virus (SW), human immune deficiency virus (HIV), and combinations thereof (see, e.g., Buchscher et al., J. Virol. 66 :2731-2739 (1992) ; Johann et al., J. Virol.66 :1635-1640 (1992) ; Sommnerfelt et al., Virol.176 :58-59 (1990) ; Wilson et al., J. Virol.63:2374-2378 (1989); Miller et al., J. Virol. 65:2220-2224 (1991); PCT/US94/05700). In applications where transient expression is preferred, adenoviral based systems may be used. Adenoviral based vectors are capable of very high transduction efficiency in many cell types and do not require cell division. With such vectors, high titer and levels of expression have been obtained. This vector can be produced in large quantities in a relatively simple system. Adeno-associated virus (“AAV”) vectors may also be used to transduce cells with target nucleic acids, e.g., in the in vitro production of nucleic acids and peptides, and for in vivo and ex vivo gene therapy procedures (see, e.g., West
Attorney Ref: 38909-52322/WO Client Ref: 005WO et al., Virology 160:38-47 (1987); U.S. Pat. No. 4,797,368; WO 93/24641; Kotin, Human Gene Therapy 5:793-801 (1994); Muzyczka, J. Clin. Invest. 94:1351 (1994). Construction of recombinant AAV vectors are described in a number of publications, including U.S. Pat. No. 5,173,414; Tratschin et al., Mol. Cell. Biol.5:3251-3260 (1985); Tratschin, et al., Mol. Cell. Biol. 4:2072-2081 (1984); Hermonat & Muzyczka, PNAS 81:6466-6470 (1984); and Samulski et al., J. Virol.63:03822-3828 (1989). [0264] Packaging cells are typically used to form virus particles that are capable of infecting a host cell. Such cells include 293 cells, which package adenovirus, and yr2 cells or PA317 cells, which package retrovirus. Viral vectors used in gene therapy are usually generated by producing a cell line that packages a nucleic acid vector into a viral particle. The vectors typically contain the minimal viral sequences required for packaging and subsequent integration into a host, other viral sequences being replaced by an expression cassette for the polynucleotide(s) to be expressed. The missing viral functions are typically supplied in trans by the packaging cell line. For example, AAV vectors used in gene therapy typically only possess ITR sequences from the AAV genome which are required for packaging and integration into the host genome. Viral DNA is packaged in a cell line, which contains a helper plasmid encoding the other AAV genes, namely rep and cap, but lacking ITR sequences. The cell line may also be infected with adenovirus as a helper. The helper virus promotes replication of the AAV vector and expression of AAV genes from the helper plasmid. The helper plasmid is not packaged in significant amounts due to a lack of ITR sequences. Contamination with adenovirus can be reduced by, e.g., heat treatment to which adenovirus is more sensitive than AAV. Additional methods for the delivery of nucleic acids to cells are known to those skilled in the art. See, for example, US20030087817, incorporated herein by reference. [0265] In some embodiments, a host cell is transiently or non-transiently transfected with one or more vectors described herein. In some embodiments, a cell is transfected as it naturally occurs in a subject. In some embodiments, a cell that is transfected is taken from a subject. Cells taken from a subject include, but are not limited to, hepatocytes or cells isolated from muscle, the CNS, eye or lung. Immunological cells are also contemplated, such as but not limited to T cells, HSCs, B-cells and NK cells. [0266] Another useful method to deliver proteins, enzymes, and guides comprises transfection of messenger RNA (mRNA). Examples of mRNA delivery methods and compositions that may be utilized in the present disclosure including, for example, PCT/US2014/028330, US8822663B2, NZ700688A, ES2740248T3, EP2755693A4, EP2755986A4, WO2014152940A1, EP3450553B1, BR112016030852A2, and EP3362461A1. Expression of CRISPR systems in particular is
Attorney Ref: 38909-52322/WO Client Ref: 005WO described by WO2020014577. Each of these publications are incorporated herein by reference in their entireties. Additional disclosure hereby incorporated by reference can be found in Kowalski et al., “Delivering the Messenger: Advances in Technologies for Therapeutic mRNA Delivery,” Mol Therap., 2019; 27(4): 710-728. [0267] In some embodiments, the cell is derived from cells taken from a subject, such as a cell line. A wide variety of cell lines for tissue culture are known in the art. Examples of cell lines include, but are not limited to, C8161, CCRF-CEM, MOLT, mIMCD-3, NHDF, HeLa-S3, Huh1, Huh4, Huh7, HUVEC, HASMC, HEKn, HEKa, MiaPaCell, Panc1, PC-3, TF1, CTLL-2, CIR, Rat6, CV1, RPTE, A10, T24, J82, A375, ARH-77, Calu1, SW480, SW620, SKOV3, SK-UT, CaCo2, P388D1, SEM-K2, WEHI-231, HB56, TIB55, Jurkat, J45.01, LRMB, Bcl-1, BC-3, IC21, DLD2, Raw264.7, NRK, NRK-52E, MRC5, MEF, Hep G2, HeLa B, HeLa T4, COS, COS-1, COS-6, COS-M6A, BS-C-1 monkey kidney epithelial, BALB/3T3 mouse embryo fibroblast, 3T3 Swiss, 3T3-L1, 132-d5 human fetal fibroblasts; 10.1 mouse fibroblasts, 293-T, 3T3, 721, 9L, A2780, A2780ADR, A2780cis, A172, A20, A253, A431, A-549, ALC, B16, B35, BCP-1 cells, BEAS-2B, bEnd.3, BHK-21, BR 293, BxPC3, C3H-10T1/2, C6/36, Cal-27, CHO, CHO-7, CHO- IR, CHO-K1, CHO-K2, CHO-T, CHO Dhfr-/-, COR-L23, COR-L23/CPR, COR-L23/5010, COR- L23/R23, COS-7, COV-434, CML T1, CMT, CT26, D17, DH82, DU145, DuCaP, EL4, EM2, EM3, EMT6/AR1, EMT6/AR10.0, FM3, H1299, H69, HB54, HB55, HCA2, HEK-293, HeLa, Hepa1c1c7, HL-60, HMEC, HT-29, Jurkat, JY cells, K562 cells, Ku812, KCL22, KG1, KYO1, LNCap, Ma-Mel 1-48, MC-38, MCF-7, MCF-10A, MDA-MB-231, MDA-MB-468, MDA-MB- 435, MDCK II, MDCK II, MOR/0.2R, MONO-MAC 6, MTD-1A, MyEnd, NCI-H69/CPR, NCI- H69/LX10, NCI-H69/LX20, NCI-H69/LX4, NIH-3T3, NALM-1, NW-145, OPCN/OPCT cell lines, Peer, PNT-1A/PNT 2, RenCa, RIN-5F, RMA/RMAS, Saos-2 cells, Sf-9, SkBr3, T2, T-47D, T84, THP1 cell line, U373, U87, U937, VcaP, Vero cells, WM39, WT-49, X63, YAC-1, YAR, and transgenic varieties thereof. Cell lines are available from a variety of sources known to those with skill in the art (see, e.g., the American Type Culture Collection (ATCC) (Manassas, Va.)). In some embodiments, a cell transfected with one or more vectors described herein is used to establish a new cell line comprising one or more vector-derived sequences. [0268] In some embodiments, one or more vectors described herein are used to produce a non- human transgenic animal or transgenic plant. In some embodiments, the transgenic animal is a mammal, such as a mouse, rat, or rabbit. In certain embodiments, the organism or subject is a plant. In certain embodiments, the organism or subject or plant is algae. Methods for producing
Attorney Ref: 38909-52322/WO Client Ref: 005WO transgenic plants and animals are known in the art, and generally begin with a method of cell transfection, such as described herein. [0269] In one aspect, the disclosure provides for methods of modifying a target polynucleotide in a prokaryotic or eukaryotic cell, which may be in vivo, ex vivo or in vitro. In some embodiments, the method comprises sampling a cell or population of cells from a human or non-human animal or plant (including micro-algae) and modifying the cell or cells. Culturing may occur at any stage ex vivo. The cell or cells may even be re-introduced into the non-human animal or plant (including micro-algae). [0270] In plants, pathogens are often host-specific. For example, Fusariumn oxysporum f. sp. Lycopersici causes tomato wilt but attacks only tomato, and F. oxysporum f. dianthii Puccinia graminis f. sp. Tritici attacks only wheat. Plants have existing and induced defenses to resist most pathogens. Mutations and recombination events across plant generations lead to genetic variability that gives rise to susceptibility, especially as pathogens reproduce with more frequency than plants. In plants there can be non-host resistance, e.g., the host and pathogen are incompatible. There can also be Horizontal Resistance, e.g., partial resistance against all races of a pathogen, typically controlled by many genes and Vertical Resistance, e.g., complete resistance to some races of a pathogen but not to other races, typically controlled by a few genes. In a Gene-for-Gene level, plants and pathogens evolve together, and the genetic changes in one balance changes in other. Accordingly, using Natural Variability, breeders combine most useful genes for Yield. Quality, Uniformity, Hardiness, Resistance. The sources of resistance genes include native or foreign Varieties, Heirloom Varieties, Wild Plant Relatives, and Induced Mutations, e.g., treating plant material with mutagenic agents. Using the present disclosure, plant breeders are provided with a new tool to induce mutations. Accordingly, one skilled in the art can analyze the genome of sources of resistance genes, and in Varieties having desired characteristics or traits employ the present disclosure to induce the rise of resistance genes, with more precision than previous mutagenic agents and hence accelerate and improve plant breeding programs. [0271] Examples of target polynucleotides include a sequence associated with a signaling biochemical pathway, e.g., a signaling biochemical pathway-associated gene or polynucleotide. Examples of target polynucleotides include a disease associated gene or polynucleotide. A “disease-associated” gene or polynucleotide refers to any gene or polynucleotide which is yielding transcription or translation products at an abnormal level or in an abnormal form in cells derived from a disease-affected tissues compared with tissues or cells of a non disease control. It may be a gene that becomes expressed at an abnormally high level; it may be a gene that becomes
Attorney Ref: 38909-52322/WO Client Ref: 005WO expressed at an abnormally low level, where the altered expression correlates with the occurrence and/or progression of the disease. A disease-associated gene also refers to a gene possessing mutation(s) or genetic variation that is directly responsible or is in linkage disequilibrium with a gene(s) that is responsible for the etiology of a disease. The transcribed or translated products may be known or unknown and may be at a normal or abnormal level. 7. ADDITIONAL EMBODIMENTS [0272] Embodiment 1. A guide RNA, comprising from 5’ to 3’: a spacer complementary to a first target genomic site; a scaffold capable of binding to a DNA binding nickase; a reverse transcriptase template comprised of at least a first segment and a second segment, wherein the first segment encodes a clamp segment complementary to a second genomic site and the second segment encodes an integrase target recognition site; and a primer binding site complementary to a third genomic site; wherein the reverse transcriptase template encodes for a first single-stranded DNA sequence. [0273] Embodiment 2. The guide RNA of any one of Embodiment 1, further comprising a 3’ extension capable of binding one or more consecutive nucleotides of the reverse transcriptase template first segment. [0274] Embodiment 3. The guide RNA of any one of Embodiments 1-2, wherein the second genomic site and the third genomic site are separated by one or more genomic bases. [0275] Embodiment 4. The guide RNA of Embodiment 3, wherein the second genomic site and the third genomic site are separated by 15-21, 33-39, or 35-41 genomic bases. [0276] Embodiment 5. The guide RNA of any one of Embodiments 1-4, wherein the clamp segment binds to the second genomic site with one or more nucleotide mismatches. [0277] Embodiment 6. The guide RNA of Embodiment 5, wherein the mismatches are consecutive or non-consecutive. [0278] Embodiment 7. The guide RNA of any one of Embodiments 1-6, wherein the guide RNA is circularized. [0279] Embodiment 8. A guide RNA complex, comprising: a spacer complementary to a first target genomic site; a scaffold capable of binding to a DNA binding nickase; a reverse transcriptase template comprised of at least a first segment and a second segment, wherein the first segment
Attorney Ref: 38909-52322/WO Client Ref: 005WO encodes a clamp segment complementary to a second genomic site and the second segment encodes an integrase target recognition site; a primer binding site complementary to a third genomic site; wherein the reverse transcriptase template encodes for a first single-stranded DNA sequence. [0280] Embodiment 9. The guide RNA complex of Embodiment 8, wherein the guide RNA complex is comprised of a first polynucleotide component and a second polynucleotide component. [0281] Embodiment 10. The guide RNA complex of Embodiment 9, wherein the first polynucleotide component comprises a spacer complementary to a first target genomic site and a scaffold capable of binding to a DNA binding nickase, and wherein the second polynucleotide component comprises a reverse transcriptase template comprised of at least a first segment and a second segment, wherein said first segment encodes a clamp segment complementary to a second genomic site and said second segment encodes an integrase target recognition site, a primer binding site complementary to a third genomic site, and a RNA-protein recruitment domain, and wherein the reverse transcriptase template encodes for a first single-stranded DNA sequence. [0282] Embodiment 11. The guide RNA complex of Embodiment 10, wherein one or more of the first polynucleotide component and the second polynucleotide component are circularized. [0283] Embodiment 12. The guide RNA or the guide RNA complex of any one of Embodiments 1-11, further comprises one or more of an RNA-protein recruitment domain, a transcriptional termination signal, a reverse transcription termination signal, a nucleobase analog, an RNA ribozyme, or a chemical linker. [0284] Embodiment 13. The guide RNA or the guide RNA complex of any one of Embodiments 1-12, wherein the guide RNA reverse transcriptase template encodes for a first single-stranded DNA sequence that contains a complementary region to a second single-stranded DNA sequence encoded by a second guide RNA comprised of a reverse transcriptase template. [0285] Embodiment 14. The guide RNA or the guide RNA complex of any one of Embodiments 1-13, wherein an RNA polynucleotide comprises the guide RNA or a DNA polynucleotide encodes the guide RNA. [0286] Embodiment 15. The guide RNA of Embodiment 1 or the guide RNA complex of Embodiment 2, wherein the guide RNA or guide RNA complex is capable of binding a DNA
Attorney Ref: 38909-52322/WO Client Ref: 005WO binding nickase selected from the group consisting of: Cas9-D10A, Cas9-H840A, Cas12a/b/c/d/e nickase, CasX nickase, SaCas9 nickase, and CasY nickase. [0287] Embodiment 16. The guide RNA or the guide RNA complex of Embodiment 14, wherein the guide RNA or the guide RNA complex is introduced into a cell using a recombinant adenovirus, helper dependent adenovirus, AAV, lentivirus, HSV, annelovirus, retrovirus, Doggybone DNA, minicircle, plasmid, miniDNA, nanoplasmid, exosome, fusosome, mRNA, RNP, or lipid nanoparticle. [0288] Embodiment 17. A method of polynucleotide integration into a cellular genome, the method comprising: incorporating an integration target recognition site into a cell genome by delivering: a gene editor protein or polynucleotide encoding a gene editor protein, wherein the gene editor protein is capable of polynucleotide genomic integration; one or more guide RNA or guide RNA complex selected from any one of Embodiments 1-16; optionally, a nicking gRNA; and integrating the polynucleotide into the cellular genome by delivering: a donor polynucleotide template, wherein the donor polynucleotide template is comprised of an integration target recognition site, wherein the donor polynucleotide is integrated into the cellular genome at the incorporated genomic integration target recognition site by the gene editor protein. 8. EXAMPLES 8.1. Example 1: Attachment Site-Containing guide RNA (atgRNAs) capable of partial genomic replacement demonstrate improved attachment site genomic placement [0289] In this example, attachment site-containing guide RNAs (atgRNAs) that are capable of site-specifically incorporating an integrase recognition site into a genomic locus were assessed for percent beacon placement (i.e., percent integration efficiency of integration recognition site). In particular, an atgRNA targeting the LMBN1 locus and capable of site-specifically incorporating an integrase recognition site into the LMBN1 locus were synthesized to allow for a 0 nucleotide separation (see atgRNA (v1) in FIGs.3A-3C), 18 nucleotide separation (see, e.g., atgRNA (v2) in FIGs. 4A-4C where the separation is 18 nucleotides), or 36 nucleotide separation (see, e.g., atgRNA (v2) in FIGs. 4A-4C where the separation is 36 nucleotides), respectively, between the third genomic site (i.e., PBS binding site) and the second genomic site (i.e., clamp binding site). The synthesized atgRNAs incorporated a clamp segment 10 nucleotides long and a 38 nucleotide integration recognition site (e.g., AttB or AttP).
Attorney Ref: 38909-52322/WO Client Ref: 005WO [0290] AtgRNAs were electroporated along with mRNA encoding a gene editor polypeptide (i.e., a Cas9-nickase-reverse transcriptase fusion) and a nicking guide RNA (ngRNA) into HEK 293 cells. In particular, the Neon electroporation system was used. AtgRNA and nicking gRNAs were ordered from Integrated DNA Technologies as Alt-R gRNA. As shown in Table 11, 1 μg of mRNA encoding the gene editor polypeptide, 120 pmol of atgRNA, 40 pmol of nicking sgRNA and 50,000 HEK293T cells were mixed in Buffer R and electroporated using 10-μl Neon tips using the following electroporation parameters: 1,150 V, 20 ms, two pulses. After electroporation, cells were plated in prewarmed 48-well plates with DMEM containing 10% FBS and incubated for 72 h before analysis. l
*Cells are either HEK293 or Jurkat cells [0291] As shown in FIG. 5, an atgRNA having a clamp segment at least partially complementary to a second genomic site and a primer binding site complementary to a third genomic site, where the second genomic site and the third genomic site are separated by 18 base pairs demonstrated an 82% beacon placement efficiency as measured by droplet digital PCR (ddPCR). In contrast, an atgRNA having a clamp segment at least partially complementary to a second genomic site and a primer binding site complementary to a third genomic site, where the second genomic site are adjacent (i.e., a 0-base separation) demonstrated a 74% beacon placement efficiency.
Attorney Ref: 38909-52322/WO Client Ref: 005WO 8.2. Example 2: Attachment Site-Containing guide RNA (atgRNAs) capable of partial genomic replacement demonstrate improved attachment site genomic placement [0292] In this example, attachment site-containing guide RNAs (atgRNAs) that are capable of site-specifically incorporating an integrase recognition site into a genomic locus were assessed for percent beacon placement. The AtgRNA were designed to target the LMNB1 locus, the hF9 locus, and the ACTB locus and with each atgRNA capable of site-specifically incorporating an integrase recognition site into the LMBN1 locus, the hF9 locus, and the ACTB locus, respectively. The atgRNAs targeting the LMNB1 locus were synthesized to allow for a 0-36 base pair separation between the second genomic site and the third genomic site. The atgRNAs targeting the hF9 locus were synthesized to allow for a 0-42 base pair separation between the second genomic site and the third genomic site. The atgRNAs targeting the ACTB locus were synthesized to allow for a 0-33 base pair separation between the second genomic site and the third genomic site. The synthesized atgRNAs incorporate the use of a 10 nucleotide clamp segment and a 38 nucleotide integration recognition site (e.g., AttB or AttP). atgRNA and nicking gRNAs were ordered from Integrated DNA Technologies as Alt-R gRNA. [0293] The atgRNAs were transfected along with mRNA encoding a gene editor polypeptide (i.e., a Cas9-nickase-reverse transcriptase fusion) and a nicking guide RNA (ngRNA) into HEK 293 cells using Lipofectamine MessengerMax (Thermo Fisher) according to the manufacturer’s instructions. In particular, 1 μg of a gene editor polypeptide (Cas9-nickase linked to a reverse transcriptase) encoding mRNA, 120 pmol of atgRNA, 40 pmol of nicking sgRNA were transfected into 50,000 HEK293T cells. After transfection, cells were allowed to recover for 72 hours in DMEM containing 10% FBS before analysis. [0294] FIG. 7A shows percent beacon placement (i.e., efficiency of integration of the integration recognition site) for atgRNAs targeting the LMNB1 locus having a clamp segment at least partially complementary to a second genomic site and a primer binding site complementary to a third genomic site, where the second genomic site and the third genomic site are separated by 0 to 36 base pairs (see FIG. 7A “clamp+0” (i.e., 0 base pair separation) to “clamp+36” (i.e., 36 base pair separation). Percent beacon placement was highest for atgRNA having a 10 base pair separation between the second and third genomic sites (see “clamp+10” in FIG.7A). [0295] FIG. 7B shows percent beacon placement (i.e., efficiency of integration of the integration recognition site) for atgRNAs targeting the hF9 locus having a clamp segment at least partially complementary to a second genomic site and a primer binding site complementary to a
Attorney Ref: 38909-52322/WO Client Ref: 005WO third genomic site, where the second genomic site and the third genomic site are separated by 0 to 42 base pairs (see FIG.7B “clamp+0” (i.e., 0 base pair separation) to “clamp+42” (i.e., 42 base pair separation). Percent beacon placement was highest for atgRNA having a 31 base pair separation between the second and third genomic sites (see “clamp+31” in FIG.7B). [0296] FIG. 7C shows percent beacon placement (i.e., efficiency of integration of the integration recognition site) for atgRNAs targeting the ACTB locus having a clamp segment at least partially complementary to a second genomic site and a primer binding site complementary to a third genomic site, where the second genomic site and the third genomic site are separated by 0 to 33 base pairs (see FIG. 7C “clamp+0” (i.e., 0 base pair separation) to “clamp+33” (i.e., 33 base pair separation). Percent beacon placement is highest for atgRNA having a 16 base pair separation between the second and third genomic sites (see “clamp+16” in FIG.7C). [0297] Overall, this data showed the distance between the second genomic site and the third genomic site can impact percent beacon placement. 8.3. Example 3: PASTE with atgRNAs capable of partial genomic replacement [0298] A PASTE readout in HEK293 cells is used to identify functionally active atgRNAs capable of inserting an integration recognition site where insertion includes partial genomic replacement. Droplet digital PCR (ddPCR) in a 96-well plate format is applied to verify gene insertion into the ACTB locus. Using Lipofectamine 3000 transfection reagent, 100ng total DNA at a plasmid ratio of 1:1:4 of Cas9(nickase)-RT-BxB1 : nicking guide : atgRNA is delivered to 30,000 HEK293 cells. The percent integration across tested atgRNAs capable of partial genomic replacement is verified via ddPCR. 9. EQUIVALENTS AND INCORPORATION BY REFERENCE [0299] All references cited herein are incorporated by reference to the same extent as if each individual publication, database entry (e.g., Genbank sequences or GeneID entries), patent application, or patent, was specifically and individually indicated incorporated by reference in its entirety, for all purposes. This statement of incorporation by reference is intended by Applicants, pursuant to 37 C.F.R. §1.57(b)(1), to relate to each and every individual publication, database entry (e.g., Genbank sequences or GeneID entries), patent application, or patent, each of which is clearly identified in compliance with 37 C.F.R. §1.57(b)(2), even if such citation is not immediately adjacent to a dedicated statement of incorporation by reference. The inclusion of dedicated statements of incorporation by reference, if any, within the specification does not in any way
Attorney Ref: 38909-52322/WO Client Ref: 005WO weaken this general statement of incorporation by reference. Citation of the references herein is not intended as an admission that the reference is pertinent prior art, nor does it constitute any admission as to the contents or date of these publications or documents. [0300] While the disclosure has been particularly shown and described with reference to a preferred embodiment and various alternate embodiments, it is understood by persons skilled in the relevant art that various changes in form and details can be made therein without departing from the spirit and scope of the disclosure.
Claims
Attorney Ref: 38909-52322/WO Client Ref: 005WO WHAT IS CLAIMED IS: 1. An attachment site containing guide RNA (atgRNA) comprising from 5’ to 3’: a spacer sequence complementary to a first genomic site of a first strand of a double- stranded target DNA sequence; a scaffold capable of associating with a DNA binding nickase domain and guiding the DNA binding nickase domain to the target DNA sequence; a reverse transcriptase (RT) template comprising: an integration recognition site; and a clamp segment comprising a sequence where upon being reverse transcribed the sequence is at least partially complementary to a second genomic site of the first strand of the double-stranded target DNA sequence; and a primer binding site complementary to a third genomic site of the double-stranded target DNA sequence, wherein the third genomic site is on a second strand of the target DNA sequence, wherein the reverse transcriptase template encodes for a first single-stranded DNA sequence comprising the integration recognition site and the clamp segment. 2. An attachment site containing guide RNA (atgRNA) comprising from 5’ to 3’: a spacer sequence complementary to a first genomic site of a first strand of a double- stranded target DNA sequence; a scaffold capable of associating with a DNA binding nickase domain and guiding the DNA binding nickase domain to the target DNA sequence; a reverse transcriptase (RT) template comprising: an integration recognition site; and a clamp segment comprising a sequence where upon being reverse transcribed comprises a sequence at least partially complementary to a second genomic site of first strand of the double-stranded target DNA sequence; and a primer binding site comprising: a sequence complementary to a third genomic site of the double-stranded DNA target sequence, wherein the third genomic site is on a second strand of the target DNA sequence; and
Attorney Ref: 38909-52322/WO Client Ref: 005WO an anti-reverse transcriptase template (anti-RTT) sequence comprising a sequence at least partially complementary to all or a portion of the RT template, wherein the reverse transcriptase template encodes for a first single-stranded DNA sequence comprising the integration recognition site and the clamp segment. 3. An attachment site containing guide RNA (atgRNA), comprising from 5’ to 3’: a spacer sequence complementary to a first genomic site of a first strand of a double- stranded target DNA sequence; a scaffold capable of associating with a DNA binding nickase domain and guiding the DNA binding nickase domain to the target DNA sequence; a clamp segment at least partially complementary to a second genomic site of the first strand of the double-stranded target DNA sequence; a reverse transcriptase (RT) template comprising an integration recognition site; and a primer binding site complementary to a third genomic site of the double-stranded target DNA sequence, wherein the third genomic site is on a second strand of the target DNA sequence; wherein the reverse transcriptase template encodes for a first single-stranded DNA sequence comprising the integration recognition site. 4. An attachment site containing guide RNA (atgRNA) comprising from 5’ to 3’: a spacer sequence complementary to a first genomic site of a first strand of a double- stranded target DNA sequence; a scaffold capable of associating with a DNA binding nickase domain and guiding the DNA binding nickase domain to the target DNA sequence; a clamp segment at least partially complementary to a second genomic site of the first strand of the double-stranded target DNA sequence; a reverse transcriptase (RT) template comprising an integration recognition site; and a primer binding site comprising: a sequence complementary to a third genomic site of the double-stranded DNA target sequence, wherein the third genomic site is on a second strand of the target DNA sequence; and
Attorney Ref: 38909-52322/WO Client Ref: 005WO an anti-reverse transcriptase template (anti-RTT) sequence comprising a sequence at least partially complementary to all or a portion of the RT template, wherein the reverse transcriptase template encodes for a first single-stranded DNA sequence comprising the integration recognition site. 5. The atgRNA of any one of claims 1-4, wherein, upon introducing the atgRNA into a cell with a gene editor polypeptide, all or a portion of the single-stranded DNA sequence comprising the integration recognition site is site-specifically integrated into the double-stranded target DNA sequence. 6. The atgRNA of any one of claims 1-5, wherein, upon introducing the atgRNA into a cell with a gene editor polypeptide, the atgRNA comprising the clamp segment enhances integration efficiency of the integration recognition site at the double-stranded target DNA sequence as compared to the integration efficiency of the integration recognition site at the double-stranded target DNA sequence when using an atgRNA that does not comprise the clamp segment. 7. The atgRNA of any one of claims 1-6, wherein the atgRNA has a length of 150 to 190 nucleotides. 8. The atgRNA of claim 7, wherein the atgRNA has a length of 160 to 180 nucleotides. 9. The atgRNA of any one of claims 1-8, wherein the clamp segment has a length of 1 nucleotide to 40 nucleotides. 10. The atgRNA of claim 9, wherein the clamp segment has a length of 1 to 10 nucleotides, 11 to 20 nucleotides, 21 to 30 nucleotides, or 31 to 40 nucleotides. 11. The atgRNA of claim 10, wherein the clamp segment has a length of 10 nucleotides. 12. The atgRNA of any one of claims 1-11, wherein the RT template has a length of 10 nucleotides to 200 nucleotides. 13. The atgRNA of claim 12, wherein the RT template has a length of 30, 32, 34, 36, 38, 40, 42, 44, 46, 48 or 50 nucleotides.
Attorney Ref: 38909-52322/WO Client Ref: 005WO 14. The atgRNA of any one of claims 1-13, wherein the second genomic site and the third genomic site are separated by one or more genomic bases in the double-stranded target DNA sequence. 15. The atgRNA of any one of claims 1-14, wherein the second genomic site and the third genomic site are separated by 1 to 10 genomic bases, 11 to 20 genomic bases, 21 to30 genomic bases, 31 to 40 genomic bases, or 41 to 45 genomic bases in the double-stranded target DNA sequence. 16. The atgRNA of any one of claims 1-14, wherein the second genomic site and the third genomic site are separated by 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, or 45 genomic bases in the double-stranded target DNA sequence. 17. The atgRNA of any one of claims 1-13, wherein the second genomic site and the third genomic site are adjacent in the double-stranded target DNA sequence. 18. The atgRNA of any one of claims 1-17, wherein the clamp segment comprises a sequence having 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence complementarity to the second genomic site. 19. The atgRNA of any one of claims 1-17, wherein the clamp segment comprises a sequence that is capable of binding to the second genomic site of the first strand of the double-stranded target DNA sequence with at least a first nucleotide mismatch. 20. The atgRNA of claim 19, wherein the clamp segment comprises a sequence that is capable of binding to the second genomic site of the first strand of the double-stranded target DNA sequence with a first nucleotide mismatch and a second nucleotide mismatch. 21. The atgRNA of claim 20, wherein the first nucleotide mismatch and the second nucleotide mismatch are consecutive. 22. The atgRNA of claim 20, wherein the first nucleotide mismatch and the second nucleotide mismatch are non-consecutive.
Attorney Ref: 38909-52322/WO Client Ref: 005WO 23. The atgRNA of any one of claims 2 or 4-22, wherein the anti-RTT sequence comprises a sequence having 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence complementarity to all or a portion the RT template. 24. The atgRNA of claim 23, wherein the anti-RTT sequence has a length between 1 nucleotide to 20 nucleotides, 5 nucleotides to 15 nucleotides, or 10 nucleotides to 20 nucleotides. 25. The atgRNA of claim 24, wherein the anti-RTT sequence has a length of 10 nucleotides. 26. The atgRNA of any one of claims 2 or 4-25, wherein, upon introducing the atgRNA into a cell, the anti-RTT sequence increases stability of the atgRNA as compared to an atgRNA that does not comprise the anti-RTT sequence. 27. A system for site-specifically incorporating an integration recognition site into a mammalian cell genome at a target DNA sequence, comprising: the attachment site containing guide RNA (atgRNA) of any one of claims 1-26; and a gene editor polypeptide comprising a DNA binding nickase domain linked to a reverse transcriptase domain capable of incorporating the integration recognition site into the target DNA sequence, wherein, upon introducing into the cell, the system integrates the integration recognition site into the target DNA sequence. 28. The system of claim 27, wherein, upon introducing the system into the cell, the atgRNA comprising the clamp segment enhances integration efficiency of the integration recognition site at the target DNA sequence as compared to the integration efficiency of the integration recognition site at the target DNA sequence when using an atgRNA that does not comprise the clamp segment. 29. A method for site-specifically incorporating an integration recognition site into a mammalian cell genome at a target DNA sequence, comprising: incorporating an integration target recognition site into the genome by delivering into the cell: the attachment site containing guide RNA (atgRNA) of any one of claims 1-26; and
Attorney Ref: 38909-52322/WO Client Ref: 005WO a gene editor polypeptide or polynucleotide encoding a gene editor polypeptide, wherein the gene editor polypeptide comprises a DNA binding nickase domain linked to a reverse transcriptase domain, wherein the integration recognition site is incorporated into the target DNA sequence by the gene editor polypeptide; and optionally, a nicking gRNA. 30. The method of claim 29, wherein the method using the atgRNA comprising the clamp segment enhances integration efficiency of the integration recognition site at the target DNA sequence as compared to the integration efficiency of the integration recognition site at the target DNA sequence when using an atgRNA that does not comprise the clamp segment. 31. A system for site-specifically integrating a donor polynucleotide template into a mammalian cell genome at a target DNA sequence, comprising: the attachment site containing gRNA (atgRNA) of any one of claims 1-26; a gene editor polypeptide comprising a DNA binding nickase domain linked to a reverse transcriptase domain capable of incorporating the integration recognition site into the target DNA sequence, an integration enzyme; and a donor polynucleotide template linked to a sequence that is an integration cognate of the integration recognition site present in the atgRNA, whereby the integration enzyme integrates the donor polynucleotide template into the target DNA sequence. 32. The system of claim 31, wherein, upon introducing the system into the cell, the atgRNA comprising the clamp enhances integration efficiency of the integration recognition site at the target DNA sequence as compared to the integration efficiency of the integration recognition site at the target DNA sequence when using an atgRNA that does not comprise the clamp segment. 33. A method for site-specifically integrating a donor polynucleotide template into a mammalian cell genome at a target DNA sequence, comprising: incorporating an integration recognition site into the genome by delivering into the cell:
Attorney Ref: 38909-52322/WO Client Ref: 005WO the attachment site containing guide RNA (atgRNA) of any one of claims 1-26; and a gene editor polypeptide or polynucleotide encoding a gene editor polypeptide, wherein the gene editor polypeptide comprises a DNA binding nickase domain linked to a reverse transcriptase domain and is capable of incorporating the integration recognition site into the target DNA sequence; and optionally, a nicking gRNA; and integrating the donor polynucleotide template into the genome by delivering into the cell: an integration enzyme; and a donor polynucleotide template, wherein the donor polynucleotide template is linked to a sequence that is an integration cognate of the integration recognition site present in the atgRNA, and wherein the donor polynucleotide template is integrated into the genome at the incorporated genomic integration recognition site by the integration enzyme. 34. The method of claim 33, wherein the method enhances integration efficiency of the integration recognition site at the double-stranded target DNA sequence as compared to the integration efficiency of the integration recognition site at the target DNA sequence when using an atgRNA that does not comprise the clamp segment.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US202263338077P | 2022-05-04 | 2022-05-04 | |
US63/338,077 | 2022-05-04 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2023215831A1 true WO2023215831A1 (en) | 2023-11-09 |
Family
ID=86604146
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2023/066609 WO2023215831A1 (en) | 2022-05-04 | 2023-05-04 | Guide rna compositions for programmable gene insertion |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2023215831A1 (en) |
Citations (78)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4186183A (en) | 1978-03-29 | 1980-01-29 | The United States Of America As Represented By The Secretary Of The Army | Liposome carriers in chemotherapy of leishmaniasis |
US4217344A (en) | 1976-06-23 | 1980-08-12 | L'oreal | Compositions containing aqueous dispersions of lipid spheres |
US4235871A (en) | 1978-02-24 | 1980-11-25 | Papahadjopoulos Demetrios P | Method of encapsulating biologically active materials in lipid vesicles |
US4261975A (en) | 1979-09-19 | 1981-04-14 | Merck & Co., Inc. | Viral liposome particle |
US4485054A (en) | 1982-10-04 | 1984-11-27 | Lipoderm Pharmaceuticals Limited | Method of encapsulating biologically active materials in multilamellar lipid vesicles (MLV) |
US4501728A (en) | 1983-01-06 | 1985-02-26 | Technology Unlimited, Inc. | Masking of liposomes from RES recognition |
US4774085A (en) | 1985-07-09 | 1988-09-27 | 501 Board of Regents, Univ. of Texas | Pharmaceutical administration systems containing a mixture of immunomodulators |
US4797368A (en) | 1985-03-15 | 1989-01-10 | The United States Of America As Represented By The Department Of Health And Human Services | Adeno-associated virus as eukaryotic expression vector |
US4837028A (en) | 1986-12-24 | 1989-06-06 | Liposome Technology, Inc. | Liposomes with enhanced circulation time |
US4897355A (en) | 1985-01-07 | 1990-01-30 | Syntex (U.S.A.) Inc. | N[ω,(ω-1)-dialkyloxy]- and N-[ω,(ω-1)-dialkenyloxy]-alk-1-yl-N,N,N-tetrasubstituted ammonium lipids and uses therefor |
US4946787A (en) | 1985-01-07 | 1990-08-07 | Syntex (U.S.A.) Inc. | N-(ω,(ω-1)-dialkyloxy)- and N-(ω,(ω-1)-dialkenyloxy)-alk-1-yl-N,N,N-tetrasubstituted ammonium lipids and uses therefor |
US5013556A (en) | 1989-10-20 | 1991-05-07 | Liposome Technology, Inc. | Liposomes with enhanced circulation time |
US5049386A (en) | 1985-01-07 | 1991-09-17 | Syntex (U.S.A.) Inc. | N-ω,(ω-1)-dialkyloxy)- and N-(ω,(ω-1)-dialkenyloxy)Alk-1-YL-N,N,N-tetrasubstituted ammonium lipids and uses therefor |
WO1991016024A1 (en) | 1990-04-19 | 1991-10-31 | Vical, Inc. | Cationic lipids for intracellular delivery of biologically active molecules |
WO1991017424A1 (en) | 1990-05-03 | 1991-11-14 | Vical, Inc. | Intracellular delivery of biologically active substances by means of self-assembling lipid complexes |
US5173414A (en) | 1990-10-30 | 1992-12-22 | Applied Immune Sciences, Inc. | Production of recombinant adeno-associated virus vectors |
WO1993024641A2 (en) | 1992-06-02 | 1993-12-09 | The United States Of America, As Represented By The Secretary, Department Of Health & Human Services | Adeno-associated virus with inverted terminal repeat sequences as promoter |
US5846946A (en) | 1996-06-14 | 1998-12-08 | Pasteur Merieux Serums Et Vaccins | Compositions and methods for administering Borrelia DNA |
US6027726A (en) | 1994-09-30 | 2000-02-22 | Inex Phamaceuticals Corp. | Glycosylated protein-liposome conjugates and methods for their preparation |
US20030087817A1 (en) | 1999-01-12 | 2003-05-08 | Sangamo Biosciences, Inc. | Regulation of endogenous gene expression in cells using zinc finger proteins |
US20040142025A1 (en) | 2002-06-28 | 2004-07-22 | Protiva Biotherapeutics Ltd. | Liposomal apparatus and manufacturing methods |
US20040171156A1 (en) | 1995-06-07 | 2004-09-02 | Invitrogen Corporation | Recombinational cloning using nucleic acids having recombination sites |
US20070042031A1 (en) | 2005-07-27 | 2007-02-22 | Protiva Biotherapeutics, Inc. | Systems and methods for manufacturing liposomes |
US8404658B2 (en) | 2007-12-31 | 2013-03-26 | Nanocor Therapeutics, Inc. | RNA interference for the treatment of heart failure |
US8454972B2 (en) | 2004-07-16 | 2013-06-04 | The United States Of America, As Represented By The Secretary, Department Of Health And Human Services | Method for inducing a multiclade immune response against HIV utilizing a multigene and multiclade immunogen |
WO2013086354A1 (en) | 2011-12-07 | 2013-06-13 | Alnylam Pharmaceuticals, Inc. | Biodegradable lipids for the delivery of active agents |
WO2013116126A1 (en) | 2012-02-01 | 2013-08-08 | Merck Sharp & Dohme Corp. | Novel low molecular weight, biodegradable cationic lipids for oligonucleotide delivery |
US8691750B2 (en) | 2011-05-17 | 2014-04-08 | Axolabs Gmbh | Lipids and compositions for intracellular delivery of biologically active compounds |
US8697359B1 (en) | 2012-12-12 | 2014-04-15 | The Broad Institute, Inc. | CRISPR-Cas systems and methods for altering expression of gene products |
EP2755693A2 (en) | 2011-09-12 | 2014-07-23 | Moderna Therapeutics, Inc. | Engineered nucleic acids and methods of use thereof |
EP2755986A1 (en) | 2011-09-12 | 2014-07-23 | Moderna Therapeutics, Inc. | Engineered nucleic acids and methods of use thereof |
US8795965B2 (en) | 2012-12-12 | 2014-08-05 | The Broad Institute, Inc. | CRISPR-Cas component systems, methods and compositions for sequence manipulation |
US8822663B2 (en) | 2010-08-06 | 2014-09-02 | Moderna Therapeutics, Inc. | Engineered nucleic acids and methods of use thereof |
WO2014152940A1 (en) | 2013-03-14 | 2014-09-25 | Shire Human Genetic Therapies, Inc. | Mrna therapeutic compositions and use to treat diseases and disorders |
US8865406B2 (en) | 2012-12-12 | 2014-10-21 | The Broad Institute Inc. | Engineering and optimization of improved systems, methods and enzyme compositions for sequence manipulation |
US8889356B2 (en) | 2012-12-12 | 2014-11-18 | The Broad Institute Inc. | CRISPR-Cas nickase systems, methods and compositions for sequence manipulation in eukaryotes |
US8906616B2 (en) | 2012-12-12 | 2014-12-09 | The Broad Institute Inc. | Engineering of systems, methods and optimized guide compositions for sequence manipulation |
US8993233B2 (en) | 2012-12-12 | 2015-03-31 | The Broad Institute Inc. | Engineering and optimization of systems, methods and compositions for sequence manipulation with functional domains |
NZ700688A (en) | 2009-12-01 | 2016-02-26 | Shire Human Genetic Therapies | Delivery of mrna for the augmentation of proteins and enzymes in human genetic diseases |
US9405700B2 (en) | 2010-11-04 | 2016-08-02 | Sonics, Inc. | Methods and apparatus for virtualization in an integrated circuit |
US9790490B2 (en) | 2015-06-18 | 2017-10-17 | The Broad Institute Inc. | CRISPR enzymes and systems |
BR112016030852A2 (en) | 2014-07-02 | 2018-01-16 | Shire Human Genetic Therapies | rna messenger encapsulation |
US10000772B2 (en) | 2012-05-25 | 2018-06-19 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
EP3362461A1 (en) | 2015-10-16 | 2018-08-22 | Modernatx, Inc. | Mrna cap analogs with modified phosphate linkage |
US10190137B2 (en) | 2013-11-07 | 2019-01-29 | Editas Medicine, Inc. | CRISPR-related methods and compositions with governing gRNAS |
US10266887B2 (en) | 2016-12-09 | 2019-04-23 | The Broad Institute, Inc. | CRISPR effector system based diagnostics |
US10377998B2 (en) | 2013-12-12 | 2019-08-13 | The Broad Institute, Inc. | CRISPR-CAS systems and methods for altering expression of gene products, structural information and inducible modular CAS enzymes |
US10385336B2 (en) | 2014-09-05 | 2019-08-20 | Vilnius University | Programmable RNA shredding by the type III-A CRISPR-Cas system of Streptococcus thermophilus |
US10494621B2 (en) | 2015-06-18 | 2019-12-03 | The Broad Institute, Inc. | Crispr enzyme mutations reducing off-target effects |
EP3450553B1 (en) | 2014-03-24 | 2019-12-25 | Translate Bio, Inc. | Mrna therapy for treatment of ocular diseases |
US10519454B2 (en) | 2014-08-06 | 2019-12-31 | Toolgen Incorporated | Genome editing using Campylobacter jejuni CRISPR/CAS system-derived RGEN |
WO2020014577A1 (en) | 2018-07-13 | 2020-01-16 | Allele Biotechnology And Pharmaceuticals, Inc. | Methods of achieving high specificity of genome editing |
US10550372B2 (en) | 2013-12-12 | 2020-02-04 | The Broad Institute, Inc. | Systems, methods and compositions for sequence manipulation with optimized functional CRISPR-Cas systems |
ES2740248T3 (en) | 2011-06-08 | 2020-02-05 | Translate Bio Inc | Lipid nanoparticle compositions and methods for mRNA administration |
US10577630B2 (en) | 2013-06-17 | 2020-03-03 | The Broad Institute, Inc. | Delivery and use of the CRISPR-Cas systems, vectors and compositions for hepatic targeting and therapy |
US20200109398A1 (en) | 2018-08-28 | 2020-04-09 | Flagship Pioneering, Inc. | Methods and compositions for modulating a genome |
US10648020B2 (en) | 2015-06-18 | 2020-05-12 | The Broad Institute, Inc. | CRISPR enzymes and systems |
US10689691B2 (en) | 2014-12-19 | 2020-06-23 | The Broad Institute, Inc. | Unbiased identification of double-strand breaks and genomic rearrangement by genome-wide insert capture sequencing |
US10711285B2 (en) | 2013-06-17 | 2020-07-14 | The Broad Institute, Inc. | Optimized CRISPR-Cas double nickase systems, methods and compositions for sequence manipulation |
US10731181B2 (en) | 2012-12-06 | 2020-08-04 | Sigma, Aldrich Co. LLC | CRISPR-based genome modification and regulation |
US10781444B2 (en) | 2013-06-17 | 2020-09-22 | The Broad Institute, Inc. | Functional genomics using CRISPR-Cas systems, compositions, methods, screens and applications thereof |
WO2020191239A1 (en) | 2019-03-19 | 2020-09-24 | The Broad Institute, Inc. | Methods and compositions for editing nucleotide sequences |
US10851380B2 (en) | 2012-10-23 | 2020-12-01 | Toolgen Incorporated | Methods for cleaving a target DNA using a guide RNA specific for the target DNA and Cas protein-encoding nucleic acid or Cas protein |
US10851357B2 (en) | 2013-12-12 | 2020-12-01 | The Broad Institute, Inc. | Compositions and methods of use of CRISPR-Cas systems in nucleotide repeat disorders |
US10930367B2 (en) | 2012-12-12 | 2021-02-23 | The Broad Institute, Inc. | Methods, models, systems, and apparatus for identifying target sequences for Cas enzymes or CRISPR-Cas systems for target sequences and conveying results thereof |
US10946108B2 (en) | 2013-06-17 | 2021-03-16 | The Broad Institute, Inc. | Delivery, use and therapeutic applications of the CRISPR-Cas systems and compositions for targeting disorders and diseases using viral components |
US10954514B2 (en) | 2014-12-12 | 2021-03-23 | The Broad Institute, Inc. | Escorted and functionalized guides for CRISPR-Cas systems |
US10968257B2 (en) | 2018-04-03 | 2021-04-06 | The Broad Institute, Inc. | Target recognition motifs and uses thereof |
US11001829B2 (en) | 2014-09-25 | 2021-05-11 | The Broad Institute, Inc. | Functional screening with optimized functional CRISPR-Cas systems |
US11008588B2 (en) | 2013-06-17 | 2021-05-18 | The Broad Institute, Inc. | Delivery, engineering and optimization of tandem guide systems, methods and compositions for sequence manipulation |
US11021740B2 (en) | 2017-03-15 | 2021-06-01 | The Broad Institute, Inc. | Devices for CRISPR effector system based diagnostics |
US11041173B2 (en) | 2012-12-12 | 2021-06-22 | The Broad Institute, Inc. | Delivery, engineering and optimization of systems, methods and compositions for sequence manipulation and therapeutic applications |
US11060115B2 (en) | 2015-06-18 | 2021-07-13 | The Broad Institute, Inc. | CRISPR enzymes and systems |
US11104937B2 (en) | 2017-03-15 | 2021-08-31 | The Broad Institute, Inc. | CRISPR effector system based diagnostics |
WO2021226558A1 (en) | 2020-05-08 | 2021-11-11 | The Broad Institute, Inc. | Methods and compositions for simultaneous editing of both strands of a target double-stranded nucleotide sequence |
WO2022032085A1 (en) * | 2020-08-07 | 2022-02-10 | The Jackson Laboratory | Targeted sequence insertion compositions and methods |
WO2022067130A2 (en) * | 2020-09-24 | 2022-03-31 | The Broad Institute, Inc. | Prime editing guide rnas, compositions thereof, and methods of using the same |
WO2022087235A1 (en) * | 2020-10-21 | 2022-04-28 | Massachusetts Institute Of Technology | Systems, methods, and compositions for site-specific genetic engineering using programmable addition via site-specific targeting elements (paste) |
-
2023
- 2023-05-04 WO PCT/US2023/066609 patent/WO2023215831A1/en unknown
Patent Citations (146)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4217344A (en) | 1976-06-23 | 1980-08-12 | L'oreal | Compositions containing aqueous dispersions of lipid spheres |
US4235871A (en) | 1978-02-24 | 1980-11-25 | Papahadjopoulos Demetrios P | Method of encapsulating biologically active materials in lipid vesicles |
US4186183A (en) | 1978-03-29 | 1980-01-29 | The United States Of America As Represented By The Secretary Of The Army | Liposome carriers in chemotherapy of leishmaniasis |
US4261975A (en) | 1979-09-19 | 1981-04-14 | Merck & Co., Inc. | Viral liposome particle |
US4485054A (en) | 1982-10-04 | 1984-11-27 | Lipoderm Pharmaceuticals Limited | Method of encapsulating biologically active materials in multilamellar lipid vesicles (MLV) |
US4501728A (en) | 1983-01-06 | 1985-02-26 | Technology Unlimited, Inc. | Masking of liposomes from RES recognition |
US5049386A (en) | 1985-01-07 | 1991-09-17 | Syntex (U.S.A.) Inc. | N-ω,(ω-1)-dialkyloxy)- and N-(ω,(ω-1)-dialkenyloxy)Alk-1-YL-N,N,N-tetrasubstituted ammonium lipids and uses therefor |
US4897355A (en) | 1985-01-07 | 1990-01-30 | Syntex (U.S.A.) Inc. | N[ω,(ω-1)-dialkyloxy]- and N-[ω,(ω-1)-dialkenyloxy]-alk-1-yl-N,N,N-tetrasubstituted ammonium lipids and uses therefor |
US4946787A (en) | 1985-01-07 | 1990-08-07 | Syntex (U.S.A.) Inc. | N-(ω,(ω-1)-dialkyloxy)- and N-(ω,(ω-1)-dialkenyloxy)-alk-1-yl-N,N,N-tetrasubstituted ammonium lipids and uses therefor |
US4797368A (en) | 1985-03-15 | 1989-01-10 | The United States Of America As Represented By The Department Of Health And Human Services | Adeno-associated virus as eukaryotic expression vector |
US4774085A (en) | 1985-07-09 | 1988-09-27 | 501 Board of Regents, Univ. of Texas | Pharmaceutical administration systems containing a mixture of immunomodulators |
US4837028A (en) | 1986-12-24 | 1989-06-06 | Liposome Technology, Inc. | Liposomes with enhanced circulation time |
US5013556A (en) | 1989-10-20 | 1991-05-07 | Liposome Technology, Inc. | Liposomes with enhanced circulation time |
WO1991016024A1 (en) | 1990-04-19 | 1991-10-31 | Vical, Inc. | Cationic lipids for intracellular delivery of biologically active molecules |
WO1991017424A1 (en) | 1990-05-03 | 1991-11-14 | Vical, Inc. | Intracellular delivery of biologically active substances by means of self-assembling lipid complexes |
US5173414A (en) | 1990-10-30 | 1992-12-22 | Applied Immune Sciences, Inc. | Production of recombinant adeno-associated virus vectors |
WO1993024641A2 (en) | 1992-06-02 | 1993-12-09 | The United States Of America, As Represented By The Secretary, Department Of Health & Human Services | Adeno-associated virus with inverted terminal repeat sequences as promoter |
US6027726A (en) | 1994-09-30 | 2000-02-22 | Inex Phamaceuticals Corp. | Glycosylated protein-liposome conjugates and methods for their preparation |
US20040171156A1 (en) | 1995-06-07 | 2004-09-02 | Invitrogen Corporation | Recombinational cloning using nucleic acids having recombination sites |
US5846946A (en) | 1996-06-14 | 1998-12-08 | Pasteur Merieux Serums Et Vaccins | Compositions and methods for administering Borrelia DNA |
US20030087817A1 (en) | 1999-01-12 | 2003-05-08 | Sangamo Biosciences, Inc. | Regulation of endogenous gene expression in cells using zinc finger proteins |
US20040142025A1 (en) | 2002-06-28 | 2004-07-22 | Protiva Biotherapeutics Ltd. | Liposomal apparatus and manufacturing methods |
US8454972B2 (en) | 2004-07-16 | 2013-06-04 | The United States Of America, As Represented By The Secretary, Department Of Health And Human Services | Method for inducing a multiclade immune response against HIV utilizing a multigene and multiclade immunogen |
US20070042031A1 (en) | 2005-07-27 | 2007-02-22 | Protiva Biotherapeutics, Inc. | Systems and methods for manufacturing liposomes |
US8404658B2 (en) | 2007-12-31 | 2013-03-26 | Nanocor Therapeutics, Inc. | RNA interference for the treatment of heart failure |
NZ700688A (en) | 2009-12-01 | 2016-02-26 | Shire Human Genetic Therapies | Delivery of mrna for the augmentation of proteins and enzymes in human genetic diseases |
US8822663B2 (en) | 2010-08-06 | 2014-09-02 | Moderna Therapeutics, Inc. | Engineered nucleic acids and methods of use thereof |
US9405700B2 (en) | 2010-11-04 | 2016-08-02 | Sonics, Inc. | Methods and apparatus for virtualization in an integrated circuit |
US8691750B2 (en) | 2011-05-17 | 2014-04-08 | Axolabs Gmbh | Lipids and compositions for intracellular delivery of biologically active compounds |
ES2740248T3 (en) | 2011-06-08 | 2020-02-05 | Translate Bio Inc | Lipid nanoparticle compositions and methods for mRNA administration |
EP2755693A2 (en) | 2011-09-12 | 2014-07-23 | Moderna Therapeutics, Inc. | Engineered nucleic acids and methods of use thereof |
EP2755986A1 (en) | 2011-09-12 | 2014-07-23 | Moderna Therapeutics, Inc. | Engineered nucleic acids and methods of use thereof |
WO2013086354A1 (en) | 2011-12-07 | 2013-06-13 | Alnylam Pharmaceuticals, Inc. | Biodegradable lipids for the delivery of active agents |
WO2013116126A1 (en) | 2012-02-01 | 2013-08-08 | Merck Sharp & Dohme Corp. | Novel low molecular weight, biodegradable cationic lipids for oligonucleotide delivery |
US10900054B2 (en) | 2012-05-25 | 2021-01-26 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
US10577631B2 (en) | 2012-05-25 | 2020-03-03 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
US11001863B2 (en) | 2012-05-25 | 2021-05-11 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
US10988780B2 (en) | 2012-05-25 | 2021-04-27 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
US10982230B2 (en) | 2012-05-25 | 2021-04-20 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
US10982231B2 (en) | 2012-05-25 | 2021-04-20 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
US10526619B2 (en) | 2012-05-25 | 2020-01-07 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
US10519467B2 (en) | 2012-05-25 | 2019-12-31 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
US10793878B1 (en) | 2012-05-25 | 2020-10-06 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
US10774344B1 (en) | 2012-05-25 | 2020-09-15 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
US10752920B2 (en) | 2012-05-25 | 2020-08-25 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
US11008589B2 (en) | 2012-05-25 | 2021-05-18 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
US11008590B2 (en) | 2012-05-25 | 2021-05-18 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
US10533190B2 (en) | 2012-05-25 | 2020-01-14 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
US10676759B2 (en) | 2012-05-25 | 2020-06-09 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
US10669560B2 (en) | 2012-05-25 | 2020-06-02 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
US10640791B2 (en) | 2012-05-25 | 2020-05-05 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
US10000772B2 (en) | 2012-05-25 | 2018-06-19 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
US10626419B2 (en) | 2012-05-25 | 2020-04-21 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
US10113167B2 (en) | 2012-05-25 | 2018-10-30 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
US10612045B2 (en) | 2012-05-25 | 2020-04-07 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
US10227611B2 (en) | 2012-05-25 | 2019-03-12 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
US10597680B2 (en) | 2012-05-25 | 2020-03-24 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
US10988782B2 (en) | 2012-05-25 | 2021-04-27 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
US10266850B2 (en) | 2012-05-25 | 2019-04-23 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
US10301651B2 (en) | 2012-05-25 | 2019-05-28 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
US10308961B2 (en) | 2012-05-25 | 2019-06-04 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
US10337029B2 (en) | 2012-05-25 | 2019-07-02 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
US10351878B2 (en) | 2012-05-25 | 2019-07-16 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
US10358658B2 (en) | 2012-05-25 | 2019-07-23 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
US10358659B2 (en) | 2012-05-25 | 2019-07-23 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
US10570419B2 (en) | 2012-05-25 | 2020-02-25 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
US10563227B2 (en) | 2012-05-25 | 2020-02-18 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
US10385360B2 (en) | 2012-05-25 | 2019-08-20 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
US10400253B2 (en) | 2012-05-25 | 2019-09-03 | The Regents Of The University Of California | Methods and compositions or RNA-directed target DNA modification and for RNA-directed modulation of transcription |
US10407697B2 (en) | 2012-05-25 | 2019-09-10 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
US10415061B2 (en) | 2012-05-25 | 2019-09-17 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
US10421980B2 (en) | 2012-05-25 | 2019-09-24 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
US10428352B2 (en) | 2012-05-25 | 2019-10-01 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
US10443076B2 (en) | 2012-05-25 | 2019-10-15 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
US10487341B2 (en) | 2012-05-25 | 2019-11-26 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
US11028412B2 (en) | 2012-05-25 | 2021-06-08 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
US10513712B2 (en) | 2012-05-25 | 2019-12-24 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
US10550407B2 (en) | 2012-05-25 | 2020-02-04 | The Regents Of The University Of California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
US10851380B2 (en) | 2012-10-23 | 2020-12-01 | Toolgen Incorporated | Methods for cleaving a target DNA using a guide RNA specific for the target DNA and Cas protein-encoding nucleic acid or Cas protein |
US10731181B2 (en) | 2012-12-06 | 2020-08-04 | Sigma, Aldrich Co. LLC | CRISPR-based genome modification and regulation |
US8871445B2 (en) | 2012-12-12 | 2014-10-28 | The Broad Institute Inc. | CRISPR-Cas component systems, methods and compositions for sequence manipulation |
US8865406B2 (en) | 2012-12-12 | 2014-10-21 | The Broad Institute Inc. | Engineering and optimization of improved systems, methods and enzyme compositions for sequence manipulation |
US11041173B2 (en) | 2012-12-12 | 2021-06-22 | The Broad Institute, Inc. | Delivery, engineering and optimization of systems, methods and compositions for sequence manipulation and therapeutic applications |
US8697359B1 (en) | 2012-12-12 | 2014-04-15 | The Broad Institute, Inc. | CRISPR-Cas systems and methods for altering expression of gene products |
US8771945B1 (en) | 2012-12-12 | 2014-07-08 | The Broad Institute, Inc. | CRISPR-Cas systems and methods for altering expression of gene products |
US8795965B2 (en) | 2012-12-12 | 2014-08-05 | The Broad Institute, Inc. | CRISPR-Cas component systems, methods and compositions for sequence manipulation |
US8889418B2 (en) | 2012-12-12 | 2014-11-18 | The Broad Institute Inc. | Engineering and optimization of improved systems, methods and enzyme compositions for sequence manipulation |
US8889356B2 (en) | 2012-12-12 | 2014-11-18 | The Broad Institute Inc. | CRISPR-Cas nickase systems, methods and compositions for sequence manipulation in eukaryotes |
US8895308B1 (en) | 2012-12-12 | 2014-11-25 | The Broad Institute Inc. | Engineering and optimization of improved systems, methods and enzyme compositions for sequence manipulation |
US10930367B2 (en) | 2012-12-12 | 2021-02-23 | The Broad Institute, Inc. | Methods, models, systems, and apparatus for identifying target sequences for Cas enzymes or CRISPR-Cas systems for target sequences and conveying results thereof |
US8906616B2 (en) | 2012-12-12 | 2014-12-09 | The Broad Institute Inc. | Engineering of systems, methods and optimized guide compositions for sequence manipulation |
US8932814B2 (en) | 2012-12-12 | 2015-01-13 | The Broad Institute Inc. | CRISPR-Cas nickase systems, methods and compositions for sequence manipulation in eukaryotes |
US8945839B2 (en) | 2012-12-12 | 2015-02-03 | The Broad Institute Inc. | CRISPR-Cas systems and methods for altering expression of gene products |
US8993233B2 (en) | 2012-12-12 | 2015-03-31 | The Broad Institute Inc. | Engineering and optimization of systems, methods and compositions for sequence manipulation with functional domains |
US8999641B2 (en) | 2012-12-12 | 2015-04-07 | The Broad Institute Inc. | Engineering and optimization of systems, methods and compositions for sequence manipulation with functional domains |
US9822372B2 (en) | 2012-12-12 | 2017-11-21 | The Broad Institute Inc. | CRISPR-Cas component systems, methods and compositions for sequence manipulation |
US9840713B2 (en) | 2012-12-12 | 2017-12-12 | The Broad Institute Inc. | CRISPR-Cas component systems, methods and compositions for sequence manipulation |
WO2014152940A1 (en) | 2013-03-14 | 2014-09-25 | Shire Human Genetic Therapies, Inc. | Mrna therapeutic compositions and use to treat diseases and disorders |
US10577630B2 (en) | 2013-06-17 | 2020-03-03 | The Broad Institute, Inc. | Delivery and use of the CRISPR-Cas systems, vectors and compositions for hepatic targeting and therapy |
US11008588B2 (en) | 2013-06-17 | 2021-05-18 | The Broad Institute, Inc. | Delivery, engineering and optimization of tandem guide systems, methods and compositions for sequence manipulation |
US10711285B2 (en) | 2013-06-17 | 2020-07-14 | The Broad Institute, Inc. | Optimized CRISPR-Cas double nickase systems, methods and compositions for sequence manipulation |
US10946108B2 (en) | 2013-06-17 | 2021-03-16 | The Broad Institute, Inc. | Delivery, use and therapeutic applications of the CRISPR-Cas systems and compositions for targeting disorders and diseases using viral components |
US10781444B2 (en) | 2013-06-17 | 2020-09-22 | The Broad Institute, Inc. | Functional genomics using CRISPR-Cas systems, compositions, methods, screens and applications thereof |
US10640788B2 (en) | 2013-11-07 | 2020-05-05 | Editas Medicine, Inc. | CRISPR-related methods and compositions with governing gRNAs |
US10190137B2 (en) | 2013-11-07 | 2019-01-29 | Editas Medicine, Inc. | CRISPR-related methods and compositions with governing gRNAS |
US10851357B2 (en) | 2013-12-12 | 2020-12-01 | The Broad Institute, Inc. | Compositions and methods of use of CRISPR-Cas systems in nucleotide repeat disorders |
US10550372B2 (en) | 2013-12-12 | 2020-02-04 | The Broad Institute, Inc. | Systems, methods and compositions for sequence manipulation with optimized functional CRISPR-Cas systems |
US10377998B2 (en) | 2013-12-12 | 2019-08-13 | The Broad Institute, Inc. | CRISPR-CAS systems and methods for altering expression of gene products, structural information and inducible modular CAS enzymes |
EP3450553B1 (en) | 2014-03-24 | 2019-12-25 | Translate Bio, Inc. | Mrna therapy for treatment of ocular diseases |
BR112016030852A2 (en) | 2014-07-02 | 2018-01-16 | Shire Human Genetic Therapies | rna messenger encapsulation |
US10519454B2 (en) | 2014-08-06 | 2019-12-31 | Toolgen Incorporated | Genome editing using Campylobacter jejuni CRISPR/CAS system-derived RGEN |
US10385336B2 (en) | 2014-09-05 | 2019-08-20 | Vilnius University | Programmable RNA shredding by the type III-A CRISPR-Cas system of Streptococcus thermophilus |
US11001829B2 (en) | 2014-09-25 | 2021-05-11 | The Broad Institute, Inc. | Functional screening with optimized functional CRISPR-Cas systems |
US10954514B2 (en) | 2014-12-12 | 2021-03-23 | The Broad Institute, Inc. | Escorted and functionalized guides for CRISPR-Cas systems |
US10689691B2 (en) | 2014-12-19 | 2020-06-23 | The Broad Institute, Inc. | Unbiased identification of double-strand breaks and genomic rearrangement by genome-wide insert capture sequencing |
US10494621B2 (en) | 2015-06-18 | 2019-12-03 | The Broad Institute, Inc. | Crispr enzyme mutations reducing off-target effects |
US11060115B2 (en) | 2015-06-18 | 2021-07-13 | The Broad Institute, Inc. | CRISPR enzymes and systems |
US11091798B2 (en) | 2015-06-18 | 2021-08-17 | The Broad Institute, Inc. | CRISPR enzymes and systems |
US9790490B2 (en) | 2015-06-18 | 2017-10-17 | The Broad Institute Inc. | CRISPR enzymes and systems |
US10876100B2 (en) | 2015-06-18 | 2020-12-29 | The Broad Institute, Inc. | Crispr enzyme mutations reducing off-target effects |
US10648020B2 (en) | 2015-06-18 | 2020-05-12 | The Broad Institute, Inc. | CRISPR enzymes and systems |
EP3362461A1 (en) | 2015-10-16 | 2018-08-22 | Modernatx, Inc. | Mrna cap analogs with modified phosphate linkage |
US10266887B2 (en) | 2016-12-09 | 2019-04-23 | The Broad Institute, Inc. | CRISPR effector system based diagnostics |
US10266886B2 (en) | 2016-12-09 | 2019-04-23 | The Broad Institute, Inc. | CRISPR effector system based diagnostics |
US11104937B2 (en) | 2017-03-15 | 2021-08-31 | The Broad Institute, Inc. | CRISPR effector system based diagnostics |
US11021740B2 (en) | 2017-03-15 | 2021-06-01 | The Broad Institute, Inc. | Devices for CRISPR effector system based diagnostics |
US10968257B2 (en) | 2018-04-03 | 2021-04-06 | The Broad Institute, Inc. | Target recognition motifs and uses thereof |
WO2020014577A1 (en) | 2018-07-13 | 2020-01-16 | Allele Biotechnology And Pharmaceuticals, Inc. | Methods of achieving high specificity of genome editing |
US20200109398A1 (en) | 2018-08-28 | 2020-04-09 | Flagship Pioneering, Inc. | Methods and compositions for modulating a genome |
WO2020191153A2 (en) | 2019-03-19 | 2020-09-24 | The Broad Institute, Inc. | Methods and compositions for editing nucleotide sequences |
WO2020191241A1 (en) | 2019-03-19 | 2020-09-24 | The Broad Institute, Inc. | Methods and compositions for editing nucleotide sequences |
WO2020191242A1 (en) | 2019-03-19 | 2020-09-24 | The Broad Institute, Inc. | Methods and compositions for editing nucleotide sequences |
WO2020191233A1 (en) | 2019-03-19 | 2020-09-24 | The Broad Institute, Inc. | Methods and compositions for editing nucleotide sequences |
WO2020191234A1 (en) | 2019-03-19 | 2020-09-24 | The Broad Institute, Inc. | Methods and compositions for editing nucleotide sequences |
WO2020191246A1 (en) | 2019-03-19 | 2020-09-24 | The Broad Institute, Inc. | Methods and compositions for editing nucleotide sequences |
WO2020191243A1 (en) | 2019-03-19 | 2020-09-24 | The Broad Institute, Inc. | Methods and compositions for editing nucleotide sequences |
WO2020191248A1 (en) | 2019-03-19 | 2020-09-24 | The Broad Institute, Inc. | Method and compositions for editing nucleotide sequences |
WO2020191249A1 (en) | 2019-03-19 | 2020-09-24 | The Broad Institute, Inc. | Methods and compositions for editing nucleotide sequences |
WO2020191171A1 (en) | 2019-03-19 | 2020-09-24 | The Broad Institute, Inc. | Methods and compositions for editing nucleotide sequences |
WO2020191239A1 (en) | 2019-03-19 | 2020-09-24 | The Broad Institute, Inc. | Methods and compositions for editing nucleotide sequences |
WO2020191245A1 (en) | 2019-03-19 | 2020-09-24 | The Broad Institute, Inc. | Methods and compositions for editing nucleotide sequences |
WO2021226558A1 (en) | 2020-05-08 | 2021-11-11 | The Broad Institute, Inc. | Methods and compositions for simultaneous editing of both strands of a target double-stranded nucleotide sequence |
WO2022032085A1 (en) * | 2020-08-07 | 2022-02-10 | The Jackson Laboratory | Targeted sequence insertion compositions and methods |
WO2022067130A2 (en) * | 2020-09-24 | 2022-03-31 | The Broad Institute, Inc. | Prime editing guide rnas, compositions thereof, and methods of using the same |
WO2022087235A1 (en) * | 2020-10-21 | 2022-04-28 | Massachusetts Institute Of Technology | Systems, methods, and compositions for site-specific genetic engineering using programmable addition via site-specific targeting elements (paste) |
US11572556B2 (en) | 2020-10-21 | 2023-02-07 | Massachusetts Institute Of Technology | Systems, methods, and compositions for site-specific genetic engineering using programmable addition via site-specific targeting elements (paste) |
Non-Patent Citations (65)
Title |
---|
ABRA ET AL., J. LIPOSOME RES., vol. 12, 2002, pages 1 - 3 |
AHMAD ET AL., CANCER RES., vol. 52, 1992, pages 4817 - 4820 |
AKINC ET AL., MOL THER, vol. 17, 2009, pages 872 - 879 |
AKINC ET AL., NAT. BIOTECHNOL., vol. 26, 2008, pages 561 - 569 |
ALLEN ET AL., BIOCHIMICA ET BIOPHYSICA ACTA, vol. 1237, 1995, pages 99 - 108 |
ANZALONE ANDREW V. ET AL: "Search-and-replace genome editing without double-strand breaks or donor DNA - SUPPLEMENTARY INFORMATION", NATURE, vol. 576, no. 7785, 21 October 2019 (2019-10-21), London, pages 149 - 157, XP055899878, ISSN: 0028-0836, DOI: 10.1038/s41586-019-1711-4 * |
ANZALONE ANDREW V. ET AL: "Search-and-replace genome editing without double-strand breaks or donor DNA", NATURE, vol. 576, no. 7785, 21 October 2019 (2019-10-21), London, pages 149 - 157, XP055980447, ISSN: 0028-0836, DOI: 10.1038/s41586-019-1711-4 * |
ANZALONE ET AL., BIORXIV, 2 November 2021 (2021-11-02) |
ANZALONE ET AL., NATURE, vol. 576, 2019, pages 149 |
BEHR ET AL., BIOCONJUGATE CHEM, vol. 5, 1994, pages 382 - 389 |
BLAESE ET AL., CANCER GENE THER, vol. 2, 1995, pages 291 - 297 |
BLUME ET AL., BIOCHIMICA ETBIOPHYSICA ACTA, vol. 1149, pages 180 - 184,1993 |
BUCHSCHER ET AL., J. VIROL., vol. 66, 1992, pages 1635 - 1640 |
CHEN ET AL., CELL, vol. 184, 28 October 2021 (2021-10-28), pages 1 - 18 |
CRYSTAL, SCIENCE, vol. 270, 1995, pages 404 - 410 |
DALKARA ET AL., SCI TRANSL MED, vol. 5, 2013 |
DEFREES ET AL., JOURNAL OF THE AMERICAN CHEMISTRY SOCIETY, vol. 118, 1996, pages 6101 - 6104 |
GAO ET AL., GENE THERAPY, vol. 2, 1995, pages 710 - 722 |
GASIUNAS ET AL.: "A catalogue of biochemically diverse CRISPR-Cas9 orthologs", NATURE COMMUNICATIONS, vol. 11, pages 5512 |
HEATH: "Methods in Enzymology", vol. 149, 1987, ACADEMIC PRESS, INC, article "Covalent Attachment of Proteins to Liposomes", pages: 111 - 119 |
HERMONATMUZYCZKA, PNAS, vol. 81, 1984, pages 6466 - 6470 |
IOANNIDI ELEONORA I ET AL: "Drag-and-drop genome insertion without DNA cleavage with CRISPR-directed integrases - Supplementary Information", BIORXIV, 1 November 2021 (2021-11-01), pages 1 - 37, XP093079476, Retrieved from the Internet <URL:https://www.biorxiv.org/content/biorxiv/early/2021/11/01/2021.11.01.466786/DC1/embed/media-1.pdf?download=true> [retrieved on 20230906] * |
IOANNIDI ELEONORA I. ET AL: "Drag-and-drop genome insertion without DNA cleavage with CRISPR-directed integrases", BIORXIV, 1 November 2021 (2021-11-01), pages 1 - 61, XP093015571, Retrieved from the Internet <URL:https://www.biorxiv.org/content/10.1101/2021.11.01.466786v1.full.pdf> [retrieved on 20230119], DOI: 10.1101/2021.11.01.466786 * |
IONNIDI ET AL.: "Drag-and drop genome insertion without DNA cleavage with CRISPR directed integrases", BIORXIV 2021.11.01.466786, 2021 |
JIANG ET AL., NAT. BIOTECHNOLOGY, 14 October 2021 (2021-10-14) |
KARVELIS ET AL.: "PAM recognition by miniature CRISPR-Casl2f nucleases triggers programmable double-stranded DNA target cleavage", NUCLEIC ACIDS RESEARCH, vol. 48, no. 9, 21 May 2020 (2020-05-21), pages 5016 - 23, XP055920188, DOI: 10.1093/nar/gkaa208 |
KIRPOTIN ET AL., FEBS LETTERS, vol. 388, 1996, pages 115 - 118 |
KLIBANOV ET AL., JOURNAL OF LIPOSOME RESEARCH, vol. 2, 1992, pages 321 - 334 |
KOTIN, HUMAN GENE THERAPY, vol. 5, 1994, pages 793 - 801 |
KOWALSKI ET AL.: "Delivering the Messenger: Advances in Technologies for Therapeutic mRNA Delivery", MOL THERAP., vol. 27, no. 4, 2019, pages 710 - 728, XP055634628, DOI: 10.1016/j.ymthe.2019.02.012 |
LEONETTI ET AL., PROC. NATL. ACAD. SCI. (USA, vol. 87, 1990, pages 2448 - 2451 |
LIUHUANG, MOLECULAR THERAPY., 2010, pages 669 - 670 |
LOVE ET AL., PROC NATL ACAD SCI USA., vol. 107, 2010, pages 1864 - 1869 |
MAHON ET AL., BIOCONJUG CHEM, vol. 21, 2010, pages 1448 - 1454 |
MAKAROVA ET AL.: "C2c2 is a single-component programmable RNA-guided RNA-targeting CRISPR effector", SCIENCE, vol. 353, no. 6299, 2016, XP055407082, DOI: 10.1126/science.aaf5573 |
MAKAROVA ET AL.: "Classification and Nomenclature of CRISPR-Cas Systems: Where from Here?", THE CRISPR JOURNAL, vol. 1, no. 5, 2018, XP055619311, DOI: 10.1089/crispr.2018.0033 |
MILLER ET AL., J. VIROL., vol. 65, 1991, pages 2220 - 2224 |
MILLINGTON-WARD ET AL., MOLECULAR THERAPY, vol. 19, no. 4, April 2011 (2011-04-01), pages 642 - 649 |
MURUGAIAH ET AL., ANALYTICAL BIOCHEMISTRY, vol. 401, 2010, pages 61 |
MUZYCZKA, J. CLIN. INVEST, vol. 94, 1994, pages 1351 |
NELSON JAMES W ET AL: "Engineered pegRNAs improve prime editing efficiency", NATURE BIOTECHNOLOGY, NATURE PUBLISHING GROUP US, NEW YORK, vol. 40, no. 3, 4 October 2021 (2021-10-04), pages 402 - 410, XP037720612, ISSN: 1087-0156, [retrieved on 20211004], DOI: 10.1038/S41587-021-01039-7 * |
NISHIMASU ET AL.: "Crystal Structure of Cas9 in Complex with Guide RNA and Target DNA", CELL, vol. 156, 27 February 2014 (2014-02-27), pages 935 - 949, XP028667665, DOI: 10.1016/j.cell.2014.02.001 |
OH, Y. ET AL.: "Expansion of the prime editing modality with Cas9 from Francisella novicida", BIORXIV 2021.05.25.445577 |
RAMAKRISHNAN ET AL.: "Nuclear export signal (NES) of transposases affects the transposition activity of mariner-like elements Ppmarl and Ppmar2 of moso bamboo", MOB DNA, vol. 10, 19 August 2019 (2019-08-19), pages 35 |
REMY ET AL., BIOCONJUGATE CHEM., vol. 5, 1994, pages 647 - 654 |
RENNEISEN ET AL., J. BIO. CHEM., vol. 265, 1990, pages 16337 - 16342 |
ROSIN ET AL., MOLECULAR THERAPY, vol. 19, no. 12, December 2011 (2011-12-01), pages 1286 - 2200 |
SAMULSKI ET AL., J. VIROL., vol. 63, 1989, pages 03822 - 3828 |
SAPRA ET AL., PROG. LIPID RES, vol. 42, no. 5, 2003, pages 439 - 62 |
SCHMIDT ET AL.: "Improved CRISPR genome editing using small highly active and specific engineered RNA-guided nucleases", NAT COMMUN, vol. 12, 2021, pages 4219 |
SCHROEDER ET AL., J INTERN MED, vol. 267, 2010, pages 9 - 21 |
SHAH ET AL.: "Protospacer recognition motifs: mixed identities and functional diversity", RNA BIOLOGY, vol. 10, no. 5, pages 891 - 899 |
SIEGWART ET AL., PROC NATL ACAD SCI USA., vol. 108, 2011, pages 12996 - 3001 |
SOMMNERFELT ET AL., VIROL., vol. 176, 1990, pages 58 - 59 |
SWARTS ET AL.: "Structural Basis for Guide RNA Processing and Seed-Dependent DNA Targeting by CRISPR-Cas12a", MOLECULAR CELL, vol. 66, 20 April 2017 (2017-04-20), pages 221 - 233, XP055569665, DOI: 10.1016/j.molcel.2017.03.016 |
TRATSCHIN ET AL., MOL. CELL. BIOL., vol. 4, 1984, pages 2072 - 2081 |
TRATSCHIN ET AL., MOL. CELL. BIOL., vol. 5, 1985, pages 3251 - 3260 |
WEST ET AL., VIROLOGY, vol. 160, 1987, pages 38 - 47 |
XU ET AL.: "Comparison and optimization of ten phage encoded serine integrases for genome engineering in Saccharomyces cerevisiae", BMC BIOTECHNOL, vol. 16, 9 February 2016 (2016-02-09), pages 13 |
XU, X. ET AL.: "Engineered Miniature CRISPR-Cas System for Mammalian Genome Regulation and Editing", MOLECULAR CELL, vol. 81, no. 20, 21 October 2021 (2021-10-21), pages 4333 - 45 |
YAMANO ET AL.: "Crystal structure of Cpfl in complex with guide RNA and target DNA", CELL, vol. 165, 2016, pages 949 - 962 |
YAMANO ET AL.: "Crystal structure of Cpfl in complex with guide RNA and target DNA", CELL, vol. 165, 5 May 2016 (2016-05-05), pages 949 - 962 |
ZALIPSKY, BIOCONJUGATE CHEMISTRY, vol. 4, 1993, pages 296 - 299 |
ZALIPSKY, FEBS LETTERS, vol. 353, 1994, pages 71 - 74 |
ZALIPSKY: "Stealth Liposomes", 1995, CRC PRESS |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20240093193A1 (en) | Dead guides for crispr transcription factors | |
US20220290186A1 (en) | Gene editing using a modified closed-ended dna (cedna) | |
US20170349894A1 (en) | Escorted and functionalized guides for crispr-cas systems | |
CA2915842A1 (en) | Delivery and use of the crispr-cas systems, vectors and compositions for hepatic targeting and therapy | |
WO2023077148A1 (en) | Single construct platform for simultaneous delivery of gene editing machinery and nucleic acid cargo | |
WO2023122764A1 (en) | Co-delivery of a gene editor construct and a donor template | |
EP3704238B1 (en) | Engineered nucleases that target human and canine factor viii genes as a treatment for hemophilia a | |
US20220047723A1 (en) | OPTIMIZED mRNA ENCODING CAS9 FOR USE IN LNPs | |
US20230131847A1 (en) | Recombinase compositions and methods of use | |
US20240110201A1 (en) | Compositions and Methods for Treating Hereditary Angioedema | |
WO2023039440A9 (en) | Hbb-modulating compositions and methods | |
WO2023215831A1 (en) | Guide rna compositions for programmable gene insertion | |
US20210355463A1 (en) | Messenger rna encoding cas9 for use in genome-editing systems | |
WO2023205744A1 (en) | Programmable gene insertion compositions | |
WO2023039447A2 (en) | Serpina-modulating compositions and methods | |
CA3221566A1 (en) | Integrase compositions and methods | |
WO2023225670A2 (en) | Ex vivo programmable gene insertion | |
JP2024504611A (en) | Compositions and methods for treating Fabry disease | |
WO2023225471A2 (en) | Helitron compositions and methods | |
WO2022241040A1 (en) | Gene editing for expression of functional factor viii for the treatment of hemophilia | |
WO2024040222A1 (en) | Cleavable closed-ended dna (cedna) and methods of use thereof | |
WO2024081888A1 (en) | Gene editing for controlled expression of episomal genes | |
WO2023108153A2 (en) | Cftr-modulating compositions and methods | |
WO2023039441A1 (en) | Recruitment in trans of gene editing system components | |
WO2023039424A2 (en) | Methods and compositions for modulating a genome |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 23726831 Country of ref document: EP Kind code of ref document: A1 |