WO2022238958A1 - Multiplex gene editing - Google Patents
Multiplex gene editing Download PDFInfo
- Publication number
- WO2022238958A1 WO2022238958A1 PCT/IB2022/054436 IB2022054436W WO2022238958A1 WO 2022238958 A1 WO2022238958 A1 WO 2022238958A1 IB 2022054436 W IB2022054436 W IB 2022054436W WO 2022238958 A1 WO2022238958 A1 WO 2022238958A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- crispr
- cas nuclease
- cell
- cells
- cas
- Prior art date
Links
- 238000010362 genome editing Methods 0.000 title claims abstract description 58
- 238000000034 method Methods 0.000 claims abstract description 71
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 claims abstract description 59
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 claims abstract description 59
- 125000003729 nucleotide group Chemical group 0.000 claims abstract description 55
- 239000002773 nucleotide Substances 0.000 claims abstract description 50
- 208000034951 Genetic Translocation Diseases 0.000 claims abstract description 17
- 230000002829 reductive effect Effects 0.000 claims abstract description 16
- 210000004027 cell Anatomy 0.000 claims description 151
- 101710163270 Nuclease Proteins 0.000 claims description 122
- 108010008286 DNA nucleotidylexotransferase Proteins 0.000 claims description 107
- 108020005004 Guide RNA Proteins 0.000 claims description 75
- 108020004999 messenger RNA Proteins 0.000 claims description 63
- 150000007523 nucleic acids Chemical class 0.000 claims description 34
- 102000039446 nucleic acids Human genes 0.000 claims description 32
- 108020004707 nucleic acids Proteins 0.000 claims description 32
- 108090000623 proteins and genes Proteins 0.000 claims description 29
- 108010029485 Protein Isoforms Proteins 0.000 claims description 26
- 102000001708 Protein Isoforms Human genes 0.000 claims description 26
- 238000003780 insertion Methods 0.000 claims description 25
- 230000037431 insertion Effects 0.000 claims description 25
- 102000004389 Ribonucleoproteins Human genes 0.000 claims description 24
- 108010081734 Ribonucleoproteins Proteins 0.000 claims description 24
- 230000001965 increasing effect Effects 0.000 claims description 22
- 102000004169 proteins and genes Human genes 0.000 claims description 20
- 238000012217 deletion Methods 0.000 claims description 10
- 230000037430 deletion Effects 0.000 claims description 10
- 210000004986 primary T-cell Anatomy 0.000 claims description 10
- 238000004520 electroporation Methods 0.000 claims description 7
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 6
- 239000003153 chemical reaction reagent Substances 0.000 claims description 6
- 238000000338 in vitro Methods 0.000 claims description 6
- 210000000822 natural killer cell Anatomy 0.000 claims description 6
- 230000007423 decrease Effects 0.000 claims description 4
- 210000004698 lymphocyte Anatomy 0.000 claims description 4
- 206010025323 Lymphomas Diseases 0.000 claims description 3
- -1 SluCas9 Proteins 0.000 claims description 3
- 238000001727 in vivo Methods 0.000 claims description 3
- 238000011065 in-situ storage Methods 0.000 claims description 3
- 208000032839 leukemia Diseases 0.000 claims description 3
- 210000004962 mammalian cell Anatomy 0.000 claims description 3
- 101710172824 CRISPR-associated endonuclease Cas9 Proteins 0.000 claims 1
- 101000910035 Streptococcus pyogenes serotype M1 CRISPR-associated endonuclease Cas9/Csn1 Proteins 0.000 claims 1
- 239000000203 mixture Substances 0.000 abstract description 8
- 108020004414 DNA Proteins 0.000 description 63
- 210000001744 T-lymphocyte Anatomy 0.000 description 63
- 108091033409 CRISPR Proteins 0.000 description 42
- 102100025221 CD70 antigen Human genes 0.000 description 25
- 101000934356 Homo sapiens CD70 antigen Proteins 0.000 description 25
- 125000006850 spacer group Chemical group 0.000 description 20
- 230000005945 translocation Effects 0.000 description 20
- 239000013598 vector Substances 0.000 description 18
- 108010017070 Zinc Finger Nucleases Proteins 0.000 description 16
- 230000014509 gene expression Effects 0.000 description 15
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 12
- 108091028113 Trans-activating crRNA Proteins 0.000 description 12
- 230000001105 regulatory effect Effects 0.000 description 11
- 102000053602 DNA Human genes 0.000 description 10
- 230000004568 DNA-binding Effects 0.000 description 10
- 238000010459 TALEN Methods 0.000 description 10
- 108010043645 Transcription Activator-Like Effector Nucleases Proteins 0.000 description 10
- 238000001228 spectrum Methods 0.000 description 10
- 108010042407 Endonucleases Proteins 0.000 description 9
- 241000191967 Staphylococcus aureus Species 0.000 description 8
- 102000004243 Tubulin Human genes 0.000 description 8
- 108090000704 Tubulin Proteins 0.000 description 8
- 150000001413 amino acids Chemical class 0.000 description 8
- 230000000295 complement effect Effects 0.000 description 8
- 238000011304 droplet digital PCR Methods 0.000 description 8
- 241000193996 Streptococcus pyogenes Species 0.000 description 7
- 210000003958 hematopoietic stem cell Anatomy 0.000 description 7
- 230000035772 mutation Effects 0.000 description 7
- 239000000523 sample Substances 0.000 description 7
- 210000000130 stem cell Anatomy 0.000 description 7
- 238000013518 transcription Methods 0.000 description 7
- 230000035897 transcription Effects 0.000 description 7
- 108010077850 Nuclear Localization Signals Proteins 0.000 description 6
- 230000005782 double-strand break Effects 0.000 description 6
- 230000004927 fusion Effects 0.000 description 6
- 230000004048 modification Effects 0.000 description 6
- 238000012986 modification Methods 0.000 description 6
- 108091093088 Amplicon Proteins 0.000 description 5
- 102100031780 Endonuclease Human genes 0.000 description 5
- 210000001185 bone marrow Anatomy 0.000 description 5
- 238000003776 cleavage reaction Methods 0.000 description 5
- 230000000694 effects Effects 0.000 description 5
- 239000013604 expression vector Substances 0.000 description 5
- 210000002901 mesenchymal stem cell Anatomy 0.000 description 5
- 230000007017 scission Effects 0.000 description 5
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 4
- 102000004533 Endonucleases Human genes 0.000 description 4
- 108700024394 Exon Proteins 0.000 description 4
- 108091028043 Nucleic acid sequence Proteins 0.000 description 4
- 241001134656 Staphylococcus lugdunensis Species 0.000 description 4
- 230000008901 benefit Effects 0.000 description 4
- 230000003247 decreasing effect Effects 0.000 description 4
- 101150007302 dntt gene Proteins 0.000 description 4
- 238000002474 experimental method Methods 0.000 description 4
- 239000000833 heterodimer Substances 0.000 description 4
- 230000003993 interaction Effects 0.000 description 4
- 230000008685 targeting Effects 0.000 description 4
- 210000001519 tissue Anatomy 0.000 description 4
- 241000701022 Cytomegalovirus Species 0.000 description 3
- 102100033215 DNA nucleotidylexotransferase Human genes 0.000 description 3
- 241000282412 Homo Species 0.000 description 3
- 241000829100 Macaca mulatta polyomavirus 1 Species 0.000 description 3
- 241000700584 Simplexvirus Species 0.000 description 3
- 241000700605 Viruses Species 0.000 description 3
- 210000001789 adipocyte Anatomy 0.000 description 3
- 230000003321 amplification Effects 0.000 description 3
- 238000004458 analytical method Methods 0.000 description 3
- 230000027455 binding Effects 0.000 description 3
- 210000000601 blood cell Anatomy 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- 238000013461 design Methods 0.000 description 3
- 238000001514 detection method Methods 0.000 description 3
- 239000003623 enhancer Substances 0.000 description 3
- 210000003743 erythrocyte Anatomy 0.000 description 3
- 239000012634 fragment Substances 0.000 description 3
- 230000006870 function Effects 0.000 description 3
- 229910052739 hydrogen Inorganic materials 0.000 description 3
- 230000001939 inductive effect Effects 0.000 description 3
- 238000005304 joining Methods 0.000 description 3
- 230000000670 limiting effect Effects 0.000 description 3
- 239000011159 matrix material Substances 0.000 description 3
- 238000003199 nucleic acid amplification method Methods 0.000 description 3
- 230000009437 off-target effect Effects 0.000 description 3
- 210000005259 peripheral blood Anatomy 0.000 description 3
- 239000011886 peripheral blood Substances 0.000 description 3
- 239000013612 plasmid Substances 0.000 description 3
- 230000010076 replication Effects 0.000 description 3
- 238000011160 research Methods 0.000 description 3
- 241000894007 species Species 0.000 description 3
- 238000013519 translation Methods 0.000 description 3
- 230000014616 translation Effects 0.000 description 3
- 241001430294 unidentified retrovirus Species 0.000 description 3
- 239000013603 viral vector Substances 0.000 description 3
- 241000203069 Archaea Species 0.000 description 2
- 241000894006 Bacteria Species 0.000 description 2
- 108010040467 CRISPR-Associated Proteins Proteins 0.000 description 2
- 238000010356 CRISPR-Cas9 genome editing Methods 0.000 description 2
- HEDRZPFGACZZDS-UHFFFAOYSA-N Chloroform Chemical compound ClC(Cl)Cl HEDRZPFGACZZDS-UHFFFAOYSA-N 0.000 description 2
- 230000007018 DNA scission Effects 0.000 description 2
- 101000834253 Gallus gallus Actin, cytoplasmic 1 Proteins 0.000 description 2
- 102000004269 Granulocyte Colony-Stimulating Factor Human genes 0.000 description 2
- 108010017080 Granulocyte Colony-Stimulating Factor Proteins 0.000 description 2
- NYHBQMYGNKIUIF-UUOKFMHZSA-N Guanosine Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O NYHBQMYGNKIUIF-UUOKFMHZSA-N 0.000 description 2
- 102100031573 Hematopoietic progenitor cell antigen CD34 Human genes 0.000 description 2
- 101000777663 Homo sapiens Hematopoietic progenitor cell antigen CD34 Proteins 0.000 description 2
- 241000725303 Human immunodeficiency virus Species 0.000 description 2
- 108091093037 Peptide nucleic acid Proteins 0.000 description 2
- ISWSIDIOOBJBQZ-UHFFFAOYSA-N Phenol Chemical compound OC1=CC=CC=C1 ISWSIDIOOBJBQZ-UHFFFAOYSA-N 0.000 description 2
- 102100037935 Polyubiquitin-C Human genes 0.000 description 2
- 108091028664 Ribonucleotide Proteins 0.000 description 2
- 241000187191 Streptomyces viridochromogenes Species 0.000 description 2
- 241000203587 Streptosporangium roseum Species 0.000 description 2
- IQFYYKKMVGJFEH-XLPZGREQSA-N Thymidine Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](CO)[C@@H](O)C1 IQFYYKKMVGJFEH-XLPZGREQSA-N 0.000 description 2
- DRTQHJPVMGBUCF-XVFCMESISA-N Uridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-XVFCMESISA-N 0.000 description 2
- HCHKCACWOHOZIP-UHFFFAOYSA-N Zinc Chemical compound [Zn] HCHKCACWOHOZIP-UHFFFAOYSA-N 0.000 description 2
- 239000002253 acid Substances 0.000 description 2
- OIRDTQYFTABQOQ-KQYNXXCUSA-N adenosine Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O OIRDTQYFTABQOQ-KQYNXXCUSA-N 0.000 description 2
- 210000004504 adult stem cell Anatomy 0.000 description 2
- 238000013459 approach Methods 0.000 description 2
- 230000001580 bacterial effect Effects 0.000 description 2
- 210000003651 basophil Anatomy 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- 210000004369 blood Anatomy 0.000 description 2
- 239000008280 blood Substances 0.000 description 2
- 239000000872 buffer Substances 0.000 description 2
- 210000004899 c-terminal region Anatomy 0.000 description 2
- 230000003197 catalytic effect Effects 0.000 description 2
- 230000010261 cell growth Effects 0.000 description 2
- 210000001612 chondrocyte Anatomy 0.000 description 2
- 230000008711 chromosomal rearrangement Effects 0.000 description 2
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 2
- 210000004443 dendritic cell Anatomy 0.000 description 2
- 239000005547 deoxyribonucleotide Substances 0.000 description 2
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 230000004069 differentiation Effects 0.000 description 2
- 239000000539 dimer Substances 0.000 description 2
- 238000006471 dimerization reaction Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 210000003979 eosinophil Anatomy 0.000 description 2
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 2
- 238000004128 high performance liquid chromatography Methods 0.000 description 2
- 239000000710 homodimer Substances 0.000 description 2
- 239000001257 hydrogen Substances 0.000 description 2
- 210000002540 macrophage Anatomy 0.000 description 2
- 210000001616 monocyte Anatomy 0.000 description 2
- 210000000440 neutrophil Anatomy 0.000 description 2
- 230000006780 non-homologous end joining Effects 0.000 description 2
- 229920000642 polymer Polymers 0.000 description 2
- 102000040430 polynucleotide Human genes 0.000 description 2
- 108091033319 polynucleotide Proteins 0.000 description 2
- 239000002157 polynucleotide Substances 0.000 description 2
- 239000000047 product Substances 0.000 description 2
- 239000003531 protein hydrolysate Substances 0.000 description 2
- 238000000746 purification Methods 0.000 description 2
- 150000003212 purines Chemical class 0.000 description 2
- 230000009467 reduction Effects 0.000 description 2
- 230000008439 repair process Effects 0.000 description 2
- 239000002336 ribonucleotide Substances 0.000 description 2
- 125000002652 ribonucleotide group Chemical group 0.000 description 2
- 238000007480 sanger sequencing Methods 0.000 description 2
- 210000002966 serum Anatomy 0.000 description 2
- 230000003007 single stranded DNA break Effects 0.000 description 2
- 210000002536 stromal cell Anatomy 0.000 description 2
- 239000000126 substance Substances 0.000 description 2
- 238000006467 substitution reaction Methods 0.000 description 2
- RWQNBRDOKXIBIV-UHFFFAOYSA-N thymine Chemical compound CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 description 2
- 229910052725 zinc Inorganic materials 0.000 description 2
- 239000011701 zinc Substances 0.000 description 2
- UHDGCWIWMRVCDJ-UHFFFAOYSA-N 1-beta-D-Xylofuranosyl-NH-Cytosine Natural products O=C1N=C(N)C=CN1C1C(O)C(O)C(CO)O1 UHDGCWIWMRVCDJ-UHFFFAOYSA-N 0.000 description 1
- 108091071338 17 family Proteins 0.000 description 1
- 101150101112 7 gene Proteins 0.000 description 1
- 101150023956 ALK gene Proteins 0.000 description 1
- 102100033793 ALK tyrosine kinase receptor Human genes 0.000 description 1
- 241000007910 Acaryochloris marina Species 0.000 description 1
- 241001135192 Acetohalobium arabaticum Species 0.000 description 1
- 241001464929 Acidithiobacillus caldus Species 0.000 description 1
- 241000605222 Acidithiobacillus ferrooxidans Species 0.000 description 1
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 1
- 229930024421 Adenine Natural products 0.000 description 1
- HJCMDXDYPOUFDY-WHFBIAKZSA-N Ala-Gln Chemical compound C[C@H](N)C(=O)N[C@H](C(O)=O)CCC(N)=O HJCMDXDYPOUFDY-WHFBIAKZSA-N 0.000 description 1
- 241000640374 Alicyclobacillus acidocaldarius Species 0.000 description 1
- 241000190857 Allochromatium vinosum Species 0.000 description 1
- 241000147155 Ammonifex degensii Species 0.000 description 1
- 241000620196 Arthrospira maxima Species 0.000 description 1
- 240000002900 Arthrospira platensis Species 0.000 description 1
- 235000016425 Arthrospira platensis Nutrition 0.000 description 1
- 241001495183 Arthrospira sp. Species 0.000 description 1
- RJUHZPRQRQLCFL-IMJSIDKUSA-N Asn-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(O)=O RJUHZPRQRQLCFL-IMJSIDKUSA-N 0.000 description 1
- KLKHFFMNGWULBN-VKHMYHEASA-N Asn-Gly Chemical compound NC(=O)C[C@H](N)C(=O)NCC(O)=O KLKHFFMNGWULBN-VKHMYHEASA-N 0.000 description 1
- 241000713826 Avian leukosis virus Species 0.000 description 1
- 238000009020 BCA Protein Assay Kit Methods 0.000 description 1
- 241000906059 Bacillus pseudomycoides Species 0.000 description 1
- DWRXFEITVBNRMK-UHFFFAOYSA-N Beta-D-1-Arabinofuranosylthymine Natural products O=C1NC(=O)C(C)=CN1C1C(O)C(O)C(CO)O1 DWRXFEITVBNRMK-UHFFFAOYSA-N 0.000 description 1
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 1
- 206010006187 Breast cancer Diseases 0.000 description 1
- 241000823281 Burkholderiales bacterium Species 0.000 description 1
- 239000002126 C01EB10 - Adenosine Substances 0.000 description 1
- 238000010453 CRISPR/Cas method Methods 0.000 description 1
- 241000589875 Campylobacter jejuni Species 0.000 description 1
- 241000589986 Campylobacter lari Species 0.000 description 1
- 241001496650 Candidatus Desulforudis Species 0.000 description 1
- 108700004991 Cas12a Proteins 0.000 description 1
- 241000193163 Clostridioides difficile Species 0.000 description 1
- 241000193155 Clostridium botulinum Species 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- 241000907165 Coleofasciculus chthonoplastes Species 0.000 description 1
- 241000186216 Corynebacterium Species 0.000 description 1
- 241000065716 Crocosphaera watsonii Species 0.000 description 1
- MIKUYHXYGGJMLM-GIMIYPNGSA-N Crotonoside Natural products C1=NC2=C(N)NC(=O)N=C2N1[C@H]1O[C@@H](CO)[C@H](O)[C@@H]1O MIKUYHXYGGJMLM-GIMIYPNGSA-N 0.000 description 1
- 241000192700 Cyanobacteria Species 0.000 description 1
- 241000159506 Cyanothece Species 0.000 description 1
- UHDGCWIWMRVCDJ-PSQAKQOGSA-N Cytidine Natural products O=C1N=C(N)C=CN1[C@@H]1[C@@H](O)[C@@H](O)[C@H](CO)O1 UHDGCWIWMRVCDJ-PSQAKQOGSA-N 0.000 description 1
- 102000004127 Cytokines Human genes 0.000 description 1
- 108090000695 Cytokines Proteins 0.000 description 1
- NYHBQMYGNKIUIF-UHFFFAOYSA-N D-guanosine Natural products C1=2NC(N)=NC(=O)C=2N=CN1C1OC(CO)C(O)C1O NYHBQMYGNKIUIF-UHFFFAOYSA-N 0.000 description 1
- 238000007400 DNA extraction Methods 0.000 description 1
- 230000033616 DNA repair Effects 0.000 description 1
- 108010008532 Deoxyribonuclease I Proteins 0.000 description 1
- 102000007260 Deoxyribonuclease I Human genes 0.000 description 1
- AHCYMLUZIRLXAA-SHYZEUOFSA-N Deoxyuridine 5'-triphosphate Chemical compound O1[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)[C@@H](O)C[C@@H]1N1C(=O)NC(=O)C=C1 AHCYMLUZIRLXAA-SHYZEUOFSA-N 0.000 description 1
- 241000702421 Dependoparvovirus Species 0.000 description 1
- 241000196324 Embryophyta Species 0.000 description 1
- 241000991587 Enterovirus C Species 0.000 description 1
- 102000004190 Enzymes Human genes 0.000 description 1
- 108090000790 Enzymes Proteins 0.000 description 1
- 241000206602 Eukaryota Species 0.000 description 1
- 241000326311 Exiguobacterium sibiricum Species 0.000 description 1
- 241000605896 Fibrobacter succinogenes Species 0.000 description 1
- 241000192016 Finegoldia magna Species 0.000 description 1
- 241000589599 Francisella tularensis subsp. novicida Species 0.000 description 1
- 241000968725 Gammaproteobacteria bacterium Species 0.000 description 1
- 108700028146 Genetic Enhancer Elements Proteins 0.000 description 1
- 101710088172 HTH-type transcriptional regulator RipA Proteins 0.000 description 1
- MDCTVRUPVLZSPG-BQBZGAKWSA-N His-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CNC=N1 MDCTVRUPVLZSPG-BQBZGAKWSA-N 0.000 description 1
- 101000800646 Homo sapiens DNA nucleotidylexotransferase Proteins 0.000 description 1
- 101001109719 Homo sapiens Nucleophosmin Proteins 0.000 description 1
- 229930010555 Inosine Natural products 0.000 description 1
- UGQMRVRMYYASKQ-KQYNXXCUSA-N Inosine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C2=NC=NC(O)=C2N=C1 UGQMRVRMYYASKQ-KQYNXXCUSA-N 0.000 description 1
- 108010002350 Interleukin-2 Proteins 0.000 description 1
- 108010002586 Interleukin-7 Proteins 0.000 description 1
- 108020004684 Internal Ribosome Entry Sites Proteins 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- 241001430082 Ktedonobacter Species 0.000 description 1
- 241000186679 Lactobacillus buchneri Species 0.000 description 1
- 241000186673 Lactobacillus delbrueckii Species 0.000 description 1
- 241000186606 Lactobacillus gasseri Species 0.000 description 1
- 241000186869 Lactobacillus salivarius Species 0.000 description 1
- 101710128836 Large T antigen Proteins 0.000 description 1
- 241000713666 Lentivirus Species 0.000 description 1
- 241000186805 Listeria innocua Species 0.000 description 1
- 241001134698 Lyngbya Species 0.000 description 1
- 241000501784 Marinobacter sp. Species 0.000 description 1
- 108010090054 Membrane Glycoproteins Proteins 0.000 description 1
- 102000012750 Membrane Glycoproteins Human genes 0.000 description 1
- 241001465754 Metazoa Species 0.000 description 1
- 241000204637 Methanohalobium evestigatum Species 0.000 description 1
- 241000192710 Microcystis aeruginosa Species 0.000 description 1
- 241000190928 Microscilla marina Species 0.000 description 1
- 241001529936 Murinae Species 0.000 description 1
- 241000714177 Murine leukemia virus Species 0.000 description 1
- 101710135898 Myc proto-oncogene protein Proteins 0.000 description 1
- 102100038895 Myc proto-oncogene protein Human genes 0.000 description 1
- 241000713883 Myeloproliferative sarcoma virus Species 0.000 description 1
- 108091061960 Naked DNA Proteins 0.000 description 1
- 241000167285 Natranaerobius thermophilus Species 0.000 description 1
- 241000588654 Neisseria cinerea Species 0.000 description 1
- 241000588650 Neisseria meningitidis Species 0.000 description 1
- 206010028980 Neoplasm Diseases 0.000 description 1
- 241000919925 Nitrosococcus halophilus Species 0.000 description 1
- 241001515112 Nitrosococcus watsonii Species 0.000 description 1
- 241000203619 Nocardiopsis dassonvillei Species 0.000 description 1
- 241001223105 Nodularia spumigena Species 0.000 description 1
- 241000192673 Nostoc sp. Species 0.000 description 1
- 102100022678 Nucleophosmin Human genes 0.000 description 1
- 102000002488 Nucleoplasmin Human genes 0.000 description 1
- 241000192520 Oscillatoria sp. Species 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- 241001386755 Parvibaculum lavamentivorans Species 0.000 description 1
- 241000606856 Pasteurella multocida Species 0.000 description 1
- 241000142651 Pelotomaculum thermopropionicum Species 0.000 description 1
- 108010077524 Peptide Elongation Factor 1 Proteins 0.000 description 1
- 241000983938 Petrotoga mobilis Species 0.000 description 1
- 102100028251 Phosphoglycerate kinase 1 Human genes 0.000 description 1
- 101710139464 Phosphoglycerate kinase 1 Proteins 0.000 description 1
- 241001599925 Polaromonas naphthalenivorans Species 0.000 description 1
- 241001472610 Polaromonas sp. Species 0.000 description 1
- 229940124158 Protease/peptidase inhibitor Drugs 0.000 description 1
- 241000590028 Pseudoalteromonas haloplanktis Species 0.000 description 1
- 229930185560 Pseudouridine Natural products 0.000 description 1
- PTJWIQPHWPFNBW-UHFFFAOYSA-N Pseudouridine C Natural products OC1C(O)C(CO)OC1C1=CNC(=O)NC1=O PTJWIQPHWPFNBW-UHFFFAOYSA-N 0.000 description 1
- CZPWVGJYEJSRLH-UHFFFAOYSA-N Pyrimidine Chemical compound C1=CN=CN=C1 CZPWVGJYEJSRLH-UHFFFAOYSA-N 0.000 description 1
- 102000001183 RAG-1 Human genes 0.000 description 1
- 108060006897 RAG1 Proteins 0.000 description 1
- 102000014450 RNA Polymerase III Human genes 0.000 description 1
- 108010078067 RNA Polymerase III Proteins 0.000 description 1
- 230000026279 RNA modification Effects 0.000 description 1
- 239000012980 RPMI-1640 medium Substances 0.000 description 1
- 241000190984 Rhodospirillum rubrum Species 0.000 description 1
- 239000006146 Roswell Park Memorial Institute medium Substances 0.000 description 1
- 241000714474 Rous sarcoma virus Species 0.000 description 1
- 206010039491 Sarcoma Diseases 0.000 description 1
- 108091061750 Signal recognition particle RNA Proteins 0.000 description 1
- 241000713896 Spleen necrosis virus Species 0.000 description 1
- 241001501869 Streptococcus pasteurianus Species 0.000 description 1
- 241000194022 Streptococcus sp. Species 0.000 description 1
- 241000194020 Streptococcus thermophilus Species 0.000 description 1
- 241001518258 Streptomyces pristinaespiralis Species 0.000 description 1
- 241000123713 Sutterella wadsworthensis Species 0.000 description 1
- 241001661355 Synapsis Species 0.000 description 1
- 241000192560 Synechococcus sp. Species 0.000 description 1
- 239000004098 Tetracycline Substances 0.000 description 1
- 241000206213 Thermosipho africanus Species 0.000 description 1
- 102000006601 Thymidine Kinase Human genes 0.000 description 1
- 108020004440 Thymidine kinase Proteins 0.000 description 1
- 108010073062 Transcription Activator-Like Effectors Proteins 0.000 description 1
- 102100037116 Transcription elongation factor 1 homolog Human genes 0.000 description 1
- 101710150448 Transcriptional regulator Myc Proteins 0.000 description 1
- 241000589892 Treponema denticola Species 0.000 description 1
- 241000078013 Trichormus variabilis Species 0.000 description 1
- 108010056354 Ubiquitin C Proteins 0.000 description 1
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical group O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 1
- 241000700618 Vaccinia virus Species 0.000 description 1
- 241000605939 Wolinella succinogenes Species 0.000 description 1
- 241001148118 Xanthomonas sp. Species 0.000 description 1
- 101710185494 Zinc finger protein Proteins 0.000 description 1
- 102100023597 Zinc finger protein 816 Human genes 0.000 description 1
- 241001673106 [Bacillus] selenitireducens Species 0.000 description 1
- 125000002777 acetyl group Chemical group [H]C([H])([H])C(*)=O 0.000 description 1
- 150000007513 acids Chemical class 0.000 description 1
- 229960000643 adenine Drugs 0.000 description 1
- 229960005305 adenosine Drugs 0.000 description 1
- 210000000577 adipose tissue Anatomy 0.000 description 1
- 239000002671 adjuvant Substances 0.000 description 1
- 239000011543 agarose gel Substances 0.000 description 1
- 230000004075 alteration Effects 0.000 description 1
- 125000003277 amino group Chemical group 0.000 description 1
- 210000003663 amniotic stem cell Anatomy 0.000 description 1
- 230000003466 anti-cipated effect Effects 0.000 description 1
- 238000003491 array Methods 0.000 description 1
- 229940011019 arthrospira platensis Drugs 0.000 description 1
- 238000003556 assay Methods 0.000 description 1
- 210000001130 astrocyte Anatomy 0.000 description 1
- 125000004429 atom Chemical group 0.000 description 1
- 210000003719 b-lymphocyte Anatomy 0.000 description 1
- 244000052616 bacterial pathogen Species 0.000 description 1
- 239000011324 bead Substances 0.000 description 1
- IQFYYKKMVGJFEH-UHFFFAOYSA-N beta-L-thymidine Natural products O=C1NC(=O)C(C)=CN1C1OC(CO)C(O)C1 IQFYYKKMVGJFEH-UHFFFAOYSA-N 0.000 description 1
- DRTQHJPVMGBUCF-PSQAKQOGSA-N beta-L-uridine Natural products O[C@H]1[C@@H](O)[C@H](CO)O[C@@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-PSQAKQOGSA-N 0.000 description 1
- WGDUUQDYDIIBKT-UHFFFAOYSA-N beta-Pseudouridine Natural products OC1OC(CN2C=CC(=O)NC2=O)C(O)C1O WGDUUQDYDIIBKT-UHFFFAOYSA-N 0.000 description 1
- 210000001772 blood platelet Anatomy 0.000 description 1
- 201000011510 cancer Diseases 0.000 description 1
- 229910052799 carbon Inorganic materials 0.000 description 1
- 125000004432 carbon atom Chemical group C* 0.000 description 1
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 1
- 125000002057 carboxymethyl group Chemical group [H]OC(=O)C([H])([H])[*] 0.000 description 1
- 210000004413 cardiac myocyte Anatomy 0.000 description 1
- 108091092356 cellular DNA Proteins 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 230000002759 chromosomal effect Effects 0.000 description 1
- 210000001728 clone cell Anatomy 0.000 description 1
- 230000004186 co-expression Effects 0.000 description 1
- 239000002131 composite material Substances 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 210000002808 connective tissue Anatomy 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- UHDGCWIWMRVCDJ-ZAKLUEHWSA-N cytidine Chemical compound O=C1N=C(N)C=CN1[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O1 UHDGCWIWMRVCDJ-ZAKLUEHWSA-N 0.000 description 1
- 230000009089 cytolysis Effects 0.000 description 1
- 229940104302 cytosine Drugs 0.000 description 1
- 210000001151 cytotoxic T lymphocyte Anatomy 0.000 description 1
- 238000000354 decomposition reaction Methods 0.000 description 1
- 230000008260 defense mechanism Effects 0.000 description 1
- 238000012938 design process Methods 0.000 description 1
- 239000005546 dideoxynucleotide Substances 0.000 description 1
- 238000010790 dilution Methods 0.000 description 1
- 239000012895 dilution Substances 0.000 description 1
- CZWHMRTTWFJMBC-UHFFFAOYSA-N dinaphtho[2,3-b:2',3'-f]thieno[3,2-b]thiophene Chemical compound C1=CC=C2C=C(SC=3C4=CC5=CC=CC=C5C=C4SC=33)C3=CC2=C1 CZWHMRTTWFJMBC-UHFFFAOYSA-N 0.000 description 1
- 206010013023 diphtheria Diseases 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 210000001671 embryonic stem cell Anatomy 0.000 description 1
- 210000002889 endothelial cell Anatomy 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 210000002919 epithelial cell Anatomy 0.000 description 1
- 230000000925 erythroid effect Effects 0.000 description 1
- 102000015694 estrogen receptors Human genes 0.000 description 1
- 108010038795 estrogen receptors Proteins 0.000 description 1
- 210000003527 eukaryotic cell Anatomy 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 239000011536 extraction buffer Substances 0.000 description 1
- 210000004700 fetal blood Anatomy 0.000 description 1
- 239000012091 fetal bovine serum Substances 0.000 description 1
- 230000001605 fetal effect Effects 0.000 description 1
- 210000000604 fetal stem cell Anatomy 0.000 description 1
- 210000002950 fibroblast Anatomy 0.000 description 1
- 239000000499 gel Substances 0.000 description 1
- 238000007429 general method Methods 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 229940029575 guanosine Drugs 0.000 description 1
- 230000003394 haemopoietic effect Effects 0.000 description 1
- 210000002768 hair cell Anatomy 0.000 description 1
- 210000002443 helper t lymphocyte Anatomy 0.000 description 1
- 230000011132 hemopoiesis Effects 0.000 description 1
- 210000003494 hepatocyte Anatomy 0.000 description 1
- 210000005260 human cell Anatomy 0.000 description 1
- 238000009396 hybridization Methods 0.000 description 1
- 125000002887 hydroxy group Chemical group [H]O* 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 210000004263 induced pluripotent stem cell Anatomy 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 229960003786 inosine Drugs 0.000 description 1
- 239000012212 insulator Substances 0.000 description 1
- 210000002510 keratinocyte Anatomy 0.000 description 1
- 238000001638 lipofection Methods 0.000 description 1
- 239000002502 liposome Substances 0.000 description 1
- 210000004185 liver Anatomy 0.000 description 1
- 210000004072 lung Anatomy 0.000 description 1
- 239000012139 lysis buffer Substances 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 230000001785 maturational effect Effects 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 210000003593 megakaryocyte Anatomy 0.000 description 1
- 210000002752 melanocyte Anatomy 0.000 description 1
- 210000003716 mesoderm Anatomy 0.000 description 1
- 239000002184 metal Substances 0.000 description 1
- 229910052751 metal Inorganic materials 0.000 description 1
- 125000002496 methyl group Chemical group [H]C([H])([H])* 0.000 description 1
- 230000002025 microglial effect Effects 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 125000004573 morpholin-4-yl group Chemical group N1(CCOCC1)* 0.000 description 1
- 210000002894 multi-fate stem cell Anatomy 0.000 description 1
- 210000001665 muscle stem cell Anatomy 0.000 description 1
- 231100000219 mutagenic Toxicity 0.000 description 1
- 230000003505 mutagenic effect Effects 0.000 description 1
- 230000000869 mutational effect Effects 0.000 description 1
- 230000003039 myelosuppressive effect Effects 0.000 description 1
- 210000000107 myocyte Anatomy 0.000 description 1
- 230000032965 negative regulation of cell volume Effects 0.000 description 1
- 210000002569 neuron Anatomy 0.000 description 1
- 125000004433 nitrogen atom Chemical group N* 0.000 description 1
- 101150083701 npm1 gene Proteins 0.000 description 1
- 108060005597 nucleoplasmin Proteins 0.000 description 1
- 230000005257 nucleotidylation Effects 0.000 description 1
- 230000009438 off-target cleavage Effects 0.000 description 1
- 210000000963 osteoblast Anatomy 0.000 description 1
- 210000004409 osteocyte Anatomy 0.000 description 1
- 210000003101 oviduct Anatomy 0.000 description 1
- 230000036961 partial effect Effects 0.000 description 1
- 229940051027 pasteurella multocida Drugs 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 239000008188 pellet Substances 0.000 description 1
- 239000000137 peptide hydrolase inhibitor Substances 0.000 description 1
- 210000003819 peripheral blood mononuclear cell Anatomy 0.000 description 1
- 125000002467 phosphate group Chemical group [H]OP(=O)(O[H])O[*] 0.000 description 1
- 230000008488 polyadenylation Effects 0.000 description 1
- 230000000379 polymerizing effect Effects 0.000 description 1
- 229920001184 polypeptide Polymers 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 108090000765 processed proteins & peptides Proteins 0.000 description 1
- 102000004196 processed proteins & peptides Human genes 0.000 description 1
- 230000035755 proliferation Effects 0.000 description 1
- PTJWIQPHWPFNBW-GBNDHIKLSA-N pseudouridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1C1=CNC(=O)NC1=O PTJWIQPHWPFNBW-GBNDHIKLSA-N 0.000 description 1
- 238000010814 radioimmunoprecipitation assay Methods 0.000 description 1
- 238000003259 recombinant expression Methods 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 230000003252 repetitive effect Effects 0.000 description 1
- 230000002441 reversible effect Effects 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 125000000548 ribosyl group Chemical group C1([C@H](O)[C@H](O)[C@H](O1)CO)* 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
- 230000037390 scarring Effects 0.000 description 1
- 239000004017 serum-free culture medium Substances 0.000 description 1
- 230000035939 shock Effects 0.000 description 1
- 210000002363 skeletal muscle cell Anatomy 0.000 description 1
- 210000000329 smooth muscle myocyte Anatomy 0.000 description 1
- 150000003431 steroids Chemical class 0.000 description 1
- UCSJYZPVAKXKNQ-HZYVHMACSA-N streptomycin Natural products CN[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O[C@H]1O[C@@H]1[C@](C=O)(O)[C@H](C)O[C@H]1O[C@@H]1[C@@H](NC(N)=N)[C@H](O)[C@@H](NC(N)=N)[C@H](O)[C@H]1O UCSJYZPVAKXKNQ-HZYVHMACSA-N 0.000 description 1
- 229960005322 streptomycin Drugs 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- 239000013589 supplement Substances 0.000 description 1
- 230000009469 supplementation Effects 0.000 description 1
- 210000002437 synoviocyte Anatomy 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 229930101283 tetracycline Natural products 0.000 description 1
- 229960002180 tetracycline Drugs 0.000 description 1
- 235000019364 tetracycline Nutrition 0.000 description 1
- 150000003522 tetracyclines Chemical class 0.000 description 1
- 238000010257 thawing Methods 0.000 description 1
- 125000003396 thiol group Chemical group [H]S* 0.000 description 1
- RYYWUUFWQRZTIU-UHFFFAOYSA-K thiophosphate Chemical group [O-]P([O-])([O-])=S RYYWUUFWQRZTIU-UHFFFAOYSA-K 0.000 description 1
- 229940104230 thymidine Drugs 0.000 description 1
- 229940113082 thymine Drugs 0.000 description 1
- 210000003954 umbilical cord Anatomy 0.000 description 1
- 241000701161 unidentified adenovirus Species 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- DRTQHJPVMGBUCF-UHFFFAOYSA-N uracil arabinoside Natural products OC1C(O)C(CO)OC1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-UHFFFAOYSA-N 0.000 description 1
- 229940045145 uridine Drugs 0.000 description 1
- 210000002845 virion Anatomy 0.000 description 1
- 230000003612 virological effect Effects 0.000 description 1
- 239000000277 virosome Substances 0.000 description 1
- 238000001262 western blot Methods 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/111—General methods applicable to biologically active non-coding nucleic acids
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/12—Transferases (2.) transferring phosphorus containing groups, e.g. kinases (2.7)
- C12N9/1241—Nucleotidyltransferases (2.7.7)
- C12N9/1252—DNA-directed DNA polymerase (2.7.7.7), i.e. DNA replicase
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/12—Transferases (2.) transferring phosphorus containing groups, e.g. kinases (2.7)
- C12N9/1241—Nucleotidyltransferases (2.7.7)
- C12N9/1258—Polyribonucleotide nucleotidyltransferase (2.7.7.8), i.e. polynucleotide phosphorylase
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/16—Hydrolases (3) acting on ester bonds (3.1)
- C12N9/22—Ribonucleases RNAses, DNAses
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y207/00—Transferases transferring phosphorus-containing groups (2.7)
- C12Y207/07—Nucleotidyltransferases (2.7.7)
- C12Y207/07007—DNA-directed DNA polymerase (2.7.7.7), i.e. DNA replicase
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y207/00—Transferases transferring phosphorus-containing groups (2.7)
- C12Y207/07—Nucleotidyltransferases (2.7.7)
- C12Y207/07031—DNA nucleotidylexotransferase (2.7.7.31), i.e. terminal deoxynucleotidyl transferase
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2310/00—Structure or type of the nucleic acid
- C12N2310/10—Type of nucleic acid
- C12N2310/20—Type of nucleic acid involving clustered regularly interspaced short palindromic repeats [CRISPRs]
Definitions
- PCT1_Sequence_Listing_ST25.txt is about 18,000 bytes in size.
- the present disclosure relates to the field of CRISPR-based multiplex gene editing.
- the methods herein comprise delivering to a cell: (a) two or more CRISPR-Cas nuclease systems or nucleic acids encoding two or more CRISPR-Cas nuclease systems, wherein each CRISPR-Cas nuclease system is targeted to a different genomic locus; and (b) an X-family DNA polymerase or a nucleic acid 2 encoding an X family DNA polymerase; wherein the two or more CRISPR-Cas nuclease systems generate cleaved genomic loci and the X-family DNA polymerase catalyzes addition of nucleotides to 3’ termini of the cleaved genomic loci, such that chromosomal translocations are reduced between the cleaved genomic loci, thereby facilitating editing at multiple genomic loci.
- nucleotides may increase nucleotide insertion frequencies, increase insertion lengths, and/or decrease nucleotide deletion frequencies as compared to a comparable cell in which the X-family DNA polymerase is not delivered.
- chromosomal translocations may be reduced by at least 5% when two genomic loci are edited as compared to a comparable cell in which the X-family DNA polymerase is not delivered.
- each CRISPR-Cas nuclease system may comprise a CRISPR-Cas nuclease and a gRNA.
- the CRISPR- Cas nuclease may be a type II CRISPR-Cas nuclease ora type V CRISPR-Cas nuclease.
- the CRISPR-Cas nuclease may be SpCas9, SluCas9, SaCas9, ora variant thereof with increased fidelity.
- the gRNA may be a single molecule gRNA (sgRNA).
- each of the CRISPR-Cas nuclease systems may be delivered as a ribonucleoprotein (RNP) complex.
- each of the CRISPR-Cas nuclease systems may be delivered as a CRISPR-Cas nuclease and a gRNA.
- each of the CRISPR-Cas nuclease systems may be delivered as mRNA encoding the CRISPR-Cas nuclease and a gRNA.
- each of the CRISPR-Cas nuclease systems may be delivered as DNA encoding the CRISPR-Cas nuclease and DNA encoding the gRNA.
- the X-family DNA polymerase may be a terminal deoxyribonucleotidyl transferase (TdT), polymerase lambda (Pol l), or polymerase mu (Pol m).
- TdT terminal deoxyribonucleotidyl transferase
- Poly l polymerase lambda
- Poly m polymerase mu
- the X-family DNA polymerase is a short isoform of TdT (TdtS) (e.g., having an amino acid sequence of SEQ ID NO: 34).
- the X-family DNA polymerase may be delivered as mRNA, protein, or DNA.
- two CRISPR-Cas nuclease systems are delivered to the cell.
- three CRISPR-Cas nuclease systems are delivered to the cell.
- four CRISPR-Cas nuclease systems are delivered to the cell.
- five CRISPR-Cas nuclease systems may be delivered to the cell.
- delivering may comprise electroporation.
- the cell may be a mammalian cell.
- the cell is other than an immature, pre-B or pre-T lymphoid cell, a leukemia cell, a lymphoma cell, or an immortalized cell derived therefrom.
- the cell may be a primary T cell or an NK cell.
- the cell may be in vitro, ex vivo, in situ, or in vivo.
- FIG. 1 Further aspects of the present disclosure are directed to a cell comprising (a) two or more CRISPR-Cas nuclease systems and (b) a short isoform of terminal deoxyribonucleotidyl transferase (TdTS), wherein the CRISPR-Cas nuclease systems and the TdTS are exogenous to the cell.
- TdTS terminal deoxyribonucleotidyl transferase
- each CRISPR-Cas nuclease systems of (a) may be a ribonucleoprotein (RNP) complex.
- each CRISPR-Cas nuclease system of (a) may comprise CRISPR-Cas mRNA and gRNA.
- (b) may comprise mRNA encoding TdTs or may comprise TdTS protein.
- each CRISPR-Cas nuclease system of (a) comprises DNA encoding CRISPR-Cas nuclease and DNA encoding gRNA and (b) comprises DNA encoding TdTS.
- kits for multiplex gene editing comprising (a) two or more CRISPR-Cas nuclease systems, and (b) a short isoform of terminal deoxyribonucleotidyl transferase (TdTS).
- TdTS terminal deoxyribonucleotidyl transferase
- each CRISPR-Cas nuclease systems of (a) in the kits provided herein may be a RNP complex. 4
- each CRISPR-Cas nuclease system of (a) may comprise CRISPR-Cas mRNA and gRNA.
- (b) may comprise mRNA encoding TdTS or may comprise TdTS protein.
- the kits provided are as described herein each CRISPR-Cas nuclease system of (a) comprises DNA encoding CRISPR-Cas nuclease and DNA encoding gRNA and (b) comprises DNA encoding TdTS.
- any of the kits provided herein may further comprise at least one reagent for gene editing.
- FIG. 1A presents editing efficiencies with NPM1 or ALK gRNAs in Jurkat or T cells. Plotted is the percent of indels in Jurkat or T cells for each gRNA.
- FIG. 1B presents indel spectra for T cells (upper) and Jurkat cells (lower). Plotted are the frequencies of deletions or insertions, with darker shading indicating higher frequencies.
- FIG. 2A presents editing efficiencies after multiplex editing at PD1 and CD70 loci in Jurkat and T cells.
- FIG. 2B presents indel spectra at PD1 and CD70 loci in T cells (left) and Jurkat cells (right). Plotted are the frequencies of insertions or deletions, with darker shading indicating higher frequencies.
- FIG. 3A diagrams translocation variants of PD1/CD70.
- FIG. 3B presents the percentage of T1-T4 translocation variants of PD1/CD70 in Jurkat and T cells.
- FIG. 4 presents Western images showing the levels of TdT and alpha-tubulin in Jurkat and T cells.
- FIG. 5 presents Western images showing the levels of TdT and alpha-tubulin in Jurkat cells edited to knock out TdT (labeled T1 , T11 , T14), and wild type Jurkat and T cells.
- FIG. 6A presents indel spectra at PD1 and CD70 loci in wild type Jurkat (TdT+), edited Jurkat (TdT-), and T cells. 5
- FIG. 6B presents editing efficiencies at PD1 and CD70 loci in Jurkat (TdT+), Jurkat (TdT-), and T cells.
- FIG. 6C presents the percentage of each translocation variant of PD1/CD70 in Jurkat (TdT+), Jurkat (TdT-), and T cells.
- FIG. 7 shows the kinetics of expression of TdT short isoform (TdTS), TdT long isoforms (TdTL1 , TdTL2, TdTL3), and Cas9 after delivery of the corresponding mRNA to T cells.
- FIG. 8A shows the levels of TdTS after delivery of increasing amounts of TdTS mRNA to T cells and the level in Jurkat (TdT+) cells.
- FIG. 8B presents the relative levels of TdTS to alpha-tubulin in under the conditions described in FIG. 8A.
- FIG. 9A presents indel spectra at PD1 and CD70 loci in T cells provided with TdTS mRNA or control mRNA.
- FIG. 9B presents the percentage of each translocation variant of PD1/CD70 in T cells provided with TdTS mRNA or control mRNA.
- FIG. 9C presents editing efficiencies at PD1 and CD70 loci in T cells provided with TdTS mRNA or control mRNA.
- FIG. 10A presents indel spectra at PD1 and CD70 loci in T cells provided with short or long isoforms of TdT.
- FIG. 10B shows the percentage of each translocation variant of PD1/CD70 in T cells provided with short or long isoforms of TdT.
- FIG. 11 A presents editing efficiencies at PD1 and CD70 loci in T cells provided with TdTS mRNA or control mRNA and Cas9 RNP.
- FIG. 11B presents editing efficiencies at PD1 and CD70 loci in T cells provided with TdTS mRNA and Cas9 mRNA + sgRNA.
- FIG. 12A shows indel spectra at the CD70 locus in T cells provided with TdTS mRNA and Cas9 RNP.
- FIG. 12B shows indel spectra at PD1 locus in T cells provided with TdTS mRNA and Cas9 RNP. 6
- FIG. 12C present indel spectra at the CD70 locus in T cells provided with TdTS mRNA and Cas9 mRNA + sgRNA.
- FIG. 12D present indel spectra at the PD1 locus in T cells provided with TdTS mRNA and Cas9 mRNA + sgRNA.
- FIG. 13A shows the percentage of each translocation variant of PD1/CD70 in T cells provided with TdTS mRNA and Cas9 RNP.
- FIG. 13B shows the percentage of each translocation variant of PD1/CD70 in T cells provided with TdTS mRNA and Cas9 mRNA + sgRNA.
- the present disclosure provides means for multiplex gene editing in which two or more programmable gene editing systems targeted to different genomic loci and an X- family DNA polymerase are delivered to cells.
- the endonucleases of the two or more programmable gene editing systems generate double stranded DNA breaks at the targeted genomic loci and the X-family DNA polymerase adds nucleotides to the 3’ termini of the double strand DNA breaks, leading to insertion-rich indel patterns.
- the insertion- rich indel patterns may reduce chromosomal translocations by a) disrupting the ability of the endonuclease to recognize and recut DNA after mutagenic scarring, b) facilitating preferential end-joining to its intrachromosomal partner through synapsis of the double strand DNA break, and c) shifting end joining pathway selection from microhomology- mediate end joining (MMEJ) to non-homologous end joining (NHEJ).
- MMEJ microhomology- mediate end joining
- NHEJ non-homologous end joining
- the addition of nucleotides by the X-family DNA polymerase at the nuclease-induced double strand DNA breaks reduces the iterative rounds of nuclease recutting by subverting error-free repair and restoration of the DNA’s original configuration. As such, ectopic supplementation the X-family DNA polymerase also serves as an adjuvant for editing efficiency.
- One aspect of the present disclosure encompasses methods for gene editing at multiple genomic loci, the method comprising delivering to a cell (a) two or more programmable gene editing systems or nucleic acid encoding two or more programmable 7 gene editing systems, wherein each programmable gene editing system is targeted to a different genomic locus, and (b) an X-family DNA polymerase or nucleic acid encoding an X family DNA polymerase, wherein the two or more programmable gene editing systems generate cleaved genomic loci and the X-family DNA polymerase catalyzes addition of nucleotides to 3’ termini of the cleaved genomic loci, such that chromosomal translocations are reduced between the cleaved genomic loci, thereby facilitating editing at multiple genomic loci.
- the method comprises delivery of two or more programmable editing systems such that two or more genomic loci can be edited simultaneously.
- Programmable gene editing systems are proteins or ribonucleoprotein complexes that can be programmed or engineered to target precise sequences in the genome and introduce double-stranded breaks at the targeted DNA loci.
- Suitable programmable gene editing systems include RNA-guided CRISPR-Cas systems, zinc finger nucleases, transcription activator-like effector-based nucleases, homing endonucleases, and hybrid nucleases.
- CRISPR-Cas Systems The RNA-guided clustered regularly interspaced short palindromic repeats (CRISPR)-Cas (CRISPR-associated protein) system is a naturally occurring defense mechanism in prokaryotes and archaea that has been repurposed as a RNA-guided DNA-targeting platform used for gene editing.
- a CRISPR-Cas system comprises a noncoding guide RNA (gRNA) to target the DNA and a CRISPR-Cas nuclease that cleaves the DNA.
- gRNA noncoding guide RNA
- the gRNA drives sequence recognition and specificity of the CRISPR-Cas system through Watson-Crick base pairing typically with a ⁇ 20 nucleotide (nt) sequence in the target locus, wherein the target sequence is adjacent to a specific short DNA motif referred to as a protospacer adjacent motif (PAM).
- the gRNA forms an RNA-duplex structure that is bound by the CRISPR-Cas nuclease to form the catalytically active CRISPR-Cas complex, which can then cleave the target locus.
- the CRISPR-Cas complex is bound to DNA at a target site, two independent nuclease domains within the 8
- CRISPR-Cas nuclease cleave opposite strands of the DNA leaving a double-strand break (DSB).
- CRISPR-Cas systems include Types I, II, III, IV, V, and VI systems.
- the CRISPR-Cas system can be a Type II CRISPR-Cas system (e.g., Cas9).
- the CRISPR-Cas system can be a Type V CRISPR-Cas system (e.g., Cas12a, previously termed Cpf1).
- the CRISPR-Cas9 nuclease or the CRISPR-Cas12a nuclease used in the methods described herein can have at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 95%, at least about 97%, at least about 99% sequence identity to a wild-type CRISPR-Cas9 nuclease or CRISPR-Cas12a nuclease.
- the Type-ll CRISPR-Cas system component can be from a Type-IIA, Type- IIB, or Type-IIC system.
- Cas9 and its orthologs are encompassed.
- Non-limiting exemplary species that the Cas9 nuclease or other components are from include Streptococcus pyogenes, Streptoccoccus lugdunensis, Streptococcus thermophilus, Streptococcus sp., Staphylococcus aureus, Listeria innocua, Lactobacillus gasseri, Francisella novicida, Wolinella succinogenes, Sutterella wadsworthensis, Gamma proteobacterium, Neisseria meningitidis, Campylobacter jejuni, Pasteurella multocida, Fibrobacter succinogene, Rhodospirillum rubrum, Nocardiopsis rougevillei, Streptomyces pristinaespiralis
- the Cas9 protein can be from Streptococcus pyogenes (SpCas9). In some embodiments, the Cas9 protein can be from S. lugdunensis (SluCas9). In some embodiments, the Cas9 protein can be from Staphylococcus aureus (SaCas9). In some embodiments, a suitable Cas9 protein for use in the present disclosure is any disclosed in W02019/183150 and WO2019/118935, each of which is incorporate herein by reference.
- the CRISPR-Cas nuclease can be engineered for increased fidelity.
- fidelity when used in reference to a CRISPR/Cas system comprising a Cas nuclease and gRNA refers to the specificity of the system for a target site in a DNA molecule (e.g., genomic DNA molecule) that is homologous (e.g., perfect match) to the gRNA spacer sequence.
- a CRISPR-Cas system with increased fidelity has reduced activity at off-target sites in the DNA molecule, i.e. , sites that are an imperfect match to the gRNA spacer sequence.
- the CRISPR-Cas system can comprise a Cas variant (e.g., a SpCas9 functional derivative, a SluCas9 functional derivative, a SaCas9 functional derivative) comprising one or more mutations for increased fidelity.
- the one or more mutations result in reduced activity of the CRISPR-Cas system at off-target sites in the DNA molecule, for example, compared to a system comprising an unmodified version of the Cas nuclease (e.g., wild-type SpCas9 nuclease, wild-type SluCas9 nuclease, wild-type SaCas9 nuclease).
- the CRISPR-Cas system has substantially equivalent activity for inducing cleavage at an on- target site in the DNA molecule, for example, as compared to the system comprising an unmodified version of the Cas nuclease.
- Methods of making Cas variants with increased fidelity are known in the art. For example, in some embodiments, a method of structure- guided engineering is used to make a Cas variant with increased fidelity. 10
- the CRISPR-Cas system described herein comprises a Cas9 nuclease comprising one or more mutations for increased fidelity.
- the Cas9 nuclease is derived from S. pyogenes, wherein the Cas nuclease comprises one or more mutations relative to wild-type SpCas9 for increased fidelity.
- the Cas9 nuclease is derived from S. aureus, wherein the Cas nuclease comprises one or more mutations relative to wild-type SaCas9 for increased fidelity.
- the Cas9 nuclease is derived from S. lugdunensis, wherein the Cas nuclease comprises one or more mutations relative to wild-type SluCas9 for increased fidelity.
- a suitable Cas9 nuclease with increased fidelity for use in the present disclosure includes anyone described in US2019/0010471; US2018/0142222; US 9,944,912; W02020/057481 ; US2019/0177710; US2018/0100148; US 10,526,591 ; and US20200149020; each of which is incorporated herein by reference in their entirety.
- the CRISPR-Cas nuclease can be engineered by one or more amino acid substitutions, deletions, and/or insertions to have improved targeting specificity, improved fidelity, altered PAM specificity, decreased off-target effects, and/or increased stability.
- Non-limiting examples of one or more mutations that improve targeting specificity, improve fidelity, and/or decrease off-target effects include N497A, R661A, Q695A, K810A, K848A, K855A, Q926A, K1003A, R1060A, and/or D1135E (with reference to the numbering system of SpCas9).
- the CRISPR-Cas nuclease can be linked to one or more nuclear localization signal (NLS) at or within 50 amino acids from the N-terminal end and/or the C-terminal end.
- NLSs are well known in the art.
- the NLS can be the SV40 Large T- antigen NLS, nucleoplasmin NLS, c-Myc NLS, or derivatives thereof.
- the linkage between the CRISPR nuclease and the NLS can be a direct or it can be indirect via an intervening linker sequence. Suitable linker sequences are well known in the art.
- Exemplary CRISPR-Cas nucleases include the Cas9 proteins as published in Fonfara et al. , “Phylogeny of Cas9 determines functional exchangeability of dual-RNA and Cas9 among orthologous type II CRISPR-Cas systems,” Nucleic Acids Research, 2014, 42: 2577-2590.
- the CRISPR-Cas gene naming system has undergone extensive 11 rewriting since the Cas genes were discovered.
- Fonfara et al. also provides PAM sequences for the Cas9 polypeptides from various species.
- a gRNA comprises at least (i) a spacer sequence that hybridizes to a target sequence and (ii) a CRISPR repeat sequence (crRNA).
- the gRNA also comprises a tracrRNA (trans-activating RNA) sequence.
- the CRISPR repeat sequence and tracrRNA sequence hybridize to each other to form a duplex.
- a type II system gRNA can be a single molecule (i.e. , single molecule gRNA or sgRNA) or can comprise two separate molecules (e.g., crRNA and tracrRNA).
- a crRNA sequence forms a duplex.
- the duplex binds the CRISPR-Cas nuclease such that the gRNA and the CRISPR-Cas nuclease form a complex.
- the gRNA thus, directs the activity of the CRISPR-CAS nuclease and provides specificity in targeting the CRISPR-Cas nuclease to the targeted genomic location.
- Each gRNA comprises a spacer sequence that defines the target sequence of a target nucleic acid (or target locus).
- the “target sequence” is adjacent to a PAM sequence in the target DNA and is cleaved by the CRISPR-Cas nuclease.
- the “target nucleic acid” is a double-stranded molecule; one strand comprises the target sequence and is referred to as the “PAM strand,” and the other complementary strand is referred to as the “non-PAM strand.”
- PAM strand the target sequence
- the gRNA spacer sequence hybridizes to the reverse complement of the target sequence, which is located in the non-PAM strand of the target nucleic acid.
- the spacer of a gRNA interacts with a target nucleic acid in a sequence-specific manner via hybridization (i.e., base pairing). The nucleotide sequence of the spacer thus varies depending on the target sequence of the target locus.
- the spacer sequence of the gRNA may be from about 15 to about 30 nucleotides long, or about 18 to 22 nucleotides long. In some embodiments, the spacer sequence is 15, 16, 17, 18, 19, 20, 21 , 22, 23, 24, 25, 26, 27, 28, 29, or 30 nucleotides. In some embodiments, a spacer sequence is 18, 19, 20, or 21 nucleotides long. In some embodiments, the spacer sequence is 20 nucleotide long.
- the spacer sequence has at least about 90%, at least about 95%, or at least about 99% sequence identity to the target sequence in the target nucleic acid. In certain embodiments, the spacer sequence has 100% sequence identity to the target sequence.
- a single molecule gRNA in a Type II CRISPR-Cas system can comprise, in the 5' to 3' direction, an optional spacer extension sequence, a spacer sequence, a minimum CRISPR repeat sequence, a single-molecule guide linker, a minimum tracrRNA sequence, a 3’ tracrRNA sequence and an optional tracrRNA extension sequence.
- the optional tracrRNA extension may comprise elements that contribute additional functionality (e.g., stability) to the guide RNA.
- the guide linker links the minimum CRISPR repeat and the minimum tracrRNA sequence to form a hairpin structure.
- the optional tracrRNA extension may comprise one or more hairpins.
- a double molecule gRNA in a Type II CRISPR-Cas system can comprise a first strand comprising, in the 5' to 3' direction, an optional spacer extension sequence, a spacer sequence and a minimum CRISPR repeat sequence (crRNA), and a second strand comprising a minimum tracrRNA sequence (complementary to the minimum CRISPR repeat sequence), a 3’ tracrRNA sequence and an optional tracrRNA extension sequence.
- crRNA minimum CRISPR repeat sequence
- a gRNA in a Type V CRISPR-Cas system can comprise, in the 5' to 3' direction, a minimum CRISPR repeat sequence (crRNA) and a spacer sequence.
- crRNA minimum CRISPR repeat sequence
- the gRNA in some embodiments, can comprise one or more uracil residues at the 3’ end of the gRNA sequence.
- the gRNA may comprise one (U), two (UU), three (UUU), four (UUUU) or more uracils at the 3’ end of the gRNA sequence.
- the gRNA comprises 5, 6, 7, or 8 uracils at the 3’ end of the gRNA sequence.
- the gRNA comprises 1 to 8, 2 to 8, 3 to 8, or 4 to 8 uracils at the 3’ end of the gRNA sequence.
- modified gRNAs may comprise one or more 2'-0-methyl phosphorothioate nucleotides.
- the gRNA may comprise 2'-0-methyl-phosphorothioate residues at the 5' end/or the 3' end. 13
- the gRNA comprises three 2'-0-methyl-phosphorothioate residues at the 5' end and 2'-0-methyl-phosphorothioate residues at the 3' end.
- ZFNs Zinc finger nucleases
- Fokl functions only as a dimer
- a pair of ZFNs are engineered to bind to cognate target “half-site” sequences on opposite DNA strands and with precise spacing between them to enable the catalytically active Fokl dimer to form.
- dimerization of the Fokl domain which itself has no sequence specificity per se, a DNA double-strand break is generated between the ZFN half-sites as the initiating step in genome editing.
- each ZFN is typically comprised of 3-6 zinc fingers of the abundant Cys2-His2 architecture, with each finger primarily recognizing a triplet of nucleotides on one strand of the target DNA sequence, although cross-strand interaction with a fourth nucleotide also can be important. Alteration of the amino acids of a finger in positions that make key contacts with the DNA alters the sequence specificity of a given finger. Thus, a four-finger zinc finger protein will selectively recognize a 12 bp target sequence, where the target sequence is a composite of the triplet preferences contributed by each finger, although triplet preference can be influenced to varying degrees by neighboring fingers.
- ZFNs can be readily re-targeted to almost any genomic address simply by modifying individual fingers.
- proteins of 4-6 fingers are used, recognizing 12-18 bp respectively.
- a pair of ZFNs will typically recognize a combined target sequence of 24-36 bp, not including the typical 5-7 bp spacer between half-sites.
- the binding sites can be separated further with larger spacers, including 15-17 bp.
- a target sequence of this length is likely to be unique in the human genome, assuming repetitive sequences or gene homologs are excluded during the design process.
- the ZFN protein-DNA interactions are not absolute in their specificity so off-target binding and cleavage events do occur, either as a heterodimer between the two ZFNs, or as a homodimer of one or the other of the ZFNs.
- the latter possibility has been effectively eliminated by engineering the dimerization interface of the Fokl domain to create “plus” and “minus” variants, also 14 known as obligate heterodimer variants, which can only dimerize with each other, and not with themselves. Forcing the obligate heterodimer prevents formation of the homodimer. This has greatly enhanced specificity of ZFNs, as well as any other nuclease that adopts these Fokl variants.
- TALENS Transcription activator-like effector nucleases
- ZFNs TALENs
- TALENs are TALENs, which like ZFNs comprise an engineered DNA binding domain linked to the Fokl nuclease domain.
- a pair of TALENs operate in tandem to achieve targeted DNA cleavage.
- the major difference from ZFNs is the nature of the DNA binding domain and the associated target DNA sequence recognition properties.
- the TALEN DNA binding domain derives from TALE proteins, which were originally described in the plant bacterial pathogen Xanthomonas sp.
- TALES are comprised of tandem arrays of 33-35 amino acid repeats, with each repeat recognizing a single base pair in the target DNA sequence that is typically up to 20 bp in length, giving a total target sequence length of up to 40 bp.
- Nucleotide specificity of each repeat is determined by the repeat variable diresidue (RVD), which includes just two amino acids at positions 12 and 13.
- RVD repeat variable diresidue
- the bases guanine, adenine, cytosine and thymine are predominantly recognized by the four RVDs: Asn-Asn, Asn-lle, His-Asp and Asn-Gly, respectively.
- RVD repeat variable diresidue
- Fokl domain then only single-strand DNA cleavage (nicking) will occur at the target site, rather than a DSB.
- the outcome is comparable to the use of CRISPR-Cas nickase mutants in which one of the Cas9 cleavage domains has been deactivated.
- HEs Homing endonucleases
- HEs are sequence-specific endonucleases that have long recognition sequences (14-44 base pairs) and cleave DNA with high specificity - often at sites unique in the genome.
- HEs can be used to create a DSB at a target locus as the initial step in genome editing.
- Nucleases can be engineered to comprise a fusion of a meganuclease (Mega) domain, a TALE DNA binding domain, and/or a homing endonuclease domain.
- Mega meganuclease
- TALE Transcription activator-like effector
- the MegaTAL platform and Tev-mTALEN platforms utilize fusion of TALE DNA binding domains and catalytically active HEs, taking advantage of both the tunable DNA binding and specificity of the TALE, as well as the cleavage sequence specificity of the HE; see, e.g., Boissel et al., Nucleic Acids Res., 2014, 42: 2591-2601 ; Kleinstiveret al., G3, 2014, 4:1155-65; and Boissel and Scharenberg, Methods Mol. Biol., 2015, 1239: 171-96, which are each incorporated by reference in their entirety.
- the MegaTev architecture is the fusion of a meganuclease with the nuclease domain derived 16 from the GIY-YIG homing endonuclease l-Tevl (Tev).
- the two active sites are positioned ⁇ 30 bp apart on a DNA substrate and generate two DSBs with non-compatible cohesive ends; see, e.g., Wolfs et al. , Nucleic Acids Res., 2014, 42, 8816-29, incorporated herein by reference in its entirety. It is anticipated that other combinations of existing nuclease- based approaches will evolve and be useful in achieving the targeted genome modifications described herein.
- fusion of the TALE DNA binding domain to a catalytically active HE takes advantage of both the tunable DNA binding and specificity of the TALE, as well as the cleavage sequence specificity of l-Tevl, with the expectation that off-target cleavage can be further reduced.
- the methods disclosed herein further comprises delivering to the cells an X- family DNA polymerase or functional fragment thereof or a nucleic acid encoding an X family polymerase or functional fragment thereof.
- the X-family of DNA polymerases have similar three-dimensional structures and include terminal deoxyribonucleotidyl transferase (TdT), polymerase lambda (Pol l), and polymerase mu (Pol m).
- TdT terminal deoxyribonucleotidyl transferase
- Poly l polymerase lambda
- Poly m polymerase mu
- Pol l and Pol m are responsible for polymerizing nucleotide addition at the site of a dsDNA break as part of the normal cellular DNA repair processes.
- TdT uniquely provides junctional diversity during VJD recombination in T- and B- cell precursors by adding nucleotides to the 3’ termini of dsDNA breaks in a template-free manner.
- terminal transferase is encoded by the DNTT gene.
- TdTS The short form (TdTS) consists of 509 amino acids; long form 1 (TdTL1) contains exon 12 and consists of 518 amino acids; long form 2 (TdTL2) contains exon 7 and consists of 527 amino acids; and long form 3 (TdTL3) contains both exons 7 and 13 and consists of 536 amino acids.
- the X-family DNA polymerase is Pol l. In other embodiments, the X-family DNA polymerase is Pol m. In certain embodiments, the X- family DNA polymerase is TdT, for example human TdT. In some embodiment, the X- 17 family DNA polymer is a long isoform of human TdT. In other embodiments, the X-family
- DNA polymerase is a combination of TdTS and either of TdTL1 , TdTL2, or TdTL3.
- the X-family DNA polymerase is the short isoform (TdTS) of human TdT.
- TdTS short isoform
- the two or more programmable gene editing systems and/or the X-family DNA polymerase can be provided to the cells as encoding nucleic acids.
- proteins of the gene editing systems and/or the X-family DNA polymerase can be coded by mRNA.
- proteins of the gene editing systems and/or the X-family DNA polymerase can be coded by DNA.
- the two or more gRNAs can be encoded by DNA.
- the encoding nucleic acid is mRNA
- the mRNA can be 5’ capped and/or 3’ polyadenylated.
- the mRNA can be single stranded and linear.
- the encoding nucleic acid is DNA
- the DNA can be single stranded, double stranded, linear, or circular.
- the gene editing system is a CRISPR-Cas system
- the DNA encoding the CRISPR-Cas nuclease generally is codon optimized for expression in the cells of interest.
- the mRNA can be synthesized by chemical means, as described in the art. While chemical synthetic procedures are continually expanding, purification of such RNAs by procedures such as 18 high-performance liquid chromatography (HPLC, which avoids the use of gels such as PAGE) tends to become more challenging as polynucleotide lengths increase significantly beyond a hundred or so nucleotides.
- HPLC high-performance liquid chromatography
- One approach used for generating RNAs of greater length is to produce two or more molecules that are ligated together. Much longer RNAs are more readily generated enzymatically in vitro.
- Various types of RNA modifications can be introduced during or after chemical synthesis and/or enzymatic generation of RNAs, e.g., modifications that enhance stability and the like.
- the encoding nucleic acid can be provided as part of vector system.
- vector refers to a nucleic acid molecule capable of transporting another nucleic acid to which it has been linked.
- plasmid refers to a circular double-stranded DNA loop into which additional nucleic acid segments can be ligated.
- viral vector Another type of vector is a viral vector, wherein additional nucleic acid segments can be ligated into the viral genome.
- Viral vectors can be RNA or DNA.
- Certain vectors are capable of autonomous replication in a host cell into which they are introduced (e.g., bacterial vectors having a bacterial origin of replication and episomal mammalian vectors). Other vectors (e.g., non-episomal mammalian vectors) are integrated into the genome of a host cell upon introduction into the host cell, and thereby are replicated along with the host genome.
- vectors can be capable of directing the expression of nucleic acids to which they are operatively linked. Such vectors are referred to as “recombinant expression vectors”, or more simply “expression vectors”, which serve equivalent functions.
- operably linked means that the nucleotide sequence of interest is linked to regulatory sequence(s) in a manner that allows for expression of the nucleotide sequence.
- regulatory sequence is intended to include, for example, promoters, enhancers and other expression control elements (e.g., polyadenylation signals). Such regulatory sequences are well known in the art and are described, for example, in Goeddel; Gene Expression Technology: Methods in Enzymology, 1990, 185, Academic Press, San Diego, CA.
- Regulatory sequences include those that direct constitutive expression of a nucleotide sequence in many types of host cells, and those 19 that direct expression of the nucleotide sequence only in certain host cells (e.g., tissue- specific regulatory sequences). It will be appreciated by those skilled in the art that the design of the expression vector can depend on such factors as the choice of the target cell, the level of expression desired, and the like.
- Suitable expression vectors include, but are not limited to, viral vectors based on vaccinia virus, poliovirus, adenovirus, adeno-associated virus (AAV), SV40, herpes simplex virus, human immunodeficiency virus, retrovirus (e.g., Murine Leukemia Virus, spleen necrosis virus, and vectors derived from retroviruses such as Rous Sarcoma Virus, Harvey Sarcoma Virus, avian leukosis virus, a lentivirus, human immunodeficiency virus, myeloproliferative sarcoma virus, and mammary tumor virus) and other recombinant vectors.
- retrovirus e.g., Murine Leukemia Virus, spleen necrosis virus, and vectors derived from retroviruses such as Rous Sarcoma Virus, Harvey Sarcoma Virus, avian leukosis virus, a lentivirus, human immunodeficiency virus, myelop
- vectors contemplated for eukaryotic target cells include, but are not limited to, the vectors pXT1 , pSG5, pSVK3, pBPV, pMSG, and pSVLSV40 (Pharmacia). Other vectors can be used so long as they are compatible with the host cell.
- a vector can comprise one or more transcription and/or translation control elements.
- any of a number of suitable transcription and translation control elements including constitutive and inducible promoters, transcription enhancer elements, transcription terminators, etc. can be used in the expression vector.
- suitable eukaryotic promoters i.e.
- promoters functional in eukaryotic cells include those from cytomegalovirus (CMV) immediate early, herpes simplex virus (HSV) thymidine kinase, early and late SV40, long terminal repeats (LTRs) from retrovirus, human elongation factor-1 a promoter (EF1a), chicken beta-actin promoter (CBA), ubiquitin C promoter (UBC), a hybrid construct comprising the cytomegalovirus enhancer fused to the chicken beta-actin promoter (CAG), murine stem cell virus promoter (MSCV), phosphoglycerate kinase-1 locus promoter (PGK), and mouse metallothionein-l promoter.
- CMV cytomegalovirus
- HSV herpes simplex virus
- LTRs long terminal repeats
- EF1a human elongation factor-1 a promoter
- CBA chicken beta-actin promoter
- UBC ubiquitin C promoter
- PGK
- the promoter can be an inducible promoter (e.g., a heat shock promoter, tetracycline-regulated promoter, steroid-regulated promoter, metal-regulated promoter, estrogen receptor-regulated promoter, etc.), a tissue specific promoter, or a cell type specific promoter.
- DNA encoding gRNAs for two or more CRISPR-Cas systems can be operably linked to a promoter sequence that is recognized by RNA polymerase III (Pol III).
- Pol III promoters include, but are not limited to, mammalian U6, U3, H1 , and 7SL RNA promoters.
- DNA encoding an array of two or more gRNAs can be operably linked to a Pol III promoter.
- the two or more programmable gene editing systems can be delivered to the cells as proteins or RNP complexes, encoding mRNA, or encoding DNA and the X-family DNA polymerase can be delivered to the cells as protein, encoding RNA, or encoding DNA.
- two programmable gene editing systems are delivered to the cells. In other embodiments, three programmable gene editing systems are delivered to the cells. In further embodiments, four programmable gene editing systems are delivered to the cells. In still other embodiments, five programmable gene editing systems are delivered to the cells. In additional some embodiments, six or more programmable gene editing systems are delivered to the cells.
- the two or more programmable gene editing systems and the X-family DNA polymerase can be delivered to the cells by a variety of methods. Suitable methods include electroporation (e.g., nucleofection), lipofection, sonoporation, microinjection, biolistics, virosomes, liposomes, immunoliposomes, polycation conjugates, lipidmucleic acid conjugates, naked DNA, naked RNA, capped RNA, artificial virions, and agent- enhanced uptake of DNA.
- the gene editing system can be delivered to the cells of interest via electroporation.
- the two or more CRISPR-Cas systems can comprise the same CRISPR-Cas nuclease (e.g., SpCas9) or different CRISPR-Cas nucleases (e.g., SpCas9 and SaCas9, etc.), wherein each gRNA is engineered to complex with a specific Cas9 protein.
- the number of CRISPR-Cas systems is determined by the number of different gRNAs. In some embodiments, two different gRNAs are provided to the cells. In other 21 embodiments, three different gRNAs are provided to the cells. In still other embodiments, four different gRNAs are provided to the cells. In additional embodiments, five different gRNAs are provided to the cells. In further embodiments, six or more different gRNAs are provided to the cells.
- the two or more CRISPR-Cas systems can be delivered to the cells as (i) RNP complexes, each comprising a CRISPR-Cas nuclease and a gRNA; (ii) mRNA encoding the CRISPR-Cas nuclease and two or more gRNAs; (iii) mRNA encoding the CRISPR-Cas nuclease and DNA encoding the two or more gRNAs; or (iv) DNA encoding the CRISPR-Cas nuclease and DNA encoding the two or more gRNAs.
- the weight ratio of each gRNA to the CRISPR-Cas nuclease can range from about 1 :1 to about 10:1. In various embodiments, the weight ratio of each gRNA to the CRISPR-Cas nuclease can be about 1 :1 , about 1.5:1 , abut 2:1 , about 2.5:1 , about 3:1 , about 3.5:1 , about 4:1 , about 4.5:1, about 5:1 , about 5.5:1 , about 6:1 , about 6.5:1 , about 7:1 , about 7.5:1 , about 8:1 , about 8.5:1 , about 9:1 , about 9.5:1 , or about 10:1.
- the X-family DNA polymerase generally is delivered as encoding mRNA. In some embodiments, however, the X-family DNA polymerase can be delivered as encoding DNA or purified protein.
- the X-family DNA polymerase can be fused to the CRISPR-Cas nuclease.
- the X-family DNA polymerase or a functional fragment thereof can be fused to the amino terminus, carboxy terminus, or both of the CRISPR-Cas nuclease.
- the fusion can be direct or via a linker, which are well known in the art.
- the X-family DNA polymerase/CRISPR-Cas fusion can be delivered to the cells as a protein, encoding mRNA, or encoding DNA.
- the cells are mammalian cells.
- the cells are human 22 cells.
- the cells are other than immature, pre-B or pre-T lymphoid cells, leukemia cells, lymphoma cells, or immortalized cells derived from any of the foregoing.
- the cells are primary cells isolated directly from human (or animal) tissue.
- the cells used in the methods or compositions disclosed herein can be CD34+ HSPCs, T cells, NK cells, monocytes, macrophages, microglial cells, neutrophils, eosinophils, basophils, dendritic cells, or adipocytes.
- the cells can be T cells, e.g., human T cells.
- the T cells can be helper T cells or cytotoxic T cells.
- the cells can be NK cells, e.g., human NK cells.
- the primary cells can be, without limit, adipocytes, astrocytes, blood cells, chondrocytes, endothelial cells, epithelial cells, fibroblasts, hair cells, hepatocytes, keratinocytes, melanocyte, myocytes, neurons, osteoblasts, skeletal muscle cells, smooth muscle cells, stem cells, or synoviocytes.
- the cells can be stem cells (e.g., embryonic stem cells, fetal stem cells, amniotic stem cells, or umbilical cord stem cells).
- the stem cells can be adult stem cells isolated from bone marrow, adipose tissue, or blood.
- the stem cells can be induced pluripotent stem cells (e.g., human iPSCs).
- the cells may be hematopoietic stem and progenitor cells (HSPCs) or hematopoietic stem cells (HSCs).
- HSPCs give rise to all blood cell types, including erythroid (erythrocytes or red blood cells (RBCs)), myeloid (monocytes and macrophages, neutrophils, basophils, eosinophils, megakaryocytes / platelets, and dendritic cells), and lymphoid (T-cells, B-cells, NK-cells).
- Blood cells are produced by the proliferation and differentiation of a very small population of pluripotent HSCs that also have the ability to replenish themselves by self-renewal.
- Bone marrow is the major site of hematopoiesis in humans and, under normal conditions, only small numbers of HSPCs can be found in the peripheral blood (PB).
- cytokines in particular granulocyte colony-stimulating factor; G-CSF
- G-CSF granulocyte colony-stimulating factor
- myelosuppressive drugs used in cancer treatment, and compounds that disrupt the interaction between hematopoietic and BM stromal cells can rapidly mobilize large numbers of stem and progenitors into the circulation.
- the cell surface glycoprotein CD34 is routinely used to identify and isolate HSPCs.
- the cells can be mesenchymal stem cells (e.g., multipotent stromal cells that can differentiate into a variety of cell types).
- Mesenchymal stem cells are adult stem cells found in the bone marrow, or isolated from other tissues such as cord blood, peripheral blood, fallopian tube, and fetal liver and lung.
- MSCs differentiate into multiple cell types including adipocytes, chondrocytes, osteocytes, and cardiomyocytes.
- Mesenchymal stem cells are a distinct entity to the mesenchyme, embryonic connective tissue, which is derived from the mesoderm and differentiates to form hematopoietic stem cells (HPCs).
- HPCs hematopoietic stem cells
- the cells used in the methods disclosed herein can be in vitro. In other embodiments, the cells used in the methods can be ex vivo. In still other embodiments, the cells used in the methods can be in situ. In yet another embodiment, the cells used in the methods can be in vivo.
- the gene editing systems Upon delivery of the two or more programmable gene editing systems and the X-family DNA polymerase, the gene editing systems introduce double stranded DNA breaks at the targeted genomic loci, thereby generating cleaved genomic loci and the X- family DNA polymerase catalyzes addition of nucleotides to 3’ termini of the cleaved genomic loci, which we have found to lead to a reduction of chromosomal translocations between the two or more cleaved genomic loci.
- chromosomal translocations when two or more genomic loci are edited simultaneously in the presence of the X-family DNA polymerase, chromosomal translocations can be reduced by at least about 5%.
- the percent of chromosomal translocations can be reduced by at least about 5%, by at least about 10%, by at least about 15%, by at least about 20%, by at least about 35% ,by at least about 50% (2-fold 24 reduction), by at least about 2.2-fold, by at least about 2.5-fold, by at least about 3-fold, by at least about 3.5-fold, by at least about 4-fold, or at least more than 4-fold.
- each type of translocation variant e.g., monocentric, dicentric, acentric
- at least one type of variant can be reduced more than the other variants.
- nucleotides by the X-family DNA polymerase produces insertion-rich indel profiles.
- cells comprising an X-family DNA polymerase, such as TdTS exhibit increased percentages of nucleotide insertions, increased percentages of longer insertions, and decreased percentages of deletions at the targeted chromosomal loci as compared to cells lacking the X-family DNA polymerase.
- the percentage of nucleotide insertions of more than one nucleotide can be increased by at least about 40% as compared to cells lacking the X-family DNA polymerase. In some embodiments, the percentage of insertions can be increased about 5%, about 10%, about 15%, about 20%, about 25%, about 30%, about 35%, or about 40%.
- the length of insertions can be increased from about 1 to about 8 to 10 nucleotides, whereas cells lacking the X-family DNA polymerase rarely exhibit insertions longer than one nucleotide.
- the percentage of nucleotide deletions can be decreased by at least about 30% as compared to cells lacking the X-family DNA polymerase. In some embodiments, the percentage of deletions can be decreased about 2%, about 5%, about 10%, about 15%, about 20%, about 25%, or about 30%.
- the efficiency of editing varies between different genomic loci, the efficiency does not differ significantly between cells with or without the X-family DNA polymerase. In general, the efficiency of editing is at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, or at least 95%.
- compositions and kits for multiplex gene editing encompasses compositions and kits for multiplex gene editing.
- the present disclosure provides cells comprising two or more programmable gene editing systems and an X-family DNA polymerase as described above in section (I).
- the two or more programmable gene editing systems and 25 an X-family DNA polymerase can exist as DNA, RNA, protein, or combinations thereof.
- the cells comprise two or more CRISPR-Cas systems and TdT.
- the cells comprise two or more CRISPR-Cas systems and the short isoform of TdT (TdTS). Suitable cells are detailed above in section (l)(e).
- kits for multiplex gene editing comprise two or more programmable gene editing systems and an X-family DNA polymerase as described above in section (I), as well as reagents for gene editing.
- Reagents for gene editing include appropriate buffers, dNTPs, divalent salts, and the like.
- the kits can also include reagents and/or primers/probes for analyzing indel patterns and/or chromosomal translocations.
- the kits comprise two or more CRISPR-Cas systems and TdT.
- the kits comprise two or more CRISPR-Cas systems and the short isoform of TdT (TdTS).
- the kits can provide the two or more CRISPR-Cas systems as RNPs, RNA, and/or DNA, and can provide the TdT or TdTS as RNA, DNA, or protein.
- the terms “complementary” or “complementarity” refer to the association of double-stranded nucleic acids by base pairing through specific hydrogen bonds.
- the base paring may be standard Watson-Crick base pairing (e.g., 5’-A G T C-3’ pairs with the complementary sequence 3’-T C A G-5’).
- the base pairing also may be Hoogsteen or reversed Hoogsteen hydrogen bonding.
- Complementarity is typically measured with respect to a duplex region and thus, excludes overhangs, for example.
- Complementarity between two strands of the duplex region may be partial and expressed as a percentage (e.g., 70%), if only some (e.g., 70%) of the bases are complementary.
- the bases that are not complementary are “mismatched.”
- Complementarity may also be complete (i.e. , 100%), if all the bases in the duplex region are complementary.
- a “gene,” as used herein, refers to a DNA region (including exons and introns) encoding a gene product, as well as all DNA regions which regulate the production of the gene product, whether or not such regulatory sequences are adjacent to coding and/or transcribed sequences. Accordingly, a gene includes, but is not necessarily limited to, promoter sequences, terminators, translational regulatory sequences such as ribosome binding sites and internal ribosome entry sites, enhancers, silencers, insulators, boundary elements, replication origins, matrix attachment sites, and locus control regions.
- nuclease and “endonuclease” are used interchangeably herein, and refer to an enzyme that cleaves both strands of a double-stranded nucleic acid sequence.
- nucleic acid and “polynucleotide” refer to a deoxyribonucleotide or ribonucleotide polymer, in linear or circular conformation, and in either single- or double- stranded form.
- these terms are not to be construed as limiting with respect to the length of a polymer.
- the terms can encompass known analogs of natural nucleotides, as well as nucleotides that are modified in the base, sugar and/or phosphate moieties (e.g., phosphorothioate backbones).
- an 27 analog of a particular nucleotide has the same base-pairing specificity; i.e., an analog of A will base-pair with T.
- nucleotide refers to deoxyribonucleotides or ribonucleotides.
- the nucleotides may be standard nucleotides (i.e., adenosine, guanosine, cytidine, thymidine, and uridine), nucleotide isomers, or nucleotide analogs.
- a nucleotide analog refers to a nucleotide having a modified purine or pyrimidine base or a modified ribose moiety.
- a nucleotide analog may be a naturally occurring nucleotide (e.g., inosine, pseudo uridine, etc.) or a non-naturally occurring nucleotide.
- Non-limiting examples of modifications on the sugar or base moieties of a nucleotide include the addition (or removal) of acetyl groups, amino groups, carboxyl groups, carboxymethyl groups, hydroxyl groups, methyl groups, phosphoryl groups, and thiol groups, as well as the substitution of the carbon and nitrogen atoms of the bases with other atoms (e.g., 7-deaza purines).
- Nucleotide analogs also include dideoxy nucleotides, 2’-0-methyl nucleotides, locked nucleic acids (LNA), peptide nucleic acids (PNA), and morpholinos.
- sequence identity indicates a quantitative measure of the degree of identity between two sequences of substantially equal length.
- the percent identity of two sequences, whether nucleic acid or amino acid sequences is the number of exact matches between two aligned sequences divided by the length of the shorter sequence and multiplied by 100.
- An approximate alignment for nucleic acid sequences is provided by the local homology algorithm of Smith and Waterman, Advances in Applied Mathematics 2:482-489 (1981). This algorithm can be applied to amino acid sequences by using the scoring matrix developed by Dayhoff, Atlas of Protein Sequences and Structure, M. O. Dayhoff ed., 5 suppl.
- PBMCs AllCells
- T cell CTSTM OpTmizerTM T Cell Expansion Media (Thermo Fisher) supplemented with 5% Human Serum AB (Vally Biomedical), 50 ng/ml IL-2 (Miltenyi Biotec) and 10 ng/ml IL-7 (Cellgenix).
- T cells were activated with T Cell TransActTM (Miltenyi) for 3 days.
- T Cell TransActTM was subsequently removed from culture and the cells were seeded at a density of 5 x 105 cells/ml.
- Jurkat cells, Clone E6-1 (ATCC TIB-152) were grown in RPMI-1640 + GlutaMax (Thermo Fisher) supplemented with 10% Fetal Bovine Serum (Thermo Fisher) and 1% Penicillin- Streptomycin (Thermo Fisher). Cells were seeded at a density of 1 x 105 cells/ml and passaged before exceeding 1 x 106 cells/ml. Preculturing of cells prior to nucleofection was followed according to recommendations by Lonza.
- Staphylococcus lugdunensis (Slu) combined with 28 pmol of guide “CD70_SluCas9_gRNA_3”.
- Gene-editing experiments that included mRNA-encoding TdT isoforms were performed by including the relevant mRNA directly in the RNP mixture.
- T cells and Jurkat cells were nucleofected using the Lonza4D nucleofector and the P3 primary cell and SE Cell line kits, respectively. Cells were counted using a Cellometer K2 or Cellaca MX HighThroughput Cell Counter (Nexcelom). T cells and Jurkat cells were resuspended in P3/SE buffer with supplement at a concentration of 5x104/pl and 1x104/mI, respectively. A total of 1x106 T cells or and 2x105 Jurkat cells were combined with RNP complex and transferred to a Lonza nucleofection cuvette for all editing experiments. T cells were nucleofected using program EO-115 and Jurkat cells nucleofected using program CL-120.
- Jurkat cells were immediately recovered in RPMI media. T cells were recovered in serum-free media for 1 hour post-nucleofection before being transferred to media supplemented with serum. After 48 hours, cells were recovered, and genomic DNA was harvested using the QIAcube HT DNA extraction system and QIAamp 96 DNA QIAcube HT Kit or DNeasy Blood and Tissue Kit (Qiagen).
- TdT mRNA for Kinetics of TdT Expression in Primary T cells was 30 performed as described above by adding mRNA directly to the P3 cellular mixture. In these experiments, a total of 5x106 T cells were transferred to a large Lonza nucleofection cuvette. Cells were recovered directly in T cell expansion media and 1 million T cells were harvested at the indicated time points for preparation of protein lysates.
- PCR and Tracking of Indels by Decomposition TIDE. Isolated genomic DNA was subjected to PCR to determine indel frequency by TIDE analysis (Brinkman et al. 2014, Nucleic Acids Research 42 (22): e168-e168). PCR for relevant regions was performed (1 cycle at 95°C for 3 min; 34 cycles at 98°C for 20 sec, 69°C for 15 sec, 72°C for 1 min; 1 cycle at 72°C for 3 min) according to the manufacturer’s recommendations for each commercial PCR polymerase. Resulting amplicons were examined on cast 1.2% or 2% pre-cast agarose gels.
- PCR clean-up of the resulting amplicons was performed using SPRIselect magnetic beads (Beckman Coulter). Sanger sequencing results were input into Tsunami along with guide sequences and resulting indel frequencies and identities were calculated by the software. Indel frequencies and editing efficiency was plotted using Prism (GraphPad). Indels were plotted as heatmaps with the indel identities on one axis and the corresponding frequency indicated by color intensity. Total editing frequencies were plotted as bar graphs.
- ddPCR Droplet Digital PCR
- ddPCR assays were designed to detect four translocations denoted (T1 , T2, T3, and T4) from simultaneous editing of the PD1 and CD70 loci. All primers and probes were ordered from IDT. Primer pairs and a HEX- labeled reference probe were designed to count all copies of the reference amplicon within the RAG1 locus. Paired primers and a FAM-labeled probe (Table 2) were designed to polymerize amplicon across the breakpoint of the predicted translocation. Amplification reactions were formulated in 2x ddPCR Supermix for Probes (No dUTP) (BioRad).
- Primers and probes were introduced at a final concentration of 1.8 mM and 0.5 pM, respectively.
- 100-200 ng of genomic DNA from cells harvested 48-hours post- nucleofection was introduced to each ddPCR reaction along with 0.1 Unit of BamH1-HF to reduce viscosity generated by high DNA concentration.
- Individual reactions comprising 25 pi of ddPCR amplification mixture, were placed into an Automated Droplet Generator (BioRad). For each sample, roughly 15,000-20,000 nanoliter-sized droplets were 31 generated.
- droplets were transferred to a 96-well plate for PCR in a thermal cycler (1 cycle at 95°C for 10 min; 40 cycles at 90°C for 30 sec, 57°C for 1 min, 72°C for 1 min; 1 cycle at 98°C for 10 min; 4°C hold).
- droplets were read by QX200 Droplet Reader and analyzed using the QuantasoftTM software (BioRad), which analyzes each droplet individually using a two-color detection system (set to detect FAM and HEX). The ratio between translocation specific events (FAM) over events of reference amplicon (HEX) (x100) is the percentage of translocation.
- No template controls (NTCs) were included for each translocation assessed to determine background fluorescence. Data was plotted in the form of bar graphs on Prism (GraphPad). 32
- the Anti-Rabbit Detection Module (ProteinSimpleTM) was used with the Anti-hTdT RabMab antibody [EPR2976Y] (ab76544) (Abeam) at 1.45 pg/ml.
- Tubulin was detected with the Anti-Mouse Detection Module (ProteinSimpleTM) and the monoclonal Anti-a-Tubulin clone B-5-1-2 (Millipore Sigma) at a 1 :10,000 dilution.
- Capillary westerns were run on the JESS device with the 12-230 kDa kits (ProteinSimpleTM) and all procedures were performed according to manufacturer’s protocol.
- the JESS device was associated with Compass software for device settings and raw data recording (ProteinSimple/Biotechne).
- TdT-/- Jurkat cells were created by CRISPR editing of the DNTT locus. Briefly, the DNTT gene was singly edited with each 33 guide in Table 3 according to the same method detailed in “CRISPR Editing of Genomic Loci in Jurkat and Primary T cells” using S. pyogenes Cas9 for RNP formation. Editing efficiencies were established by TIDE analysis of Sanger sequencing using the primers (Table 4) and amplification protocol (1 cycle at 95°C for 3 min; 34 cycles at 98°C for 20 sec, 68°C for 45 sec, 72°C for 1 min; 1 cycle at 72°C for 3 min).
- TdT expression Diminution of TdT expression was confirmed using “Immunodetection by Capillary Western” described above. Bulk-edited cultures displayed near-complete loss of TdT expression and were therefore used for downstream editing and translocation analysis following editing of the PD1 and CD70 loci.
- Plasmids encoding TdT isoforms and Cas9 nucleases from S. pyogenes and S. lugdunensis were synthesized by ATUM to contain an upstream T7 promoter, a downstream polyA tail, and a single Sap I restriction site adjacent to the polyA tail. Linear templates were created by digesting plasmid with Sap I 34
- gRNAs were designed to target the NPM1 gene locus or the ALK gene locus in human cells (Table 1).
- RNPs comprising SpCas9 protein and gRNA were delivered by electroporation to Jurkat and T cells as described above in Example 1.
- the edited loci were analysed by PCR and TIDE (as described above) to determine the efficiency of editing and the mutational profile (or indel spectrum) of the types and distribution of indels after repair of the dsDNA break introduced by Cas9.
- the editing efficiencies were similar between the two cell types at each of the loci.
- the indel profiles of T cells mainly comprised single nucleotide insertions (+1) and deletions
- those of Jurkat cells mainly comprised nucleotide insertions of +1 or greater (FIG. 1B).
- TdT is not active in mature lymphocytes (primary T cells) but is reactivated in some stable cell lines originating from T cells (e.g., Jurkat cells).
- FIG. 4 presents a Western blot confirming that Jurkat, but not T cells, express TdT.
- TdT To determine whether TdT affects the indel profiles and chromosomal rearrangements in Jurkat cells, the DNTT gene locus was disrupted such that no TdT was expressed. For this, several gRNAs that target various exons were designed, as shown above in Table 3. After editing, TdT knock out was confirmed at the level of DNA and protein. TdT levels were very low in the bulk edited pool of cells (FIG. 5). Clonal outgrowth identified Jurkat (TdT-) cells.
- TdT- edited Jurkat
- TdT+ wild type Jurkat
- primary T cells essentially as described above in Example 3.
- FIG. 6A and 6B the indel profile of the Jurkat (TdT-) cells mirrored that of T cells in that the profiles comprised mainly single nucleotide insertions and deletions.
- the Jurkat (TdT-) cells also exhibited translocation frequencies similar to those of T cells (FIG. 6C).
- TdT exists in several isoforms, e.g., short isoform, long isoform 1 (which includes exon 12), long isoform 2 (which included exon 7), and long isoform 3 (which included exons 7 and 12).
- T cells were electroporated with 6 pmol of mRNA encoding the various TdT isoforms, as well as Cas9 mRNA.
- the kinetics of TdT expression relative to alpha-tubulin expression are presented in FIG. 7.
- TdTS short isoform
- Lower levels of TdTS mRNA were introduced into T cells (FIG. 8A). Even at the lowest dose of 1.5 pmol, the relative abundance of TdTS to alpha-tubulin in T cells was higher than that in wild type Jurkat cells at 12 hours post electroporation (FIG. 8B).
- T cells were electroporated with Cas9 editing reagents (RNPs) targeting PD1 and CD70, as described above, and 3 pmol of TdTS mRNA (with no mRNA or GFP mRNA serving as controls).
- the T cells provided with TdTS mRNA exhibited indel profiles with increased frequency of insertions of +1 and greater (FIG. 9A) and exhibited a three-fold decrease in translocations as compared to control cells (FIG. 9B).
- the editing efficiency was high at both loci under the various conditions (FIG. 9C).
- it was found that providing the long isoforms of TdT did not change the indel profile (FIG. 10A) or the translocate rate in T cells (FIG. 10B).
- Example 7 Indel Patterns in T Cells Edited with Cas9 RNP and TdTS mRNA versus Cas9 mRNA +sgRNA and TdTS mRNA
- Cas9 was delivered as RNP complex (Cas9protein + sgRNA) or as nucleic acids (Cas9mRNA + sgRNA) to determine whether the means of Cas9 delivery could affect editing efficiency, indel profile, and/or translocation rate.
- T cells were multiplex edited at PD1 and CD70 in the absence or presence of TdTSmRNA. The editing efficiency did not differ between the two types of Cas9 delivery (FIGs. 11 A, 11 B). Delivery of TdTS resulted in insertion heavy indels (FIGS. 12A-12D) and reduced frequency of translocations (FIGS. 13A, 13B).
- T cells will be electroporated with TdTSmRNA and three Cas9 systems (either RNPs or Cas9mRNA + sgRNA) targeted three different genomic loci.
- the percent of chromosomal translocations, the indel patterns, and the editing efficiency will be analyzed as described above. ⁇
Abstract
Provided herein are methods and compositions for gene editing at multiple sites. The methods comprise delivering to cells two or more programmable gene editing systems and an X-family DNA polymerase, wherein the programmable gene editing systems generate cleaved genomic loci and the X-family DNA polymerase catalyzes addition of nucleotides to 3' termini of the cleaved genomic loci, such that chromosomal translocations are reduced between the cleaved genomic loci, thereby facilitating multiplex gene editing.
Description
1
MULTIPLEX GENE EDITING
CROSS-REFERENCE TO RELATED APPLICATION
[0001] This application claims the benefit of U.S. Provisional Application No. 63/187,755, filed May 12, 2021 , the disclosure of which is hereby incorporated by reference in its entirety.
INCORPORATION BY REFERENCE OF SEQUENCE LISTING
[0002] This application contains a Sequence Listing that has been submitted in ASCII format via EFS-Web and is hereby incorporated by reference in its entirety. The ASCII copy, created on May 10, 2022 is named 100867-727793_CT162-
PCT1_Sequence_Listing_ST25.txt, and is about 18,000 bytes in size.
FIELD
[0003] The present disclosure relates to the field of CRISPR-based multiplex gene editing.
BACKGROUND
[0004] Multiplex gene editing, in which multiple gene editing systems are introduced simultaneously into cells, have greatly expanded the scope and efficiency of gene editing applications. Chromosomal rearrangements, however, can result from the simultaneous generation of several double-strand breaks at multiple genomic loci. As such, there is a need for methods of reducing chromosomal translocations during multiplex gene editing.
SUMMARY
[0005] Various aspects of the present disclosure relate to methods for gene editing at multiple gene loci. In various aspects, the methods herein comprise delivering to a cell: (a) two or more CRISPR-Cas nuclease systems or nucleic acids encoding two or more CRISPR-Cas nuclease systems, wherein each CRISPR-Cas nuclease system is targeted to a different genomic locus; and (b) an X-family DNA polymerase or a nucleic acid
2 encoding an X family DNA polymerase; wherein the two or more CRISPR-Cas nuclease systems generate cleaved genomic loci and the X-family DNA polymerase catalyzes addition of nucleotides to 3’ termini of the cleaved genomic loci, such that chromosomal translocations are reduced between the cleaved genomic loci, thereby facilitating editing at multiple genomic loci.
[0006] In various aspects, the addition of nucleotides may increase nucleotide insertion frequencies, increase insertion lengths, and/or decrease nucleotide deletion frequencies as compared to a comparable cell in which the X-family DNA polymerase is not delivered.
[0007] In various aspects, chromosomal translocations may be reduced by at least 5% when two genomic loci are edited as compared to a comparable cell in which the X-family DNA polymerase is not delivered.
[0008] In any of the methods provided herein, each CRISPR-Cas nuclease system may comprise a CRISPR-Cas nuclease and a gRNA. In various aspects, the CRISPR- Cas nuclease may be a type II CRISPR-Cas nuclease ora type V CRISPR-Cas nuclease. As a non-limiting example, the CRISPR-Cas nuclease may be SpCas9, SluCas9, SaCas9, ora variant thereof with increased fidelity. In any of the methods provided herein, the gRNA may be a single molecule gRNA (sgRNA).
[0009] In any of the methods provided herein, each of the CRISPR-Cas nuclease systems may be delivered as a ribonucleoprotein (RNP) complex. Alternatively, each of the CRISPR-Cas nuclease systems may be delivered as a CRISPR-Cas nuclease and a gRNA. Alternatively, each of the CRISPR-Cas nuclease systems may be delivered as mRNA encoding the CRISPR-Cas nuclease and a gRNA. As another option, each of the CRISPR-Cas nuclease systems may be delivered as DNA encoding the CRISPR-Cas nuclease and DNA encoding the gRNA.
[0010] In any of the methods provided herein, the X-family DNA polymerase may be a terminal deoxyribonucleotidyl transferase (TdT), polymerase lambda (Pol l), or polymerase mu (Pol m). In some aspects, the X-family DNA polymerase is a short isoform of TdT (TdtS) (e.g., having an amino acid sequence of SEQ ID NO: 34).
3
[0011] In any of the methods provided herein, the X-family DNA polymerase may be delivered as mRNA, protein, or DNA.
[0012] In any of the methods provided herein, two CRISPR-Cas nuclease systems are delivered to the cell. In other methods, three CRISPR-Cas nuclease systems are delivered to the cell. In still other methods, four CRISPR-Cas nuclease systems are delivered to the cell. In various aspects, five CRISPR-Cas nuclease systems may be delivered to the cell.
[0013] In any of the methods herein, delivering may comprise electroporation.
[0014] In any of the methods herein, the cell may be a mammalian cell. In some aspects, the cell is other than an immature, pre-B or pre-T lymphoid cell, a leukemia cell, a lymphoma cell, or an immortalized cell derived therefrom. In some aspects, the cell may be a primary T cell or an NK cell.
[0015] In any of the methods provided, the cell may be in vitro, ex vivo, in situ, or in vivo.
[0016] Further aspects of the present disclosure are directed to a cell comprising (a) two or more CRISPR-Cas nuclease systems and (b) a short isoform of terminal deoxyribonucleotidyl transferase (TdTS), wherein the CRISPR-Cas nuclease systems and the TdTS are exogenous to the cell.
[0017] In various aspects, each CRISPR-Cas nuclease systems of (a) may be a ribonucleoprotein (RNP) complex. In other aspects, each CRISPR-Cas nuclease system of (a) may comprise CRISPR-Cas mRNA and gRNA.
[0018] In various aspects, (b) may comprise mRNA encoding TdTs or may comprise TdTS protein.
[0019] In other aspects, each CRISPR-Cas nuclease system of (a) comprises DNA encoding CRISPR-Cas nuclease and DNA encoding gRNA and (b) comprises DNA encoding TdTS.
[0020] Further aspects of the present disclosure are directed to a kit for multiplex gene editing, the kit comprising (a) two or more CRISPR-Cas nuclease systems, and (b) a short isoform of terminal deoxyribonucleotidyl transferase (TdTS). In various aspects, each CRISPR-Cas nuclease systems of (a) in the kits provided herein may be a RNP complex.
4
In other aspects, each CRISPR-Cas nuclease system of (a) may comprise CRISPR-Cas mRNA and gRNA. In various aspects, (b) may comprise mRNA encoding TdTS or may comprise TdTS protein. In various aspects, the kits provided are as described herein each CRISPR-Cas nuclease system of (a) comprises DNA encoding CRISPR-Cas nuclease and DNA encoding gRNA and (b) comprises DNA encoding TdTS.
[0021 ] In further aspects, any of the kits provided herein may further comprise at least one reagent for gene editing.
BRIEF DESCRIPTION OF THE DRAWINGS
[0022] FIG. 1A presents editing efficiencies with NPM1 or ALK gRNAs in Jurkat or T cells. Plotted is the percent of indels in Jurkat or T cells for each gRNA.
[0023] FIG. 1B presents indel spectra for T cells (upper) and Jurkat cells (lower). Plotted are the frequencies of deletions or insertions, with darker shading indicating higher frequencies.
[0024] FIG. 2A presents editing efficiencies after multiplex editing at PD1 and CD70 loci in Jurkat and T cells.
[0025] FIG. 2B presents indel spectra at PD1 and CD70 loci in T cells (left) and Jurkat cells (right). Plotted are the frequencies of insertions or deletions, with darker shading indicating higher frequencies.
[0026] FIG. 3A diagrams translocation variants of PD1/CD70.
[0027] FIG. 3B presents the percentage of T1-T4 translocation variants of PD1/CD70 in Jurkat and T cells.
[0028] FIG. 4 presents Western images showing the levels of TdT and alpha-tubulin in Jurkat and T cells.
[0029] FIG. 5 presents Western images showing the levels of TdT and alpha-tubulin in Jurkat cells edited to knock out TdT (labeled T1 , T11 , T14), and wild type Jurkat and T cells.
[0030] FIG. 6A presents indel spectra at PD1 and CD70 loci in wild type Jurkat (TdT+), edited Jurkat (TdT-), and T cells.
5
[0031] FIG. 6B presents editing efficiencies at PD1 and CD70 loci in Jurkat (TdT+), Jurkat (TdT-), and T cells.
[0032] FIG. 6C presents the percentage of each translocation variant of PD1/CD70 in Jurkat (TdT+), Jurkat (TdT-), and T cells.
[0033] FIG. 7 shows the kinetics of expression of TdT short isoform (TdTS), TdT long isoforms (TdTL1 , TdTL2, TdTL3), and Cas9 after delivery of the corresponding mRNA to T cells.
[0034] FIG. 8A shows the levels of TdTS after delivery of increasing amounts of TdTS mRNA to T cells and the level in Jurkat (TdT+) cells.
[0035] FIG. 8B presents the relative levels of TdTS to alpha-tubulin in under the conditions described in FIG. 8A.
[0036] FIG. 9A presents indel spectra at PD1 and CD70 loci in T cells provided with TdTS mRNA or control mRNA.
[0037] FIG. 9B presents the percentage of each translocation variant of PD1/CD70 in T cells provided with TdTS mRNA or control mRNA.
[0038] FIG. 9C presents editing efficiencies at PD1 and CD70 loci in T cells provided with TdTS mRNA or control mRNA.
[0039] FIG. 10A presents indel spectra at PD1 and CD70 loci in T cells provided with short or long isoforms of TdT.
[0040] FIG. 10B shows the percentage of each translocation variant of PD1/CD70 in T cells provided with short or long isoforms of TdT.
[0041 ] FIG. 11 A presents editing efficiencies at PD1 and CD70 loci in T cells provided with TdTS mRNA or control mRNA and Cas9 RNP.
[0042] FIG. 11B presents editing efficiencies at PD1 and CD70 loci in T cells provided with TdTS mRNA and Cas9 mRNA + sgRNA.
[0043] FIG. 12A shows indel spectra at the CD70 locus in T cells provided with TdTS mRNA and Cas9 RNP.
[0044] FIG. 12B shows indel spectra at PD1 locus in T cells provided with TdTS mRNA and Cas9 RNP.
6
[0045] FIG. 12C present indel spectra at the CD70 locus in T cells provided with TdTS mRNA and Cas9 mRNA + sgRNA.
[0046] FIG. 12D present indel spectra at the PD1 locus in T cells provided with TdTS mRNA and Cas9 mRNA + sgRNA.
[0047] FIG. 13A shows the percentage of each translocation variant of PD1/CD70 in T cells provided with TdTS mRNA and Cas9 RNP.
[0048] FIG. 13B shows the percentage of each translocation variant of PD1/CD70 in T cells provided with TdTS mRNA and Cas9 mRNA + sgRNA.
DETAILED DESCRIPTION
[0049] The present disclosure provides means for multiplex gene editing in which two or more programmable gene editing systems targeted to different genomic loci and an X- family DNA polymerase are delivered to cells. The endonucleases of the two or more programmable gene editing systems generate double stranded DNA breaks at the targeted genomic loci and the X-family DNA polymerase adds nucleotides to the 3’ termini of the double strand DNA breaks, leading to insertion-rich indel patterns. The insertion- rich indel patterns may reduce chromosomal translocations by a) disrupting the ability of the endonuclease to recognize and recut DNA after mutagenic scarring, b) facilitating preferential end-joining to its intrachromosomal partner through synapsis of the double strand DNA break, and c) shifting end joining pathway selection from microhomology- mediate end joining (MMEJ) to non-homologous end joining (NHEJ). The addition of nucleotides by the X-family DNA polymerase at the nuclease-induced double strand DNA breaks reduces the iterative rounds of nuclease recutting by subverting error-free repair and restoration of the DNA’s original configuration. As such, ectopic supplementation the X-family DNA polymerase also serves as an adjuvant for editing efficiency.
(I) Methods for Multiplex Gene Editing
[0050] One aspect of the present disclosure encompasses methods for gene editing at multiple genomic loci, the method comprising delivering to a cell (a) two or more programmable gene editing systems or nucleic acid encoding two or more programmable
7 gene editing systems, wherein each programmable gene editing system is targeted to a different genomic locus, and (b) an X-family DNA polymerase or nucleic acid encoding an X family DNA polymerase, wherein the two or more programmable gene editing systems generate cleaved genomic loci and the X-family DNA polymerase catalyzes addition of nucleotides to 3’ termini of the cleaved genomic loci, such that chromosomal translocations are reduced between the cleaved genomic loci, thereby facilitating editing at multiple genomic loci.
Programmable Gene Editing Systems
[0051 ] The method comprises delivery of two or more programmable editing systems such that two or more genomic loci can be edited simultaneously. Programmable gene editing systems are proteins or ribonucleoprotein complexes that can be programmed or engineered to target precise sequences in the genome and introduce double-stranded breaks at the targeted DNA loci. Suitable programmable gene editing systems include RNA-guided CRISPR-Cas systems, zinc finger nucleases, transcription activator-like effector-based nucleases, homing endonucleases, and hybrid nucleases.
[0052] CRISPR-Cas Systems. The RNA-guided clustered regularly interspaced short palindromic repeats (CRISPR)-Cas (CRISPR-associated protein) system is a naturally occurring defense mechanism in prokaryotes and archaea that has been repurposed as a RNA-guided DNA-targeting platform used for gene editing. A CRISPR-Cas system comprises a noncoding guide RNA (gRNA) to target the DNA and a CRISPR-Cas nuclease that cleaves the DNA.
[0053] The gRNA drives sequence recognition and specificity of the CRISPR-Cas system through Watson-Crick base pairing typically with a ~20 nucleotide (nt) sequence in the target locus, wherein the target sequence is adjacent to a specific short DNA motif referred to as a protospacer adjacent motif (PAM). The gRNA forms an RNA-duplex structure that is bound by the CRISPR-Cas nuclease to form the catalytically active CRISPR-Cas complex, which can then cleave the target locus. Once the CRISPR-Cas complex is bound to DNA at a target site, two independent nuclease domains within the
8
CRISPR-Cas nuclease cleave opposite strands of the DNA leaving a double-strand break (DSB).
[0054] CRISPR-Cas systems include Types I, II, III, IV, V, and VI systems. In some embodiments, the CRISPR-Cas system can be a Type II CRISPR-Cas system (e.g., Cas9). In other embodiments, the CRISPR-Cas system can be a Type V CRISPR-Cas system (e.g., Cas12a, previously termed Cpf1). The CRISPR-Cas9 nuclease or the CRISPR-Cas12a nuclease used in the methods described herein can have at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 95%, at least about 97%, at least about 99% sequence identity to a wild-type CRISPR-Cas9 nuclease or CRISPR-Cas12a nuclease.
[0055] The Type-ll CRISPR-Cas system component can be from a Type-IIA, Type- IIB, or Type-IIC system. Cas9 and its orthologs are encompassed. Non-limiting exemplary species that the Cas9 nuclease or other components are from include Streptococcus pyogenes, Streptoccoccus lugdunensis, Streptococcus thermophilus, Streptococcus sp., Staphylococcus aureus, Listeria innocua, Lactobacillus gasseri, Francisella novicida, Wolinella succinogenes, Sutterella wadsworthensis, Gamma proteobacterium, Neisseria meningitidis, Campylobacter jejuni, Pasteurella multocida, Fibrobacter succinogene, Rhodospirillum rubrum, Nocardiopsis dassonvillei, Streptomyces pristinaespiralis, Streptomyces viridochromogenes, Streptomyces viridochromogenes, Streptosporangium roseum, Streptosporangium roseum, Alicyclobacillus acidocaldarius, Bacillus pseudomycoides, Bacillus selenitireducens, Exiguobacterium sibiricum, Lactobacillus delbrueckii, Lactobacillus salivarius, Lactobacillus buchneri, Treponema denticola, Microscilla marina, Burkholderiales bacterium, Polaromonas naphthalenivorans, Polaromonas sp., Crocosphaera watsonii, Cyanothece sp., Microcystis aeruginosa, Synechococcus sp., Acetohalobium arabaticum, Ammonifex degensii, Caldicelulosiruptor becscii, Candidatus Desulforudis, Clostridium botulinum, Clostridium difficile, Finegoldia magna, Natranaerobius thermophilus, Pelotomaculum thermopropionicum, Acidithiobacillus caldus, Acidithiobacillus ferrooxidans, Allochromatium vinosum, Marinobacter sp., Nitrosococcus halophilus, Nitrosococcus watsoni, Pseudoalteromonas haloplanktis, Ktedonobacter
9 racemifer, Methanohalobium evestigatum, Anabaena variabilis, Nodularia spumigena, Nostoc sp., Arthrospira maxima, Arthrospira platensis, Arthrospira sp., Lyngbya sp., Microcoleus chthonoplastes, Oscillatoria sp., Petrotoga mobilis, Thermosipho africanus, Streptococcus pasteurianus, Neisseria cinerea, Campylobacter lari, Parvibaculum lavamentivorans, Corynebacterium diphtheria, or Acaryochloris marina. In some embodiments, the Cas9 protein can be from Streptococcus pyogenes (SpCas9). In some embodiments, the Cas9 protein can be from S. lugdunensis (SluCas9). In some embodiments, the Cas9 protein can be from Staphylococcus aureus (SaCas9). In some embodiments, a suitable Cas9 protein for use in the present disclosure is any disclosed in W02019/183150 and WO2019/118935, each of which is incorporate herein by reference.
[0056] In some embodiments, the CRISPR-Cas nuclease can be engineered for increased fidelity. As used herein, the term “fidelity” when used in reference to a CRISPR/Cas system comprising a Cas nuclease and gRNA refers to the specificity of the system for a target site in a DNA molecule (e.g., genomic DNA molecule) that is homologous (e.g., perfect match) to the gRNA spacer sequence. In some embodiments, a CRISPR-Cas system with increased fidelity has reduced activity at off-target sites in the DNA molecule, i.e. , sites that are an imperfect match to the gRNA spacer sequence. [0057] In some embodiments, the CRISPR-Cas system can comprise a Cas variant (e.g., a SpCas9 functional derivative, a SluCas9 functional derivative, a SaCas9 functional derivative) comprising one or more mutations for increased fidelity. In some embodiments, the one or more mutations result in reduced activity of the CRISPR-Cas system at off-target sites in the DNA molecule, for example, compared to a system comprising an unmodified version of the Cas nuclease (e.g., wild-type SpCas9 nuclease, wild-type SluCas9 nuclease, wild-type SaCas9 nuclease). In some embodiments, the CRISPR-Cas system has substantially equivalent activity for inducing cleavage at an on- target site in the DNA molecule, for example, as compared to the system comprising an unmodified version of the Cas nuclease. Methods of making Cas variants with increased fidelity are known in the art. For example, in some embodiments, a method of structure- guided engineering is used to make a Cas variant with increased fidelity.
10
[0058] In some embodiments, the CRISPR-Cas system described herein comprises a Cas9 nuclease comprising one or more mutations for increased fidelity. In some embodiments, the Cas9 nuclease is derived from S. pyogenes, wherein the Cas nuclease comprises one or more mutations relative to wild-type SpCas9 for increased fidelity. In some embodiments, the Cas9 nuclease is derived from S. aureus, wherein the Cas nuclease comprises one or more mutations relative to wild-type SaCas9 for increased fidelity. In some embodiments, the Cas9 nuclease is derived from S. lugdunensis, wherein the Cas nuclease comprises one or more mutations relative to wild-type SluCas9 for increased fidelity.
[0059] A suitable Cas9 nuclease with increased fidelity for use in the present disclosure includes anyone described in US2019/0010471; US2018/0142222; US 9,944,912; W02020/057481 ; US2019/0177710; US2018/0100148; US 10,526,591 ; and US20200149020; each of which is incorporated herein by reference in their entirety. [0060] In some embodiments, the CRISPR-Cas nuclease can be engineered by one or more amino acid substitutions, deletions, and/or insertions to have improved targeting specificity, improved fidelity, altered PAM specificity, decreased off-target effects, and/or increased stability. Non-limiting examples of one or more mutations that improve targeting specificity, improve fidelity, and/or decrease off-target effects include N497A, R661A, Q695A, K810A, K848A, K855A, Q926A, K1003A, R1060A, and/or D1135E (with reference to the numbering system of SpCas9).
[0061] The CRISPR-Cas nuclease can be linked to one or more nuclear localization signal (NLS) at or within 50 amino acids from the N-terminal end and/or the C-terminal end. NLSs are well known in the art. For example, the NLS can be the SV40 Large T- antigen NLS, nucleoplasmin NLS, c-Myc NLS, or derivatives thereof. The linkage between the CRISPR nuclease and the NLS can be a direct or it can be indirect via an intervening linker sequence. Suitable linker sequences are well known in the art.
[0062] Exemplary CRISPR-Cas nucleases include the Cas9 proteins as published in Fonfara et al. , “Phylogeny of Cas9 determines functional exchangeability of dual-RNA and Cas9 among orthologous type II CRISPR-Cas systems,” Nucleic Acids Research, 2014, 42: 2577-2590. The CRISPR-Cas gene naming system has undergone extensive
11 rewriting since the Cas genes were discovered. Fonfara et al. also provides PAM sequences for the Cas9 polypeptides from various species.
[0063] A gRNA comprises at least (i) a spacer sequence that hybridizes to a target sequence and (ii) a CRISPR repeat sequence (crRNA). In Type II CRISPR-Cas systems, the gRNA also comprises a tracrRNA (trans-activating RNA) sequence. In Type II CRISPR-Cas systems, the CRISPR repeat sequence and tracrRNA sequence hybridize to each other to form a duplex. A type II system gRNA can be a single molecule (i.e. , single molecule gRNA or sgRNA) or can comprise two separate molecules (e.g., crRNA and tracrRNA). In Type V CRISPR-Cas systems, a crRNA sequence forms a duplex. In both systems, the duplex binds the CRISPR-Cas nuclease such that the gRNA and the CRISPR-Cas nuclease form a complex. The gRNA, thus, directs the activity of the CRISPR-CAS nuclease and provides specificity in targeting the CRISPR-Cas nuclease to the targeted genomic location.
[0064] Each gRNA comprises a spacer sequence that defines the target sequence of a target nucleic acid (or target locus). The “target sequence” is adjacent to a PAM sequence in the target DNA and is cleaved by the CRISPR-Cas nuclease. The “target nucleic acid” is a double-stranded molecule; one strand comprises the target sequence and is referred to as the “PAM strand,” and the other complementary strand is referred to as the “non-PAM strand.” One of skill in the art recognizes that the gRNA spacer sequence hybridizes to the reverse complement of the target sequence, which is located in the non-PAM strand of the target nucleic acid. The spacer of a gRNA interacts with a target nucleic acid in a sequence-specific manner via hybridization (i.e., base pairing). The nucleotide sequence of the spacer thus varies depending on the target sequence of the target locus.
[0065] In some embodiments, the spacer sequence of the gRNA may be from about 15 to about 30 nucleotides long, or about 18 to 22 nucleotides long. In some embodiments, the spacer sequence is 15, 16, 17, 18, 19, 20, 21 , 22, 23, 24, 25, 26, 27, 28, 29, or 30 nucleotides. In some embodiments, a spacer sequence is 18, 19, 20, or 21 nucleotides long. In some embodiments, the spacer sequence is 20 nucleotide long. (See, e.g., Jinek et al., Science, 337, 816-821 (2012) and Deltcheva et al., Nature, 471 ,
12
602-607 (2011 )). In general, the spacer sequence has at least about 90%, at least about 95%, or at least about 99% sequence identity to the target sequence in the target nucleic acid. In certain embodiments, the spacer sequence has 100% sequence identity to the target sequence.
[0066] In some embodiments, a single molecule gRNA in a Type II CRISPR-Cas system can comprise, in the 5' to 3' direction, an optional spacer extension sequence, a spacer sequence, a minimum CRISPR repeat sequence, a single-molecule guide linker, a minimum tracrRNA sequence, a 3’ tracrRNA sequence and an optional tracrRNA extension sequence. The optional tracrRNA extension may comprise elements that contribute additional functionality (e.g., stability) to the guide RNA. The guide linker links the minimum CRISPR repeat and the minimum tracrRNA sequence to form a hairpin structure. The optional tracrRNA extension may comprise one or more hairpins.
[0067] A double molecule gRNA in a Type II CRISPR-Cas system can comprise a first strand comprising, in the 5' to 3' direction, an optional spacer extension sequence, a spacer sequence and a minimum CRISPR repeat sequence (crRNA), and a second strand comprising a minimum tracrRNA sequence (complementary to the minimum CRISPR repeat sequence), a 3’ tracrRNA sequence and an optional tracrRNA extension sequence.
[0068] A gRNA in a Type V CRISPR-Cas system can comprise, in the 5' to 3' direction, a minimum CRISPR repeat sequence (crRNA) and a spacer sequence.
[0069] The gRNA, in some embodiments, can comprise one or more uracil residues at the 3’ end of the gRNA sequence. For example, the gRNA may comprise one (U), two (UU), three (UUU), four (UUUU) or more uracils at the 3’ end of the gRNA sequence. In some embodiments, the gRNA comprises 5, 6, 7, or 8 uracils at the 3’ end of the gRNA sequence. In some embodiments, the gRNA comprises 1 to 8, 2 to 8, 3 to 8, or 4 to 8 uracils at the 3’ end of the gRNA sequence.
[0070] The gRNA can be unmodified or modified. For example, modified gRNAs may comprise one or more 2'-0-methyl phosphorothioate nucleotides. For example, the gRNA may comprise 2'-0-methyl-phosphorothioate residues at the 5' end/or the 3' end.
13
In some embodiments, the gRNA comprises three 2'-0-methyl-phosphorothioate residues at the 5' end and 2'-0-methyl-phosphorothioate residues at the 3' end.
[0071 ] Zinc finger nucleases (ZFNs). Among the various types of modular nucleases are ZFNs, which are proteins comprised of an engineered zincfinger DNA binding domain linked to the catalytic domain of the type II endonuclease Fokl. Because Fokl functions only as a dimer, a pair of ZFNs are engineered to bind to cognate target “half-site” sequences on opposite DNA strands and with precise spacing between them to enable the catalytically active Fokl dimer to form. Upon dimerization of the Fokl domain, which itself has no sequence specificity per se, a DNA double-strand break is generated between the ZFN half-sites as the initiating step in genome editing.
[0072] The DNA binding domain of each ZFN is typically comprised of 3-6 zinc fingers of the abundant Cys2-His2 architecture, with each finger primarily recognizing a triplet of nucleotides on one strand of the target DNA sequence, although cross-strand interaction with a fourth nucleotide also can be important. Alteration of the amino acids of a finger in positions that make key contacts with the DNA alters the sequence specificity of a given finger. Thus, a four-finger zinc finger protein will selectively recognize a 12 bp target sequence, where the target sequence is a composite of the triplet preferences contributed by each finger, although triplet preference can be influenced to varying degrees by neighboring fingers. An important aspect of ZFNs is that they can be readily re-targeted to almost any genomic address simply by modifying individual fingers. In most applications of ZFNs, proteins of 4-6 fingers are used, recognizing 12-18 bp respectively. Hence, a pair of ZFNs will typically recognize a combined target sequence of 24-36 bp, not including the typical 5-7 bp spacer between half-sites. The binding sites can be separated further with larger spacers, including 15-17 bp. A target sequence of this length is likely to be unique in the human genome, assuming repetitive sequences or gene homologs are excluded during the design process. Nevertheless, the ZFN protein-DNA interactions are not absolute in their specificity so off-target binding and cleavage events do occur, either as a heterodimer between the two ZFNs, or as a homodimer of one or the other of the ZFNs. The latter possibility has been effectively eliminated by engineering the dimerization interface of the Fokl domain to create “plus” and “minus” variants, also
14 known as obligate heterodimer variants, which can only dimerize with each other, and not with themselves. Forcing the obligate heterodimer prevents formation of the homodimer. This has greatly enhanced specificity of ZFNs, as well as any other nuclease that adopts these Fokl variants.
[0073] A variety of ZFN-based systems have been described in the art, modifications thereof are regularly reported, and numerous references describe rules and parameters that are used to guide the design of ZFNs; see, e.g., Segal et al. , Proc Natl Acad Sci, 1999 96(6):2758-63; Dreier et al., J Mol Biol., 2000, 303(4):489-502; Liu et al., J Biol Chem., 2002, 277(6):3850-6; Dreier et al., J Biol Chem., 2005, 280(42):35588-97; and Dreier et al., J Biol Chem. 2001 , 276(31 ):29466-78.
[0074] Transcription activator-like effector nucleases (TALENS). Another type of modular nucleases are TALENs, which like ZFNs comprise an engineered DNA binding domain linked to the Fokl nuclease domain. As such, a pair of TALENs operate in tandem to achieve targeted DNA cleavage. The major difference from ZFNs is the nature of the DNA binding domain and the associated target DNA sequence recognition properties. The TALEN DNA binding domain derives from TALE proteins, which were originally described in the plant bacterial pathogen Xanthomonas sp. TALES are comprised of tandem arrays of 33-35 amino acid repeats, with each repeat recognizing a single base pair in the target DNA sequence that is typically up to 20 bp in length, giving a total target sequence length of up to 40 bp. Nucleotide specificity of each repeat is determined by the repeat variable diresidue (RVD), which includes just two amino acids at positions 12 and 13. The bases guanine, adenine, cytosine and thymine are predominantly recognized by the four RVDs: Asn-Asn, Asn-lle, His-Asp and Asn-Gly, respectively. This constitutes a much simpler recognition code than for zinc fingers, and thus represents an advantage over the latter for nuclease design. Nevertheless, as with ZFNs, the protein- DNA interactions of TALENs are not absolute in their specificity, and TALENs have also benefitted from the use of obligate heterodimer variants of the Fokl domain to reduce off- target activity.
[0075] Additional variants of the Fokl domain have been created that are deactivated in their catalytic function. If one half of either a TALEN or a ZFN pair contains an inactive
15
Fokl domain, then only single-strand DNA cleavage (nicking) will occur at the target site, rather than a DSB. The outcome is comparable to the use of CRISPR-Cas nickase mutants in which one of the Cas9 cleavage domains has been deactivated.
[0076] A variety of TALEN-based systems have been described in the art, and modifications thereof are regularly reported; see, e.g., Boch, Science, 2009 326(5959): 1509-12; Mak et al., Science, 2012, 335(6069)716-9; and Moscou et al., Science, 2009, 326(5959): 1501.
[0077] Homing endonucleases (HEs). HEs are sequence-specific endonucleases that have long recognition sequences (14-44 base pairs) and cleave DNA with high specificity - often at sites unique in the genome. There are at least six known families of HEs as classified by their structure, including GIY-YIG, His-Cis box, H-N-H, PD-(D/E)xK, and Vsr- like that are derived from a broad range of hosts, including eukarya, protists, bacteria, archaea, cyanobacteria and phage. As with ZFNs and TALENs, HEs can be used to create a DSB at a target locus as the initial step in genome editing. The large target sequence of HEs and the specificity that they offer have made them attractive candidates to create site-specific DSBs.
[0078] A variety of HE-based systems have been described in the art, and modifications thereof are regularly reported; see, e.g., the reviews by Steentoft et al., Glycobiology, 2014, 24(8):663-80; Belfort and Bonocora, Methods Mol Biol., 2014, 1123:1-26; and Hafez and Hausner, Genome, 2012, 55(8):553-69, which are each incorporated by reference in their entirety.
[0079] Hybrid nucleases. Nucleases can be engineered to comprise a fusion of a meganuclease (Mega) domain, a TALE DNA binding domain, and/or a homing endonuclease domain. The MegaTAL platform and Tev-mTALEN platforms utilize fusion of TALE DNA binding domains and catalytically active HEs, taking advantage of both the tunable DNA binding and specificity of the TALE, as well as the cleavage sequence specificity of the HE; see, e.g., Boissel et al., Nucleic Acids Res., 2014, 42: 2591-2601 ; Kleinstiveret al., G3, 2014, 4:1155-65; and Boissel and Scharenberg, Methods Mol. Biol., 2015, 1239: 171-96, which are each incorporated by reference in their entirety. The MegaTev architecture is the fusion of a meganuclease with the nuclease domain derived
16 from the GIY-YIG homing endonuclease l-Tevl (Tev). The two active sites are positioned ~30 bp apart on a DNA substrate and generate two DSBs with non-compatible cohesive ends; see, e.g., Wolfs et al. , Nucleic Acids Res., 2014, 42, 8816-29, incorporated herein by reference in its entirety. It is anticipated that other combinations of existing nuclease- based approaches will evolve and be useful in achieving the targeted genome modifications described herein.
[0080] As further example, fusion of the TALE DNA binding domain to a catalytically active HE, such as l-Tevl, takes advantage of both the tunable DNA binding and specificity of the TALE, as well as the cleavage sequence specificity of l-Tevl, with the expectation that off-target cleavage can be further reduced.
X-family DNA Polymerase
[0081] The methods disclosed herein further comprises delivering to the cells an X- family DNA polymerase or functional fragment thereof or a nucleic acid encoding an X family polymerase or functional fragment thereof. The X-family of DNA polymerases have similar three-dimensional structures and include terminal deoxyribonucleotidyl transferase (TdT), polymerase lambda (Pol l), and polymerase mu (Pol m). Pol l and Pol m are responsible for polymerizing nucleotide addition at the site of a dsDNA break as part of the normal cellular DNA repair processes. TdT uniquely provides junctional diversity during VJD recombination in T- and B- cell precursors by adding nucleotides to the 3’ termini of dsDNA breaks in a template-free manner. In humans, terminal transferase is encoded by the DNTT gene. In several species including humans, there are multiple isoforms of TdT resulting from alternative mRNA splicing. The short form (TdTS) consists of 509 amino acids; long form 1 (TdTL1) contains exon 12 and consists of 518 amino acids; long form 2 (TdTL2) contains exon 7 and consists of 527 amino acids; and long form 3 (TdTL3) contains both exons 7 and 13 and consists of 536 amino acids. [0082] In some embodiments, the X-family DNA polymerase is Pol l. In other embodiments, the X-family DNA polymerase is Pol m. In certain embodiments, the X- family DNA polymerase is TdT, for example human TdT. In some embodiment, the X-
17 family DNA polymer is a long isoform of human TdT. In other embodiments, the X-family
DNA polymerase is a combination of TdTS and either of TdTL1 , TdTL2, or TdTL3. In specific embodiments, the X-family DNA polymerase is the short isoform (TdTS) of human TdT. The sequence of the 509 amino acid hTdTS protein is presented below.
MDPPRASHLSPRKKRPRQTGALMASSPQDIKFQDLVVFILEKKMGTTRRAFLMELARR
KGFRVENELSDSVTHIVAENNSGSDVLEWLQAQKVQVSSQPELLDVSWLIECIRAGKP
VEMTGKHQLVVRRDYSDSTNPGPPKTPPIAVQKISQYACQRRTTLNNCNQIFTDAFDIL
AENCEFRENEDSCVTFMRAASVLKSLPFTIISMKDTEGIPCLGSKVKGIIEEIIEDGESSE
VKAVLNDERYQSFKLFTSVFGVGLKTSEKWFRMGFRTLSKVRSDKSLKFTRMQKAGF
LYYEDLVSCVTRAEAEAVSVLVKEAVWAFLPDAFVTMTGGFRRGKKMGHDVDFLITSP
GSTEDEEQLLQKVMNLWEKKGLLLYYDLVESTFEKLRLPSRKVDALDHFQKCFLIFKLP
RQRVDSDQSSWQEGKTWKAIRVDLVLCPYERRAFALLGWTGSRQFERDLRRYATHE
RKMILDNHALYDKTKRIFLKAESEEEIFAHLGLDYIEPWERNA (SEQ ID NO: 34).
Encoding Nucleic Acids
[0083] The two or more programmable gene editing systems and/or the X-family DNA polymerase can be provided to the cells as encoding nucleic acids. In some embodiments, proteins of the gene editing systems and/or the X-family DNA polymerase can be coded by mRNA. In some embodiments, proteins of the gene editing systems and/or the X-family DNA polymerase can be coded by DNA. In embodiments in which the programmable gene editing systems are CRISPR-Cas systems, the two or more gRNAs can be encoded by DNA.
[0084] In embodiments in which the encoding nucleic acid is mRNA, the mRNA can be 5’ capped and/or 3’ polyadenylated. The mRNA can be single stranded and linear. In embodiments in which the encoding nucleic acid is DNA, the DNA can be single stranded, double stranded, linear, or circular. In embodiments in which the gene editing system is a CRISPR-Cas system, the DNA encoding the CRISPR-Cas nuclease generally is codon optimized for expression in the cells of interest.
[0085] In embodiments in which the encoding nucleic acid is mRNA, the mRNA can be synthesized by chemical means, as described in the art. While chemical synthetic procedures are continually expanding, purification of such RNAs by procedures such as
18 high-performance liquid chromatography (HPLC, which avoids the use of gels such as PAGE) tends to become more challenging as polynucleotide lengths increase significantly beyond a hundred or so nucleotides. One approach used for generating RNAs of greater length is to produce two or more molecules that are ligated together. Much longer RNAs are more readily generated enzymatically in vitro. Various types of RNA modifications can be introduced during or after chemical synthesis and/or enzymatic generation of RNAs, e.g., modifications that enhance stability and the like.
[0086] In other embodiments, the encoding nucleic acid can be provided as part of vector system. The term "vector" refers to a nucleic acid molecule capable of transporting another nucleic acid to which it has been linked. One type of vector is a "plasmid", which refers to a circular double-stranded DNA loop into which additional nucleic acid segments can be ligated. Another type of vector is a viral vector, wherein additional nucleic acid segments can be ligated into the viral genome. Viral vectors can be RNA or DNA. Certain vectors are capable of autonomous replication in a host cell into which they are introduced (e.g., bacterial vectors having a bacterial origin of replication and episomal mammalian vectors). Other vectors (e.g., non-episomal mammalian vectors) are integrated into the genome of a host cell upon introduction into the host cell, and thereby are replicated along with the host genome.
[0087] In some instances, vectors can be capable of directing the expression of nucleic acids to which they are operatively linked. Such vectors are referred to as "recombinant expression vectors", or more simply "expression vectors", which serve equivalent functions.
[0088] The term "operably linked" means that the nucleotide sequence of interest is linked to regulatory sequence(s) in a manner that allows for expression of the nucleotide sequence. The term "regulatory sequence" is intended to include, for example, promoters, enhancers and other expression control elements (e.g., polyadenylation signals). Such regulatory sequences are well known in the art and are described, for example, in Goeddel; Gene Expression Technology: Methods in Enzymology, 1990, 185, Academic Press, San Diego, CA. Regulatory sequences include those that direct constitutive expression of a nucleotide sequence in many types of host cells, and those
19 that direct expression of the nucleotide sequence only in certain host cells (e.g., tissue- specific regulatory sequences). It will be appreciated by those skilled in the art that the design of the expression vector can depend on such factors as the choice of the target cell, the level of expression desired, and the like.
[0089] Suitable expression vectors include, but are not limited to, viral vectors based on vaccinia virus, poliovirus, adenovirus, adeno-associated virus (AAV), SV40, herpes simplex virus, human immunodeficiency virus, retrovirus (e.g., Murine Leukemia Virus, spleen necrosis virus, and vectors derived from retroviruses such as Rous Sarcoma Virus, Harvey Sarcoma Virus, avian leukosis virus, a lentivirus, human immunodeficiency virus, myeloproliferative sarcoma virus, and mammary tumor virus) and other recombinant vectors. Other vectors contemplated for eukaryotic target cells include, but are not limited to, the vectors pXT1 , pSG5, pSVK3, pBPV, pMSG, and pSVLSV40 (Pharmacia). Other vectors can be used so long as they are compatible with the host cell.
[0090] In some examples, a vector can comprise one or more transcription and/or translation control elements. Depending on the host/vector system utilized, any of a number of suitable transcription and translation control elements, including constitutive and inducible promoters, transcription enhancer elements, transcription terminators, etc. can be used in the expression vector. Non-limiting examples of suitable eukaryotic promoters (i.e. , promoters functional in eukaryotic cells) include those from cytomegalovirus (CMV) immediate early, herpes simplex virus (HSV) thymidine kinase, early and late SV40, long terminal repeats (LTRs) from retrovirus, human elongation factor-1 a promoter (EF1a), chicken beta-actin promoter (CBA), ubiquitin C promoter (UBC), a hybrid construct comprising the cytomegalovirus enhancer fused to the chicken beta-actin promoter (CAG), murine stem cell virus promoter (MSCV), phosphoglycerate kinase-1 locus promoter (PGK), and mouse metallothionein-l promoter. In other embodiments, the promoter can be an inducible promoter (e.g., a heat shock promoter, tetracycline-regulated promoter, steroid-regulated promoter, metal-regulated promoter, estrogen receptor-regulated promoter, etc.), a tissue specific promoter, or a cell type specific promoter.
20
[0091] In other embodiments, DNA encoding gRNAs for two or more CRISPR-Cas systems can be operably linked to a promoter sequence that is recognized by RNA polymerase III (Pol III). Examples of suitable Pol III promoters include, but are not limited to, mammalian U6, U3, H1 , and 7SL RNA promoters. In some embodiments, DNA encoding an array of two or more gRNAs can be operably linked to a Pol III promoter.
Delivering to the Cells
[0092] As detailed above, the two or more programmable gene editing systems can be delivered to the cells as proteins or RNP complexes, encoding mRNA, or encoding DNA and the X-family DNA polymerase can be delivered to the cells as protein, encoding RNA, or encoding DNA.
[0093] In some embodiments, two programmable gene editing systems are delivered to the cells. In other embodiments, three programmable gene editing systems are delivered to the cells. In further embodiments, four programmable gene editing systems are delivered to the cells. In still other embodiments, five programmable gene editing systems are delivered to the cells. In additional some embodiments, six or more programmable gene editing systems are delivered to the cells.
[0094] The two or more programmable gene editing systems and the X-family DNA polymerase can be delivered to the cells by a variety of methods. Suitable methods include electroporation (e.g., nucleofection), lipofection, sonoporation, microinjection, biolistics, virosomes, liposomes, immunoliposomes, polycation conjugates, lipidmucleic acid conjugates, naked DNA, naked RNA, capped RNA, artificial virions, and agent- enhanced uptake of DNA. In specific embodiments, the gene editing system can be delivered to the cells of interest via electroporation.
[0095] In embodiments in which the two or more programmable gene editing systems are CRISPR-Cas systems, the two or more CRISPR-Cas systems can comprise the same CRISPR-Cas nuclease (e.g., SpCas9) or different CRISPR-Cas nucleases (e.g., SpCas9 and SaCas9, etc.), wherein each gRNA is engineered to complex with a specific Cas9 protein. The number of CRISPR-Cas systems is determined by the number of different gRNAs. In some embodiments, two different gRNAs are provided to the cells. In other
21 embodiments, three different gRNAs are provided to the cells. In still other embodiments, four different gRNAs are provided to the cells. In additional embodiments, five different gRNAs are provided to the cells. In further embodiments, six or more different gRNAs are provided to the cells.
[0096] In embodiments in which the programmable gene editing systems are CRISPR- Cas systems, the two or more CRISPR-Cas systems can be delivered to the cells as (i) RNP complexes, each comprising a CRISPR-Cas nuclease and a gRNA; (ii) mRNA encoding the CRISPR-Cas nuclease and two or more gRNAs; (iii) mRNA encoding the CRISPR-Cas nuclease and DNA encoding the two or more gRNAs; or (iv) DNA encoding the CRISPR-Cas nuclease and DNA encoding the two or more gRNAs.
[0097] In embodiments in which the CRISPR-Cas system is delivered as RNP, the weight ratio of each gRNA to the CRISPR-Cas nuclease can range from about 1 :1 to about 10:1. In various embodiments, the weight ratio of each gRNA to the CRISPR-Cas nuclease can be about 1 :1 , about 1.5:1 , abut 2:1 , about 2.5:1 , about 3:1 , about 3.5:1 , about 4:1 , about 4.5:1, about 5:1 , about 5.5:1 , about 6:1 , about 6.5:1 , about 7:1 , about 7.5:1 , about 8:1 , about 8.5:1 , about 9:1 , about 9.5:1 , or about 10:1.
[0098] In the above-mentioned embodiments in which the programmable gene editing systems are CRISPR-Cas systems, the X-family DNA polymerase generally is delivered as encoding mRNA. In some embodiments, however, the X-family DNA polymerase can be delivered as encoding DNA or purified protein.
[0099] In still further embodiments, the X-family DNA polymerase can be fused to the CRISPR-Cas nuclease. The X-family DNA polymerase or a functional fragment thereof can be fused to the amino terminus, carboxy terminus, or both of the CRISPR-Cas nuclease. The fusion can be direct or via a linker, which are well known in the art. The X-family DNA polymerase/CRISPR-Cas fusion can be delivered to the cells as a protein, encoding mRNA, or encoding DNA.
Cells
[00100] A variety of cells are suitable for use in the methods disclosed herein. In general, the cells are mammalian cells. In specific embodiments, the cells are human
22 cells. Generally, the cells are other than immature, pre-B or pre-T lymphoid cells, leukemia cells, lymphoma cells, or immortalized cells derived from any of the foregoing. [00101] In some embodiments, the cells are primary cells isolated directly from human (or animal) tissue. In various embodiments, the cells used in the methods or compositions disclosed herein can be CD34+ HSPCs, T cells, NK cells, monocytes, macrophages, microglial cells, neutrophils, eosinophils, basophils, dendritic cells, or adipocytes. In certain embodiments, the cells can be T cells, e.g., human T cells. The T cells can be helper T cells or cytotoxic T cells. In other embodiments, the cells can be NK cells, e.g., human NK cells.
[00102] In further embodiments, the primary cells can be, without limit, adipocytes, astrocytes, blood cells, chondrocytes, endothelial cells, epithelial cells, fibroblasts, hair cells, hepatocytes, keratinocytes, melanocyte, myocytes, neurons, osteoblasts, skeletal muscle cells, smooth muscle cells, stem cells, or synoviocytes.
[00103] In still other embodiments, the cells can be stem cells (e.g., embryonic stem cells, fetal stem cells, amniotic stem cells, or umbilical cord stem cells). In certain embodiments, the stem cells can be adult stem cells isolated from bone marrow, adipose tissue, or blood. In still other embodiments, the stem cells can be induced pluripotent stem cells (e.g., human iPSCs).
[00104] In other embodiments, the cells may be hematopoietic stem and progenitor cells (HSPCs) or hematopoietic stem cells (HSCs). HSPCs give rise to all blood cell types, including erythroid (erythrocytes or red blood cells (RBCs)), myeloid (monocytes and macrophages, neutrophils, basophils, eosinophils, megakaryocytes / platelets, and dendritic cells), and lymphoid (T-cells, B-cells, NK-cells). Blood cells are produced by the proliferation and differentiation of a very small population of pluripotent HSCs that also have the ability to replenish themselves by self-renewal. During differentiation, the progeny of HSCs progress through various intermediate maturational stages, generating multi-potential and lineage-committed progenitor cells prior to reaching maturity. Bone marrow (BM) is the major site of hematopoiesis in humans and, under normal conditions, only small numbers of HSPCs can be found in the peripheral blood (PB). Treatment with cytokines (in particular granulocyte colony-stimulating factor; G-CSF), some
23 myelosuppressive drugs used in cancer treatment, and compounds that disrupt the interaction between hematopoietic and BM stromal cells can rapidly mobilize large numbers of stem and progenitors into the circulation. The cell surface glycoprotein CD34 is routinely used to identify and isolate HSPCs.
[00105] In other embodiments, the cells can be mesenchymal stem cells (e.g., multipotent stromal cells that can differentiate into a variety of cell types). Mesenchymal stem cells (MSCs) are adult stem cells found in the bone marrow, or isolated from other tissues such as cord blood, peripheral blood, fallopian tube, and fetal liver and lung. As multipotent stem cells, MSCs differentiate into multiple cell types including adipocytes, chondrocytes, osteocytes, and cardiomyocytes. Mesenchymal stem cells are a distinct entity to the mesenchyme, embryonic connective tissue, which is derived from the mesoderm and differentiates to form hematopoietic stem cells (HPCs).
[00106] In some embodiments, the cells used in the methods disclosed herein can be in vitro. In other embodiments, the cells used in the methods can be ex vivo. In still other embodiments, the cells used in the methods can be in situ. In yet another embodiment, the cells used in the methods can be in vivo.
Indel Profiles and Chromosomal Translocations
[00107] Upon delivery of the two or more programmable gene editing systems and the X-family DNA polymerase, the gene editing systems introduce double stranded DNA breaks at the targeted genomic loci, thereby generating cleaved genomic loci and the X- family DNA polymerase catalyzes addition of nucleotides to 3’ termini of the cleaved genomic loci, which we have found to lead to a reduction of chromosomal translocations between the two or more cleaved genomic loci.
[00108] In general, when two or more genomic loci are edited simultaneously in the presence of the X-family DNA polymerase, chromosomal translocations can be reduced by at least about 5%. In embodiments in which two loci are edited simultaneously in the presence of the X-family DNA polymerase, such as TdTS, the percent of chromosomal translocations can be reduced by at least about 5%, by at least about 10%, by at least about 15%, by at least about 20%, by at least about 35% ,by at least about 50% (2-fold
24 reduction), by at least about 2.2-fold, by at least about 2.5-fold, by at least about 3-fold, by at least about 3.5-fold, by at least about 4-fold, or at least more than 4-fold. In some embodiments, each type of translocation variant (e.g., monocentric, dicentric, acentric) can be reduced a comparable amount. In other embodiments, at least one type of variant can be reduced more than the other variants.
[00109] The addition of nucleotides by the X-family DNA polymerase produces insertion-rich indel profiles. In particular, cells comprising an X-family DNA polymerase, such as TdTS, exhibit increased percentages of nucleotide insertions, increased percentages of longer insertions, and decreased percentages of deletions at the targeted chromosomal loci as compared to cells lacking the X-family DNA polymerase.
[00110] The percentage of nucleotide insertions of more than one nucleotide can be increased by at least about 40% as compared to cells lacking the X-family DNA polymerase. In some embodiments, the percentage of insertions can be increased about 5%, about 10%, about 15%, about 20%, about 25%, about 30%, about 35%, or about 40%. The length of insertions can be increased from about 1 to about 8 to 10 nucleotides, whereas cells lacking the X-family DNA polymerase rarely exhibit insertions longer than one nucleotide. The percentage of nucleotide deletions can be decreased by at least about 30% as compared to cells lacking the X-family DNA polymerase. In some embodiments, the percentage of deletions can be decreased about 2%, about 5%, about 10%, about 15%, about 20%, about 25%, or about 30%.
[00111] While the efficiency of editing varies between different genomic loci, the efficiency does not differ significantly between cells with or without the X-family DNA polymerase. In general, the efficiency of editing is at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, or at least 95%.
(II) Compositions and Kits
[00112] Another aspect of the present disclosure encompasses compositions and kits for multiplex gene editing. For example, the present disclosure provides cells comprising two or more programmable gene editing systems and an X-family DNA polymerase as described above in section (I). The two or more programmable gene editing systems and
25 an X-family DNA polymerase can exist as DNA, RNA, protein, or combinations thereof. In some embodiments, the cells comprise two or more CRISPR-Cas systems and TdT. In other embodiments, the cells comprise two or more CRISPR-Cas systems and the short isoform of TdT (TdTS). Suitable cells are detailed above in section (l)(e).
[00113] Also provided herein are kits for multiplex gene editing. In general, the kits comprise two or more programmable gene editing systems and an X-family DNA polymerase as described above in section (I), as well as reagents for gene editing. Reagents for gene editing include appropriate buffers, dNTPs, divalent salts, and the like. The kits can also include reagents and/or primers/probes for analyzing indel patterns and/or chromosomal translocations. In some embodiments, the kits comprise two or more CRISPR-Cas systems and TdT. In other embodiments, the kits comprise two or more CRISPR-Cas systems and the short isoform of TdT (TdTS). The kits can provide the two or more CRISPR-Cas systems as RNPs, RNA, and/or DNA, and can provide the TdT or TdTS as RNA, DNA, or protein.
[00114]
[00115] DEFINITIONS
[00116] It should also be understood that, unless clearly indicated to the contrary, in any methods claimed herein that include more than one step or act, the order of the steps or acts of the method is not necessarily limited to the order in which the steps or acts of the method are recited.
[00117] The indefinite articles “a” and “an,” as used herein in the specification and in the claims, unless clearly indicated to the contrary, should be understood to mean “at least one.”
[00118] In the claims, as well as in the specification above, all transitional phrases such as “comprising,” “including,” “carrying,” “having,” “containing,” “involving,” “holding,” “composed of,” and the like are to be understood to be open-ended, i.e. , to mean including but not limited to. Only the transitional phrases “consisting of” and “consisting essentially of shall be closed or semi-closed transitional phrases, respectively, as set forth in the United States Patent Office Manual of Patent Examining Procedures, Section 2111 .03.
26
[00119] The terms “about” and “substantially” preceding a numerical value mean ±10% of the recited numerical value.
[00120] Where a range of values is provided, each value between the upper and lower ends of the range are specifically contemplated and described herein.
[00121] As used herein, the terms “complementary” or “complementarity” refer to the association of double-stranded nucleic acids by base pairing through specific hydrogen bonds. The base paring may be standard Watson-Crick base pairing (e.g., 5’-A G T C-3’ pairs with the complementary sequence 3’-T C A G-5’). The base pairing also may be Hoogsteen or reversed Hoogsteen hydrogen bonding. Complementarity is typically measured with respect to a duplex region and thus, excludes overhangs, for example. Complementarity between two strands of the duplex region may be partial and expressed as a percentage (e.g., 70%), if only some (e.g., 70%) of the bases are complementary. The bases that are not complementary are “mismatched.” Complementarity may also be complete (i.e. , 100%), if all the bases in the duplex region are complementary.
[00122] A “gene,” as used herein, refers to a DNA region (including exons and introns) encoding a gene product, as well as all DNA regions which regulate the production of the gene product, whether or not such regulatory sequences are adjacent to coding and/or transcribed sequences. Accordingly, a gene includes, but is not necessarily limited to, promoter sequences, terminators, translational regulatory sequences such as ribosome binding sites and internal ribosome entry sites, enhancers, silencers, insulators, boundary elements, replication origins, matrix attachment sites, and locus control regions.
[00123] The terms “nuclease” and “endonuclease” are used interchangeably herein, and refer to an enzyme that cleaves both strands of a double-stranded nucleic acid sequence.
[00124] The terms “nucleic acid” and “polynucleotide” refer to a deoxyribonucleotide or ribonucleotide polymer, in linear or circular conformation, and in either single- or double- stranded form. For the purposes of the present disclosure, these terms are not to be construed as limiting with respect to the length of a polymer. The terms can encompass known analogs of natural nucleotides, as well as nucleotides that are modified in the base, sugar and/or phosphate moieties (e.g., phosphorothioate backbones). In general, an
27 analog of a particular nucleotide has the same base-pairing specificity; i.e., an analog of A will base-pair with T.
[00125] The term “nucleotide” refers to deoxyribonucleotides or ribonucleotides. The nucleotides may be standard nucleotides (i.e., adenosine, guanosine, cytidine, thymidine, and uridine), nucleotide isomers, or nucleotide analogs. A nucleotide analog refers to a nucleotide having a modified purine or pyrimidine base or a modified ribose moiety. A nucleotide analog may be a naturally occurring nucleotide (e.g., inosine, pseudo uridine, etc.) or a non-naturally occurring nucleotide. Non-limiting examples of modifications on the sugar or base moieties of a nucleotide include the addition (or removal) of acetyl groups, amino groups, carboxyl groups, carboxymethyl groups, hydroxyl groups, methyl groups, phosphoryl groups, and thiol groups, as well as the substitution of the carbon and nitrogen atoms of the bases with other atoms (e.g., 7-deaza purines). Nucleotide analogs also include dideoxy nucleotides, 2’-0-methyl nucleotides, locked nucleic acids (LNA), peptide nucleic acids (PNA), and morpholinos.
[00126] The term "sequence identity" as used herein, indicates a quantitative measure of the degree of identity between two sequences of substantially equal length. The percent identity of two sequences, whether nucleic acid or amino acid sequences, is the number of exact matches between two aligned sequences divided by the length of the shorter sequence and multiplied by 100. An approximate alignment for nucleic acid sequences is provided by the local homology algorithm of Smith and Waterman, Advances in Applied Mathematics 2:482-489 (1981). This algorithm can be applied to amino acid sequences by using the scoring matrix developed by Dayhoff, Atlas of Protein Sequences and Structure, M. O. Dayhoff ed., 5 suppl. 3:353-358, National Biomedical Research Foundation, Washington, D.C., USA, and normalized by Gribskov, Nucl. Acids Res. 14(6):6745-6763 (1986). An exemplary implementation of this algorithm to determine percent identity of a sequence is provided by the Genetics Computer Group (Madison, Wis.) in the “BestFit” utility application. Other suitable programs for calculating the percent identity or similarity between sequences are generally known in the art, for example, another alignment program is BLAST, used with default parameters. For example, BLASTN and BLASTP can be used using the following default parameters:
28 genetic code=standard; filter=none; strand=both; cutoff=60; expect=10; Matrix=BLOSUM62; Descriptions=50 sequences; sort by=HIGH SCORE; Databases=non-redundant, GenBank+EMBL+DDBJ+PDB+GenBank CDS translations+Swiss protein+Spupdate+PIR. Details of these programs can be found on the GenBank website.
EXAMPLES
[00127] The following examples illustrate various non-limiting embodiments of the present disclosure.
Example 1. General Methods
[00128] Media/Culturing. For primary T cells, PBMCs (AllCells) were thawed and cultured in Expansion Media [T cell CTS™ OpTmizer™ T Cell Expansion Media (Thermo Fisher) supplemented with 5% Human Serum AB (Vally Biomedical), 50 ng/ml IL-2 (Miltenyi Biotec) and 10 ng/ml IL-7 (Cellgenix). Upon thawing, T cells were activated with T Cell TransAct™ (Miltenyi) for 3 days. T Cell TransAct™ was subsequently removed from culture and the cells were seeded at a density of 5 x 105 cells/ml. Jurkat cells, Clone E6-1 (ATCC TIB-152) were grown in RPMI-1640 + GlutaMax (Thermo Fisher) supplemented with 10% Fetal Bovine Serum (Thermo Fisher) and 1% Penicillin- Streptomycin (Thermo Fisher). Cells were seeded at a density of 1 x 105 cells/ml and passaged before exceeding 1 x 106 cells/ml. Preculturing of cells prior to nucleofection was followed according to recommendations by Lonza.
[00129] CRISPR Editing of Genomic Loci in Jurkat and Primary T cells. Chemically modified (three 2'-0-methyl-phosphorothioate residues at each 5’ and 3’ end) guides (Table 1) were used to edit primary T cells and Jurkat cells (ATCC® TIB-152™). For Cas9 editing by RNP, an RNP mixture of S. pyogenes Cas9 (Biomay) and sgRNA (Agilent or Biospring) was prepared at a molar ratio of 1 :5 (Cas9:sgRNA) with final concentrations of 20 pmol Cas9 and 100 pmol sgRNA. Where indicated, cells were also edited using S. pyogenes Cas9 mRNA (0.28 pmol) combined with 28 pmol of sgRNA (molar ratio of 1 :100). In one instance, the CD70 locus was edited with 0.28 pmol of mRNA encoding
29
Staphylococcus lugdunensis (Slu) combined with 28 pmol of guide “CD70_SluCas9_gRNA_3”. Gene-editing experiments that included mRNA-encoding TdT isoforms were performed by including the relevant mRNA directly in the RNP mixture.
[00130] Primary T cells and Jurkat cells were nucleofected using the Lonza4D nucleofector and the P3 primary cell and SE Cell line kits, respectively. Cells were counted using a Cellometer K2 or Cellaca MX HighThroughput Cell Counter (Nexcelom). T cells and Jurkat cells were resuspended in P3/SE buffer with supplement at a concentration of 5x104/pl and 1x104/mI, respectively. A total of 1x106 T cells or and 2x105 Jurkat cells were combined with RNP complex and transferred to a Lonza nucleofection cuvette for all editing experiments. T cells were nucleofected using program EO-115 and Jurkat cells nucleofected using program CL-120. Following electroporation, Jurkat cells were immediately recovered in RPMI media. T cells were recovered in serum-free media for 1 hour post-nucleofection before being transferred to media supplemented with serum. After 48 hours, cells were recovered, and genomic DNA was harvested using the QIAcube HT DNA extraction system and QIAamp 96 DNA QIAcube HT Kit or DNeasy Blood and Tissue Kit (Qiagen).
[00131] Nucleofection of TdT mRNA for Kinetics of TdT Expression in Primary T cells. For kinetics of TdT expression experiments, nucleofection of TdT-encoding mRNA was
30 performed as described above by adding mRNA directly to the P3 cellular mixture. In these experiments, a total of 5x106 T cells were transferred to a large Lonza nucleofection cuvette. Cells were recovered directly in T cell expansion media and 1 million T cells were harvested at the indicated time points for preparation of protein lysates.
[00132] PCR and Tracking of Indels by Decomposition (TIDE). Isolated genomic DNA was subjected to PCR to determine indel frequency by TIDE analysis (Brinkman et al. 2014, Nucleic Acids Research 42 (22): e168-e168). PCR for relevant regions was performed (1 cycle at 95°C for 3 min; 34 cycles at 98°C for 20 sec, 69°C for 15 sec, 72°C for 1 min; 1 cycle at 72°C for 3 min) according to the manufacturer’s recommendations for each commercial PCR polymerase. Resulting amplicons were examined on cast 1.2% or 2% pre-cast agarose gels. PCR clean-up of the resulting amplicons was performed using SPRIselect magnetic beads (Beckman Coulter). Sanger sequencing results were input into Tsunami along with guide sequences and resulting indel frequencies and identities were calculated by the software. Indel frequencies and editing efficiency was plotted using Prism (GraphPad). Indels were plotted as heatmaps with the indel identities on one axis and the corresponding frequency indicated by color intensity. Total editing frequencies were plotted as bar graphs.
[00133] Droplet Digital PCR ( ddPCR ). ddPCR assays were designed to detect four translocations denoted (T1 , T2, T3, and T4) from simultaneous editing of the PD1 and CD70 loci. All primers and probes were ordered from IDT. Primer pairs and a HEX- labeled reference probe were designed to count all copies of the reference amplicon within the RAG1 locus. Paired primers and a FAM-labeled probe (Table 2) were designed to polymerize amplicon across the breakpoint of the predicted translocation. Amplification reactions were formulated in 2x ddPCR Supermix for Probes (No dUTP) (BioRad). Primers and probes were introduced at a final concentration of 1.8 mM and 0.5 pM, respectively. 100-200 ng of genomic DNA from cells harvested 48-hours post- nucleofection was introduced to each ddPCR reaction along with 0.1 Unit of BamH1-HF to reduce viscosity generated by high DNA concentration. Individual reactions comprising 25 pi of ddPCR amplification mixture, were placed into an Automated Droplet Generator (BioRad). For each sample, roughly 15,000-20,000 nanoliter-sized droplets were
31 generated. Subsequently, the droplets were transferred to a 96-well plate for PCR in a thermal cycler (1 cycle at 95°C for 10 min; 40 cycles at 90°C for 30 sec, 57°C for 1 min, 72°C for 1 min; 1 cycle at 98°C for 10 min; 4°C hold). Following PCR amplification, droplets were read by QX200 Droplet Reader and analyzed using the Quantasoft™ software (BioRad), which analyzes each droplet individually using a two-color detection system (set to detect FAM and HEX). The ratio between translocation specific events (FAM) over events of reference amplicon (HEX) (x100) is the percentage of translocation. No template controls (NTCs) were included for each translocation assessed to determine background fluorescence. Data was plotted in the form of bar graphs on Prism (GraphPad).
32
[00134] Capillary Western for Immunodetection of hTdT and Tubulin. Protein lysates were prepared from cell pellets using RIPA Lysis and Extraction Buffer (Thermo Fisher) supplemented with Halt™ Protease Inhibitor Cocktail (100X) (Thermo Fisher). Protein levels were quantitated with the Pierce™ BCA Protein Assay Kit (Thermo Fisher) on the Epoch2 microplate reader (BioTek). Immunodetection of TdT and tubulin levels was determined via Simple Western technology, an automated capillary-based size sorting and immunolabeling system (ProteinSimpleTM). For TdT, the Anti-Rabbit Detection Module (ProteinSimpleTM) was used with the Anti-hTdT RabMab antibody [EPR2976Y] (ab76544) (Abeam) at 1.45 pg/ml. Tubulin was detected with the Anti-Mouse Detection Module (ProteinSimpleTM) and the monoclonal Anti-a-Tubulin clone B-5-1-2 (Millipore Sigma) at a 1 :10,000 dilution. Capillary westerns were run on the JESS device with the 12-230 kDa kits (ProteinSimpleTM) and all procedures were performed according to manufacturer’s protocol. The JESS device was associated with Compass software for device settings and raw data recording (ProteinSimple/Biotechne).
[00135] Creation of TdT Knockout Jurkat Cells. TdT-/- Jurkat cells were created by CRISPR editing of the DNTT locus. Briefly, the DNTT gene was singly edited with each
33 guide in Table 3 according to the same method detailed in “CRISPR Editing of Genomic Loci in Jurkat and Primary T cells” using S. pyogenes Cas9 for RNP formation. Editing efficiencies were established by TIDE analysis of Sanger sequencing using the primers (Table 4) and amplification protocol (1 cycle at 95°C for 3 min; 34 cycles at 98°C for 20 sec, 68°C for 45 sec, 72°C for 1 min; 1 cycle at 72°C for 3 min). Diminution of TdT expression was confirmed using “Immunodetection by Capillary Western” described above. Bulk-edited cultures displayed near-complete loss of TdT expression and were therefore used for downstream editing and translocation analysis following editing of the PD1 and CD70 loci.
[00136] In vitro transcription of mRNA. Plasmids encoding TdT isoforms and Cas9 nucleases from S. pyogenes and S. lugdunensis were synthesized by ATUM to contain an upstream T7 promoter, a downstream polyA tail, and a single Sap I restriction site adjacent to the polyA tail. Linear templates were created by digesting plasmid with Sap I
34
(New England Biolabs) and purified by phenol/chloroform extraction. In vitro transcription and purification of mRNA was performed with the MegaScript T7 Transcription Kit (Thermo Fisher).
Example 2. Jurkat and T cells Exhibit Similar Editing Efficiencies but Different Indel Profiles
[00137] Several gRNAs were designed to target the NPM1 gene locus or the ALK gene locus in human cells (Table 1). RNPs comprising SpCas9 protein and gRNA were delivered by electroporation to Jurkat and T cells as described above in Example 1. After an appropriate period of time, the edited loci were analysed by PCR and TIDE (as described above) to determine the efficiency of editing and the mutational profile (or indel spectrum) of the types and distribution of indels after repair of the dsDNA break introduced by Cas9. As shown in FIG. 1A, the editing efficiencies were similar between the two cell types at each of the loci. However, the indel profiles of T cells mainly comprised single nucleotide insertions (+1) and deletions, whereas those of Jurkat cells mainly comprised nucleotide insertions of +1 or greater (FIG. 1B).
Example 3. Jurkat Cells Exhibit Reduced Frequency of Chromosomal Translocations During Multiplex Editing at Two Loci
[00138] Simultaneous editing at two gene loci was then examined in Jurkat cells and T cells to determine whether increased insertions during editing could reduce chromosomal translocations during multiplex editing. Guide RNAs were designed to target PD1 and CD70 loci (Table 1). Multiplex editing was performed by delivering RNPs comprising SpCas9:PD1_4gRNA and SluCas9:CD70_3gRNA to Jurkat and T cells. Similar to the results presented in Example 2, the editing efficiencies were similar between the two cell types (FIG. 2A), and the Jurkat cells exhibited increased frequencies of insertions of +1 or more nucleotides at these two loci (FIG. 2B). Moreover, the Jurkat cells exhibited reduced frequencies of chromosomal translocations (FIG. 3B) as detected by droplet digital PCR (ddPCR) as described in Example 1. FIG. 3A diagrams the various translocations variants.
35
[00139] It has been reported that TdT is not active in mature lymphocytes (primary T cells) but is reactivated in some stable cell lines originating from T cells (e.g., Jurkat cells). FIG. 4 presents a Western blot confirming that Jurkat, but not T cells, express TdT.
Example 4. Editing of Jurkat Cells to Knock Out TdT
[00140] To determine whether TdT affects the indel profiles and chromosomal rearrangements in Jurkat cells, the DNTT gene locus was disrupted such that no TdT was expressed. For this, several gRNAs that target various exons were designed, as shown above in Table 3. After editing, TdT knock out was confirmed at the level of DNA and protein. TdT levels were very low in the bulk edited pool of cells (FIG. 5). Clonal outgrowth identified Jurkat (TdT-) cells.
[00141] Multiplex editing at PD1 and CD70 loci was performed in the edited Jurkat (TdT-) cells, wild type Jurkat (TdT+) cells, and primary T cells essentially as described above in Example 3. As shown in FIG. 6A and 6B, the indel profile of the Jurkat (TdT-) cells mirrored that of T cells in that the profiles comprised mainly single nucleotide insertions and deletions. The Jurkat (TdT-) cells also exhibited translocation frequencies similar to those of T cells (FIG. 6C).
Example 5. Delivery of TdT mRNA to T Cells
[00142] TdT exists in several isoforms, e.g., short isoform, long isoform 1 (which includes exon 12), long isoform 2 (which included exon 7), and long isoform 3 (which included exons 7 and 12). T cells were electroporated with 6 pmol of mRNA encoding the various TdT isoforms, as well as Cas9 mRNA. The kinetics of TdT expression relative to alpha-tubulin expression are presented in FIG. 7. Surprisingly, it was discovered that the short isoform (TdTS) persisted at higher levels for longer periods of time than the long isoforms. Lower levels of TdTS mRNA were introduced into T cells (FIG. 8A). Even at the lowest dose of 1.5 pmol, the relative abundance of TdTS to alpha-tubulin in T cells was higher than that in wild type Jurkat cells at 12 hours post electroporation (FIG. 8B).
36
Example 6. Providing T Cells with TdTS mRNA Creates Insertion Heavy Indel Profiles and Suppresses Translocations
[00143] T cells were electroporated with Cas9 editing reagents (RNPs) targeting PD1 and CD70, as described above, and 3 pmol of TdTS mRNA (with no mRNA or GFP mRNA serving as controls). The T cells provided with TdTS mRNA exhibited indel profiles with increased frequency of insertions of +1 and greater (FIG. 9A) and exhibited a three-fold decrease in translocations as compared to control cells (FIG. 9B). The editing efficiency was high at both loci under the various conditions (FIG. 9C). Unexpectedly, it was found that providing the long isoforms of TdT did not change the indel profile (FIG. 10A) or the translocate rate in T cells (FIG. 10B).
Example 7. Indel Patterns in T Cells Edited with Cas9RNP and TdTSmRNA versus Cas9mRNA+sgRNA and TdTSmRNA
[00144] Cas9 was delivered as RNP complex (Cas9protein + sgRNA) or as nucleic acids (Cas9mRNA + sgRNA) to determine whether the means of Cas9 delivery could affect editing efficiency, indel profile, and/or translocation rate. T cells were multiplex edited at PD1 and CD70 in the absence or presence of TdTSmRNA. The editing efficiency did not differ between the two types of Cas9 delivery (FIGs. 11 A, 11 B). Delivery of TdTS resulted in insertion heavy indels (FIGS. 12A-12D) and reduced frequency of translocations (FIGS. 13A, 13B). Delivery of Cas9mRNA + sgRNA appeared to increase the frequency of insertions and created longer insertions (FIGS. 12C, 12D). It was speculated that this effect may be due to co-expression of Cas9 and TdTS at about the same time, suggesting that mRNA delivery of both may be the preferred delivery mechanism.
Example 8. Multiplex Editing at Three Loci
[00145] T cells will be electroporated with TdTSmRNA and three Cas9 systems (either RNPs or Cas9mRNA + sgRNA) targeted three different genomic loci. The percent of chromosomal translocations, the indel patterns, and the editing efficiency will be analyzed as described above.
Claims
1. A method for gene editing at multiple genomic loci, the method comprising delivering to a cell:
(a) two or more CRISPR-Cas nuclease systems or two or more nucleic acids encoding two or more CRISPR-Cas nuclease systems, wherein each CRISPR-Cas nuclease system is targeted to a different genomic locus; and
(b) an X-family DNA polymerase or a nucleic acid encoding an X family DNA polymerase; wherein the two or more CRISPR-Cas nuclease systems generate cleaved genomic loci and the X-family DNA polymerase catalyzes addition of nucleotides to 3’ termini of the cleaved genomic loci, such that chromosomal translocations are reduced between the cleaved genomic loci, thereby facilitating editing at multiple genomic loci.
2. The method of claim 1 , wherein addition of nucleotides increases nucleotide insertion frequencies, increases insertion lengths, and decreases nucleotide deletion frequencies as compared to a comparable cell in which the X-family DNA polymerase is not delivered.
3. The method of claims 1 or 2, wherein chromosomal translocations are reduced by at least 5% when two genomic loci are edited as compared to a comparable cell in which the X-family DNA polymerase is not delivered.
4. The method of any one of claims 1 to 3, wherein each CRISPR-Cas nuclease system comprises a CRISPR-Cas nuclease and a gRNA.
5. The method of claim 4, wherein the CRISPR-Cas nuclease is a type II CRISPR-Cas nuclease or a type V CRISPR-Cas nuclease.
39 The method of claims 4 or 5, wherein the CRISPR-Cas nuclease is SpCas9, SluCas9, SaCas9, or a variant thereof with increased fidelity. The method of any one of claims 4 to 6, wherein the gRNA is a single molecule gRNA (sgRNA). The method of any one of claims 1 to 7, wherein each of the CRISPR-Cas nuclease systems is delivered as a ribonucleoprotein (RNP) complex. The method of any one of claims 1 to 7, wherein each of the CRISPR-Cas nuclease systems is delivered as a CRISPR-Cas nuclease and a gRNA. The method of any one of claims 1 to 7, wherein each of the CRISPR-Cas nuclease systems is delivered as mRNA encoding the CRISPR-Cas nuclease and a gRNA. The method of any one of claims 1 to 7, wherein each of the CRISPR-Cas nuclease systems is delivered as DNA encoding the CRISPR-Cas nuclease and DNA encoding the gRNA. The method of any one of claims 1 to 11 , wherein the X-family DNA polymerase is terminal deoxyribonucleotidyl transferase (TdT), polymerase lambda (Pol l), or polymerase mu (Pol m). The method of any one of claims 1 to 12, wherein the X-family DNA polymerase is a short isoform of TdT (TdtS). The method of claim 13, wherein TdTS has an amino acid sequence of SEQ ID NO: 34. The method of any one of claims 1 to 14, wherein the X-family DNA polymerase is delivered as mRNA.
40 The method of any one of claims 1 to 14, wherein the X-family DNA polymerase is delivered as protein. The method of any one of claims 1 to 14, wherein the X-family DNA polymerase is delivered as DNA. The method of any one of claims 1 to 17, wherein two CRISPR-Cas nuclease systems are delivered to the cell. The method of any one of claims 1 to 17, wherein three CRISPR-Cas nuclease systems are delivered to the cell. The method of any one of claims 1 to 17, wherein four CRISPR-Cas nuclease systems are delivered to the cell. The method of any one of claims 1 to 17, wherein five CRISPR-Cas nuclease systems are delivered to the cell. The method of any one of claims 1 to 21 , wherein delivering comprises electroporation. The method of any one of claims 1 to 22, wherein the cell is a mammalian cell. The method of any one of claims 1 to 23, wherein the cell is other than an immature, pre-B or pre-T lymphoid cell, a leukemia cell, a lymphoma cell, or an immortalized cell derived therefrom. The method of any one of claims 1 to 24, wherein the cell is a primary T cell or a NK cell. The method of any one of claims 1 to 25, wherein the cell is in vitro, ex vivo, in situ, or in vivo.
41 A cell comprising (a) two or more CRISPR-Cas nuclease systems and (b) a short isoform of terminal deoxyribonucleotidyl transferase (TdTS), wherein the CRISPR-Cas nuclease systems and the TdTS are exogenous to the cell. The cell of claim 27, wherein each CRISPR-Cas nuclease systems of (a) is a ribonucleoprotein (RNP) complex. The cell of claim 27, wherein each CRISPR-Cas nuclease system of (a) comprises CRISPR-Cas mRNA and gRNA. The cell of claim 28 or 29, wherein (b) comprises mRNA encoding TdTS The cell of claim 28 or 29, wherein (b) comprises TdTS protein. The cell of claim 27, wherein each CRISPR-Cas nuclease system of (a) comprises DNA encoding CRISPR-Cas nuclease and DNA encoding gRNA and (b) comprises DNA encoding TdTS. A kit for multiplex gene editing, the kit comprising (a) two or more CRISPR- Cas nuclease systems, and (b) a short isoform of terminal deoxyribonucleotidyl transferase (TdTS). The kit of claim 33, wherein each CRISPR-Cas nuclease systems of (a) is a RNP complex. The kit of claim 33, wherein each CRISPR-Cas nuclease system of (a) comprises CRISPR-Cas mRNA and gRNA. The kit of claim 34 or 35, wherein (b) comprises mRNA encoding TdTS
The kit of claim 34 or 35, wherein (b) comprises TdTS protein.
42 The kit of claim 33, wherein each CRISPR-Cas nuclease system of (a) comprises DNA encoding CRISPR-Cas nuclease and DNA encoding gRNA and (b) comprises DNA encoding TdTS. The kit of any one of claims 34 to 40, further comprising at least one reagent for gene editing.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US202163187755P | 2021-05-12 | 2021-05-12 | |
US63/187,755 | 2021-05-12 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2022238958A1 true WO2022238958A1 (en) | 2022-11-17 |
Family
ID=81748436
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/IB2022/054436 WO2022238958A1 (en) | 2021-05-12 | 2022-05-12 | Multiplex gene editing |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2022238958A1 (en) |
Citations (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2017151719A1 (en) * | 2016-03-01 | 2017-09-08 | University Of Florida Research Foundation, Incorporated | Molecular cell diary system |
WO2017165826A1 (en) * | 2016-03-25 | 2017-09-28 | Editas Medicine, Inc. | Genome editing systems comprising repair-modulating enzyme molecules and methods of their use |
US20180100148A1 (en) | 2016-10-07 | 2018-04-12 | Integrated Dna Technologies, Inc. | S. pyogenes cas9 mutant genes and polypeptides encoded by same |
US9944912B2 (en) | 2015-03-03 | 2018-04-17 | The General Hospital Corporation | Engineered CRISPR-Cas9 nucleases with altered PAM specificity |
US20180142222A1 (en) | 2015-06-12 | 2018-05-24 | The Regents Of The University Of California | Reporter cas9 variants and methods of use thereof |
US20190010471A1 (en) | 2015-06-18 | 2019-01-10 | The Broad Institute Inc. | Crispr enzyme mutations reducing off-target effects |
US20190177710A1 (en) | 2016-06-15 | 2019-06-13 | Toolgen Incorporated | Method For Screening Target Specific Nuclease Using Multi-Target System Of On-Target And Off-Target And Use Thereof |
WO2019118935A1 (en) | 2017-12-14 | 2019-06-20 | Casebia Therapeutics Limited Liability Partnership | Novel rna-programmable endonuclease systems and their use in genome editing and other applications |
WO2019123014A1 (en) * | 2017-12-22 | 2019-06-27 | G+Flas Life Sciences | Chimeric genome engineering molecules and methods |
WO2019152519A1 (en) * | 2018-01-30 | 2019-08-08 | Editas Medicine, Inc. | Systems and methods for modulating chromosomal rearrangements |
WO2019183150A1 (en) | 2018-03-19 | 2019-09-26 | Casebia Therapeutics Limited Liability Partnership | Novel rna-programmable endonuclease systems and uses thereof |
US10526591B2 (en) | 2015-08-28 | 2020-01-07 | The General Hospital Corporation | Engineered CRISPR-Cas9 nucleases |
WO2020057481A1 (en) | 2018-09-19 | 2020-03-26 | The University Of Hong Kong | Improved high-throughput combinatorial genetic modification system and optimized cas9 enzyme variants |
US20200149020A1 (en) | 2017-02-14 | 2020-05-14 | Universita' Degli Studi Di Trento | High-fidelity cas9 variants and applications thereof |
-
2022
- 2022-05-12 WO PCT/IB2022/054436 patent/WO2022238958A1/en unknown
Patent Citations (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9944912B2 (en) | 2015-03-03 | 2018-04-17 | The General Hospital Corporation | Engineered CRISPR-Cas9 nucleases with altered PAM specificity |
US20180142222A1 (en) | 2015-06-12 | 2018-05-24 | The Regents Of The University Of California | Reporter cas9 variants and methods of use thereof |
US20190010471A1 (en) | 2015-06-18 | 2019-01-10 | The Broad Institute Inc. | Crispr enzyme mutations reducing off-target effects |
US10526591B2 (en) | 2015-08-28 | 2020-01-07 | The General Hospital Corporation | Engineered CRISPR-Cas9 nucleases |
WO2017151719A1 (en) * | 2016-03-01 | 2017-09-08 | University Of Florida Research Foundation, Incorporated | Molecular cell diary system |
WO2017165826A1 (en) * | 2016-03-25 | 2017-09-28 | Editas Medicine, Inc. | Genome editing systems comprising repair-modulating enzyme molecules and methods of their use |
US20190177710A1 (en) | 2016-06-15 | 2019-06-13 | Toolgen Incorporated | Method For Screening Target Specific Nuclease Using Multi-Target System Of On-Target And Off-Target And Use Thereof |
US20180100148A1 (en) | 2016-10-07 | 2018-04-12 | Integrated Dna Technologies, Inc. | S. pyogenes cas9 mutant genes and polypeptides encoded by same |
US20200149020A1 (en) | 2017-02-14 | 2020-05-14 | Universita' Degli Studi Di Trento | High-fidelity cas9 variants and applications thereof |
WO2019118935A1 (en) | 2017-12-14 | 2019-06-20 | Casebia Therapeutics Limited Liability Partnership | Novel rna-programmable endonuclease systems and their use in genome editing and other applications |
WO2019123014A1 (en) * | 2017-12-22 | 2019-06-27 | G+Flas Life Sciences | Chimeric genome engineering molecules and methods |
WO2019152519A1 (en) * | 2018-01-30 | 2019-08-08 | Editas Medicine, Inc. | Systems and methods for modulating chromosomal rearrangements |
WO2019183150A1 (en) | 2018-03-19 | 2019-09-26 | Casebia Therapeutics Limited Liability Partnership | Novel rna-programmable endonuclease systems and uses thereof |
WO2020057481A1 (en) | 2018-09-19 | 2020-03-26 | The University Of Hong Kong | Improved high-throughput combinatorial genetic modification system and optimized cas9 enzyme variants |
Non-Patent Citations (20)
Title |
---|
BELFORTBONOCORA, METHODS MOL BIOL., vol. 1123, 2014, pages 1 - 26 |
BOISSEL ET AL., NUCLEIC ACIDS RES., vol. 42, 2014, pages 8816 - 2601 |
BOISSELSCHARENBERG, METHODS MOL. BIOL., vol. 1239, 2015, pages 171 - 96 |
BRINKMAN ET AL., NUCLEIC ACIDS RESEARCH, vol. 42, no. 22, 2014, pages e168 - e168 |
DELTCHEVA ET AL., NATURE, vol. 471, 2011, pages 602 - 607 |
DREIER ET AL., J BIOL CHEM., vol. 276, no. 31, 2001, pages 29466 - 78 |
DREIER ET AL., J BIOL CHEM., vol. 280, no. 42, 2005, pages 35588 - 97 |
DREIER ET AL., J MOL BIOL., vol. 303, no. 4, 2000, pages 489 - 502 |
FONFARA ET AL.: "Phylogeny of Cas9 determines functional exchangeability of dual-RNA and Cas9 among orthologous type II CRISPR-Cas systems", NUCLEIC ACIDS RESEARCH, vol. 42, 2014, pages 2577 - 2590, XP055399937, DOI: 10.1093/nar/gkt1074 |
GOEDDEL: "Gene Expression Technology: Methods in Enzymology", vol. 185, 1990, ACADEMIC PRESS |
GRIBSKOV, NUCL. ACIDS RES., vol. 14, no. 6, 1986, pages 6745 - 6763 |
HAFEZHAUSNER, GENOME, vol. 55, no. 8, 2012, pages 553 - 69 |
JINEK ET AL., SCIENCE, vol. 335, no. 6069, 2012, pages 716 - 821 |
KLEINSTIVER ET AL., G3, vol. 4, 2014, pages 1155 - 65 |
LIU ET AL., J BIOL CHEM., vol. 277, no. 6, 2002, pages 3850 - 6 |
LOVELESS THERESA B ET AL: "Lineage tracing and analog recording in mammalian cells by single-site DNA writing", NATURE CHEMICAL BIOLOGY, NATURE PUBLISHING GROUP US, NEW YORK, vol. 17, no. 6, 22 March 2021 (2021-03-22), pages 739 - 747, XP037465311, ISSN: 1552-4450, [retrieved on 20210322], DOI: 10.1038/S41589-021-00769-8 * |
MOSCOU ET AL., SCIENCE, vol. 326, no. 5959, 2009, pages 1501 - 12 |
SEGAL ET AL., PROC NATL ACAD SCI, vol. 96, no. 6, 1999, pages 2758 - 63 |
SMITHWATERMAN, ADVANCES IN APPLIED MATHEMATICS, vol. 2, 1981, pages 482 - 489 |
STEENTOFT ET AL., GLYCOBIOLOGY, vol. 24, no. 8, 2014, pages 663 - 80 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20220033858A1 (en) | Crispr oligoncleotides and gene editing | |
Liang et al. | Rapid and highly efficient mammalian cell engineering via Cas9 protein transfection | |
AU2021231074B2 (en) | Class II, type V CRISPR systems | |
JP2021166514A5 (en) | ||
JP2023159185A (en) | Composition and method for treating hemoglobinopathy | |
EP3744844A1 (en) | Extended single guide rna and use thereof | |
JP2021517815A (en) | Lymphopoiesis manipulation using the CAS9 base editor | |
WO2020014577A1 (en) | Methods of achieving high specificity of genome editing | |
EP3983545A1 (en) | Compositions and methods for editing beta-globin for treatment of hemaglobinopathies | |
Huang et al. | Developing ABEmax-NG with precise targeting and expanded editing scope to model pathogenic splice site mutations in vivo | |
US20240016934A1 (en) | Compositions and Methods for Reducing MHC Class II in a Cell | |
US20210389303A1 (en) | Transient reporters and methods for base editing enrichment | |
US20210062248A1 (en) | Methods of performing guide-seq on primary human t cells | |
WO2022238958A1 (en) | Multiplex gene editing | |
JP2024501892A (en) | Novel nucleic acid-guided nuclease | |
US20220228142A1 (en) | Compositions and methods for editing beta-globin for treatment of hemaglobinopathies | |
CN116716298A (en) | Guide editing system and fixed-point modification method of target gene sequence | |
KR20240031238A (en) | Gene editing system comprising CRISPR nuclease and uses thereof | |
EP4251749A1 (en) | Cytidine deaminase variants for base editing | |
WO2023245108A2 (en) | Compositions and methods for reducing mhc class i in a cell |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 22724143 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |