WO2022050377A1 - 標的dnaの編集方法、標的dnaが編集された細胞の製造方法、及びそれらに用いるdna編集システム - Google Patents
標的dnaの編集方法、標的dnaが編集された細胞の製造方法、及びそれらに用いるdna編集システム Download PDFInfo
- Publication number
- WO2022050377A1 WO2022050377A1 PCT/JP2021/032436 JP2021032436W WO2022050377A1 WO 2022050377 A1 WO2022050377 A1 WO 2022050377A1 JP 2021032436 W JP2021032436 W JP 2021032436W WO 2022050377 A1 WO2022050377 A1 WO 2022050377A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- tale
- sequence
- target
- dna
- deaminase
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 103
- 238000004519 manufacturing process Methods 0.000 title description 21
- 230000000694 effects Effects 0.000 claims abstract description 150
- 108020005004 Guide RNA Proteins 0.000 claims abstract description 143
- 108020004414 DNA Proteins 0.000 claims abstract description 132
- 108020001507 fusion proteins Proteins 0.000 claims abstract description 104
- 102000037865 fusion proteins Human genes 0.000 claims abstract description 104
- 102000004190 Enzymes Human genes 0.000 claims abstract description 76
- 108090000790 Enzymes Proteins 0.000 claims abstract description 76
- 230000000295 complement effect Effects 0.000 claims abstract description 69
- 108091033409 CRISPR Proteins 0.000 claims abstract description 66
- 125000006850 spacer group Chemical group 0.000 claims abstract description 54
- 238000010356 CRISPR-Cas9 genome editing Methods 0.000 claims abstract description 48
- 101710163270 Nuclease Proteins 0.000 claims abstract description 15
- 210000004027 cell Anatomy 0.000 claims description 95
- 210000004899 c-terminal region Anatomy 0.000 claims description 70
- 238000010442 DNA editing Methods 0.000 claims description 32
- 230000033590 base-excision repair Effects 0.000 claims description 18
- 239000003112 inhibitor Substances 0.000 claims description 16
- 230000002255 enzymatic effect Effects 0.000 claims description 3
- 102000039446 nucleic acids Human genes 0.000 abstract description 8
- 108020004707 nucleic acids Proteins 0.000 abstract description 8
- 150000007523 nucleic acids Chemical class 0.000 abstract description 8
- 238000006243 chemical reaction Methods 0.000 abstract description 5
- 108010073062 Transcription Activator-Like Effectors Proteins 0.000 description 161
- 235000001014 amino acid Nutrition 0.000 description 114
- 239000013613 expression plasmid Substances 0.000 description 110
- 150000001413 amino acids Chemical class 0.000 description 76
- 239000013612 plasmid Substances 0.000 description 60
- 229920002401 polyacrylamide Polymers 0.000 description 52
- 102100035102 E3 ubiquitin-protein ligase MYCBP2 Human genes 0.000 description 47
- 108090000623 proteins and genes Proteins 0.000 description 32
- 239000013598 vector Substances 0.000 description 29
- 125000003275 alpha amino acid group Chemical group 0.000 description 28
- 238000012360 testing method Methods 0.000 description 27
- 125000000539 amino acid group Chemical group 0.000 description 26
- 102000040430 polynucleotide Human genes 0.000 description 21
- 108091033319 polynucleotide Proteins 0.000 description 21
- 239000002157 polynucleotide Substances 0.000 description 21
- 230000035772 mutation Effects 0.000 description 19
- 235000018102 proteins Nutrition 0.000 description 17
- 102000004169 proteins and genes Human genes 0.000 description 17
- 101150063416 add gene Proteins 0.000 description 15
- 241000196324 Embryophyta Species 0.000 description 14
- 108020004705 Codon Proteins 0.000 description 13
- 238000010586 diagram Methods 0.000 description 13
- 239000013642 negative control Substances 0.000 description 13
- 101710172430 Uracil-DNA glycosylase inhibitor Proteins 0.000 description 12
- 230000014509 gene expression Effects 0.000 description 12
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 11
- 239000013604 expression vector Substances 0.000 description 10
- 108700043045 nanoluc Proteins 0.000 description 10
- 102000053602 DNA Human genes 0.000 description 9
- 239000012634 fragment Substances 0.000 description 9
- 238000002360 preparation method Methods 0.000 description 9
- 108010077850 Nuclear Localization Signals Proteins 0.000 description 8
- 230000007423 decrease Effects 0.000 description 8
- 239000002773 nucleotide Substances 0.000 description 8
- 125000003729 nucleotide group Chemical group 0.000 description 8
- 238000007500 overflow downdraw method Methods 0.000 description 8
- 102220605874 Cytosolic arginine sensor for mTORC1 subunit 2_D10A_mutation Human genes 0.000 description 7
- 238000003780 insertion Methods 0.000 description 7
- 230000037431 insertion Effects 0.000 description 7
- 230000032965 negative regulation of cell volume Effects 0.000 description 7
- 239000000243 solution Substances 0.000 description 7
- 101710149870 C-C chemokine receptor type 5 Proteins 0.000 description 6
- 102100035875 C-C chemokine receptor type 5 Human genes 0.000 description 6
- 102100026846 Cytidine deaminase Human genes 0.000 description 6
- 108010031325 Cytidine deaminase Proteins 0.000 description 6
- 102100021519 Hemoglobin subunit beta Human genes 0.000 description 6
- 101000899111 Homo sapiens Hemoglobin subunit beta Proteins 0.000 description 6
- 241001465754 Metazoa Species 0.000 description 6
- 108091028113 Trans-activating crRNA Proteins 0.000 description 6
- 238000006481 deamination reaction Methods 0.000 description 6
- 210000001161 mammalian embryo Anatomy 0.000 description 6
- 210000000287 oocyte Anatomy 0.000 description 6
- 238000006467 substitution reaction Methods 0.000 description 6
- RWQNBRDOKXIBIV-UHFFFAOYSA-N thymine Chemical compound CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 description 6
- 101710169336 5'-deoxyadenosine deaminase Proteins 0.000 description 5
- 102000055025 Adenosine deaminases Human genes 0.000 description 5
- QNAYBMKLOCPYGJ-UHFFFAOYSA-N Alanine Chemical compound CC([NH3+])C([O-])=O QNAYBMKLOCPYGJ-UHFFFAOYSA-N 0.000 description 5
- 229940113491 Glycosylase inhibitor Drugs 0.000 description 5
- 108091034117 Oligonucleotide Proteins 0.000 description 5
- 108700038250 PAM2-CSK4 Proteins 0.000 description 5
- 241000700159 Rattus Species 0.000 description 5
- 101100206155 Schizosaccharomyces pombe (strain 972 / ATCC 24843) tbp1 gene Proteins 0.000 description 5
- 108020004682 Single-Stranded DNA Proteins 0.000 description 5
- 239000003153 chemical reaction reagent Substances 0.000 description 5
- 230000009615 deamination Effects 0.000 description 5
- 238000012217 deletion Methods 0.000 description 5
- 230000037430 deletion Effects 0.000 description 5
- 210000003527 eukaryotic cell Anatomy 0.000 description 5
- 239000000203 mixture Substances 0.000 description 5
- 210000001519 tissue Anatomy 0.000 description 5
- 101100108891 Arabidopsis thaliana PRMT11 gene Proteins 0.000 description 4
- 108020001738 DNA Glycosylase Proteins 0.000 description 4
- 102000028381 DNA glycosylase Human genes 0.000 description 4
- 108010005440 Guanosine deaminase Proteins 0.000 description 4
- 241000124008 Mammalia Species 0.000 description 4
- 108060004795 Methyltransferase Proteins 0.000 description 4
- 101100484946 Petunia hybrida VPY gene Proteins 0.000 description 4
- 210000004671 cell-free system Anatomy 0.000 description 4
- 230000004927 fusion Effects 0.000 description 4
- 239000002609 medium Substances 0.000 description 4
- 101150091418 pam1 gene Proteins 0.000 description 4
- BASFCYQUMIYNBI-UHFFFAOYSA-N platinum Chemical compound [Pt] BASFCYQUMIYNBI-UHFFFAOYSA-N 0.000 description 4
- 239000013641 positive control Substances 0.000 description 4
- 230000009466 transformation Effects 0.000 description 4
- 229930024421 Adenine Natural products 0.000 description 3
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 3
- 244000063299 Bacillus subtilis Species 0.000 description 3
- 235000014469 Bacillus subtilis Nutrition 0.000 description 3
- 241000283690 Bos taurus Species 0.000 description 3
- 102100031151 C-C chemokine receptor type 2 Human genes 0.000 description 3
- 101710149815 C-C chemokine receptor type 2 Proteins 0.000 description 3
- 125000001433 C-terminal amino-acid group Chemical group 0.000 description 3
- 101150083327 CCR2 gene Proteins 0.000 description 3
- 101150017501 CCR5 gene Proteins 0.000 description 3
- 238000010354 CRISPR gene editing Methods 0.000 description 3
- 241000282693 Cercopithecidae Species 0.000 description 3
- 230000006820 DNA synthesis Effects 0.000 description 3
- 241000588724 Escherichia coli Species 0.000 description 3
- 108090000331 Firefly luciferases Proteins 0.000 description 3
- 101150013707 HBB gene Proteins 0.000 description 3
- 101150019065 HBD gene Proteins 0.000 description 3
- 102100039894 Hemoglobin subunit delta Human genes 0.000 description 3
- 241000699666 Mus <mouse, genus> Species 0.000 description 3
- 108091028043 Nucleic acid sequence Proteins 0.000 description 3
- 101150025342 UGI gene Proteins 0.000 description 3
- 229960000643 adenine Drugs 0.000 description 3
- 230000000692 anti-sense effect Effects 0.000 description 3
- 230000037429 base substitution Effects 0.000 description 3
- 239000000872 buffer Substances 0.000 description 3
- 239000007853 buffer solution Substances 0.000 description 3
- 210000003855 cell nucleus Anatomy 0.000 description 3
- 238000003776 cleavage reaction Methods 0.000 description 3
- 210000002257 embryonic structure Anatomy 0.000 description 3
- 238000010362 genome editing Methods 0.000 description 3
- 230000003834 intracellular effect Effects 0.000 description 3
- 238000007852 inverse PCR Methods 0.000 description 3
- 210000000056 organ Anatomy 0.000 description 3
- 210000001236 prokaryotic cell Anatomy 0.000 description 3
- 230000008439 repair process Effects 0.000 description 3
- 230000007017 scission Effects 0.000 description 3
- 239000002904 solvent Substances 0.000 description 3
- 239000000758 substrate Substances 0.000 description 3
- 229940113082 thymine Drugs 0.000 description 3
- KDCGOANMDULRCW-UHFFFAOYSA-N 7H-purine Chemical compound N1=CNC2=NC=NC2=C1 KDCGOANMDULRCW-UHFFFAOYSA-N 0.000 description 2
- LRFVTYWOQMYALW-UHFFFAOYSA-N 9H-xanthine Chemical compound O=C1NC(=O)NC2=C1NC=N2 LRFVTYWOQMYALW-UHFFFAOYSA-N 0.000 description 2
- 241000251468 Actinopterygii Species 0.000 description 2
- 241000271566 Aves Species 0.000 description 2
- 241000894006 Bacteria Species 0.000 description 2
- 241000700199 Cavia porcellus Species 0.000 description 2
- 241000699800 Cricetinae Species 0.000 description 2
- 241000938605 Crocodylia Species 0.000 description 2
- 102000000311 Cytosine Deaminase Human genes 0.000 description 2
- 108010080611 Cytosine Deaminase Proteins 0.000 description 2
- 239000006144 Dulbecco’s modified Eagle's medium Substances 0.000 description 2
- 241000283086 Equidae Species 0.000 description 2
- 241000238631 Hexapoda Species 0.000 description 2
- 241000282412 Homo Species 0.000 description 2
- 241000235789 Hyperoartia Species 0.000 description 2
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 2
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 2
- 108060001084 Luciferase Proteins 0.000 description 2
- 239000005089 Luciferase Substances 0.000 description 2
- 229920002873 Polyethylenimine Polymers 0.000 description 2
- 102100022433 Single-stranded DNA cytosine deaminase Human genes 0.000 description 2
- 108091081024 Start codon Proteins 0.000 description 2
- 241000193996 Streptococcus pyogenes Species 0.000 description 2
- 241000282887 Suidae Species 0.000 description 2
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 2
- DRTQHJPVMGBUCF-XVFCMESISA-N Uridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-XVFCMESISA-N 0.000 description 2
- 241000251539 Vertebrata <Metazoa> Species 0.000 description 2
- 210000004102 animal cell Anatomy 0.000 description 2
- 230000027455 binding Effects 0.000 description 2
- 239000013592 cell lysate Substances 0.000 description 2
- 210000004748 cultured cell Anatomy 0.000 description 2
- UHDGCWIWMRVCDJ-ZAKLUEHWSA-N cytidine Chemical compound O=C1N=C(N)C=CN1[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O1 UHDGCWIWMRVCDJ-ZAKLUEHWSA-N 0.000 description 2
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 210000004602 germ cell Anatomy 0.000 description 2
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 2
- FDGQSTZJBFJUBT-UHFFFAOYSA-N hypoxanthine Chemical compound O=C1NC=NC2=C1NC=N2 FDGQSTZJBFJUBT-UHFFFAOYSA-N 0.000 description 2
- JVTAAEKCZFNVCJ-UHFFFAOYSA-N lactic acid Chemical compound CC(O)C(O)=O JVTAAEKCZFNVCJ-UHFFFAOYSA-N 0.000 description 2
- 229930182817 methionine Natural products 0.000 description 2
- 229910052697 platinum Inorganic materials 0.000 description 2
- 239000003755 preservative agent Substances 0.000 description 2
- 230000002335 preservative effect Effects 0.000 description 2
- 238000000746 purification Methods 0.000 description 2
- 108091008146 restriction endonucleases Proteins 0.000 description 2
- FGDZQCVHDSGLHJ-UHFFFAOYSA-M rubidium chloride Chemical compound [Cl-].[Rb+] FGDZQCVHDSGLHJ-UHFFFAOYSA-M 0.000 description 2
- 238000002741 site-directed mutagenesis Methods 0.000 description 2
- 210000001082 somatic cell Anatomy 0.000 description 2
- 210000000130 stem cell Anatomy 0.000 description 2
- 230000008685 targeting Effects 0.000 description 2
- 238000012546 transfer Methods 0.000 description 2
- 241001515965 unidentified phage Species 0.000 description 2
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 1
- UHDGCWIWMRVCDJ-UHFFFAOYSA-N 1-beta-D-Xylofuranosyl-NH-Cytosine Natural products O=C1N=C(N)C=CN1C1C(O)C(O)C(CO)O1 UHDGCWIWMRVCDJ-UHFFFAOYSA-N 0.000 description 1
- 101150072531 10 gene Proteins 0.000 description 1
- GOJUJUVQIVIZAV-UHFFFAOYSA-N 2-amino-4,6-dichloropyrimidine-5-carbaldehyde Chemical group NC1=NC(Cl)=C(C=O)C(Cl)=N1 GOJUJUVQIVIZAV-UHFFFAOYSA-N 0.000 description 1
- MZZYGYNZAOVRTG-UHFFFAOYSA-N 2-hydroxy-n-(1h-1,2,4-triazol-5-yl)benzamide Chemical compound OC1=CC=CC=C1C(=O)NC1=NC=NN1 MZZYGYNZAOVRTG-UHFFFAOYSA-N 0.000 description 1
- WMHLZRDNWFNTCU-UHFFFAOYSA-N 2-nitroso-3,7-dihydropurin-6-one Chemical compound O=C1NC(N=O)=NC2=C1N=CN2 WMHLZRDNWFNTCU-UHFFFAOYSA-N 0.000 description 1
- LRSASMSXMSNRBT-UHFFFAOYSA-N 5-methylcytosine Chemical compound CC1=CNC(=O)N=C1N LRSASMSXMSNRBT-UHFFFAOYSA-N 0.000 description 1
- 108010079649 APOBEC-1 Deaminase Proteins 0.000 description 1
- 241000589158 Agrobacterium Species 0.000 description 1
- 101100028789 Arabidopsis thaliana PBS1 gene Proteins 0.000 description 1
- 101100139907 Arabidopsis thaliana RAR1 gene Proteins 0.000 description 1
- 235000017060 Arachis glabrata Nutrition 0.000 description 1
- 244000105624 Arachis hypogaea Species 0.000 description 1
- 235000010777 Arachis hypogaea Nutrition 0.000 description 1
- 235000018262 Arachis monticola Nutrition 0.000 description 1
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Chemical compound OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 1
- 241000193830 Bacillus <bacterium> Species 0.000 description 1
- BTBUEUYNUDRHOZ-UHFFFAOYSA-N Borate Chemical compound [O-]B([O-])[O-] BTBUEUYNUDRHOZ-UHFFFAOYSA-N 0.000 description 1
- 240000002791 Brassica napus Species 0.000 description 1
- 235000006008 Brassica napus var napus Nutrition 0.000 description 1
- 102100040397 C->U-editing enzyme APOBEC-1 Human genes 0.000 description 1
- 108091079001 CRISPR RNA Proteins 0.000 description 1
- UXVMQQNJUSDDNG-UHFFFAOYSA-L Calcium chloride Chemical compound [Cl-].[Cl-].[Ca+2] UXVMQQNJUSDDNG-UHFFFAOYSA-L 0.000 description 1
- 241000283707 Capra Species 0.000 description 1
- 241000614261 Citrus hongheensis Species 0.000 description 1
- 229920000742 Cotton Polymers 0.000 description 1
- 241000195493 Cryptophyta Species 0.000 description 1
- UHDGCWIWMRVCDJ-PSQAKQOGSA-N Cytidine Natural products O=C1N=C(N)C=CN1[C@@H]1[C@@H](O)[C@@H](O)[C@H](CO)O1 UHDGCWIWMRVCDJ-PSQAKQOGSA-N 0.000 description 1
- CKLJMWTZIZZHCS-UHFFFAOYSA-N D-OH-Asp Natural products OC(=O)C(N)CC(O)=O CKLJMWTZIZZHCS-UHFFFAOYSA-N 0.000 description 1
- 102100040263 DNA dC->dU-editing enzyme APOBEC-3A Human genes 0.000 description 1
- 102100040262 DNA dC->dU-editing enzyme APOBEC-3B Human genes 0.000 description 1
- 102000052510 DNA-Binding Proteins Human genes 0.000 description 1
- 108700020911 DNA-Binding Proteins Proteins 0.000 description 1
- 230000004568 DNA-binding Effects 0.000 description 1
- 241000702421 Dependoparvovirus Species 0.000 description 1
- 229920002307 Dextran Polymers 0.000 description 1
- 240000006497 Dianthus caryophyllus Species 0.000 description 1
- 235000009355 Dianthus caryophyllus Nutrition 0.000 description 1
- 102100020743 Dipeptidase 1 Human genes 0.000 description 1
- 108090000204 Dipeptidase 1 Proteins 0.000 description 1
- 206010059866 Drug resistance Diseases 0.000 description 1
- 238000003718 Dual-Luciferase Reporter Assay System Methods 0.000 description 1
- 108091029865 Exogenous DNA Proteins 0.000 description 1
- 102100026406 G/T mismatch-specific thymine DNA glycosylase Human genes 0.000 description 1
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 1
- 244000068988 Glycine max Species 0.000 description 1
- 235000010469 Glycine max Nutrition 0.000 description 1
- 239000006173 Good's buffer Substances 0.000 description 1
- 241000219146 Gossypium Species 0.000 description 1
- 244000020551 Helianthus annuus Species 0.000 description 1
- 235000003222 Helianthus annuus Nutrition 0.000 description 1
- 101000964378 Homo sapiens DNA dC->dU-editing enzyme APOBEC-3A Proteins 0.000 description 1
- 101000964385 Homo sapiens DNA dC->dU-editing enzyme APOBEC-3B Proteins 0.000 description 1
- 101000658622 Homo sapiens Testis-specific Y-encoded-like protein 2 Proteins 0.000 description 1
- 240000005979 Hordeum vulgare Species 0.000 description 1
- 235000007340 Hordeum vulgare Nutrition 0.000 description 1
- 206010020649 Hyperkeratosis Diseases 0.000 description 1
- UGQMRVRMYYASKQ-UHFFFAOYSA-N Hypoxanthine nucleoside Natural products OC1C(O)C(CO)OC1N1C(NC=NC2=O)=C2N=C1 UGQMRVRMYYASKQ-UHFFFAOYSA-N 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- 241000713666 Lentivirus Species 0.000 description 1
- 235000007688 Lycopersicon esculentum Nutrition 0.000 description 1
- 240000000249 Morus alba Species 0.000 description 1
- 235000008708 Morus alba Nutrition 0.000 description 1
- 241000699670 Mus sp. Species 0.000 description 1
- 240000008790 Musa x paradisiaca Species 0.000 description 1
- 235000018290 Musa x paradisiaca Nutrition 0.000 description 1
- 125000000729 N-terminal amino-acid group Chemical group 0.000 description 1
- 244000061176 Nicotiana tabacum Species 0.000 description 1
- 235000002637 Nicotiana tabacum Nutrition 0.000 description 1
- 241000283973 Oryctolagus cuniculus Species 0.000 description 1
- 240000007594 Oryza sativa Species 0.000 description 1
- 235000007164 Oryza sativa Nutrition 0.000 description 1
- 241001494479 Pecora Species 0.000 description 1
- 241000251745 Petromyzon marinus Species 0.000 description 1
- 108091081062 Repeated sequence (DNA) Proteins 0.000 description 1
- 241000283984 Rodentia Species 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- 101100028790 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) PBS2 gene Proteins 0.000 description 1
- 241000607142 Salmonella Species 0.000 description 1
- 101710143275 Single-stranded DNA cytosine deaminase Proteins 0.000 description 1
- 240000003768 Solanum lycopersicum Species 0.000 description 1
- 244000061456 Solanum tuberosum Species 0.000 description 1
- 235000002595 Solanum tuberosum Nutrition 0.000 description 1
- 238000002105 Southern blotting Methods 0.000 description 1
- 101000910035 Streptococcus pyogenes serotype M1 CRISPR-associated endonuclease Cas9/Csn1 Proteins 0.000 description 1
- 241000282898 Sus scrofa Species 0.000 description 1
- 238000010459 TALEN Methods 0.000 description 1
- 102100034917 Testis-specific Y-encoded-like protein 2 Human genes 0.000 description 1
- 108010035344 Thymine DNA Glycosylase Proteins 0.000 description 1
- 108010043645 Transcription Activator-Like Effector Nucleases Proteins 0.000 description 1
- 239000007983 Tris buffer Substances 0.000 description 1
- 235000021307 Triticum Nutrition 0.000 description 1
- 244000098338 Triticum aestivum Species 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- 241000589634 Xanthomonas Species 0.000 description 1
- 240000008042 Zea mays Species 0.000 description 1
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 description 1
- 235000002017 Zea mays subsp mays Nutrition 0.000 description 1
- HCHKCACWOHOZIP-UHFFFAOYSA-N Zinc Chemical compound [Zn] HCHKCACWOHOZIP-UHFFFAOYSA-N 0.000 description 1
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 125000003277 amino group Chemical group 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 238000003491 array Methods 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- 238000003556 assay Methods 0.000 description 1
- DRTQHJPVMGBUCF-PSQAKQOGSA-N beta-L-uridine Natural products O[C@H]1[C@@H](O)[C@H](CO)O[C@@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-PSQAKQOGSA-N 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- 210000002449 bone cell Anatomy 0.000 description 1
- 210000004958 brain cell Anatomy 0.000 description 1
- 239000001110 calcium chloride Substances 0.000 description 1
- 229910001628 calcium chloride Inorganic materials 0.000 description 1
- 239000001506 calcium phosphate Substances 0.000 description 1
- 229910000389 calcium phosphate Inorganic materials 0.000 description 1
- 235000011010 calcium phosphates Nutrition 0.000 description 1
- 125000002915 carbonyl group Chemical group [*:2]C([*:1])=O 0.000 description 1
- -1 cationic lipid Chemical class 0.000 description 1
- 235000013339 cereals Nutrition 0.000 description 1
- 239000002299 complementary DNA Substances 0.000 description 1
- 238000012790 confirmation Methods 0.000 description 1
- 235000005822 corn Nutrition 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- 230000001461 cytolytic effect Effects 0.000 description 1
- 229940104302 cytosine Drugs 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 230000004069 differentiation Effects 0.000 description 1
- 238000010790 dilution Methods 0.000 description 1
- 239000012895 dilution Substances 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 238000006911 enzymatic reaction Methods 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 210000002950 fibroblast Anatomy 0.000 description 1
- 235000012055 fruits and vegetables Nutrition 0.000 description 1
- 235000013922 glutamic acid Nutrition 0.000 description 1
- 239000004220 glutamic acid Substances 0.000 description 1
- 210000003958 hematopoietic stem cell Anatomy 0.000 description 1
- 210000003494 hepatocyte Anatomy 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 230000002401 inhibitory effect Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 210000003292 kidney cell Anatomy 0.000 description 1
- 239000004310 lactic acid Substances 0.000 description 1
- 235000014655 lactic acid Nutrition 0.000 description 1
- 238000001638 lipofection Methods 0.000 description 1
- XIXADJRWDQXREU-UHFFFAOYSA-M lithium acetate Chemical compound [Li+].CC([O-])=O XIXADJRWDQXREU-UHFFFAOYSA-M 0.000 description 1
- 239000012139 lysis buffer Substances 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 210000000663 muscle cell Anatomy 0.000 description 1
- 210000002569 neuron Anatomy 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 235000020232 peanut Nutrition 0.000 description 1
- 239000008055 phosphate buffer solution Substances 0.000 description 1
- 210000001778 pluripotent stem cell Anatomy 0.000 description 1
- 229920001184 polypeptide Polymers 0.000 description 1
- 208000015768 polyposis Diseases 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 102000004196 processed proteins & peptides Human genes 0.000 description 1
- 108090000765 processed proteins & peptides Proteins 0.000 description 1
- 230000001737 promoting effect Effects 0.000 description 1
- 125000000714 pyrimidinyl group Chemical group 0.000 description 1
- 230000001172 regenerating effect Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 235000009566 rice Nutrition 0.000 description 1
- 229940102127 rubidium chloride Drugs 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 230000035939 shock Effects 0.000 description 1
- 239000003381 stabilizer Substances 0.000 description 1
- 125000001424 substituent group Chemical group 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 238000002560 therapeutic procedure Methods 0.000 description 1
- 238000013518 transcription Methods 0.000 description 1
- 230000035897 transcription Effects 0.000 description 1
- 238000001890 transfection Methods 0.000 description 1
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 1
- ZSDSQXJSNMTJDA-UHFFFAOYSA-N trifluralin Chemical compound CCCN(CCC)C1=C([N+]([O-])=O)C=C(C(F)(F)F)C=C1[N+]([O-])=O ZSDSQXJSNMTJDA-UHFFFAOYSA-N 0.000 description 1
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 1
- 241000701161 unidentified adenovirus Species 0.000 description 1
- 241000701447 unidentified baculovirus Species 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- 229940035893 uracil Drugs 0.000 description 1
- DRTQHJPVMGBUCF-UHFFFAOYSA-N uracil arabinoside Natural products OC1C(O)C(CO)OC1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-UHFFFAOYSA-N 0.000 description 1
- 229940045145 uridine Drugs 0.000 description 1
- 210000004291 uterus Anatomy 0.000 description 1
- 239000011534 wash buffer Substances 0.000 description 1
- 229940075420 xanthine Drugs 0.000 description 1
- 239000011701 zinc Substances 0.000 description 1
- 229910052725 zinc Inorganic materials 0.000 description 1
- NWONKYPBYAMBJT-UHFFFAOYSA-L zinc sulfate Chemical compound [Zn+2].[O-]S([O-])(=O)=O NWONKYPBYAMBJT-UHFFFAOYSA-L 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/16—Hydrolases (3) acting on ester bonds (3.1)
- C12N9/22—Ribonucleases RNAses, DNAses
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/10—Processes for the isolation, preparation or purification of DNA or RNA
- C12N15/102—Mutagenizing nucleic acids
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/113—Non-coding nucleic acids modulating the expression of genes, e.g. antisense oligonucleotides; Antisense DNA or RNA; Triplex- forming oligonucleotides; Catalytic nucleic acids, e.g. ribozymes; Nucleic acids used in co-suppression or gene silencing
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/78—Hydrolases (3) acting on carbon to nitrogen bonds other than peptide bonds (3.5)
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/80—Fusion polypeptide containing a DNA binding domain, e.g. Lacl or Tet-repressor
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/111—General methods applicable to biologically active non-coding nucleic acids
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2310/00—Structure or type of the nucleic acid
- C12N2310/10—Type of nucleic acid
- C12N2310/20—Type of nucleic acid involving clustered regularly interspaced short palindromic repeats [CRISPRs]
Definitions
- the present invention relates to a method for editing a target DNA, a method for producing a cell in which the target DNA has been edited, and a DNA editing system used for them. It relates to a method for producing cells and a combination used for them.
- Non-Patent Document 1 A base editing method combining CRISPR-dCas9 / nCas9 with a nucleobase converting enzyme such as deaminase was first reported in 2016 ( Komor, A. et al., Nature 533,420-424 (2016), DOI: 10). Since 1038 / nature17946 (Non-Patent Document 1)), its use has spread rapidly due to its high editing efficiency and simplicity that does not require donor DNA, and examples in various organisms and cells have been reported. At the same time, many improved systems have been developed (Rees, HA and Liu, DR, Nat Rev Genet 19,770-788 (2016), DOI: 10.1038 / s41576-018-0059- 1 (Non-Patent Document 2), etc.).
- Non-Patent Document 4 et al., Science 353, aaf8729 (2016), DOI: 10.1126 / science.aaf8729 (Non-Patent Document 4) is often used.
- APOBEC3A, APOBEC3B, etc. can be used (Gehrke, J. et al., Nat Biotechnol 36, 977-982 (2016), DOI: 10.1038 / nbt.4199 (Non-patent document). 5); Martin, AS et al., Sci Rep 9,497 (2019), DOI: 10.1038 / s41598-018-36739-9 (Non-Patent Document 6)).
- ABE Adenine Base Editor
- Non-Patent Document 8 DNA-binding proteins such as ZF and TALE do not have the property of dissociating double-stranded DNA into single strands. The efficiency of base editing by the substrate deaminase is extremely low.
- Patent Document 1 the efficiency of base editing by the system described in Patent Document 1 is about 1/3 of that of BE3 (Non-Patent Document 1), which is a conventional base editing system, even if the efficiency is the highest. It was considered that the decrease in activity was caused by.
- the present invention has been made in view of the above-mentioned problems of the prior art, and is a method capable of specifically and efficiently editing a target DNA by a nucleobase converting enzyme, and genome editing is performed using the method. It is an object of the present invention to provide a method for producing cells and a DNA editing system used for them.
- the present inventors have fused the nucleobase converting enzyme into TALE without converting it into a split, and the TALE-nucleobase converting enzyme fusion protein (preferably TALE-deaminase).
- TALE-nucleobase converting enzyme fusion protein preferably TALE-deaminase
- -UGI fusion protein dCas9 / guide RNA complex or nCas9 / guide RNA complex were both bound in the vicinity of the target site of the target DNA.
- the structure that does not split the nucleobase converting enzyme is based on the previously reported highly active system AncBE4max, for example, in the case of deaminase, but this is not directly linked to dCas9 or nCas9, but is linked to TALE.
- the present inventors have a spacer 1 between the TALE recognition sequence (or its complementary sequence) and the target site on the target DNA. And by examining the position of the RNA target sequence on the target DNA, it was demonstrated that the base editing efficiency equivalent to that of the conventional AncBE4max system could be achieved (maximum activity by the reporter assay), and the present invention was completed.
- the aspects of the present invention obtained from such findings are as follows.
- the TALE recognition sequence or its complementary sequence recognized by TALE in the fusion protein is present on the 5'side of the target site via a spacer 1 having a chain length of 7 to 31 bp, and
- the guide RNA target sequence recognized by the guide RNA in the CRISPR-Cas9 system is present to contain the complementary base of the target site.
- [2] A method of producing cells in which the target DNA has been edited.
- Fusion proteins containing TALE and nucleobase converting enzymes and (2)
- a CRISPR-Cas9 system containing a Cas9 protein having lost part or all of its nuclease activity and its guide RNA is introduced into a cell or expressed intracellularly and brought into contact with a target DNA to convert the nucleobase of the fusion protein. It comprises the step of editing the base of the target site of the target DNA by enzymatic activity.
- the TALE recognition sequence or its complementary sequence recognized by TALE in the fusion protein is present on the 5'side of the target site via a spacer 1 having a chain length of 7 to 31 bp, and
- the guide RNA target sequence recognized by the guide RNA in the CRISPR-Cas9 system is present to contain the complementary base of the target site.
- nucleobase converting enzyme in the fusion protein is deaminase.
- a DNA editing system for use in the method according to any one of [1] to [6].
- Fusion proteins containing TALE and nucleobase converting enzymes and (2) A DNA editing system comprising a CRISPR-Cas9 system comprising a Cas9 protein having lost some or all of its nuclease activity and a guide RNA thereof.
- nucleobase converting enzyme in the fusion protein is deaminase.
- a method capable of specifically and efficiently editing a target DNA by a nucleobase converting enzyme, a method for producing a genome-edited cell using the method, and DNA editing used for them It becomes possible to provide a system.
- TALE-deaminase activity value (ratio to the AncBE4max activity value) obtained in Test 4 and the length of linker 1.
- It is a schematic diagram of the structure of NanoLuc expression reporter and one aspect of base editing by TALE-deaminase. It is a conceptual diagram which shows one aspect (A, B) of the structure of TALE-deaminase respectively.
- It is a graph which shows the TALE-deaminase activity value (ratio to the AncBE4max activity value) obtained in test 5 ((A) BE4-TALE-WT: BE4-32-WT, BE4-135-WT, BE4-234-WT.
- BE4-TALE-63 BE4-32-63, BE4-135-63, BE4-234-63). It is a graph which shows the TALE-deaminase activity value (the ratio to the AncBE4max activity value or the ABE8e activity value) obtained in the test 5 ((C) BE4-TALE-47: BE4-32-47, BE4-135-47, BE4 -234-47, (D) ABE8e-TALE-47: ABE8e-32-47, ABE8e-135-47, ABE8e-234-47).
- Target region (AS5 (d), AS14 (e), AS6 (f), AS7 (g)) on endogenous DNA (CCR5, HBB), position of TALE recognition sequence and its complementary sequence, target sequence of guide RNA
- A 1st base
- B 10th base
- sgRNA-AS3 11 of the complementary sequence of the target sequence of the guide RNA
- the target DNA editing method of the present invention is It is a method of editing the target DNA, (1) Fusion proteins containing TALE and nucleobase converting enzymes, and (2) A CRISPR-Cas9 system containing a Cas9 protein having lost part or all of its nuclease activity and its guide RNA is brought into contact with the target DNA, and the nucleobase converting enzyme activity of the fusion protein causes the target site of the target DNA to be contacted.
- the TALE recognition sequence or its complementary sequence recognized by TALE in the fusion protein is present on the 5'side of the target site via a spacer 1 having a chain length of 7 to 31 bp, and
- the guide RNA target sequence recognized by the guide RNA in the CRISPR-Cas9 system is present to contain the complementary base of the target site.
- the fusion protein according to the present invention is a fusion protein containing TALE and a nucleobase converting enzyme.
- the fusion protein according to the present invention is, in some cases, referred to as "TALE-nucleobase converting enzyme fusion protein" or "TALE-nucleobase converting enzyme”.
- the fusion protein may be bound in the order of TALE and nucleobase converting enzyme from the N-terminal, or may be bound in the order of nucleobase converting enzyme and TALE.
- the fusion protein according to the present invention may contain two or more (preferably two) nucleobase converting enzymes, and in this case, the nucleobase converting enzyme may contain only one type or two types.
- the fusion protein according to the present invention contains two nucleobase converting enzymes (first nucleobase converting enzyme, second nucleobase converting enzyme), these are the first nucleobase converting enzyme, TALE, the first. It may be bound in the order of 2 nucleobase converting enzymes. Among these, the fusion protein according to the present invention is preferably bound in the order of TALE and nucleobase converting enzyme from the N-terminal, or in the order of nucleic acid-based converting enzyme and TALE from the N-terminal.
- TALE Transcription Activator-Like Effector
- the "TALE” according to the present invention includes at least an N-terminal domain and a TALE repeat domain, and may further include a C-terminal domain.
- the TALE repeat domain is composed of a plurality of TALE sequences forming a right-handed supercoil, for example, 10 to 30, preferably 13 to 25, and more preferably 15 to 20 tandem repeats (TALE repeats).
- a typical TALE repeat unit (one TALE sequence) consists of 33 to 35 amino acids, and the DNA is identified by a variable residue (repeat variable awareness: RVD) consisting of the 12th and 13th amino acid residues. Recognize the base of. Examples of RVDs that specifically recognize bases include HD that recognizes C, NG that recognizes T, NI that recognizes A, G, or NN that recognizes A, and recognizes A, C, G, or T. NS and the like can be mentioned.
- RVD receat variable awareness
- the TALE sequence of TALE according to the present invention may be appropriately modified with respect to the natural amino acid sequence as long as the TALE repeat domain can recognize and bind to the TALE recognition sequence. ..
- one or more amino acid residues other than RVD for example, 10 or less, 9 or less, 8 or less, 7) 4 or less, 6 or less, preferably 5 or less or 4 or less, more preferably 3 or less or 2 or less
- Substitution, insertion, and / or deletion may be performed, and the 12th RVD.
- the 13th 2 amino acid residue may be replaced with another amino acid residue in order to enhance the specificity for A, T, C and G.
- the TALE repeat domain of TALE recognizes a base sequence or a complementary sequence thereof existing on the 5'side of the target site of the target DNA via a spacer 1 having a chain length of 7 to 31 bp.
- the TALE recognition sequence recognized by TALE or its complementary sequence is designed to be a base sequence existing on the 5'side of the target site via a spacer 1 having a chain length of 7 to 31 bp.
- Techniques for designing and producing such desired TALE are known (eg, Miller et al., Nat. Biotechnol., 29, 143-148 (2011); Sakuma et al., Scientific Rep., 3: 3379 (2013)), for example, Sakuma et al.
- TALE having high binding activity to the base sequence can be produced.
- a natural amino acid sequence for example, the amino acid sequence of the N-terminal domain contained in pTALETF_v2 (ID: 32185 to 32188) of Addgene
- the present invention can be used.
- the natural amino acid sequence may be appropriately modified. ..
- one or more for example, 50 or less, 30 or less, 20 or less, 10 or less, 9 or less, 8 or less, 7 or less, 6 or less, preferably 5 or less or 4).
- 3 or less or 2 or less amino acid residues may be substituted, inserted, and / or deleted. It may also contain a flag tag for purification or detection, a nuclear localization signal (NLS) for translocating a TALE-nucleobase converting enzyme to the cell nucleus, and the like.
- NLS nuclear localization signal
- the chain length of the N-terminal domain is preferably 49 to 287 amino acid residues, more preferably 80 to 200 amino acid residues, and 120 to 180 amino acid residues. Is even more preferable. If the chain length of the N-terminal domain is less than the lower limit, it tends to be unable to bind to the target DNA, while if it exceeds the upper limit, the expression efficiency of the fusion protein tends to decrease.
- N-terminal domain of TALE include, for example, the amino acid sequence of the N-terminal domain contained in Addgene's ptCMV-136 / 63-VR-HD (ID: 50699) and Addgene's ptCMV-153 /. Examples thereof include, but are not limited to, the amino acid sequence of the N-terminal domain contained in 47-VR-HD (ID: 50703).
- the amino acid sequence of the C-terminal domain contained in the natural amino acid sequence is also used as the C-terminal domain of the TALE.
- the amino acid sequence of the C-terminal domain contained in the natural amino acid sequence is also used as the C-terminal domain of the TALE.
- the above-mentioned natural amino acid sequence can be used. , May be appropriately modified.
- one or more for example, 50 or less, 30 or less, 20 or less, 10 or less, 9 or less, 8 or less, 7 or less, 6 or less, preferably 5 or less or 4).
- more preferably, 3 or less or 2 or less amino acid residues may be substituted, inserted, and / or deleted.
- the C-terminal domain of the TALE may be a natural amino acid sequence in which a part of the amino acid sequence on the C-terminal side is removed.
- the chain length of the C-terminal domain is preferably 1 to 180 amino acid residues, more preferably 20 to 100 amino acid residues, and 40 to 70 amino acid residues. Is even more preferable.
- the chain length of the C-terminal domain exceeds the upper limit, the expression efficiency of the fusion protein tends to decrease when the fusion protein is expressed.
- pTALETF_v2 32185 to 32188
- pTALEN_v2 ID: 32189 to 32192
- the nucleic acid base converting enzyme is a target without breaking the DNA strand by catalyzing a reaction of converting or dropping a substituent on the purine or pyrimidine ring of a DNA base to another group or atom.
- the nucleobase converting enzyme is not particularly limited as long as it can catalyze the above reaction, and examples thereof include deaminase and glycosylase, and even if only one of these is used, two or more of them may be used. It may be a combination. Among these, deaminase is preferable as the nucleobase converting enzyme according to the present invention.
- the deaminase is an enzyme that catalyzes a deamination reaction that converts an amino group of a base into a carbonyl group, and belongs to the nucleic acid / nucleotide deaminase superfamily.
- Examples of such deaminase include cytosine deaminase capable of converting cytosine or 5-methylcytosine to uracil or thymine, adenosine deaminase capable of converting adenine to hypoxanthine, and guanosine deaminase capable of converting guanine to xanthine. It can be used properly according to the base substitution of.
- the origin of the deaminase is not particularly limited, and examples thereof include lamprey; mammals such as humans, monkeys, pigs, cows, horses, rats, and mice.
- cytidine deaminase for example, APOBEC (rAPOBEC1 derived from rat, hAPOBEC1, hAPOBEC2, hAPOBEC3 (hAPOBEC3A, 3B, 3C, 3D (3E), 3F, 3G, 3H), hAPOBEC4, etc. derived from human).
- Anc689 which is an ancestral amino acid sequence of APOBEC; AID (Activation-induced cytidine deaminase (AICDA)) derived from mammals (eg, humans, pigs, cows, horses, monkeys, etc.); PmCDA1 derived from AID and lamprey.
- AICDA Activity-induced cytidine deaminase
- adenosine deaminase 1 (Petromyzon marinus cytosine deaminase 1) can be mentioned.
- TadA derived from Escherichia coli
- Each of the above deaminase contains a variant thereof (for example, TadA-8e, TadA7.10, and variants thereof, etc.).
- the deaminase is preferably at least one selected from the group consisting of APOBEC, PmCDA1, Anc689, and TadA (each containing a variant thereof).
- the fusion protein according to the present invention contains two or more deaminase
- these combinations can be appropriately selected depending on the desired base substitution, for example, the same or different combination of cytidine deaminase, the same or different adenosine deaminase.
- the combination of TadA and PmCDA1 is preferable as the combination of deaminase.
- the base sequence and amino acid sequence of these deaminase are known and can be obtained from a public database (Genbank or the like).
- deaminase activity As long as the deaminase has deamination activity (deaminase activity), it is modified (for example, substitution, insertion, and / or lack of amino acid residues) based on the base sequence and amino acid sequence of these known deaminase. It may be the one that has been introduced).
- the presence or absence of deaminase activity can be confirmed by appropriately using a known method.
- cytidine deaminase can be confirmed by performing an enzymatic reaction using cytidine as a substrate and detecting and quantifying uridine, which is a metabolite of cytidine.
- the fusion protein according to the present invention preferably further contains linker 1.
- linker 1 in the fusion protein according to the present invention, TALE and the nucleobase converting enzyme are linked by linker 1.
- the number of linkers 1 is plural.
- the fusion protein according to the present invention contains a plurality of linkers 1, these may be only one kind or a combination of two or more kinds.
- the chain length of the linker 1 is not particularly limited, but is preferably 1 to 450 amino acid residues, and more preferably 5 to 250 amino acid residues. It is preferably 5 to 200 amino acid residues, more preferably 12 to 150 amino acid residues, and even more preferably 12 to 104 amino acid residues.
- the chain length of the linker 1 exceeds the upper limit, the expression efficiency of the fusion protein tends to decrease when the fusion protein is expressed.
- linker 1 for example, Yang L et al. , Nat Commun 7 13330 (2016), and Nishida, K. et al. et al. , Science 353, aaf8729 (2016), 104 amino acids, Kobran, L. et al. et al. , Nat Biotechnology 36, p. 843-846 (2016), 32 amino acids, Sun, N. et al. And Zhao, H. , Mol BioSystem 10, p. 446-453 (2014), Doi: 10.1039 / c3mb70412b), and examples thereof include, but are not limited to, the HTS95 amino acid sequence (95 amino acids) and its repetition.
- the fusion protein of the present invention when the nucleobase converting enzyme is located on the C-terminal side of TALE, the TALE repeat domain and the nucleobase converting enzyme are separated from each other by 1 to 630 amino acid residues. Is preferable. Therefore, the fusion protein of the present invention preferably contains at least one of the C-terminal domain of TALE and linker 1.
- the chain length between the TALE repeat domain and the nucleic acid-based converting enzyme (the amino acid residue adjacent to the C-terminal side of the C-terminal of the TALE repeat domain is set as the first residue, and the N-terminal of the nucleic acid-based converting enzyme is used as the first residue.
- the number of amino acid residues up to the residue adjacent to the N-terminal side is more preferably 25 to 300 amino acid residues, and even more preferably 52 to 174 amino acid residues. If the chain length is less than the lower limit, the efficiency of base editing tends to decrease, while if the chain length exceeds the upper limit, the expression efficiency tends to decrease when the fusion protein is expressed.
- the C-terminal residue of the TALE repeat domain is a TALE sequence (preferably N-terminal) located immediately before (N-terminal) the truncated last half-repeat (LHR) located on the C-terminal side of the TALE repeat domain.
- LHR truncated last half-repeat
- the N-terminal residue of the nucleobase converting enzyme is a typical amino acid sequence of a nucleobase converting enzyme that can be obtained from a known database (for example, NCBI) and a known method (for example, BLAST (NCBI)). It can be the starting residue (usually methionine) of an amino acid sequence having an identity of 85% or more (preferably 90% or more) in the homology search used.
- the fusion protein according to the present invention further includes the C-terminal side, that is, the C-terminal of the nucleobase-converting enzyme (when the nucleobase-converting enzyme is located on the C-terminal side of TALE) or the C-terminal of TALE (the nucleobase).
- the converting enzyme is located only on the N-terminal side of TALE
- the number of the base excision repair inhibitor may be one per the fusion protein, or may be two or more (preferably two) via the linker 2. Further, in the case of a plurality of types, only one type or a combination of two or more types may be used, but one type is preferable.
- the base excision repair inhibitor according to the present invention is not particularly limited as long as it inhibits base excision repair, but a DNA glycosylase inhibitor is preferable.
- the DNA glycosylase inhibitor include a thymine DNA glycosylase inhibitor, a uracil DNA glycosylase inhibitor, an oxoguanine DNA glycosylase inhibitor, and an alkylguanine DNA glycosylase inhibitor.
- a uracil DNA glycosylase inhibitor UGI
- the presence of deaminase and UGI on one polypeptide further improves the efficiency of base editing.
- uracil DNA glycosylase inhibitor examples include a uracil DNA glycosylase inhibitor (UGI) derived from PBS1 which is a Bacillus subtilis bacteriophage or a uracil DNA glycosylase inhibitor derived from PBS2 which is a Bacillus subtilis bacteriophage (Bacillus subtilis). UGI), but is not limited to these.
- UGI uracil DNA glycosylase inhibitor
- base sequences and amino acid sequences are known and can be obtained from public databases (Genbank, etc.).
- the base excision repair inhibitor according to the present invention is modified (for example, substitution of amino acid residues) based on the base sequence and amino acid sequence of a known base excision repair inhibitor as long as it has the repair inhibitory activity of the above DNA mismatch. , Insertion, and / or introduction of deletion).
- the fusion protein according to the present invention further contains the base excision repair inhibitor and there are a plurality of the base excision repair inhibitors
- the fusion protein-linker 2-first base excision repair inhibitor-linker 2- can be linked via the linker 2 as in the second base excision repair inhibitor.
- the plurality of linkers 2 at this time may be only one type or a combination of two or more types, but one type is preferable.
- the linker 2 is not particularly limited, but the chain length is preferably 1 to 50 amino acid residues, more preferably 5 to 30 amino acid residues, and 8 to 15 amino acid residues. Is even more preferable. If the chain length of the linker 2 is less than the lower limit, the functionality of the base excision repair inhibitor tends to decrease, while if it exceeds the upper limit, the expression efficiency of the fusion protein decreases. There is a tendency.
- linker 2 examples include, but are not limited to, the 10 amino acids described in pCMV_AncBE4max_P2A_GFP (ID: 112100) of Addgene.
- the fusion protein according to the present invention includes a flag tag for purification and detection at the N-terminal and / or C-terminal, a nuclear localization signal (NLS) for transferring TALE-nucleobase converting enzyme to the cell nucleus, and the like. These may be contained in the N-terminal domain of TALE as described above, or may be added to the N-terminal and / or C-terminal of the fusion protein.
- NLS nuclear localization signal
- the fusion protein according to the present invention is, for example, transcribed and expressed as a whole with the above-mentioned TALE and nucleobase converting enzyme, and, if necessary, the linker 1, the base excision repair inhibitor, the linker 2, and other amino acid-encoding genes. It can be obtained by letting it. Further, the fusion protein according to the present invention can be produced by appropriately adopting and improving a conventionally known method, for example, a method of chemically synthesizing based on the amino acid sequence by a commercially available synthesizer, or the following.
- the method for producing a cell in which the target DNA of the above is edited it can be obtained by a method of introducing a polynucleotide encoding a fusion protein or a vector expressing the fusion protein into the cell and expressing it.
- the CRISPR-Cas9 system contains a Cas9 protein in which some or all of the nuclease activity is lost and a guide RNA thereof.
- the Cas9 protein is a Cas9 protein that has lost a part of nuclease activity (referred to as “nCas9” in the present specification) or a Cas9 protein that has lost all of the nuclease activity. (In this specification, it is referred to as "dCas9”). That is, the Cas9 protein typically contains a domain involved in the cleavage of the target chain (RuvC domain) and a domain involved in the cleavage of the non-target chain (HNH domain), but the Cas9 protein according to the present invention includes. It is necessary that the nuclease activity of at least one domain is lost due to the introduction of a mutation or the like.
- SpCas9 protein (Cas9 protein derived from S. pyogenes)
- a mutation of the 10th amino acid (aspartic acid) from the N-terminal to alanin (D10A: mutation in the RuvC domain) for example, a mutation of the 10th amino acid (aspartic acid) from the N-terminal to alanin (D10A: mutation in the RuvC domain).
- the amino acid sequence and base sequence of the Cas9 protein are registered in a public database, for example, Genbank (http://www.ncbi.nlm.nih.gov) (for example, accession number: KX151730.1). Etc.), these can be used in the present invention.
- the Cas9 protein may be introduced with further mutations, for example, mutations for modifying the recognition of PAM sequences (Benjamin, P. et al., Nature 523, 481-485 (2015); Hirano). , S. et al., Molecular Cell 61, 886-894 (2016)), and a nuclear localization signal (NLS) for translocating to the cell nucleus may be added.
- such a Cas9 protein may be one encoded by a sequence contained in a commercially available plasmid.
- the following guide RNA is designed so that the guide RNA target sequence contains a complementary base at the target site of the target DNA.
- the complementary base of the target site of the target DNA refers to the complementary base of the target site 1 base on the complementary strand of the strand in which the target site is present.
- the guide RNA is a crRNA containing a base sequence (hereinafter, sometimes referred to as "targeted base sequence") complementary to a sequence on the target DNA (guide RNA target sequence). It is a combination of CRISPR RNA) and tracrRNA (trans-activating crRNA).
- the crRNA further contains a base sequence capable of interacting (hybridizing) with tracrRNA on the 3'side.
- tracrRNA contains a base sequence capable of interacting (hybridizing) with a part of the base sequence of crRNA on the 5'side, the guide RNA interacts with Cas9 protein by the interaction of these base sequences.
- the guide RNA recognizes and binds to the guide RNA target sequence of the target DNA, induces the Cas9 protein forming a complex with the guide RNA to the target DNA, and the induced Cas9 protein is the induced Cas9 protein.
- the helicase activity exposes the single-stranded DNA at the target site of the target DNA.
- the guide RNA of the CRISPR-Cas9 system may be a single-molecule guide RNA (sgRNA) containing crRNA and tracrRNA, or a double-molecule guide RNA composed of a crRNA fragment and a tracrRNA fragment.
- sgRNA single-molecule guide RNA
- the target DNA editing of the base at the target site is performed on the complementarity of base pairing between the target base sequence of the guide RNA and the guide RNA target sequence and on the 3'side of the complementary strand of the guide RNA target sequence. It occurs at a position determined by both the existing PAM sequence and.
- the guide RNA of the CRISPR-Cas9 system the guide RNA target sequence becomes a sequence containing a complementary base of the target site, and the Cas9 protein can recognize the PAM sequence on the 3'side of the target site. Designed to.
- the target site is specifically exposed by the helicase activity that exposes the single-stranded DNA, and the editing efficiency by the nucleobase converting enzyme is improved.
- E-CRISPR http://www.e-crisp.org/E-CRISPR.
- Zifit Targeter http://zifit.partners.org/ZiFiT/) (Zing Finger Consortium)
- CRISPRdirect http://crispr.dbcls.jp/) (Tokyo University)
- CRISPR-P /Cbi.hzau.edu.cn/crispr/) China Agricultural University
- Guide RNA Target Design Tool It can be determined by using (https://wwws.blueheronbio.com/external/tools/gRNASrc.jsp) (BlueHeronBiotech) and the like.
- the CRISPR-Cas9 system according to the present invention is, for example, by simultaneously transcribing and expressing the above-mentioned Cas9 protein and the gene encoding the guide RNA, respectively, as described in the following method for producing a cell in which the target DNA is edited.
- the Cas9 protein and guide RNA according to the present invention can be produced by appropriately adopting and improving conventionally known methods. For example, a polynucleotide encoding them or an expression vector containing the polynucleotide is applied to the cell. It can be obtained by the method of introduction and expression.
- target DNA the DNA containing the target site to be the target of DNA editing
- target DNA the DNA containing the target site to be the target of DNA editing
- the target DNA according to the present invention is a double-stranded DNA, and for convenience, at least one of the strands has a TALE recognition sequence in order from the 5'side in order to show the correspondence with the above fusion protein and the CRISPR-Cas9 system. Or, it is assumed that the structure includes the complementary sequence thereof, the spacer 1, the target site, and the PAM sequence.
- the inclusion of the complementary sequence of the TALE recognition sequence on the 5'side indicates that the complementary strand of the strand contains the TALE recognition sequence. That is, the TALE recognition sequence may be set on the same strand as the target site or on the complementary strand thereof.
- the complementary strand of the strand contains a guide RNA target sequence so as to contain the complementary base of the target site.
- An example of the configuration of the target DNA according to the present invention is shown in FIG. 2 below, but the present invention is not limited thereto.
- the target site according to the present invention refers to one base targeted for editing by the nucleobase converting enzyme (preferably deamination with deaminase).
- bases near both sides of the 1 base are further edited (deamination in the case of deaminase).
- the TALE recognition sequence is a sequence recognized by the TALE.
- the number of bases in the TALE recognition sequence is 10 to 30 bases, preferably 13 to 25 bases, and more preferably 15 to 20 bases.
- the TALE recognition sequence is selected to be present on the 5'side of the target site via a spacer 1 having a chain length of 7 to 31 bp. When the chain length of the spacer 1 is within such a range, one base of the target site can be deaminated specifically and with high efficiency.
- the chain length of the spacer 1 is the length from the base adjacent to the 5'side of the base of the target site to the base adjacent to the 3'side of the TALE recognition sequence as the first base.
- the chain length of such a spacer 1 is more preferably 10 to 31 bp, further preferably 10 to 28 bp, and even more preferably 13 to 25 bp.
- the PAM sequence is a sequence recognized by the Cas9 protein.
- the PAM sequence depends on the type of Cas9 protein used. Typical PAM sequences include, for example, S. cerevisiae.
- the PAM sequence corresponding to the Cas9 protein (type II) derived from pyogenes is 5'-NGG, and S. streptococcus pyogenes.
- the PAM sequence corresponding to the Cas9 protein (IA1 type) derived from solfatalicus is 5'-CCN, and S.I.
- the PAM sequence corresponding to the Cas9 protein (type I-A2) derived from solfatalicus is 5'-TCN, and H.I.
- the PAM sequence corresponding to the Cas9 protein (type IB) derived from walsbyl is 5'-TTC, and E.I.
- the PAM sequence corresponding to the Cas9 protein (type IE) derived from colli is 5'-AWG, and E.I.
- the PAM sequence corresponding to the Cas9 protein (type IF) derived from coli is 5'-CC, and P.I.
- the PAM sequence corresponding to the Cas9 protein (type IF) derived from aeruginosa is 5'-CC, and S. a.
- the PAM sequence corresponding to the Cas9 protein (II-A type) derived from Thermophilus is 5'-NNAGAA.
- the PAM sequence corresponding to the Cas9 protein (type II-A) derived from agalactiae is 5'-NGG, and S. a.
- the PAM sequence corresponding to the Cas9 protein from aureus is 5'-NGRRT or 5'-NGRRN, N.I.
- the PAM sequence corresponding to the Cas9 protein from meningitidis is 5'-NNNGATT, T.I.
- the PAM sequence corresponding to the Cas9 protein from denticola is 5'-NAAAAC.
- the PAM sequence exists on the 3'side of the target site, and the guide RNA target sequence recognized by the guide RNA is determined according to the position of the PAM sequence.
- the position of the complementary base of the target site in the guide RNA target sequence is not particularly limited, but is preferably, for example, between the 1st and 50th positions, and the 1st to 30th positions. It is more preferably between 1st and 25th positions.
- the position of the base in the guide RNA target sequence is the position from the base at the 3'end of the guide RNA target sequence to the complementary base of the base adjacent to the 5'side of the PAM sequence. ..
- the upper and lower limits of the base position depend on the length of the guide RNA target sequence and can be adjusted by adjusting the length of the guide RNA.
- the guide RNA target sequence recognized by the guide RNA is selected so as to be a sequence containing a complementary base of the target site. By selecting the position of the guide RNA target sequence in this way, one base of the target site can be edited specifically and with high efficiency.
- the number of bases in the guide RNA target sequence according to the present invention is preferably 12 to 50 bases, more preferably 17 to 30 bases, and even more preferably 17 to 25 bases.
- the base sequence of such a target DNA is not particularly limited, and the TALE recognition sequence or its complementary sequence, spacer 1, the complementary sequence of the guide RNA target sequence, the target site contained in the complementary sequence, and the PAM sequence are described above.
- the targeting base sequences of TALE and guide RNA so as to satisfy the conditions, and modifying the recognition specificity of PAM as necessary, it can be targeted by the DNA editing method of the present invention.
- the target DNA according to the present invention may be intracellular DNA (endogenous DNA) or extracellular DNA, depending on the purpose.
- the DNA existing in the cell may be endogenous DNA or extrinsic DNA. Examples of the endogenous DNA include genomic DNA, and examples of the exogenous DNA include DNA introduced into cells.
- the DNA existing outside the cell may be a DNA derived from a cell or a DNA amplified and synthesized extracellularly.
- Target DNA editing method In the target DNA editing method of the present invention, the fusion protein and the CRISPR-Cas9 system are brought into contact with the target DNA, and the base of the target site of the target DNA is edited by the nucleobase converting enzyme activity of the fusion protein.
- the TALE of the fusion protein recognizes and binds to the TALE recognition sequence on the target DNA, and the nucleobase converting enzyme linked to the TALE is attached to the target DNA.
- the guide RNA of the CRISPR-Cas9 system recognizes and binds to the guide RNA target sequence on the target DNA, and induces the Cas9 protein that forms a complex with the guide RNA to the target DNA.
- the Cas9 protein exposes the single-stranded DNA at the target site by helicase activity, so that the nucleobase converting enzyme can efficiently cause the substitution of the target base (for example, deamination of the base).
- This causes a single base substitution at the target site (eg, C ⁇ U), and in cells, for example, due to a double-stranded DNA mismatch, the base on the opposite strand of the substituted strand is paired with the substituted base. It may be repaired to form (eg, G ⁇ A), replaced with another nucleotide during repair (eg, U ⁇ A, G), or result in a deletion or insertion of one or a dozen bases. Thereby, various mutations can be introduced.
- deletion of one or more nucleotides, substitution with another one or more nucleotides, or 1 in the target site converted by the nucleobase converting enzyme and the vicinity containing the same. Includes the insertion of the above nucleotides or a combination of these mutations.
- the target DNA editing method of the present invention may be performed intracellularly or in a cell-free system.
- the "intracellular" field of the target DNA editing method of the present invention may be in eukaryotic cells or prokaryotic cells, preferably in eukaryotic cells.
- the eukaryotic cell include animal cells (mammalian, fish, birds, reptiles, amphibians, insect cells, etc.), plant cells, algae cells, yeast, and examples of the prokaryotic cells include Escherichia coli. Examples include salmonella, bacillus, lactic acid, and highly thermophilic bacteria.
- Animal cells include, for example, cells constituting an individual animal, cells constituting an organ / tissue excised from an animal, cultured cells derived from an animal tissue, and the like. Specifically, for example, germ cells such as egg mother cells and sperm; embryo cells of embryos at each stage (for example, 1-cell stage embryo, 2-cell stage embryo, 4-cell stage embryo, 8-cell stage embryo, 16-cell stage). Embryos, mulberry stage embryos, etc.); Stem cells such as induced pluripotent stem (iPS) cells and embryonic stem (ES) cells; fibroblasts, hematopoietic cells, neurons, muscle cells, bone cells, hepatocytes, pancreatic cells , Brain cells, somatic cells such as kidney cells and the like.
- germ cells such as egg mother cells and sperm
- embryo cells of embryos at each stage for example, 1-cell stage embryo, 2-cell stage embryo, 4-cell stage embryo, 8-cell stage embryo, 16-cell stage.
- pre-fertilized and post-fertilized oocytes can be used, but a post-fertilized oocyte, that is, a fertilized egg is preferable.
- the fertilized egg is from a pronuclear stage embryo. Oocytes can be thawed and used after being cryopreserved.
- the "plant cell” includes, for example, cells constituting an individual plant, cells constituting an organ or tissue separated from a plant, cultured cells derived from a plant tissue, and the like.
- plant organs and tissues include leaves, stems, shoot apex (growth point), roots, tubers, callus and the like.
- the "cell-free system" which is the place of the target DNA editing method of the present invention refers to a system without living cells (the eukaryotic cells and prokaryotic cells).
- the cell-free system according to the present invention is not particularly limited as long as it is a system in which the fusion protein and the CRISPR-Cas9 system can contact the target DNA, for example, in a buffer; the eukaryotic cell or prokaryote. Examples thereof include the inside of a cell crushed solution and the inside of a cell extract.
- the method of contacting the fusion protein and the CRISPR-Cas9 system with the target DNA is not particularly limited.
- the fusion protein and the CRISPR-Cas9 system are introduced into the cell containing the target DNA or a vector encoding the same, as in the method for producing a cell in which the target DNA is edited below. Examples thereof include a method of introducing the above into cells and expressing them.
- a solution of the target DNA may be mixed with the fusion protein and the solution of the CRISPR-Cas9 system.
- the solvent of these solutions is not particularly limited, but for example, a buffer solution such as a phosphate buffer solution, a Tris buffer solution, a Good's buffer solution, or a borate buffer solution is preferable.
- the method for producing a cell in which the target DNA of the present invention has been edited is A method of producing cells in which the target DNA has been edited.
- Fusion proteins containing TALE and nucleobase converting enzymes and (2) A CRISPR-Cas9 system containing a Cas9 protein having lost part or all of its nuclease activity and its guide RNA is introduced into a cell or expressed intracellularly and brought into contact with a target DNA to convert the nucleobase of the fusion protein. It comprises the step of editing the base of the target site of the target DNA by enzymatic activity.
- the TALE recognition sequence or its complementary sequence recognized by TALE in the fusion protein is present on the 5'side of the target site via a spacer 1 having a chain length of 7 to 31 bp, and
- the guide RNA target sequence recognized by the guide RNA in the CRISPR-Cas9 system is present to contain the complementary base of the target site. The method.
- the fusion protein, the CRISPR-Cas9 system, and the target DNA are the above-mentioned target DNA editing methods of the present invention.
- the target DNA in the production method of the present invention is genomic DNA, and the fusion protein and the CRISPR-Cas9 system can be designed according to the editing purpose of the genomic DNA.
- the contact between the fusion protein and the CRISPR-Cas9 system and the target DNA in the cell is the introduction of the fusion protein and the CRISPR-Cas9 system into the cell in the form of a protein, into the cell. It is carried out by introduction in the form of a polynucleotide and / or introduction into a cell in the form of a polynucleotide or introduction in an expression vector to express the protein intracellularly.
- the form of RNA or DNA (polynucleotide) encoding the protein is obtained. It may be introduced into cells and expressed intracellularly, or it may be introduced into cells in the form of a vector (expression vector) expressing the protein and expressed intracellularly.
- the guide RNA may be introduced into cells in the form of RNA, or introduced into cells in the form of DNA (polynucleotide) encoding the RNA and expressed intracellularly. It may be introduced into a cell in the form of a vector (expression vector) expressing the RNA and expressed intracellularly.
- fusion protein and the CRISPR-Cas9 system are introduced into cells in the form of an expression vector and expressed intracellularly, for example, a vector expressing the fusion protein, a vector expressing the Cas9 protein, and the guide RNA.
- a vector expressing the above may be introduced into the cell, or a vector expressing a combination of two or more of these may be introduced into the cell.
- the fusion protein and the CRISPR-Cas9 system are introduced into cells in the form of an expression vector and expressed intracellularly, the fusion protein and the polynucleotide encoding the Cas9 protein are independently introduced. It may be appropriately codon-optimized according to the cell.
- the expression vector preferably contains a promoter and / or other control sequence operably linked to the polynucleotide to be expressed. Further, it is preferable that the expression vector is one that can stably express the encoding protein without being integrated into the host genome.
- Such an expression vector can be appropriately prepared according to a conventionally known method.
- a method for introducing a protein of the fusion protein and the CRISPR-Cas9 system, a polynucleotide encoding the protein, and a vector expressing the protein into cells a known method for introducing a protein, DNA, or RNA fragment into cells.
- the method can be appropriately adopted depending on the type of cell, and examples of such a method include an electroporation method, a microinfection method, a particle gun method, a calcium phosphate method, and polyethyleneimine (PEI).
- the fusion protein and the CRISPR-Cas9 system When the fusion protein and the CRISPR-Cas9 system are introduced into a cell or expressed intracellularly, the fusion protein and the CRISPR-Cas9 system come into contact with the target DNA in the cell, and the target DNA editing method of the present invention described above.
- target DNA editing substitution of a target one-base base occurs at the target site, and as a result, it becomes possible to obtain a cell in which the target DNA has been edited.
- the present invention also provides a method for producing a non-human individual containing a cell in which the target DNA has been edited.
- This method comprises the step of producing a non-human individual from the cells obtained by the above-mentioned production method.
- the non-human individual include non-human animals and plants.
- the non-human animal include mammals (mouse, rat, guinea pig, hamster, rabbit, monkey, pig, cow, goat, sheep, etc.), fish, birds, reptiles, amphibians, and insects.
- the mammal is preferably a rodent such as a mouse, rat, guinea pig, hamster, and particularly preferably a mouse.
- the plant include cereals, oil crops, feed crops, fruits and vegetables. Specific crops include, for example, rice, corn, banana, peanut, sunflower, tomato, oilseed rape, tobacco, wheat, barley, potato, soybean, cotton, and carnation.
- a known method can be used.
- germ cells or pluripotent stem cells are usually utilized.
- the fusion protein and the CRISPR-Cas9 system can be microinjected into oocytes and the resulting oocytes can be transplanted into the uterus of a pseudopregnant female non-human mammal to obtain offspring. can.
- the fusion protein and the CRISPR-Cas9 system were microinjected into plant cells to obtain the same. By regenerating a plant from a plant cell, a plant with the desired DNA edited can be obtained.
- progeny or clones in which the desired DNA is edited from the obtained non-human individuals.
- Confirmation of the presence or absence of target DNA editing and determination of genotype can be performed based on conventionally known methods, and for example, PCR method, sequencing method, Southern blotting method and the like can be used.
- the present invention is used in the above-mentioned method for editing a target DNA of the present invention, the method for producing a cell in which the target DNA of the present invention has been edited, or the above-mentioned method for producing a non-human individual of the present invention.
- Fusion proteins containing TALE and nucleobase converting enzymes and (2) Provided is a DNA editing system including a CRISPR-Cas9 system containing a Cas9 protein having lost part or all of nuclease activity and a guide RNA thereof.
- the fusion protein and the CRISPR-Cas9 system are as described in the above-mentioned target DNA editing method and production method of the present invention. These are independently in the form of a protein or RNA, in the form of a polynucleotide encoding the protein or RNA, or in the form of a vector (expression vector) expressing the protein or RNA. May be good.
- the invention allows the user to design the fusion protein and the CRISPR-Cas9 system depending on the target site of the target DNA.
- a vector containing a polynucleotide encoding a polynucleotide encoding TALE and a polynucleotide encoding a polynucleotide other than TALE of the fusion protein and
- B A polynucleotide encoding a Cas9 protein that has lost part or all of its nuclease activity, or a vector containing the polynucleotide.
- Each of the vectors (a) to (c) contains an expression unit that enables expression of each polynucleotide.
- the DNA editing system of the present invention may be a combination including the fusion protein and the CRISPR-Cas9 system, respectively, or may be a kit containing the combination.
- the kit may further include one or more additional reagents.
- additional reagents include, but are limited to, for example, dilution buffers, reconstruction buffers, wash buffers, nucleic acid transfer reagents, protein transfer reagents, control reagents (eg, control deaminase). It's not a thing.
- the kit may further include an instruction manual for carrying out the method of the present invention.
- Each element included in the kit may be contained in a separate container or may be contained in the same container. Each element may be contained in a container for each single use, or a plurality of doses may be contained in one container. Each element may be contained in a container in a dry form, or may be contained in a container in a form dissolved in a suitable solvent (solvent containing a buffer solution, a stabilizer, a preservative, a preservative, etc.). ..
- the BEtag sequence is a TALE recognition sequence (underlined in Table 1 (* 1): 5'-tACAGAAGCGGGCAAAGG-3'; lower letters indicate thymine recognized by the N-terminal domain of TALE), a target codon (underlined in Table 1 (underlined in Table 1). * 2): 5'-ACG-3') and the spacer 1 sequence between them are included, and the length of the spacer 1 sequence is 7 bp, 13 bp including the base (A) at 5'of the target codon. , 19bp, 25bp, or 31bp.
- Each BEtag sequence (Betag-7 to 31bp: SEQ ID NOs: 1 to 5) was annealed with an oligonucleotide designed to allow the addition of NheI recognition sites and SbfI recognition sites at both ends, and treated with NheI and SbfI-. It was inserted into the M / A and used as pNLF-BEtag (pNLF-BEtag-7bp to 31bp).
- the TALE recognition sequence is a Right TALE sequence of TALEN for the human adenomatous polyposis colli gene whose intracellular activity has been confirmed (Sakuma et al., "Repeating pattern of non-RVD binding DNA binding”. ", Sci. Rep., 3: 3379 (2013)).
- the oligonucleotide of the sequence (PAM: SEQ ID NO: 6) containing the PAM portion shown in Table 1 below and its antisense sequence.
- the oligonucleotide with 5'-TCGA-3'added to the 5'end of the above was annealed and inserted into each pNLF-BEtag treated with EcoRV and XhoI, and the reporter plasmids (pNLF-BEtag-7bp-PAM, pNLF-) were inserted.
- Betag-13bp-PAM, pNLF-BEtag-19bp-PAM, pNLF-BEtag-25bp-PAM, pNLF-BEtag-31bp-PAM were prepared.
- the complementary sequence of the guide RNA target sequence located on the 5'side of the PAM part is 3 bases at a time.
- the position of the PAM sequence can be shifted downstream by one base, thereby setting eight PAM sequences (PAMs 1 to 8 in Table 7 below). be able to.
- Table 1 below shows each sequence inserted into the pNLF-M / A plasmid.
- FIG. 1 shows a conceptual diagram showing the structure of the completed reporter plasmid.
- the sequence from the NheI recognition site to the XhoI recognition site containing the TALE recognition sequence, spacer 1, target site, PAM part, and guide RNA target sequence is shown in FIG.
- the target codon is located at the site corresponding to the start codon of NanoLuc luciferase, and according to such reporter plasmid, ACG, which is the target codon, becomes ATG by TALE-deaminase bound to the TALE recognition sequence.
- ACG which is the target codon
- TALE vector set and TALE expression plasmid The sequence and composition of the TALE vector set are as described in 1. above for the TALE repeat unit. To correspond to the TALE recognition sequence of the reporter plasmid prepared in Sakuma et al. It was constructed according to (2013). First, a module sequence (non-repeat-variable di-reside (non) other than the sequence encoding four types of variable residues (RVD: HD, NG, NI, NN) composed of the 12th and 13th amino acids.
- RVD non-repeat-variable di-reside
- the number of amino acids in the C-terminal domain is 63 (63)
- the number of amino acids in the C-terminal domain is 47 (47).
- the sequence encoding "WT” is the sequence of the C-terminal domain contained in addgene's pTALETF_v2 (ID: 32185 to 32188), and the sequence encoding "63” is the sequence of Addgene's pTALEN_v2 (ID: 32189).
- N-terminal domain of TALE the sequence of the N-terminal domain included in pTALETF_v2 (ID: 32185 to 32188) of WT: Addgene corresponding to these C-terminal domains; 63: ptCMV-136 / 63- of Addgene. Sequence of N-terminal domain contained in VR-HD (ID: 50699); 47: Sequence of N-terminal domain contained in Addgene's ptCMV-153 / 47-VR-HD (ID: 50703) is artificially referred to. DNA was synthesized. Using the array plasmid and destination vector prepared above, Sakuma et al. A TALE expression plasmid was prepared by the Golden Gate method according to the method described in (2013).
- TALE-deaminase expression plasmid (1) 104 Amino Acid Linker Series First, the AncBE4max gene (Kobran et al., "Improving cytodine and adenine base editors by expression optimilysis, plasmid, plasmid, plasmid. 2018)) and the Target-AID gene (Nucleotide et al., "Targeted nucleotide editing using hybrid plasmid and vertebrate digital and vertebrate adaptibe imemuneSite3, s. AncBE4max expression plasmid and Genet-AID expression plasmid were prepared by inserting them between the recognition site and the site, respectively.
- a TALE-deaminase expression plasmid having 104 amino acid linker 1 was prepared by the following method. That is, using the AncBE4max gene of the above AncBE4max expression plasmid as a template and the primers shown in Table 2 below, the Anc689 deaminase gene (Anc689 departure: primer SEQ ID NOs: 7 to 8), 10 amino acid linkers 2 and the UGI gene twice. Repeated sequences (10aa linker- ⁇ -plasmase: primer SEQ ID NOs: 11-12) were amplified by PCR, respectively.
- the PmCDA1 deaminase gene (PmCDA1: primer SEQ ID NOs: 9 to 10) and 104 amino acid linker 1 are encoded.
- the sequence (104aa linker: primer SEQ ID NOs: 13 to 17) was amplified by PCR.
- a sequence (primer SEQ ID NOs: 22-25) containing ⁇ -lactamase to the sequence encoding the N-terminal domain (WT, 63, or 47) of TALE was amplified by PCR.
- Table 2 below shows each primer used to construct the TALE-deaminase expression plasmid and its SEQ ID NO:.
- Table 2 shows each primer used to construct the TALE-deaminase expression plasmid and its SEQ ID NO:.
- the C-terminal domain of TALE is WT
- the 104 amino acid linker 1 and Anc689 deaminase (BE4) are used in the "WT-104-BE4" below the for of 104aa linker. It is shown that it is a sequence encoding the linker 1 for TALE-deaminase containing.
- "AID" does not indicate that deaminase is AID, but indicates that it is PmCDA1 derived from "Taget-AID”.
- a sequence containing from ⁇ -plasmase prepared in 1 to a sequence encoding the N-terminal domain of TALE (WT, 63, or 47) and a downstream (3') encoding the C-terminal domain of TALE (WT, 63, or 47). Inserted between the side) sequence and the In-Fusion method (TaKaRa Bio Inc, Shiga, Japan), and in order from the N-terminal, TALE (N-terminal domain (WT, 63, or 47) -TALE repeat domain-C-terminal domain.
- TALE-deaminase expression plasmid A conceptual diagram showing the structure of the TALE-deaminase is shown in FIG.
- TALE-deaminase expression plasmid having a 12 amino acid linker 1 (12aa) instead of the 104 amino acid linker 1 was prepared by the following method. That is, first, the oligonucleotides shown in Table 3 below were annealed to prepare a sequence encoding the 12 amino acid linker 1. In Table 3, below the for of 12aa linker (SEQ ID NOs: 18 to 21), for example, in "TALE_WT-12-AID", the C-terminal domain of TALE is WT and the 12 amino acid linker 1 is described above.
- PmCDA1 deaminase are shown to be sequences encoding linker 1 for TALE-deaminase.
- a 12 amino acid linker 1 is described in Yang L et al. , "Engineering and optimizing deaminase fusions for genome editing.”, Nat Commun 7 13330 (2016).
- the sequences other than the sequence encoding 104 amino acid linker 1 of the TALE-deaminase expression plasmid were amplified by inverse PCR using the primers (SEQ ID NOs: 9, 22 to 24) shown in Table 4 below.
- the sequence encoding the 12 amino acid linker 1 was inserted by the In-Fusion method.
- Table 3 below shows the sequence encoding the 12 amino acid linker 1 and its SEQ ID NO:
- Table 4 shows the primers used for inverse PCR and their SEQ ID NOs.
- sequences other than the sequences encoding the amino acids to be deleted in the above TALE-deaminase expression plasmid were amplified by inverse PCR using the primers (SEQ ID NOs: 9, 26 to 31) shown in Table 5 below, and In- It was produced by self-ligation by the Fusion method.
- Table 5 below shows the primers used to prepare the amino acid linker 1 deletion mutant of TALE-47-104-AID and their SEQ ID NOs.
- TALE-deaminase-NC expression plasmid As a negative control (NC) for TALE-deaminase, the above 1. Instead of the TALE recognition sequence of the reporter plasmid prepared in 1. In the same manner as described above, plasmids expressing TALE-deaminase-NC corresponding to each TALE-deaminase (TALE-deaminase-NC expression plasmid) were prepared.
- TALE-deaminase and its negative control TALE-deaminase-NC
- sequence numbers SEQ ID NOs of the base sequence and amino acid sequence
- Cas9 expression plasmid pX330_BS was prepared by inserting it between the EcoRV recognition site and the BamHI recognition site by the In-Fusion method.
- a site-specific mutagenesis method by PCR was performed on the Cas9 gene of pX330_BS to prepare a nickase-type nCas9 (D10A) expression plasmid and a dCas9 (D10A + H840A) expression plasmid having no cleavage activity.
- oligonucleotide designed as described above was annealed and inserted into pX330_BS- ⁇ Cas9 by BpiI treatment and ligation to prepare a plasmid (guide RNA expression plasmids 1 to 8) expressing each guide RNA (sgRNA-1 to 8). .. Table 7 below shows the complementary sequence (underlined) of the target sequence of the guide RNA and the PAM sequence (underlined) of Cas9.
- Transformation into HEK cells (1) HEK293T cells grown in DMEM medium containing 10% FBS were seeded in each well of a 96-well plate with 5 ⁇ 10 4 cells each. 75 ng of any of the TALE-deaminase expression plasmid and the TALE-deaminase-NC expression plasmid shown in Table 6; NCas9 expression plasmid or dCas9 expression plasmid 50 ng prepared in 1. above; 10 ng of any of the guide RNA expression plasmids 1 to 8 and pX330_BS- ⁇ Cas9 (without guide RNA) prepared in 1.
- the reference plasmid is a plasmid that expresses firefly luciferase (Fluc).
- the NanoLuc luciferase activity score for cells into which the TALE-deaminase-NC expression plasmid was introduced was measured.
- the positive control cells into which the AncBE4max expression plasmid was introduced
- the activity score of the cells into which the pX330_BS- ⁇ Cas9 expression plasmid was introduced was measured and the negative control (AncBE4max-). NC).
- each NanoLuc luciferase activity score was standardized using the firefly luciferase activity score from the reference plasmid. Then, using the standardized activity score, the activity score of the cell into which the TALE-deaminase expression plasmid was introduced was subtracted from the activity score of the cell into which the TALE-deaminase expression plasmid was introduced, and the activity value of the TALE-deaminase activity was obtained. In addition, AncBE4max-NC was subtracted from the activity score of the cells into which the AncBE4max expression plasmid was introduced to obtain the AncBE4max activity value.
- the TALE-deaminase activity value is shown as a relative activity when the AncBE4max activity value under the condition of PAM2 described later is set to 1. The test was repeated at least 3 times and graphed with a standard deviation added to the mean.
- the position of the PAM sequence is set one base at a time. It was moved to the 3'side (downstream) (PAM1 to PAM8), and the base editing activity was examined.
- the AncBE4max expression plasmid prepared in 1. above and 5.
- the reporter plasmid (spacer 1: 19bp) prepared in 1. and the reference plasmid pGL4.54 were combined and introduced into HEK293T cells, and the above 7.
- the AncBE4max activity value was obtained from each of the above.
- pX330_BS- ⁇ Cas9 was used instead of the guide RNA expression plasmid.
- Fig. 5 The results are shown in Fig. 5.
- the guide RNA expression plasmid 2 that is, when the AncBE4max activity value at the time of PAM2 is 1.0
- the guide RNA expression plasmids 1 to 8 that is, when the guide RNA expression plasmids 1 to 8 are used, that is, when PAM1 to 8 are used.
- the AncBE4max activity value is shown. As shown in FIG. 5, it was found that the AncBE4max activity value showed the maximum activity at the time of PAM2. Therefore, the AncBE4max activity value at the time of PAM2 was defined as the following positive control for activity evaluation.
- TALE-deaminase (BE4 series)
- WT-104-BE4, 47-104-BE4 the C-terminal domain of TALE is WT or 47
- the amino acid linker 1 is 104 amino acids
- the deaminase is BE4 (Anc689).
- the position of the PAM sequence was fixed at PAM2 and the effect of the length of the reporter spacer 1 on activity was investigated.
- the length of the spacer 1 was 7 bp, 13 bp, 19 bp, 25 bp, and 31 bp.
- nCas9 As Cas9 / guide RNA, nCas9 (D10A) and guide RNA are expressed (nCas9), dCas9 (D10A + H840A) and guide RNA are expressed (dCas9), nCas9 is expressed and guide RNA is not expressed (nCas9 /).
- nCas9 As Cas9 / guide RNA, nCas9 (D10A) and guide RNA are expressed (nCas9), dCas9 (D10A + H840A) and guide RNA are expressed (dCas9), nCas9 is expressed and guide RNA is not expressed (nCas9 /).
- the TALE-deaminase expression plasmid (WT-104-BE4 or 47-104-BE4) prepared in (1) or the corresponding TALE-deaminase-NC expression plasmid and the above 4.
- the nCas9 expression plasmid or dCas9 expression plasmid prepared in the above 5.
- Guide RNA expression plasmid 2 or pX330_BS- ⁇ Cas9 prepared in 1. above.
- the reporter plasmid (spacer 1: 7 to 31 bp) prepared in 1. was combined with pGL4.54, which is a reference plasmid, and introduced into HEK293T cells.
- the TALE-deaminase activity value (ratio to the AncBE4max activity value) was obtained.
- Fig. 6 As shown in FIG. 6, no activity was observed in any of the TALE-deaminase (BE4) in the absence of guide RNA, that is, when nCas9 was not bound near the target base. In addition, the activity of using nCas9 to put a nick in the bottom strand was higher than that of using dCas9 to put a nick. Regarding the length of the spacer 1, particularly high activity was shown in the case of 13 bp and 25 bp, and even higher activity was shown in the case of 25 bp. Regarding the C-terminal domain of TALE, 47 (0.29 ⁇ 0.03) was more active than WT (for example, nCas9, spacer 1 length: 25 bp, 0.19 ⁇ 0.01).
- TALE-deaminase Six types of TALE-deaminase (WT-104 / 12-AID,) in which the C-terminal domain of TALE is WT, 63, or 47, the amino acid linker 1 is 104 amino acids or 12 amino acids, and the deaminase is AID (PmCDA1). For 63-104 / 12-AID, 47-104 / 12-AID), the effect of the length of the spacer 1 of the reporter on the activity was investigated in the same manner as in BE4 above.
- Fig. 7 As shown in FIG. 7, no activity was observed in any of the TALE-deaminases (AIDs) in the absence of guide RNA, that is, when nCas9 was not bound near the target base. Further, in WT-104-AID, 63-104-AID, WT-12-AID, 63-12-AID, it is better to nick the bottom strand using nCas9 as in the case where the deaminase is BE4. , The activity was higher than when nick did not enter with dCas9. On the other hand, in 47-104-AID and 47-12-AID, the effect of Nick was not so remarkable.
- AIDs TALE-deaminases
- TALE-deaminase (AID) showed high activity at 25 bp, and 63-104-AID and 47-12 AID showed high activity at 19 bp.
- the activity is WT (eg, nCas9, spacer 1 length: 25bp, 0.09 ⁇ 0.00), 63 (0.16 ⁇ 0.01). 47 (0.4 ⁇ 0.03) (FIGS. 7-A, B, C); for 12 amino acid linker 1, the activity is WT (0.32 ⁇ 0.02), 63 (0.38 ⁇ 0.03). , 47 (0.57 ⁇ 0.02) (FIGS.
- the length of the spacer 1 of the reporter is fixed, and the position of the PAM sequence is moved to the 3'side (downstream) one base at a time to obtain the PAM sequence.
- the effect of position (ie, corresponding to the position of Cas9) on the activity of TALE-deaminase was investigated.
- the TALE-deaminase expression plasmid (AID series: WT-104 / 12-AID, 63-104 / 12-AID, 47-104 / 12-AID) prepared in (2) or the corresponding TALE-deaminase. -NC expression plasmid and 4. above.
- the nCas9 expression plasmid prepared in 1. above and 5.
- Guide RNA expression plasmids 1 to 8 prepared in 1 above and 1.
- the reporter plasmid spacer 1:13, 19, or 25bp
- prepared in 1. was combined with pGL4.54, which is a reference plasmid, and introduced into HEK293T cells.
- the TALE-deaminase activity value (ratio to the AncBE4max activity value) was obtained.
- TALE-deaminase (AID series) other than 47-12-AID showed high activity when the spacer 1 length was 19 bp.
- the positions of the PAM sequences include PAM6 (eg, 0.42 ⁇ 0.08 for spacer 1 length: 19bp) or PAM7 (0.38 ⁇ 0.05) (FIG. 8-A) for WT-104-AID.
- PAM7 (0.39 ⁇ 0.00)
- Fig. 8-B for 63-104-AID
- PAM7 (0.90 ⁇ 0.10
- Fig. 8-C for 47-104-AID, WT-12.
- AID also showed high activity with PAM7 (0.91 ⁇ 0.04) (Fig.
- the ones with the highest activity were the spacer 1 length of 47-104-AID: 19bp / PAM7 (0.90 ⁇ 0.10) and the spacer 1 length of WT-12-AID: 19bp / PAM7 (0.91). ⁇ 0.04), 47-12 AID spacer 1 length 13 bp / PAM7 (0.96 ⁇ 0.03).
- the reporter plasmid (spacer 1: 25bp) prepared in 1. and the reference plasmid pGL4.54 were combined and introduced into HEK293T cells, and the above 7.
- the TALE-deaminase activity value (ratio to the AncBE4max activity value) was obtained.
- Fig. 9 The results are shown in Fig. 9.
- the TALE-deaminase expression plasmid the above 3.
- the results when the 47-104-AID prepared in (1), the 47-12-AID prepared in (2), or the corresponding TALE-deaminase-NC expression plasmid are also shown.
- the activity of TALE-deaminase (AID) was slightly lower when the length of amino acid linker 1 was shortened to 12 amino acids, but the effect of the length of amino acid linker 1 on the activity was not so great. There was found.
- TALE-deaminase expression plasmid (2) (1) BE4 series (BE4-TALE) (A) 32 amino acid linker series (BE4-32-TALE) TALE-deaminase (hereinafter, "BE4-32-TALE” in some cases) arranged in the order of Anc689 deaminase (BE4), linker 1 (32 amino acids), and TALE (C-terminal domain: WT, 63, or 47) from the N-terminal.
- BE4 series A
- 32 amino acid linker series (BE4-32-TALE) TALE-deaminase (hereinafter, "BE4-32-TALE” in some cases) arranged in the order of Anc689 deaminase (BE4), linker 1 (32 amino acids), and TALE (C-terminal domain: WT, 63, or 47) from the N-terminal.
- the expression plasmid of) was prepared by the following method. That is, first, 3.
- deaminase (Anc689 deaminase (BE4)), linker 1 (32 amino acids), and TALE (C-terminal domain: WT, 63, or 47) are arranged in this order from the N-terminal.
- Destination vectors (BE4-32-WT, BE4-32-63, BE4-32-47) for expressing the BE4-32-TALE expression plasmid to be expressed were prepared. With these, 1. Array plasmid to which the TALE sequence prepared in 1. Using the array plasmid to which the NC sequence prepared in 1.
- each TALE-deaminase expression plasmid or TALE-deaminase-NC expression plasmid was prepared by the Golden Gate method.
- a conceptual diagram showing the structure of the TALE-deaminase is shown in FIG. 11 (A).
- SEQ ID NOs base sequence sequence numbers, amino acid sequence SEQ ID NOs showing the composition of TALE-deaminase and its negative control (TALE-deaminase-NC) and their sequences in each prepared expression plasmid. It is shown in Table 12 of.
- An expression plasmid (linker 1 length: 135 or 234 amino acids including the linkage sequence added before and after the HTS95 amino acid sequence) was prepared by the following method. That is, 8. (1) A 32-amino acid linker using the destination vectors of BE4-32-WT, BE4-32-63, and BE4-32-47 prepared in (a) as templates and the primers shown in Table 9 below. The 5'side (primer SEQ ID NOs: 25 and 80) including the sequence encoding 1 and the 3'side (primer SEQ ID NOs: 12, 81 to 82) after the sequence encoding each TALE were amplified by PCR, respectively. Table 9 below shows each primer used and its SEQ ID NO:.
- HTS95 nucleotide sequence SEQ ID NO: 105, amino acid sequence SEQ ID NO: 106
- deaminase (Anc689 deaminase (BE4)), linker 1 (135 amino acids or 234 amino acids), TALE (C-terminal domain: WT, 63, or 47) in order from the N-terminal.
- BE4-135-TALE or destination vector for expressing BE4-234-TALE (BE4-135-WT, BE4-135-63, BE4-135-47, BE4-234-WT, etc. BE4-234-63, BE4-234-47) were prepared, respectively. With these, 1. Array plasmid to which the TALE sequence prepared in 1. Using the array plasmid to which the NC sequence prepared in 1.
- each TALE-deaminase expression plasmid or TALE-deaminase-NC expression plasmid was prepared by the Golden Gate method.
- the conceptual diagram showing the structure of the TALE-deaminase is the same as in FIG. 11 (a).
- the following is a list of SEQ ID NOs (SEQ ID NOs of base sequence, sequence number of amino acid sequence) showing the composition and sequence of TALE-deaminase and its negative control (TALE-deaminase-NC) in each prepared expression plasmid. It is shown in Table 12 of.
- ABE8e series (ABE8e-TALE) First, the ABEmax gene (Koblan et al., Nat Biotechnol, 36, p.843-846 (2016)) prepared by artificial DNA synthesis is inserted between the BamHI recognition site and the EcoRV recognition site of cDNA 3.1s, respectively. As a result, ABEmax expression plasmid 2 was prepared.
- ABE8e V106W
- An expression plasmid was prepared. That is, using the ABEmax expression plasmid 2 as a template and the primers shown in Table 10 below, the sequence encoding the N-terminal NLS of the 3'-ABEmax gene to the sequence encoding the C-terminal of the 32 amino acid linker 1-5. 'Was amplified by PCR (primer SEQ ID NOs: 83-84) to form an acceptor.
- a primer was designed to contain mutations such as V106W, A109S, T111R, D119N, H122N, Y147D, F149Y, T166I, and D167N on the ABE7.10 gene, and two fragments using the ABEmax expression plasmid 2 as a template.
- A primer SEQ ID NOs: 85-86
- fragment B primer SEQ ID NOs: 87-88
- Table 10 shows each primer used and its SEQ ID NO:. In each table below, "ABE8e” indicates that the deaminase is TadA-8e (V106W).
- ABE deaminase ABE8e
- linker 1 32 amino acids, 135 amino acids, or 234 amino acids
- TALE C-terminal domain: 47
- the 5'side including the sequence encoding the N-terminal NLS and the 3'side including the sequence encoding the C-terminal side from the 32 amino acid linker were amplified by PCR including the vector (primer SEQ ID NO: 89). ⁇ 90) It was used as an acceptor. Further, the ABE8e (TadA-8e (V106W)) gene portion amplified by PCR using the ABE8e (TadA-8e (V106W)) expression plasmid as a template (primer SEQ ID NO: Addition 91 to 92) was used as the acceptor by the In-Fusion method. To create a destination vector for expressing ABE8e-32-47, ABE8e-135-47, or ABE8e-234-47. Table 11 below shows each primer used and its SEQ ID NO:.
- each destination vector and 1. Array plasmid to which the TALE sequence prepared in 1. Using the array plasmid to which the NC sequence prepared in 1. Similarly, each TALE-deaminase expression plasmid or TALE-deaminase-NC expression plasmid was prepared by the Golden Gate method.
- a conceptual diagram showing the structure of the TALE-deaminase is shown in FIG. 11 (B).
- SEQ ID NOs SEQ ID NOs of base sequence, sequence number of amino acid sequence
- TALE-deaminase-NC negative control
- TALE-deaminase expression plasmid targeting endogenous DNA (genomic DNA)
- TALE recognition sequence the sequences on the eight types of endogenous DNA shown in Table 13 below are targeted.
- An array plasmid in which the TALE sequence was ligated was prepared in the same manner as in the above.
- endogenous DNA CCR5 and HBB in which highly homologous family genes (CCR2 and HBD) are present were set.
- TALE-deaminase (AID) is arranged in the order of TALE (C-terminal domain: WT or 47), linker 1 (12 amino acids or 104 amino acids), and deaminase (AID) from the N-terminal by the Golden Gate method.
- Expression plasmids (TALE-AID expression plasmids) of WT (*) -12-AID, 47 (*) -12-AID, 47 (*) -104-AID: * are targets) were prepared. In addition, except that each array plasmid was used, 8.
- TALE is arranged in the order of deaminase (BE4 or ABE8e), linker 1 (32 amino acids or 135 amino acids), and TALE (C-terminal domain: 63 or 47) from the N-terminal by the Golden Gate method.
- -Expression plasmids (BE4-TALE expression plasmid, ABE8e-TALE expression plasmid) of deaminase (BE4-32-47 (*), BE4-135-63 (*), ABE8e-135-47 (*): * are targets, respectively) ) was produced.
- Table 13 below shows the target TALE recognition sequence
- Table 14 shows the target, the composition of TALE-deaminase, and the sequence number (sequence number of the base sequence, amino acid sequence) in each prepared expression plasmid. A list of each of the sequence numbers) is shown.
- reporter plasmids for BE4-TALE and ABE8e-TALE As reporters, 1. Insert the sequence shown in Table 15 below (N-CBE (for BE4-TALE) or N-ABE (for ABE8e-TALE)) into the pNLF-M / A prepared in 1 above, and insert it for BE4-TALE and ABE8e. -Reporter plasmids for TALE were prepared respectively.
- the insertion sequence is a complementary sequence of the TALE recognition sequence (underlined (* 1) in Table 15: 5'-CCTTTGCCCGCTTCTGTa-3'; lower letters indicate the complementary base of thymine recognized by the N-terminal domain of TALE), target codon ( Underlined (* 2) in Table 15: 5'-ACG-3'or 5'-TAG-3'), and the spacer 1 sequence between them, the length of the spacer 1 is 5'of the target codon. 7 bp, 13 bp, 19 bp, 25 bp, or 31 bp including the bases in. Table 15 below shows each sequence inserted into the pNLF-M / A plasmid.
- the target codon is located at the site corresponding to the start codon of Nano luciferase, and according to the reporter plasmid, the TALE-deaminase bound to the TALE recognition sequence at the target codon.
- C which is a target base (target site)
- TALE-deaminase bound to the TALE recognition sequence at the target codon.
- the reporter plasmid for ABE8e-TALE the target codon is located at the position corresponding to the stop codon inserted upstream of NanoLuc luciferase, and according to the reporter plasmid, TALE- bound to the TALE recognition sequence.
- TAG the target base (target site) A is replaced with G
- NanoLuc luciferase is expressed.
- Transformation into HEK cells (2) 10. When each reporter plasmid prepared in 1 is used as the target DNA, 6. It was performed in the same manner as shown in. That is, 8. 75 ng of either the BE4-TALE expression plasmid or the ABE8e-TALE expression plasmid prepared in 1. NCas9 expression plasmid or dCas9 expression plasmid 50 ng prepared in 1. Guide RNA expression plasmid 10 ng prepared in 10. 20 ng of any of the reporter plasmids prepared in 1; and pGL4.54 5 ng, which is a reference plasmid, were introduced into 5 ⁇ 10 4 HEK293T cells per well using Lipofectamine LTX and cultured for 24 hours.
- 10 ng of the guide RNA expression plasmid prepared in the above was introduced into 3 ⁇ 10 4 HEK293T cells per well, and cultured for 48 hours. Further, as a control in this case, 30 ng of any of the Target-AID expression plasmid, the AncBE4max expression plasmid, and the ABE8e expression plasmid, and 11.
- One of the combinations with the guide RNA expression plasmid 20 ng prepared in the above was introduced into HEK293T cells.
- TALE-deaminase activity value (ratio to the AncBE4max activity value or the ABE8e activity value) was obtained.
- BE4 series From N-terminal, Anc689 deaminase (BE4), linker 1 (32 amino acids, 135 amino acids, or 234 amino acids), TALE (C-terminal domain: WT, 63, or 47) are arranged in this order.
- WT, BE4-TALE-63, BE4-TALE-47 the effect of the length of the spacer 1 of the reporter and the length of the linker 1 on the activity was investigated.
- the length of the spacer 1 was 7 bp, 13 bp, 19 bp, 25 bp, and 31 bp.
- nCas9 As Cas9 / guide RNA, nCas9 (D10A) and guide RNA are expressed (nCas9), dCas9 (D10A + H840A) and guide RNA are expressed (dCas9), nCas9 is expressed and guide RNA is not expressed (nCas9 /).
- nCas9 As Cas9 / guide RNA, nCas9 (D10A) and guide RNA are expressed (nCas9), dCas9 (D10A + H840A) and guide RNA are expressed (dCas9), nCas9 is expressed and guide RNA is not expressed (nCas9 /).
- the length of the spacer 1 particularly high activity was shown when the C-terminal domain of TALE was 25 bp in WT and 63, and when the C-terminal domain of TALE was 19 bp and 25 bp in 47, respectively.
- the activity was particularly high when the C-terminal domain of TALE was 32 amino acids in WT and 47, and when the C-terminal domain of TALE was 135 amino acids in 63.
- the combination with particularly high activity was when the C-terminal domain of TALE was WT, when the linker 1 length was 32 amino acids and the spacer 1 length was 25 bp (0.75 ⁇ 0.10), and when the C-terminal domain of TALE was 63.
- the linker 1 length is 135 amino acids and the spacer 1 length is 25 bp (0.86 ⁇ 0.11)
- the linker 1 length is 32 amino acids and the spacer 1 length is 19 bp and 25 bp (when the linker 1 length is 32 amino acids and the spacer 1 length is 19 bp and 25 bp). It was 0.80 ⁇ 0.14 and 0.85 ⁇ 0.13).
- ABE8e series Also for TALE-deaminase (ABE8e-TALE-47) arranged in the order of ABE8e deaminase (ABE8e), linker 1 (32 amino acids, 135 amino acids, or 234 amino acids), and TALE (C-terminal domain: 47) from the N-terminal. Similar to the BE4 series, the effect of the length of the reporter spacer 1 and the length of the linker 1 on the activity was investigated.
- the length of linker 1 is 135 amino acids (0.54 ⁇ 0.03) for nCas9, 32 amino acids (0.55 ⁇ 0.13) and 135 amino acids (0.57 ⁇ 0.09) for dCas9. Each showed particularly high activity.
- TALE-AID (WT (*) -12-AID, 47) arranged in the order of TALE (C-terminal domain: WT or 47), linker 1 (12 amino acids or 104 amino acids), and deaminase (AID) from the N-terminal. 14 to 16 show the results when (*) -12-AID and 47 (*) -104-AID: * are targets).
- the TALE recognition sequence is on AS3 (a), AS14 (b), S2 (c) shown in FIG. 13A, respectively.
- FIG. 14 shows the ratio of T in the first base (A), the tenth base (B), and the eleventh base (C) of the complementary sequence of the target sequence of the guide RNA (sgRNA-AS3) on the target AS3 (that is,).
- FIG. 15 shows the first base (A) and the fifth base (A) of the complementary sequence of the target sequence of the guide RNA (sgRNA-AS14) on the target AS14.
- B) shows the proportion of T in the 6th base (C)
- FIG. 16 shows the 1st base (A) and the 8th base of the complementary sequence of the target sequence of the guide RNA (sgRNA-S2) on the target S2.
- B) shows the ratio of T in the 9th base (C).
- TALE-AID showed the same or higher activity against the target CCR5 gene or HBB gene as compared with the existing Base editors Target-AID and AncBE4max.
- Target-AID and AncBE4max showed base-editing activity against the non-target CCR2 gene or HBD gene, but TALE-AID showed almost no base-editing activity. Therefore, it was confirmed that TALE-AID has higher target specificity than the existing Target-AID and AncBE4max.
- BE4 series BE4-TALE (BE4-32-47 (*), BE4) arranged in the order of deaminase (BE4), linker 1 (32 amino acids or 135 amino acids), and TALE (C-terminal domain: 47 or 63) from the N-terminal. -135-63 (*): * is the target), and the results are shown in FIGS. 17 to 20.
- the TALE recognition sequence is on the complementary sequence of AS5 (d), AS14 (e), AS6 (f), AS7 (g) shown in FIG. 13B, respectively.
- FIG. 17 shows the ratio of T in the 7th base (A) and the 8th base (B) of the complementary sequence of the target sequence of the guide RNA (sgRNA-AS5) on the target AS5, and FIG. 18 shows the ratio of T in the target.
- the ratio of T in the first base (A), the fifth base (B), and the sixth base (C) of the complementary sequence of the target sequence of the guide RNA (sgRNA-AS14) on AS14 is shown, and FIG. 19 shows.
- the ratio of T in the 5th base (A) and 6th base (B) of the complementary sequence of the target sequence of the guide RNA (sgRNA-AS6) on the target AS6 is shown
- FIG. 20 shows the guide on the target AS7.
- the ratio of T in the 6th base (A) and the 7th base (B) of the complementary sequence of the target sequence of RNA (sgRNA-AS7) is shown.
- BE4-TALE showed approximately the same level of activity against the target CCR5 gene or HBB gene as compared with the existing Base editors Target-AID and AncBE4max.
- Target-AID and AncBE4max showed base-editing activity against the non-target CCR2 gene or HBD gene, but BE4-TALE showed almost no base-editing activity. Therefore, it was confirmed that BE4-TALE has higher target specificity than the existing Target-AID and AncBE4max.
- ABE8e series The results of using ABE8e-TALE (ABE8e-135-47) arranged in the order of ABE8e deaminase (ABE8e), linker 1 (135 amino acids), and TALE (C-terminal domain: 47) from the N-terminal are shown in FIGS. 21 to 21. It is shown in FIG. In each ABE8e-TALE, the TALE recognition sequence is on the complementary sequence of AS5 (d), AS6 (f), AS7 (g) shown in FIG. 13B, respectively. In FIG.
- FIG. 21 shows the ratio of G in the third base (A) and the fourth base (B) of the complementary sequence of the target sequence of the guide RNA (sgRNA-AS5) on the target AS5 (that is, the base changes from A to G).
- the edited ratio is shown, and FIG. 22 shows the ratio of G in the 4th base (A) and 7th base (B) of the complementary sequence of the target sequence of the guide RNA (sgRNA-AS6) on the target AS6.
- FIG. 23 shows the ratio of G in the 4th base (A) and 8th base (B) of the complementary sequence of the target sequence of the guide RNA (sgRNA-AS7) on the target AS7.
- ABE8e-TALE showed the same level of activity against the target CCR5 gene or HBB gene as compared with the existing Base editor ABE8e (TadA-8e (V106W)).
- ABE8e (TadA-8e (V106W)) showed base editing activity against the non-target CCR2 gene or HBD gene, but ABE8e-TALE showed almost no base editing activity. Therefore, it was confirmed that ABE8e-TALE has higher target specificity than the existing ABE8e (TadA-8e (V106W)).
- the present invention a method capable of specifically and efficiently editing a target DNA by a nucleobase converting enzyme, a method for producing a genome-edited cell using the method, And the DNA editing system used for them can be provided. Therefore, the present invention is expected to be utilized in fields such as gene editing therapy, which requires high editing efficiency and high safety.
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Chemical & Material Sciences (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Organic Chemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biomedical Technology (AREA)
- General Engineering & Computer Science (AREA)
- Biotechnology (AREA)
- Molecular Biology (AREA)
- Microbiology (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Plant Pathology (AREA)
- Biophysics (AREA)
- Physics & Mathematics (AREA)
- Medicinal Chemistry (AREA)
- Crystallography & Structural Chemistry (AREA)
- Enzymes And Modification Thereof (AREA)
- Peptides Or Proteins (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
Abstract
Description
標的DNAを編集する方法であり、
(1)TALE及び核酸塩基変換酵素を含む融合タンパク質、並びに、
(2)ヌクレアーゼ活性の一部又は全部を喪失したCas9タンパク質及びそのガイドRNAを含むCRISPR-Cas9システム
を、標的DNAに接触させ、前記融合タンパク質の核酸塩基変換酵素活性により前記標的DNAの標的部位の塩基を編集する工程を含み、
前記融合タンパク質におけるTALEが認識するTALE認識配列又はその相補配列は、前記標的部位の5’側に7~31bpの鎖長のスペーサー1を介して存在し、かつ、
前記CRISPR-Cas9システムにおけるガイドRNAが認識するガイドRNA標的配列は、前記標的部位の相補塩基を含むように存在する、
方法。
標的DNAが編集された細胞を製造する方法であり、
(1)TALE及び核酸塩基変換酵素を含む融合タンパク質、並びに、
(2)ヌクレアーゼ活性の一部又は全部を喪失したCas9タンパク質及びそのガイドRNAを含むCRISPR-Cas9システム
を、細胞に導入又は細胞内で発現させて標的DNAに接触させ、前記融合タンパク質の核酸塩基変換酵素活性により前記標的DNAの標的部位の塩基を編集する工程を含み、
前記融合タンパク質におけるTALEが認識するTALE認識配列又はその相補配列は、前記標的部位の5’側に7~31bpの鎖長のスペーサー1を介して存在し、かつ、
前記CRISPR-Cas9システムにおけるガイドRNAが認識するガイドRNA標的配列は、前記標的部位の相補塩基を含むように存在する、
方法。
前記融合タンパク質がさらに、TALEと前記核酸塩基変換酵素とを結合するリンカー1を含む、[1]又は[2]に記載の方法。
前記融合タンパク質がさらに、C末側にリンカー2を介して結合された塩基除去修復インヒビターを含む、[1]~[3]のうちのいずれか一項に記載の方法。
前記融合タンパク質における核酸塩基変換酵素がデアミナーゼである、[1]~[4]のうちのいずれか一項に記載の方法。
前記デアミナーゼが、APOBEC、PmCDA1、Anc689、及びTadAからなる群から選択される少なくとも1種である、[5]に記載の方法。
[1]~[6]のうちのいずれか一項に記載の方法に用いるためのDNA編集システムであり、
(1)TALE及び核酸塩基変換酵素を含む融合タンパク質、並びに、
(2)ヌクレアーゼ活性の一部又は全部を喪失したCas9タンパク質及びそのガイドRNAを含むCRISPR-Cas9システム
を含む、DNA編集システム。
前記融合タンパク質がさらに、TALEと前記核酸塩基変換酵素とを結合するリンカー1を含む、[7]に記載のDNA編集システム。
前記融合タンパク質がさらに、C末側にリンカー2を介して結合された塩基除去修復インヒビターを含む、[7]又は[8]に記載のDNA編集システム。
前記融合タンパク質における核酸塩基変換酵素がデアミナーゼである、[7]~[9]のうちのいずれか一項に記載の方法。
前記デアミナーゼが、APOBEC、PmCDA1、Anc689、及びTadAからなる群から選択される少なくとも1種である、[10]に記載のDNA編集システム。
本発明の標的DNA編集方法は、
標的DNAを編集する方法であり、
(1)TALE及び核酸塩基変換酵素を含む融合タンパク質、並びに、
(2)ヌクレアーゼ活性の一部又は全部を喪失したCas9タンパク質及びそのガイドRNAを含むCRISPR-Cas9システム
を、標的DNAに接触させ、前記融合タンパク質の核酸塩基変換酵素活性により前記標的DNAの標的部位の塩基を編集する工程を含み、
前記融合タンパク質におけるTALEが認識するTALE認識配列又はその相補配列は、前記標的部位の5’側に7~31bpの鎖長のスペーサー1を介して存在し、かつ、
前記CRISPR-Cas9システムにおけるガイドRNAが認識するガイドRNA標的配列は、前記標的部位の相補塩基を含むように存在する、
方法である。
本発明に係る融合タンパク質は、TALE及び核酸塩基変換酵素を含む融合タンパク質である。本明細書においては、前記本発明に係る融合タンパク質を、場合により、「TALE-核酸塩基変換酵素融合タンパク質」又は「TALE-核酸塩基変換酵素」という。前記融合タンパク質としては、N末から、TALE、核酸塩基変換酵素の順に結合されていても、核酸塩基変換酵素、TALEの順に結合されていてもよい。また、本発明に係る融合タンパク質としては、2以上(好ましくは2つ)の核酸塩基変換酵素を含んでいてもよく、この場合の核酸塩基変換酵素としては、1種のみであっても2種以上の組み合わせであってもよい。例えば、本発明に係る融合タンパク質が2つの核酸塩基変換酵素(第1の核酸塩基変換酵素、第2の核酸塩基変換酵素)を含む場合、これらは、第1の核酸塩基変換酵素、TALE、第2の核酸塩基変換酵素の順に結合されていてもよい。これらの中でも、本発明に係る融合タンパク質としては、N末から、TALE、核酸塩基変換酵素の順に結合されているか、N末から、核酸塩基変換酵素、TALEの順に結合されていることが好ましい。
TALE(Transcription Activator-Like Effector:転写活性化因子様エフェクター)は、Xanthomonas種プロテオバクテリアが分泌し宿主植物の遺伝子転写を活性化するタンパク質である。本発明に係る「TALE」は、少なくとも、N末ドメイン及びTALEリピートドメインを含み、C末ドメインをさらに含んでいてもよい。
本発明において、核酸塩基変換酵素とは、DNA塩基のプリン又はピリミジン環上の置換基を他の基若しくは原子に変換、又は脱落させる反応を触媒することにより、DNA鎖を切断することなく、標的のヌクレオチドを他のヌクレオチドに変換又は脱落させ得る酵素を意味する。
前記デアミナーゼは、塩基のアミノ基をカルボニル基に変換する脱アミノ化反応を触媒する酵素であり、核酸/ヌクレオチドデアミナーゼスーパーファミリーに属する。このようなデアミナーゼとしては、シトシン又は5-メチルシトシンをそれぞれウラシル又はチミンに変換し得るシチジンデアミナーゼ、アデニンをヒポキサンチンに変換し得るアデノシンデアミナーゼ、グアニンをキサンチンに変換し得るグアノシンデアミナーゼが挙げられ、目的の塩基置換に応じて、使い分けることができる。
本発明に係る融合タンパク質としては、リンカー1をさらに含むことが好ましい。この場合、本発明に係る融合タンパク質において、TALE及び前記核酸塩基変換酵素は、リンカー1により連結される。また、本発明に係る融合タンパク質が前記核酸塩基変換酵素を複数含む場合には、それに対応してリンカー1は複数であることが好ましい。本発明に係る融合タンパク質がリンカー1を複数含む場合、これらは、1種のみであっても2種以上の組み合わせであってもよい。
本発明に係る融合タンパク質としては、さらにC末側、すなわち、前記核酸塩基変換酵素のC末(前記核酸塩基変換酵素がTALEのC末側に位置する場合)又はTALEのC末(前記核酸塩基変換酵素がTALEのN末側のみに位置する場合)に、リンカー2を介して結合された塩基除去修復インヒビターを含むことが好ましい。塩基除去修復インヒビターをさらに含む場合、当該塩基除去修復インヒビターは、前記融合タンパク質あたり、1つであっても、さらにリンカー2を介して2以上(好ましくは2つ)であってもよい。また、複数である場合には、1種のみであっても2種以上の組み合わせであってもよいが、1種であることが好ましい。
本発明に係る融合タンパク質としては、さらに、N末及び/又はC末に、精製や検出のためのフラグタグ、TALE-核酸塩基変換酵素を細胞核へ移行するための核局在化シグナル(NLS)等を含んでいてもよく、これらは、上記のようにTALEのN末ドメインに含まれていても、融合タンパク質のN末及び/又はC末に付加されていてもよい。
本発明に係るCRISPR-Cas9システムは、ヌクレアーゼ活性の一部又は全部を喪失したCas9タンパク質及びそのガイドRNAを含む。
本発明に係るCRISPR-Cas9システムにおいて、Cas9タンパク質としては、ヌクレアーゼ活性の一部を喪失しているCas9タンパク質(本明細書では「nCas9」という)、又はヌクレアーゼ活性の全部を喪失しているCas9タンパク質(本明細書では「dCas9」という)であることが必要である。すなわち、Cas9タンパク質は、典型的には、標的鎖の切断に関与するドメイン(RuvCドメイン)及び非標的鎖の切断に関与するドメイン(HNHドメイン)を含むが、本発明に係るCas9タンパク質としては、変異の導入等により、少なくとも一方のドメインのヌクレアーゼ活性が喪失していることが必要である。
本発明に係るCRISPR-Cas9システムにおいて、ガイドRNAは、標的となるDNA上の配列(ガイドRNA標的配列)に相補的な塩基配列(以下、場合により「標的化塩基配列」という)を含むcrRNA(CRISPR RNA)と、tracrRNA(trans-activating crRNA)と、の組み合わせである。前記crRNAは、さらに、tracrRNAと相互作用(ハイブリダイズ)が可能な塩基配列を3’側に含む。一方、tracrRNAは、crRNAの一部の塩基配列と相互作用(ハイブリダイズ)が可能な塩基配列を5’側に含むため、前記ガイドRNAは、これら塩基配列の相互作用により、Cas9タンパク質と相互作用する二重鎖RNAを形成する。このため、前記ガイドRNAが、前記標的DNAのガイドRNA標的配列を認識して結合し、当該ガイドRNAと複合体を形成するCas9タンパク質を当該標的DNAに誘導し、誘導されたCas9タンパク質が、そのヘリカーゼ活性によって、当該標的DNAの標的部位において一本鎖DNAを露出する。
本発明においては、目的とするDNA編集の対象となる標的部位を含むDNAを「標的DNA」という。本発明に係る標的DNAは二本鎖DNAであり、上記の融合タンパク質及びCRISPR-Cas9システムとの対応を示すために、便宜的に、少なくとも一方の鎖が、5’側から順に、TALE認識配列又はその相補配列、スペーサー1、前記標的部位、PAM配列、を含む構造であるものとする。5’側にTALE認識配列の相補配列を含むとは、当該鎖の相補鎖が、TALE認識配列を含むことを示す。すなわち、TALE認識配列は、前記標的部位と同じ鎖上に設定しても、その相補鎖上に設定してもよい。また、当該鎖の相補鎖が、前記標的部位の相補塩基を含むように、ガイドRNA標的配列を含むものとする。本発明に係る標的DNAの構成の一例を下記の図2に示すが、これに限定されるものではない。
本発明の標的DNA編集方法では、前記融合タンパク質及び前記CRISPR-Cas9システムを、前記標的DNAに接触させ、前記融合タンパク質の核酸塩基変換酵素活性により、前記標的DNAの標的部位の塩基を編集する。
本発明の標的DNAが編集された細胞の製造方法は、
標的DNAが編集された細胞を製造する方法であり、
(1)TALE及び核酸塩基変換酵素を含む融合タンパク質、並びに、
(2)ヌクレアーゼ活性の一部又は全部を喪失したCas9タンパク質及びそのガイドRNAを含むCRISPR-Cas9システム
を、細胞に導入又は細胞内で発現させて標的DNAに接触させ、前記融合タンパク質の核酸塩基変換酵素活性により前記標的DNAの標的部位の塩基を編集する工程を含み、
前記融合タンパク質におけるTALEが認識するTALE認識配列又はその相補配列は、前記標的部位の5’側に7~31bpの鎖長のスペーサー1を介して存在し、かつ、
前記CRISPR-Cas9システムにおけるガイドRNAが認識するガイドRNA標的配列は、前記標的部位の相補塩基を含むように存在する、
方法である。
また、本発明は、上記本発明の標的DNA編集方法、本発明の標的DNAが編集された細胞の製造方法、又は上記本発明の非ヒト個体の作製方法に用いるための、
(1)TALE及び核酸塩基変換酵素を含む融合タンパク質、並びに、
(2)ヌクレアーゼ活性の一部又は全部を喪失したCas9タンパク質及びそのガイドRNAを含むCRISPR-Cas9システム
を含む、DNA編集システムを提供する。
(a)TALEをコードするポリヌクレオチドの挿入部位と前記融合タンパク質のTALE以外をコードするポリヌクレオチドを含むベクター、並びに、
(b)ヌクレアーゼ活性の一部又は全部を喪失したCas9タンパク質をコードするポリヌクレオチド、又は、該ポリヌクレオチドを含むベクター、
(c)前記Cas9タンパク質のガイドRNAをコードするポリヌクレオチド、又は、該ポリヌクレオチドを含むベクター若しくは該ポリヌクレオチドの挿入部位を含むベクター、
を含む、DNA編集システムを提供する。なお、(a)~(c)の各ベクターは、それぞれ、各ポリヌクレオチドを発現可能にする発現ユニットを含むものである。
レポーターとして、pNLF1-C[CMV Hygro](PROMEGA社製,WI,USA)を用い、部位特異的変異導入法によって71番目と107番目のメチオニンコドン(ATG)をアラニンコドン(ATA)に変更してpNLF-M/Aとした。次いで、pNLF-M/Aに、先ず、下記の表1に示した配列(BEtag)を挿入した。BEtag配列は、TALE認識配列(表1中の下線(*1):5’-tACAGAAGCGGGCAAAGG-3’;小文字はTALEのN末ドメインが認識するチミンを示す)、ターゲットコドン(表1中の下線(*2):5’-ACG-3’)、及びこれらの間のスペーサー1配列を含み、スペーサー1配列の長さは、ターゲットコドンの5’にある塩基(A)を含めて、7bp、13bp、19bp、25bp、又は31bpとした。各BEtag配列(BEtag-7~31bp:配列番号1~5)には、両端にNheI認識サイト及びSbfI認識サイトを付加できるように設計したオリゴヌクレオチドをアニールして、NheI及びSbfIで処理したpNLF-M/Aに挿入し、pNLF-BEtag(pNLF-BEtag-7bp~31bp)とした。
TALEベクターセットの配列及び構成は、TALEリピートユニットが上記1.で作製したレポータープラスミドのTALE認識配列に対応するように、Sakuma et al.(2013)にしたがって構築した。先ず、12番目と13番目の2アミノ酸で構成される可変残基の4種(RVD:HD、NG、NI、NN)をコードする配列以外のモジュール配列(non-repeat-variable di-residue(non-RVD))に改変(特に、4番目と32番目)を加え、さらにその両端に制限酵素BsAI認識サイトを付加した配列(モジュール配列)を人工DNA合成した(計16種:1HD~4HD、1NG~4NG、1NI~4NI、1NN~4NN)。次いで、これらモジュール配列をpEX-A2J2(Eurofins Genomics社製,Tokyo,Japan)に挿入して、モジュールプラスミドセット(計16種:pEX1HD~pEX4HD、pEX1NG~pEX4NG、pEX1NI~pEX4NI、及びpEX1NN~pEX4NN)を作製した。
(1)104アミノ酸リンカーシリーズ
先ず、人工DNA合成により作製したAncBE4max遺伝子(Koblan et al.,”Improving cytidine and adenine base editors by expression optimization and ancestral reconstruction.”,Nat Biotechnol,36,p.843-846(2018))と、Target-AID遺伝子(Nishida et al.,”Targeted nucleotide editing using hybrid prokaryotic and vertebrate adaptive immune systems.”,Science 353,aaf8729(2016))と、をpcDNA3.1sのBamHI認識サイトとEcoRV認識サイトとの間にそれぞれ挿入することにより、AncBE4max発現プラスミド及びTarget-AID発現プラスミドを作製した。
また、前記104アミノ酸リンカー1に代えて、12アミノ酸リンカー1(12aa)を有するTALE-デアミナーゼ発現プラスミドは以下の方法で作製した。すなわち、先ず、下記の表3に記載のオリゴヌクレオチドをそれぞれアニーリングさせて12アミノ酸リンカー1をコードする配列を作製した。表3中、12aa linker(配列番号18~21)のfor以下は、例えば、「TALE_WT-12-AID」は、当該配列が、TALEのC末ドメインがWTであり、かつ、前記12アミノ酸リンカー1及びPmCDA1デアミナーゼ(AID)を含むTALE-デアミナーゼ用のリンカー1をコードする配列であることを示す。かかる12アミノ酸リンカー1は、Yang L et al.,”Engineering and optimising deaminase fusions for genome editing.”,Nat Commun 7 13330(2016)に記載の12アミノ酸からなる。次いで、上記TALE-デアミナーゼ発現プラスミドの104アミノ酸リンカー1をコードする配列以外を、下記の表4に示したプライマー(配列番号9、22~24)を用いて、それぞれ、インバースPCRにて増幅し、前記12アミノ酸リンカー1をコードする配列をIn-Fusion法により挿入した。下記の表3に、12アミノ酸リンカー1をコードする配列及びその配列番号を示し、表4に、インバースPCRに使用したプライマー及びその配列番号を示す。
さらに、TALE-47-104-AIDにおいて、104アミノ酸リンカー1に代えて、同104アミノ酸リンカー1のC末側から順にアミノ酸を欠失させ、アミノ酸長を12、24、36、48、60、又は84となるようにした各アミノ酸リンカー1をコードするTALE-デアミナーゼ発現プラスミドを以下の方法で作製した。すなわち、上記TALE-デアミナーゼ発現プラスミドの、欠失させるアミノ酸をコードする配列以外を、下記の表5に記載のプライマー(配列番号9、26~31)を用いてインバースPCRにて増幅し、In-Fusion法によりセルフライゲーションさせることで作製した。下記の表5に、TALE-47-104-AIDのアミノ酸リンカー1欠失変異体作製に使用したプライマー及びその配列番号を示す。
先ず、pX330(Addgene,Cambridge,MA;Plasmid 42230)のU6プロモーターからBGHポリA付加配列までの領域を人工DNA合成し、pBlueScriptII(SK+)(Stratagene,La Jolla,CA,USA)のEcoRV認識サイトとBamHI認識サイトとの間にIn-Fusion法により挿入することでCas9発現プラスミドpX330_BSを作製した。次いで、pX330_BSのCas9遺伝子に対してPCRによる部位特異的変異導入法を行うことにより、ニッカーゼタイプのnCas9(D10A)発現プラスミド、及び切断活性のないdCas9(D10A+H840A)発現プラスミドをそれぞれ作製した。
先ず、上記4.で作製したpX330_BSをXbaI及びNotIで処理してCas9遺伝子を除去し、フィルイン反応後、セルフライゲーションによりプラスミド(pX330_BS-ΔCas9)を作製した。次いで、下記の表7に示した、ガイドRNAの標的配列(表7には、ガイドRNAの標的配列(アンチセンス鎖上)の相補配列を示す)(表7の下線以外の配列)に対応するように設計したオリゴヌクレオチドをアニールさせ、BpiI処理とライゲーションとによってpX330_BS-ΔCas9に挿入して、各ガイドRNA(sgRNA-1~8)を発現するプラスミド(ガイドRNA発現プラスミド1~8)を作製した。下記の表7に、ガイドRNAの標的配列の相補配列(下線以外)及びCas9のPAM配列(下線)を示す。
10%FBSを含むDMEM培地で生育させたHEK293T細胞を5×104細胞ずつ96ウェルプレートの各ウェルに播種した。表6に記載のTALE-デアミナーゼ発現プラスミド、及びTALE-デアミナーゼ-NC発現プラスミドのうちのいずれか75ng;上記4.で作製したnCas9発現プラスミド又はdCas9発現プラスミド50ng;上記5.で作製したガイドRNA発現プラスミド1~8及びpX330_BS-ΔCas9(ガイドRNAなし)のうちのいずれか10ng;上記1.で作製したレポータープラスミドのうちのいずれか20ng;並びにリファレンスプラスミドであるpGL4.54(PROMEGA社製)5ng;をLipofectamine LTX(Thermo Fisher Scientific社製)を用いてHEK293T細胞に導入した。リファレンスプラスミドは、ホタルルシフェラーゼ(Fluc)を発現するプラスミドである。
形質転換してから24時間培養後に培地を除去し、PBS(-)で洗浄後、Passive Lysis Buffer(PROMEGA社製)で処理して細胞を溶解し、細胞溶解液とした。前記細胞溶解液をDMEM培地で100倍に希釈し、Nano-Glo Dual-Luciferase Reporter Assay System(PROMEGA社製)を用いて、TriStar S LB942 プレートリーダー(Berthold Technologies社製,Bad Wildbad,Germany)でNanoLucルシフェラーゼ活性スコア及びホタルルシフェラーゼ活性スコアを測定した。
ポジティブコントロールとして、既存のBase editorであるAncBE4maxが最も高い活性を示すPAM配列の位置を調べるために、PAM配列の位置を1塩基ずつ3’側(下流)に移動させ(PAM1~PAM8)、塩基編集活性を調べた。
(BE4シリーズ)
TALEのC末ドメインがWT又は47であり、アミノ酸リンカー1が104アミノ酸であり、デアミナーゼがBE4(Anc689)である2種類のTALE-デアミナーゼ(WT-104-BE4、47-104-BE4)について、PAM配列の位置をPAM2に固定して、レポーターのスペーサー1の長さが活性に与える影響を調べた。スペーサー1の長さは、7bp、13bp、19bp、25bp、31bpとした。また、Cas9/ガイドRNAとしては、nCas9(D10A)及びガイドRNAの発現あり(nCas9)、dCas9(D10A+H840A)及びガイドRNAの発現あり(dCas9)、nCas9の発現あり及びガイドRNAの発現なし(nCas9/ガイドなし)の3条件について検討した。
TALEのC末ドメインがWT、63、又は47であり、アミノ酸リンカー1が104アミノ酸又は12アミノ酸であり、デアミナーゼがAID(PmCDA1)である6種類のTALE-デアミナーゼ(WT-104/12-AID、63-104/12-AID、47-104/12-AID)について、上記のBE4と同様にして、レポーターのスペーサー1の長さが活性に与える影響を調べた。
レポーターのスペーサー1の長さを固定し、PAM配列の位置を1塩基ずつ3’側(下流)に移動させて、PAM配列の位置(つまり、Cas9の位置に対応する)がTALE-デアミナーゼの活性に与える影響を調べた。スペーサー1長としては、試験2で活性評価を行った際に比較的活性が高かった13bp、19bp、25bpについて検討を行った。
上記試験3から、アミノ酸リンカー1として、104アミノ酸よりも12アミノ酸の方が活性が高いことが判明したが、そのアミノ酸長自体については検討していない。そこで、上記3.(3)で作製したTALE-デアミナーゼ発現プラスミド(47-12’~84’-AID:アミノ酸リンカー1長:84アミノ酸、60アミノ酸、36アミノ酸、24アミノ酸、12アミノ酸)を用いて、アミノ酸リンカー1の長さの影響を調べた。
(1)BE4シリーズ(BE4-TALE)
(a)32アミノ酸リンカーシリーズ(BE4-32-TALE)
N末から、Anc689デアミナーゼ(BE4)、リンカー1(32アミノ酸)、TALE(C末ドメイン:WT、63、又は47)の順に配置されるTALE-デアミナーゼ(以下、場合により「BE4-32-TALE」という)の発現プラスミドを以下の方法により作製した。すなわち、先ず、3.(1)において作製したAncBE4max発現プラスミドを鋳型として、下記の表8に記載のプライマーを用いて、32アミノ酸リンカー1をコードする配列を含めて5’側(プライマー配列番号25、73~74)、10アミノ酸リンカー2及びUGIの2回繰り返しをコードする配列を含めて3’側(プライマー配列番号12、75~77)をそれぞれPCRにより増幅した。また、2.で作製したデスティネーションベクター(TALE-WT、TALE-63、TALE-47)をそれぞれ鋳型として、各TALEをコードする配列(プライマー配列番号22~24、78~79)をPCRにより増幅した。下記の表8に、BE4-32-TALE発現プラスミドを構築するために使用した各プライマー及びその配列番号を示す。
次いで、デアミナーゼとTALEとの間のリンカー1長(32アミノ酸)をコードする配列の下流側に、既知のリンカー配列であるHTS95アミノ酸配列(Sun,N. and Zhao,H.,Mol BioSyst 10,p.446-453(2014),Doi:10.1039/c3mb70412b)をコードする配列又はその2回繰り返しをコードする配列を挿入したTALE-デアミナーゼ(以下、場合により「BE4-135-TALE」又は「BE4-234-TALE」という)の発現プラスミド(リンカー1長:HTS95アミノ酸配列の前後に付加した繋ぎ配列を含めて、135又は234アミノ酸)を以下の方法により作製した。すなわち、8.(1)(a)で作製したBE4-32-WT、BE4-32-63、BE4-32-47の各デスティネーションベクターを鋳型として、下記の表9に記載のプライマーを用いて、32アミノ酸リンカー1をコードする配列を含めて5’側(プライマー配列番号25、80)、各TALEをコードする配列以降の3’側(プライマー配列番号12、81~82)をそれぞれPCRにより増幅した。下記の表9に、使用した各プライマー及びその配列番号を示す。
先ず、人工DNA合成により作製したABEmax遺伝子(Koblan et al.,Nat Biotechnol,36,p.843-846(2018))を、pcDNA3.1sのBamHI認識サイトとEcoRV認識サイトとの間にそれぞれ挿入することにより、ABEmax発現プラスミド2を作製した。
TALE認識配列として、下記の表13に記載の8種類の内因性DNA上の配列がターゲットとなるように、2.と同様の方法でTALE配列を連結したアレイプラスミドを作製した。前記内因性DNAとしては、相同性の高いファミリー遺伝子(CCR2及びHBD)が存在するCCR5とHBBを設定した。
レポーターとして、1.で作製したpNLF-M/Aに、下記の表15に示した配列(N-CBE(BE4-TALE用)又はN-ABE(ABE8e-TALE用))を挿入し、BE4-TALE用、及びABE8e-TALE用のレポータープラスミドをそれぞれ作製した。挿入配列は、TALE認識配列の相補配列(表15中の下線(*1):5’-CCTTTGCCCGCTTCTGTa-3’;小文字はTALEのN末ドメインが認識するチミンの相補塩基を示す)、ターゲットコドン(表15中の下線(*2):5’-ACG-3’又は5’-TAG-3’)、及びこれらの間のスペーサー1配列を含み、スペーサー1の長さは、ターゲットコドンの5’にある塩基を含めて、7bp、13bp、19bp、25bp、又は31bpとした。下記の表15に、pNLF-M/Aプラスミドに挿入した各配列を示す。
ガイドRNAを発現するプラスミドを、ガイドRNAの標的配列(表16には、ガイドRNAの標的配列(アンチセンス鎖上)の相補配列を示す)に対応するように設計したこと以外は、5.と同様にして、各ガイドRNAを発現するプラスミド(ガイドRNA発現プラスミド)を作製した。下記の表16に、ガイドRNAの標的配列の相補配列を示す。
上記10.で作製した各レポータープラスミドを標的DNAとした場合は、6.に示した方法と同様に行った。すなわち、8.で作製したBE4-TALE発現プラスミド又はABE8e-TALE発現プラスミドのうちのいずれか75ng;4.で作製したnCas9発現プラスミド又はdCas9発現プラスミド50ng;11.で作製したガイドRNA発現プラスミド10ng、10.で作製したレポータープラスミドのうちのいずれか20ng;並びにリファレンスプラスミドであるpGL4.54 5ngを、Lipofectamine LTXを用いて、各ウェルあたり5×104個のHEK293T細胞に導入し、24時間培養した。
形質転換してから48時間後に培地を除去し、PBS(-)で洗浄後、DNAzol(Molecular Research Center,INC,Ohio,USA)50μLを添加して細胞を溶解した。細胞溶解液をテンプレートにして、下記の表17に記載のプライマーを用いて、PrimeSTAR Max(Takara Bio Inc.,Siga,Japan)により、前記内因性DNAのターゲットサイトを含む領域を増幅した。得られたPCR産物を精製後、株式会社ファスマックにシークエンスを依頼した。EditR(Kluesner et al.,The CRISPR J 1,p.239-250(2018),DOI:10.1089/crispr.2018.0014)を用いてシークエンスデータを解析し、ターゲットサイトの塩基編集効率(T又はGの割合)を算出した。
上記12.によりHEK細胞への形質転換を行い、上記7.によりTALE-デアミナーゼ活性値(AncBE4max活性値又はABE8e活性値に対する割合)をそれぞれ取得した。
N末から、Anc689デアミナーゼ(BE4)、リンカー1(32アミノ酸、135アミノ酸、又は234アミノ酸)、TALE(C末ドメイン:WT、63、又は47)の順に配置されるTALE-デアミナーゼ(BE4-TALE-WT、BE4-TALE-63、BE4-TALE-47)について、レポーターのスペーサー1の長さ及びリンカー1の長さが活性に与える影響を調べた。スペーサー1の長さは、7bp、13bp、19bp、25bp、31bpとした。また、Cas9/ガイドRNAとしては、nCas9(D10A)及びガイドRNAの発現あり(nCas9)、dCas9(D10A+H840A)及びガイドRNAの発現あり(dCas9)、nCas9の発現あり及びガイドRNAの発現なし(nCas9/ガイドなし)の3条件について検討した。
N末から、ABE8eデアミナーゼ(ABE8e)、リンカー1(32アミノ酸、135アミノ酸、又は234アミノ酸)、TALE(C末ドメイン:47)の順に配置されるTALE-デアミナーゼ(ABE8e-TALE-47)についても、上記BE4シリーズと同様にして、レポーターのスペーサー1の長さ及びリンカー1の長さが活性に与える影響を調べた。
上記で特に高い活性が得られたTALE-デアミナーゼについて、上記12.及び13.に記載の方法により、内因性DNAに対する塩基編集活性を調べた。前記内因性DNAとしては、相同性の高いファミリー遺伝子(CCR2及びHBD)が存在するCCR5とHBBを設定した。図13A及び図13Bに、前記内因性DNA(CCR5、HBB)上のターゲット領域(AS3(a:配列番号221)、AS14(b:配列番号222)(e:配列番号225)、S2(c:配列番号223)、AS5(d:配列番号224)、AS6(f:配列番号226)、AS7(g:配列番号227))、TALE認識配列又はその相補配列の位置、ガイドRNAの標的配列の相補配列の位置をそれぞれ示す。また、図13A及び図13Bには、(a)、(b)、(d)、(e)に示すCCR5と相同性の高いCCR2、(c)、(f)、(g)に示すHBBと相同性の高いHBDとのアラインメントとして、各領域の塩基と異なる塩基もそれぞれ示す。
N末から、TALE(C末ドメイン:WT、又は47)、リンカー1(12アミノ酸、又は104アミノ酸)、デアミナーゼ(AID)の順に配置されるTALE-AID(WT(*)-12-AID、47(*)-12-AID、47(*)-104-AID:*はそれぞれターゲット)を用いたときの結果を図14~図16に示す。各TALE-AIDにおいて、TALE認識配列は、それぞれ、図13Aに示すAS3(a)、AS14(b)、S2(c)上にある。図14には、ターゲットAS3上の、ガイドRNA(sgRNA-AS3)の標的配列の相補配列の1塩基目(A)、10塩基目(B)、11塩基目(C)におけるTの割合(すなわち、塩基がCからTに編集された割合)を示し、図15には、ターゲットAS14上の、ガイドRNA(sgRNA-AS14)の標的配列の相補配列の1塩基目(A)、5塩基目(B)、6塩基目(C)におけるTの割合を示し、図16には、ターゲットS2上の、ガイドRNA(sgRNA-S2)の標的配列の相補配列の1塩基目(A)、8塩基目(B)、9塩基目(C)におけるTの割合を示す。
N末から、デアミナーゼ(BE4)、リンカー1(32アミノ酸、又は135アミノ酸)、TALE(C末ドメイン:47、又は63)の順に配置されるBE4-TALE(BE4-32-47(*)、BE4-135-63(*):*はそれぞれターゲット)を用いたときの結果を図17~図20に示す。各BE4-TALEにおいて、TALE認識配列は、それぞれ、図13Bに示すAS5(d)、AS14(e)、AS6(f)、AS7(g)の相補配列上にある。図17には、ターゲットAS5上の、ガイドRNA(sgRNA-AS5)の標的配列の相補配列の7塩基目(A)、8塩基目(B)におけるTの割合を示し、図18には、ターゲットAS14上の、ガイドRNA(sgRNA-AS14)の標的配列の相補配列の1塩基目(A)、5塩基目(B)、6塩基目(C)におけるTの割合を示し、図19には、ターゲットAS6上の、ガイドRNA(sgRNA-AS6)の標的配列の相補配列の5塩基目(A)、6塩基目(B)におけるTの割合を示し、図20には、ターゲットAS7上の、ガイドRNA(sgRNA-AS7)の標的配列の相補配列の6塩基目(A)、7塩基目(B)におけるTの割合を示す。
N末から、ABE8eデアミナーゼ(ABE8e)、リンカー1(135アミノ酸)、TALE(C末ドメイン:47)の順に配置されるABE8e-TALE(ABE8e-135-47)を用いたときの結果を図21~図23に示す。各ABE8e-TALEにおいて、TALE認識配列は、それぞれ、図13Bに示すAS5(d)、AS6(f)、AS7(g)の相補配列上にある。図21には、ターゲットAS5上の、ガイドRNA(sgRNA-AS5)の標的配列の相補配列の3塩基目(A)、4塩基目(B)におけるGの割合(すなわち、塩基がAからGに編集された割合)を示し、図22には、ターゲットAS6上の、ガイドRNA(sgRNA-AS6)の標的配列の相補配列の4塩基目(A)、7塩基目(B)におけるGの割合を示し、図23には、ターゲットAS7上の、ガイドRNA(sgRNA-AS7)の標的配列の相補配列の4塩基目(A)、8塩基目(B)におけるGの割合を示す。
Claims (11)
- 標的DNAを編集する方法であり、
(1)TALE及び核酸塩基変換酵素を含む融合タンパク質、並びに、
(2)ヌクレアーゼ活性の一部又は全部を喪失したCas9タンパク質及びそのガイドRNAを含むCRISPR-Cas9システム
を、標的DNAに接触させ、前記融合タンパク質の核酸塩基変換酵素活性により前記標的DNAの標的部位の塩基を編集する工程を含み、
前記融合タンパク質におけるTALEが認識するTALE認識配列又はその相補配列は、前記標的部位の5’側に7~31bpの鎖長のスペーサー1を介して存在し、かつ、
前記CRISPR-Cas9システムにおけるガイドRNAが認識するガイドRNA標的配列は、前記標的部位の相補塩基を含むように存在する、
方法。 - 標的DNAが編集された細胞を製造する方法であり、
(1)TALE及び核酸塩基変換酵素を含む融合タンパク質、並びに、
(2)ヌクレアーゼ活性の一部又は全部を喪失したCas9タンパク質及びそのガイドRNAを含むCRISPR-Cas9システム
を、細胞に導入又は細胞内で発現させて標的DNAに接触させ、前記融合タンパク質の核酸塩基変換酵素活性により前記標的DNAの標的部位の塩基を編集する工程を含み、
前記融合タンパク質におけるTALEが認識するTALE認識配列又はその相補配列は、前記標的部位の5’側に7~31bpの鎖長のスペーサー1を介して存在し、かつ、
前記CRISPR-Cas9システムにおけるガイドRNAが認識するガイドRNA標的配列は、前記標的部位の相補塩基を含むように存在する、
方法。 - 前記融合タンパク質がさらに、TALEと前記核酸塩基変換酵素とを結合するリンカー1を含む、請求項1又は2に記載の方法。
- 前記融合タンパク質がさらに、C末側にリンカー2を介して結合された塩基除去修復インヒビターを含む、請求項1~3のうちのいずれか一項に記載の方法。
- 前記融合タンパク質における核酸塩基変換酵素がデアミナーゼである、請求項1~4のうちのいずれか一項に記載の方法。
- 前記デアミナーゼが、APOBEC、PmCDA1、Anc689、及びTadAからなる群から選択される少なくとも1種である、請求項5に記載の方法。
- 請求項1~6のうちのいずれか一項に記載の方法に用いるためのDNA編集システムであり、
(1)TALE及び核酸塩基変換酵素を含む融合タンパク質、並びに、
(2)ヌクレアーゼ活性の一部又は全部を喪失したCas9タンパク質及びそのガイドRNAを含むCRISPR-Cas9システム
を含む、DNA編集システム。 - 前記融合タンパク質がさらに、TALEと前記核酸塩基変換酵素とを結合するリンカー1を含む、請求項7に記載のDNA編集システム。
- 前記融合タンパク質がさらに、C末側にリンカー2を介して結合された塩基除去修復インヒビターを含む、請求項7又は8に記載のDNA編集システム。
- 前記融合タンパク質における核酸塩基変換酵素がデアミナーゼである、請求項7~9のうちのいずれか一項に記載の方法。
- 前記デアミナーゼが、APOBEC、PmCDA1、Anc689、及びTadAからなる群から選択される少なくとも1種である、請求項10に記載のDNA編集システム。
Priority Applications (6)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP21864429.2A EP4209588A1 (en) | 2020-09-04 | 2021-09-03 | Method for editing target dna, method for producing cell having edited target dna, and dna edition system for use in said methods |
JP2022546983A JPWO2022050377A1 (ja) | 2020-09-04 | 2021-09-03 | |
US18/043,814 US20230340455A1 (en) | 2020-09-04 | 2021-09-03 | Method for editing target dna, method for producing cell in which target dna is edited, and dna editing system for use in these |
AU2021338062A AU2021338062A1 (en) | 2020-09-04 | 2021-09-03 | Method for editing target dna, method for producing cell having edited target dna, and dna edition system for use in said methods |
KR1020237010913A KR20230058482A (ko) | 2020-09-04 | 2021-09-03 | 표적 dna의 편집 방법, 표적 dna가 편집된 세포의 제조 방법, 및 그것들에 사용하는 dna 편집 시스템 |
CN202180054310.XA CN116157517A (zh) | 2020-09-04 | 2021-09-03 | 目标dna的编辑方法、编辑了目标dna的细胞的制备方法及用于它们的dna编辑系统 |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2020-148944 | 2020-09-04 | ||
JP2020148944 | 2020-09-04 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2022050377A1 true WO2022050377A1 (ja) | 2022-03-10 |
Family
ID=80491073
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/JP2021/032436 WO2022050377A1 (ja) | 2020-09-04 | 2021-09-03 | 標的dnaの編集方法、標的dnaが編集された細胞の製造方法、及びそれらに用いるdna編集システム |
Country Status (7)
Country | Link |
---|---|
US (1) | US20230340455A1 (ja) |
EP (1) | EP4209588A1 (ja) |
JP (1) | JPWO2022050377A1 (ja) |
KR (1) | KR20230058482A (ja) |
CN (1) | CN116157517A (ja) |
AU (1) | AU2021338062A1 (ja) |
WO (1) | WO2022050377A1 (ja) |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2014131833A1 (en) | 2013-02-27 | 2014-09-04 | Helmholtz Zentrum München Deutsches Forschungszentrum Für Gesundheit Und Umwelt (Gmbh) | Gene editing in the oocyte by cas9 nucleases |
WO2018218166A1 (en) | 2017-05-25 | 2018-11-29 | The General Hospital Corporation | Using split deaminases to limit unwanted off-target base editor deamination |
JP2018537963A (ja) * | 2015-10-23 | 2018-12-27 | プレジデント アンド フェローズ オブ ハーバード カレッジ | 遺伝子編集のための進化したCas9蛋白質 |
WO2020051562A2 (en) * | 2018-09-07 | 2020-03-12 | Beam Therapeutics Inc. | Compositions and methods for improving base editing |
-
2021
- 2021-09-03 JP JP2022546983A patent/JPWO2022050377A1/ja active Pending
- 2021-09-03 AU AU2021338062A patent/AU2021338062A1/en active Pending
- 2021-09-03 US US18/043,814 patent/US20230340455A1/en active Pending
- 2021-09-03 WO PCT/JP2021/032436 patent/WO2022050377A1/ja unknown
- 2021-09-03 EP EP21864429.2A patent/EP4209588A1/en active Pending
- 2021-09-03 CN CN202180054310.XA patent/CN116157517A/zh active Pending
- 2021-09-03 KR KR1020237010913A patent/KR20230058482A/ko unknown
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2014131833A1 (en) | 2013-02-27 | 2014-09-04 | Helmholtz Zentrum München Deutsches Forschungszentrum Für Gesundheit Und Umwelt (Gmbh) | Gene editing in the oocyte by cas9 nucleases |
JP2018537963A (ja) * | 2015-10-23 | 2018-12-27 | プレジデント アンド フェローズ オブ ハーバード カレッジ | 遺伝子編集のための進化したCas9蛋白質 |
WO2018218166A1 (en) | 2017-05-25 | 2018-11-29 | The General Hospital Corporation | Using split deaminases to limit unwanted off-target base editor deamination |
JP2020521446A (ja) * | 2017-05-25 | 2020-07-27 | ザ ジェネラル ホスピタル コーポレイション | 二部塩基エディター(bbe)構造およびii型−c−cas9ジンクフィンガー編集 |
WO2020051562A2 (en) * | 2018-09-07 | 2020-03-12 | Beam Therapeutics Inc. | Compositions and methods for improving base editing |
Non-Patent Citations (17)
Title |
---|
"Genbank", Database accession no. KX151730.1 |
BENJAMIN, P. ET AL., NATURE, vol. 523, 2015, pages 481 - 485 |
GAUDELLI, N. ET AL., NATURE, vol. 551, 2017, pages 464 - 471 |
GEHRKE, J. ET AL.: "Improving cytidine and adenine base editors by expression optimization and ancestral reconstruction.", NAT BIOTECHNOL, vol. 36, 2018, pages 843 - 846, XP036929657, DOI: 10.1038/nbt.4172 |
HIRANO, S. ET AL., MOLECULAR CELL, vol. 61, 2016, pages 886 - 894 |
KLUESNER ET AL., THE CRISPR J, vol. 1, 2018, pages 239 - 250 |
KOBLAN ET AL., NAT BIOTECHNOL, vol. 36, 2018, pages 843 - 846 |
KOMOR, A. ET AL., NATURE, vol. 533, 2016, pages 420 - 424 |
LEONARD G. DAVIS ET AL.: "Basic methods in molecular biology", 1986, ELSEVIER |
MARTIN, A. S. ET AL., SCI REP, vol. 9, 2019, pages 497 |
MILLER ET AL., NAT. BIOTECHNOL., vol. 29, 2011, pages 143 - 148 |
NISHIDA, K. ET AL.: "Targeted nucleotide editing using hybrid prokaryotic and vertebrate adaptive immune systems.", SCIENCE, vol. 353, 2016, pages aaf8729, XP055482712, DOI: 10.1126/science.aaf8729 |
REES, H. A.LIU, D. R., NAT REV GENET, vol. 19, 2018, pages 770 - 788 |
RICHTER, M. F. ET AL., NAT BIOTECHNOL, vol. 38, 2020, pages 883 - 891 |
SAKUMA ET AL.: "Repeating pattern of non-RVD variations in DNA-binding modules enhances TALEN activity.", SCI. REP., vol. 3, 2013, pages 3379 |
SUN, N.ZHAO, H., MOL BIOSYST, vol. 10, 2014, pages 446 - 453 |
YANG, L. ET AL.: "Engineering and optimising deaminase fusions for genome editing.", NAT COMMUN, vol. 7, 2016, pages 13330, XP055415680, DOI: 10.1038/ncomms13330 |
Also Published As
Publication number | Publication date |
---|---|
US20230340455A1 (en) | 2023-10-26 |
JPWO2022050377A1 (ja) | 2022-03-10 |
AU2021338062A1 (en) | 2023-05-04 |
CN116157517A (zh) | 2023-05-23 |
EP4209588A1 (en) | 2023-07-12 |
KR20230058482A (ko) | 2023-05-03 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP6480647B1 (ja) | Dnaが編集された真核細胞を製造する方法、および当該方法に用いられるキット | |
JP6958917B2 (ja) | 遺伝子ノックイン細胞の作製方法 | |
US20190223417A1 (en) | Genetically modified animals having increased heat tolerance | |
Ma et al. | Building Cre knockin rat lines using CRISPR/Cas9 | |
JP7426101B2 (ja) | ゲノム編集された細胞を製造する方法 | |
CN113646429A (zh) | 敲入细胞的制作方法 | |
WO2022050377A1 (ja) | 標的dnaの編集方法、標的dnaが編集された細胞の製造方法、及びそれらに用いるdna編集システム | |
EP4242237A1 (en) | Foki nuclease domain variant | |
JP2023131616A (ja) | Dna編集システム、並びに、それを用いた標的dnaの編集方法及び標的dnaが編集された細胞の製造方法 | |
WO2021171688A1 (ja) | 遺伝子ノックイン方法、遺伝子ノックイン細胞作製方法、遺伝子ノックイン細胞、がん化リスク評価方法、がん細胞製造方法、及びこれらに用いるためのキット | |
EA040859B1 (ru) | Способ получения эукариотических клеток с отредактированной днк и набор, используемый в этом способе |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 21864429 Country of ref document: EP Kind code of ref document: A1 |
|
ENP | Entry into the national phase |
Ref document number: 2022546983 Country of ref document: JP Kind code of ref document: A |
|
ENP | Entry into the national phase |
Ref document number: 20237010913 Country of ref document: KR Kind code of ref document: A |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
ENP | Entry into the national phase |
Ref document number: 2021864429 Country of ref document: EP Effective date: 20230404 |
|
ENP | Entry into the national phase |
Ref document number: 2021338062 Country of ref document: AU Date of ref document: 20210903 Kind code of ref document: A |