CN116790566A - Nuclease with novel base editing function and editing system - Google Patents
Nuclease with novel base editing function and editing system Download PDFInfo
- Publication number
- CN116790566A CN116790566A CN202310149826.3A CN202310149826A CN116790566A CN 116790566 A CN116790566 A CN 116790566A CN 202310149826 A CN202310149826 A CN 202310149826A CN 116790566 A CN116790566 A CN 116790566A
- Authority
- CN
- China
- Prior art keywords
- ddd
- editing
- ddbe
- site
- sequence
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 101710163270 Nuclease Proteins 0.000 title description 2
- 230000035772 mutation Effects 0.000 claims abstract description 12
- 102000004190 Enzymes Human genes 0.000 claims abstract description 10
- 108090000790 Enzymes Proteins 0.000 claims abstract description 10
- 108090000623 proteins and genes Proteins 0.000 claims description 32
- 108020004414 DNA Proteins 0.000 claims description 27
- 102000004169 proteins and genes Human genes 0.000 claims description 26
- 230000000694 effects Effects 0.000 claims description 23
- 102000053602 DNA Human genes 0.000 claims description 21
- 230000009615 deamination Effects 0.000 claims description 18
- 238000006481 deamination reaction Methods 0.000 claims description 17
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 claims description 10
- 102220272280 rs777826765 Human genes 0.000 claims description 7
- 229940104302 cytosine Drugs 0.000 claims description 5
- 108010073062 Transcription Activator-Like Effectors Proteins 0.000 claims description 4
- 210000004899 c-terminal region Anatomy 0.000 claims description 4
- 101710172430 Uracil-DNA glycosylase inhibitor Proteins 0.000 claims 1
- 125000003275 alpha amino acid group Chemical group 0.000 claims 1
- 108020005196 Mitochondrial DNA Proteins 0.000 abstract description 22
- 201000010099 disease Diseases 0.000 abstract description 8
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 abstract description 8
- 238000010362 genome editing Methods 0.000 abstract description 2
- UHDGCWIWMRVCDJ-UHFFFAOYSA-N 1-beta-D-Xylofuranosyl-NH-Cytosine Natural products O=C1N=C(N)C=CN1C1C(O)C(O)C(CO)O1 UHDGCWIWMRVCDJ-UHFFFAOYSA-N 0.000 abstract 1
- UHDGCWIWMRVCDJ-PSQAKQOGSA-N Cytidine Natural products O=C1N=C(N)C=CN1[C@@H]1[C@@H](O)[C@@H](O)[C@H](CO)O1 UHDGCWIWMRVCDJ-PSQAKQOGSA-N 0.000 abstract 1
- UHDGCWIWMRVCDJ-ZAKLUEHWSA-N cytidine Chemical compound O=C1N=C(N)C=CN1[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O1 UHDGCWIWMRVCDJ-ZAKLUEHWSA-N 0.000 abstract 1
- PAHHYDSPOXDASW-VGWMRTNUSA-N (2s)-6-amino-2-[[(2s)-6-amino-2-[[(2s)-1-[(2s)-2-amino-3-hydroxypropanoyl]pyrrolidine-2-carbonyl]amino]hexanoyl]amino]hexanoic acid Chemical group NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO PAHHYDSPOXDASW-VGWMRTNUSA-N 0.000 description 15
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 12
- RAXXELZNTBOGNW-UHFFFAOYSA-N imidazole Natural products C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 description 12
- 108010015840 seryl-prolyl-lysyl-lysine Proteins 0.000 description 12
- DGVVWUTYPXICAM-UHFFFAOYSA-N β‐Mercaptoethanol Chemical compound OCCS DGVVWUTYPXICAM-UHFFFAOYSA-N 0.000 description 12
- 210000004027 cell Anatomy 0.000 description 11
- 238000003776 cleavage reaction Methods 0.000 description 10
- 230000007017 scission Effects 0.000 description 10
- 239000000758 substrate Substances 0.000 description 9
- 238000010442 DNA editing Methods 0.000 description 8
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical group O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 7
- 238000012163 sequencing technique Methods 0.000 description 7
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 6
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 6
- 239000000872 buffer Substances 0.000 description 6
- 239000011780 sodium chloride Substances 0.000 description 6
- 230000006463 DNA deamination Effects 0.000 description 5
- 102100037111 Uracil-DNA glycosylase Human genes 0.000 description 5
- 230000002438 mitochondrial effect Effects 0.000 description 5
- 239000000047 product Substances 0.000 description 5
- 102200143001 rs796052126 Human genes 0.000 description 5
- 238000001890 transfection Methods 0.000 description 5
- 241000588724 Escherichia coli Species 0.000 description 4
- 230000003321 amplification Effects 0.000 description 4
- 238000003556 assay Methods 0.000 description 4
- 239000011324 bead Substances 0.000 description 4
- 238000006243 chemical reaction Methods 0.000 description 4
- 238000004925 denaturation Methods 0.000 description 4
- 230000036425 denaturation Effects 0.000 description 4
- 239000002609 medium Substances 0.000 description 4
- 238000000034 method Methods 0.000 description 4
- 238000003199 nucleic acid amplification method Methods 0.000 description 4
- 239000013612 plasmid Substances 0.000 description 4
- GUAHPAJOXVYFON-ZETCQYMHSA-N (8S)-8-amino-7-oxononanoic acid zwitterion Chemical group C[C@H](N)C(=O)CCCCCC(O)=O GUAHPAJOXVYFON-ZETCQYMHSA-N 0.000 description 3
- BZTDTCNHAFUJOG-UHFFFAOYSA-N 6-carboxyfluorescein Chemical compound C12=CC=C(O)C=C2OC2=CC(O)=CC=C2C11OC(=O)C2=CC=C(C(=O)O)C=C21 BZTDTCNHAFUJOG-UHFFFAOYSA-N 0.000 description 3
- 229920000936 Agarose Polymers 0.000 description 3
- 108091093088 Amplicon Proteins 0.000 description 3
- 101001109052 Homo sapiens NADH-ubiquinone oxidoreductase chain 4 Proteins 0.000 description 3
- 101000598279 Homo sapiens NADH-ubiquinone oxidoreductase chain 5 Proteins 0.000 description 3
- 101000632623 Homo sapiens NADH-ubiquinone oxidoreductase chain 6 Proteins 0.000 description 3
- 102100021506 NADH-ubiquinone oxidoreductase chain 4 Human genes 0.000 description 3
- 102100036971 NADH-ubiquinone oxidoreductase chain 5 Human genes 0.000 description 3
- 102100028386 NADH-ubiquinone oxidoreductase chain 6 Human genes 0.000 description 3
- PXHVJJICTQNCMI-UHFFFAOYSA-N Nickel Chemical compound [Ni] PXHVJJICTQNCMI-UHFFFAOYSA-N 0.000 description 3
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 3
- 238000004458 analytical method Methods 0.000 description 3
- 230000001580 bacterial effect Effects 0.000 description 3
- 238000002474 experimental method Methods 0.000 description 3
- 229960000789 guanidine hydrochloride Drugs 0.000 description 3
- PJJJBBJSCAKJQF-UHFFFAOYSA-N guanidinium chloride Chemical compound [Cl-].NC(N)=[NH2+] PJJJBBJSCAKJQF-UHFFFAOYSA-N 0.000 description 3
- 230000006872 improvement Effects 0.000 description 3
- 238000000338 in vitro Methods 0.000 description 3
- 229940035893 uracil Drugs 0.000 description 3
- 241000371430 Burkholderia cenocepacia Species 0.000 description 2
- 101100062880 Burkholderia cenocepacia (strain H111) dddA gene Proteins 0.000 description 2
- 241000589513 Burkholderia cepacia Species 0.000 description 2
- 108091033409 CRISPR Proteins 0.000 description 2
- 238000010354 CRISPR gene editing Methods 0.000 description 2
- 102000000311 Cytosine Deaminase Human genes 0.000 description 2
- 108010080611 Cytosine Deaminase Proteins 0.000 description 2
- 208000032087 Hereditary Leber Optic Atrophy Diseases 0.000 description 2
- 108010036650 Immunoproteins Proteins 0.000 description 2
- SRBFZHDQGSBBOR-HWQSCIPKSA-N L-arabinopyranose Chemical compound O[C@H]1COC(O)[C@H](O)[C@H]1O SRBFZHDQGSBBOR-HWQSCIPKSA-N 0.000 description 2
- 201000000639 Leber hereditary optic neuropathy Diseases 0.000 description 2
- 108091028043 Nucleic acid sequence Proteins 0.000 description 2
- 108091034117 Oligonucleotide Proteins 0.000 description 2
- 229920002684 Sepharose Polymers 0.000 description 2
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 2
- SRBFZHDQGSBBOR-UHFFFAOYSA-N beta-D-Pyranose-Lyxose Natural products OC1COC(O)C(O)C1O SRBFZHDQGSBBOR-UHFFFAOYSA-N 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 230000003247 decreasing effect Effects 0.000 description 2
- 238000012217 deletion Methods 0.000 description 2
- 230000037430 deletion Effects 0.000 description 2
- 239000012149 elution buffer Substances 0.000 description 2
- 230000005484 gravity Effects 0.000 description 2
- 239000008062 guanidine hydrochloride buffer Substances 0.000 description 2
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000001717 pathogenic effect Effects 0.000 description 2
- YBYRMVIVWMBXKQ-UHFFFAOYSA-N phenylmethanesulfonyl fluoride Chemical compound FS(=O)(=O)CC1=CC=CC=C1 YBYRMVIVWMBXKQ-UHFFFAOYSA-N 0.000 description 2
- 238000002360 preparation method Methods 0.000 description 2
- 239000002096 quantum dot Substances 0.000 description 2
- 238000004153 renaturation Methods 0.000 description 2
- 102220005146 rs35262412 Human genes 0.000 description 2
- 238000007480 sanger sequencing Methods 0.000 description 2
- 239000006228 supernatant Substances 0.000 description 2
- 230000008685 targeting Effects 0.000 description 2
- 239000011534 wash buffer Substances 0.000 description 2
- QFVHZQCOUORWEI-UHFFFAOYSA-N 4-[(4-anilino-5-sulfonaphthalen-1-yl)diazenyl]-5-hydroxynaphthalene-2,7-disulfonic acid Chemical compound C=12C(O)=CC(S(O)(=O)=O)=CC2=CC(S(O)(=O)=O)=CC=1N=NC(C1=CC=CC(=C11)S(O)(=O)=O)=CC=C1NC1=CC=CC=C1 QFVHZQCOUORWEI-UHFFFAOYSA-N 0.000 description 1
- 241000726118 Acidovorax facilis Species 0.000 description 1
- HRPVXLWXLXDGHG-UHFFFAOYSA-N Acrylamide Chemical compound NC(=O)C=C HRPVXLWXLXDGHG-UHFFFAOYSA-N 0.000 description 1
- 229920001817 Agar Polymers 0.000 description 1
- 108020000946 Bacterial DNA Proteins 0.000 description 1
- 101000884048 Burkholderia cenocepacia (strain H111) Double-stranded DNA deaminase toxin A Proteins 0.000 description 1
- 241000862994 Chondromyces apiculatus Species 0.000 description 1
- KDXKERNSBIXSRK-RXMQYKEDSA-N D-lysine Chemical compound NCCCC[C@@H](N)C(O)=O KDXKERNSBIXSRK-RXMQYKEDSA-N 0.000 description 1
- 238000007399 DNA isolation Methods 0.000 description 1
- 230000004568 DNA-binding Effects 0.000 description 1
- 239000006144 Dulbecco’s modified Eagle's medium Substances 0.000 description 1
- 241000257465 Echinoidea Species 0.000 description 1
- 241000791290 Falcatimonas Species 0.000 description 1
- 241000223218 Fusarium Species 0.000 description 1
- 108020005004 Guide RNA Proteins 0.000 description 1
- HVLSXIKZNLPZJJ-TXZCQADKSA-N HA peptide Chemical compound C([C@@H](C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@H]1N(CCC1)C(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 HVLSXIKZNLPZJJ-TXZCQADKSA-N 0.000 description 1
- 108010093488 His-His-His-His-His-His Proteins 0.000 description 1
- 102100030649 Histone H2B type 1-J Human genes 0.000 description 1
- 101710086682 Histone H2B.1 Proteins 0.000 description 1
- 101000604411 Homo sapiens NADH-ubiquinone oxidoreductase chain 1 Proteins 0.000 description 1
- 101710116034 Immunity protein Proteins 0.000 description 1
- 208000026350 Inborn Genetic disease Diseases 0.000 description 1
- 208000006136 Leigh Disease Diseases 0.000 description 1
- 208000017507 Leigh syndrome Diseases 0.000 description 1
- 239000006137 Luria-Bertani broth Substances 0.000 description 1
- 206010058799 Mitochondrial encephalomyopathy Diseases 0.000 description 1
- 101710116852 Molybdenum cofactor sulfurase 1 Proteins 0.000 description 1
- 101710116850 Molybdenum cofactor sulfurase 2 Proteins 0.000 description 1
- 102100038625 NADH-ubiquinone oxidoreductase chain 1 Human genes 0.000 description 1
- 108091093105 Nuclear DNA Proteins 0.000 description 1
- 241000258130 Psammechinus miliaris Species 0.000 description 1
- 238000010357 RNA editing Methods 0.000 description 1
- 230000026279 RNA modification Effects 0.000 description 1
- 238000011529 RT qPCR Methods 0.000 description 1
- 241000192031 Ruminococcus Species 0.000 description 1
- 241000134861 Ruminococcus sp. Species 0.000 description 1
- 101150081767 TXN2 gene Proteins 0.000 description 1
- 238000001042 affinity chromatography Methods 0.000 description 1
- 239000008272 agar Substances 0.000 description 1
- 238000000246 agarose gel electrophoresis Methods 0.000 description 1
- 230000002776 aggregation Effects 0.000 description 1
- 238000004220 aggregation Methods 0.000 description 1
- 238000003277 amino acid sequence analysis Methods 0.000 description 1
- 150000001413 amino acids Chemical group 0.000 description 1
- 229960000723 ampicillin Drugs 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- 230000027455 binding Effects 0.000 description 1
- 238000010170 biological method Methods 0.000 description 1
- 239000008280 blood Substances 0.000 description 1
- 210000004369 blood Anatomy 0.000 description 1
- 239000004202 carbamide Substances 0.000 description 1
- 210000000845 cartilage Anatomy 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 239000013592 cell lysate Substances 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 239000013078 crystal Substances 0.000 description 1
- 230000002950 deficient Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000010790 dilution Methods 0.000 description 1
- 239000012895 dilution Substances 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 239000013613 expression plasmid Substances 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 238000000799 fluorescence microscopy Methods 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 238000001502 gel electrophoresis Methods 0.000 description 1
- 208000016361 genetic disease Diseases 0.000 description 1
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 1
- 239000010931 gold Substances 0.000 description 1
- 229910052737 gold Inorganic materials 0.000 description 1
- 238000003384 imaging method Methods 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 229930027917 kanamycin Natural products 0.000 description 1
- 229960000318 kanamycin Drugs 0.000 description 1
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 1
- 229930182823 kanamycin A Natural products 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 239000012139 lysis buffer Substances 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 210000003470 mitochondria Anatomy 0.000 description 1
- 230000025608 mitochondrion localization Effects 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 239000000178 monomer Substances 0.000 description 1
- 238000007481 next generation sequencing Methods 0.000 description 1
- 229910052759 nickel Inorganic materials 0.000 description 1
- 239000002773 nucleotide Substances 0.000 description 1
- 125000003729 nucleotide group Chemical group 0.000 description 1
- 239000013307 optical fiber Substances 0.000 description 1
- 108090000765 processed proteins & peptides Proteins 0.000 description 1
- 238000001742 protein purification Methods 0.000 description 1
- 238000011002 quantification Methods 0.000 description 1
- 238000013139 quantization Methods 0.000 description 1
- 230000001105 regulatory effect Effects 0.000 description 1
- 239000011347 resin Substances 0.000 description 1
- 229920005989 resin Polymers 0.000 description 1
- 102220236765 rs80358547 Human genes 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 238000001542 size-exclusion chromatography Methods 0.000 description 1
- 238000004513 sizing Methods 0.000 description 1
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 239000000243 solution Substances 0.000 description 1
- 238000000527 sonication Methods 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 238000010186 staining Methods 0.000 description 1
- 238000007619 statistical method Methods 0.000 description 1
- 238000010998 test method Methods 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 210000001519 tissue Anatomy 0.000 description 1
- 239000012096 transfection reagent Substances 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
- 238000012800 visualization Methods 0.000 description 1
- 238000012070 whole genome sequencing analysis Methods 0.000 description 1
- 239000011592 zinc chloride Substances 0.000 description 1
- 235000005074 zinc chloride Nutrition 0.000 description 1
- JIAARYAFYJHUJI-UHFFFAOYSA-L zinc dichloride Chemical compound [Cl-].[Cl-].[Zn+2] JIAARYAFYJHUJI-UHFFFAOYSA-L 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/78—Hydrolases (3) acting on carbon to nitrogen bonds other than peptide bonds (3.5)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/10—Processes for the isolation, preparation or purification of DNA or RNA
- C12N15/102—Mutagenizing nucleic acids
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/85—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y305/00—Hydrolases acting on carbon-nitrogen bonds, other than peptide bonds (3.5)
- C12Y305/04—Hydrolases acting on carbon-nitrogen bonds, other than peptide bonds (3.5) in cyclic amidines (3.5.4)
- C12Y305/04001—Cytosine deaminase (3.5.4.1)
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Organic Chemistry (AREA)
- Zoology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Wood Science & Technology (AREA)
- General Engineering & Computer Science (AREA)
- Biomedical Technology (AREA)
- Biotechnology (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- Microbiology (AREA)
- Molecular Biology (AREA)
- Physics & Mathematics (AREA)
- Biophysics (AREA)
- Plant Pathology (AREA)
- Medicinal Chemistry (AREA)
- Crystallography & Structural Chemistry (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
The present application provides an enzyme Ddd _SS from Simiaoa sunii (Simiaoa) which can effectively deaminate cytidine in a GC environment. The application further provides an optimized derivative DdBE of Ddd _SS. Gene editing can be performed in a GC environment by using the enzymes provided by the application, so that mutation of disease-related mitochondrial DNA (mtDNA) is introduced, which cannot be realized in the prior art.
Description
Background
The base editing can realize the accurate mutation of target DNA sequence, and is helpful to analyze the functions of genes and regulatory elements, model diseases and develop medicines for treating diseases. Base editing in nuclear DNA is achieved by a base editing system based on aggregation of regularly spaced short palindromic repeats (clustered regularly interspaced short palindromic repeats, CRISPR). Unfortunately, CRISPR base editing systems are currently not suitable for mitochondrial DNA editing due to the lack of methods to deliver guide RNAs to mitochondria. Recently, base editors (dda-derived Cytosine Base Editor, ddCBE) derived from transcription activator-like effectors (Transcription Activator Like Effector, TALE) have been developed to catalyze the editing of c→t in mitochondrial DNA. These methods rely on ddatox, a double-stranded DNA editing enzyme from new burkholderia cepacia (Burkholderia cenocepacia). The original ddatox requires a stringent TC sequence site. Through phage-assisted evolution, mok et al further obtained a DddAtox variant that could catalyze the editing of C.fwdarw.T at the TC/AC/CC sequence site. However, there is currently no DdBE suitable for the GC sequence site. Furthermore, it is unclear whether double-stranded DNA deaminase is found in other species. In this study, the present inventors identified and designed novel double-stranded DNA deaminase homologs to solve the above-described problems.
Disclosure of Invention
Drawings
FIG. 1 shows that the C-terminal sequence of DddAtox protein has an important role in its dsDNA deamination activity. Fig. 1 (a): boxes indicate the sequence similar to the sequence of the SPKK motif in the reported sea urchin (Psammechinus miliaris) histone h2b.1 protein sequence and the sequence of the new burkholderia cepacia (Burkholderia cenocepacia) dda protein. Fig. 1 (b): schematic of the protein structure of wild-type ddatox (WT), truncations (Delta) of the similar sequence of the ddatox deletion sphk motif and fusion of the AT-Hook sequence (AT-Hook) on the basis of truncations. Fig. 1 (c): the proteins shown in FIG. 1 (b) were mixed with an equal amount of substrate dsDNA at different concentrations to react with protein deamination activity by identifying the relative proportion of dsDNA to the total amount of substrate that had undergone deamination, the data shown in the figures being mean.+ -. Standard deviation, and 3 independent replicates. FIG. 2 shows a comparison of various dsDNA cytosine deaminase enzymes homologous to DddAtox. Fig. 2 (a): summary of the sequence characteristics and activity of various candidate dsDNA cytosine deaminase enzymes homologous to ddataox. #1 ranking from high to low according to PSI-BLAST scoring. #2 + the C-terminus of the marker protein contains a sequence similar to the SPKK motif; ++, the C-terminal of the protein contains two similar sequences of the SPKK motif. # 3. It is noted that in vitro deamination activity is comparable to ddatox; * Indicating that in vitro deamination activity is higher than ddatox; weak, deaminated product can be identified, but at a protein concentration of 10. Mu.M the average deamination ratio is <5%. Fig. 2 (b): probability map of the two sides of the mutant cytosine on the genome by overexpressing Ddd _SS, ddd _Fa and DddAtox in UNG knocked-out E.coli.
FIG. 3 shows the results of constructing a mitochondrial base editor using Ddd _SS. Fig. 3 (a): ddBE construction schematic. Fig. 3 (b): HEK293T cells transfected ND 5.1-DdBE, mitochondrial DNA editing ratio after 3 days: A_G1397, dddAtox-G1397; ss_n94: ddd _ss-N94; A_G1333, dddAtox-G1333; ss_n29: ddd _ss-N29, data shown in the figure are mean ± standard deviation, 3 independent replicates.
FIG. 4 shows the editing function of Ddd _SS mutant on pathogenic genes. Fig. 4 (a): related diseases caused by mutations in two mitochondrial genes. Fig. 4 (b): HEK293T cells transfected ND 4-DdBE, mitochondrial DNA editing ratio after 3 days; fig. 4 (c): mitochondrial DNA editing ratio 3 days after HEK293T cells are transfected with ND 6-DdBE; in FIGS. 4 (b) and 4 (c), the gray background indicates the disease-associated site in (a), the gray base is the TALE binding moiety, ddd _SS1: ddd _SS (T26I+T7I+T110I); ddd _SS2: ddd _SS (T26I); ddd _SS3: ddd _SS (T77I); ddd _SS4: ddd _SS (T110I); ddd _SS5: ddd _SS (T7I+T110I); the data shown in the figure are mean.+ -. Standard deviation, 3 independent replicates of the experiment, with the Dead-Ddd _SS5: ddd _SS (E44A+T7I+T110I), ddd _SS and its mutant using the N94 cleavage site, dddA and its mutant using the G1397 cleavage site.
FIG. 5 shows the expansion of editable sites by DddAtox mutants. Fig. 5 (a): ND 5.1-DdBE was transfected with HEK293T cells, mitochondrial DNA editing ratio after 3 days; fig. 5 (b): mitochondrial DNA editing ratio 3 days after transfection of ND 1-DdBE with HEK293T cells; ddd-SS and its mutant used the N94 cleavage site, dddA and its mutant used the G1397 cleavage site, and the data shown in the figures are mean.+ -. Standard deviation, 3 independent replicates.
Detailed Description
The molecular biological methods used in the present application, unless otherwise indicated, are all experimental methods well known to those skilled in the art. The reagents used, unless otherwise indicated, are commercially available and common. Thus, the experiments of the present application can be repeated entirely and the same results can be obtained by those skilled in the art upon reading the following description.
Test method
Strains and culture conditions
All strains were grown at 37℃in Luria-Bertani (LB) medium or agar solid LB medium. Kanamycin (50 mg/L), ampicillin (100 mg/L), L-arabinose (2 g/L) and IPTG (0.5 mM) may be added to the medium as required. Coli dh5α, BL21 (DE 3), BW25113 a ung were used to construct and generate plasmids, protein expression and heterologous expression of candidate deaminase, respectively, to determine substrate preference.
Plasmid construction
To construct deaminase expression plasmids, the Azenta Life Sciences (su state) synthetic deaminase gene and corresponding immunity proteins were cloned into MCS-1 (BamHI and NotI sites, N-terminal hexahistidine tag introduced) and MCS-2 (NdeI and XhoI sites) of pcoladat 1. For expression of deaminase in E.coli BW 25113. DELTA. ung, dddAtox, ddd_SS and Ddd _Fa were cloned downstream of the araBAD promoter in pBAD and the corresponding immunoprotein gene driven by the T7 promoter was cloned downstream of the deaminase gene of the same plasmid. To assemble the DdBE, the TALE array was assembled from the tetrameric template using the gold gate cloning method, and then the TALE array was digested with Ndel and BamH1 enzymes and ligated with the cleavage deaminase, UGI and other DdBE sequences. The DdBE construct in this study contained a mitochondrial targeting sequence from the TXN2 gene, a Flag/HA tag, a TALE array, a split deaminase, and UGI. Except for L-A1397N and L-SS94N in FIGS. 2b and 2c, all DdBE was fused to GFP by self-cleaving T2A sequences. DdBE was then cloned into the piggyBac vector under the control of the CAGGS promoter.
Protein purification in vitro DNA deamination assays
For the expression of deaminase, pCOLADuet-1 containing deaminase and its corresponding immunoprotein gene was transformed into BL21 (DE 3). Individual colonies were selected for characterization and amplification. Bacterial liquid with OD600 of 0.6-0.8 and 0.5mM IPTG are selected to induce protein expression at 18 deg.C overnight. Cells were harvested and resuspended in lysis buffer (50 mM Tris-HCl, pH7.5, 500mM NaCl,5% glycerol, 20mM imidazole, 5mM 2-mercaptoethanol and 1mM PMSF) and the supernatant was separated by sonication by centrifugation in a JA-25.50 centrifuge (Beckman) at 18000r.p.m. for 30 minutes. Deaminase immune protein complex was purified from cell lysates by nickel affinity chromatography using 1mL Ni-sepharose 6 fast flow agarose beads, loaded onto a gravity flow column (GE healthcare). The supernatant was loaded onto the column and the resin was washed with 10ml of wash buffer (50 mM Tris-HCl, pH7.5, 500mM NaCl,20mM imidazole and 5mM 2-mercaptoethanol). Deaminase immune protein complex was eluted with 3mL elution buffer (50 mM Tris-HCl pH7.5, 300mM imidazole, 500mM NaCl and 5mM 2-mercaptoethanol) and then deaminase was separated from the complex by denaturation and renaturation steps. For denaturation, the eluted protein samples were added to 25ml of 6M guanidine hydrochloride denaturation buffer (50 mM Tris-HCl pH7.5, 20mM imidazole, 500mM NaCl and 5mM 2-mercaptoethanol) and incubated at 4℃for 1 hour. The eluted protein containing 6M guanidine hydrochloride buffer was loaded on a gravity flow column with 1mL Ni-Sepharose 6 fast flow agarose beads. The column was washed with 10ml of 6m guanidine hydrochloride buffer to remove any remaining immune proteins. While deaminase is still bound to the Ni agarose beads, it is washed sequentially with 8ml of denaturation buffer containing 10. Mu.M ZnCl2 and decreasing concentrations of guanidine hydrochloride (5M, 4M, 3M, 2M, 1M) for protein renaturation, and finally with washing buffer to remove residual guanidine hydrochloride. The column bound protein was then eluted with 3ml elution buffer. Eluted deaminase was again purified by size exclusion chromatography using a Superdex75 column (GE Healthcare) in sizing buffer (20 mM Tris-HCl pH7.5, 200mM NaCl, 5mM 2-mercaptoethanol, and 5% glycerol). Fractions were assessed for purity by SDS-PAGE staining with Coomassie blue, and then the highest quality fractions were stored at-80 ℃.
DNA deamination assay
The DNA deamination assay was essentially performed as described previously, with some modifications. DNA substrates were purchased from Sangon Biotech (Shanghai, china) containing a 6-FAM fluorophore at its 5' end for visualization. To generate a double-stranded DNA substrate, the unmodified reverse complement oligonucleotide was annealed at equimolar concentrations to the substrate modified with the 6-FAM fluorophore. The reaction was carried out in 10. Mu.L of deamination buffer containing 20mM Tris-HCl pH7.5, 200mM NaCl, 5mM 2-mercaptoethanol and 1. Mu.M substrate. The reaction was incubated at 37℃for 1 hour, then 5. Mu.L of UDG reaction solution (New England Biolabs, 0.02U/. Mu.L of UDG in 1 XUDG buffer) was added, and incubated at 37℃for another 30 minutes. Cleavage of the base site resulting from cleavage of uracil residues in the UDG-mediated substrate was induced by addition of 100mM NaOH and incubation at 95℃for 2 min. The reaction was analyzed by 20% acrylamide 8M urea gel electrophoresis in 1 XTBE buffer and the 6-FAM fluorophore signal was detected by fluorescence imaging by the ChemiDoc MP imaging system (Bio-Rad). Quantification of the percent deamination was performed by ImageJ.
Single Nucleotide Variation (SNV) analysis
To obtain genomic DNA, E.coli BW25113ung strain expressing candidate deaminase was inoculated into 20mL of LB broth at a 1:100 dilution, the culture was grown to about OD600 of 0.6, and then treated with 2g/L L-arabinose for 1 hour to induce deaminase expression. Bacterial genomes were extracted using 3mL bacterial cultures using FastPure blood/cells/tissue/bacterial DNA isolation mini-kit (Vazyme, DC 112). The extraction yield was quantified using Qubit (ThermoFisher Scientific). Sequencing libraries were constructed using the VAHTS Universal Plus DNA library preparation kit for Illumina V2 (Vazyme, ND 627) according to the manufacturer's instructions except that VAHTS HiFi Amplification Mix component was replaced with a KAPA HiFi HotStart Uracil + ReadyMix (KAPA Biosystems, KK 2801) component to achieve efficient amplification of uracil encountered in DNA templates. Library concentration and quality were assessed by Qubit and 1% agarose gel electrophoresis. Sequencing was performed using the Illumina Nova-seq 6000 sequencing system (novogen) and read-out of the reference genome (nc_ 000913.3) was mapped using BWA software (version 0.7.17). The duplicate items were deleted using the Picard tool (version 2.18.29). Pile-up data from the alignment was generated using SAMtools (version 1.14) and variable calls were performed using VarScan (version 2.4.4). The SNV verification threshold is set to: variable frequency >0.01, coverage > 50 reads per base and p-value <0.01. A probability logo that modifies the consensus region on both sides of the base was generated using the Weblogo on-line tool (https:// weblog. Berkeley. Edu).
Cell culture and transfection
HEK293T cells were cultured in high-sugar DMEM medium (Hyclone, D6429) supplemented with 10% fbs (PANSera, 2602-P130707) at 37 ℃ under 5% co 2. To edit the mitochondrial genome, 8×10 was used approximately 20 hours prior to transfection 4 The HEK293T cells were seeded on 24-well plates coated with poly D-lysine 466 (PDL). JetTime transfection reagent (Polymer-transfection,468 PT-114-75), the amount of each DdBE monomer was 250ng, and the total amount of transfected total plasmid DNA was 500ng. Cells were harvested 72h after transfection and genomic DNA was extracted, and then PCR amplified for regions containing the editing site of interest for Sanger sequencing (Azenta Life Sciences) or library preparation for next generation sequencing. EditR was used to evaluate the basal editing efficiency of Sanger sequencing data.
Target amplicon sequencing and analysis
For target amplicon sequencing, the region of interest was first amplified by round 1 PCR using KAPAHiFi HotStart Uracil +readymix (KAPA Biosystems, KK 2801), then PCR product 1 was amplified again in round 2 PCR using N323 VAHTS RNAMultiplex Oligos Set for lolumina (Vazyme, N323) to increase Illumina index. qPCR was used to optimize the PCR cycle number to the top of the linear range to minimize the bias of amplification. For example, the inventors used 100ng whole cell genome as the initial template, 12 cycles for round 1 PCR and 10 cycles for round 2 PCR. The product was further purified by 1× VAHTSDNA Clean Beads (Vazyme, N411) and the library was then sequenced by Illumina Nova seq 6000 sequencing system. Target amplicon sequencing analysis was performed with CRISPResso 2. The output file "nucleic_permanent_table.txt" is used for quantization of the editing frequency.
Statistics of
Data are expressed as mean ± standard deviation, statistical analysis using GraphPad Prism 9, unless otherwise indicated.
Example one modification to ddatox
From the amino acid sequence analysis of ddatox, the inventors noted that its C-terminus comprises two peptide motifs associated with SPKK (fig. 1 (a)), which are known to be more preferred to bind to a/T-rich DNA sequences located in minor double-stranded DNA grooves. Deletion of these two SPKK-related motifs completely abrogated the double-stranded DNA deaminase activity of DddAtox, while addition of AT-hook with similar DNA binding properties to the SPKK-related motif restored the deaminase activity of truncated DddAtox (FIGS. 1 (b) and 1 (c)). These data indicate that the SPKK-related motif at the C-terminus of DddAtox is important for its double-stranded DNA deamination activity.
Next, the inventors searched for homologs of ddatox using PSI-BLAST in MPI Bioinformatics Toolkit. The inventors run a non-redundant (NR) protein database (2021, nr50_1_nov), iterating until no new sequences appear, and finally determining 555 homologous candidate sequences. Then, the present inventors selected 8 candidate proteins from among 4 proteins having the SPKK-related motif, 4 proteins having no SPKK-related motif, and tested their deaminase activity on a double-stranded DNA substrate (FIG. 2 (a)). As can be seen from FIG. 2 (a), all 4 proteins containing the SPKK-related motif show deamination activity comparable to or higher than DddAtox. In contrast, all 4 proteins without the SPKK-associated motif showed little or no deamination activity. These results indicate that the SPKK-associated motif can be used to recognize highly active double-stranded DNA deaminase.
The four protein sequences with the SPKK-associated motifs are as follows:
Ddd_SS, from Acidovorax facilis (Simiaoa Sunii):
MSLPEYDGTTTHGVLVLDDGTQIGFTSGNGDPRYTNYRNNGHVEQKSALYMRENNISNATVYHNNTNGTCGYCNTMTATFLPEGATLTVVPPENAVANNSRAIDYVKTYTGTSNDPKISPRYKGN(SEQ ID NO:1)
Ddd_Ru from Ruminococcus MSJ-25 (Ruminococcus sp. MSJ-25):
VLPKYDGKTTEGVMVTPDGKQISFKSGNSSTPSYPQYKAQSASHVEGKAALYMRENGINEATVFHNNPNGTCGFCDRQVPALLPKGAKLTVVPPSNSVANNVRAIPVPKTYIGNSTVPKIK(SEQ ID NO:2)
Ddd_Fa, derived from Fusarium species MSJ-15 (Falcatimonas sp. MSJ-15):
SINLPEYDGKTTHGVLVLDDGTQVPFSSGNANPNYKNYIPASHVEGKSAIYMRENGINNGTVFHNNTDGTCPYCDKMLPTLLEEGSTLTVVPPANANAPKPSWVDTVKTYIGNDKIPKKPK(SEQ ID NO:3)
ddd_ca, derived from cartilage needle mould (Chondromyces Apiculatus):
MGNTLPGWDGGKTQGWFVYPDGTERHLISGYDGPSKFTQGIPGMNGNIKSHVEAHAAALMRQYELSKATLYINRVPCPGVRGCDALLARMLPEGVQLEIIGPNGFKKTYTGLPDPKLKPKGCS(SEQ ID NO:4)
in the deamination assay, the inventors noted multiple bands formed by double stranded DNA deaminase (Ddd _ss) from siella far away (Simiaoa Sunii), indicating that it has extensive deaminase activity in non-TC environments. In fact, whole genome sequencing of UNG-deficient e.coli strain expressing Ddd _ss showed that it prefers deamination activity at AC/GC/TC sequence sites, whereas the other double stranded DNA deaminase Ddd _fa had deamination activity at AC/CC/TC sequence sites, slightly higher preference for TC, whereas the original ddatox had very high preference for TC sequence sites (fig. 2 (b)). Since Ddd _ss possesses deamination at GC sequence sites that ddatox and its derived variants do not, and shows the highest deamination activity among all deaminases tested (including ddatox) (compare product yields of deaminase at 0.5 μm), the inventors focused on Ddd _ss during subsequent studies.
Example two base editing effect of Ddd _ss construct on mitochondrial DNA
In this example, the inventors tested Ddd _ss for gene editing in target mitochondrial DNA by fusing half of Ddd _ss into a TALE array protein comprising a mitochondrial localization sequence (MTS). Alignment of Ddd _ss structure and ddatox crystal structure based on ColabFold prediction, N29 and N94 in Ddd _ss correspond to the optimal cleavage sites G1333 and G1397 reported by ddatox 6. From the initial dcbe study, the inventors devised the following mitochondrial dcbe (dcbe_ss) containing Ddd _ss: it is the product of a pair of mitochondrial TALEs linked to a cleaved Ddd _ss, containing an MTS, TALE array, half Ddd _ss cleaved from the N29 or N94 site, and UGI protein. The C-terminal half and the N-terminal half of Ddd _ss are connected to the right and left sides of TALE, respectively (fig. 3 (a)).
The inventors first tested DdBE_SS targeting MT-ND5, MT-ND5 encoding the NADH dehydrogenase 5 subunit of Complex I. The inventors found that DdBE_SS can achieve C.fwdarw.T editing at the poly-C site and has different sequence preferences at the same level as compared to DdBEs containing DddAtox. Importantly, ddBE_SS (DdBE_SS_N94) with N94 cleavage site was at GCThe editing efficiency for C6 at the site was about 40%, while DdBE_A with the G1397 cleavage site (DdBE_A_G 1397) was only about 8% for C6. In addition, in the case of the optical fiber,DdBE_SS_N 94 at GCThe editing efficiency for C7 at the site was about 33 times higher than that of DdBE_A_G 1397. For this MT-ND5.1 site, the efficiency of editing of the two GC sites by the DdBE_SS_N 29 construct was lower than that of the DdBE_SS_N 94 construct (6.1% for C6 and 5.2% for C7), but this was still significantly higher than that of the DdBE_A_G 1333 construct (both sites less than 0.3%) (FIG. 3 (b)). These results indicate that the inventors have successfully engineered DdBE using Ddd _SS to obtain a modified gene sequence in GCSite specific DddA tox A much more efficient means of mtDNA editing. It should be noted that: since the base editor can edit only one strand of dsDNA, the efficiency of editing measured by the above method is theoretically up to 50%.
To test the versatility of dcbe_ss in mitochondrial DNA, the inventors subsequently constructed dcbe_ss targeted to MT-ATP 6. The inventors observed that the editing efficiency of dcbe_ss_n94 was up to 38% and that of dcbe_ss_n29 was up to 24% (fig. 2c and 3). Inspired by these results, the inventors subsequently tested the editing of 8 additional sites in the 6 mitochondrial genes (table 2), with an editing efficiency of about 7-42% for dcbe_ss_n94 at these sites (fig. 4a-4 c). Taken together, these results indicate that dcbe_ss is a versatile and efficient mitochondrial DNA editing tool.
Example TridCBE_SS edit function on pathogenic mutant Gene
In this example, the inventors introduced two mutations at the GC sites of MT-ND4 and MT-ND6, which are associated with the human diseases Leber Hereditary Optic Neuropathy (LHON) and Leigh syndrome (FIG. 2 d), both of which are devastating genetic diseases, for which no effective treatment is currently available. DdBE_SS_N 94 produced 10% edit efficiency for MT-ND4 (C8) (FIG. 4 (b)), and about 20% edit efficiency for MT-ND6 (C8) (FIG. 4 (C)). To further increase the efficiency of DdBE_SS_N 94, the present inventors have mutated it again to mutate three sites of T26I, T I and T110I (corresponding to S1330I, T1380I and T1413I of DddAtox, respectively). It was exciting that the DdBE_SS_N94 mutant (DdBE_SS5) having the T77I and T110I mutations had significantly improved editing efficiency for the disease-associated target site, about 25% for MT-ND4 (C8) (FIG. 4 (b)),the editing efficiency for MT-ND6 (C8) was about 30% (FIG. 4 (C)). These results indicate that the optimized DdBE obtained by the present inventors from Ddd _SS can be found in GCThe site implements disease-related mtDNA mutations, which have not been previously possible.
The amino acid sequence of Ddd _ss containing each mutation is shown below:
Ddd_SS1:Ddd_SS(T26I+T77I+T110I);
SLPEYDGTTTHGVLVLDDGTQIGFISGNGDPRYTNYRNNGHVEQKSALYMRENNISNATVYHNNTNGTCGYCNTMIATFLPEGATLTVVPPENAVANNSRAIDYVKTYIGTSNDPKISPRYKGN(SEQ ID NO:5)
Ddd_SS2:Ddd_SS(T26I);
SLPEYDGTTTHGVLVLDDGTQIGFISGNGDPRYTNYRNNGHVEQKSALYMRENNISNATVYHNNTNGTCGYCNTMTATFLPEGATLTVVPPENAVANNSRAIDYVKTYTGTSNDPKISPRYKGN(SEQ ID NO:6)
Ddd_SS3:Ddd_SS(T77I);
SLPEYDGTTTHGVLVLDDGTQIGFTSGNGDPRYTNYRNNGHVEQKSALYMRENNISNATVYHNNTNGTCGYCNTMIATFLPEGATLTVVPPENAVANNSRAIDYVKTYTGTSNDPKISPRYKGN(SEQ ID NO:7)
Ddd_SS4:Ddd_SS(T110I);
SLPEYDGTTTHGVLVLDDGTQIGFTSGNGDPRYTNYRNNGHVEQKSALYMRENNISNATVYHNNTNGTCGYCNTMTATFLPEGATLTVVPPENAVANNSRAIDYVKTYIGTSNDPKISPRYKGN(SEQ ID NO:8)
Ddd_SS5:Ddd_SS(T77I+T110I);
SLPEYDGTTTHGVLVLDDGTQIGFTSGNGDPRYTNYRNNGHVEQKSALYMRENNISNATVYHNNTNGTCGYCNTMIATFLPEGATLTVVPPENAVANNSRAIDYVKTYIGTSNDPKISPRYKGN(SEQ ID NO:9)
Dead-Ddd_SS5:Ddd_SS(E44A+T77I+T110I),
SLPEYDGTTTHGVLVLDDGTQIGFTSGNGDPRYTNYRNNGHVAQKSALYMRENNISNATVYHNNTNGTCGYCNTMIATFLPEGATLTVVPPENAVANNSRAIDYVKTYIGTSNDPKISPRYKGN(SEQ ID NO:10)
example four ddatox mutant expansion of editable sites
DdBE_SS has a different sequence preference than DdBE_A. Previous studies on the C.fwdarw.U RNA editing enzyme APOBEC have shown that the sequence preference of the loop sequence for enzyme activity is important. To find out the different sequences that determine the different sequence preferences of Ddd _ss and ddatox, the inventors mutated three loop sequences around the ddatox active site, respectively, based on the Ddd _ss loop sequence. The inventors still used the MT-ND5.1 site as a target, with the dcbe_a mutant with the loop 2 mutation (E1370N) having about 3.2-fold improvement in editing efficiency at C6 (GC site), about 2.0-fold improvement in editing efficiency at C13 (AC site), and about 2.3-fold improvement in editing efficiency at C14 (CC site) (fig. 5 b). In contrast, the loop 3 mutation had little effect on the activity of DdBE_A, and the loop 1 mutation also caused the activity of DdBE_A to be slightly decreased (FIG. 5 (a)). Furthermore, the DdBE_A mutant having the loop 2 mutation (E1370N) also obtained higher editing efficiency for MT-ND1 and MT-ND5.2 sites as compared with the original DdBE_A (FIG. 5 (b)). Taken together, these results demonstrate that by exchanging sequences from different homologues, the sequence compatibility and editing efficiency of DDCBE can be reasonably optimized.
From the above examples, the inventors of the present application found that: the SPKK-related motif at the C-terminus of the protein is important for DNA deamination efficiency of DddAtox. Furthermore, the present inventors identified many ddatox homolog candidates by PSI-BLAST and identified four homologs with double stranded DNA deaminase activity, all of which have the SPKK-related motif. Then, the present inventors constructed a plurality of DDCBEs from Ddd _ss, capable of efficiently editing 14 mitochondrial DNA sites of 10 mitochondrial genes. Importantly, the constructed variants of dcbe_ss successfully achieved c→t efficient editing at GC sites of mitochondrial DNA that could not previously be achieved. Finally, by introducing the Ddd _SS mutant into DddAtox, the inventor successfully constructs a DdBE_A with wider sequence compatibility and higher editing efficiency. In summary, the present application provides previously unrealizable C.fwdarw.T editing at the GC site of mitochondrial DNA and provides the potential for further screening and design of mtDNA base editors with potentially higher efficiency and wider sequence compatibility.
Claims (5)
1. An enzyme having deamination activity to cytosine at the GC sequence site of double stranded DNA within a strand. It is a protein as described in the following (a) or (b):
(a) Consists of SEQ ID NO:1, a protein comprising the amino acid sequence shown in the specification,
(b) Consists of SEQ ID NO:1, and has deamination activity of cytosine at a GC sequence site of the intragranular double-stranded DNA.
2. A base editing system for deamination of cytosine at a GC sequence site of a double stranded DNA within a wire, comprising an N-terminal half and a C-terminal half of the enzyme of claim 1 linked to a pair of transcription activator-like effectors (TALEs), respectively.
3. The base editing system of claim 2, further comprising a uracil DNA glycosylase inhibitor UGI protein.
4. The base editing system according to claim 2, wherein the enzyme according to claim 1 is split in two from its N94 site or N29 site.
5. The base editing system of claim 2, wherein the enzyme of claim 1 further comprises a point mutation of T77I and/or T110I.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310149826.3A CN116790566A (en) | 2023-02-13 | 2023-02-13 | Nuclease with novel base editing function and editing system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310149826.3A CN116790566A (en) | 2023-02-13 | 2023-02-13 | Nuclease with novel base editing function and editing system |
Publications (1)
Publication Number | Publication Date |
---|---|
CN116790566A true CN116790566A (en) | 2023-09-22 |
Family
ID=88046813
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202310149826.3A Pending CN116790566A (en) | 2023-02-13 | 2023-02-13 | Nuclease with novel base editing function and editing system |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN116790566A (en) |
-
2023
- 2023-02-13 CN CN202310149826.3A patent/CN116790566A/en active Pending
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10392616B2 (en) | CRISPR RNA targeting enzymes and systems and uses thereof | |
US11168324B2 (en) | Crispr DNA targeting enzymes and systems | |
JP7324713B2 (en) | Base editor with improved accuracy and specificity | |
CN102796728B (en) | Methods and compositions for DNA fragmentation and tagging by transposases | |
US20180258418A1 (en) | Method of identifying genome-wide off-target sites of base editors by detecting single strand breaks in genomic dna | |
WO2022121301A1 (en) | Prokaryote-derived argonaute protein and application thereof | |
US11795442B2 (en) | CRISPR DNA targeting enzymes and systems | |
KR20220150329A (en) | Class II, Type V CRISPR System | |
JP2022526695A (en) | Inhibition of unintentional mutations in gene editing | |
KR102210700B1 (en) | Method of identifying base editing using adenosine deaminase | |
Mi et al. | DddA homolog search and engineering expand sequence compatibility of mitochondrial base editing | |
AU2019319230B2 (en) | Novel mutations that enhance the DNA cleavage activity of acidaminococcus sp. Cpf1 | |
US20220389425A1 (en) | Mbp_argonaute proteins from prokaryotes and applications thereof | |
Shi et al. | DNA topology regulates PAM-Cas9 interaction and DNA unwinding to enable near-PAMless cleavage by thermophilic Cas9 | |
Walker et al. | Novel molecular requirements for CRISPR RNA-guided transposition | |
Stoletzki | The surprising negative correlation of gene length and optimal codon use-disentangling translational selection from GC-biased gene conversion in yeast | |
Chen et al. | Cas12n nucleases, early evolutionary intermediates of type V CRISPR, comprise a distinct family of miniature genome editors | |
US20240110178A1 (en) | Application of prokaryotic argonaute protein with only rna target cleavage activity in rna editing | |
Daggett et al. | A general method for scanning unnatural amino acid mutagenesis | |
CA3153005A1 (en) | Novel crispr dna targeting enzymes and systems | |
CN116790566A (en) | Nuclease with novel base editing function and editing system | |
US20210139890A1 (en) | Novel crispr rna targeting enzymes and systems and uses thereof | |
KR102567576B1 (en) | Novel Cas9 protein variants with improved target specificity and use thereof | |
Esquerra‐Ruvira et al. | Identification of the EH CRISPR‐Cas9 system on a metagenome and its application to genome engineering | |
US20230279442A1 (en) | Engineered cas9-nucleases and method of use thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |