WO2021175289A1 - 多重基因组编辑方法和系统 - Google Patents
多重基因组编辑方法和系统 Download PDFInfo
- Publication number
- WO2021175289A1 WO2021175289A1 PCT/CN2021/079087 CN2021079087W WO2021175289A1 WO 2021175289 A1 WO2021175289 A1 WO 2021175289A1 CN 2021079087 W CN2021079087 W CN 2021079087W WO 2021175289 A1 WO2021175289 A1 WO 2021175289A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- plant
- editing
- scrna
- seq
- deaminase
- Prior art date
Links
- 238000010362 genome editing Methods 0.000 title claims abstract description 44
- 238000000034 method Methods 0.000 title claims abstract description 35
- 241000196324 Embryophyta Species 0.000 claims description 107
- 108091033409 CRISPR Proteins 0.000 claims description 63
- 240000007594 Oryza sativa Species 0.000 claims description 59
- 235000007164 Oryza sativa Nutrition 0.000 claims description 55
- 235000009566 rice Nutrition 0.000 claims description 55
- 102000037865 fusion proteins Human genes 0.000 claims description 52
- 108020001507 fusion proteins Proteins 0.000 claims description 52
- 108091008103 RNA aptamers Proteins 0.000 claims description 47
- 108020004414 DNA Proteins 0.000 claims description 40
- 238000010354 CRISPR gene editing Methods 0.000 claims description 35
- 108010008532 Deoxyribonuclease I Proteins 0.000 claims description 34
- 102000007260 Deoxyribonuclease I Human genes 0.000 claims description 34
- 239000002773 nucleotide Substances 0.000 claims description 33
- 125000003729 nucleotide group Chemical group 0.000 claims description 33
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 claims description 32
- 239000013598 vector Substances 0.000 claims description 32
- 108091008324 binding proteins Proteins 0.000 claims description 31
- 230000014509 gene expression Effects 0.000 claims description 29
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 29
- 108010052875 Adenine deaminase Proteins 0.000 claims description 27
- 101100272670 Aromatoleum evansii boxB gene Proteins 0.000 claims description 27
- 230000009466 transformation Effects 0.000 claims description 23
- 230000001419 dependent effect Effects 0.000 claims description 21
- 230000009870 specific binding Effects 0.000 claims description 19
- 241000588724 Escherichia coli Species 0.000 claims description 18
- 230000009615 deamination Effects 0.000 claims description 18
- 238000006481 deamination reaction Methods 0.000 claims description 18
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 18
- 108020005004 Guide RNA Proteins 0.000 claims description 17
- 229940104302 cytosine Drugs 0.000 claims description 16
- 229920001184 polypeptide Polymers 0.000 claims description 16
- 229930024421 Adenine Natural products 0.000 claims description 15
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 claims description 15
- 229960000643 adenine Drugs 0.000 claims description 15
- 108091035705 tRNA adenine Proteins 0.000 claims description 15
- 101710172430 Uracil-DNA glycosylase inhibitor Proteins 0.000 claims description 14
- 230000008685 targeting Effects 0.000 claims description 14
- 102000012758 APOBEC-1 Deaminase Human genes 0.000 claims description 13
- 108010079649 APOBEC-1 Deaminase Proteins 0.000 claims description 13
- 102000000311 Cytosine Deaminase Human genes 0.000 claims description 11
- 108010080611 Cytosine Deaminase Proteins 0.000 claims description 11
- 230000035772 mutation Effects 0.000 claims description 11
- 238000012217 deletion Methods 0.000 claims description 10
- 230000037430 deletion Effects 0.000 claims description 10
- 239000003795 chemical substances by application Substances 0.000 claims description 7
- 230000009261 transgenic effect Effects 0.000 claims description 6
- 244000038559 crop plants Species 0.000 claims description 5
- 102100022433 Single-stranded DNA cytosine deaminase Human genes 0.000 claims description 4
- 101710143275 Single-stranded DNA cytosine deaminase Proteins 0.000 claims description 4
- 244000062793 Sorghum vulgare Species 0.000 claims description 4
- 240000008042 Zea mays Species 0.000 claims description 4
- 235000002017 Zea mays subsp mays Nutrition 0.000 claims description 4
- 235000021307 Triticum Nutrition 0.000 claims description 3
- 244000098338 Triticum aestivum Species 0.000 claims description 3
- 230000030648 nucleus localization Effects 0.000 claims description 3
- MZZYGYNZAOVRTG-UHFFFAOYSA-N 2-hydroxy-n-(1h-1,2,4-triazol-5-yl)benzamide Chemical compound OC1=CC=CC=C1C(=O)NC1=NC=NN1 MZZYGYNZAOVRTG-UHFFFAOYSA-N 0.000 claims description 2
- 108010004483 APOBEC-3G Deaminase Proteins 0.000 claims description 2
- 108091023037 Aptamer Proteins 0.000 claims description 2
- 229920000742 Cotton Polymers 0.000 claims description 2
- 102100038076 DNA dC->dU-editing enzyme APOBEC-3G Human genes 0.000 claims description 2
- 244000068988 Glycine max Species 0.000 claims description 2
- 235000010469 Glycine max Nutrition 0.000 claims description 2
- 244000020551 Helianthus annuus Species 0.000 claims description 2
- 235000003222 Helianthus annuus Nutrition 0.000 claims description 2
- 101000964378 Homo sapiens DNA dC->dU-editing enzyme APOBEC-3A Proteins 0.000 claims description 2
- 101000658622 Homo sapiens Testis-specific Y-encoded-like protein 2 Proteins 0.000 claims description 2
- 240000005979 Hordeum vulgare Species 0.000 claims description 2
- 235000007340 Hordeum vulgare Nutrition 0.000 claims description 2
- 235000007688 Lycopersicon esculentum Nutrition 0.000 claims description 2
- 240000003183 Manihot esculenta Species 0.000 claims description 2
- 235000016735 Manihot esculenta subsp esculenta Nutrition 0.000 claims description 2
- 240000004658 Medicago sativa Species 0.000 claims description 2
- 235000017587 Medicago sativa ssp. sativa Nutrition 0.000 claims description 2
- 244000061176 Nicotiana tabacum Species 0.000 claims description 2
- 235000002637 Nicotiana tabacum Nutrition 0.000 claims description 2
- 240000000111 Saccharum officinarum Species 0.000 claims description 2
- 235000007201 Saccharum officinarum Nutrition 0.000 claims description 2
- 240000003768 Solanum lycopersicum Species 0.000 claims description 2
- 244000061456 Solanum tuberosum Species 0.000 claims description 2
- 235000002595 Solanum tuberosum Nutrition 0.000 claims description 2
- 235000011684 Sorghum saccharatum Nutrition 0.000 claims description 2
- 102100034917 Testis-specific Y-encoded-like protein 2 Human genes 0.000 claims description 2
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 claims description 2
- 235000005822 corn Nutrition 0.000 claims description 2
- 102000048646 human APOBEC3A Human genes 0.000 claims description 2
- 235000019713 millet Nutrition 0.000 claims description 2
- 102000023732 binding proteins Human genes 0.000 claims 10
- 125000003275 alpha amino acid group Chemical group 0.000 claims 8
- 244000299507 Gossypium hirsutum Species 0.000 claims 1
- 101710163270 Nuclease Proteins 0.000 abstract description 10
- 108090000623 proteins and genes Proteins 0.000 description 52
- 230000000694 effects Effects 0.000 description 47
- 210000001938 protoplast Anatomy 0.000 description 46
- 150000001413 amino acids Chemical class 0.000 description 36
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 34
- 102000004169 proteins and genes Human genes 0.000 description 30
- 235000018102 proteins Nutrition 0.000 description 29
- 108010077850 Nuclear Localization Signals Proteins 0.000 description 23
- 210000004027 cell Anatomy 0.000 description 22
- 102000014914 Carrier Proteins Human genes 0.000 description 21
- 238000002474 experimental method Methods 0.000 description 20
- 235000001014 amino acid Nutrition 0.000 description 19
- 150000007523 nucleic acids Chemical class 0.000 description 19
- 230000006870 function Effects 0.000 description 18
- 230000001404 mediated effect Effects 0.000 description 18
- 238000010586 diagram Methods 0.000 description 14
- 241000701489 Cauliflower mosaic virus Species 0.000 description 13
- 230000005782 double-strand break Effects 0.000 description 13
- 102100035102 E3 ubiquitin-protein ligase MYCBP2 Human genes 0.000 description 12
- 108091027544 Subgenomic mRNA Proteins 0.000 description 12
- 102000039446 nucleic acids Human genes 0.000 description 12
- 108020004707 nucleic acids Proteins 0.000 description 12
- 230000001105 regulatory effect Effects 0.000 description 12
- 210000001519 tissue Anatomy 0.000 description 11
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 10
- 108091028043 Nucleic acid sequence Proteins 0.000 description 10
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 10
- OIRDTQYFTABQOQ-KQYNXXCUSA-N adenosine Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O OIRDTQYFTABQOQ-KQYNXXCUSA-N 0.000 description 10
- 238000006243 chemical reaction Methods 0.000 description 9
- 239000013612 plasmid Substances 0.000 description 9
- 230000007115 recruitment Effects 0.000 description 8
- 238000013518 transcription Methods 0.000 description 8
- 230000035897 transcription Effects 0.000 description 8
- 102000053602 DNA Human genes 0.000 description 7
- 238000010276 construction Methods 0.000 description 7
- 108091093088 Amplicon Proteins 0.000 description 6
- 241001426452 Oryza sativa alphaendornavirus Species 0.000 description 6
- 108020004682 Single-Stranded DNA Proteins 0.000 description 6
- 238000003776 cleavage reaction Methods 0.000 description 6
- 238000012360 testing method Methods 0.000 description 6
- -1 CRISPRi) Proteins 0.000 description 5
- 230000015572 biosynthetic process Effects 0.000 description 5
- 238000011161 development Methods 0.000 description 5
- 230000018109 developmental process Effects 0.000 description 5
- 230000009977 dual effect Effects 0.000 description 5
- 230000010354 integration Effects 0.000 description 5
- 230000004048 modification Effects 0.000 description 5
- 238000012986 modification Methods 0.000 description 5
- 238000007481 next generation sequencing Methods 0.000 description 5
- 229910052757 nitrogen Inorganic materials 0.000 description 5
- 229940035893 uracil Drugs 0.000 description 5
- UHDGCWIWMRVCDJ-UHFFFAOYSA-N 1-beta-D-Xylofuranosyl-NH-Cytosine Natural products O=C1N=C(N)C=CN1C1C(O)C(O)C(CO)O1 UHDGCWIWMRVCDJ-UHFFFAOYSA-N 0.000 description 4
- 108091026890 Coding region Proteins 0.000 description 4
- UHDGCWIWMRVCDJ-PSQAKQOGSA-N Cytidine Natural products O=C1N=C(N)C=CN1[C@@H]1[C@@H](O)[C@@H](O)[C@H](CO)O1 UHDGCWIWMRVCDJ-PSQAKQOGSA-N 0.000 description 4
- 229930010555 Inosine Natural products 0.000 description 4
- UGQMRVRMYYASKQ-KQYNXXCUSA-N Inosine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C2=NC=NC(O)=C2N=C1 UGQMRVRMYYASKQ-KQYNXXCUSA-N 0.000 description 4
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 4
- 230000004913 activation Effects 0.000 description 4
- 238000004458 analytical method Methods 0.000 description 4
- 210000004899 c-terminal region Anatomy 0.000 description 4
- 238000010367 cloning Methods 0.000 description 4
- UHDGCWIWMRVCDJ-ZAKLUEHWSA-N cytidine Chemical compound O=C1N=C(N)C=CN1[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O1 UHDGCWIWMRVCDJ-ZAKLUEHWSA-N 0.000 description 4
- 239000000539 dimer Substances 0.000 description 4
- 238000012239 gene modification Methods 0.000 description 4
- 230000005017 genetic modification Effects 0.000 description 4
- 235000013617 genetically modified food Nutrition 0.000 description 4
- 229960003786 inosine Drugs 0.000 description 4
- 238000004519 manufacturing process Methods 0.000 description 4
- 229930182817 methionine Natural products 0.000 description 4
- 230000033607 mismatch repair Effects 0.000 description 4
- 229920000642 polymer Polymers 0.000 description 4
- 108091033319 polynucleotide Proteins 0.000 description 4
- 102000040430 polynucleotide Human genes 0.000 description 4
- 239000002157 polynucleotide Substances 0.000 description 4
- 230000001052 transient effect Effects 0.000 description 4
- 239000002126 C01EB10 - Adenosine Substances 0.000 description 3
- 238000010446 CRISPR interference Methods 0.000 description 3
- 238000010453 CRISPR/Cas method Methods 0.000 description 3
- 108091081024 Start codon Proteins 0.000 description 3
- 241000193996 Streptococcus pyogenes Species 0.000 description 3
- 108091028113 Trans-activating crRNA Proteins 0.000 description 3
- 241000700605 Viruses Species 0.000 description 3
- 229960005305 adenosine Drugs 0.000 description 3
- 230000009418 agronomic effect Effects 0.000 description 3
- 230000001488 breeding effect Effects 0.000 description 3
- 230000000295 complement effect Effects 0.000 description 3
- 238000012258 culturing Methods 0.000 description 3
- 210000000805 cytoplasm Anatomy 0.000 description 3
- 238000009826 distribution Methods 0.000 description 3
- 235000013399 edible fruits Nutrition 0.000 description 3
- 239000013604 expression vector Substances 0.000 description 3
- 230000004927 fusion Effects 0.000 description 3
- 239000000833 heterodimer Substances 0.000 description 3
- 210000005260 human cell Anatomy 0.000 description 3
- 238000000338 in vitro Methods 0.000 description 3
- 230000001939 inductive effect Effects 0.000 description 3
- 239000000203 mixture Substances 0.000 description 3
- 235000015097 nutrients Nutrition 0.000 description 3
- 230000008929 regeneration Effects 0.000 description 3
- 238000011069 regeneration method Methods 0.000 description 3
- 238000012163 sequencing technique Methods 0.000 description 3
- 238000013519 translation Methods 0.000 description 3
- KDCGOANMDULRCW-UHFFFAOYSA-N 7H-purine Chemical compound N1=CNC2=NC=NC2=C1 KDCGOANMDULRCW-UHFFFAOYSA-N 0.000 description 2
- 101100285688 Caenorhabditis elegans hrg-7 gene Proteins 0.000 description 2
- 101000909256 Caldicellulosiruptor bescii (strain ATCC BAA-1888 / DSM 6725 / Z-1320) DNA polymerase I Proteins 0.000 description 2
- 108091033380 Coding strand Proteins 0.000 description 2
- 108020004705 Codon Proteins 0.000 description 2
- 101710177611 DNA polymerase II large subunit Proteins 0.000 description 2
- 101710184669 DNA polymerase II small subunit Proteins 0.000 description 2
- 238000001712 DNA sequencing Methods 0.000 description 2
- AEMRFAOFKBGASW-UHFFFAOYSA-N Glycolic acid Chemical compound OCC(O)=O AEMRFAOFKBGASW-UHFFFAOYSA-N 0.000 description 2
- NYHBQMYGNKIUIF-UUOKFMHZSA-N Guanosine Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O NYHBQMYGNKIUIF-UUOKFMHZSA-N 0.000 description 2
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 2
- 241000829100 Macaca mulatta polyomavirus 1 Species 0.000 description 2
- 108091005461 Nucleic proteins Proteins 0.000 description 2
- 102000035195 Peptidases Human genes 0.000 description 2
- 108091005804 Peptidases Proteins 0.000 description 2
- 239000004365 Protease Substances 0.000 description 2
- 101000902592 Pyrococcus furiosus (strain ATCC 43587 / DSM 3638 / JCM 8422 / Vc1) DNA polymerase Proteins 0.000 description 2
- 108020004511 Recombinant DNA Proteins 0.000 description 2
- IQFYYKKMVGJFEH-XLPZGREQSA-N Thymidine Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](CO)[C@@H](O)C1 IQFYYKKMVGJFEH-XLPZGREQSA-N 0.000 description 2
- DRTQHJPVMGBUCF-XVFCMESISA-N Uridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-XVFCMESISA-N 0.000 description 2
- 235000016383 Zea mays subsp huehuetenangensis Nutrition 0.000 description 2
- 235000004279 alanine Nutrition 0.000 description 2
- 125000000539 amino acid group Chemical group 0.000 description 2
- 230000033590 base-excision repair Effects 0.000 description 2
- 230000033228 biological regulation Effects 0.000 description 2
- 238000009395 breeding Methods 0.000 description 2
- 230000001276 controlling effect Effects 0.000 description 2
- 238000000684 flow cytometry Methods 0.000 description 2
- 230000002068 genetic effect Effects 0.000 description 2
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 2
- 230000006801 homologous recombination Effects 0.000 description 2
- 238000002744 homologous recombination Methods 0.000 description 2
- 230000005764 inhibitory process Effects 0.000 description 2
- 235000009973 maize Nutrition 0.000 description 2
- 108020004999 messenger RNA Proteins 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- 238000005457 optimization Methods 0.000 description 2
- 230000037361 pathway Effects 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 238000007480 sanger sequencing Methods 0.000 description 2
- 239000000243 solution Substances 0.000 description 2
- 125000006850 spacer group Chemical group 0.000 description 2
- 241000894007 species Species 0.000 description 2
- 239000000126 substance Substances 0.000 description 2
- 238000006467 substitution reaction Methods 0.000 description 2
- 239000000758 substrate Substances 0.000 description 2
- 230000001131 transforming effect Effects 0.000 description 2
- DIGQNXIGRZPYDK-WKSCXVIASA-N (2R)-6-amino-2-[[2-[[(2S)-2-[[2-[[(2R)-2-[[(2S)-2-[[(2R,3S)-2-[[2-[[(2S)-2-[[2-[[(2S)-2-[[(2S)-2-[[(2R)-2-[[(2S,3S)-2-[[(2R)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[2-[[(2S)-2-[[(2R)-2-[[2-[[2-[[2-[(2-amino-1-hydroxyethylidene)amino]-3-carboxy-1-hydroxypropylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxybutylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1,5-dihydroxy-5-iminopentylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxybutylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxyethylidene]amino]hexanoic acid Chemical compound C[C@@H]([C@@H](C(=N[C@@H](CS)C(=N[C@@H](C)C(=N[C@@H](CO)C(=NCC(=N[C@@H](CCC(=N)O)C(=NC(CS)C(=N[C@H]([C@H](C)O)C(=N[C@H](CS)C(=N[C@H](CO)C(=NCC(=N[C@H](CS)C(=NCC(=N[C@H](CCCCN)C(=O)O)O)O)O)O)O)O)O)O)O)O)O)O)O)N=C([C@H](CS)N=C([C@H](CO)N=C([C@H](CO)N=C([C@H](C)N=C(CN=C([C@H](CO)N=C([C@H](CS)N=C(CN=C(C(CS)N=C(C(CC(=O)O)N=C(CN)O)O)O)O)O)O)O)O)O)O)O)O DIGQNXIGRZPYDK-WKSCXVIASA-N 0.000 description 1
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 1
- KZNQNBZMBZJQJO-UHFFFAOYSA-N 1-(2-azaniumylacetyl)pyrrolidine-2-carboxylate Chemical compound NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 1
- YKBGVTZYEHREMT-KVQBGUIXSA-N 2'-deoxyguanosine Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@H]1C[C@H](O)[C@@H](CO)O1 YKBGVTZYEHREMT-KVQBGUIXSA-N 0.000 description 1
- CKTSBUTUHBMZGZ-SHYZEUOFSA-N 2'‐deoxycytidine Chemical compound O=C1N=C(N)C=CN1[C@@H]1O[C@H](CO)[C@@H](O)C1 CKTSBUTUHBMZGZ-SHYZEUOFSA-N 0.000 description 1
- JTBBWRKSUYCPFY-UHFFFAOYSA-N 2,3-dihydro-1h-pyrimidin-4-one Chemical compound O=C1NCNC=C1 JTBBWRKSUYCPFY-UHFFFAOYSA-N 0.000 description 1
- CKTSBUTUHBMZGZ-ULQXZJNLSA-N 4-amino-1-[(2r,4s,5r)-4-hydroxy-5-(hydroxymethyl)oxolan-2-yl]-5-tritiopyrimidin-2-one Chemical compound O=C1N=C(N)C([3H])=CN1[C@@H]1O[C@H](CO)[C@@H](O)C1 CKTSBUTUHBMZGZ-ULQXZJNLSA-N 0.000 description 1
- 101710169336 5'-deoxyadenosine deaminase Proteins 0.000 description 1
- 230000005730 ADP ribosylation Effects 0.000 description 1
- 102000007469 Actins Human genes 0.000 description 1
- 108010085238 Actins Proteins 0.000 description 1
- 102100036664 Adenosine deaminase Human genes 0.000 description 1
- 241000589158 Agrobacterium Species 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 102100026189 Beta-galactosidase Human genes 0.000 description 1
- 239000002028 Biomass Substances 0.000 description 1
- 101100338278 Caenorhabditis elegans his-66 gene Proteins 0.000 description 1
- 108700004991 Cas12a Proteins 0.000 description 1
- 102000053642 Catalytic RNA Human genes 0.000 description 1
- 108090000994 Catalytic RNA Proteins 0.000 description 1
- MIKUYHXYGGJMLM-GIMIYPNGSA-N Crotonoside Natural products C1=NC2=C(N)NC(=O)N=C2N1[C@H]1O[C@@H](CO)[C@H](O)[C@@H]1O MIKUYHXYGGJMLM-GIMIYPNGSA-N 0.000 description 1
- 241000701022 Cytomegalovirus Species 0.000 description 1
- NYHBQMYGNKIUIF-UHFFFAOYSA-N D-guanosine Natural products C1=2NC(N)=NC(=O)C=2N=CN1C1OC(CO)C(O)C1O NYHBQMYGNKIUIF-UHFFFAOYSA-N 0.000 description 1
- 230000004568 DNA-binding Effects 0.000 description 1
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 1
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 1
- CKTSBUTUHBMZGZ-UHFFFAOYSA-N Deoxycytidine Natural products O=C1N=C(N)C=CN1C1OC(CO)C(O)C1 CKTSBUTUHBMZGZ-UHFFFAOYSA-N 0.000 description 1
- 208000035240 Disease Resistance Diseases 0.000 description 1
- 241000214054 Equine rhinitis A virus Species 0.000 description 1
- 241000710198 Foot-and-mouth disease virus Species 0.000 description 1
- 108010002537 Fruit Proteins Proteins 0.000 description 1
- 241000287828 Gallus gallus Species 0.000 description 1
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 1
- 241000219146 Gossypium Species 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- 102000003792 Metallothionein Human genes 0.000 description 1
- 108090000157 Metallothionein Proteins 0.000 description 1
- 108700026244 Open Reading Frames Proteins 0.000 description 1
- 101000971112 Oryza sativa subsp. japonica Betaine aldehyde dehydrogenase 2 Proteins 0.000 description 1
- CZPWVGJYEJSRLH-UHFFFAOYSA-N Pyrimidine Chemical compound C1=CN=CN=C1 CZPWVGJYEJSRLH-UHFFFAOYSA-N 0.000 description 1
- 102000014450 RNA Polymerase III Human genes 0.000 description 1
- 108010078067 RNA Polymerase III Proteins 0.000 description 1
- 108010083644 Ribonucleases Proteins 0.000 description 1
- 102000006382 Ribonucleases Human genes 0.000 description 1
- 241000714474 Rous sarcoma virus Species 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- 241001420369 Thosea Species 0.000 description 1
- 102000006943 Uracil-DNA Glycosidase Human genes 0.000 description 1
- 108010072685 Uracil-DNA Glycosidase Proteins 0.000 description 1
- 102100037111 Uracil-DNA glycosylase Human genes 0.000 description 1
- 108020000999 Viral RNA Proteins 0.000 description 1
- 108700041896 Zea mays Ubi-1 Proteins 0.000 description 1
- 238000010521 absorption reaction Methods 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 239000012190 activator Substances 0.000 description 1
- 238000007792 addition Methods 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- CKLJMWTZIZZHCS-REOHCLBHSA-N aspartic acid group Chemical group N[C@@H](CC(=O)O)C(=O)O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 108010005774 beta-Galactosidase Proteins 0.000 description 1
- DRTQHJPVMGBUCF-PSQAKQOGSA-N beta-L-uridine Natural products O[C@H]1[C@@H](O)[C@H](CO)O[C@@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-PSQAKQOGSA-N 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- 239000000969 carrier Substances 0.000 description 1
- 230000003197 catalytic effect Effects 0.000 description 1
- 235000013339 cereals Nutrition 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 230000011088 chloroplast localization Effects 0.000 description 1
- 239000013611 chromosomal DNA Substances 0.000 description 1
- 230000007711 cytoplasmic localization Effects 0.000 description 1
- 238000002716 delivery method Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- QDOXWKRWXJOMAK-UHFFFAOYSA-N dichromium trioxide Chemical compound O=[Cr]O[Cr]=O QDOXWKRWXJOMAK-UHFFFAOYSA-N 0.000 description 1
- 210000005069 ears Anatomy 0.000 description 1
- 239000012636 effector Substances 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 102000034287 fluorescent proteins Human genes 0.000 description 1
- 108091006047 fluorescent proteins Proteins 0.000 description 1
- 239000012634 fragment Substances 0.000 description 1
- 230000006251 gamma-carboxylation Effects 0.000 description 1
- 238000012214 genetic breeding Methods 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 125000000291 glutamic acid group Chemical group N[C@@H](CCC(O)=O)C(=O)* 0.000 description 1
- 230000013595 glycosylation Effects 0.000 description 1
- 238000006206 glycosylation reaction Methods 0.000 description 1
- 230000012010 growth Effects 0.000 description 1
- 229940029575 guanosine Drugs 0.000 description 1
- 238000003306 harvesting Methods 0.000 description 1
- 230000002363 herbicidal effect Effects 0.000 description 1
- 239000004009 herbicide Substances 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 125000000487 histidyl group Chemical group [H]N([H])C(C(=O)O*)C([H])([H])C1=C([H])N([H])C([H])=N1 0.000 description 1
- 239000005556 hormone Substances 0.000 description 1
- 229940088597 hormone Drugs 0.000 description 1
- 125000002887 hydroxy group Chemical group [H]O* 0.000 description 1
- 230000033444 hydroxylation Effects 0.000 description 1
- 230000008676 import Effects 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 239000003112 inhibitor Substances 0.000 description 1
- 230000002401 inhibitory effect Effects 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 150000002632 lipids Chemical class 0.000 description 1
- 230000004807 localization Effects 0.000 description 1
- 230000002503 metabolic effect Effects 0.000 description 1
- 210000003470 mitochondria Anatomy 0.000 description 1
- 230000025608 mitochondrion localization Effects 0.000 description 1
- 239000003607 modifier Substances 0.000 description 1
- 230000000877 morphologic effect Effects 0.000 description 1
- 231100000350 mutagenesis Toxicity 0.000 description 1
- 239000013642 negative control Substances 0.000 description 1
- 230000009437 off-target effect Effects 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 210000003463 organelle Anatomy 0.000 description 1
- 210000001672 ovary Anatomy 0.000 description 1
- 230000002018 overexpression Effects 0.000 description 1
- 238000003976 plant breeding Methods 0.000 description 1
- 210000002706 plastid Anatomy 0.000 description 1
- 230000008488 polyadenylation Effects 0.000 description 1
- 239000013641 positive control Substances 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 230000001172 regenerating effect Effects 0.000 description 1
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 description 1
- 230000008439 repair process Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 108091092562 ribozyme Proteins 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
- 230000001568 sexual effect Effects 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 230000019635 sulfation Effects 0.000 description 1
- 238000005670 sulfation reaction Methods 0.000 description 1
- 230000001629 suppression Effects 0.000 description 1
- 230000005030 transcription termination Effects 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- DRTQHJPVMGBUCF-UHFFFAOYSA-N uracil arabinoside Natural products OC1C(O)C(CO)OC1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-UHFFFAOYSA-N 0.000 description 1
- 229940045145 uridine Drugs 0.000 description 1
- 239000013603 viral vector Substances 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8201—Methods for introducing genetic material into plant cells, e.g. DNA, RNA, stable or transient incorporation, tissue culture methods adapted for transformation
- C12N15/8213—Targeted insertion of genes into the plant genome by homologous recombination
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/10—Processes for the isolation, preparation or purification of DNA or RNA
- C12N15/102—Mutagenizing nucleic acids
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/16—Hydrolases (3) acting on ester bonds (3.1)
- C12N9/22—Ribonucleases RNAses, DNAses
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2310/00—Structure or type of the nucleic acid
- C12N2310/10—Type of nucleic acid
- C12N2310/20—Type of nucleic acid involving clustered regularly interspaced short palindromic repeats [CRISPRs]
Definitions
- the invention relates to the field of plant genetic engineering. Specifically, the present invention relates to a multiple genome editing method and system suitable for plants, especially crops. More specifically, the present invention relates to a CRISPR nickase-based system and method capable of simultaneously realizing different types of genome editing.
- CRISPR/Cas Clusters of regularly spaced short palindrome repeats and their related systems (Clustered Regularly Interspaced Short Palindromic Repeats/CRISPR-associated, CRISPR/Cas) have greatly promoted molecular biology. development of. In the Class 2 system, more and more Cas proteins have been discovered and engineered, including Cas9 targeting DNA, Cas12 targeting single-stranded DNA (ssDNA) and RNA, Cas13 targeting RNA, and CAST The system is used to insert DNA. The diversity and simplicity of the CRISPR/Cas system make it a super molecular toolbox. In addition, the Cas protein can also be transformed into a variant lacking nuclease activity.
- the Cas9 (Streptococcus pyogenes Cas9, SpCas9) protein from Streptococcus pyogenes consists of two nuclease domains, RuvC and HNH, which cut the non-targeting strand and the targeting strand, respectively.
- SpCas9 can be engineered into a nickase nCas9 (Nickase Cas9); Or replace Asp10 and His840 with alanine at the same time, so that SpCas9 loses nuclease activity and becomes dCas9 (Deactive Cas9).
- Asp10 and His840 with alanine at the same time, so that SpCas9 loses nuclease activity and becomes dCas9 (Deactive Cas9).
- the development of these variants has promoted the CRISPR/Cas9 system to become a toolbox for genome editing systems ( Figure 1a).
- Cas9 is used to generate double-strand break (DSB) on the genome; paired nCas9 can also be used to generate highly specific DSB on the genome, and nCas9 (D10A) is also used in the single-base editing system CBE (Cytosine Base Editor) and ABE (Adenine Base Editor) development; dCas9 is often used to fuse various effector proteins to achieve CRISPR interference (CRISPR Interference, CRISPRi), CRISPR activation (CRISPRa), gene composition and appearance Genetic modification, etc. However, in most cases, these systems perform only one type of genome editing after one transformation.
- sgRNA RNA aptamer into the sgRNA backbone to form scaffold RNA (scRNA).
- scRNA scaffold RNA
- the dCas9/scRNA complex recruits gene activation or inhibitory factors through the hairpin structure, which can be located in different positions. The point realizes the dual functions of gene transcription activation and inhibition at the same time.
- Another strategy uses multiple homologous CRISPR systems to achieve the triple function of gene activation, suppression, and deletion at different target sites at the same time.
- most of these multiple strategies for genome engineering have been developed in bacteria, yeast and human cells. Due to the limitations of delivery methods and PAM, it is still challenging to develop a multiple genome editing system in plants using different homologous CRISPR systems.
- the efficiency of homologous recombination (HR) in plants is still relatively low. It is of great significance for breeders to superimpose multiple important agronomic traits or change the regulatory network of genes at the genetic level. Therefore, there is an urgent need in the art for methods and systems that can achieve multiple genome editing in plants such as crops.
- SWISS Simultaneous and Wide-editing Induced by Single System
- Figure 1a SWISS uses two scRNAs containing different RNA aptamers to recruit the cytosine deaminase or adenine deaminase fused with the corresponding RNA aptamer binding protein. After a single transformation, they can be used at different target sites. Two types of editing are realized: CBE and ABE. Introducing the paired sgRNA into the SWISS system can generate DSB at the third target site, making SWISS a CRISPR system with triple editing function ( Figure 1a).
- Figure 1 Optimization of the construction of plant cytosine base editors recruited by RNA aptamers.
- (a) A multiple genome editing system based on nCas9 nuclease-based CRISPR scaffold RNA programming.
- (b) The structure of pOsU3-esgRNA-2 ⁇ MS2 construct, with two MS2 hairpin structures at the 3'end of esgRNA.
- (c) The architecture of PBEc1 to PBEc5. Abbreviations: XTEN, 16-aa linker; NLS, nuclear localization signal; CaMV, cauliflower mosaic virus; Term, terminator.
- FIG. 1 A variety of scaffold RNA and binding protein orthologs can effectively mediate the conversion of C to T.
- (c) Compare the C>T conversion of the BFP to GFP reporter system induced by various scRNAs and their homologous PBEc in rice protoplasts (n 3). Values and error bars represent the average ⁇ standard error of three independent experiments.
- FIG. 3 Optimizing plant adenine base editor constructs using multiple scaffold RNA and binding protein homologues.
- ecTadA7.10 evolved E. coli TadA
- aa amino acid
- XTEN 16aa linker
- NLS nuclear localization signal
- CaMV cauliflower mosaic virus
- Term terminator.
- c The structure of PABEc5 to PABEc7.
- FIG. 4 Simultaneous multiple genome editing of CRISPR scaffold RNA programming based on nCas9 (D10A) platform in rice protoplasts.
- the picture on the left is a schematic diagram of the SWISSv1.1 strategy.
- the CBE target with esgRNA-2 ⁇ MS2 and the paired sgRNA for DSB were assembled in the same vector.
- a sample of untreated protoplasts was used as a control. Values and error bars represent the average ⁇ standard error of three independent experiments.
- PABEc6 uses esgRNA-2 ⁇ boxB and paired sgRNA to simultaneously induce ABE and DSB.
- the picture on the left is a schematic diagram of the SWISSv1.2 strategy.
- the ABE target with esgRNA-2xboxB and the paired sgRNA for DSB were assembled in the same vector.
- a sample of untreated protoplasts was used as a control. Values and error bars represent the average ⁇ standard error of three independent experiments.
- the picture on the left is a schematic diagram of the SWISSv2 strategy.
- a CBE target with esgRNA-2 ⁇ MS2 and an ABE target with esgRNA-2 ⁇ boxB were assembled in the same vector.
- a sample of untreated protoplasts was used as a control. Values and error bars represent the average ⁇ standard error of three independent experiments.
- (d) Simultaneous CBE, ABE and DSB induced by MGE with esgRNA-2 ⁇ MS2, esgRNA-2 ⁇ boxB and paired sgRNA.
- a CBE target with esgRNA-2 ⁇ MS2, an ABE target with esgRNA-2 ⁇ boxB and paired sgRNA for DSB were assembled in the same vector.
- a sample of untreated protoplasts was used as a control. Values and error bars represent the average ⁇ standard error of three independent experiments.
- a sample of untreated protoplasts was used as a control. Values and error bars represent the average ⁇ standard error of three independent experiments.
- Figure 6 Base editing efficiency of different scRNAs and homologous PBEc to endogenous genes in rice protoplasts.
- the data is displayed in the form of a box plot (center line, median, frame limit, 25th and 75th percentile of the data; upper and lower whiskers extend to the minimum or maximum value respectively).
- Figure 7 Product purity and indel frequency of esgRNA-2 ⁇ MS2, esgRNA-3 ⁇ MS2, sgRNA4.0, esgRNA-2 ⁇ com and related PBEc in rice protoplasts.
- (a) shows the product distribution between edited DNA sequencing reads of esgRNA-2 ⁇ MS2, esgRNA-3 ⁇ MS2, sgRNA4.0 and esgRNA-2 ⁇ com with homologous PBEc in rice protoplasts. Values and error bars represent the average ⁇ standard error of three independent experiments.
- FIG. 8 C to T editing frequency of APOBEC1 narrow window variant recruited by scaffold RNA in rice protoplasts.
- YE1-PBE, YE2-PBE, EE-PBE and YEE-PBE Abbreviations: XTEN, 16-aa linker; NLS, nuclear localization signal; CaMV, cauliflower mosaic virus; Term, terminator.
- FIG. 9 Activity of PABEc8 to PABEc10 with adenosine deaminase N-terminal binding protein.
- (a) The structure of PABEc8 to PABEc10. Abbreviations: ecTadA7.10, evolved E. coli TadA; aa, amino acid; XTEN, 16aa linker; NLS, nuclear localization signal; CaMV, cauliflower mosaic virus; Term, terminator.
- Figure 10 Activity, product purity and indel frequency of selected scaffold RNA and homologous PABEc in rice protoplasts.
- the OsALS-T1, OsCDC48, OsDEP1-T1, OsNRT1.1B, OsEV and OsOD targets were tested. One of three independent biological replicates is shown.
- (b) shows the DNA sequencing reads edited by esgRNA-2 ⁇ MS2, esgRNA-MS2+f6, esgRNA-1 ⁇ PP7-1, esgRNA-2 ⁇ boxB and esgRNA-2 ⁇ com and associated PABEc in rice protoplasts Product distribution. Values and error bars represent the average ⁇ standard error of three independent experiments.
- FIG. 11 Schematic diagram of multiple sgRNA assembly of SWISSv1.1 and SWISSv1.2.
- (a) Schematic diagram of paired sgRNA assembly. Paired sgRNAs (paired sgL and sgR) are designed in the outward direction of PAM, and the distance between the cut sites is 40-68 bp. Amplify the PCR product from the esgRNA-pTaU6 template, and then insert it into the BsaI site of pOsU3-esgRNA through Golden Gate Assembly.
- the ABE target was inserted into the BsaI site of pOsU3-esgRNA-2 ⁇ boxB, and then the pOsU3-ABE target-esgRNA-2 ⁇ boxB part was amplified.
- a PCR product with paired sgRNA was amplified from the paired sgRNAs plasmid. The above two PCR products were assembled into the EcoRI and HindIII digested backbone of pOsU3-esgRNA by multiple one-step cloning.
- Figure 12 shows the distribution of deletion products between the SWISSv1.1, SWISSv1.2 and SWISSv3 indel sequencing reads. Values and error bars represent the average ⁇ standard error of three independent experiments.
- FIG. 13 Schematic diagram of various sgRNA assembly of SWISSv2 and SWISSv3.
- FIG. 1 Schematic diagram of CBE target, ABE target and paired sgRNA assembly.
- a PCR product with CBE target and ABE target was amplified from the dual sgRNA plasmid of CBE target and ABE target.
- a PCR product with paired sgRNA was amplified from the paired sgRNA plasmid. The above two PCR products were assembled into the EcoRI and HindIII digested backbone of pOsU3-esgRNA by multiple one-step cloning.
- FIG. 14 Simultaneous CBE, ABE and DSB in rice.
- a PCR product with CBE target and ABE target was amplified from the dual sgRNA plasmid of CBE target and ABE target.
- a PCR product with paired sgRNA was amplified from the paired sgRNA plasmid.
- the above two PCR products were assembled into the pH-MGE binary vector digested by HindIII by multiple one-step cloning.
- the PAM sequence is shown in brown. Ten T0 seedlings (T0-1 to T0-10) were analyzed. WT/D and WT/U indicate genomic DNA amplicons with or without wild-type (WT) control digested by T7E1. A total of 55 mutants were identified. The band marked by the red arrow can be diagnosed as positive for genome editing. The sequence was confirmed by Sanger sequencing. Further analyze the sequencing spectra of indels through online tools DSDecodeM and TIDE.
- FIG. 15 Analysis of undesired edit misses caused by SWISSv2 and SWISSv3.
- T2A-mediated "self-cleavage" is achieved by the ribosome skipping the formation of a glycyl-prolyl peptide bond at the C-terminus.
- the position of T2A will affect the expression level of the polycistronic construct.
- Successful skipping can produce three independent proteins as designed. However, skipping failures may also occur, resulting in fusion protein products.
- both esgRNA-2 ⁇ MS2 and esgRNA-2 ⁇ boxB can recruit the fusion protein, which leads to undesirable performance on the ABE target.
- Cytosine editing and undesirable adenine editing on the CBE target (b) The efficiency of undesirable editing off-target caused by SWISSv2 and SWISSv3.
- the cytosine of the ABE target and the adenine of the CBE target in SWISSv2 and SWISSv3 were analyzed. Values and error bars represent the average ⁇ standard error of three independent experiments.
- Figure 16 Shows a variety of different scRNA/sgRNA structures designed.
- Figure 17 shows a variety of different scRNA/sgRNA structures designed.
- the term “and/or” encompasses all combinations of items connected by the term, and should be treated as if each combination has been individually listed herein.
- “A and/or B” encompasses “A”, “A and B”, and “B”.
- “A, B, and/or C” encompasses "A”, “B”, “C”, “A and B”, “A and C”, “B and C”, and "A and B and C”.
- the protein or nucleic acid may be composed of the sequence, or may have additional amino acids or nuclei at one or both ends of the protein or nucleic acid. Glycolic acid, but still has the activity described in the present invention.
- methionine encoded by the start codon at the N-terminus of the polypeptide will be retained under certain actual conditions (for example, when expressed in a specific expression system), but does not substantially affect the function of the polypeptide.
- Gene as used herein not only covers chromosomal DNA present in the nucleus, but also includes organelle DNA present in subcellular components of the cell (such as mitochondria, plastids).
- Genetically modified plant means a plant that contains an exogenous polynucleotide or a modified gene or expression control sequence in its genome.
- exogenous polynucleotides can be stably integrated into the genome of plants and inherited for successive generations.
- the exogenous polynucleotide can be integrated into the genome alone or as part of a recombinant DNA construct.
- the modified gene or expression control sequence includes one or more deoxynucleotide substitutions, deletions and additions in the plant genome.
- Form in terms of sequence means a sequence from a foreign species, or if from the same species, a sequence that has undergone significant changes in composition and/or locus from its natural form through deliberate human intervention.
- nucleic acid sequence is used interchangeably and are single-stranded or double-stranded RNA or DNA polymers, optionally containing synthetic, non-natural Or changed nucleotide bases.
- Nucleotides are referred to by their single letter names as follows: “A” is adenosine or deoxyadenosine (respectively RNA or DNA), “C” is cytidine or deoxycytidine, and “G” is guanosine or Deoxyguanosine, “U” means uridine, “T” means deoxythymidine, “R” means purine (A or G), “Y” means pyrimidine (C or T), “K” means G or T, “ H” means A or C or T, “D” means A, T or G, “I” means inosine, and “N” means any nucleotide.
- Polypeptide “peptide”, and “protein” are used interchangeably in the present invention and refer to a polymer of amino acid residues.
- the term applies to amino acid polymers in which one or more amino acid residues are artificial chemical analogs of the corresponding naturally-occurring amino acids, as well as to naturally-occurring amino acid polymers.
- the terms "polypeptide”, “peptide”, “amino acid sequence” and “protein” may also include modified forms, including but not limited to glycosylation, lipid linkage, sulfation, gamma carboxylation of glutamic acid residues, hydroxyl And ADP-ribosylation.
- expression construct refers to a vector suitable for expression of a nucleotide sequence of interest in an organism, such as a recombinant vector.
- “Expression” refers to the production of a functional product.
- the expression of a nucleotide sequence may refer to the transcription of the nucleotide sequence (such as transcription to generate mRNA or functional RNA) and/or the translation of RNA into a precursor or mature protein.
- the "expression construct" of the present invention can be a linear nucleic acid fragment, a circular plasmid, a viral vector, or, in some embodiments, can be RNA (such as mRNA) that can be translated, for example, RNA generated by in vitro transcription.
- RNA such as mRNA
- the "expression construct" of the present invention may contain regulatory sequences and nucleotide sequences of interest from different sources, or regulatory sequences and nucleotide sequences of interest from the same source but arranged in a manner different from those normally occurring in nature.
- regulatory sequence and “regulatory element” are used interchangeably and refer to the upstream (5' non-coding sequence), middle or downstream (3' non-coding sequence) of the coding sequence, and affect the transcription, RNA processing, or processing of the related coding sequence. Stability or translated nucleotide sequence. Regulatory sequences may include, but are not limited to, promoters, translation leader sequences, introns, and polyadenylation recognition sequences.
- Promoter refers to a nucleic acid fragment capable of controlling the transcription of another nucleic acid fragment.
- the promoter is a promoter capable of controlling gene transcription in a cell, regardless of whether it is derived from the cell.
- the promoter can be a constitutive promoter or a tissue-specific promoter or a developmentally regulated promoter or an inducible promoter.
- tissue-specific promoter and “tissue-preferred promoter” are used interchangeably, and refer to mainly but not necessarily exclusively expressed in a tissue or organ, and can also be expressed in a specific cell or cell type The promoter.
- tissue-preferred promoter refers to a promoter whose activity is determined by developmental events.
- inducible promoters selectively express operably linked DNA sequences in response to endogenous or exogenous stimuli (environment, hormones, chemical signals, etc.).
- promoters include, but are not limited to, polymerase (pol) I, pol II, or pol III promoters.
- pol I promoter examples include chicken RNA pol I promoter.
- pol II promoters include, but are not limited to, cytomegalovirus immediate early (CMV) promoter, Rous sarcoma virus long terminal repeat (RSV-LTR) promoter, and simian virus 40 (SV40) immediate early promoter.
- pol III promoters include U6 and H1 promoters.
- An inducible promoter such as a metallothionein promoter can be used.
- promoters include T7 phage promoter, T3 phage promoter, ⁇ -galactosidase promoter, and Sp6 phage promoter.
- the promoter can be cauliflower mosaic virus 35S promoter, maize Ubi-1 promoter, wheat U6 promoter, rice U3 promoter, maize U3 promoter, rice actin promoter.
- operably linked refers to the connection of regulatory elements (for example, but not limited to, promoter sequences, transcription termination sequences, etc.) to nucleic acid sequences (for example, coding sequences or open reading frames) such that the nucleotides The transcription of the sequence is controlled and regulated by the transcription control element.
- regulatory elements for example, but not limited to, promoter sequences, transcription termination sequences, etc.
- nucleic acid sequences for example, coding sequences or open reading frames
- "Introducing" a nucleic acid molecule (e.g. plasmid, linear nucleic acid fragment, RNA, etc.) or protein into an organism refers to transforming the cell of the organism with the nucleic acid or protein so that the nucleic acid or protein can function in the cell.
- the "transformation” used in the present invention includes stable transformation and transient transformation.
- “Stable transformation” refers to the introduction of an exogenous nucleotide sequence into the genome, resulting in the stable inheritance of the exogenous gene. Once stably transformed, the exogenous nucleic acid sequence is stably integrated into the genome of the organism and any successive generations thereof.
- Transient transformation refers to the introduction of nucleic acid molecules or proteins into cells to perform functions without stable inheritance of foreign genes. In transient transformation, the foreign nucleic acid sequence is not integrated into the genome.
- Proteins refer to the physiological, morphological, biochemical or physical characteristics of cells or organisms.
- “Agronomic traits” especially refer to the measurable index parameters of crop plants, including but not limited to: leaf green, grain yield, growth rate, total biomass or accumulation rate, fresh weight at maturity, dry weight at maturity, fruit Yield, seed yield, plant total nitrogen content, fruit nitrogen content, seed nitrogen content, plant nutrient tissue nitrogen content, plant total free amino acid content, fruit free amino acid content, seed free amino acid content, plant nutrient tissue free amino acid content, plant total protein Content, fruit protein content, seed protein content, plant nutrient tissue protein content, herbicide resistance, drought resistance, nitrogen absorption, root lodging, harvest index, stem lodging, plant height, ear height, ear length, disease resistance Resistance, cold resistance, salt resistance and tiller number.
- the present invention provides a genome editing system for multiple editing in plants, especially crops, which comprises:
- gene editing system refers to a combination of components required for editing the genome of a cell or organism.
- the various components of the system such as CRISPR nickase, first scRNA, first fusion protein, second scRNA, second fusion protein, paired gRNA, and their expression vectors can exist independently of each other, or can be any
- the combination is in the form of a composition.
- CRISPR nickase refers to the nickase form of CRISPR nuclease, which forms a nick in the double-stranded nucleic acid molecule, but does not completely cut the double-stranded nucleic acid, and still retains the sequence-specific DNA binding ability guided by gRNA .
- the CRISPR nickase is a Ca9 nickase, such as a Cas9 nickase derived from S. pyogenes Cas9 (SpCas9).
- the Cas9 nickase comprises the amino acid sequence shown in SEQ ID NO: 25 (nCas9(D10A)).
- the Cas9 nickase is a Cas9 variant nickase that recognizes the PAM sequence 5'-NG-3', and includes the amino acid sequence shown in SEQ ID NO: 48 (nCas9-NG(D10A)).
- guide RNA and “gRNA” are used interchangeably, and refer to the ability to form a complex with CRISPR nuclease or its derivative protein such as CRISPR nickase, and because it has a certain identity with the target sequence, the The complex targets the RNA molecule of the target sequence.
- the gRNA targets the target sequence by base pairing with the complementary strand of the target sequence.
- the gRNA used by Cas9 nuclease or its derivative protein such as Cas9 nickase is usually composed of crRNA and tracrRNA molecules that are partially complementary to form a complex, wherein the crRNA contains sufficient identity with the target sequence to hybridize with the complementary strand of the target sequence.
- sgRNA single guide RNA
- the sgRNA comprises the nucleotide sequence shown in SEQ ID NO: 3 or SEQ ID NO: 4.
- RNA aptamer refers to an RNA molecule that can specifically bind to a specific protein.
- RNA aptamers suitable for the present invention include but are not limited to MS2, PP7, boxB and com, and the corresponding RNA aptamer-specific binding proteins are MCP (SEQ ID NO: 34), PCP (SEQ ID NO: 35), N22p (SEQ ID NO: 36) and COM (SEQ ID NO: 37).
- scRNA or the interchangeably used terms "scaffold RNA” and “Scaffold RNA” refer to RNA molecules formed by incorporation of RNA aptamers on gRNA of the CRISPR system, such as sgRNA, which retain the function of gRNA, And can recruit the specific binding protein of the RNA aptamer or the fusion protein containing the binding protein.
- the scRNA comprises two or more RNA aptamers. In some embodiments, the scRNA comprises the nucleotide sequence set forth in one of SEQ ID NO: 5-24.
- the first scRNA comprises the nucleotide sequence shown in SEQ ID NO: 13 or 15.
- the first RNA aptamer specific binding protein comprises the amino acid sequence shown in SEQ ID NO:34.
- the first scRNA comprises the nucleotide sequence shown in SEQ ID NO:24.
- the first RNA aptamer specific binding protein comprises the amino acid sequence shown in SEQ ID NO: 37.
- the second scRNA comprises the nucleotide sequence shown in SEQ ID NO:22.
- the second RNA aptamer specific binding protein comprises the amino acid sequence shown in SEQ ID NO: 36.
- cytosine deamination domain refers to a domain that can accept single-stranded DNA as a substrate and catalyze the deamination of cytidine or deoxycytidine to uracil or deoxyuracil, respectively.
- the cytosine deaminase domain comprises at least one (e.g., one or two) cytosine deaminase polypeptides.
- the cytidine deaminization domain in the first fusion protein can convert the cytidine C deamination of the single-stranded DNA produced during the formation of the CRIPR nickase-first scRNA-first fusion protein-DNA complex into Uracil U, then through base mismatch repair to achieve C to T base replacement.
- cytosine deaminase examples include, but are not limited to, for example, APOBEC1 deaminase, activation-induced cytidine deaminase (AID), APOBEC3G, CDA1, human APOBEC3A deaminase, or their functional modifications body.
- the cytosine deaminase is APOBEC1 deaminase or a functional variant thereof.
- the cytosine deaminase comprises the amino acid sequence of one of SEQ ID NOs: 26-30.
- the first RNA aptamer-specific binding protein in the first fusion protein, is located at the N-terminus of the cytosine deamination domain. In some embodiments, in the first fusion protein, the first RNA aptamer-specific binding protein and the cytosine deamination domain are fused through a linker.
- the first fusion protein further comprises Uracil DNA Glycosylase Inhibitor (UGI).
- Uracil DNA Glycosylase Inhibitor In the cell, uracil DNA glycosylase catalyzes the removal of U from DNA and initiates base excision repair (BER), resulting in the repair of U:G to C:G. Therefore, without being limited by any theory, the inclusion of Uracil DNA Glycosylase Inhibitor (UGI) in the first fusion protein of the present invention will be able to increase the efficiency of C to T base editing.
- the UGI comprises the amino acid sequence shown in SEQ ID NO:31.
- adenine deamination domain refers to a domain that can accept single-stranded DNA as a substrate and catalyze the formation of inosine (I) from adenosine or deoxyadenosine (A).
- the adenine deaminase domain comprises at least one (eg, one) DNA-dependent adenine deaminase polypeptide.
- the adenine deamination domain in the fusion protein can convert the adenosine deamination of single-stranded DNA produced in the formation of the CRISPR nickase-second scRNA-second fusion protein-DNA complex into inosine ( I), because DNA polymerase treats inosine (I) as guanine (G), the substitution of A to G can be achieved through base mismatch repair.
- the DNA-dependent adenine deaminase is a variant of E. coli tRNA adenine deaminase TadA (ecTadA).
- ecTadA E. coli tRNA adenine deaminase TadA
- An exemplary wild-type ecTadA amino acid sequence is shown in SEQ ID NO: 32.
- the DNA-dependent adenine deaminase comprises an amino acid sequence as shown in SEQ ID NO: 33.
- E. coli tRNA adenine deaminase usually functions as a dimer, it is expected that two DNA-dependent adenine deaminase will form a dimer or DNA-dependent adenine deaminase and wild-type adenine The formation of dimers by deaminase can significantly increase the editing activity of fusion proteins A to G.
- the adenine deaminase domain comprises two of the DNA-dependent adenine deaminase.
- the adenine deaminase domain further comprises a corresponding DNA-dependent adenine deaminase (such as a DNA-dependent variant of E. coli tRNA adenine deaminase TadA) fused to the DNA-dependent adenine deaminase Wild-type adenine deaminase (eg E. coli tRNA adenine deaminase TadA).
- the DNA-dependent adenine deaminase e.g., a DNA-dependent variant of E.
- coli tRNA adenine deaminase TadA is fused to a corresponding wild-type adenine deaminase (e.g., E. coli The C-terminus of tRNA adenine deaminase (TadA).
- DNA-dependent adenine deaminase e.g., a DNA-dependent variant of E. coli tRNA adenine deaminase TadA
- DNA-dependent adenine deaminase e.g., A DNA-dependent variant of Escherichia coli tRNA adenine deaminase TadA
- wild-type adenine deaminase for example, Escherichia coli tRNA adenine deaminase TadA
- the second RNA aptamer-specific binding protein in the second fusion protein, is located at the C-terminus of the adenine deamination domain. In some embodiments, in the second fusion protein, the second RNA aptamer-specific binding protein and the adenine deamination domain are fused through a linker.
- linkers can be 1-50 pieces in length (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20 or 20-25, 25-50) or more amino acids, non-functional amino acid sequences without secondary or higher structure.
- the joint may be a flexible joint or the like.
- the linker is 16 amino acids in length, for example, the linker comprises the amino acid sequence shown in SEQ ID NO:41.
- the linker is 36 amino acids in length, for example, the linker comprises the amino acid sequence shown in SEQ ID NO: 42 or 43.
- the CRISPR nickase, the first fusion protein and/or the second fusion protein of the present invention may further comprise a nuclear localization sequence (NLS).
- NLS nuclear localization sequence
- one or more of the NLS in the CRISPR nickase, the first fusion protein and/or the second fusion protein should have sufficient strength to drive the protein in the nucleus of the cell to achieve its base The amount of editing functions accumulates.
- the strength of nuclear localization activity is determined by the number and location of NLS in the protein, one or more specific NLS used, or a combination of these factors.
- the NLS of the CRISPR nickase, the first fusion protein and/or the second fusion protein of the present invention may be located at the N-terminal and/or C-terminal or in the middle.
- the CRISPR nickase, first fusion protein and/or second fusion protein comprise about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more NLS .
- the CRISPR nickase, the first fusion protein and/or the second fusion protein comprise about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 at or near the N-terminus Or more NLS.
- the CRISPR nickase, the first fusion protein, and/or the second fusion protein are contained at or near the C-terminal about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 Or more NLS.
- each one can be selected as not dependent on the other NLS.
- the NLS comprises the amino acid sequence shown in SEQ ID NO: 39 or 40.
- the CRISPR nickase, the first fusion protein and/or the second fusion protein of the present invention may also include other localization sequences, such as cytoplasmic localization sequences, chloroplast localization sequences, mitochondrial localization sequences, etc.
- the CRISPR nickase, the first fusion protein and/or the second fusion protein of the present invention are connected to each other by a "self-cleaving peptide".
- self-cleaving peptide means a peptide that can achieve self-cleavage within a cell.
- the self-cleaving peptide may include a protease recognition site, so that it can be recognized and specifically cleaved by the protease in the cell.
- the self-cleaving peptide may be a 2A polypeptide.
- 2A polypeptide is a type of short peptide derived from viruses, and its self-cleavage occurs during translation. When 2A polypeptide is used to connect two different target polypeptides and expressed in the same reading frame, the two target polypeptides are almost produced at a ratio of 1:1.
- 2A polypeptides can be P2A from porcine techovirus-1, T2A from Thosea asignis virus, and E2A from equine rhinitis A virus. And F2A from foot-and-mouth disease virus.
- T2A porcine techovirus-1
- T2A from Thosea asignis virus
- E2A from equine rhinitis A virus
- F2A from foot-and-mouth disease virus.
- T2A for example, comprising the amino acid sequence shown in SEQ ID NO:38.
- the CRISPR nickase, the first fusion protein and/or the second fusion protein of the present invention can be placed in the same expression vector for expression.
- the first scRNA, the second scRNA, and/or the paired gRNA may be expressed by the same expression construct.
- the genome editing system of the present invention By using the genome editing system of the present invention, different types of genome editing of different target sites can be simultaneously realized through one transformation. For example, if i) and ii-1), ii-2) and ii-3) in the system are introduced into plants together (in the same vector or in separate vectors), the first target position can be achieved by one transformation CT editing in the spot, AG editing in the second target site, and deletion mutations in the third target site.
- the present invention provides a method for producing genetically modified plants, such as crop plants, including introducing the genome editing system of the present invention into plants.
- i) and ii-1) in the system are introduced into plants together, thereby achieving C-T editing in the first target site.
- i) and ii-1) and ii-2) in the system are co-introduced into the plant, thereby achieving mid-CT editing at the first target site, and AG at the second target site edit.
- i) and ii-2) and ii-3) in the system are introduced into plants together to achieve A-G editing in the second target site and deletion mutations in the third target site.
- i) and ii-1) and ii-3) in the system are introduced into plants together to achieve C-T editing in the first target site and deletion mutations in the third target site.
- i) and ii-1), ii-2) and ii-3) in the system are co-introduced into the plant to achieve CT editing in the first target site, and in the second target site AG editing, and deletion mutations at the third target site.
- i), ii-1), ii-2), ii-3) and combinations thereof in the system are introduced into the plant at the same time, for example, into the plant in the same vector, or in one transformation Import plants.
- the method includes:
- the genome editing system can be introduced into the plant by various methods well known to those skilled in the art.
- Methods that can be used to introduce the editing system of the present invention into plants include, but are not limited to: gene bombardment, PEG-mediated transformation of protoplasts, Agrobacterium-mediated transformation, plant virus-mediated transformation, pollen tube passage method, and ovary injection Law.
- the system is introduced into the plant by transient transformation.
- the target site can be modified by introducing or producing the protein and RNA molecules into plant cells, and the modification can be inherited stably without the need to stably transform the editing system into plants. This avoids the potential off-target effects of the stably existing editing system, and also avoids the integration of exogenous nucleotide sequences in the plant genome, thereby having higher biological safety.
- the introduction is performed in the absence of selective pressure, so as to avoid the integration of foreign nucleotide sequences in the plant genome.
- the introduction includes transforming the genome editing system of the present invention into an isolated plant cell or tissue, and then regenerating the transformed plant cell or tissue into a whole plant.
- the regeneration is performed in the absence of selective pressure, that is, no selective agent for the selective gene carried on the expression vector is used during the tissue culture process. Not using selection agents can improve plant regeneration efficiency and obtain modified plants that do not contain exogenous nucleotide sequences.
- the genome editing system of the present invention can be transformed to specific parts on the whole plant, such as leaves, stem tips, pollen tubes, young ears or hypocotyls. This is particularly suitable for the transformation of plants that are difficult to undergo tissue culture regeneration.
- the protein expressed in vitro and/or the RNA molecule transcribed in vitro is directly transformed into the plant.
- the protein and/or RNA molecule can realize gene editing in plant cells and then be degraded by the cell, avoiding the integration of foreign nucleotide sequences in the plant genome.
- genetic modification and breeding of plants using the method of the present invention can obtain plants without foreign DNA integration, that is, transgene-free modified plants.
- Plants that can be gene-edited by the system and method of the present invention include monocotyledonous plants and dicotyledonous plants.
- the plant may be a crop plant such as wheat, rice, corn, soybean, sunflower, sorghum, rape, alfalfa, cotton, barley, millet, sugarcane, tomato, tobacco, cassava, or potato.
- the target site is related to plant traits such as agronomic traits, whereby the editing results in the plant having an altered trait relative to a wild-type plant.
- the target sequence to be modified can be located anywhere in the genome, for example, in a functional gene such as a protein-coding gene, or, for example, can be located in a gene expression regulatory region such as a promoter region or an enhancer region, so as to achieve Modification of gene function or modification of gene expression.
- the method further includes obtaining progeny of the genetically modified plant.
- the present invention also provides a genetically modified plant or its progeny or part thereof, wherein the plant is obtained by the above-mentioned method of the present invention.
- the genetically modified plant or progeny or part thereof is non-transgenic.
- the present invention also provides a plant breeding method, comprising crossing the genetically modified first plant obtained by the above-mentioned method of the present invention with a second plant not containing the genetic modification, thereby combining the The genetic modification is introduced into the second plant.
- the plant cytosine base editing system PBE is mainly composed of the following modules: 1) Cytosine deaminase is used to deaminate cytosine (C) into uracil (Uracil, U); 2) nCas9(D10A) ) For sgRNA programmable DNA base editing and promote endogenous mismatch repair (Mismatch Repair, MMR) pathway; 3) Uracil DNA glycosylase inhibitor (Uracil DNA Glycosylase Inhibitor, UGI) for inhibition in vivo The activity of uracil glycosylase UDG prevents U from becoming an AP site.
- MS2 is a commonly used RNA aptamer. Studies have shown that the scRNA formed by adding two MS2 hairpin structures to the 3'end of esgRNA can efficiently mediate CRISPRa in human cells. Therefore, the scRNA vector pOsU3-esgRNA-2 ⁇ MS2 driven by the OsU3 promoter was first constructed ( Figure 1b).
- T2A "self-cleavage" peptides were used to express multiple protein modules at the same time, where nCas9 (D10A) was fused or not fused with APOBEC1 or UGI as the RNA programmable module, and MS2 binding protein MCP fused APOBEC1 or UGI as recruited modules to construct PBEc1 to PBEc5. All PBEc vectors are codon optimized for use and are driven by the Ubi-1 promoter of maize ( Figure 1c).
- the BFP-to-GFP reporter system was used to evaluate the C>T efficiency of PBEc vectors.
- the GFP fluorescent activity of the reporter system requires that the His66 encoded by CAC on BFP be changed to Tyr66 encoded by TAC. Therefore, a scRNA plasmid esgRNA-2 ⁇ MS2-BFP targeting this site was constructed, with base C located at the 4th position far from the PAM.
- PEG induction method different combinations of PBEc and esgRNA-2 ⁇ MS2-BFP were used to transform rice protoplasts. PBE and sgRNA-BFP were used as the control group, GFP was the positive control, and the untransformed rice protoplasts were the negative control.
- nPBEc4 has high product editing purity, non-target products are not obvious ( ⁇ 0.04%); its Indel efficiency (0.04-0.29%) is consistent with that of untreated rice protoplasts (0.01-0.32%), far Lower than Cas9 (4.82-11.75%) ( Figure 5).
- the nCas9(D10A) complex programmed by scRNA simultaneously recruits APOBEC1 and UGI to enhance the base editing activity of C>T. Therefore, the carrier structure of PBEc4 is used as the object of further research.
- a MS2 hairpin structure was chimerized on sgRNA or four loops and neck loops of esgRNA with an AU or CG flip and extended hairpin structure, respectively, to construct sgRNA2.0 to sgRNA4.0 ( Figure 2b and Figure 16). , 17).
- sgRNA7-1, sgRNA7-2 and sgRNAB.0 containing two variants of PP7 hairpin structure and boxB hairpin structure were also constructed ( Figure 2b and Figures 16, 17).
- scRNAs were constructed using the strategy of connecting 1 or 2 RNA aptamers (including MS2, PP7, boxB, com) hairpin structure at the 3'end ( Figure 2b and Figures 16, 17).
- 1 or 2 RNA aptamers including MS2, PP7, boxB, com
- a combination of the above two strategies was used to construct sgRNA4.0 containing the MS2 hairpin structure ( Figure 2b and Figures 16, 17).
- the C>T editing efficiency of the transformation combination of PBEc4 and esgRNA-2 ⁇ MS2, esgRNA-3 ⁇ MS2 or sgRNA4.0 and the transformation combination of PBEc8 and esgRNA-2 ⁇ com in the reporting system is 7.47% and 8.00%, respectively , 8.83%, and 6.90%, all of which have higher efficiency than the transformation combination of PBE and esgRNA (6.03%) ( Figure 2c).
- scRNA-2 ⁇ MS2, esgRNA-3 ⁇ MS2, sgRNA4.0, esgRNA-2 ⁇ com high-efficiency scRNA-mediated rice endogenous target sites
- five endogenous targets The sites were constructed on four scRNA vectors and co-transformed rice protoplasts with PBEc4 and PBEc8, respectively.
- the sgRNA or esgRNA vector containing the target site was co-transformed with PBE and Cas9, respectively, to transform rice protoplasts as a control group. After culturing at 22°C for 60 hours, the rice protoplasm DNA was extracted and amplicon NGS sequencing was performed.
- esgRNA-2 ⁇ MS2, esgRNA-3 ⁇ MS2, esgRNA-2 ⁇ com-mediated C>T editing efficiency is 2.31 ⁇ 3.75 times higher than that of sgRNA ( Figure 2d, Figure 6), and have the same major The single base window C 3 ⁇ C 9 ( Figure 2d).
- different scRNAs in rice protoplasts also have higher product purity (>99.68%) ( Figure 7a) and lower Indel value ( ⁇ 0.56%), much lower than Cas9 ( ⁇ 21.21%) ( Figure 7b) ).
- RNA aptamers into sgRNA provides an effective solution for the multiple recruitment of nCas9 (D10A) using RNA programming in plants.
- the selected esgRNA-2 ⁇ MS2, esgRNA-3 ⁇ MS2 and esgRNA-2 ⁇ com can be used as candidates for mediating CBE function in the multiple genome editing system.
- PABE-7 mainly consists of the following modules: a heterodimer composed of wild-type adenine deaminase ecTadA and artificially evolved deoxyadenine deaminase ecTadA7.10, and PBE system Consistent nCas9 (D10A), and 3 copies of SV40NLS at the C end of nCas9 (D10A).
- PABEc1 for esgRNA-2 ⁇ MS2 recruitment was first constructed based on PBEc4 ( Figure 3a). Use mGFP reporter system to test PABEc1-mediated A>G base editing activity.
- the ecTadA-ecTadA7.10 heterodimer was fused to the C-terminus of nCas9 (D10A) on PABEc3 to construct PABEc4 ( Figure 3a).
- the A>G base editing efficiency of PABEc4 was 10.60%, which was still lower than the combination of PABE-7 and esgRNA ( Figure 3b). This result suggests that the optimization of PABEc's conformation to improve its A>G base editing activity in plants is limited.
- the PABEc3 conformation is used to further develop multiple systems and try to use other RNA aptamers to enhance the activity of this conformation.
- the C-terminal MCP of PABEc3 was replaced with PCP, N22p, and Com, and PABEc5, PABEc6, and PABEc7 were constructed to identify the RNA hairpin structures of PP7, boxB, and com, respectively ( Figure 3c).
- the A>G activity of the combination of PABEc6 and esgRNA-2 ⁇ boxB in the reporting system was 26.53%, which was slightly higher than the activity of the combination of PABE-7 and esgRNA (25.57%) (Figure 3d).
- RNA aptamers the combinations with the highest reporter system A>G activity are: PABEc3 and esgRNA-2 ⁇ MS2+f6, PABEc5 and esgRNA-1 ⁇ PP7-1, PABEc7 and esgRNA-2 ⁇ com
- PABEc3 and esgRNA-2 ⁇ MS2+f6 PABEc5 and esgRNA-1 ⁇ PP7-1
- the efficiencies were 18.07%, 21.03%, and 22.47%, respectively, which were all lower than the combination of PABE-7 and esgRNA (Figure 3d).
- vectors PABEc8, PABEc9, and PABEc10 based on the PABEc2 conformation were also constructed (Figure 9a), using PCP, N22p or Com binding protein at the N-terminus of adenine deaminase, but all tested combinations are in the mGFP reporter system.
- the efficiency ( ⁇ 21.23%) was lower than the combination of PABE-2 and sgRNA (22.87%) ( Figure 9b).
- PABEc and these scRNAs Similar to the use of scRNAs containing RNA hairpin structures on the four loops and stem loop 2 in PBEc, PABEc and these scRNAs also mediate a lower GFP fluorescence signal (Figure 3d). Therefore, the scRNA with the 3′-end RNA hairpin structure mediates C>T and A>G more efficiently than the scRNA with the four loops and the stem loop 2 RNA hairpin structure.
- the average efficiency in the main editing window A 4 to A 8 is 4.65%, which is comparable to that of PABE.
- the efficiency of the combination of -2 and sgRNA is comparable (average 4.78%) ( Figure 3e, Figure 10a). Since the length of N22p in the tested binding protein is the shortest, 33 amino acids, it is speculated that the activity of the ecTadA-ecTadA7.10 heterodimer is not only affected by the position of the binding protein, but also by the length of the binding protein.
- nCas9 D10A
- nCas9 (D10A) the function of nCas9 (D10A) to mediate multiple genome editing, firstly based on PBEc4 and simultaneously expressed 1 esgRNA-2 ⁇ MS2 and a pair of sgRNA integrated SWISSv1.1, simultaneously produces cytosine at different target sites Base editing and paired nCas9-mediated DSB ( Figure 4a).
- the Ubi-1 promoter and T2A "self-cleaving" peptide were used to simultaneously express nCas9(D10A), MCP-APOBEC1-UGI And ecTadA-ecTadA7.10-N22p, the MGE vector was constructed ( Figure 13a).
- the TaU6 promoter drives esgRNA-2 ⁇ MS2 for C>T editing
- the OsU3 promoter drives esgRNA-2 ⁇ boxB for A>G editing ( Figure 13b), which is combined with MGE to form SWISSv2 ( Figure 4c).
- SWISSv3 provides an alternative solution for gene stacking and heritable modification in plants.
- Paired sgRNAs are designed according to the following rules: (1) with the PAM in the outer direction; (2) the distance between the cut sites is the40-68bp.
- SWISSv3 can also produce Three-site single mutant mutants (Table 4).
- SWISSv3 using scRNA can be used as a triple-function comprehensive programmable genome editing system in plants, which will be beneficial to crop molecular breeding.
- T0 lines (rice) carrying the observed mutations is relative to the total number of T0 transgenic rice lines analyzed.
- b Analyze the genotype of indels through online tools DSDecodeM (reference) and TIDE (reference). .
- T2A is used to express multiple modules at the same time. It is speculated that the "self-cleavage" efficiency of T2A will affect the product purity of CBE or ABE target sites ( Figure 15a). As shown in Figure 15a, T2A-mediated "self-cleavage" is achieved by ribosome skipping the glycine-proline peptide bond formed at the C-terminus of T2A. The position of T2A will affect the level of simultaneous expression of multiple modules.
- MGE successful skipping will produce three independent target proteins, but failure of skipping will produce non-targeted fusion proteins, especially the produced MCP-APOBEC1-UGI-T2A-ecTadA-ecTadA7.10-N22p Fusion proteins can be recruited by esgRNA-2 ⁇ MS2 and esgRNA-2 ⁇ boxB, which will produce non-target C-base editing at the ABE target site or non-target A-base editing at the CBE target site, causing non-target sexual off-target. Analyze the amplicon NGS sequencing results of SWISSv2 and SWISSv3 rice protoplasts, and check the efficiency of ABE target site cytosine editing and CBE target site adenine editing.
- a PAM motif is shown in bold and underlined; b NA is not available.
- the multiple genome editing system using multiple sgRNA strategies can be divided into two aspects: one is to perform the same type of genome editing on different target sites; the other is to perform different targets on different target sites proposed in this study.
- Types of genome editing So far, it is the first time that the SWISS system developed in this research can use a programmable Cas protein to mediate multiple different types of genome editing simultaneously in plants. Although this multiple editing can be achieved using CRISPR/Cas homologous proteins, the carriers of multiple homologous proteins will be larger, which is not conducive to gene gun-mediated genetic transformation, and the requirements for PAM are also stricter.
- the SWISS system uses only one type of nCas9 (D10A), which can alleviate the problems caused by the above two shortcomings, especially the Cas9 variant using NG PAM, which can further expand the editing scope of SWISS ( Figure 4e, Table 2).
- RNA polymerase III promoters OsU3 and TaU6 were used to express multiple sgRNAs.
- Other multiple sgRNA strategies can also be used to further optimize the SWISS system, such as Csy4RNA ribonuclease or ribozymes to produce multiple sgRNAs.
- the average C>T activity of scRNA-recruited construction is higher than that of PBE, and it is not accompanied by a wider base editing window. This strategy can be used to increase the editing activity of narrow editing window cytosine variants.
- the A>G activity of the construction recruited by scRNA is only comparable to PABE-2, its activity is sufficient to mediate SWISSv3 to obtain rice A>G mutants.
- RNA aptamers does not improve the efficiency of PABEc construction. It also means that the space for optimizing PABEc is relatively limited, and it is necessary to develop more efficient adenine deaminase.
- the dual-function SWISSv1.1 and SWISSv1.2 systems can also be implemented by using PBE and PABE combined with multiple sgRNA strategies.
- the RNA aptamer recruitment strategy in this study provides another alternative method, especially nCas9(D10A) overexpressing plants are subjected to multiple genome editing. This strategy is based on advantages. Therefore, in the future, nCas9(D10A) overexpression rice can also be constructed. As a development platform, it only needs to transform multiple sgRNA and base editing recruitment modules. After a second transformation, the multiple editing functions of SWISS can be realized, and it can also reduce non-purpose The off-target phenomenon is conducive to the molecular design and breeding of crops.
- a quadruple-function CRISPR system can be realized.
- the SWISS system can also use random and multiple sgRNA strategies for directed evolution of plant endogenous genes, as well as applications beyond plants, such as changing cell fate or metabolic regulation pathways.
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Chemical & Material Sciences (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biotechnology (AREA)
- Organic Chemistry (AREA)
- Biomedical Technology (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- General Engineering & Computer Science (AREA)
- Molecular Biology (AREA)
- Microbiology (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Plant Pathology (AREA)
- Biophysics (AREA)
- Physics & Mathematics (AREA)
- Cell Biology (AREA)
- Medicinal Chemistry (AREA)
- Crystallography & Structural Chemistry (AREA)
- Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
Description
Claims (36)
- 一种用于在植物尤其是作物中进行多重编辑的基因组编辑系统,其包含:i)CRISPR切口酶和/或含有编码所述CRISPR切口酶的核苷酸序列的表达构建体;和ii)选自以下的一项或多项或全部:ii-1)靶向所述植物基因组内的第一靶区域的第一scRNA和/或含有编码所述第一scRNA的核苷酸序列的表达构建体,所述第一scRNA包含至少一个第一RNA适配体(aptamer);以及,第一融合蛋白和/或含有编码所述第一融合蛋白的核苷酸序列的表达构建体,所述第一融合蛋白包含第一RNA适配体特异性结合蛋白和胞嘧啶脱氨结构域;ii-2)靶向所述植物基因组内的第二靶区域的第二scRNA和/或含有编码所述第二scRNA的核苷酸序列的表达构建体,所述第二scRNA包含至少一个第二RNA适配体;以及,第二融合蛋白和/或含有编码所述第二融合蛋白的核苷酸序列的表达构建体,所述第二融合蛋白包含第二RNA适配体特异性结合蛋白和腺嘌呤脱氨结构域;ii-3)靶向所述植物基因组内的第三靶区域的成对gRNA和/或含有编码所述成对gRNA的核苷酸序列的表达构建体,所述成对gRNA分别靶向所述第三靶区域DNA的不同链。
- 权利要求1的系统,其中所述CRISPR切口酶是Ca9切口酶,例如其包含SEQ ID NO:25或48所示的氨基酸序列。
- 权利要求1或2的系统,所述成对gRNA包含SEQ ID NO:3或SEQ ID NO:4所示核苷酸序列。
- 权利要求1-3中任一项的系统,所述RNA适配体选自MS2、PP7、boxB和com。
- 权利要求1-4中任一项的系统,所述RNA适配体特异性结合蛋白选自MCP、PCP、N22p和COM。
- 权利要求1-5中任一项的系统,所述scRNA包含两个或更多个RNA适配体。
- 权利要求1-6中任一项的系统,所述scRNA包含SEQ ID NO:5-24之一所述的核苷酸序列。
- 权利要求1-7中任一项的系统,所述第一scRNA包含SEQ ID NO:13或15所示核苷酸序列。
- 权利要求8的系统,所述第一RNA适配体特异性结合蛋白包含SEQ ID NO:34所示氨基酸序列。
- 权利要求1-7中任一项的系统,所述第一scRNA包含SEQ ID NO:24所示核苷酸序列。
- 权利要求10的系统,所述第一RNA适配体特异性结合蛋白包含SEQ ID NO:37 所示氨基酸序列。
- 权利要求1-11中任一项的系统,所述第二scRNA包含SEQ ID NO:22所示核苷酸序列。
- 权利要求12的系统,所述第二RNA适配体特异性结合蛋白包含SEQ ID NO:36所示氨基酸序列。
- 权利要求1-13中任一项的系统,所述胞嘧啶脱氨酶选自APOBEC1脱氨酶、激活诱导的胞苷脱氨酶(AID)、APOBEC3G、CDA1、人APOBEC3A脱氨酶,或它们的功能性变体。
- 权利要求14的系统,所述胞嘧啶脱氨酶是APOBEC1脱氨酶或其功能性变体。
- 权利要求15的系统,所述胞嘧啶脱氨酶包含SEQ ID NO:26-30之一的氨基酸序列。
- 权利要求1-16中任一项的系统,所述第一RNA适配体特异性结合蛋白位于所述胞嘧啶脱氨结构域的N端。
- 权利要求1-17中任一项的系统,所述第一RNA适配体特异性结合蛋白与所述胞嘧啶脱氨结构域之间通过接头融合。
- 权利要求1-18中任一项的系统,所述第一融合蛋白还包含尿嘧啶DNA糖基化酶抑制剂(UGI),例如,所述UGI包含SEQ ID NO:31所示氨基酸序列。
- 权利要求1-19中任一项的系统,所述腺嘌呤脱氨结构域包含至少一个DNA依赖型腺嘌呤脱氨酶多肽。
- 权利要求20的系统,所述DNA依赖型腺嘌呤脱氨酶是大肠杆菌tRNA腺嘌呤脱氨酶TadA(ecTadA)的变体,例如,所述DNA依赖型腺嘌呤脱氨酶包含如SEQ ID NO:33所示的氨基酸序列。
- 权利要求21的系统,所述腺嘌呤脱氨结构域还包含与所述大肠杆菌tRNA腺嘌呤脱氨酶TadA的DNA依赖型变体融合的对应的野生型大肠杆菌tRNA腺嘌呤脱氨酶TadA,例如,所述野生型大肠杆菌tRNA腺嘌呤脱氨酶TadA包含SEQ ID NO:32所示的氨基酸序列。
- 权利要求22的系统,所述大肠杆菌tRNA腺嘌呤脱氨酶TadA的DNA依赖型变体融合至对应的野生型大肠杆菌tRNA腺嘌呤脱氨酶TadA的C端,优选地,通过接头融合。
- 权利要求1-23中任一项的系统,所述第二RNA适配体特异性结合蛋白位于所述腺嘌呤脱氨结构域的C端。
- 权利要求1-24中任一项的系统,所述第二RNA适配体特异性结合蛋白与所述腺嘌呤脱氨结构域之间通过接头融合。
- 权利要求1-25中任一项的系统,所述CRISPR切口酶、第一融合蛋白和/或第二融合蛋白还包含核定位序列(NLS)。
- 权利要求1-26中任一项的系统,所述CRISPR切口酶、第一融合蛋白和/或第 二融合蛋白通过“自裂解肽”相互连接。
- 一种产生经遗传修饰的植物例如作物植物的方法,包括将权利要求1-27中任一项的基因组编辑系统导入所述植物。
- 权利要求28的方法,其中将所述系统中的i)和ii-1)共同导入植物中,由此实现第一靶位点中C-T编辑。
- 权利要求28的方法,其中将所述系统中的i)和ii-1)以及ii-2共同导入植物中,由此实现第一靶位点的中C-T编辑,第二靶位点中的A-G编辑。
- 权利要求28的方法,其中将所述系统中的i)和ii-2)以及ii-3)共同导入植物中,实现第二靶位点中A-G编辑以及第三靶位点的缺失突变。
- 权利要求28的方法,其中将所述系统中的i)和ii-1)以及ii-3)共同导入植物中,实现第一靶位点中C-T编辑,以及第三靶位点的缺失突变。
- 权利要求28的方法,其中将所述系统中的i)和ii-1)、ii-2)以及ii-3)共同导入植物中,实现第一靶位点中C-T编辑,第二靶位点中A-G编辑,以及第三靶位点的缺失突变。
- 权利要求28-33中的方法,其中所述系统中的i)、ii-1)、ii-2)、ii-3)或其组合是同时导入所述植物,例如在同一载体导入植物,或在一次转化中导入植物。
- 权利要求28的方法所述方法包括:c)将所述系统的i)导入植物,获得稳定表达所述CRSPR切口酶的转基因植物;d)将所述系统的ii-1)、ii-2)、ii-3)或其任意组合导入步骤a)获得的转基因植物。
- 权利要求28-35中任一项的方法,所述植物包括单子叶植物和双子叶植物,例如,所述植物是作物植物,例如小麦、水稻、玉米、大豆、向日葵、高粱、油菜、苜蓿、棉花、大麦、粟、甘蔗、番茄、烟草、木薯或马铃薯。
Priority Applications (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US17/909,309 US20240117368A1 (en) | 2020-03-04 | 2021-03-04 | Multiplex genome editing method and system |
EP21764356.8A EP4116426A4 (en) | 2020-03-04 | 2021-03-04 | MULTIPLEX GENOME EDITING METHOD AND SYSTEM |
BR112022017704A BR112022017704A2 (pt) | 2020-03-04 | 2021-03-04 | Método e sistema de edição de genoma multiplex |
CN202180019211.8A CN115667528B (zh) | 2020-03-04 | 2021-03-04 | 多重基因组编辑方法和系统 |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010143643.7 | 2020-03-04 | ||
CN202010143643 | 2020-03-04 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2021175289A1 true WO2021175289A1 (zh) | 2021-09-10 |
Family
ID=77612853
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2021/079087 WO2021175289A1 (zh) | 2020-03-04 | 2021-03-04 | 多重基因组编辑方法和系统 |
Country Status (5)
Country | Link |
---|---|
US (1) | US20240117368A1 (zh) |
EP (1) | EP4116426A4 (zh) |
CN (1) | CN115667528B (zh) |
BR (1) | BR112022017704A2 (zh) |
WO (1) | WO2021175289A1 (zh) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114686456A (zh) * | 2022-05-10 | 2022-07-01 | 中山大学 | 基于双分子脱氨酶互补的碱基编辑系统及其应用 |
WO2023163806A1 (en) * | 2022-02-22 | 2023-08-31 | Massachusetts Institute Of Technology | Engineered nucleases and methods of use thereof |
WO2024193739A1 (en) * | 2023-03-23 | 2024-09-26 | Ostravska Univerzita | Method for producing proteins in host cells |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116751799B (zh) * | 2023-06-14 | 2024-01-26 | 江南大学 | 一种多位点双重碱基编辑器及其应用 |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2016054106A1 (en) * | 2014-09-29 | 2016-04-07 | The Regents Of The University Of California | SCAFFOLD RNAs |
CN106191099A (zh) * | 2016-07-27 | 2016-12-07 | 苏州泓迅生物科技有限公司 | 一种基于CRISPR‑Cas9系统的酿酒酵母基因组并行多重编辑载体及其应用 |
CN107027313A (zh) * | 2014-10-17 | 2017-08-08 | 宾州研究基金会 | 用于多元rna引导的基因组编辑和其它rna技术的方法和组合物 |
WO2019138052A1 (en) * | 2018-01-11 | 2019-07-18 | Kws Saat Se | Optimized plant crispr/cpf1 systems |
CN110520163A (zh) * | 2017-01-05 | 2019-11-29 | 新泽西鲁特格斯州立大学 | 独立于dna双链断裂的靶向基因编辑平台及其用途 |
Family Cites Families (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10287590B2 (en) * | 2014-02-12 | 2019-05-14 | Dna2.0, Inc. | Methods for generating libraries with co-varying regions of polynuleotides for genome modification |
CN108291218B (zh) * | 2015-07-15 | 2022-08-19 | 新泽西鲁特格斯州立大学 | 核酸酶非依赖性靶向基因编辑平台及其用途 |
SG11201903089RA (en) * | 2016-10-14 | 2019-05-30 | Harvard College | Aav delivery of nucleobase editors |
CN107043779B (zh) * | 2016-12-01 | 2020-05-12 | 中国农业科学院作物科学研究所 | 一种CRISPR/nCas9介导的定点碱基替换在植物中的应用 |
EP3589751A4 (en) * | 2017-03-03 | 2021-11-17 | The Regents of The University of California | RNA TARGETING OF MUTATIONS VIA SUPPRESSOR RNA AND DEAMINASES |
CA3064601A1 (en) * | 2017-06-26 | 2019-01-03 | The Broad Institute, Inc. | Crispr/cas-adenine deaminase based compositions, systems, and methods for targeted nucleic acid editing |
CN110066824B (zh) * | 2018-01-24 | 2021-06-08 | 中国农业科学院植物保护研究所 | 一套用于水稻的碱基编辑人工系统 |
US11739322B2 (en) * | 2018-02-01 | 2023-08-29 | Institute Of Genetics And Developmental Biology, Chinese Academy Of Sciences | Method for genome editing using a self-inactivating CRISPR nuclease |
WO2019153902A1 (zh) * | 2018-02-11 | 2019-08-15 | 中国科学院上海生命科学研究院 | 植物基因组定点替换的方法 |
CN110527695B (zh) * | 2019-03-07 | 2020-06-16 | 山东舜丰生物科技有限公司 | 一种用于基因定点突变的核酸构建物 |
-
2021
- 2021-03-04 BR BR112022017704A patent/BR112022017704A2/pt unknown
- 2021-03-04 WO PCT/CN2021/079087 patent/WO2021175289A1/zh unknown
- 2021-03-04 US US17/909,309 patent/US20240117368A1/en active Pending
- 2021-03-04 EP EP21764356.8A patent/EP4116426A4/en not_active Withdrawn
- 2021-03-04 CN CN202180019211.8A patent/CN115667528B/zh active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2016054106A1 (en) * | 2014-09-29 | 2016-04-07 | The Regents Of The University Of California | SCAFFOLD RNAs |
CN107027313A (zh) * | 2014-10-17 | 2017-08-08 | 宾州研究基金会 | 用于多元rna引导的基因组编辑和其它rna技术的方法和组合物 |
CN106191099A (zh) * | 2016-07-27 | 2016-12-07 | 苏州泓迅生物科技有限公司 | 一种基于CRISPR‑Cas9系统的酿酒酵母基因组并行多重编辑载体及其应用 |
CN110520163A (zh) * | 2017-01-05 | 2019-11-29 | 新泽西鲁特格斯州立大学 | 独立于dna双链断裂的靶向基因编辑平台及其用途 |
WO2019138052A1 (en) * | 2018-01-11 | 2019-07-18 | Kws Saat Se | Optimized plant crispr/cpf1 systems |
Non-Patent Citations (5)
Title |
---|
CHAO LI, YUAN ZONG, SHUAI JIN, HAOCHENG ZHU, DEXING LIN, SHENGNAN LI, JIN-LONG QIU, YANPENG WANG, CAIXIA GAO: "SWISS: multiplexed orthogonal genome editing in plants with a Cas9 nickase and engineered CRISPR RNA scaffolds", GENOME BIOLOGY, vol. 21, no. 1, 1 December 2020 (2020-12-01), XP055768063, DOI: 10.1186/s13059-020-02051-x * |
JESSE ZALATAN, MICHAEL LEE, RICARDO ALMEIDA, LUKE GILBERT, EVAN WHITEHEAD, MARIE LARUSSA, JORDAN. TSAI, JONATHAN WEISSMAN, JOHN DU: "Engineering Complex Synthetic Transcriptional Programs with CRISPR RNA Scaffolds", CELL, ELSEVIER, AMSTERDAM NL, vol. 160, no. 1-2, 1 January 2015 (2015-01-01), Amsterdam NL, pages 339 - 350, XP055278878, ISSN: 0092-8674, DOI: 10.1016/j.cell.2014.11.052 * |
SAMBROOK, JFRITSCH, E.F.MANIATIS, T: "Molecular Cloning: A Laboratory Manual", 1989, COLD SPRING HARBOR LABORATORY PRESS |
See also references of EP4116426A4 |
WANG, HONGZHEN, YU JIAXIN, LIU QIANG, YU YINGJIE, CHENG JUN: "CRISPR/Cas9 Genome Editing Technique in Maize Breeding", MOLECULAR PLANT BREEDING, vol. 17, no. 20, 1 January 2019 (2019-01-01), pages 6696 - 6704, XP055842318, ISSN: 6696-6704, DOI: 10.13271/j.mpb.017.006696 * |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2023163806A1 (en) * | 2022-02-22 | 2023-08-31 | Massachusetts Institute Of Technology | Engineered nucleases and methods of use thereof |
CN114686456A (zh) * | 2022-05-10 | 2022-07-01 | 中山大学 | 基于双分子脱氨酶互补的碱基编辑系统及其应用 |
CN114686456B (zh) * | 2022-05-10 | 2023-02-17 | 中山大学 | 基于双分子脱氨酶互补的碱基编辑系统及其应用 |
WO2024193739A1 (en) * | 2023-03-23 | 2024-09-26 | Ostravska Univerzita | Method for producing proteins in host cells |
Also Published As
Publication number | Publication date |
---|---|
CN115667528B (zh) | 2024-10-01 |
EP4116426A1 (en) | 2023-01-11 |
US20240117368A1 (en) | 2024-04-11 |
EP4116426A4 (en) | 2024-05-22 |
CN115667528A (zh) | 2023-01-31 |
BR112022017704A2 (pt) | 2022-11-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2021175289A1 (zh) | 多重基因组编辑方法和系统 | |
WO2021032155A1 (zh) | 一种碱基编辑系统和其使用方法 | |
KR20200103769A (ko) | 연장된 단일 가이드 rna 및 그 용도 | |
CN104080462B (zh) | 用于修饰预定的靶核酸序列的组合物和方法 | |
CN109136248B (zh) | 多靶点编辑载体及其构建方法和应用 | |
CN108130342B (zh) | 基于Cpf1的植物基因组定点编辑方法 | |
WO2021185358A1 (zh) | 一种提高植物遗传转化和基因编辑效率的方法 | |
CN107027313A (zh) | 用于多元rna引导的基因组编辑和其它rna技术的方法和组合物 | |
WO2021082830A1 (zh) | 靶向性修饰植物基因组序列的方法 | |
CN111575319B (zh) | 一种高效的crispr rnp和供体dna共位介导的基因插入或替换方法及其应用 | |
WO2018117746A1 (ko) | 동물 배아의 염기 교정용 조성물 및 염기 교정 방법 | |
WO2023169454A1 (zh) | 腺嘌呤脱氨酶及其在碱基编辑中的用途 | |
WO2019205939A1 (zh) | 一种重复片段介导的植物定点重组方法 | |
WO2023092731A1 (zh) | Mad7-nls融合蛋白、用于植物基因组定点编辑的核酸构建物及其应用 | |
WO2020020193A1 (zh) | 基于人apobec3a脱氨酶的碱基编辑器及其用途 | |
WO2021175288A1 (zh) | 改进的胞嘧啶碱基编辑系统 | |
WO2022199665A1 (zh) | 一种提高植物遗传转化和基因编辑效率的方法 | |
KR102679001B1 (ko) | 신규의 개량된 염기 편집 또는 교정용 융합단백질 및 이의 용도 | |
WO2023227050A1 (zh) | 一种在基因组中定点插入外源序列的方法 | |
Curtis et al. | Recombinant DNA, vector design, and construction | |
US20200377909A1 (en) | Directed genome engineering using enhanced targeted editing technologies | |
WO2023232109A1 (zh) | 新的crispr基因编辑系统 | |
US20120070900A1 (en) | T-dna/protein nano-complexes for plant transformation | |
JP2022071820A (ja) | 遺伝子の発現を高める方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 21764356 Country of ref document: EP Kind code of ref document: A1 |
|
REG | Reference to national code |
Ref country code: BR Ref legal event code: B01A Ref document number: 112022017704 Country of ref document: BR |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
ENP | Entry into the national phase |
Ref document number: 2021764356 Country of ref document: EP Effective date: 20221004 |
|
ENP | Entry into the national phase |
Ref document number: 112022017704 Country of ref document: BR Kind code of ref document: A2 Effective date: 20220902 |