WO2020177751A1 - Construction d'acide nucléique pour édition de gènes - Google Patents
Construction d'acide nucléique pour édition de gènes Download PDFInfo
- Publication number
- WO2020177751A1 WO2020177751A1 PCT/CN2020/078079 CN2020078079W WO2020177751A1 WO 2020177751 A1 WO2020177751 A1 WO 2020177751A1 CN 2020078079 W CN2020078079 W CN 2020078079W WO 2020177751 A1 WO2020177751 A1 WO 2020177751A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- coding sequence
- nucleic acid
- acid construct
- sequence
- gene
- Prior art date
Links
- 150000007523 nucleic acids Chemical class 0.000 title claims abstract description 61
- 102000039446 nucleic acids Human genes 0.000 title claims abstract description 58
- 108020004707 nucleic acids Proteins 0.000 title claims abstract description 58
- 238000010362 genome editing Methods 0.000 title claims abstract description 21
- 108091026890 Coding region Proteins 0.000 claims description 68
- 108091033409 CRISPR Proteins 0.000 claims description 46
- 230000035772 mutation Effects 0.000 claims description 45
- 108010052875 Adenine deaminase Proteins 0.000 claims description 44
- 210000004027 cell Anatomy 0.000 claims description 42
- 108090000623 proteins and genes Proteins 0.000 claims description 42
- 108010080611 Cytosine Deaminase Proteins 0.000 claims description 40
- 102000000311 Cytosine Deaminase Human genes 0.000 claims description 40
- 238000000034 method Methods 0.000 claims description 40
- 239000013598 vector Substances 0.000 claims description 32
- 230000000694 effects Effects 0.000 claims description 26
- 101710163270 Nuclease Proteins 0.000 claims description 25
- 102000004169 proteins and genes Human genes 0.000 claims description 23
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 claims description 22
- 235000018102 proteins Nutrition 0.000 claims description 22
- 108020005004 Guide RNA Proteins 0.000 claims description 20
- 239000002773 nucleotide Substances 0.000 claims description 20
- 125000003729 nucleotide group Chemical group 0.000 claims description 20
- 238000003776 cleavage reaction Methods 0.000 claims description 19
- 108020001507 fusion proteins Proteins 0.000 claims description 19
- 102000037865 fusion proteins Human genes 0.000 claims description 19
- 230000007017 scission Effects 0.000 claims description 19
- 108010077850 Nuclear Localization Signals Proteins 0.000 claims description 16
- 108010075254 C-Peptide Proteins 0.000 claims description 14
- 229940035893 uracil Drugs 0.000 claims description 11
- 239000003153 chemical reaction reagent Substances 0.000 claims description 10
- 229940122069 Glycosidase inhibitor Drugs 0.000 claims description 9
- 239000003316 glycosidase inhibitor Substances 0.000 claims description 9
- 230000001419 dependent effect Effects 0.000 claims description 6
- 102000009572 RNA Polymerase II Human genes 0.000 claims description 4
- 108010009460 RNA Polymerase II Proteins 0.000 claims description 4
- 210000000349 chromosome Anatomy 0.000 claims description 3
- 235000004252 protein component Nutrition 0.000 claims description 2
- 239000003795 chemical substances by application Substances 0.000 claims 1
- 150000001875 compounds Chemical class 0.000 claims 1
- 238000002741 site-directed mutagenesis Methods 0.000 abstract description 3
- 241000196324 Embryophyta Species 0.000 description 117
- 240000007594 Oryza sativa Species 0.000 description 33
- 235000007164 Oryza sativa Nutrition 0.000 description 32
- 235000009566 rice Nutrition 0.000 description 32
- 108020004705 Codon Proteins 0.000 description 22
- 150000001413 amino acids Chemical group 0.000 description 21
- 102100035102 E3 ubiquitin-protein ligase MYCBP2 Human genes 0.000 description 17
- 108091033319 polynucleotide Proteins 0.000 description 15
- 102000040430 polynucleotide Human genes 0.000 description 15
- 239000002157 polynucleotide Substances 0.000 description 15
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 14
- 238000006467 substitution reaction Methods 0.000 description 13
- 108020004414 DNA Proteins 0.000 description 12
- 230000009261 transgenic effect Effects 0.000 description 12
- 230000009466 transformation Effects 0.000 description 11
- 230000037429 base substitution Effects 0.000 description 10
- 241000589158 Agrobacterium Species 0.000 description 9
- 230000002068 genetic effect Effects 0.000 description 9
- 102220605874 Cytosolic arginine sensor for mTORC1 subunit 2_D10A_mutation Human genes 0.000 description 8
- 108091028043 Nucleic acid sequence Proteins 0.000 description 8
- 238000010276 construction Methods 0.000 description 8
- 240000008042 Zea mays Species 0.000 description 7
- 235000002017 Zea mays subsp mays Nutrition 0.000 description 7
- 229940104302 cytosine Drugs 0.000 description 7
- 238000012217 deletion Methods 0.000 description 7
- 230000037430 deletion Effects 0.000 description 7
- 239000004009 herbicide Substances 0.000 description 7
- 238000005457 optimization Methods 0.000 description 7
- 230000008569 process Effects 0.000 description 7
- 238000012163 sequencing technique Methods 0.000 description 7
- 108010020183 3-phosphoshikimate 1-carboxyvinyltransferase Proteins 0.000 description 6
- 101150001232 ALS gene Proteins 0.000 description 6
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 6
- 239000004475 Arginine Substances 0.000 description 6
- 235000007688 Lycopersicon esculentum Nutrition 0.000 description 6
- 240000003768 Solanum lycopersicum Species 0.000 description 6
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 description 6
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 6
- 235000005822 corn Nutrition 0.000 description 6
- 238000003780 insertion Methods 0.000 description 6
- 230000037431 insertion Effects 0.000 description 6
- 230000010354 integration Effects 0.000 description 6
- 229930024421 Adenine Natural products 0.000 description 5
- 241000219194 Arabidopsis Species 0.000 description 5
- 244000068988 Glycine max Species 0.000 description 5
- 235000010469 Glycine max Nutrition 0.000 description 5
- 244000062793 Sorghum vulgare Species 0.000 description 5
- 241000700605 Viruses Species 0.000 description 5
- 229960000643 adenine Drugs 0.000 description 5
- 235000001014 amino acid Nutrition 0.000 description 5
- 238000005516 engineering process Methods 0.000 description 5
- 238000012360 testing method Methods 0.000 description 5
- 238000010354 CRISPR gene editing Methods 0.000 description 4
- 241000588724 Escherichia coli Species 0.000 description 4
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 4
- 239000002202 Polyethylene glycol Substances 0.000 description 4
- 241000209140 Triticum Species 0.000 description 4
- 235000021307 Triticum Nutrition 0.000 description 4
- 230000000295 complement effect Effects 0.000 description 4
- 238000013461 design Methods 0.000 description 4
- 239000012634 fragment Substances 0.000 description 4
- 229920001223 polyethylene glycol Polymers 0.000 description 4
- 238000011160 research Methods 0.000 description 4
- 108091008146 restriction endonucleases Proteins 0.000 description 4
- 238000003786 synthesis reaction Methods 0.000 description 4
- 210000001519 tissue Anatomy 0.000 description 4
- 108010000700 Acetolactate synthase Proteins 0.000 description 3
- YQYJSBFKSSDGFO-UHFFFAOYSA-N Epihygromycin Natural products OC1C(O)C(C(=O)C)OC1OC(C(=C1)O)=CC=C1C=C(C)C(=O)NC1C(O)C(O)C2OCOC2C1O YQYJSBFKSSDGFO-UHFFFAOYSA-N 0.000 description 3
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 3
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 3
- 244000061176 Nicotiana tabacum Species 0.000 description 3
- 235000002637 Nicotiana tabacum Nutrition 0.000 description 3
- 101150110992 SLR1 gene Proteins 0.000 description 3
- 235000011684 Sorghum saccharatum Nutrition 0.000 description 3
- 108091028113 Trans-activating crRNA Proteins 0.000 description 3
- 235000004279 alanine Nutrition 0.000 description 3
- 230000015572 biosynthetic process Effects 0.000 description 3
- 238000007796 conventional method Methods 0.000 description 3
- 230000012010 growth Effects 0.000 description 3
- 230000002363 herbicidal effect Effects 0.000 description 3
- 230000004807 localization Effects 0.000 description 3
- 244000005700 microbiome Species 0.000 description 3
- 210000004940 nucleus Anatomy 0.000 description 3
- 238000005215 recombination Methods 0.000 description 3
- 230000008685 targeting Effects 0.000 description 3
- 101150097623 1B gene Proteins 0.000 description 2
- 108010079649 APOBEC-1 Deaminase Proteins 0.000 description 2
- 108010085238 Actins Proteins 0.000 description 2
- 102000007469 Actins Human genes 0.000 description 2
- 241000208173 Apiaceae Species 0.000 description 2
- 235000017060 Arachis glabrata Nutrition 0.000 description 2
- 244000105624 Arachis hypogaea Species 0.000 description 2
- 235000010777 Arachis hypogaea Nutrition 0.000 description 2
- 235000018262 Arachis monticola Nutrition 0.000 description 2
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- 235000007319 Avena orientalis Nutrition 0.000 description 2
- 244000075850 Avena orientalis Species 0.000 description 2
- 241000894006 Bacteria Species 0.000 description 2
- 240000007124 Brassica oleracea Species 0.000 description 2
- 235000003899 Brassica oleracea var acephala Nutrition 0.000 description 2
- 235000011301 Brassica oleracea var capitata Nutrition 0.000 description 2
- 235000001169 Brassica oleracea var oleracea Nutrition 0.000 description 2
- 238000010453 CRISPR/Cas method Methods 0.000 description 2
- 241000701489 Cauliflower mosaic virus Species 0.000 description 2
- 235000007516 Chrysanthemum Nutrition 0.000 description 2
- 244000189548 Chrysanthemum x morifolium Species 0.000 description 2
- 241000218631 Coniferophyta Species 0.000 description 2
- 241000518484 Cotton leaf curl virus Species 0.000 description 2
- 240000008067 Cucumis sativus Species 0.000 description 2
- 235000010799 Cucumis sativus var sativus Nutrition 0.000 description 2
- 102100040264 DNA dC->dU-editing enzyme APOBEC-3D Human genes 0.000 description 2
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 2
- 239000004471 Glycine Substances 0.000 description 2
- 101000964382 Homo sapiens DNA dC->dU-editing enzyme APOBEC-3D Proteins 0.000 description 2
- 240000005979 Hordeum vulgare Species 0.000 description 2
- 235000007340 Hordeum vulgare Nutrition 0.000 description 2
- 206010020649 Hyperkeratosis Diseases 0.000 description 2
- RAXXELZNTBOGNW-UHFFFAOYSA-O Imidazolium Chemical compound C1=C[NH+]=CN1 RAXXELZNTBOGNW-UHFFFAOYSA-O 0.000 description 2
- 229930010555 Inosine Natural products 0.000 description 2
- UGQMRVRMYYASKQ-KQYNXXCUSA-N Inosine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C2=NC=NC(O)=C2N=C1 UGQMRVRMYYASKQ-KQYNXXCUSA-N 0.000 description 2
- 240000008436 Ipomoea aquatica Species 0.000 description 2
- 235000019004 Ipomoea aquatica Nutrition 0.000 description 2
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 2
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 2
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 2
- 240000008415 Lactuca sativa Species 0.000 description 2
- 235000003228 Lactuca sativa Nutrition 0.000 description 2
- 241000209510 Liliopsida Species 0.000 description 2
- 241001465754 Metazoa Species 0.000 description 2
- 235000000365 Oenanthe javanica Nutrition 0.000 description 2
- 240000006243 Oenanthe sarmentosa Species 0.000 description 2
- 244000207867 Pistia stratiotes Species 0.000 description 2
- 235000006440 Pistia stratiotes Nutrition 0.000 description 2
- 108700001094 Plant Genes Proteins 0.000 description 2
- 102000014450 RNA Polymerase III Human genes 0.000 description 2
- 108010078067 RNA Polymerase III Proteins 0.000 description 2
- 241000208292 Solanaceae Species 0.000 description 2
- 244000300264 Spinacia oleracea Species 0.000 description 2
- 235000009337 Spinacia oleracea Nutrition 0.000 description 2
- 241000193996 Streptococcus pyogenes Species 0.000 description 2
- 241000187747 Streptomyces Species 0.000 description 2
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 2
- 239000004473 Threonine Substances 0.000 description 2
- 241000723873 Tobacco mosaic virus Species 0.000 description 2
- 108090000848 Ubiquitin Proteins 0.000 description 2
- 102000044159 Ubiquitin Human genes 0.000 description 2
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 2
- 229940024606 amino acid Drugs 0.000 description 2
- 235000003704 aspartic acid Nutrition 0.000 description 2
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 2
- 238000010367 cloning Methods 0.000 description 2
- 238000006481 deamination reaction Methods 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 230000005782 double-strand break Effects 0.000 description 2
- 241001233957 eudicotyledons Species 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 239000013604 expression vector Substances 0.000 description 2
- 235000013922 glutamic acid Nutrition 0.000 description 2
- 239000004220 glutamic acid Substances 0.000 description 2
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 2
- 229960003786 inosine Drugs 0.000 description 2
- 235000021374 legumes Nutrition 0.000 description 2
- 239000003550 marker Substances 0.000 description 2
- 238000000520 microinjection Methods 0.000 description 2
- 235000019713 millet Nutrition 0.000 description 2
- 235000020232 peanut Nutrition 0.000 description 2
- 230000008635 plant growth Effects 0.000 description 2
- 102000004196 processed proteins & peptides Human genes 0.000 description 2
- 108090000765 processed proteins & peptides Proteins 0.000 description 2
- 230000006798 recombination Effects 0.000 description 2
- 238000007480 sanger sequencing Methods 0.000 description 2
- 125000006850 spacer group Chemical group 0.000 description 2
- 241000894007 species Species 0.000 description 2
- 239000000758 substrate Substances 0.000 description 2
- 238000013518 transcription Methods 0.000 description 2
- 230000035897 transcription Effects 0.000 description 2
- 238000011426 transformation method Methods 0.000 description 2
- 239000004474 valine Substances 0.000 description 2
- CAAMSDWKXXPUJR-UHFFFAOYSA-N 3,5-dihydro-4H-imidazol-4-one Chemical compound O=C1CNC=N1 CAAMSDWKXXPUJR-UHFFFAOYSA-N 0.000 description 1
- 101100301006 Allochromatium vinosum (strain ATCC 17899 / DSM 180 / NBRC 103801 / NCIMB 10441 / D) cbbL2 gene Proteins 0.000 description 1
- 240000002791 Brassica napus Species 0.000 description 1
- 235000011293 Brassica napus Nutrition 0.000 description 1
- 102100040397 C->U-editing enzyme APOBEC-1 Human genes 0.000 description 1
- 102100040399 C->U-editing enzyme APOBEC-2 Human genes 0.000 description 1
- 238000010356 CRISPR-Cas9 genome editing Methods 0.000 description 1
- 102100025053 Cell division control protein 45 homolog Human genes 0.000 description 1
- 241000747028 Cestrum yellow leaf curling virus Species 0.000 description 1
- 108700010070 Codon Usage Proteins 0.000 description 1
- 102100026846 Cytidine deaminase Human genes 0.000 description 1
- 108010031325 Cytidine deaminase Proteins 0.000 description 1
- 108091028709 DNA adenine Proteins 0.000 description 1
- 102100040263 DNA dC->dU-editing enzyme APOBEC-3A Human genes 0.000 description 1
- 102100040262 DNA dC->dU-editing enzyme APOBEC-3B Human genes 0.000 description 1
- 102100040261 DNA dC->dU-editing enzyme APOBEC-3C Human genes 0.000 description 1
- 102100040266 DNA dC->dU-editing enzyme APOBEC-3F Human genes 0.000 description 1
- 102100038050 DNA dC->dU-editing enzyme APOBEC-3H Human genes 0.000 description 1
- 101710082737 DNA dC->dU-editing enzyme APOBEC-3H Proteins 0.000 description 1
- 230000004543 DNA replication Effects 0.000 description 1
- 101150111720 EPSPS gene Proteins 0.000 description 1
- 102000004190 Enzymes Human genes 0.000 description 1
- 108090000790 Enzymes Proteins 0.000 description 1
- 229930191978 Gibberellin Natural products 0.000 description 1
- 239000005562 Glyphosate Substances 0.000 description 1
- 101000964322 Homo sapiens C->U-editing enzyme APOBEC-2 Proteins 0.000 description 1
- 101000934421 Homo sapiens Cell division control protein 45 homolog Proteins 0.000 description 1
- 101000964378 Homo sapiens DNA dC->dU-editing enzyme APOBEC-3A Proteins 0.000 description 1
- 101000964385 Homo sapiens DNA dC->dU-editing enzyme APOBEC-3B Proteins 0.000 description 1
- 101000964383 Homo sapiens DNA dC->dU-editing enzyme APOBEC-3C Proteins 0.000 description 1
- 101000964377 Homo sapiens DNA dC->dU-editing enzyme APOBEC-3F Proteins 0.000 description 1
- 101000800426 Homo sapiens Putative C->U-editing enzyme APOBEC-4 Proteins 0.000 description 1
- 241000235789 Hyperoartia Species 0.000 description 1
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 1
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 1
- 101000598243 Nicotiana tabacum Probable aquaporin TIP-type RB7-18C Proteins 0.000 description 1
- 101000655028 Nicotiana tabacum Probable aquaporin TIP-type RB7-5A Proteins 0.000 description 1
- 241000209504 Poaceae Species 0.000 description 1
- 102100033091 Putative C->U-editing enzyme APOBEC-4 Human genes 0.000 description 1
- CZPWVGJYEJSRLH-UHFFFAOYSA-N Pyrimidine Chemical compound C1=CN=CN=C1 CZPWVGJYEJSRLH-UHFFFAOYSA-N 0.000 description 1
- 101100036896 Schizosaccharomyces pombe (strain 972 / ATCC 24843) uep1 gene Proteins 0.000 description 1
- 102100022433 Single-stranded DNA cytosine deaminase Human genes 0.000 description 1
- 101710143275 Single-stranded DNA cytosine deaminase Proteins 0.000 description 1
- 244000061456 Solanum tuberosum Species 0.000 description 1
- 235000002595 Solanum tuberosum Nutrition 0.000 description 1
- 241000191967 Staphylococcus aureus Species 0.000 description 1
- 241000194007 Streptococcus canis Species 0.000 description 1
- 101000910035 Streptococcus pyogenes serotype M1 CRISPR-associated endonuclease Cas9/Csn1 Proteins 0.000 description 1
- 235000016383 Zea mays subsp huehuetenangensis Nutrition 0.000 description 1
- 238000010521 absorption reaction Methods 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 230000006229 amino acid addition Effects 0.000 description 1
- 210000004102 animal cell Anatomy 0.000 description 1
- 238000009395 breeding Methods 0.000 description 1
- 230000001488 breeding effect Effects 0.000 description 1
- 239000000969 carrier Substances 0.000 description 1
- 101150004101 cbbL gene Proteins 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 239000002131 composite material Substances 0.000 description 1
- 244000038559 crop plants Species 0.000 description 1
- 125000004122 cyclic group Chemical group 0.000 description 1
- 210000000805 cytoplasm Anatomy 0.000 description 1
- 238000001976 enzyme digestion Methods 0.000 description 1
- 210000003527 eukaryotic cell Anatomy 0.000 description 1
- 238000003209 gene knockout Methods 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- IXORZMNAPKEEDV-UHFFFAOYSA-N gibberellic acid GA3 Natural products OC(=O)C1C2(C3)CC(=C)C3(O)CCC2C2(C=CC3O)C1C3(C)C(=O)O2 IXORZMNAPKEEDV-UHFFFAOYSA-N 0.000 description 1
- 239000003448 gibberellin Substances 0.000 description 1
- 125000000291 glutamic acid group Chemical group N[C@@H](CCC(O)=O)C(=O)* 0.000 description 1
- XDDAORKBJWWYJS-UHFFFAOYSA-N glyphosate Chemical compound OC(=O)CNCP(O)(O)=O XDDAORKBJWWYJS-UHFFFAOYSA-N 0.000 description 1
- 229940097068 glyphosate Drugs 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 239000003112 inhibitor Substances 0.000 description 1
- 230000002401 inhibitory effect Effects 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 229960000310 isoleucine Drugs 0.000 description 1
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 1
- 125000001909 leucine group Chemical group [H]N(*)C(C(*)=O)C([H])([H])C(C([H])([H])[H])C([H])([H])[H] 0.000 description 1
- 235000009973 maize Nutrition 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 229910052757 nitrogen Inorganic materials 0.000 description 1
- 235000015097 nutrients Nutrition 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 1
- 229920001184 polypeptide Polymers 0.000 description 1
- 235000012015 potatoes Nutrition 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 101150074945 rbcL gene Proteins 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 230000009870 specific binding Effects 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 108091035705 tRNA adenine Proteins 0.000 description 1
- 238000001890 transfection Methods 0.000 description 1
- 238000003151 transfection method Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 108700026220 vif Genes Proteins 0.000 description 1
Images
Classifications
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01H—NEW PLANTS OR NON-TRANSGENIC PROCESSES FOR OBTAINING THEM; PLANT REPRODUCTION BY TISSUE CULTURE TECHNIQUES
- A01H5/00—Angiosperms, i.e. flowering plants, characterised by their plant parts; Angiosperms characterised otherwise than by their botanic taxonomy
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01H—NEW PLANTS OR NON-TRANSGENIC PROCESSES FOR OBTAINING THEM; PLANT REPRODUCTION BY TISSUE CULTURE TECHNIQUES
- A01H6/00—Angiosperms, i.e. flowering plants, characterised by their botanic taxonomy
- A01H6/20—Brassicaceae, e.g. canola, broccoli or rucola
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01H—NEW PLANTS OR NON-TRANSGENIC PROCESSES FOR OBTAINING THEM; PLANT REPRODUCTION BY TISSUE CULTURE TECHNIQUES
- A01H6/00—Angiosperms, i.e. flowering plants, characterised by their botanic taxonomy
- A01H6/46—Gramineae or Poaceae, e.g. ryegrass, rice, wheat or maize
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01H—NEW PLANTS OR NON-TRANSGENIC PROCESSES FOR OBTAINING THEM; PLANT REPRODUCTION BY TISSUE CULTURE TECHNIQUES
- A01H6/00—Angiosperms, i.e. flowering plants, characterised by their botanic taxonomy
- A01H6/54—Leguminosae or Fabaceae, e.g. soybean, alfalfa or peanut
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K19/00—Hybrid peptides, i.e. peptides covalently bound to nucleic acids, or non-covalently bound protein-protein complexes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N5/00—Undifferentiated human, animal or plant cells, e.g. cell lines; Tissues; Cultivation or maintenance thereof; Culture media therefor
- C12N5/10—Cells modified by introduction of foreign genetic material
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/16—Hydrolases (3) acting on ester bonds (3.1)
- C12N9/22—Ribonucleases RNAses, DNAses
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/78—Hydrolases (3) acting on carbon to nitrogen bonds other than peptide bonds (3.5)
Definitions
- the present invention relates to the field of biotechnology, in particular, to a nucleic acid construct for gene editing.
- CRISPR-Cas9 gene editing technology has been widely used in the research of gene editing in animals and plants. Currently, it mainly includes cytosine base editor (CBE) and adenine base editor (ABE).
- CBE cytosine base editor
- ABE adenine base editor
- the base editor can be performed in the genome Accurate base changes (such as substitutions) without causing DNA double-strand breaks (DSB).
- the early developed CBE and ABE systems consist of rat cytidine deaminase APOBEC1, which is derived from lamprey’s cytidine deaminase PmCDA1 and TadA, which is derived from tRNA adenine deaminase, and have been applied to many plant species.
- Medium such as rice, wheat, corn, tomato, Arabidopsis and Brassica napus.
- Kang et al. optimized the promoter
- Zong et al. optimized the deaminase.
- Cas9 protein variants can recognize PAM sequences that are different from the classic NGG motif.
- the efficiency of single-base editing is still very low.
- the base editors reported so far can only recognize a limited number of PAM sequences, which greatly limits the editable range of plant genomes.
- the purpose of the present invention is to provide a new nucleic acid construct for gene editing that can efficiently achieve A-G transformation in a large range and accurately in plant cells while significantly reducing the risk of insertion or deletion mutations.
- the first aspect of the present invention provides a nucleic acid construct having a 5'-3' (5' to 3') formula I structure:
- I1 is the first integrated component
- I2 is the second integrated component
- Z1 is the first expression cassette
- Z2 is the second expression cassette
- one of the expression cassettes of Z1 and Z2 has the structure of Ia or Ia', and the other expression cassette has the structure of formula Ib:
- P1, S1, X1, L1, X2, X4, L2, X3, P2, Y1 are elements used to form the construct, respectively;
- P1 is a first promoter, and the first promoter is an RNA polymerase II dependent promoter;
- S1 is the coding sequence of the first nuclear localization signal
- X1 is the coding sequence of adenine deaminase (such as wild-type and/or mutant TadA) and/or the coding sequence of cytosine deaminase;
- L1 is the coding sequence of no or first connecting peptide
- X2 is the coding sequence of Cas9 nuclease, which has no cleavage activity or single-stranded cleavage activity;
- X4 is the coding sequence of no or uracil glycosidase inhibitor UGI;
- L2 is the coding sequence of no or second connecting peptide
- X3 is the coding sequence of the second nuclear localization signal
- P2 is the second promoter
- Y1 is the coding sequence of gRNA
- each "-" is a bond or a nucleotide connection sequence
- the additional condition is that when X1 is the coding sequence of adenine deaminase, X4 is none, when X1 is the coding sequence of cytosine deaminase, and X4 is the coding sequence of the uracil glycosidase inhibitor UGI.
- the gRNA includes crRNA, tracrRNA, and sgRNA.
- the first promoter is derived from one or more plants selected from the group consisting of corn, rice, soybean, Arabidopsis or tomato.
- the first promoter is derived from one or more microorganisms selected from the group consisting of Streptomyces and Escherichia coli.
- the first promoter is derived from one or more viruses selected from the group consisting of tobacco mosaic virus, yellow leaf curl virus, cauliflower mosaic virus, and cotton leaf curl virus.
- the first promoter includes a maize ubiquitin promoter.
- the ubiquitin promoter includes UBI promoter.
- the first promoter is selected from the group consisting of UBI, UBQ, 35S, Actin, SPL, CmYLCV, YAO, CDC45, rbcS, rbcL, PsGNS2, UEP1, TobRB7, Cab, or a combination thereof.
- the length of the nucleotide sequence of L1 and L2 is independently 3-120 nt, preferably 3-96 nt, and preferably a multiple of 3.
- the lengths of the amino acid sequences encoded by L1 and L2 are independently 3-40aa, preferably 6-32aa, preferably 18-32aa, and preferably 24-32aa.
- the length of the nucleotide linking sequence is 1-300 nt, preferably 1-100 nt.
- nucleotide linking sequence does not affect the normal transcription and translation of each element.
- the Cas9 nuclease is selected from the group consisting of nCas9, Cas9NG, nCas9NG, or a combination thereof.
- the Cas9 nuclease is selected from the group consisting of nSpCas9 (D10A), nSpCas9NG, nSaCas9, nScCas9, nSqCas9, nXCas9, or a combination thereof.
- the Cas9 nuclease includes a mutant Cas9 nuclease.
- the identity of the Cas9 nuclease and the mutant Cas9 nuclease is ⁇ 80%, preferably ⁇ 90%; more preferably ⁇ 95%, more preferably, ⁇ 98% Or 99%.
- the activity of the mutant Cas9 nuclease is equivalent to or significantly better than that of the wild-type Cas9 enzyme.
- the mutant Cas9 nuclease is passed through one or more of the wild-type Cas9 nuclease, preferably 1-15, preferably 1-10, preferably 1- 7, more preferably 2-5, amino acid substitution, deletion; and/or after 1-5, preferably 1-4, more preferably 1-3, most preferably 1-2 amino acid addition Forming.
- the mutation site is at position D10A of Cas9 nuclease, and its amino acid sequence is shown in SEQ ID NO.:2.
- the X2 element is mutated at one or more sites selected from the following group in the Cas9 nuclease corresponding to SEQ ID NO.: 2:
- Threonine (T) at position 1337.
- the arginine (R) at position 1335 is mutated to alanine (A); and/or
- Leucine (L) at position 1111 is mutated to arginine (R); and/or
- Threonine (T) at position 1337 was mutated to arginine (R).
- the mutation is selected from the following group: R1335A; L1111R; D1135V; G1218R; E1219F; A1322R; T1337R.
- amino acid sequence of the X2 element is shown in SEQ ID NO.: 3.
- the X2 element is derived from bacteria.
- the source of the X2 element is selected from the group consisting of Streptococcus pyogenes, Staphylococcus aureus, Streptococcus canis, or a combination thereof.
- the coding sequence of the first connecting peptide and the coding sequence of the second connecting peptide each independently include XTEN.
- the coding sequence of the first connecting peptide and the coding sequence of the second connecting peptide are shown in SEQ ID NO.: 4 or 7.
- the nuclear localization signal is a codon-optimized nuclear localization signal.
- the nuclear localization signal is a plant codon optimized nuclear localization signal.
- the nuclear localization signal includes bpNLS.
- the nuclear localization signal is bpNLS.
- nucleotide sequences of the S1 element and the X3 element are shown in SEQ ID NO.: 5 or 20, respectively.
- the adenine deaminase includes wild type and mutant type.
- the adenine deaminase includes wild-type and/or mutant TadA.
- the adenine deaminase includes TadA.
- the mutant type of adenine deaminase includes TadA7-10.
- the adenine deaminase is a tandem adenine deaminase, and the structure of the tandem adenine deaminase is shown in formula II:
- Z8 is the amino acid sequence of wild-type adenine deaminase TadA
- L8 is an optional connecting peptide sequence
- Z9 is the amino acid sequence of the mutant adenine deaminase TadA7-10.
- the adenine deaminase is a codon-optimized adenine deaminase.
- the adenine deaminase is a plant codon optimized adenine deaminase.
- the coding sequence of the adenine deaminase is selected from the following group:
- the coding sequence of the adenine deaminase is shown in SEQ ID NO.1.
- amino acid sequence of the adenine deaminase is shown in SEQ ID NO.: 8.
- the cytosine deaminase includes wild type and mutant type.
- the cytosine deaminase includes APOBEC.
- the APOBEC is selected from the following group: APOBEC1 (A1), APOBEC2 (A2), APOBEC3A, APOBEC3B, APOBEC3C, APOBEC3D, APOBEC3E, APOBEC3F, APOBEC3H, APOBEC4 (A4), activation-induced deaminase ( activation induced cytidinedeaminase, AID), or a combination thereof.
- the mutants of the cytosine deaminase include CBE2.0, CBE2.1, CBE2.2, CBE2.3, and CBE2.4.
- the cytosine deaminase is a codon-optimized cytosine deaminase.
- the cytosine deaminase is a plant codon optimized cytosine deaminase.
- the coding sequence of the cytosine deaminase is selected from the following group:
- the coding sequence of the cytosine deaminase is shown in SEQ ID NO.19.
- amino acid sequence of the cytosine deaminase is shown in SEQ ID NO.:21.
- nucleotide sequence of the X4 element is shown in SEQ ID NO.: 22.
- the second promoter is derived from one or more plants selected from the group consisting of rice, corn, soybean, Arabidopsis or tomato.
- the second promoter is derived from one or more microorganisms selected from the group consisting of Streptomyces and Escherichia coli.
- the second promoter is derived from one or more viruses selected from the group consisting of tobacco mosaic virus, yellow leaf curl virus, cauliflower mosaic virus, and cotton leaf curl virus.
- the second promoter includes an RNA polymerase III-dependent promoter.
- the second promoter is an RNA polymerase III-dependent promoter.
- the second promoter is selected from the group consisting of U6, U3, U6a, U6b, U6c, U6-1, U3b, U3d, U6-26, U6-29, H1, or a combination thereof.
- the second promoter includes U6 promoter.
- the second promoter is selected from the group consisting of OsU6, OsU3, OsU6a, OsU6b, OsU6c, AtU6-1, AtU3b, AtU3d, AtU6-1, AtU6-26, AtU6-29, or combination.
- the "no cleavage activity or single-strand cleavage activity" refers to the Cas9 nuclease having no cleavage activity for the single-stranded target site T.
- the "no cleavage activity or single-strand cleavage activity" refers to the Cas9 nuclease has no cleavage activity for the single-stranded target site G.
- nucleotide elements of the present invention are connected in-frame to express a fusion protein with the correct amino acid sequence.
- the construct has a structure of formula IIa or IIa' or formula IIb or IIb':
- both the first expression cassette and the second expression cassette have a terminator.
- first expression cassette and the second expression cassette share the same terminator.
- the terminator includes a terminator suitable for plant gene editing.
- the terminator is selected from the group consisting of NOS, Poly A, T-UBQ, rbcS, or a combination thereof.
- nucleotide sequence of the terminator is shown in SEQ ID NO.:6.
- the first integration element includes a 5'homology arm sequence.
- the second integration element includes a 3'homology arm sequence.
- the length of the nucleic acid construct is 3000-10000 bp, preferably 4000-8500 bp, more preferably 4000-6000 bp.
- one or more additional expression cassettes are additionally inserted.
- the additional expression cassette is independent of the first expression cassette and the second expression cassette.
- the additional expression cassette expresses a substance selected from the following group:
- the marker gene includes a resistance gene (such as a hygromycin resistance gene, a herbicide resistance gene), a fluorescent gene, or a combination thereof.
- the second aspect of the present invention provides a fusion protein, the fusion protein includes (a) adenine deaminase and/or cytosine deaminase; and (b) Cas9 nuclease, and the fusion protein is composed of the present invention
- the nucleic acid construct described in the first aspect encodes.
- the third aspect of the present invention provides a vector containing the nucleic acid construct according to the first aspect of the present invention.
- the vector is a plant expression vector.
- the vector is an expression vector that can be transfected or transformed into plant cells.
- the carrier is an Agrobacterium Ti carrier.
- the construct is integrated into the T-DNA region of the vector.
- the carrier is cyclic or linear.
- the fourth aspect of the present invention provides a genetically engineered cell containing the nucleic acid construct according to the first aspect of the present invention, or its genome integrates one or more nucleic acid constructs according to the first aspect of the present invention.
- the cell is a plant cell.
- the plant is selected from the group consisting of monocots, dicots, gymnosperms, or combinations thereof.
- the plant is selected from the group consisting of gramineous plants, legumes, cruciferous plants, Solanaceae, Umbelliferae, or a combination thereof.
- the plants include: Arabidopsis, wheat, barley, oats, corn, rice, sorghum, millet, soybean, peanut, tobacco, tomato, cabbage, rape, spinach, lettuce, cucumber, chrysanthemum , Water spinach, celery, lettuce, or combinations thereof.
- the genetically engineered cell is introduced into the cell with the nucleic acid construct of claim 1 by a method selected from the group consisting of: Agrobacterium transformation method, gene gun method, microinjection method, electric shock method , Ultrasonic method and polyethylene glycol (PEG) mediation method.
- a method selected from the group consisting of: Agrobacterium transformation method, gene gun method, microinjection method, electric shock method , Ultrasonic method and polyethylene glycol (PEG) mediation method.
- the fifth aspect of the present invention provides a reagent combination for gene editing, including:
- P1 is a first promoter, and the first promoter is an RNA polymerase II dependent promoter;
- S1 is the coding sequence of the first nuclear localization signal
- X1 is the coding sequence of adenine deaminase (such as wild-type and/or mutant TadA) and/or the coding sequence of cytosine deaminase;
- L1 is the coding sequence of no or first connecting peptide
- X2 is the coding sequence of Cas9 nuclease, said Cas9 nuclease has no cleavage activity or single-stranded cleavage activity;
- X4 is the coding sequence of no or uracil glycosidase inhibitor UGI;
- L2 is the coding sequence of no or second connecting peptide
- X3 is the coding sequence of the nuclear localization signal
- the additional condition is that when X1 is the coding sequence of adenine deaminase, X4 is none, when X1 is the coding sequence of cytosine deaminase, and X4 is the coding sequence of the uracil glycosidase inhibitor UGI; and
- P2 is the second promoter
- Y1 is the coding sequence of gRNA
- the first carrier and the second carrier are different carriers.
- first nucleic acid construct and the second nucleic acid construct are located on different vectors.
- the first carrier and the second carrier are the same carrier.
- first nucleic acid construct and the second nucleic acid construct are located on the same vector.
- the sixth aspect of the present invention provides a kit containing the reagent combination according to the fifth aspect of the present invention.
- the kit further contains a label or instructions.
- the seventh aspect of the present invention provides a method for gene editing of plants, including the steps:
- nucleic acid construct according to the first aspect of the present invention, the vector according to the third aspect of the present invention, or the reagent combination according to the fifth aspect of the present invention is introduced into the plant cell of the plant to be edited, so that the Gene editing in plant cells.
- the introduction is by Agrobacterium.
- the introduction is by gene gun introduction.
- the gene editing is site-directed base substitution (or mutation).
- the site-directed substitution includes mutating A to G.
- the site-directed substitution includes mutating C to T.
- the plants include any higher plant types that can be transformed, including monocots, dicots and gymnosperms.
- the plant is selected from the group consisting of gramineous plants, legumes, cruciferous plants, Solanaceae, Umbelliferae, or a combination thereof.
- the plants include: Arabidopsis, wheat, barley, oats, corn, rice, sorghum, millet, soybean, peanut, tobacco, tomato, cabbage, rape, spinach, lettuce, cucumber, chrysanthemum , Water spinach, celery, lettuce, or combinations thereof.
- the eighth aspect of the present invention provides a method for preparing gene-edited plant cells, including the steps:
- nucleic acid construct of the first aspect of the present invention, the vector of the third aspect of the present invention, or the combination of the reagent of the fifth aspect of the present invention are transfected into plant cells, so that the chromosomes in the plant cells undergo site-directed replacement ( Or mutation), thereby preparing the gene-edited plant cell.
- the transfection adopts the Agrobacterium transformation method or the gene gun bombardment method.
- the ninth aspect of the present invention provides a nucleic acid construct according to the first aspect of the present invention, a fusion protein according to the second aspect of the present invention, a vector according to the third aspect of the present invention, and a gene according to the fourth aspect of the present invention
- the engineered cell, the reagent combination according to the fifth aspect of the present invention, and the use of the kit according to the sixth aspect of the present invention are used for gene editing of plants.
- the tenth aspect of the present invention provides a method for preparing a gene-edited plant, including the steps:
- the gene-edited plant cell prepared by the method of the eighth aspect of the present invention or the ninth aspect of the present invention is regenerated into a plant body, thereby obtaining the gene-edited plant.
- the eleventh aspect of the present invention provides a gene-edited plant prepared by the method of the tenth aspect of the present invention.
- the twelfth aspect of the present invention provides a composite comprising the following two components:
- nucleic acid component the nucleic acid is gRNA
- Figure 1 shows the editing efficiency of ABE-nCas9 and ABEmax-nCas9 on ALS genes.
- Figure 2 shows the editing efficiency of ABE-nCas9, ABEmax-nCas9 and ABEmax-nCas9NG on ALS genes at different PAM sites.
- Figure 3 shows the base editing efficiency of CBE2.0-nCas9 for replacing the target site C with T in the rice NRT1.1B gene.
- the abscissa represents the sgRNA-PAM sequence, and the ordinate represents the substitution efficiency.
- Figure 4 shows the ratio of homozygous to other non-homozygous in rice NRT1.1B gene mutant plants.
- FIGS 5 and 6 show the difference in traits between rice SLR1 gene mutant plants and wild-type plants.
- Figure 7 shows the base editing efficiency of CBE2.0-nCas9 for replacing the target site C of the rice SLR1 gene with T.
- the abscissa represents the sgRNA-PAM sequence, and the ordinate represents the substitution efficiency.
- Figure 8 shows the ratio of homozygous to other non-homozygous mutation types in rice SLR1 gene mutants.
- Figure 9 shows the base editing efficiency of CBE2.0-nCas9 for replacing the target site C with T in the rice ALS gene.
- Figure 10 shows the ratio of homozygous to other non-homozygous plants in mutant plants of the rice ALS gene.
- Figure 11 shows the base editing efficiency of CBE2.0-nCas9NG for replacing the target site C with T in rice ALS gene.
- Figure 12 shows the difference in the growth phenotype of rice ALS gene mutant plants and wild-type plants sprayed with imidazolium herbicide.
- Figure 13 shows the mutation sites of rice ALS gene mutant plants relative to wild-type plants.
- Figure 14 shows the mutation sites of the rice EPSPS gene.
- Figure 15 shows the editing efficiency of CBE2.0-nCas9 and CBE2.0-nCas9NG for different target genes.
- adenine base editor elements and/or cytosine base editor elements After extensive and in-depth research, the inventors have optimized the quality and quantity of adenine base editor elements and/or cytosine base editor elements, using binuclear localization signals, optimized adenine deaminase and/or cytosine Pyrimidine deaminase and different Cas9 nucleases construct an adenine and/or cytosine base editing tool that is more efficient and has a wider recognition range in plant gene editing.
- the present invention successfully implements sgRNA in plants for the first time Guided base site-directed mutation (such as A mutation to G or C mutation to T), and the mutation efficiency of the adenine base editor element is very high (up to ⁇ 40% or higher), which can identify more PAM positions At the same time, the indel ratio is very low.
- the inventors have also improved the mutation efficiency significantly (up to ⁇ 80% or higher) by optimizing the cytosine deaminase. By optimizing the Cas9 protein, it can recognize more Many PAM sites (including NGG, NG), and the indel ratio is very low, ⁇ 7%, which improves the accuracy of editing.
- the present invention has been completed on this basis.
- homologous arm refers to the flanking sequences that are identical to the genome sequence on both sides of the foreign sequence to be inserted on the targeting vector, and are used to identify and recombine regions.
- plant promoter refers to a nucleic acid sequence capable of initiating transcription of nucleic acid in plant cells.
- the plant promoter can be derived from plants, microorganisms (such as bacteria, viruses), animals, etc., or synthetic or engineered promoters.
- the term "gene editing” or “base mutation” or “base editing” refers to a substitution, insertion, and/or deletion of a base at a certain position in a nucleotide sequence. ).
- the "edit” or “mutation” in the present invention is preferably a single base mutation.
- base substitution refers to the mutation of a base at a certain position in the nucleotide sequence to another different base, such as the mutation of A to G.
- A.T to G.C refers to the mutation or replacement of an A-T base pair at a certain position with a G-C base pair in a double-stranded nucleic acid sequence (especially a genomic sequence).
- C.G to T.A refers to the mutation or replacement of a C-G base pair at a certain position with a T-A base pair in a double-stranded nucleic acid sequence (especially a genomic sequence).
- Cas protein refers to a nuclease.
- a preferred Cas protein is the Cas9 protein.
- Typical Cas9 proteins include (but are not limited to): Cas9 derived from Streptococcus pyogenes.
- the Cas9 protein is a mutant Cas9 protein, specifically, a mutant Cas9 protein that has no cleavage activity or only a single-stranded cleavage activity.
- the Cas9 protein of the present invention includes SpCas9n (D10A), nSpCas9NG, SaCas9n, ScCas9n, XCas9n.
- the term "coding sequence of Cas protein” refers to a nucleotide sequence encoding Cas protein.
- the skilled person will realize that because of the degeneracy of the codon, a large number of polynucleotide sequences can encode the same polypeptide .
- technicians will also realize that different species have certain preferences for codons, and may optimize the codons of Cas protein according to the needs of expression in different species. These variants are all referred to by the term "Cas protein.
- Encoding sequence specifically covers.
- the term specifically includes a full-length sequence that is substantially the same as the Cas gene sequence, and a sequence encoding a protein that retains the function of the Cas protein.
- gRNA is also called guide RNA or guide RNA, and has the meaning commonly understood by those skilled in the art.
- guide RNAs can include direct repeats and guide sequences, or consist essentially of direct repeats and guide sequences (also called spacers in the context of endogenous CRISPR systems). (spacer)) composition.
- gRNA can include crRNA and tracrRNA, or only crRNA, depending on the Cas protein it depends on.
- crRNA and tracrRNA can be artificially modified and fused to form single guide RNA (sgRNA).
- the gRNA of the present invention may be natural, or artificially modified or designed and synthesized.
- the targeting sequence is any polynucleotide sequence that has sufficient complementarity with the target sequence to hybridize with the target sequence and guide the specific binding of the CRISPR/Cas complex to the target sequence, usually having 17- Sequence length of 23nt.
- the degree of complementarity between the targeting sequence and its corresponding target sequence is at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95%, Or at least 99%. Determining the best alignment is within the abilities of those of ordinary skill in the art. For example, there are published and commercially available alignment algorithms and programs, such as but not limited to ClustalW, Smith-Waterman algorithm in matlab, Bowtie, Geneious, Biopython, and SeqMan.
- nucleotide sequence is from 5'to 3', unless otherwise specified.
- adenine deaminase refers to TadA adenine deaminase, derived from Escherichia coli, originally acting on tRNA and capable of deaminating specific adenines in tRNA.
- the applicable TadA includes both the wild-type form and its specific mutant form TadA7-10, or a combination of the wild-type form and the mutant form.
- TadA7-10 can perform deamination reaction with DNA as a substrate.
- the coding sequence of the adenine deaminase of the present invention is codon-optimized, so that it can be expressed in plants more efficiently.
- cytosine deaminase refers to the cytosine deaminase APOBEC, derived from Escherichia coli, originally acting on tRNA, capable of deaminating specific cytosines in tRNA.
- the applicable cytosine deaminase includes both wild-type and specific mutant forms (such as CBE2.0, CBE2.1, CBE2.2, CBE2.3, CBE2.4), or Contains a combination of wild-type and mutant forms.
- the mutant form of cytosine deaminase can perform deamination reaction with DNA as a substrate.
- the coding sequence of the cytosine deaminase of the present invention is codon-optimized, so that it can be expressed in plants more efficiently.
- the preferred cytosine deaminase is CBE2.0.
- amino acid sequence of CBE2.0 is shown in SEQ ID NO.: 23.
- the amino acid sequence of CBE2.1 is shown in SEQ ID NO.: 24.
- the amino acid sequence of CBE2.2 is shown in SEQ ID NO.: 25.
- amino acid sequence of CBE2.3 is shown in SEQ ID NO.: 26.
- the amino acid sequence of CBE2.4 is shown in SEQ ID NO.: 27.
- the present invention provides a nucleic acid construct for gene editing of plants, the nucleic acid construct has a 5'-3' formula I structure:
- I1 is the first integrated component
- I2 is the second integrated component
- Z1 is the first expression cassette
- Z2 is the second expression cassette
- one of the expression cassettes of Z1 and Z2 has the structure of Ia or Ia', and the other expression cassette has the structure of formula Ib:
- I1, P1, S1, X1, L1, X2, X4, L2, X3, P2, Y1, I2 are elements used to form the construct, respectively, and the definitions are as described in the first aspect of the present invention
- each "-" is a bond or a nucleotide connection sequence
- the additional condition is that when X1 is the coding sequence of adenine deaminase, X4 is none, when X1 is the coding sequence of cytosine deaminase, and X4 is the coding sequence of the uracil glycosidase inhibitor UGI.
- the I1 element (or the integration element on the left) and the I2 element (or the integration element on the right) can cooperate to integrate the elements between them (that is, the nucleotide sequence from P1 to Y1) into In the genome of plant cells.
- I1 and I2 are Ti elements from Agrobacterium. Of course, other elements that can play a similar integration role can also be used in the present invention.
- the various elements used in the construct of the present invention are either known in the art or can be prepared by methods known to those skilled in the art.
- the corresponding elements can be obtained by conventional methods, such as PCR methods, fully artificial chemical synthesis methods, and enzyme digestion methods, and then connected together by well-known DNA ligation techniques to form the construct of the present invention.
- Inserting the construct of the present invention into an exogenous vector constitutes the vector of the present invention.
- the vector of the present invention is transformed into plant cells so as to mediate the integration of the vector of the present invention into the chromosomes of plant cells and express in the plant body to prepare gene-edited plant cells.
- the gene-edited plant cell of the present invention is regenerated into a plant body, thereby obtaining a gene-edited plant.
- nucleic acid construct constructed by the present invention can be introduced into plant cells through conventional plant recombination technology (for example, Agrobacterium transfer technology), thereby obtaining the nucleic acid construct (or the vector carrying the nucleic acid construct) Or obtain a plant cell with the nucleic acid construct integrated in the genome.
- plant recombination technology for example, Agrobacterium transfer technology
- the plant individual integrated with the nucleic acid construct can be isolated or removed in its progeny by routine screening or other means known in the art, thereby obtaining a gene-edited plant without the nucleic acid construct body.
- the present invention is to construct an optimized adenine deaminase expression cassette and/or cytosine deaminase expression cassette in the CRISPR/Cas9 system of plants.
- Adenine deaminase can convert the target DNA Adenine (A) is deaminated and converted to inosine (I). Inosine can be paired with cytosine, and it is read and copied as guanine (G) at the DNA level to realize A to G mutation within the mutation window;
- Cytosine deaminase can deaminate the cytosine (C) in the target DNA into uracil (U), and uracil will be recognized as T during the DNA replication process, realizing the mutation from C to T.
- the basic structure of the nucleic acid construct of the adenine base editor element is as follows:
- the deaminase expression cassette is ZmUbi-ecTadA-32aa linker-ecTadA(7.10)-32aa linker-nSpCas9/nSaCas9-SV40NLS-NOS
- the basic structure of the nucleic acid construct of the cytosine base editor element is as follows:
- the deaminase expression cassette is ZmUbi-rAPOBEC1-XTEN-nCas9-UGI-SV40NLS-NOS
- the main feature of the vector is to link adenine deaminase and/or cytosine deaminase with the Cas protein in the CRISPR/Cas system and the coding sequence of the nuclear localization signal bpNLS to form the coding sequence of the fusion protein.
- the fusion protein encoded by the coding sequence is expressed in the cytoplasm, the fusion protein can be transferred to the nucleus very efficiently, and guided to the target site in the genome by the guide RNA encoded by the formula I construct, Therefore, the base substitution from AT to GC or CG to TA is performed at the target site, and the risk of insertion/deletion is basically avoided or eliminated.
- the Cas protein is a mutant Cas protein with no cleavage activity or a single-strand cleavage activity.
- the Cas protein of the present invention may be Cas9 (D10A), and its amino acid sequence is shown in SEQ ID NO.:2.
- the Cas protein of the present invention may be nCas9NG, and its amino acid sequence is shown in SEQ ID NO.:3.
- the proteins are usually connected by some flexible short peptides, namely Linker (connecting peptide sequence).
- Linker connecting peptide sequence
- the Linker can use XTEN, and its coding sequence is shown in SEQ ID NO.:4.
- suitable promoters include constitutive and/or inducible promoters.
- a strong promoter suitable for plant cells can be selected. Representative examples include (but not limited to): CaMV 35S promoter or UBI promoter or Actin promoter.
- the action area of the deaminase is fixed.
- adenine deaminase TadA or mutant adenine deaminase TadA7-10 protein for example, (a) adenine deaminase TadA or mutant adenine deaminase TadA7-10 protein; and/or (b) cytosine deaminase APOBEC or mutant cytosine deaminase (CBE2. 0)
- the experimental results obtained by the present invention show that the editing windows of the various Cas proteins of the present invention have little difference, and they are all within the first 20 bases of the PAM site Within, the preferred hot spot area is in the range of 3-10 bases.
- the method for introducing the construct of formula I of the present invention into cells or integrating into the genome there is no particular limitation on the method for introducing the construct of formula I of the present invention into cells or integrating into the genome. It can be carried out by conventional methods, for example, the construct of formula I or the corresponding vector is introduced into plant cells by a suitable method.
- Representative introduction methods include but are not limited to: Agrobacterium transfection method, gene gun method, microinjection method, electric shock method, ultrasonic method, and polyethylene glycol (PEG)-mediated method.
- the recipient plants which include various crop plants (such as grasses), forestry plants, horticultural plants (such as floral plants) and the like.
- crop plants such as grasses
- forestry plants such as floral plants
- horticultural plants such as floral plants
- Representative examples include, but are not limited to: rice, soybeans, tomatoes, corn, tobacco, wheat, sorghum, potatoes and the like.
- the DNA in the transformed plant cell is allowed to express the fusion protein and gRNA.
- the Cas protein fused with adenine deaminase under the guidance of the corresponding gRNA, mutates the A at the target position to G (thereby mutates the T of the complementary strand to C) or mutates the C at the target position to T (and thus makes the complementary
- the G mutation of the chain is A).
- codon optimization is performed on the coding sequence of adenine deaminase and/or cytosine deaminase, especially plant codon optimization.
- codon optimization There are 64 genetic codes, but most of them tend to use some of these codons. Those most frequently used are called preferred codons or optimal codons, and those that are not frequently used are called rare or low-utilized codons. In fact, every organism that uses protein expression or production shows a certain degree of difference or preference in codon usage. Use preferred codons and avoid inefficient or rare codons for gene synthesis. This kind of gene resetting is called codon optimization.
- codon optimization is performed on the amino acid sequence of adenine deaminase and/or cytosine deaminase and nuclear localization signal, and the codons preferred by eukaryotic cells are used for optimization.
- the preferred codons of animal cells are used.
- the codons are optimized, and more preferably, the codons preferred by plant cells are used for optimization.
- the invention can be used in the field of plant genetic engineering, for plant research and breeding, especially for genetic improvement of agricultural crops, forestry crops or horticultural plants with economic value.
- the present invention combines Cas9 nuclease (such as nCas9 or nCas9NG) with optimized adenine deaminase and binuclear localization signal to form a fusion protein for the first time, and successfully realizes sgRNA-guided base-directed mutation (such as A mutation to G), and the mutation efficiency is very high (up to ⁇ 40% or higher), while significantly reducing or substantially eliminating the risk of insertion and/or deletion (indel) at the target site.
- Cas9 nuclease such as nCas9 or nCas9NG
- adenine deaminase and binuclear localization signal to form a fusion protein for the first time
- sgRNA-guided base-directed mutation such as A mutation to G
- the present invention combines Cas9 nuclease (such as nCas9 or nCas9NG) with optimized cytosine deaminase, UGI, and binuclear localization signal to form a fusion protein for the first time, and successfully realizes sgRNA-guided base-directed mutation (such as C
- the mutation is T
- the mutation efficiency is very high (up to ⁇ 80% or higher), while significantly reducing or substantially eliminating the risk of insertion and/or deletion (indel) at the target site, and can reduce the indel ratio It is ⁇ 7%.
- the present invention can expand the range of targeted editing in the plant genome by using different forms of Cas9.
- the present invention finds for the first time that the nCas9NG of the present invention can recognize non-NGG (such as NG) PAM, and can obtain very efficient base editing efficiency.
- non-NGG such as NG
- ALS-sg3 CGCATTCAAGGACATGATCCTGG (SEQ ID NO.: 9)
- ALS-sg1 GCGCCCCCACTTGGGATCATAGG (SEQ ID NO.: 10)
- the nucleic acid construct (pCambia1300) containing the base editor and hygromycin resistance gene was introduced into Agrobacterium EHA105, and then transformed into rice callus.
- the transformation, tissue culture, and plant growth of rice were performed according to the procedures recorded in the literature. (Nishimura et al., 2006; Wang et al., 2015).
- the efficiency of ABE-nCas9 to generate A to G mutations at the target site is about 20%, while the efficiency of ABEmax-nCas9 is about 40%-50%.
- the specific results are shown in Figure 1.
- the editing efficiency of ABEmax-nCas9 is as high as 40%, which is double the editing efficiency of ABE-nCas9.
- the results show that the optimized ABEmax-nCas9 base editor can be efficiently applied to site-specific base replacement in plant genomes.
- Adopt Mut MultiS multiple site-directed mutagenesis kit obtained nSpCas9-NG on the basis of nSpCas9 (D10A), which contains 7 amino acid substitutions R1335A/L1111R/D1135V/G1218R/E1219F/A1322R/T1337R.
- the mutated sequence replaced the nSpCas9(D10A) fragment in the ABEmax-nCas9 editor with BamHI and SpeI restriction enzyme sites to obtain the ABEmax-nCas9NG editor.
- Example 3 The base editing effect of CBE2.0-nCas9 on rice NRT1.1B and SLR1 genes
- NRT1.1B and SLR1 genes were selected, and the previously tested sgRNA was used for editing.
- NRT1.1B controls the absorption of nitrogen as a nutrient element in rice.
- the change of amino acid at position 327 from Thr to Met can increase yield;
- SLR1 controls the synthesis of gibberellin and affects rice plant height.
- OsU6-sgRNA NRT1.1B The sequence of OsU6-sgRNA (NRT1.1B) is shown in SEQ ID NO.: 28.
- OsU6-sgRNA SLR1
- SLR1 The sequence of OsU6-sgRNA (SLR1) is shown in SEQ ID NO.: 29.
- the nucleic acid construct (pCambia1300) containing the base editor and hygromycin resistance gene was introduced into Agrobacterium EHA105, and then transformed into rice callus.
- the transformation, tissue culture, and plant growth of rice were performed according to the procedures recorded in the literature. (Nishimura et al., 2006; Wang et al., 2015).
- the base replacement efficiency of the first generation of CBE base editor at the above two sites is only 2.7% (NRT1.1B) and 13.3% (SLR1) (Lu and Zhu, 2017), compared with the present invention
- the efficiency of the optimized CBE base editor has been increased by 26 times and 6 times respectively.
- Example 4 The base editing effect of CBE2.0-nCas9 on rice acetolactate synthase gene (ALS)
- Acetolactate synthase is a key enzyme for the synthesis of valine, leucine and isoleucine in plants.
- ALS inhibitors are often used as herbicides. However, in addition to inhibiting the growth of weeds, these herbicides also Can inhibit the growth of crops. Studies have shown that the mutation of Ser at position 627 of the protein sequence of ALS to Asn (corresponding to the mutation of G at position 1880 in the DNA sequence to A) confers tolerance to imidazolinone herbicides (Piao et al., 2018) ).
- Example 5 The base editing effect of CBE2.0-nCas9NG on acetolactate synthase gene (ALS) and 5-enolpyruvylshikimate-3-phosphate synthase gene (EPSPS) in rice
- ALS acetolactate synthase gene
- EPSPS 5-enolpyruvylshikimate-3-phosphate synthase gene
- Adopt Mut MultiS multiple site-directed mutagenesis kit purchased from Nanjing Novozan obtained nSpCas9-NG on the basis of nSpCas9 (D10A), which contains 7 amino acid substitutions R1335A/L1111R/D1135V/G1218R/E1219F/A1322R/T1337R.
- the mutated sequence replaced the nSpCas9(D10A) fragment in the CBE2.0-nCas9 editor with BamHI and SpeI restriction enzyme sites to obtain the CBE2.0-nCas9NG editor.
- CBE2.0-nCas9NG base editor can recognize the PAM motif of NGN.
- the sgRNA sequence for ALS has been redesigned (ALSsg2, with AGC as the PAM motif: CCCCACTTGGGATCATAGGCAGC (SEQ ID NO.: 44))
- ALSsg2 with AGC as the PAM motif: CCCCACTTGGGATCATAGGCAGC (SEQ ID NO.: 44)
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Genetics & Genomics (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Zoology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Wood Science & Technology (AREA)
- Biomedical Technology (AREA)
- Biotechnology (AREA)
- Biochemistry (AREA)
- General Engineering & Computer Science (AREA)
- Molecular Biology (AREA)
- General Health & Medical Sciences (AREA)
- Developmental Biology & Embryology (AREA)
- Microbiology (AREA)
- Environmental Sciences (AREA)
- Physiology (AREA)
- Botany (AREA)
- Medicinal Chemistry (AREA)
- Natural Medicines & Medicinal Plants (AREA)
- Biophysics (AREA)
- Cell Biology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Physics & Mathematics (AREA)
- Plant Pathology (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
L'invention concerne une construction d'acide nucléique pour l'édition de gènes, la construction d'acide nucléique ayant une structure spécifique pouvant permettre la mutagenèse dirigée sur l'ARNg et dirigée sur le site de bases dans des plantes.
Applications Claiming Priority (8)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910169340.X | 2019-03-06 | ||
CN201910169340 | 2019-03-06 | ||
CN201910173148.8 | 2019-03-07 | ||
CN201910173148 | 2019-03-07 | ||
CN201910838286.3A CN110526993B (zh) | 2019-03-06 | 2019-09-05 | 一种用于基因编辑的核酸构建物 |
CN201910839046.5A CN110527695B (zh) | 2019-03-07 | 2019-09-05 | 一种用于基因定点突变的核酸构建物 |
CN201910839046.5 | 2019-09-05 | ||
CN201910838286.3 | 2019-09-05 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2020177751A1 true WO2020177751A1 (fr) | 2020-09-10 |
Family
ID=72337612
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2020/078079 WO2020177751A1 (fr) | 2019-03-06 | 2020-03-05 | Construction d'acide nucléique pour édition de gènes |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2020177751A1 (fr) |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104293828A (zh) * | 2013-07-16 | 2015-01-21 | 中国科学院上海生命科学研究院 | 植物基因组定点修饰方法 |
CN106609282A (zh) * | 2016-12-02 | 2017-05-03 | 中国科学院上海生命科学研究院 | 一种用于植物基因组定点碱基替换的载体 |
CN110157726A (zh) * | 2018-02-11 | 2019-08-23 | 中国科学院上海生命科学研究院 | 植物基因组定点替换的方法 |
CN110527695A (zh) * | 2019-03-07 | 2019-12-03 | 山东舜丰生物科技有限公司 | 一种用于基因定点突变的核酸构建物 |
CN110526993A (zh) * | 2019-03-06 | 2019-12-03 | 山东舜丰生物科技有限公司 | 一种用于基因编辑的核酸构建物 |
-
2020
- 2020-03-05 WO PCT/CN2020/078079 patent/WO2020177751A1/fr active Application Filing
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104293828A (zh) * | 2013-07-16 | 2015-01-21 | 中国科学院上海生命科学研究院 | 植物基因组定点修饰方法 |
CN106609282A (zh) * | 2016-12-02 | 2017-05-03 | 中国科学院上海生命科学研究院 | 一种用于植物基因组定点碱基替换的载体 |
CN110157726A (zh) * | 2018-02-11 | 2019-08-23 | 中国科学院上海生命科学研究院 | 植物基因组定点替换的方法 |
CN110526993A (zh) * | 2019-03-06 | 2019-12-03 | 山东舜丰生物科技有限公司 | 一种用于基因编辑的核酸构建物 |
CN110527695A (zh) * | 2019-03-07 | 2019-12-03 | 山东舜丰生物科技有限公司 | 一种用于基因定点突变的核酸构建物 |
Non-Patent Citations (2)
Title |
---|
KAI HUA: "Precise A.T to G*C Base Editing in the Rice Genome", MOLECULAR PLANT, vol. 11, no. 4, 21 February 2018 (2018-02-21), pages 627 - 630, XP055655070, ISSN: 1752-9867 * |
ZAFRA MARIA PAZ, SCHATOFF EMMA M, KATTI ALYNA, FORONDA MIGUEL, BREINIG MARCO, SCHWEITZER ANABEL Y, SIMON AMBER, HAN TENG, GOSWAMI : "Optimized Base Editors Enable Efficient Editing in Cells, Organoids and Mice", NAT BIOTECHNOL., vol. 36, no. 9, 3 July 2018 (2018-07-03), pages 888 - 893, XP036929662, ISSN: 1546-1696 * |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110526993B (zh) | 一种用于基因编辑的核酸构建物 | |
EP3902911B1 (fr) | Polypeptides utiles pour l'édition de gènes et procédés d'utilisation | |
CN110527695B (zh) | 一种用于基因定点突变的核酸构建物 | |
CN110157726B (zh) | 植物基因组定点替换的方法 | |
CN111263810A (zh) | 使用多核苷酸指导的核酸内切酶的细胞器基因组修饰 | |
US20210403901A1 (en) | Targeted mutagenesis using base editors | |
JP2020519310A (ja) | 除草剤耐性遺伝子の創出およびその使用 | |
CN112266420B (zh) | 一种植物高效胞嘧啶单碱基编辑器及其构建与应用 | |
WO2018098935A1 (fr) | Vecteur pour substitution de base dirigée du génome d'une plante | |
JP2023550932A (ja) | アデノシンデアミナーゼ、並びに関連する生体材料及びその使用 | |
JP2023534693A (ja) | ウラシル安定化タンパク質並びにその活性断片及びバリアント並びに使用方法 | |
CN113151229A (zh) | 胞嘧啶脱氨酶及包含该酶的胞嘧啶编辑器 | |
US11814633B2 (en) | Plant terminator for transgene expression | |
CN113994007B (zh) | 一种核酸表达的方法 | |
TW202300649A (zh) | Dna修飾酶及活性片段及其變體及使用方法 | |
US20230332167A1 (en) | Plant promoter for transgene expression | |
WO2020177751A1 (fr) | Construction d'acide nucléique pour édition de gènes | |
EP3052633B1 (fr) | Éléments régulateurs du gène de type métallothionéine de zea mays et leurs utilisations | |
CN114686456A (zh) | 基于双分子脱氨酶互补的碱基编辑系统及其应用 | |
Karcher et al. | Faithful editing of a tomato-specific mRNA editing site in transgenic tobacco chloroplasts | |
WO2021003410A1 (fr) | Modification du génome d'organites | |
CN114196644B (zh) | 一种蛋白棕榈酰化转移酶dhhc16及其在提高水稻耐盐方面的应用 | |
CN116724119A (zh) | 除草剂抗性植物 | |
CN111019969B (zh) | 一种通过优化供体dna模板来提高基因精确替换效率的方法 | |
WO2022188816A1 (fr) | Système d'édition de base cg amélioré |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 20765898 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 20765898 Country of ref document: EP Kind code of ref document: A1 |