EP3475427A1 - Procédés et compositions destinés à être utilisés dans la modification du génome de plantes - Google Patents
Procédés et compositions destinés à être utilisés dans la modification du génome de plantesInfo
- Publication number
- EP3475427A1 EP3475427A1 EP17821073.8A EP17821073A EP3475427A1 EP 3475427 A1 EP3475427 A1 EP 3475427A1 EP 17821073 A EP17821073 A EP 17821073A EP 3475427 A1 EP3475427 A1 EP 3475427A1
- Authority
- EP
- European Patent Office
- Prior art keywords
- sequence
- interest
- vector
- strand
- cell
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 91
- 239000000203 mixture Substances 0.000 title description 5
- 230000004048 modification Effects 0.000 title description 4
- 238000012986 modification Methods 0.000 title description 4
- 241000589157 Rhizobiales Species 0.000 claims abstract description 125
- 230000001965 increasing effect Effects 0.000 claims abstract description 72
- 230000010354 integration Effects 0.000 claims abstract description 56
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 644
- 210000004027 cell Anatomy 0.000 claims description 503
- 239000013598 vector Substances 0.000 claims description 459
- 241000196324 Embryophyta Species 0.000 claims description 320
- 102000004190 Enzymes Human genes 0.000 claims description 256
- 108090000790 Enzymes Proteins 0.000 claims description 256
- 102000053602 DNA Human genes 0.000 claims description 193
- 108020004414 DNA Proteins 0.000 claims description 193
- 108090000623 proteins and genes Proteins 0.000 claims description 179
- 230000014509 gene expression Effects 0.000 claims description 166
- 102000018120 Recombinases Human genes 0.000 claims description 158
- 108010091086 Recombinases Proteins 0.000 claims description 158
- 230000015572 biosynthetic process Effects 0.000 claims description 150
- 239000004055 small Interfering RNA Substances 0.000 claims description 150
- 238000003786 synthesis reaction Methods 0.000 claims description 150
- 230000035897 transcription Effects 0.000 claims description 128
- 238000013518 transcription Methods 0.000 claims description 128
- 230000000295 complement effect Effects 0.000 claims description 122
- 241000589158 Agrobacterium Species 0.000 claims description 115
- 239000013612 plasmid Substances 0.000 claims description 111
- 101710163270 Nuclease Proteins 0.000 claims description 110
- 230000000692 anti-sense effect Effects 0.000 claims description 98
- 108091030071 RNAI Proteins 0.000 claims description 95
- 230000009368 gene silencing by RNA Effects 0.000 claims description 94
- 102000004169 proteins and genes Human genes 0.000 claims description 85
- 108020004459 Small interfering RNA Proteins 0.000 claims description 78
- 150000007523 nucleic acids Chemical group 0.000 claims description 77
- 238000010459 TALEN Methods 0.000 claims description 72
- 230000000977 initiatory effect Effects 0.000 claims description 70
- 102000004533 Endonucleases Human genes 0.000 claims description 62
- 108010042407 Endonucleases Proteins 0.000 claims description 62
- 230000003247 decreasing effect Effects 0.000 claims description 60
- 230000033616 DNA repair Effects 0.000 claims description 56
- 108700011259 MicroRNAs Proteins 0.000 claims description 56
- 239000002679 microRNA Substances 0.000 claims description 56
- 108010020764 Transposases Proteins 0.000 claims description 53
- 102000008579 Transposases Human genes 0.000 claims description 53
- 239000003550 marker Substances 0.000 claims description 53
- 108091033409 CRISPR Proteins 0.000 claims description 52
- 230000006798 recombination Effects 0.000 claims description 46
- 238000005215 recombination Methods 0.000 claims description 45
- 108020005004 Guide RNA Proteins 0.000 claims description 42
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 41
- 230000008685 targeting Effects 0.000 claims description 39
- 108010088141 Argonaute Proteins Proteins 0.000 claims description 38
- 102000008682 Argonaute Proteins Human genes 0.000 claims description 38
- 101100300807 Drosophila melanogaster spn-A gene Proteins 0.000 claims description 37
- 102000053062 Rad52 DNA Repair and Recombination Human genes 0.000 claims description 37
- 108700031762 Rad52 DNA Repair and Recombination Proteins 0.000 claims description 37
- 102100036976 X-ray repair cross-complementing protein 6 Human genes 0.000 claims description 37
- 101710124907 X-ray repair cross-complementing protein 6 Proteins 0.000 claims description 37
- 108700026220 vif Genes Proteins 0.000 claims description 37
- 101150085703 vir gene Proteins 0.000 claims description 37
- 108091007412 Piwi-interacting RNA Proteins 0.000 claims description 36
- 239000002243 precursor Substances 0.000 claims description 36
- 230000001131 transforming effect Effects 0.000 claims description 35
- 230000037430 deletion Effects 0.000 claims description 34
- 238000012217 deletion Methods 0.000 claims description 34
- 230000006801 homologous recombination Effects 0.000 claims description 34
- 238000002744 homologous recombination Methods 0.000 claims description 34
- 125000006850 spacer group Chemical group 0.000 claims description 34
- 230000037431 insertion Effects 0.000 claims description 32
- 238000003780 insertion Methods 0.000 claims description 32
- 230000000694 effects Effects 0.000 claims description 31
- 230000035772 mutation Effects 0.000 claims description 31
- 240000008042 Zea mays Species 0.000 claims description 30
- 235000002017 Zea mays subsp mays Nutrition 0.000 claims description 30
- 230000037361 pathway Effects 0.000 claims description 30
- 230000004952 protein activity Effects 0.000 claims description 30
- 230000004568 DNA-binding Effects 0.000 claims description 29
- 108700019146 Transgenes Proteins 0.000 claims description 29
- 108091032973 (ribonucleotides)n+m Proteins 0.000 claims description 28
- 108010017070 Zinc Finger Nucleases Proteins 0.000 claims description 28
- 230000006870 function Effects 0.000 claims description 26
- 210000001161 mammalian embryo Anatomy 0.000 claims description 24
- 238000010362 genome editing Methods 0.000 claims description 23
- -1 Csm2 Proteins 0.000 claims description 22
- 235000010469 Glycine max Nutrition 0.000 claims description 22
- 244000068988 Glycine max Species 0.000 claims description 22
- 239000003623 enhancer Substances 0.000 claims description 22
- 108091008146 restriction endonucleases Proteins 0.000 claims description 22
- 235000014698 Brassica juncea var multisecta Nutrition 0.000 claims description 21
- 235000006008 Brassica napus var napus Nutrition 0.000 claims description 21
- 235000006618 Brassica rapa subsp oleifera Nutrition 0.000 claims description 21
- 235000004977 Brassica sinapistrum Nutrition 0.000 claims description 21
- 101150018129 CSF2 gene Proteins 0.000 claims description 21
- 101150069031 CSN2 gene Proteins 0.000 claims description 21
- 229920000742 Cotton Polymers 0.000 claims description 21
- 241000970829 Mesorhizobium Species 0.000 claims description 21
- 101100494762 Mus musculus Nedd9 gene Proteins 0.000 claims description 21
- 101100385413 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) csm-3 gene Proteins 0.000 claims description 21
- 241000588843 Ochrobactrum Species 0.000 claims description 21
- 241000589180 Rhizobium Species 0.000 claims description 21
- 240000000111 Saccharum officinarum Species 0.000 claims description 21
- 235000007201 Saccharum officinarum Nutrition 0.000 claims description 21
- 241001135312 Sinorhizobium Species 0.000 claims description 21
- 241000209140 Triticum Species 0.000 claims description 21
- 235000021307 Triticum Nutrition 0.000 claims description 21
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 claims description 21
- 210000003763 chloroplast Anatomy 0.000 claims description 21
- 101150055601 cops2 gene Proteins 0.000 claims description 21
- 235000005822 corn Nutrition 0.000 claims description 21
- 241000589173 Bradyrhizobium Species 0.000 claims description 20
- 241001135342 Phyllobacterium Species 0.000 claims description 20
- 239000012212 insulator Substances 0.000 claims description 20
- 230000002438 mitochondrial effect Effects 0.000 claims description 20
- 230000005030 transcription termination Effects 0.000 claims description 20
- 108020005544 Antisense RNA Proteins 0.000 claims description 19
- 239000003184 complementary RNA Substances 0.000 claims description 19
- 230000006780 non-homologous end joining Effects 0.000 claims description 19
- 239000000700 radioactive tracer Substances 0.000 claims description 19
- 102000040650 (ribonucleotides)n+m Human genes 0.000 claims description 18
- 108091032955 Bacterial small RNA Proteins 0.000 claims description 18
- 239000012634 fragment Substances 0.000 claims description 9
- 239000004009 herbicide Substances 0.000 claims description 9
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 claims description 8
- 230000009261 transgenic effect Effects 0.000 claims description 8
- 239000012881 co-culture medium Substances 0.000 claims description 7
- 230000002363 herbicidal effect Effects 0.000 claims description 7
- 108700001094 Plant Genes Proteins 0.000 claims description 6
- 235000016709 nutrition Nutrition 0.000 claims description 5
- 229910052757 nitrogen Inorganic materials 0.000 claims description 4
- 230000000749 insecticidal effect Effects 0.000 claims description 3
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 claims description 3
- 238000012258 culturing Methods 0.000 claims description 2
- 240000000385 Brassica napus var. napus Species 0.000 claims 4
- 230000001404 mediated effect Effects 0.000 abstract description 12
- 230000009466 transformation Effects 0.000 abstract description 12
- 229940088598 enzyme Drugs 0.000 description 247
- 229920002477 rna polymer Polymers 0.000 description 109
- 239000002773 nucleotide Substances 0.000 description 49
- 125000003729 nucleotide group Chemical group 0.000 description 49
- 102000012330 Integrases Human genes 0.000 description 48
- 108010061833 Integrases Proteins 0.000 description 48
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 33
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 33
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 32
- 102000039446 nucleic acids Human genes 0.000 description 31
- 108020004707 nucleic acids Proteins 0.000 description 31
- 102000040430 polynucleotide Human genes 0.000 description 18
- 108091033319 polynucleotide Proteins 0.000 description 18
- 239000002157 polynucleotide Substances 0.000 description 18
- 244000188595 Brassica sinapistrum Species 0.000 description 17
- 108010051219 Cre recombinase Proteins 0.000 description 17
- 108010046276 FLP recombinase Proteins 0.000 description 16
- 230000000670 limiting effect Effects 0.000 description 15
- 210000001519 tissue Anatomy 0.000 description 11
- 238000004519 manufacturing process Methods 0.000 description 10
- 241000589155 Agrobacterium tumefaciens Species 0.000 description 9
- 235000016383 Zea mays subsp huehuetenangensis Nutrition 0.000 description 9
- 235000009973 maize Nutrition 0.000 description 9
- 238000012546 transfer Methods 0.000 description 8
- 108010000700 Acetolactate synthase Proteins 0.000 description 7
- 108010073062 Transcription Activator-Like Effectors Proteins 0.000 description 7
- 210000000349 chromosome Anatomy 0.000 description 7
- 230000001939 inductive effect Effects 0.000 description 7
- 230000001105 regulatory effect Effects 0.000 description 7
- IMXSCCDUAFEIOE-UHFFFAOYSA-N D-Octopin Natural products OC(=O)C(C)NC(C(O)=O)CCCN=C(N)N IMXSCCDUAFEIOE-UHFFFAOYSA-N 0.000 description 6
- LMKYZBGVKHTLTN-NKWVEPMBSA-N D-nopaline Chemical compound NC(=N)NCCC[C@@H](C(O)=O)N[C@@H](C(O)=O)CCC(O)=O LMKYZBGVKHTLTN-NKWVEPMBSA-N 0.000 description 6
- IMXSCCDUAFEIOE-RITPCOANSA-N D-octopine Chemical compound [O-]C(=O)[C@@H](C)[NH2+][C@H](C([O-])=O)CCCNC(N)=[NH2+] IMXSCCDUAFEIOE-RITPCOANSA-N 0.000 description 6
- IAJOBQBIJHVGMQ-UHFFFAOYSA-N Phosphinothricin Natural products CP(O)(=O)CCC(N)C(O)=O IAJOBQBIJHVGMQ-UHFFFAOYSA-N 0.000 description 6
- HCHKCACWOHOZIP-UHFFFAOYSA-N Zinc Chemical compound [Zn] HCHKCACWOHOZIP-UHFFFAOYSA-N 0.000 description 6
- 238000005516 engineering process Methods 0.000 description 6
- 108020004999 messenger RNA Proteins 0.000 description 6
- 108010058731 nopaline synthase Proteins 0.000 description 6
- 239000011701 zinc Substances 0.000 description 6
- 229910052725 zinc Inorganic materials 0.000 description 6
- 108020003589 5' Untranslated Regions Proteins 0.000 description 5
- 241000701489 Cauliflower mosaic virus Species 0.000 description 5
- 108091026890 Coding region Proteins 0.000 description 5
- 230000011559 double-strand break repair via nonhomologous end joining Effects 0.000 description 5
- SEOVTRFCIGRIMH-UHFFFAOYSA-N indole-3-acetic acid Chemical compound C1=CC=C2C(CC(=O)O)=CNC2=C1 SEOVTRFCIGRIMH-UHFFFAOYSA-N 0.000 description 5
- 231100001160 nonlethal Toxicity 0.000 description 5
- 229960000268 spectinomycin Drugs 0.000 description 5
- UNFWWIHTNXNPBV-WXKVUWSESA-N spectinomycin Chemical compound O([C@@H]1[C@@H](NC)[C@@H](O)[C@H]([C@@H]([C@H]1O1)O)NC)[C@]2(O)[C@H]1O[C@H](C)CC2=O UNFWWIHTNXNPBV-WXKVUWSESA-N 0.000 description 5
- 108020005345 3' Untranslated Regions Proteins 0.000 description 4
- 108010066133 D-octopine dehydrogenase Proteins 0.000 description 4
- NDUPDOJHUQKPAG-UHFFFAOYSA-N Dalapon Chemical compound CC(Cl)(Cl)C(O)=O NDUPDOJHUQKPAG-UHFFFAOYSA-N 0.000 description 4
- 102000053187 Glucuronidase Human genes 0.000 description 4
- 108010060309 Glucuronidase Proteins 0.000 description 4
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 4
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 4
- 241000169176 Natronobacterium gregoryi Species 0.000 description 4
- 241000205156 Pyrococcus furiosus Species 0.000 description 4
- 241000589596 Thermus Species 0.000 description 4
- 108010043645 Transcription Activator-Like Effector Nucleases Proteins 0.000 description 4
- 150000001413 amino acids Chemical class 0.000 description 4
- 238000003776 cleavage reaction Methods 0.000 description 4
- 230000002068 genetic effect Effects 0.000 description 4
- IAJOBQBIJHVGMQ-BYPYZUCNSA-N glufosinate-P Chemical compound CP(O)(=O)CC[C@H](N)C(O)=O IAJOBQBIJHVGMQ-BYPYZUCNSA-N 0.000 description 4
- 239000005090 green fluorescent protein Substances 0.000 description 4
- 230000007017 scission Effects 0.000 description 4
- 239000000126 substance Substances 0.000 description 4
- RWQNBRDOKXIBIV-UHFFFAOYSA-N thymine Chemical compound CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 description 4
- 230000014616 translation Effects 0.000 description 4
- 238000011144 upstream manufacturing Methods 0.000 description 4
- UPMXNNIRAGDFEH-UHFFFAOYSA-N 3,5-dibromo-4-hydroxybenzonitrile Chemical compound OC1=C(Br)C=C(C#N)C=C1Br UPMXNNIRAGDFEH-UHFFFAOYSA-N 0.000 description 3
- 241000589156 Agrobacterium rhizogenes Species 0.000 description 3
- 239000005489 Bromoxynil Substances 0.000 description 3
- 241000701484 Figwort mosaic virus Species 0.000 description 3
- 239000005980 Gibberellic acid Substances 0.000 description 3
- 239000005562 Glyphosate Substances 0.000 description 3
- 108060001084 Luciferase Proteins 0.000 description 3
- 239000005089 Luciferase Substances 0.000 description 3
- 240000007594 Oryza sativa Species 0.000 description 3
- 235000007164 Oryza sativa Nutrition 0.000 description 3
- 238000007792 addition Methods 0.000 description 3
- 230000009418 agronomic effect Effects 0.000 description 3
- 230000027455 binding Effects 0.000 description 3
- 239000003593 chromogenic compound Substances 0.000 description 3
- IXORZMNAPKEEDV-UHFFFAOYSA-N gibberellic acid GA3 Natural products OC(=O)C1C2(C3)CC(=C)C3(O)CCC2C2(C=CC3O)C1C3(C)C(=O)O2 IXORZMNAPKEEDV-UHFFFAOYSA-N 0.000 description 3
- IXORZMNAPKEEDV-OBDJNFEBSA-N gibberellin A3 Chemical compound C([C@@]1(O)C(=C)C[C@@]2(C1)[C@H]1C(O)=O)C[C@H]2[C@]2(C=C[C@@H]3O)[C@H]1[C@]3(C)C(=O)O2 IXORZMNAPKEEDV-OBDJNFEBSA-N 0.000 description 3
- XDDAORKBJWWYJS-UHFFFAOYSA-N glyphosate Chemical compound OC(=O)CNCP(O)(O)=O XDDAORKBJWWYJS-UHFFFAOYSA-N 0.000 description 3
- 229940097068 glyphosate Drugs 0.000 description 3
- 229930027917 kanamycin Natural products 0.000 description 3
- 229960000318 kanamycin Drugs 0.000 description 3
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 3
- 229930182823 kanamycin A Natural products 0.000 description 3
- 230000007246 mechanism Effects 0.000 description 3
- 239000000178 monomer Substances 0.000 description 3
- 230000037039 plant physiology Effects 0.000 description 3
- 102000004196 processed proteins & peptides Human genes 0.000 description 3
- 235000009566 rice Nutrition 0.000 description 3
- 239000000523 sample Substances 0.000 description 3
- 230000002103 transcriptional effect Effects 0.000 description 3
- 238000013519 translation Methods 0.000 description 3
- JLIDBLDQVAYHNE-YKALOCIXSA-N (+)-Abscisic acid Chemical compound OC(=O)/C=C(/C)\C=C\[C@@]1(O)C(C)=CC(=O)CC1(C)C JLIDBLDQVAYHNE-YKALOCIXSA-N 0.000 description 2
- NDUPDOJHUQKPAG-UHFFFAOYSA-M 2,2-Dichloropropanoate Chemical compound CC(Cl)(Cl)C([O-])=O NDUPDOJHUQKPAG-UHFFFAOYSA-M 0.000 description 2
- WNZQDUSMALZDQF-UHFFFAOYSA-N 2-benzofuran-1(3H)-one Chemical compound C1=CC=C2C(=O)OCC2=C1 WNZQDUSMALZDQF-UHFFFAOYSA-N 0.000 description 2
- CAAMSDWKXXPUJR-UHFFFAOYSA-N 3,5-dihydro-4H-imidazol-4-one Chemical compound O=C1CNC=N1 CAAMSDWKXXPUJR-UHFFFAOYSA-N 0.000 description 2
- 108090000104 Actin-related protein 3 Proteins 0.000 description 2
- 241000894006 Bacteria Species 0.000 description 2
- 101710132601 Capsid protein Proteins 0.000 description 2
- 241000701459 Caulimovirus Species 0.000 description 2
- 108020004635 Complementary DNA Proteins 0.000 description 2
- 108091035707 Consensus sequence Proteins 0.000 description 2
- 239000005504 Dicamba Substances 0.000 description 2
- 108090000204 Dipeptidase 1 Proteins 0.000 description 2
- 108091029865 Exogenous DNA Proteins 0.000 description 2
- 239000005561 Glufosinate Substances 0.000 description 2
- XUMBMVFBXHLACL-UHFFFAOYSA-N Melanin Chemical compound O=C1C(=O)C(C2=CNC3=C(C(C(=O)C4=C32)=O)C)=C2C4=CNC2=C1C XUMBMVFBXHLACL-UHFFFAOYSA-N 0.000 description 2
- 206010028980 Neoplasm Diseases 0.000 description 2
- 238000002105 Southern blotting Methods 0.000 description 2
- 229940100389 Sulfonylurea Drugs 0.000 description 2
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 2
- 241000700605 Viruses Species 0.000 description 2
- 101710185494 Zinc finger protein Proteins 0.000 description 2
- 102100023597 Zinc finger protein 816 Human genes 0.000 description 2
- 108010050516 adenylate isopentenyltransferase Proteins 0.000 description 2
- 239000003242 anti bacterial agent Substances 0.000 description 2
- 230000001580 bacterial effect Effects 0.000 description 2
- GINJFDRNADDBIN-FXQIFTODSA-N bilanafos Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCP(C)(O)=O GINJFDRNADDBIN-FXQIFTODSA-N 0.000 description 2
- 230000003115 biocidal effect Effects 0.000 description 2
- 238000003390 bioluminescence detection Methods 0.000 description 2
- 238000010804 cDNA synthesis Methods 0.000 description 2
- 239000002299 complementary DNA Substances 0.000 description 2
- 150000001875 compounds Chemical class 0.000 description 2
- 238000004590 computer program Methods 0.000 description 2
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 2
- 230000002939 deleterious effect Effects 0.000 description 2
- IWEDIXLBFLAXBO-UHFFFAOYSA-N dicamba Chemical compound COC1=C(Cl)C=CC(Cl)=C1C(O)=O IWEDIXLBFLAXBO-UHFFFAOYSA-N 0.000 description 2
- 230000005782 double-strand break Effects 0.000 description 2
- 230000007613 environmental effect Effects 0.000 description 2
- 230000030279 gene silencing Effects 0.000 description 2
- 108010039239 glyphosate N-acetyltransferase Proteins 0.000 description 2
- 230000012010 growth Effects 0.000 description 2
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 2
- 239000003617 indole-3-acetic acid Substances 0.000 description 2
- 238000007852 inverse PCR Methods 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- 239000003921 oil Substances 0.000 description 2
- 230000002018 overexpression Effects 0.000 description 2
- 244000052769 pathogen Species 0.000 description 2
- 230000008488 polyadenylation Effects 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 230000001172 regenerating effect Effects 0.000 description 2
- 230000010076 replication Effects 0.000 description 2
- 238000012216 screening Methods 0.000 description 2
- YROXIXLRRCOBKF-UHFFFAOYSA-N sulfonylurea Chemical class OC(=N)N=S(=O)=O YROXIXLRRCOBKF-UHFFFAOYSA-N 0.000 description 2
- 229940113082 thymine Drugs 0.000 description 2
- 238000011426 transformation method Methods 0.000 description 2
- 150000005206 1,2-dihydroxybenzenes Chemical class 0.000 description 1
- GOCUAJYOYBLQRH-UHFFFAOYSA-N 2-(4-{[3-chloro-5-(trifluoromethyl)pyridin-2-yl]oxy}phenoxy)propanoic acid Chemical compound C1=CC(OC(C)C(O)=O)=CC=C1OC1=NC=C(C(F)(F)F)C=C1Cl GOCUAJYOYBLQRH-UHFFFAOYSA-N 0.000 description 1
- 108010020183 3-phosphoshikimate 1-carboxyvinyltransferase Proteins 0.000 description 1
- HUNCSWANZMJLPM-UHFFFAOYSA-N 5-methyltryptophan Chemical compound CC1=CC=C2NC=C(CC(N)C(O)=O)C2=C1 HUNCSWANZMJLPM-UHFFFAOYSA-N 0.000 description 1
- 102000000452 Acetyl-CoA carboxylase Human genes 0.000 description 1
- 108010016219 Acetyl-CoA carboxylase Proteins 0.000 description 1
- 102000007469 Actins Human genes 0.000 description 1
- 108010085238 Actins Proteins 0.000 description 1
- 229930024421 Adenine Natural products 0.000 description 1
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 1
- 108010000239 Aequorin Proteins 0.000 description 1
- 239000004382 Amylase Substances 0.000 description 1
- 108010037870 Anthranilate Synthase Proteins 0.000 description 1
- 241000238421 Arthropoda Species 0.000 description 1
- 229930192334 Auxin Natural products 0.000 description 1
- 108091079001 CRISPR RNA Proteins 0.000 description 1
- OYPRJOBELJOOCE-UHFFFAOYSA-N Calcium Chemical compound [Ca] OYPRJOBELJOOCE-UHFFFAOYSA-N 0.000 description 1
- 101710197658 Capsid protein VP1 Proteins 0.000 description 1
- 108090000489 Carboxy-Lyases Proteins 0.000 description 1
- 229930186147 Cephalosporin Natural products 0.000 description 1
- 101710094648 Coat protein Proteins 0.000 description 1
- WQZGKKKJIJFFOK-QTVWNMPRSA-N D-mannopyranose Chemical compound OC[C@H]1OC(O)[C@@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-QTVWNMPRSA-N 0.000 description 1
- 230000007018 DNA scission Effects 0.000 description 1
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 1
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 1
- 102100024746 Dihydrofolate reductase Human genes 0.000 description 1
- 102000016680 Dioxygenases Human genes 0.000 description 1
- 108010028143 Dioxygenases Proteins 0.000 description 1
- AHMIDUVKSGCHAU-UHFFFAOYSA-N Dopaquinone Natural products OC(=O)C(N)CC1=CC(=O)C(=O)C=C1 AHMIDUVKSGCHAU-UHFFFAOYSA-N 0.000 description 1
- 101150013191 E gene Proteins 0.000 description 1
- 101150048726 E9 gene Proteins 0.000 description 1
- 238000002965 ELISA Methods 0.000 description 1
- 241000588724 Escherichia coli Species 0.000 description 1
- VGGSQFUCUMXWEO-UHFFFAOYSA-N Ethene Chemical compound C=C VGGSQFUCUMXWEO-UHFFFAOYSA-N 0.000 description 1
- 239000005977 Ethylene Substances 0.000 description 1
- 101710129170 Extensin Proteins 0.000 description 1
- 108010007508 Farnesyltranstransferase Proteins 0.000 description 1
- 102000001390 Fructose-Bisphosphate Aldolase Human genes 0.000 description 1
- 108010068561 Fructose-Bisphosphate Aldolase Proteins 0.000 description 1
- 241000233866 Fungi Species 0.000 description 1
- 241000237858 Gastropoda Species 0.000 description 1
- 108700028146 Genetic Enhancer Elements Proteins 0.000 description 1
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 1
- 102100039291 Geranylgeranyl pyrophosphate synthase Human genes 0.000 description 1
- 108010014458 Gin recombinase Proteins 0.000 description 1
- 101710102069 Glutathione S-transferase 3 Proteins 0.000 description 1
- 108030006517 Glyphosate oxidoreductases Proteins 0.000 description 1
- 102100021181 Golgi phosphoprotein 3 Human genes 0.000 description 1
- 102000002812 Heat-Shock Proteins Human genes 0.000 description 1
- 108010004889 Heat-Shock Proteins Proteins 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- 101000928189 Homo sapiens Adhesion G protein-coupled receptor L2 Proteins 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- 241000588744 Klebsiella pneumoniae subsp. ozaenae Species 0.000 description 1
- WTDRDQBEARUVNC-UHFFFAOYSA-N L-Dopa Natural products OC(=O)C(N)CC1=CC=C(O)C(O)=C1 WTDRDQBEARUVNC-UHFFFAOYSA-N 0.000 description 1
- AHMIDUVKSGCHAU-LURJTMIESA-N L-dopaquinone Chemical compound [O-]C(=O)[C@@H]([NH3+])CC1=CC(=O)C(=O)C=C1 AHMIDUVKSGCHAU-LURJTMIESA-N 0.000 description 1
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 1
- FBOZXECLQNJBKD-ZDUSSCGKSA-N L-methotrexate Chemical compound C=1N=C2N=C(N)N=C(N)C2=NC=1CN(C)C1=CC=C(C(=O)N[C@@H](CCC(O)=O)C(O)=O)C=C1 FBOZXECLQNJBKD-ZDUSSCGKSA-N 0.000 description 1
- 108091026898 Leader sequence (mRNA) Proteins 0.000 description 1
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 1
- 101710125418 Major capsid protein Proteins 0.000 description 1
- 241001465754 Metazoa Species 0.000 description 1
- 102100026722 Microsomal glutathione S-transferase 3 Human genes 0.000 description 1
- 241000244206 Nematoda Species 0.000 description 1
- 235000002637 Nicotiana tabacum Nutrition 0.000 description 1
- 244000061176 Nicotiana tabacum Species 0.000 description 1
- 108010033272 Nitrilase Proteins 0.000 description 1
- 101710141454 Nucleoprotein Proteins 0.000 description 1
- 101710089395 Oleosin Proteins 0.000 description 1
- 108091034117 Oligonucleotide Proteins 0.000 description 1
- 229910019142 PO4 Inorganic materials 0.000 description 1
- 241000588912 Pantoea agglomerans Species 0.000 description 1
- UOZODPSAJZTQNH-UHFFFAOYSA-N Paromomycin II Natural products NC1C(O)C(O)C(CN)OC1OC1C(O)C(OC2C(C(N)CC(N)C2O)OC2C(C(O)C(O)C(CO)O2)N)OC1CO UOZODPSAJZTQNH-UHFFFAOYSA-N 0.000 description 1
- 240000007377 Petunia x hybrida Species 0.000 description 1
- 101001066099 Phaseolus coccineus Gibberellin 2-beta-dioxygenase Proteins 0.000 description 1
- OAICVXFJPJFONN-UHFFFAOYSA-N Phosphorus Chemical compound [P] OAICVXFJPJFONN-UHFFFAOYSA-N 0.000 description 1
- 101710173432 Phytoene synthase Proteins 0.000 description 1
- 101710083689 Probable capsid protein Proteins 0.000 description 1
- 101710193739 Protein RecA Proteins 0.000 description 1
- 101150090155 R gene Proteins 0.000 description 1
- 101710118046 RNA-directed RNA polymerase Proteins 0.000 description 1
- MUPFEKGTMRGPLJ-RMMQSMQOSA-N Raffinose Natural products O(C[C@H]1[C@@H](O)[C@H](O)[C@@H](O)[C@@H](O[C@@]2(CO)[C@H](O)[C@@H](O)[C@@H](CO)O2)O1)[C@@H]1[C@H](O)[C@@H](O)[C@@H](O)[C@@H](CO)O1 MUPFEKGTMRGPLJ-RMMQSMQOSA-N 0.000 description 1
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 1
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 1
- 108010003581 Ribulose-bisphosphate carboxylase Proteins 0.000 description 1
- CSPPKDPQLUUTND-NBVRZTHBSA-N Sethoxydim Chemical compound CCO\N=C(/CCC)C1=C(O)CC(CC(C)SCC)CC1=O CSPPKDPQLUUTND-NBVRZTHBSA-N 0.000 description 1
- 108020004682 Single-Stranded DNA Proteins 0.000 description 1
- 229920002472 Starch Polymers 0.000 description 1
- 101000951943 Stenotrophomonas maltophilia Dicamba O-demethylase, oxygenase component Proteins 0.000 description 1
- 108010043934 Sucrose synthase Proteins 0.000 description 1
- 108700005078 Synthetic Genes Proteins 0.000 description 1
- 101000693961 Trachemys scripta 68 kDa serum albumin Proteins 0.000 description 1
- 108020004566 Transfer RNA Proteins 0.000 description 1
- 108060008724 Tyrosinase Proteins 0.000 description 1
- MUPFEKGTMRGPLJ-UHFFFAOYSA-N UNPD196149 Natural products OC1C(O)C(CO)OC1(CO)OC1C(O)C(O)C(O)C(COC2C(C(O)C(O)C(CO)O2)O)O1 MUPFEKGTMRGPLJ-UHFFFAOYSA-N 0.000 description 1
- 108091023045 Untranslated Region Proteins 0.000 description 1
- 108020000999 Viral RNA Proteins 0.000 description 1
- 101710108545 Viral protein 1 Proteins 0.000 description 1
- 241000589634 Xanthomonas Species 0.000 description 1
- 101150044453 Y gene Proteins 0.000 description 1
- 241000607479 Yersinia pestis Species 0.000 description 1
- 101150067314 aadA gene Proteins 0.000 description 1
- 108020002494 acetyltransferase Proteins 0.000 description 1
- 102000005421 acetyltransferase Human genes 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 150000007513 acids Chemical class 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 229960000643 adenine Drugs 0.000 description 1
- 108090000637 alpha-Amylases Proteins 0.000 description 1
- 125000000539 amino acid group Chemical group 0.000 description 1
- 229940126575 aminoglycoside Drugs 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 210000004102 animal cell Anatomy 0.000 description 1
- 229930002877 anthocyanin Natural products 0.000 description 1
- 235000010208 anthocyanin Nutrition 0.000 description 1
- 239000004410 anthocyanin Substances 0.000 description 1
- 150000004636 anthocyanins Chemical class 0.000 description 1
- 229940088710 antibiotic agent Drugs 0.000 description 1
- 230000009833 antibody interaction Effects 0.000 description 1
- 239000000427 antigen Substances 0.000 description 1
- 108091007433 antigens Proteins 0.000 description 1
- 102000036639 antigens Human genes 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 238000003491 array Methods 0.000 description 1
- MXWJVTOOROXGIU-UHFFFAOYSA-N atrazine Chemical compound CCNC1=NC(Cl)=NC(NC(C)C)=N1 MXWJVTOOROXGIU-UHFFFAOYSA-N 0.000 description 1
- 239000002363 auxin Substances 0.000 description 1
- 244000052616 bacterial pathogen Species 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 108010005774 beta-Galactosidase Proteins 0.000 description 1
- 102000006635 beta-lactamase Human genes 0.000 description 1
- 108091008324 binding proteins Proteins 0.000 description 1
- 230000000975 bioactive effect Effects 0.000 description 1
- 239000002551 biofuel Substances 0.000 description 1
- 229920001222 biopolymer Polymers 0.000 description 1
- 230000001851 biosynthetic effect Effects 0.000 description 1
- 230000006696 biosynthetic metabolic pathway Effects 0.000 description 1
- 210000004899 c-terminal region Anatomy 0.000 description 1
- 239000011575 calcium Substances 0.000 description 1
- 229910052791 calcium Inorganic materials 0.000 description 1
- 230000003197 catalytic effect Effects 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 210000002421 cell wall Anatomy 0.000 description 1
- 229940124587 cephalosporin Drugs 0.000 description 1
- 150000001780 cephalosporins Chemical class 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 229930002868 chlorophyll a Natural products 0.000 description 1
- ATNHDLDRLWWWCB-AENOIHSZSA-M chlorophyll a Chemical compound C1([C@@H](C(=O)OC)C(=O)C2=C3C)=C2N2C3=CC(C(CC)=C3C)=[N+]4C3=CC3=C(C=C)C(C)=C5N3[Mg-2]42[N+]2=C1[C@@H](CCC(=O)OC\C=C(/C)CCC[C@H](C)CCC[C@H](C)CCCC(C)C)[C@H](C)C2=C5 ATNHDLDRLWWWCB-AENOIHSZSA-M 0.000 description 1
- 229930002869 chlorophyll b Natural products 0.000 description 1
- NSMUHPMZFPKNMZ-VBYMZDBQSA-M chlorophyll b Chemical compound C1([C@@H](C(=O)OC)C(=O)C2=C3C)=C2N2C3=CC(C(CC)=C3C=O)=[N+]4C3=CC3=C(C=C)C(C)=C5N3[Mg-2]42[N+]2=C1[C@@H](CCC(=O)OC\C=C(/C)CCC[C@H](C)CCC[C@H](C)CCCC(C)C)[C@H](C)C2=C5 NSMUHPMZFPKNMZ-VBYMZDBQSA-M 0.000 description 1
- 239000013611 chromosomal DNA Substances 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 101150081158 crtB gene Proteins 0.000 description 1
- OILAIQUEIWYQPH-UHFFFAOYSA-N cyclohexane-1,2-dione Chemical compound O=C1CCCCC1=O OILAIQUEIWYQPH-UHFFFAOYSA-N 0.000 description 1
- UQHKFADEQIVWID-UHFFFAOYSA-N cytokinin Natural products C1=NC=2C(NCC=C(CO)C)=NC=NC=2N1C1CC(O)C(CO)O1 UQHKFADEQIVWID-UHFFFAOYSA-N 0.000 description 1
- 239000004062 cytokinin Substances 0.000 description 1
- 229940104302 cytosine Drugs 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- FCRACOPGPMPSHN-UHFFFAOYSA-N desoxyabscisic acid Natural products OC(=O)C=C(C)C=CC1C(C)=CC(=O)CC1(C)C FCRACOPGPMPSHN-UHFFFAOYSA-N 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 235000014113 dietary fatty acids Nutrition 0.000 description 1
- 235000019621 digestibility Nutrition 0.000 description 1
- 108020001096 dihydrofolate reductase Proteins 0.000 description 1
- 239000000539 dimer Substances 0.000 description 1
- 238000006471 dimerization reaction Methods 0.000 description 1
- MHUWZNTUIIFHAS-CLFAGFIQSA-N dioleoyl phosphatidic acid Chemical compound CCCCCCCC\C=C/CCCCCCCC(=O)OCC(COP(O)(O)=O)OC(=O)CCCCCCC\C=C/CCCCCCCC MHUWZNTUIIFHAS-CLFAGFIQSA-N 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 1
- 230000009977 dual effect Effects 0.000 description 1
- 230000003028 elevating effect Effects 0.000 description 1
- 230000006353 environmental stress Effects 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 229930195729 fatty acid Natural products 0.000 description 1
- 239000000194 fatty acid Substances 0.000 description 1
- 150000004665 fatty acids Chemical class 0.000 description 1
- 238000007380 fibre production Methods 0.000 description 1
- 239000000796 flavoring agent Substances 0.000 description 1
- 235000019634 flavors Nutrition 0.000 description 1
- 230000004345 fruit ripening Effects 0.000 description 1
- BRZYSWJRSDMWLG-CAXSIQPQSA-N geneticin Natural products O1C[C@@](O)(C)[C@H](NC)[C@@H](O)[C@H]1O[C@@H]1[C@@H](O)[C@H](O[C@@H]2[C@@H]([C@@H](O)[C@H](O)[C@@H](C(C)O)O2)N)[C@@H](N)C[C@H]1N BRZYSWJRSDMWLG-CAXSIQPQSA-N 0.000 description 1
- 230000003054 hormonal effect Effects 0.000 description 1
- 239000005556 hormone Substances 0.000 description 1
- 229940088597 hormone Drugs 0.000 description 1
- 101150063121 iaaM gene Proteins 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 239000003262 industrial enzyme Substances 0.000 description 1
- 208000015181 infectious disease Diseases 0.000 description 1
- 239000003112 inhibitor Substances 0.000 description 1
- 230000002401 inhibitory effect Effects 0.000 description 1
- 150000002500 ions Chemical class 0.000 description 1
- 238000005304 joining Methods 0.000 description 1
- 238000011005 laboratory method Methods 0.000 description 1
- 231100000518 lethal Toxicity 0.000 description 1
- 230000001665 lethal effect Effects 0.000 description 1
- 108010087711 leukotriene-C4 synthase Proteins 0.000 description 1
- 229960004502 levodopa Drugs 0.000 description 1
- 101150026430 manA gene Proteins 0.000 description 1
- 229960000485 methotrexate Drugs 0.000 description 1
- 239000003147 molecular marker Substances 0.000 description 1
- 238000002703 mutagenesis Methods 0.000 description 1
- 231100000350 mutagenesis Toxicity 0.000 description 1
- 239000003471 mutagenic agent Substances 0.000 description 1
- 210000004940 nucleus Anatomy 0.000 description 1
- 230000031787 nutrient reservoir activity Effects 0.000 description 1
- 230000035764 nutrition Effects 0.000 description 1
- YCIMNLLNPGFGHC-UHFFFAOYSA-N o-dihydroxy-benzene Natural products OC1=CC=CC=C1O YCIMNLLNPGFGHC-UHFFFAOYSA-N 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 230000001590 oxidative effect Effects 0.000 description 1
- 229960001914 paromomycin Drugs 0.000 description 1
- UOZODPSAJZTQNH-LSWIJEOBSA-N paromomycin Chemical compound N[C@@H]1[C@@H](O)[C@H](O)[C@H](CN)O[C@@H]1O[C@H]1[C@@H](O)[C@H](O[C@H]2[C@@H]([C@@H](N)C[C@@H](N)[C@@H]2O)O[C@@H]2[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O2)N)O[C@@H]1CO UOZODPSAJZTQNH-LSWIJEOBSA-N 0.000 description 1
- 230000001717 pathogenic effect Effects 0.000 description 1
- 239000010452 phosphate Substances 0.000 description 1
- 108010082527 phosphinothricin N-acetyltransferase Proteins 0.000 description 1
- 229910052698 phosphorus Inorganic materials 0.000 description 1
- 239000011574 phosphorus Substances 0.000 description 1
- 239000000049 pigment Substances 0.000 description 1
- 238000004161 plant tissue culture Methods 0.000 description 1
- 210000002706 plastid Anatomy 0.000 description 1
- 229920001184 polypeptide Polymers 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 238000001742 protein purification Methods 0.000 description 1
- MUPFEKGTMRGPLJ-ZQSKZDJDSA-N raffinose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO[C@@H]2[C@@H]([C@@H](O)[C@@H](O)[C@@H](CO)O2)O)O1 MUPFEKGTMRGPLJ-ZQSKZDJDSA-N 0.000 description 1
- 210000005132 reproductive cell Anatomy 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 108020004418 ribosomal RNA Proteins 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
- 230000028327 secretion Effects 0.000 description 1
- 238000010187 selection method Methods 0.000 description 1
- 238000002864 sequence alignment Methods 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 230000003007 single stranded DNA break Effects 0.000 description 1
- 210000001082 somatic cell Anatomy 0.000 description 1
- 230000009870 specific binding Effects 0.000 description 1
- 235000019698 starch Nutrition 0.000 description 1
- 239000008107 starch Substances 0.000 description 1
- 230000035882 stress Effects 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- 125000000472 sulfonyl group Chemical group *S(*)(=O)=O 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 231100000167 toxic agent Toxicity 0.000 description 1
- 239000003440 toxic substance Substances 0.000 description 1
- YWBFPKPWMSWWEA-UHFFFAOYSA-O triazolopyrimidine Chemical compound BrC1=CC=CC(C=2N=C3N=CN[N+]3=C(NCC=3C=CN=CC=3)C=2)=C1 YWBFPKPWMSWWEA-UHFFFAOYSA-O 0.000 description 1
- 101150101900 uidA gene Proteins 0.000 description 1
- 229940035893 uracil Drugs 0.000 description 1
- 230000003612 virological effect Effects 0.000 description 1
- 108700026215 vpr Genes Proteins 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8201—Methods for introducing genetic material into plant cells, e.g. DNA, RNA, stable or transient incorporation, tissue culture methods adapted for transformation
- C12N15/8202—Methods for introducing genetic material into plant cells, e.g. DNA, RNA, stable or transient incorporation, tissue culture methods adapted for transformation by biological means, e.g. cell mediated or natural vector
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8201—Methods for introducing genetic material into plant cells, e.g. DNA, RNA, stable or transient incorporation, tissue culture methods adapted for transformation
- C12N15/8202—Methods for introducing genetic material into plant cells, e.g. DNA, RNA, stable or transient incorporation, tissue culture methods adapted for transformation by biological means, e.g. cell mediated or natural vector
- C12N15/8205—Agrobacterium mediated transformation
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8201—Methods for introducing genetic material into plant cells, e.g. DNA, RNA, stable or transient incorporation, tissue culture methods adapted for transformation
- C12N15/8213—Targeted insertion of genes into the plant genome by homologous recombination
Definitions
- This disclosure relates to the field of plant transformation.
- Methods of, and compositions for, transfer of T-strands of DNA from Agrobacterium plasmids are provided.
- methods of, and compositions for, using Agrobacterium-mediated transfer of T-strands of DNA to promote templated gene editing and site-directed integration of transgenes are also provided.
- Agrobacterium-mediated transformation utilizes Agrobacterium tumefaciens to transfer single-stranded DNA synthesized from recombinant plasmids to plant cells.
- Transformation of plant cells often requires co-culturing with tumefaciens.
- a DNA sequence of interest is incorporated into specially-constructed DNA plasmids where the DNA sequence of interest is flanked by an Agrobacterium tumor-inducing (Ti) plasmid right border DNA region and a left border DNA region.
- Ti Agrobacterium tumor-inducing
- the Agrobacterium-mediated transformation process initiates when an endonuclease, VirD2, nicks the DNA plasmid at the right border and left border regions to release a single-stranded transfer DNA (also called the T-strand).
- the T-strand transfers transgenes situated between the right and left borders into the targeted plant cells, where the T-strand can integrate into the genome (see, for example, U.S. Pat. No.
- Plant cells that have been transformed via Agrobacterium-mediated transformation can be manipulated to regenerate into a whole, fertile plant.
- Successful Agrobacterium-mediated transformation of a plant cell typically results in a random integration in the plant genome. Such random integrations can have deleterious effects to the plant cell if the transgene inserts into an essential endogenous gene. Similarly, such random integrations can have a deleterious effect on expression of the transgene due to position effects of the chromosome where the transgene inserted.
- An Agrobacterium-mediated transformation method for plant cells that promotes templated gene editing and site-directed integration of transgenes.
- the present disclosure provides a method of transforming a plant cell, comprising contacting the plant cell with a Rhizobiales cell capable of transforming the plant cell, where the Rhizobiales cell comprises at least one vector capable of forming two T- strands that are essentially complementary in at least a portion of the T-strands.
- the at least one vector comprises a first right border DNA sequence (RBI), a second right border DNA sequence (RB2), and at least one sequence of interest
- the RBI is positioned in the vector to initiate synthesis of a first T-strand such that the sequence of interest is in the sense orientation from the 5' to 3' end of the first T-strand
- the RB2 is positioned in the vector to initiate synthesis of a second T-strand such that the sequence of interest is in the anti-sense orientation relative to the sequence of interest in the first T-strand, and the two T-strands resulting from initiation at RB 1 and RB2 are essentially
- the at least one vector comprises a RBI, a RB2, a sequence of interest, a first left border DNA sequence (LB1) and a second left border DNA sequence (LB2), where the vector is configured such that the RBI is paired with the LB1 which are positioned in the vector to initiate (RBI) and terminate (LB 1) synthesis of a first T-strand such that the sequence of interest is in the sense orientation from the 5' to 3' end of the first T-strand; and the RB2 is paired with the LB2 which are positioned in the vector to initiate (RB2) and terminate (LB2) synthesis of a second T-strand such that the sequence of interest is in an anti- sense orientation relative to the sequence of interest in the first T-strand, and where the two T-strands resulting from initiation at RBI and RB2 are essentially complementary in at least a portion of the sequence of interest.
- the vector comprises a first sequence of interest and a second sequence of interest, where the first sequence of interest is essentially identical to the second sequence of interest, where the vector further comprises a RBI and a LBl which are positioned in the vector to initiate (RB I) and terminate (LBl) synthesis of a first T-strand such that the first sequence of interest is in the sense orientation from the 5' to 3' end of the first T-strand, and the vector further comprises a RB2 and a LB2 which are positioned in the vector to initiate (RB2) and terminate (LB2) synthesis of a second T-strand such that the second sequence of interest is in an anti-sense orientation relative to the first sequence of interest in the first T-strand, and where the two T-strands resulting from initiation at RBI and RB2 are essentially complementary in at least a portion of the first sequence of interest and the second sequence of interest.
- the Rhizobiales cell comprises at least a first vector and a second vector, where each vector comprises essentially identical sequences of interest, and where the first vector comprises a RBI and a LB l which are positioned in the first vector to initiate (RBI) and terminate (LB 1) synthesis of a first T-strand such that the sequence of interest is in the sense orientation from the 5' to 3' end of the first T-strand; and where the second vector comprises a RB2 and a LB2) which are positioned in the second vector to initiate (RB2) and terminate (LB2) synthesis of a second T-strand such that the sequence of interest is in an anti- sense orientation relative to the sequence of interest in the first T-strand, and where the two T-strands resulting from initiation at RBI and RB2 are essentially complementary in at least a portion of the sequence of interest.
- the vector comprises a first sequence of interest, a second sequence of interest different from the first sequence of interest, at least two RB DNA sequences, and one or more optional LB DNA sequences, wherein the first RB DNA sequence (RBI) and a first LB DNA sequence (LBl) are positioned in the vector to initiate (RBI) and terminate (LBl) synthesis of the first T-strand such that the first sequence of interest is in the sense orientation from the 5' to 3' end of the first T-strand; and the vector configuration further comprises a second RB DNA sequence (RB2) and a second LB DNA sequence (LB2) which are positioned in the vector to initiate (RB2) and terminate (LB2) synthesis of a second T-strand such that the first sequence of interest is in an anti-sense orientation from the 5' to 3' end of the second T-strand, wherein the two T-strands resulting from initiation at RBI and RB2 are essentially complementary to each other in at least a portion of the first sequence of interest, and wherein the vector configuration
- the instant disclosure provides a method of transforming a plant cell, comprising contacting the plant cell with two or more Rhizobiales cells capable of transforming the plant cell, where the two or more Rhizobiales cells each contain one of at least two vectors capable of forming two essentially complementary T-strands.
- each vector comprises an essentially identical sequence of interest
- the first vector comprises a first right border DNA sequence (RBI)
- the RBI is positioned in the vector to initiate synthesis of a first T-strand such that the sequence of interest is in the sense orientation from the 5' to 3' end of the first T-strand
- the second vector comprises a second right border DNA sequence (RB2) which is positioned in the vector to initiate synthesis of a second T-strand such that the sequence of interest is in the anti-sense orientation relative to the sequence of interest in the first T-strand, and where the two T- strands resulting from initiation at RBI and RB2 are essentially complementary in at least a portion of the sequence of interest.
- each vector comprises an essentially identical sequence of interest
- the first vector comprises a RBI and a first left border DNA sequence (LB1) which are positioned in the first vector to initiate (RB I) and terminate (LB 1) synthesis of a first T-strand such that the sequence of interest is in the sense orientation from the 5' to 3' end of the first T-strand
- the second vector comprises a RB2 and a second left border DNA sequence (LB2) which are positioned in the second vector to initiate (RB2) and terminate (LB2) synthesis of a second T-strand such that the sequence of interest is in an anti-sense orientation from the 5' to 3' end of the second T-strand, and where the sequence of interest in the two T-strands resulting from initiation at RB 1 and RB2 are essentially complementary to each other.
- RBI is 5' to LB2.
- RBI is 3' to LB2.
- RB2 is 5' to LB 1.
- RB2 is 3' to
- the instant disclosure provides a method of increasing the rate of site directed integration of a sequence of interest, comprising contacting a plant cell with at least one vector capable of forming two essentially complementary T-strands.
- the at least one vector comprises a first right border DNA sequence (RBI), a second right border DNA sequence (RB2), and at least one sequence of interest, where the RBI is positioned in the vector to initiate synthesis of a first T-strand such that the sequence of interest is in the sense orientation from the 5' to 3' end of the first T-strand; and the RB2 is positioned in the vector to initiate synthesis of a second T-strand such that the sequence of interest is in the anti-sense orientation relative to the sequence of interest in the first T-strand, and where the two T-strands resulting from initiation at RBI and RB2 are essentially complementary in at least a portion of the sequence of interest.
- the at least one vector comprises a RBI, a RB2, a sequence of interest, a first left border DNA sequence (LB1), a second left border DNA sequence (LB2) and where the vector is configured such that the RBI is paired with the LB1 which are positioned in the vector to initiate (RBI) and terminate (LB 1) synthesis of a first T-strand such that the sequence of interest is in the sense orientation from the 5' to 3' end of the first T-strand; and the RB2 is paired with the LB2 which are positioned in the vector to initiate (RB2) and terminate (LB2) synthesis of a second T-strand such that the sequence of interest is in an anti- sense orientation relative to the sequence of interest in the first T-strand, and where the sequence of interest in the two T-strands resulting from initiation at RBI and RB2 are essentially complementary in at least a portion of the sequence of interest.
- the vector comprises a first sequence of interest and a second sequence of interest, where the first sequence of interest is essentially identical to the second sequence of interest; where the vector further comprises a RBI and a LB1 which are positioned in the vector to initiate (RBI) and terminate (LB1) synthesis of a first T-strand such that the first sequence of interest is in the sense orientation from the 5' to 3' end of the first T-strand; and the vector further comprises a RB2 and a LB2 which are positioned in the vector to initiate (RB2) and terminate (LB2) synthesis of a second T-strand such that the second sequence of interest is in an anti-sense orientation relative to the first sequence of interest in the first T-strand, and where the two T-strands resulting from initiation at RBI and RB2 are essentially complementary in at least a portion of the first sequence of interest and the second sequence of interest.
- the Rhizobiales cell comprises at least a first and a second vector, where each vector comprises essentially identical sequences of interest, and where the first vector comprises a RBI and a LB1 which are positioned in the first vector to initiate (RBI) and terminate (LB1) synthesis of a first T- strand such that the sequence of interest is in the sense orientation from the 5' to 3' end of the first T-strand; and the second vector comprises a RB2 and a LB2 which are positioned in the second vector to initiate (RB2) and terminate (LB2) synthesis of a second T-strand such that the sequence of interest is in an anti-sense orientation relative to the sequence of interest in the first T-strand, and where the two T-strands resulting from initiation at RBI and RB2 are essentially complementary in at least a portion of the sequence of interest.
- the vector comprises a first sequence of interest, a second sequence of interest different from the first sequence of interest, at least two RB DNA sequences, and one or more optional LB DNA sequences, wherein the first RB DNA sequence (RBI) and a first LB DNA sequence (LB1) are positioned in the vector to initiate (RBI) and terminate (LB1) synthesis of the first T-strand such that the first sequence of interest is in the sense orientation from the 5' to 3' end of the first T-strand; and the vector configuration further comprises a second RB DNA sequence (RB2) and a second LB DNA sequence (LB2) which are positioned in the vector to initiate (RB2) and terminate (LB2) synthesis of a second T- strand such that the first sequence of interest is in an anti-sense orientation from the 5' to 3' end of the second T-strand, wherein the two T-strands resulting from initiation at RBI and RB2 are essentially complementary to each other in at least a portion of the first sequence of interest, and wherein the vector configuration further
- the instant disclosure provides a method of increasing the rate of site directed integration of a sequence of interest, comprising contacting a plant cell with two or more Rhizobiales cells, where the two or more Rhizobiales cells each contain one of at least two vectors capable of forming two essentially complementary T-strands.
- each vector comprises an essentially identical sequence of interest
- the first vector comprises a first right border DNA sequence (RBI)
- the RBI is positioned in the vector to initiate synthesis of a first T-strand such that the sequence of interest is in the sense orientation from the 5' to 3' end of the first T-strand
- the second vector comprises a second right border DNA sequence (RB2) which is positioned in the vector to initiate synthesis of a second T-strand such that the sequence of interest is in the anti-sense orientation relative to the sequence of interest in the first T-strand, and where the two T- strands resulting from initiation at RBI and RB2 are essentially complementary in at least a portion of the sequence of interest.
- each vector comprises an essentially identical sequence of interest
- the first vector comprises a RBI and a first left border (LBl) which are positioned in the first vector to initiate (RB I) and terminate (LBl) synthesis of a first T-strand such that the sequence of interest is in the sense orientation from the 5' to 3' end of the first T-strand
- the second vector comprises a RB2 and a second left border (LB2) which are positioned in the second vector to initiate (RB2) and terminate (LB2) synthesis of a second T-strand such that the sequence of interest is in an anti- sense orientation from the 5' to 3' end of the second T-strand, and where the sequence of interest in the two T-strands resulting from initiation at RBI and RB2 are essentially complementary to each other.
- the Rhizobiales cell is selected from an Agrobacterium spp., a Bradyrhizobium spp., a Mesorhizobium spp., an Ochrobactrum spp., a Phyllobacterium spp., a Rhizobium spp., and a Sinorhizobium spp.
- the Rhizobiales cell further contains a vector comprising at least one expression cassette, wherein the expression cassettes comprise a sequence encoding a protein involved in DNA repair, and wherein the protein is selected from the group comprising a vir gene from the Ti plasmid, Rad51, Rad52, Rad2, a dominant-negative Ku70, or any combination thereof.
- RBI is 5' to LB2.
- RBI is 3' to LB2.
- RB2 is 5' to LBl .
- RB2 is 3' to LBl .
- the instant disclosure provides a method of transforming a plant cell, comprising contacting the plant cell with a Rhizobiales cell capable of transforming the plant cell, where the Rhizobiales cell comprises at least one vector comprising a right border DNA sequence (RB) and a left border DNA sequence (LB) and where the vector comprises between the RB and LB: (i) a first sequence of interest in a sense orientation relative to the RB, (ii) a spacer, and (iii) a second sequence of interest in an anti-sense orientation relative to the RB, where the first sequence of interest and second sequence of interest are essentially complementary and after synthesis of the T-strand anneal to form a double-stranded DNA.
- the Rhizobiales cell is selected from an Agrobacterium spp., a
- the Rhizobiales cell further contains a vector comprising at least one expression cassette, wherein the expression cassettes comprise a sequence encoding a protein involved in DNA repair, and wherein the protein is selected from the group comprising a vir gene from the Ti plasmid, Rad51, Rad52, Rad2, a dominant-negative Ku70, or any combination thereof.
- the instant disclosure provides a method of increasing the rate of site directed integration of a double-stranded DNA into a genome of a plant cell, comprising contacting the plant cell with at least one vector comprising a right border DNA sequence (RB) and a left border DNA sequence (LB), where the vector comprises between the RB and LB: (i) a first sequence of interest in a sense orientation relative to the RB, (ii) a spacer, and (iii) a second sequence of interest in an anti-sense orientation relative to the RB, where the first sequence of interest and second sequence of interest are essentially complementary and after synthesis of the T-anneal, where the double-stranded DNA is integrated into a genome of a plant cell.
- RB right border DNA sequence
- LB left border DNA sequence
- the instant disclosure provides an Agrobacterium cell comprising at least one vector that is capable of forming two essentially complementary T-strands.
- the instant disclosure provides an Agrobacterium cell comprising at least one vector comprising a right border DNA sequence (RB) and a left border DNA sequence (LB) and where the vector comprises between the RB and LB: (i) a first sequence of interest in a sense orientation relative to the RB, (ii) a spacer, and (iii) a second sequence of interest in an anti-sense orientation relative to the RB, where the first sequence of interest and second sequence of interest are essentially complementary, and after synthesis of the T- strand anneal to form a double-stranded DNA.
- RB right border DNA sequence
- LB left border DNA sequence
- the instant disclosure provides a method of transforming a plant genome, comprising contacting at least one plant cell on a co-culture medium for at least 2 days, with at least one Rhizobiales cell capable of transforming the plant cell, where the Rhizobiales cell comprises at least one vector capable of forming two essentially
- the Rhizobiales cell is selected from an Agrobacterium spp., a Bradyrhizobium spp., a Mesorhizobium spp., an Ochrobactrum spp., a Phyllobacterium spp., a Rhizobium spp., and a Sinorhizobium spp.
- the Rhizobiales cell further contains a vector comprising at least one expression cassette, wherein the expression cassettes comprise a sequence encoding a protein involved in DNA repair, and wherein the protein is selected from the group comprising a vir gene from the Ti plasmid, Rad51, Rad52, Rad2, a dominant-negative Ku70, or any combination thereof.
- the instant disclosure provides a method of transforming a plant genome, comprising contacting at least one plant cell on a co-culture medium for at least 2 days, with at least one Rhizobiales cell capable of transforming the plant cell, where the Rhizobiales cell comprises at least one vector comprising a right border (RB) DNA sequence and a left border (LB) DNA sequence and where the vector further comprises between the RB and LB DNA sequences: (i) a first sequence of interest in a sense orientation relative to the RB DNA sequence, and (ii) a spacer, and (iii) a second sequence of interest in an anti- sense orientation relative to the RB DNA sequence, where the first sequence of interest and second sequence of interest are essentially complementary, and after synthesis of the T-strand anneal to form a double-stranded DNA.
- the Rhizobiales cell comprises at least one vector comprising a right border (RB) DNA sequence and a left border (LB) DNA sequence and where the vector further comprises between the RB and LB DNA sequences: (i
- the Rhizobiales cell is selected from an Agrobacterium spp., a Bradyrhizobium spp., a Mesorhizobium spp., an Ochrobactrum spp., a Phyllobacterium spp., a Rhizobium spp., and a Sinorhizobium spp.
- the Rhizobiales cell further contains a vector comprising at least one expression cassette, wherein the expression cassettes comprise a sequence encoding a protein involved in DNA repair, and wherein the protein is selected from the group comprising a vir gene from the Ti plasmid, Rad51, Rad52, Rad2, a dominant-negative Ku70, or any combination thereof.
- the instant disclosure provides a method of transforming a plant genome, comprising contacting at least one plant cell on a co-culture medium for at least 3 days, with at least one Rhizobiales cell capable of transforming the plant cell, where the Rhizobiales cell comprises at least one vector capable of forming two essentially complementary T-strands.
- the Rhizobiales cell is selected from an Agrobacterium spp., a Bradyrhizobium spp., a Mesorhizobium spp., an Ochrobactrum spp., a Phyllobactenum spp., a Rhizobium spp., and a Sinorhizobium spp.
- the Rhizobiales cell further contains a vector comprising at least one expression cassette, wherein the expression cassettes comprise a sequence encoding a protein involved in DNA repair, and wherein the protein is selected from the group comprising a vir gene from the Ti plasmid, Rad51, Rad52, Rad2, a dominant-negative Ku70, or any combination thereof.
- the instant disclosure provides a method of transforming a plant genome, comprising contacting at least one plant cell on a co-culture medium for at least 3 days, with at least one Rhizobiales cell capable of transforming the plant cell, where the
- Rhizobiales cell comprises at least one vector comprising a right border (RB) DNA sequence and a left border (LB) DNA sequence and where the vector further comprises between the RB and LB DNA sequences: (i) a first sequence of interest in a sense orientation relative to the RB DNA sequence, and (ii) a spacer, and (iii) a second sequence of interest in an anti- sense orientation relative to the RB DNA sequence, where the first sequence of interest and second sequence of interest are essentially complementary, and after synthesis of the T-strand anneal to form a double-stranded DNA.
- RB right border
- LB left border
- the Rhizobiales cell is selected from an Agrobacterium spp., a Bradyrhizobium spp., a Mesorhizobium spp., an Ochrobactrum spp., a Phyllobacterium spp., a Rhizobium spp., and a Sinorhizobium spp.
- the Rhizobiales cell further contains a vector comprising at least one expression cassette, wherein the expression cassettes comprise a sequence encoding a protein involved in DNA repair, and wherein the protein is selected from the group comprising a vir gene from the Ti plasmid, Rad51, Rad52, Rad2, a dominant-negative Ku70, or any combination thereof.
- a method of providing a sequence of interest to a plant cell comprising contacting the plant cell with a Rhizobiales cell capable of transforming the plant cell, wherein the Rhizobiales cell comprises at least one vector capable of forming two T-strands that are essentially complementary in at least a portion of the T-strands, wherein the at least one vector comprises a first right border DNA sequence (RBI), a second right border DNA sequence (RB2), and at least one sequence of interest, and wherein the RBI is positioned in the vector to initiate synthesis of a first T-strand such that the sequence of interest is in the sense orientation from the 5' to 3' end of the first T-strand; and the RB2 is positioned in the vector to initiate synthesis of a second T-strand such that the sequence of interest is in the anti-sense orientation relative to the sequence of interest in the first T-strand, and wherein the two T-strands resulting from initiation at RBI and RB2 are essentially complementary in at least a portion of
- the RBI and the RB2 are essentially homologous. In some embodiments, the LB1 and the LB2 are essentially homologous. In some embodiments, the RBI and the RB2 are not essentially homologous. In some embodiments, the LB1 and LB2 are not essentially homologous. In some embodiments, at least one of the RBI and RB2 comprise an Agrobacterium Ti plasmid right border consensus DNA sequence. In some embodiments, the right border consensus DNA sequence is selected from SEQ ID NO: 21 or SEQ ID NO: 22. In some embodiments, at least one of the RBI and RB2 comprise a sequence selected from SEQ ID NOs: 1-13.
- At least one of the RBI and RB2 comprise a sequence at least 80% identical to a sequence selected from SEQ ID NOs:4 and SEQ ID NO: 12.
- at least one of the LB 1 and LB2 comprise an Agrobacterium Ti plasmid left border consensus DNA sequence.
- the left border consensus DNA sequence is selected from SEQ ID NO: 23 or SEQ ID NO: 24.
- at least one of the LB 1 and LB2 comprise a sequence selected from SEQ ID NOs: 14-20.
- at least one of the LB 1 and LB2 comprise a sequence at least 80% identical to SEQ ID NO: 19.
- the sequence of interest comprises one or more expression cassettes.
- the sequence of interest comprises one or more sequences selected from: a gene, a portion of a gene, an intergenic sequence, an enhancer, a promoter, an intron, an exon, a sequence encoding a transcription termination sequence, a sequence encoding a chloroplast targeting peptide, a sequence encoding a mitochondrial targeting peptide, an insulator sequence, a sequence encoding an anti-sense RNA construct, a sequence encoding non-protein-coding RNA (npcRNA), a sequence encoding a recombinase, a sequence encoding a recombinase recognition site, a landing pad, an editing template, an expression cassette, a stack of two or more expression cassettes encoding transgenes, a sequence encoding a site-specific enzyme, a sequence encoding a site-specific enzyme target site, a sequence encoding a selection marker, a sequence encoding a cell factor that functions to increase DNA repair
- the sequence of interest comprises a sequence encoding a site-specific enzyme target site 5' to an expression cassette and a sequence encoding a site-specific enzyme target site 3' to an expression cassette.
- the sequence of interest does not comprise a homology arm DNA sequence.
- the sequence of interest comprises at least one homology arm DNA sequence.
- the sequence of interest comprises both a left homology arm DNA sequence and a right homology arm DNA sequence.
- the sequence of interest comprises i) a first sequence positioned between the left homology arm DNA sequence and the right homology DNA arm sequence, and ii) a second sequence that is not positioned between the left homology arm DNA sequence and the right homology arm DNA sequence.
- the sequence of interest comprises a sequence positioned between the left homology arm DNA sequence and the right homology DNA arm sequence, where the left and right homology arm sequences are positioned between two sequences encoding site-specific enzyme target sites. In some embodiments, the sequence of interest comprises i) a first sequence positioned between the left homology arm DNA sequence and the right homology DNA arm sequence where the left and right homology arm sequences are positioned between two sequences encoding site-specific enzyme target sites, and ii) a second sequence that is not positioned between sequences encoding site-specific enzyme target sites.
- the sequence of interest comprises i) a first sequence positioned between the left homology arm DNA sequence and the right homology DNA arm sequence where the left and right homology arm sequences are positioned between two sequences encoding site-specific enzyme target sites, and ii) a second sequence encoding the site-specific enzyme, where the second sequence is not positioned between sequences encoding site-specific enzyme target sites.
- at least one homology arm DNA sequence comprises a sequence that is at least 80%, at least 85%, at least 90%, at least 95%), or at least 99% identical to a target sequence in the plant genome.
- the target sequence in the plant genome is a genie sequence. In some embodiments, the target sequence in the plant genome is a non-genic sequence. In some embodiments, the sequence of interest flanked by homology arms comprises a sequence that is at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identical to a native gene of the plant cell. In some embodiments, the sequence of interest comprises a protein-coding sequence. In some embodiments, the sequence of interest comprises a non- protein-coding RNA.
- the non-protein-coding RNA is selected from the group consisting of: a microRNA (miRNA), a miRNA precursor, a small interfering RNA (siRNA), a small RNA (22-26 nt in length) and precursor encoding same, a heterochromatic siRNA (hc-siRNA), a Piwi-interacting RNA (piRNA), a hairpin double strand RNA (hairpin dsRNA), a trans-acting siRNA (ta-siRNA), a naturally occurring antisense siRNA (nat- siRNA), a tracer RNA (tcRNA), a guide RNA (gRNA), and a single-guide RNA (sgRNA), or any combination thereof.
- the site-specific enzyme is selected from a group consisting of an endonuclease, a recombinase, and a transposase.
- the endonuclease is selected from a meganuclease, a zinc finger nuclease, a transcription activator-like effector nuclease (TALEN), an Argonaute, a Cas9 nuclease, a CasX nuclease, a CasY nuclease, and a Cpfl nuclease.
- TALEN transcription activator-like effector nuclease
- Argonaute a Cas9 nuclease
- CasX nuclease a CasX nuclease
- CasY nuclease a CasY nuclease
- Cpfl nuclease a Cpfl nuclease
- the Cas9 nuclease is selected from the group comprising Casl, CaslB, Cas2, Cas3, Cas4, Cas5, Cas6, Cas7, Cas8, Cas9 (also known as Csnl and Csxl2), CaslO, Csyl, Csy2, Csy3, Csel, Cse2, Cscl, Csc2, Csa5, Csn2, Csm2, Csm3, Csm4, Csm5, Csm6, Cmrl, Cmr3, Cmr4, Cmr5, Cmr6, Csbl, Csb2, Csb3, Csxl7, Csxl4, CsxlO, Csxl6, CsaX, Csx3, Csxl, Csxl5, Csfl, Csf2, Csf3, and Csf4 nuclease.
- the recombinase is a tyrosine recombinase attached to a DNA recognition motif, or a serine recombinase attached to a DNA recognition motif.
- the tyrosine recombinase attached to a DNA recognition motif is selected from the group consisting of a Cre recombinase, a Flp recombinase, and a Tnpl recombinase.
- the serine recombinase attached to a DNA recognition motif is selected from the group consisting of a PhiC31 integrase, an R4 integrase, and a TP-901 integrase.
- the transposase is a DNA transposase attached to a DNA binding domain.
- the sequence of interest further comprises at least one site-specific enzyme target site.
- the at least one site-specific enzyme target site is selected from a Cre/lox recombination site, a Flp/FRT recombination site, a endonuclease recognition site, and a TALEN site.
- the sequence of interest comprises a sequence encoding a protein involved in DNA repair, wherein the protein is selected from the group comprising a vir gene from the Ti plasmid, Rad51, Rad52, Rad2, a dominant-negative Ku70, or any combination thereof.
- At least part of the sequence of interest is integrated into the plant genome via homologous recombination, wherein the integration of at least part of the sequence of interest results in a point mutation, an insertion, a deletion, an inversion, increased transcription of an endogenous locus, decreased transcription of an endogenous locus, altered protein activity, altered RNAi products, altered RNAi target sites, altered RNAi pathway activity, increased transcription of the sequence of interest, decreased transcription of the integrated sequence of interest, or a combination thereof.
- At least part of the sequence of interest is integrated into the plant genome via non-homologous end joining, wherein the integration of at least part of the sequence of interest results in a point mutation, an insertion, a deletion, an inversion, increased transcription of an endogenous locus, decreased transcription of an endogenous locus, altered protein activity, altered RNAi products, altered RNAi target sites, altered RNAi pathway activity, increased transcription of the sequence of interest, decreased transcription of the integrated sequence of interest, or a combination thereof.
- the plant cell already comprises a site-specific enzyme.
- the plant cell is selected from the group consisting of a corn cell, a soybean cell, a canola cell, a cotton cell, a wheat cell, or a sugarcane cell.
- a nucleotide sequence encoding the site-specific enzyme is stably transformed into the plant cell.
- the Rhizobiales cell is selected from an Agrobacterium spp., a Bradyrhizobium spp., a Mesorhizobium spp., an
- the Rhizobiales cell further contains a vector comprising at least one expression cassette, wherein the expression cassettes comprise a sequence encoding a protein involved in DNA repair, and wherein the protein is selected from the group comprising a vir gene from the Ti plasmid, Rad51, Rad52, Rad2, a dominant-negative Ku70, or any combination thereof.
- a method of providing a sequence of interest to a plant cell comprising contacting the plant cell with a Rhizobiales cell capable of transforming the plant cell, wherein the Rhizobiales cell comprises at least one vector capable of forming two T-strands that are essentially complementary in at least a portion of the T-strands, wherein the at least one vector comprises a RBI, a RB2, a sequence of interest, a first left border DNA sequence (LB1) and a second left border DNA sequence (LB2), wherein the vector is configured such that the RBI is paired with the LB1 which are positioned in the vector to initiate (RBI) and terminate (LB1) synthesis of a first T-strand such that the sequence of interest is in the sense orientation from the 5' to 3' end of the first T-strand; and the RB2 is paired with the LB2 which are positioned in the vector to initiate (RB2) and terminate (LB2) synthesis of a second T-strand such that the sequence of interest is in an anti-sense orientation relative to the sequence
- the RBI and the RB2 are essentially homologous. In some embodiments, the LB1 and the LB2 are essentially homologous. In some embodiments, the RBI and the RB2 are not essentially homologous. In some embodiments, the LB1 and LB2 are not essentially homologous. In some embodiments, at least one of the RBI and RB2 comprise an Agrobacterium Ti plasmid right border consensus DNA sequence. In some embodiments, the right border consensus DNA sequence is selected from SEQ ID NO: 21 or SEQ ID NO: 22. In some embodiments, at least one of the RBI and RB2 comprise a sequence selected from SEQ ID NOs: 1-13.
- At least one of the RBI and RB2 comprise a sequence at least 80% identical to a sequence selected from SEQ ID NOs:4 and SEQ ID NO: 12.
- at least one of the LBl and LB2 comprise an Agrobacterium Ti plasmid left border consensus DNA sequence.
- the left border consensus DNA sequence is selected from SEQ ID NO: 23 or SEQ ID NO: 24.
- at least one of the LBl and LB2 comprise a sequence selected from SEQ ID NOs: 14-20.
- at least one of the LBl and LB2 comprise a sequence at least 80% identical to SEQ ID NO: 19.
- the sequence of interest comprises one or more expression cassettes.
- the sequence of interest comprises one or more sequences selected from: a gene, a portion of a gene, an intergenic sequence, an enhancer, a promoter, an intron, an exon, a sequence encoding a transcription termination sequence, a sequence encoding a chloroplast targeting peptide, a sequence encoding a mitochondrial targeting peptide, an insulator sequence, a sequence encoding an anti-sense RNA construct, a sequence encoding non-protein-coding RNA (npcRNA), a sequence encoding a recombinase, a sequence encoding a recombinase recognition site, a landing pad, an editing template, an expression cassette, a stack of two or more expression cassettes encoding transgenes, a sequence encoding a site-specific enzyme, a sequence encoding a site-specific enzyme target site, a sequence encoding a selection marker, a sequence encoding a cell factor that functions to increase DNA repair
- the sequence of interest comprises a sequence encoding a site-specific enzyme target site 5' to an expression cassette and a sequence encoding a site-specific enzyme target site 3' to an expression cassette.
- the sequence of interest does not comprise a homology arm DNA sequence.
- the sequence of interest comprises at least one homology arm DNA sequence.
- the sequence of interest comprises both a left homology arm DNA sequence and a right homology arm DNA sequence.
- the sequence of interest comprises i) a first sequence positioned between the left homology arm DNA sequence and the right homology DNA arm sequence, and ii) a second sequence that is not positioned between the left homology arm DNA sequence and the right homology arm DNA sequence.
- the sequence of interest comprises a sequence positioned between the left homology arm DNA sequence and the right homology DNA arm sequence, where the left and right homology arm sequences are positioned between two sequences encoding site-specific enzyme target sites. In some embodiments, the sequence of interest comprises i) a first sequence positioned between the left homology arm DNA sequence and the right homology DNA arm sequence where the left and right homology arm sequences are positioned between two sequences encoding site-specific enzyme target sites, and ii) a second sequence that is not positioned between sequences encoding site-specific enzyme target sites.
- the sequence of interest comprises i) a first sequence positioned between the left homology arm DNA sequence and the right homology DNA arm sequence where the left and right homology arm sequences are positioned between two sequences encoding site-specific enzyme target sites, and ii) a second sequence encoding the site-specific enzyme, where the second sequence is not positioned between sequences encoding site-specific enzyme target sites.
- at least one homology arm DNA sequence comprises a sequence that is at least 80%, at least 85%, at least 90%, at least 95%, or at least 99% identical to a target sequence in the plant genome.
- the target sequence in the plant genome is a genie sequence.
- the target sequence in the plant genome is a non-genic sequence.
- the sequence of interest flanked by homology arms comprises a sequence that is at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%), or 100% identical to a native gene of the plant cell.
- the sequence of interest comprises a protein-coding sequence.
- the sequence of interest comprises a non-protein-coding RNA.
- the non- protein-coding RNA is selected from the group consisting of: a microRNA (miRNA), a miRNA precursor, a small interfering RNA (siRNA), a small RNA (22-26 nt in length) and precursor encoding same, a heterochromatic siRNA (hc-siRNA), a Piwi-interacting RNA (piRNA), a hairpin double strand RNA (hairpin dsRNA), a trans-acting siRNA (ta-siRNA), a naturally occurring antisense siRNA (nat-siRNA), a tracer RNA (tcRNA), a guide RNA (gRNA), and a single-guide RNA (sgRNA), or any combination thereof.
- miRNA microRNA
- miRNA precursor a small interfering RNA
- siRNA small interfering RNA
- small RNA 22-26 nt in length
- precursor precursor encoding same
- hc-siRNA a heterochromatic siRNA
- the site-specific enzyme is selected from a group consisting of an
- the endonuclease is selected from a meganuclease, a zinc finger nuclease, a transcription activator-like effector nuclease (TALEN), an Argonaute, a Cas9 nuclease, a CasX nuclease, a CasY nuclease, and a Cpfl nuclease.
- TALEN transcription activator-like effector nuclease
- the Cas9 nuclease is selected from the group comprising Casl, CaslB, Cas2, Cas3, Cas4, Cas5, Cas6, Cas7, Cas8, Cas9 (also known as Csnl and Csxl2), CaslO, Csyl, Csy2, Csy3, Csel, Cse2, Cscl, Csc2, Csa5, Csn2, Csm2, Csm3, Csm4, Csm5, Csm6, Cmrl, Cmr3, Cmr4, Cmr5, Cmr6, Csbl, Csb2, Csb3, Csxl7, Csxl4, CsxlO, Csxl6, CsaX, Csx3, Csxl, Csxl5, Csfl, Csf2, Csfi, and Csf4 nuclease.
- the recombinase is a tyrosine
- the tyrosine recombinase attached to a DNA recognition motif is selected from the group consisting of a Cre recombinase, a Flp recombinase, and a Tnpl recombinase.
- the serine recombinase attached to a DNA recognition motif is selected from the group consisting of a PhiC31 integrase, an R4 integrase, and a TP-901 integrase.
- the transposase is a DNA transposase attached to a DNA binding domain.
- the sequence of interest further comprises at least one site-specific enzyme target site.
- the at least one site-specific enzyme target site is selected from a Cre/lox recombination site, a Flp/FRT recombination site, a endonuclease recognition site, and a TALEN site.
- the sequence of interest comprises a sequence encoding a protein involved in DNA repair, wherein the protein is selected from the group comprising a vir gene from the Ti plasmid, Rad51, Rad52, Rad2, a dominant-negative Ku70, or any combination thereof.
- At least part of the sequence of interest is integrated into the plant genome via homologous recombination, wherein the integration of at least part of the sequence of interest results in a point mutation, an insertion, a deletion, an inversion, increased transcription of an endogenous locus, decreased transcription of an endogenous locus, altered protein activity, altered RNAi products, altered RNAi target sites, altered RNAi pathway activity, increased transcription of the sequence of interest, decreased transcription of the integrated sequence of interest, or a combination thereof.
- At least part of the sequence of interest is integrated into the plant genome via non-homologous end joining, wherein the integration of at least part of the sequence of interest results in a point mutation, an insertion, a deletion, an inversion, increased transcription of an endogenous locus, decreased transcription of an endogenous locus, altered protein activity, altered RNAi products, altered RNAi target sites, altered RNAi pathway activity, increased transcription of the sequence of interest, decreased transcription of the integrated sequence of interest, or a combination thereof.
- the Rhizobiales cell is selected from an Agrobacterium spp., a Bradyrhizobium spp., a Mesorhizobium spp., an Ochrobactrum spp., a Phyllobacterium spp., a Rhizobium spp., and a Sinorhizobium spp.
- the Rhizobiales cell further contains a vector comprising at least one expression cassette, wherein the expression cassettes comprise a sequence encoding a protein involved in DNA repair, and wherein the protein is selected from the group comprising a vir gene from the Ti plasmid, Rad51, Rad52, Rad2, a dominant- negative Ku70, or any combination thereof.
- the plant cell already comprises a site-specific enzyme.
- the plant cell is selected from the group consisting of a corn cell, a soybean cell, a canola cell, a cotton cell, a wheat cell, or a sugarcane cell.
- a nucleotide sequence encoding the site-specific enzyme is stably transformed into the plant cell.
- a method of providing a sequence of interest to a plant cell comprising contacting the plant cell with a Rhizobiales cell capable of transforming the plant cell, wherein the Rhizobiales cell comprises at least one vector capable of forming two T-strands that are essentially complementary in at least a portion of the T-strands, wherein the vector comprises a first sequence of interest and a second sequence of interest, wherein the first sequence of interest is essentially identical to the second sequence of interest; wherein the vector further comprises a RB 1 and a LB 1 which are positioned in the vector to initiate (RB 1) and terminate (LBl) synthesis of a first T-strand such that the first sequence of interest is in the sense orientation from the 5' to 3' end of the first T-strand; and the vector further comprises a RB2 and a LB2 which are positioned in the vector to initiate (RB2) and terminate (LB2) synthesis of a second T-strand such that the second sequence of interest is in an anti-sense orientation relative to the first sequence of interest in
- the RBI and the RB2 are essentially homologous. In some embodiments, the LBl and the LB2 are essentially homologous. In some embodiments, the RBI and the RB2 are not essentially homologous. In some embodiments, the LBl and LB2 are not essentially homologous. In some embodiments, at least one of the RBI and RB2 comprise an
- the right border consensus DNA sequence is selected from SEQ ID NO: 21 or SEQ ID NO: 22.
- at least one of the RBI and RB2 comprise a sequence selected from SEQ ID NOs: 1-13.
- at least one of the RBI and RB2 comprise a sequence at least 80% identical to a sequence selected from SEQ ID NOs:4 and SEQ ID NO: 12.
- at least one of the LB l and LB2 comprise an Agrobacterium Ti plasmid left border consensus DNA sequence.
- the left border consensus DNA sequence is selected from SEQ ID NO: 23 or SEQ ID NO: 24.
- At least one of the LB l and LB2 comprise a sequence selected from SEQ ID NOs: 14-20. In some embodiments, at least one of the LBl and LB2 comprise a sequence at least 80% identical to SEQ ID NO: 19. In some embodiments, the sequence of interest comprises one or more expression cassettes.
- the sequence of interest comprises one or more sequences selected from: a gene, a portion of a gene, an intergenic sequence, an enhancer, a promoter, an intron, an exon, a sequence encoding a transcription termination sequence, a sequence encoding a chloroplast targeting peptide, a sequence encoding a mitochondrial targeting peptide, an insulator sequence, a sequence encoding an anti-sense RNA construct, a sequence encoding non-protein-coding RNA (npcRNA), a sequence encoding a recombinase, a sequence encoding a recombinase recognition site, a landing pad, an editing template, an expression cassette, a stack of two or more expression cassettes encoding transgenes, a sequence encoding a site-specific enzyme, a sequence encoding a site-specific enzyme target site, a sequence encoding a selection marker, a sequence encoding a cell factor that functions to increase DNA repair
- the sequence of interest comprises a sequence encoding a site-specific enzyme target site 5' to an expression cassette and a sequence encoding a site-specific enzyme target site 3' to an expression cassette.
- the sequence of interest does not comprise a homology arm DNA sequence.
- the sequence of interest comprises at least one homology arm DNA sequence.
- the sequence of interest comprises both a left homology arm DNA sequence and a right homology arm DNA sequence.
- the sequence of interest comprises i) a first sequence positioned between the left homology arm DNA sequence and the right homology DNA arm sequence, and ii) a second sequence that is not positioned between the left homology arm DNA sequence and the right homology arm DNA sequence.
- the sequence of interest comprises a sequence positioned between the left homology arm DNA sequence and the right homology DNA arm sequence, where the left and right homology arm sequences are positioned between two sequences encoding site-specific enzyme target sites. In some embodiments, the sequence of interest comprises i) a first sequence positioned between the left homology arm DNA sequence and the right homology DNA arm sequence where the left and right homology arm sequences are positioned between two sequences encoding site-specific enzyme target sites, and ii) a second sequence that is not positioned between sequences encoding site-specific enzyme target sites.
- the sequence of interest comprises i) a first sequence positioned between the left homology arm DNA sequence and the right homology DNA arm sequence where the left and right homology arm sequences are positioned between two sequences encoding site-specific enzyme target sites, and ii) a second sequence encoding the site-specific enzyme, where the second sequence is not positioned between sequences encoding site-specific enzyme target sites.
- at least one homology arm DNA sequence comprises a sequence that is at least 80%, at least 85%, at least 90%, at least 95%, or at least 99% identical to a target sequence in the plant genome.
- the target sequence in the plant genome is a genie sequence.
- the target sequence in the plant genome is a non-genic sequence.
- the sequence of interest flanked by homology arms comprises a sequence that is at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%), or 100% identical to a native gene of the plant cell.
- the sequence of interest comprises a protein-coding sequence.
- the sequence of interest comprises a non-protein-coding RNA.
- the non- protein-coding RNA is selected from the group consisting of: a microRNA (miRNA), a miRNA precursor, a small interfering RNA (siRNA), a small RNA (22-26 nt in length) and precursor encoding same, a heterochromatic siRNA (hc-siRNA), a Piwi-interacting RNA (piRNA), a hairpin double strand RNA (hairpin dsRNA), a trans-acting siRNA (ta-siRNA), a naturally occurring antisense siRNA (nat-siRNA), a tracer RNA (tcRNA), a guide RNA (gRNA), and a single-guide RNA (sgRNA), or any combination thereof.
- miRNA microRNA
- miRNA precursor a small interfering RNA
- siRNA small interfering RNA
- small RNA 22-26 nt in length
- precursor precursor encoding same
- hc-siRNA a heterochromatic siRNA
- the site-specific enzyme is selected from a group consisting of an
- the endonuclease is selected from a meganuclease, a zinc finger nuclease, a transcription activator-like effector nuclease (TALEN), an Argonaute, a Cas9 nuclease, a CasX nuclease, a CasY nuclease, and a Cpfl nuclease.
- TALEN transcription activator-like effector nuclease
- the Cas9 nuclease is selected from the group comprising Casl, CaslB, Cas2, Cas3, Cas4, Cas5, Cas6, Cas7, Cas8, Cas9 (also known as Csnl and Csxl2), CaslO, Csyl, Csy2, Csy3, Csel, Cse2, Cscl, Csc2, Csa5, Csn2, Csm2, Csm3, Csm4, Csm5, Csm6, Cmrl, Cmr3, Cmr4, Cmr5, Cmr6, Csbl, Csb2, Csb3, Csxl7, Csxl4, CsxlO, Csxl6, CsaX, Csx3, Csxl, Csxl5, Csfl, Csf2, Csf3, and Csf4 nuclease.
- the recombinase is a tyrosine
- the tyrosine recombinase attached to a DNA recognition motif is selected from the group consisting of a Cre recombinase, a Flp recombinase, and a Tnpl recombinase.
- the serine recombinase attached to a DNA recognition motif is selected from the group consisting of a PhiC31 integrase, an R4 integrase, and a TP-901 integrase.
- the transposase is a DNA transposase attached to a DNA binding domain.
- the sequence of interest further comprises at least one site-specific enzyme target site.
- the at least one site-specific enzyme target site is selected from a Cre/lox recombination site, a Flp/FRT recombination site, a
- the sequence of interest comprises a sequence encoding a protein involved in DNA repair, wherein the protein is selected from the group comprising a vir gene from the Ti plasmid, Rad51, Rad52, Rad2, a dominant-negative Ku70, or any combination thereof.
- At least part of the sequence of interest is integrated into the plant genome via homologous recombination, wherein the integration of at least part of the sequence of interest results in a point mutation, an insertion, a deletion, an inversion, increased transcription of an endogenous locus, decreased transcription of an endogenous locus, altered protein activity, altered RNAi products, altered RNAi target sites, altered RNAi pathway activity, increased transcription of the sequence of interest, decreased transcription of the integrated sequence of interest, or a combination thereof.
- At least part of the sequence of interest is integrated into the plant genome via non-homologous end joining, wherein the integration of at least part of the sequence of interest results in a point mutation, an insertion, a deletion, an inversion, increased transcription of an endogenous locus, decreased transcription of an endogenous locus, altered protein activity, altered RNAi products, altered RNAi target sites, altered RNAi pathway activity, increased transcription of the sequence of interest, decreased transcription of the integrated sequence of interest, or a combination thereof.
- the plant cell already comprises a site-specific enzyme.
- the plant cell is selected from the group consisting of a corn cell, a soybean cell, a canola cell, a cotton cell, a wheat cell, or a sugarcane cell.
- a nucleotide sequence encoding the site-specific enzyme is stably transformed into the plant cell.
- the Rhizobiales cell is selected from an Agrobacterium spp., a Bradyrhizobium spp., a Mesorhizobium spp., an Ochrobactrum spp., a Phyllobacterium spp., a Rhizobium spp., and a Sinorhizobium spp.
- the Rhizobiales cell further contains a vector comprising at least one expression cassette, wherein the expression cassettes comprise a sequence encoding a protein involved in DNA repair, and wherein the protein is selected from the group comprising a vir gene from the Ti plasmid, Rad51, Rad52, Rad2, a dominant- negative Ku70, or any combination thereof.
- a method of providing a sequence of interest to a plant cell comprising contacting the plant cell with a Rhizobiales cell capable of transforming the plant cell, wherein the Rhizobiales cell comprises at least one vector capable of forming two T-strands that are essentially complementary in at least a portion of the T-strands, wherein the Rhizobiales cell comprises at least a first vector and a second vector, wherein each vector comprises essentially identical sequences of interest, and wherein the first vector comprises a RBI and a LBl which are positioned in the first vector to initiate (RB I) and terminate (LBl) synthesis of a first T-strand such that the sequence of interest is in the sense orientation from the 5' to 3' end of the first T-strand; and wherein the second vector comprises a RB2 and a LB2) which are positioned in the second vector to initiate (RB2) and terminate (LB2) synthesis of a second T-strand such that the sequence of interest is in an anti-sense orientation relative to the sequence of interest in the first
- the RBI and the RB2 are essentially homologous. In some embodiments, the LBl and the LB2 are essentially homologous. In some embodiments, the RBI and the RB2 are not essentially homologous. In some embodiments, the LBl and LB2 are not essentially homologous. In some embodiments, at least one of the RBI and RB2 comprise an Agrobacterium Ti plasmid right border consensus DNA sequence. In some embodiments, the right border consensus DNA sequence is selected from SEQ ID NO: 21 or SEQ ID NO: 22. In some embodiments, at least one of the RBI and RB2 comprise a sequence selected from SEQ ID NOs: 1-13.
- At least one of the RBI and RB2 comprise a sequence at least 80% identical to a sequence selected from SEQ ID NOs:4 and SEQ ID NO: 12.
- at least one of the LBl and LB2 comprise an Agrobacterium Ti plasmid left border consensus DNA sequence.
- the left border consensus DNA sequence is selected from SEQ ID NO: 23 or SEQ ID NO: 24.
- at least one of the LBl and LB2 comprise a sequence selected from SEQ ID NOs: 14-20.
- at least one of the LBl and LB2 comprise a sequence at least 80% identical to SEQ ID NO: 19.
- the sequence of interest comprises one or more expression cassettes.
- the sequence of interest comprises one or more sequences selected from: a gene, a portion of a gene, an intergenic sequence, an enhancer, a promoter, an intron, an exon, a sequence encoding a transcription termination sequence, a sequence encoding a chloroplast targeting peptide, a sequence encoding a mitochondrial targeting peptide, an insulator sequence, a sequence encoding an anti-sense RNA construct, a sequence encoding non-protein-coding RNA (npcRNA), a sequence encoding a recombinase, a sequence encoding a recombinase recognition site, a landing pad, an editing template, an expression cassette, a stack of two or more expression cassettes encoding transgenes, a sequence encoding a site-specific enzyme, a sequence encoding a site-specific enzyme target site, a sequence encoding a selection marker, a sequence encoding a cell factor that functions to increase DNA repair
- the sequence of interest comprises a sequence encoding a site-specific enzyme target site 5' to an expression cassette and a sequence encoding a site-specific enzyme target site 3' to an expression cassette.
- the sequence of interest does not comprise a homology arm DNA sequence.
- the sequence of interest comprises at least one homology arm DNA sequence.
- the sequence of interest comprises both a left homology arm DNA sequence and a right homology arm DNA sequence.
- the sequence of interest comprises i) a first sequence positioned between the left homology arm DNA sequence and the right homology DNA arm sequence, and ii) a second sequence that is not positioned between the left homology arm DNA sequence and the right homology arm DNA sequence.
- the sequence of interest comprises a sequence positioned between the left homology arm DNA sequence and the right homology DNA arm sequence, where the left and right homology arm sequences are positioned between two sequences encoding site-specific enzyme target sites. In some embodiments, the sequence of interest comprises i) a first sequence positioned between the left homology arm DNA sequence and the right homology DNA arm sequence where the left and right homology arm sequences are positioned between two sequences encoding site-specific enzyme target sites, and ii) a second sequence that is not positioned between sequences encoding site-specific enzyme target sites.
- the sequence of interest comprises i) a first sequence positioned between the left homology arm DNA sequence and the right homology DNA arm sequence where the left and right homology arm sequences are positioned between two sequences encoding site-specific enzyme target sites, and ii) a second sequence encoding the site-specific enzyme, where the second sequence is not positioned between sequences encoding site-specific enzyme target sites.
- at least one homology arm DNA sequence comprises a sequence that is at least 80%, at least 85%, at least 90%, at least 95%, or at least 99% identical to a target sequence in the plant genome.
- the target sequence in the plant genome is a genie sequence.
- the target sequence in the plant genome is a non-genic sequence.
- the sequence of interest flanked by homology arms comprises a sequence that is at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%), or 100% identical to a native gene of the plant cell.
- the sequence of interest comprises a protein-coding sequence.
- the sequence of interest comprises a non-protein-coding RNA.
- the non- protein-coding RNA is selected from the group consisting of: a microRNA (miRNA), a miRNA precursor, a small interfering RNA (siRNA), a small RNA (22-26 nt in length) and precursor encoding same, a heterochromatic siRNA (hc-siRNA), a Piwi-interacting RNA (piRNA), a hairpin double strand RNA (hairpin dsRNA), a trans-acting siRNA (ta-siRNA), a naturally occurring antisense siRNA (nat-siRNA), a tracer RNA (tcRNA), a guide RNA (gRNA), and a single-guide RNA (sgRNA), or any combination thereof.
- miRNA microRNA
- miRNA precursor a small interfering RNA
- siRNA small interfering RNA
- small RNA 22-26 nt in length
- precursor precursor encoding same
- hc-siRNA a heterochromatic siRNA
- the site-specific enzyme is selected from a group consisting of an
- the endonuclease is selected from a meganuclease, a zinc finger nuclease, a transcription activator-like effector nuclease (TALEN), an Argonaute, a Cas9 nuclease, a CasX nuclease, a CasY nuclease, and a Cpfl nuclease.
- TALEN transcription activator-like effector nuclease
- the Cas9 nuclease is selected from the group comprising Casl, CaslB, Cas2, Cas3, Cas4, Cas5, Cas6, Cas7, Cas8, Cas9 (also known as Csnl and Csxl2), CaslO, Csyl, Csy2, Csy3, Csel, Cse2, Cscl, Csc2, Csa5, Csn2, Csm2, Csm3, Csm4, Csm5, Csm6, Cmrl, Cmr3, Cmr4, Cmr5, Cmr6, Csbl, Csb2, Csb3, Csxl7, Csxl4, CsxlO, Csxl6, CsaX, Csx3, Csxl, Csxl5, Csfl, Csf2, Csf3, and Csf4 nuclease.
- the recombinase is a tyrosine
- the tyrosine recombinase attached to a DNA recognition motif is selected from the group consisting of a Cre recombinase, a Flp recombinase, and a Tnpl recombinase.
- the serine recombinase attached to a DNA recognition motif is selected from the group consisting of a PhiC31 integrase, an R4 integrase, and a TP-901 integrase.
- the transposase is a DNA transposase attached to a DNA binding domain.
- the sequence of interest further comprises at least one site-specific enzyme target site.
- the at least one site-specific enzyme target site is selected from a Cre/lox recombination site, a Flp/FRT recombination site, a endonuclease recognition site, and a TALEN site.
- the sequence of interest comprises a sequence encoding a protein involved in DNA repair, wherein the protein is selected from the group comprising a vir gene from the Ti plasmid, Rad51, Rad52, Rad2, a dominant-negative Ku70, or any combination thereof.
- At least part of the sequence of interest is integrated into the plant genome via homologous recombination, wherein the integration of at least part of the sequence of interest results in a point mutation, an insertion, a deletion, an inversion, increased transcription of an endogenous locus, decreased transcription of an endogenous locus, altered protein activity, altered RNAi products, altered RNAi target sites, altered RNAi pathway activity, increased transcription of the sequence of interest, decreased transcription of the integrated sequence of interest, or a combination thereof.
- At least part of the sequence of interest is integrated into the plant genome via non-homologous end joining, wherein the integration of at least part of the sequence of interest results in a point mutation, an insertion, a deletion, an inversion, increased transcription of an endogenous locus, decreased transcription of an endogenous locus, altered protein activity, altered RNAi products, altered RNAi target sites, altered RNAi pathway activity, increased transcription of the sequence of interest, decreased transcription of the integrated sequence of interest, or a combination thereof.
- the plant cell already comprises a site-specific enzyme.
- the plant cell is selected from the group consisting of a corn cell, a soybean cell, a canola cell, a cotton cell, a wheat cell, or a sugarcane cell.
- a nucleotide sequence encoding the site-specific enzyme is stably transformed into the plant cell.
- the Rhizobiales cell is selected from an Agrobacterium spp., a Bradyrhizobium spp., a Mesorhizobium spp., an Ochrobactrum spp., a Phyllobacterium spp., a Rhizobium spp., and a Sinorhizobium spp.
- the Rhizobiales cell further contains a vector comprising at least one expression cassette, wherein the expression cassettes comprise a sequence encoding a protein involved in DNA repair, and wherein the protein is selected from the group comprising a vir gene from the Ti plasmid, Rad51, Rad52, Rad2, a dominant- negative Ku70, or any combination thereof.
- a method of providing a sequence of interest to a plant cell comprising contacting the plant cell with a Rhizobiales cell capable of transforming the plant cell, wherein the Rhizobiales cell comprises at least one vector capable of forming two T-strands that are essentially complementary in at least a portion of the T-strands, wherein the at least one vector comprises a first sequence of interest, a second sequence of interest different from the first sequence of interest, at least two RB DNA sequences, and one or more optional LB DNA sequences, wherein the first RB DNA sequence (RBI) and a first LB DNA sequence (LBl) are positioned in the vector to initiate (RBI) and terminate (LBl) synthesis of the first T-strand such that the first sequence of interest is in the sense orientation from the 5' to 3' end of the first T-strand; and the vector configuration further comprises a second RB DNA sequence (RB2) and a second LB DNA sequence (LB2) which are positioned in the vector to initiate (RB2) and terminate (LB2) synthesis of
- the RBI and the RB2 are essentially homologous. In some embodiments, the LBl and the LB2 are essentially homologous. In some embodiments, the RBI and the RB2 are not essentially homologous. In some embodiments, the LBl and LB2 are not essentially homologous. In some embodiments, at least one of the RBI and RB2 comprise an Agrobacterium Ti plasmid right border consensus DNA sequence. In some embodiments, the right border consensus DNA sequence is selected from SEQ ID NO: 21 or SEQ ID NO: 22. In some embodiments, at least one of the RBI and RB2 comprise a sequence selected from SEQ ID NOs: 1-13.
- At least one of the RBI and RB2 comprise a sequence at least 80% identical to a sequence selected from SEQ ID NOs:4 and SEQ ID NO: 12.
- at least one of the LBl and LB2 comprise an Agrobacterium Ti plasmid left border consensus DNA sequence.
- the left border consensus DNA sequence is selected from SEQ ID NO: 23 or SEQ ID NO: 24.
- at least one of the LBl and LB2 comprise a sequence selected from SEQ ID NOs: 14-20.
- at least one of the LBl and LB2 comprise a sequence at least 80% identical to SEQ ID NO: 19.
- the sequence of interest comprises one or more expression cassettes.
- the sequence of interest comprises one or more sequences selected from: a gene, a portion of a gene, an intergenic sequence, an enhancer, a promoter, an intron, an exon, a sequence encoding a transcription termination sequence, a sequence encoding a chloroplast targeting peptide, a sequence encoding a mitochondrial targeting peptide, an insulator sequence, a sequence encoding an anti-sense RNA construct, a sequence encoding non-protein-coding RNA (npcRNA), a sequence encoding a recombinase, a sequence encoding a recombinase recognition site, a landing pad, an editing template, an expression cassette, a stack of two or more expression cassettes encoding transgenes, a sequence encoding a site-specific enzyme, a sequence encoding a site-specific enzyme target site, a sequence encoding a selection marker, a sequence encoding a cell factor that functions to increase DNA repair
- the sequence of interest comprises a sequence encoding a site-specific enzyme target site 5' to an expression cassette and a sequence encoding a site-specific enzyme target site 3' to an expression cassette.
- the sequence of interest does not comprise a homology arm DNA sequence.
- the sequence of interest comprises at least one homology arm DNA sequence.
- the sequence of interest comprises both a left homology arm DNA sequence and a right homology arm DNA sequence.
- the sequence of interest comprises i) a first sequence positioned between the left homology arm DNA sequence and the right homology DNA arm sequence, and ii) a second sequence that is not positioned between the left homology arm DNA sequence and the right homology arm DNA sequence.
- the sequence of interest comprises a sequence positioned between the left homology arm DNA sequence and the right homology DNA arm sequence, where the left and right homology arm sequences are positioned between two sequences encoding site-specific enzyme target sites. In some embodiments, the sequence of interest comprises i) a first sequence positioned between the left homology arm DNA sequence and the right homology DNA arm sequence where the left and right homology arm sequences are positioned between two sequences encoding site-specific enzyme target sites, and ii) a second sequence that is not positioned between sequences encoding site-specific enzyme target sites.
- the sequence of interest comprises i) a first sequence positioned between the left homology arm DNA sequence and the right homology DNA arm sequence where the left and right homology arm sequences are positioned between two sequences encoding site-specific enzyme target sites, and ii) a second sequence encoding the site-specific enzyme, where the second sequence is not positioned between sequences encoding site-specific enzyme target sites.
- at least one homology arm DNA sequence comprises a sequence that is at least 80%, at least 85%, at least 90%, at least 95%, or at least 99% identical to a target sequence in the plant genome.
- the target sequence in the plant genome is a genie sequence.
- the target sequence in the plant genome is a non-genic sequence.
- the sequence of interest flanked by homology arms comprises a sequence that is at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identical to a native gene of the plant cell.
- the sequence of interest comprises a protein-coding sequence.
- the sequence of interest comprises a non-protein-coding RNA.
- the non- protein-coding RNA is selected from the group consisting of: a microRNA (miRNA), a miRNA precursor, a small interfering RNA (siRNA), a small RNA (22-26 nt in length) and precursor encoding same, a heterochromatic siRNA (hc-siRNA), a Piwi-interacting RNA
- RNA hairpin double strand RNA
- ta-siRNA trans-acting siRNA
- nat-siRNA naturally occurring antisense siRNA
- tcRNA tracer RNA
- gRNA guide RNA
- sgRNA single-guide RNA
- the site-specific enzyme is selected from a group consisting of an
- the endonuclease is selected from a meganuclease, a zinc finger nuclease, a transcription activator-like effector nuclease (TALEN), an Argonaute, a Cas9 nuclease, a CasX nuclease, a CasY nuclease, and a Cpfl nuclease.
- TALEN transcription activator-like effector nuclease
- the Cas9 nuclease is selected from the group comprising Casl, CaslB, Cas2, Cas3, Cas4, Cas5, Cas6, Cas7, Cas8, Cas9 (also known as Csnl and Csxl2), CaslO, Csyl, Csy2, Csy3, Csel, Cse2, Cscl, Csc2, Csa5, Csn2, Csm2, Csm3, Csm4, Csm5, Csm6, Cmrl, Cmr3, Cmr4, Cmr5, Cmr6, Csbl, Csb2, Csb3, Csxl7, Csxl4, CsxlO, Csxl6, CsaX, Csx3, Csxl, Csxl5, Csfl, Csf2, Csf3, and Csf4 nuclease.
- the recombinase is a tyrosine
- the tyrosine recombinase attached to a DNA recognition motif is selected from the group consisting of a Cre recombinase, a Flp recombinase, and a Tnpl recombinase.
- the serine recombinase attached to a DNA recognition motif is selected from the group consisting of a PhiC31 integrase, an R4 integrase, and a TP-901 integrase.
- the transposase is a DNA transposase attached to a DNA binding domain.
- the sequence of interest further comprises at least one site-specific enzyme target site.
- the at least one site-specific enzyme target site is selected from a Cre/lox recombination site, a Flp/FRT recombination site, a endonuclease recognition site, and a TALEN site.
- the sequence of interest comprises a sequence encoding a protein involved in DNA repair, wherein the protein is selected from the group comprising a vir gene from the Ti plasmid, Rad51, Rad52, Rad2, a dominant-negative Ku70, or any combination thereof.
- At least part of the sequence of interest is integrated into the plant genome via homologous recombination, wherein the integration of at least part of the sequence of interest results in a point mutation, an insertion, a deletion, an inversion, increased transcription of an endogenous locus, decreased transcription of an endogenous locus, altered protein activity, altered RNAi products, altered RNAi target sites, altered RNAi pathway activity, increased transcription of the sequence of interest, decreased transcription of the integrated sequence of interest, or a combination thereof.
- At least part of the sequence of interest is integrated into the plant genome via non-homologous end j oining, wherein the integration of at least part of the sequence of interest results in a point mutation, an insertion, a deletion, an inversion, increased transcription of an endogenous locus, decreased transcription of an endogenous locus, altered protein activity, altered RNAi products, altered RNAi target sites, altered RNAi pathway activity, increased transcription of the sequence of interest, decreased transcription of the integrated sequence of interest, or a combination thereof.
- the Rhizobiales cell is selected from an Agrobacterium spp., a Bradyrhizobium spp., a Mesorhizobium spp., an Ochrobactrum spp., a Phyllobacterium spp., a Rhizobium spp., and a Sinorhizobium spp.
- the Rhizobiales cell further contains a vector comprising at least one expression cassette, wherein the expression cassettes comprise a sequence encoding a protein involved in DNA repair, and wherein the protein is selected from the group comprising a vir gene from the Ti plasmid, Rad51, Rad52, Rad2, a dominant- negative Ku70, or any combination thereof.
- the plant cell already comprises a site-specific enzyme.
- the plant cell is selected from the group consisting of a corn cell, a soybean cell, a canola cell, a cotton cell, a wheat cell, or a sugarcane cell.
- a nucleotide sequence encoding the site-specific enzyme is stably transformed into the plant cell.
- Several embodiments relate to a method of transforming a plant cell, comprising contacting the plant cell with two or more Rhizobiales cells capable of transforming the plant cell, wherein the two or more Rhizobiales cells each contain one of at least two vectors capable of forming two essentially complementary T-strands, wherein each vector comprises an essentially identical sequence of interest, and where the first vector comprises a first right border DNA sequence (RBI), and wherein the RB I is positioned in the vector to initiate synthesis of a first T-strand such that the sequence of interest is in the sense orientation from the 5' to 3' end of the first T-strand; and the second vector comprises a second right border DNA sequence (RB2) which is positioned in the vector to initiate synthesis of a second T- strand such that the sequence of interest is in the anti-sense orientation relative to the sequence of interest in the first T-strand, and wherein the two T-strands resulting from initiation at RBI and RB2 are essentially complementary in at least a portion of the sequence of interest.
- the RBI and the RB2 are essentially homologous. In some embodiments, the LBl and the LB2 are essentially homologous. In some embodiments, the RBI and the RB2 are not essentially homologous. In some embodiments, the LBl and LB2 are not essentially homologous. In some embodiments, at least one of the RBI and RB2 comprise an Agrobacterium Ti plasmid right border consensus DNA sequence. In some embodiments, the right border consensus DNA sequence is selected from SEQ ID NO: 21 or SEQ ID NO: 22. In some embodiments, at least one of the RBI and RB2 comprise a sequence selected from SEQ ID NOs: 1-13.
- At least one of the RBI and RB2 comprise a sequence at least 80% identical to a sequence selected from SEQ ID NOs:4 and SEQ ID NO: 12.
- at least one of the LBl and LB2 comprise an Agrobacterium Ti plasmid left border consensus DNA sequence.
- the left border consensus DNA sequence is selected from SEQ ID NO: 23 or SEQ ID NO: 24.
- at least one of the LBl and LB2 comprise a sequence selected from SEQ ID NOs: 14-20.
- at least one of the LBl and LB2 comprise a sequence at least 80% identical to SEQ ID NO: 19.
- the sequence of interest comprises one or more expression cassettes.
- the sequence of interest comprises one or more sequences selected from: a gene, a portion of a gene, an intergenic sequence, an enhancer, a promoter, an intron, an exon, a sequence encoding a transcription termination sequence, a sequence encoding a chloroplast targeting peptide, a sequence encoding a mitochondrial targeting peptide, an insulator sequence, a sequence encoding an anti-sense RNA construct, a sequence encoding non-protein-coding RNA (npcRNA), a sequence encoding a recombinase, a sequence encoding a recombinase recognition site, a landing pad, an editing template, an expression cassette, a stack of two or more expression cassettes encoding transgenes, a sequence encoding a site-specific enzyme, a sequence encoding a site-specific enzyme target site, a sequence encoding a selection marker, a sequence encoding a cell factor that functions to increase DNA repair
- the sequence of interest comprises a sequence encoding a site-specific enzyme target site 5' to an expression cassette and a sequence encoding a site-specific enzyme target site 3' to an expression cassette.
- the sequence of interest does not comprise a homology arm DNA sequence.
- the sequence of interest comprises at least one homology arm DNA sequence.
- the sequence of interest comprises both a left homology arm DNA sequence and a right homology arm DNA sequence.
- the sequence of interest comprises i) a first sequence positioned between the left homology arm DNA sequence and the right homology DNA arm sequence, and ii) a second sequence that is not positioned between the left homology arm DNA sequence and the right homology arm DNA sequence.
- the sequence of interest comprises a sequence positioned between the left homology arm DNA sequence and the right homology DNA arm sequence, where the left and right homology arm sequences are positioned between two sequences encoding site-specific enzyme target sites. In some embodiments, the sequence of interest comprises i) a first sequence positioned between the left homology arm DNA sequence and the right homology DNA arm sequence where the left and right homology arm sequences are positioned between two sequences encoding site-specific enzyme target sites, and ii) a second sequence that is not positioned between sequences encoding site-specific enzyme target sites.
- the sequence of interest comprises i) a first sequence positioned between the left homology arm DNA sequence and the right homology DNA arm sequence where the left and right homology arm sequences are positioned between two sequences encoding site-specific enzyme target sites, and ii) a second sequence encoding the site-specific enzyme, where the second sequence is not positioned between sequences encoding site-specific enzyme target sites.
- at least one homology arm DNA sequence comprises a sequence that is at least 80%, at least 85%, at least 90%, at least 95%, or at least 99% identical to a target sequence in the plant genome.
- the target sequence in the plant genome is a genie sequence.
- the target sequence in the plant genome is a non-genic sequence.
- the sequence of interest flanked by homology arms comprises a sequence that is at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identical to a native gene of the plant cell.
- the sequence of interest comprises a protein-coding sequence.
- the sequence of interest comprises a non-protein-coding RNA.
- the non- protein-coding RNA is selected from the group consisting of: a microRNA (miRNA), a miRNA precursor, a small interfering RNA (siRNA), a small RNA (22-26 nt in length) and precursor encoding same, a heterochromatic siRNA (hc-siRNA), a Piwi-interacting RNA (piRNA), a hairpin double strand RNA (hairpin dsRNA), a trans-acting siRNA (ta-siRNA), a naturally occurring antisense siRNA (nat-siRNA), a tracer RNA (tcRNA), a guide RNA (gRNA), and a single-guide RNA (sgRNA), or any combination thereof.
- miRNA microRNA
- miRNA precursor a small interfering RNA
- siRNA small interfering RNA
- small RNA 22-26 nt in length
- precursor precursor encoding same
- hc-siRNA a heterochromatic siRNA
- the site-specific enzyme is selected from a group consisting of an
- the endonuclease is selected from a meganuclease, a zinc finger nuclease, a transcription activator-like effector nuclease (TALEN), an Argonaute, a Cas9 nuclease, a CasX nuclease, a CasY nuclease, and a Cpfl nuclease.
- TALEN transcription activator-like effector nuclease
- the Cas9 nuclease is selected from the group comprising Casl, CaslB, Cas2, Cas3, Cas4, Cas5, Cas6, Cas7, Cas8, Cas9 (also known as Csnl and Csxl2), CaslO, Csyl, Csy2, Csy3, Csel, Cse2, Cscl, Csc2, Csa5, Csn2, Csm2, Csm3, Csm4, Csm5, Csm6, Cmrl, Cmr3, Cmr4, Cmr5, Cmr6, Csbl, Csb2, Csb3, Csxl7, Csxl4, CsxlO, Csxl6, CsaX, Csx3, Csxl, Csxl5, Csfl, Csf2, Csf3, and Csf4 nuclease.
- the recombinase is a tyrosine
- the tyrosine recombinase attached to a DNA recognition motif is selected from the group consisting of a Cre recombinase, a Flp recombinase, and a Tnpl recombinase.
- the serine recombinase attached to a DNA recognition motif is selected from the group consisting of a PhiC31 integrase, an R4 integrase, and a TP-901 integrase.
- the transposase is a DNA transposase attached to a DNA binding domain.
- the sequence of interest further comprises at least one site-specific enzyme target site.
- the at least one site-specific enzyme target site is selected from a Cre/lox recombination site, a Flp/FRT recombination site, a endonuclease recognition site, and a TALEN site.
- the sequence of interest comprises a sequence encoding a protein involved in DNA repair, wherein the protein is selected from the group comprising a vir gene from the Ti plasmid, Rad51, Rad52, Rad2, a dominant-negative Ku70, or any combination thereof.
- At least part of the sequence of interest is integrated into the plant genome via homologous recombination, wherein the integration of at least part of the sequence of interest results in a point mutation, an insertion, a deletion, an inversion, increased transcription of an endogenous locus, decreased transcription of an endogenous locus, altered protein activity, altered RNAi products, altered RNAi target sites, altered RNAi pathway activity, increased transcription of the sequence of interest, decreased transcription of the integrated sequence of interest, or a combination thereof.
- At least part of the sequence of interest is integrated into the plant genome via non-homologous end joining, wherein the integration of at least part of the sequence of interest results in a point mutation, an insertion, a deletion, an inversion, increased transcription of an endogenous locus, decreased transcription of an endogenous locus, altered protein activity, altered RNAi products, altered RNAi target sites, altered RNAi pathway activity, increased transcription of the sequence of interest, decreased transcription of the integrated sequence of interest, or a combination thereof.
- the Rhizobiales cell is selected from an Agrobacterium spp., a Bradyrhizobium spp., a Mesorhizobium spp., an Ochrobactrum spp., a Phyllobacterium spp., a Rhizobium spp., and a Sinorhizobium spp.
- the Rhizobiales cell further contains a vector comprising at least one expression cassette, wherein the expression cassettes comprise a sequence encoding a protein involved in DNA repair, and wherein the protein is selected from the group comprising a vir gene from the Ti plasmid, Rad51, Rad52, Rad2, a dominant- negative Ku70, or any combination thereof.
- the plant cell already comprises a site-specific enzyme.
- the plant cell is selected from the group consisting of a corn cell, a soybean cell, a canola cell, a cotton cell, a wheat cell, or a sugarcane cell.
- a nucleotide sequence encoding the site-specific enzyme is stably transformed into the plant cell.
- Several embodiments relate to a method of transforming a plant cell, comprising contacting the plant cell with two or more Rhizobiales cells capable of transforming the plant cell, wherein the two or more Rhizobiales cells each contain one of at least two vectors capable of forming two essentially complementary T-strands, wherein each vector comprises an essentially identical sequence of interest, and where the first vector comprises a RBI and a first left border DNA sequence (LBl) which are positioned in the first vector to initiate (RB I) and terminate (LB 1) synthesis of a first T-strand such that the sequence of interest is in the sense orientation from the 5' to 3' end of the first T-strand; and the second vector comprises a RB2 and a second left border DNA sequence (LB2) which are positioned in the second vector to initiate (RB2) and terminate (LB2) synthesis of a second T-strand such that the sequence of interest is in an anti-sense orientation from the 5' to 3' end of the second T-strand, and wherein the sequence of interest in the two T-
- the RBI and the RB2 are essentially homologous. In some embodiments, the LBl and the LB2 are essentially homologous. In some embodiments, the RBI and the RB2 are not essentially homologous. In some embodiments, the LBl and LB2 are not essentially homologous. In some embodiments, at least one of the RBI and RB2 comprise an Agrobacterium Ti plasmid right border consensus DNA sequence. In some embodiments, the right border consensus DNA sequence is selected from SEQ ID NO: 21 or SEQ ID NO: 22. In some embodiments, at least one of the RBI and RB2 comprise a sequence selected from SEQ ID NOs: 1-13. In some
- At least one of the RBI and RB2 comprise a sequence at least 80% identical to a sequence selected from SEQ ID NOs:4 and SEQ ID NO: 12.
- at least one of the LB l and LB2 comprise an Agrobacterium Ti plasmid left border consensus DNA sequence.
- the left border consensus DNA sequence is selected from SEQ ID NO: 23 or SEQ ID NO: 24.
- at least one of the LBl and LB2 comprise a sequence selected from SEQ ID NOs: 14-20.
- at least one of the LBl and LB2 comprise a sequence at least 80% identical to SEQ ID NO: 19.
- the sequence of interest comprises one or more expression cassettes.
- the sequence of interest comprises one or more sequences selected from: a gene, a portion of a gene, an intergenic sequence, an enhancer, a promoter, an intron, an exon, a sequence encoding a transcription termination sequence, a sequence encoding a chloroplast targeting peptide, a sequence encoding a mitochondrial targeting peptide, an insulator sequence, a sequence encoding an anti-sense RNA construct, a sequence encoding non-protein-coding RNA (npcRNA), a sequence encoding a recombinase, a sequence encoding a recombinase recognition site, a landing pad, an editing template, an expression cassette, a stack of two or more expression cassettes encoding transgenes, a sequence encoding a site-specific enzyme, a sequence encoding a site-specific enzyme target site, a sequence encoding a selection marker, a sequence encoding a cell factor that functions to increase DNA repair
- the sequence of interest does not comprise a homology arm DNA sequence.
- the sequence of interest comprises at least one homology arm DNA sequence.
- the sequence of interest comprises both a left homology arm DNA sequence and a right homology arm DNA sequence.
- the sequence of interest comprises i) a first sequence positioned between the left homology arm DNA sequence and the right homology DNA arm sequence, and ii) a second sequence that is not positioned between the left homology arm DNA sequence and the right homology arm DNA sequence.
- the sequence of interest comprises a sequence positioned between the left homology arm DNA sequence and the right homology DNA arm sequence, where the left and right homology arm sequences are positioned between two sequences encoding site-specific enzyme target sites.
- the sequence of interest comprises i) a first sequence positioned between the left homology arm DNA sequence and the right homology DNA arm sequence where the left and right homology arm sequences are positioned between two sequences encoding site-specific enzyme target sites, and ii) a second sequence that is not positioned between sequences encoding site-specific enzyme target sites.
- the sequence of interest comprises i) a first sequence positioned between the left homology arm DNA sequence and the right homology DNA arm sequence where the left and right homology arm sequences are positioned between two sequences encoding site-specific enzyme target sites, and ii) a second sequence encoding the site-specific enzyme, where the second sequence is not positioned between sequences encoding site-specific enzyme target sites.
- at least one homology arm DNA sequence comprises a sequence that is at least 80%, at least 85%, at least 90%, at least 95%), or at least 99% identical to a target sequence in the plant genome.
- the target sequence in the plant genome is a genie sequence. In some embodiments, the target sequence in the plant genome is a non-genic sequence. In some embodiments, the sequence of interest flanked by homology arms comprises a sequence that is at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least
- the sequence of interest comprises a protein-coding sequence. In some embodiments, the sequence of interest comprises a non- protein-coding RNA.
- the non-protein-coding RNA is selected from the group consisting of: a microRNA (miRNA), a miRNA precursor, a small interfering RNA (siRNA), a small RNA (22-26 nt in length) and precursor encoding same, a heterochromatic siRNA (hc-siRNA), a Piwi-interacting RNA (piRNA), a hairpin double strand RNA (hairpin dsRNA), a trans-acting siRNA (ta-siRNA), a naturally occurring antisense siRNA (nat- siRNA), a tracer RNA (tcRNA), a guide RNA (gRNA), and a single-guide RNA (sgRNA), or any combination thereof.
- the site-specific enzyme is selected from a group consisting of an endonuclease, a recombinase, and a transposase.
- the endonuclease is selected from a meganuclease, a zinc finger nuclease, a transcription activator-like effector nuclease (TALEN), an Argonaute, a Cas9 nuclease, a CasX nuclease, a CasY nuclease, and a Cpfl nuclease.
- TALEN transcription activator-like effector nuclease
- Argonaute a Cas9 nuclease
- CasX nuclease a CasX nuclease
- CasY nuclease a CasY nuclease
- Cpfl nuclease a Cpfl nuclease
- the Cas9 nuclease is selected from the group comprising Casl, CaslB, Cas2, Cas3, Cas4, Cas5, Cas6, Cas7, Cas8, Cas9 (also known as Csnl and Csxl2), CaslO, Csyl, Csy2, Csy3, Csel, Cse2, Cscl, Csc2, Csa5, Csn2, Csm2, Csm3, Csm4, Csm5, Csm6, Cmrl, Cmr3, Cmr4, Cmr5, Cmr6, Csbl, Csb2, Csb3, Csxl7, Csxl4, CsxlO, Csxl6, CsaX, Csx3, Csxl, Csxl5, Csfl, Csf2, Csf3, and Csf4 nuclease.
- the recombinase is a tyrosine recombinase attached to a DNA recognition motif, or a serine recombinase attached to a DNA recognition motif.
- the tyrosine recombinase attached to a DNA recognition motif is selected from the group consisting of a Cre recombinase, a Flp recombinase, and a Tnpl recombinase.
- the serine recombinase attached to a DNA recognition motif is selected from the group consisting of a PhiC31 integrase, an R4 integrase, and a TP-901 integrase.
- the transposase is a DNA transposase attached to a DNA binding domain.
- the sequence of interest further comprises at least one site-specific enzyme target site.
- the at least one site-specific enzyme target site is selected from a Cre/lox recombination site, a Flp/FRT recombination site, a endonuclease recognition site, and a TALEN site.
- the sequence of interest comprises a sequence encoding a protein involved in DNA repair, wherein the protein is selected from the group comprising a vir gene from the Ti plasmid, Rad51, Rad52, Rad2, a dominant-negative Ku70, or any combination thereof.
- At least part of the sequence of interest is integrated into the plant genome via homologous recombination, wherein the integration of at least part of the sequence of interest results in a point mutation, an insertion, a deletion, an inversion, increased transcription of an endogenous locus, decreased transcription of an endogenous locus, altered protein activity, altered RNAi products, altered RNAi target sites, altered RNAi pathway activity, increased transcription of the sequence of interest, decreased transcription of the integrated sequence of interest, or a combination thereof.
- At least part of the sequence of interest is integrated into the plant genome via non-homologous end joining, wherein the integration of at least part of the sequence of interest results in a point mutation, an insertion, a deletion, an inversion, increased transcription of an
- the plant cell already comprises a site-specific enzyme.
- the plant cell is selected from the group consisting of a corn cell, a soybean cell, a canola cell, a cotton cell, a wheat cell, or a sugarcane cell.
- a nucleotide sequence encoding the site-specific enzyme is stably transformed into the plant cell.
- the Rhizobiales cell is selected from an Agrobacterium spp., a Bradyrhizobium spp., a Mesorhizobium spp., an Ochrobactrum spp., a Phyllobacterium spp., a Rhizobium spp., and a Sinorhizobium spp.
- the Rhizobiales cell further contains a vector comprising at least one expression cassette, wherein the expression cassettes comprise a sequence encoding a protein involved in DNA repair, and wherein the protein is selected from the group comprising a vir gene from the Ti plasmid, Rad51, Rad52, Rad2, a dominant-negative Ku70, or any combination thereof.
- a method of increasing the rate of site directed integration of a sequence of interest comprising contacting a plant cell with at least one vector capable of forming two essentially complementary T-strands, wherein the at least one vector comprises a first right border DNA sequence (RBI), a second right border DNA sequence (RB2), and at least one sequence of interest, wherein the RBI is positioned in the vector to initiate synthesis of a first T-strand such that the sequence of interest is in the sense orientation from the 5' to 3' end of the first T-strand; and the RB2 is positioned in the vector to initiate synthesis of a second T-strand such that the sequence of interest is in the anti-sense orientation relative to the sequence of interest in the first T-strand, and wherein the two T- strands resulting from initiation at RBI and RB2 are essentially complementary in at least a portion of the sequence of interest.
- the at least one vector comprises a first right border DNA sequence (RBI), a second right border DNA sequence (RB2), and at least one sequence of interest
- the RBI and the RB2 are essentially homologous. In some embodiments, the LBl and the LB2 are essentially homologous. In some embodiments, the RBI and the RB2 are not essentially homologous. In some embodiments, the LBl and LB2 are not essentially homologous. In some embodiments, at least one of the RBI and RB2 comprise an Agrobacterium Ti plasmid right border consensus DNA sequence. In some embodiments, the right border consensus DNA sequence is selected from SEQ ID NO: 21 or SEQ ID NO: 22. In some embodiments, at least one of the RBI and RB2 comprise a sequence selected from SEQ ID NOs: 1-13. In some
- At least one of the RBI and RB2 comprise a sequence at least 80% identical to a sequence selected from SEQ ID NOs:4 and SEQ ID NO: 12.
- at least one of the LB 1 and LB2 comprise an Agrobacterium Ti plasmid left border consensus DNA sequence.
- the left border consensus DNA sequence is selected from SEQ ID NO: 23 or SEQ ID NO: 24.
- at least one of the LB1 and LB2 comprise a sequence selected from SEQ ID NOs: 14-20.
- at least one of the LB1 and LB2 comprise a sequence at least 80% identical to SEQ ID NO: 19.
- the sequence of interest comprises one or more expression cassettes.
- the sequence of interest comprises one or more sequences selected from: a gene, a portion of a gene, an intergenic sequence, an enhancer, a promoter, an intron, an exon, a sequence encoding a transcription termination sequence, a sequence encoding a chloroplast targeting peptide, a sequence encoding a mitochondrial targeting peptide, an insulator sequence, a sequence encoding an anti-sense RNA construct, a sequence encoding non-protein-coding RNA (npcRNA), a sequence encoding a recombinase, a sequence encoding a recombinase recognition site, a landing pad, an editing template, an expression cassette, a stack of two or more expression cassettes encoding transgenes, a sequence encoding a site-specific enzyme, a sequence encoding a site-specific enzyme target site, a sequence encoding a selection marker, a sequence encoding a cell factor that functions to increase DNA repair
- the sequence of interest does not comprise a homology arm DNA sequence.
- the sequence of interest comprises at least one homology arm DNA sequence.
- the sequence of interest comprises both a left homology arm DNA sequence and a right homology arm DNA sequence.
- the sequence of interest comprises i) a first sequence positioned between the left homology arm DNA sequence and the right homology DNA arm sequence, and ii) a second sequence that is not positioned between the left homology arm DNA sequence and the right homology arm DNA sequence.
- the sequence of interest comprises a sequence positioned between the left homology arm DNA sequence and the right homology DNA arm sequence, where the left and right homology arm sequences are positioned between two sequences encoding site-specific enzyme target sites.
- the sequence of interest comprises i) a first sequence positioned between the left homology arm DNA sequence and the right homology DNA arm sequence where the left and right homology arm sequences are positioned between two sequences encoding site-specific enzyme target sites, and ii) a second sequence that is not positioned between sequences encoding site-specific enzyme target sites.
- the sequence of interest comprises i) a first sequence positioned between the left homology arm DNA sequence and the right homology DNA arm sequence where the left and right homology arm sequences are positioned between two sequences encoding site-specific enzyme target sites, and ii) a second sequence encoding the site-specific enzyme, where the second sequence is not positioned between sequences encoding site-specific enzyme target sites.
- at least one homology arm DNA sequence comprises a sequence that is at least 80%, at least 85%, at least 90%, at least 95%), or at least 99% identical to a target sequence in the plant genome.
- the target sequence in the plant genome is a genie sequence. In some embodiments, the target sequence in the plant genome is a non-genic sequence. In some embodiments, the sequence of interest flanked by homology arms comprises a sequence that is at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identical to a native gene of the plant cell. In some embodiments, the sequence of interest comprises a protein-coding sequence. In some embodiments, the sequence of interest comprises a non- protein-coding RNA.
- the non-protein-coding RNA is selected from the group consisting of: a microRNA (miRNA), a miRNA precursor, a small interfering RNA (siRNA), a small RNA (22-26 nt in length) and precursor encoding same, a heterochromatic siRNA (hc-siRNA), a Piwi-interacting RNA (piRNA), a hairpin double strand RNA (hairpin dsRNA), a trans-acting siRNA (ta-siRNA), a naturally occurring antisense siRNA (nat- siRNA), a tracer RNA (tcRNA), a guide RNA (gRNA), and a single-guide RNA (sgRNA), or any combination thereof.
- the site-specific enzyme is selected from a group consisting of an endonuclease, a recombinase, and a transposase.
- the endonuclease is selected from a meganuclease, a zinc finger nuclease, a transcription activator-like effector nuclease (TALEN), an Argonaute, a Cas9 nuclease, a CasX nuclease, a CasY nuclease, and a Cpfl nuclease.
- TALEN transcription activator-like effector nuclease
- Argonaute a Cas9 nuclease
- CasX nuclease a CasX nuclease
- CasY nuclease a CasY nuclease
- Cpfl nuclease a Cpfl nuclease
- the Cas9 nuclease is selected from the group comprising Casl, CaslB, Cas2, Cas3, Cas4, Cas5, Cas6, Cas7, Cas8, Cas9 (also known as Csnl and Csxl2), CaslO, Csyl, Csy2, Csy3, Csel, Cse2, Cscl, Csc2, Csa5, Csn2, Csm2, Csm3, Csm4, Csm5, Csm6, Cmrl, Cmr3, Cmr4, Cmr5, Cmr6, Csbl, Csb2, Csb3, Csxl7, Csxl4, CsxlO, Csxl6, CsaX, Csx3, Csxl, Csxl5, Csfl, Csf2, Csf3, and Csf4 nuclease.
- the recombinase is a tyrosine recombinase attached to a DNA recognition motif, or a serine recombinase attached to a DNA recognition motif.
- the tyrosine recombinase attached to a DNA recognition motif is selected from the group consisting of a Cre recombinase, a Flp recombinase, and a Tnpl recombinase.
- the serine recombinase attached to a DNA recognition motif is selected from the group consisting of a PhiC31 integrase, an R4 integrase, and a TP-901 integrase.
- the transposase is a DNA transposase attached to a DNA binding domain.
- the sequence of interest further comprises at least one site-specific enzyme target site.
- the at least one site-specific enzyme target site is selected from a Cre/lox recombination site, a Flp/FRT recombination site, a endonuclease recognition site, and a TALEN site.
- the sequence of interest comprises a sequence encoding a protein involved in DNA repair, wherein the protein is selected from the group comprising a vir gene from the Ti plasmid, Rad51, Rad52, Rad2, a dominant-negative Ku70, or any combination thereof.
- At least part of the sequence of interest is integrated into the plant genome via homologous recombination, wherein the integration of at least part of the sequence of interest results in a point mutation, an insertion, a deletion, an inversion, increased transcription of an endogenous locus, decreased transcription of an endogenous locus, altered protein activity, altered RNAi products, altered RNAi target sites, altered RNAi pathway activity, increased transcription of the sequence of interest, decreased transcription of the integrated sequence of interest, or a combination thereof.
- At least part of the sequence of interest is integrated into the plant genome via non-homologous end joining, wherein the integration of at least part of the sequence of interest results in a point mutation, an insertion, a deletion, an inversion, increased transcription of an
- the plant cell already comprises a site-specific enzyme.
- the plant cell is selected from the group consisting of a corn cell, a soybean cell, a canola cell, a cotton cell, a wheat cell, or a sugarcane cell.
- a nucleotide sequence encoding the site-specific enzyme is stably transformed into the plant cell.
- a method of increasing the rate of site directed integration of a sequence of interest comprising contacting a plant cell with at least one vector capable of forming two essentially complementary T-strands, wherein the at least one vector comprises a RBI, a RB2, a sequence of interest, a first left border DNA sequence (LBl), a second left border DNA sequence (LB2) and wherein the vector is configured such that the RBI is paired with the LBl which are positioned in the vector to initiate (RBI) and terminate (LBl) synthesis of a first T-strand such that the sequence of interest is in the sense orientation from the 5' to 3' end of the first T-strand; and the RB2 is paired with the LB2 which are positioned in the vector to initiate (RB2) and terminate (LB2) synthesis of a second T-strand such that the sequence of interest is in an anti-sense orientation relative to the sequence of interest in the first T-strand, and wherein the sequence of interest in the two T- strands resulting from
- the RBI and the RB2 are essentially homologous. In some embodiments, the LBl and the LB2 are essentially homologous. In some embodiments, the RBI and the RB2 are not essentially homologous. In some embodiments, the LBl and LB2 are not essentially homologous. In some embodiments, at least one of the RBI and RB2 comprise an Agrobacterium Ti plasmid right border consensus DNA sequence. In some embodiments, the right border consensus DNA sequence is selected from SEQ ID NO: 21 or SEQ ID NO: 22. In some embodiments, at least one of the RBI and RB2 comprise a sequence selected from SEQ ID NOs: 1-13. In some
- At least one of the RBI and RB2 comprise a sequence at least 80% identical to a sequence selected from SEQ ID NOs:4 and SEQ ID NO: 12.
- at least one of the LB l and LB2 comprise an Agrobacterium Ti plasmid left border consensus DNA sequence.
- the left border consensus DNA sequence is selected from SEQ ID NO: 23 or SEQ ID NO: 24.
- at least one of the LBl and LB2 comprise a sequence selected from SEQ ID NOs: 14-20.
- at least one of the LBl and LB2 comprise a sequence at least 80% identical to SEQ ID NO: 19.
- the sequence of interest comprises one or more expression cassettes.
- the sequence of interest comprises one or more sequences selected from: a gene, a portion of a gene, an intergenic sequence, an enhancer, a promoter, an intron, an exon, a sequence encoding a transcription termination sequence, a sequence encoding a chloroplast targeting peptide, a sequence encoding a mitochondrial targeting peptide, an insulator sequence, a sequence encoding an anti-sense RNA construct, a sequence encoding non-protein-coding RNA (npcRNA), a sequence encoding a recombinase, a sequence encoding a recombinase recognition site, a landing pad, an editing template, an expression cassette, a stack of two or more expression cassettes encoding transgenes, a sequence encoding a site-specific enzyme, a sequence encoding a site-specific enzyme target site, a sequence encoding a selection marker, a sequence encoding a cell factor that functions to increase DNA repair
- the sequence of interest does not comprise a homology arm DNA sequence.
- the sequence of interest comprises at least one homology arm DNA sequence.
- the sequence of interest comprises both a left homology arm DNA sequence and a right homology arm DNA sequence.
- the sequence of interest comprises i) a first sequence positioned between the left homology arm DNA sequence and the right homology DNA arm sequence, and ii) a second sequence that is not positioned between the left homology arm DNA sequence and the right homology arm DNA sequence.
- the sequence of interest comprises a sequence positioned between the left homology arm DNA sequence and the right homology DNA arm sequence, where the left and right homology arm sequences are positioned between two sequences encoding site-specific enzyme target sites.
- the sequence of interest comprises i) a first sequence positioned between the left homology arm DNA sequence and the right homology DNA arm sequence where the left and right homology arm sequences are positioned between two sequences encoding site-specific enzyme target sites, and ii) a second sequence that is not positioned between sequences encoding site-specific enzyme target sites.
- the sequence of interest comprises i) a first sequence positioned between the left homology arm DNA sequence and the right homology DNA arm sequence where the left and right homology arm sequences are positioned between two sequences encoding site-specific enzyme target sites, and ii) a second sequence encoding the site-specific enzyme, where the second sequence is not positioned between sequences encoding site-specific enzyme target sites.
- at least one homology arm DNA sequence comprises a sequence that is at least 80%, at least 85%, at least 90%, at least 95%), or at least 99% identical to a target sequence in the plant genome.
- the target sequence in the plant genome is a genie sequence. In some embodiments, the target sequence in the plant genome is a non-genic sequence. In some embodiments, the sequence of interest flanked by homology arms comprises a sequence that is at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identical to a native gene of the plant cell. In some embodiments, the sequence of interest comprises a protein-coding sequence. In some embodiments, the sequence of interest comprises a non- protein-coding RNA.
- the non-protein-coding RNA is selected from the group consisting of: a microRNA (miRNA), a miRNA precursor, a small interfering RNA (siRNA), a small RNA (22-26 nt in length) and precursor encoding same, a heterochromatic siRNA (hc-siRNA), a Piwi-interacting RNA (piRNA), a hairpin double strand RNA (hairpin dsRNA), a trans-acting siRNA (ta-siRNA), a naturally occurring antisense siRNA (nat- siRNA), a tracer RNA (tcRNA), a guide RNA (gRNA), and a single-guide RNA (sgRNA), or any combination thereof.
- the site-specific enzyme is selected from a group consisting of an endonuclease, a recombinase, and a transposase.
- the endonuclease is selected from a meganuclease, a zinc finger nuclease, a transcription activator-like effector nuclease (TALEN), an Argonaute, a Cas9 nuclease, a CasX nuclease, a CasY nuclease, and a Cpfl nuclease.
- TALEN transcription activator-like effector nuclease
- Argonaute a Cas9 nuclease
- CasX nuclease a CasX nuclease
- CasY nuclease a CasY nuclease
- Cpfl nuclease a Cpfl nuclease
- the Cas9 nuclease is selected from the group comprising Casl, CaslB, Cas2, Cas3, Cas4, Cas5, Cas6, Cas7, Cas8, Cas9 (also known as Csnl and Csxl2), CaslO, Csyl, Csy2, Csy3, Csel, Cse2, Cscl, Csc2, Csa5, Csn2, Csm2, Csm3, Csm4, Csm5, Csm6, Cmrl, Cmr3, Cmr4, Cmr5, Cmr6, Csbl, Csb2, Csb3, Csxl7, Csxl4, CsxlO, Csxl6, CsaX, Csx3, Csxl, Csxl5, Csfl, Csf2, Csf3, and Csf4 nuclease.
- the recombinase is a tyrosine
- the tyrosine recombinase attached to a DNA recognition motif is selected from the group consisting of a Cre recombinase, a Flp recombinase, and a Tnpl recombinase.
- the serine recombinase attached to a DNA recognition motif is selected from the group consisting of a PhiC31 integrase, an R4 integrase, and a TP-901 integrase.
- the transposase is a DNA transposase attached to a DNA binding domain.
- the sequence of interest further comprises at least one site-specific enzyme target site.
- the at least one site-specific enzyme target site is selected from a Cre/lox recombination site, a Flp/FRT recombination site, a endonuclease recognition site, and a TALEN site.
- the sequence of interest comprises a sequence encoding a protein involved in DNA repair, wherein the protein is selected from the group comprising a vir gene from the Ti plasmid, Rad51, Rad52, Rad2, a dominant-negative Ku70, or any combination thereof.
- At least part of the sequence of interest is integrated into the plant genome via homologous recombination, wherein the integration of at least part of the sequence of interest results in a point mutation, an insertion, a deletion, an inversion, increased transcription of an endogenous locus, decreased transcription of an endogenous locus, altered protein activity, altered RNAi products, altered RNAi target sites, altered RNAi pathway activity, increased transcription of the sequence of interest, decreased transcription of the integrated sequence of interest, or a combination thereof.
- At least part of the sequence of interest is integrated into the plant genome via non-homologous end joining, wherein the integration of at least part of the sequence of interest results in a point mutation, an insertion, a deletion, an inversion, increased transcription of an
- the plant cell already comprises a site-specific enzyme.
- the plant cell is selected from the group consisting of a corn cell, a soybean cell, a canola cell, a cotton cell, a wheat cell, or a sugarcane cell.
- a nucleotide sequence encoding the site-specific enzyme is stably transformed into the plant cell.
- Several embodiments relate to a method of increasing the rate of site directed integration of a sequence of interest, comprising contacting a plant cell with at least one vector capable of forming two essentially complementary T-strands, wherein the vector comprises a first sequence of interest and a second sequence of interest, wherein the first sequence of interest is essentially identical to the second sequence of interest; wherein the vector further comprises a RBI and a LBl which are positioned in the vector to initiate (RBI) and terminate (LB l) synthesis of a first T-strand such that the first sequence of interest is in the sense orientation from the 5' to 3' end of the first T-strand; and the vector further comprises a RB2 and aLB2 which are positioned in the vector to initiate (RB2) and terminate (LB2) synthesis of a second T-strand such that the second sequence of interest is in an anti- sense orientation relative to the first sequence of interest in the first T-strand, and wherein the two T-strands resulting from initiation at RBI and RB2 are essentially complementary
- the RBI and the RB2 are essentially homologous. In some embodiments, the LBl and the LB2 are essentially homologous. In some embodiments, the RBI and the RB2 are not essentially homologous. In some embodiments, the LBl and LB2 are not essentially homologous. In some embodiments, at least one of the RBI and RB2 comprise an Agrobacterium Ti plasmid right border consensus DNA sequence. In some embodiments, the right border consensus DNA sequence is selected from SEQ ID NO: 21 or SEQ ID NO: 22. In some embodiments, at least one of the RBI and RB2 comprise a sequence selected from SEQ ID NOs: 1-13.
- At least one of the RBI and RB2 comprise a sequence at least 80% identical to a sequence selected from SEQ ID NOs:4 and SEQ ID NO: 12.
- at least one of the LB l and LB2 comprise an Agrobacterium Ti plasmid left border consensus DNA sequence.
- the left border consensus DNA sequence is selected from SEQ ID NO: 23 or SEQ ID NO: 24.
- at least one of the LB l and LB2 comprise a sequence selected from SEQ ID NOs: 14-20.
- at least one of the LBl and LB2 comprise a sequence at least 80% identical to SEQ ID NO: 19.
- the sequence of interest comprises one or more expression cassettes.
- the sequence of interest comprises one or more sequences selected from: a gene, a portion of a gene, an intergenic sequence, an enhancer, a promoter, an intron, an exon, a sequence encoding a transcription termination sequence, a sequence encoding a chloroplast targeting peptide, a sequence encoding a mitochondrial targeting peptide, an insulator sequence, a sequence encoding an anti-sense RNA construct, a sequence encoding non-protein-coding RNA (npcRNA), a sequence encoding a recombinase, a sequence encoding a recombinase recognition site, a landing pad, an editing template, an expression cassette, a stack of two or more expression cassettes encoding transgenes, a sequence encoding a site-specific enzyme, a sequence encoding a site-specific enzyme target site, a sequence encoding a selection marker, a sequence encoding a cell factor that functions to increase DNA repair
- the sequence of interest comprises a sequence encoding a site-specific enzyme target site 5' to an expression cassette and a sequence encoding a site-specific enzyme target site 3' to an expression cassette.
- the sequence of interest does not comprise a homology arm DNA sequence.
- the sequence of interest comprises at least one homology arm DNA sequence.
- the sequence of interest comprises both a left homology arm DNA sequence and a right homology arm DNA sequence.
- the sequence of interest comprises i) a first sequence positioned between the left homology arm DNA sequence and the right homology DNA arm sequence, and ii) a second sequence that is not positioned between the left homology arm DNA sequence and the right homology arm DNA sequence.
- the sequence of interest comprises a sequence positioned between the left homology arm DNA sequence and the right homology DNA arm sequence, where the left and right homology arm sequences are positioned between two sequences encoding site-specific enzyme target sites. In some embodiments, the sequence of interest comprises i) a first sequence positioned between the left homology arm DNA sequence and the right homology DNA arm sequence where the left and right homology arm sequences are positioned between two sequences encoding site-specific enzyme target sites, and ii) a second sequence that is not positioned between sequences encoding site-specific enzyme target sites.
- the sequence of interest comprises i) a first sequence positioned between the left homology arm DNA sequence and the right homology DNA arm sequence where the left and right homology arm sequences are positioned between two sequences encoding site-specific enzyme target sites, and ii) a second sequence encoding the site-specific enzyme, where the second sequence is not positioned between sequences encoding site-specific enzyme target sites.
- at least one homology arm DNA sequence comprises a sequence that is at least 80%, at least 85%, at least 90%, at least 95%, or at least 99% identical to a target sequence in the plant genome.
- the target sequence in the plant genome is a genie sequence.
- the target sequence in the plant genome is a non-genic sequence.
- the sequence of interest flanked by homology arms comprises a sequence that is at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%), or 100% identical to a native gene of the plant cell.
- the sequence of interest comprises a protein-coding sequence.
- the sequence of interest comprises a non-protein-coding RNA.
- the non- protein-coding RNA is selected from the group consisting of: a microRNA (miRNA), a miRNA precursor, a small interfering RNA (siRNA), a small RNA (22-26 nt in length) and precursor encoding same, a heterochromatic siRNA (hc-siRNA), a Piwi-interacting RNA (piRNA), a hairpin double strand RNA (hairpin dsRNA), a trans-acting siRNA (ta-siRNA), a naturally occurring antisense siRNA (nat-siRNA), a tracer RNA (tcRNA), a guide RNA (gRNA), and a single-guide RNA (sgRNA), or any combination thereof.
- miRNA microRNA
- miRNA precursor a small interfering RNA
- siRNA small interfering RNA
- small RNA 22-26 nt in length
- precursor precursor encoding same
- hc-siRNA a heterochromatic siRNA
- the site-specific enzyme is selected from a group consisting of an
- the endonuclease is selected from a meganuclease, a zinc finger nuclease, a transcription activator-like effector nuclease (TALEN), an Argonaute, a Cas9 nuclease, a CasX nuclease, a CasY nuclease, and a Cpfl nuclease.
- TALEN transcription activator-like effector nuclease
- the Cas9 nuclease is selected from the group comprising Casl, CaslB, Cas2, Cas3, Cas4, Cas5, Cas6, Cas7, Cas8, Cas9 (also known as Csnl and Csxl2), CaslO, Csyl, Csy2, Csy3, Csel, Cse2, Cscl, Csc2, Csa5, Csn2, Csm2, Csm3, Csm4, Csm5, Csm6, Cmrl, Cmr3, Cmr4, Cmr5, Cmr6, Csbl, Csb2, Csb3, Csxl7, Csxl4, CsxlO, Csxl6, CsaX, Csx3, Csxl, Csxl5, Csfl, Csf2, Csfi, and Csf4 nuclease.
- the recombinase is a tyrosine
- the tyrosine recombinase attached to a DNA recognition motif is selected from the group consisting of a Cre recombinase, a Flp recombinase, and a Tnpl recombinase.
- the serine recombinase attached to a DNA recognition motif is selected from the group consisting of a PhiC31 integrase, an R4 integrase, and a TP-901 integrase.
- the transposase is a DNA transposase attached to a DNA binding domain.
- the sequence of interest further comprises at least one site-specific enzyme target site.
- the at least one site-specific enzyme target site is selected from a Cre/lox recombination site, a Flp/FRT recombination site, a
- the sequence of interest comprises a sequence encoding a protein involved in DNA repair, wherein the protein is selected from the group comprising a vir gene from the Ti plasmid, Rad51, Rad52, Rad2, a dominant-negative Ku70, or any combination thereof.
- At least part of the sequence of interest is integrated into the plant genome via homologous recombination, wherein the integration of at least part of the sequence of interest results in a point mutation, an insertion, a deletion, an inversion, increased transcription of an endogenous locus, decreased transcription of an endogenous locus, altered protein activity, altered RNAi products, altered RNAi target sites, altered RNAi pathway activity, increased transcription of the sequence of interest, decreased transcription of the integrated sequence of interest, or a combination thereof.
- At least part of the sequence of interest is integrated into the plant genome via non-homologous end joining, wherein the integration of at least part of the sequence of interest results in a point mutation, an insertion, a deletion, an inversion, increased transcription of an endogenous locus, decreased transcription of an endogenous locus, altered protein activity, altered RNAi products, altered RNAi target sites, altered RNAi pathway activity, increased transcription of the sequence of interest, decreased transcription of the integrated sequence of interest, or a combination thereof.
- the plant cell already comprises a site-specific enzyme.
- the plant cell is selected from the group consisting of a corn cell, a soybean cell, a canola cell, a cotton cell, a wheat cell, or a sugarcane cell.
- a nucleotide sequence encoding the site-specific enzyme is stably transformed into the plant cell.
- a method of increasing the rate of site directed integration of a sequence of interest comprising contacting a plant cell with at least one vector capable of forming two essentially complementary T-strands, wherein the Rhizobiales cell comprises at least a first and a second vector, wherein each vector comprises essentially identical sequences of interest, and wherein the first vector comprises a RB 1 and a LB 1 which are positioned in the first vector to initiate (RB I) and terminate (LB 1) synthesis of a first T-strand such that the sequence of interest is in the sense orientation from the 5' to 3' end of the first T-strand; and the second vector comprises a RB2 and a LB2 which are positioned in the second vector to initiate (RB2) and terminate (LB2) synthesis of a second T- strand such that the sequence of interest is in an anti-sense orientation relative to the sequence of interest in the first T-strand, and wherein the two T-strands resulting from initiation at RBI and RB2 are essentially complementary in
- the RBI and the RB2 are essentially homologous. In some embodiments, the LB1 and the LB2 are essentially homologous. In some embodiments, the RBI and the RB2 are not essentially homologous. In some embodiments, the LB1 and LB2 are not essentially homologous. In some embodiments, at least one of the RBI and RB2 comprise an Agrobacterium Ti plasmid right border consensus DNA sequence. In some embodiments, the right border consensus DNA sequence is selected from SEQ ID NO: 21 or SEQ ID NO: 22. In some embodiments, at least one of the RBI and RB2 comprise a sequence selected from SEQ ID NOs: 1-13.
- At least one of the RBI and RB2 comprise a sequence at least 80% identical to a sequence selected from SEQ ID NOs:4 and SEQ ID NO: 12.
- at least one of the LB 1 and LB2 comprise an Agrobacterium Ti plasmid left border consensus DNA sequence.
- the left border consensus DNA sequence is selected from SEQ ID NO: 23 or SEQ ID NO: 24.
- at least one of the LB 1 and LB2 comprise a sequence selected from SEQ ID NOs: 14-20.
- at least one of the LB1 and LB2 comprise a sequence at least 80% identical to SEQ ID NO: 19.
- the sequence of interest comprises one or more expression cassettes.
- the sequence of interest comprises one or more sequences selected from: a gene, a portion of a gene, an intergenic sequence, an enhancer, a promoter, an intron, an exon, a sequence encoding a transcription termination sequence, a sequence encoding a chloroplast targeting peptide, a sequence encoding a mitochondrial targeting peptide, an insulator sequence, a sequence encoding an anti-sense RNA construct, a sequence encoding non-protein-coding RNA (npcRNA), a sequence encoding a recombinase, a sequence encoding a recombinase recognition site, a landing pad, an editing template, an expression cassette, a stack of two or more expression cassettes encoding transgenes, a sequence encoding a site-specific enzyme, a sequence encoding a site-specific enzyme target site, a sequence encoding a selection marker, a sequence encoding a cell factor that functions to increase DNA repair
- the sequence of interest comprises a sequence encoding a site-specific enzyme target site 5' to an expression cassette and a sequence encoding a site-specific enzyme target site 3' to an expression cassette.
- the sequence of interest does not comprise a homology arm DNA sequence.
- the sequence of interest comprises at least one homology arm DNA sequence.
- the sequence of interest comprises both a left homology arm DNA sequence and a right homology arm DNA sequence.
- the sequence of interest comprises i) a first sequence positioned between the left homology arm DNA sequence and the right homology DNA arm sequence, and ii) a second sequence that is not positioned between the left homology arm DNA sequence and the right homology arm DNA sequence.
- the sequence of interest comprises a sequence positioned between the left homology arm DNA sequence and the right homology DNA arm sequence, where the left and right homology arm sequences are positioned between two sequences encoding site-specific enzyme target sites. In some embodiments, the sequence of interest comprises i) a first sequence positioned between the left homology arm DNA sequence and the right homology DNA arm sequence where the left and right homology arm sequences are positioned between two sequences encoding site-specific enzyme target sites, and ii) a second sequence that is not positioned between sequences encoding site-specific enzyme target sites.
- the sequence of interest comprises i) a first sequence positioned between the left homology arm DNA sequence and the right homology DNA arm sequence where the left and right homology arm sequences are positioned between two sequences encoding site-specific enzyme target sites, and ii) a second sequence encoding the site-specific enzyme, where the second sequence is not positioned between sequences encoding site-specific enzyme target sites.
- at least one homology arm DNA sequence comprises a sequence that is at least 80%, at least 85%, at least 90%, at least 95%, or at least 99% identical to a target sequence in the plant genome.
- the target sequence in the plant genome is a genie sequence.
- the target sequence in the plant genome is a non-genic sequence.
- the sequence of interest flanked by homology arms comprises a sequence that is at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%), or 100% identical to a native gene of the plant cell.
- the sequence of interest comprises a protein-coding sequence.
- the sequence of interest comprises a non-protein-coding RNA.
- the non- protein-coding RNA is selected from the group consisting of: a microRNA (miRNA), a miRNA precursor, a small interfering RNA (siRNA), a small RNA (22-26 nt in length) and precursor encoding same, a heterochromatic siRNA (hc-siRNA), a Piwi-interacting RNA (piRNA), a hairpin double strand RNA (hairpin dsRNA), a trans-acting siRNA (ta-siRNA), a naturally occurring antisense siRNA (nat-siRNA), a tracer RNA (tcRNA), a guide RNA (gRNA), and a single-guide RNA (sgRNA), or any combination thereof.
- miRNA microRNA
- miRNA precursor a small interfering RNA
- siRNA small interfering RNA
- small RNA 22-26 nt in length
- precursor precursor encoding same
- hc-siRNA a heterochromatic siRNA
- the site-specific enzyme is selected from a group consisting of an
- the endonuclease is selected from a meganuclease, a zinc finger nuclease, a transcription activator-like effector nuclease (TALEN), an Argonaute, a Cas9 nuclease, a CasX nuclease, a CasY nuclease, and a Cpfl nuclease.
- TALEN transcription activator-like effector nuclease
- the Cas9 nuclease is selected from the group comprising Casl, CaslB, Cas2, Cas3, Cas4, Cas5, Cas6, Cas7, Cas8, Cas9 (also known as Csnl and Csxl2), CaslO, Csyl, Csy2, Csy3, Csel, Cse2, Cscl, Csc2, Csa5, Csn2, Csm2, Csm3, Csm4, Csm5, Csm6, Cmrl, Cmr3, Cmr4, Cmr5, Cmr6, Csbl, Csb2, Csb3, Csxl7, Csxl4, CsxlO, Csxl6, CsaX, Csx3, Csxl, Csxl5, Csfl, Csf2, Csf3, and Csf4 nuclease.
- the recombinase is a tyrosine
- the tyrosine recombinase attached to a DNA recognition motif is selected from the group consisting of a Cre recombinase, a Flp recombinase, and a Tnpl recombinase.
- the serine recombinase attached to a DNA recognition motif is selected from the group consisting of a PhiC31 integrase, an R4 integrase, and a TP-901 integrase.
- the transposase is a DNA transposase attached to a DNA binding domain.
- the sequence of interest further comprises at least one site-specific enzyme target site.
- the at least one site-specific enzyme target site is selected from a Cre/lox recombination site, a Flp/FRT recombination site, a endonuclease recognition site, and a TALEN site.
- the sequence of interest comprises a sequence encoding a protein involved in DNA repair, wherein the protein is selected from the group comprising a vir gene from the Ti plasmid, Rad51, Rad52, Rad2, a dominant-negative Ku70, or any combination thereof.
- At least part of the sequence of interest is integrated into the plant genome via homologous recombination, wherein the integration of at least part of the sequence of interest results in a point mutation, an insertion, a deletion, an inversion, increased transcription of an endogenous locus, decreased transcription of an endogenous locus, altered protein activity, altered RNAi products, altered RNAi target sites, altered RNAi pathway activity, increased transcription of the sequence of interest, decreased transcription of the integrated sequence of interest, or a combination thereof.
- At least part of the sequence of interest is integrated into the plant genome via non-homologous end joining, wherein the integration of at least part of the sequence of interest results in a point mutation, an insertion, a deletion, an inversion, increased transcription of an endogenous locus, decreased transcription of an endogenous locus, altered protein activity, altered RNAi products, altered RNAi target sites, altered RNAi pathway activity, increased transcription of the sequence of interest, decreased transcription of the integrated sequence of interest, or a combination thereof.
- the Rhizobiales cell is selected from an Agrobacterium spp., a Bradyrhizobium spp., a Mesorhizobium spp., an Ochrobactrum spp., a Phyllobacterium spp., a Rhizobium spp., and a Sinorhizobium spp.
- the Rhizobiales cell further contains a vector comprising at least one expression cassette, wherein the expression cassettes comprise a sequence encoding a protein involved in DNA repair, and wherein the protein is selected from the group comprising a vir gene from the Ti plasmid, Rad51, Rad52, Rad2, a dominant- negative Ku70, or any combination thereof.
- the plant cell already comprises a site-specific enzyme.
- the plant cell is selected from the group consisting of a corn cell, a soybean cell, a canola cell, a cotton cell, a wheat cell, or a sugarcane cell.
- a nucleotide sequence encoding the site-specific enzyme is stably transformed into the plant cell.
- a method of increasing the rate of site directed integration of a sequence of interest comprising contacting a plant cell with at least one vector capable of forming two essentially complementary T-strands, wherein the at least one vector comprises a first sequence of interest, a second sequence of interest different from the first sequence of interest, at least two RB DNA sequences, and one or more optional LB DNA sequences, wherein the first RB DNA sequence (RBI) and a first LB DNA sequence (LBl) are positioned in the vector to initiate (RBI) and terminate (LBl) synthesis of the first T-strand such that the first sequence of interest is in the sense orientation from the 5' to 3' end of the first T-strand; and the vector configuration further comprises a second RB DNA sequence (RB2) and a second LB DNA sequence (LB2) which are positioned in the vector to initiate (RB2) and terminate (LB2) synthesis of a second T-strand such that the first sequence of interest is in an anti-sense orientation from the 5' to 3'
- the RBI and the RB2 are essentially homologous. In some embodiments, the LBl and the LB2 are essentially homologous. In some embodiments, the RBI and the RB2 are not essentially homologous. In some embodiments, the LBl and LB2 are not essentially homologous. In some embodiments, at least one of the RBI and RB2 comprise an Agrobacterium Ti plasmid right border consensus DNA sequence. In some embodiments, the right border consensus DNA sequence is selected from SEQ ID NO: 21 or SEQ ID NO: 22. In some embodiments, at least one of the RBI and RB2 comprise a sequence selected from SEQ ID NOs: 1-13.
- At least one of the RBI and RB2 comprise a sequence at least 80% identical to a sequence selected from SEQ ID NOs:4 and SEQ ID NO: 12.
- at least one of the LBl and LB2 comprise an Agrobacterium Ti plasmid left border consensus DNA sequence.
- the left border consensus DNA sequence is selected from SEQ ID NO: 23 or SEQ ID NO: 24.
- at least one of the LBl and LB2 comprise a sequence selected from SEQ ID NOs: 14-20.
- at least one of the LBl and LB2 comprise a sequence at least 80% identical to SEQ ID NO: 19.
- the sequence of interest comprises one or more expression cassettes.
- the sequence of interest comprises one or more sequences selected from: a gene, a portion of a gene, an intergenic sequence, an enhancer, a promoter, an intron, an exon, a sequence encoding a transcription termination sequence, a sequence encoding a chloroplast targeting peptide, a sequence encoding a mitochondrial targeting peptide, an insulator sequence, a sequence encoding an anti-sense RNA construct, a sequence encoding non-protein-coding RNA (npcRNA), a sequence encoding a recombinase, a sequence encoding a recombinase recognition site, a landing pad, an editing template, an expression cassette, a stack of two or more expression cassettes encoding transgenes, a sequence encoding a site-specific enzyme, a sequence encoding a site-specific enzyme target site, a sequence encoding a selection marker, a sequence encoding a cell factor that functions to increase DNA repair
- the sequence of interest comprises a sequence encoding a site-specific enzyme target site 5' to an expression cassette and a sequence encoding a site-specific enzyme target site 3' to an expression cassette.
- the sequence of interest does not comprise a homology arm DNA sequence.
- the sequence of interest comprises at least one homology arm DNA sequence.
- the sequence of interest comprises both a left homology arm DNA sequence and a right homology arm DNA sequence.
- the sequence of interest comprises i) a first sequence positioned between the left homology arm DNA sequence and the right homology DNA arm sequence, and ii) a second sequence that is not positioned between the left homology arm DNA sequence and the right homology arm DNA sequence.
- the sequence of interest comprises a sequence positioned between the left homology arm DNA sequence and the right homology DNA arm sequence, where the left and right homology arm sequences are positioned between two sequences encoding site-specific enzyme target sites. In some embodiments, the sequence of interest comprises i) a first sequence positioned between the left homology arm DNA sequence and the right homology DNA arm sequence where the left and right homology arm sequences are positioned between two sequences encoding site-specific enzyme target sites, and ii) a second sequence that is not positioned between sequences encoding site-specific enzyme target sites.
- the sequence of interest comprises i) a first sequence positioned between the left homology arm DNA sequence and the right homology DNA arm sequence where the left and right homology arm sequences are positioned between two sequences encoding site-specific enzyme target sites, and ii) a second sequence encoding the site-specific enzyme, where the second sequence is not positioned between sequences encoding site-specific enzyme target sites.
- at least one homology arm DNA sequence comprises a sequence that is at least 80%, at least 85%, at least 90%, at least 95%, or at least 99% identical to a target sequence in the plant genome.
- the target sequence in the plant genome is a genie sequence.
- the target sequence in the plant genome is a non-genic sequence.
- the sequence of interest flanked by homology arms comprises a sequence that is at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%), or 100% identical to a native gene of the plant cell.
- the sequence of interest comprises a protein-coding sequence.
- the sequence of interest comprises a non-protein-coding RNA.
- the non- protein-coding RNA is selected from the group consisting of: a microRNA (miRNA), a miRNA precursor, a small interfering RNA (siRNA), a small RNA (22-26 nt in length) and precursor encoding same, a heterochromatic siRNA (hc-siRNA), a Piwi-interacting RNA (piRNA), a hairpin double strand RNA (hairpin dsRNA), a trans-acting siRNA (ta-siRNA), a naturally occurring antisense siRNA (nat-siRNA), a tracer RNA (tcRNA), a guide RNA (gRNA), and a single-guide RNA (sgRNA), or any combination thereof.
- miRNA microRNA
- miRNA precursor a small interfering RNA
- siRNA small interfering RNA
- small RNA 22-26 nt in length
- precursor precursor encoding same
- hc-siRNA a heterochromatic siRNA
- the site-specific enzyme is selected from a group consisting of an
- the endonuclease is selected from a meganuclease, a zinc finger nuclease, a transcription activator-like effector nuclease (TALEN), an Argonaute, a Cas9 nuclease, a CasX nuclease, a CasY nuclease, and a Cpfl nuclease.
- TALEN transcription activator-like effector nuclease
- the Cas9 nuclease is selected from the group comprising Casl, CaslB, Cas2, Cas3, Cas4, Cas5, Cas6, Cas7, Cas8, Cas9 (also known as Csnl and Csxl2), CaslO, Csyl, Csy2, Csy3, Csel, Cse2, Cscl, Csc2, Csa5, Csn2, Csm2, Csm3, Csm4, Csm5, Csm6, Cmrl, Cmr3, Cmr4, Cmr5, Cmr6, Csbl, Csb2, Csb3, Csxl7, Csxl4, CsxlO, Csxl6, CsaX, Csx3, Csxl, Csxl5, Csfl, Csf2, Csf3, and Csf4 nuclease.
- the recombinase is a tyrosine
- the tyrosine recombinase attached to a DNA recognition motif is selected from the group consisting of a Cre recombinase, a Flp recombinase, and a Tnpl recombinase.
- the serine recombinase attached to a DNA recognition motif is selected from the group consisting of a PhiC31 integrase, an R4 integrase, and a TP-901 integrase.
- the transposase is a DNA transposase attached to a DNA binding domain.
- the sequence of interest further comprises at least one site-specific enzyme target site.
- the at least one site-specific enzyme target site is selected from a Cre/lox recombination site, a Flp/FRT recombination site, a
- the sequence of interest comprises a sequence encoding a protein involved in DNA repair, wherein the protein is selected from the group comprising a vir gene from the Ti plasmid, Rad51, Rad52, Rad2, a dominant-negative Ku70, or any combination thereof.
- At least part of the sequence of interest is integrated into the plant genome via homologous recombination, wherein the integration of at least part of the sequence of interest results in a point mutation, an insertion, a deletion, an inversion, increased transcription of an endogenous locus, decreased transcription of an endogenous locus, altered protein activity, altered RNAi products, altered RNAi target sites, altered RNAi pathway activity, increased transcription of the sequence of interest, decreased transcription of the integrated sequence of interest, or a combination thereof.
- At least part of the sequence of interest is integrated into the plant genome via non-homologous end joining, wherein the integration of at least part of the sequence of interest results in a point mutation, an insertion, a deletion, an inversion, increased transcription of an endogenous locus, decreased transcription of an endogenous locus, altered protein activity, altered RNAi products, altered RNAi target sites, altered RNAi pathway activity, increased transcription of the sequence of interest, decreased transcription of the integrated sequence of interest, or a combination thereof.
- the plant cell already comprises a site-specific enzyme.
- the plant cell is selected from the group consisting of a corn cell, a soybean cell, a canola cell, a cotton cell, a wheat cell, or a sugarcane cell.
- a nucleotide sequence encoding the site-specific enzyme is stably transformed into the plant cell.
- a method of transforming a plant cell comprising contacting the plant cell with a Rhizobiales cell capable of transforming the plant cell, wherein the Rhizobiales cell comprises at least one vector comprising a right border DNA sequence (RB) and a left border DNA sequence (LB) and wherein the vector comprises between the RB and LB: (i) a first sequence of interest in a sense orientation relative to the RB, (ii) a spacer, and (iii) a second sequence of interest in an anti-sense orientation relative to the RB, wherein the first sequence of interest and second sequence of interest are essentially complementary and after synthesis of the T-strand anneal to form a double-stranded DNA.
- RB right border DNA sequence
- LB left border DNA sequence
- the first sequence of interest further comprises a first left homology arm DNA sequence and a first right homology arm DNA sequence
- the second sequence of interest further comprises a second left homology arm DNA sequence and a second right homology arm DNA sequence.
- the Rhizobiales cell is selected from an Agrobacterium spp., a Bradyrhizobium spp., a Mesorhizobium spp., an Ochrobactrum spp., a Phyllobacterium spp., a Rhizobium spp., and a Sinorhizobium spp.
- the Rhizobiales cell further contains a vector comprising at least one expression cassette, wherein the expression cassettes comprise a sequence encoding a protein involved in DNA repair, and wherein the protein is selected from the group comprising a vir gene from the Ti plasmid, Rad51, Rad52, Rad2, a dominant-negative Ku70, or any combination thereof.
- the RBI and the RB2 are essentially
- the LBl and the LB2 are essentially homologous. In some embodiments, the RBI and the RB2 are not essentially homologous. In some embodiments, the LBl and LB2 are not essentially homologous. In some embodiments, at least one of the RBI and RB2 comprise an Agrobacterium Ti plasmid right border consensus DNA sequence. In some embodiments, the right border consensus DNA sequence is selected from SEQ ID NO: 21 or SEQ ID NO: 22. In some embodiments, at least one of the RBI and RB2 comprise a sequence selected from SEQ ID NOs: 1-13.
- At least one of the RBI and RB2 comprise a sequence at least 80% identical to a sequence selected from SEQ ID NOs:4 and SEQ ID NO: 12.
- at least one of the LB l and LB2 comprise an Agrobacterium Ti plasmid left border consensus DNA sequence.
- the left border consensus DNA sequence is selected from SEQ ID NO: 23 or SEQ ID NO: 24.
- at least one of the LBl and LB2 comprise a sequence selected from SEQ ID NOs: 14-20.
- at least one of the LBl and LB2 comprise a sequence at least 80% identical to SEQ ID NO: 19.
- the sequence of interest comprises one or more expression cassettes.
- the sequence of interest comprises one or more sequences selected from: a gene, a portion of a gene, an intergenic sequence, an enhancer, a promoter, an intron, an exon, a sequence encoding a transcription termination sequence, a sequence encoding a chloroplast targeting peptide, a sequence encoding a mitochondrial targeting peptide, an insulator sequence, a sequence encoding an anti-sense RNA construct, a sequence encoding non-protein-coding RNA (npcRNA), a sequence encoding a recombinase, a sequence encoding a recombinase recognition site, a landing pad, an editing template, an expression cassette, a stack of two or more expression cassettes encoding transgenes, a sequence encoding a site-specific enzyme, a sequence encoding a site-specific enzyme target site, a sequence encoding a selection marker, a sequence encoding a cell factor that functions to increase DNA repair
- the sequence of interest comprises a sequence encoding a site-specific enzyme target site 5' to an expression cassette and a sequence encoding a site-specific enzyme target site 3' to an expression cassette.
- the sequence of interest does not comprise a homology arm DNA sequence.
- the sequence of interest comprises at least one homology arm DNA sequence.
- the sequence of interest comprises both a left homology arm DNA sequence and a right homology arm DNA sequence.
- the sequence of interest comprises i) a first sequence positioned between the left homology arm DNA sequence and the right homology DNA arm sequence, and ii) a second sequence that is not positioned between the left homology arm DNA sequence and the right homology arm DNA sequence.
- the sequence of interest comprises a sequence positioned between the left homology arm DNA sequence and the right homology DNA arm sequence, where the left and right homology arm sequences are positioned between two sequences encoding site-specific enzyme target sites. In some embodiments, the sequence of interest comprises i) a first sequence positioned between the left homology arm DNA sequence and the right homology DNA arm sequence where the left and right homology arm sequences are positioned between two sequences encoding site-specific enzyme target sites, and ii) a second sequence that is not positioned between sequences encoding site-specific enzyme target sites.
- the sequence of interest comprises i) a first sequence positioned between the left homology arm DNA sequence and the right homology DNA arm sequence where the left and right homology arm sequences are positioned between two sequences encoding site-specific enzyme target sites, and ii) a second sequence encoding the site-specific enzyme, where the second sequence is not positioned between sequences encoding site-specific enzyme target sites.
- at least one homology arm DNA sequence comprises a sequence that is at least 80%, at least 85%, at least 90%, at least 95%, or at least 99% identical to a target sequence in the plant genome.
- the target sequence in the plant genome is a genie sequence.
- the target sequence in the plant genome is a non-genic sequence.
- the sequence of interest flanked by homology arms comprises a sequence that is at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%), or 100% identical to a native gene of the plant cell.
- the sequence of interest comprises a protein-coding sequence.
- the sequence of interest comprises a non-protein-coding RNA.
- the non- protein-coding RNA is selected from the group consisting of: a microRNA (miRNA), a miRNA precursor, a small interfering RNA (siRNA), a small RNA (22-26 nt in length) and precursor encoding same, a heterochromatic siRNA (hc-siRNA), a Piwi-interacting RNA (piRNA), a hairpin double strand RNA (hairpin dsRNA), a trans-acting siRNA (ta-siRNA), a naturally occurring antisense siRNA (nat-siRNA), a tracer RNA (tcRNA), a guide RNA (gRNA), and a single-guide RNA (sgRNA), or any combination thereof.
- miRNA microRNA
- miRNA precursor a small interfering RNA
- siRNA small interfering RNA
- small RNA 22-26 nt in length
- precursor precursor encoding same
- hc-siRNA a heterochromatic siRNA
- the site-specific enzyme is selected from a group consisting of an
- the endonuclease is selected from a meganuclease, a zinc finger nuclease, a transcription activator-like effector nuclease (TALEN), an Argonaute, a Cas9 nuclease, a CasX nuclease, a CasY nuclease, and a Cpfl nuclease.
- TALEN transcription activator-like effector nuclease
- the Cas9 nuclease is selected from the group comprising Casl, CaslB, Cas2, Cas3, Cas4, Cas5, Cas6, Cas7, Cas8, Cas9 (also known as Csnl and Csxl2), CaslO, Csyl, Csy2, Csy3, Csel, Cse2, Cscl, Csc2, Csa5, Csn2, Csm2, Csm3, Csm4, Csm5, Csm6, Cmrl, Cmr3, Cmr4, Cmr5, Cmr6, Csbl, Csb2, Csb3, Csxl7, Csxl4, CsxlO, Csxl6, CsaX, Csx3, Csxl, Csxl5, Csfl, Csf2, Csf3, and Csf4 nuclease.
- the recombinase is a tyrosine
- the tyrosine recombinase attached to a DNA recognition motif is selected from the group consisting of a Cre recombinase, a Flp recombinase, and a Tnpl recombinase.
- the serine recombinase attached to a DNA recognition motif is selected from the group consisting of a PhiC31 integrase, an R4 integrase, and a TP-901 integrase.
- the transposase is a DNA transposase attached to a DNA binding domain.
- the sequence of interest further comprises at least one site-specific enzyme target site.
- the at least one site-specific enzyme target site is selected from a Cre/lox recombination site, a Flp/FRT recombination site, a endonuclease recognition site, and a TALEN site.
- the sequence of interest comprises a sequence encoding a protein involved in DNA repair, wherein the protein is selected from the group comprising a vir gene from the Ti plasmid, Rad51, Rad52, Rad2, a dominant-negative Ku70, or any combination thereof.
- At least part of the sequence of interest is integrated into the plant genome via homologous recombination, wherein the integration of at least part of the sequence of interest results in a point mutation, an insertion, a deletion, an inversion, increased transcription of an endogenous locus, decreased transcription of an endogenous locus, altered protein activity, altered RNAi products, altered RNAi target sites, altered RNAi pathway activity, increased transcription of the sequence of interest, decreased transcription of the integrated sequence of interest, or a combination thereof.
- At least part of the sequence of interest is integrated into the plant genome via non-homologous end joining, wherein the integration of at least part of the sequence of interest results in a point mutation, an insertion, a deletion, an inversion, increased transcription of an endogenous locus, decreased transcription of an endogenous locus, altered protein activity, altered RNAi products, altered RNAi target sites, altered RNAi pathway activity, increased transcription of the sequence of interest, decreased transcription of the integrated sequence of interest, or a combination thereof.
- the Rhizobiales cell is selected from an Agrobacterium spp., a Bradyrhizobium spp., a Mesorhizobium spp., an Ochrobactrum spp., a Phyllobacterium spp., a Rhizobium spp., and a Sinorhizobium spp.
- the Rhizobiales cell further contains a vector comprising at least one expression cassette, wherein the expression cassettes comprise a sequence encoding a protein involved in DNA repair, and wherein the protein is selected from the group comprising a vir gene from the Ti plasmid, Rad51, Rad52, Rad2, a dominant- negative Ku70, or any combination thereof.
- the plant cell already comprises a site-specific enzyme.
- the plant cell is selected from the group consisting of a corn cell, a soybean cell, a canola cell, a cotton cell, a wheat cell, or a sugarcane cell.
- a nucleotide sequence encoding the site-specific enzyme is stably transformed into the plant cell.
- RB right border DNA sequence
- LB left border DNA sequence
- the vector comprises between the RB and LB: (i) a first sequence of interest in a sense orientation relative to the RB, (ii) a spacer, and (iii) a second sequence of interest in an anti-sense orientation relative to the RB, wherein the first sequence of interest and second sequence of interest are essentially complementary and after synthesis of the T-strand anneal to form a double-stranded DNA.
- the first sequence of interest further comprises a first left homology arm DNA sequence and a first right homology arm DNA sequence
- the second sequence of interest further comprises a second left homology arm DNA sequence and a second right homology arm
- the RBI and the RB2 are essentially homologous. In some embodiments, the LBl and the LB2 are essentially homologous. In some embodiments, the RBI and the RB2 are not essentially homologous. In some embodiments, the LBl and LB2 are not essentially homologous. In some embodiments, at least one of the RBI and RB2 comprise an Agrobacterium Ti plasmid right border consensus DNA sequence. In some embodiments, the right border consensus DNA sequence is selected from SEQ ID NO: 21 or SEQ ID NO: 22. In some embodiments, at least one of the RBI and RB2 comprise a sequence selected from SEQ ID NOs: 1-13.
- At least one of the RBI and RB2 comprise a sequence at least 80% identical to a sequence selected from SEQ ID NOs:4 and SEQ ID NO: 12.
- at least one of the LBl and LB2 comprise an Agrobacterium Ti plasmid left border consensus DNA sequence.
- the left border consensus DNA sequence is selected from SEQ ID NO: 23 or SEQ ID NO: 24.
- at least one of the LBl and LB2 comprise a sequence selected from SEQ ID NOs: 14-20.
- at least one of the LBl and LB2 comprise a sequence at least 80% identical to SEQ ID NO: 19.
- the sequence of interest comprises one or more expression cassettes.
- the sequence of interest comprises one or more sequences selected from: a gene, a portion of a gene, an intergenic sequence, an enhancer, a promoter, an intron, an exon, a sequence encoding a transcription termination sequence, a sequence encoding a chloroplast targeting peptide, a sequence encoding a mitochondrial targeting peptide, an insulator sequence, a sequence encoding an anti-sense RNA construct, a sequence encoding non-protein-coding RNA (npcRNA), a sequence encoding a recombinase, a sequence encoding a recombinase recognition site, a landing pad, an editing template, an expression cassette, a stack of two or more expression cassettes encoding transgenes, a sequence encoding a site-specific enzyme, a sequence encoding a site-specific enzyme target site, a sequence encoding a selection marker, a sequence encoding a cell factor that functions to increase DNA repair
- the sequence of interest comprises a sequence encoding a site-specific enzyme target site 5' to an expression cassette and a sequence encoding a site-specific enzyme target site 3' to an expression cassette.
- the sequence of interest does not comprise a homology arm DNA sequence.
- the sequence of interest comprises at least one homology arm DNA sequence.
- the sequence of interest comprises both a left homology arm DNA sequence and a right homology arm DNA sequence.
- the sequence of interest comprises i) a first sequence positioned between the left homology arm DNA sequence and the right homology DNA arm sequence, and ii) a second sequence that is not positioned between the left homology arm DNA sequence and the right homology arm DNA sequence.
- the sequence of interest comprises a sequence positioned between the left homology arm DNA sequence and the right homology DNA arm sequence, where the left and right homology arm sequences are positioned between two sequences encoding site-specific enzyme target sites. In some embodiments, the sequence of interest comprises i) a first sequence positioned between the left homology arm DNA sequence and the right homology DNA arm sequence where the left and right homology arm sequences are positioned between two sequences encoding site-specific enzyme target sites, and ii) a second sequence that is not positioned between sequences encoding site-specific enzyme target sites.
- the sequence of interest comprises i) a first sequence positioned between the left homology arm DNA sequence and the right homology DNA arm sequence where the left and right homology arm sequences are positioned between two sequences encoding site-specific enzyme target sites, and ii) a second sequence encoding the site-specific enzyme, where the second sequence is not positioned between sequences encoding site-specific enzyme target sites.
- at least one homology arm DNA sequence comprises a sequence that is at least 80%, at least 85%, at least 90%, at least 95%, or at least 99% identical to a target sequence in the plant genome.
- the target sequence in the plant genome is a genie sequence.
- the target sequence in the plant genome is a non-genic sequence.
- the sequence of interest flanked by homology arms comprises a sequence that is at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%), or 100% identical to a native gene of the plant cell.
- the sequence of interest comprises a protein-coding sequence.
- the sequence of interest comprises a non-protein-coding RNA.
- the non- protein-coding RNA is selected from the group consisting of: a microRNA (miRNA), a miRNA precursor, a small interfering RNA (siRNA), a small RNA (22-26 nt in length) and precursor encoding same, a heterochromatic siRNA (hc-siRNA), a Piwi-interacting RNA (piRNA), a hairpin double strand RNA (hairpin dsRNA), a trans-acting siRNA (ta-siRNA), a naturally occurring antisense siRNA (nat-siRNA), a tracer RNA (tcRNA), a guide RNA (gRNA), and a single-guide RNA (sgRNA), or any combination thereof.
- miRNA microRNA
- miRNA precursor a small interfering RNA
- siRNA small interfering RNA
- small RNA 22-26 nt in length
- precursor precursor encoding same
- hc-siRNA a heterochromatic siRNA
- the site-specific enzyme is selected from a group consisting of an
- the endonuclease is selected from a meganuclease, a zinc finger nuclease, a transcription activator-like effector nuclease (TALEN), an Argonaute, a Cas9 nuclease, a CasX nuclease, a CasY nuclease, and a Cpfl nuclease.
- TALEN transcription activator-like effector nuclease
- the Cas9 nuclease is selected from the group comprising Casl, CaslB, Cas2, Cas3, Cas4, Cas5, Cas6, Cas7, Cas8, Cas9 (also known as Csnl and Csxl2), CaslO, Csyl, Csy2, Csy3, Csel, Cse2, Cscl, Csc2, Csa5, Csn2, Csm2, Csm3, Csm4, Csm5, Csm6, Cmrl, Cmr3, Cmr4, Cmr5, Cmr6, Csbl, Csb2, Csb3, Csxl7, Csxl4, CsxlO, Csxl6, CsaX, Csx3, Csxl, Csxl5, Csfl, Csf2, Csf3, and Csf4 nuclease.
- the recombinase is a tyrosine
- the tyrosine recombinase attached to a DNA recognition motif is selected from the group consisting of a Cre recombinase, a Flp recombinase, and a Tnpl recombinase.
- the serine recombinase attached to a DNA recognition motif is selected from the group consisting of a PhiC31 integrase, an R4 integrase, and a TP-901 integrase.
- the transposase is a DNA transposase attached to a DNA binding domain.
- the sequence of interest further comprises at least one site-specific enzyme target site.
- the at least one site-specific enzyme target site is selected from a Cre/lox recombination site, a Flp/FRT recombination site, a
- the sequence of interest comprises a sequence encoding a protein involved in DNA repair, wherein the protein is selected from the group comprising a vir gene from the Ti plasmid, Rad51, Rad52, Rad2, a dominant-negative Ku70, or any combination thereof.
- At least part of the sequence of interest is integrated into the plant genome via homologous recombination, wherein the integration of at least part of the sequence of interest results in a point mutation, an insertion, a deletion, an inversion, increased transcription of an endogenous locus, decreased transcription of an endogenous locus, altered protein activity, altered RNAi products, altered RNAi target sites, altered RNAi pathway activity, increased transcription of the sequence of interest, decreased transcription of the integrated sequence of interest, or a combination thereof.
- At least part of the sequence of interest is integrated into the plant genome via non-homologous end joining, wherein the integration of at least part of the sequence of interest results in a point mutation, an insertion, a deletion, an inversion, increased transcription of an endogenous locus, decreased transcription of an endogenous locus, altered protein activity, altered RNAi products, altered RNAi target sites, altered RNAi pathway activity, increased transcription of the sequence of interest, decreased transcription of the integrated sequence of interest, or a combination thereof.
- the plant cell already comprises a site-specific enzyme.
- the plant cell is selected from the group consisting of a corn cell, a soybean cell, a canola cell, a cotton cell, a wheat cell, or a sugarcane cell.
- a nucleotide sequence encoding the site-specific enzyme is stably transformed into the plant cell.
- the Agrobacterium cell comprises at least one vector that is capable of forming two essentially complementary T-strands.
- the Agrobacterium cell comprises at least one vector comprises a first right border DNA sequence (RBI), a second right border DNA sequence (RB2), and at least one sequence of interest, and wherein the RBI is positioned in the vector to initiate synthesis of a first T- strand such that the sequence of interest is in the sense orientation from the 5' to 3' end of the first T-strand; and the RB2 is positioned in the vector to initiate synthesis of a second T- strand such that the sequence of interest is in the anti-sense orientation relative to the sequence of interest in the first T-strand, and wherein the two T-strands resulting from initiation at RBI and RB2 are essentially complementary in at least a portion of the sequence of interest.
- RBI right border DNA sequence
- RB2 second right border DNA sequence
- the Agrobacterium cell comprises at least one vector comprises a first right border DNA sequence (RBI), a second right border DNA sequence (RB2), a sequence of interest, a first left border DNA sequence (LBl) and a second left border DNA sequence (LB2), wherein the vector is configured such that the RBI is paired with the LBl which are positioned in the vector to initiate (RBI) and terminate (LB l) synthesis of a first T-strand such that the sequence of interest is in the sense orientation from the 5' to 3' end of the first T-strand; and the RB2 is paired with the LB2 which are positioned in the vector to initiate (RB2) and terminate (LB2) synthesis of a second T-strand such that the sequence of interest is in an anti-sense orientation relative to the sequence of interest in the first T-strand, and wherein the two T-strands resulting from initiation at RBI and RB2 are essentially complementary in at least a portion of the sequence of interest.
- RBI right border DNA sequence
- RB2 second right border
- the Agrobacterium cell comprises at least one vector comprises a first sequence of interest and a second sequence of interest, where the first sequence of interest is essentially identical to the second sequence of interest; wherein the vector further comprises a first right border DNA sequence (RBI) and a first left border DNA sequence (LBl) which are positioned in the vector to initiate (RBI) and terminate (LBl) synthesis of a first T-strand such that the first sequence of interest is in the sense orientation from the 5' to 3' end of the first T-strand; and the vector further comprises a second right border DNA sequence (RB2) and a second left border DNA sequence (LB2) which are positioned in the vector to initiate (RB2) and terminate (LB2) synthesis of a second T-strand such that the second sequence of interest is in an anti-sense orientation relative to the first sequence of interest in the first T- strand, and wherein the two T-strands resulting from initiation at RBI and RB2 are essentially complementary in at least a portion of the first sequence of interest and the second sequence of
- the Agrobacterium cell comprises at least a first vector and a second vector wherein each vector comprises essentially identical sequences of interest, and wherein the first vector comprises a first right border DNA sequence (RBI) and a first left border DNA sequence (LB 1) which are positioned in the first vector to initiate (RB 1) and terminate (LB 1) synthesis of a first T-strand such that the sequence of interest is in the sense orientation from the 5' to 3' end of the first T-strand; and wherein the second vector comprises a second right border DNA sequence (RB2) and a second left border DNA sequence (LB2) which are positioned in the second vector to initiate (RB2) and terminate (LB2) synthesis of a second T-strand such that the sequence of interest is in an anti-sense orientation relative to the sequence of interest in the first T-strand, and wherein the two T- strands resulting from initiation at RBI and RB2 are essentially complementary in at least a portion of the sequence of interest.
- the first vector comprises a first right border DNA sequence (RBI) and a first left
- the Agrobacterium cell comprises a RBI and RB2 that are essentially homologous. In some embodiments, the Agrobacterium cell comprises a LB 1 and LB2that are essentially homologous. In some embodiments, the Agrobacterium cell comprises RBI and RB2 sequences that are not essentially homologous. In some embodiments, the Agrobacterium cell of LB 1 and LB2 sequences that are not essentially homologous. In some embodiments, at least one of the RBI and RB2 comprise an Agrobacterium Ti plasmid right border consensus DNA sequence. In some embodiments, the right border consensus DNA sequence is selected from SEQ ID NO: 21 or SEQ ID NO: 22.
- At least one of the RBI and RB2 comprise a sequence selected from SEQ ID NOs: 1-13. In some embodiments, at least one of the RBI and RB2 comprise a sequence at least 80% identical to a sequence selected from SEQ ID NOs:4 and SEQ ID NO: 12. In some embodiments, at least one of the LB 1 and LB2 comprise an Agrobacterium Ti plasmid left border consensus DNA sequence. In some embodiments, the left border consensus DNA sequence is selected from SEQ ID NO: 23 or SEQ ID NO: 24 In some embodiments, at least one of the LB 1 and LB2 comprise a sequence selected from SEQ ID NOs: 14-20.
- the sequence of interest comprises one or more expression cassettes.
- the sequence of interest comprises at least one sequence selected from: a gene, a portion of a gene, an intergenic sequence, an enhancer, a promoter, an intron, an exon, a sequence encoding a transcription termination sequence, a sequence encoding a chloroplast targeting peptide, a sequence encoding a mitochondrial targeting peptide, an insulator sequence, a sequence encoding an anti-sense RNA construct, a sequence encoding non-protein-coding RNA (npcRNA), a sequence encoding a recombinase, a sequence encoding a recombinase recognition site, a landing pad, an editing template, an expression cassette, a stack of two or more expression cassettes encoding transgenes, a sequence encoding
- sequence of interest does not comprise a homology arm DNA sequence. In some embodiments, the sequence of interest further comprises at least one homology arm DNA sequence. In some embodiments, the sequence of interest comprises both a left homology arm DNA sequence and a right homology arm DNA sequence.
- the sequence of interest comprises i) a first sequence positioned between the left homology arm DNA sequence and the right homology arm DNA sequence, and ii) a second sequence that is not positioned between the region comprising the left homology arm DNA sequence and the right homology arm DNA sequence
- the at least one homology arm DNA sequence comprises a sequence that is at least 80%, at least 85%, at least 90%, at least 95%, or at least 99% identical to a target sequence in the plant genome.
- the sequence of interest comprises a protein-coding sequence.
- the sequence of interest comprises a non-protein-coding RNA
- the non-protein-coding RNA is selected from a microRNA (miRNA), a miRNA precursor, a small interfering RNA (siRNA), a small RNA (22-26 nt in length) and precursor encoding same, a heterochromatic siRNA (hc-siRNA), a Piwi- interacting RNA (piRNA), a hairpin double strand RNA (hairpin dsRNA), a trans-acting siRNA (ta-siRNA), a naturally occurring antisense siRNA (nat-siRNA), a tracer RNA (tcRNA), a guide RNA (gRNA), and a single-guide RNA (sgRNA), or any combination thereof.
- the expression cassette comprises a nucleic acid sequence at least 80%) identical to a native plant gene. In some embodiments, the expression cassette comprises a nucleic acid sequence that is not homologous to a native plant sequence. In some embodiments, at least one expression cassette comprises at a sequence selected from: an insecticidal resistance gene, herbicide tolerance gene, nitrogen use efficiency gene, a water use efficiency gene, a nutritional quality gene, a DNA binding gene, a selectable marker gene, an RNAi construct, a site specific nuclease gene, a guide RNA, and any combination thereof.
- the site-specific enzyme is selected from an endonuclease, a recombinase, a transposase, and any combination thereof.
- the endonuclease is selected from the group consisting of: a meganuclease, a zinc finger nuclease, a transcription activator-like effector nuclease (TALEN), an Argonaute, a Cas9 nuclease, a CasX nuclease, a CasY nuclease, and a Cpfl nuclease.
- the Cas9 nuclease is selected from the group comprising Casl, CaslB, Cas2, Cas3, Cas4, Cas5, Cas6, Cas7, Cas8, Cas9 (also known as Csnl and Csxl2), CaslO, Csyl, Csy2, Csy3, Csel, Cse2, Cscl, Csc2, Csa5, Csn2, Csm2, Csm3, Csm4, Csm5, Csm6, Cmrl, Cmr3, Cmr4, Cmr5, Cmr6, Csbl, Csb2, Csb3, Csxl7, Csxl4, CsxlO, Csxl6, CsaX, Csx3, Csxl, Csxl5, Csfl, Csf2, Csf3, and Csf4 nuclease.
- the recombinase is a tyrosine recombinase attached to a DNA recognition motif, or a serine recombinase attached to a DNA recognition motif.
- the tyrosine recombinase attached to a DNA recognition motif is selected from the group consisting of a Cre recombinase, a Flp recombinase, and a Tnpl recombinase.
- the serine recombinase attached to a DNA recognition motif is selected from the group consisting of a PhiC31 integrase, an R4 integrase, and a TP-901 integrase.
- the transposase is a DNA transposase attached to a DNA binding domain.
- the sequence of interest comprises at least one site-specific enzyme target site.
- at least one site-specific enzyme target site is selected from the group consisting of: a Cre/lox recombination site, a Flp/FRT recombination site, a endonuclease recognition site, and a
- the sequence of interest comprises a sequence encoding at least one protein involved in DNA repair, wherein the protein is selected from the group comprising a vir gene from the Ti plasmid, Rad51, Rad52, Rad2, a dominant-negative Ku70, or any combination thereof.
- an Agrobacterium cell comprising at least one vector comprising a right border DNA sequence (RB) and a left border DNA sequence (LB) and wherein the vector comprises between the RB and LB: (i) a first sequence of interest in a sense orientation relative to the RB, (ii) a spacer, and (iii) a second sequence of interest in an anti-sense orientation relative to the RB, wherein the first sequence of interest and second sequence of interest are essentially complementary, and after synthesis of the T-strand anneal to form a double-stranded DNA.
- RB right border DNA sequence
- LB left border DNA sequence
- the Agrobacterium cell comprises a first sequence of interest comprising a left homology arm DNA sequence and a right homology arm DNA sequence and a second sequence of interest comprising a left homology arm sequence DNA sequence and a right homology arm DNA sequence.
- the RB comprises an Agrobacterium Ti plasmid right border consensus DNA sequence.
- the right border consensus DNA sequence is selected from SEQ ID NO: 21 or SEQ ID NO: 22.
- the RB comprises a sequence selected from SEQ ID NOs: 1-13.
- the RB comprises a sequence at least 80% identical to a sequence selected from SEQ ID NOs:4 and SEQ ID NO: 12.
- the LB comprises an Agrobacterium Ti plasmid left border consensus DNA sequence.
- the left border consensus DNA sequence is selected from SEQ ID NO: 23 or SEQ ID NO: 24.
- the LB comprises a sequence selected from SEQ ID NOs: 14-20.
- the LB comprises a sequence at least 80%) identical to SEQ ID NO: 19.
- a sequence of interest comprises one or more expression cassettes.
- a sequence of interest comprises at least one sequence selected from: a gene, a portion of a gene, an intergenic sequence, an enhancer, a promoter, an intron, an exon, a sequence encoding a transcription termination sequence, a sequence encoding a chloroplast targeting peptide, a sequence encoding a mitochondrial targeting peptide, an insulator sequence, a sequence encoding an anti-sense RNA construct, a sequence encoding non-protein-coding RNA (npcRNA), a sequence encoding a recombinase, a sequence encoding a recombinase recognition site, a landing pad, an editing template, an expression cassette, a stack of two or more expression cassettes encoding transgenes, a sequence encoding a site-specific enzyme, a sequence encoding a site-specific enzyme target site, a sequence encoding a selection marker, gene expression cassette comprising a sequence encoding a cell factor that
- Several embodiments relate to a method of transforming a plant genome, comprising contacting at least one plant cell on a co-culture medium for at least 24-48 hours, for at least 24-30 hours, for at least 30-36 hours, for at least 36-42 hours, for at least 42-48 hours, for at least 48-54 hours, for at least 54-60 hours, for at least 60-66 hours, for at least 66-72 hours, for at least 72-78 hours, for at least 78-84 hours, for at least 84-90 hours, for at least 90-96 hours, for at least 96-102 hours, for at least 102-108 hours, for at least 108-114 hours, for at least 114-120 hours, for at least 120-126 hours, or for at least 126-132 hours, with at least one Rhizobiales cell capable of transforming the plant cell,
- Rhizobiales cell comprises at least one vector capable of forming two essentially
- the plant cell is a corn immature embryo cell, a corn mature embryo cell, a corn seed cell, a soybean immature embryo cell, a soybean mature embryo cell, a soybean seed cell, a canola immature embryo cell, a canola mature embryo cell, a canola seed cell, a cotton immature embryo cell, a cotton mature embryo cell, a cotton seed cell, a wheat immature embryo cell, a wheat mature embryo cell, a wheat seed cell, a sugarcane immature embryo cell, a sugarcane mature embryo cell, or a sugarcane seed cell.
- the Rhizobiales cell comprises at least one vector comprises a first right border (RBI) DNA sequence, a second right border DNA sequence (RB2), and at least one sequence of interest
- the RBI is positioned in the vector to initiate synthesis of a first T-strand such that the sequence of interest is in the sense orientation from the 5' to 3' end of the first T-strand
- the RB2 is positioned in the vector to initiate synthesis of a second T-strand such that the sequence of interest is in the anti-sense orientation relative to the sequence of interest in the first T-strand
- the sequence of interest in the two T-strands resulting from initiation at RBI and RB2 are essentially complementary in at least a portion of the sequence of interest.
- the Rhizobiales cell comprises at least one vector disclosed herein comprises a RBI, a RB2 , and a sequence of interest, and further comprises a first left border DNA sequence (LB 1) and a second left border DNA sequence (LB2), and wherein the vector is configured such that the RBI is paired with the LBl which are positioned in the vector to initiate (RBI) and terminate (LBl) synthesis of a first T-strand such that the sequence of interest is in the sense orientation from the 5' to 3' end of the first T-strand; and the RB2 is paired with the LB2 which are positioned in the vector to initiate (RB2) and terminate (LB2) synthesis of a second T-strand such that the sequence of interest is in an anti-sense orientation relative to the sequence of interest in the first T-strand, and wherein the sequence of interest in the two T-strands resulting from initiation at RBI and RB2 are essentially complementary in at least a portion of the sequence of interest.
- the vector is configured such that the RBI is paired with the
- the Rhizobiales cell comprises a vector comprising a first sequence of interest and a second sequence of interest, where the first sequence of interest is essentially identical to the second sequence of interest; and the vector configuration further comprises a RB I with aLB l which are positioned in the vector to initiate (RBI) and terminate (LB l) synthesis of a first T-strand such that the first sequence of interest is in the sense orientation from the 5' to 3' end of the first T-strand; and the vector further comprises a RB2 and a LB2 which are positioned in the vector to initiate (RB2) and terminate (LB2) synthesis of a second T-strand such that the second sequence of interest is in an anti-sense orientation relative to the first sequence of interest in the first T-strand, and wherein the sequence of interest in the two T-strands resulting from initiation at RB 1 and RB2 are essentially complementary in at least a portion of the first sequence of interest and the second sequence of interest.
- the Rhizobiales cell comprises at least a first vector and a second vector, wherein each vector comprises essentially identical sequences of interest, and wherein the first vector configuration comprises a RBI and a LB1 which are positioned in the first vector to initiate (RB I) and terminate (LB 1) synthesis of a first T-strand such that the sequence of interest is in the sense orientation from the 5' to 3' end of the first T-strand; and wherein the second vector configuration comprises a RB2 and a LB2 which are positioned in the second vector to initiate (RB2) and terminate (LB2) synthesis of a second T-strand such that the sequence of interest is in an anti-sense orientation relative to the sequence of interest in the first T-strand, and wherein the sequence of interest in the two T-strands resulting from initiation at RB I and RB2 are essentially complementary in at least a portion of the sequence of interest.
- the first vector configuration comprises a RBI and a LB1 which are positioned in the first vector to initiate (RB I) and terminate (LB 1)
- the Rhizobiales cell comprises at least one vector comprises a first sequence of interest, a second sequence of interest different from the first sequence of interest, at least two RB DNA sequences, and one or more optional LB DNA sequences, wherein the first RB DNA sequence (RBI) and a first LB DNA sequence (LB 1) are positioned in the vector to initiate (RB 1) and terminate (LB 1) synthesis of the first T-strand such that the first sequence of interest is in the sense orientation from the 5' to 3' end of the first T-strand; and the vector configuration further comprises a second RB DNA sequence (RB2) and a second LB DNA sequence (LB2) which are positioned in the vector to initiate (RB2) and terminate (LB2) synthesis of a second T-strand such that the first sequence of interest is in an anti-sense orientation from the 5' to 3' end of the second T-strand, wherein the two T-strands resulting from initiation at RBI and RB2 are essentially complementary to each other in at least a portion of the first sequence of interest, and
- the method comprises contacting the plant cell with a first Rhizobiales cell and a second Rhizobiales cell, wherein each Rhizobiales cell contains at least one of two vectors, wherein each vector comprises an essentially identical sequence of interest, and where the first vector comprises a RBI, and wherein the RBI is positioned in the vector to initiate synthesis of a first T-strand such that the sequence of interest is in the sense orientation from the 5' to 3' end of the first T-strand; and the second vector comprises a RB2 which is positioned in the vector to initiate synthesis of a second T-strand such that the sequence of interest is in the anti-sense orientation relative to the sequence of interest in the first T-strand, and wherein the two T-strands resulting from initiation at RBI and RB2 are essentially complementary in at least a portion of the sequence of interest.
- the method comprises contacting the plant cell with a first Rhizobiales cell and a second Rhizobiales cell, wherein each Rhizobiales cell contains at least one of two vectors, wherein each vector comprises an essentially identical sequence of interest, and where the first vector comprises a RBI and a LB1 which are positioned in the first vector to initiate (RB 1) and terminate (LB 1) synthesis of a first T-strand such that the sequence of interest is in the sense orientation from the 5' to 3' end of the first T-strand; and the second vector comprises a RB2 and a LB2 which are positioned in the second vector to initiate (RB2) and terminate (LB2) synthesis of a second T-strand such that the sequence of interest is in an anti- sense orientation from the 5' to 3' end of the second T-strand, and wherein the sequence of interest in the two T-strands resulting from initiation at RBI and RB2 are essentially complementary to each other.
- the Rhizobiales cell is selected from an Agrobacterium spp., a Bradyrhizobium spp., a Mesorhizobium spp., an Ochrobactrum spp., a Phyllobacterium spp., a Rhizobium spp., and a Sinorhizobium spp.
- the Agrobacterium spp. cell is selected from the group consisting of an Agrobacterium
- the turn efaci ens cell and an Agrobacterium rhizogenes cell in some embodiments, the
- Rhizobiales cell further contains a vector comprising at least one expression cassette, wherein the expression cassettes comprise a sequence encoding a protein involved in DNA repair, and wherein the protein is selected from the group comprising a vir gene from the Ti plasmid, Rad51, Rad52, Rad2, a dominant-negative Ku70, or any combination thereof.
- the method results in at least a fragment of the sequence of interest being integrated into the plant genome by homologous recombination.
- the method results in at least a fragment of the sequence of interest being integrated into the plant genome by non-homologous end joining.
- the method further comprises detecting the integration of at least a fragment of the sequence of interest of the vector in the at least one plant cell.
- the method further comprises selecting the plant cell based on the presence of the at least a fragment of the sequence of interest integrated into the plant genome.
- the method further comprises regenerating a transgenic plant from the selected plant cell.
- Several embodiments relate to a method of transforming a plant genome, comprising contacting at least one plant cell on a co-culture medium for at least 24-48 hours, for at least 24-30 hours, for at least 30-36 hours, for at least 36-42 hours, for at least 42-48 hours, for at least 48-54 hours, for at least 54-60 hours, for at least 60-66 hours, for at least 66-72 hours, for at least 72-78 hours, for at least 78-84 hours, for at least 84-90 hours, for at least 90-96 hours, for at least 96-102 hours, for at least 102-108 hours, for at least 108-114 hours, for at least 114-120 hours, for at least 120-126 hours, or for at least 126-132 hours, with at least one Rhizobiales cell capable of transforming the plant cell, wherein the Rhizobiales cell comprises at least one vector comprising a right border (RB) DNA sequence and a left border (LB) DNA sequence and where the vector further comprises between the RB
- the first sequence of interest further comprises a left homology arm DNA sequence and a right homology arm DNA sequence in and the second sequence of interest further comprises a left homology arm DNA sequence and a right homology arm DNA sequence.
- the Rhizobiales cell is selected from an Agrobacterium spp., a Bradyrhizobium spp., a Mesorhizobium spp., an Ochrobactrum spp., a Phyllobacterium spp., a Rhizobium spp., and a Sinorhizobium spp.
- the Agrobacterium spp. cell is selected from the group consisting of an Agrobacterium tumefaciens cell and an Agrobacterium rhizogenes cell. In some
- the Rhizobiales cell further contains a vector comprising at least one expression cassette, wherein the expression cassettes comprise a sequence encoding a protein involved in DNA repair, and wherein the protein is selected from the group comprising a vir gene from the Ti plasmid, Rad51, Rad52, Rad2, a dominant-negative Ku70, or any combination thereof.
- the method results in at least a fragment of the sequence of interest being integrated into the plant genome by homologous recombination. In some embodiments, the method results in at least a fragment of the sequence of interest being integrated into the plant genome by non-homologous end joining.
- the method further comprises detecting the integration of at least a fragment of the sequence of interest of the vector in the at least one plant cell. In some embodiments, the method further comprises selecting the plant cell based on the presence of the at least a fragment of the sequence of interest integrated into the plant genome. In some embodiments, the method further comprises regenerating a transgenic plant from the selected plant cell.
- Figures 1A and IB illustrate two control vector configurations in which a sequence of interest is flanked by a right border (RB) DNA sequence on the 5' end, and a left border (LB) DNA sequence on the 3' end (Panel 1A); or alternatively, one RB DNA sequence on the 5' end (Panel IB).
- RB right border
- LB left border
- Figure 1C illustrates a vector configuration comprising two RB DNA sequences, where the first RB DNA sequence (RBI) is positioned in the vector to initiate synthesis of a first T-strand such that the sequence of interest is in the sense orientation from the 5' to 3' end of the first T-strand; and the second RB DNA sequence (RB2) is positioned in the vector to initiate synthesis of a second T-strand such that the sequence of interest is in the anti-sense orientation from the 5' to 3' end of the second T-strand.
- the two T-strands resulting from initiation at RBI and RB2 are essentially complementary to each other in at least a portion of the sequence of interest.
- Figure ID illustrates a vector configuration comprising optional LB DNA sequences.
- the first RB DNA sequence (RBI) is paired with a first LB DNA sequence (LB 1) which are positioned in the vector to initiate (RB 1) and terminate (LBl) synthesis of a first T-strand such that the sequence of interest is in the sense orientation from the 5' to 3' end of the first T-strand; and the second RB DNA sequence (RB2) is paired with a second left border DNA sequence (LB2) which are positioned in the vector to initiate synthesis of a second T-strand such that the sequence of interest is in an anti-sense orientation from the 5' to 3' end of the second T-strand.
- the two T-strands resulting from initiation at RBI and RB2 are essentially complementary to each other in at least a portion of the sequence of interest.
- Figure 2 A illustrates a vector configuration comprising a first sequence of interest and a second sequence of interest, where the first sequence of interest is essentially identical to the second sequence of interest; and the vector configuration further comprises a first RB DNA sequence (RBI) with an optional first LB DNA sequence (LBl) which are positioned in the vector to initiate (RB I) and optionally terminate (LBl) synthesis of a first T-strand such that the sequence of interest is in the sense orientation from the 5' to 3' end of the first T- strand; and the vector configuration further comprises a second RB DNA sequence (RB2) and an optional second LB DNA sequence (LB2) which are positioned in the vector to initiate (RB2) and optionally terminate (LB2) synthesis of a second T-strand such that the sequence of interest is in an anti-sense orientation from the 5' to 3' end of the second T- strand.
- the sequence of interest in the two T-strands resulting from initiation at RBI and RB2 are essentially complementary to each other.
- Figure 2B illustrates a vector configuration employed in co-transformation in which two essentially identical sequences of interest are located on separate vectors, hosted by one or more bacterium cells, and where the first vector configuration comprises a first RB DNA sequence (RBI) and an optional first LB DNA sequence (LBl) which are positioned in the first vector to initiate (RBI) and optionally terminate (LB l) synthesis of a first T-strand such that the sequence of interest is in the sense orientation from the 5' to 3' end of the first T- strand; and the second vector configuration comprises a second RB DNA sequence (RB2) and an optional second LB DNA sequence (LB2) which are positioned in the second vector to initiate (RB2) and optionally terminate (LB2) synthesis of a second T-strand such that the sequence of interest is in an anti-sense orientation from the 5' to 3' end of the second T- strand.
- the sequence of interest in the two T-strands resulting from initiation at RBI and RB2 are essentially complementary to each other.
- Figure 2C illustrates a vector comprising a first sequence of interest, a second sequence of interest different from the first sequence of interest, two or more RB DNA sequences, and one or more optional LB DNA sequences.
- the illustrated construct is one non-limiting example where the vector configuration comprises a first RB DNA sequence (RBI) and an optional first LB DNA sequence (LBl) which are positioned in the vector to initiate (RBI) and optionally terminate (LBl) synthesis of a first T-strand such that the first sequence of interest is in the sense orientation from the 5' to 3' end of the first T-strand; and the vector configuration further comprises a second RB DNA sequence (RB2) and an optional second LB DNA sequence (LB2) which are positioned in the vector to initiate (RB2) and optionally terminate (LB2) synthesis of a second T-strand such that the first sequence of interest is in an anti-sense orientation from the 5' to 3' end of the second T-strand.
- RBI first RB DNA sequence
- LBl optional first LB DNA sequence
- the two T-strands resulting from initiation at RBI and RB2 are essentially complementary to each other in at least a portion of the first sequence of interest.
- the vector configuration further comprises a third RB DNA sequence (RB3) and an optional third LB DNA sequence (LB3) which are positioned in the vector to initiate (RB3) and optionally terminate (LB3) synthesis of a third T-strand such that the second sequence of interest is in the sense orientation from the 5' to 3' end of the third T-strand; and the vector configuration further comprises a fourth RB DNA sequence (RB4) and an optional fourth LB DNA sequence (LB4) which are positioned in the vector to initiate (RB4) and optionally terminate (LB4) synthesis of a fourth T-strand such that the second sequence of interest is in an anti-sense orientation from the 5' to 3' end of the fourth T-strand.
- the two T-strands resulting from initiation at RB3 and RB4 are essentially complementary to each other in at least a portion of the second sequence of interest.
- Figure 3A illustrates a vector configuration where the vector comprises a RB DNA sequence and a LB DNA sequence and where the vector further comprises between the RB and LB DNA sequences: (i) a first sequence of interest in a sense orientation relative to the RB DNA sequence, (ii) a spacer, and (iii) a second sequence of interest in an anti-sense orientation relative to the RB DNA sequence.
- the two sequences of interest are essentially complementary, and after synthesis of the T-strand a hairpin structure may form.
- Figures 3B and 3C illustrate possible variations of at least two cassettes (S-o-1 1 and S-o-1 2) in relation to positions of two homology arms within a sequence of interest.
- Figure 3B illustrates a first cassette positioned between two homology arms (HA) and a second cassette positioned outside of the one of the homology arms (HA).
- Figure 3C illustrates both a first cassette and a second cassette positioned between two homology arms (HA).
- Figure 4A illustrates a schematic of the control vector configuration for vector A used in Example 9.
- Figure 4B illustrates a schematic of the vector configuration for vector C used in Example 9.
- Figure 4C illustrates a schematic of the vector configuration for vector D used in Example 9.
- Figure 5 illustrates the targeted DNA sequence, comprising a left homology arm (HA-L), a selectable marker gene (CP4-EPSPS), and a right homology arm (HA-R), that is used in Example 9.
- Figure 5 further illustrates the position of PCR primers and Southern blot probes used in Example 9.
- Figure 6A illustrates a schematic of the configuration of control vector A used in Example 12.
- Vector A comprises a first sequence of interest comprising an expression cassette encoding CP4-EPSPS positioned between a first RB DNA sequence (RBI) and a first LB DNA sequence (LB1).
- the vector further comprises a second sequence of interest comprising two expression cassettes each encoding half of a TALEN pair positioned between a second RB DNA sequence (RB2) and a second LB DNA sequence (LB2).
- Figure 6B illustrates a schematic of the configuration of vector B used in Example 12.
- the vector comprises a first sequence of interest comprising an expression cassette encoding CP4-EPSPS positioned between two RB DNA sequences and two LB DNA sequences.
- the vector further comprises a second sequence of interest comprising two expression cassettes each encoding half of a TALEN pair.
- Figure 7A illustrates a schematic of the configuration of vector A used in Example 13.
- the vector comprises a first sequence of interest comprising an expression cassette encoding CP4-EPSPS with a TALEN target site (TS) positioned 5' to the CP4-EPSPS cassette. Additionally, the first sequence of interest is flanked by a first RB DNA sequence (RBI) and a second RB DNA sequence (RB2).
- the vector further comprised a second sequence of interest comprising two expression cassettes each encoding half of a TALEN pair positioned adjacent to a third RB DNA sequence (RB3).
- Figure 7B illustrates a schematic of the configuration of vector B used in Example 13.
- the vector comprises a first sequence of interest comprising an expression cassette encoding CP4-EPSPS flanked by TALEN target sites. Additionally, the first sequence of interest is flanked by a first RB DNA sequence (RBI) and a second RB DNA sequence (RB2).
- the vector further comprised a second sequence of interest comprising two expression cassettes each encoding half of a TALEN pair positioned adjacent to a third RB DNA sequence (RB3).
- Figure 7C illustrates a schematic of the configuration of vector C used in Example 13.
- the vector comprises a first sequence of interest comprising an expression cassette encoding CP4-EPSPS positioned between a left homology arm and a right homology arm. Additionally, the first sequence of interest is positioned between a first RB DNA sequence (RBI) and a second RB DNA sequence (RB2).
- the vector further comprises a second sequence of interest comprising two expression cassettes each encoding half of a TALEN pair positioned adjacent to a third RB DNA sequence (RB3).
- a cell or tissue culture derived from a plant can comprise any plant components or plant organs (e.g., leaves, stems, roots, etc.), plant tissues, seeds, plant cells, and/or progeny of the same.
- a progeny plant can be from any filial generation, e.g., Fi, F 2 , F 3 , F 4 , F 5 , F 6 , F 7; etc.
- a plant cell is a biological cell of a plant, taken from a plant or derived through culture from a cell taken from a plant.
- transgenic means a plant cell, a plant, a plant part, or a seed whose genome has been altered by the stable integration of exogenous DNA.
- a transgenic line includes a plant regenerated from an originally-transformed plant cell and progeny transgenic plants from later generations or crosses of a transformed plant.
- stably transformed is defined as a transfer of DNA into genomic DNA of a targeted cell that allows the targeted cell to pass the transferred DNA to the next generation.
- the transferred DNA is integrated into the genomic DNA of a reproductive cell.
- the transferred DNA is integrated into the genomic DNA of a somatic cell.
- transiently transformed is defined as a transfer of DNA into a cell that is not integrated into the transformed cell's genomic DNA.
- plant genome refers to a nuclear genome, a mitochondrial genome, or a plastid (e.g., chloroplast) genome of a plant cell.
- the instant disclosure provides a Rhizobiales cell comprising at least one vector that is capable of forming at least two T-strands that are essentially
- the instant disclosure provides a Rhizobiales cell comprising at least one vector that is capable of forming at least two T-strands that are essentially complementary for at least a portion of the vector backbone.
- the instant disclosure provides a Rhizobiales cell comprising at least one vector that is capable of forming two T-strands that are essentially complementary for at least 25, 30, 35, 40, 45, 50, 60, 70, 80, 90, 100, 200, 300, 400, 500, 600, 700, 800, 900, 1,000, 1, 100, 1,200, 1,300, 1,400, 1,500, 1,600, 1,700, 1,800, 1,900, 2,000, 2, 100, 2,200, 2,300, 2,400, 2,500, 2,600, 2,700, 2,800, 2,900, 3,000, 3, 100, 3,200, 3,300, 3,400, 3,500, 3,600, 3,700, 3,800, 3,900, 4,000, 4, 100, 4,200, 4,300, 4,400, 4,500, 4,600, 4,700, 4,800, 4,900, 5,000, 5, 100, 5,200, 5,300, 5,400, 5,500, 5,600, 5,700, 5,800, 5,900, 6,000, 6, 100, 6,200, 6,300, 6,400, 6,500, 6,600, 6,700, 6,800, 6,900, 6,000
- 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10 or more vectors provided herein are located within a single Rhizobiales cell.
- the instant disclosure provides a Rhizobiales cell comprising at least a first vector and a second vector, where the first vector is configured to produce a first T-strand comprising a sequence of interest and the second vector is configured to produce a second T-strand comprising a sequence of interest oriented such that the first and second T- strands are essentially complementary in at least a portion of a sequence of interest.
- the instant disclosure provides a Rhizobiales cell comprising at least a first vector and a second vector that are capable of producing a first T-strand and a second T- strand that are essentially complementary for at least a portion of the vector backbone.
- the instant disclosure provides a Rhizobiales cell comprising at least a first vector and a second vector that are capable of producing a first T-strand and a second T- strand that are essentially complementary for at least 25, 30, 35, 40, 45, 50, 60, 70, 80, 90, 100, 200, 300, 400, 500, 600, 700, 800, 900, 1,000, 1,100, 1,200, 1,300, 1,400, 1,500, 1,600, 1,700, 1,800, 1,900, 2,000, 2, 100, 2,200, 2,300, 2,400, 2,500, 2,600, 2,700, 2,800, 2,900, 3,000, 3,100, 3,200, 3,300, 3,400, 3,500, 3,600, 3,700, 3,800, 3,900, 4,000, 4,100, 4,200, 4,300, 4,400, 4,500, 4,600, 4,700, 4,800, 4,900, 5,000, 5, 100, 5,200, 5,300, 5,400, 5,500, 5,600, 5,700, 5,800, 5,900, 6,000, 6,100, 6,200, 6,300, 6,400
- Rhizobiales refers to members of the bacterial Order Rhizobiales that are capable of transforming a plant cell.
- a Rhizobiales provided herein can refer to an Agrobacterium spp., a Bmdyrhizobium spp., a Mesorhizobium spp., an Ochrobactrum spp., a Phyllobacterium spp., a Rhizobium spp., and a Sinorhizobium spp.
- a Rhizobiales provided herein can refer to Agrobacterium tumefaciens or Agrobacterium rhizogenes.
- 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10 or more vectors provided herein are located within a single Rhizobiales cell. In other aspects, 2, 3, 4, 5, 6, 7, 8, 9 or 10 or more different vectors provided herein are located within 2, 3, 4, 5, 6, 7, 8, 9 or 10 or more different Rhizobiales cells.
- recombination refers to the exchange of nucleotides between two nucleic acid molecules.
- the term “homologous recombination” refers to the exchange of nucleotides at a conserved region shared by two nucleic acid molecules. HR includes symmetric homologous recombination and asymmetric homologous recombination. Asymmetric homologous recombination can also mean unequal recombination.
- non-homologous end joining HEJ refers to the ligation of two ends of double- stranded DNA without the need of a homologous sequence to direct the ligation.
- microhomology refers to the presence of the same short sequence (1 to 10 bp) of bases in different nucleic acid molecules.
- at least one of the nucleic acid molecules is genomic DNA and at least one of the nucleic acid molecules comprises two T-strands that are essentially complementary in at least a portion of a sequence of interest.
- Methods for detecting HR and NHEJ include, but are not limited to, 1) phenotypic screening, 2) molecular marker technologies such as single nucleotide polymorphism (SNP) analysis by TaqMan® or Illumina/Infinium technology, 3) Southern blot, and 4) sequencing (e.g., Sanger, Illumina®, 454, Pac-Bio, Ion TorrentTM).
- SNP single nucleotide polymorphism
- iPCR inverse PCR
- restriction nuclease sites flanking a targeted gene are identified on each of the two parental chromosomes. These restriction nuclease sites can be the same or different.
- a PCR primer specific for the first parental chromosome and another PCR primer specific for the second parental chromosome are designed.
- An induced double-strand break promotes recombination between the two parental chromosomes brings both restriction endonuclease sites and primer binding sites onto the same recombinant chromosome.
- a PCR product is observed only in instances where recombination occurs.
- the occurrence of homologous recombination can be detected by PCR, with primers specifically designed for the T-strand insert paired with primers specifically designed for flanking sequences of the target sequence (outside of the homologous regions). For example, when an upstream flanking primer is paired with a downstream insert-specific primer, a PCR product is observed only when recombination occurs.
- plasmid refers to a circular, double-stranded DNA molecule that is physically separate from chromosomal DNA.
- a plasmid or vector used herein is capable of replication in vivo.
- the vector is capable of transforming a plant cell.
- a plasmid provided herein is a bacterial plasmid.
- a plasmid provided herein is an Agrobacterium Ti plasmid or derived from an Agrobacterium Ti plasmid.
- a plasmid or vector provided herein is a recombinant vector.
- the term "recombinant vector” refers to a vector formed by laboratory methods of genetic recombination, such as molecular cloning.
- a vector or plasmid provided herein is a synthetic plasmid.
- a "synthetic plasmid" is an artificially created plasmid that is capable of the same functions ⁇ e.g., replication) as a natural plasmid (e.g., Ti plasmid).
- one skilled in the art can create a synthetic plasmid de novo via synthesizing a plasmid by individual nucleotides, or by splicing together nucleic acid molecules from different pre-existing plasmids.
- sequence identity when used in relation to nucleic acids, describe the degree of similarity between two or more nucleotide sequences.
- the percentage of "sequence identity" between two sequences is determined by comparing two optimally aligned sequences over a comparison window, such that the portion of the sequence in the comparison window may comprise additions or deletions (gaps) as compared to the reference sequence (which does not comprise additions or deletions) for optimal alignment of the two sequences.
- the percentage is calculated by determining the number of positions at which the identical nucleic acid base or amino acid residue occurs in both sequences to yield the number of matched positions, dividing the number of matched positions by the total number of positions in the window of comparison, and multiplying the result by 100 to yield the percentage of sequence identity.
- a sequence that is identical at every position in comparison to a reference sequence is said to be identical to the reference sequence and vice- versa.
- An alignment of two or more sequences may be performed using any suitable computer program. For example, a widely used and accepted computer program for performing sequence alignments is CLUSTALW vl .6 (Thompson, et al. (1994) Nucl. Acids Res., 22: 4673-4680).
- nucleic acid molecule refers to pairing of nucleotide bases such that A is complementary to T (or U), and G is complementary to C. Two complementary nucleic acid molecules are capable of hybridizing with each other. As an example, the two strands of double stranded DNA are complementary to each other.
- the term "essentially homologous" or "essentially identical” means that two nucleotide sequences have at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%), at least 99%, or 100% sequence identity with each other.
- the term "essentially complementary" means that two nucleotide sequences have at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%), or 100%) sequence complementarity with each other.
- a "portion" of a nucleic acid molecule refers to at least 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% of its total length, or at least 5, 10, 15, 20, 25, 30, 25, 40, 45, 50, 75, 100, 150, 200, 250, 300, 350, 400, 450, 500, 600, 700, 800, 900, or 1000 or more contiguous nucleotides.
- polynucleotide refers to a nucleic acid molecule containing multiple nucleotides and generally comprises at least 2, at least 5, at least 10, at least 20, at least 30, at least 40, at least 50, at least 100, at least 250, at least 500, at least 1000, at least 1500, at least 2000, at least 2500, at least 3000, at least 5000, at least 10,000 or more nucleotide bases.
- a polynucleotide provided herein can be a plasmid.
- a specific polynucleotide of 18 - 25 nucleotides in length may be referred to as an
- oligonucleotide Nucleic acid molecules provided herein include deoxyribonucleic acids (DNA) and ribonucleic acids (RNA) and functional analogues thereof, such as
- Nucleic acid molecules provided herein can be single stranded or double stranded. Nucleic acid molecules comprise the nucleotide bases adenine (A), guanine (G), thymine (T), cytosine (C). Uracil (U) replaces thymine in RNA molecules.
- the symbol “N” can be used to represent any nucleotide base (e.g., A, G, C, T, or U).
- the symbol “K” can be used to represent a G or a T/U nucleotide base.
- nucleic acid sequences are located on the same nucleic acid molecule.
- a physical linkage can be adjacent or proximal.
- a nucleic acid sequence provided herein is adjacent to another nucleic acid sequence.
- a first nucleic acid molecule provided herein is physically linked to a second nucleic acid molecule provided herein.
- “flanked” refers to a nucleic acid sequence that is linked on one or both sides to another nucleic acid sequence, including linked to a sequence of interest, or to a LB DNA sequence, or a RB DNA sequence.
- a flanking sequence precedes a sequence of interest.
- flanking sequence follows a sequence of interest.
- a sequence of interest is flanked by another sequence of interest.
- a flanking sequence is on the 5' or upstream end of a sequence of interest.
- a flanking sequence is on the 3 ' or downstream end of a sequence of interest.
- the flanking sequence is contiguous with the sequence of interest.
- operably linked means that the operably linked nucleic acid sequences exhibit their desired function.
- a provided DNA promoter sequence can initiate transcription of an operably linked DNA sequence into RNA.
- a nucleic acid sequence provided herein can be upstream or downstream of a physically or operably linked nucleic acid sequence.
- a first nucleic acid molecule provided herein is both physically linked and operably linked to a second nucleic acid molecule provided herein.
- a first nucleic acid molecule provided herein is neither physically linked nor operably linked to a second nucleic acid molecule provided herein.
- upstream means the nucleic acid sequence is positioned before the 5' end of a linked nucleic acid sequence.
- downstream means the nucleic acid sequence is positioned after the 3' end of a linked nucleic acid sequence.
- a "spacer” or a “linker” refers to any polynucleotide sequence capable of forming a loop structure that is at least 4 nucleotides in length. Without being limiting, a spacer provided herein can comprise a non-coding sequence or a coding sequence.
- T-strand refers to a single-strand copy of DNA made when transcription is initiated from a RB DNA sequence of a vector.
- synthesis of the transfer DNA is initiated at a 25-base-pair consensus DNA sequence referred to as the "right border” (RB), and T-strand synthesis termination occurs at a 25 bp consensus DNA sequence referred to as the "left border” (LB).
- RB 25-base-pair consensus DNA sequence
- LB left border
- the 25 bp RB consensus DNA sequence (SEQ ID NO: 21) and the 25 bp LB consensus DNA sequence are from the nopaline Agrobacterium tumefaciens strain C58.
- the 25 bp RB consensus DNA sequence (SEQ ID NO: 22) and the 25 bp LB consensus DNA sequence (SEQ ID NO: 24) are from the octopine Agrobacterium tumefaciens strain A6.
- a RB consensus DNA sequence is selected from SEQ ID NOs: 21 and 23.
- a LB consensus DNA sequence is selected from SEQ ID NOs: 22 and 24.
- a RB DNA sequence or LB DNA sequence comprises at least a 25 bp Ti plasmid RB or LB (respectively) consensus DNA sequence and may additionally comprise a nucleic acid sequence comprising at least 25 nucleotides, at least 50 nucleotides, at least 100 nucleotides, at least 150 nucleotides, at least 200 nucleotides, at least 250 nucleotides, at least 300 nucleotides, at least 350 nucleotides, at least 400 nucleotides, at least 450 nucleotides, at least 500 nucleotides, or at least 600 nucleotides.
- a RB DNA sequence or a LB DNA sequence provided herein can be of any length such that the DNA segment is capable of transformation of plant tissue performed by Agrobacterium or other Rhizobiales-mediated methods ⁇ See U. S. Patent Nos. 5,731, 179 and 6,265,638; and U.S. Patent Application Publications US2003/110532; US2005/0183170; and US2007/0271627).
- RB DNA sequences are presented as SEQ ID NOs: 1 - 13 and are variants of A. tumefaciens Ti plasmid sequences ranging from 162 nt to 505 nt in length and comprising either the nopaline RB consensus DNA sequence (SEQ ID NO: 21) or the octopine RB consensus DNA sequence (SEQ ID NO: 22). Similarly, as used in this application, LB DNA sequences are presented as SEQ ID NOs: 14 - 20 and are variants of A.
- tumefaciens Ti plasmid sequences ranging from 220 nt to 443 nt in length and comprising either the nopaline LB consensus DNA sequence (SEQ ID NO: 23) or the octopine LB consensus DNA sequence (SEQ ID NO: 24).
- Table 1 contains the SEQ ID NOs for each of the RB and LB DNA sequences presented herein with the position indicated within each SEQ ID NO of the nopaline or octopine 25 bp consensus sequences. Other RB and LB sequences are contemplated. Table 1.
- a vector provided herein can comprise at least 1, at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, or at least 10 RB DNA sequences.
- the RB DNA sequence comprises an octopine Agrobacterium Ti plasmid 25 bp RB DNA consensus sequence.
- the RB DNA sequence comprises a nopaline Agrobacterium Ti plasmid 25 bp RB consensus DNA sequence.
- the RB DNA sequences may be essentially homologous or the RB DNA sequences may not be essentially homologous.
- the one or more RB DNA sequences comprise a sequence at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% identical, or 100% identical to a sequence selected from SEQ ID NOs: 1-13.
- the first RB (RBI) DNA sequence or the second RB (RB2) DNA sequence comprises a sequence selected from the group comprising SEQ ID NOs: 1-13.
- the RBI DNA sequence comprises a sequence at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%), at least 97%, at least 98%, at least 99% identical, or 100% identical to a sequence selected from SEQ ID NO: 4 or SEQ ID NO: 12; and the RB2 DNA sequence comprises a sequence at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% identical, or 100% identical to a sequence
- the at least one vector disclosed herein does not comprise a LB DNA sequence.
- the at least one vector disclosed herein comprises at least one, at least 1, at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, or at least 10 LB DNA sequences.
- the LB DNA sequences may be essentially homologous or the LB DNA sequences may not be essentially homologous.
- the LB DNA sequence comprises an octopine Agrobacterium Ti plasmid 25 bp LB consensus DNA sequence.
- the LB DNA sequence comprises a nopaline Agrobacterium Ti plasmid 25 bp LB consensus DNA sequence.
- the one or more of the LB DNA sequences are selected from the group comprising SEQ ID NOs: 14-20.
- the two or more LB DNA sequences comprise a sequence at least 80%, at least 81%, at least 81%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identical to a sequence selected from SEQ ID NOs: 14-20.
- the LB DNA sequence comprises a sequence at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% identical, or 100% identical to SEQ ID NO: 19.
- sequence of interest refers to a polynucleotide sequence in a vector that forms part of the T-strand.
- a vector provided herein comprises 0, at least 1, at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, or at least 10, or more sequences of interest.
- a sequence of interest in a vector disclosed herein does not include RB or a LB DNA sequences, and does not include vector backbone sequence.
- vector configurations with two sequences of interest positioned between two RB DNA sequences such that the sequence of interest in the two T- strands synthesized from the two RB DNA sequences are essentially complementary.
- the nucleotide sequence of the two sequences of interest may be identical, or the nucleotide sequence may be at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical.
- a sequence of interest provided herein comprises 0, at least 1, at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, or at least 10 expression cassettes.
- the sequence of interest provided herein comprises one or more expression cassettes physically and/or operably linked in a cassette stack.
- a sequence of interest comprises an expression cassette adjacent to a left homology arm DNA sequence, a right homology arm DNA sequence, or a left homology arm DNA sequence and a right homology arm DNA sequence.
- a sequence of interest comprises an expression cassette flanked by homology arm DNA sequences.
- a sequence of interest comprises one ore more expression cassettes that is not flanked by homology arms.
- a sequence of interest comprises one or more expression cassettes flanked by one ore more site-specific enzyme target sites. In some aspects, a sequence of interest comprises one or more expression cassettes flanked by one ore more recombinase recognition sites. In another aspect, a sequence of interest provided herein comprises an endogenous polynucleotide sequence. In some embodiments, the endogenous polynucleotide sequence comprises an intergenic sequence, a native gene, or a mutated gene. In another aspect, a sequence of interest provided herein comprises an exogenous
- At least part of the sequence of interest in the vector disclosed herein is integrated into a plant genome via HR. In another aspect, at least part of the sequence of interest in the vector disclosed herein is integrated into a plant genome via HEJ.
- a sequence of interest is flanked by at least two homology arm DNA sequences and comprises a sequence that is at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%), at least 98%, at least 99%, or 100% identical to a native gene of a plant cell.
- the at least two homology are sequences are flanked by one or more site- specific enzyme target sites.
- the at least two homology are sequences are flanked by one or more recombinase recognition sites.
- a sequence of interest provided herein comprises 0, at least 1, or at least 2 homology arm DNA sequences.
- a sequence of interest provided herein comprises at least two homology arm DNA sequences the at least two homology arm DNA sequences can be distinguished by referring to them as a "left homology arm DNA sequence" and a "right homology arm DNA sequence.”
- a sequence of interest provided herein comprises both a left homology arm DNA sequence and a right homology arm DNA sequence.
- a right homology arm DNA sequence and a left homology arm DNA sequence provided herein are homologous to a targeted genomic DNA sequence in the plant or plant cell.
- a right homology arm DNA sequence and a left homology arm DNA sequence are not essentially homologous to each other.
- a right homology arm DNA sequence and a left homology arm DNA sequence are essentially homologous to each other.
- a sequence of interest comprises one or more expression cassettes positioned between a right homology arm DNA sequence and a left homology arm DNA sequence.
- a sequence of interest comprises a sequence for templated genome editing positioned between a right homology arm DNA sequence and a left homology arm DNA sequence.
- at least part of a sequence of interest provided herein is outside of the region comprising a left homology arm DNA sequence, a right homology arm DNA sequence, and one or more cassettes.
- at least part of a sequence of interest provided herein is within the region comprising a left homology arm DNA sequence, a right homology arm DNA sequence, and a sequence for templated genome editing.
- a sequence of interest provided herein comprises a first sequence positioned between a left homology arm DNA sequence and a right homology arm DNA sequence, and a second sequences that is not positioned between a left homology arm DNA sequence and a right homology arm DNA sequence.
- a vector provided herein comprises a first sequence of interest further comprising a first left homology arm DNA sequence and a first right homology arm DNA sequence and a second sequence of interest further comprising a second left homology arm DNA sequence and a second right homology arm DNA sequence.
- a sequence of interest provided herein is integrated into a plant genome in its entirety. In an aspect, at least a part of a sequence of interest provided herein is integrated into a plant genome. In another aspect, only the sequence of interest between a right homology arm DNA sequence and a left homology arm DNA sequence is integrated into a plant genome. In another aspect, a sequence of interest provided herein is integrated into a plant genome via HR. In an aspect, HR can occur between one or two homology arm DNA sequences provided herein and a plant genome. In another aspect, a sequence of interest provided herein is integrated into a plant genome via NHEJ. In another aspect, a sequence of interest provided herein is integrated into a plant genome via microhomology-mediated end joining. In yet another aspect, a sequence of interest provided herein is randomly integrated into a plant genome via Agrobacterium-mediated T-strand integration.
- integration of at least part of a sequence of interest provided herein results in one or more point mutations, one or more insertions, one or more deletions, an inversion, increased transcription of an endogenous locus, decreased transcription of an endogenous locus or any combination thereof.
- integration of at least part of a sequence of interest provided herein results in altered protein activity, altered production of RNAi polynucleotides, altered sequence of RNAi target sites, altered RNAi pathway activity, increased transcription of the sequence of interest, silencing of the integrated sequence of interest, or any combination thereof.
- Silencing technologies include, without limitations, antisense-, co-suppression-mediated mechanisms, and RNAi technologies, such as miRNA (e.g., U. S. Patent Application Publication 2006/0200878).
- integration of at least part of a sequence of interest causes targeted transcription, decreased transcription, enhanced transcription, or templated editing of a transgenic nucleic acid sequence present in the genome.
- homology arm or “homology arm DNA sequence” refers to a polynucleotide sequence that has at least 80%, at least 85%>, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%) or 100%> sequence identity to a target sequence in a plant or plant cell.
- a homology arm can comprise at least 5, at least 10, at least 15, at least 20, at least 25, at least 30, at least 40, at least 50, at least 100, at least 150, at least 200, at least 250, at least 300, at least 350, at least 400, at least 450, at least 500, at least 500, at least 550, at ;east 600, at least 650, at least 700, at least, 750, at least 800, at least 850, at least 900, at least 950, at least 1000, or at least 2500 nucleotides.
- the target sequence comprises a protein-coding sequence.
- the target sequence is a genie sequence.
- a "genie" sequence is a nucleic acid sequence that encodes a protein or a non-protein-coding RNA.
- a genie sequence can include one or more introns.
- the target sequence is a non-genic sequence.
- a "non-genic" sequence is a nucleic acid sequence that is not a genie sequence.
- the target sequence comprises a non-coding sequence.
- the target sequence comprises both a protein-coding sequence and a non- coding sequence.
- the target sequence does not comprise a gene or a portion of a gene.
- the target sequence is linked to a gene of interest.
- the target sequence is linked to a transgene integrated in the genome of a plant or plant cell.
- target sequence refers to a genomic locus selected for targeted integration of a sequence of interest.
- the sequence of interest is integrated into the target sequence by HR or NHEJ.
- target sequence can refer to the full-length nucleotide sequence of the genomic locus targeted for cleavage and recombination, or the nucleotide sequence of a portion of the genomic locus targeted for cleavage and recombination.
- the target sequence can be an endogenous genomic locus or a transgene.
- a target sequence is a genie sequence.
- a target sequence is a non-genic sequence.
- a "site-specific recombination site” or “site-specific target site” are used interchangeably to refer to a nucleic acid sequence where exogenous DNA is inserted by HR or by non-homologous recombination.
- site-specific enzyme target site refers to the site that is cleaved by a nuclease introducing a double stranded break into the nucleic acid backbone. The site of the double-strand break may be a target site for introduction of a sequence of interest.
- endogenous refers to a nucleic acid sequence that exists naturally in the genome of a cell.
- a method provided herein is used to modify an endogenous locus so that the modified locus shares at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, at least 99.5%, or at least 99.9% sequence identity as compared to an unmodified endogenous locus.
- an endogenous nucleic acid sequence undergoes "template editing.” Template editing can occur via HR between a target sequence and a donor template after a double-stranded break occurs in or near the target sequence.
- a "donor template” is a nucleic acid sequence that shares at least 60% at least 60%, at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, at least 99.5%), or at least 99.9% sequence identity to at least a portion of a target sequence.
- a donor template provided herein comprises a sequence of interest.
- Template editing can introduce one or more point mutations, deletions, or insertions into a target sequence.
- the entire donor template, a portion of the donor template, a copy of the donor template, or a portion of a copy of the donor template integrates into the target sequence.
- template editing would be analogous to an orthologous nucleic acid sequence, a paralogous nucleic acid sequence, an isogenic nucleic acid sequence, or a cisgenic nucleic acid sequence of the endogenous genome.
- exogenous refers to a nucleic acid sequence that is not normally present in a cell, but can be introduced into a cell by one or more genetic, biochemical or other methods.
- an exogenous nucleic acid sequence can be
- cassette or "expression cassette” refer to a nucleic acid sequence which may or may not be operably linked to one or more expression elements such as an enhancer, a promoter, a leader, an intron, a 5' untranslated region (UTR), a 3' UTR, or a transcription termination sequence.
- expression elements such as an enhancer, a promoter, a leader, an intron, a 5' untranslated region (UTR), a 3' UTR, or a transcription termination sequence.
- a cassette comprises a nucleic acid sequence that is at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%), or 100%) identical to an endogenous nucleic acid sequence.
- the cassette comprises an exogenous nucleic acid sequence.
- a cassette in a vector disclosed herein comprises at least one sequence selected from: a gene, a portion of a gene, an intergenic sequence, an enhancer, a promoter, an intron, an exon, a transcription termination sequence, a sequence encoding a chloroplast targeting peptide, a sequence encoding a mitochondrial targeting peptide, an insulator sequence, a sequence encoding an anti-sense RNA, a sequence encoding non-protein-coding RNA (npcRNA), a sequence encoding a recombinase, a sequence encoding a recombinase recognition site (for example, a lox site or a flp site), a sequence encoding an Argonaute (non-limiting examples of Argonaute proteins include Thermus thermophilics Argonaute (TtAgo), Pyrococcus furiosus Argonaute (PfAgo), Natronobacterium gregoryi
- the cassette is positioned between a first right border (RBI) and a second right border (RB2) such that the two T-strands resulting from initiation at RBI and RB2 are essentially complementary to each other in at least a portion of the cassette.
- the cassette is positioned adjacent to at least one LB DNA sequence.
- the cassette is not positioned between RB DNA sequences.
- the cassette is not positioned between a RB DNA sequence and a LB DNA sequence.
- the term "landing pad" refers to a nucleic acid locus that is designed to be the locus for site-specific recombination.
- landing pads may comprise one or more nucleic acid sequences that are not homologous to native sequences of the host organism. In some embodiments, landing pads may comprise one or more recognition sites for any of an endonuclease, a recombinase or a transposase. In some embodiments a landing pad may comprise one or more recognition sites for any of an endonuclease, a recombinase or a transposase flanking one or more nucleotide sequences substantially lacking homology with the genome of the host organism.
- the one or more recognition sites are binding sites for DNA-binding domains (e.g., zinc finger proteins (ZFPs), meganucleases, or leucine zippers).
- landing pads may comprise nucleotide sequences that have substantially no homology to regions of any sequenced target plant genome.
- the landing pad may comprises any combination of one or more Zinc Finger Nuclease recognition sites, one or more
- an editing template refers to a nucleic acid sequence that can be used for recombination with a target sequence.
- an editing template comprises a sequence that is at least 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or at least 99% identical to at least a portion of a target sequence.
- the editing template comprises one or more nucleic acid changes compared to the target sequence.
- cassette stack refers to two or more expression cassettes which are physically linked in a vector. In some embodiments, two or more cassettes in a cassette stack are operably linked. In some embodiments, two or more cassettes in a cassette stack are not operably linked. In some embodiments, two or more cassettes in a cassette stack are flanked by one or more recombinase recognition sites. In some embodiments, two or more cassettes in a cassette stack are flanked by one or more a site-specific enzyme target sites.
- two or more cassettes in a cassette stack may be separated by spacer sequences, insulator sequences, multiple cloning site sequences, one or more recombinase recognition sites, a sequence encoding a site-specific enzyme target site, a landing pad, homology arms, a RB DNA sequence or a LB DNA sequence.
- the cassette stack is positioned between a first right border (RBI) and a second right border (RB2) such that the two T-strands resulting from initiation at RBI and RB2 are essentially complementary to each other in at least a portion of the cassette stack.
- the cassette stack is positioned adjacent to at least one LB DNA sequence.
- the cassette stack is not positioned between RB DNA sequences.
- the cassette stack is not positioned between a RB DNA sequence and a LB DNA sequence.
- genomic locus means a locatable region of genomic sequence, corresponding to a unit of inheritance.
- a genomic locus may comprise one or more regulatory regions, such as promoters, enhancers, 5' UTRs, intron regions, 3'UTRs, transcribed regions, and other functional sequence regions that may exist as native genes or transgenes in a plant or a mammalian genome.
- gene refers to a sequence that encodes a protein, or a sequence encoding a non-protein-coding RNA.
- protein-coding refers to a
- polynucleotide encoding for the amino acids of a polypeptide.
- "encoding” refers to a polynucleotide that can produce a functional unit (without being limiting, for example, a protein, a microRNA, a transfer RNA, a ribosomal RNA, a small interfering RNA, a guide RNA, a tracer RNA, a single-guide RNA) via transcription and/or translation.
- a series of three nucleotide bases encodes one amino acid.
- “expressed,” “expression,” or “expressing” refers to transcription of RNA from a DNA molecule.
- a sequence of interest provided herein comprises a protein-coding sequence.
- npcRNA refers to non-protein-coding RNA.
- Non- limiting examples of non-protein-coding RNA include a microRNA (miRNA), a miRNA precursor, a small interfering RNA (siRNA), a small RNA (22-26 nt in length) and precursor encoding same, a heterochromatic siRNA (hc-siRNA), a Piwi-interacting RNA (piRNA), a hairpin double strand RNA (hairpin dsRNA), a trans-acting siRNA (ta-siRNA), a naturally occurring antisense siRNA (nat-siRNA), a tracer RNA (tcRNA), a guide RNA (gRNA), and a single-guide RNA (sgRNA).
- a sequence of interest provided herein comprises a non-protein-coding RNA.
- an expression cassette comprises at least one gene selected from a gene of agronomic interest, a DNA binding gene, a selectable marker gene, an RNAi construct, a site specific nuclease gene, a recombinant guide RNA of an RNA-guided nuclease
- RNA-guided nucleases include Casl, CaslB, Cas2, Cas3, Cas4, Cas5, Cas6, Cas7, Cas8, Cas9 (also known as Csnl and Csxl2), CaslO, Csyl, Csy2, Csy3, Csel, Cse2, Cscl, Csc2, Csa5, Csn2, Csm2, Csm3, Csm4, Csm5, Csm6, Cmrl, Cmr3, Cmr4, Cmr5, Cmr6, Csbl, Csb2, Csb3, Csxl7, Csxl
- a gene provided herein comprises a promoter. In another aspect, a gene provided herein does not comprise a promoter.
- genes for disease, insect, or pest tolerance for example, virus tolerance, bacteria tolerance, fungus tolerance, nematode tolerance, arthropod tolerance, gastropod tolerance
- herbicide tolerance genes for quality improvements such as yield, nutritional enhancements, environmental or stress tolerances, or any desirable changes in plant physiology, growth, development, morphology or plant product(s) including starch production (U.S. Pat. Nos. 6,538, 181; 6,538,179; 6,538,178; 5,750,876; 6,476,295), modified oils production (U.S. Pat. Nos.
- an expression cassette provided herein comprises a nucleic acid sequence selected from an insecticidal resistance gene, an herbicide tolerance gene, a nitrogen use efficiency gene, a water use efficiency gene, a nutritional quality gene, a DNA binding gene, a selectable marker gene, an RNAi construct, a site specific nuclease gene, a guide RNA, and any combination thereof.
- a gene can also include polynucleotide sequences that encode for other polynucleotide sequences such as a messenger RNA (mRNA).
- mRNA messenger RNA
- An mRNA produced from a nucleic acid molecule of this disclosure can contain a 5'-UTR leader sequence.
- This sequence can be derived from the promoter selected to express the gene and can be specifically modified so as to increase or decrease translation of the mRNA.
- a 5'-UTR can also be obtained from viral RNAs, from suitable eukaryotic genes, or from a synthetic gene sequence. Such "enhancer" sequences can be desirable to increase or alter the translational efficiency of the resultant mRNA. This disclosure is not limited to constructs where the non- translated region is derived from the 5'-UTR that accompanies the promoter sequence.
- the 5'-UTR sequence can be derived from unrelated promoters or genes (see, for example U.S. Pat. No. 5,362,865).
- non-translation leader sequences include maize and petunia heat shock protein leaders (U.S. Pat. No. 5,362,865), plant virus coat protein leaders, plant rubisco leaders, GmHsp (U.S. Pat. No. 5,659,122), PhDnaK (U.S. Pat. No. 5,362,865), AtAntl, TEV (Carrington and Freed, Journal of Virology, (1990) 64: 1590- 1597), and AGRtu.nos (GenBank Accession V00087; Bevan et al. , Nucleic Acids Research (1983) 11 :369-385).
- Other genetic components that serve to enhance expression or affect transcription or translational of a gene are also envisioned as genetic components.
- a gene can further comprise a 3'-UTR.
- the provided 3'-UTRs can contain a transcriptional terminator, or an element having equivalent function, and a polyadenylation signal that functions in plants to cause the addition of polyadenylated nucleotides to the 3' end of an RNA molecule.
- suitable 3' regions are (1) the 3' transcribed, non- translated regions containing the polyadenylation signal of Agrobacterium Ti plasmid genes, such as the nopaline synthase (NOS; Fraley et al., Proceedings of the National Academy of Sciences, USA (1983) 80: 4803-4807) gene, and (2) plant genes such as the soybean storage protein genes and the small subunit of the ribulose-l,5-bisphosphate carboxylase
- ssRUBISCO ssRUBISCO gene.
- An example of a 3' region is that from the ssRUBISCO E9 gene from pea (European Patent Application 0385 962).
- an expression cassette or sequence of interest provided herein may comprise a sequence encoding a cell factor that functions to increase DNA repair, where the protein is selected from the group consisting of a vir gene from the Ti plasmid, Rad51, Rad52, Rad2, a dominant-negative Ku70, or any combination thereof.
- an expression cassette provided herein comprises a nucleic acid sequence that is not essentially homologous to an endogenous plant nucleic acid sequence.
- an expression cassette provided herein comprises a nucleic acid sequence that is not essentially homologous to an endogenous plant gene.
- an expression cassette provided herein comprises a nucleic acid sequence that is at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identical to an endogenous plant gene.
- an expression cassette provided herein comprises a nucleic acid sequence that is at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%), or 100%) identical to an endogenous plant nucleic acid sequence.
- a promoter contains a sequence of nucleotide bases that signals RNA polymerase to associate with the DNA and to initiate transcription into mRNA using one of the DNA strands as a template to make a corresponding complementary strand of RNA.
- a promotor provided herein is a constitutive promoter.
- a promoter provided herein is a regulatable promoter.
- an expression cassette provided herein can comprise 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or more than 10 promoters.
- a promoter provided herein is located within a sequence of interest. In another aspect, a promoter provided herein is not located within a sequence of interest.
- promoters that are active in plant cells have been described in the literature. Such promoters would include but are not limited to the nopaline synthase (NOS) and octopine synthase (OCS) promoters that are carried on Ti plasmids of Agrobacterium tumefaciens, the caulimovirus promoters such as the cauliflower mosaic virus (CaMV) 19S and 35S promoters and the Figwort mosaic virus (FMV) 35S promoter, and the enhanced CaMV35S promoter (e35S).
- NOS nopaline synthase
- OCS octopine synthase
- CaMV cauliflower mosaic virus
- FMV Figwort mosaic virus
- e35S enhanced CaMV35S promoter
- a promoter is capable of causing sufficient expression to result in the production of an effective amount of the gene product of interest. Examples describing such promoters include without limitation U.S. Pat. No. 6,437,217 (maize RS81 promoter), U.S. Pat. No. 5,641,876 (rice actin promoter), U.S. Pat. No. 6,426,446 (maize RS324 promoter), U.S. Pat. No.
- 6,429,362 (maize PR-1 promoter), U.S. Pat. No. 6,232,526 (maize A3 promoter), U.S. Pat. No. 6, 177,611 (constitutive maize promoters), U.S. Pat. Nos. 5,322,938, 5,352,605, 5,359,142 and 5,530,196 (35S promoter), U.S. Pat. No. 6,433,252 (maize L3 oleosin promoter), U.S. Pat. No. 6,429,357 (rice actin 2 promoter as well as a rice actin 2 intron), U.S. Pat. No. 5,837,848 (root specific promoter), U.S. Pat. No.
- a nopaline synthase (NOS) promoter (Ebert et al., 1987), the octopine synthase (OCS) promoter (which is carried on tumor-inducing plasmids of Agrobacterium tumefaciens), the caulimovirus promoters such as the cauliflower mosaic virus (CaMV) 19S promoter (Lawton et al, Plant Molecular Biology (1987) 9: 315-324), the CaMV 35S promoter (Odell et al, Nature (1985) 313 : 810-812), the figwort mosaic virus 35S-promoter (U.S. Pat. Nos.
- NOS nopaline synthase
- OCS octopine synthase
- sucrose synthase promoter (Yang and Russell, Proceedings of the National Academy of Sciences, USA ( 1990) 87: 4144-4148), the R gene complex promoter (Chandler et al, Plant Cell (1989) 1 : 1175-1183), and the chlorophyll a/b binding protein gene promoter, PC1 SV (U.S. Pat. No. 5,850,019), and AGRtu.nos (GenBank Accession V00087; Depicker et al., Journal of Molecular and Applied Genetics (1982) 1 : 561-573; Bevan et al., 1983) promoters.
- promoter hybrids can be constructed to enhance transcriptional activity (U.S. Pat. No. 5,106,739), or to combine desired transcriptional activity, inducibility and tissue specificity or developmental specificity.
- Promoters that function in plants include but are not limited to promoters that are inducible, viral, synthetic, constitutive, temporally regulated, spatially regulated, and spatio-temporally regulated.
- Other promoters that are tissue-enhanced, tissue-specific, or developmentally regulated are also known in the art and envisioned to have utility in the practice of this disclosure.
- Promoters used in the provided nucleic acid molecules and vectors of this disclosure can be modified, if desired, to affect their control characteristics. Promoters can be derived by means of ligation with operator regions, random or controlled mutagenesis, etc. Furthermore, the promoters can be altered to contain multiple "enhancer sequences" to assist in elevating gene expression.
- a sequence of interest provided herein can comprise one or more selectable or screenable marker genes.
- the selectable or screenable marker gene aids in the identification of a transformed plant, or a product of agronomic utility.
- DNA that serves as a selectable or screenable marker can function in a regenerable plant tissue to produce a compound that would confer upon the plant tissue resistance to an otherwise toxic compound.
- selectable or screenable marker genes are known in the art and can be used.
- Genes for use as a selectable or screenable marker can include, but are not limited, to ⁇ -glucuronidase (GUS), green fluorescent protein (GFP), luciferase (LUC), genes conferring tolerance to antibiotics like kanamycin (Dekeyser et al., Plant Physiology (1989) 90: 217-223) or spectinomycin (e.g. spectinomycin aminoglycoside adenyltransferase (aadA); U.S. Pat. No. 5,217,902), genes that encode enzymes that give tolerance to herbicides like glyphosate (e.g.
- EPSPS 5- enolpymvylshikimate-3 -phosphate synthase
- sulfonyl herbicides e.g. acetohydroxyacid synthase or acetolactate synthase conferring tolerance to acetolactate synthase inhibitors such as sulfonylurea, imidazolinone, triazolopyrimidine, pyrimidyloxybenzoates and phthalide; (U.S. Pat. Nos. 6,225,105;
- Escherichia coli allowing growth in the presence of mannose), and dual selection (e.g.
- spectinomycin at a concentration of about 25-1000 ppm, such as at about 150 ppm, is also contemplated.
- a selectable or screenable marker provided herein is a positive selection marker.
- a positive selection marker confers an advantage to a cell comprising such marker.
- a sequence of interest provided herein comprises one or more positive selection markers.
- a positive selection marker confers antibiotic resistance or herbicide resistance.
- a selectable or screenable marker provided herein is a negative selection marker.
- a sequence of interest provided herein comprises one or more negative selection markers.
- a selectable or screenable marker provided herein is both a positive selection marker and a negative selection marker.
- a negative selectable marker provided herein can be a lethal or non-lethal negative selectable marker.
- non-lethal negative selectable markers examples include U.S. Publication No. 2004-0237142, such as GGPP synthases, GA 2-oxidase gene sequences, isopentenyl transferase (IPT), CKI1 (cytokinin-independent 1), ESR-2, ESR1-A, auxin-producing genes, such as indole-3-acetic acid (IAA), iaaM, iaah, roLABC, genes that result in over-expression of ethylene biosynthetic enzymes, VP1 genes, AB13 genes, LEC1 genes, and Basl genes for example.
- a non-lethal negative selectable marker gene can be included on any nucleic acid molecule provided herein.
- a non-lethal negative selectable marker gene provided herein is a gene resulting in the over-expression of a class of enzymes that use substrates of the gibberellic acid (GA) biosynthetic pathway, but that do not result in the production of bioactive GA.
- a nucleic acid molecule provided herein comprises a non-lethal negative selectable marker gene such as a phytoene synthase gene from Erwinia herbicola (crtB).
- the selectable or screenable marker imparts a distinct phenotype to cells expressing the marker protein and often provide a means to more efficiently distinguish such transformed cells from cells that do not have the selectable or screenable marker.
- Such genes may encode either a selectable or screenable marker, depending on whether the marker confers a trait which one can "select" for by chemical means, such as through the use of a selective agent (for example, an herbicide, or an antibiotic), or whether it is simply a trait that one can identify through observation (for example, expression of GFP) or testing or "screening".
- a selective agent for example, an herbicide, or an antibiotic
- suitable marker proteins are known to the art and can be employed in the practice of the invention.
- selectable or “screenable markers” also are genes which encode a "secretable marker” whose secretion can be detected as a means of identifying or selecting for transformed cells. Examples include markers which are secretable antigens that can be identified by antibody interaction, or even secretable enzymes which can be detected by their catalytic activity. Secretable proteins fall into a number of classes, including small, diffusible proteins detectable, by ELISA; or small active enzymes detectable in extracellular solution (for example, a-amylase, ⁇ -lactamase, phosphinothricin
- proteins that are inserted or trapped in the cell wall for example, proteins that include a leader sequence such as that found in the expression unit of extensin or tobacco PR-S.
- screenable markers examples include a
- ⁇ -glucuronidase GUS or uidA gene which encodes an enzyme for which various chromogenic substrates are known
- an R-locus gene which encodes a product that regulates the production of anthocyanin pigments (red color) in plant tissues (Dellaporta et al, 1988); a ⁇ -lactamase gene (Sutcliffe, 1978), which encodes an enzyme for which various chromogenic substrates are known (e.g., PAD AC, a chromogenic cephalosporin); a y/E gene (Zukowsky et al, 1983) which encodes a catechol dioxygenase that can convert chromogenic catechols; an ⁇ -amylase gene (Ikuta et al, 1990); a tyrosinase gene (Katz et al, 1983) which encodes an enzyme capable of oxidizing tyrosine to DOPA and dopaquinone which in turn condenses to
- a sequence of interest provided herein comprises polynucleotides encoding at least 1, at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, or at least 10 site-specific enzymes.
- a plant cell provided herein already comprises a polynucleotide encoding a site-specific enzyme.
- a polynucleotide encoding a site-specific enzyme provided herein is stably transformed into a plant cell.
- a polynucleotide encoding a site-specific enzyme provided herein is transiently transformed into a plant cell.
- a polynucleotide encoding a site-specific enzyme is under the control of a regulatable promoter, a constitutive promoter, a tissue specific promoter, or any promoter useful for expression of the site- specific enzyme.
- a vector comprises in cis a cassette encoding a site-specific enzyme and a sequence of interest such that when contacted with the genome of a plant cell, the site-specific enzyme enables site-specific integration of the sequence of interest.
- a first vector comprises a cassette encoding a site-specific enzyme and a second vector comprises a sequence of interest such that when contacted with the genome of a plant cell, the site-specific enzyme provided in trans enables site-specific integration of the sequence of interest.
- site-specific enzyme refers to any enzyme that can cleave a nucleotide sequence in a site-specific manner.
- a site-specific enzyme provided herein is selected from the group consisting of an endonuclease (without being limiting, for example, a meganuclease, a zinc-finger nuclease (ZFN), a transcription activator-like effector nucleases (TALEN), an Argonaute (non-limiting examples of
- Argonaute proteins include Thermus thermophilics Argonaute (TtAgo), Pyrococcus furiosus Argonaute (PfAgo), Natronobacterium gregoryi Argonaute (NgAgo), an RNA-guided nuclease (non-limiting examples of RNA-guided nucleases include Casl, CaslB, Cas2, Cas3, Cas4, Cas5, Cas6, Cas7, Cas8, Cas9 (also known as Csnl and Csxl2), CaslO, Csyl, Csy2, Csy3, Csel, Cse2, Cscl, Csc2, Csa5, Csn2, Csm2, Csm3, Csm4, Csm5, Csm6, Cmrl, Cmr3, Cmr4, Cmr5, Cmr6, Csbl, Csb2, Csb3, Csxl7, Csxl4, CsxlO, Cs
- a tyrosine recombinase attached to a DNA recognition motif provided herein is selected from the group consisting of a Cre recombinase, a Gin
- a Cre recombinase or a Gin recombinase provided herein is tethered to a zinc-finger DNA binding domain.
- a serine recombinase attached to a DNA recognition motif provided herein is selected from the group consisting of a PhiC31 integrase, an R4 integrase, and a TP-901 integrase.
- a DNA transposase attached to a DNA binding domain provided herein is selected from the group consisting of a TALE-piggyBac and TALE-Mutator.
- Site-specific nucleases such as meganucleases, ZFNs, TALENs, Argonaute proteins
- Argonaute proteins include Thermus thermophilics Argonaute (TtAgo), Pyrococcus furiosus Argonaute (PfAgo), Natronobacterium gregoryi Argonaute (NgAgo), homologs thereof, or modified versions thereof
- Cas9 nucleases Non- limiting examples of RNA-guided nucleases include Casl, CaslB, Cas2, Cas3, Cas4, Cas5, Cas6, Cas7, Cas8, Cas9 (also known as Csnl and Csxl2), CaslO, Csyl, Csy2, Csy3, Csel, Cse2, Cscl, Csc2, Csa5, Csn2, Csm2, Csm3, Csm4, Csm5, Csm6, Cmrl,
- Csf2, Csf3, Csf4, Cpfl, CasX, CasY, homologs thereof, or modified versions thereof induce a double-strand DNA break at the target site of a genomic sequence that is then repaired by the natural processes of HR or NHEJ. Sequence modifications then occur at the cleaved sites, which can include deletions or insertions that result in gene disruption in the case of NHEJ, or integration of nucleic acid sequences by HR.
- a vector or sequence of interest provided herein can comprise a nucleic acid sequence encoding a zinc finger nuclease.
- ZFNs are synthetic proteins consisting of an engineered zinc finger DNA-binding domain fused to the cleavage domain of the Fokl restriction endonuclease.
- ZFNs can be designed to cleave almost any long stretch of double- stranded DNA for modification of the zinc finger DNA-binding domain.
- ZFNs form dimers from monomers composed of a non-specific DNA cleavage domain of Fokl endonuclease fused to a zinc finger array engineered to bind a target DNA sequence.
- the DNA-binding domain of a ZFN is typically composed of 3-4 zinc-finger arrays.
- the amino acids at positions -1, +2, +3, and +6 relative to the start of the zinc finger co-helix, which contribute to site-specific binding to the target DNA, can be changed and customized to fit specific target sequences.
- the other amino acids form the consensus backbone to generate ZFNs with different sequence specificities. Rules for selecting target sequences for ZFNs are known in the art.
- the Fokl nuclease domain requires dimerization to cleave DNA and therefore two ZFNs with their C-terminal regions are needed to bind opposite DNA strands of the cleavage site (separated by 5-7 bp).
- the ZFN monomer can cut the target site if the two-ZF- binding sites are palindromic.
- ZFN as used herein, is broad and includes a monomelic ZFN that can cleave double stranded DNA without assistance from another ZFN.
- the term ZFN is also used to refer to one or both members of a pair of ZFNs that are engineered to work together to cleave DNA at the same site.
- a vector or sequence of interest provided herein can comprise a nucleic acid sequence encoding a transcription activator-like effector nuclease (TALEN).
- TALENs are artificial restriction enzymes generated by fusing the transcription activator-like effector (TALE) DNA binding domain to a nuclease domain.
- the nuclease is selected from a group consisting of PvuII, MutH, TevI and Fokl, Alwl, Mlyl, Sbfl, Sdal, Stsl, CleDORF, Clo051, PeptOVl.
- TALEN as used herein, is broad and includes a monomelic TALEN that can cleave double stranded DNA without assistance from another TALEN.
- TALEN is also used to refer to one or both members of a pair of TALENs that work together to cleave DNA at the same site.
- Transcription activator-like effectors can be engineered to bind practically any DNA sequence.
- TALE proteins are DNA-binding domains derived from various plant bacterial pathogens of the genus Xanthomonas . The X pathogens secrete TALEs into the host plant cell during infection.
- TALE moves to the nucleus, where it recognizes and binds to a specific DNA sequence in the promoter region of a specific DNA sequence in the promoter region of a specific gene in the host genome.
- TALE has a central DNA-binding domain composed of 13-28 repeat monomers of 33-34 amino acids.
- the amino acids of each monomer are highly conserved, except for hypervariable amino acid residues at positions 12 and 13.
- the two variable amino acids are called repeat-variable diresidues (RVDs).
- RVDs repeat-variable diresidues
- the amino acid pairs NI, NG, HD, and NN of RVDs preferentially recognize adenine, thymine, cytosine, and guanine/adenine, respectively, and modulation of RVDs can recognize consecutive DNA bases. This simple relationship between amino acid sequence and DNA recognition has allowed for the engineering of specific DNA binding domains by selecting a combination of repeat segments containing the appropriate RVDs.
- Fokl domains Besides the wild-type Fokl cleavage domain, variants of the Fokl cleavage domain with mutations have been designed to improve cleavage specificity and cleavage activity.
- the Fokl domain functions as a dimer, requiring two constructs with unique DNA binding domains for sites in the target genome with proper orientation and spacing. Both the number of amino acid residues between the TALEN DNA binding domain and the Fokl cleavage domain and the number of bases between the two individual TALEN binding sites are parameters for achieving high levels of activity.
- PvuII, MutH, and Tevl cleavage domains are useful alternatives to Fokl and Fokl variants for use with TALEs.
- PvuII functions as a highly specific cleavage domain when coupled to a TALE (see Yank et al. 2013. PLoS One. 8: e82539). MutH is capable of introducing strand-specific nicks in DNA (see Gabsalilow et al. 2013. Nucleic Acids Research. 41 : e83). Tevl introduces double-stranded breaks in DNA at targeted sites(see Beurdeley et al., 2013. Nature Communications. 4: 1762).
- TALE binding domain allows for designable proteins.
- Software programs such as DNA Works can be used to design TALE constructs.
- Other methods of designing TALE constructs are known to those of skill in the art. See Doyle et al, Nucleic Acids Research (2012) 40: Wl 17-122.; Cermak et al., Nucleic Acids Research (2011). 39:e82; and talent, cac. Cornell . edu/ about.
- a vector or sequence of interest provided herein can comprise a nucleic acid sequence encoding a meganuclease.
- Meganucl eases which are commonly identified in microbes, are unique enzymes with high activity and long recognition sequences (> 14 bp) resulting in site-specific digestion of target DNA.
- Engineered versions of naturally occurring meganucleases typically have extended DNA recognition sequences (for example, 14 to 40 bp).
- TALENs because the DNA recognition and cleavage functions of meganucleases are intertwined in a single domain. Specialized methods of mutagenesis and high-throughput screening have been used to create novel meganuclease variants that recognize unique sequences and possess improved nuclease activity.
- a vector provided herein can comprise any combination of a nucleic acid sequence encoding an Argonaute (non-limiting examples of Argonaute proteins include Thermus thermophilics Argonaute (TtAgo), Pyrococcus furiosus Argonaute (PfAgo), Natronobacterium gregoryi Argonaute (NgAgo), homologs thereof, or modified versions thereof), and optionally, a sequence encoding DNA guide.
- Argonaute proteins include Thermus thermophilics Argonaute (TtAgo), Pyrococcus furiosus Argonaute (PfAgo), Natronobacterium gregoryi Argonaute (NgAgo), homologs thereof, or modified versions thereof
- TtAgo Thermus thermophilics Argonaute
- PfAgo Pyrococcus furiosus Argonaute
- Natronobacterium gregoryi Argonaute Natronobacterium gregoryi Argonaute
- a vector provided herein can comprise any combination of a nucleic acid sequence encoding a RNA-guided Cas9 nuclease (non-limiting examples of RNA-guided nucleases include Casl, CaslB, Cas2, Cas3, Cas4, Cas5, Cas6, Cas7, Cas8, Cas9 (also known as Csnl and Csxl 2), CaslO, Csyl, Csy2, Csy3, Csel, Cse2, Cscl, Csc2, Csa5, Csn2, Csm2, Csm3, Csm4, Csm5, Csm6, Cmrl, Cmr3, Cmr4, Cmr5, Cmr6, Csbl, Csb2, Csb3, Csxl7, Csxl4, CsxlO, Csxl6, CsaX, Csx3, Csxl, Csxl5, Csfl, C
- Cas9 nucleases are part of the adaptive immune system of bacteria and archaea, protecting them against invading nucleic acids such as viruses by cleaving the foreign DNA in a sequence-dependent manner.
- the immunity is acquired by the integration of short fragments of the invading DNA known as spacers between two adjacent repeats at the proximal end of a CRISPR locus.
- the CRISPR arrays, including the spacers, are transcribed during subsequent encounters with invasive DNA and are processed into small interfering CRISPR RNAs (crRNAs) approximately 40 nt in length, which combine with the tram-activating CRISPR RNA (tracrRNA) to activate and guide the Cas9 nuclease.
- crRNAs small interfering CRISPR RNAs
- a prerequisite for cleavage is the presence of a conserved protospacer-adjacent motif (PAM) downstream of the target DNA, which usually has the sequence 5-NGG-3 but less frequently NAG.
- PAM protospacer-adjacent motif
- Specificity is provided by the so-called “seed sequence” approximately 12 bases upstream of the PAM, which must match between the RNA and target DNA.
- Cpf 1 acts in a similar manner to Cas9, but Cpfl does not require a tracrRNA.
- a nucleic acid molecule provided herein comprises at least 1, at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, or at least 10 site-specific enzyme target sites.
- a vector or sequence of interest provided herein comprises a Cre/lox recombination site, a Flp/FRT recombination site, an endonuclease recognition site, a TALEN site, or any combination thereof.
- a vector or sequence of interest provided herein comprises a Cre recombinase or a Flp recombination system.
- a recombination system provided herein can act in cis.
- a recombination system provided herein can act in trans.
- a site-specific enzyme target site provided herein is at least 10, at least 20, at least 30, at least 40, at least 50, at least 75, at least 100, at least 125, at least 150, at least 200, at least 250, at least 300, at least 400, or at least 500 nucleotides.
- the Flp- RJ site-directed recombination system comes from the 2 ⁇ plasmid from the baker's yeast Saccharomyces cerevisiae.
- Flp recombinase flippase
- FRT sites comprise 34 nucleotides.
- Flp binds to the "arms" of the FRT sites (one arm is in reverse orientation) and cleaves the FRT site at either end of an intervening nucleic acid sequence. After cleavage, Flp recombines nucleic acid sequences between two FRT sites.
- Cre-lox is a site-directed recombination system derived from the
- Cre-lox can be used to invert a nucleic acid sequence, delete a nucleic acid sequence, or translocate a nucleic acid sequence.
- Cre recombinase recombines a pair of lox nucleic acid sequences. Lox sites comprise 34 nucleotides, with the first and last 13 nucleotides (arms) being palindromic.
- Cre recombinase protein binds to two lox sites on different nucleic acids and cleaves at the lox sites.
- a lox site provided herein is a loxP, lox 2272, loxN, lox 511, lox 5171, lox71, lox66, M2, M3, M7, or Mi l site.
- the instant disclosure provides a Rhizobiales cell comprising at least one vector that is capable of forming two essentially complementary T-strands.
- the instant disclosure provides an Agrobacterium cell comprising at least one vector that is capable of forming two essentially complementary T-strands.
- the instant disclosure provides a method of increasing the rate of site directed integration of a sequence of interest, comprising contacting a plant cell with a Rhizobiales cell capable of transforming the plant cell, where the Rhizobiales cell comprises at least one vector.
- the instant disclosure provides a method of
- transforming a plant cell comprising contacting the plant cell with a Rhizobiales cell capable of transforming the plant cell, where the Rhizobiales cell comprises at least one vector capable of forming two T-strands that are essentially complementary in at least a portion of a of the T-strands.
- a vector disclosed herein comprises a first right border DNA sequence (RBI), a second right border DNA sequence (RB2), and at least one sequence of interest, where the RBI is positioned in the vector to initiate synthesis of a first T-strand such that the sequence of interest is in the sense orientation from the 5' to 3 ' end of the first T-strand; and the RB2 is positioned in the vector to initiate synthesis of a second T-strand such that the sequence of interest is in the anti-sense orientation from the 5' to 3' end of the second T-strand, and where the two T-strands resulting from initiation at RBI and RB2 are essentially complementary to each other.
- RBI right border DNA sequence
- RB2 second right border DNA sequence
- a vector disclosed herein comprises a first right border
- a vector disclosed herein comprises a first right border
- DNA sequence (RBI), a second right border DNA sequence (RB2), at least one sequence of interest, and a left border DNA sequence (LB), where the vector is configured such that the RBI is paired with the LB, which are positioned in the vector to initiate (RBI) and terminate (LB) synthesis of a first T-strand such that the sequence of interest is in the sense orientation from the 5' to 3' end of the first T-strand; and the RB2 initiates synthesis of a second T- strand such that the sequence of interest is in an anti-sense orientation relative to the sequence of interest in the first T-strand, and where the two T-strands resulting from initiation at RBI and RB2 are essentially complementary in at least a portion of the sequence of interest.
- a vector disclosed herein comprises a first right border
- DNA sequence (RBI), a second right border DNA sequence (RB2), at least one sequence of interest, and a left border DNA sequence (LB), where the vector is configured such that the RBI initiates synthesis of a first T-strand such that the sequence of interest is in the sense orientation from the 5' to 3' end of the first T-strand; and the RB2 is paired with the LB, which are positioned in the vector to initiate (RB2) and terminate (LB) synthesis of a second T-strand such that the sequence of interest is in an anti-sense orientation relative to the sequence of interest in the first T-strand, and where the two T-strands resulting from initiation at RBI and RB2 are essentially complementary in at least a portion of the sequence of interest.
- a vector disclosed herein comprises a first sequence of interest and a second sequence of interest, where the first sequence of interest is essentially identical to the second sequence of interest; and the vector further comprises a first right border DNA sequence (RBI) with a first left border DNA sequence (LB 1) which are positioned in the vector to initiate (RBI) and terminate (LB1) synthesis of a first T-strand such that the first sequence of interest is in the sense orientation from the 5' to 3' end of the first T-strand; and the vector further comprises a second right border DNA sequence (RB2) and a second left border DNA sequence (LB2) which are positioned in the vector to initiate (RB2) and terminate (LB2) synthesis of a second T-strand such that the second sequence of interest is in an anti-sense orientation relative to the first sequence of interest in the first T- strand, and where the two T-strands resulting from initiation at RBI and RB2 are essentially complementary in at least a portion of the first sequence of interest and the second sequence of interest.
- RBI right border DNA sequence
- a vector disclosed herein comprises a first sequence of interest and a second sequence of interest, where the first sequence of interest is essentially identical to the second sequence of interest; and the vector further comprises a first right border DNA sequence (RBI) with a left border DNA sequence (LB) which are positioned in the vector to initiate (RB I) and terminate (LB) synthesis of a first T-strand such that the first sequence of interest is in the sense orientation from the 5' to 3' end of the first T-strand; and the vector further comprises a second right border DNA sequence (RB2) which initiates synthesis of a second T-strand such that the second sequence of interest is in an anti-sense orientation relative to the first sequence of interest in the first T-strand, and where the two T- strands resulting from initiation at RBI and RB2 are essentially complementary in at least a portion of the first sequence of interest and the second sequence of interest.
- RBI right border DNA sequence
- LB left border DNA sequence
- RB2 second right border DNA sequence
- a vector disclosed herein comprises a first sequence of interest and a second sequence of interest, where the first sequence of interest is essentially identical to the second sequence of interest; and the vector further comprises a first right border DNA sequence (RBI) which initiates synthesis of a first T-strand such that the first sequence of interest is in the sense orientation from the 5' to 3' end of the first T-strand; and the vector further comprises a second right border DNA sequence (RB2) and a left border DNA sequence (LB) which are positioned in the vector to initiate (RB2) and terminate (LB) synthesis of a second T-strand such that the second sequence of interest is in an anti-sense orientation relative to the first sequence of interest in the first T-strand, and where the two T- strands resulting from initiation at RBI and RB2 are essentially complementary in at least a portion of the first sequence of interest and the second sequence of interest.
- RBI right border DNA sequence
- LB left border DNA sequence
- a vector disclosed herein comprises a first sequence of interest and a second sequence of interest, where the first sequence of interest is essentially identical to the second sequence of interest; and the vector further comprises a first right border DNA sequence (RBI) to initiate synthesis of a first T-strand such that the first sequence of interest is in the sense orientation from the 5' to 3' end of the first T-strand; and the vector further comprises a second right border DNA sequence (RB2) which initiates synthesis of a second T-strand such that the second sequence of interest is in an anti-sense orientation relative to the first sequence of interest in the first T-strand, and where the two T- strands resulting from initiation at RBI and RB2 are essentially complementary in at least a portion of the first sequence of interest and the second sequence of interest.
- RBI right border DNA sequence
- RB2 second right border DNA sequence
- a Rhizobiales cell disclosed herein comprises at least a first vector and a second vector, where each vector comprises essentially identical sequences of interest, and where the first vector comprises a first right border DNA sequence (RBI) and a first left border DNA sequence (LB 1) which are positioned in the first vector to initiate (RBI) and terminate (LB 1) synthesis of a first T-strand such that the sequence of interest is in the sense orientation from the 5' to 3' end of the first T-strand; and the second vector comprises a second right border DNA sequence (RB2) and a second left border DNA Sequence (LB2) which are positioned in the second vector to initiate (RB2) and terminate (LB2) synthesis of a second T-strand such that the sequence of interest is in an anti-sense orientation relative to the sequence of interest in the first T-strand, and where the two T- strands resulting from initiation at RBI and RB2 are essentially complementary to each other in at least a portion of the sequence of interest.
- the first vector comprises a first right border DNA sequence (RBI)
- a Rhizobiales cell comprises at least a first vector and a second vector, where each vector comprises essentially identical sequences of interest, and where the first vector comprises a first right border DNA sequence (RBI) and a left border DNA sequence (LB) which are positioned in the first vector to initiate (RBI) and terminate (LB) synthesis of a first T-strand such that the sequence of interest is in the sense orientation from the 5' to 3' end of the first T-strand; and the second vector comprises a second right border DNA sequence (RB2) which initiates synthesis of a second T-strand such that the sequence of interest is in an anti-sense orientation relative to the sequence of interest in the first T-strand, and where the two T-strands resulting from initiation at RBI and RB2 are essentially complementary to each other in at least a portion of the sequence of interest.
- RBI right border DNA sequence
- LB left border DNA sequence
- a Rhizobiales cell comprises at least a first vector and a second vector, where each vector comprises essentially identical sequences of interest, and where the first vector comprises a first right border DNA sequence (RBI) which initiates synthesis of a first T-strand such that the sequence of interest is in the sense orientation from the 5' to 3' end of the first T-strand; and the second vector comprises a second right border DNA sequence (RB2) and a left border DNA Sequence (LB) which are positioned in the second vector to initiate (RB2) and terminate (LB) synthesis of a second T- strand such that the sequence of interest is in an anti-sense orientation relative to the sequence of interest in the first T-strand, and where the two T-strands resulting from initiation at RB 1 and RB2 are essentially complementary to each other in at least a portion of the sequence of interest.
- RBI right border DNA sequence
- LB left border DNA Sequence
- a Rhizobiales cell comprises at least a first vector and a second vector, where each vector comprises essentially identical sequences of interest, and where the first vector comprises a first right border DNA sequence (RB 1) which initiates synthesis of a first T-strand such that the sequence of interest is in the sense orientation from the 5' to 3' end of the first T-strand; and the second vector comprises a second right border DNA sequence (RB2) which initiates synthesis of a second T-strand such that the sequence of interest is in an anti-sense orientation relative to the sequence of interest in the first T-strand, and where the two T-strands resulting from initiation at RBI and RB2 are essentially complementary to each other in at least a portion of the sequence of interest.
- RB 1 first right border DNA sequence
- RB2 second right border DNA sequence
- methods provided in the instant disclosure comprise two or more vectors, where the two or more vectors are in one Rhizobiales cell.
- methods provided in the instant disclosure comprise two or more vectors, where the two or more vectors are in two or more Rhizobiales cells.
- a first Rhizobiales cell comprises a first vector
- a second Rhizobiales cell comprises a second vector.
- the instant disclosure provides a method of increasing the rate of site directed integration of a sequence of interest, comprising contacting the plant cell with two or more Rhizobiales cells capable of transforming the plant cell, where the two or more Rhizobiales cells each contain one of at least two vectors capable of forming two essentially complementary T-strands.
- the instant disclosure provides a method of transforming a plant cell, comprising contacting the plant cell with two or more Rhizobiales cells capable of transforming the plant cell, where the two or more Rhizobiales cells each contain one of at least two vectors capable of forming two essentially complementary T-strands.
- the instant disclosure provides a method of increasing the rate of site directed integration of a sequence of interest, comprising contacting a plant cell with two or more Rhizobiales cells, where the two or more Rhizobiales cells each contain one of at least two vectors capable of forming two essentially complementary T-strands.
- a first Rhizobiales cell and a second Rhizobiales cell contain at least a first vector and a second vector, respectively, where each vector comprises an essentially identical sequence of interest, and where the first vector comprises a first right border DNA sequence (RBI), and where the RBI is positioned in the vector to initiate synthesis of a first T-strand such that the sequence of interest is in the sense orientation from the 5' to 3' end of the first T-strand; and the second vector comprises a second right border DNA sequence (RB2) which is positioned in the vector to initiate synthesis of a second T-strand such that the sequence of interest is in the anti-sense orientation relative to the sequence of interest in the first T-strand, and where the two T- strands resulting from initiation at RBI and RB2 are essentially complementary in at least a portion of the sequence of interest.
- RBI right border DNA sequence
- a first Rhizobiales cell and a second Rhizobiales cell contain at least a first vector and a second vector, respectively, where each vector comprises an essentially identical sequence of interest, and where the first vector comprises a first right border DNA sequence (RBI) and a first left border DNA sequence (LBl) which are positioned in the first vector to initiate (RBI) and terminate (LB l) synthesis of a first T-strand such that the sequence of interest is in the sense orientation from the 5' to 3' end of the first T-strand; and the second vector comprises a second right border DNA sequence (RB2) and a second left border DNA sequence (LB2) which are positioned in the second vector to initiate (RB2) and terminate (LB2) synthesis of a second T-strand such that the sequence of interest is in an anti-sense orientation from the 5' to 3' end of the second T- strand, and where the sequence of interest in the two T-strands resulting from initiation at RBI and RB2 are essentially complementary to each other.
- the first vector comprises a first right
- the instant disclosure provides a method comprising a first Rhizobiales cell and a second Rhizobiales cell, where each Rhizobiales cell contains at least one of two vectors, where each vector comprises an essentially identical sequence of interest, and where the first vector comprises a first right border DNA sequence (RBI), and where the RBI is positioned in the vector to initiate synthesis of a first T-strand such that the sequence of interest is in the sense orientation from the 5' to 3' end of the first T-strand; and the second vector comprises a second right border DNA sequence (RB2) which is positioned in the vector to initiate synthesis of a second T-strand such that the sequence of interest is in the anti-sense orientation relative to the sequence of interest in the first T-strand, and wherein the two T-strands resulting from initiation at RBI and RB2 are essentially complementary in at least a portion of the sequence of interest.
- RBI right border DNA sequence
- the instant disclosure provides a method comprising a first Rhizobiales cell and a second Rhizobiales cell, where each Rhizobiales cell contains at least one of two vectors, where each vector comprises an essentially identical sequence of interest, and where the first vector comprises a first right border DNA sequence (RBI) and a first left border DNA sequence (LB 1) which are positioned in the first vector to initiate (RB 1) and terminate (LB 1) synthesis of a first T-strand such that the sequence of interest is in the sense orientation from the 5' to 3' end of the first T-strand; and the second vector comprises a second right border DNA sequence (RB2) and a second left border DNA sequence (LB2) which are positioned in the second vector to initiate (RB2) and terminate (LB2) synthesis of a second T-strand such that the sequence of interest is in an anti-sense orientation from the 5' to 3' end of the second T-strand, and wherein the sequence of interest in the two T-strands resulting from initiation at RBI and RB2 are essentially complementary
- the instant disclosure provides a method comprising a first Rhizobiales cell and a second Rhizobiales cell, where each Rhizobiales cell contains at least one of two vectors, where each vector comprises an essentially identical sequence of interest, and where the first vector comprises a first right border DNA sequence (RBI) and a left border DNA sequence (LB) which are positioned in the first vector to initiate (RBI) and terminate (LB) synthesis of a first T-strand such that the sequence of interest is in the sense orientation from the 5' to 3' end of the first T-strand; and the second vector comprises a second right border DNA sequence (RB2) to initiate synthesis of a second T-strand such that the sequence of interest is in an anti-sense orientation from the 5' to 3' end of the second T- strand, and wherein the sequence of interest in the two T-strands resulting from initiation at RBI and RB2 are essentially complementary to each other.
- RBI right border DNA sequence
- LB left border DNA sequence
- the instant disclosure provides a method comprising a first Rhizobiales cell and a second Rhizobiales cell, where each Rhizobiales cell contains at least one of two vectors, where each vector comprises an essentially identical sequence of interest, and where the first vector comprises a first right border DNA sequence (RBI) initiate synthesis of a first T-strand such that the sequence of interest is in the sense orientation from the 5' to 3' end of the first T-strand; and the second vector comprises a second right border DNA sequence (RB2) and a left border DNA sequence (LB) which are positioned in the second vector to initiate (RB2) and terminate (LB) synthesis of a second T- strand such that the sequence of interest is in an anti-sense orientation from the 5' to 3' end of the second T-strand, and wherein the sequence of interest in the two T-strands resulting from initiation at RBI and RB2 are essentially complementary to each other.
- RBI right border DNA sequence
- LB left border DNA sequence
- At least one vector disclosed herein comprises an RB and a LB and where the vector further comprises between the RB and LB: (i) a first sequence of interest in a sense orientation relative to the RB, (ii) a spacer, and (iii) a second sequence of interest in an anti-sense orientation relative to the RB, where the first sequence of interest and the second sequence of interest are essentially complementary and after synthesis of the T-strand anneal to form a double-stranded DNA.
- the integration of at least part of the sequence of interest in a vector results in a point mutation, an insertion, a deletion, an inversion, increased
- transcription of an endogenous genomic locus decreased transcription of an endogenous genomic locus, altered protein activity, increased transcription of the sequence of interest, decreased transcription of the integrated sequence of interest, or a combination thereof.
- the sequence of interest in the vector further comprises at least one, at least two, at least three, at least four, at leave five, at least six, at least seven, at least eight, at least nine, or at least ten site-specific enzyme target sites.
- site-specific enzyme target site include, but are not limited to, a Cre/lox recombination site, a Flp/FRT recombination site, a endonuclease recognition site, and a TALEN site.
- the Rhizobiales cell is selected from an Agrobacterium spp., a Bradyrhizobium spp., & Mesorhizobium spp., an Ochrobactrum spp., & Phyllobacterium spp., a Rhizobium spp., Sinorhizobium spp, and an Ensifer spp.
- the Rhizobiales cell is an Agrobacterium cell.
- the Rhizobiales cell is an Agrobacterium tumefaciens cell.
- the Rhizobiales cell further contains a vector comprising one or more gene expression cassettes with sequence encoding a cell factor that functions to increase DNA repair, one or more Agrobacterium Ti plasmid vir genes, one or more selectable marker genes, an origin of replication, or any combination thereof.
- the instant disclosure provides a method of increasing the rate of site-directed integration of a sequence of interest in a plant genome, comprising contacting at least one plant cell on a co-culture medium for at least two days, at least 3 days, at least 4 days or at least 5 days with a Rhizobiales cell capable of transforming the plant cell, wherein the Rhizobiales cell comprises at least one vector described in the instant disclosure.
- the instant disclosure provides a method of transforming a plant genome, comprising contacting at least one plant cell on a co-culture medium for at least two days, at least 3 days, at least 4 days, or at least 5 days with at least one Rhizobiales cell capable of transforming the plant cell, where the Rhizobiales cell comprises at least one vector capable of forming two essentially complementary T-strands.
- the contacting comprises co-culturing the plant cell with a
- Rhizobiales cell for at least three days, at least four days, at least five days, at least six days, at least seven days, at least eight days, at least nine days, or at least ten days.
- the contacting comprises co-culturing the plant cell with a Rhizobiales cell for at least 48 hours, at least 49 hours, at least 50 hours, at least 51 hours, at least 52 hours, at least 53 hours, at least 54 hours, at least 55 hours, at least 56 hours, at least 57 hours, at least 58 hours, at least 59 hours, at least 60 hours, at least 61 hours, at least 62 hours, at least 63 hours, at least 64 hours, at least 65 hours, at least 66 hours, at least 67 hours, at least 68 hours, at least 69 hours, at least 70 hours, at least 71 hours, at least 72 hours, at least 73 hours, at least 74 hours, at least 75 hours, at least 76 hours, at least 77 hours, at least 78 hours, at least 79 hours, at least 80 hours,
- a Rhizobiales cell is selected from the group consisting of an Agrobacterium spp. cell, a Bradyrhizobium spp. cell, a Mesorhizobium spp. cell, an Ochrobactrum spp. cell, a Phyllobacterium spp. cell, a Rhizobium spp. cell, and a Sinorhizobium spp. cell.
- an Agrobacterium spp. cell is selected from the group consisting of an Agrobacterium tumefaciens cell and an Agrobacterium rhizogenes cell.
- the method further comprises detecting the integration of at least a fragment of the sequence of interest of the vector in the at least one plant cell.
- the at least one fragment of the sequence of interest in the vector is integrated into the plant genome by HR.
- the at least one fragment of the sequence of interest in the vector is integrated into the plant genome by HEJ.
- the method also comprises selecting the plant call based on the presence of the at least one fragment of the sequence of interest in the vector integrated into the plant genome. In a further aspect, the method further comprises regenerating a transgenic plant form the selected plant cell.
- a method or system for site-specific modification of a target nucleic acid sequence provided herein involves homologous recombination. In another aspect, a method or system for site-specific modification of a target nucleic acid sequence provided herein involves non-homologous end joining.
- a method or system for site-specific modification of a target nucleic acid sequence comprises non-homologous end joining that further comprises the introduction of an insertion and/or deletion of one or more, two or more, three or more, four or more, five or more, six or more, seven or more, eight or more, nine or more, ten or more, twenty or more, fifty or more, or one hundred or more nucleotides into the target nucleic acid sequence.
- microprojectile bombardment with particles coated with recombinant DNA are found in US Patents 5,015,580 (soybean); 5,550,318 (corn); 5,538,880 (corn); 5,914,451 (soybean);
- a method provided herein stably transforms a plant cell.
- a method provided herein transiently transforms a plant cell.
- a method of transforming a plant cell provided herein comprises a biolistic transformation or a bacteria-mediated transformation.
- a method of transforming a plant cell provided herein comprises bacteria-mediated transformation that further comprises contacting the plant cell with a Rhizobiales cell, where the Rhizobiales cell is capable of transforming the plant cell.
- Transformation methods to provide transgenic plant cells and transgenic plants containing stably integrated nucleic acid molecules provided herein are preferably practiced in tissue culture on media and in a controlled environment.
- Media refers to the numerous nutrient mixtures that are used to grow cells in vitro, that is, outside of the intact living organism.
- this disclosure provides plant cells that are not reproductive material and do not mediate the natural reproduction of the plant. In another aspect, this disclosure also provides plant cells that are reproductive material and mediate the natural reproduction of the plant. In another aspect, this disclosure provides plant cells that cannot maintain themselves via photosynthesis. In another aspect, this disclosure provides somatic plant cells. Somatic cells, contrary to germline cells, do not mediate plant reproduction.
- Recipient cell targets for transformation include, but are not limited to, a seed cell, a fruit cell, a leaf cell, a cotyledon cell, a hypocotyl cell, a meristem cell, an embryo cell, an endosperm cell, a root cell, a shoot cell, a stem cell, a pod cell, a flower cell, an
- this disclosure provides a plant chloroplast.
- this disclosure provides an epidermal cell, a stomata cell, a trichome cell, a root hair cell, a storage root cell, or a tuber cell.
- this disclosure provides a protoplast.
- this disclosure provides a plant callus cell. Any cell from which a fertile plant can be regenerated is contemplated as a useful recipient cell for practice of this disclosure.
- Callus can be initiated from various tissue sources, including, but not limited to, immature embryos or parts of embryos, seedling apical meristems, microspores, and the like. Those cells which are capable of proliferating as callus can serve as recipient cells for transformation. Practical
- transgenic plants of this disclosure e.g., various media and recipient target cells, transformation of immature embryos, and subsequent regeneration of fertile transgenic plants
- transformation methods and materials for making transgenic plants of this disclosure e.g., various media and recipient target cells, transformation of immature embryos, and subsequent regeneration of fertile transgenic plants
- a plant cell provided herein is selected from the group consisting of an Acacia cell, an alfalfa cell, an aneth cell, an apple cell, an apricot cell, an artichoke cell, an arugula cell, an asparagus cell, an avocado cell, a banana cell, a barley cell, a bean cell, a beet cell, a blackberry cell, a blueberry cell, a broccoli cell, a Brussels sprout cell, a cabbage cell, a canola cell, a cantaloupe cell, a carrot cell, a cassava cell, a cauliflower cell, a celery cell, a Chinese cabbage cell, a cherry cell, a cilantro cell, a citrus cell, a
- Clementine cell a coffee cell, a corn cell, a cotton cell, a cucumber cell, a Douglas fir cell, an eggplant cell, an endive cell, an escarole cell, an eucalyptus cell, a fennel cell, a fig cell, a forest tree cell, a gourd cell, a grape cell, a grapefruit cell, a honey dew cell, a jicama cell, kiwifruit cell, a lettuce cell, a leek cell, a lemon cell, a lime cell, a Loblolly pine cell, a mango cell, a maple tree cell, a melon cell, a mushroom cell, a nectarine cell, a nut cell, an oat cell, an okra cell, an onion cell, an orange cell, an ornamental plant cell, a papaya cell, a parsley cell, a pea cell, a peach cell, a peanut cell, a pear cell, a pepper cell,
- a plant cell provided herein is selected from the group consisting of a corn immature embryo cell, a corn mature embryo cell, a corn seed cell, a soybean immature embryo cell, a soybean mature embryo cell, a soybean seed cell, a canola immature embryo cell, a canola mature embryo cell, a canola seed cell, a cotton immature embryo cell, a cotton mature embryo cell, a cotton seed cell, a wheat immature embryo cell, a wheat mature embryo cell, a wheat seed cell, a sugarcane immature embryo cell, a sugarcane mature embryo cell, a sugarcane seed cell.
- transformation of a plant cell is performed by an Agrobacterium or other Rhizobiales-mediated method (U.S. Pat. Nos. 6,265,638, 5,731,179; U.S. Patent Application Publications US2005/0183170; 2003110532).
- the polynucleotide sequences that can be transferred into a plant cell provided herein can be present on one recombination vector in one bacterial strain being utilized for transformation.
- the polynucleotide sequences provided herein can be present on separate recombination vectors in one bacterial strain.
- the polynucleotide sequences provided herein can be found in separate bacterial cells or strains used together for transformation.
- the DNA constructs used for transformation in the methods of present disclosure generally also contain the plasmid backbone DNA segments that provide replication function and antibiotic selection in bacterial cells, for example, an Escherichia coli origin of replication such as ori322, an Agrobacterium origin of replication such as oriV or oriRi, and a coding region for a selectable marker such as Spec/Strp that encodes for Tn7 aminoglycoside adenyltransferase (aadA) conferring resistance to spectinomycin or streptomycin, or a gentamicin (Gm, Gent) selectable marker gene.
- aadA Tn7 aminoglycoside adenyltransferase
- Gm, Gent gentamicin
- the host bacterial strain is often Agrobacterium tumefaciens ABI, C58, LBA4404, AGLO, AGL1, EHA101, or EHA105 carrying a plasmid having a transfer function for the expression unit.
- Other strains known to those skilled in the art of plant transformation can function in this disclosure.
- assays include, for example, molecular biological assays (e.g., Southern and northern blotting, PCRTM); biochemical assays, such as detecting the presence of a protein product (e.g., by immunological means (ELISAs and western blots), or by enzymatic function (e.g., GUS assay)); pollen histochemistry; plant part assays, (e.g., leaf or root assays); and also, by analyzing the phenotype of the whole regenerated plant.
- molecular biological assays e.g., Southern and northern blotting, PCRTM
- biochemical assays such as detecting the presence of a protein product (e.g., by immunological means (ELISAs and western blots), or by enzymatic function (e.g., GUS assay)); pollen histochemistry; plant part assays, (e.g., leaf or root assays); and also, by analyzing the phenotype of the
- the instant disclosure also provides a transgenic plant cell comprising a sequence of interest integrated into a genome of the plant cell according to the methods disclosed herein. Also provided is a transgenic plant produced by the methods disclosed herein.
- Both control vectors contain a sequence of interest comprising three expression cassettes: an expression cassette encoding a gene (CP4-EPSPS) to confer tolerance to the herbicide glyphosate positioned between a left homology arm and a right homology arm; and two expression cassettes each encoding half of a TALEN pair.
- TALENs were obtained from Life Technologies.
- the first control vector comprised a RB DNA sequence, the sequence of interest comprising the three expression cassettes, and a LB DNA sequence, as illustrated in Figure 4 A.
- the second control vector was the same as the first control vector except that there was no LB DNA sequence. These two control vectors were used in Example 9, below.
- the vector was constructed to contain a sequence of interest comprising three expression cassettes: one expression cassette encoding a gene (CP4-EPSPS) to confer tolerance to the herbicide glyphosate positioned between a left homology arm and a right homology arm; and two expression cassettes each encoding half of a TALEN pair; and two RB DNA sequences (RBI and RB2).
- CP4-EPSPS CP4-EPSPS
- the vector configuration comprised the first RB DNA sequence (RBI) positioned in the vector to initiate synthesis of a first T-strand comprising the three expression cassettes (CP4-EPSPS and two TALENs); and the second RB DNA sequence (RB2) was positioned in the vector to initiate synthesis of a second T- strand comprising the three expression cassettes (CP4-EPSPS and two TALENs) such that the second T-strand was in an anti-sense orientation relative to the first T-strand.
- the two T- strands resulting from initiation at RBI and RB2 are essentially complementary to each other in at least a portion of the sequence of interest.
- a vector comprising two RB DNA sequences, and one LB DNA sequence was created for Agrobacterium-mediated transformation ( Figure 4C).
- the vector was constructed to contain a sequence of interest comprising three expression cassettes: one expression cassette encoding a gene (CP4-EPSPS) to confer tolerance to the herbicide glyphosate positioned between a left homology arm and a right homology arm; and two expression cassettes each encoding half of a TALEN pair; and two right border DNA sequences (RBI and RB2), and one LB DNA sequence (LB 1) paired with the first RB DNA sequence (RBI).
- the vector configuration comprised the first RB DNA sequence (RBI) positioned in the vector to initiate synthesis of a first T-strand comprising the three expression cassettes (CP4-EPSPS and two TALENs) and terminate synthesis at the first LB DNA sequence (LB1) (paired with RBI); and the second RB DNA sequence (RB2) was positioned in the vector to initiate synthesis of a second T-strand comprising the three expression cassettes (CP4-EPSPS and two TALENs) such that the second T-strand was in an anti-sense orientation relative to the first T-strand.
- This vector was used in Example 9 below.
- the two T-strands resulting from initiation at RBI and RB2 are essentially complementary to each other in at least a portion of the sequence of interest.
- a vector comprising at least one sequence of interest, two RB DNA
- the vector configuration comprises a first RB DNA sequence (RBI) paired with a first LB DNA sequence (LB1) which are positioned in the vector to initiate (RBI) and terminate (LB1) synthesis of a first T- strand such that the sequence of interest is in the sense orientation from the 5' to 3' end of the first T-strand; and the second RB DNA sequence (RB2) is paired with a second LB DNA sequence (LB2) which are positioned in the vector to initiate synthesis of a second T-strand such that the sequence of interest is in an anti-sense orientation relative to the sequence of interest in the first T-strand.
- the two T-strands resulting from initiation at RBI and RB2 are essentially complementary to each other in at least a portion of the sequence of interest.
- Example 5 Construct of a vector with two RB DNA sequences and two essentially identical sequences of interest
- a vector comprising two essentially identical sequences of interest, at least two RB DNA sequences, and optional one or more LB DNA sequences is created for
- the vector configuration comprises a first RB DNA sequence (RBI) with an optional first LB DNA sequence (LBl) which are positioned in the vector to initiate (RBI) and terminate (LBl) synthesis of a first T-strand such that the first sequence of interest is in the sense orientation from the 5' to 3' end of the first T-strand; and the vector configuration further comprises a second RB DNA sequence (RB2) and an optional second LB DNA sequence (LB2) which are positioned in the vector to initiate (RB2) and terminate (LB2) synthesis of a second T-strand such that the second sequence of interest is in an anti-sense orientation relative to the sequence of interest in the first T-strand.
- the two T-strands resulting from initiation at RBI and RB2 are essentially complementary to each other in at least a portion of the sequence of interest.
- the two vector configurations comprise essentially identical sequences of interest, where the first vector configuration comprises a first RB DNA sequence (RBI) and an optional first LB DNA sequence (LB 1 ) which are positioned in the first vector to initiate (RB 1 ) and terminate (LBl) synthesis of a first T-strand such that the sequence of interest is in the sense orientation from the 5' to 3' end of the first T-strand; and the second vector configuration comprises a second RB DNA sequence (RB2) and an optional second LB DNA sequence (LB2) which are positioned in the second vector to initiate (RB2) and terminate (LB2) synthesis of a second T- strand such that the sequence of interest is in an anti-sense orientation relative to the sequence of interest in the first T-strand.
- the first vector configuration comprises a first RB DNA sequence (RBI) and an optional first LB DNA sequence (LB 1 ) which are positioned in the first vector to initiate (RB 1 ) and terminate (LBl) synthesis of a first T-strand such that the sequence of interest is in the sense orientation
- the two T-strands resulting from initiation at RBI and RB2 are essentially complementary to each other in at least a portion of the sequence of interest.
- the two vectors may be in the same Agrobacterium cell, or the two vectors may be in different Agrobacterium cells.
- Example 7. Construct of a vector with two different sequences of interest and three or more RB DNA sequences
- a vector comprising a first sequence of interest, a second sequence of interest different from the first sequence of interest, three or more RB DNA sequences, and optional one or more LB DNA sequences is created for Agrobacterium-mediated transformation.
- a vector configuration is presented in
- Figure 2C where the vector configuration comprises a first RB DNA sequence (RBI) and a first LB DNA sequence (LBl) which are positioned in the vector to initiate (RBI) and terminate (LB1) synthesis of a first T-strand such that the first sequence of interest is in the sense orientation from the 5' to 3' end of the first T-strand; and the vector configuration further comprises a second RB DNA sequence (RB2) and a second LB DNA sequence (LB2) which are positioned in the vector to initiate (RB2) and terminate (LB2) synthesis of a second T-strand such that the first sequence of interest is in an anti-sense orientation relative to the sequence of interest in the first T-strand.
- RBI RB DNA sequence
- LBl first LB DNA sequence
- the two T-strands resulting from initiation at RBI and RB2 are essentially complementary to each other in at least a portion of the sequence of interest.
- the vector configuration further comprises a third RB DNA sequence (RB3) and a third LB DNA sequence (LB3) which are positioned in the vector to initiate (RB3) and terminate (LB3) synthesis of a third T-strand such that the second sequence of interest is in the sense orientation from the 5' to 3' end of the third T-strand; and the vector configuration further comprises a fourth RB DNA sequence (RB4) and a fourth LB DNA sequence (LB4) which are positioned in the vector to initiate (RB4) and terminate (LB4) synthesis of a fourth T-strand such that the second sequence of interest is in an anti-sense orientation relative to the sequence of interest in the third T-strand.
- the two T-strands resulting from initiation at RB3 and RB4 are essentially complementary to each other in at least a portion of the sequence of interest.
- Example 8 Construct of a vector with a RB DNA sequence, a LB DNA sequence, and two sequences of interest linked by a spacer
- a vector is provided for Agrobacterium-mediated transformation where the vector configuration comprises a RB DNA sequence and a LB DNA sequence and where the vector further comprises between the RB and LB DNA sequences: (i) a first sequence of interest in a sense orientation relative to the RB DNA sequence, (ii) a spacer, and (iii) a second sequence of interest in an anti-sense orientation relative to the first sequence of interest.
- the two sequences of interest after synthesis of the T-strand, form a partially double stranded hairpin structure due to complementary base pairing.
- Figure 3 A one non-limiting example of such a vector configuration is presented in Figure 3 A.
- Example 9 Increased frequency of site-directed integration by Agrobacterium-m diat d transformation
- L7 A locus termed L7 was identified as occurring only once in a corn genome. A site was selected within the L7 locus for insertion of a transgene by site-directed
- TALENs To facilitate site-directed recombination a pair of TALENs was engineered to introduce a double-strand break at a specific site within the L7 locus.
- Vectors made according to Examples 1-3 were transformed into immature corn embryos. The vectors each comprised a sequence of interest containing three expression cassettes: two expression cassettes for expression of each one of the TALEN pair and an expression cassette containing a CP4-EPSPS transgene positioned between a left homology arm and a right homology arm.
- Agrobacterium containing one of the four vectors for 3 days then moved to callus-induction medium containing 0.1 mM glyphosate as a selection agent. Approximately 300 regenerated plants (R0 events) were selected for each vector, transferred to plugs, and grown in a greenhouse. Genomic DNA was extracted from R0 leaf tissue after one week of greenhouse growth, and individual plants were molecularly assessed with real time PCR for (a) the presence of the transgene (CP4-EPSPS) copy number, and (b) targeting sequence copy number.
- CP4-EPSPS transgene
- PCR with genomic DNA from of selected R0 plants was used to identify individual plants comprising CP4-EPSPS cassette insertions at the L7 target site.
- PCR primers were designed such that a product was only produced when the CP4-EPSPS cassette inserted into the L7 targeted region of the corn genome; one PCR primer was located in genomic DNA flanking the targeted region, and one PCR primer was located within the CP4- EPSPS cassette.
- Two sets of PCR primers were used, one positioned on the 5' end of the CP4-EPSPS cassette and one positioned on the 3' end of the CP4-EPSPS cassette.
- Figure 5 shows the positions of the primers used to identify CP4-EPSPS cassette insertions in the L7 locus.
- the 5' set of PCR primers amplifies a 2676 bp product when the CP4-EPSPS cassette is inserted in the targeted L7 locus; no product is produced in wild type genomic DNA or if the CP4-EPSPS cassette is not inserted into the L7 target site.
- the 3' set of PCR primers amplifies a 2282 bp product when the CP4-EPSPS is inserted in the targeted L7 locus; no product is produced in wild type genomic DNA or if the CP4-EPSPS cassette is not inserted into the L7 target site.
- the products were resolved on an agarose gel to identify plants with the correct sized bands for both the 5' end and the 3' end of the CP4-EPSPS cassette.
- plants with at least one positive PCR result for site directed insertion of the CP4-EPSPS cassette were selected for Southern blot analysis.
- Genomic DNA extracted from the plant was digested with the restriction endonuclease Kpnl.
- Digestion with Kpnl produces a 9.6 kb fragment when the CP4-EPSPS cassette is integrated into the L7 target site (see Figure 5).
- Digestion with Kpnl produces a 5.1 kb DNA fragment corresponding to the L7 genomic locus, for example in the wild-type non-transformed plant or a plant where the CP4-EPSPS cassette has not integrated into the L7 locus (see Figure 5).
- the Southern blot was probed with a "left flank probe" as shown in Figure 5.
- TALEN- mediated site-directed integration using Agrobacterium vectors comprising either two RB DNA sequences (vector C) or two RB DNA sequences and one LB DNA sequence (vector D) had superior efficacy compared to standard Agrobacterium vectors comprising either one RB DNA sequence and one LB DNA sequence (vector A), or one RB DNA sequence (vector B).
- the data indicate the unexpected observation of an increased frequency of site-directed integration by NHEJ or HR when the vector comprised two RB DNA sequences positioned to generate essentially complementary T-strands.
- the data further indicated an increased number of events generated by HR when the vector comprised two RB DNA sequences positioned to generate essentially complementary T-strands (see Table 2).
- Table 2 Summary of border configurations and site-directed integration (SDI) frequencies. Pos., positive; L, left; R, right; HEJ, non-homologous end-joining; HR, homologous recombination.
- Example 10 Retargeting a gene to a pre-existing lox site
- a vector with two RB DNA sequences is used to facilitate site-directed integration of a sequence of interest into a pre-existing recombination site (e.g. a lox site) in genomic DNA.
- the vector is created comprising, from 5' to 3 ', a first right RB DNA sequence (RB I); P-35S-crtB; a first lox site; a gene; a first recombination site (optional); a marker gene to select for transformants; a second recombination site (optional); a second lox site; P-DaMV-Cre; and a second RB DNA sequence (RB2).
- the RB I and RB2 are positioned in the vector to generate essentially complementary T-strands.
- P-35S-crtB is a non-lethal, constitutively expressed phytoene synthase expression cassette that inhibits shoot elongation in cells where non-targeted recombination occurs.
- P-DaMV-Cre is a constitutively expressed Cre-recombinase that promotes recombination of the intervening DNA construct with the targeted genomic lox site.
- the lox sites of the intervening DNA recombine with a pre-existing genomic DNA lox site and insert into the genomic DNA.
- a target after transformation and recombination contains only the regions of the intervening DNA construct between the two lox sites (in this example, a gene; a first recombination site; a marker gene; and a second recombination site).
- Optional recombination sites are used to remove the marker gene in a future recombination event.
- Agrobacterium containing a vector is typically incubated with the immature corn embryos for 12-16 hours.
- the unexpected observation detailed herein was that extended co-culture of the Agrobacterium containing a vector with the immature corn embryos resulted in a higher percentage of targeted integrations of the sequence of interest contained in the vector while not significantly affecting transformation frequency.
- Vector A ( Figure 4A, Example 1) was used for Agrobacterium-mediated transformation using standard protocols except that the co-culture time was varied. About 3000 immature corn embryos were co-cultured for each of 1 day, 2 days, or 3 days. After transformation, the transformants were moved to callus-induction medium containing 0.1 mM glyphosate as a selection agent. R0 plants surviving glyphosate selection were transferred to plugs and grown in a greenhouse.
- Example 9 The PCR protocol and primers described in Example 9 were used with the genomic DNA from selected R0 plants from the extended co-culture protocol to identify individual plants comprising targeted integration of the CP4-EPSPS cassette. As described in Example 9, and illustrated in Figure 5, two sets of PCR primers were used to detect targeted integration of the CP4-EPSPS cassette at the L7 locus. To further verify whether the junction sequences at the targeted sites were perfect (e.g., HR) or imperfect (e.g., NHEJ), the PCR products were sequenced. Table 5 shows that the highest number of positive PCR results for either the left flank, the right flank, or both the left and right flanks was with the 3 -day co- culture.
- HR perfect
- NHEJ imperfect
- Example 12 Increased frequency of site-directed integration of a Sequence of Interest flanked by RBs and optional LBs.
- CP4-EPSPS CP4-EPSPS
- a pair of TALENs was engineered to introduce a double-strand break at a specific site within the L7 locus.
- a control vector (Vector 6A) was created for Agrobacterium-mediated transformation (see Figure 6A).
- the control vector comprised a first sequence of interest comprising an expression cassette encoding CP4-EPSPS positioned between a first RB DNA sequence (RBI) and a first LB DNA sequence (LB1) such that T-strand synthesis initiated at RBI and terminated at LB1.
- the vector further comprised a second sequence of interest comprising two expression cassettes each encoding half of a TALEN pair positioned between a second RB DNA sequence (RB2) and a second LB DNA sequence (LB2) such that T-strand synthesis initiated at RB2 and terminated at LB2.
- TALENs were obtained from Life
- a second vector was created for Agrobacterium-mediated transformation (see Figure 6B).
- This vector comprised a first sequence of interest comprising an expression cassette encoding CP4-EPSPS positioned between two RB DNA sequences and two LB DNA sequences.
- the vector further comprised a second sequence of interest comprising two expression cassettes each encoding half of a TALEN pair.
- the vector configuration comprised a first RB DNA sequence (RBI) paired with a first LB DNA sequence (LB 1) which were positioned in the vector to initiate (RBI) and terminate (LB 1) synthesis of a first T-strand comprising CP4-EPSPS in an anti-sense orientation from the 5' to 3 ' end of the first T-strand; and the second RB DNA sequence (RB2) was paired with a second LB DNA sequence (LB2) which were positioned in the vector to initiate synthesis of a second T-strand such that CP4-EPSPS was in sense orientation from the 5' to 3' end of the second T-strand.
- RBI was also positioned such that read through of the LB1 would result in the synthesis of a T-strand comprising both the CP4-EPSPS cassette and the two TALEN cassettes.
- the T-strands resulting from initiation at RBI and RB2 were essentially complementary to each other in at least a portion of the first sequence of interest.
- callus-induction medium containing 0.1 mM glyphosate as a selection agent.
- genomic DNA was isolated from selected R0 plants and PCR assays were carried out to identify individual plants comprising CP4-EPSPS cassette insertions at the L7 target site.
- PCR primers were designed such that a product was only produced when the CP4-EPSPS cassette inserted into the L7 target region of the corn genome.
- PCR protocol and primers described in Example 9, and illustrated in Figure 5 were used to identify individual plants comprising targeted integration of the CP4-EPSPS cassette.
- the PCR products were resolved on agarose gels to identify plants with the correct sized bands for both the 5' end and the 3 ' end of the CP4- EPSPS cassette.
- Vector 6A of the 180 RO plants that were screened by PCR, none were positive for the left flank and 2 were positive for the right flank.
- vector 6B of the 371 RO plants that were screened by PCR, 2 were positive for the left flank and 10 were positive for the right flank.
- Example 13 Increased frequency of site-directed integration of a Sequence of interest flanked by RBs and no LBs.
- CP4-EPSPS CP4-EPSPS
- a pair of TALENs was engineered to introduce a double-strand break at a specific site within the L7 locus.
- a first vector was created for Agrobacterium-mediated transformation.
- the vector comprised a first sequence of interest comprising an expression cassette encoding CP4-EPSPS with a TALEN target site (TS) positioned 5' to the cassette.
- the first sequence of interest was flanked by a first RB DNA sequence (RBI) and a second RB DNA sequence (RB2) positioned such that T-strand synthesis initiated at both RBI and RB2 and the resulting T-strands were essentially complementary to each other in at least a portion of the first sequence of interest.
- RBI first RB DNA sequence
- RB2 second RB DNA sequence
- the vector further comprised a second sequence of interest comprising two expression cassettes each encoding half of a TALEN pair, where the second sequence of interest was positioned adjacent to a third RB DNA sequence (RB3) so as to initiate synthesis of a third T-strand that comprised the two TALEN cassettes.
- TALENs were obtained from Life Technologies
- a second vector was created for Agrobacterium-mediated transformation.
- the vector comprised a first sequence of interest comprising an expression cassette encoding CP4-EPSPS flanked by TALEN target sites (TS). Additionally, the first sequence of interest was positioned between a first RB DNA sequence (RB 1) and a second RB DNA sequence (RB2) such that T-strand synthesis initiated at both RBI and RB2 and the resulting T-strands were essentially complementary to each other in at least a portion of the first sequence of interest.
- RB 1 first RB DNA sequence
- RB2 second RB DNA sequence
- the vector further comprised a second sequence of interest comprising two expression cassettes each encoding half of a TALEN pair positioned adjacent to a third RB DNA sequence (RB3) so as to initiate synthesis of a third T-strand that comprised the two TALEN cassettes.
- TALENs were obtained from Life Technologies.
- a third vector (Vector 7C) was created for Agrobacterium-mediated transformation.
- the vector configuration is presented in Figure 7C.
- the vector comprised a first sequence of interest comprising an expression cassette encoding CP4-EPSPS positioned between a left homology arm and a right homology arm.
- the first sequence of interest was positioned between a first RB DNA sequence (RBI) and a second RB DNA sequence (RB2) such that T-strand synthesis initiated at both RBI and RB2 and the resulting T-strands were essentially complementary to each other in at least a portion of the first sequence of interest.
- the vector further comprised a second sequence of interest comprising two expression cassettes each encoding half of a TALEN pair positioned adjacent to a third RB DNA sequence (RB3) so as to initiate synthesis of a third T-strand that comprised the two TALEN cassettes.
- TALENs were obtained from Life Technologies.
- Agrobacterium containing one of the three vectors for 3 days then moved to callus-induction medium containing 0.1 mM glyphosate as a selection agent. Approximately 200 regenerated plants (R0 events) were selected for each vector, transferred to plugs, and grown in a greenhouse.
- genomic DNA was isolated from selected R0 plants and PCR assays were carried out to identify individual plants comprising CP4-EPSPS cassette insertions at the L7 target site.
- PCR primers were designed such that a product was only produced when the CP4-EPSPS cassette inserted into the targeted L7 region of the corn genome.
- PCR protocol and primers described in Example 9, and illustrated in Figure 5 were used to identify individual plants comprising targeted integration of the CP4-EPSPS cassette.
- the products were resolved on agarose gels to identify plants with the correct sized bands for both the 5' end and the 3' end of the CP4-EPSPS cassette.
- Vector 7A of the 226 R0 plants that were screened by PCR, one was positive for the left flank and 4 were positive for the right flank.
- Vector 7B of the 292 R0 plants that were screened by PCR, 2 were positive for the left flank and 26 were positive for the right flank.
- One plant was positive for both right and left PCR.
- Vector 7C of the 241 R0 plants that were screened by PCR, 7 were positive for the left flank and 7 were positive for the right flank. Four plants were positive for both right and left flank PCR products.
- the data indicate that TALEN-mediated site-directed integration (SDI) using an Agrobacterium vector comprising two TALEN target sites and two RB DNA sequences flanking a cargo sequence of interest (Vector 7B) had superior efficacy compared to an Agrobacterium vector comprising only 1 target site and two RB DNA sequences flanking the cargo sequence (Vector 7A).
- the data further indicate an increased number of full integration events generated by FIR when the vector comprised a cargo sequence flanked by homology arms and two RB DNA sequences (Vector 7C).
- Agrobacterium comprising a transformation vector is typically incubated with immature corn embryos for 12-16 hours.
- the unexpected observation detailed herein was that prolonging the co-culture time of the Agrobacterium comprising a vector with immature corn embryos to three days resulted in a higher number of events that had targeted integration of the sequence of interest contained in the vector.
- a vector depicted in Figure 4A and Example 1 was used for Agrobacterium- mediated transformation using standard protocols except that the co-culture time was varied. Explants were co-cultured with Agrobacterium for 1, 3, 5, or 7 days. Transformation of immature embryos was conducted on eight different experimental dates. One and three day co-culture treatments were conducted on alternating dates (a total of 4 experimental treatment dates each), and an average of 1086 and 1132 explants were treated per experiment, respectively. Five and seven day treatments were conducted on each of the eight
- PCR protocol and primers described in Example 9 were used with the genomic DNA from RO plants from the extended co-culture protocol to identify individual plants comprising targeted integration of the CP4- EPSPS cassette. As described in Example 9, and illustrated in Figure 5, two sets of PCR primers were used to detect targeted integration of the CP4-EPSPS cassette at the L7 locus.
- the number of site directed integration events (determined by positive flank PCR results for either or both flanks) detected per experimental treatment was divided by the number of explants recovered to estimate the SDI frequencies.
- the data were analyzed using a mixed linear model. Table 8 shows that the highest number of positive PCR results for either the left flank, the right flank, or both the left and right flanks was with the 3 -day co-culture.
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Biomedical Technology (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Organic Chemistry (AREA)
- Chemical & Material Sciences (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Molecular Biology (AREA)
- Microbiology (AREA)
- Plant Pathology (AREA)
- Cell Biology (AREA)
- Biophysics (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Physics & Mathematics (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
Abstract
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201662355715P | 2016-06-28 | 2016-06-28 | |
PCT/US2017/039502 WO2018005491A1 (fr) | 2016-06-28 | 2017-06-27 | Procédés et compositions destinés à être utilisés dans la modification du génome de plantes |
Publications (2)
Publication Number | Publication Date |
---|---|
EP3475427A1 true EP3475427A1 (fr) | 2019-05-01 |
EP3475427A4 EP3475427A4 (fr) | 2019-11-06 |
Family
ID=60786786
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP17821073.8A Pending EP3475427A4 (fr) | 2016-06-28 | 2017-06-27 | Procédés et compositions destinés à être utilisés dans la modification du génome de plantes |
Country Status (4)
Country | Link |
---|---|
US (2) | US11732268B2 (fr) |
EP (1) | EP3475427A4 (fr) |
CN (1) | CN109642235B (fr) |
WO (1) | WO2018005491A1 (fr) |
Families Citing this family (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11168319B1 (en) | 2017-02-28 | 2021-11-09 | Inari Agriculture Technology, Inc. | Plant cell culture |
WO2019144124A1 (fr) | 2018-01-22 | 2019-07-25 | Inari Agriculture, Inc. | Systèmes, procédés et compositions d'édition de gènes végétaux |
AU2019275110A1 (en) | 2018-05-24 | 2020-10-22 | Monsanto Technology Llc | Genome editing in plants |
EP3802810A4 (fr) | 2018-05-24 | 2022-03-09 | Monsanto Technology LLC | Édition génomique dans des plantes |
WO2020176972A1 (fr) * | 2019-03-06 | 2020-09-10 | Plantform Corporation | Vecteurs d'adn-t ayant des séquences 5' modifiées en amont d'enzymes de modification post-traductionnelles et leurs procédés d'utilisation |
US11746354B2 (en) | 2019-07-19 | 2023-09-05 | Inari Agriculture Technology, Inc. | Homology dependent repair genome editing |
AU2021378791A1 (en) * | 2020-11-11 | 2023-06-15 | Monsanto Technology Llc | Methods to improve site-directed integration frequency |
EP4263813A1 (fr) * | 2020-12-15 | 2023-10-25 | Monsanto Technology LLC | Procédés et compositions pour plants de petite taille par manipulation du métabolisme de la gibbérelline |
Family Cites Families (138)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5094945A (en) | 1983-01-05 | 1992-03-10 | Calgene, Inc. | Inhibition resistant 5-enolpyruvyl-3-phosphoshikimate synthase, production and use |
US5352605A (en) | 1983-01-17 | 1994-10-04 | Monsanto Company | Chimeric genes for transforming plant cells using viral promoters |
US5034322A (en) | 1983-01-17 | 1991-07-23 | Monsanto Company | Chimeric genes suitable for expression in plant cells |
DE3587718T2 (de) | 1984-03-06 | 1994-08-04 | Mgi Pharma Inc | Herbizide Resistenz in Pflanzen. |
US4761373A (en) | 1984-03-06 | 1988-08-02 | Molecular Genetics, Inc. | Herbicide resistance in plants |
US6774283B1 (en) | 1985-07-29 | 2004-08-10 | Calgene Llc | Molecular farming |
JPH0795952B2 (ja) | 1986-01-08 | 1995-10-18 | カルジ−ン・インコ−ポレイテツド | ハロアリ−ルニトリル分解遺伝子、その使用および該遺伝子含有細胞 |
US4810648A (en) | 1986-01-08 | 1989-03-07 | Rhone Poulenc Agrochimie | Haloarylnitrile degrading gene, its use, and cells containing the gene |
DE3765449D1 (de) | 1986-03-11 | 1990-11-15 | Plant Genetic Systems Nv | Durch gentechnologie erhaltene und gegen glutaminsynthetase-inhibitoren resistente pflanzenzellen. |
US5188958A (en) | 1986-05-29 | 1993-02-23 | Calgene, Inc. | Transformation and foreign gene expression in brassica species |
US5750871A (en) | 1986-05-29 | 1998-05-12 | Calgene, Inc. | Transformation and foreign gene expression in Brassica species |
US5276268A (en) | 1986-08-23 | 1994-01-04 | Hoechst Aktiengesellschaft | Phosphinothricin-resistance gene, and its use |
US5637489A (en) | 1986-08-23 | 1997-06-10 | Hoechst Aktiengesellschaft | Phosphinothricin-resistance gene, and its use |
US5273894A (en) | 1986-08-23 | 1993-12-28 | Hoechst Aktiengesellschaft | Phosphinothricin-resistance gene, and its use |
US5378824A (en) | 1986-08-26 | 1995-01-03 | E. I. Du Pont De Nemours And Company | Nucleic acid fragment encoding herbicide resistant plant acetolactate synthase |
US5605011A (en) | 1986-08-26 | 1997-02-25 | E. I. Du Pont De Nemours And Company | Nucleic acid fragment encoding herbicide resistant plant acetolactate synthase |
US5013659A (en) | 1987-07-27 | 1991-05-07 | E. I. Du Pont De Nemours And Company | Nucleic acid fragment encoding herbicide resistant plant acetolactate synthase |
US5004863B2 (en) | 1986-12-03 | 2000-10-17 | Agracetus | Genetic engineering of cotton plants and lines |
US5015580A (en) | 1987-07-29 | 1991-05-14 | Agracetus | Particle-mediated transformation of soybean plants and lines |
US5359142A (en) | 1987-01-13 | 1994-10-25 | Monsanto Company | Method for enhanced expression of a protein |
US5322938A (en) | 1987-01-13 | 1994-06-21 | Monsanto Company | DNA sequence for enhancing the efficiency of transcription |
CN87100603A (zh) | 1987-01-21 | 1988-08-10 | 昂科公司 | 抗黑素瘤疫苗 |
US5229114A (en) | 1987-08-20 | 1993-07-20 | The United States Of America As Represented By The Secretary Of Agriculture | Approaches useful for the control of root nodulation of leguminous plants |
US5416011A (en) | 1988-07-22 | 1995-05-16 | Monsanto Company | Method for soybean transformation and regeneration |
US5597718A (en) | 1988-10-04 | 1997-01-28 | Agracetus | Genetically engineering cotton plants for altered fiber |
ES2164633T3 (es) | 1989-02-24 | 2002-03-01 | Monsanto Technology Llc | Genes vegetales sinteticos y procedimiento para su preparacion. |
US5106739A (en) | 1989-04-18 | 1992-04-21 | Calgene, Inc. | CaMv 355 enhanced mannopine synthase promoter and method for using same |
US5217902A (en) | 1989-05-26 | 1993-06-08 | Dna Plant Technology Corporation | Method of introducing spectinomycin resistance into plants |
US5550318A (en) | 1990-04-17 | 1996-08-27 | Dekalb Genetics Corporation | Methods and compositions for the production of stably transformed, fertile monocot plants and cells thereof |
US7705215B1 (en) | 1990-04-17 | 2010-04-27 | Dekalb Genetics Corporation | Methods and compositions for the production of stably transformed, fertile monocot plants and cells thereof |
US5689041A (en) | 1989-08-10 | 1997-11-18 | Plant Gentic Systems N.V. | Plants modified with barstar for fertility restoration |
US6051753A (en) | 1989-09-07 | 2000-04-18 | Calgene, Inc. | Figwort mosaic virus promoter and uses |
ES2150900T3 (es) | 1989-10-31 | 2000-12-16 | Monsanto Co | Promotor para plantas transgenicas. |
US5641876A (en) | 1990-01-05 | 1997-06-24 | Cornell Research Foundation, Inc. | Rice actin gene and promoter |
US5484956A (en) | 1990-01-22 | 1996-01-16 | Dekalb Genetics Corporation | Fertile transgenic Zea mays plant comprising heterologous DNA encoding Bacillus thuringiensis endotoxin |
CA2074355C (fr) | 1990-01-22 | 2008-10-28 | Ronald C. Lundquist | Plants de mais transgeniques fertiles |
US6426447B1 (en) | 1990-11-14 | 2002-07-30 | Monsanto Technology Llc | Plant seed oils |
US5837848A (en) | 1990-03-16 | 1998-11-17 | Zeneca Limited | Root-specific promoter |
US5543576A (en) | 1990-03-23 | 1996-08-06 | Mogen International | Production of enzymes in seeds and their use |
US5969214A (en) | 1990-06-11 | 1999-10-19 | Calgene, Inc. | Glycogen biosynthetic enzymes in plants |
US5498830A (en) | 1990-06-18 | 1996-03-12 | Monsanto Company | Decreased oil content in plant seeds |
JP3325022B2 (ja) | 1990-06-18 | 2002-09-17 | モンサント カンパニー | 植物中の増加された澱粉含量 |
CA2083948C (fr) | 1990-06-25 | 2001-05-15 | Ganesh M. Kishore | Plantes tolerantes au glyphosate |
US6483008B1 (en) | 1990-08-15 | 2002-11-19 | Calgene Llc | Methods for producing plants with elevated oleic acid content |
US5633435A (en) | 1990-08-31 | 1997-05-27 | Monsanto Company | Glyphosate-tolerant 5-enolpyruvylshikimate-3-phosphate synthases |
CA2096843C (fr) | 1990-11-23 | 2007-08-07 | Kathleen D'halluin | Procede pour la transformation de plantes monocotyledones |
US5512466A (en) | 1990-12-26 | 1996-04-30 | Monsanto Company | Control of fruit ripening and senescence in plants |
US5767366A (en) | 1991-02-19 | 1998-06-16 | Louisiana State University Board Of Supervisors, A Governing Body Of Louisiana State University Agricultural And Mechanical College | Mutant acetolactate synthase gene from Ararbidopsis thaliana for conferring imidazolinone resistance to crop plants |
US5591616A (en) | 1992-07-07 | 1997-01-07 | Japan Tobacco, Inc. | Method for transforming monocotyledons |
US6011199A (en) | 1992-12-15 | 2000-01-04 | Commonwealth Scientific | Method for producing fruiting plants with improved fruit flavour |
US6414222B1 (en) | 1993-02-05 | 2002-07-02 | Regents Of The University Of Minnesota | Gene combinations for herbicide tolerance in corn |
US5362865A (en) | 1993-09-02 | 1994-11-08 | Monsanto Company | Enhanced expression in plants using non-translated leader sequences |
AU7925094A (en) | 1993-09-30 | 1995-04-18 | Agracetus, Inc. | Transgenic cotton plants producing heterologous peroxidase |
JP3102888B2 (ja) | 1993-12-08 | 2000-10-23 | 日本たばこ産業株式会社 | 植物の形質転換方法及びそのためのベクター |
US6828475B1 (en) | 1994-06-23 | 2004-12-07 | Calgene Llc | Nucleic acid sequences encoding a plant cytoplasmic protein involved in fatty acyl-CoA metabolism |
US6080560A (en) | 1994-07-25 | 2000-06-27 | Monsanto Company | Method for producing antibodies in plant cells |
US6140075A (en) | 1994-07-25 | 2000-10-31 | Monsanto Company | Method for producing antibodies and protein toxins in plant cells |
US5750876A (en) | 1994-07-28 | 1998-05-12 | Monsanto Company | Isoamylase gene, compositions containing it, and methods of using isoamylases |
US5633437A (en) | 1994-10-11 | 1997-05-27 | Sandoz Ltd. | Gene exhibiting resistance to acetolactate synthase inhibitor herbicides |
US5631152A (en) | 1994-10-26 | 1997-05-20 | Monsanto Company | Rapid and efficient regeneration of transgenic plants |
US5716837A (en) | 1995-02-10 | 1998-02-10 | Monsanto Company | Expression of sucrose phosphorylase in plants |
US6051409A (en) * | 1995-09-25 | 2000-04-18 | Novartis Finance Corporation | Method for achieving integration of exogenous DNA delivered by non-biological means to plant cells |
US6946588B2 (en) | 1996-03-13 | 2005-09-20 | Monsanto Technology Llc | Nucleic acid encoding a modified threonine deaminase and methods of use |
US5958745A (en) | 1996-03-13 | 1999-09-28 | Monsanto Company | Methods of optimizing substrate pools and biosynthesis of poly-β-hydroxybutyrate-co-poly-β-hydroxyvalerate in bacteria and plants |
US6091002A (en) | 1996-03-13 | 2000-07-18 | Monsanto Company | Polyhydroxyalkanoates of narrow molecular weight distribution prepared in transgenic plants |
US6166292A (en) | 1996-04-26 | 2000-12-26 | Ajinomoto Co., Inc. | Raffinose synthetase gene, method of producing raffinose and transgenic plant |
AR006928A1 (es) | 1996-05-01 | 1999-09-29 | Pioneer Hi Bred Int | Una molecula de adn aislada que codifica una proteina fluorescente verde como marcador rastreable para la transformacion de plantas, un metodo para laproduccion de plantas transgenicas, un vector de expresion, una planta transgenica y celulas de dichas plantas. |
US5985605A (en) | 1996-06-14 | 1999-11-16 | Her Majesty The Queen In Right Of Canada, As Represented By The Dept. Of Agriculture & Agri-Food Canada | DNA sequences encoding phytases of ruminal microorganisms |
US5998700A (en) | 1996-07-02 | 1999-12-07 | The Board Of Trustees Of Southern Illinois University | Plants containing a bacterial Gdha gene and methods of use thereof |
US5850019A (en) | 1996-08-06 | 1998-12-15 | University Of Kentucky Research Foundation | Promoter (FLt) for the full-length transcript of peanut chlorotic streak caulimovirus (PCLSV) and expression of chimeric genes in plants |
US5750848A (en) | 1996-08-13 | 1998-05-12 | Monsanto Company | DNA sequence useful for the production of polyhydroxyalkanoates |
CA2236770A1 (fr) | 1996-09-05 | 1998-03-12 | Unilever Plc | Promoteur inductible en presence de sel pouvant etre derive d'une bacterie lactique et son utilisation dans une bactere lactique pour la production d'une proteine requise |
WO1998018949A2 (fr) | 1996-10-29 | 1998-05-07 | Calgene Llc | Cellulose synthase vegetale et sequences promotrices |
AU720780B2 (en) | 1997-01-20 | 2000-06-15 | Plant Genetic Systems N.V. | Pathogen-induced plant promoters |
US5922564A (en) | 1997-02-24 | 1999-07-13 | Performance Plants, Inc. | Phosphate-deficiency inducible promoter |
US6040497A (en) | 1997-04-03 | 2000-03-21 | Dekalb Genetics Corporation | Glyphosate resistant maize lines |
US7105724B2 (en) | 1997-04-04 | 2006-09-12 | Board Of Regents Of University Of Nebraska | Methods and materials for making and using transgenic dicamba-degrading organisms |
US6171640B1 (en) | 1997-04-04 | 2001-01-09 | Monsanto Company | High beta-conglycinin products and their use |
AR013633A1 (es) | 1997-04-11 | 2001-01-10 | Calgene Llc | METODO PARA LA ALTERACIoN DE LA COMPOSICIoN DE ÁCIDOS GRASOS DE CADENA MEDIA EN SEMILLAS VEGETALES QUE EXPRESAN UNA TIOESTERASA QUE PREFIERE CADENA MEDIA VEGETAL HETERoLOGA. |
US5972664A (en) | 1997-04-11 | 1999-10-26 | Abbott Laboratories | Methods and compositions for synthesis of long chain poly-unsaturated fatty acids |
US6380466B1 (en) | 1997-05-08 | 2002-04-30 | Calgene Llc | Production of improved rapeseed exhibiting yellow-seed coat |
CN1255541C (zh) | 1997-06-05 | 2006-05-10 | 卡尔金有限责任公司 | 脂酰辅酶a:脂肪醇酰基转移酶 |
WO1999001563A1 (fr) * | 1997-06-30 | 1999-01-14 | Mogen International N.V. | Plasmides servant a transformer des plantes et leur procede d'utilisation |
IL122270A0 (en) | 1997-11-20 | 1998-04-05 | Yeda Res & Dev | DNA molecules conferring to plants resistance to a herbicide and plants transformed thereby |
US6072103A (en) | 1997-11-21 | 2000-06-06 | Calgene Llc | Pathogen and stress-responsive promoter for gene expression |
US6653530B1 (en) | 1998-02-13 | 2003-11-25 | Calgene Llc | Methods for producing carotenoid compounds, tocopherol compounds, and specialty oils in plant seeds |
IL137410A0 (en) | 1998-02-19 | 2001-07-24 | Cotton Inc | A method for the production of transgenic plants using apical shoot tips |
US6051756A (en) | 1998-02-26 | 2000-04-18 | Cargill, Incorporated | Particle bombardment transformation of Brassica |
WO1999043819A1 (fr) | 1998-02-26 | 1999-09-02 | Pioneer Hi-Bred International, Inc. | Famille de genes pr-1 et de promoteurs |
WO1999043797A2 (fr) | 1998-02-26 | 1999-09-02 | Pioneer Hi-Bred International, Inc. | Promoteurs constitutifs du mais |
US5914451A (en) | 1998-04-06 | 1999-06-22 | Monsanto Company | Efficiency soybean transformation protocol |
US6635806B1 (en) | 1998-05-14 | 2003-10-21 | Dekalb Genetics Corporation | Methods and compositions for expression of transgenes in plants |
US6307123B1 (en) | 1998-05-18 | 2001-10-23 | Dekalb Genetics Corporation | Methods and compositions for transgene identification |
MXPA00012100A (es) | 1998-06-05 | 2003-04-22 | Calgene Llc | Secuencias de acidos nucleicos relacionados con acil coenzima a: colesterol aciltransferasa. |
WO1999064614A2 (fr) | 1998-06-12 | 1999-12-16 | Calgene Llc | Acides gras polyinsatures vegetaux |
JP4514952B2 (ja) | 1998-07-02 | 2010-07-28 | カルジーン エルエルシー | ジアシルグリセロールアシルトランスフェラーゼタンパク質 |
ATE360072T1 (de) | 1998-07-10 | 2007-05-15 | Calgene Llc | Expression eukarotischer peptide in pflanzenplastiden |
JP2000083680A (ja) | 1998-07-16 | 2000-03-28 | Nippon Paper Industries Co Ltd | 光誘導型プロモ―タ―の制御下に置かれた不定芽再分化遺伝子を選抜マ―カ―遺伝子とする植物への遺伝子導入方法及びこれに用いる植物への遺伝子導入用ベクタ― |
US6476294B1 (en) | 1998-07-24 | 2002-11-05 | Calgene Llc | Plant phosphatidic acid phosphatases |
WO2000007430A2 (fr) | 1998-08-04 | 2000-02-17 | Cargill, Incorporated | Promoteurs de desaturase d'acides gras vegetaux |
US6365802B2 (en) | 1998-08-14 | 2002-04-02 | Calgene Llc | Methods for increasing stearate content in soybean oil |
EP1117816B1 (fr) * | 1998-10-01 | 2005-12-21 | Pioneer Hi-Bred International, Inc. | Procede de transformation de vegetaux |
US6531648B1 (en) | 1998-12-17 | 2003-03-11 | Syngenta Participations Ag | Grain processing method and transgenic plants useful therein |
EP1141346A2 (fr) | 1999-01-14 | 2001-10-10 | Monsanto Co. | Procede de transformation de soja |
JP2002541851A (ja) | 1999-04-15 | 2002-12-10 | カルジーン エルエルシー | イソプレノイド合成に関与するタンパク質の核酸配列 |
US6232526B1 (en) | 1999-05-14 | 2001-05-15 | Dekalb Genetics Corp. | Maize A3 promoter and methods for use thereof |
US6207879B1 (en) | 1999-05-14 | 2001-03-27 | Dekalb Genetics Corporation | Maize RS81 promoter and methods for use thereof |
US6194636B1 (en) | 1999-05-14 | 2001-02-27 | Dekalb Genetics Corp. | Maize RS324 promoter and methods for use thereof |
US6429357B1 (en) | 1999-05-14 | 2002-08-06 | Dekalb Genetics Corp. | Rice actin 2 promoter and intron and methods for use thereof |
WO2000075350A2 (fr) | 1999-06-08 | 2000-12-14 | Calgene Llc | Sequences d'acide nucleique codant des proteines impliquees dans la beta-oxydation des acides gras et procedes d'utilisation |
US6770465B1 (en) | 1999-06-09 | 2004-08-03 | Calgene Llc | Engineering B-ketoacyl ACP synthase for novel substrate specificity |
US7105730B1 (en) | 1999-07-12 | 2006-09-12 | Monsanto Technology L.L.C. | Nucleic acid molecules and other molecules associated with sterol synthesis and metabolism |
US6603061B1 (en) | 1999-07-29 | 2003-08-05 | Monsanto Company | Agrobacterium-mediated plant transformation method |
US6297056B1 (en) | 1999-10-12 | 2001-10-02 | Pioneer Hi-Bred International, Inc. | Brassica transformation via microprojectile bombardment |
US6613963B1 (en) | 2000-03-10 | 2003-09-02 | Pioneer Hi-Bred International, Inc. | Herbicide tolerant Brassica juncea and method of production |
EP1303616A2 (fr) | 2000-07-25 | 2003-04-23 | Calgene LLC | Sequences d'acide nucleique codantes pour la beta-cetoacyl-acp synthase et utilisation de celles-ci |
CN102212534A (zh) | 2000-10-30 | 2011-10-12 | 弗迪亚股份有限公司 | 新的草甘膦n-乙酰转移酶(gat)基因 |
AUPR143100A0 (en) | 2000-11-10 | 2000-12-07 | Bureau Of Sugar Experiment Stations | Plant transformation |
US7151204B2 (en) | 2001-01-09 | 2006-12-19 | Monsanto Technology Llc | Maize chloroplast aldolase promoter compositions and methods for use thereof |
AU2002315526A1 (en) | 2001-07-06 | 2003-01-21 | Monsanto Technology Llc | Methods for enhancing segregation of transgenes in plants and compositions thereof |
US20050034188A1 (en) * | 2002-03-20 | 2005-02-10 | J. R. Simplot Company | Refined plant transformation |
WO2004009761A2 (fr) | 2002-07-18 | 2004-01-29 | Monsanto Technology Llc | Procedes permettant d'utiliser des polynucleotides artificiels et compositions desdits polynucleotides afin de reduire le silençage transgenique |
DE60336849D1 (de) | 2002-12-18 | 2011-06-01 | Athenix Corp | Herbizidresistenz verleihende gene |
WO2004074443A2 (fr) | 2003-02-18 | 2004-09-02 | Monsanto Technology Llc | 5-enolpyruvylshikimate-3-phosphate synthase (epsps) de classe i resistante au glyphosate |
WO2005003362A2 (fr) | 2003-03-10 | 2005-01-13 | Athenix Corporation | Methodes destinees a conferer une resistance aux herbicides |
US7575917B2 (en) | 2003-04-09 | 2009-08-18 | Monsanto Technology Llc | DNA constructs and methods to enhance the production of commercially viable transgenic plants |
EP1766032B1 (fr) * | 2004-02-13 | 2012-10-10 | Monsanto Technology, LLC | Assemblage in vivo d'unites de transcription |
US20050289672A1 (en) | 2004-06-28 | 2005-12-29 | Cambia | Biological gene transfer system for eukaryotic cells |
US7601536B2 (en) * | 2004-09-08 | 2009-10-13 | J. R. Simplot Company | Plant-specific genetic elements and transfer cassettes for plant transformation |
CN101389755B (zh) * | 2004-09-24 | 2014-06-11 | J.R.西姆普罗特公司 | 基因沉默 |
US20060200878A1 (en) | 2004-12-21 | 2006-09-07 | Linda Lutfiyya | Recombinant DNA constructs and methods for controlling gene expression |
US20060150286A1 (en) * | 2004-12-23 | 2006-07-06 | Shihshieh Huang | Gene suppression in transgenic plants using multiple constructs |
EP2016181B1 (fr) * | 2006-05-12 | 2013-04-17 | Monsanto Technology, LLC | Procédés et compositions pour l'élaboration de plantes transgéniques sans marqueur |
CN101490266B (zh) | 2006-05-16 | 2012-06-27 | 孟山都技术有限公司 | 非土壤杆菌属细菌物种用于植物转化的用途 |
CN101517083A (zh) * | 2006-07-19 | 2009-08-26 | 孟山都技术有限公司 | 提高植物转化效率的多个转化增强子序列的应用 |
BR112013015247B1 (pt) * | 2010-12-17 | 2021-11-23 | Monsanto Technology Llc | Método para melhorar a competência de células de milho para transformação mediada por bactérias |
CA2871008C (fr) * | 2012-04-23 | 2022-11-22 | Bayer Cropscience Nv | Ingenierie genomique ciblee dans des plantes |
DE102013215045B4 (de) | 2013-07-31 | 2023-05-04 | Siemens Healthcare Gmbh | Dreheinheit mit einer Vorrichtung zur drahtlosen Datenübertragung zwischen zwei relativ zueinander bewegbaren Teilen sowie Verfahren zur drahtlosen Datenübertragung zwischen zwei relativ zueinander bewegbaren Teilen |
-
2017
- 2017-06-27 US US16/311,749 patent/US11732268B2/en active Active
- 2017-06-27 WO PCT/US2017/039502 patent/WO2018005491A1/fr unknown
- 2017-06-27 CN CN201780051254.8A patent/CN109642235B/zh active Active
- 2017-06-27 EP EP17821073.8A patent/EP3475427A4/fr active Pending
-
2023
- 2023-06-06 US US18/330,297 patent/US20230383301A1/en active Pending
Also Published As
Publication number | Publication date |
---|---|
US20190211344A1 (en) | 2019-07-11 |
EP3475427A4 (fr) | 2019-11-06 |
CN109642235A (zh) | 2019-04-16 |
CN109642235B (zh) | 2023-12-29 |
US11732268B2 (en) | 2023-08-22 |
US20230383301A1 (en) | 2023-11-30 |
WO2018005491A1 (fr) | 2018-01-04 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20230383301A1 (en) | Methods and compositions for use in genome modification in plants | |
JP6505599B2 (ja) | 遺伝子標的化および形質スタッキングのための特別設計の導入遺伝子組み込みプラットフォーム(etip) | |
CN115315516B (zh) | 一种提高植物遗传转化和基因编辑效率的方法 | |
CA3091251A1 (fr) | Compositions et procedes pour ameliorer le rendement des recoltes par empilement de caracteres | |
WO2017180180A2 (fr) | Promoteur de plante et 3'utr pour l'expression de transgènes | |
JP2022534381A (ja) | ゲノム編集を使用してドミナントアレルを生成する方法及び組成物 | |
EP3356537B1 (fr) | Séquence de chromosome b de maïs de recombinaison et utilisations de ceux-ci | |
CN111819285A (zh) | 防碎基因和突变 | |
WO2017219046A1 (fr) | Promoteur de plante et 3'utr pour l'expression de transgènes | |
EP3052633B1 (fr) | Éléments régulateurs du gène de type métallothionéine de zea mays et leurs utilisations | |
WO2017053164A1 (fr) | Promoteur et 3'utr de plante pour l'expression d'un transgène | |
CA2982927C (fr) | Promoteur vegetal pour l'expression d'un transgene | |
WO2017219047A2 (fr) | Promoteur de plante et 3'utr pour l'expression de transgènes | |
WO2022086951A1 (fr) | Éléments régulateurs de plante et utilisations associées pour l'autoexcision | |
US10570403B2 (en) | Plant promoter and 3′ UTR from Zea mays chlorophyl A/B binding protein gene | |
AU2023254082A1 (en) | Plant regulatory elements and uses thereof for autoexcision | |
US20090156793A1 (en) | Circular Chromosomes | |
CA3140132A1 (fr) | Ingenierie genomique dirigee utilisant des technologies d'edition ciblees ameliorees | |
BR112018006553B1 (pt) | Ácido nucleico recombinante e método para produzir uma célula de milho transgênica | |
BR102017008936B1 (pt) | Vetor de ácido nucleico e uso de uma planta ou semente não-zea mays c.v. b73 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE INTERNATIONAL PUBLICATION HAS BEEN MADE |
|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: REQUEST FOR EXAMINATION WAS MADE |
|
17P | Request for examination filed |
Effective date: 20190117 |
|
AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR |
|
AX | Request for extension of the european patent |
Extension state: BA ME |
|
DAV | Request for validation of the european patent (deleted) | ||
DAX | Request for extension of the european patent (deleted) | ||
A4 | Supplementary search report drawn up and despatched |
Effective date: 20191009 |
|
RIC1 | Information provided on ipc code assigned before grant |
Ipc: A01H 1/00 20060101ALI20191002BHEP Ipc: C12N 15/79 20060101AFI20191002BHEP Ipc: C12N 15/84 20060101ALI20191002BHEP Ipc: C12N 15/82 20060101ALI20191002BHEP |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: EXAMINATION IS IN PROGRESS |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: EXAMINATION IS IN PROGRESS |
|
17Q | First examination report despatched |
Effective date: 20201014 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: EXAMINATION IS IN PROGRESS |