US20230257757A1 - System and Method of Modular Cloning - Google Patents
System and Method of Modular Cloning Download PDFInfo
- Publication number
- US20230257757A1 US20230257757A1 US18/150,272 US202318150272A US2023257757A1 US 20230257757 A1 US20230257757 A1 US 20230257757A1 US 202318150272 A US202318150272 A US 202318150272A US 2023257757 A1 US2023257757 A1 US 2023257757A1
- Authority
- US
- United States
- Prior art keywords
- item
- sites
- site
- level
- recognition
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000010367 cloning Methods 0.000 title claims description 165
- 238000000034 method Methods 0.000 title description 33
- 238000003776 cleavage reaction Methods 0.000 claims abstract description 443
- 230000007017 scission Effects 0.000 claims abstract description 443
- 239000013598 vector Substances 0.000 claims abstract description 378
- 108091008146 restriction endonucleases Proteins 0.000 claims abstract description 231
- 108020004414 DNA Proteins 0.000 claims abstract description 190
- 150000007523 nucleic acids Chemical class 0.000 claims abstract description 136
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 86
- 230000000295 complement effect Effects 0.000 claims abstract description 67
- 108020004707 nucleic acids Proteins 0.000 claims abstract description 58
- 102000039446 nucleic acids Human genes 0.000 claims abstract description 58
- 239000003550 marker Substances 0.000 claims abstract description 35
- 108010042407 Endonucleases Proteins 0.000 claims abstract description 28
- 102000004533 Endonucleases Human genes 0.000 claims abstract description 28
- 238000000137 annealing Methods 0.000 claims abstract description 14
- 239000002773 nucleotide Substances 0.000 claims description 53
- 125000003729 nucleotide group Chemical group 0.000 claims description 53
- 238000006243 chemical reaction Methods 0.000 description 166
- 239000012634 fragment Substances 0.000 description 96
- 238000013518 transcription Methods 0.000 description 71
- 230000035897 transcription Effects 0.000 description 71
- 239000013612 plasmid Substances 0.000 description 45
- 210000004027 cell Anatomy 0.000 description 39
- 102000004190 Enzymes Human genes 0.000 description 34
- 108090000790 Enzymes Proteins 0.000 description 34
- 230000002103 transcriptional effect Effects 0.000 description 29
- 239000007795 chemical reaction product Substances 0.000 description 28
- 108700026244 Open Reading Frames Proteins 0.000 description 24
- 230000006798 recombination Effects 0.000 description 21
- 238000005215 recombination Methods 0.000 description 21
- 239000000047 product Substances 0.000 description 20
- 238000011144 upstream manufacturing Methods 0.000 description 19
- 108010076504 Protein Sorting Signals Proteins 0.000 description 17
- 230000014509 gene expression Effects 0.000 description 15
- FDSDTBUPSURDBL-LOFNIBRQSA-N canthaxanthin Chemical compound CC=1C(=O)CCC(C)(C)C=1/C=C/C(/C)=C/C=C/C(/C)=C/C=C/C=C(C)C=CC=C(C)C=CC1=C(C)C(=O)CCC1(C)C FDSDTBUPSURDBL-LOFNIBRQSA-N 0.000 description 14
- 241000588724 Escherichia coli Species 0.000 description 13
- 239000000203 mixture Substances 0.000 description 13
- 102000003960 Ligases Human genes 0.000 description 12
- 108090000364 Ligases Proteins 0.000 description 12
- 238000013461 design Methods 0.000 description 12
- 102000012410 DNA Ligases Human genes 0.000 description 11
- 108010061982 DNA Ligases Proteins 0.000 description 11
- UPYKUZBSLRQECL-UKMVMLAPSA-N Lycopene Natural products CC(=C/C=C/C=C(C)/C=C/C=C(C)/C=C/C1C(=C)CCCC1(C)C)C=CC=C(/C)C=CC2C(=C)CCCC2(C)C UPYKUZBSLRQECL-UKMVMLAPSA-N 0.000 description 11
- JEVVKJMRZMXFBT-XWDZUXABSA-N Lycophyll Natural products OC/C(=C/CC/C(=C\C=C\C(=C/C=C/C(=C\C=C\C=C(/C=C/C=C(\C=C\C=C(/CC/C=C(/CO)\C)\C)/C)\C)/C)\C)/C)/C JEVVKJMRZMXFBT-XWDZUXABSA-N 0.000 description 11
- 235000012661 lycopene Nutrition 0.000 description 11
- 239000001751 lycopene Substances 0.000 description 11
- OAIJSZIZWZSQBC-GYZMGTAESA-N lycopene Chemical compound CC(C)=CCC\C(C)=C\C=C\C(\C)=C\C=C\C(\C)=C\C=C\C=C(/C)\C=C\C=C(/C)\C=C\C=C(/C)CCC=C(C)C OAIJSZIZWZSQBC-GYZMGTAESA-N 0.000 description 11
- 229960004999 lycopene Drugs 0.000 description 11
- 238000005580 one pot reaction Methods 0.000 description 11
- ZCIHMQAPACOQHT-ZGMPDRQDSA-N trans-isorenieratene Natural products CC(=C/C=C/C=C(C)/C=C/C=C(C)/C=C/c1c(C)ccc(C)c1C)C=CC=C(/C)C=Cc2c(C)ccc(C)c2C ZCIHMQAPACOQHT-ZGMPDRQDSA-N 0.000 description 11
- 241000120506 Bluetongue virus Species 0.000 description 10
- 230000001580 bacterial effect Effects 0.000 description 10
- 230000015572 biosynthetic process Effects 0.000 description 10
- 230000002068 genetic effect Effects 0.000 description 10
- 101150066555 lacZ gene Proteins 0.000 description 10
- 235000018102 proteins Nutrition 0.000 description 10
- 102000004169 proteins and genes Human genes 0.000 description 10
- 241000196324 Embryophyta Species 0.000 description 9
- 241000588696 Pantoea ananatis Species 0.000 description 9
- 108700008625 Reporter Genes Proteins 0.000 description 9
- 230000029087 digestion Effects 0.000 description 9
- 108020003589 5' Untranslated Regions Proteins 0.000 description 8
- 235000012682 canthaxanthin Nutrition 0.000 description 8
- 238000012360 testing method Methods 0.000 description 8
- 230000009466 transformation Effects 0.000 description 8
- OOUTWVMJGMVRQF-DOYZGLONSA-N Phoenicoxanthin Natural products CC(=C/C=C/C=C(C)/C=C/C=C(C)/C=C/C1=C(C)C(=O)C(O)CC1(C)C)C=CC=C(/C)C=CC2=C(C)C(=O)CCC2(C)C OOUTWVMJGMVRQF-DOYZGLONSA-N 0.000 description 7
- 230000008901 benefit Effects 0.000 description 7
- 239000001659 canthaxanthin Substances 0.000 description 7
- 229940008033 canthaxanthin Drugs 0.000 description 7
- 238000010276 construction Methods 0.000 description 7
- 230000000694 effects Effects 0.000 description 7
- 238000004519 manufacturing process Methods 0.000 description 7
- 241000589155 Agrobacterium tumefaciens Species 0.000 description 6
- 108091028043 Nucleic acid sequence Proteins 0.000 description 6
- 230000003115 biocidal effect Effects 0.000 description 6
- 235000021466 carotenoid Nutrition 0.000 description 6
- 150000001747 carotenoids Chemical class 0.000 description 6
- 241000894006 Bacteria Species 0.000 description 5
- 102000053602 DNA Human genes 0.000 description 5
- 101150000046 crtE gene Proteins 0.000 description 5
- 101150056470 dxs gene Proteins 0.000 description 5
- 230000008595 infiltration Effects 0.000 description 5
- 238000001764 infiltration Methods 0.000 description 5
- 238000005304 joining Methods 0.000 description 5
- 238000000746 purification Methods 0.000 description 5
- 239000012429 reaction media Substances 0.000 description 5
- 239000011541 reaction mixture Substances 0.000 description 5
- 238000012163 sequencing technique Methods 0.000 description 5
- 101150093857 Ccn4 gene Proteins 0.000 description 4
- 101150081978 ELM1 gene Proteins 0.000 description 4
- 241000207746 Nicotiana benthamiana Species 0.000 description 4
- 102000018120 Recombinases Human genes 0.000 description 4
- 108010091086 Recombinases Proteins 0.000 description 4
- 101100278777 Streptomyces coelicolor (strain ATCC BAA-471 / A3(2) / M145) dxs1 gene Proteins 0.000 description 4
- 101100114901 Streptomyces griseus crtI gene Proteins 0.000 description 4
- 229960000723 ampicillin Drugs 0.000 description 4
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 4
- 230000003321 amplification Effects 0.000 description 4
- 238000004458 analytical method Methods 0.000 description 4
- 238000013459 approach Methods 0.000 description 4
- 239000002131 composite material Substances 0.000 description 4
- 230000001419 dependent effect Effects 0.000 description 4
- 238000011534 incubation Methods 0.000 description 4
- 238000003780 insertion Methods 0.000 description 4
- 230000037431 insertion Effects 0.000 description 4
- 238000003199 nucleic acid amplification method Methods 0.000 description 4
- 238000002360 preparation method Methods 0.000 description 4
- 239000000376 reactant Substances 0.000 description 4
- 230000035939 shock Effects 0.000 description 4
- 239000000126 substance Substances 0.000 description 4
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 4
- 101100165658 Alternaria brassicicola bsc5 gene Proteins 0.000 description 3
- 101100397224 Bacillus subtilis (strain 168) isp gene Proteins 0.000 description 3
- 101100032924 Bacillus subtilis (strain 168) radA gene Proteins 0.000 description 3
- 101100492392 Didymella fabae pksAC gene Proteins 0.000 description 3
- 241000206602 Eukaryota Species 0.000 description 3
- 108700007698 Genetic Terminator Regions Proteins 0.000 description 3
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 3
- 239000006142 Luria-Bertani Agar Substances 0.000 description 3
- 241000520272 Pantoea Species 0.000 description 3
- 101100226893 Phomopsis amygdali PaP450-2 gene Proteins 0.000 description 3
- 241000709992 Potato virus X Species 0.000 description 3
- 108020004511 Recombinant DNA Proteins 0.000 description 3
- 101100052502 Shigella flexneri yciB gene Proteins 0.000 description 3
- 101000870438 Streptococcus gordonii UDP-N-acetylglucosamine-peptide N-acetylglucosaminyltransferase stabilizing protein GtfB Proteins 0.000 description 3
- 101100061456 Streptomyces griseus crtB gene Proteins 0.000 description 3
- 101000645119 Vibrio campbellii (strain ATCC BAA-1116 / BB120) Nucleotide-binding protein VIBHAR_03667 Proteins 0.000 description 3
- 230000010261 cell growth Effects 0.000 description 3
- 101150081158 crtB gene Proteins 0.000 description 3
- 101150011633 crtI gene Proteins 0.000 description 3
- 101150079391 crtW gene Proteins 0.000 description 3
- 230000001086 cytosolic effect Effects 0.000 description 3
- 230000007423 decrease Effects 0.000 description 3
- 101150064873 ispA gene Proteins 0.000 description 3
- 229930027917 kanamycin Natural products 0.000 description 3
- 229960000318 kanamycin Drugs 0.000 description 3
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 3
- 229930182823 kanamycin A Natural products 0.000 description 3
- 230000037361 pathway Effects 0.000 description 3
- 238000013492 plasmid preparation Methods 0.000 description 3
- 238000007747 plating Methods 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 238000012546 transfer Methods 0.000 description 3
- 230000000007 visual effect Effects 0.000 description 3
- JKQXZKUSFCKOGQ-JLGXGRJMSA-N (3R,3'R)-beta,beta-carotene-3,3'-diol Chemical compound C([C@H](O)CC=1C)C(C)(C)C=1/C=C/C(/C)=C/C=C/C(/C)=C/C=C/C=C(C)C=CC=C(C)C=CC1=C(C)C[C@@H](O)CC1(C)C JKQXZKUSFCKOGQ-JLGXGRJMSA-N 0.000 description 2
- OPIFSICVWOWJMJ-AEOCFKNESA-N 5-bromo-4-chloro-3-indolyl beta-D-galactoside Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1OC1=CNC2=CC=C(Br)C(Cl)=C12 OPIFSICVWOWJMJ-AEOCFKNESA-N 0.000 description 2
- 108090000104 Actin-related protein 3 Proteins 0.000 description 2
- 241000589158 Agrobacterium Species 0.000 description 2
- 241000589156 Agrobacterium rhizogenes Species 0.000 description 2
- 101100054772 Arabidopsis thaliana ACT2 gene Proteins 0.000 description 2
- 102100031102 C-C motif chemokine 4 Human genes 0.000 description 2
- 101150066002 GFP gene Proteins 0.000 description 2
- 101710180399 Glycine-rich protein Proteins 0.000 description 2
- 101000777470 Mus musculus C-C motif chemokine 4 Proteins 0.000 description 2
- 238000012408 PCR amplification Methods 0.000 description 2
- 108010064978 Type II Site-Specific Deoxyribonucleases Proteins 0.000 description 2
- 102100028262 U6 snRNA-associated Sm-like protein LSm4 Human genes 0.000 description 2
- JKQXZKUSFCKOGQ-LQFQNGICSA-N Z-zeaxanthin Natural products C([C@H](O)CC=1C)C(C)(C)C=1C=CC(C)=CC=CC(C)=CC=CC=C(C)C=CC=C(C)C=CC1=C(C)C[C@@H](O)CC1(C)C JKQXZKUSFCKOGQ-LQFQNGICSA-N 0.000 description 2
- QOPRSMDTRDMBNK-RNUUUQFGSA-N Zeaxanthin Natural products CC(=C/C=C/C=C(C)/C=C/C=C(C)/C=C/C1=C(C)CCC(O)C1(C)C)C=CC=C(/C)C=CC2=C(C)CC(O)CC2(C)C QOPRSMDTRDMBNK-RNUUUQFGSA-N 0.000 description 2
- 239000002253 acid Substances 0.000 description 2
- OIRDTQYFTABQOQ-KQYNXXCUSA-N adenosine Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O OIRDTQYFTABQOQ-KQYNXXCUSA-N 0.000 description 2
- OENHQHLEOONYIE-UKMVMLAPSA-N all-trans beta-carotene Natural products CC=1CCCC(C)(C)C=1/C=C/C(/C)=C/C=C/C(/C)=C/C=C/C=C(C)C=CC=C(C)C=CC1=C(C)CCCC1(C)C OENHQHLEOONYIE-UKMVMLAPSA-N 0.000 description 2
- JKQXZKUSFCKOGQ-LOFNIBRQSA-N all-trans-Zeaxanthin Natural products CC(=C/C=C/C=C(C)/C=C/C=C(C)/C=C/C1=C(C)CC(O)CC1(C)C)C=CC=C(/C)C=CC2=C(C)CC(O)CC2(C)C JKQXZKUSFCKOGQ-LOFNIBRQSA-N 0.000 description 2
- 235000013734 beta-carotene Nutrition 0.000 description 2
- 239000011648 beta-carotene Substances 0.000 description 2
- TUPZEYHYWIEDIH-WAIFQNFQSA-N beta-carotene Natural products CC(=C/C=C/C=C(C)/C=C/C=C(C)/C=C/C1=C(C)CCCC1(C)C)C=CC=C(/C)C=CC2=CCCCC2(C)C TUPZEYHYWIEDIH-WAIFQNFQSA-N 0.000 description 2
- 229960002747 betacarotene Drugs 0.000 description 2
- 230000008238 biochemical pathway Effects 0.000 description 2
- 230000001851 biosynthetic effect Effects 0.000 description 2
- 230000006696 biosynthetic metabolic pathway Effects 0.000 description 2
- 239000013256 coordination polymer Substances 0.000 description 2
- 101150085103 crtY gene Proteins 0.000 description 2
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical group NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 2
- 239000000539 dimer Substances 0.000 description 2
- 238000001976 enzyme digestion Methods 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 230000030279 gene silencing Effects 0.000 description 2
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 2
- 230000006801 homologous recombination Effects 0.000 description 2
- 238000002744 homologous recombination Methods 0.000 description 2
- 239000003112 inhibitor Substances 0.000 description 2
- 230000000813 microbial effect Effects 0.000 description 2
- -1 nucleic acid acids Chemical class 0.000 description 2
- 230000002441 reversible effect Effects 0.000 description 2
- 229960000268 spectinomycin Drugs 0.000 description 2
- UNFWWIHTNXNPBV-WXKVUWSESA-N spectinomycin Chemical compound O([C@@H]1[C@@H](NC)[C@@H](O)[C@H]([C@@H]([C@H]1O1)O)NC)[C@]2(O)[C@H]1O[C@H](C)CC2=O UNFWWIHTNXNPBV-WXKVUWSESA-N 0.000 description 2
- 239000007858 starting material Substances 0.000 description 2
- 239000000725 suspension Substances 0.000 description 2
- KBPHJBAIARWVSC-XQIHNALSSA-N trans-lutein Natural products CC(=C/C=C/C=C(C)/C=C/C=C(C)/C=C/C1=C(C)CC(O)CC1(C)C)C=CC=C(/C)C=CC2C(=CC(O)CC2(C)C)C KBPHJBAIARWVSC-XQIHNALSSA-N 0.000 description 2
- 235000010930 zeaxanthin Nutrition 0.000 description 2
- 239000001775 zeaxanthin Substances 0.000 description 2
- 229940043269 zeaxanthin Drugs 0.000 description 2
- OENHQHLEOONYIE-JLTXGRSLSA-N β-Carotene Chemical compound CC=1CCCC(C)(C)C=1\C=C\C(\C)=C\C=C\C(\C)=C\C=C\C=C(/C)\C=C\C=C(/C)\C=C\C1=C(C)CCCC1(C)C OENHQHLEOONYIE-JLTXGRSLSA-N 0.000 description 2
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 1
- 108040000931 ATP:ADP antiporter activity proteins Proteins 0.000 description 1
- 102000023805 ATP:ADP antiporter activity proteins Human genes 0.000 description 1
- 102100034542 Acyl-CoA (8-3)-desaturase Human genes 0.000 description 1
- 229920001817 Agar Polymers 0.000 description 1
- 241000178564 Agrobacterium aurantiacum Species 0.000 description 1
- 241000219195 Arabidopsis thaliana Species 0.000 description 1
- 101100162180 Arabidopsis thaliana AAC1 gene Proteins 0.000 description 1
- 101100490660 Arabidopsis thaliana AGP18 gene Proteins 0.000 description 1
- 101000793994 Arabidopsis thaliana Chlorophyll a-b binding protein CP26, chloroplastic Proteins 0.000 description 1
- 101100228136 Arabidopsis thaliana GAE6 gene Proteins 0.000 description 1
- 101100326997 Arabidopsis thaliana LHCB5 gene Proteins 0.000 description 1
- 101100351627 Arabidopsis thaliana PER42 gene Proteins 0.000 description 1
- 101000601283 Arabidopsis thaliana Peroxidase 42 Proteins 0.000 description 1
- 101100411930 Arabidopsis thaliana RBCS-1B gene Proteins 0.000 description 1
- 101000580697 Arabidopsis thaliana Ribulose bisphosphate carboxylase small subunit 1B, chloroplastic Proteins 0.000 description 1
- 101100004361 Arabidopsis thaliana TGG1 gene Proteins 0.000 description 1
- 241000972773 Aulopiformes Species 0.000 description 1
- 239000002126 C01EB10 - Adenosine Substances 0.000 description 1
- 101710132601 Capsid protein Proteins 0.000 description 1
- 241000701489 Cauliflower mosaic virus Species 0.000 description 1
- 101710094648 Coat protein Proteins 0.000 description 1
- 108091026890 Coding region Proteins 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- 230000004543 DNA replication Effects 0.000 description 1
- 238000001712 DNA sequencing Methods 0.000 description 1
- 101100286286 Dictyostelium discoideum ipi gene Proteins 0.000 description 1
- 241000702191 Escherichia virus P1 Species 0.000 description 1
- 241000282326 Felis catus Species 0.000 description 1
- 241000701484 Figwort mosaic virus Species 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 102100021181 Golgi phosphoprotein 3 Human genes 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- 101000848239 Homo sapiens Acyl-CoA (8-3)-desaturase Proteins 0.000 description 1
- 108060003951 Immunoglobulin Proteins 0.000 description 1
- 108091026898 Leader sequence (mRNA) Proteins 0.000 description 1
- 101710125418 Major capsid protein Proteins 0.000 description 1
- 108091092724 Noncoding DNA Proteins 0.000 description 1
- 101710163270 Nuclease Proteins 0.000 description 1
- 101710141454 Nucleoprotein Proteins 0.000 description 1
- 101100168661 Paracoccus sp. (strain N81106 / MBIC 01143) crtW gene Proteins 0.000 description 1
- 101710132772 Peroxidase 1 Proteins 0.000 description 1
- 101000714429 Potato virus X Coat protein Proteins 0.000 description 1
- 101710083689 Probable capsid protein Proteins 0.000 description 1
- 229940096437 Protein S Drugs 0.000 description 1
- 102000029301 Protein S Human genes 0.000 description 1
- 108010066124 Protein S Proteins 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- 244000061456 Solanum tuberosum Species 0.000 description 1
- 108091081024 Start codon Proteins 0.000 description 1
- 241000187391 Streptomyces hygroscopicus Species 0.000 description 1
- 108700005078 Synthetic Genes Proteins 0.000 description 1
- 241000710145 Tomato bushy stunt virus Species 0.000 description 1
- 101150107399 UTR1 gene Proteins 0.000 description 1
- 108010067390 Viral Proteins Proteins 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- 240000008042 Zea mays Species 0.000 description 1
- 235000007244 Zea mays Nutrition 0.000 description 1
- 230000006978 adaptation Effects 0.000 description 1
- 229960005305 adenosine Drugs 0.000 description 1
- 239000008272 agar Substances 0.000 description 1
- 239000011543 agarose gel Substances 0.000 description 1
- 235000001014 amino acid Nutrition 0.000 description 1
- 150000001413 amino acids Chemical class 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- 229940088710 antibiotic agent Drugs 0.000 description 1
- 108010054251 arabinogalactan proteins Proteins 0.000 description 1
- 102000005936 beta-Galactosidase Human genes 0.000 description 1
- 108010005774 beta-Galactosidase Proteins 0.000 description 1
- 230000008827 biological function Effects 0.000 description 1
- FDSDTBUPSURDBL-DKLMTRRASA-N canthaxanthin Chemical compound CC=1C(=O)CCC(C)(C)C=1\C=C\C(\C)=C\C=C\C(\C)=C\C=C\C=C(/C)\C=C\C=C(/C)\C=C\C1=C(C)C(=O)CCC1(C)C FDSDTBUPSURDBL-DKLMTRRASA-N 0.000 description 1
- 229960003669 carbenicillin Drugs 0.000 description 1
- FPPNZSSZRUTDAP-UWFZAAFLSA-N carbenicillin Chemical compound N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)C(C(O)=O)C1=CC=CC=C1 FPPNZSSZRUTDAP-UWFZAAFLSA-N 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 239000012707 chemical precursor Substances 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 229940104302 cytosine Drugs 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- 101150014423 fni gene Proteins 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 101150075592 idi gene Proteins 0.000 description 1
- 102000018358 immunoglobulin Human genes 0.000 description 1
- 229940072221 immunoglobulins Drugs 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 230000000670 limiting effect Effects 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 210000004962 mammalian cell Anatomy 0.000 description 1
- 238000012269 metabolic engineering Methods 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 108091027963 non-coding RNA Proteins 0.000 description 1
- 102000042567 non-coding RNA Human genes 0.000 description 1
- 230000036961 partial effect Effects 0.000 description 1
- 108010082527 phosphinothricin N-acetyltransferase Proteins 0.000 description 1
- 239000000049 pigment Substances 0.000 description 1
- 210000001778 pluripotent stem cell Anatomy 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 230000003252 repetitive effect Effects 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 235000019515 salmon Nutrition 0.000 description 1
- 230000037432 silent mutation Effects 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 238000003860 storage Methods 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 238000000844 transformation Methods 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 230000001052 transient effect Effects 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/66—General methods for inserting a gene into a vector to form a recombinant vector using cleavage and ligation; Use of non-functional linkers or adaptors, e.g. linkers containing the sequence for a restriction endonuclease
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/10—Processes for the isolation, preparation or purification of DNA or RNA
- C12N15/1034—Isolating an individual clone by screening libraries
- C12N15/1093—General methods of preparing gene libraries, not provided for in other subgroups
-
- C—CHEMISTRY; METALLURGY
- C40—COMBINATORIAL TECHNOLOGY
- C40B—COMBINATORIAL CHEMISTRY; LIBRARIES, e.g. CHEMICAL LIBRARIES
- C40B40/00—Libraries per se, e.g. arrays, mixtures
- C40B40/04—Libraries containing only organic compounds
- C40B40/10—Libraries containing peptides or polypeptides, or derivatives thereof
Definitions
- the present invention relates to a cloning system for producing a nucleic acid construct of interest using type IIs restriction endonucleases.
- the invention also provides a method of producing a nucleic acid construct of interest from at least three nucleic acid fragment constructs.
- Synthetic biology promises to revolutionize biotechnology through engineering of life forms with novel phenotypes not normally found in nature.
- applications include the microbial production of chemical precursors, novel antibiotics and, induction and fine-tuning of pluripotent stem cells and the engineering of a minimal free living cell.
- Such applications will require the ability to physically assemble complex DNA molecules containing large numbers of natural or artificial genes in a wide variety of arrangements.
- Recombinase-based cloning eliminates the problems coming from the multiple occurrence of restriction sites in large constructs but is limited by the fact that recombination sites are left in the final construct, preventing the seamless assembly of protein coding sequences. Moreover, recombinase-based cloning is limited by the fact that, so far, only 4 fragments can be assembled in one construct simultaneously. Ligation-independent cloning is also independent of restriction sites since restriction enzymes are not used, but is limited by the fact that it requires PCR and therefore requires sequencing of constructs made with this approach.
- Engler et al. PLoS ONE 4 (2009) e5553 describe a protocol to assemble in one step and one tube at least nine separate DNA fragments together into an acceptor vector using type IIs restriction enzymes by simply subjecting a mix of 10 undigested input plasmids to a restriction-ligation reaction and transforming the resulting mix into competent cells.
- This protocol was named “Golden Gate” cloning.
- the ability to assemble a large number of fragments in a single step is useful for making combinatorial libraries, for example for making a construct containing all the genes encoding for a biochemical pathway.
- Such library can be made by assembling in one step all the genes necessary for a biochemical pathway, with multiple variants for each of the genes of the pathway.
- the present invention provides:
- each of said m nucleic acid fragment constructs comprising in this order:
- FIG. 1 A shows elements of a system that allows re-use of the entry DNAs of the invention (level 1 destination vectors) for different inserted sequence segments.
- FIG. 1 B provides an example for cloning of three nucleic acid fragment constructs na1, na2, and na3 into a destination vector using end-linker EL3.
- FIG. 1 C shows that further nucleic acid fragment constructs na4 to na7 and a further construct na1 that is based on the same entry vector number 1 as na1 ligated in FIG. 1 B can be cloned into the product vector obtained in FIG. 1 B .
- FIG. 2 explains how type IIs restriction sites are depicted in the following figures.
- FIG. 3 shows a general embodiment wherein a eukaryotic multi-gene construct is produced as the nucleic acid fragment of interest.
- FIG. 4 depicts a set of level 1 destination vectors, level 2 destination vectors, and end-linkers.
- FIG. 5 A illustrates the structure of the modules depicted in FIG. 3 in greater detail and shows how these modules can be assembled.
- FIG. 5 B show nucleotide sequences at the junction of various modules for each step of cloning of FIG. 5 A .
- “Gene 1” is equivalent to “TU 1”.
- Sequence stretches (SS) designated SS1 to SS26 are SEQ ID NO: 31 to SEQ ID NO: 56, respectively.
- FIG. 6 is an overview of alternative cloning strategies.
- FIG. 7 shows the structure of level 0 modules.
- FIG. 8 A explains the assembly of transcriptional units from level 0 modules into level 1 destination vectors (entry DNAs).
- FIG. 8 B illustrates an example for the assembly of level 0 modules using a different set of level 1 destination vectors.
- FIG. 9 A assembly of 5 transcription units into a multigene level 2 construct in a level 2 reaction.
- FIG. 9 B Assembly is performed with an end linker containing internal divergent Bsal restriction sites for allowing a further level 2 reaction in the resulting level 2 construct.
- FIG. 9 C shows examples of further rounds of cloning.
- FIG. 10 shows the efficiency of cloning for different levels of assembly.
- FIG. 11 illustrates the general strategy used for cloning of prokaryotic operons.
- FIG. 12 Comparison of level 1 destination vectors for eukaryotes (“MoClo” and prokaryotes (“MoClo Pro”).
- FIG. 13 illustrates the cloning of 5 level 1 constructs (shown in the lower part) from level 0 modules into level 1 destination vectors (shown in the middle part) in a level 1 reaction.
- FIG. 14 shows the level 2 reaction from the level 1 constructs made as shown in FIG. 13 to produce a functional operon in a level 2 construct.
- FIG. 15 shows a similar level 2 reaction as shown in FIG. 14 , but with an end-linker containing internal Bsal sites for allowing a further level 2 reaction.
- FIG. 16 illustrates the preparation of 3 additional level 1 constructs (designated 6, 7 and 1, respectively) containing a second promoter (P2) and two further ORFs (orf4 and orf5).
- FIG. 17 Cloning of the three additional level 1 constructs obtained according to FIG. 16 into the level 2i ⁇ 1 construct made according to FIG. 15 .
- FIGS. 18 A- 18 C show the structure of transcriptional units, level 0 modules, and of destination vectors required for their cloning.
- FIG. 18 A The transcriptional units contain up of 5 basic modules separated by 4 nucleotides sequences that serve as recombination sites (shown in boxes).
- FIG. 18 B Level 0 modules shown on the first line are flanked by Bsal sites.
- FIG. 18 C Strategy for removing internal type IIS recognition sequences.
- FIGS. 19 A- 19 D show an example for cloning of constructs of level 0, 1 and 2. Antibiotic resistances are indicated.
- FIG. 19 A Illustrates cloning of level 0 promoter modules.
- FIG. 19 B Illustrates cloning of a level 1 construct containing a transcription unit.
- FIG. 19 C Illustrates cloning of a level 2i-1 construct containing 5 transcription units, TU1 (containing GFP), TU2 (containing p19), TU3 (containing VP2), TU4 (containing VP5), TU5 (containing VP7) into destination vector pL2-1.
- FIG. 19 D Illustrates cloning of a level 2-2 construct containing 11 transcription units.
- the construct contains TU6 (transcription unit with VP3), TU7 (transcription unit with BAR), TU8 (transcription unit with antibody light chain), TU9 (transcription unit with antibody heavy chain), TU10 (transcription unit with TMV MP), TU11 (transcription unit with TMV CP).
- FIG. 20 A shows the structure of the 11 level 1 transcription units used in FIG. 19 B .
- FIG. 20 B shows the structure of construct plCH51811 that is obtained by cloning of the 11 transcription units cloned in a level 2-2 construct.
- FIG. 21 shows the expression of GFP in a Nicotiana benthamiana leaf after infiltration of level-2 constructs shown in FIG. 10 .
- FIG. 22 shows prokaryotic genes cloned as level 0 entry modules.
- FIG. 23 A shows level 1 destination vectors for cloning of prokaryotic coding sequences.
- FIG. 23 B shows two sets of end linkers that contain the LacZ terminator from pUC19.
- FIG. 24 shows a list of level 1 constructs made with the level 0 modules shown in FIG. 22 .
- FIG. 25 shows level 2i-1 constructs.
- FIGS. 26 A- 26 B show addition of two or three more genes to the operons obtained in FIG. 25 .
- FIG. 27 shows that level 2 cloning can be followed by a level 3.
- FIG. 28 shows vectors required for level 3 cloning.
- FIG. 29 A shows a set of destination vectors and end-linkers for assembly of several level 1 transcription units (or more generally nucleic acids “na”) into a level M destination vector (M stands for multiplication).
- FIG. 29 B illustrates assembly of three transcription units (nucleic acid acids na1 to 3) in a level M destination vector.
- FIG. 29 C illustrates assembly of two transcription units (nucleic acids na4 and 5) in a level M destination vector.
- the linker is chosen from the set of linkers ELM1 to ELM7 such that na5 can be linked to cleavage site S8 of destination vector M.
- 29 D shows assembly of the two pre-assembled blocks of transcription units (the reaction products of the reactions shown in B and C) in a level 2 construct employing a level 2 destination vector. Details of the restriction sites of destination vectors and end-linkers are shown in FIGS. 31 and 32 (below).
- FIG. 30 shows that level M can be used as an intermediate step between level 1 and 2.
- FIG. 31 shows the structure of the entry DNAs as well as vectors required for level M cloning.
- FIGS. 32 A-C show an example where blocks of three ( FIG. 32 A ) and two ( FIG. 32 B ) transcription units are separately pre-assembled into level M destination vectors before being assembled in a level 2 vector in a subsequent step ( FIG. 32 C ).
- the destination vectors and end-linkers for level M (left side of the figure) are as shown in FIG. 31 .
- FIG. 34 A illustrates a level P reaction wherein two level M constructs are assembled into a level P destination vector using an end-linker P to give a level P construct.
- FIG. 34 B illustrates a level M reaction wherein two level P constructs are assembled into a destination vector M (“destination M”) using an end-linker M to give a Level M construct.
- FIG. 35 Cloning from level P to level M and vice-versa can be repeated indefinitely by reusing the same set of vectors and end-linkers shown in FIG. 33 .
- FIG. 36 illustrates the structure of constructs obtained in successive level M and level P cloning steps.
- the system of the invention comprises a defined set of components that have a high versatility and flexibility, whereby a given system can be easily applied to many different applications.
- a given system can be used for applications comprising different numbers of fragments to be assembled in a nucleic acid construct of interest. It is a great advantage of the invention that many different fragments can be combined with a number of acceptor vectors that is smaller than the number of fragments to be combined. Therefore, the system can be scaled to the combination of many different fragments and fragment numbers with little or no extra cloning work for the adaption of acceptor vectors to a large number of fragments.
- the cloning system allows to assemble constructs from multiple DNA fragments at each cloning step (using Golden Gate cloning), (2) the cloning procedure is automatically defined by the number of genetic elements that the user wants to assemble and does not require construct-specific cloning strategies and can therefore easily be automatized, (3) the cloning procedure can be repeated indefinitely using the same set of cloning vectors to make increasingly complex constructs (with an increasingly higher number of multigene and/or genetic elements.
- a nucleic acid construct of interest is a DNA assembled from m nucleic acid fragment constructs, m being an integer of at least 3. Each nucleic acid fragment construct provides a sequence segment to the nucleic acid construct of interest.
- the nucleic acid construct of interest is present in a vector having in its backbone a selectable marker for allowing selection of cells containing the vector.
- the nucleic acid construct of interest is produced in the invention in a process comprising at least two, typically three, steps of restriction and ligation, for example departing from standardised pre-prepared modules. Restriction is catalysed by a type IIs restriction endonuclease, ligation is catalysed by a ligase.
- a first step of restriction and ligation corresponding to step (A) of the method of the invention (also referred herein as “level 1” or “level 1 reaction”), at least one, preferably at least 2, nucleic acid modules are linked by restriction and ligation and at the same time inserted into an acceptor vector.
- Acceptor vectors are referred to herein as “destination vectors”.
- the acceptor vectors of the level 1 reaction are also referred to herein as “level 1 destination vectors” or “level 1 acceptor vector”.
- the level 1 acceptor or destination vectors are also referred to herein as “entry DNAs”.
- the reaction products of the level 1 reaction are referred to as “level 1 constructs” or “nucleic acid fragment constructs”, the latter terms being equivalent herein.
- a level 1 destination vector is a reactant of the level 1 reaction
- the nucleic acid fragment constructs are the products of the level 1 reaction.
- Multiple level 1 reactions are generally conducted separately to obtain at least two different nucleic acid fragment constructs to be assembled in the second step, referred to herein as “level 2” (see further below).
- level 1 reaction One purpose of the level 1 reaction is to provide the nucleic acid fragment constructs to be assembled in the next step with suitable cleavage sites of a type IIs restriction endonuclease to allow ligation of the constructs obtained on level 1 in the desired order on level 2.
- the level 1 reaction further serves the purpose of constructing nucleic acid fragment constructs from 2 or more modules.
- the nucleic acid construct of interest comprises several eukaryotic transcription units
- multiple individual transcription units can be assembled in separate level 1 reactions from 2 or more modules (such as promoter, 5′ UTR, signal peptide sequence etc.).
- level 2 two or more transcription units can then be combined.
- one or more further nucleic acid construct each containing a transcription unit can be combined with the reaction product of the first level 2 reaction. It is, however, not compulsory to produce the nucleic acid fragment constructs using such level 1 reaction. It could also be considered to engineer them by other means or to synthesise them artificially de novo.
- step (B) of the method of the invention at least 2 nucleic acid fragment constructs obtained in the previous level 1 step are combined by restriction and ligation and, in the same reaction, inserted into an acceptor vector.
- This acceptor vector of the level 2 reaction is referred to as “level 2 destination vector”. If the term “destination vector” is used without reference to a particular level, it refers to a level 2 destination vector.
- the reaction product of the level 2 reaction is referred to as “level 2 construct”. In some embodiments, the level 2 construct is the nucleic acid construct of interest.
- the level 2 reaction is followed by a further reaction step (step (C)) that may be referred to as “level 2-2”, indicating a second level 2 reaction.
- step (C) a further reaction step
- a nucleic acid linker also simply referred to as “linker” or “end-linker” herein
- linker also simply referred to as “linker” or “end-linker” herein
- linker or multiple linkers significantly improves the versatility and flexibility of the systems of the invention in that different nucleic acid fragment constructs can be inserted into a given destination vector, whereby a given destination vector can be used independent of the cleavage site of the nucleic acid fragment construct.
- linkers allow introduction of a type IIs restriction site for re-opening the level 2 reaction product for insertion of further nucleic acid fragment constructs in a further step of restriction and ligation (step (C)) as will be described below.
- module is used herein to refer to the starting compound of a level 1 reaction other than the level 1 destination vector.
- a module is a reactant of a restriction and ligation reaction that reacts with a level 1 destination vector.
- the modules of a level 1 reaction can be produced in a level 0 reaction (see further below).
- the modules of the level 1 reaction can be the products of a level 0 reaction.
- the system of the invention comprises a set of n entry DNAs and at least one destination vector.
- n is an integer of at least 2, in another embodiment of at least 3. Conveniently, n may be between 3 and 10.
- the term “entry DNA” refers to acceptor vectors of the level 1 reaction.
- the destination vector is an acceptor vector of the level 2 reaction.
- the entry DNAs and the at least one destination vector of the system are key components for performing both the level 1 and the level 2 reaction.
- the entry DNAs of the system of the invention allow combination of the multiple nucleic acid fragment constructs produced from multiple entry DNAs in a desired order and insertion into the level 2 destination vector in the level 2 reaction.
- each of said n entry DNAs comprises in this order: (i) a type IIs restriction endonuclease recognition site followed by the cleavage site thereof;
- the cleavage sites of the type IIs restriction endonuclease recognition sites of item (iii) of entry DNAs 1 to n ⁇ 1 are complementary to the cleavage sites of the type IIs restriction endonuclease recognition sites of item (i) of entry DNAs 2 to n, respectively.
- Being complementary means that single-stranded overhangs produced by restriction with a type IIs restriction endonuclease recognising the recognition sites of the cleavage sites are complementary such that the single stranded overhangs can anneal and be ligated after annealing to form a linear DNA.
- the first entry DNA can anneal with its end represented by item (iii) to the end represented by item (i) of the second entry DNA.
- the second entry DNA can anneal with its end represented by item (iii) to the end represented by item (i) of the third entry DNA etc. This is illustrated by the dashed arrows in FIG. 4 that link the right hand side of the level 1 destination vectors with the left hand side of the level 1 destination vector underneath.
- all item (i) cleavage sites are unique and non-complementary among the n entry DNAs of the system of the invention, and all item (iii) cleavage sites of all entry DNAs are unique and non-complementary among the n entry DNAs of the invention.
- the cleavage sites of items (i) and (iii) are preferably non complementary in order to avoid ligating multiple identical fragment constructs contiguously.
- the recognition sites of items (i) and (iii) among all entry DNAs are recognitions sites of the same endonuclease so that the associated cleavage sites can be cleaved using the same type IIs restriction endonuclease.
- the recognition sites of different entry DNAs are recognition sites of different type IIs restriction endonucleases.
- multiple endonucleases will have to be used in a given level 2 reaction to ensure that all required cleavage sites are cleaved.
- the single-stranded overhangs formed from the cleavage sites by type IIs restriction enzyme cleavage are preferably non-palindromic.
- the cleavage site of the type IIs restriction endonuclease recognition site of item (iii) of entry DNA n is complementary to the cleavage site of the type IIs restriction endonuclease recognition site of item (i) of entry DNA 1 for allowing annealing of complementary single-stranded overhangs formed by restriction at recognition site (i) of entry DNA 1 and at recognition site (iii) of entry DNA n.
- This feature is illustrated in the long dashed arrow linking the TGCC cleavage site of the level 1 destination vector pL1F-7 with the TGCC cleavage site of the level 1 destination vector pL1F-1 ( FIG. 4 ).
- this allows reuse of the first and optionally further entry DNAs in a second level 2 reaction so that more nucleic acid fragment constructs can be combined to produce a nucleic acid construct of interest than the system has entry DNAs.
- the entry DNAs may be circular plasmids or vectors, wherein items (i) and (iii) of the entry DNAs are linked by a vector backbone.
- the vector backbone may contain a selectable marker allowing selection of cell clones containing the entry DNA or the nucleic acid fragment construct obtained therefrom in the level 1 reaction.
- the system further comprises a destination vector (level 2 destination vector) comprising in this order:
- the cleavage sites of items (I) and (III) are different and non-complementary.
- the recognition sites of items (I) and (III) are recognition sites of the same endonuclease so that the associated cleavage sites can be cleaved using the same type IIs restriction endonuclease.
- the recognition sites of items (I) and (III) are further recognitions sites of the same endonuclease as the recognition sites of items (i) and (iii) of the entry DNAs, so that the level 2 reaction can be performed using one type IIs restriction endonuclease.
- the type IIs restriction endonuclease recognising the recognition site (1) of said destination vector can produce a single-stranded overhang from the cleavage site of item (I) that is complementary to the single-stranded overhang producible by the type IIs restriction endonuclease recognising the recognition site (i) of entry DNA numbered 1 for enabling annealing of said complementary single-stranded overhangs and ligation of said destination vector with the DNA segment of item (ii) from entry DNA numbered 1.
- the cleavage site of item (i) of entry DNA 1 and the cleavage site of item (I) of the destination vector are complementary.
- the type IIs restriction endonuclease recognising the recognition site (1) of a destination vector can produce a single-stranded overhang from the cleavage site of item (I) that is complementary to the single-stranded overhang producible by the type IIs restriction endonuclease recognising the recognition site (i) of an entry DNA other than 1, such as 2 or 3.
- Such destination vectors are depicted in FIG. 4 .
- the cleavage site of item (III) of the destination vector may be made complementary to the cleavage site of the entry DNA that will be linked to the cleavage site of item (III).
- nucleic acid linkers may be used for this purpose, since suitable linkers allow to link any item (iii) cleavage site to the item (III) cleavage site of the destination vector without the need for producing a destination vector for each possible downstream (item (iii)) cleavage site of the entry DNAs.
- linkers Since the specific item (iii) cleavage site of an entry DNA or nucleic acid fragment construct depends, for a given set of entry DNAs, from the number of fragment constructs to be combined in the level 2 reaction, the linkers provide the system with a broad applicability to many different real life applications. Notably, a given system can be applied to cases with different numbers of fragment constructs to be recombined.
- An advantageous linker comprise in the following order:
- the linkers may be linear DNA molecules. Generally, however, the linkers are circular plasmids.
- the linkers comprise a pair of type IIs restriction endonuclease recognition sites (items (a) and (h)) and associated cleavage sites (items (b) and (g) at both ends for linking a given item (iii) site with an item (III).
- This pair of restriction sites is in convergent orientation, which means that the two cleavage sites are oriented toward the center of the linker, while the recognition sites are oriented towards the termini of the linker so that the recognition sites are removed upon restriction.
- Examples of linkers are linkers pELB-1 to -7 and pELR-1 to -7 shown in FIG. 4 .
- the linkers preferably comprise a further, different, pair of type IIs restriction sites flanked by the pair formed by items (a), (b), (g) and (h) of the linker.
- This further pair is formed by items (c) to (f) of the linker and is in divergent orientation, which allows to reopen a level 2 reaction product produced using such linker for insertion of further nucleic acid fragment constructs.
- the cleavage site of item (b) is complementary to an item (iii) cleavage site of an entry DNA for being capable of linking a cleavage site of item (iii) of one of a entry DNAs numbered 1 to n, preferably of number 1 to n ⁇ 1, to a cleavage site of item (III) of said destination vector.
- the cleavage site of item (g) of the linker is complementary to the cleavage site of item (III) of the linker.
- the linkers may be provided as part of a plasmid containing the linker elements (a) to (h) defined above and a plasmid backbone linking elements (a) and (h).
- the backbone may contain a selectable marker for selecting cells containing the plasmid using a selective agent. This allows storage and amplification of linkers in cells, notably bacterial cells.
- the system comprises a set of n nucleic acid linkers numbered 1 to n, each n-th linker comprising items (a) to (h), the cleavage site of item (iii) of each n-th entry DNA is complementary to the cleavage site of item (b) of the n-th linker; the cleavage site of item (g) of each n-th linker being complementary to the cleavage site of item (III) of the n-th destination vector.
- each n-th linker is capable of linking a cleavage site of item (iii) of the n-th entry DNA to a cleavage site of item (III) of each n-th destination vector.
- the system contains the same number of n entry DNAs and linkers.
- a linker is provided for each entry DNA of the set of n entry DNA.
- a linker is provided for each entry DNA of the set of n entry DNA.
- all item (g) cleavage sites of the set of n linkers can be identical.
- FIG. 4 shows a set of 7 entry vectors (level 1 destination vectors) PL1F-1 to -7 and a set of linkers pELB-1 to -7.
- Each of the n linkers of the set of n linkers may be part of a plasmid.
- the cleavage sites of items (b) and (c) within each linker may have the same sequence of nucleotides and may overlap such that one and the same sequence of nucleotides provides the cleavage site of items (b) and that of item (c).
- the cleavage sites of items (f) and (g) within each linker may have the same sequence of nucleotides and may overlap such that one and the same sequence of nucleotides provides the cleavage site of items (f) and that of item (g).
- the system of the invention may comprise from 1 to n multiple destination vectors numbered 1 to n, each of said 1 to n destination vectors having segments (I) to (III) as defined above and optionally a segment (IV) as defined above.
- the cleavage sites of item (III) of all destination vectors may be identical and all cleavage sites of item (I) of all n destination vectors may be unique among the cleavage sites of item (III).
- the n-th item (I) cleavage site of all n destination vectors is complementary to the n-th item (i) cleavage site of the entry DNA.
- FIG. 4 shows a set of 7 level 2 destination vectors having identical item (III) cleavage sites (GGGA) that are complementary to the GGGA cleavage sites of the linkers.
- the item (I) cleavage sites of each level 2 destination vector is complementary to the level 1 destination vector shown in the same line in the left-most column.
- the optional insert of item (IV) of the destination vector(s) may be any sequence linking items (III) and (I), whereby the destination vector will be a circular molecule of vector. Absence of the insert of item (IV) may mean that the destination vector is linear DNA molecule. Preferably, however, an insert is used that is or comprises a marker gene that allows to distinguish cell clones containing the destination vector from those containing the product of the level 2 reaction. Since the restriction sites of the destination vector are in divergent orientation with respect to the insert (see FIG. 4 ), the insert is lost in the level 2 reaction. Thus, destination vectors and level 2 reactions products can be distinguished by the absence of red color in cell clones containing the latter.
- entry DNAs and nucleic acid fragment constructs differ in that the latter contain a sequence segment (item (ii′)) of the nucleic acid construct of interest to be produced.
- each sequence portion of item (ii) of each entry DNA 1 to n generally comprises a further pair of two type IIs restriction endonuclease recognition sites oriented such that said further pair of recognition sites can be removed from said entry DNAs by treatment with type IIs restriction endonuclease(s) recognising said further pair of recognition sites.
- the further pair of two recognition sites are the Bsal sites in the level 1 destination vectors.
- sequence region of item (ii) is not particularly restricted and may be as short as a few nucleotides. In one embodiment, however, the sequence region of item (ii) contains a reporter gene or reporter genes allowing color selection of cell clones containing the reporter gene(s). Said further pair of recognition sites may flank the reporter gene for enabling selection of cell clones for the presence or absence of said reporter gene e.g. by color.
- the reporter gene is lacZ flanked by a pair of Bsal sites.
- the further pair of two type IIs restriction endonuclease recognition sites are recognition sites of endonucleases different from those of recognition sites of item (i) and item (iii) so that a given endonuclease does not cleave a cleavage site of the further pair and, at the same time, a recognition site of item (i) or (iii).
- the orientation of the further pair of recognition sites is divergent with respect to the marker gene, so that the reporter gene and the recognition sites is removed upon treatment with the type IIs endonuclease recognising these site, so that these recognition sites are not present in the nucleic acid fragment construct obtained in the level 1 reaction.
- the second type of reactants of the level 1 reaction is one or more modules that can be incorporated into the entry DNAs in the level 1 reaction using the known methodology described in Engler et al. PLoS ONE 4 (2009) e5553. These modules are also referred to herein as “level 0 modules”, since they can be produced in a level 0 reaction.
- An example of a level 1 reaction is schematically shown in FIG. 5 A .
- One or more level 0 modules are ligated together in a desired order with and into the entry DNA to produce the level 1 fragment constructs (level 1 construct), using the inner pair of type IIs restriction sites present in sequence portion of item (ii) of the entry DNA.
- a nucleic acid construct of interest is produced from at least m nucleic acid fragment constructs numbered 1 to m.
- Each nucleic acid construct of interest typically comprises a sequence segment to be incorporated into the nucleic acid construct of interest. These sequence segments may be numbered 1 to m as the nucleic acid fragment construct containing them in the order of occurrence in the nucleic acid construct of interest.
- Numeral m is an integer of at least 3, preferably at least 6, more preferably at least 10.
- Said method comprises the steps (A) to (C) as described in the following.
- step (A) the m nucleic acid fragment constructs are provided.
- Each of said m nucleic acid fragment constructs comprising in this order:
- the m nucleic acid fragment constructs can be provided in (separate) level 1 reactions using the entry DNAs of the system of the invention and at least one module per type of fragment construct that provides the core portion to the sequence segment of item (ii′).
- the level 1 reaction one or several such modules may be combined to generate the fragment constructs with the desired core portion comprising portions derived from multiple modules.
- the modules used in the level 1 reaction are also referred to herein as “level 0 modules”, as they can be prepared in a restriction and ligation step before the level 1 reaction.
- the level 1 reaction may be performed as explained with reference to FIGS. 7 and 8 using known methods and as described herein.
- the downstream cleavage sites of nucleic acid fragment constructs 1 to m ⁇ 1 are complementary to the upstream cleavage sites of nucleic acid fragment constructs 2 to m, respectively, for allowing assembly of the nucleic acid fragment constructs in the order corresponding to the numbering of the constructs in the subsequent step (B).
- the recognition sites of items (i′) and (iii′) as well as the upstream and downstream cleavage sites of item (ii′) are derived from the entry DNAs used, whereas the core portion is essentially derived from the level 0 modules.
- the products of the level 1 reaction are generally transformed into cells for amplification and purification. Typically, they are transformed into competent bacterial cells such as E. coli cells. After cell growth, the fragment constructs are isolated from the cells, e.g. using standard plasmid preparation protocols, for use in the following step (B).
- the method of the invention comprises two steps wherein fragment constructs are combined, namely the following steps (B) and (C).
- steps (B) and (C) at least one fragment construct is used in step (C) that is derived from the same entry DNA as a fragment construct used in step (B).
- the method of the invention allows reuse of entry DNAs for more than one nucleic acid fragment. It is an important aspect of the invention that many different fragment constructs can be combined with a relatively small number entry DNAs. However, in this embodiment, fragment constructs derived from the same entry DNA have the same upstream and downstream cleavage sites (ii′) and are therefore used in separate reactions to avoid statistical inclusion of either fragment construct at a selected position into the final nucleic acid construct of interest.
- the downstream cleavage site of a nucleic acid fragment construct u is complementary to the upstream cleavage site of the type IIs restriction endonuclease recognition site of item (ii′) of nucleic acid fragment 1 (illustrated in FIG. 4 by the long dashed arrow linking cleavage sites TGCC of pL1F-1 and pL1F-7).
- Step (B) is a level 2 reaction.
- step (B) is a level 2i-1 reaction.
- step (B) the sequence segment(s) of item (ii′) of nucleic acid fragment constructs 1 to s, wherein s is an integer ⁇ u, and said linker are ligated, in this order, and inserted into said destination vector.
- This may be done by reacting, in the presence of a type IIs restriction endonuclease recognising said type IIs restriction endonuclease recognition sites of items (i′) and (iii′) and items (I) and (III) of the destination vector defined below and in the presence a DNA ligase, in reaction medium compatible with activity of said type IIs restriction endonuclease and said ligase.
- a mixture comprising nucleic acid fragment constructs 1 to s, the destination vector and a linker may be treated with the type IIs restriction endonuclease and the DNA ligase in a reaction medium compatible with activity of the type IIs restriction endonuclease and the ligase.
- s defines the number of nucleic acid fragment constructs combined in step (B) with the (level 2) destination vector. Since s is smaller than u, nucleic acid fragment construct u+1 and higher will not be used in step (B), but in a subsequent step such as step (C).
- Cleavage site (b) of the linker may be complementary to the downstream cleavage site of item (ii′) of nucleic acid fragment construct s, and cleavage site (g) of said linker and the cleavage site of item (III) of the destination vector may complementary for connecting the downstream cleavage site of fragment construct s to site (III) of the destination vector.
- Step (B) may comprise transformation of the restriction and ligation product into cells for amplification and purification. Typically, it is transformed into competent bacterial cells such as E. coli cells. After cell growth, the level 2 construct is generally isolated from the cells, e.g. using standard plasmid preparation protocols, for use in the following step (C).
- Step (C) is a subsequent level 2 reaction.
- step (C) is a level 2-2 or level 2i-2 reaction.
- a mixture comprising the recombination product of step (B) (a “level 2i-1 construct”) and nucleic acid fragment construct(s) s+1 to m is treated with a type IIs restriction endonuclease recognising said type IIs restriction endonuclease recognition sites of items (i′) and (iii′), a type IIs restriction endonuclease recognising said type IIs restriction endonuclease recognition sites of items (d) and (e) of the linker and a DNA ligase in a reaction medium compatible with activity of said type IIs restriction endonucleases and said ligase.
- sequence segments of item (ii′) of nucleic acid fragment constructs s+1 to m and optionally a further linker as defined in item (3) are inserted into the cleavage sites provided by items (c) and (f) of the linker used in step (B).
- the recognition sites of items (i′) and (iii′) may be the same as the recognition sites of items (d) and (e) of the linker, whereby a type IIs restriction endonuclease recognising all these recognition sites can be used.
- the linker may be of the type pELE shown in FIG. 4 , whereby no further level 2 reaction can be performed with the reaction product of step (C).
- nucleic acid constructs of interest can be made from more than m nucleic acid fragment constructs.
- the possibility to use one or more further level 2 reactions is schematically shown in FIG. 6 .
- Step (C) may comprise transformation of the restriction and ligation product into cells for amplification and purification. Typically, it is transformed into competent bacterial cells such as E. coli cells. After cell growth, the construct of step (C) may be isolated from the cells, e.g. using standard plasmid preparation protocols.
- the present invention provides a further system for producing a nucleic acid construct of interest. Similar as with the system described above, consecutive repetitions of cloning steps and re-use of the cleavage sites from a predefined set of vectors allows to increase the number of fragments that make up a nucleic acid construct of interest in a vector.
- a set of n destination vectors is used that are referred to as “level M destination vectors”.
- Level M destination vectors differ from level 2 destination vectors in that an additional type IIs restriction endonuclease recognition site is present (compare the level 2 destination vectors of FIG. 4 with the “level M destination vectors” in FIG. 31 ).
- the additional recognition site has a cleavage site of the same nucleotide sequence as the cleavage site of item (I) of the level 2 destination vectors. Fragment constructs or entry DNAs are inserted into the level M destination vectors together with linkers referred to as “linkers M”.
- Linkers M differ from linkers such as linkers pELE shown in FIG. 4 in that they have an additional type IIs restriction endonuclease recognition site (compare linkers pELE of FIG. 4 with the “end liners M” in FIG. 31 ).
- the additional recognition site of linkers M has a cleavage site of the same nucleotide sequence as the cleavage site on the left hand side of linkers pELE of FIG. 4 .
- destination vectors M and linkers M allow excision of constructs cloned into the designation vectors M and introduction, preferably with other constructs produced in parallel level M reactions, into a further destination vector referred to as “level P destination vector” together with a suitable linker referred to as linker P.
- level P destination vector a further destination vector
- linker P a suitable linker referred to as linker P.
- destination vectors M and linkers M destination vectors P and linkers P are designed such that excision of constructs cloned into the designation vectors P is possible as well as reintroduction into a further level M destination vector. Since in each cloning step, multiple fragment constructs prepared in parallel preceding steps can be combined, the number of fragment constructs combined into a construct of interest can be increased multiplicatively, which is indicated by letter M in “destination vector M”.
- a set of n destination vectors M and a set of n linkers M allows reuse of a limited number of n cleavage sites such that a large number of fragment constructs (90 in FIG. 36 ) can be assembled with a small number of elements n in said sets.
- n is at least 2, preferably at least 3, more preferably at least 4.
- the versatility of the system increases with increasing n.
- n may be a number of from 3 to 20, preferably of from 4 to 10, more preferably of from 5 to 9 or from 6 to 8.
- the cleavage sites of items (II′) and (III′) of destination vectors M may overlap completely.
- one physical sequence of nucleotides provides the cleavage sites of two different type IIs restriction endonuclease recognition sites.
- one physical sequence of nucleotides may provide the cleavage sites of two different type IIs restriction endonuclease recognition sites, namely the cleavage sites of items (b′) and (c′), of items (II′) and (III′) and of items (b′′) and (c′′).
- This embodiment is used in the examples shown in the figures. However, it is also possible that the cleavage sites of the pairs mentioned before are adjacent separated cleavage sites.
- the number of entry DNAs to be used in not decisive in the system of this embodiment It is possible that one entry DNA is incorporated into a level M destination vector, optionally followed by incorporation of one level 1 construct into a level P destination vector. However, the main advantages of the system can be made use of if at least 2, at least 3, at least 4, or at least 5 entry DNAs are combined by introduction into a destination vector M.
- the recognition sites of items (i), (iii), (I′) and (VII′), (a′), and (f) may be recognition sites of the same type IIs restriction endonuclease.
- Multiple level M reactions can be conducted in parallel in separate reaction vessels as indicated in FIGS. 32 a and b .
- the separate level M constructs may be combined in a subsequent level 2 or level P reaction.
- the destination vector M used in the second level M reaction is selected such that it has a complementary cleavage site to the cleavage site provided by the linker M used in the first level M reaction.
- two or more level M constructs can be combined in a level 2 reaction ( FIG. 32 c ) or a level P reaction ( FIG. 34 a ).
- FIG. 32 c it is the TTAC cleavage site in the level M construct 1 that is derived from a linker M used in a first level M reaction.
- the TTAC cleavage site in the level M construct 2 is derived from a destination vector M used in the second level M reaction. Cleavage by Bsal allows ligation of the construct comprising TU1 to TU3 with the construct comprising TU4 and TU5 into the destination vector level 2.
- a destination vector P and linker P can be treated with Bsal and ligase together with level M constructs 1 and 2 to give the level P construct shown at the bottom of FIG. 34 a.
- n destination vectors P the same set of cleavage sites is used as in the destination vectors M.
- the cleavage sites (VI′′) of all n destination vectors P are identical and are at the same time identical to the cleavage sites of items (VI′) of destination vectors M.
- the set of n linkers P the same set of cleavage sites is used as in the linkers M and the destination vectors M.
- a limited set of n cleavage sites allows to combine a number of fragments constructs that can far exceed the number of n.
- FIG. 34 b multiple level P constructs can be combined into another destination vector M using a linker M.
- the recognition sites of items (I′′), (IV′), (d′), (a′′) and (f′′) may be recognition sites of the same type IIs restriction endonuclease, and the recognition sites of items (IV′′), (I′), (VII′), (a′) and (f′) may be recognition sites of the same type IIs restriction endonuclease.
- Item (VIII′) of destination vector M may have a marker allowing selection of ligation product of a level M reaction for absence of item (VIII′).
- the marker may be lacZ for blue/white selection.
- Item (VIII′′) of destination vector P may have a marker allowing selection of ligation product of a level P reaction for absence of item (VIII′′).
- the designation vectors and the linkers are circular molecules or plasmids containing a selectable marker in their backbone.
- An advantage of systems and methods of the invention is that cloning steps can be done as one-pot reactions, requiring only a simple incubation such as in a thermocycler. In particular, this avoids the need for labor-intensive and operations that are difficult to automate such as purification of DNA fragments from agarose gels. This mean that all the elements required for the design of a completely automatized cloning system are now in place. Operations that are employed are preparation of miniprep DNA, liquid handling and incubation to perform restriction-ligation, plating of transformations on plates, picking of colonies, and digestion and analysis of miniprep DNA. This last step may be replaced by DNA sequencing, as very few colonies need to be screened to obtain the desired construct (in the majority of cases, one colony is sufficient). All these operations can easily be handled by standard automation robots. This aspect promises to revolutionize the number of constructs that can be made within a given time as well as the production costs for these constructs, and therefore opens the door for new applications that will be required in the field of synthetic biology.
- Another advantage of the invention compared to more traditional cloning strategies is that complex design of specific construction strategies is not needed anymore, since the design is automatically defined by the number and the order of modules (or genes) that a user wants to assemble.
- the cloning strategy in fact can be easily and unambiguously determined by a simple computer program. This program can be directly linked to the automation robots that would physically make the construct.
- An advantage of such system is that the cloning strategy itself cannot become a limiting factor when constructs reach a large size, since the same principles and the same cloning vectors can be reused indefinitely. In fact, it is conceivable that the invention can be used to clone entire chromosomes.
- the system of the invention allows the production of nucleic acid constructs of interest from multiple nucleic acid fragments constructs using a combination of nucleic acid fragment constructs via single-stranded overhangs formed at both ends of the fragments using type IIs restriction endonucleases.
- type IIs restriction enzymes are used.
- the type IIs restriction endonuclease recognition site is a recognition site of a restriction endonuclease recognizing a double-stranded DNA and cleaving the double-stranded DNA at a cleavage site that is outside the recognition site on the double stranded DNA.
- the type IIs restriction endonuclease cleaves such that, depending on the specific type IIs restriction endonuclease, overhangs of from 3 to 6 nucleotides are produced. However, it is also possible to use type IIs endonucleases producing longer single-stranded overhangs.
- the nucleotide range that forms the overhangs upon cleavage is referred to herein as cleavage site. Since the nucleotides of the cleavage site are not part of the recognition site, they can be chosen as desired without destroying cleavage activity of the type IIs restriction endonuclease. Examples of type IIs restriction endonucleases suitable for the methods of the invention are given below.
- Bsal and Bpil any type IIs restriction enzyme that provides “sticky” ends sufficient for efficient ligation at its cleavage sites can be used.
- a selection of such enzymes is provided on the REBASE webpage (rebase.neb.com/cgi-bin/asymmlist) and in the review of Szybalsky et al. (1991 , Gene, 100:13-26).
- Type II restriction enzymes with asymmetric recognition sites e.g. those shown in this webpage
- have cleavage site outside of recognition site and provide upon cleavage of at least three, preferably 4 or more nucleotide residues overhangs e.g.
- Bli7361; BpuAI, VpaK321, SfaNI, etc. can be used in the invention. It is recommended that the recognition site contains at least 4, more preferably at least 6 or more base pairs in order to minimize the chance for such site to be found in a sequence portion of interest.
- Type IIs restriction nucleases with 5 bp recognition sites e.g. SfaNI
- Type IIs restriction endonucleases that produce 4 nt single-stranded overhangs at the extremities of digested fragments can theoretically generate ends with 256 possible sequences.
- Type IIs restriction enzymes having even longer recognition sites, e.g. comprising ten or more base pairs have been engineered.
- the largest recognition site among natural type IIs enzymes is for the enzyme SapI which has a 7 bp recognition site.
- a preferred solution is the use of artificial type IIs enzymes engineered to have a long recognition site (Lippow et al, 2009, Nucleic acides Res., 37:3061-3073).
- a type IIs enzyme with a 18 bp recognition sites would be expected to cut only a few times per eukaryotic genome at most, and would allow to make most entry modules without having to change any nucleotide of the native sequence.
- ligases to be used in the invention include T4 DNA ligase, E. coli DNA ligase, Taq DNA ligase, all of which are commercially available from New England Biolabs.
- FIG. 1 A shows elements of a system that allows re-use of the entry DNAs of the invention (level 1 destination vectors) for different inserted sequence segments.
- This system comprises:
- the cleavage site at the 3′ end (right hand side in the figures) of the last construct (na7) forms a single-stranded overhang compatible with the overhang created by cleavage of the cleavage site at the 5′ end (left hand side in the figures) of the first fragment construct na1 by restriction endonuclease digestion, as indicated by the same numbering “S1” at these sites; (2) a set of n ‘end-linkers’ (ELx, x indicating the numbering from 1 to 7) flanked on one side (5′ end) with a cleavage site compatible with the 3′ cleavage sites (S1 to S7) of the nucleic acid fragment constructs (as well as the underlying entry DNA) and on the other side (3′ end) with a unique site not compatible with any of the n entry DNAs (S8); (3) a destination vector with two cleavage sites, one site compatible with sites S1 (or S2, S3, S4, S5, S6, S7), and the other site
- FIG. 1 B provides an example for cloning of three nucleic acid fragment constructs into a destination vector. Cloning of the three nucleic acids fragment constructs employs ligation of the appropriate end-linker (end-linker 3). The resulting construct can be later re-opened at cleavage sites S4 and S8 by digestion with the appropriate type IIs endonuclease. All other sites lack a flanking type IIs endonuclease recognition site in the reaction product and are thus protected from digestion with the endonuclease used for the production of this reaction product.
- FIG. 1 B provides an example for cloning of three nucleic acid fragment constructs into a destination vector. Cloning of the three nucleic acids fragment constructs employs ligation of the appropriate end-linker (end-linker 3). The resulting construct can be later re-opened at cleavage sites S4 and S8 by digestion with the appropriate type IIs endonuclease. All other sites lack a flanking type II
- FIG. 1 C shows that further nucleic acid fragment constructs na4 to na7 and a further construct na1 that is based on the same entry vector number 1 as na1 ligated in FIG. 1 B can be cloned into the product vector obtained in FIG. 1 B .
- a different end-linker is used (ELx or ELx-b that differ by containing different internal type IIs restriction site for reopening the construct at the next stage).
- the structure of the end-linkerse will be more specifically explained in the following figures).
- FIG. 2 explains how type IIs restriction sites are depicted in the following figures.
- a type IIs restriction endonuclease site contains a recognition sequence (also referred to herein as “recognition site”) and a cleavage site located outside of the recognition sequence.
- the nucleotide sequence of the cleavage site is shown in a horizontally elongated box.
- the recognition site can be found on either side of the cleavage site, depending on the orientation of the asymmetrical recognition site in the DNA.
- the recognition site is illustrated as a vertically elongated box on the left half under the box representing the cleavage site.
- the recognition site When the recognition sequence is located on the right of the cleavage site, the recognition site is illustrated as a vertically extended box on the right half under the box representing the cleavage site.
- the nucleotide sequences of the top DNA strand are shown next to the represented type IIs restriction sites.
- two restriction sites are shown the cleavage sites of which are oriented towards each other, but are non overlapping and separated by an “optional sequence”.
- two restriction sites are shown the cleavage site of which overlap over the entire range of 4 base pairs.
- FIG. 3 shows a general embodiment wherein a eukaryotic multi-gene construct is produced as the nucleic acid fragment of interest.
- FIG. 3 illustrates a general strategy that can be used.
- Basic genetic elements such as promoters, 5′ untranslated regions (5′ UTRs), signal peptides (SP), open reading frames (ORFs), and terminators (T) are cloned as basic modules (“level 0 modules”).
- the libraries of ‘level 0’ entry modules may be stored until needed for cloning.
- genetic elements cloned as level 0 modules are chosen (second row in the FIG. 3 ), and are assembled, in a level 1 reaction, using a one-pot Golden Gate cloning reaction into an entry DNA.
- the level 1 reaction product (schematically shown in the third row), referred herein as nucleic acid fragment construct, contains an assembled transcription unit comprising the chosen elements in the desired order.
- two or more (11 in the depicted example) different transcription units are assembled in a desired order into a destination vector (level 2) to obtain a level 2 multigene construct of interest.
- the basic gene structures for secreted and cytosolic proteins are shown. Since the latter have no signal peptide (SP), the ORF level 0 modules for cytosolic proteins may have the cleavage site sequences of the signal peptides used for secreted proteins for allowing linking of the ORF module with the 3′ end of the module for the 5′ UTR in the level 1 reaction.
- SP signal peptide
- FIG. 4 depicts a set of level 1 destination vectors, level 2 destination vectors, and end-linkers.
- the level 1 destination vectors and end-linkers are cloned in plasmids with a carbenicillin resistance selectable marker, while level 2 destination vectors carry a kanamycin resistance gene. Other antibiotic resistance genes could also be used instead of the ones used in this example.
- Level 1 destination vectors of the series pL1F-1 to -7 and of the series pL1R-1 to -7 differ by the cleavage sites of the internal pair type IIs restriction sites (the Bsal sites). The sites of both series are designed for cloning of the same assembled transcription units, but in opposite orientation.
- the level 1 destination vectors have an internal lacZ reporter gene that is removed together with the recognition sites of the internal pair of type IIs restriction sites (the Bsal sites) in the level 1 reaction due to the divergent orientation of these sites with respect to LacZ. Thus, lacZ is not present in the level 1 reaction product, allowing blue/while selection of cell clones containing the reaction product.
- the outer restriction sites in the level 1 destination vectors (the Bpil sites) are in convergent orientation with respect to the Bsal sites and lacZ and are unchanged in the level 1 reaction, since the restriction endonuclease of the inner pair of restriction sites (Bsal), but not the restriction endonuclease of the outer pair of restriction sites (Bpil), is used in the level 1 reaction.
- Straight dashed arrows indicate the complementarity of the cleavages sites of the right-hand Bpil sites of each of entry DNAs 1 to 6 with the left-hand entry DNA directly underneath which allows the ligation of entry DNAs 1 to 7, as well as the level 1 constructs derived therefrom, in the level 2 reaction via complementary single-stranded overhangs produced by Bpil digestion.
- Complementarity of the right-hand cleavage site of the bottom entry DNA (of sequence TGCC) with the left-hand cleavage site of the first entry DNA allows to reuse the entry DNA 1 and the following entry DNAs in a second level 2 reaction.
- the reused entry DNAs may be provided in a level 1 reaction with a different insert compared to the insert (referred to as “core portion of the sequence segment” in item (ii′) of the method of the invention).
- core portion of the sequence segment in item (ii′) of the method of the invention.
- more nucleic acid fragment constructs can be combined into a nucleic acid construct of interest than the number of elements of the set of entry DNAs.
- “Divergent” herein means that the two cleavage sites of a pair of restriction sites are more remote than the recognition sites from a position between the two restriction sites.
- Convergent herein means that the two recognition sites of a pair of restriction sites are more remote than the cleavages sites from a position between the two restriction sites.
- the set of level 2 destination vectors shown has the same number of elements as the number of level 1 destination vectors.
- the level 2 destination vectors have a pair of divergent (with respect to the central portion in which nucleic acid fragment constructs are inserted in the level 2 reaction) type IIs restriction sites flanking genes (“CRed”) providing a red phenotype.
- CRed type IIs restriction sites flanking genes
- each entry DNA can be used to produce a nucleic acid fragment construct that will take position 1 in the level 2 reaction product.
- Three sets of end-linkers are depicted, each set generally having the same number of elements as the number of level 1 and level 2 destination vectors.
- Sets pELB and pELR are similar in that they have the same cleavage sites and outer recognition sites.
- Sets pELB and pELR both have a further inner recognition site that will be unchanged in the level 2 reaction, whereby they are present in the level 2 reaction product. Thus, they can be used for inserting, in a second or further level 2 reaction, further nucleic acid fragment constructs into the reaction product of the first level 2 reaction. This is not possible if an end-linker from the pELE set is used, since these lack the inner divergent pair of restriction sites.
- Sets pELB and pELR differ in that different inner recognition sites (Bsal versus Esp3I) are used and in that different central reporter genes for color selection of cell clones are used.
- All end-linkers can be used for joining the nucleic acid fragment constructs derived from the level 1 destination vectors to the downstream cleavage site of the level 2 destination vectors using cleavage site GGGA. Thus all destination vectors and all end-linkers have the same downstream cleavage site (GGGA). For each downstream Bpil cleavage site of the entry DNAs there is a linker having a complementary upstream Bpil cleavage site.
- FIG. 5 A illustrates the structure of the modules depicted in FIG. 3 in greater detail and shows how these modules can be assembled.
- constructs can be assembled by mixing in one tube all constructs or DNA fragments required, together with the appropriate type IIS enzyme (indicated above the horizontal reaction arrows) and ligase, and incubating the mix under conditions allowing restriction enzyme digestion and ligation.
- Level 0 modules have an insert of interest (for example a promoter sequence, P1) located between two convergent type IIs restriction sites (Bsal in the example shown).
- Level 0 modules can be cloned by a number of different procedures, and one example is shown here, starting from either PCR products or level-1 constructs (top row of the figure designated, level 0′′). In this example, cloning is performed using the enzyme Bpil in a compatible level 0 destination vector. Methods for such cloning are known from the literature, see e.g. Engler et al. PLoS ONE 4 (2009) e5553.
- level 0 modules are then assembled and cloned on level 1 into a level 1 destination vector using a Golden Gate cloning reaction with a second type IIs enzyme, here Bsal.
- Bsal a second type IIs enzyme
- the resulting level 1 constructs contain, for example, assembled transcriptional units (TUs).
- level 1 constructs in the present example, 2 such constructs indicated by “TU1” and “TU2”) are then assembled together with a selected end-linker (pELE-2, see FIG. 4 ) in a compatible level 2 destination vector (pL2-1 for example).
- a selected end-linker pELE-2, see FIG. 4
- pL2-1 a compatible level 2 destination vector
- both level 1 constructs have to be compatible for level 2 assembly, i.e. having convergent terminal cleavage sites.
- the first level 1 construct corresponds to position 1 (the level 0 modules were cloned in the level 1 destination vector pL1F-1 depicted in FIG. 4 ) and the second one corresponds to the next position (position 2), the level 0 modules of which were cloned in the level 1 destination vector pL1F-2).
- level reaction is referred to as, level 2-1′′ and the reaction product is referred to as, level 2-1 construct′′).
- level 2 reaction can however be made using end-linker pELB-2 rather than pELE-2 (see FIG. 4 ). Since this end-linker contains a pair of further internal type IIs restriction sites (Bsal), the resulting construct (“level 2i-1 construct”) will also contain such site, and therefore can be used again as level 2 destination vector for a second level 2 reaction for insertion of additional transcriptional units or other fragment constructs.
- end-linker pELB-2 rather than pELE-2 (see FIG. 4 ). Since this end-linker contains a pair of further internal type IIs restriction sites (Bsal), the resulting construct (“level 2i-1 construct”) will also contain such site, and therefore can be used again as level 2 destination vector for a second level 2 reaction for insertion of additional transcriptional units or other fragment constructs.
- cleavage site indicates that only one of several entry clones was drawn due to space limitation.
- Each cleavage site is shown as a box with the 4 nucleotides of the cleavage site; the two boxes below show which type IIS recognition sites flank the recombination sites on the left and right sides.
- P1-a/b stands for promoter fragment 1 or 2; UTR1 stands for 5′ untranslated sequence; T1 indicates a terminator; CRed stands for a red color visual marker encoding canthaxanthin biosynthetic genes.
- FIG. 5 B show nucleotide sequences at the junction of various modules for each step of cloning of FIG. 5 A .
- “Gene 1” is equivalent to “TU 1”.
- Sequence stretches (SS) designated SS1 to SS26 are SEQ ID NO: 31 to SEQ ID NO: 56, respectively.
- FIG. 6 is an overview of alternative cloning strategies. Every cloning step relies on three elements that are different from one level to the next: antibiotic selectable marker, type IS restriction enzyme, and visual selectable marker. Cloning after level 2i-1 requires the use of two type IIS enzymes, such as Bpil-Bsal or Bpil-Esp3I. Using the described set of level 1 destination vectors, level 2 destination vectors and end linkers, and the indefinitely repeatable cloning strategy provided, as many transcription units can be added to a construct as desired by a user, using as many cycles of cloning as required. Physical limits will ultimately be encountered due to handling of very large constructs, but such limits will not come from the cloning strategy itself. The level 2 reactions are numbered by the numeral at position x in level 2-x.
- Level 2-x stands for a level 2 reaction producing a level 2 reaction product that cannot be used for a further level 2 reaction due to the absence of a pair of type IIs restriction sites allowing reopening of the level 2 reaction product (e.g. due to the use of an end-linker of the pELE set, the last “E” indicating “end”).
- Level 2i-x stands for a level 2 reaction that produces a reaction product that is an intermediate (e.g. due to the use of an end-linker of the pELB set) and can thus be used for a further level 2 reaction.
- Each level 2i-x reaction product opens up two possibilities for a further level 2 reaction (indicated by the branching arrows). Depending on the use of the end-linker, the next level 2 reaction will either lead to an end (boxed level 2-x) or will lead to a further intermediate reaction product, allowing a still further level 2 reaction.
- FIG. 7 shows the structure of level 0 modules. All level 0 modules are flanked by two convergent Bsal sites (with respect to the insert such as P1). Five module classes, namely promoter modules, 5′UTR modules, signal peptide modules, ORF modules and terminator modules are depicted. All modules of a same class are flanked by the same cleavage sites (for promoter modules GGAG and TACT). This design allows to clone any module of a given class into a transcription unit using the same cloning strategy. It also allows the use of multiple modules from a module class for obtaining libraries of level 1 fragment constructs.
- Modules of different classes are designed to be compatible for assembly in a multi-fragment level 1 reaction, also referred to herein as “Golgen Gate” cloning reaction, i.e., the sites joining two modules form compatible (complementary) single-stranded overhangs after digestion with a type IIs restriction enzyme, here Bsal.
- a “set” designates a group consisting of 1 module from each of the five module classes.
- FIG. 8 A explains the assembly of transcriptional units from level 0 modules into level 1 destination vectors (entry DNAs).
- Module sets 1 to 9 (each consisting of a set of level 0 modules as shown in FIG. 7 ) are assembled in a level 1 destination vector.
- Modules sets 1 to 7 are assembled in level 1 destination vectors pL1F-1 to -7 (see FIG. 4 ), respectively.
- Module sets 8 and 9 (and optionall further sets) are cloned in the same series of level 1 destination vectors pLF-1, pLF-2 etc, respectively.
- level 1 destination vectors pL1F-1 and -2 (and optionally further vectors) are reused for a different module set than module sets 1 and 2.
- the reaction products are level 1 constructs having flanking Bpil restriction sites retained from the level 1 destination vectors. These flanking restriction sites define with their associated cleavage sites the order in which the level 1 constructs are ligated in the subsequent level 2 reaction. Each reaction shown is generally performed as a separate reaction.
- FIG. 8 B illustrates an example for the assembly of level 0 modules using a different set of level 1 destination vectors.
- the restriction sites for the enzyme Bpil are identical as in FIG. 1 A , but the 4-nucleotide cleavage sites of the restriction sites for the enzymes Bsal and Bpil overlap exactly in each destination vector.
- the cleavage sites of the restriction sites for Bsal are not compatible with the previously described level 0 modules (in FIG. 7 ).
- a set of adaptors that allow joining of the level 0 modules and of the level 1 destination vectors is employed.
- the resulting level 1 constructs have the same overall structure as the constructs made in FIG. 8 A : the different transcription units are all flanked by the same compatible sets of Bpil restriction sites. These sites are convergent in each construct, and are compatible from one construct to the next.
- FIG. 9 A assembly of 5 transcription units into a multigene level 2 construct in a level 2 reaction.
- An appropriate end linker is used for linking the right-hand side of the fifth level 1 construct to the GGGA cleavage site of the destination vector referred to as “Destination 2”.
- the type of end-linker used does not allow the reaction product to be used as a starting material for a further level 2 reaction.
- the term “Golden Gate” means that reactants having type IIs restriction sites with compatible cleavage sites are combined in desired order in a reaction comprising restriction and ligation.
- the level 2 construct is shown on the right hand side in two different representations with different degrees of detail.
- FIG. 9 B Assembly is performed with an end linker containing internal divergent Bsal restriction sites for allowing a further level 2 reaction in the resulting level 2 construct.
- FIG. 9 C shows examples of further rounds of cloning.
- a short version of the reaction of FIG. 9 B is shown, whereby a level 2i-1 construct is obtained.
- 6 transcription units designated TU6 to TU11
- TU12 to TU15 are added to the level 2i-1 construct, leading to a level 2i-2 construct.
- four further TUs are added, leading to the level 2i-3 construct.
- one further TU is added to obtain the level 2-4 construct.
- different end linkers have to be used, that may contain either internal Bsal or Esp3I restriction sites, or none if the final round of cloning is reached.
- FIG. 10 shows the efficiency of cloning for different levels of assembly.
- level 1 and level 2-1 all minipreps analyzed contained only correctly assembled constructs, even though multiple fragments were assembled in a one-pot one-step reaction for each construct.
- the last level 2-1 construct was obtained by assembly in one step of 6 transcription units and one end-linker in one destination vector.
- all minipreps analyzed contained only correct constructs.
- the final construct made contains 11 transcription units. All colonies analyzed contained correct constructs despite the large size of the construct (34 kb),
- FIGS. 11 to 17 show how the cloning system of this invention can be used to create constructs containing one or multiple operons.
- Applications include microbial strain construction for metabolic engineering.
- FIG. 11 illustrates the general strategy used for cloning of prokaryotic operons.
- Basic genetic elements such as promoters, open reading frames (ORFs), and terminators are cloned as level 0 modules. Libraries of level 0 modules or individual level 0 modules can be stored until needed for cloning. To make a desired construct, a selected number of genetic elements cloned as level 0 modules are chosen. Unlike for cloning of eukaryotic multigene constructs, promoters, open reading frames and terminators are not cloned together in a level 1 destination vector.
- operons may contain a different number of open reading frames, preventing the design of a fixed set of compatible cleavage sites for cloning. Rather, promoters, open reading frames and terminators are cloned separately in different level 1 destination vectors to obtain level 1 constructs. Rather than serving the purpose to assemble several level 0 modules together, cloning in level 1 destination vectors for prokaryotes serves mainly the purpose of providing positional information for the cloned level 1 construct for a subsequent level 2 assembly. Complete or partial operons are then cloned from several level 1 constructs in a one-pot one-step cloning reaction on level 2.
- FIG. 12 Comparison of level 1 destination vectors for eukaryotes (“MoClo” and prokaryotes (“MoClo Pro”).
- Level 1 destination vectors (entry DNAs) for cloning promoters and terminators for prokaryotic operons may be the same as level 1 destination vectors made for eukaryotes ( FIG. 4 ).
- level 1 destination vectors made for prokaryotic ORFs open reading frames
- they contain a ribosome binding site between the first Bpil and Bsal cleavage sites.
- all destination vectors also have the same general structure in common, namely two convergent type IIs enzymes flanking the DNA sequence of interest, and cleavage sites designed to fit the structure of nucleic acids 1 to 7 shown in FIG. 1 A .
- FIG. 13 illustrates the cloning of 5 level 1 constructs (shown in the lower part) from level 0 modules into level 1 destination vectors (shown in the middle part) in a level 1 reaction.
- Level 1 destination vectors shown in FIG. 12 are used.
- Five level 1 constructs, namely a promoter construct, 3 ORF constructs and a terminator construct are obtained with compatible cleavage sites for ligation, in the order given, in the subsequent level 2 reaction.
- FIG. 14 shows the level 2 reaction from the level 1 constructs made as shown in FIG. 13 to produce a functional operon in a level 2 construct.
- FIG. 15 shows a similar level 2 reaction as shown in FIG. 14 , but with an end-linker containing internal Bsal sites for allowing a further level 2 reaction.
- FIG. 16 illustrates the preparation of 3 additional level 1 constructs (designated 6, 7 and 1, respectively) containing a second promoter (P2) and two further ORFs (orf4 and orf5).
- the right hand cleavage site thereof having sequence TGCC allows reuse of the first level 1 destination vector (pL1 P-1 in FIG. 12 ) for orf5.
- FIG. 17 Cloning of the three additional level 1 constructs obtained according to FIG. 16 into the level 2i-1 construct made according to FIG. 15 .
- the reaction product shown at the bottom is a level 2i-2 construct, since the end-linker used allows a subsequent level 2 reaction.
- FIGS. 18 A- 18 C show the structure of transcriptional units, level 0 modules, and of destination vectors required for their cloning.
- FIG. 18 A The transcriptional units contain up of 5 basic modules separated by 4 nucleotides sequences that serve as recombination sites (shown in boxes).
- FIG. 18 B Level 0 modules shown on the first line are flanked by Bsal sites. The modules are cloned in a level 0 reaction using the enzyme Bpil and one of the level 0 destination vectors shown underneath.
- FIG. 18 C Strategy for removing internal type IIS recognition sequences. Removal of a Bsal site in a fragment of interest is done by amplifying two fragments with primers pr1 (SEQ ID NO: 27) and 2 (SEQ ID NO: 28) and primers pr3 (SEQ ID NO: 29) and 4 (SEQ ID NO: 30). Sequences of the Bpil recognition sites in the 5′ extensions in the primers (horizontal arrows) are shown in bold. The two fragments are cloned using Bpil in the appropriate level 0 destination vector, for example pL0-P in the present example.
- FIGS. 19 A- 19 D show an example for cloning of constructs of level 0, 1 and 2. Antibiotic resistances are indicated.
- FIG. 19 A Illustrates cloning of level 0 promoter modules.
- FIG. 19 B Illustrates cloning of a level 1 construct containing a transcription unit.
- FIG. 19 C Illustrates cloning of a level 2i-1 construct containing 5 transcription units, TU1 (containing GFP), TU2 (containing p19), TU3 (containing VP2), TU4 (containing VP5), TU5 (containing VP7) into destination vector pL2-1.
- FIG. 19 D Illustrates cloning of a level 2-2 construct containing 11 transcription units.
- the construct contains TU6 (transcription unit with VP3), TU7 (transcription unit with BAR), TU8 (transcription unit with antibody light chain), TU9 (transcription unit with antibody heavy chain), TU10 (transcription unit with TMV MP), TU11 (transcription unit with TMV CP).
- FIG. 20 A shows the structure of the 11 level 1 transcription units used in FIG. 19 B .
- FIG. 20 B shows the structure of construct plCH51811 that is obtained by cloning of the 11 transcription units cloned in a level 2-2 construct.
- FIG. 21 shows the expression of GFP in a Nicotiana benthamiana leaf after infiltration of level-2 constructs shown in FIG. 10 .
- the level-2 constructs were transformed in Agrobacterium tumefaciens , and the transformed bacteria infiltrated in leaves using a syringe without a needle. GFP expression was observed at 5 dpi under UV light. The number in parenthesis indicates the number of transcription units in each infiltrated construct.
- FIG. 22 shows prokaryotic genes cloned as level 0 entry modules.
- 3 genes from Pantoea ananatis crtE, I, and B are involved in lycopene biosynthesis.
- Other genes known to increase lycopene expression when overexpressed in E. coli were also cloned: dxs from Agrobacterium rhizogenes strain K84 and E. coli strain K12, the ispA genes from the same two species, and the idi and AppY genes from E. coli strain K12.
- Two promoters were also cloned as level 0 modules: the Lac Z promoter from pUC19, and a promoter from Pantoea ananatis.
- FIG. 23 A shows level 1 destination vectors for cloning of prokaryotic coding sequences. All constructs are in fact libraries (indicated by the L in the construct name) that have variable sequences flanking the RBS.
- FIG. 23 B shows two sets of end linkers that contain the LacZ terminator from pUC19.
- FIG. 24 shows a list of level 1 constructs made with the level 0 modules shown in FIG. 22 .
- FIG. 25 shows level 2i-1 constructs.
- Two libraries were made with genes for lycopene biosynthesis.
- Both constructs pICH5648L and pICH5850L are in fact libraries in which all clones obtained differ in the sequence flanking the RBS of all three cloned genes.
- FIGS. 26 A- 26 B show addition of two or three more genes to the operons obtained in FIG. 25 . Since a library of different constructs was placed in the cloning mixes (pICH5447L to pICH5452L in FIG. 26 A , or pICH5455L in FIG. 26 B ) the constructs obtained consist of libraries of constructs that differ in the genes present at positions 5 and 6 or 5, 6 and 7. The number of different gene combinations for library pICH5920L is in theory 216 possibilities, without even considering the variation provided by variability in the 5 RBS regions of the 5 genes per operon.
- FIG. 27 shows that level 2 cloning can be followed by a level 3.
- a new type IIs enzyme needs to be used, such as SapI in the case shown.
- FIG. 28 shows vectors required for level 3 cloning.
- Level two destination vectors are flanked by restriction sites of a new type IIs enzyme (SapI in this example).
- Level 3 destination vectors and end-linkers are similar as for level 2, but with the new type IIS enzyme.
- Numbers 1 to 8 represent eight different 3 nucleotide sequences as the cleavage site of SapI.
- FIG. 29 A shows a set of destination vectors and end-linkers for assembly of several level 1 transcription units (or more generally nucleic acids “na”) into a level M destination vector (M stands for multiplication).
- Level M destination vectors and corresponding end-linkers (ELM1 to 7) are designed in such a way that blocks of assembled transcription units (or nucleic acid fragments) cloned in level M destination vectors become flanked by cleavage sites for a type IIs restriction enzyme, which will be used for the next step of cloning.
- level 2 constructs where types IIs enzymes restriction sites are located at the end of the constructs in the end-linker here the type IIs enzymes flank the assembled nucleic acid fragments, allowing them to be further subcloned in a new vector.
- FIG. 29 B illustrates assembly of three transcription units (nucleic acid acids na1 to 3) in a level M destination vector.
- the linker is chosen from the set of linkers ELM1 to ELM7 such that na3 can be linked to cleavage site S8 of destination vector M.
- FIG. 29 C illustrates assembly of two transcription units (nucleic acids na4 and 5) in a level M destination vector.
- the linker is chosen from the set of linkers ELM1 to ELM7 such that na5 can be linked to cleavage site S8 of destination vector M.
- FIG. 29 D shows assembly of the two pre-assembled blocks of transcription units (the reaction products of the reactions shown in B and C) in a level 2 construct employing a level 2 destination vector. Details of the restriction sites of destination vectors and end-linkers are shown in FIGS. 31 and 32 (below).
- FIG. 30 shows that level M can be used as an intermediate step between level 1 and 2. Examples for color selection, antibiotic selection and type IIs enzymes for each step are shown.
- FIG. 31 shows the structure of the entry DNAs as well as vectors required for level M cloning.
- the orientation and position of type IS restriction sites in destination vectors and end-linkers is such that assembled transcription units (or nucleic acid fragments) in level M constructs can be excised by a type IIs enzyme, in this case Bsal.
- Entry DNAs are flanked by type IIs restriction sites in opposite orientations (here Bpil sites).
- the cleavage sites of these two sites are each identical to one of the seven cleavage sites present on the left-hand side of level M destination vectors and linkers.
- the 7 cleavage sites on the left-hand side of the 7 destination vectors are unique (and different from one another).
- the same set of 7 cleavages sites is present on the left-hand side of the linkers M.
- FIGS. 32 A-C show an example where blocks of three ( FIG. 32 A ) and two ( FIG. 32 B ) transcription units are separately pre-assembled into level M destination vectors before being assembled in a level 2 vector in a subsequent step ( FIG. 32 C ).
- the destination vectors and end-linkers for level M are as shown in FIG. 31 .
- the structure of entry DNAs for level P cloning is identical to the structure of entry DNAs for level M, except that restriction sites for a different type IIs enzymes are present (here for Bsal).
- Level M constructs have a structure corresponding to level P entry DNAs. Thus, level M constructs can be used as entry DNAs in a level P reaction. Level P constructs become entry DNAs for a next round of level M cloning.
- FIG. 34 A illustrates a level P reaction wherein two level M constructs are assembled into a level P destination vector using an end-linker P to give a level P construct.
- FIG. 34 B illustrates a level M reaction wherein two level P constructs are assembled into a destination vector M (“destination M”) using an end-linker M to give a Level M construct.
- FIG. 35 Cloning from level P to level M and vice-versa can be repeated indefinitely by reusing the same set of vectors and end-linkers shown in FIG. 33 .
- FIG. 36 illustrates the structure of constructs obtained in successive level M and level P cloning steps.
- 90 transcription units or nucleic acid fragments
- 5 Sets of 6 transcription units are assembled in parallel in separate reactions in the first level M cloning step.
- the resulting 5 constructs are then assembled in one construct in the following cloning reaction on level P.
- This construct and two other compatible constructs construction not shown for lack of space), each containing 30 assembled transcription units (or nucleic acid fragments) are then assembled in a third step of cloning in a level M destination vector.
- Boxes with numbers 1 to 7 represent nucleic acid fragments or transcription units, with such numbers referring only to the nature of the type IIs enzyme cleavage sites (position as defined earlier) flanking them, and not to the sequence of the nucleic acids contained between these two sites.
- Plasmid DNA preparations were made by using the NucleoSpin Plasmid Quick Pure kit (Macherey-Nagel, Düren, Germany) following the manufacturer protocol. Plasmid DNA concentration was measured using a Nano Drop® Spectrophotometer ND-1000 (Peqlab, Er Weg). The coding DNA for the coat proteins VP2, VP3, VP5 and VP7 of blue tongue virus serovar 8 was synthesised from Entelechon GmbH and lack all Bpil, Bsal and Esp3I restriction sites).
- Level-0 modules were sequenced with primers moclof (SEQ ID NO: 1: 5′-agcgaggaagcggaagagcg) and moclor (SEQ ID NO: 2: 5′-gccacctgacgtctaagaaacc).
- a one step—one pot restriction/ligation was setup with approximately 30 fmol ( ⁇ 100 ng for a 5 kb plasmid) of each fragment (PCR product or plasmid), Promega ligation buffer, 10 U of the respective restriction enzyme (Bsal, Bpil, or Esp3I), 10 U high concentrated T4 DNA ligase (Promega), in a 20 ⁇ l volume.
- the reaction was incubated for 5 hours at 37° C., 5 min 50° C. and 5 min 80° C.
- the mix was added to 100 ⁇ l chemical competent DH10b cells, incubated for 30 min on ice and transformed by heat shock. Two clones with the expected color were analysed by restriction analysis and optionally by sequencing.
- a DNA fragment coding for canthaxanthin biosynthesis was made by PCR amplification of 4 genes from Pantoea ananatis that are necessary for biosynthesis of p-carotene (genes crtE, crtY, crtI and crtB, Ref) and of one gene from Agrobacterium aurantiacum (crtW) necessary to convert ⁇ -carotene to canthaxanthin (ref).
- the gene crtW gene is used in addition to the 4 pantoea genes because the orange/red color of canthaxanthin is easier to see on agar plates than the yellow color of ⁇ -carotene.
- the Pantoea ananatis strain was obtained from the DSMZ (cat DSM 30080), and a fragment containing the crtW gene was synthesised by Mr. Gene GmbH.
- An artificial operon containing the genes crtE-W-Y-1-B under control of the P. ananatis native promoter was made by ligation of three fragments derived from PCR: fragment 1 containing the promoter and crtE gene was amplified from P.
- fragment 2 containing the crtW gene was amplified from a synthetic DNA fragment (sequence identical to the native sequence) with primers 5′-ttt ggtctc a atgagcgcacatgccctgcc (SEQ ID NO: 5) and 5′-ttt ggtctc a tcactcatgcggtgtccccttggt (SEQ ID NO: 6), and fragment 3 containing the genes crtY-1-B was amplified from Pantoea DNA using primers 5′-ttt ggtctc a gtgacttaagt
- Example 1 Generation of the Basic Parts: The Level 0 Modules
- a generalized eukaryotic transcriptional unit as the basis for our modular cloning system (MoClo). This unit was subdivided into five basic modules which cover the most important features of any transcriptional unit: promoter (P), 5′UTR (5U), signal peptide (SP), open reading frame (ORF), and terminator (T, which also includes 3′ untranslated sequences) ( FIG. 3 ). For each module, two unique flanking recombination sites were defined. We use here the term ‘recombination sites’ by functional analogy to the recombination sites used in recombination systems such as the phage P1 Cre-IoxP recombination system.
- ‘recombination sites’ can be any nucleotide sequence of choice of at least 3 base pairs in length, and will serve as the sequence where restriction enzyme digestion and ligation will take place; no recombinase will in fact be used. To guarantee efficient cloning, these sites were chosen to be non-palindromic and to share a maximum of two identical consecutive nucleotides.
- the recombination site between the promoter fragment and the signal peptide reads AATG and encodes a start codon preceded by an adenosine, which is the most common nucleotide in dicotyledonous plants at this position.
- the recombination site between the signal peptide and the rest of the protein (for secreted proteins) was chosen to be AGGT, with GGT coding for a glycine; this sequence was chosen as the last aminoacid of the signal peptide can usually be a glycine, and this allows producing secreted proteins with native sequence.
- the four remaining recombination site sequences chosen (GGAG, TACT, GCTT and CGCT) were selected without any special requirement other than being unique and non-palindromic, since they are all present in non-coding DNA.
- the designated DNA fragments are then amplified by PCR with primers designed to attach the specific recombination site and the recognition site sequence for the type IIS restriction enzyme Bpil ( FIGS. 5 A and 5 B ).
- a set of level 0 destination vectors was constructed (one vector for each type of level 0 module), each containing two Bpil restriction sites compatible with the amplified PCR fragments. Beside the destination plasmids for the five standard elements (pL0-P, pL0-5, pL0-S, pL0-O and pL0-T) further variants were included, which allow an easy adaptation of the MoClo system to a wide variety of projects ( FIG. 18 ).
- cytosolic proteins for example does not require a signal peptide, and these can be cloned into pL0-SO, spanning the SP and ORF position.
- All level 0 destination plasmids are based on a pUC19 backbone, confer spectinomycin resistance and encode a lacZ ⁇ fragment for blue/white selection.
- On both sites of the lacZ ⁇ fragment two different type IS recognition sequences—for the enzymes Bsal and Bpil—are positioned in inverse orientation relative to each other ( FIG. 5 A , B). Both recognition sequences are designed to create the identical four nucleotide recombination site upon digestion (for example, sequence GGAG on the left side of the destination vector for cloning of promoter modules).
- This design allows cloning of the PCR product(s) for the DNA fragment of interest efficiently via Bpil—removing the Bpil recognition sites and lacZ ⁇ in the process—but provides the possibility to release the cloned fragment with Bsal using the identical recombination sites it was cloned in.
- the level 0 modules should not contain any of the type IIS restriction sites used in the MoClo system within the sequence of the fragments of interest. Beside the already mentioned Bsal and Bpil, a third type IIs enzyme, Esp3I, is used in the process of assembly of higher order constructs (see below). Removal of these sites can be easily done at the time of cloning of level 0 modules by using primers overlapping the internal Bpil, Bsal or Esp3I sites, but containing a single silent nucleotide mismatch in the recognition site. An example for the removal of a single Bsal site is given in FIG. 18 C . The primers pr2 and pr3 introduce a guanine to cytosine exchange and destroy the Bsal recognition site.
- the level 0 module is then assembled from the PCR-fragments (n+1 PCR fragments when n sites need to be removed) in a one-step one-tube reaction using Bpil into the appropriate level 0 destination plasmid.
- the purification of the PCR reaction prior cloning is preferable, as primer dimers can be competitors of the PCR product in the cloning procedure.
- the 35S forward primer comprises: 5′-ttt GAAGACAAGGAG (SEQ ID NO: 9:) followed by bases specific for the 35S promoter
- 35S reverse comprises: 5′-ttt GAAGACAAAGTA (SEQ ID NO: 10:) followed by bases specific for the 35S promoter.
- the reaction was incubated for 5 hours at 37° C., and 10 min 80° C.
- the mix was added to 100 ⁇ l chemical competent DH10b cells, incubated for 30 min on ice and transformed by heat shock. After plating on LB agar plates containing spectinomycin (100 ⁇ g/ml) and growing over night at 37° C., two white clones were analyzed by restriction analysis and by sequencing.
- hygroscopicus 6 Phosphinothricin acetyltransferase plCH44022 P19 Tomato bushy stunt virus silencing inhibitor 7 plCH45502 BTV (blue tongue virus) VP2
- This work plCH45512 BTV blue tongue virus
- VP7 This work plCH48348 PVX CP
- This work plCH48367 PVX 25K MP This work plCH49488 IgG 1 Light chain with native signal peptide
- T Terminator
- Maize Spm transposable element has an enhancer-insensitive promoter.
- the next level of cloning consists of assembling several level 0 modules into a complete transcriptional unit in a level 1 reaction. Since assembly is performed by Golden Gate cloning using the enzyme Bsal, no Bsal restriction site is left in the resulting level 1 construct. Therefore, to be able to later subclone the assembled transcriptional unit into higher level constructs, two restriction sites of a second type IIS restriction enzyme also have to be present flanking the assembled fragment. We therefore created level 1 destination vectors containing two Bpil restriction sites flanking the lacZ ⁇ fragment, in addition to the two Bsal restriction sites needed for cloning of the transcription unit (pL1F-1 to pL1F-7, FIG. 4 ).
- level 0 destination vectors If, as for level 0 destination vectors, the cleavage sites of the two different type IIS enzymes (Bsal and Bpil) overlapped, all level 1 transcriptional unit fragments would be flanked by the same GGAG and CGCT recombination sites (which originate from the promoter and terminator modules, respectively). The presence of identical overhangs flanking each transcriptional unit would prohibit the creation of higher level multigene constructs with a defined linear order. Consequently a series of seven level 1 destination vectors was designed in which the Bpil cleavage sites generate two recombination sites with new specificities for each vector (for example sites TGCC and GCAA for level 1 vector position 1, FIGS. 4 and 5 B ).
- such a set of 7 level 1 destination vectors will allow an unlimited number (regarding the design, not the physical size of DNA) of transcriptional units to be cloned in a final nucleic acid construct of interest, although by incremental steps of up to 6 constructs at a time (see further description below for such assembly).
- level 1 constructs Since our transient plant expression system is based on Agrobacterium tumefaciens , all plasmids have a broad host range RK2 origin of DNA-replication and left border (LB) and right border (RB) T-DNA sequences to allow T-DNA transfer into the plant cell. These two features allow testing the functionality of level 1 constructs by plant infiltration. It is also possible to make similar vectors for allowing expression in other eukaryotic hosts such as yeast, insect or mammalian cells or in prokaryotes.
- the level 1 destination vectors encode an ampicillin resistance gene and a lacZ ⁇ fragment flanked by Bpil and Bsal sites.
- level 0 modules were assembled into level 1 transcriptional units (level 1 constructs).
- 11 artificial transcriptional units were designed (promoters and terminators were randomly assigned to ORFs without consideration for level of expression since all constructs in this study were made purely as an exercise to demonstrate the ability of the MoClo system for cloning of multigene constructs), and were (again randomly) assigned to one of the seven level 1 positions.
- the level 0 modules were combined with the respective level 1 destination vectors, T4-DNA ligase and the restriction enzyme Bsal in a one-tube one-step reaction ( FIG. 20 ).
- level 0 and level 1 destination plasmids in combination with the blue/white selection represent a very convenient way to screen for correctly assembled level 1 constructs.
- the reactions were spread on ampicillin and X-Gal containing plates and the numbers of white and blue clones were counted.
- the number of white, and therefore correct, colonies varied from approximately 16000 to 180000, whereas only a minor fraction ( ⁇ 1%) of blue colonies was present in two out of eleven reactions ( FIG. 10 ).
- Two white colonies from each reaction were analyzed by an analytical restriction digest with Bpil (which cleaves on both sides of the assembled transcriptional unit). All 22 tested plasmids contained a fragment of the expected size.
- the reaction was incubated for 5 hours at 37° C., 5 min 50° C. and 5 min 80° C.
- the mix was added to 100 ⁇ l chemical competent DH10b cells, incubated for 30 min on ice and transformed by heat shock. After plating on LB agar plates containing ampicillin (100 ⁇ g/ml) and growing over night at 37° C., two white clones were analyzed by restriction analysis and optionally by sequencing.
- multigene level 2 constructs are assembled from lower level (here level 1) constructs using a one-pot restriction-ligation. In this case, assembly is performed using the enzyme Bpil.
- Level 2 destination vectors carry a kanamycin resistance gene, in accordance with the principle that a specific selection marker is assigned to each level of cloning, allowing effective counter-selection against entry plasmid backbones.
- Level 2 destination vector backbones do not contain any type IIs restriction sites used in the MClo system, other that the recognition sites used for the cloning of the inserts.
- the visible selectable marker used for level 2 constructs is not a lacZ gene, but an artificial bacterial operon containing 5 genes (see Reference Example 2) that lead to biosynthesis of canthaxanthin, a red (more precisely salmon/orange) colored carotenoid pigment.
- a lacZ gene for blue-white selection would have also worked for this step, but the choice of a new color selectable marker is explained below in the paragraph on level 2i.
- the cantaxanthin operon in level 2 destination vector pL2-1 is flanked by two Bpil sites that create TGCC and GGGA overhangs after digestion ( FIG. 4 ).
- the TGCC overhang is compatible with the level 1 construct for position 1, while the GGGA represents an overhang which is unique to level 2 destination plasmids.
- the last overhang of the assembled DNA fragment will be different for each of the six possibilities.
- pELE-n a set of seven end-linkers
- End-linkers can consist of any DNA sequence, and serve the purpose of joining two chosen type IIs enzyme cleavage sites.
- sequence of linker pELE-1 is (SEQ ID NO: 11) gaagac aa GCAA gaggatgcacatgtgaccga GGGA tt gtcttc (bold is the Bpi recognition sites, underlined are the cleavage sites, and in between is the joining linker sequence).
- the end linkers are cloned in a pUC19 based plasmid (but could also be cloned in other backbones).
- level 1 constructs designed for a defined position cannot be reused in a different context.
- a level 1 construct made for subcloning at position three cannot be used without two other constructs for position 1 and 2.
- Placing the same transcriptional unit at position 1 could be done by recloning the same level 0 modules in a level 1 destination vector specific for position 1.
- a possibility to reduce the need for extensive recloning of the same construct for different positions is given by the periodical design of the level 1 overhangs.
- the relative position of, for example a level 1 position 3 construct can easily be shifted to the relative first position, when the left overhang from the level 2 destination vector would read ACTA instead of TGCC.
- FIG. 4 A set of seven level 2 destination vectors created for this purpose is shown in FIG. 4 (pL2-1 to pL2-7). Together with the seven end-linker plasmids, these 14 plasmids allow to realize every overhang combination in a level 2 destination vector. The identical flexibility only based on level 2 destination vectors without the end-linkers would require 36 different plasmids.
- restriction-ligation reactions were set up to clone from 2 up to 6 transcriptional units in a single step.
- the restriction-ligation reactions each contains from 2 to 6 level 1 transcriptional unit constructs (pICH50711, plCH50721, plCH49722, plCH49733, plCH50731, pICH50741 FIG. 10 ), one appropriate end-linker (pELE-2 to pELE-6), and the destination vector pL2-1.
- the reaction was incubated for 5 hours at 37° C. and 10 min 80° C.
- the mix was added to 100 ⁇ l chemical competent DH10b cells, incubated for 30 min on ice and transformed by heat shock.
- the transformation was plated on LB agar plates containing kanamycin (100 ⁇ g/ml) and the plate incubated over night at 37° C.
- the assembly of up to six transcription units to produce a 24 kb construct can be done in a one-step and one-tube level 2 reaction.
- the final level 2 constructs are in a “closed” status and no additional genes can be inserted since no type IIS restriction sites are left in the construct.
- An entry option can be provided when modified end-linkers containing additional type IIS restriction sites (pELB-n) are used in the assembly of the level 2 constructs ( FIG. 4 ).
- This type of construct was named level 2i since it is based on level 2 backbones but represents also an intermediate step to extend the number of genes in the construct beyond six.
- This intermediate construct can be used both as a destination vector for a final construct (level 2-2) or once again to construct the next level destination vector (level 2i-2), in case more than 6 more genes still need to be added to the construct.
- the same visual selectable marker such as LacZ ⁇ for blue/white selection
- a second color selectable marker that we developed based on carotenoid biosynthetic genes. With such marker, every cloning step can be performed using color selection, for example from red to white for level 2, red to blue for level 2i, blue to white for level 2-2, blue to red for level 2i-2, etc. . . .
- pELB-1 contains the sequence (SEQ ID NO: 12) gaagac aa tgcc t gagacc (bold Bpil recognition site, underlined cleavage site of Bpil and Bsal, italics Bsal recognition site) followed by a puc19 fragment containing the LacZ alpha fragment (gcagctggcacgacaggtttcccgactggaaagcgggcagtgagcgcaacgcaattaatgtgagttagctcactcattaggcaccc caggctttacactttatgcttccggctcgtatgttgtggaattgtgagcggataacaatttcacacaggaaacagctatgaccatgatta cgccaagcttgcatgcctgcagg
- red/blue selection is performed rather than red/white selection.
- red and blue colonies a few colonies had a dark green color. These contain incorrect plasmids that have both the canthaxanthin operon and a lacZ gene.
- the number of colonies containing correctly assembled plasmids (blue colonies), and the ratio of red to blue colonies for plCH51212 and plCH51226 were comparable with the level 2 construct made from a similar number of entry clones, plCH51191 and plCH51201.
- Example 5 The Level 2-2
- the level 2i-1 construct plCH51212 was chosen as a destination vector.
- This construct contains, beside five level 1 modules, a lacZ ⁇ end-linker providing two Bsal restriction sites.
- Bsal allows reopening the level 2i backbone and provides defined overhangs which are compatible with the level 1 modules released by Bpil. Since two type IIS restriction enzymes have to be used at the same time and the target plasmid has already a size of 22 kb, we again tested the efficiency of the Golden Gate cloning for the introduction of either one to up to six level 1 modules simultaneously.
- the mix was incubated in a thermocycler with the following parameters: incubation for 2 minutes at 37° C., 5 minutes at 16° C., both steps repeated 45 times, followed by incubation for 5 minutes at 80° C. and 10 minutes are 80° C.
- the reaction mix was transformed in E. coli chemically competent cells, and an aliquot of the transformation plated on a LB plate containing Kanamycing and X-gal. These parameters greatly increased cloning efficiency since 2685 white colonies were obtained (for the whole transformation) and no blue colony ( FIG. 10 ). Plasmid DNA from 6 randomly picked colonies was analyzed. All were found to contain the expected correct construct.
- the resulting level 0 constructs are shown in FIG. 22 .
- the genes for dxs and ispA were amplified and cloned from both E. coli strain K12 and from Agrobacterium rhizogenes strain K84.
- the genes for idi and AppY were cloned only from E. coli strain K12.
- the LacZ promoter present in pUC19 and the Pantoea ananatis promoter from the Zeaxanthin biosynthesis operon were also cloned as level 0 modules ( FIG. 22 ).
- Level 1 destination vectors for cloning of bacterial coding sequences are different from destination vectors for cloning eukaryotic transcription units, since they are designed for cloning of individual coding sequences rather than complete transcription units. Moreover, they also provide a bacterial ribosome binding site (RBS) to the cloned coding sequence.
- RBS bacterial ribosome binding site
- vector pICH4862L was made by PCR amplification of the LacZ fragment from pUC19 using primers prok1-1deg (SEQ ID NO: 15) (tttcgtctcattcagaagacat TGCC nv agga dnnnnn AATG ggagaccttatgaccatgattacgccaagc, in bold is the core sequence of the RBS and flanking it is degenerate sequence, the underlined sequence is the cleavage sites of the Bpil or Bsal restriction sites) and prok1-2 (SEQ ID NO: 16) (tttcgtctcacttagaagacaa TTGC AAGC tgagaccttatgcggcatcagagcagattgt).
- v stands for a, c or g
- d stands for a, t or g
- n stands for a, t, g or c.
- end-linkers and level 2 destination vectors were made that do not already contain carotenoid genes.
- End linkers in particular were made in which the sequence of the linker consists of the Lac Z terminator sequences from plasmid pUC19. The two sets of end linkers are shown in FIG. 23 B .
- a set of level 1 constructs was constructed from level 0 modules. Promoter modules were cloned in vector pL1F-1 ( FIG. 12 ) and the coding sequences in vectors pICH5025L to pICH2030L for the different positions expected in the final construct ( FIG. 24 ). Cloning was performed as a one-tube cloning reaction using the selected level 0 module and the chosen destination vector (or vector library) using the enzyme Bsal as described for cloning of level 1 eukaryotic transcription units. The 6 genes cloned as level 0 modules and known to increase lycopene production in E. coli were all cloned for positions 5, 6 and 7.
- the next step consists of adding two or three genes to the previous constructs to try to increase the amount of lycopene produced. Since we have 6 genes available cloned at three different positions, there are too many possible combinations to test them all individually. Instead, libraries can be made in which any of the 6 genes will be cloned randomly at any of the two or three positions (position 5 and 6, or 5, 6 and 7, FIG. 26 A and B). Four different libraries were made by setting up 4 reactions by adding into one tube all the constructs depicted in FIG.
Abstract
System for producing a nucleic acid construct of interest, said system comprising:
-
- a set of n entry DNAs numbered 1 to n, n being an integer of at least 2,
- each of said n entry DNAs comprising in this order:
- (i) a type IIs restriction endonuclease recognition site followed by the cleavage site thereof;
- (ii) a sequence portion linking the cleavage site of said recognition site of item (i) with the cleavage site of the recognition site of the following item (iii), and
- (iii) a cleavage site of a further type IIs restriction endonuclease recognition site followed by the recognition site of said cleavage site;
- the cleavage sites of the type IIs restriction endonuclease recognition sites of item (iii) of entry DNAs 1 to n−1 are complementary to the cleavage sites of the type IIs restriction endonuclease recognition sites of item (i) of entry DNAs 2 to n, respectively;
- the cleavage site of the type IIs restriction endonuclease recognition site of item (iii) of entry DNA n is complementary to the cleavage site of the type IIs restriction endonuclease recognition site of item (i) of entry DNA 1 for allowing annealing of complementary single-stranded overhangs formed by restriction at recognition site (i) of entry DNA 1 and at recognition site (iii) of entry DNA n;
- said system further comprising a destination vector comprising in this order:
- (I) a type IIs restriction endonuclease recognition site followed by the cleavage site thereof;
- (II) a vector backbone preferably comprising a selectable marker gene, said vector backbone linking the cleavage sites of said recognition sites of items (I) and the following item (III);
- (III) a further cleavage site of a type IIs restriction endonuclease recognition site followed by the recognition site of said cleavage site, and
- (IV) optionally, an insert between the recognition sites of item (III) and item (I);
- said cleavage sites of items (I) and (III) being different and non-complementary, said recognition sites of items (I) and (III) being preferably recognitions sites of the same endonuclease.
Description
- This application is a Division of U.S. patent application Ser. No. 15/634,803, filed Jun. 27, 2017, now U.S. Pat. No. 11,597,937, issued Mar. 7, 2023, which is a Division of U.S. patent application Ser. No. 13/703,123, now abandoned, which has a 35 U.S.C. § 371(c) date of May 14, 2013 and which is the U.S. National Stage of International Application PCT/EP2011/002843, filed Jun. 9, 2011, which designates the U.S. and was published by the International Bureau in English on Dec. 15, 2011, and which claims the benefit of European Patent Application Nos. 10006090.4, filed Jun. 11, 2010 and 10006955.8, filed Jul. 6, 2010, all of which are hereby incorporated by reference in their entirety.
- The contents of the electronic sequence listing (070313-0043SEQLST-2.xml; Size: 90,359 bytes; and Date of Creation: May 7, 2023) is herein incorporated by reference in its entirety.
- The present invention relates to a cloning system for producing a nucleic acid construct of interest using type IIs restriction endonucleases. The invention also provides a method of producing a nucleic acid construct of interest from at least three nucleic acid fragment constructs.
- Synthetic biology promises to revolutionize biotechnology through engineering of life forms with novel phenotypes not normally found in nature. Examples of applications include the microbial production of chemical precursors, novel antibiotics and, induction and fine-tuning of pluripotent stem cells and the engineering of a minimal free living cell. Such applications will require the ability to physically assemble complex DNA molecules containing large numbers of natural or artificial genes in a wide variety of arrangements.
- Although much progress has been made in the past few years, construction of recombinant DNA molecules is still a slow and labour-intensive process. Recombinant DNA molecules have traditionally been constructed using type II restriction enzymes and ligase. Although versatile, such approach is slow and tedious and only allows creation of constructs of relatively small size and containing only few genes. In particular, this approach is limited by the fact that designing cloning strategies becomes extremely difficult for large constructs, since all restriction enzymes available will cut many times in such constructs. In the past few years, a number of different approaches have been developed to overcome these limitations. These include recombinase-based cloning, ligation-independent cloning, cloning based of homologous recombination and PCR-based assembly. Recombinase-based cloning eliminates the problems coming from the multiple occurrence of restriction sites in large constructs but is limited by the fact that recombination sites are left in the final construct, preventing the seamless assembly of protein coding sequences. Moreover, recombinase-based cloning is limited by the fact that, so far, only 4 fragments can be assembled in one construct simultaneously. Ligation-independent cloning is also independent of restriction sites since restriction enzymes are not used, but is limited by the fact that it requires PCR and therefore requires sequencing of constructs made with this approach. Methods based on homologous recombination are valuable and allow to assemble extremely large DNA fragments of up to the size of entire bacterial genomes (Gibson et al., Science. 2010 Jul. 2; 329(5987):52-6), but are not well suited for combinatorial assembly of multiple independent basic genetic elements since they, but are not well suited for combinatorial assembly of multiple independent basic genetic elements, since they require a minimum amount of sequence in common between modules.
- Recently, cloning methods based on type IIs restriction enzymes have been developed (WO 2008/095927). Engler et al. PLoS ONE 4 (2009) e5553) describe a protocol to assemble in one step and one tube at least nine separate DNA fragments together into an acceptor vector using type IIs restriction enzymes by simply subjecting a mix of 10 undigested input plasmids to a restriction-ligation reaction and transforming the resulting mix into competent cells. This protocol was named “Golden Gate” cloning.
- Although methods that allow assembly of multiple DNA fragments in one step (such as Golden Gate cloning) are helpful for construction of recombinant DNA molecules, they still do not solve the problem that construct-specific cloning strategies need to be defined at each step of cloning. What is needed for synthetic biology are methods that eliminate the need for construct-specific cloning strategies. A step toward standardization of cloning strategies has been proposed with the BioBrick system (Knight TF, 2003, Idempotent Vector Design for Standard Assembly of BioBricks. MIT Synthetic Biology Working Group Technical Report). This system is based on hierarchical assembly of basic genetic parts, two parts at a time. Assembly of two basic parts using restriction enzymes and ligase results in a composite part that has the same structure as the basic part in terms of flanking restriction sites (the parts are therefore called idempotent). Since the structure of the composite parts is the same as the structure of the basic parts, the same assembly procedure can be repeated on composite parts to get increasingly complex constructs. However, because this strategy is based on idempotency of the DNA fragments, the BioBrick system is necessarily limited to assembly of two fragments at a time (addition of a part to a plasmid that already contains another part or composite part). This is a serious limitation since synthetic biology will require assembling very large number of DNA fragments, which will be very costly and impractical if assembly is performed two fragments at a time. Moreover, the ability to assemble a large number of fragments in a single step is useful for making combinatorial libraries, for example for making a construct containing all the genes encoding for a biochemical pathway. Such library can be made by assembling in one step all the genes necessary for a biochemical pathway, with multiple variants for each of the genes of the pathway.
- Departing from the prior art, it is an object of this invention to provide a system that overcomes the limitations of the prior art. Notably, it is an object to provide a system and method that is not limited to the combination of two fragments per reaction. It is another object to provide a system of DNA molecules that allows assembly of a large or even unlimited number of DNA fragments using a fixed set of cloning vectors. The system should allow assembly of multiple DNA fragments at each cloning step, and should allow as many successive steps of cloning as necessary to be performed, continually reusing the same set of vectors. Repetition of these cloning cycles should allow assembly of increasingly larger numbers of DNA fragments in any desired order, resulting in increasingly larger constructs.
- Accordingly, the present invention provides:
- (1) System for producing a nucleic acid construct of interest, said system comprising: a set of n entry DNAs numbered 1 to n, n being an integer of at least 2, each of said n entry DNAs comprising in this order:
- (i) a type IIs restriction endonuclease recognition site followed by the cleavage site thereof;
- (ii) a sequence portion linking the cleavage site of said recognition site of item (i) with the cleavage site of the recognition site of the following item (iii), and
- (iii) a cleavage site of a further type IIs restriction endonuclease recognition site followed by the recognition site of said cleavage site;
- the cleavage sites of the type IIs restriction endonuclease recognition sites of item (iii) of
entry DNAs 1 to n−1 are complementary to the cleavage sites of the type IIs restriction endonuclease recognition sites of item (i) ofentry DNAs 2 to n, respectively; - the cleavage site of the type IIs restriction endonuclease recognition site of item (iii) of entry DNA n is complementary to the cleavage site of the type IIs restriction endonuclease recognition site of item (i) of
entry DNA 1 for allowing annealing of complementary single-stranded overhangs formed by restriction at recognition site (i) ofentry DNA 1 and at recognition site (iii) of entry DNA n; - said system further comprising a destination vector comprising in this order:
- (I) a type IIs restriction endonuclease recognition site followed by the cleavage site thereof;
- (II) a vector backbone preferably comprising a selectable marker gene, said vector backbone linking the cleavage sites of said recognition sites of items (I) and the following item (III);
- (III) a further cleavage site of a type IIs restriction endonuclease recognition site followed by the recognition site of said cleavage site, and
- (IV) optionally, an insert between the recognition sites of item (III) and item (I); said cleavage sites of items (I) and (III) being different and non-complementary, said recognition sites of items (I) and (III) being preferably recognitions sites of the same endonuclease.
- the cleavage sites of the type IIs restriction endonuclease recognition sites of item (iii) of
- (2) The system according to (1), wherein a type IIs restriction endonuclease recognising the recognition site (I) of said destination vector can produce a single-stranded overhang from the cleavage site of item (I) that is complementary to the single-stranded overhang producible by the type IIs restriction endonuclease recognising the recognition site (i) of entry DNA numbered 1 for enabling annealing of said complementary single-stranded overhangs and ligation of said destination vector with the DNA segment of item (ii) from entry DNA numbered 1.
- (3) The system according to (1) or (2), said system comprising a nucleic acid linker comprising in the following order:
- (a) a type IIs restriction endonuclease recognition site;
- (b) a cleavage site of said recognition site of item (a);
- (c) a cleavage site of a further type IIs restriction endonuclease recognition site of the following item (d);
- (d) a type IIs restriction endonuclease recognition site defining the cleavage site of item (c) and being a recognition site of a type IIs restriction endonuclease different from that of item (a);
- (e) a type IIs restriction endonuclease recognition site, preferably of the same endonuclease as the recognition site of item (d);
- (f) a cleavage site of said recognition site of item (e);
- (g) a cleavage site of a further type IIs restriction endonuclease recognition site of the following item (h);
- (h) a type IIs restriction endonuclease recognition site defining the cleavage site of item (g), preferably of the same endonuclease as the recognition site of item (a);
- said linker being capable of linking a cleavage site of item (iii) of one of a entry DNA numbered 1 to n, preferably of
number 1 to n−1, to a cleavage site of item (III) of said destination vector.
- said linker being capable of linking a cleavage site of item (iii) of one of a entry DNA numbered 1 to n, preferably of
- (4) The system according to (3), wherein the cleavage site of item (iii) of one of said entry DNAs is complementary to the cleavage site of item (b) of said linker, and
- the cleavage site of item (g) of said linker is complementary to the cleavage site of item (III) of said destination vector.
- (5) The system according to any one of (1) to (4), comprising from 1 to n multiple destination vectors numbered 1 to n, each of said 1 to n destination vectors having segments (I) to (III) as defined in (1) and optionally a segment (IV) as defined in (1),
- wherein the cleavage sites of item (III) of all destination vectors are identical and all cleavage sites of item (I) of all destination vectors are unique among the cleavage sites of item (III).
- (6) The system according to (3), comprising a set of n nucleic acid linkers numbered 1 to n, each n-th linker comprising items (a) to (h) as defined in (3),
- the cleavage site of item (iii) of each n-th entry DNA is complementary to the cleavage site of item (b) of the n-th linker;
- the cleavage site of item (g) of each n-th linker being complementary to the cleavage site of item (III) of the n-th destination vector;
- whereby each n-th linker being capable of linking a cleavage site of item (iii) of the n-th entry DNA to a cleavage site of item (III) of each n-th destination vector.
- (7) The system according to any one of (3), (4) or (6), said linker(s) comprising a marker gene in between items (d) and (e) for enabling selection of cell clones for the presence or absence of said marker gene.
- (8) The system according to any one of (1) to (7), wherein each sequence portion of item (ii) of each
entry DNA 1 to n comprises a further pair of two type IIs restriction endonuclease recognition sites oriented such that said further pair of recognition sites can be removed from said entry DNAs by treatment with type IIs restriction endonuclease(s) recognising said further pair of recognition sites, said further pair of recognition sites may flank a marker gene for enabling selection of cell clones for the presence or absence of said marker gene;- wherein said further pair of two type IIs restriction endonuclease recognition sites are recognition sites of endonucleases different from the recognition sites of item (i) and item (iii) of (1).
- (9) The system according to any one of (1) to (8), wherein the cleavage sites of the recognition sites of item (i) are unique among the item (i) recognition sites of the set of n entry DNAs, and the cleavage sites of the recognition sites of item (iii) are unique among the item (iii) recognition sites within the set of n entry DNAs.
- (10) The system according to any one of (1) to (9), each of said n entry DNAs further comprising in the order defined in (1):
- (iv) a vector backbone comprising a selectable marker gene.
- (11) The system according to any one of (1) to (10), wherein the type IIs restriction endonuclease recognition sites of items (i) and (iii) are recognition sites of the same type IIs restriction endonuclease.
- (12) The system according to any one of (1) to (11), wherein the cleavage sites of the recognition sites of item (III) of all destination vectors are identical, and the cleavage sites of the recognition sites of item (I) of all destination vectors are non-identical.
- (13) The system according to (3), wherein the cleavage sites of items (b) and (c) have the same sequence and preferably overlap, and wherein the cleavage sites of items (f) and (g) have the same sequence and preferably overlap.
- (14) A method of producing a nucleic acid construct of interest from at least m nucleic acid fragment constructs numbered 1 to m, each nucleic acid construct of interest comprising a sequence segment numbered 1 to m in the order of occurrence in the nucleic acid construct of interest, m being an integer of at least 3;
- said method comprising the following steps (A) to (C):
- (A) providing said m nucleic acid fragment constructs, each of said m nucleic acid fragment constructs comprising in this order:
- (i′) a type IIs restriction endonuclease recognition site of the upstream cleavage site of item (ii′);
- (ii′) a sequence segment of said nucleic acid construct of interest, said sequence segment comprising, in this order, an upstream cleavage site of the recognition site of item (i′), a core portion of the sequence segment, and a downstream cleavage site of the recognition site of the following item (iii′), and
- (iii′) the type IIs restriction endonuclease recognition site of said downstream cleavage site of item (ii′);
- the downstream cleavage sites of nucleic acid fragment constructs 1 to m−1 are complementary to the upstream cleavage sites of nucleic acid fragment constructs 2 to m, respectively,
- the downstream cleavage site of a nucleic acid fragment construct u, wherein u is an integer that is <m and at least 2 is complementary to the upstream cleavage site of the type IIs restriction endonuclease recognition site of item (ii′) of
nucleic acid fragment 1;
- (B) combining nucleic acid fragment constructs 1 to s, wherein s is an integer<u, a destination vector and a linker in the presence of a type IIs restriction endonuclease recognising said type IIs restriction endonuclease recognition sites of items (i′) and (iii′) and items (I) and (III) of the destination vector defined below and in the presence a DNA ligase in reaction medium compatible with activity of said type IIs restriction endonuclease and said ligase for recombining and ligating, in the following order, the sequence segment(s) of item (ii′) of nucleic acid fragment constructs 1 to s and said linker into said destination vector;
- said destination vector comprising in this order:
- (I) a type IIs restriction endonuclease recognition site followed by the cleavage site thereof complementary to the upstream cleavage site of item (ii′) of nucleic
acid fragment construct 1; - (II) a vector backbone comprising a selectable marker gene, said vector backbone linking the cleavage sites of said recognition sites of items (I) and the following item (III);
- (III) the further cleavage site of a type IIs restriction endonuclease recognition site followed by the recognition site of said cleavage site,
- said linker being as defined in item (3), wherein cleavage site (b) of said linker is complementary to the downstream cleavage site of item (ii′) of nucleic acid fragment construct s, and wherein cleavage site (g) of said linker and the cleavage site of item (III) of the destination vector are complementary; and
- (C) treating a mixture comprising the recombination product of step (B) and nucleic acid fragment construct(s) s+1 to m with a type IIs restriction endonuclease recognising said type IIs restriction endonuclease recognition sites of items (i′) and (iii′), a type IIs restriction endonuclease recognising said type IIs restriction endonuclease recognition sites of items (d) and (e) of the linker and a DNA ligase in a reaction medium compatible with activity of said type IIs restriction endonucleases and said ligase for inserting the sequence segments of item (ii′) of nucleic acid fragment constructs s+1 to m and optionally a further linker as defined in item (3) into the cleavage sites provided by items (c) and (f) of the linker used in step (B).
- (15) The method according to (14), wherein the recognition sites of all nucleic acid fragment constructs of items (i′) and (iii′), the recognition sites of items (a) and (h) of the linker and the recognition sites of item (I) and (III) of the destination vector are recognition sites of the same type IIs restriction endonuclease.
- (16) System for producing a nucleic acid construct of interest, said system comprising:
- a set of n entry DNAs numbered 1 to n, n being an integer of at least 2, preferably at least 3,
- each of said n entry DNAs comprising in this order:
- (i) a type IIs restriction endonuclease recognition site followed by the cleavage site thereof;
- (ii) a DNA segment linking the cleavage site of said recognition site of item (i) with the cleavage site of the recognition site of the following item (iii), and
- (iii) a cleavage site of a further type IIs restriction endonuclease recognition site followed by the recognition site of said cleavage site;
- the cleavage sites of the type IIs restriction endonuclease recognition sites of item (iii) of
entry DNAs 1 to n−1 are complementary to the cleavage sites of the type IIs restriction endonuclease recognition sites of item (i) ofentry DNAs 2 to n, respectively; - all cleavages sites of item (i) are typically unique among said n entry DNAs, and all cleavage sites of item (iii) are typically unique among said n entry DNAs;
- said system further comprising a destination vector comprising in this order:
- (I) a type IIs restriction endonuclease recognition site followed by the cleavage site thereof;
- (II) a vector backbone comprising a selectable marker gene, said vector backbone linking the cleavage sites of said recognition sites of items (I) and the following item (III);
- (III) a further cleavage site of a type IIs restriction endonuclease recognition site followed by the recognition site of said cleavage site, and
- (IV) optionally, a linker between the recognition sites of item (III) and item (I); said system further comprising a nucleic acid linker comprising in the following order:
- (a) a type IIs restriction endonuclease recognition site;
- (b) a cleavage site of said recognition site of item (a);
- (c) a cleavage site of a further type IIs restriction endonuclease recognition site of the following item (d);
- (d) a type IIs restriction endonuclease recognition site defining the cleavage site of item (c) and being a recognition site of a type IIs restriction endonuclease different from that of item (a);
- (e) a type IIs restriction endonuclease recognition site, preferably of the same endonuclease as the recognition site of item (d);
- (f) a cleavage site of said recognition site of item (e);
- (g) a cleavage site of a further type IIs restriction endonuclease recognition site of the following item (h);
- (h) a type IIs restriction endonuclease recognition site defining the cleavage site of item (g), preferably of the same endonuclease as the recognition site of item (a); said linker being capable of linking a cleavage site of item (iii) of one of a entry DNA numbered 1 to n, preferably of
number 1 to n−1, to a cleavage site of item (III) of said destination vector.
- (17) The system according to (16), comprising the same number n of said linkers as the system comprises entry DNAs, said linkers being numbered 1 to n,
- wherein all linkers have the same cleavage site (g) that is complementary to the cleavage site of item (III) of said destination vector for linking each linker to the recognition site of item (III) of said destination vector, and
- wherein each of said n linkers has a different cleavage site (b) that is complementary to the cleavage site of item (iii) of one of said n entry DNAs.
- (18) The system according to (17), said system further comprising n different destination vectors, each destination vector being defined by items (I) to (IV) and having the same cleavage site of item (III) that is complementary to the cleavage site (g) of all linkers,
- each of said destination vectors having a different cleavage site if item (I) that is complementary to the cleavage site of item (i) of one of said n entry DNAs.
- (19) System for producing a nucleic acid construct of interest, said system comprising: a set of m nucleic acid fragment constructs numbered 1 to m, m being an integer of at least 2, preferably at least 3,
- each of said m nucleic acid fragment constructs comprising in this order:
-
- (i′) a type IIs restriction endonuclease recognition site of the upstream cleavage site of item (ii′);
- (ii′) a sequence segment of said nucleic acid construct of interest, said sequence segment comprising, in this order, an upstream cleavage site of the recognition site of item (i′), a core portion of the sequence segment, and a downstream cleavage site of the recognition site of the following item (iii′), and
- (iii′) a type IIs restriction endonuclease recognition site of said downstream cleavage site of item (ii′);
- the downstream cleavage sites of nucleic acid fragment constructs 1 to m−1 are complementary to the upstream cleavage sites of nucleic acid fragment constructs 2 to m, respectively;
- said system further comprising a destination vector comprising in this order:
- (I) a type IIs restriction endonuclease recognition site followed by the cleavage site thereof;
- (II) a vector backbone comprising a selectable marker gene, said vector backbone linking the cleavage sites of said recognition sites of items (I) and the following item (III);
- (III) a further cleavage site of a type IIs restriction endonuclease recognition site followed by the recognition site of said cleavage site, and
- (IV) optionally, an insert between the recognition sites of item (III) and item (I);
- said cleavage sites of items (I) and (III) being different and non-complementary, said recognition sites of items (I) and (III) being recognitions sites of the same endonuclease.
- (20) The system of item (19), wherein the downstream cleavage site of a nucleic acid fragment construct u, wherein u is an integer that is <m and at least 2 is complementary to the upstream cleavage site of the type IIs restriction endonuclease recognition site of item (ii′) of
nucleic acid fragment 1. - (21) System for producing a nucleic acid construct of interest, said system comprising:
- a set of n destination vectors (“destination vectors M”), n being an integer of at least 2, preferably at least 3,
- each of said n destination vectors M comprising in the following order:
- (I′) a type IIs restriction endonuclease recognition site defining the cleavage site of item (II′);
- (II′) the cleavage site of said recognition site of item (I′);
- (III′) a cleavage site of said recognition site of the following item (IV′);
- (IV′) a further type IIs restriction endonuclease recognition site defining the cleavage site of item (III′) and being a different recognition site of a type IIs restriction endonuclease from that of item (I′);
- (V′) a vector backbone comprising a selectable marker gene, said vector backbone linking the cleavage sites of said recognition sites of item and (IV′) and the following item (VI′);
- (VI′) a further type IIs restriction endonuclease cleavage site;
- (VII′) a type IIs restriction endonuclease recognition site of the cleavage site of item (VI′), preferably of the same type IIs restriction endonuclease as the recognition site of item (I′) and
- (VIII′) optionally, an insert between the recognition sites of item (VII′) and item (I′); and
- a set of n linkers M, n being as defined above, each linker M comprising in the following order:
- (a′) a type IIs restriction endonuclease recognition site defining the cleavage site of item (b′);
- (b′) the cleavage site of said recognition site of item (a′);
- (c′) a cleavage site of a further type IIs restriction endonuclease recognition site of item (d′), said cleavage site having the same sequence of nucleotides as the cleavage site of item (b′);
- (d′) the type IIs restriction endonuclease recognition site defining the cleavage site of item (c′) and being a different recognition site of a type IIs restriction endonuclease different from that of item (a′);
- (e′) a further cleavage site of a type IIs restriction endonuclease recognition site of the following item (f′);
- (f′) the type IIs restriction endonuclease recognition site defining the cleavage site of item (e′), that is preferably a recognition site of the same endonuclease as the recognition site of item (a′);
- wherein
- the cleavage sites (VI′) of all n destination vectors M are identical;
- the cleavage sites (e′) of all n linkers M are identical;
- the cleavage site of item (VI′) of each destination vector M is complementary to the cleavage site of item (e′) of each linker M for allowing annealing of single-stranded overhangs produced by the type IIs restriction endonuclease recognising recognition sites (VII′) and (f′);
- the cleavage sites of items (II′) and (III′) within each destination vector M have the same sequence of nucleotides and may overlap such that one and the same sequence of nucleotides provides the cleavage site of items (II′) and that of item (III′); and
- the cleavage sites of items (b′) and (c′) within each linker M have the same sequence of nucleotides and may overlap such that one and the same sequence of nucleotides provides the cleavage site of items (b′) and that of item (c′); and
- the cleavage site (II′) of each destination vector M is unique among the cleavage sites (II′) of the set of n destination vectors M such that there are n different cleavage sites (II′), wherein for each of said n different cleavage sites (II′), there is a linker M having a cleavage site (b′) of identical nucleotide sequence among the set of n linkers M.
- (22) The system according to item (21), comprising:
- a set of z entry DNAs numbered 1 to z, z being an integer of at least 2, preferably an integer of at least 3,
- each of said z entry DNAs comprising in this order:
- (i) a type IIs restriction endonuclease recognition site followed by the cleavage site thereof;
- (ii) a sequence portion linking the cleavage site of said recognition site of item (i) with the cleavage site of the recognition site of the following item (iii), and
- (iii) a cleavage site of a further type IIs restriction endonuclease recognition site followed by the recognition site of said cleavage site;
- wherein
- the cleavage site of item (i) of each entry DNA is complementary to the cleavage site of item (II′) of one of the n destination vectors M for allowing annealing of single-stranded overhangs produced by the type IIs restriction endonuclease recognising recognition sites of items (i) and (I′),
- the recognition sites of item (i) of all z entry DNAs are preferably recognition sites of the same type IIs restriction endonuclease as the recognition sites of item (I′) and (VII′);
- the cleavage site of item (iii) of each entry DNA is complementary to the cleavage sites of item (b′) of one of the n linkers M for allowing annealing of single-stranded overhangs produced by the type IIs restriction endonuclease recognising recognition sites of items (iii) and (a′),
- the recognition sites of item (i) are recognition sites of the same type IIs restriction endonuclease as the recognition sites of item (a′) and (f′); and
- the recognition sites of items (i) and (iii) of all z entry DNAs are recognition sites of the same type IIs restriction endonuclease.
- (23) The system according to any one of items (21) or (22), further comprising a set of n destination vectors (“destination vectors P”), wherein n is as defined in item (21),
- each of said n destination vectors P comprising in the following order:
- (I″) a type IIs restriction endonuclease recognition site defining the cleavage site of item (II″);
- (II″) the cleavage site of said recognition site of item (I″);
- (III″) a cleavage site of said recognition site of the following item (IV″);
- (IV″) a further type IIs restriction endonuclease recognition site defining the cleavage site of item (III″) and being a different recognition site of a type IIs restriction endonuclease from that of item (I″);
- (V″) a vector backbone comprising a selectable marker gene, said vector backbone linking the cleavage sites of said recognition sites of item and
- (IV″) and the following item (VI″);
- (VI″) a further type IIs restriction endonuclease cleavage site;
- (VII″) a type IIs restriction endonuclease recognition site of the cleavage site of item (VI″), preferably of the same endonuclease as the recognition site of item (I″) and
- (VIII″) optionally, an insert between the recognition sites of item (VII″) and item (I″); and
- a set of n linkers P, each linker P comprising in the following order:
- (a″) a type IIs restriction endonuclease recognition site defining the cleavage site of item (b″);
- (b″) the cleavage site of said recognition site of item (a″);
- (c″) a cleavage site of a further type IIs restriction endonuclease recognition site of item (d″), said cleavage site having the same nucleotide sequence as the cleavage site of item (b″);
- (d″) the type IIs restriction endonuclease recognition site defining the cleavage site of item (c″) and being a different recognition site of a type IIs restriction endonuclease from that of item (a″);
- (e″) a further cleavage site of a type IIs restriction endonuclease recognition site of the following item (f″);
- (f″) the type IIs restriction endonuclease recognition site defining the cleavage site of item (e″), that is preferably a recognition site of the same endonuclease as the recognition site of item (a″);
- wherein
- the cleavage sites (VI″) of all n destination vectors P are identical;
- the cleavage sites (e″) of all n linkers P are identical;
- the cleavage site of item (VI″) of each destination vector P is complementary to the cleavage site of item (e″) of each linker P for allowing annealing of single-stranded overhangs produced by the type IIs restriction endonuclease recognising recognition sites (VII″) and (f″);
- the cleavage sites of items (II″) and (III″) within each destination vector P have the same sequence of nucleotides and may overlap such that one and the same sequence of nucleotides provides the cleavage site of item (II″) and the cleavage site of item (III″);
- the cleavage sites of items (b″) and (c″) within each linker P have the same sequence of nucleotides and may overlap such that one and the same sequence of nucleotides provides the cleavage site of items (b″) and the cleavage site of item (c″); and
- for each of said n different cleavage sites (b′) or (II′), there is a destination vector P having a cleavage site (II″) of identical nucleotide sequence as the nucleotide sequence of cleavage sites (b′) or (II′); and
- for each of said n different cleavage sites (b′) or (II′), there is a linker P having a cleavage site (b″) of identical nucleotide sequence as the nucleotide sequence of cleavage sites (b′) or (II′).
-
FIG. 1A shows elements of a system that allows re-use of the entry DNAs of the invention (level 1 destination vectors) for different inserted sequence segments. -
FIG. 1B provides an example for cloning of three nucleic acid fragment constructs na1, na2, and na3 into a destination vector using end-linker EL3. -
FIG. 1C shows that further nucleic acid fragment constructs na4 to na7 and a further construct na1 that is based on the sameentry vector number 1 as na1 ligated inFIG. 1B can be cloned into the product vector obtained inFIG. 1B . -
FIG. 2 explains how type IIs restriction sites are depicted in the following figures. -
FIG. 3 shows a general embodiment wherein a eukaryotic multi-gene construct is produced as the nucleic acid fragment of interest. -
FIG. 4 depicts a set oflevel 1 destination vectors,level 2 destination vectors, and end-linkers. -
FIG. 5A illustrates the structure of the modules depicted inFIG. 3 in greater detail and shows how these modules can be assembled. -
FIG. 5B show nucleotide sequences at the junction of various modules for each step of cloning ofFIG. 5A . “Gene 1” is equivalent to “TU 1”. Sequence stretches (SS) designated SS1 to SS26 are SEQ ID NO: 31 to SEQ ID NO: 56, respectively. -
FIG. 6 is an overview of alternative cloning strategies. -
FIG. 7 shows the structure oflevel 0 modules. -
FIG. 8A explains the assembly of transcriptional units fromlevel 0 modules intolevel 1 destination vectors (entry DNAs). -
FIG. 8B illustrates an example for the assembly oflevel 0 modules using a different set oflevel 1 destination vectors. -
FIG. 9A : assembly of 5 transcription units into amultigene level 2 construct in alevel 2 reaction. -
FIG. 9B : Assembly is performed with an end linker containing internal divergent Bsal restriction sites for allowing afurther level 2 reaction in the resultinglevel 2 construct. -
FIG. 9C shows examples of further rounds of cloning. -
FIG. 10 shows the efficiency of cloning for different levels of assembly. -
FIG. 11 illustrates the general strategy used for cloning of prokaryotic operons. -
FIG. 12 Comparison oflevel 1 destination vectors for eukaryotes (“MoClo” and prokaryotes (“MoClo Pro”). -
FIG. 13 illustrates the cloning of 5level 1 constructs (shown in the lower part) fromlevel 0 modules intolevel 1 destination vectors (shown in the middle part) in alevel 1 reaction. -
FIG. 14 shows thelevel 2 reaction from thelevel 1 constructs made as shown inFIG. 13 to produce a functional operon in alevel 2 construct. -
FIG. 15 shows asimilar level 2 reaction as shown inFIG. 14 , but with an end-linker containing internal Bsal sites for allowing afurther level 2 reaction. -
FIG. 16 illustrates the preparation of 3additional level 1 constructs (designated 6, 7 and 1, respectively) containing a second promoter (P2) and two further ORFs (orf4 and orf5). -
FIG. 17 . Cloning of the threeadditional level 1 constructs obtained according toFIG. 16 into thelevel 2i−1 construct made according toFIG. 15 . -
FIGS. 18A-18C show the structure of transcriptional units,level 0 modules, and of destination vectors required for their cloning.FIG. 18A : The transcriptional units contain up of 5 basic modules separated by 4 nucleotides sequences that serve as recombination sites (shown in boxes).FIG. 18B :Level 0 modules shown on the first line are flanked by Bsal sites. -
FIG. 18C : Strategy for removing internal type IIS recognition sequences. -
FIGS. 19A-19D show an example for cloning of constructs oflevel FIG. 19A : Illustrates cloning oflevel 0 promoter modules.FIG. 19B : Illustrates cloning of alevel 1 construct containing a transcription unit.FIG. 19C : Illustrates cloning of alevel 2i-1 construct containing 5 transcription units, TU1 (containing GFP), TU2 (containing p19), TU3 (containing VP2), TU4 (containing VP5), TU5 (containing VP7) into destination vector pL2-1.FIG. 19D : Illustrates cloning of a level 2-2 construct containing 11 transcription units. In addition to TU1 to TU5, the construct contains TU6 (transcription unit with VP3), TU7 (transcription unit with BAR), TU8 (transcription unit with antibody light chain), TU9 (transcription unit with antibody heavy chain), TU10 (transcription unit with TMV MP), TU11 (transcription unit with TMV CP). -
FIG. 20A shows the structure of the 11level 1 transcription units used inFIG. 19B . -
FIG. 20B shows the structure of construct plCH51811 that is obtained by cloning of the 11 transcription units cloned in a level 2-2 construct. -
FIG. 21 shows the expression of GFP in a Nicotiana benthamiana leaf after infiltration of level-2 constructs shown inFIG. 10 . -
FIG. 22 shows prokaryotic genes cloned aslevel 0 entry modules. -
FIG. 23A showslevel 1 destination vectors for cloning of prokaryotic coding sequences.FIG. 23B shows two sets of end linkers that contain the LacZ terminator from pUC19. -
FIG. 24 shows a list oflevel 1 constructs made with thelevel 0 modules shown inFIG. 22 . -
FIG. 25 showslevel 2i-1 constructs. -
FIGS. 26A-26B show addition of two or three more genes to the operons obtained inFIG. 25 . -
FIG. 27 shows thatlevel 2 cloning can be followed by alevel 3. -
FIG. 28 shows vectors required forlevel 3 cloning. -
FIG. 29A shows a set of destination vectors and end-linkers for assembly ofseveral level 1 transcription units (or more generally nucleic acids “na”) into a level M destination vector (M stands for multiplication).FIG. 29B illustrates assembly of three transcription units (nucleic acid acids na1 to 3) in a level M destination vector.FIG. 29C illustrates assembly of two transcription units (nucleic acids na4 and 5) in a level M destination vector. The linker is chosen from the set of linkers ELM1 to ELM7 such that na5 can be linked to cleavage site S8 of destination vector M.FIG. 29D shows assembly of the two pre-assembled blocks of transcription units (the reaction products of the reactions shown in B and C) in alevel 2 construct employing alevel 2 destination vector. Details of the restriction sites of destination vectors and end-linkers are shown inFIGS. 31 and 32 (below). -
FIG. 30 shows that level M can be used as an intermediate step betweenlevel -
FIG. 31 shows the structure of the entry DNAs as well as vectors required for level M cloning. -
FIGS. 32A-C show an example where blocks of three (FIG. 32A ) and two (FIG. 32B ) transcription units are separately pre-assembled into level M destination vectors before being assembled in alevel 2 vector in a subsequent step (FIG. 32C ). -
FIG. 33 illustrates a set of n=7 destination vectors and a set of n=7 end-linkers required for level M (left side of the figure) and P (right side of the figure). The destination vectors and end-linkers for level M (left side of the figure) are as shown inFIG. 31 . -
FIG. 34A illustrates a level P reaction wherein two level M constructs are assembled into a level P destination vector using an end-linker P to give a level P construct.FIG. 34B illustrates a level M reaction wherein two level P constructs are assembled into a destination vector M (“destination M”) using an end-linker M to give a Level M construct. -
FIG. 35 : Cloning from level P to level M and vice-versa can be repeated indefinitely by reusing the same set of vectors and end-linkers shown inFIG. 33 . -
FIG. 36 illustrates the structure of constructs obtained in successive level M and level P cloning steps. - The system of the invention comprises a defined set of components that have a high versatility and flexibility, whereby a given system can be easily applied to many different applications. Notably, a given system can be used for applications comprising different numbers of fragments to be assembled in a nucleic acid construct of interest. It is a great advantage of the invention that many different fragments can be combined with a number of acceptor vectors that is smaller than the number of fragments to be combined. Therefore, the system can be scaled to the combination of many different fragments and fragment numbers with little or no extra cloning work for the adaption of acceptor vectors to a large number of fragments.
- This system provides three advantages: (1) the cloning system allows to assemble constructs from multiple DNA fragments at each cloning step (using Golden Gate cloning), (2) the cloning procedure is automatically defined by the number of genetic elements that the user wants to assemble and does not require construct-specific cloning strategies and can therefore easily be automatized, (3) the cloning procedure can be repeated indefinitely using the same set of cloning vectors to make increasingly complex constructs (with an increasingly higher number of multigene and/or genetic elements.
- In the invention, a nucleic acid construct of interest is a DNA assembled from m nucleic acid fragment constructs, m being an integer of at least 3. Each nucleic acid fragment construct provides a sequence segment to the nucleic acid construct of interest. Typically, the nucleic acid construct of interest is present in a vector having in its backbone a selectable marker for allowing selection of cells containing the vector. The nucleic acid construct of interest is produced in the invention in a process comprising at least two, typically three, steps of restriction and ligation, for example departing from standardised pre-prepared modules. Restriction is catalysed by a type IIs restriction endonuclease, ligation is catalysed by a ligase.
- In a first step of restriction and ligation corresponding to step (A) of the method of the invention (also referred herein as “
level 1” or “level 1 reaction”), at least one, preferably at least 2, nucleic acid modules are linked by restriction and ligation and at the same time inserted into an acceptor vector. Acceptor vectors are referred to herein as “destination vectors”. Thus, the acceptor vectors of thelevel 1 reaction are also referred to herein as “level 1 destination vectors” or “level 1 acceptor vector”. Thelevel 1 acceptor or destination vectors are also referred to herein as “entry DNAs”. The reaction products of thelevel 1 reaction are referred to as “level 1 constructs” or “nucleic acid fragment constructs”, the latter terms being equivalent herein. The term “construct” herein indicates a reaction product of a restriction and ligation reaction. Thus, alevel 1 destination vector is a reactant of thelevel 1 reaction, and the nucleic acid fragment constructs are the products of thelevel 1 reaction.Multiple level 1 reactions are generally conducted separately to obtain at least two different nucleic acid fragment constructs to be assembled in the second step, referred to herein as “level 2” (see further below). One purpose of thelevel 1 reaction is to provide the nucleic acid fragment constructs to be assembled in the next step with suitable cleavage sites of a type IIs restriction endonuclease to allow ligation of the constructs obtained onlevel 1 in the desired order onlevel 2. In some embodiments, thelevel 1 reaction further serves the purpose of constructing nucleic acid fragment constructs from 2 or more modules. For example, if the nucleic acid construct of interest comprises several eukaryotic transcription units, multiple individual transcription units can be assembled inseparate level 1 reactions from 2 or more modules (such as promoter, 5′ UTR, signal peptide sequence etc.). Onlevel 2, two or more transcription units can then be combined. In afurther level 2 reaction, one or more further nucleic acid construct each containing a transcription unit can be combined with the reaction product of thefirst level 2 reaction. It is, however, not compulsory to produce the nucleic acid fragment constructs usingsuch level 1 reaction. It could also be considered to engineer them by other means or to synthesise them artificially de novo. - In the second step of restriction and ligation corresponding to step (B) of the method of the invention (referred herein as “
level 2”), at least 2 nucleic acid fragment constructs obtained in theprevious level 1 step are combined by restriction and ligation and, in the same reaction, inserted into an acceptor vector. This acceptor vector of thelevel 2 reaction is referred to as “level 2 destination vector”. If the term “destination vector” is used without reference to a particular level, it refers to alevel 2 destination vector. The reaction product of thelevel 2 reaction is referred to as “level 2 construct”. In some embodiments, thelevel 2 construct is the nucleic acid construct of interest. In other embodiments, such as in the method of the invention, thelevel 2 reaction is followed by a further reaction step (step (C)) that may be referred to as “level 2-2”, indicating asecond level 2 reaction. In thefirst level 2 reaction, a nucleic acid linker (also simply referred to as “linker” or “end-linker” herein) is preferably used that links one of the at least two nucleic acid fragment constructs to one of the cleavage sites of thelevel 2 destination vector. Use of the linker or multiple linkers significantly improves the versatility and flexibility of the systems of the invention in that different nucleic acid fragment constructs can be inserted into a given destination vector, whereby a given destination vector can be used independent of the cleavage site of the nucleic acid fragment construct. Moreover, the linkers allow introduction of a type IIs restriction site for re-opening thelevel 2 reaction product for insertion of further nucleic acid fragment constructs in a further step of restriction and ligation (step (C)) as will be described below. - The term “module” is used herein to refer to the starting compound of a
level 1 reaction other than thelevel 1 destination vector. Thus, a module is a reactant of a restriction and ligation reaction that reacts with alevel 1 destination vector. The modules of alevel 1 reaction can be produced in alevel 0 reaction (see further below). Thus, the modules of thelevel 1 reaction can be the products of alevel 0 reaction. - The system of the invention comprises a set of n entry DNAs and at least one destination vector. n is an integer of at least 2, in another embodiment of at least 3. Conveniently, n may be between 3 and 10. In the figures, examples with sets of n=7 entry DNAs are presented (
FIG. 4 ). As mentioned above, the term “entry DNA” refers to acceptor vectors of thelevel 1 reaction. The destination vector is an acceptor vector of thelevel 2 reaction. Thus, the entry DNAs and the at least one destination vector of the system are key components for performing both thelevel 1 and thelevel 2 reaction. The entry DNAs of the system of the invention allow combination of the multiple nucleic acid fragment constructs produced from multiple entry DNAs in a desired order and insertion into thelevel 2 destination vector in thelevel 2 reaction. The entry DNAs are numbered consecutively with integers starting from 1 in the order in which inserts inserted intolevel 1 reaction can be assembled into the nucleic acid construct of interest in a subsequent step. Entry DNAs differing in their numbering by 1 are referred to as contiguous entry DNAs. For allowing combination of entry DNAs and/or fragment constructs derived from the entry DNAs in a desired order in alevel 2 reaction, each of said n entry DNAs comprises in this order: (i) a type IIs restriction endonuclease recognition site followed by the cleavage site thereof; -
- (ii) a sequence portion linking the cleavage site of said recognition site of item (i) with the cleavage site of the recognition site of the following item (iii), and
- (iii) a cleavage site of a further type IIs restriction endonuclease recognition site followed by the recognition site of said cleavage site.
- The cleavage sites of the type IIs restriction endonuclease recognition sites of item (iii) of
entry DNAs 1 to n−1 are complementary to the cleavage sites of the type IIs restriction endonuclease recognition sites of item (i) ofentry DNAs 2 to n, respectively. Being complementary means that single-stranded overhangs produced by restriction with a type IIs restriction endonuclease recognising the recognition sites of the cleavage sites are complementary such that the single stranded overhangs can anneal and be ligated after annealing to form a linear DNA. Thus, the first entry DNA can anneal with its end represented by item (iii) to the end represented by item (i) of the second entry DNA. The second entry DNA can anneal with its end represented by item (iii) to the end represented by item (i) of the third entry DNA etc. This is illustrated by the dashed arrows inFIG. 4 that link the right hand side of thelevel 1 destination vectors with the left hand side of thelevel 1 destination vector underneath. - Generally, all item (i) cleavage sites are unique and non-complementary among the n entry DNAs of the system of the invention, and all item (iii) cleavage sites of all entry DNAs are unique and non-complementary among the n entry DNAs of the invention. In a given entry DNA, the cleavage sites of items (i) and (iii) are preferably non complementary in order to avoid ligating multiple identical fragment constructs contiguously. It is also preferred that the recognition sites of items (i) and (iii) among all entry DNAs are recognitions sites of the same endonuclease so that the associated cleavage sites can be cleaved using the same type IIs restriction endonuclease. However, it is also possible that the recognition sites of different entry DNAs are recognition sites of different type IIs restriction endonucleases. In this case, multiple endonucleases will have to be used in a given
level 2 reaction to ensure that all required cleavage sites are cleaved. The single-stranded overhangs formed from the cleavage sites by type IIs restriction enzyme cleavage are preferably non-palindromic. - In a preferred embodiment, the cleavage site of the type IIs restriction endonuclease recognition site of item (iii) of entry DNA n is complementary to the cleavage site of the type IIs restriction endonuclease recognition site of item (i) of
entry DNA 1 for allowing annealing of complementary single-stranded overhangs formed by restriction at recognition site (i) ofentry DNA 1 and at recognition site (iii) of entry DNA n. This feature is illustrated in the long dashed arrow linking the TGCC cleavage site of thelevel 1 destination vector pL1F-7 with the TGCC cleavage site of thelevel 1 destination vector pL1F-1 (FIG. 4 ). As will be described in more detail below, this allows reuse of the first and optionally further entry DNAs in asecond level 2 reaction so that more nucleic acid fragment constructs can be combined to produce a nucleic acid construct of interest than the system has entry DNAs. - The entry DNAs may be circular plasmids or vectors, wherein items (i) and (iii) of the entry DNAs are linked by a vector backbone. The vector backbone may contain a selectable marker allowing selection of cell clones containing the entry DNA or the nucleic acid fragment construct obtained therefrom in the
level 1 reaction. - The system further comprises a destination vector (
level 2 destination vector) comprising in this order: - (I) a type IIs restriction endonuclease recognition site followed by the cleavage site thereof;
- (II) a vector backbone comprising a selectable marker gene, said vector backbone linking the cleavage sites of said recognition sites of items (I) and the following item (III);
- (III) a further cleavage site of a type IIs restriction endonuclease recognition site followed by the recognition site of said cleavage site, and
- (IV) optionally, an insert between the recognition sites of item (III) and item (I);
- In the destination vector, the cleavage sites of items (I) and (III) are different and non-complementary. Preferably, the recognition sites of items (I) and (III) are recognition sites of the same endonuclease so that the associated cleavage sites can be cleaved using the same type IIs restriction endonuclease. For convenience, the recognition sites of items (I) and (III) are further recognitions sites of the same endonuclease as the recognition sites of items (i) and (iii) of the entry DNAs, so that the
level 2 reaction can be performed using one type IIs restriction endonuclease. - For enabling ligation of multiple nucleic acid fragment constructs into the destination vector, the type IIs restriction endonuclease recognising the recognition site (1) of said destination vector can produce a single-stranded overhang from the cleavage site of item (I) that is complementary to the single-stranded overhang producible by the type IIs restriction endonuclease recognising the recognition site (i) of entry DNA numbered 1 for enabling annealing of said complementary single-stranded overhangs and ligation of said destination vector with the DNA segment of item (ii) from entry DNA numbered 1. In the terminology used herein, the cleavage site of item (i) of
entry DNA 1 and the cleavage site of item (I) of the destination vector are complementary. Alternatively, the type IIs restriction endonuclease recognising the recognition site (1) of a destination vector can produce a single-stranded overhang from the cleavage site of item (I) that is complementary to the single-stranded overhang producible by the type IIs restriction endonuclease recognising the recognition site (i) of an entry DNA other than 1, such as 2 or 3. Such destination vectors are depicted inFIG. 4 . - For inserting a ligation product from multiple nucleic acid fragment constructs into the destination vector, the cleavage site of item (III) of the destination vector may be made complementary to the cleavage site of the entry DNA that will be linked to the cleavage site of item (III). However, in the present invention nucleic acid linkers may be used for this purpose, since suitable linkers allow to link any item (iii) cleavage site to the item (III) cleavage site of the destination vector without the need for producing a destination vector for each possible downstream (item (iii)) cleavage site of the entry DNAs. Since the specific item (iii) cleavage site of an entry DNA or nucleic acid fragment construct depends, for a given set of entry DNAs, from the number of fragment constructs to be combined in the
level 2 reaction, the linkers provide the system with a broad applicability to many different real life applications. Notably, a given system can be applied to cases with different numbers of fragment constructs to be recombined. An advantageous linker comprise in the following order: - (a) a type IIs restriction endonuclease recognition site;
- (b) a cleavage site of said recognition site of item (a);
- (c) a cleavage site of a further type IIs restriction endonuclease recognition site of the following item (d);
- (d) a type IIs restriction endonuclease recognition site defining the cleavage site of item (c) and being a recognition site of a type IIs restriction endonuclease different from that of item (a);
- (e) a type IIs restriction endonuclease recognition site, preferably of the same endonuclease as the recognition site of item (d);
- (f) a cleavage site of said recognition site of item (e);
- (g) a cleavage site of a further type IIs restriction endonuclease recognition site of the following item (h);
- (h) a type IIs restriction endonuclease recognition site defining the cleavage site of item (g), preferably of the same endonuclease as the recognition site of item (a).
- The linkers may be linear DNA molecules. Generally, however, the linkers are circular plasmids. The linkers comprise a pair of type IIs restriction endonuclease recognition sites (items (a) and (h)) and associated cleavage sites (items (b) and (g) at both ends for linking a given item (iii) site with an item (III). This pair of restriction sites is in convergent orientation, which means that the two cleavage sites are oriented toward the center of the linker, while the recognition sites are oriented towards the termini of the linker so that the recognition sites are removed upon restriction. Examples of linkers are linkers pELB-1 to -7 and pELR-1 to -7 shown in
FIG. 4 . - The linkers preferably comprise a further, different, pair of type IIs restriction sites flanked by the pair formed by items (a), (b), (g) and (h) of the linker. This further pair is formed by items (c) to (f) of the linker and is in divergent orientation, which allows to reopen a
level 2 reaction product produced using such linker for insertion of further nucleic acid fragment constructs. - The cleavage site of item (b) is complementary to an item (iii) cleavage site of an entry DNA for being capable of linking a cleavage site of item (iii) of one of a entry DNAs numbered 1 to n, preferably of
number 1 to n−1, to a cleavage site of item (III) of said destination vector. The cleavage site of item (g) of the linker is complementary to the cleavage site of item (III) of the linker. - The linkers may be provided as part of a plasmid containing the linker elements (a) to (h) defined above and a plasmid backbone linking elements (a) and (h). The backbone may contain a selectable marker for selecting cells containing the plasmid using a selective agent. This allows storage and amplification of linkers in cells, notably bacterial cells.
- In a preferred embodiment, the system comprises a set of n nucleic acid linkers numbered 1 to n, each n-th linker comprising items (a) to (h), the cleavage site of item (iii) of each n-th entry DNA is complementary to the cleavage site of item (b) of the n-th linker; the cleavage site of item (g) of each n-th linker being complementary to the cleavage site of item (III) of the n-th destination vector. Thus, each n-th linker is capable of linking a cleavage site of item (iii) of the n-th entry DNA to a cleavage site of item (III) of each n-th destination vector. In this embodiment, the system contains the same number of n entry DNAs and linkers. For each entry DNA of the set of n entry DNA, a linker is provided allowing linking the item (iii) cleavage site to the item (III) cleavage site of the destination vector. Thus for a given destination vector, all item (g) cleavage sites of the set of n linkers can be identical. As an example,
FIG. 4 shows a set of 7 entry vectors (level 1 destination vectors) PL1F-1 to -7 and a set of linkers pELB-1 to -7. Each of the n linkers of the set of n linkers may be part of a plasmid. - The cleavage sites of items (b) and (c) within each linker may have the same sequence of nucleotides and may overlap such that one and the same sequence of nucleotides provides the cleavage site of items (b) and that of item (c). Similarly, the cleavage sites of items (f) and (g) within each linker may have the same sequence of nucleotides and may overlap such that one and the same sequence of nucleotides provides the cleavage site of items (f) and that of item (g).
- In some embodiments, it may be desired to use a given entry DNA of number>1 at a
position 1 in the reaction product of thelevel 2 reaction. For this purpose, the system of the invention may comprise from 1 to n multiple destination vectors numbered 1 to n, each of said 1 to n destination vectors having segments (I) to (III) as defined above and optionally a segment (IV) as defined above. The cleavage sites of item (III) of all destination vectors may be identical and all cleavage sites of item (I) of all n destination vectors may be unique among the cleavage sites of item (III). Preferably, the n-th item (I) cleavage site of all n destination vectors is complementary to the n-th item (i) cleavage site of the entry DNA. An example of such embodiment is given inFIG. 4 that shows a set of 7level 2 destination vectors having identical item (III) cleavage sites (GGGA) that are complementary to the GGGA cleavage sites of the linkers. The item (I) cleavage sites of eachlevel 2 destination vector is complementary to thelevel 1 destination vector shown in the same line in the left-most column. - The optional insert of item (IV) of the destination vector(s) may be any sequence linking items (III) and (I), whereby the destination vector will be a circular molecule of vector. Absence of the insert of item (IV) may mean that the destination vector is linear DNA molecule. Preferably, however, an insert is used that is or comprises a marker gene that allows to distinguish cell clones containing the destination vector from those containing the product of the
level 2 reaction. Since the restriction sites of the destination vector are in divergent orientation with respect to the insert (seeFIG. 4 ), the insert is lost in thelevel 2 reaction. Thus, destination vectors andlevel 2 reactions products can be distinguished by the absence of red color in cell clones containing the latter. - In the invention, entry DNAs and nucleic acid fragment constructs differ in that the latter contain a sequence segment (item (ii′)) of the nucleic acid construct of interest to be produced. For allowing introduction of such sequence segment with its core portion into the entry DNA in a
level 1 reaction, each sequence portion of item (ii) of eachentry DNA 1 to n generally comprises a further pair of two type IIs restriction endonuclease recognition sites oriented such that said further pair of recognition sites can be removed from said entry DNAs by treatment with type IIs restriction endonuclease(s) recognising said further pair of recognition sites. InFIG. 4 , the further pair of two recognition sites are the Bsal sites in thelevel 1 destination vectors. The sequence region of item (ii) is not particularly restricted and may be as short as a few nucleotides. In one embodiment, however, the sequence region of item (ii) contains a reporter gene or reporter genes allowing color selection of cell clones containing the reporter gene(s). Said further pair of recognition sites may flank the reporter gene for enabling selection of cell clones for the presence or absence of said reporter gene e.g. by color. InFIG. 4 , the reporter gene is lacZ flanked by a pair of Bsal sites. The further pair of two type IIs restriction endonuclease recognition sites are recognition sites of endonucleases different from those of recognition sites of item (i) and item (iii) so that a given endonuclease does not cleave a cleavage site of the further pair and, at the same time, a recognition site of item (i) or (iii). The orientation of the further pair of recognition sites is divergent with respect to the marker gene, so that the reporter gene and the recognition sites is removed upon treatment with the type IIs endonuclease recognising these site, so that these recognition sites are not present in the nucleic acid fragment construct obtained in thelevel 1 reaction. - The second type of reactants of the
level 1 reaction is one or more modules that can be incorporated into the entry DNAs in thelevel 1 reaction using the known methodology described in Engler et al. PLoS ONE 4 (2009) e5553. These modules are also referred to herein as “level 0 modules”, since they can be produced in alevel 0 reaction. An example of alevel 1 reaction is schematically shown inFIG. 5A . One ormore level 0 modules are ligated together in a desired order with and into the entry DNA to produce thelevel 1 fragment constructs (level 1 construct), using the inner pair of type IIs restriction sites present in sequence portion of item (ii) of the entry DNA. This leaves the recognition and cleavage sites of items (i) and (iii) unchanged, whereby these are also present in the reaction product for use in thesubsequent level 2 reaction. Since at least 3 nucleic acid fragment constructs are employed in method of the invention, at least 3level 1 reactions are typically performed separately.Multiple level 1 constructs and alevel 2 destination vector are then combined in a one pot reaction onlevel 2. - In the method of the invention, a nucleic acid construct of interest is produced from at least m nucleic acid fragment constructs numbered 1 to m. Each nucleic acid construct of interest typically comprises a sequence segment to be incorporated into the nucleic acid construct of interest. These sequence segments may be numbered 1 to m as the nucleic acid fragment construct containing them in the order of occurrence in the nucleic acid construct of interest. Numeral m is an integer of at least 3, preferably at least 6, more preferably at least 10. Said method comprises the steps (A) to (C) as described in the following.
- In step (A), the m nucleic acid fragment constructs are provided. Each of said m nucleic acid fragment constructs comprising in this order:
- (i′) a type IIs restriction endonuclease recognition site of the upstream cleavage site of item (ii′);
- (ii′) a sequence segment of said nucleic acid construct of interest, said sequence segment comprising an upstream cleavage site of the recognition site of item (i′), a core portion of the sequence segment, and a downstream cleavage site of the recognition site of the following item (iii′), and
- (iii′) the type IIs restriction endonuclease recognition site of said downstream cleavage site of item (ii′).
- The m nucleic acid fragment constructs can be provided in (separate)
level 1 reactions using the entry DNAs of the system of the invention and at least one module per type of fragment construct that provides the core portion to the sequence segment of item (ii′). In thelevel 1 reaction one or several such modules may be combined to generate the fragment constructs with the desired core portion comprising portions derived from multiple modules. The modules used in thelevel 1 reaction are also referred to herein as “level 0 modules”, as they can be prepared in a restriction and ligation step before thelevel 1 reaction. Thelevel 1 reaction may be performed as explained with reference toFIGS. 7 and 8 using known methods and as described herein. It is also possible to use more than module having identical type IIs restriction endonuclease cleavage sites in one restriction and ligation reaction, allowing the generation of libraries of nucleic acid fragment constructs. For example, five promoter modules P1 to P5 as depicted inFIG. 7 may be combined in one reaction, whereby a mixture oflevel 1 fragment constructs is obtained that differ by having different promoters. Thus, the mixture of fragment constructs may be screened for the most suitable promoter function in the context of the remaining modules introduced into the fragment constructs. Similarly, libraries of fragment constructs containing different 5′UTRs, signal peptides, ORFs, terminators, combinations thereof or other elements may be produced and screened for a suitable property using the invention. - The downstream cleavage sites of nucleic acid fragment constructs 1 to m−1 are complementary to the upstream cleavage sites of nucleic acid fragment constructs 2 to m, respectively, for allowing assembly of the nucleic acid fragment constructs in the order corresponding to the numbering of the constructs in the subsequent step (B). In the nucleic acid fragment constructs, the recognition sites of items (i′) and (iii′) as well as the upstream and downstream cleavage sites of item (ii′) are derived from the entry DNAs used, whereas the core portion is essentially derived from the
level 0 modules. - If the nucleic acid fragment constructs are provided in a
level 1 reaction, the products of thelevel 1 reaction are generally transformed into cells for amplification and purification. Typically, they are transformed into competent bacterial cells such as E. coli cells. After cell growth, the fragment constructs are isolated from the cells, e.g. using standard plasmid preparation protocols, for use in the following step (B). - The method of the invention comprises two steps wherein fragment constructs are combined, namely the following steps (B) and (C). In these steps, at least one fragment construct is used in step (C) that is derived from the same entry DNA as a fragment construct used in step (B). Thus, the method of the invention allows reuse of entry DNAs for more than one nucleic acid fragment. It is an important aspect of the invention that many different fragment constructs can be combined with a relatively small number entry DNAs. However, in this embodiment, fragment constructs derived from the same entry DNA have the same upstream and downstream cleavage sites (ii′) and are therefore used in separate reactions to avoid statistical inclusion of either fragment construct at a selected position into the final nucleic acid construct of interest.
- For this purpose, the downstream cleavage site of a nucleic acid fragment construct u, wherein u is an integer that is <m and at least 2, is complementary to the upstream cleavage site of the type IIs restriction endonuclease recognition site of item (ii′) of nucleic acid fragment 1 (illustrated in
FIG. 4 by the long dashed arrow linking cleavage sites TGCC of pL1F-1 and pL1F-7). - Step (B) is a
level 2 reaction. In the terminology used with reference to the figures, step (B) is alevel 2i-1 reaction. In step (B), the sequence segment(s) of item (ii′) of nucleic acid fragment constructs 1 to s, wherein s is an integer<u, and said linker are ligated, in this order, and inserted into said destination vector. This may be done by reacting, in the presence of a type IIs restriction endonuclease recognising said type IIs restriction endonuclease recognition sites of items (i′) and (iii′) and items (I) and (III) of the destination vector defined below and in the presence a DNA ligase, in reaction medium compatible with activity of said type IIs restriction endonuclease and said ligase. For example, a mixture comprising nucleic acid fragment constructs 1 to s, the destination vector and a linker may be treated with the type IIs restriction endonuclease and the DNA ligase in a reaction medium compatible with activity of the type IIs restriction endonuclease and the ligase. Thus, s defines the number of nucleic acid fragment constructs combined in step (B) with the (level 2) destination vector. Since s is smaller than u, nucleic acid fragment construct u+1 and higher will not be used in step (B), but in a subsequent step such as step (C). - The linker that may be used in step (B) is as defined above. Cleavage site (b) of the linker may be complementary to the downstream cleavage site of item (ii′) of nucleic acid fragment construct s, and cleavage site (g) of said linker and the cleavage site of item (III) of the destination vector may complementary for connecting the downstream cleavage site of fragment construct s to site (III) of the destination vector.
- Step (B) may comprise transformation of the restriction and ligation product into cells for amplification and purification. Typically, it is transformed into competent bacterial cells such as E. coli cells. After cell growth, the
level 2 construct is generally isolated from the cells, e.g. using standard plasmid preparation protocols, for use in the following step (C). - Step (C) is a
subsequent level 2 reaction. In the terminology used with reference to the figures, step (C) is a level 2-2 orlevel 2i-2 reaction. In step (C), a mixture comprising the recombination product of step (B) (a “level 2i-1 construct”) and nucleic acid fragment construct(s) s+1 to m is treated with a type IIs restriction endonuclease recognising said type IIs restriction endonuclease recognition sites of items (i′) and (iii′), a type IIs restriction endonuclease recognising said type IIs restriction endonuclease recognition sites of items (d) and (e) of the linker and a DNA ligase in a reaction medium compatible with activity of said type IIs restriction endonucleases and said ligase. Thereby, the sequence segments of item (ii′) of nucleic acid fragment constructs s+1 to m and optionally a further linker as defined in item (3) are inserted into the cleavage sites provided by items (c) and (f) of the linker used in step (B). The recognition sites of items (i′) and (iii′) may be the same as the recognition sites of items (d) and (e) of the linker, whereby a type IIs restriction endonuclease recognising all these recognition sites can be used. The linker may be of the type pELE shown inFIG. 4 , whereby nofurther level 2 reaction can be performed with the reaction product of step (C). Alternatively, a linker as defined in item (3) may be used, whereby afurther level 2 reaction can be conducted. Thus, nucleic acid constructs of interest can be made from more than m nucleic acid fragment constructs. The possibility to use one or morefurther level 2 reactions is schematically shown inFIG. 6 . - Step (C) may comprise transformation of the restriction and ligation product into cells for amplification and purification. Typically, it is transformed into competent bacterial cells such as E. coli cells. After cell growth, the construct of step (C) may be isolated from the cells, e.g. using standard plasmid preparation protocols.
- The present invention provides a further system for producing a nucleic acid construct of interest. Similar as with the system described above, consecutive repetitions of cloning steps and re-use of the cleavage sites from a predefined set of vectors allows to increase the number of fragments that make up a nucleic acid construct of interest in a vector. In this system, a set of n destination vectors is used that are referred to as “level M destination vectors”. Level M destination vectors differ from
level 2 destination vectors in that an additional type IIs restriction endonuclease recognition site is present (compare thelevel 2 destination vectors ofFIG. 4 with the “level M destination vectors” inFIG. 31 ). The additional recognition site has a cleavage site of the same nucleotide sequence as the cleavage site of item (I) of thelevel 2 destination vectors. Fragment constructs or entry DNAs are inserted into the level M destination vectors together with linkers referred to as “linkers M”. Linkers M differ from linkers such as linkers pELE shown inFIG. 4 in that they have an additional type IIs restriction endonuclease recognition site (compare linkers pELE ofFIG. 4 with the “end liners M” inFIG. 31 ). The additional recognition site of linkers M has a cleavage site of the same nucleotide sequence as the cleavage site on the left hand side of linkers pELE ofFIG. 4 . The additional recognition sites of destination vectors M and linkers M allow excision of constructs cloned into the designation vectors M and introduction, preferably with other constructs produced in parallel level M reactions, into a further destination vector referred to as “level P destination vector” together with a suitable linker referred to as linker P. Similarly as destination vectors M and linkers M, destination vectors P and linkers P are designed such that excision of constructs cloned into the designation vectors P is possible as well as reintroduction into a further level M destination vector. Since in each cloning step, multiple fragment constructs prepared in parallel preceding steps can be combined, the number of fragment constructs combined into a construct of interest can be increased multiplicatively, which is indicated by letter M in “destination vector M”. In any event, a set of n destination vectors M and a set of n linkers M, preferably in combination with a set of n destination vectors P and set of n linkers P, allows reuse of a limited number of n cleavage sites such that a large number of fragment constructs (90 inFIG. 36 ) can be assembled with a small number of elements n in said sets. - n is at least 2, preferably at least 3, more preferably at least 4. The versatility of the system increases with increasing n. However, it is not necessary to have n>10. Thus, n may be a number of from 3 to 20, preferably of from 4 to 10, more preferably of from 5 to 9 or from 6 to 8. In the figures, embodiments with n=7 are exemplified, which is the most preferred embodiment.
- The cleavage sites of items (II′) and (III′) of destination vectors M may overlap completely. In this case, one physical sequence of nucleotides provides the cleavage sites of two different type IIs restriction endonuclease recognition sites. Analogously, one physical sequence of nucleotides may provide the cleavage sites of two different type IIs restriction endonuclease recognition sites, namely the cleavage sites of items (b′) and (c′), of items (II′) and (III′) and of items (b″) and (c″). This embodiment is used in the examples shown in the figures. However, it is also possible that the cleavage sites of the pairs mentioned before are adjacent separated cleavage sites.
- The number of entry DNAs to be used in not decisive in the system of this embodiment. It is possible that one entry DNA is incorporated into a level M destination vector, optionally followed by incorporation of one
level 1 construct into a level P destination vector. However, the main advantages of the system can be made use of if at least 2, at least 3, at least 4, or at least 5 entry DNAs are combined by introduction into a destination vector M. The recognition sites of items (i), (iii), (I′) and (VII′), (a′), and (f) may be recognition sites of the same type IIs restriction endonuclease. - Multiple level M reactions can be conducted in parallel in separate reaction vessels as indicated in
FIGS. 32 a and b . The separate level M constructs may be combined in asubsequent level 2 or level P reaction. The destination vector M used in the second level M reaction is selected such that it has a complementary cleavage site to the cleavage site provided by the linker M used in the first level M reaction. In this way, two or more level M constructs can be combined in alevel 2 reaction (FIG. 32 c ) or a level P reaction (FIG. 34 a ). InFIG. 32 c , it is the TTAC cleavage site in the level M construct 1 that is derived from a linker M used in a first level M reaction. The TTAC cleavage site in thelevel M construct 2 is derived from a destination vector M used in the second level M reaction. Cleavage by Bsal allows ligation of the construct comprising TU1 to TU3 with the construct comprising TU4 and TU5 into thedestination vector level 2. Alternatively, as shown inFIG. 34 a , a destination vector P and linker P can be treated with Bsal and ligase together with level M constructs 1 and 2 to give the level P construct shown at the bottom ofFIG. 34 a. - In the set of n destination vectors P, the same set of cleavage sites is used as in the destination vectors M. The cleavage sites (VI″) of all n destination vectors P are identical and are at the same time identical to the cleavage sites of items (VI′) of destination vectors M. In the set of n linkers P, the same set of cleavage sites is used as in the linkers M and the destination vectors M. Thus, a limited set of n cleavage sites allows to combine a number of fragments constructs that can far exceed the number of n. As shown in
FIG. 34 b , multiple level P constructs can be combined into another destination vector M using a linker M. - The recognition sites of items (I″), (IV′), (d′), (a″) and (f″) may be recognition sites of the same type IIs restriction endonuclease, and the recognition sites of items (IV″), (I′), (VII′), (a′) and (f′) may be recognition sites of the same type IIs restriction endonuclease.
- Item (VIII′) of destination vector M may have a marker allowing selection of ligation product of a level M reaction for absence of item (VIII′). The marker may be lacZ for blue/white selection. Item (VIII″) of destination vector P may have a marker allowing selection of ligation product of a level P reaction for absence of item (VIII″). Generally, the designation vectors and the linkers are circular molecules or plasmids containing a selectable marker in their backbone.
- An advantage of systems and methods of the invention is that cloning steps can be done as one-pot reactions, requiring only a simple incubation such as in a thermocycler. In particular, this avoids the need for labor-intensive and operations that are difficult to automate such as purification of DNA fragments from agarose gels. This mean that all the elements required for the design of a completely automatized cloning system are now in place. Operations that are employed are preparation of miniprep DNA, liquid handling and incubation to perform restriction-ligation, plating of transformations on plates, picking of colonies, and digestion and analysis of miniprep DNA. This last step may be replaced by DNA sequencing, as very few colonies need to be screened to obtain the desired construct (in the majority of cases, one colony is sufficient). All these operations can easily be handled by standard automation robots. This aspect promises to revolutionize the number of constructs that can be made within a given time as well as the production costs for these constructs, and therefore opens the door for new applications that will be required in the field of synthetic biology.
- Another advantage of the invention compared to more traditional cloning strategies is that complex design of specific construction strategies is not needed anymore, since the design is automatically defined by the number and the order of modules (or genes) that a user wants to assemble. The cloning strategy in fact can be easily and unambiguously determined by a simple computer program. This program can be directly linked to the automation robots that would physically make the construct. An advantage of such system is that the cloning strategy itself cannot become a limiting factor when constructs reach a large size, since the same principles and the same cloning vectors can be reused indefinitely. In fact, it is conceivable that the invention can be used to clone entire chromosomes.
- The system of the invention allows the production of nucleic acid constructs of interest from multiple nucleic acid fragments constructs using a combination of nucleic acid fragment constructs via single-stranded overhangs formed at both ends of the fragments using type IIs restriction endonucleases. In the invention, type IIs restriction enzymes are used. The type IIs restriction endonuclease recognition site is a recognition site of a restriction endonuclease recognizing a double-stranded DNA and cleaving the double-stranded DNA at a cleavage site that is outside the recognition site on the double stranded DNA. The type IIs restriction endonuclease cleaves such that, depending on the specific type IIs restriction endonuclease, overhangs of from 3 to 6 nucleotides are produced. However, it is also possible to use type IIs endonucleases producing longer single-stranded overhangs. The nucleotide range that forms the overhangs upon cleavage is referred to herein as cleavage site. Since the nucleotides of the cleavage site are not part of the recognition site, they can be chosen as desired without destroying cleavage activity of the type IIs restriction endonuclease. Examples of type IIs restriction endonucleases suitable for the methods of the invention are given below.
- For practicing the invention, not only Bsal and Bpil, but any type IIs restriction enzyme that provides “sticky” ends sufficient for efficient ligation at its cleavage sites can be used. A selection of such enzymes is provided on the REBASE webpage (rebase.neb.com/cgi-bin/asymmlist) and in the review of Szybalsky et al. (1991, Gene, 100:13-26). Type II restriction enzymes with asymmetric recognition sites (e.g. those shown in this webpage) that have cleavage site outside of recognition site and provide upon cleavage of at least three, preferably 4 or more nucleotide residues overhangs (e.g. Bli7361; BpuAI, VpaK321, SfaNI, etc.) can be used in the invention. It is recommended that the recognition site contains at least 4, more preferably at least 6 or more base pairs in order to minimize the chance for such site to be found in a sequence portion of interest. Type IIs restriction nucleases with 5 bp recognition sites (e.g. SfaNI) also can be used. Type IIs restriction endonucleases that produce 4 nt single-stranded overhangs at the extremities of digested fragments can theoretically generate ends with 256 possible sequences. Type IIs restriction enzymes having even longer recognition sites, e.g. comprising ten or more base pairs have been engineered. The largest recognition site among natural type IIs enzymes is for the enzyme SapI which has a 7 bp recognition site. A preferred solution is the use of artificial type IIs enzymes engineered to have a long recognition site (Lippow et al, 2009, Nucleic acides Res., 37:3061-3073). For example, a type IIs enzyme with a 18 bp recognition sites would be expected to cut only a few times per eukaryotic genome at most, and would allow to make most entry modules without having to change any nucleotide of the native sequence.
-
TABLE 1 List of usable type Ils restriction enzymes commercially available reach reach Recognition top bottom Name sequence strand strand extension BsaXI (9/12) 10 7 3 nt 3′ ACNNNN NCTCC (10/7) Bst6I CTCTTC 1 4 3 nt 5′ (1/4) Eam1104I CTCTTC 1 4 3 nt 5′ (1/4) EarI CTCTTC 1 4 3 nt 5′ (1/4) LguI GCTCTTC 1 4 3 nt 5′ (1/4) PciSI GCTCTTC 1 4 3 nt 5′ (1/4) BspQI GCTCTTC 1 4 3 nt 5′ (1/4) SapI GCTCTTC 1 4 3 nt 5′ (1/4) BveI ACCTGC 4 8 4 nt 5′ (4/8) Acc36I ACCTGC 4 8 4 nt 5′ (4/8) BfuAI ACCTGC 4 8 4 nt 5′ (4/8) BspMI ACCTGC 4 8 4 nt 5′ (4/8) AarI CACCTGC 4 8 4 nt 5′ (4/8) Esp3I CGTCTC 1 5 4 nt 5′ (1/5) BsmBI CGTCTC 1 5 4 nt 5′ (1/5) BstV2I GAAGAC 2 6 4 nt 5′ (2/6) BpiI GAAGAC 2 6 4 nt 5′ (2/6) BpuAI GAAGAC 2 6 4 nt 5′ (2/6) BbsI GAAGAC 2 6 4 nt 5′ (2/6) BseXI GCAGC 8 12 4 nt 5′ (8/12) Lsp1109I GCAGC 8 12 4 nt 5′ (8/12) BstV1I GCAGC 8 12 4 nt 5′ (8/12) BbvI GCAGC 8 12 4 nt 5′ (8/12) SfaNI GCATC 5 9 4 nt 5′ (5/9) LweI GCATC 5 9 4 nt 5′ (5/9) BtgZI GCGATG 10 14 4 nt 5′ (10/14) FokI GGATG 9 13 4 nt 5′ (9/13) FaqI GGGAC 10 14 4 nt 5′ (10/14) BsIFI GGGAC 10 14 4 nt 5′ (10/14) BsmFI GGGAC 10 14 4 nt 5′ (10/14) Bso31I GGTCTC 1 5 4 nt 5′ (1/5) BspTNI GGTCTC 1 5 4 nt 5′ (1/5) Eco31I GGTCTC 1 5 4 nt 5′ (1/5) BsaI GGTCTC 1 5 4 nt 5′ (1/5) Alw26I GTCTC 1 5 4 nt 5′ (1/5) BstMAI GTCTC 1 5 4 nt 5′ (1/5) BsmAI GTCTC 1 5 4 nt 5′ (1/5) BaeI (10/15) 12 7 5 nt 3′ ACNNNN GTAYC (12/7) PpiI (7/12) 13 8 5 nt 3′ GAACNNN NNCTC (13/8) PsrI (7/12) 12 7 5 nt 3′ GAACNNN NNNTAC (12/7) AloI (7/12) 12 7 5 nt 3′ GAACNNN NNNTCC (12/7) BarI (7/12) 12 7 5 nt 3′ GAAGNNN NNNTAC (12/7) AjuI (7/12) 11 6 5 nt 3′ GAANNNNN NNTTGG (11/6) TstI (8/13) 12 7 5 nt 3′ CACNNNN NNTCC (12/7) Hin4I (8/13) 13 8 5 nt 3′ GAYNNN NNVTC (13/8) HgaI GACGC 5 10 5 nt 5′ (5/10) CseI GACGC 5 10 5 nt 5′ (5/10) -
TABLE 2 Preferred type IIs restriction enzymes reach reach Recognition top bottom Name sequence strand strand extension LguI GCTCTTC (1/4) 1 4 3 nt 5′ PciSI GCTCTTC (1/4) 1 4 3 nt 5′ BspQI GCTCTTC (1/4) 1 4 3 nt 5′ SapI GCTCTTC (1/4) 1 4 3 nt 5′ BveI ACCTGC (4/8) 4 8 4 nt 5′ Acc36I ACCTGC (4/8) 4 8 4 nt 5′ BfuAI ACCTGC (4/8) 4 8 4 nt 5′ BspMI ACCTGC (4/8) 4 8 4 nt 5′ AarI CACCTGC (4/8) 4 8 4 nt 5′ Esp3I CGTCTC (1/5) 1 5 4 nt 5′ BsmBI CGTCTC (1/5) 1 5 4 nt 5′ BstV2I GAAGAC (2/6) 2 6 4 nt 5′ BpiI GAAGAC (2/6) 2 6 4 nt 5′ BpuAI GAAGAC (2/6) 2 6 4 nt 5′ BbsI GAAGAC (2/6) 2 6 4 nt 5′ BtgZI GCGATG (10/14) 10 14 4 nt 5′ Bso31I GGTCTC (1/5) 1 5 4 nt 5′ BspTNI GGTCTC (1/5) 1 5 4 nt 5′ Eco31I GGTCTC (1/5) 1 5 4 nt 5′ BsaI GGTCTC (1/5) 1 5 4 nt 5′ HgaI GACGC (5/10) 5 10 5 nt 5′ CseI GACGC (5/10) 5 10 5 nt 5′ - Most preferred are the following type IIs restriction endonucleases: SapI, BspMI, AarI, Esp3I, Bpil, Bsal and Hgal. Many of the cited restriction endonucleases are available from New England Biolabs. Sources of these enzymes can also be found on the REBASE webpage mentioned above.
- Examples of ligases to be used in the invention include T4 DNA ligase, E. coli DNA ligase, Taq DNA ligase, all of which are commercially available from New England Biolabs.
- In the following, the invention will be further described with reference to specific embodiments, examples and the figures.
-
FIG. 1A shows elements of a system that allows re-use of the entry DNAs of the invention (level 1 destination vectors) for different inserted sequence segments. This system comprises: - (1) n nucleic acid fragment constructs (“na”, shown for n=1 to 7), each flanked by two sequences Sx and Sy representing cleavage sites of a type IIs restriction endonuclease. After restriction endonuclease digestion, the cleavage sites form single-stranded overhangs that are complementary from one nucleic acid fragment constructs (as well as the underlying entry vector) to the next, which is indicated by the same index of “S”. The cleavage site at the 3′ end (right hand side in the figures) of the last construct (na7) forms a single-stranded overhang compatible with the overhang created by cleavage of the cleavage site at the 5′ end (left hand side in the figures) of the first fragment construct na1 by restriction endonuclease digestion, as indicated by the same numbering “S1” at these sites;
(2) a set of n ‘end-linkers’ (ELx, x indicating the numbering from 1 to 7) flanked on one side (5′ end) with a cleavage site compatible with the 3′ cleavage sites (S1 to S7) of the nucleic acid fragment constructs (as well as the underlying entry DNA) and on the other side (3′ end) with a unique site not compatible with any of the n entry DNAs (S8);
(3) a destination vector with two cleavage sites, one site compatible with sites S1 (or S2, S3, S4, S5, S6, S7), and the other site compatible with cleavage site S8 of the end-linkers. -
FIG. 1B provides an example for cloning of three nucleic acid fragment constructs into a destination vector. Cloning of the three nucleic acids fragment constructs employs ligation of the appropriate end-linker (end-linker 3). The resulting construct can be later re-opened at cleavage sites S4 and S8 by digestion with the appropriate type IIs endonuclease. All other sites lack a flanking type IIs endonuclease recognition site in the reaction product and are thus protected from digestion with the endonuclease used for the production of this reaction product.FIG. 1C shows that further nucleic acid fragment constructs na4 to na7 and a further construct na1 that is based on the sameentry vector number 1 as na1 ligated inFIG. 1B can be cloned into the product vector obtained inFIG. 1B . At each successive cloning step, a different end-linker is used (ELx or ELx-b that differ by containing different internal type IIs restriction site for reopening the construct at the next stage). The structure of the end-linkerse will be more specifically explained in the following figures). -
FIG. 2 explains how type IIs restriction sites are depicted in the following figures. A type IIs restriction endonuclease site contains a recognition sequence (also referred to herein as “recognition site”) and a cleavage site located outside of the recognition sequence. The nucleotide sequence of the cleavage site is shown in a horizontally elongated box. The recognition site can be found on either side of the cleavage site, depending on the orientation of the asymmetrical recognition site in the DNA. When the recognition sequence is located on the left of the cleavage site, the recognition site is illustrated as a vertically elongated box on the left half under the box representing the cleavage site. When the recognition sequence is located on the right of the cleavage site, the recognition site is illustrated as a vertically extended box on the right half under the box representing the cleavage site. The nucleotide sequences of the top DNA strand are shown next to the represented type IIs restriction sites. In the third and fourth row, two restriction sites are shown the cleavage sites of which are oriented towards each other, but are non overlapping and separated by an “optional sequence”. In the fifth and sixth row, two restriction sites are shown the cleavage site of which overlap over the entire range of 4 base pairs. - After cloning using a type IIs enzyme, the corresponding recognition site is usually eliminated during cloning. If recognition sites on both sides of the cleavage site are eliminated, only the sequence of the cleavage site (4 bases in the examples depicted) is left in the DNA as shown schematically at the bottom.
FIG. 3 shows a general embodiment wherein a eukaryotic multi-gene construct is produced as the nucleic acid fragment of interest.FIG. 3 illustrates a general strategy that can be used. Basic genetic elements such as promoters, 5′ untranslated regions (5′ UTRs), signal peptides (SP), open reading frames (ORFs), and terminators (T) are cloned as basic modules (“level 0 modules”). Libraries of each of these types of genetic elements may be provided as shown at the top. The libraries of ‘level 0’ entry modules may be stored until needed for cloning. When a construct needs to be made, genetic elements cloned aslevel 0 modules are chosen (second row in theFIG. 3 ), and are assembled, in alevel 1 reaction, using a one-pot Golden Gate cloning reaction into an entry DNA. Thelevel 1 reaction product (schematically shown in the third row), referred herein as nucleic acid fragment construct, contains an assembled transcription unit comprising the chosen elements in the desired order. In a second step of cloning, two or more (11 in the depicted example) different transcription units (that can be made inseparate level 1 reactions) are assembled in a desired order into a destination vector (level 2) to obtain alevel 2 multigene construct of interest. - At the bottom, the basic gene structures for secreted and cytosolic proteins are shown. Since the latter have no signal peptide (SP), the
ORF level 0 modules for cytosolic proteins may have the cleavage site sequences of the signal peptides used for secreted proteins for allowing linking of the ORF module with the 3′ end of the module for the 5′ UTR in thelevel 1 reaction. -
FIG. 4 depicts a set oflevel 1 destination vectors,level 2 destination vectors, and end-linkers. Thelevel 1 destination vectors and end-linkers are cloned in plasmids with a carbenicillin resistance selectable marker, whilelevel 2 destination vectors carry a kanamycin resistance gene. Other antibiotic resistance genes could also be used instead of the ones used in this example.Level 1 destination vectors of the series pL1F-1 to -7 and of the series pL1R-1 to -7 differ by the cleavage sites of the internal pair type IIs restriction sites (the Bsal sites). The sites of both series are designed for cloning of the same assembled transcription units, but in opposite orientation. Thelevel 1 destination vectors have an internal lacZ reporter gene that is removed together with the recognition sites of the internal pair of type IIs restriction sites (the Bsal sites) in thelevel 1 reaction due to the divergent orientation of these sites with respect to LacZ. Thus, lacZ is not present in thelevel 1 reaction product, allowing blue/while selection of cell clones containing the reaction product. The outer restriction sites in thelevel 1 destination vectors (the Bpil sites) are in convergent orientation with respect to the Bsal sites and lacZ and are unchanged in thelevel 1 reaction, since the restriction endonuclease of the inner pair of restriction sites (Bsal), but not the restriction endonuclease of the outer pair of restriction sites (Bpil), is used in thelevel 1 reaction. Straight dashed arrows indicate the complementarity of the cleavages sites of the right-hand Bpil sites of each ofentry DNAs 1 to 6 with the left-hand entry DNA directly underneath which allows the ligation ofentry DNAs 1 to 7, as well as thelevel 1 constructs derived therefrom, in thelevel 2 reaction via complementary single-stranded overhangs produced by Bpil digestion. Complementarity of the right-hand cleavage site of the bottom entry DNA (of sequence TGCC) with the left-hand cleavage site of the first entry DNA allows to reuse theentry DNA 1 and the following entry DNAs in asecond level 2 reaction. In asecond level 2 reaction, the reused entry DNAs may be provided in alevel 1 reaction with a different insert compared to the insert (referred to as “core portion of the sequence segment” in item (ii′) of the method of the invention). Thus, more nucleic acid fragment constructs can be combined into a nucleic acid construct of interest than the number of elements of the set of entry DNAs. “Divergent” herein means that the two cleavage sites of a pair of restriction sites are more remote than the recognition sites from a position between the two restriction sites. “Convergent” herein means that the two recognition sites of a pair of restriction sites are more remote than the cleavages sites from a position between the two restriction sites. - The set of
level 2 destination vectors shown has the same number of elements as the number oflevel 1 destination vectors. Thelevel 2 destination vectors have a pair of divergent (with respect to the central portion in which nucleic acid fragment constructs are inserted in thelevel 2 reaction) type IIs restriction sites flanking genes (“CRed”) providing a red phenotype. For each upstream Bpil cleavage site of the entry DNAs there is alevel 2 destination vector having a complementary upstream Bpil site. Thus, each entry DNA can be used to produce a nucleic acid fragment construct that will takeposition 1 in thelevel 2 reaction product. - Three sets of end-linkers are depicted, each set generally having the same number of elements as the number of
level 1 andlevel 2 destination vectors. Sets pELB and pELR are similar in that they have the same cleavage sites and outer recognition sites. Sets pELB and pELR both have a further inner recognition site that will be unchanged in thelevel 2 reaction, whereby they are present in thelevel 2 reaction product. Thus, they can be used for inserting, in a second orfurther level 2 reaction, further nucleic acid fragment constructs into the reaction product of thefirst level 2 reaction. This is not possible if an end-linker from the pELE set is used, since these lack the inner divergent pair of restriction sites. Sets pELB and pELR differ in that different inner recognition sites (Bsal versus Esp3I) are used and in that different central reporter genes for color selection of cell clones are used. All end-linkers can be used for joining the nucleic acid fragment constructs derived from thelevel 1 destination vectors to the downstream cleavage site of thelevel 2 destination vectors using cleavage site GGGA. Thus all destination vectors and all end-linkers have the same downstream cleavage site (GGGA). For each downstream Bpil cleavage site of the entry DNAs there is a linker having a complementary upstream Bpil cleavage site. -
FIG. 5A illustrates the structure of the modules depicted inFIG. 3 in greater detail and shows how these modules can be assembled. At each cloning step, constructs can be assembled by mixing in one tube all constructs or DNA fragments required, together with the appropriate type IIS enzyme (indicated above the horizontal reaction arrows) and ligase, and incubating the mix under conditions allowing restriction enzyme digestion and ligation. -
Level 0 modules have an insert of interest (for example a promoter sequence, P1) located between two convergent type IIs restriction sites (Bsal in the example shown).Level 0 modules can be cloned by a number of different procedures, and one example is shown here, starting from either PCR products or level-1 constructs (top row of the figure designated,level 0″). In this example, cloning is performed using the enzyme Bpil in acompatible level 0 destination vector. Methods for such cloning are known from the literature, see e.g. Engler et al. PLoS ONE 4 (2009) e5553. - Compatible sets of
level 0 modules are then assembled and cloned onlevel 1 into alevel 1 destination vector using a Golden Gate cloning reaction with a second type IIs enzyme, here Bsal. The resultinglevel 1 constructs contain, for example, assembled transcriptional units (TUs). -
Several level 1 constructs (in the present example, 2 such constructs indicated by “TU1” and “TU2”) are then assembled together with a selected end-linker (pELE-2, seeFIG. 4 ) in acompatible level 2 destination vector (pL2-1 for example). As discussed forFIG. 1 a , bothlevel 1 constructs have to be compatible forlevel 2 assembly, i.e. having convergent terminal cleavage sites. Thefirst level 1 construct corresponds to position 1 (thelevel 0 modules were cloned in thelevel 1 destination vector pL1F-1 depicted inFIG. 4 ) and the second one corresponds to the next position (position 2), thelevel 0 modules of which were cloned in thelevel 1 destination vector pL1F-2). Since pELE-2 does not contain internal type IIs restriction sites, no further nucleic acids can be cloned in the resultinglevel 2 construct. In such case, the level reaction is referred to as, level 2-1″ and the reaction product is referred to as, level 2-1 construct″). - A
similar level 2 reaction can however be made using end-linker pELB-2 rather than pELE-2 (seeFIG. 4 ). Since this end-linker contains a pair of further internal type IIs restriction sites (Bsal), the resulting construct (“level 2i-1 construct”) will also contain such site, and therefore can be used again aslevel 2 destination vector for asecond level 2 reaction for insertion of additional transcriptional units or other fragment constructs. - ++ indicates that only one of several entry clones was drawn due to space limitation. Each cleavage site is shown as a box with the 4 nucleotides of the cleavage site; the two boxes below show which type IIS recognition sites flank the recombination sites on the left and right sides. P1-a/b stands for
promoter fragment -
FIG. 5B show nucleotide sequences at the junction of various modules for each step of cloning ofFIG. 5A . “Gene 1” is equivalent to “TU 1”. Sequence stretches (SS) designated SS1 to SS26 are SEQ ID NO: 31 to SEQ ID NO: 56, respectively. -
FIG. 6 is an overview of alternative cloning strategies. Every cloning step relies on three elements that are different from one level to the next: antibiotic selectable marker, type IS restriction enzyme, and visual selectable marker. Cloning afterlevel 2i-1 requires the use of two type IIS enzymes, such as Bpil-Bsal or Bpil-Esp3I. Using the described set oflevel 1 destination vectors,level 2 destination vectors and end linkers, and the indefinitely repeatable cloning strategy provided, as many transcription units can be added to a construct as desired by a user, using as many cycles of cloning as required. Physical limits will ultimately be encountered due to handling of very large constructs, but such limits will not come from the cloning strategy itself. Thelevel 2 reactions are numbered by the numeral at position x in level 2-x. - Level 2-x stands for a
level 2 reaction producing alevel 2 reaction product that cannot be used for afurther level 2 reaction due to the absence of a pair of type IIs restriction sites allowing reopening of thelevel 2 reaction product (e.g. due to the use of an end-linker of the pELE set, the last “E” indicating “end”). -
Level 2i-x stands for alevel 2 reaction that produces a reaction product that is an intermediate (e.g. due to the use of an end-linker of the pELB set) and can thus be used for afurther level 2 reaction. - Each
level 2i-x reaction product opens up two possibilities for afurther level 2 reaction (indicated by the branching arrows). Depending on the use of the end-linker, thenext level 2 reaction will either lead to an end (boxed level 2-x) or will lead to a further intermediate reaction product, allowing a stillfurther level 2 reaction. -
FIG. 7 shows the structure oflevel 0 modules. Alllevel 0 modules are flanked by two convergent Bsal sites (with respect to the insert such as P1). Five module classes, namely promoter modules, 5′UTR modules, signal peptide modules, ORF modules and terminator modules are depicted. All modules of a same class are flanked by the same cleavage sites (for promoter modules GGAG and TACT). This design allows to clone any module of a given class into a transcription unit using the same cloning strategy. It also allows the use of multiple modules from a module class for obtaining libraries oflevel 1 fragment constructs. Modules of different classes are designed to be compatible for assembly in amulti-fragment level 1 reaction, also referred to herein as “Golgen Gate” cloning reaction, i.e., the sites joining two modules form compatible (complementary) single-stranded overhangs after digestion with a type IIs restriction enzyme, here Bsal. A “set” designates a group consisting of 1 module from each of the five module classes. -
FIG. 8A explains the assembly of transcriptional units fromlevel 0 modules intolevel 1 destination vectors (entry DNAs). Module sets 1 to 9 (each consisting of a set oflevel 0 modules as shown inFIG. 7 ) are assembled in alevel 1 destination vector. Modules sets 1 to 7 are assembled inlevel 1 destination vectors pL1F-1 to -7 (seeFIG. 4 ), respectively. Module sets 8 and 9 (and optionall further sets) are cloned in the same series oflevel 1 destination vectors pLF-1, pLF-2 etc, respectively. Thus,level 1 destination vectors pL1F-1 and -2 (and optionally further vectors) are reused for a different module set than module sets 1 and 2. The reaction products arelevel 1 constructs having flanking Bpil restriction sites retained from thelevel 1 destination vectors. These flanking restriction sites define with their associated cleavage sites the order in which thelevel 1 constructs are ligated in thesubsequent level 2 reaction. Each reaction shown is generally performed as a separate reaction. -
FIG. 8B illustrates an example for the assembly oflevel 0 modules using a different set oflevel 1 destination vectors. In this example, the restriction sites for the enzyme Bpil are identical as inFIG. 1A , but the 4-nucleotide cleavage sites of the restriction sites for the enzymes Bsal and Bpil overlap exactly in each destination vector. As a consequence, the cleavage sites of the restriction sites for Bsal are not compatible with the previously describedlevel 0 modules (inFIG. 7 ). In this embodiment, a set of adaptors that allow joining of thelevel 0 modules and of thelevel 1 destination vectors is employed. The resultinglevel 1 constructs have the same overall structure as the constructs made inFIG. 8A : the different transcription units are all flanked by the same compatible sets of Bpil restriction sites. These sites are convergent in each construct, and are compatible from one construct to the next. -
FIG. 9A : assembly of 5 transcription units into amultigene level 2 construct in alevel 2 reaction. An appropriate end linker is used for linking the right-hand side of thefifth level 1 construct to the GGGA cleavage site of the destination vector referred to as “Destination 2”. The type of end-linker used does not allow the reaction product to be used as a starting material for afurther level 2 reaction. The term “Golden Gate” means that reactants having type IIs restriction sites with compatible cleavage sites are combined in desired order in a reaction comprising restriction and ligation. Thelevel 2 construct is shown on the right hand side in two different representations with different degrees of detail. -
FIG. 9B : Assembly is performed with an end linker containing internal divergent Bsal restriction sites for allowing afurther level 2 reaction in the resultinglevel 2 construct. -
FIG. 9C shows examples of further rounds of cloning. At the top, a short version of the reaction ofFIG. 9B is shown, whereby alevel 2i-1 construct is obtained. In thenext level 2 reaction, 6 transcription units (designated TU6 to TU11) are added to thelevel 2i-1 construct, leading to alevel 2i-2 construct. In the next step, four further TUs (designated TU12 to TU15) are added, leading to thelevel 2i-3 construct. In a further step shown at the bottom, one further TU (TU16) is added to obtain the level 2-4 construct. At each successive cloning step, different end linkers have to be used, that may contain either internal Bsal or Esp3I restriction sites, or none if the final round of cloning is reached. -
FIG. 10 shows the efficiency of cloning for different levels of assembly. Forlevel 1 and level 2-1, all minipreps analyzed contained only correctly assembled constructs, even though multiple fragments were assembled in a one-pot one-step reaction for each construct. Moreover, the last level 2-1 construct was obtained by assembly in one step of 6 transcription units and one end-linker in one destination vector. Despite the large size and large number of components, all minipreps analyzed contained only correct constructs. The final construct made contains 11 transcription units. All colonies analyzed contained correct constructs despite the large size of the construct (34 kb), -
FIGS. 11 to 17 show how the cloning system of this invention can be used to create constructs containing one or multiple operons. Applications include microbial strain construction for metabolic engineering. -
FIG. 11 illustrates the general strategy used for cloning of prokaryotic operons. Basic genetic elements such as promoters, open reading frames (ORFs), and terminators are cloned aslevel 0 modules. Libraries oflevel 0 modules orindividual level 0 modules can be stored until needed for cloning. To make a desired construct, a selected number of genetic elements cloned aslevel 0 modules are chosen. Unlike for cloning of eukaryotic multigene constructs, promoters, open reading frames and terminators are not cloned together in alevel 1 destination vector. The reason for this design is that different operons may contain a different number of open reading frames, preventing the design of a fixed set of compatible cleavage sites for cloning. Rather, promoters, open reading frames and terminators are cloned separately indifferent level 1 destination vectors to obtainlevel 1 constructs. Rather than serving the purpose to assembleseveral level 0 modules together, cloning inlevel 1 destination vectors for prokaryotes serves mainly the purpose of providing positional information for the clonedlevel 1 construct for asubsequent level 2 assembly. Complete or partial operons are then cloned fromseveral level 1 constructs in a one-pot one-step cloning reaction onlevel 2. -
FIG. 12 Comparison oflevel 1 destination vectors for eukaryotes (“MoClo” and prokaryotes (“MoClo Pro”).Level 1 destination vectors (entry DNAs) for cloning promoters and terminators for prokaryotic operons may be the same aslevel 1 destination vectors made for eukaryotes (FIG. 4 ). In contrast,level 1 destination vectors made for prokaryotic ORFs (open reading frames) have different internal Bsal sites (AATG and GCTT). In addition, they contain a ribosome binding site between the first Bpil and Bsal cleavage sites. There are many possible designs forlevel 1 destination vectors other than the ones described here. However, despite these differences, all destination vectors also have the same general structure in common, namely two convergent type IIs enzymes flanking the DNA sequence of interest, and cleavage sites designed to fit the structure ofnucleic acids 1 to 7 shown inFIG. 1A . -
FIG. 13 illustrates the cloning of 5level 1 constructs (shown in the lower part) fromlevel 0 modules intolevel 1 destination vectors (shown in the middle part) in alevel 1 reaction.Level 1 destination vectors shown inFIG. 12 are used. Fivelevel 1 constructs, namely a promoter construct, 3 ORF constructs and a terminator construct are obtained with compatible cleavage sites for ligation, in the order given, in thesubsequent level 2 reaction. -
FIG. 14 shows thelevel 2 reaction from thelevel 1 constructs made as shown inFIG. 13 to produce a functional operon in alevel 2 construct. -
FIG. 15 shows asimilar level 2 reaction as shown inFIG. 14 , but with an end-linker containing internal Bsal sites for allowing afurther level 2 reaction. -
FIG. 16 illustrates the preparation of 3additional level 1 constructs (designated 6, 7 and 1, respectively) containing a second promoter (P2) and two further ORFs (orf4 and orf5). Thelevel 1 destination vector for orf4 is the last destination vector of a set of n=7 entry DNAs. The right hand cleavage site thereof having sequence TGCC allows reuse of thefirst level 1 destination vector (pL1 P-1 inFIG. 12 ) for orf5. -
FIG. 17 . Cloning of the threeadditional level 1 constructs obtained according toFIG. 16 into thelevel 2i-1 construct made according toFIG. 15 . The reaction product shown at the bottom is alevel 2i-2 construct, since the end-linker used allows asubsequent level 2 reaction. - The following figures further illustrate the examples.
-
FIGS. 18A-18C show the structure of transcriptional units,level 0 modules, and of destination vectors required for their cloning. -
FIG. 18A : The transcriptional units contain up of 5 basic modules separated by 4 nucleotides sequences that serve as recombination sites (shown in boxes). -
FIG. 18B :Level 0 modules shown on the first line are flanked by Bsal sites. The modules are cloned in alevel 0 reaction using the enzyme Bpil and one of thelevel 0 destination vectors shown underneath. -
FIG. 18C : Strategy for removing internal type IIS recognition sequences. Removal of a Bsal site in a fragment of interest is done by amplifying two fragments with primers pr1 (SEQ ID NO: 27) and 2 (SEQ ID NO: 28) and primers pr3 (SEQ ID NO: 29) and 4 (SEQ ID NO: 30). Sequences of the Bpil recognition sites in the 5′ extensions in the primers (horizontal arrows) are shown in bold. The two fragments are cloned using Bpil in theappropriate level 0 destination vector, for example pL0-P in the present example. -
FIGS. 19A-19D show an example for cloning of constructs oflevel -
FIG. 19A : Illustrates cloning oflevel 0 promoter modules. -
FIG. 19B : Illustrates cloning of alevel 1 construct containing a transcription unit. -
FIG. 19C : Illustrates cloning of alevel 2i-1 construct containing 5 transcription units, TU1 (containing GFP), TU2 (containing p19), TU3 (containing VP2), TU4 (containing VP5), TU5 (containing VP7) into destination vector pL2-1. -
FIG. 19D : Illustrates cloning of a level 2-2 construct containing 11 transcription units. In addition to TU1 to TU5, the construct contains TU6 (transcription unit with VP3), TU7 (transcription unit with BAR), TU8 (transcription unit with antibody light chain), TU9 (transcription unit with antibody heavy chain), TU10 (transcription unit with TMV MP), TU11 (transcription unit with TMV CP). -
FIG. 20A shows the structure of the 11level 1 transcription units used inFIG. 19B .FIG. 20B shows the structure of construct plCH51811 that is obtained by cloning of the 11 transcription units cloned in a level 2-2 construct. -
FIG. 21 shows the expression of GFP in a Nicotiana benthamiana leaf after infiltration of level-2 constructs shown inFIG. 10 . The level-2 constructs were transformed in Agrobacterium tumefaciens, and the transformed bacteria infiltrated in leaves using a syringe without a needle. GFP expression was observed at 5 dpi under UV light. The number in parenthesis indicates the number of transcription units in each infiltrated construct. -
FIG. 22 shows prokaryotic genes cloned aslevel 0 entry modules. 3 genes from Pantoea ananatis crtE, I, and B are involved in lycopene biosynthesis. Other genes known to increase lycopene expression when overexpressed in E. coli were also cloned: dxs from Agrobacterium rhizogenes strain K84 and E. coli strain K12, the ispA genes from the same two species, and the idi and AppY genes from E. coli strain K12. Two promoters were also cloned aslevel 0 modules: the Lac Z promoter from pUC19, and a promoter from Pantoea ananatis. -
FIG. 23A showslevel 1 destination vectors for cloning of prokaryotic coding sequences. All constructs are in fact libraries (indicated by the L in the construct name) that have variable sequences flanking the RBS.FIG. 23B shows two sets of end linkers that contain the LacZ terminator from pUC19. -
FIG. 24 shows a list oflevel 1 constructs made with thelevel 0 modules shown inFIG. 22 . -
FIG. 25 showslevel 2i-1 constructs. Two libraries were made with genes for lycopene biosynthesis. Both constructs pICH5648L and pICH5850L are in fact libraries in which all clones obtained differ in the sequence flanking the RBS of all three cloned genes. -
FIGS. 26A-26B show addition of two or three more genes to the operons obtained inFIG. 25 . Since a library of different constructs was placed in the cloning mixes (pICH5447L to pICH5452L inFIG. 26A , or pICH5455L inFIG. 26B ) the constructs obtained consist of libraries of constructs that differ in the genes present atpositions -
FIG. 27 shows thatlevel 2 cloning can be followed by alevel 3. For this, a new type IIs enzyme needs to be used, such as SapI in the case shown. -
FIG. 28 shows vectors required forlevel 3 cloning. Level two destination vectors are flanked by restriction sites of a new type IIs enzyme (SapI in this example).Level 3 destination vectors and end-linkers are similar as forlevel 2, but with the new type IIS enzyme.Numbers 1 to 8 represent eight different 3 nucleotide sequences as the cleavage site of SapI. -
FIG. 29A shows a set of destination vectors and end-linkers for assembly ofseveral level 1 transcription units (or more generally nucleic acids “na”) into a level M destination vector (M stands for multiplication). Level M destination vectors and corresponding end-linkers (ELM1 to 7) are designed in such a way that blocks of assembled transcription units (or nucleic acid fragments) cloned in level M destination vectors become flanked by cleavage sites for a type IIs restriction enzyme, which will be used for the next step of cloning. In contrast tolevel 2 constructs where types IIs enzymes restriction sites are located at the end of the constructs in the end-linker, here the type IIs enzymes flank the assembled nucleic acid fragments, allowing them to be further subcloned in a new vector. -
FIG. 29B illustrates assembly of three transcription units (nucleic acid acids na1 to 3) in a level M destination vector. The linker is chosen from the set of linkers ELM1 to ELM7 such that na3 can be linked to cleavage site S8 of destination vector M. -
FIG. 29C illustrates assembly of two transcription units (nucleic acids na4 and 5) in a level M destination vector. The linker is chosen from the set of linkers ELM1 to ELM7 such that na5 can be linked to cleavage site S8 of destination vector M. -
FIG. 29D shows assembly of the two pre-assembled blocks of transcription units (the reaction products of the reactions shown in B and C) in alevel 2 construct employing alevel 2 destination vector. Details of the restriction sites of destination vectors and end-linkers are shown inFIGS. 31 and 32 (below). -
FIG. 30 shows that level M can be used as an intermediate step betweenlevel -
FIG. 31 shows the structure of the entry DNAs as well as vectors required for level M cloning. A set of n=7 level M destination vectors and a set of n=7 linkers M is shown. Similarly aslevel 2 cloning, level M requires destination vectors and a set of compatible end-linkers, and allows from 1 to 6 (with sets of n=7 destination vectors and linkers as shown) transcription units (or nucleic acid fragments) to be assembled in one step. The orientation and position of type IS restriction sites in destination vectors and end-linkers is such that assembled transcription units (or nucleic acid fragments) in level M constructs can be excised by a type IIs enzyme, in this case Bsal. Entry DNAs are flanked by type IIs restriction sites in opposite orientations (here Bpil sites). The cleavage sites of these two sites are each identical to one of the seven cleavage sites present on the left-hand side of level M destination vectors and linkers. The 7 cleavage sites on the left-hand side of the 7 destination vectors are unique (and different from one another). The same set of 7 cleavages sites is present on the left-hand side of the linkers M. -
FIGS. 32A-C show an example where blocks of three (FIG. 32A ) and two (FIG. 32B ) transcription units are separately pre-assembled into level M destination vectors before being assembled in alevel 2 vector in a subsequent step (FIG. 32C ). -
FIG. 33 illustrates a set of n=7 destination vectors and a set of n=7 end-linkers required for level M (left side of the figure) and P (right side of the figure). The destination vectors and end-linkers for level M (left side of the figure) are as shown inFIG. 31 . The structure of entry DNAs for level P cloning is identical to the structure of entry DNAs for level M, except that restriction sites for a different type IIs enzymes are present (here for Bsal). Level M constructs have a structure corresponding to level P entry DNAs. Thus, level M constructs can be used as entry DNAs in a level P reaction. Level P constructs become entry DNAs for a next round of level M cloning. -
FIG. 34A illustrates a level P reaction wherein two level M constructs are assembled into a level P destination vector using an end-linker P to give a level P construct. -
FIG. 34B illustrates a level M reaction wherein two level P constructs are assembled into a destination vector M (“destination M”) using an end-linker M to give a Level M construct. -
FIG. 35 : Cloning from level P to level M and vice-versa can be repeated indefinitely by reusing the same set of vectors and end-linkers shown inFIG. 33 . -
FIG. 36 illustrates the structure of constructs obtained in successive level M and level P cloning steps. In this example, 90 transcription units (or nucleic acid fragments) are assembled in a construct (shown at the bottom) in three cloning steps. 5 Sets of 6 transcription units are assembled in parallel in separate reactions in the first level M cloning step. The resulting 5 constructs are then assembled in one construct in the following cloning reaction on level P. This construct and two other compatible constructs (construction not shown for lack of space), each containing 30 assembled transcription units (or nucleic acid fragments) are then assembled in a third step of cloning in a level M destination vector. Boxes withnumbers 1 to 7 represent nucleic acid fragments or transcription units, with such numbers referring only to the nature of the type IIs enzyme cleavage sites (position as defined earlier) flanking them, and not to the sequence of the nucleic acids contained between these two sites. - Restriction enzymes used in this study were purchased from New England Biolabs and Fermentas. T4 DNA ligase was purchased from Promega. Plasmid DNA preparations were made by using the NucleoSpin Plasmid Quick Pure kit (Macherey-Nagel, Düren, Germany) following the manufacturer protocol. Plasmid DNA concentration was measured using a Nano Drop® Spectrophotometer ND-1000 (Peqlab, Erlangen). The coding DNA for the coat proteins VP2, VP3, VP5 and VP7 of blue
tongue virus serovar 8 was synthesised from Entelechon GmbH and lack all Bpil, Bsal and Esp3I restriction sites). Level-0 modules were sequenced with primers moclof (SEQ ID NO: 1: 5′-agcgaggaagcggaagagcg) and moclor (SEQ ID NO: 2: 5′-gccacctgacgtctaagaaacc). - A one step—one pot restriction/ligation was setup with approximately 30 fmol (˜100 ng for a 5 kb plasmid) of each fragment (PCR product or plasmid), Promega ligation buffer, 10 U of the respective restriction enzyme (Bsal, Bpil, or Esp3I), 10 U high concentrated T4 DNA ligase (Promega), in a 20 μl volume. The reaction was incubated for 5 hours at 37° C., 5 min 50° C. and 5 min 80° C. The mix was added to 100 μl chemical competent DH10b cells, incubated for 30 min on ice and transformed by heat shock. Two clones with the expected color were analysed by restriction analysis and optionally by sequencing.
- A DNA fragment coding for canthaxanthin biosynthesis was made by PCR amplification of 4 genes from Pantoea ananatis that are necessary for biosynthesis of p-carotene (genes crtE, crtY, crtI and crtB, Ref) and of one gene from Agrobacterium aurantiacum (crtW) necessary to convert β-carotene to canthaxanthin (ref). The gene crtW gene is used in addition to the 4 pantoea genes because the orange/red color of canthaxanthin is easier to see on agar plates than the yellow color of β-carotene. The Pantoea ananatis strain was obtained from the DSMZ (cat DSM 30080), and a fragment containing the crtW gene was synthesised by Mr. Gene GmbH. An artificial operon containing the genes crtE-W-Y-1-B under control of the P. ananatis native promoter was made by ligation of three fragments derived from PCR:
fragment 1 containing the promoter and crtE gene was amplified from P. ananatis genomic DNA withprimers 5′-ttt ggtctc a ggag ggtaccgcacggtctgccaa (SEQ ID NO: 3) and 5′-ttt ggtctc a tcatgcagcatccttaactgacggcag (SEQ ID NO: 4),fragment 2 containing the crtW gene was amplified from a synthetic DNA fragment (sequence identical to the native sequence) withprimers 5′-ttt ggtctc a atgagcgcacatgccctgcc (SEQ ID NO: 5) and 5′-ttt ggtctc a tcactcatgcggtgtcccccttggt (SEQ ID NO: 6), andfragment 3 containing the genes crtY-1-B was amplified from PantoeaDNA using primers 5′-ttt ggtctc a gtgacttaagtgggagcggctatg (SEQ ID NO: 7) and 5′-ttt ggtctc a atgtagtcgctctttaacgatgag (SEQ ID NO: 8). The fragments were assembled by Golden Gate cloning in a target vector using Bsal. Two Bpil and one Esp3I site present in crtY were removed using primers containing silent mutations in the recognition sites. - To check that the constructs are working, at least for one of the transcriptional units (containing GFP), all level-2 constructs were introduced into Agrobacterium tumefaciens. Agrobacterium suspensions were infiltrated with a syringe without a needle into Nicotiana benthamiana leaves. GFP is expressed from all constructs, as expected from expression cassettes driven by the 35S promoter. Interestingly, the level of GFP expression was found to decrease for the largest constructs. This can be explained by the fact that the GFP gene was always located at the left border in all constructs; since T-DNA transfer to plant cells occurs from the right to the left border, and is sometimes incomplete, plant cells will acquire the GFP cassettes from large constructs less frequently than from smaller constructs.
- We defined in a first step a generalized eukaryotic transcriptional unit as the basis for our modular cloning system (MoClo). This unit was subdivided into five basic modules which cover the most important features of any transcriptional unit: promoter (P), 5′UTR (5U), signal peptide (SP), open reading frame (ORF), and terminator (T, which also includes 3′ untranslated sequences) (
FIG. 3 ). For each module, two unique flanking recombination sites were defined. We use here the term ‘recombination sites’ by functional analogy to the recombination sites used in recombination systems such as the phage P1 Cre-IoxP recombination system. However, for the purpose described here, ‘recombination sites’ can be any nucleotide sequence of choice of at least 3 base pairs in length, and will serve as the sequence where restriction enzyme digestion and ligation will take place; no recombinase will in fact be used. To guarantee efficient cloning, these sites were chosen to be non-palindromic and to share a maximum of two identical consecutive nucleotides. The recombination site between the promoter fragment and the signal peptide reads AATG and encodes a start codon preceded by an adenosine, which is the most common nucleotide in dicotyledonous plants at this position. The recombination site between the signal peptide and the rest of the protein (for secreted proteins) was chosen to be AGGT, with GGT coding for a glycine; this sequence was chosen as the last aminoacid of the signal peptide can usually be a glycine, and this allows producing secreted proteins with native sequence. The four remaining recombination site sequences chosen (GGAG, TACT, GCTT and CGCT) were selected without any special requirement other than being unique and non-palindromic, since they are all present in non-coding DNA. - The designated DNA fragments are then amplified by PCR with primers designed to attach the specific recombination site and the recognition site sequence for the type IIS restriction enzyme Bpil (
FIGS. 5A and 5B ). To allow efficient cloning of these PCR fragments, a set oflevel 0 destination vectors was constructed (one vector for each type oflevel 0 module), each containing two Bpil restriction sites compatible with the amplified PCR fragments. Beside the destination plasmids for the five standard elements (pL0-P, pL0-5, pL0-S, pL0-O and pL0-T) further variants were included, which allow an easy adaptation of the MoClo system to a wide variety of projects (FIG. 18 ). The expression of cytosolic proteins for example does not require a signal peptide, and these can be cloned into pL0-SO, spanning the SP and ORF position. Alllevel 0 destination plasmids are based on a pUC19 backbone, confer spectinomycin resistance and encode a lacZα fragment for blue/white selection. On both sites of the lacZα fragment two different type IS recognition sequences—for the enzymes Bsal and Bpil—are positioned in inverse orientation relative to each other (FIG. 5A , B). Both recognition sequences are designed to create the identical four nucleotide recombination site upon digestion (for example, sequence GGAG on the left side of the destination vector for cloning of promoter modules). This design allows cloning of the PCR product(s) for the DNA fragment of interest efficiently via Bpil—removing the Bpil recognition sites and lacZα in the process—but provides the possibility to release the cloned fragment with Bsal using the identical recombination sites it was cloned in. - The
level 0 modules should not contain any of the type IIS restriction sites used in the MoClo system within the sequence of the fragments of interest. Beside the already mentioned Bsal and Bpil, a third type IIs enzyme, Esp3I, is used in the process of assembly of higher order constructs (see below). Removal of these sites can be easily done at the time of cloning oflevel 0 modules by using primers overlapping the internal Bpil, Bsal or Esp3I sites, but containing a single silent nucleotide mismatch in the recognition site. An example for the removal of a single Bsal site is given inFIG. 18C . The primers pr2 and pr3 introduce a guanine to cytosine exchange and destroy the Bsal recognition site. Thelevel 0 module is then assembled from the PCR-fragments (n+1 PCR fragments when n sites need to be removed) in a one-step one-tube reaction using Bpil into theappropriate level 0 destination plasmid. The purification of the PCR reaction prior cloning is preferable, as primer dimers can be competitors of the PCR product in the cloning procedure. - To show the versatility of the system, we cloned a number of modules for all elements of the transcriptional unit. These include 11 ORFs representing a wide spectrum of biological functions like immunoglobulins (IgG, heavy and light chain), structural viral proteins from BTV and PVX (Potato Virus X), the silencing inhibitor p19, the bar resistance marker and GFP. As an example, we provide here how a promoter module can be cloned. The 35S promoter fragment was generated by PCR using 35S promoter specific primers which add the Bpil recognition sites (underlined) and the promoter module specific fusion sites (bold). The 35S forward primer comprises: 5′-ttt GAAGACAAGGAG (SEQ ID NO: 9:) followed by bases specific for the 35S promoter, 35S reverse comprises: 5′-ttt GAAGACAAAGTA (SEQ ID NO: 10:) followed by bases specific for the 35S promoter.
- In order to create the
level 0 module plCH41373 (pL0-P with the 35S promoter) by a Bpil dependent Golden gate cloning reaction, the following reaction mix was added into a single tube (FIG. 19A ): -
2 μl 10x T4-ligase buffer (Promega) 1 μl high concentrated T4-ligase (Promega; 10 U/μl) 1 μl Bpil restriction enzyme (Fermentas 10 U/μl) 1 μl 30 fmol pL0- P destination plasmid 1 μl specific PCR product (column-purified to eliminate primer dimers) of the 35S promoter (generated by standard PCR with primers describe above) 14 μl water 20 μl total - The reaction was incubated for 5 hours at 37° C., and 10 min 80° C. The mix was added to 100 μl chemical competent DH10b cells, incubated for 30 min on ice and transformed by heat shock. After plating on LB agar plates containing spectinomycin (100 μg/ml) and growing over night at 37° C., two white clones were analyzed by restriction analysis and by sequencing.
- In contrast to the number of ORFs, the number of commonly used promoters and terminator sequences available for expression of heterologous proteins in plants is much lower. To avoid repetitive sequences in planned multigene constructs, we therefore cloned several Arabidopsis thaliana promoter and terminator sequences from genes which show a high basic expression level. After sequencing, the
level 0 modules form the bottom level in the hierarchal MoClo system. A summary of alllevel 0 modules used in this study is presented in the table below: -
Module type and construct number Relevant characteristics Reference or accession number Promoter (P) plCH41373 35S promoter CaMV 1 plCH41551 ST-LS1 (Stem and leaf specific) promoter S. tuberosum; 2 plCH42755 34S promoter FMV 3 plCH42760 Spm promoter Zea mays 4 plCH44157 RBCS (RuBisCO Small subunit 1b) promoter A. thaliana; At5g38430; This work plCH45131 LHB1B2 promoter A. thaliana; At2g34420; This work plCH45145 LHCB5 promoter A. thaliana; At4g10340; This work plCH45167 RRM-containing protein promoter A. thaliana; At1g70200; This work plCH50581 ACT2 (Actin 2) promoter A. thaliana; At3g18780; This work 5′UTR (U) plCH46501 Tabacco mosaic virus □□fragment This work ORF plCH41531 sGFP codon optimized 5 plCH42222 Basta ™ resistance protein S. hygroscopicus 6 (Phosphinothricin acetyltransferase) plCH44022 P19 Tomato bushy stunt virus silencing inhibitor 7 plCH45502 BTV (blue tongue virus) VP2 This work plCH45512 BTV (blue tongue virus) VP3 This work plCH45526 BTV (blue tongue virus) VP5 This work plCH45531 BTV (blue tongue virus) VP7 This work plCH48348 PVX CP This work plCH48367 PVX 25K MP This work plCH49488 IgG1 Light chain with native signal peptide This work plCH49500 IgG1 Heavy chain with native signal peptide This work Terminator (T) plCH41432 Ocs terminator A. tumefaciens 8 plCH41414 35S terminator CaMV plCH44300 ACT2 (Actin 2) terminator A. thaliana; At3g18780; This work plCH44311 TGG1 (Thioglucoside Glucohydrolase 1) A. thaliana; At5g26000; This terminator work plCH44344 GCRP (glycine-rich protein) terminator A. thaliana; At1g67870; This work plCH44355 AAC1 (ADP/ATP carrier protein 1) terminator A. thaliana; At3g08580; This work plCH44377 PRXR1 (Peroxidase 1) terminator A. thaliana; At4g21960; This work plCH44388 AGP18 (Arabinogalactan protein 18) terminator A. thaliana; At4g37450; This work plCH44393 GAE6 (UDP-D-Glucoronate 4-Epimerase 6) A. thaliana; At3g23820; This terminator work plCH49344 Nos terminator A. tumefaciens 9 1 Guilley, H., Dudley, R. K., Jonard, G., Balazs, E. & Richards, K. E. Transcription of Cauliflower mosaic virus DNA: detection of promoter sequences, and characterization of transcripts. Cell 30, 763-773 (1982). 2 Stockhaus, J., Eckes, P., Blau, A., Schell, J. & Willmitzer, L. Organ-specific and dosage-dependent expression of a leaf/stem specific gene from potato after tagging and transfer into potato and tobacco plants. Nucleic Acids Res 15, 3479-3491 (1987).3 Sanger, M., Daubert, S. & Goodman, R. M. Characteristics of a strong promoter from figwort mosaic virus: comparison with the analogous 35S promoter from cauliflower mosaic virus and the regulated mannopine synthase promoter. Plant Mol Biol 14, 433-443 (1990).4 Raina, R., Cook, D. & Fedoroff, N. Maize Spm transposable element has an enhancer-insensitive promoter. Proc Natl Acad Sci USA 90, 6355-6359 (1993). 5 Chiu, W. et al. Engineered GFP as a vital reporter in plants. Curr Biol 6, 325-330 (1996).6 Thompson, C. J. et al. Characterization of the herbicide-resistance gene bar from Streptomyces hygroscopicus. Embo J 6, 2519-2523 (1987).7 Marillonnet, S., Thoeringer, C., Kandzia, R., Klimyuk, V. & Gleba, Y. Systemic Agrobacterium tumefaciens-mediated transfection of viral replicons for efficient transient expression in plants. Nat Biotechnol 23, 718-723 (2005).8 De Greve, H. et al. Nucleotide sequence and transcript map of the Agrobacterium tumefaciens Ti plasmid-encoded octopine synthase gene. J Mol Appl Genet 1, 499-511 (1982).9 Depicker, A., Stachel, S., Dhaese, P., Zambryski, P. & Goodman, H. M. Nopaline synthase: transcript mapping and DNA sequence. J Mol Appl Genet 1, 561-573 (1982). - The next level of cloning consists of assembling
several level 0 modules into a complete transcriptional unit in alevel 1 reaction. Since assembly is performed by Golden Gate cloning using the enzyme Bsal, no Bsal restriction site is left in the resultinglevel 1 construct. Therefore, to be able to later subclone the assembled transcriptional unit into higher level constructs, two restriction sites of a second type IIS restriction enzyme also have to be present flanking the assembled fragment. We therefore createdlevel 1 destination vectors containing two Bpil restriction sites flanking the lacZα fragment, in addition to the two Bsal restriction sites needed for cloning of the transcription unit (pL1F-1 to pL1F-7,FIG. 4 ). If, as forlevel 0 destination vectors, the cleavage sites of the two different type IIS enzymes (Bsal and Bpil) overlapped, alllevel 1 transcriptional unit fragments would be flanked by the same GGAG and CGCT recombination sites (which originate from the promoter and terminator modules, respectively). The presence of identical overhangs flanking each transcriptional unit would prohibit the creation of higher level multigene constructs with a defined linear order. Consequently a series of sevenlevel 1 destination vectors was designed in which the Bpil cleavage sites generate two recombination sites with new specificities for each vector (for example sites TGCC and GCAA forlevel 1vector position 1,FIGS. 4 and 5B ). These sites are compatible from one vector to the next so that assembled transcriptional units can be subcloned in a one-pot reaction from thelevel 1 constructs into alevel 2 construct. If the sites of alllevel 1 constructs were defined strictly for linear assembly, the number oflevel 1 destination vectors would need to be as high as the number of transcriptional units that an experimenter would wish to assemble in a final construct. However, to avoid the construction and consideration of toomany level 1 destination vectors, the spatial order of overhangs was designed to be circular instead of linear, as the first recombination site at position 1 (TGCC) is also the final site atposition 7. So, alevel 1 construct forposition 1 can be reused later at a virtual position “8” (FIG. 4 ). Therefore, such a set of 7level 1 destination vectors will allow an unlimited number (regarding the design, not the physical size of DNA) of transcriptional units to be cloned in a final nucleic acid construct of interest, although by incremental steps of up to 6 constructs at a time (see further description below for such assembly). - Between the upstream Bpil and Bsal sites, we also introduced additional restriction sites for analytical restriction digests: an EcoRI site is present in each
level 1 destination plasmid, whereas a second restriction site is specific for each position (FIG. 5B ). Beside the sevenlevel 1 destination vectors for the assembly of transcriptional units in the forward direction, a set oflevel 1 destination vectors for cloning of transcription units in the opposite orientation was also created. Here the Bsal overhangs are in reverse complement orientation, but accept thesame level 0 modules as for the other orientation (pL1R-1 to pL1-R7,FIG. 4 ). - Since our transient plant expression system is based on Agrobacterium tumefaciens, all plasmids have a broad host range RK2 origin of DNA-replication and left border (LB) and right border (RB) T-DNA sequences to allow T-DNA transfer into the plant cell. These two features allow testing the functionality of
level 1 constructs by plant infiltration. It is also possible to make similar vectors for allowing expression in other eukaryotic hosts such as yeast, insect or mammalian cells or in prokaryotes. Thelevel 1 destination vectors encode an ampicillin resistance gene and a lacZ□ fragment flanked by Bpil and Bsal sites. - To test the efficiency of the assembly of
level 0 modules intolevel 1 transcriptional units (level 1 constructs), 11 artificial transcriptional units were designed (promoters and terminators were randomly assigned to ORFs without consideration for level of expression since all constructs in this study were made purely as an exercise to demonstrate the ability of the MoClo system for cloning of multigene constructs), and were (again randomly) assigned to one of the sevenlevel 1 positions. In 11 independent cloning reactions, thelevel 0 modules were combined with therespective level 1 destination vectors, T4-DNA ligase and the restriction enzyme Bsal in a one-tube one-step reaction (FIG. 20 ). The different antibiotic resistances oflevel 0 andlevel 1 destination plasmids in combination with the blue/white selection represent a very convenient way to screen for correctly assembledlevel 1 constructs. After transformation, the reactions were spread on ampicillin and X-Gal containing plates and the numbers of white and blue clones were counted. The number of white, and therefore correct, colonies varied from approximately 16000 to 180000, whereas only a minor fraction (<1%) of blue colonies was present in two out of eleven reactions (FIG. 10 ). Two white colonies from each reaction were analyzed by an analytical restriction digest with Bpil (which cleaves on both sides of the assembled transcriptional unit). All 22 tested plasmids contained a fragment of the expected size. - We provide here as an example how the cloning reaction was set up for one of the transcription units. In order to create the
level 1 construct plCH50711 by a Bsal dependent Golden gate cloning reaction, the following reaction mix was added into a single tube (FIG. 19 B): -
2 μl 10x T4-ligase buffer (Promega) 1 μl high concentrated T4-ligase (Promega; 10 U/μl) 1 μl Bsal restriction enzyme (NEB; 10 U/μl) 1 μl 30 fmol pL1F-1 destination plasmid 1 μl 30 fmol plCH41373 ( level 0 promoter module, 35S promoter)1 μl 30 fmol plCH46501 ( level 0 5′UTR module, TMV untraslatedregion) 1 μl 30 fmol plCH41531 ( level 0 ORF module, GFP)1 μl 30 fmol plCH41432 ( level 0 terminator module, Ocs terminator)11 μl water 20 μl total - The reaction was incubated for 5 hours at 37° C., 5 min 50° C. and 5 min 80° C. The mix was added to 100 μl chemical competent DH10b cells, incubated for 30 min on ice and transformed by heat shock. After plating on LB agar plates containing ampicillin (100 μg/ml) and growing over night at 37° C., two white clones were analyzed by restriction analysis and optionally by sequencing.
- As all other MoClo constructs,
multigene level 2 constructs are assembled from lower level (here level 1) constructs using a one-pot restriction-ligation. In this case, assembly is performed using the enzyme Bpil.Level 2 destination vectors carry a kanamycin resistance gene, in accordance with the principle that a specific selection marker is assigned to each level of cloning, allowing effective counter-selection against entry plasmid backbones.Level 2 destination vector backbones do not contain any type IIs restriction sites used in the MClo system, other that the recognition sites used for the cloning of the inserts. In contrast tolevel 0 andlevel 1, the visible selectable marker used forlevel 2 constructs is not a lacZ gene, but an artificial bacterial operon containing 5 genes (see Reference Example 2) that lead to biosynthesis of canthaxanthin, a red (more precisely salmon/orange) colored carotenoid pigment. A lacZ gene for blue-white selection would have also worked for this step, but the choice of a new color selectable marker is explained below in the paragraph onlevel 2i. The cantaxanthin operon inlevel 2 destination vector pL2-1 is flanked by two Bpil sites that create TGCC and GGGA overhangs after digestion (FIG. 4 ). The TGCC overhang is compatible with thelevel 1 construct forposition 1, while the GGGA represents an overhang which is unique tolevel 2 destination plasmids. Depending on the number oflevel 1 constructs that are subcloned in one step (from one to a maximum of 6 transcription units), the last overhang of the assembled DNA fragment will be different for each of the six possibilities. To connect this end with the GGGA overhang of thelevel 2 destination vector, a set of seven end-linkers (pELE-n) was designed (FIG. 4 , the seventh linker is necessary only when using theother level 2 destination vectors described below). Like thelevel 1 modules, the end-linker plasmids confer ampicillin resistance, and are themselves flanked by Bpil sites. These two features make them compatible withstandard level 1 destination vectors (and constructs obtained therefrom) in a Bpil-basedlevel 2 Golden Gate cloning reaction. End-linkers can consist of any DNA sequence, and serve the purpose of joining two chosen type IIs enzyme cleavage sites. For example, the sequence of linker pELE-1 is (SEQ ID NO: 11) gaagac aa GCAA gaggatgcacatgtgaccga GGGA tt gtcttc (bold is the Bpi recognition sites, underlined are the cleavage sites, and in between is the joining linker sequence). The end linkers are cloned in a pUC19 based plasmid (but could also be cloned in other backbones). - At first glance the
level 1 constructs designed for a defined position cannot be reused in a different context. For example, alevel 1 construct made for subcloning at position three cannot be used without two other constructs forposition position 1 could be done by recloning thesame level 0 modules in alevel 1 destination vector specific forposition 1. However, a possibility to reduce the need for extensive recloning of the same construct for different positions is given by the periodical design of thelevel 1 overhangs. Here the relative position of, for example alevel 1position 3 construct, can easily be shifted to the relative first position, when the left overhang from thelevel 2 destination vector would read ACTA instead of TGCC. Here the first two positions would be virtually deleted, shiftingposition 3 to arelative position 1. A set of sevenlevel 2 destination vectors created for this purpose is shown inFIG. 4 (pL2-1 to pL2-7). Together with the seven end-linker plasmids, these 14 plasmids allow to realize every overhang combination in alevel 2 destination vector. The identical flexibility only based onlevel 2 destination vectors without the end-linkers would require 36 different plasmids. - To test cloning of
several level 1 transcriptional units intolevel 2 constructs, 5 different restriction-ligation reactions were set up to clone from 2 up to 6 transcriptional units in a single step. The restriction-ligation reactions each contains from 2 to 6level 1 transcriptional unit constructs (pICH50711, plCH50721, plCH49722, plCH49733, plCH50731, pICH50741FIG. 10 ), one appropriate end-linker (pELE-2 to pELE-6), and the destination vector pL2-1. - The reaction was incubated for 5 hours at 37° C. and 10 min 80° C. The mix was added to 100 μl chemical competent DH10b cells, incubated for 30 min on ice and transformed by heat shock. The transformation was plated on LB agar plates containing kanamycin (100 μg/ml) and the plate incubated over night at 37° C.
- Considering all
level 1 cloning experiments, the number of white colonies obtained per transformation, which gives a measure of the cloning efficiency, decreased from approximately 33000 (for twolevel 1 modules plus end linker) to 150 (six modules plus end-linker), and the percentage of red colonies raised from 0.02% to 10% (FIG. 10 ). Six white colonies were tested from eachlevel 2 assembly by an analytical endonuclease restriction digest and all were found to be correct. - As we have shown above, the assembly of up to six transcription units to produce a 24 kb construct (pICH51201) can be done in a one-step and one-
tube level 2 reaction. However thefinal level 2 constructs are in a “closed” status and no additional genes can be inserted since no type IIS restriction sites are left in the construct. An entry option can be provided when modified end-linkers containing additional type IIS restriction sites (pELB-n) are used in the assembly of thelevel 2 constructs (FIG. 4 ). This type of construct was namedlevel 2i since it is based onlevel 2 backbones but represents also an intermediate step to extend the number of genes in the construct beyond six. This intermediate construct can be used both as a destination vector for a final construct (level 2-2) or once again to construct the next level destination vector (level 2i-2), in case more than 6 more genes still need to be added to the construct. Given these alternative possibilities, the same visual selectable marker (such as LacZα for blue/white selection) cannot be used for all possible cloning steps. It is for this reason that we developed a second color selectable marker, that we developed based on carotenoid biosynthetic genes. With such marker, every cloning step can be performed using color selection, for example from red to white forlevel 2, red to blue forlevel 2i, blue to white for level 2-2, blue to red for level 2i-2, etc. . . . - As an example of end-linker sequence, we provide the sequence features of plasmid pELB-1. pELB-1 contains the sequence (SEQ ID NO: 12) gaagac aa tgcc t gagacc (bold Bpil recognition site, underlined cleavage site of Bpil and Bsal, italics Bsal recognition site) followed by a puc19 fragment containing the LacZ alpha fragment (gcagctggcacgacaggtttcccgactggaaagcgggcagtgagcgcaacgcaattaatgtgagttagctcactcattaggcaccc caggctttacactttatgcttccggctcgtatgttgtgtggaattgtgagcggataacaatttcacacaggaaacagctatgaccatgatta cgccaagcttgcatgcctgcaggtcgactctagaggatccccgggtaccgagctcgaattcactggccgtcgttttacaacgtcgtgac tgggaaaaccctggcgttacccaacttaatcgccttgcagcacatccccctttcgccagctggcgtaatagcgaagaggcccgcacc gatcgcccttcccaacagttgcgcagcctgaatggcgaatggcgcctgatgcggtattttctccttacgcatctgtgcggtatttcacacc gcatatggtgcactctcagtacaatctgctctgatgccgcatagttaagccagccccgacacccgccaacacccgctgacgcgccct gacgggcttgtctgctcccggcatccgcttacagacaagctgtgac (SEQ ID NO: 13), followed by sequence ggtctc a ggga tt gtcttca (SEQ ID NO: 14) (as before, bold Bpil recognition site, underlined cleavage site of Bpil and Bsal, italics Bsal recognition site). The sequence of the end-linker shown above is cloned in a pUC19-based vector (that does not contain additional LacZ fragment sequences), but can also be cloned in other plasmid backbones).
- To test cloning of
level 2i constructs, two restriction-ligations were set up as forlevel 2, except that the end-linker was replaced by an end-linker containing Bsal sites and a lacZ gene (constructs plCH51212 and plCH51226,FIG. 10 ). We provide here in more detail the set up for cloning of plCH51212 as an example. In order to create thelevel 2i construct plCH51212 by a Bpil dependent Golden gate cloning reaction, the following reaction mix was added into a single tube (FIG. 19 C): -
2 μl 10x T4-ligase buffer (Promega) 1 μl high concentrated T4-ligase (Promega; 10 U/μl) 1 μl Bpil restriction enzyme (Fermentas 10 U/μl) 1 μl 30 fmol pL2-1 (destination vector) 1 μl 30 fmol plCH50711 ( level 1 construct forposition 1,transcription unit with GFP) 1 μl 30 fmol plCH50721 ( level 1 construct forposition 2,transcription unit with p19) 1 μl 30 fmol plCH49722 ( level 1 construct forposition 3,transcription unit with VP2) 1 μl 30 fmol plCH49733 ( level 1 construct forposition 4,transcription unit with VP5) 1 μl 30 fmol plCH50731 ( level 1 construct forposition 5,transcription unit with VP7) 1 μl 30 fmol pELB-5 (end-linker for position 6) 9 μl water 20 μl total - In contrast to previous constructs, red/blue selection is performed rather than red/white selection. In addition to red and blue colonies, a few colonies had a dark green color. These contain incorrect plasmids that have both the canthaxanthin operon and a lacZ gene. The number of colonies containing correctly assembled plasmids (blue colonies), and the ratio of red to blue colonies for plCH51212 and plCH51226 were comparable with the
level 2 construct made from a similar number of entry clones, plCH51191 and plCH51201. - As a starting point for the introduction of up to six
further level 1 constructs, thelevel 2i-1 construct plCH51212 was chosen as a destination vector. This construct contains, beside fivelevel 1 modules, a lacZα end-linker providing two Bsal restriction sites. In contrast to the previously described cloning oflevel 1,level 2 orlevel 2i constructs, which required either Bsal (level 1) or Bpil (level 2 andlevel 2i) alone, we have to use here both enzymes at the same time. Bsal allows reopening thelevel 2i backbone and provides defined overhangs which are compatible with thelevel 1 modules released by Bpil. Since two type IIS restriction enzymes have to be used at the same time and the target plasmid has already a size of 22 kb, we again tested the efficiency of the Golden Gate cloning for the introduction of either one to up to sixlevel 1 modules simultaneously. - The results for the construction of plasmids plCH51761 to plCH51811 show that the cloning efficiency decreases dependent on the number of incorporated transcription unit fragment constructs (
FIG. 10 ). Interestingly, the rate with which the cloning efficiency drops is similar to the earlier analyzed set of level 2-1 plasmids (plCH51161 to plCH51201) despite the different target plasmid size of 22 kb versus 4 kb. Since no positive clone was obtained for the largest construct, we repeated the cloning using different reaction conditions (FIG. 10 , for plCH1811*). - The following reaction mix was set up (
FIG. 19D ): -
2 μl 10x T4-ligase buffer (Promega) 1 μl high concentrated T4-ligase (Promega; 10 U/μl) 0.5 μl Bpil restriction enzyme (Fermentas 10 U/μl) 0.5 μl Bsal restriction enzyme (NEB 10 U/μl) 4.67 μl 40 fmol plCH51212 level 2i-1 construct0.72 μl 40 fmol plCH50741 ( level 1 construct for position 1)0.75 μl 40 fmol plCH50751 ( level 1 construct for position 2)0.69 μl 40 fmol plCH50761 ( level 1 construct for position 3)0.75 μl 40 fmol plCH50771 ( level 1 construct for position 4)0.58 μl 40 fmol plCH50781 ( level 1 construct for position 5)1.17 μl 40 fmol plCH50791 ( level 1 construct for position 6)0.70 μl 40 fmol pELE-4 (end linker) 5.97 μl water 20 μl total - The mix was incubated in a thermocycler with the following parameters: incubation for 2 minutes at 37° C., 5 minutes at 16° C., both steps repeated 45 times, followed by incubation for 5 minutes at 80° C. and 10 minutes are 80° C. The reaction mix was transformed in E. coli chemically competent cells, and an aliquot of the transformation plated on a LB plate containing Kanamycing and X-gal. These parameters greatly increased cloning efficiency since 2685 white colonies were obtained (for the whole transformation) and no blue colony (
FIG. 10 ). Plasmid DNA from 6 randomly picked colonies was analyzed. All were found to contain the expected correct construct. - We have therefore shown here that complex constructs containing many transcription units (eleven as shown here, consisting of 44 individual basic modules) can easily be assembled by a series of three easy-to-perform one-pot reactions, and with extremely high cloning efficiency. The largest construct made in this study (pICH51811) has a size of 34 kb. Considering the relative efficiency with which this construct and its precursors were obtained, it is likely that we have not yet reached the upper size limit for constructs that can be made using this technology. To make larger constructs starting from those that we have described here, the next step would be to remake the final construct (pICH51811), but using an end-linker that would add two restriction sites for the enzyme Esp3I (end-linker pELR-4,
FIG. 4 ), and use the resulting plasmid as a destination vector to add one or several additional genes. - To check the constructs, at least for one of the transcriptional units (containing GFP), all
level 2 constructs were introduced into Agrobacterium tumefaciens. Agrobacterium suspensions were infiltrated with a syringe without a needle into Nicotiana benthamiana leaves. GFP is expressed from all constructs (FIG. 21 ), as expected from expression cassettes driven by the 35S promoter. Interestingly, the level of GFP expression was found to decrease for the largest constructs. This can be explained by the fact that the GFP gene was always located at the left border in all constructs; since T-DNA transfer to plant cells occurs from the right to the left border, and is sometimes incomplete, plant cells will acquire the GFP cassettes from large constructs less frequently than from smaller constructs. - As an example for cloning of bacterial operons, we chose to work with a carotenoid biosynthesis pathway since carotenoids are easily visible in the colonies, for example as a red color for lycopene. We chose the Pantoea ananatis Zeaxanthin biosynthesis pathway, since all genes of the pathway are known and sequenced (Misawa et al., Journal of Bacteriology, 1990, 172:6704-6712). Three genes of this pathway are required for lycopene biosynthesis crtE, crtI and crtB. We PCR-amplified all three genes and cloned them in vector pL0-SO (
FIG. 18 ) as described for eukaryotic coding sequences. The resultinglevel 0 constructs are shown inFIG. 22 . We also cloned several genes known to increase synthesis of lycopene when over-expressed in E. coli cells expressing the three crt genes necessary for lycopene production. These genes are dxs, ispA, idi and AppY. The genes for dxs and ispA were amplified and cloned from both E. coli strain K12 and from Agrobacterium rhizogenes strain K84. The genes for idi and AppY were cloned only from E. coli strain K12. In addition, the LacZ promoter present in pUC19 and the Pantoea ananatis promoter from the Zeaxanthin biosynthesis operon were also cloned aslevel 0 modules (FIG. 22 ). -
Level 1 destination vectors for cloning of bacterial coding sequences are different from destination vectors for cloning eukaryotic transcription units, since they are designed for cloning of individual coding sequences rather than complete transcription units. Moreover, they also provide a bacterial ribosome binding site (RBS) to the cloned coding sequence. Instead of making vectors with a defined RBS sequence, we made vectors with a degenerate RBS to provide a range of expression levels (FIG. 23A ). For example, vector pICH4862L was made by PCR amplification of the LacZ fragment from pUC19 using primers prok1-1deg (SEQ ID NO: 15) (tttcgtctcattcagaagacat TGCC nv agga dnnnnnn AATG ggagaccttatgaccatgattacgccaagc, in bold is the core sequence of the RBS and flanking it is degenerate sequence, the underlined sequence is the cleavage sites of the Bpil or Bsal restriction sites) and prok1-2 (SEQ ID NO: 16) (tttcgtctcacttagaagacaa TTGC AAGC tgagaccttatgcggcatcagagcagattgt). In the above sequence, v stands for a, c or g; d stands for a, t or g; and n stands for a, t, g or c. This PCR product was cloned using DraIII in a pUC19-based plasmid backbone containing compatible DraIII sites. Single colonies were not picked since the result of this cloning is a library. Instead, the entire library was grown in E. coli and plasmid DNA prepared from the library. A similar procedure was repeated for the other 6 destination vectors, but using different primers for the different type IIs cleavage sites. - Since for this experiment we are cloning carotenoid genes, end-linkers and
level 2 destination vectors were made that do not already contain carotenoid genes. End linkers in particular were made in which the sequence of the linker consists of the Lac Z terminator sequences from plasmid pUC19. The two sets of end linkers are shown inFIG. 23B . - A set of
level 1 constructs was constructed fromlevel 0 modules. Promoter modules were cloned in vector pL1F-1 (FIG. 12 ) and the coding sequences in vectors pICH5025L to pICH2030L for the different positions expected in the final construct (FIG. 24 ). Cloning was performed as a one-tube cloning reaction using the selectedlevel 0 module and the chosen destination vector (or vector library) using the enzyme Bsal as described for cloning oflevel 1 eukaryotic transcription units. The 6 genes cloned aslevel 0 modules and known to increase lycopene production in E. coli were all cloned forpositions - Two constructs were made containing the lycopene biosynthesis genes crtB, crtE and crtI and either the lac Z promoter or the Pantoea ananatis promoter. No terminator cloned as
level 1 module was used since the terminator is provided here by the end-linker (FIG. 25 ). The resulting constructs, pICH5848L and pICH5850L are in fact two different libraries, in which all constructs have different RBS sequences for each of the three genes. The colonies displayed a wide variation in the amount of red color, as expected from such libraries. The libraries were grown and plasmid DNA prepared from them. - The next step consists of adding two or three genes to the previous constructs to try to increase the amount of lycopene produced. Since we have 6 genes available cloned at three different positions, there are too many possible combinations to test them all individually. Instead, libraries can be made in which any of the 6 genes will be cloned randomly at any of the two or three positions (
position FIG. 26 A and B). Four different libraries were made by setting up 4 reactions by adding into one tube all the constructs depicted inFIG. 26A (two reactions with either the LacZ promoter or the Pantoea promoter) or 26B (two reactions as well) and incubating the mixes in presence of Bsal and Bpil enzymes and ligase (as described previously forlevel 2i-2 for eukaryotic constructs). Colonies with a wide range of intensity for the red color were obtained (FIG. 26B ), with many colonies having a stronger red that the parental 2i-1 constructs. To test that genes (from the 6 selected genes) had been cloned randomly at the three positions, 8 constructs from intense red colonies were sequenced. All constructs contained a different combination of genes. All sequenced constructs contained a dxs gene at least at one of the three positions; this is not surprising, since dxs is known to be the gene with the strongest effect on lycopene production among all enhancer genes tested as reported in the literature.
Claims (9)
1. System for producing a nucleic acid construct of interest, said system comprising:
a set of n destination vectors (“destination vectors M”), n being an integer of at least 2, preferably at least 3,
each of said n destination vectors M comprising in the following order:
(I′) a type IIs restriction endonuclease recognition site defining the cleavage site of item (II′);
(II′) the cleavage site of said recognition site of item (I′);
(III′) a cleavage site of said recognition site of the following item (IV′);
(IV′) a further type IIs restriction endonuclease recognition site defining the cleavage site of item (III′) and being a different recognition site of a type IIs restriction endonuclease from that of item (I′);
(V′) a vector backbone comprising a selectable marker gene, said vector backbone linking the cleavage sites of said recognition sites of item and
(IV′) and the following item (VI′);
(VI′) a further type IIs restriction endonuclease cleavage site;
(VII′) a type IIs restriction endonuclease recognition site of the cleavage site of item (VI′) and
(VIII′) optionally, an insert between the recognition sites of item (VII′) and item (I′); and
a set of n linkers M, n being as defined above, each linker M comprising in the following order:
(a′) a type IIs restriction endonuclease recognition site defining the cleavage site of item (b′);
(b′) the cleavage site of said recognition site of item (a′);
(c′) a cleavage site of a further type IIs restriction endonuclease recognition site of item (d′), said cleavage site having the same sequence of nucleotides as the cleavage site of item (b′);
(d′) the type IIs restriction endonuclease recognition site defining the cleavage site of item (c′) and being a different recognition site of a type IIs restriction endonuclease different from that of item (a′);
(e′) a further cleavage site of a type IIs restriction endonuclease recognition site of the following item (f′);
(f′) the type IIs restriction endonuclease recognition site defining the cleavage site of item (e′), that is preferably a recognition site of the same endonuclease as the recognition site of item (a′);
wherein
the cleavage sites (VI′) of all n destination vectors M are identical;
the cleavage sites (e′) of all n linkers M are identical;
the cleavage site of item (VI′) of each destination vector M is complementary to the cleavage site of item (e′) of each linker M for allowing annealing of single-stranded overhangs produced by the type IIs restriction endonuclease recognising recognition sites (VII′) and (f′);
the cleavage sites of items (II′) and (III′) within each destination vector M have the same sequence of nucleotides and may overlap such that one and the same sequence of nucleotides provides the cleavage site of items (II′) and that of item (III′); and
the cleavage sites of items (b′) and (c′) within each linker M have the same sequence of nucleotides and may overlap such that one and the same sequence of nucleotides provides the cleavage site of items (b′) and that of item (c′); and
the cleavage site (II′) of each destination vector M is unique among the cleavage sites (II′) of the set of n destination vectors M such that there are n different cleavage sites (II′), wherein for each of said n different cleavage sites (II′), there is a linker M having a cleavage site (b′) of identical nucleotide sequence among the set of n linkers M.
2. The system according to claim 1 , wherein
(α) the recognition sites of items (a′) and (f′) of all n linkers M are recognition sites of the same type IIs restriction endonuclease;
(β) the recognition sites of items (d′) of all n linkers M are recognition sites of the same type IIs restriction endonuclease;
wherein the recognition sites of item (a) are different recognitions sites from those of item (β).
3. The system according to claim 1 , wherein
(γ) the recognition sites of items (I′) and (VII′) of all n destination vectors M are recognition sites of the same type IIs restriction endonuclease;
(δ) the recognition sites of items (IV′) of all n destination vectors M are recognition sites of the same type IIs restriction endonuclease;
wherein the recognition sites of item (γ) are different recognitions sites from those of item (δ).
4. The system according to claim 1 , wherein
the recognition sites of items (VII′) and (I′) of destination vectors M and of items (a′) and (f′) of the linkers M are recognition sites of the same type IIs restriction endonuclease;
the recognition sites of items (IV′) of destination vectors M and of items (d′) of the linkers M are recognition sites of the same type IIs restriction endonuclease.
5. The system according to claim 1 , comprising:
a set of z entry DNAs numbered 1 to z, z being an integer of at least 2, preferably an integer of at least 3,
each of said z entry DNAs comprising in this order:
(i) a type IIs restriction endonuclease recognition site followed by the cleavage site thereof;
(ii) a sequence portion linking the cleavage site of said recognition site of item (i) with the cleavage site of the recognition site of the following item (iii), and
(iii) a cleavage site of a further type IIs restriction endonuclease recognition site followed by the recognition site of said cleavage site;
wherein
the cleavage site of item (i) of each entry DNA is complementary to the cleavage site of item (II′) of one of the n destination vectors M for allowing annealing of single-stranded overhangs produced by the type IIs restriction endonuclease recognising recognition sites of items (i) and (I′),
the recognition sites of item (i) of all z entry DNAs are preferably recognition sites of the same type IIs restriction endonuclease as the recognition sites of item (I′) and (VII′);
the cleavage site of item (iii) of each entry DNA is complementary to the cleavage sites of item (b′) of one of the n linkers M for allowing annealing of single-stranded overhangs produced by the type IIs restriction endonuclease recognising recognition sites of items (iii) and (a′),
the recognition sites of item (i) are recognition sites of the same type IIs restriction endonuclease as the recognition sites of item (a′) and (f′); and
the recognition sites of items (i) and (iii) of all z entry DNAs are recognition sites of the same type IIs restriction endonuclease.
6. The system according to claim 1 , further comprising a set of n destination vectors (“destination vectors P”), wherein n is as defined in claim 1 ,
each of said n destination vectors P comprising in the following order:
(I″) a type IIs restriction endonuclease recognition site defining the cleavage site of item (II″);
(II″) the cleavage site of said recognition site of item (I″);
(III″) a cleavage site of said recognition site of the following item (IV″);
(IV″) a further type IIs restriction endonuclease recognition site defining the cleavage site of item (III″) and being a different recognition site of a type IIs restriction endonuclease from that of item (I″);
(V″) a vector backbone comprising a selectable marker gene, said vector backbone linking the cleavage sites of said recognition sites of item and (IV″) and the following item (VI″);
(VI″) a further type IIs restriction endonuclease cleavage site;
(VII″) a type IIs restriction endonuclease recognition site of the cleavage site of item (VI″), preferably of the same endonuclease as the recognition site of item (I″) and
(VIII″) optionally, an insert between the recognition sites of item (VII″) and item (I″); and
a set of n linkers P, each linker P comprising in the following order:
(a″) a type IIs restriction endonuclease recognition site defining the cleavage site of item (b″);
(b″) the cleavage site of said recognition site of item (a″);
(c″) a cleavage site of a further type IIs restriction endonuclease recognition site of item (d″), said cleavage site having the same nucleotide sequence as the cleavage site of item (b″);
(d″) the type IIs restriction endonuclease recognition site defining the cleavage site of item (c″) and being a different recognition site of a type IIs restriction endonuclease from that of item (a″);
(e″) a further cleavage site of a type IIs restriction endonuclease recognition site of the following item (f″);
(f″) the type IIs restriction endonuclease recognition site defining the cleavage site of item (e″), that is preferably a recognition site of the same endonuclease as the recognition site of item (a″);
wherein
the cleavage sites (VI″) of all n destination vectors P are identical;
the cleavage sites (e″) of all n linkers P are identical;
the cleavage site of item (VI″) of each destination vector P is complementary to the cleavage site of item (e″) of each linker P for allowing annealing of single-stranded overhangs produced by the type IIs restriction endonuclease recognising recognition sites (VII″) and (f″);
the cleavage sites of items (II″) and (III″) within each destination vector P have the same sequence of nucleotides and may overlap such that one and the same sequence of nucleotides provides the cleavage site of item (II″) and the cleavage site of item (III″);
the cleavage sites of items (b″) and (c″) within each linker P have the same sequence of nucleotides and may overlap such that one and the same sequence of nucleotides provides the cleavage site of items (b″) and the cleavage site of item (c″); and
for each of said n different cleavage sites (b′) or (II′), there is a destination vector P having a cleavage site (II″) of identical nucleotide sequence as the nucleotide sequence of cleavage sites (b′) or (II′); and
for each of said n different cleavage sites (b′) or (II′), there is a linker P having a cleavage site (b″) of identical nucleotide sequence as the nucleotide sequence of cleavage sites (b′) or (II′).
7. The system according to claim 6 , wherein
the recognition sites of items (I″) and (VII″) of all n destination vectors P are recognition sites of the same type IIs restriction endonuclease; and
the recognition sites of items (IV″) of all n destination vectors P are recognition sites of the same type IIs restriction endonuclease but different from the recognition sites of items (I″) and (VII″).
8. The system according to claim 6 , wherein
the recognition sites of items (I″), (IV′), (d′), (a″) and (f″) are recognition sites of the same type IIs restriction endonuclease;
the recognition sites of items (IV″), (I′), (VII′), (a′) and (f′) are recognition sites of the same type IIs restriction endonuclease.
9. Use of the system defined in claim 1 for cloning a nucleic acid construct of interest into a vector.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US18/150,272 US20230257757A1 (en) | 2010-06-11 | 2023-01-05 | System and Method of Modular Cloning |
Applications Claiming Priority (8)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP10006090.4 | 2010-06-11 | ||
EP10006090 | 2010-06-11 | ||
EP10006955.8 | 2010-07-06 | ||
EP10006955A EP2395087A1 (en) | 2010-06-11 | 2010-07-06 | System and method of modular cloning |
PCT/EP2011/002843 WO2011154147A1 (en) | 2010-06-11 | 2011-06-09 | System and method of modular cloning |
US201313703123A | 2013-05-14 | 2013-05-14 | |
US15/634,803 US11597937B2 (en) | 2010-06-11 | 2017-06-27 | System and method of modular cloning |
US18/150,272 US20230257757A1 (en) | 2010-06-11 | 2023-01-05 | System and Method of Modular Cloning |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US15/634,803 Division US11597937B2 (en) | 2010-06-11 | 2017-06-27 | System and method of modular cloning |
Publications (1)
Publication Number | Publication Date |
---|---|
US20230257757A1 true US20230257757A1 (en) | 2023-08-17 |
Family
ID=43242437
Family Applications (3)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US13/703,123 Abandoned US20130267021A1 (en) | 2010-06-11 | 2011-06-09 | System and method of modular cloning |
US15/634,803 Active 2032-02-08 US11597937B2 (en) | 2010-06-11 | 2017-06-27 | System and method of modular cloning |
US18/150,272 Pending US20230257757A1 (en) | 2010-06-11 | 2023-01-05 | System and Method of Modular Cloning |
Family Applications Before (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US13/703,123 Abandoned US20130267021A1 (en) | 2010-06-11 | 2011-06-09 | System and method of modular cloning |
US15/634,803 Active 2032-02-08 US11597937B2 (en) | 2010-06-11 | 2017-06-27 | System and method of modular cloning |
Country Status (3)
Country | Link |
---|---|
US (3) | US20130267021A1 (en) |
EP (3) | EP2395087A1 (en) |
WO (1) | WO2011154147A1 (en) |
Families Citing this family (76)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP2395087A1 (en) | 2010-06-11 | 2011-12-14 | Icon Genetics GmbH | System and method of modular cloning |
CN104204205B (en) | 2012-03-27 | 2017-06-27 | 帝斯曼知识产权资产管理有限公司 | Cloning process |
WO2014004393A1 (en) | 2012-06-25 | 2014-01-03 | Gen9, Inc. | Methods for nucleic acid assembly and high throughput sequencing |
US20140038240A1 (en) * | 2012-07-10 | 2014-02-06 | Pivot Bio, Inc. | Methods for multipart, modular and scarless assembly of dna molecules |
CN103215296B (en) * | 2013-03-22 | 2014-12-03 | 武汉伯远生物科技有限公司 | Multi-fragment DNA molecule assemble method and application |
US9206433B2 (en) * | 2013-05-01 | 2015-12-08 | Dna Twopointo, Inc. | Methods, compositions and kits for a one-step DNA cloning system |
US10253321B2 (en) | 2013-05-01 | 2019-04-09 | Dna2.0, Inc. | Methods, compositions and kits for a one-step DNA cloning system |
CN104419716B (en) * | 2013-08-27 | 2017-12-29 | 天津大学 | A kind of standardization, high-precision, general functional module construction method |
EP3050971B1 (en) | 2015-01-30 | 2020-06-24 | Symrise AG | Process for the enzymatic production of a 4-O-methylated cinnamic acid amide |
FR3032447B1 (en) | 2015-02-10 | 2020-02-21 | Universite Des Sciences Et Technologies De Lille-Lille 1 | METHOD OF MANUFACTURING DNA VECTORS FROM MOLECULAR BRICKS CONTAINING SEQUENCES OF INTEREST |
WO2016176325A1 (en) * | 2015-04-27 | 2016-11-03 | California Institute Of Technology | Methods and compositions for rapid assembly of genetic modules |
EP3097783B1 (en) | 2015-05-26 | 2019-11-13 | Nomad Bioscience GmbH | Colicin m for the control of ehec |
US9988624B2 (en) | 2015-12-07 | 2018-06-05 | Zymergen Inc. | Microbial strain improvement by a HTP genomic engineering platform |
WO2017100376A2 (en) | 2015-12-07 | 2017-06-15 | Zymergen, Inc. | Promoters from corynebacterium glutamicum |
US11208649B2 (en) | 2015-12-07 | 2021-12-28 | Zymergen Inc. | HTP genomic engineering platform |
EP3858996B1 (en) | 2015-12-07 | 2022-08-03 | Zymergen Inc. | Microbial strain improvement by a htp genomic engineering platform |
EP3178313A1 (en) | 2015-12-09 | 2017-06-14 | Leibniz-Institut für Pflanzenbiochemie | Gene and protein for the synthesis of oxidized zingiberene derivatives |
WO2017210148A1 (en) * | 2016-05-31 | 2017-12-07 | Igc Bio, Inc. | Compositions and methods for self-recombination plasmid transformation |
US11293021B1 (en) | 2016-06-23 | 2022-04-05 | Inscripta, Inc. | Automated cell processing methods, modules, instruments, and systems |
US10544411B2 (en) | 2016-06-30 | 2020-01-28 | Zymergen Inc. | Methods for generating a glucose permease library and uses thereof |
JP2019519242A (en) | 2016-06-30 | 2019-07-11 | ザイマージェン インコーポレイテッド | Method for generating a bacterial hemoglobin library and its use |
KR20190027843A (en) | 2016-07-05 | 2019-03-15 | 지머젠 인코포레이티드 | Genetic disturbance of RNA de Grado som protein complex |
NZ752583A (en) * | 2016-11-07 | 2023-10-27 | Genovie Ab | A two-part device for t-cell receptor synthesis and stable genomic integration to tcr-presenting cells |
WO2018147071A1 (en) * | 2017-02-08 | 2018-08-16 | Spiber株式会社 | Method for obtaining target dna fragments |
US20180362965A1 (en) * | 2017-02-22 | 2018-12-20 | Ingenza Ltd. | Method for assembly of polynucleic acid sequences using phosphorothioate bonds within linker oligos |
FR3064644A1 (en) | 2017-04-04 | 2018-10-05 | Inovactis | ANTI-TUMOR IMMUNOTHERAPY BASED ON YEASTS |
EP3625351A1 (en) | 2017-05-19 | 2020-03-25 | Zymergen Inc. | Genomic engineering of biosynthetic pathways leading to increased nadph |
ES2875579T3 (en) | 2017-06-06 | 2021-11-10 | Zymergen Inc | HTP genomic engineering platform to improve Escherichia coli |
EP3635110A2 (en) | 2017-06-06 | 2020-04-15 | Zymergen, Inc. | A high-throughput (htp) genomic engineering platform for improving saccharopolyspora spinosa |
JP7227162B2 (en) | 2017-06-06 | 2023-02-21 | ザイマージェン インコーポレイテッド | HTP Genome Engineering Platform for Improving Fungal Strains |
JP2020524494A (en) | 2017-06-06 | 2020-08-20 | ザイマージェン インコーポレイテッド | High throughput transposon mutagenesis |
US10011849B1 (en) | 2017-06-23 | 2018-07-03 | Inscripta, Inc. | Nucleic acid-guided nucleases |
US9982279B1 (en) | 2017-06-23 | 2018-05-29 | Inscripta, Inc. | Nucleic acid-guided nucleases |
ES2913457T3 (en) | 2017-06-30 | 2022-06-02 | Inscripta Inc | Automated Cell Processing Methods, Modules, Instruments and Systems |
JP7335224B2 (en) * | 2017-07-18 | 2023-08-29 | ジェノヴィー エービー | A Two-Component Vector Library System for Rapid Assembly and Diversification of Full-Length T-Cell Receptor Open Reading Frames |
US10738327B2 (en) | 2017-08-28 | 2020-08-11 | Inscripta, Inc. | Electroporation cuvettes for automation |
CN111372650A (en) | 2017-09-30 | 2020-07-03 | 因思科瑞普特公司 | Flow-through electroporation apparatus |
WO2019183183A1 (en) | 2018-03-20 | 2019-09-26 | Zymergen Inc. | A htp platform for the genetic engineering of chinese hamster ovary cells |
WO2019190874A1 (en) | 2018-03-29 | 2019-10-03 | Inscripta, Inc. | Automated control of cell growth rates for induction and transformation |
WO2019200004A1 (en) | 2018-04-13 | 2019-10-17 | Inscripta, Inc. | Automated cell processing instruments comprising reagent cartridges |
US11254953B2 (en) * | 2018-04-18 | 2022-02-22 | Utah State University | Compositions and methods for zika virus characterization and vaccine development |
US10501738B2 (en) | 2018-04-24 | 2019-12-10 | Inscripta, Inc. | Automated instrumentation for production of peptide libraries |
US10858761B2 (en) | 2018-04-24 | 2020-12-08 | Inscripta, Inc. | Nucleic acid-guided editing of exogenous polynucleotides in heterologous cells |
US10526598B2 (en) | 2018-04-24 | 2020-01-07 | Inscripta, Inc. | Methods for identifying T-cell receptor antigens |
EP3810750A4 (en) | 2018-06-06 | 2022-03-23 | Zymergen, Inc. | Manipulation of genes involved in signal transduction to control fungal morphology during fermentation and production |
EP4070802A1 (en) | 2018-06-30 | 2022-10-12 | Inscripta, Inc. | Instruments, modules, and methods for improved detection of edited sequences in live cells |
US11142740B2 (en) | 2018-08-14 | 2021-10-12 | Inscripta, Inc. | Detection of nuclease edited sequences in automated modules and instruments |
US10532324B1 (en) | 2018-08-14 | 2020-01-14 | Inscripta, Inc. | Instruments, modules, and methods for improved detection of edited sequences in live cells |
WO2020058438A1 (en) | 2018-09-20 | 2020-03-26 | Sanofi | Intron-based universal cloning methods and compositions |
US11214781B2 (en) | 2018-10-22 | 2022-01-04 | Inscripta, Inc. | Engineered enzyme |
CN113227368B (en) | 2018-10-22 | 2023-07-07 | 因思科瑞普特公司 | Engineered enzymes |
KR20210125494A (en) * | 2019-01-09 | 2021-10-18 | 아시모브 인코포레이티드 | Compositions of polynucleic acid vectors and uses thereof |
AU2020247900A1 (en) | 2019-03-25 | 2021-11-04 | Inscripta, Inc. | Simultaneous multiplex genome editing in yeast |
US11001831B2 (en) | 2019-03-25 | 2021-05-11 | Inscripta, Inc. | Simultaneous multiplex genome editing in yeast |
GB201904653D0 (en) * | 2019-04-02 | 2019-05-15 | Univ Oxford Innovation Ltd | Universal DNA assembly |
WO2020247587A1 (en) | 2019-06-06 | 2020-12-10 | Inscripta, Inc. | Curing for recursive nucleic acid-guided cell editing |
CN114008070A (en) | 2019-06-21 | 2022-02-01 | 因思科瑞普特公司 | Whole genome rationally designed mutations leading to increased lysine production in E.coli |
US10927385B2 (en) | 2019-06-25 | 2021-02-23 | Inscripta, Inc. | Increased nucleic-acid guided cell editing in yeast |
WO2021061694A1 (en) | 2019-09-23 | 2021-04-01 | Zymergen Inc. | Method for counterselection in microorganisms |
WO2021102059A1 (en) | 2019-11-19 | 2021-05-27 | Inscripta, Inc. | Methods for increasing observed editing in bacteria |
US10883095B1 (en) | 2019-12-10 | 2021-01-05 | Inscripta, Inc. | Mad nucleases |
US10704033B1 (en) | 2019-12-13 | 2020-07-07 | Inscripta, Inc. | Nucleic acid-guided nucleases |
IL292895A (en) | 2019-12-18 | 2022-07-01 | Inscripta Inc | Cascade/dcas3 complementation assays for in vivo detection of nucleic acid-guided nuclease edited cells |
US10689669B1 (en) | 2020-01-11 | 2020-06-23 | Inscripta, Inc. | Automated multi-module cell processing methods, instruments, and systems |
US11225674B2 (en) | 2020-01-27 | 2022-01-18 | Inscripta, Inc. | Electroporation modules and instrumentation |
US20210332388A1 (en) | 2020-04-24 | 2021-10-28 | Inscripta, Inc. | Compositions, methods, modules and instruments for automated nucleic acid-guided nuclease editing in mammalian cells |
US11787841B2 (en) | 2020-05-19 | 2023-10-17 | Inscripta, Inc. | Rationally-designed mutations to the thrA gene for enhanced lysine production in E. coli |
MX2023000708A (en) | 2020-07-16 | 2023-02-14 | Nomad Bioscience Gmbh | Products for oral consumption with reduced sugar content. |
US11479779B2 (en) | 2020-07-31 | 2022-10-25 | Zymergen Inc. | Systems and methods for high-throughput automated strain generation for non-sporulating fungi |
US11299731B1 (en) | 2020-09-15 | 2022-04-12 | Inscripta, Inc. | CRISPR editing to embed nucleic acid landing pads into genomes of live cells |
US11512297B2 (en) | 2020-11-09 | 2022-11-29 | Inscripta, Inc. | Affinity tag for recombination protein recruitment |
EP4243608A1 (en) | 2020-11-11 | 2023-09-20 | Leibniz-Institut für Pflanzenbiochemie (IPB) | Fusion protein for editing endogenous dna of a eukaryotic cell |
AU2021415461A1 (en) | 2021-01-04 | 2023-08-17 | Inscripta, Inc. | Mad nucleases |
US11332742B1 (en) | 2021-01-07 | 2022-05-17 | Inscripta, Inc. | Mad nucleases |
WO2022173636A1 (en) * | 2021-02-12 | 2022-08-18 | President And Fellows Of Harvard College | Compositions and methods for molecular cloning |
US11884924B2 (en) | 2021-02-16 | 2024-01-30 | Inscripta, Inc. | Dual strand nucleic acid-guided nickase editing |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20020025561A1 (en) * | 2000-04-17 | 2002-02-28 | Hodgson Clague Pitman | Vectors for gene-self-assembly |
DE10145553B4 (en) * | 2001-09-16 | 2006-06-08 | Gene Architects Ag | Nucleic acid for a cloning vector |
US20040166567A1 (en) * | 2002-09-26 | 2004-08-26 | Santi Daniel V | Synthetic genes |
WO2008095927A1 (en) | 2007-02-05 | 2008-08-14 | Philipps-Universität Marburg | Method of cloning at least one nucleic acid molecule of interest using type iis restriction endonucleases, and corresponding cloning vectors, kits and system using type iis restriction endonucleases |
EP2395087A1 (en) | 2010-06-11 | 2011-12-14 | Icon Genetics GmbH | System and method of modular cloning |
-
2010
- 2010-07-06 EP EP10006955A patent/EP2395087A1/en not_active Withdrawn
-
2011
- 2011-06-09 WO PCT/EP2011/002843 patent/WO2011154147A1/en active Application Filing
- 2011-06-09 US US13/703,123 patent/US20130267021A1/en not_active Abandoned
- 2011-06-09 EP EP16200729.8A patent/EP3159412B1/en active Active
- 2011-06-09 EP EP11730558.1A patent/EP2580332B1/en active Active
-
2017
- 2017-06-27 US US15/634,803 patent/US11597937B2/en active Active
-
2023
- 2023-01-05 US US18/150,272 patent/US20230257757A1/en active Pending
Also Published As
Publication number | Publication date |
---|---|
EP2580332A1 (en) | 2013-04-17 |
WO2011154147A1 (en) | 2011-12-15 |
EP2395087A1 (en) | 2011-12-14 |
EP3159412B1 (en) | 2019-08-07 |
EP2580332B1 (en) | 2016-12-07 |
EP3159412A1 (en) | 2017-04-26 |
US20170369889A1 (en) | 2017-12-28 |
US20130267021A1 (en) | 2013-10-10 |
US11597937B2 (en) | 2023-03-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20230257757A1 (en) | System and Method of Modular Cloning | |
Engler et al. | Golden gate shuffling: a one-pot DNA shuffling method based on type IIs restriction enzymes | |
Lampropoulos et al. | GreenGate-a novel, versatile, and efficient cloning system for plant transgenesis | |
Weber et al. | A modular cloning system for standardized assembly of multigene constructs | |
Sarrion-Perdigones et al. | Design and construction of multigenic constructs for plant biotechnology using the GoldenBraid cloning strategy | |
CN111094575B (en) | DNA assembly | |
EP3257942B1 (en) | Method for assembly of polynucleic acid sequences | |
US10041067B2 (en) | Methods and compositions for rapid assembly of genetic modules | |
JP2004121248A (en) | Method for preparation and application of multi-gene recombinant vector construct | |
Gualberto et al. | DNA-binding proteins in plant mitochondria: Implications for transcription | |
CN113564197B (en) | Construction method and application of CRISPR/Cas9 mediated plant polygene editing vector | |
Zhao et al. | Efficient assembly of long DNA fragments and multiple genes with improved nickase‐based cloning and Cre/loxP recombination | |
CN106103712A (en) | A kind of efficient gene cloning method and application thereof | |
CN109563508A (en) | By fixed point DNA cracking and repair targeting protein diversification in situ | |
US20240043834A1 (en) | Linked dna production method and vector combination for use therein | |
CN104109683A (en) | Primer-free gene synthesis method | |
Angulo et al. | Targeted mutagenesis of the Arabidopsis GROWTH-REGULATING FACTOR (GRF) gene family suggests competition of multiplexed sgRNAs for Cas9 apoprotein | |
Fernandez‐Moreno et al. | A rapid and scalable approach to build synthetic repetitive hormone‐responsive promoters | |
Curtis et al. | Recombinant DNA, vector design, and construction | |
JP7473953B2 (en) | Method for producing ligated DNA and combination of vectors for use therein | |
TWI686477B (en) | Cloning vector, kit, and method for specifically inducing mutagenesis in chloroplast genes, and transgenic plant cells and agrobacterium generated by the same | |
Ren et al. | Engineering the K+ uptake regulatory pathway by MultiRound Gateway | |
MANN | MARK D. CURTIS | |
Piepers et al. | GreenGate 2.0: Backwards compatible addons for assembly of complex transcriptional units and their stacking with GreenGate | |
CN116426552A (en) | Multi-gene vector assembly system and construction method of multi-gene expression vector |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
STPP | Information on status: patent application and granting procedure in general |
Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION |