OA19168A - Multigene expression in microalgae. - Google Patents
Multigene expression in microalgae. Download PDFInfo
- Publication number
- OA19168A OA19168A OA1201800273 OA19168A OA 19168 A OA19168 A OA 19168A OA 1201800273 OA1201800273 OA 1201800273 OA 19168 A OA19168 A OA 19168A
- Authority
- OA
- OAPI
- Prior art keywords
- expression
- nucleic acid
- host cell
- transgenes
- expression cassettes
- Prior art date
Links
- 230000014509 gene expression Effects 0.000 title claims abstract description 161
- 150000007523 nucleic acids Chemical class 0.000 claims abstract description 63
- 108020004707 nucleic acids Proteins 0.000 claims abstract description 47
- 239000003550 marker Substances 0.000 claims description 29
- 235000014113 dietary fatty acids Nutrition 0.000 claims description 24
- 239000000194 fatty acid Substances 0.000 claims description 24
- 150000004665 fatty acids Chemical class 0.000 claims description 24
- 102000004190 Enzymes Human genes 0.000 claims description 15
- 108090000790 Enzymes Proteins 0.000 claims description 15
- 241000224474 Nannochloropsis Species 0.000 claims description 15
- 230000001131 transforming Effects 0.000 claims description 13
- 241000206744 Phaeodactylum tricornutum Species 0.000 claims description 11
- 241000710198 Foot-and-mouth disease virus Species 0.000 claims description 10
- 230000001851 biosynthetic Effects 0.000 claims description 10
- 230000037361 pathway Effects 0.000 claims description 10
- 230000000051 modifying Effects 0.000 claims description 3
- 230000015572 biosynthetic process Effects 0.000 abstract description 6
- 230000002018 overexpression Effects 0.000 abstract description 6
- 230000037348 biosynthesis Effects 0.000 abstract description 3
- 210000004027 cells Anatomy 0.000 description 94
- 229920001850 Nucleic acid sequence Polymers 0.000 description 17
- 108010084455 Zeocin Proteins 0.000 description 15
- VBJHPXDIVMXHJU-UHFFFAOYSA-N Zeocin Chemical compound N=1C(C=2SC=C(N=2)C(=O)NCCCCN=C(N)N)CSC=1CCNC(=O)C(C(O)C)NC(=O)C(C)C(O)C(C)NC(=O)C(C(OC1C(C(O)C(O)C(CO)O1)OC1C(C(OC(N)=O)C(O)C(CO)O1)O)C=1[N]C=NC=1)NC(=O)C1=NC(C(CC(N)=O)NCC(N)C(N)=O)=NC(N)=C1C VBJHPXDIVMXHJU-UHFFFAOYSA-N 0.000 description 15
- 230000003115 biocidal Effects 0.000 description 15
- 229920003013 deoxyribonucleic acid Polymers 0.000 description 15
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 12
- 239000002609 media Substances 0.000 description 12
- 238000004519 manufacturing process Methods 0.000 description 11
- 102000004169 proteins and genes Human genes 0.000 description 11
- 108090000623 proteins and genes Proteins 0.000 description 11
- 108090000765 processed proteins & peptides Proteins 0.000 description 9
- 102000004196 processed proteins & peptides Human genes 0.000 description 9
- 229920000160 (ribonucleotides)n+m Polymers 0.000 description 8
- 229920001405 Coding region Polymers 0.000 description 8
- 239000001963 growth media Substances 0.000 description 8
- 229940035893 Uracil Drugs 0.000 description 6
- NRAUADCLPJTGSF-ZPGVOIKOSA-N [(2R,3S,4R,5R,6R)-6-[[(3aS,7R,7aS)-7-hydroxy-4-oxo-1,3a,5,6,7,7a-hexahydroimidazo[4,5-c]pyridin-2-yl]amino]-5-[[(3S)-3,6-diaminohexanoyl]amino]-4-hydroxy-2-(hydroxymethyl)oxan-3-yl] carbamate Chemical compound NCCC[C@H](N)CC(=O)N[C@@H]1[C@@H](O)[C@H](OC(N)=O)[C@@H](CO)O[C@H]1\N=C/1N[C@H](C(=O)NC[C@H]2O)[C@@H]2N\1 NRAUADCLPJTGSF-ZPGVOIKOSA-N 0.000 description 6
- 239000003242 anti bacterial agent Substances 0.000 description 6
- OKTJSMMVPCPJKN-UHFFFAOYSA-N carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 6
- 229910052799 carbon Inorganic materials 0.000 description 6
- 239000000203 mixture Substances 0.000 description 6
- 230000001402 polyadenylating Effects 0.000 description 6
- 230000035897 transcription Effects 0.000 description 6
- 241000195585 Chlamydomonas Species 0.000 description 5
- 241000195649 Chlorella <Chlorellales> Species 0.000 description 5
- 241000700605 Viruses Species 0.000 description 5
- 238000009632 agar plate Methods 0.000 description 5
- 238000004520 electroporation Methods 0.000 description 5
- 241000894007 species Species 0.000 description 5
- 241000195628 Chlorophyta Species 0.000 description 4
- 241000206751 Chrysophyceae Species 0.000 description 4
- 241000192700 Cyanobacteria Species 0.000 description 4
- FBPFZTCFMRRESA-FSIIMWSLSA-N D-Glucitol Natural products OC[C@H](O)[C@H](O)[C@@H](O)[C@H](O)CO FBPFZTCFMRRESA-FSIIMWSLSA-N 0.000 description 4
- WQZGKKKJIJFFOK-GASJEMHNSA-N D-Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 4
- FBPFZTCFMRRESA-JGWLITMVSA-N D-glucitol Chemical compound OC[C@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-JGWLITMVSA-N 0.000 description 4
- 108020004999 Messenger RNA Proteins 0.000 description 4
- 241001250129 Nannochloropsis gaditana Species 0.000 description 4
- 241000199919 Phaeophyceae Species 0.000 description 4
- 241000206572 Rhodophyta Species 0.000 description 4
- 241001491691 Thalassiosira Species 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 4
- 238000000855 fermentation Methods 0.000 description 4
- 230000004151 fermentation Effects 0.000 description 4
- 239000008103 glucose Substances 0.000 description 4
- 230000012010 growth Effects 0.000 description 4
- 125000005647 linker group Chemical group 0.000 description 4
- 229920002106 messenger RNA Polymers 0.000 description 4
- 239000001301 oxygen Substances 0.000 description 4
- 229910052760 oxygen Inorganic materials 0.000 description 4
- MYMOFIZGZYHOMD-UHFFFAOYSA-N oxygen Chemical compound O=O MYMOFIZGZYHOMD-UHFFFAOYSA-N 0.000 description 4
- 229920000642 polymer Polymers 0.000 description 4
- 229920001184 polypeptide Polymers 0.000 description 4
- 229960002920 sorbitol Drugs 0.000 description 4
- 101710035147 ura3 Proteins 0.000 description 4
- 230000003612 virological Effects 0.000 description 4
- 229940064005 Antibiotic throat preparations Drugs 0.000 description 3
- 229940083879 Antibiotics FOR TREATMENT OF HEMORRHOIDS AND ANAL FISSURES FOR TOPICAL USE Drugs 0.000 description 3
- 229940042052 Antibiotics for systemic use Drugs 0.000 description 3
- 229940042786 Antitubercular Antibiotics Drugs 0.000 description 3
- 241000206761 Bacillariophyta Species 0.000 description 3
- 241000195597 Chlamydomonas reinhardtii Species 0.000 description 3
- 241000199914 Dinophyceae Species 0.000 description 3
- 241000710188 Encephalomyocarditis virus Species 0.000 description 3
- 102000018711 Facilitative Glucose Transport Proteins Human genes 0.000 description 3
- 108091006272 Glucose transporter family Proteins 0.000 description 3
- 229940093922 Gynecological Antibiotics Drugs 0.000 description 3
- 229920002459 Intron Polymers 0.000 description 3
- 108020004391 Introns Proteins 0.000 description 3
- 101700031697 NAT1 Proteins 0.000 description 3
- 241000206731 Phaeodactylum Species 0.000 description 3
- 108020004511 Recombinant DNA Proteins 0.000 description 3
- 229920001914 Ribonucleotide Polymers 0.000 description 3
- 241000702670 Rotavirus Species 0.000 description 3
- 229940024982 Topical Antifungal Antibiotics Drugs 0.000 description 3
- 102000024070 binding proteins Human genes 0.000 description 3
- 108091007650 binding proteins Proteins 0.000 description 3
- 238000005119 centrifugation Methods 0.000 description 3
- ATNHDLDRLWWWCB-AENOIHSZSA-M chlorophyll a Chemical compound C1([C@@H](C(=O)OC)C(=O)C2=C3C)=C2N2C3=CC(C(CC)=C3C)=[N+]4C3=CC3=C(C=C)C(C)=C5N3[Mg-2]42[N+]2=C1[C@@H](CCC(=O)OC\C=C(/C)CCC[C@H](C)CCC[C@H](C)CCCC(C)C)[C@H](C)C2=C5 ATNHDLDRLWWWCB-AENOIHSZSA-M 0.000 description 3
- 229930002868 chlorophyll a Natural products 0.000 description 3
- 238000003776 cleavage reaction Methods 0.000 description 3
- 239000005547 deoxyribonucleotide Substances 0.000 description 3
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 3
- 239000003623 enhancer Substances 0.000 description 3
- 230000002708 enhancing Effects 0.000 description 3
- 101700029555 fcpA Proteins 0.000 description 3
- SJWWTRQNNRNTPU-ABBNZJFMSA-N fucoxanthin Chemical compound C[C@@]1(O)C[C@@H](OC(=O)C)CC(C)(C)C1=C=C\C(C)=C\C=C\C(\C)=C\C=C\C=C(/C)\C=C\C=C(/C)C(=O)C[C@]1(C(C[C@H](O)C2)(C)C)[C@]2(C)O1 SJWWTRQNNRNTPU-ABBNZJFMSA-N 0.000 description 3
- 238000003780 insertion Methods 0.000 description 3
- 229940079866 intestinal antibiotics Drugs 0.000 description 3
- 238000000034 method Methods 0.000 description 3
- 229940005935 ophthalmologic Antibiotics Drugs 0.000 description 3
- 229920000023 polynucleotide Polymers 0.000 description 3
- 239000002157 polynucleotide Substances 0.000 description 3
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 description 3
- 239000002336 ribonucleotide Substances 0.000 description 3
- 125000002652 ribonucleotide group Chemical group 0.000 description 3
- 235000000346 sugar Nutrition 0.000 description 3
- 150000008163 sugars Chemical class 0.000 description 3
- 230000005030 transcription termination Effects 0.000 description 3
- 238000011144 upstream manufacturing Methods 0.000 description 3
- 108010054662 2-acylglycerophosphate acyltransferase Proteins 0.000 description 2
- 102000000452 Acetyl-CoA Carboxylase Human genes 0.000 description 2
- 108010016219 Acetyl-CoA Carboxylase Proteins 0.000 description 2
- 241000317943 Acute bee paralysis virus Species 0.000 description 2
- 241000192542 Anabaena Species 0.000 description 2
- 240000009108 Chlorella vulgaris Species 0.000 description 2
- 235000007089 Chlorella vulgaris Nutrition 0.000 description 2
- NSMUHPMZFPKNMZ-VBYMZDBQSA-M Chlorophyll b Chemical compound C1([C@@H](C(=O)OC)C(=O)C2=C3C)=C2N2C3=CC(C(CC)=C3C=O)=[N+]4C3=CC3=C(C=C)C(C)=C5N3[Mg-2]42[N+]2=C1[C@@H](CCC(=O)OC\C=C(/C)CCC[C@H](C)CCC[C@H](C)CCCC(C)C)[C@H](C)C2=C5 NSMUHPMZFPKNMZ-VBYMZDBQSA-M 0.000 description 2
- 210000000349 Chromosomes Anatomy 0.000 description 2
- 241000710127 Cricket paralysis virus Species 0.000 description 2
- 241000195493 Cryptophyta Species 0.000 description 2
- FBPFZTCFMRRESA-KAZBKCHUSA-N D-Mannitol Natural products OC[C@@H](O)[C@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-KAZBKCHUSA-N 0.000 description 2
- 241000907524 Drosophila C virus Species 0.000 description 2
- 241000195634 Dunaliella Species 0.000 description 2
- 241000195633 Dunaliella salina Species 0.000 description 2
- 102000002148 EC 2.3.1.20 Human genes 0.000 description 2
- 108010001348 EC 2.3.1.20 Proteins 0.000 description 2
- 108010052187 EC 2.3.1.23 Proteins 0.000 description 2
- 241000214054 Equine rhinitis A virus Species 0.000 description 2
- 241001239777 Erbovirus A Species 0.000 description 2
- 241000195620 Euglena Species 0.000 description 2
- 108009000552 Fatty Acid Biosynthesis Proteins 0.000 description 2
- BJHIKXHVCXFQLS-UYFOZJQFSA-N Fructose Natural products OC[C@@H](O)[C@@H](O)[C@H](O)C(=O)CO BJHIKXHVCXFQLS-UYFOZJQFSA-N 0.000 description 2
- 108010070792 GAT Proteins 0.000 description 2
- 102000000587 Glycerolphosphate Dehydrogenase Human genes 0.000 description 2
- 108010041921 Glycerolphosphate Dehydrogenase Proteins 0.000 description 2
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 2
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 2
- 241000168517 Haematococcus lacustris Species 0.000 description 2
- 101700042226 INHA Proteins 0.000 description 2
- 241000726047 Infectious flacherie virus Species 0.000 description 2
- 102100002327 LPCAT3 Human genes 0.000 description 2
- 102100003688 MBOAT2 Human genes 0.000 description 2
- 241000829100 Macaca mulatta polyomavirus 1 Species 0.000 description 2
- FBPFZTCFMRRESA-KVTDHHQDSA-N Mannitol Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-KVTDHHQDSA-N 0.000 description 2
- 241000502321 Navicula Species 0.000 description 2
- 108090000913 Nitrate Reductases Proteins 0.000 description 2
- 241000180701 Nitzschia <flatworm> Species 0.000 description 2
- 102100018840 PAX7 Human genes 0.000 description 2
- 101700064753 PAX7 Proteins 0.000 description 2
- 241000425347 Phyla <beetle> Species 0.000 description 2
- 208000008425 Protein Deficiency Diseases 0.000 description 2
- 102000012751 Pyruvate Dehydrogenase Complex Human genes 0.000 description 2
- 108010090051 Pyruvate Dehydrogenase Complex Proteins 0.000 description 2
- 241000195663 Scenedesmus Species 0.000 description 2
- 229920000972 Sense strand Polymers 0.000 description 2
- 241001648840 Thosea asigna virus Species 0.000 description 2
- 108090000848 Ubiquitin Proteins 0.000 description 2
- 102400000757 Ubiquitin Human genes 0.000 description 2
- 241000206764 Xanthophyceae Species 0.000 description 2
- 150000001413 amino acids Chemical class 0.000 description 2
- 101710025596 betaTub85D Proteins 0.000 description 2
- 108010006025 bovine growth hormone Proteins 0.000 description 2
- 239000001913 cellulose Substances 0.000 description 2
- 229920002678 cellulose Polymers 0.000 description 2
- 229930002869 chlorophyll b Natural products 0.000 description 2
- 230000004186 co-expression Effects 0.000 description 2
- 239000005090 green fluorescent protein Substances 0.000 description 2
- 230000000102 heterotrophic Effects 0.000 description 2
- 239000000594 mannitol Substances 0.000 description 2
- 235000010355 mannitol Nutrition 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- -1 methylimino Chemical group 0.000 description 2
- IJGRMHOSHXDMSA-UHFFFAOYSA-N nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- 125000002467 phosphate group Chemical group [H]OP(=O)(O[H])O[*] 0.000 description 2
- 230000001105 regulatory Effects 0.000 description 2
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 2
- 238000003786 synthesis reaction Methods 0.000 description 2
- 230000002194 synthesizing Effects 0.000 description 2
- 230000000007 visual effect Effects 0.000 description 2
- WQZGKKKJIJFFOK-VFUOTHLCSA-N β-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 2
- QRBLKGHRWFGINE-UGWAGOLRSA-N 2-[2-[2-[[2-[[4-[[2-[[6-amino-2-[3-amino-1-[(2,3-diamino-3-oxopropyl)amino]-3-oxopropyl]-5-methylpyrimidine-4-carbonyl]amino]-3-[(2R,3S,4S,5S,6S)-3-[(2S,3R,4R,5S)-4-carbamoyl-3,4,5-trihydroxy-6-(hydroxymethyl)oxan-2-yl]oxy-4,5-dihydroxy-6-(hydroxymethyl)- Chemical compound N=1C(C=2SC=C(N=2)C(N)=O)CSC=1CCNC(=O)C(C(C)=O)NC(=O)C(C)C(O)C(C)NC(=O)C(C(O[C@H]1[C@@]([C@@H](O)[C@H](O)[C@H](CO)O1)(C)O[C@H]1[C@@H]([C@](O)([C@@H](O)C(CO)O1)C(N)=O)O)C=1NC=NC=1)NC(=O)C1=NC(C(CC(N)=O)NCC(N)C(N)=O)=NC(N)=C1C QRBLKGHRWFGINE-UGWAGOLRSA-N 0.000 description 1
- CZPWVGJYEJSRLH-UHFFFAOYSA-N 289-95-2 Chemical compound C1=CN=CN=C1 CZPWVGJYEJSRLH-UHFFFAOYSA-N 0.000 description 1
- 241001607836 Achnanthes Species 0.000 description 1
- UNFWWIHTNXNPBV-WXKVUWSESA-N Actinospectacin Chemical compound O([C@@H]1[C@@H](NC)[C@@H](O)[C@H]([C@@H]([C@H]1O1)O)NC)[C@]2(O)[C@H]1O[C@H](C)CC2=O UNFWWIHTNXNPBV-WXKVUWSESA-N 0.000 description 1
- 241000432074 Adeno-associated virus Species 0.000 description 1
- 229920001817 Agar Polymers 0.000 description 1
- 241000589158 Agrobacterium Species 0.000 description 1
- 206010001935 American trypanosomiasis Diseases 0.000 description 1
- 241000611184 Amphora Species 0.000 description 1
- 240000002900 Arthrospira platensis Species 0.000 description 1
- 241001465318 Aspergillus terreus Species 0.000 description 1
- 241000132092 Aster Species 0.000 description 1
- 241001425589 Auxenochlorella Species 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 241000283690 Bos taurus Species 0.000 description 1
- 241001536324 Botryococcus Species 0.000 description 1
- 101710005912 CDH2 Proteins 0.000 description 1
- 241000227752 Chaetoceros Species 0.000 description 1
- 241000195654 Chlorella sorokiniana Species 0.000 description 1
- 241000180279 Chlorococcum Species 0.000 description 1
- 241001608005 Cocconeis Species 0.000 description 1
- 229920002676 Complementary DNA Polymers 0.000 description 1
- 241001466512 Coscinodiscus Species 0.000 description 1
- 241000199913 Crypthecodinium Species 0.000 description 1
- 241001147476 Cyclotella Species 0.000 description 1
- 241000206743 Cylindrotheca Species 0.000 description 1
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 1
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 1
- 241000305506 Desmodesmus Species 0.000 description 1
- GRRNUXAQVGOGFE-UHFFFAOYSA-N Destomysin Chemical compound OC1C(NC)CC(N)C(O)C1OC1C2OC3(C(C(O)C(O)C(C(N)CO)O3)O)OC2C(O)C(CO)O1 GRRNUXAQVGOGFE-UHFFFAOYSA-N 0.000 description 1
- 108020001991 EC 1.3.3.4 Proteins 0.000 description 1
- 102000005135 EC 1.3.3.4 Human genes 0.000 description 1
- 108010055012 EC 4.1.1.23 Proteins 0.000 description 1
- 102000010719 EC 4.2.99.18 Human genes 0.000 description 1
- 108010063362 EC 4.2.99.18 Proteins 0.000 description 1
- 241000196324 Embryophyta Species 0.000 description 1
- 206010014599 Encephalitis Diseases 0.000 description 1
- 241000195623 Euglenida Species 0.000 description 1
- 108010042634 F2A4-K-NS peptide Proteins 0.000 description 1
- 241000692360 Fistulifera Species 0.000 description 1
- 241001105002 Fragilariopsis Species 0.000 description 1
- 239000005715 Fructose Substances 0.000 description 1
- 102100004985 GUSB Human genes 0.000 description 1
- 108010060309 Glucuronidase Proteins 0.000 description 1
- 239000005562 Glyphosate Substances 0.000 description 1
- XDDAORKBJWWYJS-UHFFFAOYSA-O Glyphosate Chemical compound OC(=O)C[NH2+]CP(O)(O)=O XDDAORKBJWWYJS-UHFFFAOYSA-O 0.000 description 1
- 241001499732 Gyrosigma Species 0.000 description 1
- 241000206759 Haptophyceae Species 0.000 description 1
- 208000006454 Hepatitis Diseases 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- 101710006546 Hsp70Aa Proteins 0.000 description 1
- 101710006547 Hsp70Ab Proteins 0.000 description 1
- 108090000604 Hydrolases Proteins 0.000 description 1
- 102000004157 Hydrolases Human genes 0.000 description 1
- 229940097277 Hygromycin B Drugs 0.000 description 1
- 241001501885 Isochrysis Species 0.000 description 1
- SBUJHOSQTJFQJX-NOAMYHISSA-N Kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 1
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 1
- 241001466453 Laminaria Species 0.000 description 1
- 241001534525 Lampriscus Species 0.000 description 1
- 241000713666 Lentivirus Species 0.000 description 1
- 241000708386 Ljungan virus Species 0.000 description 1
- 239000005089 Luciferase Substances 0.000 description 1
- 108060001084 Luciferase family Proteins 0.000 description 1
- 241001491708 Macrocystis Species 0.000 description 1
- LTYOQGRJFJAKNA-KKIMTKSISA-N Malonyl CoA Natural products S(C(=O)CC(=O)O)CCNC(=O)CCNC(=O)[C@@H](O)C(CO[P@](=O)(O[P@](=O)(OC[C@H]1[C@@H](OP(=O)(O)O)[C@@H](O)[C@@H](n2c3ncnc(N)c3nc2)O1)O)O)(C)C LTYOQGRJFJAKNA-KKIMTKSISA-N 0.000 description 1
- 230000036740 Metabolism Effects 0.000 description 1
- 230000035633 Metabolized Effects 0.000 description 1
- 241000013738 Monochrysis Species 0.000 description 1
- 241001478792 Monoraphidium Species 0.000 description 1
- 241000196305 Nannochloris Species 0.000 description 1
- 241000509521 Nannochloropsis sp. Species 0.000 description 1
- 241000195644 Neochloris Species 0.000 description 1
- 241001442227 Nephroselmis Species 0.000 description 1
- 241000059630 Nodularia <Cyanobacteria> Species 0.000 description 1
- 241000192656 Nostoc Species 0.000 description 1
- 241000326556 Odontella <springtail> Species 0.000 description 1
- 229920000272 Oligonucleotide Polymers 0.000 description 1
- 241000514008 Oocystis Species 0.000 description 1
- 241000658540 Ora Species 0.000 description 1
- 241001221668 Ostreococcus tauri Species 0.000 description 1
- 108090000854 Oxidoreductases Proteins 0.000 description 1
- 102000004316 Oxidoreductases Human genes 0.000 description 1
- 239000005590 Oxyfluorfen Substances 0.000 description 1
- OQMBBFQZGJFLBU-UHFFFAOYSA-N Oxyfluorfen Chemical compound C1=C([N+]([O-])=O)C(OCC)=CC(OC=2C(=CC(=CC=2)C(F)(F)F)Cl)=C1 OQMBBFQZGJFLBU-UHFFFAOYSA-N 0.000 description 1
- 101700052034 PPOB Proteins 0.000 description 1
- 101700006424 PPOD Proteins 0.000 description 1
- 101700017379 PPOE Proteins 0.000 description 1
- 101700039848 PPOF Proteins 0.000 description 1
- 102100018171 PPOX Human genes 0.000 description 1
- 101700040325 PPOX Proteins 0.000 description 1
- RXWNCPJZOCPEPQ-NVWDDTSBSA-N PUROMYCIN Chemical compound C1=CC(OC)=CC=C1C[C@H](N)C(=O)N[C@H]1[C@@H](O)[C@H](N2C3=NC=NC(=C3N=C2)N(C)C)O[C@@H]1CO RXWNCPJZOCPEPQ-NVWDDTSBSA-N 0.000 description 1
- 229950010131 PUROMYCIN Drugs 0.000 description 1
- 241000206766 Pavlova Species 0.000 description 1
- 102000035443 Peptidases Human genes 0.000 description 1
- 108091005771 Peptidases Proteins 0.000 description 1
- 108010035235 Phleomycins Proteins 0.000 description 1
- 241000709664 Picornaviridae Species 0.000 description 1
- 241000722208 Pleurochrysis Species 0.000 description 1
- 229920002873 Polyethylenimine Polymers 0.000 description 1
- 241001672814 Porcine teschovirus 1 Species 0.000 description 1
- 241000206609 Porphyra Species 0.000 description 1
- 241000206618 Porphyridium Species 0.000 description 1
- 239000004365 Protease Substances 0.000 description 1
- 241000192511 Pseudanabaena Species 0.000 description 1
- KDCGOANMDULRCW-UHFFFAOYSA-N Purine Chemical compound N1=CNC2=NC=NC2=C1 KDCGOANMDULRCW-UHFFFAOYSA-N 0.000 description 1
- 241001509341 Pyramimonas Species 0.000 description 1
- 101710007435 RP883 Proteins 0.000 description 1
- 229920002472 Starch Polymers 0.000 description 1
- 241001148696 Stichococcus Species 0.000 description 1
- 241000187310 Streptomyces noursei Species 0.000 description 1
- 102000003673 Symporters Human genes 0.000 description 1
- 108090000088 Symporters Proteins 0.000 description 1
- 241000192707 Synechococcus Species 0.000 description 1
- 241000192584 Synechocystis Species 0.000 description 1
- 241000196321 Tetraselmis Species 0.000 description 1
- 241000710209 Theiler's encephalomyelitis virus Species 0.000 description 1
- 241000096130 Toxopus brucei Species 0.000 description 1
- 241000192118 Trichodesmium Species 0.000 description 1
- 241000223109 Trypanosoma cruzi Species 0.000 description 1
- 101710028079 UMAG_05828 Proteins 0.000 description 1
- 102100005023 UMPS Human genes 0.000 description 1
- 241001261505 Undaria Species 0.000 description 1
- UOZODPSAJZTQNH-LSWIJEOBSA-N Zygomycin A1 Chemical compound N[C@@H]1[C@@H](O)[C@H](O)[C@H](CN)O[C@@H]1O[C@H]1[C@@H](O)[C@H](O[C@H]2[C@@H]([C@@H](N)C[C@@H](N)[C@@H]2O)O[C@@H]2[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O2)N)O[C@@H]1CO UOZODPSAJZTQNH-LSWIJEOBSA-N 0.000 description 1
- 101710029159 aadA Proteins 0.000 description 1
- QTBSBXVTEAMEQO-UHFFFAOYSA-M acetate Chemical compound CC([O-])=O QTBSBXVTEAMEQO-UHFFFAOYSA-M 0.000 description 1
- 239000008272 agar Substances 0.000 description 1
- 125000000217 alkyl group Chemical group 0.000 description 1
- 101710025562 alphaTub85E Proteins 0.000 description 1
- QGZKDVFQNNGYKY-UHFFFAOYSA-N ammonia Chemical compound N QGZKDVFQNNGYKY-UHFFFAOYSA-N 0.000 description 1
- 150000003863 ammonium salts Chemical class 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 230000003698 anagen phase Effects 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 230000000692 anti-sense Effects 0.000 description 1
- 101700069820 baeD Proteins 0.000 description 1
- 238000004166 bioassay Methods 0.000 description 1
- 235000010633 broth Nutrition 0.000 description 1
- OYPRJOBELJOOCE-UHFFFAOYSA-N calcium Chemical compound [Ca] OYPRJOBELJOOCE-UHFFFAOYSA-N 0.000 description 1
- 239000011575 calcium Substances 0.000 description 1
- 229910052791 calcium Inorganic materials 0.000 description 1
- 150000001720 carbohydrates Chemical class 0.000 description 1
- 235000014633 carbohydrates Nutrition 0.000 description 1
- 230000035425 carbon utilization Effects 0.000 description 1
- 125000002091 cationic group Chemical group 0.000 description 1
- 230000002759 chromosomal Effects 0.000 description 1
- 239000002299 complementary DNA Substances 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 230000009089 cytolysis Effects 0.000 description 1
- 230000003247 decreasing Effects 0.000 description 1
- NAGJZTKCGNOGPW-UHFFFAOYSA-K dioxido-sulfanylidene-sulfido-$l^{5}-phosphane Chemical compound [O-]P([O-])([S-])=S NAGJZTKCGNOGPW-UHFFFAOYSA-K 0.000 description 1
- 229910001882 dioxygen Inorganic materials 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000000909 electrodialysis Methods 0.000 description 1
- 210000003527 eukaryotic cell Anatomy 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 230000004136 fatty acid synthesis Effects 0.000 description 1
- 108091006031 fluorescent proteins Proteins 0.000 description 1
- 102000034387 fluorescent proteins Human genes 0.000 description 1
- 238000006062 fragmentation reaction Methods 0.000 description 1
- 238000007710 freezing Methods 0.000 description 1
- 108020001507 fusion proteins Proteins 0.000 description 1
- 102000037240 fusion proteins Human genes 0.000 description 1
- 238000000769 gas chromatography-flame ionisation detection Methods 0.000 description 1
- 238000002290 gas chromatography-mass spectrometry Methods 0.000 description 1
- 239000000499 gel Substances 0.000 description 1
- 230000002068 genetic Effects 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 229940097068 glyphosate Drugs 0.000 description 1
- 231100000283 hepatitis Toxicity 0.000 description 1
- 230000002363 herbicidal Effects 0.000 description 1
- 239000004009 herbicide Substances 0.000 description 1
- 238000004128 high performance liquid chromatography Methods 0.000 description 1
- 238000002744 homologous recombination Methods 0.000 description 1
- 101710013253 hup Proteins 0.000 description 1
- 238000009776 industrial production Methods 0.000 description 1
- 239000003456 ion exchange resin Substances 0.000 description 1
- 229920003303 ion-exchange polymer Polymers 0.000 description 1
- 229960000318 kanamycin Drugs 0.000 description 1
- 239000002502 liposome Substances 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 238000004895 liquid chromatography mass spectrometry Methods 0.000 description 1
- WHXSMMKQMYFTQS-UHFFFAOYSA-N lithium Chemical compound [Li] WHXSMMKQMYFTQS-UHFFFAOYSA-N 0.000 description 1
- 229910052744 lithium Inorganic materials 0.000 description 1
- 230000002934 lysing Effects 0.000 description 1
- LTYOQGRJFJAKNA-DVVLENMVSA-N malonyl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)CC(O)=O)O[C@H]1N1C2=NC=NC(N)=C2N=C1 LTYOQGRJFJAKNA-DVVLENMVSA-N 0.000 description 1
- 230000004060 metabolic process Effects 0.000 description 1
- 230000035786 metabolism Effects 0.000 description 1
- 125000000325 methylidene group Chemical group [H]C([H])=* 0.000 description 1
- YACKEPLHDIMKIO-UHFFFAOYSA-L methylphosphonate(2-) Chemical compound CP([O-])([O-])=O YACKEPLHDIMKIO-UHFFFAOYSA-L 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000006011 modification reaction Methods 0.000 description 1
- 239000003607 modifier Substances 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- XTGGILXPEMRCFM-UHFFFAOYSA-N morpholin-4-yl carbamate Chemical compound NC(=O)ON1CCOCC1 XTGGILXPEMRCFM-UHFFFAOYSA-N 0.000 description 1
- 238000001728 nano-filtration Methods 0.000 description 1
- 239000002105 nanoparticle Substances 0.000 description 1
- 101700077740 natR Proteins 0.000 description 1
- 229960004927 neomycin Drugs 0.000 description 1
- 230000001264 neutralization Effects 0.000 description 1
- 229910052757 nitrogen Inorganic materials 0.000 description 1
- NVGOPFQZYCNLDU-UHFFFAOYSA-N norflurazon Chemical compound O=C1C(Cl)=C(NC)C=NN1C1=CC=CC(C(F)(F)F)=C1 NVGOPFQZYCNLDU-UHFFFAOYSA-N 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 235000015097 nutrients Nutrition 0.000 description 1
- 108010086644 oleoyl-(acyl-carrier-protein) hydrolase Proteins 0.000 description 1
- 150000002894 organic compounds Chemical class 0.000 description 1
- 229960001914 paromomycin Drugs 0.000 description 1
- 235000012162 pavlova Nutrition 0.000 description 1
- 230000000737 periodic Effects 0.000 description 1
- 150000004713 phosphodiesters Chemical class 0.000 description 1
- PTMHPRAIXMAOOB-UHFFFAOYSA-L phosphoramidate Chemical compound NP([O-])([O-])=O PTMHPRAIXMAOOB-UHFFFAOYSA-L 0.000 description 1
- 230000035479 physiological effects, processes and functions Effects 0.000 description 1
- 101700018374 pksD Proteins 0.000 description 1
- 229920003219 poly( p-phenylene oxide) Polymers 0.000 description 1
- 229920000117 poly(dioxanone) Polymers 0.000 description 1
- 229920001601 polyetherimide Polymers 0.000 description 1
- 229920001223 polyethylene glycol Polymers 0.000 description 1
- 101700080086 ppoA Proteins 0.000 description 1
- 101700053503 ppoC Proteins 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 230000000284 resting Effects 0.000 description 1
- 229920002477 rna polymer Polymers 0.000 description 1
- 230000003584 silencer Effects 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 229960000268 spectinomycin Drugs 0.000 description 1
- 239000008107 starch Substances 0.000 description 1
- 235000019698 starch Nutrition 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- IIACRCGMVDHOTQ-UHFFFAOYSA-M sulfamate Chemical compound NS([O-])(=O)=O IIACRCGMVDHOTQ-UHFFFAOYSA-M 0.000 description 1
- 230000002123 temporal effect Effects 0.000 description 1
- RYYWUUFWQRZTIU-UHFFFAOYSA-K thiophosphate Chemical compound [O-]P([O-])([O-])=S RYYWUUFWQRZTIU-UHFFFAOYSA-K 0.000 description 1
- 210000001519 tissues Anatomy 0.000 description 1
- 230000002103 transcriptional Effects 0.000 description 1
- 241000701161 unidentified adenovirus Species 0.000 description 1
- 241001529453 unidentified herpesvirus Species 0.000 description 1
- 241001430294 unidentified retrovirus Species 0.000 description 1
- 108091005946 yellow fluorescent protein Proteins 0.000 description 1
Abstract
The present application relates to an expression system for multigene overexpression in microalgae, which expression system comprises at least two nucleic acid expression cassettes, wherein each expression cassette comprises a promoter operably linked to three or more transgenes connected to one another by at least one sequence encoding a 2A peptide (i.e. a multicistronic construct). Also disclosed herein are vector systems comprising said expression systems, host cells transformed with said expression systems or comprising said vector systems, methods for producing these host cells, as well as their use for biosynthesis.
Description
MULTIGENE EXPRESSION IN MICROALGAE
FIELD OF THE INVENTION
The présent invention is directed to genetic engineering of microalgae, and in particular to engineering microalgae with multiple genes.
BACKGROUND
Multigene overexpression in eukaryotic microalgae is nowadays limited amongst others by the availability of only a small group of suitable promoters and a limited set of suitable selectable marker genes.
One approach that has been used in microalgae, in particular in Phaeodactylum tricornutum, to co-express two genes was the use of the same promoter for the different genes (Hamilton et al., 2014 Metab Eng. 22:3-9). It has however drawbacks for the coexpression of a larger number of genes, namely, that nucleic acid sequences with a large number of identical promoters often stick together and become unstable. Moreover, in general promoters increase the size ofthe nucleic acid construct, which may then become too large to be introduced in the host cell in a single step. Another strategy for bi-cistronic gene expression was shown in Chlamydomonas (Muto et al., 2009 BMC Biotechnology 9:26), wherein the 2 genes were genetically linked to resuit in a fusion protein that is cleavable by an endogenous protein. Yet another technology that has proven to be useful for multigene expression in microalgae comprises the expression of several proteins from a single open reading frame, wherein each protein is separated by so-called 2A sequences (Ryan et al., 1991 J Gen Virol. 72:2727-32). When the ribosome translates the 2A sequence, it releases the nascent peptide and continues translation of the downstream sequence. As a resuit, several different, separate proteins can be formed from a single open reading frame. This 2A sequence approach, more particularly using the foot-andmouth disease virus (FMDV) 2A peptide, was successfully applied in Chlamydomonas to overexpress two genes (Rasala et al. 2012 PLoS One 7:e43349). In 2014, the same authors described the use of 2 different 2A peptides, in particular F2A and E2A, in the same construct to genetically engineer Chlamydomonas (Rasala et al. 2014 PLoS One 9:e94028). They did engineering with reporter genes only however. They also used the résistance in their multigene expression cassette. In WO 2014026770, the use of 2A sequences for multigene insertion in microalgae species including Nannochloropsis and the diatom Phaeodactylum is described. It is reported that 2A peptides can be used to express two or more (up to 20 or more) functional proteins from a single mRNA. More than two 2A sequences could be used to increase the number of genes under the control of the same promoter. This increases however the size of the mRNA to be transcribed and could lead to exhaustion of the ribosome. In this case, prématuré stop in transcription could occur which would resuit in no synthesis of some proteins downstream.
In view of the above, it is clear that there remains a need in the art for multigene engineering in microalgae.
SUMMARY OF THE INVENTION
The instant invention aims to provide a system for multigene engineering in microalgae.
The inventors hâve identified particular methods involving the use of self-cleaving viral 2A peptides or 2A-like peptides that allow for efficient multigene overexpression in microalgae. The présent invention is in particular captured by any one or any combination of one or more of the below numbered aspects and embodiments (i) to wherein:
The présent invention is in particular captured by any one or any combination of one or more of the below numbered aspects and embodiments (i) to (xvi) wherein:
(i) A multigene expression system comprising at least two nucleic acid expression cassettes, wherein each expression cassette comprises a promoter operably linked to three or more transgenes connected to one another by at least one sequence encoding a 2A peptide.
(ii) The expression system according to (i), wherein each expression cassette comprises a promoter operably linked to three or more transgenes connected to one another by at least two successive sequences encoding a 2A peptide.
(iii) The expression system according to (i) or (ii), wherein each expression cassette comprises three transgenes.
(iv) The expression system according to any one of (i) to (iii), wherein the promoters of the expression cassettes are the same.
(v) The expression system according to any one of (i) to (iv), wherein the 2A peptide is derived from foot-and-mouth disease virus (FMDV 2A or F2A).
(vi) The expression system according to any one of (i) to (v), further comprising one or more nucleic acid expression cassettes comprising a selectable marker gene.
(vii) The expression system according to any one of (i) to (vi), wherein the transgenes encode for enzymes involved in a biosynthetic pathway.
(viii) The expression system according to (vii), wherein the transgenes encode enzymes involved in the fatty acid biosynthetic pathway.
(ix) A vector System comprising the expression system according to any one of (i) to (viii), said vector system comprising at least two vectors, wherein each vector comprises one of said at least two nucleic acid expression cassettes comprising a promoter operably linked to three or more transgenes connected to one another by at least one sequence encoding a 2A peptide.
(x) The vector system according to (ix), wherein each vector further comprises a nucleic acid expression cassette comprising a selectable marker gene.
(xi) The vector system according to (ix) or (x), wherein the vectors are plasmids.
(xii) A host cell comprising the expression system according to any one of (i) to (viii) or the vector system according to any one of (ix) to (xi).
(xiii) The host cell according to xii, wherein the host cell is a microalga, preferably a diatom such as Phaeodactylum tricornutum, or a Nannochloropsis species.
(xiv) A method for genetically modifying a host cell with multiple genes comprising the following steps:
- providing a host cell, and
- transforming the host cell with at least two nucleic acid expression cassettes, wherein each expression cassette comprises a promoter operably linked to three or more transgenes connected to one another by at least one sequence encoding a 2A peptide, and optionally one or more nucleic acid expression cassettes comprising a selectable marker gene.
(xv) The method according to (xiv), wherein the at least two nucleic acid expression cassettes and optionally the one or more nucleic acid expression cassettes comprising a selectable marker gene are co-transformed into the host cell.
(xvi) The method according to (xiv) or (xv), further comprising the step of selecting the host cells which hâve been transformed with said at least two nucleic acid cassettes and said one or more nucleic acid expression cassettes comprising a selectable marker gene by culturing the host cells on a sélective medium, wherein the ability of a host cell to be cultured on the sélective medium is dépendent on the expression ofthe selectable marker gene.
BRIEF DESCRIPTION OF THE FIGURES
The teaching of the application is illustrated by the following Figures which are to be considered as illustrative only and do not in any way limit the scope ofthe daims.
Figure 1: Schematic illustration of constructs for transformation in Nannochloropsis. UEP: reference vector comprising only the shble gene. pMA01 : vector comprising bsd and shble genes linked with 2A-linker. pMA02: vector comprising bsd, nat1 and shbe genes linked with 2A-linker. P= promoter; T= terminator; L=linker.
Figure 2: Spot test on f/2 agar plate without (f/2 control) or with 7 pg/ml zeocin (Zeo 7), 100 pg/ml blasticidin (Bsd 100), or 500 pg/ml nourseothricin (Nat 500) with WT Nannochloropsis (ΉΤ), or Nannochloropsis transformed with UEP vector, pMA01 vector, or pMA02 vector.
Figure 3: Schematic illustration of constructs for co-transformation (pMA03+pMA04) or transformation (pMA05) in Nannochloropsis. pMA03: construct comprising gene 1, gene 2 and shble gene linked with 2A-linker. pMA04: construct comprising gene 3, gene 4 and bsd gene linked with 2A-linker. pMA05: construct comprising gene 1, gene 2, shble gene, gene 3, gene 4 and bsd gene linked with 2A-linker. P= promoter; T= terminator; L=linker.
DETAILED DESCRIPTION OF THE INVENTION
Unless otherwise defined, ail terms used in disclosing the invention, including technical and scientific terms, hâve the meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. By means of further guidance, term définitions are included to better appreciate the teaching of the présent invention.
As used herein, the singular forms a, an, and the include both singular and plural referents unless the context clearly dictâtes otherwise.
The terms comprising, comprises and comprised of' as used herein are synonymous with including, includes or containing, contains, and are inclusive oropen-ended and do not exclude additional, non-recited members, éléments or method steps. Where reference is made to embodiments as comprising certain éléments or steps, this encompasses also embodiments which consist essentially of the recited éléments or steps.
The recitation of numerical ranges by endpoints includes ail numbers and fractions subsumed within the respective ranges, as well as the recited endpoints.
The term about as used herein when referring to a measurable value such as a parameter, an amount, a temporal duration, and the like, is meant to encompass variations of +/-10% or less, preferably +/-5% or less, more preferably +/-1% or less, and still more preferably +/-0.1% or less of and from the specified value, insofar such variations are appropriate to perform in the disclosed invention. It is to be understood that the value to which the modifier about refers is itself also specifically, and preferably, disclosed.
Ail documents cited in the présent spécification are hereby incorporated by reference in their entirety. In particular, the teachings of ail documents herein specifically referred to are incorporated by reference.
Standard reference work setting forth the general principles of recombinant DNA technology include Molecular Cloning: A Laboratory Manual, 2nd ed., vol. 1-3, ed. Sambrook et al., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989; Current Protocols in Molecular Biology, ed. Ausubel et al., Greene Publishing and WileyInterscience, New York, 1992 (with periodic updates).
The terms polynucleotide and nucleic acid are used interchangeably herein and generally refer to a polymer of any length composed essentially of nucléotides, e.g., deoxyribonucleotides and/or ribonucleotides. Nucleic acids can comprise purine and/or pyrimidine bases, and/or other natural, chemically or biochemically modified (e.g., methylated), non-natural, or derivatised nucléotide bases. The backbone of nucleic acids can comprise sugars and phosphate groups, as can typically be found in RNA or DNA, and/or one or more modified or substituted (such as, 2'-O-alkylated, e.g., 2'-O-methylated or 2'-O-ethylated; or2'-O,4'-C-alkynelated, e.g., 2'-O,4'-C-ethylated) sugars or one or more modified or substituted phosphate groups. For example, backbone analogues in nucleic acids may include phosphodiester, phosphorothioate, phosphorodithioate, methylphosphonate, phosphoramidate, alkyl phosphotriester, sulfamate, 3'-thioacetal, methylene (methylimino), 3'-N-carbamate, morpholino carbamate, and peptide nucleic acids (PNAs). The term nucleic acid further specifically encompasses DNA, RNA and DNA/RNA hybrid molécules, specifically including hnRNA, pre-mRNA, mRNA, cDNA, genomic DNA, gene, amplification products, oligonucleotides, and synthetic (e.g. chemically synthesised) DNA, RNA or DNA/RNA hybrids. The terms ribonucleic acid and RNA as used herein mean a polymer of any length composed of ribonucleotides. The terms deoxyribonucleic acid and DNA as used herein mean a polymer of any length composed of deoxyribonucleotides. The term DNA/RNA hybrid as used herein mean a polymer of any length composed of one or more deoxyribonucleotides and one or more ribonucleotides. A nucleic acid can be naturally occurring, e.g., présent in or isolated from nature, can be recombinant, i.e., produced by recombinant DNA technology, and/or can be, partly or entirely, chemically or biochemically synthesized. A nucleic acid can be double-stranded, partly double stranded, or single-stranded. Where single-stranded, the nucleic acid can be the sense strand or the antisense strand. In addition, nucleic acid can be circular or linear.
As used herein, the term “nucleic acid expression cassette” refers to nucleic acid molécules that include one or more transcriptional control éléments (such as, but not limited to promoters, enhancers, polyadenylation sequences, and introns) that direct expression of a (trans)gene(s) to which they are operably linked.
The term “operably linked” as used herein refers to the arrangement of various nucleic acid molécule éléments relative to each such that the éléments are functionally connected and are able to interact with each other in the context of gene expression. Such éléments may include, without limitation, a promoter, an enhancer, a polyadenylation sequence, one or more introns, and a coding sequence of a gene of interest to be expressed (e.g., the (trans)gene). The nucleic acid sequence éléments, when properly oriented or operably linked, act together to ensure ormodulate expression of the coding sequence. By modulate is meant increasing, decreasing, or maintaining the level of activity of a particular element. The position of each element relative to other éléments may be expressed in terms of the 5' terminus and the 3' terminus of each element, and the distance between any particular éléments may be referenced by the number of intervening nucléotides, or base pairs, between the éléments.
The term “transgene” or “(trans)gene” as used herein refers to particular nucleic acid sequences encoding a polypeptide or a portion of a polypeptide to be expressed in a host cell into which the nucleic acid sequence is introduced. How the nucleic acid sequence is introduced into a host cell is not essential, it may for instance be through intégration in the genome or as an episomal plasmid. The term “transgene” is meant to include (1) a nucleic acid sequence that is not naturally found in the host cell (i.e., a heterologous nucleic acid sequence); (2) a nucleic acid sequence that is a mutant form of a nucleic acid sequence naturally found in the host cell into which it has been introduced ; (3) a nucleic acid sequence that serves to add additional copies of the same (i.e., homologous) or a similar nucleic acid sequence naturally occurring in the host cell into which it has been introduced; or (4) a silent naturally occurring or homologous nucleic acid sequence whose expression is induced in the host cell into which it has been introduced. Accordingly, a “transgene” is characterized by the fact that it does not naturally occur in the same location in the host cell. By “mutant form” is meant a nucleic acid sequence that contains one or more nucléotides that are different from the wild-type or naturally occurring sequence, i.e., the mutant nucleic acid sequence contains one or more nucléotide substitutions, délétions, and/or insertions.
The term “cistron” generally refers to nucleic acid sequences encoding a gene product (such as a protein or RNA molécule) and including upstream and downstream transcriptional control éléments. As used herein, the term “multicistron” refers to multiple nucleic acid sequences encoding gene products and including upstream and downstream transcriptional control éléments. Typical for a multicistron is that the multiple coding sequences are under the control of a single promoter. The term “tricistron as used herein specifically refers to a multicistron comprising three coding sequences.
The term “2A peptide” (also referred to as CHYSEL or cis-acting hydrolase element) refers to a viral sequence of about 18 to 22 amino acids which upon translation, médiates rapid intramolecular (cis) cleavage of a protein or polypeptide comprising the peptide to yield discrète mature proteins or polypeptides; this cleavage does not require any additional factors like proteases. The term 2A peptide as used herein also includes any modification of the sequence of the 2A peptide which may improve, increase or hâve a neutral effect regarding the functionality of the 2A peptide.
As used in the application, the term “promoter” refers to a nucleic acid sequence capable of binding RNA polymerase and that initiâtes the transcription of one or more nucleic acid coding sequences to which it is operably linked (e.g., a transgene). A promoter is usually located near the transcription start site of a gene on the same strand and upstream on the nucléotide coding sequence (5' in the sense strand). A promoter may function alone to regulate transcription or may be further regulated by one or more regulatory sequences (e.g. enhancers or silencers).
The term transcription termination sequence encompasses a control sequence at the end of a transcriptional unit, which signais 3' processing and termination of transcription.
As used herein, the term selectable marker gene includes any gene, which confers a phenotype on a host cell in which it is expressed to facilitate the identification and/or sélection of host cells which are transfected or transformed with a transgene.
By “vector” is meant a polynucleotide molécule, preferably a DNA molécule derived, for example, from a plasmid, bactériophage, or plant virus, into which a polynucleotide can be inserted or cloned. A vector preferably contains one or more unique restriction sites and can be capable of autonomous réplication in a defined host cell, or can be integrated within the genome ofthe defined host such thatthe cloned sequence is reproducible. The choice of the vector will typically dépend on the compatibility of the vector with the host cell into which the vector is to be introduced.
As used herein, the term “host cell” refers to those cells used for transformation, i.e. for expression of transgenes. A host cell may be an isolated cell or a cell line grown in culture, or a cell which résides in a living tissue or organism. In the context ofthe présent invention, the host cells are preferably cells that are capable of growth in culture.
The term “microalgae” as used herein refers to microscopie algae. “Microalgae” encompass, without limitation, organisme within (i) several eukaryotic phyla, including the Rhodophyta (red algae), Chlorophyta (green algae), Dinoflagellata, Haptophyta, (ii) several classes from the eukaryotic phylum Heterokontophyta which includes, without limitation, the classes Bacillariophycea (diatoms), Eustigmatophycea, Phaeophyceae (brown algae), Xanthophyceae (yellow-green algae) and Chrysophyceae (golden algae), and (iii) the prokaryotic phylum Cyanobacteria (blue-green algae). The term microalgae includes for example généra selected from: Achnanthes, Amphora, Anabaena, Anikstrodesmis, Arachnoidiscusm, Aster, Botryococcus, Chaetoceros, Chlamydomonas, Chlorella, Chlorococcum, Chorethron, Cocconeis, Coscinodiscus, Crypthecodinium, Cyclotella, Cylindrotheca, Desmodesmus, Dunaliella, Emiliana, Euglena, Fistulifera, Fragilariopsis, Gyrosigma, Hematococcus, Isochrysis, Lampriscus, Monochrysis, Monoraphidium, Nannochloris, Nannochloropsis, Navicula, Neochloris, Nephrochloris, Nephroselmis, Nitzschia, Nodularia, Nostoc, Odontella, Oochromonas, Oocystis, Oscillartoria, Pavlova, Phaeodactylum, Playtmonas, Pleurochrysis, Porhyra, Pseudoanabaena, Pyramimonas, Scenedesmus, Schyzochitrium, Stichococcus, Synechococcus, Synechocystis, Tetraselmis, Thalassiosira, and Trichodesmium.
The term transformation means introducing an exogenous nucleic acid into an organism so that the nucleic acid is replicable, either as an extrachromosomal element or by chromosomal intégration.
The présent application generally relates to multigene engineering, and more particularly to multigene overexpression, in microalgae.
More particularly, the application provides expression Systems for multigene overexpression in microalgae and vector Systems comprising said expression Systems, host cells transformed with said expression Systems or comprising said vector Systems, methods for producing these host cells, as well as their use for biosynthetic processes. The inventors hâve found that the use of at least two multicistronic expression cassettes such as tricistronic expression cassettes, wherein the transgenes are connected to one another by at least one sequence encoding a 2A peptide, and preferably driven by the same promoter, allows for efficient expression of multiple genes in microalgae, overcoming the existing limitations for multigene overexpression in these organisms. The different aspects of the invention are detailed herein below.
Expression system
The expression Systems provided in the context of the présent invention comprise at least two different multicistronic such as tricistronic expression cassettes. Accordingly, the expression system provided herein comprises at least a first nucleic acid expression cassette and a second nucleic acid expression cassette, wherein said first and said second expression cassettes are not copies of each other, and wherein each expression cassette comprises a promoter operably linked to three or more transgenes connected to one another by at least one sequence encoding a 2A peptide.
The multicistronic expression cassettes envisaged herein comprise three or more, such as three, four, five, six or more, transgenes of interest, which are under the control of a single promoter, and which are connected to one another by at least one sequence encoding a 2A peptide. In certain embodiments, the multicistronic expression cassettes are tricistronic expression cassettes which comprise three transgenes of interest under the control of a single promoter, wherein said three transgenes are connected to one another by at least one sequence encoding a 2A peptide.
The expression system of the présent invention is characterized in that it comprises at least two of these multicistronic or tricistronic expression cassettes. The two or more multicistronic or tricistronic expression cassettes are different and thus not copies of each other. The two or more multicistronic expression cassettes may comprise an equal number of transgenes, or the number of transgenes comprised in the two or more expression cassettes may be different.
Each of these multicistronic or tricistronic expression cassettes is operably linked to a promoter and the promoter for the different multicistronic or tricistronic expression cassettes may be the same or different. In certain embodiments, the promoters of the multicistronic or tricistronic expression cassettes of the expression system of the invention are the same. This is advantageous for expression in microalgae, for which only a limited list of suitable promoters is available. In this way a significant number of genes (at least 6) can be expressed simultaneously without the disadvantages of the use of multiple promoters which increase the size of the construct and may further compromise the stability of the construct where the use of identical promoters is envisaged.
The 2A peptide used in the expression Systems envisaged herein may be derived from a mammalian virus such as foot and mouth disease virus (FMDV), cardiovirus encephalomyocarditis virus (EMCV), Theiler’s murine encephalitis (TMEV), bovine type C rotavirus (BRCV), Porcine type C rotavirus (PRCV), Human type C rotavirus (HRCV), equine rhinitis A virus (ERAV), equine rhinitis B virus (ERBV) and porcine teschovirus-1 (PTV-1; formerly porcine enteovirus-1). The 2A peptide may also be derived from an insect virus selected from the group comprising Thoseaasigna virus (TaV), infectious flacherie virus (IFV), Drosophila C virus (DCV) acute bee paralysis virus (ABPV) and cricket paralysis virus (CrPV). The 2A peptide may also be derived from Trypansoma spp., including T. brucei (TSR1) and T. cruzi (AP endonuclease) as described in Ryan et al. (2002, in Molecular Biology of Picornaviruses Ed. Semler and Wimmer, p. 213-223) or from Ljungan virus (174F, 145SL, 87-012, M1146). Preferably, the 2A peptide is derived from FMDV or EMCV. In embodiments, the sequence encoding the 2A peptide is the FMDV 2A sequence (APVKQTLNFDLLKLAGDVESNPGP, SEQ ID NO:1).
The transgenes in the multicistronic expression cassettes such as the tricistronic expression cassettes envisaged herein are connected to one another by at least one sequence encoding a 2A peptide. In certain embodiments, the transgenes are connected to one another by at least two, such as two, three, four, five or more, successive sequences encoding a 2A peptide. The use of two or more successive 2A increases the likelihood of cleavage of the transgene products.
In embodiments, the successive 2A sequences are the same. In other embodiments, at least one 2A sequence of the successive 2A sequences is different. The use of different 2A sequences reduces the likelihood of homologous recombination events.
In embodiments, the 2A peptides in each of the two or more multicistronic expression cassettes are the same. In other embodiments, the 2A peptides présent in the two or more multicistronic expression cassette are different. In particular embodiments, the 2A peptides used within one multicistronic expression cassette are the same, but are different from the 2A peptides used in the other of the two or more multicistronic expression cassettes. In particular embodiments, the 2A peptides used within one multicistronic expression cassette are different from each other but the same in each of the two or more multicistronic expression cassettes.
The transgenes that are envisaged for transformation using the expression System of the présent invention are not critical. Indeed the présent System can allow multigene (over)expression in microalgae independent of the nature of the transgene.
In particular embodiments, the transgenes encode enzymes involved in biosynthetic pathways. Indeed, multigene transformation is of particular interest in the context of introducing biosynthetic pathways into a host organism. The expression Systems of the présent invention allow for the simultaneous and co-localized expression of several genes relating to a biosynthetic pathway. The co-expression of different enzymes involved in subséquent steps of a biosynthetic pathway significantly furthers their efficiency.
For example, the transgenes in the multicistronic or tricistronic expression cassettes encode enzymes involved in the fatty acid biosynthetic pathway (also referred to as fatty acid enzymes herein). These multicistronic or tricistronic expression cassettes are of particular interest for the recombinant production of fatty acids, e.g. through the (over)expression of said multicistronic or tricistronic expression cassettes in a recombinant host cell, as detailed below. Exemplary genes involved in fatty acid synthesis include, without limitation genes encoding pyruvate dehydrogenase complex (PDH), acetyl-CoA carboxylase (ACCase), malonyl-CoA:ACP transacylase (MAT), 3-ketoacyl-ACP synthase (KAS), 3-ketoacyl-ACP reductase (KAR), 3-hydroxyacyl-ACP dehydratase (HD), enoylACP reductase (ENR), fatty acyl-ACP thioesterase (FAT), glycerol-3phosphateacyltransferase (GPAT), lyso-phosphatidicacidacyltransferase (LPAAT), lysophosphatidylcholineacyltransferase (LPAT), diacylglycerolacyltransferase (DAGAT), or glycerol-3-phosphate dehydrogenase (G3PDH), as described e.g. in Radakovits et al. (2010 Eukaryotic Cell 486:501).
Promoters envisaged in the context ofthe présent invention will be determined by its ability to direct expression in the host cell of interest. Preferably, the promoter is a promoter from microalgae. Exemplary promoters include, without limitation, those from Chlamydomonas reinhardtii, and from Chlorella species including Chlorella vulgaris, Nannochloropsis sp, Phaeodactylum tricornutum, Thalassiosira sp, Dunaliella salina and Haematococcus pluvialis. Non-limiting examples of suitable promoters are the Hsp70A promoter, the RbcS2 promoter and the beta-2-tubulin (TUB2) promoter from Chlamydomonas reinhardtii, the fucoxanthin chlorophyll a/b-binding protein (fcp) promoters, Histone 4 (H4) promoter from Phaeodactylum tricornutum, the Nitrate reductase (NR) promoter from Thalassiosira, and ubiquitin extension protein (UEP) from Nannochloropsis sp. In embodiments, the promoter in the multicistronic or tricistrionic expression cassettes envisaged herein is the Histone 4 (H4) promoter from Phaeodactylum tricornutum or ubiquitin extension protein (UEP) from Nannochloropsis gaditana.
Other sequences may be incorporated in the multicistronic or tricistronic expression cassettes according to the invention. More particularly the inclusion of sequences which further increase orstabilize the expression ofthetransgene products (e.g. introns and/ora transcription termination sequence) is envisaged.
In particular embodiments, the multicistronic or tricistronic expression cassettes further comprise a transcription termination sequence. Any polyadenylation signal that directs the synthesis of a polyA tail is useful in the multicistronic or tricistronic expression cassettes described herein, examples of those are well known to one of skill in the art. Exemplary polyadenylation signais include, but are not limited to, the polyadenylation signal derived from the Simian virus 40 (SV40) late gene, and the bovine growth hormone (BGH) polyadenylation signal, or the terminator région of the fucoxanthin chlorophyll a/b-binding protein (fcp) gene, such as the fcpA terminator. In embodiments, the fcpA terminator is used in the multicistronic or tricistronic expression cassettes envisaged herein.
Preferably, the expression Systems envisaged herein comprise a selectable marker gene. Said selectable marker gene is preferably not comprised in the multicistronic or tricistronic expression cassettes ofthe expression system (i.e. the transgenes ofthe multicistronic or tricistronic expression cassettes do not encode for a selectable marker), but in a separate nucleic acid expression cassette. Accordingly, in embodiments, the expression Systems envisaged herein further comprise one or more nucleic acid expression cassettes comprising a selectable marker gene. Expression of the selectable marker gene(s) may indicate that the host cell has been transformed with the multicistronic or tricistronic expression cassettes and hence, allows for selecting transformed host cells. The selectable marker cassette typically further includes a promoter and transcription terminator sequence, operatively linked to the selectable marker gene, and which are opérable in the host cell of choice.
Suitable markers may be selected from markers that confer antibiotic résistance, herbicid résistance, Visual markers, or markers that complément auxotrophic deficiencies of a host cell. For example, the sélection marker may confer résistance to an antibiotic such as hygromycin B (such as the hph gene), zeocin/phleomycin (such as the ble gene), kanamycin or G418 (such as the nptll or aphVII\ genes), spectinomycin (such as the aadA gene), neomycin (such as the aphVIII gene), blasticidin (such as the bsd gene), nourseothricin (such as the natR gene), puromycin (such as pac gene) and paromomycin (such as the aphVIII gene). In other examples, the sélection marker may confer résistance to a herbicide such as glyphosate (such as GAT gene), oxyfluorfen (such as protox/PPO gene) and norflurazon (such as PDS gene). Visual markers may also be used and include for example beta-glucuronidase (GUS), luciferase and fluorescent proteins such as Green Fluorescent Protein (GFP), Yellow Fluorescent protein, etc. Two prominent examples of auxotrophic deficiencies are the amino acid leucine deficiency (e.g. LEU2 gene) or uracil deficiency (e.g. URA3 gene). Cells that are orotidine-5'-phosphate decarboxylase négative (ura3-) cannot grow on media lacking uracil. Thus a functional URA3 gene can be used as a sélection marker on a host cell having a uracil deficiency, and successful transformants can be selected on a medium lacking uracil. Only cells transformed with the functional URA3 gene are able to synthesize uracil and grow on such medium. If the wild-type strain does not hâve a uracil deficiency, an auxotrophic mutant having the deficiency must be made in order to use URA3 as a sélection marker for the strain. Methods for accomplishing this are well known in the art.
Vector system
The expression cassettes envisaged herein may be used as such, or typically, they may be part of (i.e. introduced into) a nucleic acid vector. The at least two multicistronic or tricistronic expression cassettes ofthe expression system disclosed herein may be located on the same vector or on different vectors. The présent invention particularly envisages a vector system comprising at least two vectors, wherein each vector comprises only one of said at least two multicistronic ortricistronic expression cassettes. The vectors ofthe vector system envisaged herein may be the same or different.
In embodiments, the vectors disclosed herein further comprise an expression cassette comprising a selectable marker gene, such as an antibiotic résistance cassette.
The vectors disclosed herein may further include an origin of réplication that is required for maintenance and/or réplication in a spécifie cell type. One example is when a vector is required to be maintained in a host cell as an episomal genetic element (e.g. plasmid or cosmid molécule). Exemplary origins of réplication include, but are not limited to the f 1 -ori, colE1 ori, and Gram+ bacteria origins of réplication.
The vectors taught herein may further contain restriction sites of various types for linearization or fragmentation.
Numerous vectors are known to practitioners skilled in the art and any such vector may be used. Sélection of an appropriate vector is a matter of choice. The vector may be a nonviral or viral vector. Non-viral vectors include but are not limited to plasmids, cationic lipids, liposomes, nanoparticles, PEG, PEI, etc. Viral vectors are derived from viruses including but not limited to: retrovirus, lentivirus, adeno- associated virus, adenovirus, herpesvirus, hepatitis virus or the like. Preferred vectors for this invention are vectors developed for algae such as the vectors commonly known by the skilled person as pPha-T1, pPha-T1HSP, pPha-T1-TUB and pPhaT1-UEP.
Construction of the vectors described herein containing or including the multicistronic or tricistronic expression cassettes, and optionally the selectable marker cassettes, and one or more of the above listed components employs standard ligation techniques. For example, isolated plasmids may be cleaved, tailored, and re-ligated in the form desired to generate the plasmids required.
Host cells and methods for makinq same
A further aspect of the présent invention relates to a host cell comprising an expression System or a vector System according to the invention, which host cells are genetically modified with multiple (trans)genes.
The sélection of the host cell may be determined by the envisaged application. Particular examples of host cells which may be used in accordance with the présent invention are microalgae. Non-limiting examples of microalgae are Chlamydomonas reinhardtii strains, Chlorella species including Chlorella vulgaris, Chlorella sorokiniana and Chlorella (Auxenochlorella) protothecoides, Dunaliella salina, Haematococcus pluvialis, Ostreococcus tauri, Nannochloropsis species such as Nannochloropsis gaditana, Scenedesmus species, and diatoms such as Phaeodactylum species, e.g. Phaeodactylum tricornutum. More preferably, the microalga is a Nannochloropsis species or a diatom such as Phaeodactylum tricornutum.
The microalgae may be for example, but without limitation, microalgae growing in photoautotrophic, mixotrophic or heterotrophic conditions. Most microalgae are photoautotrophs, i.e. their growth is strictly dépendent on the génération of photosynthetically-derived energy. Their cultivation hence requires a relatively controlled environment with a large input of light energy. For certain industrial applications, it is advantageous to use heterotrophic microalgae, which can be grown in conventional fermenters. Accordingly, in embodiments the microalgae hâve been metabolically engineered to grow heterotrophically (i.e. to utilize exogenous organic compounds (such as glucose, acetate, etc.) as an energy or carbon source). A method for metabolically engineering microalgae to grow heterotrophically has been described in US Patent No. 7,939,710, which is specifically incorporated by reference herein. In particular embodiments, the microalgae are further genetically engineered to comprise a recombinant nucleic acid encoding a glucose transporter, preferably a glucose transporter selected from the group consisting of Glut 1 (human érythrocyte glucose transporter 1) and Hup1 (Chlorella HUP1 Monosaccharide-H+ Symporter). The glucose transportera facilitate the uptake of glucose by the host cell, allowing the cells to metabolize exogenous organic carbon and to grow independent of light. This is particularly advantageous for obligate phototrophic microalgae. Lists of phototrophs may be found in a review by Droop (1974. Heterotrophy of Carbon. In Algal Physiology and Biochemistry, Botanical Monographs, 10:530-559, ed. Stewart, University of California Press, Berkeley), and include, for example butwithout limitation, organisms ofthe phyla Cyanophyta (Blue-green algae), including the species Spirulina and Anabaena', Chlorophyta (Green algae), including the species Dunaliella, Chlamydomonas, and Heamatococcus; Rhodophyta (Red algae), including the species Porphyridium, Porphyra, Euchema, and Graciliaria; Phaeophyta (Brown algae), including the species, Macrocystis, Laminaria, Undaria, and Fucus-, Baccilariophyta (Diatoms), including the species Nitzschia, Navicula, Thalassiosira, and Phaeodactylurrr, Dinophyta (Dinoflagellates), including the species Gonyaulax-, Chrysophyta (Golden algae), including the species Irsochrysis and Nannochloropsis; Cryptophyla, including the species Cryptomonas-, and Euglenophyta, including the species Euglena.
Also provided herein are methods for obtaining a genetically engineered host cell as described herein, which method may comprise transforming, preferably co-transforming, a host cell with the at least two multicistronic or tricistronic expression cassettes or the at least two vectors each comprising one of said at least two multicistronic or tricistronic expression cassettes, as taught herein above. The method may further comprise the step of selecting the cells which hâve taken up the exogenous nucleic acids. In embodiments wherein the host cells are co-transformed with the at leasttwo vectors ofthe vectorSystem envisaged herein, said at least two vectors preferably comprise a different selectable marker gene.
Methods used herein for transformation ofthe host cells are well known to a skilled person. For example, electroporation and/or chemical (such as calcium chloride- or lithium acetatebased) transformation methods or Agrobacterium tumefaciens-medïated transformation methods as known in the art can be used.
The multicistronic or tricistronic expression cassettes or vectors disclosed herein may either be integrated into the genome of the host cell or they may be maintained in some form (such as a plasmid) extrachromosomally. A stably transformed host cell is one in which the exogenous nucleic acid has become integrated into a chromosome so that it is inherited by daughter cells through chromosome réplication.
Successful transformants can be selected for in known manner, e.g. by taking advantage of the attributes contributed by the marker gene, or by other characteristics resulting from the introduced coding sequences (such as ability to produce fatty acids). Screening can also be performed by PCR or Southern analysis to confirm that the desired insertions hâve taken place, to confirm copy number and to identify the point of intégration of coding sequences into the host genome.
Producing fatty acids using recombinant host cells
As detailed above, in particular embodiments, it is envisaged to introduce the expression Systems of the présent invention in the context of biosynthesis, such as fatty acid production. Accordingly, the présent invention also relates to the use of an expression system, a vector system or a host cell according to the invention, for biosynthesis, such as the industrial production of fatty acids.
In a further aspect, the invention provides methods for the production of fatty acids, which method comprises providing a genetically engineered host cell wherein enzymes involved in fatty acid biosynthesis hâve been introduced using the multigene expression Systems as described above and culturing said genetically engineered host cell in a culture medium so as to allow the production of fatty acids. More particularly, the host cell is cultured under conditions suitable to ensure expression of the multicistronic or tricistronic expression cassettes, which expression cassettes comprise transgenes encoding enzymes involved in the fatty acid biosynthetic pathway envisaged herein.
In particular embodiments, the host cells ensure a rate of fatty acid production which is sufficiently high to be industrially valuable. Indeed, in particular embodiments, as a resuit ofthe coordinated expression ofthe different enzymes involved, the recombinant host cells disclosed herein are capable of ensuring a high yield at limited production costs.
The recombinant host cells are cultured under conditions suitable for the production of fatty acids by the host cells. More particularly this implies conditions sufficient to allow expression ofthe multicistronic ortricistronic expression cassettes (comprising transgenes encoding fatty acid enzymes), which means any condition that allows a host cell to (over)produce a fatty acid enzyme as described herein. Suitable conditions include, for example, fermentation conditions. Fermentation conditions can comprise many parameters, such as température ranges, levels of aération, and media composition. Each of these conditions, individually and in combination, allows the host cell to grow. To détermine if conditions are sufficient to allow (over)expression, a host cell can be cultured, for example, for about 4, 8, 12, 18, 24, 36, or 48 hours. During and/or after culturing, samples can be obtained and analyzed to détermine if the conditions allow (over)expression. For example, the host cells in the sample or the culture medium in which the host cells were grown can be tested for the presence of a desired product (e.g. a fatty acid). When testing for the presence of a desired product, assays, such as, but not limited to, sodium dodecyl sulfate polyacrylamide gel electrophoresis (SDS-PAGE), TLC, HPLC, GC/FID, GC/MS, LC/MS, MS, can be used.
Exemplary culture media include broths or gels. The host cells may be grown in a culture medium comprising a carbon source to be used for growth of the host cell. Exemplary carbon sources include carbohydrates, such as glucose, fructose, cellulose, or the like, that can be directly metabolized by the host cell. In addition, enzymes can be added to the culture medium to facilitate the mobilization (e.g., the depolymerization of starch or cellulose to fermentable sugars) and subséquent metabolism of the carbon source. A culture medium may optionally contain further nutrients as required by the particular strain, including inorganic nitrogen sources such as ammonia or ammonium salts, and the like, and minerais and the like. In particular embodiments, wherein phototrophic microalgae are used as host cells, the method for the production of fatty acids may comprise providing microalgae genetically engineered to produce fatty acids as taught herein, and culturing said microalgae in photobioreactors or an open pond system using CO2 and sunlight as feedstock.
Other growth conditions, such as température, cell density, and the like are generally selected to provide an economical process. Températures during each of the growth phase and the production phase may range from above the freezing température of the medium to about 50°C.
The culturing step of the methods of the invention may be conducted aerobically, anaerobically, or substantially anaerobically. Briefly, anaérobie conditions refer to an environment devoid of oxygen. Substantially anaérobie conditions include, for example, a culture, batch fermentation or continuous fermentation such that the dissolved oxygen concentration in the medium remains between 0 and 10% of saturation. Substantially anaérobie conditions also includes growing or resting cells in liquid medium or on solid agar inside a sealed chamber maintained with an atmosphère of less than 1% oxygen. The percent of oxygen can be maintained by, for example, sparging the culture with an N2/CO2 mixture or other suitable non-oxygen gas or gasses.
The cultivation step of the methods described herein can be conducted continuously, batch-wise, or some combination thereof.
In further embodiments, methods are provided for producing fatty acids, which, in addition to the steps detailed above, further comprise the step of recovering the fatty acids from the host cell or the culture medium. Suitable purification can be carried out by methods known to the person skilled in the art such as by using lysis methods, extraction, ion exchange resins, electrodialysis, nanofiltration, etc.
Accordingly, methods are provided for the production of fatty acids which methods comprise the steps of:
(i) providing a genetically engineered host cell transformed using a multigene expression system which ensures expression of enzymes involved in fatty acid biosynthesis as described herein above;
(ii) culturing the host cells under conditions suitable for the production of fatty acids, and (iii) recovering the fatty acids from the host cell or the culture medium.
The invention will be further understood with reference to the following non-limiting examples.
EXAMPLES
The practice ofthe présent invention will employ, unless otherwise indicated, conventional techniques used in recombinant DNA technology, molecular biology, biological testing, and the like, which are within the skill of the art. Such techniques are explained fully in the literature.
Example 1 : Multigene expression in Phaeodactylum tricornutum
Materials and methods
DNA expression cassettes were constructed which comprise the genes bsd (from Aspergillus terreus), nat1 (from Streptomyces noursei) and shble (gene from pUT58 (Drocourt et al., 1990 Nucleic Acids Research 18: 4009), which confer résistance to respectively, the antibiotics blasticidin, nourseothricin and zeocin. The genes were separated by a nucléotide sequence encoding the F2A peptide (F2A; APVKQTLNFDLLKLAGDVESNPGP, SEQ ID NO:1), and were under the control of the histone 4 (H4p) promoter of Phaeodactylum tricornutum. The fucoxanthin chlorophyll a binding protein (fcpA) terminator was further integrated behind the tricistronic construct.
The expression cassettes were ligated into pCT2 vectors, each vector comprising only one tricistronic expression cassette, and further comprising an antibiotic résistance cassette.
Phaeodactylum tricornutum cells were transformed using an adapted NEPA21 electroporation protocol as described by Miyahara et al. (2013 Biosci Biotechnol Biochem. 77(4):874-876) with the constructed vectors and allowed to randomly insert the expression cassettes in their genome. Briefly, 1.107 cells were collected by centrifugation at 1500xg, washed twice with 1 ml 0,77 M mannitol, re-suspended in 150 μΙ 0,77 M mannitol and transferred to 0,2 cm electroporation cuvettes. 4 pg of vectors were electroporated. Cells were transferred to 50 ml tubes in 5 ml ESAW medium (Harrison et al. 1980 J. Phycol. 16:28-35) and allowed to recover for 20 hours at 20°C in a 12:12 darklight regimen while shaking at 100 RPM. Cells were collected and plated onto 100 pg/ml zeocin containing agar plates and incubated one month under the same light and température conditions. Clones résistant to zeocin were then re-plated on the 2 other antibiotics.
Results
The clones were résistant to the 3 antibiotics, which in ail likelihood is due to the expression of ail 3 genes linked by the F2A sequences.
Example 2: Multigene expression in Nannochloropsis
Materials and methods
The following DNA constructs were prepared (Fig. 1):
UEP construct: comprising a promoter operably linked to the shble gene that confers résistance to the antibiotic zeocin, and a terminator;
pMA01 construct: comprising a promoter operably linked to the bsd gene that confers résistance to the antibiotic blasticidin and the shBle gene fused to a His-tag, and a terminator; and pMA02 construct: comprising a promoter operably linked to the bsd gene, the nat1 gene that confers résistance to the antibiotic nourseothricin, and the shBle gene fused to a Histag, and a terminator.
The F2A sequence (APVKQTLNFDLLKLAGDVESNPGP; SEQ ID NO:1) was used as linker between the antibiotic résistance genes in the pMA01 and pMA02 constructs.
The constructs were transformed in Nannochloropsis gaditana 526. Briefly, 1.108 cells were collected by centrifugation at 3500xg for 13 min, washed twice in 1 ml 375 mM D-sorbitol and re-suspended in 100 μΙ D-sorbitol. 2pg of linearized DNA was added to the cells, kept on ice for 15 min and electroporated into 0,2 cm cuvettes, using 2400 V, 500 Ohms, 50 pF. After electroporation, cells were transferred to 5 ml of f/2 medium and incubated for 24 hours in constant light, at 100 RPM and 20°C. Cells were then pelleted (3500xg for 5min), and plated onto sélective 1% agar plate containing 7 pg/ml zeocin (Zeo 7), 100 pg/ml blasticidin (Bsd 100), or 500 pg/ml nourseothricin (Nat 500), and incubated under the same conditions.
Results
The pMA01 and pMA02 transformants were résistant to zeocin and blasticidin, and zeocin, blasticidin and nourseothricin, respectively (Fig. 2). These data show that the 2A sequenceseparated résistance genes were functional in Nannochloropsis.
Example 3: Co-expression of 2 multigene constructs in Nannochloropsis
Matériels and methods
The following DNA constructs are prepared (Fig.3):
pMA03 construct: comprising a promoter operably linked to gene 1, gene 2 and the shBle gene that confers résistance to the antibiotic zeocin, and a terminator.
pMA04 construct: comprising a promoter operably linked to gene 3, gene 4 and the bsd gene that confers résistance to the antibiotic blasticidin, and a terminator.
pMA05 construct: comprising a promoter operably linked to gene 1, gene 2, the shble gene that confers résistance to the antibiotic zeocin, gene 3, gene 4 and the bsd gene that confers résistance to the antibiotic blasticidin, and a terminator.
The F2A sequence (APVKQTLNFDLLKLAGDVESNPGP; SEQ ID NO:1) is used as linker between the genes in the pMA03, pMA04 and pMA05 constructs.
Nannochloropsis gaditana 526 are co-transformed with the constructs pMA03 and pMA04, or transformed with the construct pMA05. Briefly, 1.108 cells are collected by centrifugation at 3500xg for 13 min, washed twice in 1 ml 375 mM D-sorbitol and re-suspended in 100 pl D-sorbitol. 2pg of linearized DNA is added to the cells, kept on ice for 15 min and electroporated into 0,2 cm cuvettes, using 2400 V, 500 Ohms, 50 pF. After electroporation, cells are transferred to 5 ml of f/2 medium and incubated for 24 hours in constant light, at 100 RPM and 20°C. Cells are then pelleted (3500xg for 5min), and plated onto sélective 1% agar plate containing 7 pg/ml zeocin (Zeo 7) and incubated under the same conditions. After 1 month, zeocin résistant colonies are replated onto sélective 1% agar plate containing 100 pg/ml blasticidin (Bsd 100).
Results
Transformation of pMA05 or co-transformation of pMA03+pMA04 give rise to similar number of zeocin résistant colonies, but significantly more of the zeocin-resistant pMA03+pMA04 co-transformants are also résistant to blasticidin compared to the zeocinresistant pMA05 transformants. These data show that expression of the 6th gene on a multigene cassette is less efficient than using an expression System of the présent invention comprising two expression cassettes of 3 genes.
Claims (16)
1. A multigene expression System comprising at least two different nucleic acid expression cassettes, wherein each expression cassette comprises a promoter operably linked to three or more transgenes connected to one another by at least one sequence encoding a 2A peptide.
2. The expression System according to claim 1, wherein each expression cassette comprises a promoter operably linked to three or more transgenes connected to one another by at least two successive sequences encoding a 2A peptide.
3. The expression System according to any one of daims 1 or 2, wherein each expression cassette comprises three transgenes.
4. The expression System according to any one of daims 1 to 3, wherein the promoters of the expression cassettes are the same.
5. The expression System according to any one of daims 1 to 4, wherein the 2A peptide is derived from foot-and-mouth disease virus (FMDV 2A or F2A).
6. The expression System according to claim 1 to 5, further comprising one or more nucleic acid expression cassettes comprising a selectable marker gene.
7. The expression System according to any one of daims 1 to 6, wherein the transgenes encode for enzymes involved in a biosynthetic pathway.
8. The expression System according to claim 7, wherein the transgenes encode enzymes involved in the fatty acid biosynthetic pathway.
9. A vector System comprising the expression System according to any one of daims 1 to
8, said vector System comprising at least two vectors, wherein each vector comprises one of said at least two nucleic acid expression cassettes comprising a promoter operably linked to three or more transgenes connected to one another by at least one sequence encoding a 2A peptide.
10. The vector System according to claim 9, wherein each vector further comprises a nucleic acid expression cassette comprising a selectable marker gene.
11. The vector System according to claim 9 or 10, wherein the vectors are plasmids.
12. A host cell comprising the expression System according to any one of daims 1 to 8 or the vector System according to any one of daims 9 to 11.
13. The host cell according to claim 12, wherein the host cell is a microalga, preferably a diatom such as Phaeodactylum tricornutum, or a Nannochloropsis species.
14. A method for genetically modifying a host cell with multiple genes comprising the following steps:
providing a host cell, and
- transforming the host cell with at least two different nucleic acid expression cassettes, wherein each expression cassette comprises a promoter operably linked to three or more transgenes connected to one another by at least one sequence encoding a 2A peptide, and optionally one or more nucleic acid expression cassettes comprising a selectable marker gene.
15. The method according to claim 14, wherein the at least two nucleic acid expression cassettes and optionally the one or more nucleic acid expression cassettes comprising a selectable marker gene are co-transformed into the host cell.
16. The method according to claim 14 or 15, further comprising the step of selecting the host cells which hâve been transformed with said at least two nucleic acid cassettes and said one or more nucleic acid expression cassettes comprising a selectable marker gene by culturing the host cells on a sélective medium, wherein the ability of a host cell to be cultured on the sélective medium is dépendent on the expression of the selectable marker gene.
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP16152953.2 | 2016-01-27 |
Publications (1)
Publication Number | Publication Date |
---|---|
OA19168A true OA19168A (en) | 2020-03-09 |
Family
ID=
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US9487790B2 (en) | Nuclear based expression of genes for production of biofuels and process co-products in algae | |
EP3408394B1 (en) | Multigene expression in microalgae | |
Sun et al. | Expression of foreign genes in Dunaliella by electroporation | |
US11332764B2 (en) | Microorganisms having increased lipid productivity | |
US20200157558A1 (en) | Algal chloroplastic srp54 mutants | |
AU2011297126B2 (en) | Alga in which production of photosynthetic products is improved, and use for said alga | |
US10041079B2 (en) | Compositions and methods for expressing genes in algae | |
CN109321548B (en) | Cas9 protein, CRISPR/Cas9 system, mushroom gene editing method and application | |
Khatiwada et al. | Nuclear transformation of the versatile microalga Euglena gracilis | |
KR101567308B1 (en) | Recombinant vector for increasing biomass and lipid productivity of microalgae and uses thereof | |
WO2014016612A1 (en) | Methods and compositions for increasing productivity and viability of algal cells | |
OA19168A (en) | Multigene expression in microalgae. | |
US10590444B2 (en) | Increased triacylglycerol production in microalgae | |
US20140272949A1 (en) | Methods for Fully Segregating Recombinant Marine Cyanobacteria | |
KR20200098424A (en) | Method for gene editing in microalgae using particle bombardment | |
KR102617091B1 (en) | Promoter from Pyropia yezoensis U6 gene and uses thereof | |
EP4166658A1 (en) | Diatom-based genetic engineering system methodology for the eco-sustainable production of ovothiols | |
CN117616125A (en) | Toxin-free aloe and preparation method thereof | |
Nesamma et al. | Genetic engineering of microalgae for production of biofuels: Recent progress and practical limitations | |
KR20230162423A (en) | Expressible promoter in Esteya vermicola and uses thereof | |
KR20220098155A (en) | Nonviral transcriptional activation domains and related methods and uses | |
CN114807218A (en) | Method for improving high-temperature resistance of phaeodactylum tricornutum algae species by transferring exogenous gene | |
CN116656513A (en) | Monascus pilosus mutant strain and construction method and application thereof | |
OA19169A (en) | Increased triacylglycerol production in microalgae. |