WO2014164232A1 - Cyanobacteria that produce lipid packaging proteins - Google Patents
Cyanobacteria that produce lipid packaging proteins Download PDFInfo
- Publication number
- WO2014164232A1 WO2014164232A1 PCT/US2014/021423 US2014021423W WO2014164232A1 WO 2014164232 A1 WO2014164232 A1 WO 2014164232A1 US 2014021423 W US2014021423 W US 2014021423W WO 2014164232 A1 WO2014164232 A1 WO 2014164232A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- dgat
- seq
- acyl
- acp
- polypeptide
- Prior art date
Links
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 237
- 150000002632 lipids Chemical class 0.000 title claims abstract description 168
- 102000004169 proteins and genes Human genes 0.000 title claims abstract description 146
- 238000004806 packaging method and process Methods 0.000 title claims abstract description 44
- 241000192700 Cyanobacteria Species 0.000 title abstract description 87
- 102000040430 polynucleotide Human genes 0.000 claims abstract description 212
- 108091033319 polynucleotide Proteins 0.000 claims abstract description 212
- 239000002157 polynucleotide Substances 0.000 claims abstract description 212
- 230000001965 increasing effect Effects 0.000 claims abstract description 36
- 108010001348 Diacylglycerol O-acyltransferase Proteins 0.000 claims description 190
- 102000002148 Diacylglycerol O-acyltransferase Human genes 0.000 claims description 187
- 238000004519 manufacturing process Methods 0.000 claims description 61
- 238000000034 method Methods 0.000 claims description 56
- 241001464430 Cyanobacterium Species 0.000 claims description 54
- 230000014509 gene expression Effects 0.000 claims description 50
- 230000007935 neutral effect Effects 0.000 claims description 44
- 230000030570 cellular localization Effects 0.000 claims description 34
- 239000000758 substrate Substances 0.000 claims description 34
- 241000589291 Acinetobacter Species 0.000 claims description 27
- 230000012010 growth Effects 0.000 claims description 14
- 101100297345 Caenorhabditis elegans pgl-2 gene Proteins 0.000 claims description 9
- 101150111781 PGL1 gene Proteins 0.000 claims description 9
- 241000187747 Streptomyces Species 0.000 claims description 8
- 108700037654 Acyl carrier protein (ACP) Proteins 0.000 claims description 7
- 102000048456 Acyl carrier protein (ACP) Human genes 0.000 claims description 7
- 241000611272 Alcanivorax Species 0.000 claims description 7
- 238000012258 culturing Methods 0.000 claims description 6
- 230000001939 inductive effect Effects 0.000 claims description 5
- 244000005700 microbiome Species 0.000 abstract description 126
- 230000000243 photosynthetic effect Effects 0.000 abstract description 113
- 235000014113 dietary fatty acids Nutrition 0.000 abstract description 83
- 229930195729 fatty acid Natural products 0.000 abstract description 83
- 239000000194 fatty acid Substances 0.000 abstract description 83
- 150000004665 fatty acids Chemical class 0.000 abstract description 77
- 150000003626 triacylglycerols Chemical class 0.000 abstract description 49
- 108700016155 Acyl transferases Proteins 0.000 abstract description 4
- 102000057234 Acyl transferases Human genes 0.000 abstract description 3
- 230000002194 synthesizing effect Effects 0.000 abstract 1
- 108090000765 processed proteins & peptides Proteins 0.000 description 257
- 102000004196 processed proteins & peptides Human genes 0.000 description 225
- 229920001184 polypeptide Polymers 0.000 description 223
- 102000004190 Enzymes Human genes 0.000 description 139
- 108090000790 Enzymes Proteins 0.000 description 139
- 235000018102 proteins Nutrition 0.000 description 133
- 230000015572 biosynthetic process Effects 0.000 description 128
- 230000000694 effects Effects 0.000 description 112
- 235000001014 amino acid Nutrition 0.000 description 91
- 229940024606 amino acid Drugs 0.000 description 84
- 150000001413 amino acids Chemical class 0.000 description 82
- 108010016219 Acetyl-CoA carboxylase Proteins 0.000 description 70
- 102000000452 Acetyl-CoA carboxylase Human genes 0.000 description 67
- 108010018763 Biotin carboxylase Proteins 0.000 description 67
- 102000005488 Thioesterase Human genes 0.000 description 60
- 108020002982 thioesterase Proteins 0.000 description 60
- 210000004027 cell Anatomy 0.000 description 57
- 239000012634 fragment Substances 0.000 description 57
- 229920002527 Glycogen Polymers 0.000 description 55
- 102000011720 Lysophospholipase Human genes 0.000 description 55
- 108020002496 Lysophospholipase Proteins 0.000 description 55
- 229940096919 glycogen Drugs 0.000 description 55
- 101710146995 Acyl carrier protein Proteins 0.000 description 54
- 238000003786 synthesis reaction Methods 0.000 description 52
- -1 from just sunlight Substances 0.000 description 51
- 108020001507 fusion proteins Proteins 0.000 description 50
- 102000037865 fusion proteins Human genes 0.000 description 50
- 230000002255 enzymatic effect Effects 0.000 description 48
- DCXXMTOCNZCJGO-UHFFFAOYSA-N tristearoylglycerol Chemical compound CCCCCCCCCCCCCCCCCC(=O)OCC(OC(=O)CCCCCCCCCCCCCCCCC)COC(=O)CCCCCCCCCCCCCCCCC DCXXMTOCNZCJGO-UHFFFAOYSA-N 0.000 description 48
- 239000013598 vector Substances 0.000 description 48
- 241000894006 Bacteria Species 0.000 description 42
- 108010069394 Phosphatidate Phosphatase Proteins 0.000 description 42
- 102000001107 Phosphatidate Phosphatase Human genes 0.000 description 42
- 101150112449 aas gene Proteins 0.000 description 42
- 239000012528 membrane Substances 0.000 description 42
- 210000004379 membrane Anatomy 0.000 description 42
- 238000012217 deletion Methods 0.000 description 38
- 230000037430 deletion Effects 0.000 description 38
- 210000000170 cell membrane Anatomy 0.000 description 37
- UFTFJSFQGQCHQW-UHFFFAOYSA-N triformin Chemical compound O=COCC(OC=O)COC=O UFTFJSFQGQCHQW-UHFFFAOYSA-N 0.000 description 34
- 102000015439 Phospholipases Human genes 0.000 description 33
- 108010064785 Phospholipases Proteins 0.000 description 33
- 239000002773 nucleotide Substances 0.000 description 33
- 125000003729 nucleotide group Chemical group 0.000 description 33
- 230000001580 bacterial effect Effects 0.000 description 32
- 230000035772 mutation Effects 0.000 description 30
- 238000006243 chemical reaction Methods 0.000 description 29
- 150000007523 nucleic acids Chemical group 0.000 description 29
- 125000002252 acyl group Chemical group 0.000 description 28
- 125000003275 alpha amino acid group Chemical group 0.000 description 28
- 108020002663 Aldehyde Dehydrogenase Proteins 0.000 description 27
- 238000006467 substitution reaction Methods 0.000 description 27
- 102000005369 Aldehyde Dehydrogenase Human genes 0.000 description 26
- 239000004164 Wax ester Substances 0.000 description 26
- 125000000539 amino acid group Chemical group 0.000 description 26
- 150000001982 diacylglycerols Chemical class 0.000 description 26
- 235000019386 wax ester Nutrition 0.000 description 26
- 101710129019 Long-chain acyl-[acyl-carrier-protein] reductase Proteins 0.000 description 25
- 241000196324 Embryophyta Species 0.000 description 24
- 108091028043 Nucleic acid sequence Proteins 0.000 description 24
- 239000000047 product Substances 0.000 description 24
- 238000003860 storage Methods 0.000 description 24
- 229910052799 carbon Inorganic materials 0.000 description 23
- ZIIUUSVHCHPIQD-UHFFFAOYSA-N 2,4,6-trimethyl-N-[3-(trifluoromethyl)phenyl]benzenesulfonamide Chemical compound CC1=CC(C)=CC(C)=C1S(=O)(=O)NC1=CC=CC(C(F)(F)F)=C1 ZIIUUSVHCHPIQD-UHFFFAOYSA-N 0.000 description 22
- 241000588724 Escherichia coli Species 0.000 description 22
- ZSLZBFCDCINBPY-ZSJPKINUSA-N acetyl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)C)O[C@H]1N1C2=NC=NC(N)=C2N=C1 ZSLZBFCDCINBPY-ZSJPKINUSA-N 0.000 description 22
- 102000039446 nucleic acids Human genes 0.000 description 22
- 108020004707 nucleic acids Proteins 0.000 description 22
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 21
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 21
- 230000002829 reductive effect Effects 0.000 description 21
- 108010021809 Alcohol dehydrogenase Proteins 0.000 description 20
- 102000007698 Alcohol dehydrogenase Human genes 0.000 description 20
- 108010001483 Glycogen Synthase Proteins 0.000 description 20
- 150000003904 phospholipids Chemical class 0.000 description 20
- 108020004414 DNA Proteins 0.000 description 19
- 241000192560 Synechococcus sp. Species 0.000 description 19
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 18
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 18
- 108010052285 Membrane Proteins Proteins 0.000 description 18
- 230000015556 catabolic process Effects 0.000 description 18
- 235000021588 free fatty acids Nutrition 0.000 description 18
- 101710153103 Long-chain-fatty-acid-CoA ligase FadD13 Proteins 0.000 description 17
- 230000001086 cytosolic effect Effects 0.000 description 17
- 125000005647 linker group Chemical group 0.000 description 17
- 150000002191 fatty alcohols Chemical class 0.000 description 16
- 108010082340 Arginine deiminase Proteins 0.000 description 15
- 102000004157 Hydrolases Human genes 0.000 description 15
- 108090000604 Hydrolases Proteins 0.000 description 15
- 241000192707 Synechococcus Species 0.000 description 15
- 101710137710 Thioesterase 1/protease 1/lysophospholipase L1 Proteins 0.000 description 15
- 230000003197 catalytic effect Effects 0.000 description 15
- 230000002018 overexpression Effects 0.000 description 15
- 230000001105 regulatory effect Effects 0.000 description 15
- 102000018697 Membrane Proteins Human genes 0.000 description 14
- 108010013690 Methyl-Accepting Chemotaxis Proteins Proteins 0.000 description 14
- 101150049837 PGM gene Proteins 0.000 description 14
- 238000003780 insertion Methods 0.000 description 14
- 230000037431 insertion Effects 0.000 description 14
- 230000037361 pathway Effects 0.000 description 14
- 241000192589 Synechococcus elongatus PCC 7942 Species 0.000 description 13
- 230000001413 cellular effect Effects 0.000 description 13
- 125000001924 fatty-acyl group Chemical group 0.000 description 13
- 230000006870 function Effects 0.000 description 12
- 239000008187 granular material Substances 0.000 description 12
- 230000003834 intracellular effect Effects 0.000 description 12
- 239000011159 matrix material Substances 0.000 description 12
- 241000219195 Arabidopsis thaliana Species 0.000 description 11
- 101100121991 Chlamydia pneumoniae glmM gene Proteins 0.000 description 11
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 11
- 108090001060 Lipase Proteins 0.000 description 11
- 102000004316 Oxidoreductases Human genes 0.000 description 11
- 108090000854 Oxidoreductases Proteins 0.000 description 11
- 229920000331 Polyhydroxybutyrate Polymers 0.000 description 11
- 108010076504 Protein Sorting Signals Proteins 0.000 description 11
- 230000035508 accumulation Effects 0.000 description 11
- 238000009825 accumulation Methods 0.000 description 11
- 238000007792 addition Methods 0.000 description 11
- 230000004075 alteration Effects 0.000 description 11
- 230000027455 binding Effects 0.000 description 11
- 210000000805 cytoplasm Anatomy 0.000 description 11
- 230000004136 fatty acid synthesis Effects 0.000 description 11
- 101150084612 gpmA gene Proteins 0.000 description 11
- 101150104722 gpmI gene Proteins 0.000 description 11
- 230000036961 partial effect Effects 0.000 description 11
- 239000005015 poly(hydroxybutyrate) Substances 0.000 description 11
- 210000002377 thylakoid Anatomy 0.000 description 11
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 10
- 102100036869 Diacylglycerol O-acyltransferase 1 Human genes 0.000 description 10
- 102000004160 Phosphoric Monoester Hydrolases Human genes 0.000 description 10
- 108090000608 Phosphoric Monoester Hydrolases Proteins 0.000 description 10
- 241000316848 Rhodococcus <scale insect> Species 0.000 description 10
- 230000000875 corresponding effect Effects 0.000 description 10
- 101150013858 glgC gene Proteins 0.000 description 10
- 239000000543 intermediate Substances 0.000 description 10
- 230000000670 limiting effect Effects 0.000 description 10
- 230000004048 modification Effects 0.000 description 10
- 238000012986 modification Methods 0.000 description 10
- 239000003921 oil Substances 0.000 description 10
- 235000019198 oils Nutrition 0.000 description 10
- 101100434045 Danio rerio acp7 gene Proteins 0.000 description 9
- LTYOQGRJFJAKNA-KKIMTKSISA-N Malonyl CoA Natural products S(C(=O)CC(=O)O)CCNC(=O)CCNC(=O)[C@@H](O)C(CO[P@](=O)(O[P@](=O)(OC[C@H]1[C@@H](OP(=O)(O)O)[C@@H](O)[C@@H](n2c3ncnc(N)c3nc2)O1)O)O)(C)C LTYOQGRJFJAKNA-KKIMTKSISA-N 0.000 description 9
- 241000124008 Mammalia Species 0.000 description 9
- 241001465754 Metazoa Species 0.000 description 9
- 101100434047 Mus musculus Acp7 gene Proteins 0.000 description 9
- 108700021044 acyl-ACP thioesterase Proteins 0.000 description 9
- 230000004927 fusion Effects 0.000 description 9
- 230000003993 interaction Effects 0.000 description 9
- LTYOQGRJFJAKNA-DVVLENMVSA-N malonyl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)CC(O)=O)O[C@H]1N1C2=NC=NC(N)=C2N=C1 LTYOQGRJFJAKNA-DVVLENMVSA-N 0.000 description 9
- 239000000203 mixture Substances 0.000 description 9
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 9
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 8
- WFPZSXYXPSUOPY-ROYWQJLOSA-N ADP alpha-D-glucoside Chemical compound C([C@H]1O[C@H]([C@@H]([C@@H]1O)O)N1C=2N=CN=C(C=2N=C1)N)OP(O)(=O)OP(O)(=O)O[C@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O WFPZSXYXPSUOPY-ROYWQJLOSA-N 0.000 description 8
- 108700023224 Glucose-1-phosphate adenylyltransferases Proteins 0.000 description 8
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 8
- 102000004882 Lipase Human genes 0.000 description 8
- 239000004367 Lipase Substances 0.000 description 8
- 102100037883 Phospholipase B1, membrane-associated Human genes 0.000 description 8
- 241000192581 Synechocystis sp. Species 0.000 description 8
- HXXFSFRBOHSIMQ-UHFFFAOYSA-N [3,4,5-trihydroxy-6-(hydroxymethyl)oxan-2-yl] dihydrogen phosphate Chemical compound OCC1OC(OP(O)(O)=O)C(O)C(O)C1O HXXFSFRBOHSIMQ-UHFFFAOYSA-N 0.000 description 8
- 210000004899 c-terminal region Anatomy 0.000 description 8
- 238000004422 calculation algorithm Methods 0.000 description 8
- RGJOEKWQDUBAIZ-UHFFFAOYSA-N coenzime A Natural products OC1C(OP(O)(O)=O)C(COP(O)(=O)OP(O)(=O)OCC(C)(C)C(O)C(=O)NCCC(=O)NCCS)OC1N1C2=NC=NC(N)=C2N=C1 RGJOEKWQDUBAIZ-UHFFFAOYSA-N 0.000 description 8
- 235000019421 lipase Nutrition 0.000 description 8
- GYHFUZHODSMOHU-UHFFFAOYSA-N nonanal Chemical compound CCCCCCCCC=O GYHFUZHODSMOHU-UHFFFAOYSA-N 0.000 description 8
- 238000012216 screening Methods 0.000 description 8
- WFPZSXYXPSUOPY-UHFFFAOYSA-N ADP-mannose Natural products C1=NC=2C(N)=NC=NC=2N1C(C(C1O)O)OC1COP(O)(=O)OP(O)(=O)OC1OC(CO)C(O)C(O)C1O WFPZSXYXPSUOPY-UHFFFAOYSA-N 0.000 description 7
- 241000186361 Actinobacteria <class> Species 0.000 description 7
- 101100388296 Arabidopsis thaliana DTX51 gene Proteins 0.000 description 7
- RGJOEKWQDUBAIZ-IBOSZNHHSA-N CoASH Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCS)O[C@H]1N1C2=NC=NC(N)=C2N=C1 RGJOEKWQDUBAIZ-IBOSZNHHSA-N 0.000 description 7
- 241000195493 Cryptophyta Species 0.000 description 7
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 7
- 241000206602 Eukaryota Species 0.000 description 7
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 7
- 108700026244 Open Reading Frames Proteins 0.000 description 7
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 7
- 101100215626 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) ADP1 gene Proteins 0.000 description 7
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 7
- 229910002092 carbon dioxide Inorganic materials 0.000 description 7
- 239000005516 coenzyme A Substances 0.000 description 7
- 229940093530 coenzyme a Drugs 0.000 description 7
- 230000001419 dependent effect Effects 0.000 description 7
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 7
- 235000014304 histidine Nutrition 0.000 description 7
- 229920000642 polymer Polymers 0.000 description 7
- 235000004400 serine Nutrition 0.000 description 7
- QCGCETFHYOEVAI-GDVGLLTNSA-N (2s)-2-[(3-amino-3-carboxypropanoyl)amino]-5-(diaminomethylideneamino)pentanoic acid Chemical compound OC(=O)C(N)CC(=O)N[C@H](C(O)=O)CCCN=C(N)N QCGCETFHYOEVAI-GDVGLLTNSA-N 0.000 description 6
- 102100022089 Acyl-[acyl-carrier-protein] hydrolase Human genes 0.000 description 6
- 108010039224 Amidophosphoribosyltransferase Proteins 0.000 description 6
- 102000007132 Carboxyl and Carbamoyl Transferases Human genes 0.000 description 6
- 108010072957 Carboxyl and Carbamoyl Transferases Proteins 0.000 description 6
- 108091026890 Coding region Proteins 0.000 description 6
- 240000006262 Cuphea hookeriana Species 0.000 description 6
- 108050004099 Diacylglycerol O-acyltransferase 1 Proteins 0.000 description 6
- 108010039731 Fatty Acid Synthases Proteins 0.000 description 6
- 241000233866 Fungi Species 0.000 description 6
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 6
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 6
- 101710089395 Oleosin Proteins 0.000 description 6
- 108091007187 Reductases Proteins 0.000 description 6
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 6
- 239000004473 Threonine Substances 0.000 description 6
- XJLXINKUBYWONI-DQQFMEOOSA-N [[(2r,3r,4r,5r)-5-(6-aminopurin-9-yl)-3-hydroxy-4-phosphonooxyoxolan-2-yl]methoxy-hydroxyphosphoryl] [(2s,3r,4s,5s)-5-(3-carbamoylpyridin-1-ium-1-yl)-3,4-dihydroxyoxolan-2-yl]methyl phosphate Chemical compound NC(=O)C1=CC=C[N+]([C@@H]2[C@H]([C@@H](O)[C@H](COP([O-])(=O)OP(O)(=O)OC[C@@H]3[C@H]([C@@H](OP(O)(O)=O)[C@@H](O3)N3C4=NC=NC(N)=C4N=C3)O)O2)O)=C1 XJLXINKUBYWONI-DQQFMEOOSA-N 0.000 description 6
- 239000002253 acid Substances 0.000 description 6
- 230000035605 chemotaxis Effects 0.000 description 6
- 108010031546 cyanophycin Proteins 0.000 description 6
- 229920000976 cyanophycin polymer Polymers 0.000 description 6
- KDTSHFARGAKYJN-UHFFFAOYSA-N dephosphocoenzyme A Natural products OC1C(O)C(COP(O)(=O)OP(O)(=O)OCC(C)(C)C(O)C(=O)NCCC(=O)NCCS)OC1N1C2=NC=NC(N)=C2N=C1 KDTSHFARGAKYJN-UHFFFAOYSA-N 0.000 description 6
- 235000011187 glycerol Nutrition 0.000 description 6
- 230000002209 hydrophobic effect Effects 0.000 description 6
- 210000004020 intracellular membrane Anatomy 0.000 description 6
- 239000002609 medium Substances 0.000 description 6
- 210000001322 periplasm Anatomy 0.000 description 6
- 108010023506 peroxygenase Proteins 0.000 description 6
- 108091000115 phosphomannomutase Proteins 0.000 description 6
- 239000013612 plasmid Substances 0.000 description 6
- 241000894007 species Species 0.000 description 6
- 239000000126 substance Substances 0.000 description 6
- 230000008685 targeting Effects 0.000 description 6
- 235000008521 threonine Nutrition 0.000 description 6
- 238000012546 transfer Methods 0.000 description 6
- 241000588626 Acinetobacter baumannii Species 0.000 description 5
- 101710120269 Acyl-CoA thioester hydrolase YbgC Proteins 0.000 description 5
- UXDDRFCJKNROTO-UHFFFAOYSA-N Glycerol 1,2-diacetate Chemical compound CC(=O)OCC(CO)OC(C)=O UXDDRFCJKNROTO-UHFFFAOYSA-N 0.000 description 5
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 5
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 5
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 5
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 description 5
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 5
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 5
- 101710091688 Patatin Proteins 0.000 description 5
- 108010058864 Phospholipases A2 Proteins 0.000 description 5
- 241001524101 Rhodococcus opacus Species 0.000 description 5
- 108010064926 acyl-CoA carboxylase Proteins 0.000 description 5
- 150000001335 aliphatic alkanes Chemical class 0.000 description 5
- 235000018417 cysteine Nutrition 0.000 description 5
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 5
- 230000002950 deficient Effects 0.000 description 5
- 238000004146 energy storage Methods 0.000 description 5
- 150000002192 fatty aldehydes Chemical class 0.000 description 5
- 101150019926 glgA gene Proteins 0.000 description 5
- 238000000338 in vitro Methods 0.000 description 5
- 230000004060 metabolic process Effects 0.000 description 5
- 238000007254 oxidation reaction Methods 0.000 description 5
- 239000005014 poly(hydroxyalkanoate) Substances 0.000 description 5
- 229920000903 polyhydroxyalkanoate Polymers 0.000 description 5
- 239000002243 precursor Substances 0.000 description 5
- 230000009467 reduction Effects 0.000 description 5
- 230000011664 signaling Effects 0.000 description 5
- 150000007970 thio esters Chemical class 0.000 description 5
- 210000001519 tissue Anatomy 0.000 description 5
- 101710161460 3-oxoacyl-[acyl-carrier-protein] synthase Proteins 0.000 description 4
- 108010051457 Acid Phosphatase Proteins 0.000 description 4
- 241000588624 Acinetobacter calcoaceticus Species 0.000 description 4
- 241000588625 Acinetobacter sp. Species 0.000 description 4
- 239000004475 Arginine Substances 0.000 description 4
- 239000002028 Biomass Substances 0.000 description 4
- 108090000371 Esterases Proteins 0.000 description 4
- 241000192125 Firmicutes Species 0.000 description 4
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 4
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 4
- 239000004471 Glycine Substances 0.000 description 4
- 102000007390 Glycogen Phosphorylase Human genes 0.000 description 4
- 108010046163 Glycogen Phosphorylase Proteins 0.000 description 4
- 102000051366 Glycosyltransferases Human genes 0.000 description 4
- 108700023372 Glycosyltransferases Proteins 0.000 description 4
- 101000927974 Homo sapiens Diacylglycerol O-acyltransferase 1 Proteins 0.000 description 4
- 101000582992 Homo sapiens Phospholipid phosphatase-related protein type 5 Proteins 0.000 description 4
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 4
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 4
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 4
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 4
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 4
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 4
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 4
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 4
- 239000004472 Lysine Substances 0.000 description 4
- 241000186367 Mycobacterium avium Species 0.000 description 4
- 241000187480 Mycobacterium smegmatis Species 0.000 description 4
- 229910019142 PO4 Inorganic materials 0.000 description 4
- 102100040153 Poly(A) polymerase gamma Human genes 0.000 description 4
- 244000253724 Saccharomyces cerevisiae S288c Species 0.000 description 4
- 235000004905 Saccharomyces cerevisiae S288c Nutrition 0.000 description 4
- 241000187398 Streptomyces lividans Species 0.000 description 4
- 244000098338 Triticum aestivum Species 0.000 description 4
- 102000014384 Type C Phospholipases Human genes 0.000 description 4
- 108010079194 Type C Phospholipases Proteins 0.000 description 4
- 244000025271 Umbellularia californica Species 0.000 description 4
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 4
- 125000002777 acetyl group Chemical group [H]C([H])([H])C(*)=O 0.000 description 4
- 235000004279 alanine Nutrition 0.000 description 4
- 150000001299 aldehydes Chemical class 0.000 description 4
- HXXFSFRBOHSIMQ-VFUOTHLCSA-N alpha-D-glucose 1-phosphate Chemical compound OC[C@H]1O[C@H](OP(O)(O)=O)[C@H](O)[C@@H](O)[C@@H]1O HXXFSFRBOHSIMQ-VFUOTHLCSA-N 0.000 description 4
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 4
- 235000009697 arginine Nutrition 0.000 description 4
- 125000003118 aryl group Chemical group 0.000 description 4
- 230000003115 biocidal effect Effects 0.000 description 4
- 239000002551 biofuel Substances 0.000 description 4
- 230000004071 biological effect Effects 0.000 description 4
- 230000033228 biological regulation Effects 0.000 description 4
- 229960002685 biotin Drugs 0.000 description 4
- 235000020958 biotin Nutrition 0.000 description 4
- 239000011616 biotin Substances 0.000 description 4
- 150000007942 carboxylates Chemical class 0.000 description 4
- 108091008690 chemoreceptors Proteins 0.000 description 4
- 230000000295 complement effect Effects 0.000 description 4
- 239000002299 complementary DNA Substances 0.000 description 4
- 239000008103 glucose Substances 0.000 description 4
- 150000002327 glycerophospholipids Chemical class 0.000 description 4
- 230000009878 intermolecular interaction Effects 0.000 description 4
- 229960000310 isoleucine Drugs 0.000 description 4
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 4
- 108010062385 long-chain-alcohol O-fatty-acyltransferase Proteins 0.000 description 4
- 235000018977 lysine Nutrition 0.000 description 4
- 239000000463 material Substances 0.000 description 4
- 230000007246 mechanism Effects 0.000 description 4
- 229930182817 methionine Natural products 0.000 description 4
- 238000002703 mutagenesis Methods 0.000 description 4
- 231100000350 mutagenesis Toxicity 0.000 description 4
- 229930027945 nicotinamide-adenine dinucleotide Natural products 0.000 description 4
- 230000003647 oxidation Effects 0.000 description 4
- 244000045947 parasite Species 0.000 description 4
- 239000010452 phosphate Substances 0.000 description 4
- 150000008103 phosphatidic acids Chemical class 0.000 description 4
- 230000008569 process Effects 0.000 description 4
- 108020003175 receptors Proteins 0.000 description 4
- 102000005962 receptors Human genes 0.000 description 4
- 238000012360 testing method Methods 0.000 description 4
- 125000003396 thiol group Chemical group [H]S* 0.000 description 4
- 238000013519 translation Methods 0.000 description 4
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 4
- 235000002374 tyrosine Nutrition 0.000 description 4
- 239000004474 valine Substances 0.000 description 4
- DSSYKIVIOFKYAU-XCBNKYQSSA-N (R)-camphor Chemical compound C1C[C@@]2(C)C(=O)C[C@@H]1C2(C)C DSSYKIVIOFKYAU-XCBNKYQSSA-N 0.000 description 3
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 3
- 241000023308 Acca Species 0.000 description 3
- 102000013563 Acid Phosphatase Human genes 0.000 description 3
- 108700028369 Alleles Proteins 0.000 description 3
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 3
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 3
- 102000014914 Carrier Proteins Human genes 0.000 description 3
- NBSCHQHZLSJFNQ-GASJEMHNSA-N D-Glucose 6-phosphate Chemical compound OC1O[C@H](COP(O)(O)=O)[C@@H](O)[C@H](O)[C@H]1O NBSCHQHZLSJFNQ-GASJEMHNSA-N 0.000 description 3
- 102100035762 Diacylglycerol O-acyltransferase 2 Human genes 0.000 description 3
- 240000005636 Dryobalanops aromatica Species 0.000 description 3
- 230000005526 G1 to G0 transition Effects 0.000 description 3
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 3
- VFRROHXSMXFLSN-UHFFFAOYSA-N Glc6P Natural products OP(=O)(O)OCC(O)C(O)C(O)C(O)C=O VFRROHXSMXFLSN-UHFFFAOYSA-N 0.000 description 3
- 101710121996 Hexon protein p72 Proteins 0.000 description 3
- 241000282414 Homo sapiens Species 0.000 description 3
- 101000930020 Homo sapiens Diacylglycerol O-acyltransferase 2 Proteins 0.000 description 3
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 3
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 3
- 239000000232 Lipid Bilayer Substances 0.000 description 3
- 108030000162 Long-chain-fatty-acid-[acyl-carrier-protein] ligases Proteins 0.000 description 3
- 101710125418 Major capsid protein Proteins 0.000 description 3
- 241000187722 Micromonospora echinospora Species 0.000 description 3
- 101100000438 Mus musculus Acacb gene Proteins 0.000 description 3
- 241000186359 Mycobacterium Species 0.000 description 3
- 241000187479 Mycobacterium tuberculosis Species 0.000 description 3
- 125000001429 N-terminal alpha-amino-acid group Chemical group 0.000 description 3
- 241000244206 Nematoda Species 0.000 description 3
- 241000187654 Nocardia Species 0.000 description 3
- 102000009569 Phosphoglucomutase Human genes 0.000 description 3
- 241000187563 Rhodococcus ruber Species 0.000 description 3
- 241000235347 Schizosaccharomyces pombe Species 0.000 description 3
- 241000192584 Synechocystis Species 0.000 description 3
- 241000192593 Synechocystis sp. PCC 6803 Species 0.000 description 3
- 241000255588 Tephritidae Species 0.000 description 3
- 102000004357 Transferases Human genes 0.000 description 3
- 108090000992 Transferases Proteins 0.000 description 3
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 3
- 235000008674 Umbellularia californica Nutrition 0.000 description 3
- 230000002378 acidificating effect Effects 0.000 description 3
- 230000009471 action Effects 0.000 description 3
- 230000010933 acylation Effects 0.000 description 3
- 238000005917 acylation reaction Methods 0.000 description 3
- 238000004458 analytical method Methods 0.000 description 3
- 239000012736 aqueous medium Substances 0.000 description 3
- 235000009582 asparagine Nutrition 0.000 description 3
- 229960001230 asparagine Drugs 0.000 description 3
- 235000003704 aspartic acid Nutrition 0.000 description 3
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 3
- 125000004063 butyryl group Chemical group O=C([*])C([H])([H])C([H])([H])C([H])([H])[H] 0.000 description 3
- 150000001720 carbohydrates Chemical class 0.000 description 3
- 235000014633 carbohydrates Nutrition 0.000 description 3
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 3
- 230000021523 carboxylation Effects 0.000 description 3
- 238000006473 carboxylation reaction Methods 0.000 description 3
- 230000008859 change Effects 0.000 description 3
- 210000000349 chromosome Anatomy 0.000 description 3
- 239000013078 crystal Substances 0.000 description 3
- 230000030609 dephosphorylation Effects 0.000 description 3
- 238000006209 dephosphorylation reaction Methods 0.000 description 3
- 229940023064 escherichia coli Drugs 0.000 description 3
- 150000002148 esters Chemical class 0.000 description 3
- 150000002185 fatty acyl-CoAs Chemical class 0.000 description 3
- 230000002538 fungal effect Effects 0.000 description 3
- 101150065899 glgA1 gene Proteins 0.000 description 3
- 101150037310 glgM gene Proteins 0.000 description 3
- 125000002791 glucosyl group Chemical group C1([C@H](O)[C@@H](O)[C@H](O)[C@H](O1)CO)* 0.000 description 3
- 235000013922 glutamic acid Nutrition 0.000 description 3
- 239000004220 glutamic acid Substances 0.000 description 3
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 3
- 235000004554 glutamine Nutrition 0.000 description 3
- 108010087116 holo-(acyl-carrier-protein) synthase Proteins 0.000 description 3
- 230000007062 hydrolysis Effects 0.000 description 3
- 238000006460 hydrolysis reaction Methods 0.000 description 3
- 230000004807 localization Effects 0.000 description 3
- 230000001404 mediated effect Effects 0.000 description 3
- 239000002207 metabolite Substances 0.000 description 3
- 210000004897 n-terminal region Anatomy 0.000 description 3
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 3
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 3
- WTJKGGKOPKCXLL-RRHRGVEJSA-N phosphatidylcholine Chemical compound CCCCCCCCCCCCCCCC(=O)OC[C@H](COP([O-])(=O)OCC[N+](C)(C)C)OC(=O)CCCCCCCC=CCCCCCCCC WTJKGGKOPKCXLL-RRHRGVEJSA-N 0.000 description 3
- 238000000746 purification Methods 0.000 description 3
- 230000010076 replication Effects 0.000 description 3
- 238000011160 research Methods 0.000 description 3
- 150000003839 salts Chemical class 0.000 description 3
- 238000010561 standard procedure Methods 0.000 description 3
- 231100000331 toxic Toxicity 0.000 description 3
- 230000002588 toxic effect Effects 0.000 description 3
- 238000013518 transcription Methods 0.000 description 3
- 230000035897 transcription Effects 0.000 description 3
- 230000002103 transcriptional effect Effects 0.000 description 3
- 230000009466 transformation Effects 0.000 description 3
- 108091005703 transmembrane proteins Proteins 0.000 description 3
- 102000035160 transmembrane proteins Human genes 0.000 description 3
- PORPENFLTBBHSG-MGBGTMOVSA-N 1,2-dihexadecanoyl-sn-glycerol-3-phosphate Chemical compound CCCCCCCCCCCCCCCC(=O)OC[C@H](COP(O)(O)=O)OC(=O)CCCCCCCCCCCCCCC PORPENFLTBBHSG-MGBGTMOVSA-N 0.000 description 2
- 102100030492 1-phosphatidylinositol 4,5-bisphosphate phosphodiesterase epsilon-1 Human genes 0.000 description 2
- 108010052386 2-haloacid dehalogenase Proteins 0.000 description 2
- 102100039377 28 kDa heat- and acid-stable phosphoprotein Human genes 0.000 description 2
- 101710176122 28 kDa heat- and acid-stable phosphoprotein Proteins 0.000 description 2
- WHBMMWSBFZVSSR-UHFFFAOYSA-M 3-hydroxybutyrate Chemical compound CC(O)CC([O-])=O WHBMMWSBFZVSSR-UHFFFAOYSA-M 0.000 description 2
- FUYCAQNCWDAOLQ-UHFFFAOYSA-N 4,8,12-trimethyltridecanoic acid Chemical compound CC(C)CCCC(C)CCCC(C)CCC(O)=O FUYCAQNCWDAOLQ-UHFFFAOYSA-N 0.000 description 2
- 241000186046 Actinomyces Species 0.000 description 2
- 241000611270 Alcanivorax borkumensis Species 0.000 description 2
- 102000016912 Aldehyde Reductase Human genes 0.000 description 2
- 108010053754 Aldehyde reductase Proteins 0.000 description 2
- 241000192542 Anabaena Species 0.000 description 2
- 241000219194 Arabidopsis Species 0.000 description 2
- 241000186063 Arthrobacter Species 0.000 description 2
- 108700040250 B-Specific) Enoyl-(Acyl-Carrier Protein) Reductase (NADPH Proteins 0.000 description 2
- 244000063299 Bacillus subtilis Species 0.000 description 2
- 235000014469 Bacillus subtilis Nutrition 0.000 description 2
- 101710201279 Biotin carboxyl carrier protein Proteins 0.000 description 2
- 241000192685 Calothrix Species 0.000 description 2
- CURLTUGMZLYLDI-UHFFFAOYSA-N Carbon dioxide Chemical compound O=C=O CURLTUGMZLYLDI-UHFFFAOYSA-N 0.000 description 2
- 102000004308 Carboxylic Ester Hydrolases Human genes 0.000 description 2
- 108090000863 Carboxylic Ester Hydrolases Proteins 0.000 description 2
- 108020004705 Codon Proteins 0.000 description 2
- 241000186216 Corynebacterium Species 0.000 description 2
- 241000159506 Cyanothece Species 0.000 description 2
- 102000053602 DNA Human genes 0.000 description 2
- RTZKZFJDLAIYFH-UHFFFAOYSA-N Diethyl ether Chemical compound CCOCC RTZKZFJDLAIYFH-UHFFFAOYSA-N 0.000 description 2
- 102100039104 Dolichyl-diphosphooligosaccharide-protein glycosyltransferase subunit DAD1 Human genes 0.000 description 2
- 206010059866 Drug resistance Diseases 0.000 description 2
- 108020002908 Epoxide hydrolase Proteins 0.000 description 2
- 101710172177 Fasciclin-2 Proteins 0.000 description 2
- 241000187809 Frankia Species 0.000 description 2
- 229920001503 Glucan Polymers 0.000 description 2
- 108010042204 Glycosylphosphatidylinositol diacylglycerol-lyase Proteins 0.000 description 2
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 2
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 2
- 101000884921 Homo sapiens Dolichyl-diphosphooligosaccharide-protein glycosyltransferase subunit DAD1 Proteins 0.000 description 2
- 241000913084 Limnothrix Species 0.000 description 2
- 102100025357 Lipid-phosphate phosphatase Human genes 0.000 description 2
- 108090001030 Lipoproteins Proteins 0.000 description 2
- 102000004895 Lipoproteins Human genes 0.000 description 2
- 241000186779 Listeria monocytogenes Species 0.000 description 2
- 108010011927 Long-chain-alcohol dehydrogenase Proteins 0.000 description 2
- 241000100287 Membras Species 0.000 description 2
- 241001139348 Microchaete Species 0.000 description 2
- 241000192041 Micrococcus Species 0.000 description 2
- BAWFJGJZGIEFAR-NNYOXOHSSA-O NAD(+) Chemical compound NC(=O)C1=CC=C[N+]([C@H]2[C@@H]([C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OC[C@@H]3[C@H]([C@@H](O)[C@@H](O3)N3C4=NC=NC(N)=C4N=C3)O)O2)O)=C1 BAWFJGJZGIEFAR-NNYOXOHSSA-O 0.000 description 2
- 101710136418 NADPH-dependent 1-acyldihydroxyacetone phosphate reductase Proteins 0.000 description 2
- 241001478892 Nostoc sp. PCC 7120 Species 0.000 description 2
- 102100031254 Patatin-like phospholipase domain-containing protein 6 Human genes 0.000 description 2
- 108010093375 Phosphatidylinositol Diacylglycerol-Lyase Proteins 0.000 description 2
- 101710133727 Phospholipid:diacylglycerol acyltransferase Proteins 0.000 description 2
- 241000186429 Propionibacterium Species 0.000 description 2
- WHBMMWSBFZVSSR-UHFFFAOYSA-N R3HBA Natural products CC(O)CC(O)=O WHBMMWSBFZVSSR-UHFFFAOYSA-N 0.000 description 2
- 241000119319 Rhodococcus opacus PD630 Species 0.000 description 2
- 108091028664 Ribonucleotide Proteins 0.000 description 2
- 238000012300 Sequence Analysis Methods 0.000 description 2
- 102100022346 Serine/threonine-protein phosphatase 5 Human genes 0.000 description 2
- 101710129069 Serine/threonine-protein phosphatase 5 Proteins 0.000 description 2
- 101710199542 Serine/threonine-protein phosphatase T Proteins 0.000 description 2
- 108020004459 Small interfering RNA Proteins 0.000 description 2
- 244000300264 Spinacia oleracea Species 0.000 description 2
- 235000009337 Spinacia oleracea Nutrition 0.000 description 2
- 241001464990 Stanieria Species 0.000 description 2
- 229920002472 Starch Polymers 0.000 description 2
- 241000187432 Streptomyces coelicolor Species 0.000 description 2
- 241001446311 Streptomyces coelicolor A3(2) Species 0.000 description 2
- 241001453296 Synechococcus elongatus Species 0.000 description 2
- 241001453313 Synechococcus sp. PCC 7002 Species 0.000 description 2
- 241001303802 Synechococcus sp. RCC307 Species 0.000 description 2
- 241000970911 Trichormus variabilis ATCC 29413 Species 0.000 description 2
- HSCJRCZFDFQWRP-JZMIEXBBSA-N UDP-alpha-D-glucose Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1OP(O)(=O)OP(O)(=O)OC[C@@H]1[C@@H](O)[C@@H](O)[C@H](N2C(NC(=O)C=C2)=O)O1 HSCJRCZFDFQWRP-JZMIEXBBSA-N 0.000 description 2
- HSCJRCZFDFQWRP-UHFFFAOYSA-N Uridindiphosphoglukose Natural products OC1C(O)C(O)C(CO)OC1OP(O)(=O)OP(O)(=O)OCC1C(O)C(O)C(N2C(NC(=O)C=C2)=O)O1 HSCJRCZFDFQWRP-UHFFFAOYSA-N 0.000 description 2
- 244000207951 Vigna marina Species 0.000 description 2
- 241000700605 Viruses Species 0.000 description 2
- 239000000370 acceptor Substances 0.000 description 2
- 150000007513 acids Chemical class 0.000 description 2
- 101150071422 acp gene Proteins 0.000 description 2
- 230000004913 activation Effects 0.000 description 2
- 150000001336 alkenes Chemical class 0.000 description 2
- AWUCVROLDVIAJX-UHFFFAOYSA-N alpha-glycerophosphate Natural products OCC(O)COP(O)(O)=O AWUCVROLDVIAJX-UHFFFAOYSA-N 0.000 description 2
- 230000004814 anther dehiscence Effects 0.000 description 2
- 239000003242 anti bacterial agent Substances 0.000 description 2
- 239000007864 aqueous solution Substances 0.000 description 2
- YZXBAPSDXZZRGB-DOFZRALJSA-N arachidonic acid Chemical compound CCCCC\C=C/C\C=C/C\C=C/C\C=C/CCCC(O)=O YZXBAPSDXZZRGB-DOFZRALJSA-N 0.000 description 2
- 238000003491 array Methods 0.000 description 2
- 229940009098 aspartate Drugs 0.000 description 2
- 230000008827 biological function Effects 0.000 description 2
- 125000001721 carboxyacetyl group Chemical group 0.000 description 2
- 150000001733 carboxylic acid esters Chemical class 0.000 description 2
- 210000002421 cell wall Anatomy 0.000 description 2
- 210000004671 cell-free system Anatomy 0.000 description 2
- 239000007795 chemical reaction product Substances 0.000 description 2
- 210000003763 chloroplast Anatomy 0.000 description 2
- 238000002742 combinatorial mutagenesis Methods 0.000 description 2
- 150000001875 compounds Chemical class 0.000 description 2
- 108091036078 conserved sequence Proteins 0.000 description 2
- 125000004122 cyclic group Chemical group 0.000 description 2
- 238000006731 degradation reaction Methods 0.000 description 2
- 230000018044 dehydration Effects 0.000 description 2
- 238000006297 dehydration reaction Methods 0.000 description 2
- 239000001177 diphosphate Substances 0.000 description 2
- XPPKVPWEQAFLFU-UHFFFAOYSA-J diphosphate(4-) Chemical compound [O-]P([O-])(=O)OP([O-])([O-])=O XPPKVPWEQAFLFU-UHFFFAOYSA-J 0.000 description 2
- 235000011180 diphosphates Nutrition 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 239000003623 enhancer Substances 0.000 description 2
- 235000020776 essential amino acid Nutrition 0.000 description 2
- 239000003797 essential amino acid Substances 0.000 description 2
- 210000003527 eukaryotic cell Anatomy 0.000 description 2
- 239000013604 expression vector Substances 0.000 description 2
- 239000000284 extract Substances 0.000 description 2
- 229950010772 glucose-1-phosphate Drugs 0.000 description 2
- 150000004676 glycans Chemical class 0.000 description 2
- 108700014210 glycosyltransferase activity proteins Proteins 0.000 description 2
- 239000005090 green fluorescent protein Substances 0.000 description 2
- 239000001963 growth medium Substances 0.000 description 2
- IPCSVZSSVZVIGE-UHFFFAOYSA-M hexadecanoate Chemical compound CCCCCCCCCCCCCCCC([O-])=O IPCSVZSSVZVIGE-UHFFFAOYSA-M 0.000 description 2
- 150000002430 hydrocarbons Chemical group 0.000 description 2
- 125000002887 hydroxy group Chemical group [H]O* 0.000 description 2
- 238000001727 in vivo Methods 0.000 description 2
- 229910052816 inorganic phosphate Inorganic materials 0.000 description 2
- 150000002500 ions Chemical class 0.000 description 2
- 238000002955 isolation Methods 0.000 description 2
- 230000006372 lipid accumulation Effects 0.000 description 2
- 229940115931 listeria monocytogenes Drugs 0.000 description 2
- 125000000346 malonyl group Chemical group C(CC(=O)*)(=O)* 0.000 description 2
- 239000003550 marker Substances 0.000 description 2
- 150000004667 medium chain fatty acids Chemical class 0.000 description 2
- 102000006240 membrane receptors Human genes 0.000 description 2
- 108020004084 membrane receptors Proteins 0.000 description 2
- 108020004999 messenger RNA Proteins 0.000 description 2
- 125000001360 methionine group Chemical group N[C@@H](CCSC)C(=O)* 0.000 description 2
- 238000001823 molecular biology technique Methods 0.000 description 2
- 239000000178 monomer Substances 0.000 description 2
- 229910052757 nitrogen Inorganic materials 0.000 description 2
- 235000015097 nutrients Nutrition 0.000 description 2
- 230000029553 photosynthesis Effects 0.000 description 2
- 238000010672 photosynthesis Methods 0.000 description 2
- 230000037039 plant physiology Effects 0.000 description 2
- 108010010718 poly(3-hydroxyalkanoic acid) synthase Proteins 0.000 description 2
- 229920000470 poly(p-phenylene terephthalate) polymer Polymers 0.000 description 2
- 229920001282 polysaccharide Polymers 0.000 description 2
- 239000005017 polysaccharide Substances 0.000 description 2
- 230000004481 post-translational protein modification Effects 0.000 description 2
- 230000006798 recombination Effects 0.000 description 2
- 238000005215 recombination Methods 0.000 description 2
- 238000004064 recycling Methods 0.000 description 2
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 description 2
- 230000003362 replicative effect Effects 0.000 description 2
- 230000002441 reversible effect Effects 0.000 description 2
- 239000002336 ribonucleotide Substances 0.000 description 2
- 125000002652 ribonucleotide group Chemical group 0.000 description 2
- 229920006395 saturated elastomer Polymers 0.000 description 2
- 230000028327 secretion Effects 0.000 description 2
- AWUCVROLDVIAJX-GSVOUGTGSA-N sn-glycerol 3-phosphate Chemical compound OC[C@@H](O)COP(O)(O)=O AWUCVROLDVIAJX-GSVOUGTGSA-N 0.000 description 2
- 239000000243 solution Substances 0.000 description 2
- 230000009870 specific binding Effects 0.000 description 2
- 150000003408 sphingolipids Chemical class 0.000 description 2
- 235000019698 starch Nutrition 0.000 description 2
- 239000008107 starch Substances 0.000 description 2
- UCSJYZPVAKXKNQ-HZYVHMACSA-N streptomycin Chemical compound CN[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O[C@H]1O[C@@H]1[C@](C=O)(O)[C@H](C)O[C@H]1O[C@@H]1[C@@H](NC(N)=N)[C@H](O)[C@@H](NC(N)=N)[C@H](O)[C@H]1O UCSJYZPVAKXKNQ-HZYVHMACSA-N 0.000 description 2
- 125000001424 substituent group Chemical group 0.000 description 2
- 230000009897 systematic effect Effects 0.000 description 2
- 150000003573 thiols Chemical class 0.000 description 2
- 230000001052 transient effect Effects 0.000 description 2
- 230000032258 transport Effects 0.000 description 2
- 238000011144 upstream manufacturing Methods 0.000 description 2
- 239000001993 wax Substances 0.000 description 2
- IHNKQIMGVNPMTC-UHFFFAOYSA-N (2-hydroxy-3-octadecanoyloxypropyl) 2-(trimethylazaniumyl)ethyl phosphate Chemical compound CCCCCCCCCCCCCCCCCC(=O)OCC(O)COP([O-])(=O)OCC[N+](C)(C)C IHNKQIMGVNPMTC-UHFFFAOYSA-N 0.000 description 1
- JARGNLJYKBUKSJ-KGZKBUQUSA-N (2r)-2-amino-5-[[(2r)-1-(carboxymethylamino)-3-hydroxy-1-oxopropan-2-yl]amino]-5-oxopentanoic acid;hydrobromide Chemical compound Br.OC(=O)[C@H](N)CCC(=O)N[C@H](CO)C(=O)NCC(O)=O JARGNLJYKBUKSJ-KGZKBUQUSA-N 0.000 description 1
- GHOKWGTUZJEAQD-ZETCQYMHSA-N (D)-(+)-Pantothenic acid Chemical group OCC(C)(C)[C@@H](O)C(=O)NCCC(O)=O GHOKWGTUZJEAQD-ZETCQYMHSA-N 0.000 description 1
- 102100025573 1-alkyl-2-acetylglycerophosphocholine esterase Human genes 0.000 description 1
- WRGQSWVCFNIUNZ-GDCKJWNLSA-N 1-oleoyl-sn-glycerol 3-phosphate Chemical compound CCCCCCCC\C=C/CCCCCCCC(=O)OC[C@@H](O)COP(O)(O)=O WRGQSWVCFNIUNZ-GDCKJWNLSA-N 0.000 description 1
- 108050004163 1-phosphatidylinositol 4,5-bisphosphate phosphodiesterase delta-1 Proteins 0.000 description 1
- 108050004626 1-phosphatidylinositol 4,5-bisphosphate phosphodiesterase delta-3 Proteins 0.000 description 1
- 108050004186 1-phosphatidylinositol 4,5-bisphosphate phosphodiesterase delta-4 Proteins 0.000 description 1
- 108050004161 1-phosphatidylinositol 4,5-bisphosphate phosphodiesterase epsilon-1 Proteins 0.000 description 1
- 108050004167 1-phosphatidylinositol 4,5-bisphosphate phosphodiesterase eta-1 Proteins 0.000 description 1
- 108050004168 1-phosphatidylinositol 4,5-bisphosphate phosphodiesterase eta-2 Proteins 0.000 description 1
- 108050004159 1-phosphatidylinositol 4,5-bisphosphate phosphodiesterase zeta-1 Proteins 0.000 description 1
- GZCWLCBFPRFLKL-UHFFFAOYSA-N 1-prop-2-ynoxypropan-2-ol Chemical compound CC(O)COCC#C GZCWLCBFPRFLKL-UHFFFAOYSA-N 0.000 description 1
- GOJUJUVQIVIZAV-UHFFFAOYSA-N 2-amino-4,6-dichloropyrimidine-5-carbaldehyde Chemical group NC1=NC(Cl)=C(C=O)C(Cl)=N1 GOJUJUVQIVIZAV-UHFFFAOYSA-N 0.000 description 1
- NPMFMWBCAQGTRI-UHFFFAOYSA-N 2-phenyldecanoic acid Chemical compound CCCCCCCCC(C(O)=O)C1=CC=CC=C1 NPMFMWBCAQGTRI-UHFFFAOYSA-N 0.000 description 1
- 108010093803 3-ketoacyl-acyl carrier protein synthase III Proteins 0.000 description 1
- XTWYTFMLZFPYCI-KQYNXXCUSA-N 5'-adenylphosphoric acid Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](COP(O)(=O)OP(O)(O)=O)[C@@H](O)[C@H]1O XTWYTFMLZFPYCI-KQYNXXCUSA-N 0.000 description 1
- 241000204392 Acetonema longum Species 0.000 description 1
- 241000897241 Acinetobacter sp. ADP1 Species 0.000 description 1
- XTWYTFMLZFPYCI-UHFFFAOYSA-N Adenosine diphosphate Natural products C1=NC=2C(N)=NC=NC=2N1C1OC(COP(O)(=O)OP(O)(O)=O)C(O)C1O XTWYTFMLZFPYCI-UHFFFAOYSA-N 0.000 description 1
- GUBGYTABKSRVRQ-XLOQQCSPSA-N Alpha-Lactose Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](CO)O[C@H](O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-XLOQQCSPSA-N 0.000 description 1
- 241000149144 Anabaenopsis Species 0.000 description 1
- 241000024188 Andala Species 0.000 description 1
- 108020000948 Antisense Oligonucleotides Proteins 0.000 description 1
- 108020005544 Antisense RNA Proteins 0.000 description 1
- 241000192660 Aphanizomenon Species 0.000 description 1
- 241000192698 Aphanocapsa Species 0.000 description 1
- 241000192705 Aphanothece Species 0.000 description 1
- 101100334284 Arabidopsis thaliana FAR6 gene Proteins 0.000 description 1
- 101001131987 Arabidopsis thaliana Peroxygenase 1 Proteins 0.000 description 1
- 241001495180 Arthrospira Species 0.000 description 1
- 240000002900 Arthrospira platensis Species 0.000 description 1
- 235000016425 Arthrospira platensis Nutrition 0.000 description 1
- 108010024976 Asparaginase Proteins 0.000 description 1
- 241001555080 Aulosira Species 0.000 description 1
- 241000194103 Bacillus pumilus Species 0.000 description 1
- 241000961101 Bacteroides caccae ATCC 43185 Species 0.000 description 1
- 241000962975 Blautia obeum ATCC 29174 Species 0.000 description 1
- 241000589969 Borreliella burgdorferi Species 0.000 description 1
- 241000283690 Bos taurus Species 0.000 description 1
- 125000001433 C-terminal amino-acid group Chemical group 0.000 description 1
- AFWTZXXDGQBIKW-UHFFFAOYSA-N C14 surfactin Natural products CCCCCCCCCCCC1CC(=O)NC(CCC(O)=O)C(=O)NC(CC(C)C)C(=O)NC(CC(C)C)C(=O)NC(C(C)C)C(=O)NC(CC(O)=O)C(=O)NC(CC(C)C)C(=O)NC(CC(C)C)C(=O)O1 AFWTZXXDGQBIKW-UHFFFAOYSA-N 0.000 description 1
- BHPQYMZQTOCNFJ-UHFFFAOYSA-N Calcium cation Chemical compound [Ca+2] BHPQYMZQTOCNFJ-UHFFFAOYSA-N 0.000 description 1
- 241000589875 Campylobacter jejuni Species 0.000 description 1
- 241000049608 Capsosira Species 0.000 description 1
- 241000010804 Caulobacter vibrioides Species 0.000 description 1
- 241001611009 Chamaesiphon Species 0.000 description 1
- 241000195597 Chlamydomonas reinhardtii Species 0.000 description 1
- 241000447613 Chlorobaculum parvum Species 0.000 description 1
- 241000003118 Chloroflexus aggregans DSM 9485 Species 0.000 description 1
- 241001447757 Chloroherpeton thalassium Species 0.000 description 1
- 241000531074 Chroococcidiopsis Species 0.000 description 1
- 241001219477 Chroococcus Species 0.000 description 1
- 241000048321 Chroogloeocystis Species 0.000 description 1
- 244000060011 Cocos nucifera Species 0.000 description 1
- 235000013162 Cocos nucifera Nutrition 0.000 description 1
- 241001470365 Coelosphaerium Species 0.000 description 1
- 241000264659 Coleodesmium <Cyanoacteria> Species 0.000 description 1
- 108091035707 Consensus sequence Proteins 0.000 description 1
- 241000065719 Crocosphaera Species 0.000 description 1
- 201000007336 Cryptococcosis Diseases 0.000 description 1
- 241000221204 Cryptococcus neoformans Species 0.000 description 1
- 241000219992 Cuphea Species 0.000 description 1
- 241000414116 Cyanobium Species 0.000 description 1
- 241000410565 Cyanodictyon Species 0.000 description 1
- 241000023723 Cyanosarcina Species 0.000 description 1
- 241000380046 Cyanospira Species 0.000 description 1
- 241000565779 Cylindrospermum Species 0.000 description 1
- FBPFZTCFMRRESA-KVTDHHQDSA-N D-Mannitol Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-KVTDHHQDSA-N 0.000 description 1
- JDMUPRLRUUMCTL-VIFPVBQESA-N D-pantetheine 4'-phosphate Chemical compound OP(=O)(O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCS JDMUPRLRUUMCTL-VIFPVBQESA-N 0.000 description 1
- 101710096830 DNA-3-methyladenine glycosylase Proteins 0.000 description 1
- 102100039128 DNA-3-methyladenine glycosylase Human genes 0.000 description 1
- 101710101803 DNA-binding protein J Proteins 0.000 description 1
- 241000721041 Dactylococcopsis Species 0.000 description 1
- 101710088194 Dehydrogenase Proteins 0.000 description 1
- 241000530784 Dermocarpella Species 0.000 description 1
- 241000255601 Drosophila melanogaster Species 0.000 description 1
- 102000011426 Enoyl-CoA hydratase Human genes 0.000 description 1
- 108010023922 Enoyl-CoA hydratase Proteins 0.000 description 1
- 101710198510 Enoyl-[acyl-carrier-protein] reductase [NADH] Proteins 0.000 description 1
- 241000195619 Euglena gracilis Species 0.000 description 1
- 101150071111 FADD gene Proteins 0.000 description 1
- 101710172176 Fasciclin-1 Proteins 0.000 description 1
- 241000192601 Fischerella Species 0.000 description 1
- 241000956293 Fulda Species 0.000 description 1
- 241000892911 Geitlerinema Species 0.000 description 1
- 206010071602 Genetic polymorphism Diseases 0.000 description 1
- 241000626621 Geobacillus Species 0.000 description 1
- 241001134702 Gloeothece Species 0.000 description 1
- 102000030595 Glucokinase Human genes 0.000 description 1
- 108010021582 Glucokinase Proteins 0.000 description 1
- 101710090186 Glucose-1-phosphate adenylyltransferase large subunit Proteins 0.000 description 1
- 101710095003 Glucose-1-phosphate adenylyltransferase small subunit Proteins 0.000 description 1
- 102000005731 Glucose-6-phosphate isomerase Human genes 0.000 description 1
- 108010070600 Glucose-6-phosphate isomerase Proteins 0.000 description 1
- 102000000340 Glucosyltransferases Human genes 0.000 description 1
- 108010055629 Glucosyltransferases Proteins 0.000 description 1
- 108010070675 Glutathione transferase Proteins 0.000 description 1
- 102000005720 Glutathione transferase Human genes 0.000 description 1
- JZNWSCPGTDBMEW-UHFFFAOYSA-N Glycerophosphorylethanolamin Natural products NCCOP(O)(=O)OCC(O)CO JZNWSCPGTDBMEW-UHFFFAOYSA-N 0.000 description 1
- 241000045411 Hahella chejuensis Species 0.000 description 1
- 241000681677 Halomicronema Species 0.000 description 1
- 241000549847 Halospirulina Species 0.000 description 1
- 241000605178 Halothiobacillus neapolitanus Species 0.000 description 1
- 241000350665 Hapalosiphon Species 0.000 description 1
- 241001453258 Helicobacter hepaticus Species 0.000 description 1
- 101710154606 Hemagglutinin Proteins 0.000 description 1
- HTTJABKRGRZYRN-UHFFFAOYSA-N Heparin Chemical compound OC1C(NC(=O)C)C(O)OC(COS(O)(=O)=O)C1OC1C(OS(O)(=O)=O)C(O)C(OC2C(C(OS(O)(=O)=O)C(OC3C(C(O)C(O)C(O3)C(O)=O)OS(O)(=O)=O)C(CO)O2)NS(O)(=O)=O)C(C(O)=O)O1 HTTJABKRGRZYRN-UHFFFAOYSA-N 0.000 description 1
- 102100036284 Hepcidin Human genes 0.000 description 1
- SQUHHTBVTRBESD-UHFFFAOYSA-N Hexa-Ac-myo-Inositol Natural products CC(=O)OC1C(OC(C)=O)C(OC(C)=O)C(OC(C)=O)C(OC(C)=O)C1OC(C)=O SQUHHTBVTRBESD-UHFFFAOYSA-N 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- 108010072039 Histidine kinase Proteins 0.000 description 1
- 101001021253 Homo sapiens Hepcidin Proteins 0.000 description 1
- 101000599573 Homo sapiens InaD-like protein Proteins 0.000 description 1
- 101001017254 Homo sapiens Myb-binding protein 1A Proteins 0.000 description 1
- 101001129178 Homo sapiens Patatin-like phospholipase domain-containing protein 6 Proteins 0.000 description 1
- 102000004867 Hydro-Lyases Human genes 0.000 description 1
- 108090001042 Hydro-Lyases Proteins 0.000 description 1
- 102100037978 InaD-like protein Human genes 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- 108010028688 Isoamylase Proteins 0.000 description 1
- 241000378111 Katagnymene Species 0.000 description 1
- 125000000998 L-alanino group Chemical group [H]N([*])[C@](C([H])([H])[H])([H])C(=O)O[H] 0.000 description 1
- ODKSFYDXXFIFQN-BYPYZUCNSA-N L-arginine Chemical compound OC(=O)[C@@H](N)CCCN=C(N)N ODKSFYDXXFIFQN-BYPYZUCNSA-N 0.000 description 1
- 125000000174 L-prolyl group Chemical group [H]N1C([H])([H])C([H])([H])C([H])([H])[C@@]1([H])C(*)=O 0.000 description 1
- 125000000510 L-tryptophano group Chemical group [H]C1=C([H])C([H])=C2N([H])C([H])=C(C([H])([H])[C@@]([H])(C(O[H])=O)N([H])[*])C2=C1[H] 0.000 description 1
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 description 1
- 235000017858 Laurus nobilis Nutrition 0.000 description 1
- 241000589242 Legionella pneumophila Species 0.000 description 1
- 241000215457 Leptolyngbya Species 0.000 description 1
- 241001134698 Lyngbya Species 0.000 description 1
- 241001344131 Magnaporthe grisea Species 0.000 description 1
- 241001657388 Magnetospirillum magneticum Species 0.000 description 1
- OFOBLEOULBTSOW-UHFFFAOYSA-L Malonate Chemical compound [O-]C(=O)CC([O-])=O OFOBLEOULBTSOW-UHFFFAOYSA-L 0.000 description 1
- OFOBLEOULBTSOW-UHFFFAOYSA-N Malonic acid Chemical compound OC(=O)CC(O)=O OFOBLEOULBTSOW-UHFFFAOYSA-N 0.000 description 1
- 229930195725 Mannitol Natural products 0.000 description 1
- 241000206589 Marinobacter Species 0.000 description 1
- 241000285148 Marinobacter adhaerens HP15 Species 0.000 description 1
- 241000426386 Marinobacter algicola DG893 Species 0.000 description 1
- 241000501784 Marinobacter sp. Species 0.000 description 1
- 241000243405 Mastigocladopsis Species 0.000 description 1
- 241000520876 Merismopedia Species 0.000 description 1
- 241000179980 Microcoleus Species 0.000 description 1
- 241000192701 Microcystis Species 0.000 description 1
- 241000192710 Microcystis aeruginosa Species 0.000 description 1
- 108010093369 Multienzyme Complexes Proteins 0.000 description 1
- 102000002568 Multienzyme Complexes Human genes 0.000 description 1
- 108700005084 Multigene Family Proteins 0.000 description 1
- 241000699660 Mus musculus Species 0.000 description 1
- 241001646722 Mycobacterium tuberculosis CDC1551 Species 0.000 description 1
- 241000511380 Myxosarcina Species 0.000 description 1
- 241001148162 Nitrobacter sp. Species 0.000 description 1
- 241000059630 Nodularia <Cyanobacteria> Species 0.000 description 1
- 241000192656 Nostoc Species 0.000 description 1
- 241000424623 Nostoc punctiforme Species 0.000 description 1
- 241000894763 Nostoc punctiforme PCC 73102 Species 0.000 description 1
- 241000243387 Nostochopsis Species 0.000 description 1
- 108091005461 Nucleic proteins Chemical group 0.000 description 1
- 108090000119 Nucleotidyltransferases Proteins 0.000 description 1
- 102000003832 Nucleotidyltransferases Human genes 0.000 description 1
- 241001272808 Oceanicola granulosus HTCC2516 Species 0.000 description 1
- 241000192497 Oscillatoria Species 0.000 description 1
- 241000192494 Oscillatoriales Species 0.000 description 1
- 101710093908 Outer capsid protein VP4 Proteins 0.000 description 1
- 101710135467 Outer capsid protein sigma-1 Proteins 0.000 description 1
- 101710116435 Outer membrane protein Proteins 0.000 description 1
- 101150073131 PAP gene Proteins 0.000 description 1
- 241000192608 Phormidium Species 0.000 description 1
- 108010044302 Phosphoinositide phospholipase C Proteins 0.000 description 1
- 108090000553 Phospholipase D Proteins 0.000 description 1
- 102000011420 Phospholipase D Human genes 0.000 description 1
- 108700019535 Phosphoprotein Phosphatases Proteins 0.000 description 1
- 102000045595 Phosphoprotein Phosphatases Human genes 0.000 description 1
- 102000004861 Phosphoric Diester Hydrolases Human genes 0.000 description 1
- 108090001050 Phosphoric Diester Hydrolases Proteins 0.000 description 1
- OAICVXFJPJFONN-UHFFFAOYSA-N Phosphorus Chemical compound [P] OAICVXFJPJFONN-UHFFFAOYSA-N 0.000 description 1
- 241001208362 Photobacterium profundum SS9 Species 0.000 description 1
- 241000913090 Planktothricoides Species 0.000 description 1
- 241000530769 Planktothrix Species 0.000 description 1
- 241000192665 Plectonema Species 0.000 description 1
- 241000179979 Pleurocapsa Species 0.000 description 1
- 241000511381 Pleurocapsales Species 0.000 description 1
- 241000192135 Prochloraceae Species 0.000 description 1
- 241000192138 Prochlorococcus Species 0.000 description 1
- 241000192141 Prochloron Species 0.000 description 1
- 241000192144 Prochlorothrix Species 0.000 description 1
- 101710176177 Protein A56 Proteins 0.000 description 1
- 108010029485 Protein Isoforms Proteins 0.000 description 1
- 102000001708 Protein Isoforms Human genes 0.000 description 1
- 102000001253 Protein Kinase Human genes 0.000 description 1
- 241000192511 Pseudanabaena Species 0.000 description 1
- 241000006062 Pseudoalteromonas tunicata D2 Species 0.000 description 1
- 241001348364 Pseudooceanicola batsensis HTCC2597 Species 0.000 description 1
- 241000410561 Radiocystis Species 0.000 description 1
- 241000700157 Rattus norvegicus Species 0.000 description 1
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 1
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 1
- 108700008625 Reporter Genes Proteins 0.000 description 1
- 241000049606 Rexia Species 0.000 description 1
- 241000051643 Rhabdoderma Species 0.000 description 1
- 241000900326 Rhizobium leguminosarum bv. trifolii WSM1325 Species 0.000 description 1
- 241000191043 Rhodobacter sphaeroides Species 0.000 description 1
- 241000391803 Richelia Species 0.000 description 1
- 241000598149 Roseobacter sp. Species 0.000 description 1
- 241001653978 Roseovarius sp. Species 0.000 description 1
- 241000607142 Salmonella Species 0.000 description 1
- 241000235346 Schizosaccharomyces Species 0.000 description 1
- 241000970913 Schizothrix Species 0.000 description 1
- 241000192120 Scytonema Species 0.000 description 1
- 102000009105 Short Chain Dehydrogenase-Reductases Human genes 0.000 description 1
- 108010048287 Short Chain Dehydrogenase-Reductases Proteins 0.000 description 1
- 241000221095 Simmondsia Species 0.000 description 1
- 244000044822 Simmondsia californica Species 0.000 description 1
- 235000004433 Simmondsia californica Nutrition 0.000 description 1
- 241000794718 Snowella Species 0.000 description 1
- 108010039811 Starch synthase Proteins 0.000 description 1
- 241000122973 Stenotrophomonas maltophilia Species 0.000 description 1
- 229930182558 Sterol Natural products 0.000 description 1
- 241000243446 Stigonema Species 0.000 description 1
- 241000193998 Streptococcus pneumoniae Species 0.000 description 1
- 229930006000 Sucrose Natural products 0.000 description 1
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 1
- NINIDFKCEFEMDL-UHFFFAOYSA-N Sulfur Chemical compound [S] NINIDFKCEFEMDL-UHFFFAOYSA-N 0.000 description 1
- 239000005864 Sulphur Substances 0.000 description 1
- 241000243385 Symphyonema Species 0.000 description 1
- 241001512067 Symploca Species 0.000 description 1
- 241000158777 Synechococcus bigranulatus Species 0.000 description 1
- 241001453317 Synechococcus leopoliensis Species 0.000 description 1
- 241001464792 Synechococcus lividus Species 0.000 description 1
- 241001136511 Synechococcus rubescens Species 0.000 description 1
- 241001660016 Synechococcus sp. WH 8102 Species 0.000 description 1
- 241001504076 Thermosynechococcus elongatus BP-1 Species 0.000 description 1
- 241000204666 Thermotoga maritima Species 0.000 description 1
- 108700009124 Transcription Initiation Site Proteins 0.000 description 1
- 108091023040 Transcription factor Proteins 0.000 description 1
- 102000040945 Transcription factor Human genes 0.000 description 1
- 241000015793 Treponema primitia Species 0.000 description 1
- 241000192118 Trichodesmium Species 0.000 description 1
- 241000192117 Trichodesmium erythraeum Species 0.000 description 1
- 241001170687 Trichodesmium erythraeum IMS101 Species 0.000 description 1
- 241000078013 Trichormus variabilis Species 0.000 description 1
- 241000209140 Triticum Species 0.000 description 1
- 235000021307 Triticum Nutrition 0.000 description 1
- 241000530641 Tychonema Species 0.000 description 1
- 108010089879 Type I Fatty Acid Synthase Proteins 0.000 description 1
- 102000008024 Type I Fatty Acid Synthase Human genes 0.000 description 1
- 108010018022 Type II Fatty Acid Synthase Proteins 0.000 description 1
- 241001106462 Ulmus Species 0.000 description 1
- 241001520827 Umezakia Species 0.000 description 1
- ZVNYJIZDIRKMBF-UHFFFAOYSA-N Vesnarinone Chemical compound C1=C(OC)C(OC)=CC=C1C(=O)N1CCN(C=2C=C3CCC(=O)NC3=CC=2)CC1 ZVNYJIZDIRKMBF-UHFFFAOYSA-N 0.000 description 1
- 241001272841 Vibrio alginolyticus 12G01 Species 0.000 description 1
- 241000607618 Vibrio harveyi Species 0.000 description 1
- 241000607284 Vibrio sp. Species 0.000 description 1
- 241000243390 Westiellopsis Species 0.000 description 1
- 241000794698 Woronichinia Species 0.000 description 1
- 241000511385 Xenococcus Species 0.000 description 1
- HCHKCACWOHOZIP-UHFFFAOYSA-N Zinc Chemical compound [Zn] HCHKCACWOHOZIP-UHFFFAOYSA-N 0.000 description 1
- MMWCIQZXVOZEGG-HOZKJCLWSA-N [(1S,2R,3S,4S,5R,6S)-2,3,5-trihydroxy-4,6-diphosphonooxycyclohexyl] dihydrogen phosphate Chemical compound O[C@H]1[C@@H](O)[C@H](OP(O)(O)=O)[C@@H](OP(O)(O)=O)[C@H](O)[C@H]1OP(O)(O)=O MMWCIQZXVOZEGG-HOZKJCLWSA-N 0.000 description 1
- ATBOMIWRCZXYSZ-XZBBILGWSA-N [1-[2,3-dihydroxypropoxy(hydroxy)phosphoryl]oxy-3-hexadecanoyloxypropan-2-yl] (9e,12e)-octadeca-9,12-dienoate Chemical compound CCCCCCCCCCCCCCCC(=O)OCC(COP(O)(=O)OCC(O)CO)OC(=O)CCCCCCC\C=C\C\C=C\CCCCC ATBOMIWRCZXYSZ-XZBBILGWSA-N 0.000 description 1
- 102000045404 acyltransferase activity proteins Human genes 0.000 description 1
- 108700014220 acyltransferase activity proteins Proteins 0.000 description 1
- 108060000200 adenylate cyclase Proteins 0.000 description 1
- 102000030621 adenylate cyclase Human genes 0.000 description 1
- 210000001789 adipocyte Anatomy 0.000 description 1
- 230000002411 adverse Effects 0.000 description 1
- 150000001298 alcohols Chemical class 0.000 description 1
- 125000001931 aliphatic group Chemical group 0.000 description 1
- 230000008848 allosteric regulation Effects 0.000 description 1
- 239000002776 alpha toxin Substances 0.000 description 1
- 150000001408 amides Chemical class 0.000 description 1
- 230000003698 anagen phase Effects 0.000 description 1
- 230000000692 anti-sense effect Effects 0.000 description 1
- 239000000074 antisense oligonucleotide Substances 0.000 description 1
- 238000012230 antisense oligonucleotides Methods 0.000 description 1
- PYMYPHUHKUWMLA-WDCZJNDASA-N arabinose Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)C=O PYMYPHUHKUWMLA-WDCZJNDASA-N 0.000 description 1
- PYMYPHUHKUWMLA-UHFFFAOYSA-N arabinose Natural products OCC(O)C(O)C(O)C=O PYMYPHUHKUWMLA-UHFFFAOYSA-N 0.000 description 1
- 229940114079 arachidonic acid Drugs 0.000 description 1
- 235000021342 arachidonic acid Nutrition 0.000 description 1
- 125000000637 arginyl group Chemical group N[C@@H](CCCNC(N)=N)C(=O)* 0.000 description 1
- 210000004507 artificial chromosome Anatomy 0.000 description 1
- CKLJMWTZIZZHCS-REOHCLBHSA-L aspartate group Chemical group N[C@@H](CC(=O)[O-])C(=O)[O-] CKLJMWTZIZZHCS-REOHCLBHSA-L 0.000 description 1
- SRBFZHDQGSBBOR-UHFFFAOYSA-N beta-D-Pyranose-Lyxose Natural products OC1COC(O)C(O)C1O SRBFZHDQGSBBOR-UHFFFAOYSA-N 0.000 description 1
- 230000002457 bidirectional effect Effects 0.000 description 1
- 108091008324 binding proteins Proteins 0.000 description 1
- 239000003225 biodiesel Substances 0.000 description 1
- 230000008436 biogenesis Effects 0.000 description 1
- 235000008429 bread Nutrition 0.000 description 1
- 229910001424 calcium ion Inorganic materials 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 125000004432 carbon atom Chemical group C* 0.000 description 1
- 239000001569 carbon dioxide Substances 0.000 description 1
- 235000021466 carotenoid Nutrition 0.000 description 1
- 150000001747 carotenoids Chemical class 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 230000010261 cell growth Effects 0.000 description 1
- 230000036978 cell physiology Effects 0.000 description 1
- 230000003833 cell viability Effects 0.000 description 1
- ATNHDLDRLWWWCB-AENOIHSZSA-M chlorophyll a Chemical compound C1([C@@H](C(=O)OC)C(=O)C2=C3C)=C2N2C3=CC(C(CC)=C3C)=[N+]4C3=CC3=C(C=C)C(C)=C5N3[Mg-2]42[N+]2=C1[C@@H](CCC(=O)OC\C=C(/C)CCC[C@H](C)CCC[C@H](C)CCCC(C)C)[C@H](C)C2=C5 ATNHDLDRLWWWCB-AENOIHSZSA-M 0.000 description 1
- 229930002868 chlorophyll a Natural products 0.000 description 1
- SUHOQUVVVLNYQR-MRVPVSSYSA-N choline alfoscerate Chemical compound C[N+](C)(C)CCOP([O-])(=O)OC[C@H](O)CO SUHOQUVVVLNYQR-MRVPVSSYSA-N 0.000 description 1
- 230000002759 chromosomal effect Effects 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 238000004581 coalescence Methods 0.000 description 1
- 239000011248 coating agent Substances 0.000 description 1
- 238000000576 coating method Methods 0.000 description 1
- 229940000425 combination drug Drugs 0.000 description 1
- 230000002301 combined effect Effects 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 230000006854 communication Effects 0.000 description 1
- 239000003184 complementary RNA Substances 0.000 description 1
- 230000000536 complexating effect Effects 0.000 description 1
- 238000009833 condensation Methods 0.000 description 1
- 230000005494 condensation Effects 0.000 description 1
- 230000021615 conjugation Effects 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 239000002537 cosmetic Substances 0.000 description 1
- 230000006324 decarbonylation Effects 0.000 description 1
- 238000006606 decarbonylation reaction Methods 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 239000000539 dimer Substances 0.000 description 1
- 238000006471 dimerization reaction Methods 0.000 description 1
- 230000003292 diminished effect Effects 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 238000001493 electron microscopy Methods 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 230000006353 environmental stress Effects 0.000 description 1
- 230000032050 esterification Effects 0.000 description 1
- 238000005886 esterification reaction Methods 0.000 description 1
- 108010083294 ethanol acyltransferase Proteins 0.000 description 1
- 238000001704 evaporation Methods 0.000 description 1
- 230000008020 evaporation Effects 0.000 description 1
- 230000007717 exclusion Effects 0.000 description 1
- 230000001747 exhibiting effect Effects 0.000 description 1
- 235000019197 fats Nutrition 0.000 description 1
- 239000012847 fine chemical Substances 0.000 description 1
- 238000000799 fluorescence microscopy Methods 0.000 description 1
- 230000004907 flux Effects 0.000 description 1
- 230000037433 frameshift Effects 0.000 description 1
- 230000005714 functional activity Effects 0.000 description 1
- 108010044804 gamma-glutamyl-seryl-glycine Proteins 0.000 description 1
- 238000004817 gas chromatography Methods 0.000 description 1
- 238000012239 gene modification Methods 0.000 description 1
- 108091006104 gene-regulatory proteins Proteins 0.000 description 1
- 102000034356 gene-regulatory proteins Human genes 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 230000005017 genetic modification Effects 0.000 description 1
- 235000013617 genetically modified food Nutrition 0.000 description 1
- 229930195712 glutamate Natural products 0.000 description 1
- 125000005456 glyceride group Chemical group 0.000 description 1
- 108010054790 glycerol-3-phosphate oxidase Proteins 0.000 description 1
- 150000002313 glycerolipids Chemical class 0.000 description 1
- 229960004956 glycerylphosphorylcholine Drugs 0.000 description 1
- 229910001385 heavy metal Inorganic materials 0.000 description 1
- 239000000185 hemagglutinin Substances 0.000 description 1
- 229920000669 heparin Polymers 0.000 description 1
- 229960002897 heparin Drugs 0.000 description 1
- 210000003494 hepatocyte Anatomy 0.000 description 1
- 108090001018 hexadecanal dehydrogenase (acylating) Proteins 0.000 description 1
- 125000000487 histidyl group Chemical group [H]N([H])C(C(=O)O*)C([H])([H])C1=C([H])N([H])C([H])=N1 0.000 description 1
- 238000002744 homologous recombination Methods 0.000 description 1
- 230000006801 homologous recombination Effects 0.000 description 1
- 229920001519 homopolymer Polymers 0.000 description 1
- 239000005556 hormone Substances 0.000 description 1
- 229940088597 hormone Drugs 0.000 description 1
- 235000003642 hunger Nutrition 0.000 description 1
- 238000009396 hybridization Methods 0.000 description 1
- 230000005660 hydrophilic surface Effects 0.000 description 1
- 230000005661 hydrophobic surface Effects 0.000 description 1
- 230000001976 improved effect Effects 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 230000002401 inhibitory effect Effects 0.000 description 1
- 230000005764 inhibitory process Effects 0.000 description 1
- 239000003999 initiator Substances 0.000 description 1
- 239000000976 ink Substances 0.000 description 1
- 229910052500 inorganic mineral Inorganic materials 0.000 description 1
- 229960000367 inositol Drugs 0.000 description 1
- CDAISMWEOUEBRE-GPIVLXJGSA-N inositol Chemical compound O[C@H]1[C@H](O)[C@@H](O)[C@H](O)[C@H](O)[C@@H]1O CDAISMWEOUEBRE-GPIVLXJGSA-N 0.000 description 1
- 238000007689 inspection Methods 0.000 description 1
- 230000016507 interphase Effects 0.000 description 1
- 230000002427 irreversible effect Effects 0.000 description 1
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 1
- 238000005304 joining Methods 0.000 description 1
- 229930027917 kanamycin Natural products 0.000 description 1
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 1
- 229960000318 kanamycin Drugs 0.000 description 1
- 229930182823 kanamycin A Natural products 0.000 description 1
- 125000000468 ketone group Chemical group 0.000 description 1
- 239000008101 lactose Substances 0.000 description 1
- 229940115932 legionella pneumophila Drugs 0.000 description 1
- 239000003446 ligand Substances 0.000 description 1
- 108020001756 ligand binding domains Proteins 0.000 description 1
- 230000037356 lipid metabolism Effects 0.000 description 1
- 230000033001 locomotion Effects 0.000 description 1
- 150000004668 long chain fatty acids Chemical class 0.000 description 1
- 239000000314 lubricant Substances 0.000 description 1
- 108010093769 lysophospholipase-transacylase Proteins 0.000 description 1
- 235000021073 macronutrients Nutrition 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 230000014759 maintenance of location Effects 0.000 description 1
- LTYOQGRJFJAKNA-VFLPNFFSSA-N malonyl-coa Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)C(O)C(=O)NCCC(=O)NCCSC(=O)CC(O)=O)O[C@H]1N1C2=NC=NC(N)=C2N=C1 LTYOQGRJFJAKNA-VFLPNFFSSA-N 0.000 description 1
- 239000000594 mannitol Substances 0.000 description 1
- 235000010355 mannitol Nutrition 0.000 description 1
- 241001044666 marine gamma proteobacterium HTCC2080 Species 0.000 description 1
- 241001044661 marine gamma proteobacterium HTCC2143 Species 0.000 description 1
- 230000002503 metabolic effect Effects 0.000 description 1
- 230000037353 metabolic pathway Effects 0.000 description 1
- 125000002496 methyl group Chemical group [H]C([H])([H])* 0.000 description 1
- 230000011987 methylation Effects 0.000 description 1
- 238000007069 methylation reaction Methods 0.000 description 1
- 239000011785 micronutrient Substances 0.000 description 1
- 235000013369 micronutrients Nutrition 0.000 description 1
- 239000011707 mineral Substances 0.000 description 1
- 238000003032 molecular docking Methods 0.000 description 1
- 150000002759 monoacylglycerols Chemical class 0.000 description 1
- 102000035085 multipass transmembrane proteins Human genes 0.000 description 1
- 108091005494 multipass transmembrane proteins Proteins 0.000 description 1
- 108010070612 neurotoxic esterase Proteins 0.000 description 1
- VOFUROIFQGPCGE-UHFFFAOYSA-N nile red Chemical compound C1=CC=C2C3=NC4=CC=C(N(CC)CC)C=C4OC3=CC(=O)C2=C1 VOFUROIFQGPCGE-UHFFFAOYSA-N 0.000 description 1
- QJGQUHMNIGDVPM-UHFFFAOYSA-N nitrogen group Chemical group [N] QJGQUHMNIGDVPM-UHFFFAOYSA-N 0.000 description 1
- XDUHQPOXLUAVEE-BPMMELMSSA-N oleoyl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)CCCCCCC\C=C/CCCCCCCC)O[C@H]1N1C2=NC=NC(N)=C2N=C1 XDUHQPOXLUAVEE-BPMMELMSSA-N 0.000 description 1
- 238000006384 oligomerization reaction Methods 0.000 description 1
- 210000003463 organelle Anatomy 0.000 description 1
- 230000003204 osmotic effect Effects 0.000 description 1
- 230000006320 pegylation Effects 0.000 description 1
- 230000002093 peripheral effect Effects 0.000 description 1
- 108010033574 phasin Proteins 0.000 description 1
- 125000002467 phosphate group Chemical group [H]OP(=O)(O[H])O[*] 0.000 description 1
- 125000001095 phosphatidyl group Chemical group 0.000 description 1
- 150000008104 phosphatidylethanolamines Chemical class 0.000 description 1
- 229910052698 phosphorus Inorganic materials 0.000 description 1
- 239000011574 phosphorus Substances 0.000 description 1
- 230000026731 phosphorylation Effects 0.000 description 1
- 238000006366 phosphorylation reaction Methods 0.000 description 1
- 230000035790 physiological processes and functions Effects 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 230000019525 primary metabolic process Effects 0.000 description 1
- 238000007639 printing Methods 0.000 description 1
- 230000002035 prolonged effect Effects 0.000 description 1
- 238000009790 rate-determining step (RDS) Methods 0.000 description 1
- 239000003642 reactive oxygen metabolite Substances 0.000 description 1
- 230000008844 regulatory mechanism Effects 0.000 description 1
- 108091008146 restriction endonucleases Proteins 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 239000000523 sample Substances 0.000 description 1
- 238000007423 screening assay Methods 0.000 description 1
- CDAISMWEOUEBRE-UHFFFAOYSA-N scyllo-inosotol Natural products OC1C(O)C(O)C(O)C(O)C1O CDAISMWEOUEBRE-UHFFFAOYSA-N 0.000 description 1
- 125000000467 secondary amino group Chemical group [H]N([*:1])[*:2] 0.000 description 1
- 230000024053 secondary metabolic process Effects 0.000 description 1
- 238000002864 sequence alignment Methods 0.000 description 1
- 230000019491 signal transduction Effects 0.000 description 1
- 102000035087 single-pass transmembrane proteins Human genes 0.000 description 1
- 108091005496 single-pass transmembrane proteins Proteins 0.000 description 1
- 239000002689 soil Substances 0.000 description 1
- 239000002904 solvent Substances 0.000 description 1
- UNFWWIHTNXNPBV-WXKVUWSESA-N spectinomycin Chemical compound O([C@@H]1[C@@H](NC)[C@@H](O)[C@H]([C@@H]([C@H]1O1)O)NC)[C@]2(O)[C@H]1O[C@H](C)CC2=O UNFWWIHTNXNPBV-WXKVUWSESA-N 0.000 description 1
- 229960000268 spectinomycin Drugs 0.000 description 1
- 230000004137 sphingolipid metabolism Effects 0.000 description 1
- 229940082787 spirulina Drugs 0.000 description 1
- 238000010186 staining Methods 0.000 description 1
- 239000007858 starting material Substances 0.000 description 1
- 230000037351 starvation Effects 0.000 description 1
- 210000000130 stem cell Anatomy 0.000 description 1
- 150000003432 sterols Chemical class 0.000 description 1
- 235000003702 sterols Nutrition 0.000 description 1
- 229940031000 streptococcus pneumoniae Drugs 0.000 description 1
- 229960005322 streptomycin Drugs 0.000 description 1
- 108020001568 subdomains Proteins 0.000 description 1
- 239000005720 sucrose Substances 0.000 description 1
- 235000000346 sugar Nutrition 0.000 description 1
- NJGWOFRZMQRKHT-UHFFFAOYSA-N surfactin Natural products CC(C)CCCCCCCCCC1CC(=O)NC(CCC(O)=O)C(=O)NC(CC(C)C)C(=O)NC(CC(C)C)C(=O)NC(C(C)C)C(=O)NC(CC(O)=O)C(=O)NC(CC(C)C)C(=O)NC(CC(C)C)C(=O)O1 NJGWOFRZMQRKHT-UHFFFAOYSA-N 0.000 description 1
- NJGWOFRZMQRKHT-WGVNQGGSSA-N surfactin C Chemical compound CC(C)CCCCCCCCC[C@@H]1CC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)O1 NJGWOFRZMQRKHT-WGVNQGGSSA-N 0.000 description 1
- 101150087812 tesA gene Proteins 0.000 description 1
- 230000001225 therapeutic effect Effects 0.000 description 1
- 230000005030 transcription termination Effects 0.000 description 1
- 230000014616 translation Effects 0.000 description 1
- 238000009966 trimming Methods 0.000 description 1
- 241001515965 unidentified phage Species 0.000 description 1
- 235000015112 vegetable and seed oil Nutrition 0.000 description 1
- 150000004669 very long chain fatty acids Chemical class 0.000 description 1
- 229910052725 zinc Inorganic materials 0.000 description 1
- 239000011701 zinc Substances 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/195—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N1/00—Microorganisms, e.g. protozoa; Compositions thereof; Processes of propagating, maintaining or preserving microorganisms or compositions thereof; Processes of preparing or isolating a composition containing a microorganism; Culture media therefor
- C12N1/20—Bacteria; Culture media therefor
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/1025—Acyltransferases (2.3)
- C12N9/1029—Acyltransferases (2.3) transferring groups other than amino-acyl groups (2.3.1)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y203/00—Acyltransferases (2.3)
- C12Y203/01—Acyltransferases (2.3) transferring groups other than amino-acyl groups (2.3.1)
- C12Y203/0102—Diacylglycerol O-acyltransferase (2.3.1.20)
Definitions
- Certain organisms can be utilized as a source of oil in the production of biofuels.
- algae naturally produce oil such as triglycerides as energy storage molecules, and certain biofuel-related technologies are presently focused on the use of algae as a feedstock for biofuels.
- Algae are photosynthetic organisms, and the use of triglyceride-producing organisms such as algae provides the ability to produce biodiesel from sunlight, water, C02, macronutrients, and micronutrients. Algae, however, cannot be readily genetically manipulated, and produce much less oil (i.e., triglycerides) under culture conditions than in the wild.
- Cyanobacteria obtain energy from photosynthesis, utilizing chlorophyll A and water to reduce C02. Certain Cyanobacteria can produce metabolites, such as carbohydrates, proteins, and fatty acids, from just sunlight, water C02, water, and inorganic salts. Unlike algae, Cyanobacteria can be genetically manipulated. For example, S. elongatus PCC 7942 (hereafter refered to as "5. elongatus PCC 7942”) is a genetically manipulate, oligotrophic Cyanobacterium that thrives in low nutrient level conditions, and in the wild accumulates fatty acids in the form of lipid membranes to about 4 to 8 % by dry weight. Cyanobacteria such as Synechococcus, however, produce no triglyceride energy storage molecules, since Cyanobacteria typically lack the essential enzymes involved in triglyceride synthesis.
- S. elongatus PCC 7942 (hereafter refered to as "5.
- modified photosynthetic microorganisms including Cyanobacteria, capable of producing oil with high efficiency, such as triglycerides, e.g., to be used as feedstock in the production of biofuels and/or various specialty chemicals.
- Embodiments of the present invention relate to the demonstration that photosynthetic microorganisms, including Cyanobacteria, can be genetically modified to increase fatty acid biosynthesis, and to produce neutral lipids and lipid packaging proteins.
- the modified photosynthetic microorganisms, e.g., Cyanobacteria, of the present invention comprise one or more polynucleotides encoding one or more enzymes associated with neutral lipid synthesis and lipid packaging protein.
- Embodiments concern a modified Cyanobacterium comprising an exogenous polynucleotide encoding a diacylglycerol acyltransferase (hereinafter referred to as "DGAT”) and at least one exogenous polynucleotide encoding at least one lipid packaging protein.
- DGAT diacylglycerol acyltransferase
- a method for producing a neutral lipid comprising culturing a Cyanobacterium comprising an exogenous polynucleotide encoding a DGAT and at least one exogenous polynucleotide encoding at least one lipid packaging protein to produce a neutral lipid.
- Still other embodiments concern a method for producing a modified Cyanobacterium, comprising modifying a Cyanobacterium by introducing into the Cyanobacterium an exogenous polynucleotide encoding a DGAT and at least one exogenous polynucleotide encoding at least one lipid packaging protein, wherein said Cyanobacterium produces a neutral lipid.
- the present invention relates, in part, to the demonstration that photosynthetic organisms, including but not limited to Cyanobacteria, such as Synechococcus, which do not naturally produce neutral lipids can be genetically modified to synthesize neutral lipids, such as triglycerides, and to produce lipid packaging proteins which encapsulate the produced neutral lipids and which increase the size and/or number of the neutral lipid bodies.
- photosynthetic organisms including but not limited to Cyanobacteria, such as Synechococcus, which do not naturally produce neutral lipids can be genetically modified to synthesize neutral lipids, such as triglycerides, and to produce lipid packaging proteins which encapsulate the produced neutral lipids and which increase the size and/or number of the neutral lipid bodies.
- Examples of enzymes associated with neutral lipid synthesis include enzymes having a phosphatidate phosphatase activity and enzymes having a DGAT activity .
- phosphatidate phosphatase enzymes catalyze the production of diacylglycerol molecules, an immediate pre-cursor to triglycerides, and DGAT enzymes catalyze the final step of triglyceride synthesis by converting the diacylglycerol precursors to triglycerides.
- an element means one element or more than one element.
- the term "about” means a q uantity, level, value, num ber, frequency, percentage, dimension, size, amount, weight or length that varies by as much as 30, 25, 20, 25, 10, 9, 8, 7, 6, 5, 4, 3, 2 or 1 % to a reference quantity, level, value, number, frequency, percentage, dimension, size, amount, weight or length.
- biologically active fragment refers to a fragment that has at least about 0.1, 0.5, 1, 2, 5, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 96, 97, 98, 99, 100, 110, 120, 150, 200, 300, 400, 500, 600, 700, 800, 900, 1000% or more of the activity of a reference sequence.
- reference sequence refers generally to a nucleic acid coding sequence, or amino acid sequence, of any enzyme having a DGAT activity, a phosphatidate phosphatase activity, and/or an acetyl-CoA carboxylase activity, as described herein ⁇ see, e.g., SEQ ID NOS:l-9).
- I ncluded within the scope of the present invention are biologically active fragments of at least about 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 40, 50, 60, 70, 80, 90, 100, 120, 140, 160, 180, 200, 220, 240, 260, 280, 300, 320, 340, 360, 380, 400, 500, 600 or more contiguous nucleotides or amino acid residues in length, including all integers in between, which comprise or encode a polypeptide having an enzymatic activity of a reference polynucleotide or polypeptide.
- Representative biologically active fragments generally participate in an interaction, e.g., an intra-molecular or an inter-molecular interaction.
- An inter-molecular interaction can be a specific binding interaction or an enzymatic interaction.
- enzymatic interactions or activities include diacylglycerol acyltransferase activity, phosphatidate phosphatase activity, and/or acetyl-CoA carboxylase activity, as described herein.
- Coding seq uence means any nucleic acid sequence that contributes to the code for the polypeptide product of a gene.
- non-coding sequence refers to any nucleic acid sequence that does not contribute to the code for the polypeptide product of a gene.
- complementarity refers to polynucleotides (i.e., a sequence of nucleotides) related by the base-pairing rules.
- the seq uence “A-G-T” is complementary to the sequence “T-C-A.”
- Complementarity may be “partial,” in which only some of the nucleic acids' bases are matched according to the base pairing rules. Or, there may be “complete” or “total” complementarity between the nucleic acids. The degree of complementarity between nucleic acid strands has significant effects on the efficiency and strength of hybridization between nucleic acid strands.
- Derivative means a polypeptide that has been derived from the basic seq uence by modification, for example by conjugation or complexing with other chemical moieties (e.g., pegylation) or by post-translational modification techniq ues as would be understood in the art.
- derivative also includes within its scope alterations that have been made to a parent sequence including additions or deletions that provide for functionally eq uivalent molecules.
- Enzyme reactive conditions means that any necessary conditions are available in an environment (i.e., such factors as temperature, pH, and lack of inhibiting substances) which will permit the enzyme to function. Enzyme reactive conditions can be either in vitro, such as in a test tube, or in vivo, as within a cell.
- Gene is means a unit of inheritance that occupies a specific locus on a chromosome and consists of transcriptional and/or translational regulatory sequences and/or a coding region and/or non-translated sequences (i.e., introns, 5' and 3' untranslated sequences).
- host cell includes an individual cell or cell culture which can be or has been a recipient of any recombinant vector(s) or isolated polynucleotide of the invention.
- Host cells include progeny of a single host cell, and the progeny may not necessarily be completely identical (in morphology or in total DNA complement) to the original parent cell due to natural, accidental, or deliberate mutation and/or change.
- a host cell includes cells transfected or infected in vivo or in vitro with a recombinant vector or a polynucleotide of the invention.
- a host cell which comprises a recombinant vector of the invention is a recombinant host cell.
- Isolated means material that is substantially or essentially free from components that normally accompany it in its native state.
- an "isolated polynucleotide”, as used herein, refers to a polynucleotide, which has been purified from the seq uences which flank it in a naturally-occurring state, e.g., a DNA fragment which has been removed from the sequences that are normally adjacent to the fragment.
- an "isolated peptide” or an “isolated polypeptide” and the like, as used herein refer to in vitro isolation and/or purification of a peptide or polypeptide molecule from its natural cellular environment, and from association with other components of the cell.
- “Increased” or “increasing” means the ability of one or more modified photosynthetic microorganisms, e.g., Cyanobacteria, to produce a greater amount of a given fatty acid, lipid molecule, or triglyceride as compared to a control Cyanobacteria, such as an unmodified Cyanobacteria or a differently modified Cyanobacteria.
- Production of fatty acids can be measured according to techniq ues known in the art, such as Nile Red staining and gas chromatography.
- Production of triglycerides can be measured, for example, using commercially available enzymatic tests, including colorimetric enzymatic tests using glycerol-3-phosphate-oxidase.
- neutral lipid means any lipid that is soluble only in solvents of very low polarity.
- Neutral lipids are divided into two main groups: (1) acylglycerols (glycerides), i.e. fatty-acid esters of glycerol; and (2) waxes, i.e. fatty-acid esters of long-chain monohydroxy alcohols.
- the neutral lipid may include a fatty acid ester arising from the joining of the fatty acid carboxyl group to a hydroxyl group to make an ester bond. This can occur, for example, between a fatty acid and glycerol to make mono-, di- and triglycerides, or between a fatty acid and an alcohol to make a wax ester.
- Obtained from means that a sample such as, for example, a polynucleotide extract or polypeptide extract is isolated from, or derived from, a particular source, such as a desired organism or a specific tissue within a desired orga nism. "Obtained from” can also refer to the situation in which a polynucleotide or polypeptide sequence is isolated from, or derived from, a particular organism or tissue within an organism.
- operably linked means placing a gene under the regulatory control of a promoter, which then controls the transcription and optionally the translation of the gene. I n the construction of heterologous promoter/structural gene combinations, it is generally preferred to position the genetic sequence or promoter at a distance from the gene transcription start site that is approximately the same as the distance between that genetic sequence or promoter and the gene it controls in its natural setting; i.e. the gene from which the genetic sequence or promoter is derived. As is known in the art, some variation in this distance can be accommodated without loss of function.
- a regulatory sequence element with respect to a heterologous gene to be placed under its control is defined by the positioning of the element in its natural setting; i.e., the genes from which it is derived.
- Constutive promoters are typically active, i.e., promote transcription, under most conditions.
- Inducible promoters are typically active only under certain conditions, such as in the presence of a given molecule factor (e.g., IPTG) or a given environmental condition (e.g., particular C0 2 concentration, nutrient levels, light, heat). I n the absence of that condition, inducible promoters typically do not allow significant or measurable levels of transcriptional activity.
- inducible promoters may be induced according to temperature, pH, a hormone, a metabolite (e.g., lactose, mannitol, an amino acid), light (e.g., wavelength specific), osmotic potential (e.g., salt induced), a heavy metal, or an antibiotic.
- a hormone e.g., lactose, mannitol, an amino acid
- light e.g., wavelength specific
- osmotic potential e.g., salt induced
- polynucleotide or “nucleic acid” as used herein designates mRNA, RNA, cRNA, rRNA, cDNA or DNA.
- the term typically refers to polymeric form of nucleotides of at least 10 bases in length, either ribonucleotides or deoxynucleotides or a modified form of either type of nucleotide.
- the term includes single and double stranded forms of DNA.
- polynucleotide variant and “variant” and the like refer to polynucleotides displaying substantial seq uence identity with a reference polynucleotide sequence or polynucleotides that hybridize with a reference seq uence under stringent conditions that are defined hereinafter. These terms also encompass polynucleotides that are distinguished from a reference polynucleotide by the addition, deletion or substitution of at least one nucleotide. Accordingly, the terms “polynucleotide variant” and “variant” include polynucleotides in which one or more nucleotides have been added or deleted, or replaced with different nucleotides.
- Polynucleotide variants include, for example, polynucleotides having at least 50% (and at least 51% to at least 99% and all integer percentages in between) sequence identity with a reference polynucleotide sequence that encodes a diacylglycerol acyltransferase, a phosphatidate phosphatase, and/or an acetyl-CoA carboxylase enzyme.
- polynucleotide variant and “variant” also include naturally-occurring allelic variants and orthologs that encode these enzymes.
- exogenous refers to a polynucleotide seq uence that does not naturally occur in a wild-type cell or organism, but is typically introduced into the cell by molecular biological techniq ues.
- exogenous polynucleotides include vectors, plasmids, and/or man-made nucleic acid constructs encoding a desired protein.
- endogenous or “native” refers to naturally occurring polynucleotide sequences that may be found in a given wild-type cell or organism.
- certain cyanobacterial species do not typically contain a DGAT gene, and, therefore, do not comprise an "endogenous" polynucleotide sequence that encodes a DGAT polypeptide.
- a particular polynucleotide sequence that is isolated from a first organism and transferred to second organism by molecular biological techniques is typically considered an "exogenous" polynucleotide with respect to the second organism.
- Polypeptide “polypeptide fragment,” “peptide” and “protein” are used interchangeably herein to refer to a polymer of amino acid residues and to variants and synthetic analogues of the same. Thus, these terms apply to amino acid polymers in which one or more amino acid residues are synthetic non-naturally occurring amino acids, such as a chemical analogue of a corresponding naturally occurring amino acid, as well as to naturally-occurring amino acid polymers.
- polypeptides may include enzymatic polypeptides, or "enzymes,” which typically catalyze (i.e., increase the rate of) various chemical reactions.
- a polypeptide "variant" refers to polypeptides that are distinguished from a reference polypeptide seq uence by the addition, deletion or substitution of at least one amino acid residue.
- a polypeptide variant is distinguished from a reference polypeptide by one or more substitutions, which may be conservative or non-conservative.
- the polypeptide variant comprises conservative substitutions and, in this regard, it is well understood in the art that some amino acids may be changed to others with broadly similar properties without changing the nature of the activity of the polypeptide.
- Polypeptide variants also encompass polypeptides in which one or more amino acids have been added or deleted, or replaced with different amino acid residues.
- lipid packaging protein or “lipid body packaging protein” refers to structural components of intracellular lipid globules, or proteins that interact with or are evolutionarily related to the components of intracellular lipid globules, which encapsulate lipid and/or neutral lipids within the host cell.
- lipid packaging protein include Caleosin and Oleosin Proteins.
- the present invention contemplates the use in the methods described herein of variants of full-length enzymes having DGAT activity, phosphatidate phosphatase activity, and/or acetyl-CoA carboxylase activity, truncated fragments of these full-length polypeptides, variants of truncated fragments, as well as their related biologically active fragments.
- biologically active fragments of a polypeptide may participate in an interaction, for example, an intra-molecular or an inter-molecular interaction.
- An inter-molecular interaction can be a specific binding interaction or an enzymatic interaction (e.g., the interaction can be transient and a covalent bond is formed or broken).
- Biologically active fragments of a polypeptide/enzyme having a diacylglycerol acyltransferase activity, a phosphatidate phosphatase activity, and/or acetyl-CoA carboxylase activity include peptides comprising amino acid sequences sufficiently similar to, or derived from, the amino acid sequences of a (putative) full-length reference polypeptide sequence.
- biologically active fragments comprise a domain or motif with at least one activity of a DGAT, phosphatidate phosphatase polypeptide, and/or acetyl-coA carboxylase polypeptide, and may include one or more (and in some cases all) of the various active domains.
- a biologically active fragment of a DGAT, phosphatidate phosphatase, and/or acetyl-CoA carboxylase polypeptide can be a polypeptide fragment which is, for example, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29,30, 40, 50, 60, 70, 80, 90, 100, 110, 120, 130, 140, 150, 160, 170, 180, 190, 200, 220, 240, 260, 280, 300, 320, 340, 360, 380, 400, 450, 500, 600 or more contiguous amino acids, including all integers in between, of a reference polypeptide sequence.
- a biologically active fragment comprises a conserved enzymatic sequence, domain, or motif, as described elsewhere herein and known in the art.
- the biologically-active fragment has no less than about 1%, 10%, 25%, 50% of an activity of the wild-type polypeptide from which it is derived.
- a “reference sequence,” as used herein, refers to a wild-type polynucleotide or polypeptide sequence from any organism, e.g., wherein the polynucleotide encodes a polypeptide having an acyl-ACP reductase, as described herein and known in the art.
- polypeptide "reference sequences” are provided herein, including the polynucleotide and polypepetide sequences of an acyl-ACP reductase of Synechococcus elongatus PCC7942 (see SEQ ID NOs:l and 2 for the polynucleotide and polypeptide sequences, respectively) and an acyl-ACP reductase of Synechocystis sp. PCC6803 (SEQ ID NOs:3 and 4 for the polynucleotide and polypeptide sequences, respectively), among others known to a person skilled in the art.
- sequence identity or, for example, comprising a “seq uence 50% identical to,” as used herein, refer to the extent that sequences are identical on a nucleotide-by-nucleotide basis or an amino acid-by-amino acid basis over a window of comparison.
- a "percentage of sequence identity” may be calculated by comparing two optimally aligned sequences over the window of comparison, determining the number of positions at which the identical nucleic acid base (e.g., A, T, C, G, I) or the identical amino acid residue (e.g., Ala, Pro, Ser, Thr, Gly, Val, Leu, lie, Phe, Tyr, Trp, Lys, Arg, His, Asp, Glu, Asn, Gin, Cys and Met) occurs in both sequences to yield the number of matched positions, dividing the number of matched positions by the total number of positions in the window of comparison (i.e., the window size), and multiplying the result by 100 to yield the percentage of sequence identity.
- the identical nucleic acid base e.g., A, T, C, G, I
- the identical amino acid residue e.g., Ala, Pro, Ser, Thr, Gly, Val, Leu, lie, Phe, Tyr, Trp, Lys, Arg,
- references to describe seq uence relationships between two or more polynucleotides or polypeptides include “reference seq uence”, “comparison window”, “seq uence identity”, “percentage of sequence identity” and “substantial identity”.
- a “reference sequence” is at least 12 but frequently 15 to 18 and often at least 25 monomer units, inclusive of nucleotides and amino acid residues, in length.
- two polynucleotides may each comprise (1) a sequence (i.e., only a portion of the complete polynucleotide seq uence) that is similar between the two polynucleotides, and (2) a seq uence that is divergent between the two polynucleotides
- sequence comparisons between two (or more) polynucleotides are typically performed by comparing sequences of the two polynucleotides over a "comparison window" to identify and compare local regions of seq uence similarity.
- a “comparison window” refers to a conceptual segment of at least 6 contiguous positions, usually about 50 to about 100, more usually about 100 to about 150 in which a sequence is compared to a reference seq uence of the same number of contiguous positions after the two seq uences are optimally aligned.
- the comparison window may comprise additions or deletions (i.e., gaps) of about 20% or less as compared to the reference seq uence (which does not comprise additions or deletions) for optimal alignment of the two sequences.
- Optimal alignment of sequences for aligning a comparison window may be conducted by computerized implementations of algorithms (GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package Release 7.0, Genetics Computer Group, 575 Science Drive Madison, Wl, USA) or by inspection and the best alignment (i.e., resulting in the highest percentage homology over the comparison window) generated by any of the various methods selected.
- GAP Garnier et al.
- BESTFIT Pearson FASTA
- FASTA Pearson's Alignment of sequences
- TFASTA Pearson's Alignment of Altschul et al.
- a detailed discussion of sequence analysis can be found in Unit 19.3 of Ausubel et al., "Current Protocols in Molecular Biology", John Wiley & Sons I nc, 1994-1998, Chapter 15.
- Transformation refers to the permanent, heritable alteration in a cell resulting from the uptake and incorporation of foreign DNA into the host-cell genome; also, the transfer of an exogenous gene from one organism into the genome of another organism.
- Vector means a polynucleotide molecule, preferably a DNA molecule derived, for example, from a plasmid, bacteriophage, yeast or virus, into which a polynucleotide can be inserted or cloned.
- a vector preferably contains one or more unique restriction sites and can be capable of autonomous replication in a defined host cell including a target cell or tissue or a progenitor cell or tissue thereof, or be integrable with the genome of the defined host such that the cloned sequence is reproducible.
- the vector can be an autonomously replicating vector, i.e., a vector that exists as an extra-chromosomal entity, the replication of which is independent of chromosomal replication, e.g., a linear or closed circular plasmid, an extra-chromosomal element, a mini-chromosome, or an artificial chromosome.
- the vector can contain any means for assuring self-replication.
- the vector can be one which, when introduced into the host cell, is integrated into the genome and replicated together with the chromosome(s) into which it has been integrated.
- Such a vector may comprise specific sequences that allow recombination into a particular, desired site of the host chromosome.
- a vector system can comprise a single vector or plasmid, two or more vectors or plasmids, which together contain the total DNA to be introduced into the genome of the host cell, or a transposon.
- the choice of the vector will typically depend on the compatibility of the vector with the host cell into which the vector is to be introduced. I n the present case, the vector is preferably one which is operably functional in a bacterial cell, such as a cyanobacterial cell.
- the vector can include a reporter gene, such as a green fluorescent protein (GFP), which can be either fused in frame to one or more of the encoded polypeptides, or expressed separately.
- the vector can also include a selection marker such as an antibiotic resistance gene that can be used for selection of suitable transformants.
- wild-type and “naturally occurring” are used interchangeably to refer to a gene or gene product that has the characteristics of that gene or gene product when isolated from a naturally occurring source.
- a wild type gene or gene product e.g., a polypeptide
- a wild type gene or gene product is that which is most freq uently observed in a population and is thus arbitrarily designated as the "normal” or "wild-type” form of the gene.
- modified photosynthetic microorganisms e.g., Cyanobacteria
- the modified photosynthetic microorganism comprises one or more polynucleotides encoding one or more enzymes associated with lipid and/or neutral lipid biosynthesis (such as, for example triglyceride, fatty acid ester, wax ester, and/or carotenoid biosynthesis) and at least one exogenous polynucleotide encoding at least one protein associated with lipid packaging.
- enzymes associated with lipid and/or neutral lipid biosynthesis such as, for example triglyceride, fatty acid ester, wax ester, and/or carotenoid biosynthesis
- exogenous polynucleotide encoding at least one protein associated with lipid packaging.
- the one or more polynucleotides encoding one or more enzymes associated with triglyceride biosynthesis include but are not limited to a DGAT and/or a phosphatidate phosphatase.
- the at least one protein associated with lipid packaging includes polyhydroxyalkanoatel P (Phal P), triacylglycerol accum ulation deficient A (Tad A), Caleosin 1 1 (Clol 1), or proteins evolutionarily related to these proteins such as plastoglobulin 1 (pgl 1), plastoglobulin 2 (pgl 2), plastoglobulin-like (pgl-like), or a combination thereof.
- the present invention contemplates the use of naturally-occurring and non-naturally-occurring variants of these lipid packaging proteins and these DGAT and phosphatidate phosphatase enzymes, as well as variants of their encoding polynucleotides.
- the DGAT encoding polynucleotide sequence is derived from Acinetobacter baylii (ADP1-DGAT), and the phosphatidate phosphatase encoding polynucleotide sequence is from Saccharomyces cerevisiae (yPahl).
- enzyme encoding sequences may be derived from any organism having a suitable DGAT or phosphatidate phosphatase enzyme, and may also include any man-made variants thereof, such as any optimized coding sequences (i.e., codon-optimized polynucleotides) or optimized polypeptide sequences.
- polypeptide and polynucleotide sequences are described infra and include the lipid packaging protein derived from pgl 1 (Synechocystis) (see SEQ ID NOs:240 and 239 for polypeptide and polynucleotide sequence, respectively), pgl 2 (Synechocystis) (see SEQ I D NOs:242 and 241 for polypeptide and polynucleotide sequence, respectively), pgl-like (Microcystis aeruginosa) (see SEQ I D NOs:244 and 243 for polypeptide and polynucleotide sequence, respectively), pgl-like (Cyanothece sp) (see SEQ ID NOs:246 and 245 for polypeptide and polynucleotide sequence, respectively), pgl-like (Calothrix sp) (see SEQ I D NOs:248 and 247 for polypeptide and polynucle
- Eutropha see SEQ I D NOs:234 and 233 for polypeptide and polynucleotide sequence, respectively
- Tad A see SEQ I D NOs:236 and 235 for polypeptide and polynucleotide sequence, respectively
- Clol 1 (Arabidopsis) (see SEQ I D NOs:238 and 237 for polypeptide and polynucleotide sequence, respectively).
- These encoding sequences may be derived from any organism having suitable lipid packaging proteins, and may also include any man-made variants thereof, such as any optimized coding sequences (i.e., codon-optimized polynucleotides) or optimized polypeptide sequences. Exemplary polypeptide and polynucleotide sequences are described infra.
- a modified photosynthetic microorganism comprises modulated and/or introduced plastoglobulin 1 (pgl 1), plastoglobulin 2 (pgl 2) and plastoglobulin-like (pgl-like).
- the modified photosynthetic microorganism comprises overexpressed pgl 1, pgl 2, and pgl-like.
- the pgl 1 and pgl 2 in the cyanobacterium Synechocystis sp. PCC 6803 (slll568 and slrl024) encode polypeptides similar in sequence to plant plastoglobulins.
- a modified photosynthetic microorganism comprises modulated and/or introduced polyhydroxyalkanoatel P (Phal P).
- Pha 1 has been shown to be associated with TAG bodies if it is ectopically expressed in Acinetobacter.
- Pha 1 may be associated with a diversity of intracellular carbon storage bodies.
- Phal represents the major phasin on the surface of PHA inclusions in Ra.
- eutropha H16 plays an important role in the formation and structure of these inclusions, because its presence or absence affects the number and size of the inclusions, and the amount of PHB in the cells.
- PHA inclusions are formed from soluble PHA synthases that polymerize the 3-hydroxybutyrate (3HB) of 3HB-CoA to PHB, with concomitant release of CoA. Since PHA synthases remain covalently linked to the growing PHB chain, an amphiphilic complex composed of the hydrophilic synthase and the elongating polymer chain is formed. These complexes are thought to aggregate to form micelle-like structures, which enlarge to PHA granules due to subsequent extension of the PHA chains. During granule growth, phasins and phospholipids are thought to migrate into the exposed hydrophobic surface of the polymer core, thereby generating an interphase between the hydrophobic core and the cytoplasm. See, for example, Hanisch (Microbiology, 152: 3271-3280, 2006) and the references cited therein.
- a modified photosynthetic microorganism comprises modulated and/or introduced triacylglycerol accumulation deficient A (Tad A), which refers to a gene from Rhodococcus that encodes a putative heparin-binding 63 hemagglutinin homolog that is important for TAG accumulation.
- Tad A triacylglycerol accumulation deficient A
- a modified photosynthetic microorganism comprises modulated and/or introduced Caleosin and/or Oleosin protein from A. thaliana (Clol).
- Caleosin and oleosin are embedded in the half unit membranes of oil bodies in many plant seeds. Functionally, oleosin is believed to coat the oil body surface and thereby prevent coalescence of adjacent oil bodies. Both proteins have similar proposed structures including N-and C-terminal cytosolic ends, and long hydrophobic internal segments with centrally located proline knot motifs. It has been hypothesized that Caleosin's function in oil bodies involves oil body biogenesis from the ER.
- Caleosins are a multigene family in Arabidopsis, with a single isoform, Clol, predominantly expressed in seeds.
- Clol a lipid body associated transmembrane protein has been implicated in lipid body generation, size maintenance, and degradation of stored lipid; algal genomes, including C. reinhardtii genomes, lack genes encoding oleosin homologues but encode caleosin homologues. See, for example, Wang (EC, 8(12): 1856-1868, 2009) and the references cited therein.
- the modified photosynthetic microorganisms may comprise two or more polynucleotides that encode DGAT or a variant or fragment thereof.
- the two or more polynucletoides are identical or express the same DGAT.
- these two or more polynucleotides may be different or may encode two different DGAT polypeptides.
- one of the polynucleotides may encode ADGATd, while another polynucleotide may encode ScoDGAT.
- the following DGATs are coexpressed in modified photosynthetic microorganisms, e.g., Cyanobacteria, using one of the following double DGAT strains: ADGATd(NSl)::ADGATd(NS2); ADGATn(NSl): :ADGATn(NS2); ADGATn(NSl)::SDGAT(NS2); SDGAT(NSl)::ADGATn(NS2); SDGAT(NS1)::SDGAT(NS2).
- pAM2291 EcoRI follows ATG and is part of the open reading frame (ORF).
- pAM 1579 EcoRI follows ATG and is part of the ORF.
- a DGAT having EcoRI nucleotides following ATG may be cloned in either pAM2291 or pAM 1579; such a DGAT is referred to as ADGATd.
- Other embodiments utilize the vector, pAM2314FTrc3, which is an NS1 vector with Nde/8glll sites, or the vector, pAM1579FTrc3, which is the NS2 vector with Nde/8glll sites.
- a DGAT without EcoRI nucleotides may be cloned into either of these last two vectors. Such a DGAT is referred to as ADGATn.
- modified photosynthetic microorganisms expressing different DGATs express TAGs having different fatty acid compositions. Accordingly, certain embodiments of the present invention contemplate expressing two or more different DGATs, in order to produce DAGs having varied fatty acid compositions. Related embodiments contemplate expressing two or more different phosphatidate phosphatase and/or two or more different acetyl-CoA carboxylases.
- Embodiments of the present invention also relate to modified photosynthetic microorganisms, e.g., Cyanobacteria, and methods of use thereof, wherein the modified Cyanobacteria comprise one or more polynucleotides encoding enzymes associated with fatty acid biosynthesis and at least one polynucleotide encoding proteins associated with lipid packaging, wherein said polynucleotides are exogenous to the Cyanobacterium's native genome.
- the enzymes associated with fatty acid synthesis comprise an acetyl-CoA carboxylase (ACCase) activity, including naturally-occurring and non-naturally-occurring functional variants of such enzymes and their encoding polynucleotides.
- ACCase acetyl-CoA carboxylase
- the polynucleotide sequence encoding the ACCase enzyme is derived from Saccharomyces cerevisiae (yAccl). As above, however, these ACCase enzyme encoding sequences may be derived from any organism having a suitable ACCase enzyme, and may also include any man-made variants thereof, such as any optimized coding seq uences (i.e., codon-optimized polynucleotides) or optimized polypeptide sequences.
- the modified photosynthetic microorganisms may comprise at least one DGAT polypeptide fused to at least one heterologous intracellular localization domain, such as a bacterial membrane-targeting domain.
- Such fusion proteins can be partially or fully isolated from other cellular components, or expressed, for example, in cell-free systems or a host cell, such as a modified photosynthetic microorganism.
- the modified photosynthetic microorganisms may comprise at least one DGAT polypeptide fused to at least one lipid packaging protein.
- the modified photosynthetic microorganisms described herein can optionally comprise any combination of one or more overexpressed or introduced lipid biosynthesis proteins and/or one or more overexpressed or introduced proteins associated with glycogen breakdown.
- lipid biosynthesis proteins include acyl carrier proteins (ACP), acyl ACP synthases (Aas), acyl-ACP reductases, alcohol dehydrogenases, aldehyde dehydrogenases, aldehyde decarbonylases, thioesterases (TES), acetyl coenzyme A carboxylases (ACCase), phosphatidic acid phosphatases (PAP; or phosphatidate phosphatases), triacylglycerol (TAG) hydrolases, fatty acyl-CoA synthetases, and lipases/phospholipases, fatty acyl reductase (FAR), as described herein.
- ACP acyl carrier proteins
- Aas acyl ACP synthases
- TES acyl-ACP reductases
- alcohol dehydrogenases aldehyde dehydrogenases
- aldehyde decarbonylases thioesterases
- TES
- Cyanobacteria having increased fatty acid production may be utilized to improve the overall production of triglycerides. Accordingly, certain embodiments relate to modified Cyanobacteria, and methods of use thereof, wherein the Cyanobacteria comprise one or more polynucleotides encoding enzymes associated with fatty acid synthesis and triglyceride synthesis and at least one polynucleotide encoding proteins associated with lipid packaging.
- the modified Cyanobacteria of the present invention comprise one or more polynucleotides encoding enzymes that comprise a DGAT activity and/or a phosphatidate phosphatase activity, as well as an exogenous enzyme comprising an ACCase activity, and at least one polynucleotide encoding proteins that comprises a lipid packaging activity.
- Embodiments of the present invention also include methods of producing neutral lipids such as triglycerides in a photosynthetic microorganism, e.g., a Cyanobacterium, comprising introducing one or more polynucleotides encoding one or more enzymes associated with triglyceride biosynthesis and at least one polynucleotide encoding at least one protein associated with lipid packaging activities into a photosynthetic microorganism, incubating the photosynthetic microorganism for a time and under conditions sufficient to allow neutral lipids such as triglyceride production, thereby producing neutral lipids such as triglycerides in the photosynthetic microorganism.
- a photosynthetic microorganism e.g., a Cyanobacterium
- Also contemplated are methods of producing a neutral lipid in a photosynthetic microorganism e.g., a Cyanobacterium, comprising culturing a photosynthetic microorganism comprising one or more polynucleotides encoding one or more enzymes associated with triglyceride biosynthesis and at least one polynucleotide encoding at least one protein associated with lipid packaging activities, for a time and under conditions sufficient to allow neutral lipid production.
- the one or more enzymes comprise a DGAT enzymatic activity and/or a phosphatidate phosphatase enzymatic activity.
- the one or more enzymes comprise both a DGAT enzymatic activity and a phosphotidate phosphatase enzymatic activity.
- the one or more enzymes comprise an acyl-CoA carboxylase (ACCase) enzymatic activity, as well as a DGAT enzymatic activity and/or a phosphatidate phosphatase enzymatic activity.
- the one or more enzymes comprise a DGAT enzymatic activity, a phosphotidate phosphatase enzymatic activity, and an acyl-CoA carboxylase (ACCase) enzymatic activity.
- one or more of the polynucleotides are exogenous to the photosynthetic microorganism's native genome.
- the at least one protein comprises a lipid packaging protein.
- the at least one protein comprises pgl 1, pgl 2, pgl-like, Phal P, Tad A, Clol 1.
- the at least one polynucleotide is exogenous to the photosynthetic microorganism's native genome.
- the present invention also relates to methods of producing an increased amount of fatty acid, e.g., a free fatty acid, in a photosynthetic microorganism, e.g., a Cyanobacterium, comprising introducing one or more polynucleotides encoding one or more enzymes associated with fatty acid biosynthesis and at least one polynucleotide encoding at least one protein associated with lipid packaging activities into a photosynthetic microorganism, culturing the Cyanobacterium for a time and under conditions sufficient to allow increased neutral lipid production, thereby producing an increased amount of neutral lipids in the Cyanobacterium.
- a photosynthetic microorganism e.g., a Cyanobacterium
- a photosynthetic microorganism e.g., a Cyanobacterium
- culturing a photosynthetic microorganism comprising one or more polynucleotides encoding one or more enzymes associated with fatty acid biosynthesis and at least one polynucleotide encoding at least one protein associated with lipid packaging activities.
- one or more of the polynucleotides are exogenous to the photosynthetic microorganism's native genome.
- the one or more enzymes comprise an ACCase enzymatic
- the modified Cyanobacteria of the present invention may be cultured according to routine techniques known in the art and exemplified herein, such as photobioreactor based culture techniques.
- the at least one polynucleotide is exogenous to the photosynthetic microorganism's native genome.
- the at least one polynucleotide comprises a lipid packaging activity.
- the modified Cyanobacteria of the present invention may be cultured according to routine techniques known in the art and exemplified herein, such as photobioreactor based culture techniques.
- the present invention also relates to methods of preparing a modified photosynthetic microorganism, e.g., a modified Cyanobacterium, such as by genetic modification, to increase production of naturally-occurring neutral lipids, e.g., free fatty acids, and/or to produce neutral lipids such as triglycerides.
- Photosynthetic microorganisms such as Cyanobacteria
- genetic manipulation in Cyanobacteria can be performed by the introduction of non-replicating vectors which contain native Cyanobacterial seq uences, exogenous genes of interest, and drug resistance genes.
- the vectors may be integrated into the Cyanobacterial genome through homologous recombination. I n this way, the exogenous gene of interest and the drug resistance gene are stably integrated into the Cyanobacterial genome. Such recombinants cells can then be isolated from non-recombinant cells by drug selection. Examples of suitable vectors are provided herein.
- Embodiments of the present invention include methods of producing neutral lipids in a Cyanobacterium, comprising introducing one or more polynucleotides encoding one or more enzymes associated with triglyceride biosynthesis a nd at least one polynucleotide encoding at least one protein associated with lipid packaging activities into a Cyanobacterium, such as an enzyme having a phosphatidate phosphatase activity and/or an enzyme having a diacylglycerol acyltransferase activity, and the protein having lipid packaging activity.
- genetically modified photosynthetic microorganisms may be prepared by (i) introducing one or more desired polynucleotides encoding one or more enzymes associated with triglyceride biosynthesis and lipid packaging activity into a photosynthetic microorganism, and (ii) selecting for, and/or isolating, photosynthetic microorganisms that comprise the one or more desired polynucleotides.
- selection and isolation may include the use of antibiotic resistant markers known in the art (e.g., kanamycin, spectinomycin, and streptomycin).
- genetically modified photosynthetic microorganisms may be prepared by (i) introducing one or more desired, exogenous polynucleotides encoding one or more enzymes associated with fatty acid biosynthesis and at least one polynucleotide encoding at least a protein associated with lipid packaging activities into a photosynthetic microorganism, e.g., a Cyanobacteria, and (ii) selecting for, and/or isolating, photosynthetic microorganisms that comprise one or more desired, exogenous polynucleotides.
- a photosynthetic microorganism e.g., a Cyanobacteria
- genetically modified photosynthetic microorganisms may be prepared by (i) introducing one or more desired polynucleotides encoding one or more enzymes associated with fatty acid biosynthesis and triglyceride synthesis and at least one polynucleotide encoding at least a protein associated with lipid packaging activities into a photosynthetic microorganism, and (ii) selecting for, and/or isolating, photosynthetic microorganisms that comprise the one or more desired polynucleotides.
- the one or more enzymes associated with triglyceride synthesis com prise a diacylglycerol acyltransferase (DGAT) enzymatic activity or a phosphatidate phosphatase enzymatic activity.
- DGAT diacylglycerol acyltransferase
- the one or more enzymes associated with triglyceride synthesis comprise both a DGAT enzymatic activity and a phosphatidate phosphatase enzymatic activity.
- the one or more enzymes associated with fatty acid biosynthesis comprise an acyl-CoA carboxylase (ACCase) enzymatic activity.
- the one or more enzymes associated with triglyceride synthesis comprise an acyl-CoA carboxylase (ACCase) enzymatic activity and either a DGAT enzymatic activity or a phosphatidate phosphatase enzymatic activity.
- the one or more enzymes associated with triglyceride synthesis comprise an acyl-CoA carboxylase (ACCase) enzymatic activity, a DGAT enzymatic activity, and a phosphatidate phosphatase enzymatic activity.
- Modified photosynthetic microorganisms of the present invention may include intracellular localization domain-DGAT "fusion proteins,” comprising at least one DGAT polypeptide fused to at least one heterologous intracellular localization domain, such as a bacterial membrane-targeting domain or at least one DGAT polypeptide fused to at least one lipid packaging protein.
- fusion proteins comprising at least one DGAT polypeptide fused to at least one heterologous intracellular localization domain, such as a bacterial membrane-targeting domain or at least one DGAT polypeptide fused to at least one lipid packaging protein.
- Embodiments of the present invention include DGAT fusion proteins, comprising at least one DGAT polypeptide fused to at least one heterologous intracellular localization domain, such as a bacterial membrane-targeting domain.
- DGAT fusion proteins can be partially or fully isolated from other cellular components, or expressed, for example, in cell-free systems or a host cell, such as a modified photosynthetic microorganism.
- the modified photosynthetic microorganisms described herein can optionally comprise any combination of one or more overexpressed or introduced lipid biosynthesis proteins and/or one or more overexpressed or introduced proteins associated with glycogen breakdown.
- lipid biosynthesis proteins include acyl carrier proteins (ACP), acyl ACP synthases (Aas), acyl-ACP reductases, alcohol dehydrogenases, aldehyde dehydrogenases, aldehyde decarbonylases, thioesterases (TES), acetyl coenzyme A carboxylases (ACCase), phosphatidic acid phosphatases (PAP; or phosphatidate phosphatases), triacylglycerol (TAG) hydrolases, fatty acyl-CoA synthetases, and lipases/phospholipases, fatty acyl reductases (FAR), as described herein.
- ACP acyl carrier proteins
- Aas acyl ACP synthases
- TES acyl-ACP reductases
- alcohol dehydrogenases aldehyde dehydrogenases
- aldehyde decarbonylases thioesterases
- TES
- photosynthetic microorganisms may optionally comprise reduced, eliminated, or non-functional expression (e.g., expression of a deletion mutant with reduced or no functional activity) of one or more endogenous lipid biosynthesis proteins.
- modified photosynthetic microorganisms such as Synechococcus may optionally comprise reduced, eliminated, or non-functional expression of one or more aldehyde decarbonylases (e.g., orfl593), aldehyde dehydrogenases (e.g., orf0489), or both.
- a modified photosynthetic microorganism may optionally comprise reduced, eliminated, or non-functional expression of an Aas polypeptide.
- modified photosynthetic microorganisms may optionally comprise reduced, eliminated, or non-functional expression of one or more proteins associated with glycogen biosynthesis, either alone or in combination with overexpressed lipid biosynthesis proteins and/or overexpressed glycogen breakdown proteins, or in combination with any other polypeptide-related modification described herein.
- modified photosynthetic microorganisms of the present invention may comprise any combination of one or more of the additional modifications noted herein, typically as long as they express at least one intracellular localization domain-DGAT fusion protein. It is further understood that the compositions and methods of the present invention may be practiced using biologically active variants and/or fragments of any of the polypeptides described herein.
- embodiments of the present invention include intracellular localization domain-DGAT "fusion proteins,” comprising at least one DGAT polypeptide fused to at least one heterologous intracellular localization domain, such as a bacterial membrane-targeting domain.
- Fusion proteins are defined elsewhere herein and well known in the art, as are methods of making fusion proteins. Fusion proteins may be prepared using standard techniques. For example, DNA sequences encoding the polypeptide components of a desired fusion may be assembled separately, and ligated into an appropriate expression vector. The 3' end of the DNA sequence encoding one polypeptide component can be ligated, with or without a peptide linker (described below), to the 5' end of a DNA sequence encoding the second (or third, fourth, etc.) polypeptide component so that the reading frames of the sequences are in phase. This permits translation into a single fusion protein that retains the biological activity of both component polypeptides.
- the ligated DNA sequences may be operably linked to suitable transcriptional or translational regulatory elements.
- the regulatory elements responsible for expression of DNA are typically located 5' to the DNA sequence encoding the first polypeptide (e.g., the membrane-targeting domain).
- stop codons required to end translation and transcription termination signals are typically present 3' to the DNA seq uence encoding the second (or third, fourth, etc.) polypeptide.
- the intracellular localization or targeting domain can be fused to the N-terminus of the DGAT polypeptide, the C-terminus of the DGAT polypeptide, internally, or any combination thereof.
- the intracellular localization or targeting domain can be fused to the DGAT polypeptide within the N-terminal region (e.g., within about the first 2, 3, 4, 5, 6, 7, 8, 9, 10, 20, 30, 40, 50, 60, 70, 80, 90, 100 or so amino acids), at an internal region (between the N-terminal and C-terminal regions), and/or within the C-terminal region (e.g., within about the last 2, 3, 4, 5, 6, 7, 8, 9, 10, 20, 30, 40, 50, 60, 70, 80, 90, 100 or so amino acids).
- the intracellular localization or targeting domain is fused to the N-terminus of the DGAT polypeptide, or near the N-terminus, for example, within about the 1-100 amino acids.
- the intracellular localization or targeting domain is fused to the second amino acid of a DGAT polypeptide, resulting in the removal of the first residue (the methionine residue of the AUG codon).
- a DGAT fusion protein is "targeted to" or “selectively localizes" to one or more defined intracellular region(s) of a photosynthetic microorganism, where at least about 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, 98%, or nearly 100% of the DGAT fusion protein can be found associated with the defined intracellular region(s), relative to a cytoplasmic or a soluble fraction of the microorganism.
- the intracellular region is one or more of the plasma membrane, a thylakoid, a vesicle, a lipid body, a glycogen granule, a polyhydroxybutyrate (PHB) body, a carboxysome, a cyanophycin granule, and/or an intracellular membrane, such as an intracellular membrane associated with a thylakoid, vesicle, lipid body, glycogen granule, PHB body, carboxysome, and/or cyanophycin granule.
- the enzymatic domain(s) of the DGAT fusion protein selectively localizes to the cytoplasmic side of the intracellular region(s) or associated membranes.
- a DGAT fusion protein is "targeted to" or “selectively localizes" to one or more membrane(s) of a photosynthetic microorganism, where at least about 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, 98%, or nearly 100% of the fusion protein can be found associated with at least one membrane (e.g., a membrane fraction) upon expression in a given photosynthetic microorganism (e.g., Cyanobacteria), relative to other cellular spaces, such as a cytoplasmic or soluble fraction of the cell.
- a membrane fraction e.g., a membrane fraction
- membranes include the plasma membrane and any intracellular membranes, such as intracellular membranes associated with a thylakoid, vesicle, lipid body, glycogen granule, PHB body, carboxysome, and/or cyanophycin granule.
- the enzymatic domain(s) of the DGAT fusion protein selectively localize to the cytoplasmic side of the membrane.
- a DGAT fusion protein is "targeted to" or “selectively localizes” to the plasma membrane of a photosynthetic microorganism, where at least about 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, 98%, or nearly 100% of the fusion protein can be found at the plasma membrane (or associated with the plasma membrane) upon expression in a given photosynthetic microorganism (e.g.,.
- Cyanobacteria relative to other cellular spaces, such as the cytoplasm, vesicles, lipid bodies, thylakoids, glycogen granules, PHB bodies, carboxysomes, cyanophycin granules, other intracellular membranes (e.g., thylakoid membranes, lipid body membranes, vesicle membranes), or any combination thereof.
- Thylakoids consist of a thylakoid membrane surrounding a thylakoid lumen, and are the site of photosynthesis.
- the enzymatic domain(s) of the DGAT fusion protein selectively localize to the cytoplasmic side of the plasma membrane.
- the fusion to a heterologous intracellular localization domain limits the potential for DGAT-mediated photosystem disruption, generation of reactive oxygen species, and loss of cell viability, and thereby improves the cell growth phenotype of DGAT-expressing and lipid-producing photosynthetic microorganisms.
- the intracellular localization domain sequences of the DGAT fusion proteins described herein can be obtained from any one or more signal or other sequences that selectively localize a given protein to a defined intracellular region (for instance, relative to dispersal throughout the cytoplasm), such as the plasma membrane, a thylakoid, a vesicle, a lipid body, a glycogen granule, a PHB body, a carboxysome, a cyanophycin granule, or an intracellular membrane.
- a defined intracellular region for instance, relative to dispersal throughout the cytoplasm
- Particular examples thus include membrane-, thylakoid-, vesicle-, lipid body-, glycogen granule-, polyhydroxybutyrate (PHB) body-, carboxysome-, and cyanophycin granule-targeting domains, including domains that target DGAT to the membranes associated with these intracellular regions.
- the intracellular localization domain selectively localizes the active domain(s) of DGAT to the cytoplasmic side of the intracellular region, so that DGAT can interact with lipid-producing substrates in the cytoplasm.
- the intracellular localization domain can be any length that is sufficient to selectively localize the DGAT fusion protein(s) to an intracellular region of a membrane, such as the plasma membrane, and/or alter its relation to other cell membranes or substrates, and allow the enzymatic portions of the DGAT polypeptide to interact with lipid-producing substrates in the cytoplasm.
- the intracellular localization domain can be anywhere from about 10-1000 amino acids in length, including about 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 110, 120, 130, 140, 150, 160, 170, 180, 190, 200, 210, 220, 230, 240, 250, 260, 270, 280, 290, 300, 310, 320, 330, 340, 350, 360, 370, 380, 390, 400, 410, 420, 430, 440, 450, 460, 470, 480, 490, 500, 510, 520, 530, 540, 550, 560, 570, 580, 590, 600, 700, 710, 720, 730, 740, 750, 760, 770, 780, 790, 800, 810, 820, 830, 840, 850, 860, 870, 880, 890,
- the intracellular localization domain is a membrane-targeting or plasma membrane (PM)-targeting domain.
- membrane-targeting sequences can be obtained or derived from any combination of N-terminal leader sequence(s), transmembrane domain sequence(s), and/or integral membrane seq uence(s) of a bacterial membrane protein, such as a bacterial plasma membrane protein.
- bacterial plasma membrane proteins in their endogenous state selectively localize to the plasma membrane, and are characterized by having at least one C-terminal region that is localized to the cytoplasmic side of a bacterial plasma membrane, and/or the periplasmic side of the outer membrane (for plasma membrane proteins derived from gram-negative bacteria).
- a bacterial plasma membrane protein "selectively localizes" to the plasma membrane where at least about 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, 98%, or nearly 100% of the protein can be found at the plasma membrane upon (preferably endogenous) expression in the bacteria from which it is derived (e.g., gram-positive bacteria, gram-negative bacteria, photosynthetic bacteria, Cyanobacteria), relative to other cellular spaces, such as the cytoplasm, the cell wall, other cellular Organelles' or membranes, such as thylakoid membranes for certain photosynthetic bacteria, or any combination thereof.
- endogenous expression in the bacteria from which it is derived e.g., gram-positive bacteria, gram-negative bacteria, photosynthetic bacteria, Cyanobacteria
- other cellular spaces such as the cytoplasm, the cell wall, other cellular Organelles' or membranes, such as thylakoid membranes for certain photosynthetic bacteria, or any combination thereof
- the membrane-targeting or PM -targeting domain may com prise an amino acid seq uence of an N-terminal leader sequence, an amino acid seq uence of one or more transmembrane domains (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more transmembrane domains), an amino acid seq uence of one or more integral membrane domains, or any combination thereof.
- the sequences of the N-terminal leader, the transmembrane domain(s), and/or the integral membra ne domain(s) can be from the same or different bacterial plasma membrane protein(s).
- Membrane-targeting or PM-targeting domain seq uences can be obtained from (or derived from) the signal sequences, transmembrane domains, or integral membrane domains of any variety of bacterial membrane proteins.
- the bacterial membrane protein can be an integral membrane protein (I M P), such as a transmembrane protein (TP).
- I M P integral membrane protein
- TP transmembrane protein
- the membrane-targeting domain is obtained from a single-pass transmembrane protein, having only one domain that spans the lipid bilayer of the plasma membrane, or a multi-pass transmembrane protein, having about 2, 3, 4, 5, 6, 7, 8, 9, 10 or more domains that span the lipid bilayer of the plasma membrane.
- the membrane-targeting domain can comprise any one or more of the multiple transmembrane domains.
- the membrane-targeting domain or transmembrane domain can comprise an alpha-helical transmembrane structure, or a beta-barrel transmembrane structure, the latter typically deriving from gram-negative outer membrane proteins.
- the membrane-targeting or PM-targeting domain does not span the entire lipid bilayer, but inserts into or attaches to the cytoplasmic side of the membrane, such as the plasma membrane.
- Examples include membrane-targeting domains that interact with the membrane by an amphipathic helix (e.g., parallel to the membrane plane), membrane-targeting domains that interact with the membrane by a hydrophobic loop, and membrane-targeting domains that interact with the membrane by electrostatic or ionic interactions, for example, through calcium ions.
- the membrane-targeting domain sequence is obtained from a membrane protein or plasma membrane protein of one or more gram-negative bacteria, gram-positive bacteria, or other bacteria, such as Cyanobacteria. Exemplary bacteria are described elsewhere herein and known in the art.
- the membrane-targeting or PM-targeting domain sequence is obtained from a membrane protein or plasma membrane protein of a photosynthetic bacteria, such as a Cyanobacteria from the genera Aphanocapsa, Aphanothece, Chamaesiphon, Chroococcus, Chroogloeocystis, Coelosphaerium, Crocosphaera, Cyanobacterium, Cyanobium, Cyanodictyon, Cyanosarcina, Cyanothece, Dactylococcopsis, Gloecapsa, Gloeothece, Merismopedia, Microcystis, Radiocystis, Rhabdoderma, Snowella, Synychococcus, Synechocystis, Thermosenechococcus, and Woronichinia; Nostacales Cyanobacteria from the genera Anabaena, Anabaenopsis, Aphanizomenon, Aulosira, Calothrix, Coleodesmium
- the Cyanobacterium is from the genus Synechococcus, including, but not limited to Synechococcus bigranulatus, Synechococcus elongatus, Synechococcus leopoliensis, Synechococcus lividus, Synechococcus nidulans, and Synechococcus rubescens.
- the membrane-targeting domain sequence is derived from a plasma membrane protein from 5. elongatus sp. strain PCC7942 or Synechococcus sp. PCC 7002 (originally known as Agmenellum quadruplicatum).
- the membrane protein is a plasma membrane receptor protein, such as a chemoreceptor or chemotaxis protein.
- a plasma membrane receptor protein such as a chemoreceptor or chemotaxis protein.
- Particular examples include integral membrane chemoreceptors, e.g., transmembrane chemoreceptors.
- Examples of chemoreceptors or chemotaxis proteins include methyl-accepting chemotaxis proteins and amino acid chemotaxis receptors, such as serine chemotaxis receptors (e.g., Tsr receptor from Escherichia coli) and aspartate chemotaxis receptors.
- the membrane-targeting domain can thus be obtained from the signal sequence and/or transmembrane domains of any one or more of such bacterial plasma membrane receptors.
- the membrane-targeting domain is obtained from (or derived from) a methyl-accepting chemotaxis protein (MCP).
- MCPs can be classified by topology type (see Zhulin, Adv Microb Physiol. 45:157-198, 2001) and signaling domain class (see Alexander and Zhulin, PNAS USA. 104:2885-2890, 2007).
- Topology type I MCPs have large periplasmic ligand-binding domains and an elongated cytoplasmic region consisting of a HAMP domain (i.e., histidine kinases, adenylyl cyclases, methyl-binding proteins, and phosphatases) followed by a signaling domain, which in turn is composed of "methylation,” “flexible bundle,” and “signaling” sub-domains (see Alexander and Zhulin, supra; and Hazelbauer et al., Trends Biochem Sci. 33:9-19, 2008). MCPs cluster together with other chemotaxis proteins in large arrays at the cell pole.
- HAMP domain i.e., histidine kinases, adenylyl cyclases, methyl-binding proteins, and phosphatases
- MCP arrays from variety of bacteria have been well-characterized, including, for example, E. coli, C. crescentus, Thermotoga maritima, Magnetospirillum magneticum, Rhodobacter sphaeroides, Treponema monia, Listeria monocytogenes, Helicobacter hepaticus, Acetonema longum, Borrelia burgdorferi, Halothiobacillus neapolitanus, and Campylobacter jejuni (see Briegel et al., PNAS USA. 106:17181-17186, 2009).
- the membrane-targeting domain can thus be derived from the signal sequence and/or transmembrane domains of an MCP from any one or more of these bacteria, or an MCP from any other bacteria described herein or known in the art.
- the MCP is encoded by PCC7942-0858 or PCC7942-1015 from 5. elongatus.
- the polypeptide and polynucleotide sequence of the 5. elongatus PCC7942-0858 MCP are set forth in SEQ ID NOS:199 and 200, respectively, and the polypeptide and polynucleotide sequence of the 5. elongatus PCC7942-1015 MCP are set forth in SEQ ID NOS:201 and 202, respectively.
- the membrane-targeting domain comprises or consists essentially of the N-terminal leader sequence, the first (N-terminal) transmembrane domain, and/or the second transmembrane domain of PCC7942-0858, singly or in combination together.
- the bacterial membrane-targeting domain comprises or consists essentially of about the N-terminal 43-53 amino acids of the MCP encoded by PCC7942-0858, for example, about residues 1-43, 4-44, 1-45, 1-46, 1-47, 1-48, 1-49, 1-50, 1-51, 1-52, 1-53, 1-54, 1-55, 1-56, 1-57, 1-58, 1-59, 1-60 of SEQ ID NO:199.
- the bacterial membrane-targeting domain comprises or consists essentially of the N-terminal signal sequence of the MCP encoded by PCC7942-0858, for example, about residues 1-43 of SEQ ID NO:199.
- DGAT Polypeptides include any protein, polypeptide or peptide, obtainable from any cell source, which demonstrates the ability to catalyze the production of triacylglycerol from 1, 2-diacylglycerol and fatty acyl substrates under enzyme reactive conditions, in addition to any naturally-occurring (e.g., allelic variants, orthologs) or non-naturally occurring variants of a diacylglycerol acyltransferase sequence having such ability.
- naturally-occurring e.g., allelic variants, orthologs
- non-naturally occurring variants of a diacylglycerol acyltransferase sequence having such ability e.g., allelic variants, orthologs
- DGAT polypeptides of the present invention also include bi-functional proteins, such as those bi-functional proteins that exhibit a DGAT activity as well as a CoA:fatty alcohol acyltransferase activity, e.g., a wax ester synthesis (WES) activity, as often found in many TAG producing bacteria.
- bi-functional proteins such as those bi-functional proteins that exhibit a DGAT activity as well as a CoA:fatty alcohol acyltransferase activity, e.g., a wax ester synthesis (WES) activity, as often found in many TAG producing bacteria.
- WES wax ester synthesis
- DGATs are members of the O-acyltransferase superfamily, which esterify either sterols or diacyglycerols in an oleoyl-CoA-dependent manner.
- DGAT in particular esterifies diacylglycerols, which reaction represents the fina l enzymatic step in the production of triacylglycerols in plants, fungi and mammals.
- DGAT is responsible for transferring an acyl group from acyl-coenzyme-A to the sn-3 position of 1, 2-diacylglycerol (DAG) to form triacylglycerol (TAG).
- DAG 2-diacylglycerol
- TAG triacylglycerol
- DGAT is an integral membrane protein that has been generally described in Harwood (Biochem. Biophysics.
- DGAT is associated with the membrane and lipid body fractions. I n catalyzing TAGs, DGAT contributes mainly to the storage of carbon used as energy reserves. In animals, however, the role of DGAT is more complex. DGAT not only plays a role in lipoprotein assembly and the regulation of plasma triacylglycerol concentration (Bell, R. M ., et al. ), but participates as well in the regulation of diacylglycerol levels (see Brindley, Biochemistry of Lipids, Lipoproteins and Membranes, eds. Vance, D. E. & Vance, J. E. (Elsevier, Amsterdam), 171-203; and Nishizuka, Science 258:607-614, 1992, each of which are incorporated by reference).
- DGAT1, DGAT2, and PDAT independent DGAT gene families
- Yeast contain all three of DGAT1, DGAT2, and PDAT, but the expression levels of these gene families varies during different phases of the life cycle (Dahlqvst, A., et al. Proc. Natl. Acad. Sci. USA 97:6487-6492, 2000, incorporated by reference).
- WS/DGAT from Acinetobacter calcoaceticus ADP1 represents the first identified member of a widespread class of bacterial wax ester and TAG biosynthesis enzymes.
- This enzyme comprises a putative membrane-spanning region but shows no sequence homology to the DGAT1 and DGAT2 families from eukaryotes.
- WS/DGAT shows a broad capability of utilizing a large variety of fatty alcohols, and even thiols as acceptors of the acyl moieties of various acyl-CoA thioesters.
- WS/DGAT acyltransferase enzymes exhibit extraordinarily broad substrate specificity. Genes for homologous acyltransferases have been found in almost all bacteria capable of accumulating neutral lipids, including, for example, Acinetobacter baylii, A. baumanii, and M. avium, and M.
- tuberculosis CDC1551 in which about 15 functional homologues are present (see, e.g., Daniel et al., J. Bacteriol. 186:5017-5030, 2004; and Kalscheuer et al., J. Biol. Chem. 287:8075-8082, 2003).
- DGAT proteins may utilize a variety of acyl substrates in a host cell, including fatty acyl-CoA and fatty acyl-ACP molecules.
- the acyl substrates acted upon by DGAT enzymes may have varying carbon chain lengths and degrees of saturation, although DGAT may demonstrate preferential activity towards certain molecules.
- eukaryotic DGAT polypeptides typically contain a FYxDWWN (SEQ ID NO:15) heptapeptide retention motif, as well as a histidine (or tyrosine)-serine-phenylalanine (H/YSF) tripeptide motif, as described in Zhongmin et al. (Journal of Lipid Research, 42:1282-1291, 2001) (herein incorporated by reference).
- the highly conserved FYxDWWN (SEQ ID NO:15) is believed to be involved in fatty Acyl-CoA binding.
- the DGAT polypeptide portion of the fusion proteins described herein may thus comprise one or more these motifs.
- DGAT polypeptides utilized according to the fusion proteins described herein may be isolated from any organism, including eukaryotic and prokaryotic organisms.
- Eukaryotic organisms having a DGAT gene are well-known in the art, and include various animals (e.g., mammals, fruit flies, nematodes), plants, parasites, and fungi (e.g., yeast such as 5. cerevisiae and Schizosaccharomyces pombe).
- prokaryotic organisms include certain actinomycetes, a group of Gram-positive bacteria with high G+C ratio, such as those from the representative genera Actinomyces, Arthrobacter, Corynebacterium, Frankia, Micrococcus, Mocrimonospora, Mycobacterium, Nocardia, Propionibacterium, Rhodococcus and Streptomyces.
- actinomycetes that have one or more genes encoding a DGAT activity include, for example, Mycobacterium tuberculosis, M. avium, M. smegmatis, Micromonospora echinospora, Rhodococcus opacus, R. ruber, and Streptomyces lividans.
- prokaryotic organisms that encode one or more enzymes having a DGAT activity include members of the genera Acinetobacter, such as A. calcoaceticus, A. baumanii, A. baylii, and members of the generua Alcanivorax.
- a DGAT polypeptide is from Acinetobacter baylii sp.
- ADP1 a gram-negative triglyceride forming prokaryote, which contains a well-characterized DGAT (AtfA).
- the DGAT polypeptide is an Acinetobacter DGAT (ADGAT), a
- the DGAT polypeptide comprises or consists of a polypeptide sequence set forth in any one of SEQ ID NOs:58, 59, 60, or 61, or a fragment or variant thereof.
- SEQ I D NO:58 is the seq uence of DGATn
- SEQ ID NO:59 is the seq uence of Streptomyces coelicolor DGAT (ScoDGAT or SDGAT)
- SEQ ID NO:60 is the sequence of Alcanivorax borkumensis DGAT (AboDGAT)
- SEQ ID NO:61 is the sequence of DGATd.
- the modified photosynthetic microorganisms of the present invention may express two or more intracellular localization domain-DGAT fusion proteins.
- the DGAT polypeptides may by the same or different.
- the following intracellular localization domain-DGAT fusions are co-expressed in modified photosynthetic microorganisms, e.g., Cyanobacteria, using one of the following double DGAT strains: ADGATd: : ScoDGAT; ADGATd(NSl): :ADGATd(NS2); ADG/ ⁇ 7n(NSl)::ADGATn(NS2);
- pAM2291 EcoRI follows ATG and is part of the open reading frame (ORF).
- pAM 1579 EcoRI follows ATG and is part of the ORF.
- a DGAT having EcoRI nucleotides following ATG may be cloned in either pAM2291 or pAM 1579; such a DGAT is referred to as ADGATd.
- em bodiments utilize the vector, pAM2314FTrc3, which is an NS1 vector with Nde/Bglll sites, or the vector, pAM 1579FTrc3, which is the NS2 vector with Nde/Bgll l sites.
- a DGAT without EcoRI nucleotides may be cloned into either of these last two vectors.
- Modified photosynthetic microorganisms expressing different DGATs express TAGs having different fatty acid compositions. Accordingly, certain embodiments contemplate expressing two or more different intracellular localization domain-DGAT fusions, in order to produce TAGs having varied fatty acid compositions.
- a peptide linker sequence may be employed to separate the DGAT polypeptide(s) and the heterologous intracellular localization domain(s) by a distance sufficient to ensure that each polypeptide folds into its desired secondary and tertiary structures.
- a peptide linker sequence can be incorporated into the fusion protein using standard techniques well known in the art.
- Certain peptide linker seq uences may be chosen based on the following exemplary factors: (1) their ability to adopt a flexible extended conformation; (2) their inability to adopt a secondary structure that could interact with functional epitopes on the first and second polypeptides; (3) their physiological stability; and (4) the lack of hydrophobic or charged residues that might react with the polypeptide functional epitopes, or other features. See, e.g., George and Heringa, J Protein Eng. 15:871-879, 2002.
- the linker seq uence can be essentially any length, but is generally from about 1 to about 300 amino acids in length.
- Particular linkers can have an overall amino acid length of about 1-300 amino acids, 1-250, 1-200 amino acids, 1-150 amino acids, 1-100 amino acids, 1-90 amino acids, 1-80 amino acids, 1-70 amino acids, 1-60 amino acids, 1-50 amino acids, 1-40 amino acids, 1-30 amino acids, 1-20 amino acids, 1-10 amino acids, 1-5 amino acids, 1-4 amino acids, 1-3 amino acids, or about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16 ,17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 60, 70, 80, 90, 100, 110, 120, 130, 140, 150, 160, 170, 180, 190, 200, 210, 220, 230, 240, 250, 260, 270, 280
- Certain amino acid seq uences which may be usefully employed as linkers include those disclosed in Maratea et al., Gene 40:39-46, 1985; M urphy et al., PNAS USA. 83:8258-8262, 1986; U.S. Pat. No. 4,935,233 and U.S. Pat. No. 4,751,180.
- Particular peptide linker sequences contain Gly, Ser, and/or Asn residues.
- Other near neutral amino acids, such as Thr and Ala may also be employed in the peptide linker sequence, if desired.
- Certain exemplary linkers include Gly, Ser and/or Asn-containing linkers, as follows: [G] x , [Six, [N] x , [GS] X , [GGS] X , [GSS] X , [GSGS] X (SEQ I D NO:203), [GGSG] X (SEQ I D NO:204), [GGGS] X (SEQ I D NO:205), [GGGGS] X (SEQ I D NO:206), [GN] X , [GGN] X , [GNN] X , [GNGN] X (SEQ I D NO:207), [GGNG] X (SEQ I D NO:208), [GGGN] X (SEQ ID NO:209), [GGGGN] X (SEQ I D NO:210) linkers, where x is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 25, 30, 40, 50 or more. Other combinations of these and related amino acids will be
- any one or more of the peptide linkers are optional.
- linker seq uences may not required when the bacterial membrane protein (from which the membrane- or PM -targeting domain is derived) and the DGAT polypeptides have non-essential regions (e.g., N-terminal and/or C-terminal amino acids, or for the plasma membrane proteins, regions just downstream of the signal seq uences and/or transmembrane domains) that can be used to separate the functional domains and prevent steric interference.
- modified photosynthetic microorganisms of the present invention further comprise one or more exogenous (e.g., introduced) or overexpressed nucleic acids that encode a lipid biosynthesis protein, e.g., a polypeptide having an activity associated with triglyceride biosynthesis or fatty acid biosynthesis, including but not limited to any of those described herein.
- a modified photosynthetic microorganism may comprise reduced expression and/or activity of one or more selected lipid biosynthesis proteins. Certain of these proteins are described in greater detail below.
- the exogenous nucleic acid does not comprise a nucleic acid sequence that is native to the microorganism's genome.
- the exogenous nucleic acid comprises a nucleic acid sequence that is native to the microorganism's genome, but it has been introduced into the microorganism, e.g., in a vector or by molecular biology techniques, for example, to increase expression of the nucleic acid and/or its encoded polypeptide in the microorganism.
- the expression of a native or endogenous nucleic acid and its corresponding protein can be increased by introducing a heterologous promoter upstream of the native gene.
- lipid biosynthesis proteins can be involved in triglyceride biosynthesis, fatty acid synthesis, wax ester synthesis, or any combination thereof.
- Triglyceride Biosynthesis Triglycerides, or triacylglycerols (TAGs), consist primarily of glycerol esterified with three fatty acids, and yield more energy upon oxidation than either carbohydrates or proteins. Triglycerides provide an important mechanism of energy storage for most eukaryotic organisms. In mammals, TAGs are synthesized and stored in several cell types, including adipocytes and hepatocytes (Bell et al. Annu. Rev. Biochem. 49:459-487,1980) (incorporated by reference). I n plants, TAG production is mainly important for the generation of seed oils.
- TAGs triacylglycerols
- I n contrast to eukaryotes, the observation of triglyceride production in prokaryotes has been limited to certain actinomycetes, such as members of the genera Mycobacterium, Nocardia, Rhodococcus and Streptomyces, in addition to certain members of the genus Acinetobacter.
- actinomycetes such as members of the genera Mycobacterium, Nocardia, Rhodococcus and Streptomyces, in addition to certain members of the genus Acinetobacter.
- triglycerides may accumulate to nearly 80% of the dry cell weight, but accumulate to only about 15% of the dry cell weight in Acinetobacter.
- I n general, triglycerides are stored in spherical lipid bodies, with quantities and diameters depending on the respective species, growth stage, and cultivation conditions.
- cells of Rhodococcus opacus and Streptomyces lividans contain only few TAGs when cultivated in complex media with a high content of carbon and nitrogen; however, the lipid content and the number of TAG bodies increase drastically when the cells are cultivated in mineral salt medium with a low nitrogen-to-carbon ratio, yielding a maximum in the late stationary growth phase.
- cells can be almost completely filled with lipid bodies exhibiting diameters ranging from 50 to 400 nm.
- One example is R. opacus PD630, in which lipids can reach more than 70% of the total cellular dry weight.
- TAG formation typically starts with the docking of a diacylglycerol acyltransferase enzyme to the plasma mem brane, followed by formation of small lipid droplets (SLDs). These SLDs are only some nanometers in diameter and remain associated with the membrane-docked enzyme. In this phase of lipid accumulation, SLDs typically form an emulsive, oleogenous layer at the plasma membrane. During prolonged lipid synthesis, SLDs leave the membrane-associated acyltransferase and conglomerate to membrane-bound lipid prebodies. These lipid prebodies reach distinct sizes, e.g., about 200 nm in A. calcoaceticus and about 300 nm in R.
- SLDs small lipid droplets
- Free and membrane-bound lipid prebodies correspond to the lipid domains occurring in the cytoplasm and at the cell wall, as observed in M. smegmatis during fluorescence microscopy and also confirmed in R. opacus PD630 and A. calcoaceticus ADPl (see, e.g., Christensen et al., Mol. Microbiol. 31:1561-1572, 1999; and Waltermann et al., Mol. Microbiol. 55:750-763, 2005).
- SLDs coalesce with each other to form the homogenous lipid core found in mature lipid bodies, which often appear opaque in electron microscopy.
- compositions and structures of bacterial TAGs vary considerably depending on the microorganism and on the carbon source.
- unusual acyl moieties such as phenyldecanoic acid and 4,8,12 trimethyl tridecanoic acid, may also contribute to the structural diversity of bacterial TAGs (see, e.g., Alvarez et al., Appl Microbiol Biotechnol. 60:367-76, 2002).
- TAGs As with eukaryotes, the main function of TAGs in prokaryotes is to serve as a storage compound for energy and carbon. TAGs, however, may provide other functions in prokaryotes. For example, lipid bodies may act as a deposit for toxic or useless fatty acids formed during growth on recalcitrant carbon sources, which must be excluded from the plasma membrane and phospholipid (PL) biosynthesis. Furthermore, many TAG-accumulating bacteria are ubiquitous in soil, and in this habitat, water deficiency causing dehydration is a frequent environmental stress. Storage of evaporation-resistant lipids might be a strategy to maintain a basic water supply, since oxidation of the hydrocarbon chains of the lipids under conditions of dehydration would generate considerable amounts of water. Cyanobacteria such as Synechococcus, however, do not produce triglycerides, because these organisms lack the enzymes necessary for triglyceride biosynthesis.
- Triglycerides are synthesized from fatty acids and glycerol.
- TAG triglyceride
- sequential acylation of glycerol-3-phosphate via the "Kennedy Pathway" leads to the formation of phosphatidate.
- Phosphatidate is then dephosphorylated by the enzyme phosphatidate phosphatase to yield 1,2 diacylglycerol (DAG).
- DAG 1,2 diacylglycerol
- DAG 1,2 diacylglycerol
- DAG 1,2 diacylglycerol
- an enzyme having diacylglycerol acyltransferase (DGAT) activity catalyzes the acylation of DAG using acyl-CoA as a substrate.
- DGAT diacylglycerol acyltransferase
- DGAT enzymes combine acyl-CoA with 1,2 diacylglycerol molecule to form a TAG.
- Acyl-CoA-independent TAG synthesis may be mediated by a phospholipid:DAG acyltransferase found in yeast and plants, which uses phospholipids as acyl donors for DAG esterification.
- Another mechanism of TAG synthesis in animals and plants may be mediated by a DAG-DAG-transacylase, which uses DAG as both an acyl donor and acceptor, yielding TAG and monoacylglycerol.
- Modified photosynthetic microorganisms e.g., Cyanobacteria
- Modified photosynthetic microorganisms may comprise one or more exogenous polynucleotides encoding polypeptides comprising one or more of the polypeptides and enzymes described herein.
- embodiments of the present invention include genetically modified Cyanobacteria that comprise polynucleotides encoding one or more DGAT fusion proteins, optionally in combination with one or more enzymes having a fatty acyl-CoA synthetase activity.
- the one or more exogenous polynucleotides encode a DGAT fusion protein described herein and a fatty acyl-CoA synthetase, or a functional variant or fragment thereof.
- triglycerides are typically formed from fatty acids
- the level of fatty acid biosynthesis in a cell may limit the production of triglycerides. I ncreasing the level of fatty acid biosynthesis may, therefore, allow increased production of triglycerides.
- acetyl-CoA carboxylase catalyzes the commitment step to fatty acid biosynthesis.
- certain embodiments of the present invention include Cyanobacterium, and methods of use thereof, comprising polynucleotides that encode one or more enzymes having acetyl-CoA carboxylase activity to increase fatty acid biosynthesis and lipid production, in addition to one or more DGAT fusion proteins and optionally one or more enzymes having fatty acyl-CoA synthetase activity, to catalyze triglyceride production.
- DGAT fusion proteins optionally one or more enzymes having fatty acyl-CoA synthetase activity
- the negative charge is located at a carboxyl end group and is typically deprotonated at physiological pH values (pK ⁇ 2-3).
- physiological pH values pK ⁇ 2-3.
- the length of the fatty acid 'tail' determines its water solubility (or rather insolubility) and amphipathic characteristics.
- Fatty acids are components of phospholipids and sphingolipids, which form part of biological membranes, as well as triglycerides, which are primarily used as energy storage molecules inside cells.
- Fatty acids are formed from acetyl-CoA and malonyl-CoA precursors.
- Malonyl-CoA is a carboxylated form of acetyl-CoA, and contains a 3-carbon dicarboxylic acid, malonate, bound to Coenzyme A.
- Acetyl-CoA carboxylase catalyzes the 2-step reaction by which acetyl-CoA is carboxylated to form malonyl-CoA.
- ma lonate is formed from acetyl-CoA by the addition of C0 2 using the biotin cofactor of the enzyme acetyl-CoA carboxylase.
- Fatty acid synthase carries out the chain elongation steps of fatty acid biosynthesis.
- FAS is a large multienzyme complex. I n mammals, FAS contains two subunits, each containing multiple enzyme activities.
- individual proteins which associate into a large complex, catalyze the individual steps of the synthesis scheme.
- the acyl carrier protein is a smaller, independent protein.
- Fatty acid synthesis starts with acetyl-CoA, and the chain grows from the "tail end" so that carbon 1 and the alpha-carbon of the complete fatty acid are added last.
- the first reaction is the transfer of an acetyl group to a pantothenate group of acyl carrier protein (ACP), a region of the large mammalian fatty acid synthase (FAS) protein.
- ACP acyl carrier protein
- FAS large mammalian fatty acid synthase
- acetyl CoA is added to a cysteine -SH group of the condensing enzyme (CE) domain: acetyl CoA + CE-cys-SH -> acetyl-cys-CE + CoASH.
- this is a two step process, in which the group is first transferred to the ACP (acyl carrier peptide), and then to the cysteine -SH group of the condensing enzyme domain.
- malonyl CoA is added to the ACP sulfhydryl group: malonyl CoA + ACP-SH -> malonyl ACP + CoASH.
- This -SH group is pa rt of a phosphopantethenic acid prosthetic group of the ACP.
- the acetyl group is transferred to the malonyl group with the release of carbon dioxide: malonyl ACP + acetyl-cys-CE -> beta-ketobutyryl-ACP + C0 2 .
- the keto group is reduced to a hydroxyl group by the beta-ketoacyl reductase activity: beta-ketobutyryl-ACP + NADPH + H + -> beta-hydroxybutyryl-ACP + NAD + .
- beta-hydroxybutyryl-ACP is dehydrated to form a trans- monounsaturated fatty acyl group by the beta-hydroxyacyl dehydratase activity: beta-hydroxybutyryl-ACP -> 2-butenoyl-ACP + H 2 0.
- the double bond is reduced by NADPH, yielding a saturated fatty acyl group two carbons longer than the initial one (an acetyl group was converted to a butyryl group in this case): 2-butenoyl-ACP + NADPH + H + -> butyryl-ACP + NADP + .
- the butyryl group is then transferred from the ACP sulfhydryl group to the CE sulfhydryl: butyryl-ACP + CE-cys-SH -> ACP-SH + butyryl-cys-CE. This step is catalyzed by the same transferase activity utilized previously for the original acetyl group.
- the butyryl group is now ready to condense with a new malonyl group (third reaction above) to repeat the process.
- a thioesterase activity hydrolyses it, forming free palmitate: palmitoyl-ACP + H 2 0 -> palmitate + ACP-SH.
- Fatty acid molecules can undergo further modification, such as elongation and/or desaturation.
- Modified photosynthetic microorganisms may comprise one or more exogenous polynucleotides encoding any of the above polypeptides or enzymes involved in fatty acid synthesis.
- the enzyme is an acetyl-CoA carboxylase or a variant or functional fragment thereof.
- Wax esters are esters of a fatty acid and a long-chain alcohol. These neutral lipids are composed of aliphatic alcohols and acids, with both moieties usually long-chain (e.g., C 16 and C i8 ) or very-long-chain (C 20 and longer) carbon structures, though medium-chain-containing wax esters are included (e.g., C w , C 12 and C i4 ). Wax esters have diverse biological functions in bacteria, insects, mammals, and terrestrial plants and are also important substrates for a variety of industrial applications. Various types of wax ester are widely used in the manufacture of fine chemicals such as cosmetics, candles, printing inks, lubricants, coating stuffs, and others.
- acyl-CoA acyl coenzyme A
- acyl-CoA reductase aldehyde reductase
- wax ester biosynthesis involves elongation of saturated Ci 6 and Ci 8 fatty acyl-CoAs to very-long-chain fatty acid wax precursors between 24 and 34 carbons in length, and their subsequent modification by either the alkane-forming (decarbonylation) or the alcohol-forming (acyl reduction) pathway (see Li et al., Plant Physiology 148:97-107, 2008).
- Modified photosynthetic microorganisms e.g., Cyanobacteria, may therefore comprise one or more exogenous polynucleotides encoding any of the above polypeptides or enzymes involved in wax ester synthesis.
- Embodiments of the present invention optionally include one or more exogenous (e.g., recombinantly introduced) or overexpressed ACP proteins. These proteins play crucial roles in fatty acid synthesis. Fatty acid synthesis in bacteria, including Cyanobacteria, is carried out by highly conserved enzymes of the type II fatty acid synthase system (FAS I I; consisting of about 19 genes) in a sequential, regulated manner. Acyl carrier protein (ACP) plays a central role in this process by carrying all the intermediates as thioesters attached to the terminus of its 4'-phosphopantetheine prosthetic group (ACP-thioesters).
- exogenous e.g., recombinantly introduced
- ACP proteins play crucial roles in fatty acid synthesis. Fatty acid synthesis in bacteria, including Cyanobacteria, is carried out by highly conserved enzymes of the type II fatty acid synthase system (FAS I I; consisting of about 19 genes) in a sequential,
- Apo-ACP the product of acp gene
- PPT phosphopantetheinyl transferease
- AcpS acyl carrier protein synthase
- Sfp surfactin type
- Cyanobacteria posses an Sfp-like PPT, which is understood to act in both primary and secondary metabolism.
- Embodiments of the present invention therefore include overexpression of PPTs such as AcpS and/or Sfp-type PPTs in combination with overexpression of cognate ACP encoding genes, such as ACP.
- the ACP-thioesters are substrates for all of the enzymes of the FAS II system.
- the end product of fatty acid synthesis is a long acyl chain typically consisting of about 14-18 carbons attached to ACP by a thioester bond.
- At least three enzymes of the FAS II system in other bacteria can be subject to feedback inhibition by acyl-ACPs: 1) the ACCase complex - a heterotetramer of the AccABCD genes that catalyzes the production of malonyl-coA, the first step in the pathway; 2) the product of the FabH gene ( ⁇ -ketoacyl-ACP synthase III), which catalyzes the condensation of acetyl-CoA with malonyl-ACP; and 3) the product of the Fabl gene (enoyl-ACP reductase), which catalyzes the final elongation step in each round of elongation.
- Certain proteins such as acyl-ACP reductase are capable of increasing fatty acid production in photosynthetic bacteria such as Cyanobacteria, and it is believed that overexpression of ACP in combination with this protein and possibly other biosynthesis proteins will further increase fatty acid production in such strains.
- An ACP can be derived from a variety of eukaryotic organisms, microorganisms (e.g., bacteria, fungi), or plants.
- an ACP polynucleotide sequence and its corresponding polypeptide sequence are derived from Cyanobacteria such as Synechococcus.
- ACPs can be derived from plants such as spinach.
- SEQ ID NOS:5-12 provide the nucleotide and polypeptide sequences of exemplary bacterial ACPs from Synechococcus and Acinetobacter, and SEQ ID NOS:13-14 provide the same for an exemplary plant ACP from Spinacia oleracea (spinach).
- SEQ ID NOS:5 and 6 derive from Synechococcus elongatus PCC7942, and SEQ ID NOS:7-12 derive from Acinetobacter sp.
- ADP1 ADP1.
- prokaryotic organisms having an ACP include certain actinomycetes, a group of Gram-positive bacteria with high G+C ratio, such as those from the representative genera Actinomyces, Arthrobacter, Corynebacterium, Frankia, Micrococcus, Mocrimonospora, Mycobacterium, Nocardia, Propionibacterium, Rhodococcus and Streptomyces.
- actinomycetes that have one or more genes encoding an ACP activity include, for example, Mycobacterium tuberculosis, M. avium, M. smegmatis, Micromonospora echinospora, Rhodococcus opacus, R.
- prokaryotic organisms that encode one or more enzymes having an ACP activity include members of the genera Acinetobacter, such as A. calcoaceticus, A. baumanii, A. baylii, and members of the generua Alcanivorax.
- an ACP gene or enzyme is isolated from Acinetobacter baylii sp.
- ADP1 a gram-negative triglyceride forming prokaryote.
- Acyl ACP Synthases (Aas).
- Acyl-ACP synthetases (Aas) catalyze the ATP-dependent acylation of the thiol of acyl carrier protein (ACP) with fatty acids, including those fatty acids having chain lengths from about C4 to C18.
- Aas enzymes not only directly incorporate exogenous fatty acids from the culture medium into other lipids, but also play a role in the recycling of acyl chains from lipid membranes. Deletion of Aas in cyanobacteria can lead to secretion of free fatty acids into the culture medium. See, e.g., Kaczmarzyk and Fulda, Plant Physiology 152:1598-1610, 2010.
- Certain embodiments may overexpress one or more Aas polypeptides described herein and known in the art.
- overexpression of Aas in com bination with overexpression of ACP leads to increased TAG production in DGAT-expressing strains, for example, by boosting acyl-ACP levels.
- Overexpression of Aas in optional combination with overexpression of ACP may likewise increase wax ester formation, for example, when com bined with overexpression of one or more alcohol dehydrogenase(s) and wax ester synthase(s), such as a bi-functional DGAT.
- Certain embodiments therefore include modified photosynthetic microorganisms comprising overexpressed Aas polypeptide(s), optionally in combination with overexpressed ACP polypeptide(s), especially when combined with overexpression of alcohol dehydrogenase, acyl-ACP reductase (e.g., orfl594), and wax ester synthase (e.g., aDGAT).
- modified photosynthetic microorganisms comprising overexpressed Aas polypeptide(s), optionally in combination with overexpressed ACP polypeptide(s), especially when combined with overexpression of alcohol dehydrogenase, acyl-ACP reductase (e.g., orfl594), and wax ester synthase (e.g., aDGAT).
- bacterial Aas enzymes include those derived from E. coli, Acinetobacter, and Vibrio sp. such as V. harveyi (see, e.g., Shanklin, Protein Expression and Purification. 18:355-360, 2000; Jiang et al., Biochemistry. 45:10008-10019, 2006).
- SEQ I D NOS:43 and 44 respectively, provide the nucleotide and polypeptide sequences of an exemplary Aas from Synechococcus elongatus PCC 7942 (0918).
- the Aas is derived from the same organism as the overexpressed ACP, DGAT, and /or the TES, if any one of these polypeptides is employed in combination with an Aas. Accordingly, certain embodiments include Aas sequences from any of the organisms described herein for deriving a DGAT or TES, including, for example, various animals (e.g., mammals, fruit flies, nematodes), plants, parasites, and fungi (e.g., yeast such as 5. cerevisiae and Schizosaccharomyces pombe).
- prokaryotic organisms include certain actinomycetes, a group of Gram-positive bacteria with high G+C ratio, such as those from the representative genera Actinomyces, Arthrobacter, Corynebacterium, Frankia, Micrococcus, Mocrimonospora, Mycobacterium, Nocardia, Propionibacterium, Rhodococcus and Streptomyces.
- actinomycetes that have one or more genes encoding an Aas activity include, for example, Mycobacterium tuberculosis, M. avium, M. smegmatis, Micromonospora echinospora, Rhodococcus opacus, R. ruber, and Streptomyces lividans.
- prokaryotic organisms that encode one or more enzymes having an Aas activity include members of the genera Acinetobacter, such as ,4. calcoaceticus, A. baumanii, A. baylii, and members of the generua Alcanivorax.
- an Aas gene or enzyme is isolated from Acinetobacter baylii sp.
- ADP1 a gram-negative triglyceride forming prokaryote.
- an endogenous aldehyde dehydrogenase may be acting on the excess acyl-aldehydes generated by overexpressed orfl594 and converting them to free fatty acids.
- the normal role of such a dehydrogenase might involve removing or otherwise dealing with damaged lipids. I n this scenario, it is then likely that the Aas gene product recycles these free fatty acids by ligating them to ACP. Accordingly, reducing or eliminating expression of the Aas gene product might ultimately increase production of fatty acids, by reducing or preventing their transfer to ACP.
- certain aspects include mutations (e.g., genomic) such as point mutations or insertions that reduce or eliminate the enzymatic activity of one or more endogenous acyl-ACP synthetases (or synthases). Also included are full or partial deletions of an endogenous gene encoding an Aas protein.
- mutations e.g., genomic
- point mutations or insertions that reduce or eliminate the enzymatic activity of one or more endogenous acyl-ACP synthetases (or synthases).
- full or partial deletions of an endogenous gene encoding an Aas protein are also included.
- Acyl-ACP reductases are members of the reductase or short-chain dehydrogenase family, and are key enzymes of the type I I fatty acid synthesis (FAS) system.
- FOS fatty acid synthesis
- an "acyl-ACP reductase” or “acyl-ACP dehydrogenase” as used herein is capable of catalyzing the conversion (reduction) of acyl-ACP to an acyl aldehyde (see Schirmer et al., supra) and the concomitant oxidation of NAD(P)H to NADP + .
- the acyl-ACP reductase preferentially interacts with acyl-ACP, and does not interact significantly with acyl-CoA, i.e., it does not significantly catalyze the conversion of acyl-CoA to acyl aldehyde.
- Acyl-ACP reductases can be derived from a variety of plants and bacteria, included photosynthetic microorganisms such as Cyanobacteria.
- One exemplary acyl-ACP reductase is encoded by orfl594 of Synechococcus elongatus PCC7942 (see SEQ ID NOs:l and 2 for the polynucleotide and polypeptide seq uences, respectively).
- Another exemplary acyl-ACP reductase is encoded by orfsll0209 of Synechocystis sp. PCC6803 (SEQ I D NOs:3 and 4 for the polynucleotide and polypeptide sequences, respectively).
- Embodiments of the present invention optionally include one or more alcohol dehydrogenase polypeptides.
- alcohol dehydrogenases include those capable of using acyl or fatty aldehydes (e.g., one or more of nonyl-aldehyde, C 12 , C i4 , C 16 , Ci 8 , C 20 att Y aldehyde) as a substrate, and converting them into fatty alcohols.
- Specific examples include long-chain alcohol dehydrogenases, capable of using long-chain aldehydes (e.g., Ci 6 , Ci 8 , C 20 ) as substrates.
- the alcohol dehydrogenase is naturally-occurring or endogenous to the modified microorganism, and is sufficient to convert increased acyl aldehydes (produced by acyl-ACP reductase) into fatty alcohols, and thereby contribute to increased wax ester production and overall satisfactory growth characteristics.
- the alcohol dehydrogenase is derived from a microorganism that differs from the one being modified.
- expression or overexpression of an alcohol dehydrogenase may increase shunting of acyl aldehydes towards production of fatty alcohols, and away from production of other products such as alkanes, fatty acids, or triglycerides.
- alcohol dehydrogenases may contribute to production of wax esters. They may also reduce accum ulation of potentially toxic acyl aldehydes, and thereby improve growth characteristics of a modified microorganism.
- Non-limiting examples of alcohol dehydrogenases include those encoded by slrll92 of Synechocystis sp. PCC6803 (SEQ I D NOS:104-105) and ACIAD3612 of Acinetobacter bayiyi (SEQ I D NOS:106-107). Also included are homologs or paralogs thereof, functional equivalents thereof, and fragments or variants thereofs.
- Functional equivalents can include alcohol dehydrogenases with the ability to efficiently convert acyl aldehydes (e.g., C 6 , C 8 , Ci 0 , C 12 , C i4 ,Ci 6 , i 8 , C 2 o aldehydes) into fatty alcohols.
- Specific examples of functional equivalents include long-chain alcohol dehydrogenases, having the ability to utilize long-chain aldehydes (e.g., C 16 , Ci8, C 2 o) as substrates.
- the alcohol dehydrogenase has the amino acid sequence of SEQ ID NO:105 (encoded by the polynucleotide sequence of SEQ ID NO:104), or an active fragment or variant of this seq uence.
- the alcohol dehydrogenase has the amino acid seq uence of SEQ ID NO:107 (encoded by the polynucleotide sequence of SEQ ID NO:106), or an active fragment or variant of this sequence.
- Embodiments of the present invention optionally include one or more overexpressed or introduced aldehyde dehydrogenases.
- aldehyde dehydrogenases include enzymes capable of using acyl aldehydes (e.g., nonyl-aldehyde, C16 fatty aldehyde) as a substrate, and converting them into fatty acids.
- the aldehyde dehydrogenase is naturally-occurring or endogenous to the modified microorganism, and is sufficient to convert increased acyl aldehydes (produced by acyl-ACP reductase) into fatty acids, and thereby contribute to increased fatty acid production and overall satisfactory growth characteristics.
- the aldehyde dehydrogenase can be overexpressed, for example, by recombinantly introducing a polynucleotide that encodes the enzyme, increasing expression of an endogenous enzyme, or both.
- An aldehyde dehydrogenase can be overexpressed in a strain that already expresses a naturally-occurring or endogenous enzyme, to further increase fatty acid production of an acyl-ACP reductase over-expressing strain and/or improve its growth characteristics, relative, for example, to an acyl-ACP reductase-overexpressing strain that only expresses endogenous aldehyde dehydrogenase.
- aldehyde dehydrogenase can also be expressed or overexpressed in a strain that does not have a naturally occurring aldehyde dehydrogenase of that type, e.g., it does not naturally express an enzyme that is capable of efficiently converting acyl aldehydes such as nonyl-aldehyde into fatty acids.
- expression or overexpression of an aldehyde dehydrogenase may increase shunting of acyl aldehydes towards production of fatty acids, and away from production of other products such as alkanes. It may also reduce accumulation of potentially toxic acyl aldehydes, and thereby improve growth characteristics of a modified microorganism.
- aldehyde dehydrogenase is encoded by orf0489 of Synechococcus elongatus PCC7942. Also included are homologs or paralogs thereof, functional eq uivalents thereof, and fragments or variants thereof. Functional equivalents can include aldehyde dehydrogenases with the ability to efficiently convert acyl aldehydes (e.g., nonyl-aldehyde) into fatty acids.
- the aldehyde dehydrogenase has the amino acid sequence of SEQ ID NO:103 (encoded by the polynucleotide sequence of SEQ I D NO:102), or an active fragment or variant of this sequence.
- Particular embodiments include photosynthetic microorganisms having reduced expression and/or activity of one or more aldehyde dehydrogenases, for instance, in the production of wax esters.
- I ncluded are mutations (e.g., genomic) that reduce or eliminate the enzymatic activity of one or more endogenous aldehyde dehydrogenases, such as point mutations, insertions, or full or partial deletion m utations.
- Specific embodiments include a modified Synechococcus elongatus PCC7942 having a full or partial deletion of orf0489.
- Aldehyde Decarbonylases include photosynthetic microorganisms having reduced expression and/or activity of one or more aldehyde decarbonylases.
- an "aldehyde decarbonylase” is capable of catalyzing the conversion of an acyl aldehyde (or fatty aldehyde) to an alkane or alkene.
- I ncluded are members of the ferritin-like or ribonucleotide reductase-like family of nonheme diiron enzymes (see, e.g., Stubbe et ai, Trends Biochem Sci. 23 :438-43, 1998).
- a ldehyde decarbonylase encoded by PCC7942_orfl593 (from Synechococcus) or PCC6803_orfsll0208 (from Synechostis sp. PCC6803) utilizes acyl aldehyde as a substrate for alkane or alkene production
- reducing expression of this protein may further increase yields of free fatty acids by shunting acyl aldehydes (produced by acyl-ACP reductase) away from an alkane-producing pathway, and towards a fatty acid-producing pathway.
- PCC7942_orfl593 and PCC6803_orfsll0208 orthologs can be found, for exam ple, in N. punctiforme PCC73102, Thermosynechococcus elongatus BP-1, Synechococcus sp. Ja-3-3AB, P. marinus M IT9313, P. marinus NATL2A, and Synechococcus sp. RS 9117, the latter having at least two paralogs (RS 9117-1 and -2).
- Particular embodiments include mutations (e.g., genomic) that reduce or eliminate the enzymatic activity of one or more endogenous aldehyde decarbonylases, for instance, in the production of fatty acids or wax esters, optionally in combination with reduced expression of one or more endogenous aldehyde dehydrogenases. Also included are point mutations, insertions, and full or partial deletions of an endogenous gene encoding an aldehyde decarbonylase. Specific embodiments include a modified Synechococcus elongatus PCC7942 having a full or partial deletion of orfl593.
- Thioesterases include one or more exogenous or overexpressed thioesterase enzymes, optionally in combination with at least one of an introduced ACP enzyme, an introduced Aas enzyme, or both.
- an introduced ACP and/or Aas to increase the growth and/or fatty acid production of a free fatty acid producing TES strain, such as a TesA strain or a FatB strain (i.e., a strain having an introduced TesA or FatB).
- Thioesterases exhibit esterase activity (splitting of an ester into acid and alcohol, in the presence of water) specifically at a thiol group.
- Fatty acids are often attached to cofactor molecules, such as coenzyme A (CoA) and acyl carrier protein (ACP), by thioester linkages during the process of de novo fatty acid synthesis.
- Certain embodiments employ thioesterases having acyl-ACP thioesterase activity, acyl-CoA thioesterase activity, or both activities. Examples of thioesterases having both activities (i.e., acyl-ACP/acyl-CoA thioesterases) include TesA and related embodiments.
- a selected thioesterase has acyl-ACP thioesterase activity but not acyl-CoA thioesterase activity. Examples of thioesterases having only acyl-ACP thioesterase activity include the FatB thioesterases and related embodiments.
- Certain thioesterases have both thioesterase activity and lysophospholipase activity.
- Specific examples of thioesterases include TesA, TesB, and related embodiments.
- Certain embodiments may employ periplasmically-localized or cytoplasmically-localized enzymes that thioesterase activity, such as E. coli TesA or E. coli TesB.
- wild type TesA being localized to the periplasm, is normally used to hydrolyze thioester linkages of fatty acid-ACP (acyl-ACP) or fatty acid-CoA (acyl-CoA) compounds scavenged from the environment.
- PldC(*TesA) results in hydrolysis of acyl groups from endogenous acyl-ACP and acyl-CoA molecules.
- Cells expressing PldC(*TesA) must channel additional cellular carbon and energy to maintain production of acyl-ACP and acyl-coA molecules, which are required for membrane lipid synthesis.
- PldC(*TesA) expression results in a net increase in total cellular lipid content.
- PldC(*TesA) expressed alone in Synechococcus doubles the total lipid content from 10% of biomass to 20% of biomass, a result that can be further increased by combining *TesA or related molecules with an introduced ACP and/or an introduced Aas.
- certain embodiments employ an exogenous or overexpressed cytoplasmic TesA (such as *TesA) in combination with an exogenous or overexpressed ACP, an exogenous or overexpressed Aas, or both.
- a thioesterase is an acyl-ACP thioesterase and/or an acyl-CoA thioesterase.
- the TES is a TesA or TesB polypeptide from E. coli, or a cytoplasmic TesA variant (*TesA) variant having the sequence set forth in SEQ ID NO:121, or a fragment or variant thereof.
- Certain thioesterases have thioesterase activity only, i.e., they have little or no lysophospholipase activity. Examples of these thioesterases include enzymes of the FatB family.
- FatB encoded enzymes typically hydrolyze saturated C14-C18 ACPs, preferentially 16:0 ACP, but they can also hydrolyze 18:1 ACP.
- the production of medium chain (C8-C12) fatty acids in plants or seeds such as those of Cuphea spp. often results of FatB enzymes that have chain length specificities for medium chain fatty acyl-ACPs.
- These medium chain FatB thioesterases are present in many species with medium-chain fatty acids in their oil, including, for example, California bay laurel, coconut, and elm, among others. Hence, FatB sequences may be derived from these and other organisms.
- the TES is a FatB polypeptide, such as a C8, C12, C14, C16, or C18 FatB.
- FatB thioesterases include the Cuphea hookeriana C8/C10 FatB thioesterase, the Umbellularia californica C12 FatBl thioesterase, the Cinnamomum camphora C14 FatBl thioesterase, and the Cuphea hookeriana CIS FatBl thioesterase.
- the thioesterase is a Cuphea hookeriana C8/C10 FatB, comprising the amino acid sequence of SEQ ID NO:108 (full-length protein) or SEQ ID NO:109 (mature protein without signal sequence).
- the thioesterase is a Umbellularia californica C12 FatBl, comprising the amino acid sequence of SEQ I D NO:110 (full-length protein) or SEQ ID NO:lll (mature protein without signal sequence).
- the thioesterase is a Cinnamomum camphora C14 FatBl, comprising the amino acid sequence of SEQ I D NO:112 (full-length protein) or SEQ I D NO:113 (mature protein without signal sequence).
- the thioesterase is a Cuphea hookeriana CIS FatBl, comprising the amino acid sequence of SEQ ID NO:114 (full-length protein) or SEQ ID NO:115 (mature protein without signal sequence).
- Acetyl Coenzyme A carboxylases ACCase
- Embodiments of the present invention optionally include one or more exogenous (e.g., recombinantly introduced) or overexpressed ACCase proteins.
- an "acetyl CoA carboxylase" gene includes any polynucleotide sequence encoding amino acids, such as protein, polypeptide or peptide, obtainable from any cell source, which demonstrates the ability to cata lyze the carboxylation of acetyl-CoA to produce malonyl-CoA under enzyme reactive conditions, and further includes any naturally-occurring or non-naturally occurring variants of an acetyl-CoA carboxylase sequence having such ability.
- Acetyl-CoA carboxylase is a biotin-dependent enzyme that catalyses the irreversible carboxylation of acetyl-CoA to produce malonyl-CoA through its two catalytic activities, biotin carboxylase (BC) and carboxyltransferase (CT).
- BC biotin carboxylase
- CT carboxyltransferase
- the biotin carboxylase (BC) domain catalyzes the first step of the reaction: the carboxylation of the biotin prosthetic group that is covalently linked to the biotin carboxyl carrier protein (BCCP) domain.
- the carboxyltransferase (CT) domain catalyzes the transfer of the carboxyl group from (carboxy) biotin to acetyl-CoA.
- CRC carboxyltransferase
- ACCase acetyl-CoA carboxylase
- ACCase is a multi-subunit enzyme, whereas in most eukaryotes it is a large, multi-domain enzyme.
- yeast the crystal structure of the CT domain of yeast ACCase has been determined at 2.7A resolution (Zhang et ai, Science, 299:2064-2067 (2003). This structure contains two domains, which share the same backbone fold. This fold belongs to the crotonase/CIpP family of proteins, with a b-b-a superhelix.
- the CT domain contains many insertions on its surface, which are important for the dimerization of ACCase.
- the active site of the enzyme is located at the dimer interface.
- Cyanobacteria such as Synechococcus
- a native ACCase enzyme these bacteria typically do not produce or accumulate significant amounts of fatty acids.
- Synechococcus in the wild accumulates fatty acids in the form of lipid membranes to a total of about 4% by dry weight.
- embodiments of the present invention include methods of increasing the production of fatty acid biosynthesis, and, thus, lipid production, in Cyanobacteria by introducing one or more polynucleotides that encode an ACCase enzyme that is exogenous to the Cyanobacterium's native genome.
- embodiments of the present invention also include a modified Cyanobacterium, and compositions comprising said Cyanobacterium, comprising one or more polynucleotides that encode an ACCase enzyme that is exogenous to the Cyanobacterium's native genome.
- a polynucleotide encoding an ACCase enzyme may be isolated or obtained from any organism, such as any prokaryotic or eukaryotic organism that contains an endogenous ACCase gene.
- eukaryotic organisms having an ACCase gene are well-known in the art, and include various animals (e.g., mammals, fruit flies, nematodes), plants, parasites, and fungi (e.g., yeast such as 5. cerevisiae and Schizosaccharomyces pombe).
- the ACCase encoding polynucleotide sequences are obtained from Synechococcus sp. PCC7002.
- prokaryotic organisms that may be utilized to obtain a polynucleotide encoding an enzyme having ACCase activity include, but are not limited to, Escherichia coli, Legionella pneumophila, Listeria monocytogenes, Streptococcus pneumoniae, Bacillus subtilis, Ruminococcus obeum ATCC 29174, marine gamma proteobacterium HTCC2080, Roseovarius sp. HTCC2601, Oceanicola granulosus HTCC2516, Bacteroides caccae ATCC 43185, Vibrio alginolyticus 12G01, Pseudoalteromonas tunicata D2, Marinobacter sp.
- ELB17 marine gamma proteobacterium HTCC2143, Roseobacter sp. SK209-2-6, Oceanicola batsensis HTCC2597, Rhizobium leguminosarum bv. trifolii WSM 1325, Nitrobacter sp. Nb-311A, Chloroflexus aggregans DSM 9485, Chlorobaculum parvum, Chloroherpeton thalassium, Acinetobacter baumannii, Geobacillus, and Stenotrophomonas maltophilia, among others.
- acetyl-CoA carboxylases comprise or consist of a polypeptide sequence set forth in any of SEQ ID NOs:55, 45, 46, 47, 48 or 49, or a fragment or variant thereof.
- SEQ ID NO:55 is the sequence of Saccharomyces cerevisiae acetyl-CoA carboxylase (yAccl);
- SEQ ID NO:45 is Synechococcus sp.
- SEQ ID NO:46 is Synechococcus sp.
- SEQ I D NO:47 is Synechococcus sp.
- Phosphatidate phosphatases catalyze the dephosphorylation of phosphatidate (PtdOH), yielding diacylglycerol (DAG) and inorganic phosphate.
- PAP 3-s/i-phosphatidate phosphohydrolase
- PAG diacylglycerol
- This enzyme belongs to the family of hydrolases, specifically those acting on phosphoric monoester bonds. The systematic name of this enzyme class is 3-sn-phosphatidate phosphohydrolase. Other names in common use include phosphatic acid phosphatase, acid phosphatidyl phosphatase, and phosphatic acid phosphohydrolase.
- This enzyme participates in at least 4 metabolic pathways: glycerolipid metabolism, glycerophospholipid metabolism, ether lipid metabolism, and sphingolipid metabolism.
- PAP enzymes have roles in both the synthesis of phospholipids and triacylglycerol through its product diacylglycerol, as well as the generation or degradation of lipid-signaling molecules in eukaryotic cells.
- PAP enzymes are typically classified as either Mg 2+ -dependent (referred to as PAP1 enzymes) or Mg 2+ -independent (PAP2 or lipid phosphate phosphatase (LPP) enzymes) with respect to their cofactor requirement for catalytic activity.
- PAP1 enzymes Mg 2+ -dependent
- LPP lipid phosphate phosphatase
- PAP1 enzymes such as those found in Saccharomyces cerevisiae, play a role in de novo lipid synthesis (Han, et al. J Biol Chem. 281:9210-9218, 2006), thereby revealing that the two types of PAP are responsible for different physiological functions.
- the PAP reaction is the committed step in the synthesis of the storage lipid triacylglycerol (TAG), which is formed from PtdOH through the intermediate DAG.
- TAG storage lipid triacylglycerol
- the reaction product DAG is also used in the synthesis of the membrane phospholipids phosphatidylcholine (PtdCho) and phosphatidylethanolamine.
- the substrate PtdOH is used for the synthesis of all membrane phospholipids (and the derivative inositol-containing sphingolipids) through the intermediate CDP-DAG.
- regulation of PAP activity might govern whether cells make storage lipids and phospholipids through DAG or phospholipids through CDP-DAG.
- PAP is involved in the transcriptional regulation of phospholipid synthesis.
- PAPl enzymes have been purified and characterized from the membrane and cytosolic fractions of yeast, including a gene (Pahl, formerly known as Smp2) been identified to encode a PAPl enzyme in 5. cerevisiae.
- the PaM-encoded PAPl enzyme is found in the cytosolic and membrane fractions of the cell, and its association with the membrane is peripheral in nature.
- ⁇ mutants still contain PAPl activity, indicating the presence of an additional gene or genes encoding enzymes having PAPl activity.
- Mg 2+ ions as a cofactor for PAP enzymes
- the PaW-encoded PAPl enzyme has a DxDxT (SEQ ID NO:198) catalytic motif within a haloacid dehalogenase (HAD)-like domain ("x" is any amino acid).
- HAD haloacid dehalogenase
- This motif is found in a superfamily of Mg 2+ -dependent phosphatase enzymes, and its first aspartate residue is responsible for binding the phosphate moiety in the phosphatase reaction.
- the DPPl-and .PPl-encoded PAP2 enzymes contain a three-domain lipid phosphatase motif that is localized to the hydrophilic surface of the membrane.
- This catalytic motif which comprises the consensus sequences KxxxxxxRP (domain 1) (SEQ ID NO:116), PSGH (domain 2) (SEQ ID NO:117), and SRxxxxxHxxxD (domain 3) (SEQ ID NO:118), is shared by a superfamily of lipid phosphatases that do not req rind ions for activity.
- the conserved arginine residue in domain 1 and the conserved histidine residues in domains 2 and 3 may be essential for the catalytic activity of PAP2 enzymes.
- a phosphatidate phosphatase polypeptide may comprise one or more of the above-described catalytic motifs.
- a polypeptide having a phosphatidate phosphatase enzymatic activity may be obtained from any organism having a suitable, endogenous phosphatidate phosphatase gene.
- organisms that may be used to obtain a phosphatidate phosphatase encoding polynucleotide sequence include, but are not limited to, Homo sapiens, Mus musculus, Rattus norvegicus, Bos taurus, Drosophila melanogaster, Arabidopsis thaliana, Magnaporthe grisea, Saccharomyces cerevisiae, Schizosaccharomyces pom be, Cryptococcus neoformans, and Bacillus pumilus, among others.
- PAP enzymes include Pahl from 5. cerevisiae, PgpB from E. coli, and PAP from PCC6803.
- a phosphatidate phosphatase polypeptide comprises or consists of a polypeptide sequence set forth in SEQ I D NO:131, or a fragment or variant thereof.
- SEQ I D NO:131 is the sequence of Saccharomyces cerevisiae phosphatidate phosphatase (yPahl).
- the polypeptide seq uence of the PAP is encoded by the E. coli PgpB gene, and/or the PAP gene from Synechocystis sp. PCC6803.
- TAG hydrolases Triacylglycerol (TAG) Hydrolases.
- TAG hydrolases or TAG lipases
- Certain embodiments relate to the use of exogenous or overexpressed TAG hydrolases (or TAG lipases) to increase production of TAGs in a TAG-producing strain.
- specific embodiments may utilize a TAG hydrolase in combination with a DGAT, and optionally a TES. These embodiments may then further utilize an ACP, an Aas, or both, any of the lipid biosynthesis proteins described herein, and/or any of the modifications to glycogen production and storage described herein.
- TAG hydrolases may be used in TAG-producing strains (e.g., DGAT-expressing strains) with or without an ACP or Aas.
- TAG hydrolases are carboxylesterases that are typically specific for insoluble long chain fatty acid TAGs.
- Carboxylesterases catalyze the chemical reaction: carboxylic ester + H 2 0 alcohol + carboxylate
- the two substrates of this enzyme are carboxylic ester and H 2 0, whereas its two products are alcohol and carboxylate.
- TAG hydrolase expression or overexpression in a TAG producing strain (e.g., DGAT/ACP, DGAT/Aas, DGAT/ACP/Aas) releases acyl chains to not only increase accumulation of free fatty acids (FFA), but also increase the amount of free 1, 2 diacylglycerol (DAG).
- FFA free fatty acids
- DAG free 1, 2 diacylglycerol
- This free DAG then serves as a substrate for DGAT, and thereby allows increased TAG production, especially in the presence of over-expressed ACP, Aas, or both.
- TAG hydrolase produces increased amounts of TAG, relative, for example, to a DGAT only-expressing microorganism.
- the TAG hydrolase is specific for TAG and not DAG, i.e., it preferentially acts on TAG relative to DAG.
- TAG hydrolases include SDP1 (SUGAR-DEPENDENT1) triacylglycerol lipase from Arabidopsis thaliana (SEQ I D NO:170), ACIAD1335 from Acinetobacter sp. ADP1 (SEQ I D NO:171), TG14P from S. cerevisiae (SEQ ID NO:172), and RHAl_ro04722 (YP_704665) TAG lipase from Rhodococcus (SEQ ID NO:173).
- SDP1 SUGAR-DEPENDENT1
- ADP1 SEQ I D NO:171
- TG14P from S. cerevisiae
- YP_704665 RHAl_ro04722
- Rhodococcus Additional putative lipases/esterases from Rhodococcus include RHAl_ro01602 lipase/esterase (see SEQ ID NOs:156 and 174 for polynucleotide and polypeptide sequence, respectively), and RHAl_ro06856 lipase/esterase (see SEQ ID NOs:119 and 120 for polynucleotide and polypeptide seq uence, respectively).
- Fatty Acyl-CoA Synthetases relate to the use of overexpressed fatty acyl-CoA synthetases to increase activation of fatty acids, and thereby increase production of TAGs in a TAG-producing strain (e.g., a DGAT-expressing strain).
- a TAG-producing strain e.g., a DGAT-expressing strain
- specific embodiments may utilize an acyl-ACP reductase in combination with a fatty acyl-CoA synthetase and a DGAT. These embodiments may then further utilize an ACP, an ACCase, or both, and/or any of the modifications to glycogen production and storage or glycogen breakdown described herein.
- Fatty acyl-CoA synthetases activate fatty acids for metabolism by catalyzing the formation of fatty acyl-CoA thioesters. Fatty acyl-CoA thioesters can then serve not only as substrates for beta-oxidation, at least in bacteria capable of growing on fatty acids as a sole source of carbon (e.g., E. coli, Salmonella), but also as acyl donors in phospholipid biosynthesis. Many fatty acyl-CoA synthetases are characterized by two highly conserved sequence elements, an ATP/AM P binding motif, which is common to enzymes that form an adenylated intermediate, and a fatty acid binding motif.
- certain embodiments may employ fatty acyl-CoA synthetases to increase activation of free fatty acids, which can then be incorporated into TAGs, mainly by the DGAT-expressing (and thus TAG-producing) photosynthetic microorganisms described herein.
- fatty acyl-CoA synthetases can be used in any of the embodiments described herein, such as those that produce increased levels of free fatty acids, where it is desirable to turn free fatty acids into TAGs.
- these free fatty acids can then be activated by fatty acyl-CoA synthetases to generate acyl-CoA thioesters, which can then serve as substrates by DGAT to produce increased levels of TAGs.
- One exemplary fatty acyl-CoA synthetase includes the FadD gene from E. coli (SEQ ID NOS:16 and 17 for nucleotide and polypeptide sequence, respectively), which encodes a fatty acyl-CoA synthetase having substrate specificity for medium and long chain fatty acids.
- Other exemplary fatty acyl-CoA synthetases include those derived from 5.
- modified photosynthetic microorganisms e.g., Cyanobacteria
- modified photosynthetic microorganisms of the present invention further comprise one or more exogenous or introduced nucleic acids that encode a polypeptide having a lipase or phospholipase activity, or a fragment or variant thereof.
- Lipases including phospholipases, lysophospholipases, thioesterases, and enzymes having one, two, or all three of these activities, typically catalyze the hydrolysis of ester chemical bonds in lipid substrates.
- the expression of one or more phospholipases can generate fatty acids from membra ne lipids, which may then be used by the ACP and/or Aas to make acyl-ACPs. These acyl-ACPs, for example, can then feed into the triglyceride synthesis pathways, thereby increasing triglyceride (TAG) production.
- TAG triglyceride
- a phospholipase is an enzyme that hydrolyzes phospholipids into fatty acids and other lipophilic substances.
- Phospholipase Al cleaves the SN-1 acyl chain
- Phospholipase A2 cleaves the SN-2 acyl chain, releasing arachidonic acid.
- Phospholipase B cleaves both SN-1 and SN-2 acyl chains, and is also known as a lysophospholipase.
- Phospholipase C cleaves before the phosphate, releasing diacylglycerol and a phosphate-containing head group.
- Phospholipases C play a central role in signal transduction, releasing the second messenger, inositol triphosphate.
- Phospholipase D cleaves after the phosphate, releasing phosphatidic acid and an alcohol. Types C and D are considered phosphodiesterases. In various embodiments, one or more phospholipase from any one of these classes may be used, alone or in any combination.
- phospholipases act on phospholipids of different kinds including phosphatidyl glycerol, the major phospholipid in Cyanobacteria, by cleaving the acyl chains off the snl or sn2 positions (carbon 1 or 2 on the glycerol backbone); some are selective for snl or sn2, others act on both.
- Lysophospholipases act on lysophospholipids, which can be the product of phospholipases or on lysophosphatidic acid, a normal intermediate of the de novo phosphatidic acid synthesis pathway, e.g., l-acyl-DAG-3-phosphate.
- phospholipases and/or lysophospholipases can cleave off acyl chains from phospholipids or lysophospholipids and thus deregulate the normal recycling of the lipid membranes, including both cell membrane and thylakoid membranes, which then leads to accumulation of free fatty acids (FFAs).
- FFAs free fatty acids
- FFAs can be converted into acyl-ACPs by acyl ACP synthase (Aas) in a strain that also over-expresses ACP.
- acyl ACP synthase e.g., DGAT-containing microorganisms
- these acyl-ACPs can then serve as substrates for DGAT to make TAGs.
- phospholipases can be over-expressed to generate lyshophospholipids and acyl chains. The lysophospholipids can then serve as substrates for a lysophospholipase, which cleaves off the remaining acyl chain.
- these acyl chains can either accumulate as FFAs, or in other embodiments may serve as substrates of Acyl ACP synthase (Aas) to generate acyl-ACPs, which can then be used by DGAT to make TAGs.
- Acyl ACP synthase Acyl ACP synthase
- phospholipase C enzymes include those derived from eukaryotes such as mammals and parasites, in addition to those derived from bacteria.
- examples include phosphoinositide phospholipase C (EC 3.1.4.11), the main form found in eukaryotes, especially mammals, the zinc-dependent phospholipase C family of bacterial enzymes (EC 3.1.4.3) that includes alpha toxins, phosphatidylinositol diacylglycerol-lyase (EC 4.6.1.13), a related bacterial enzyme, and glycosylphosphatidylinositol diacylglycerol-lyase (EC 4.6.1.14), a trypanosomal enzyme.
- the present invention contemplates using a lysophospholipase.
- a lysophospholipase is an enzyme that catalyzes the chemical reaction :
- the two substrates of this enzyme are 2-lysophosphatidylcholine and H 2 0, whereas its two products are glycerophosphocholine and carboxylate.
- Lysophospholipase are members of the hydrolase family, specifically those acting on carboxylic ester bonds. Lysophospholipases participate in glycerophospholipid metabolism. Examples of lysophospholipases include, but are not limited to, 2-Lysophosphatidylcholine acylhydrolase, Lecithinase B, Lysolecithinase, Phospholipase B, Lysophosphatidase, Lecitholipase, Phosphatidase B, Lysophosphatidylcholine hydrolase, Lysophospholipase Al, Lysophospholipase LI (TesA), Lysophopholipase L2, TesB, Lysophospholipase transacylase, Neuropathy target esterase, NTE, NTE-LysoPLA, NTE-lysophospholipase, and Vu Patatin 1 protein.
- lysophospholipases utilized according to the present invention are derived from a bacteria, e.g., E. coli, or a plant. Any of these lysophospholipases may be used according to various embodiments of the present invention.
- Lysophospholipases such as Lysophospholipase LI (also referred to as PldC or TesA) are periplasmically-localized or cytoplasmically-localized enzymes that have both lysophospholipase and thioesterase activity, as described above.
- certain thioesterases such as TesA can also be characterized as lysophospholipases.
- PldC(*TesA) is not exported to the periplasm due to deletion of an N-terminal amino acid sequence required for proper transport of TesA from the cytoplasm to the periplasm.
- PldC(*TesA) results in hydrolysis of acyl groups from endogenous acyl-ACP and acyl-CoA molecules.
- Cells expressing PldC(*TesA) must channel additional cellular carbon and energy to maintain production of acyl-ACP and acyl-coA molecules, which are required for membrane lipid synthesis.
- PldC(*TesA) expression results in a net increase in cellular lipid content.
- PldC(*TesA) is expressed in Synechococcus lipid content doubles from 10% of biomass to 20% of biomass.
- lysophospholipases utilized according to the present invention have both phospholipase and thioesterase activities.
- lysophospholipases that have both activities include, e.g., Lysophospholipase LI (TesA), such as E. coli Lysophospholipase LI, as well as fragments and variants thereof, including those described in the paragraph above.
- TesA Lysophospholipase LI
- certain embodiments may employ TesA variants having only lysophospholipase activity, including variants with reduced or no thioesterase activity.
- the phospholipase is a bacterial phospholipase, e.g., lysophospholipase, or a fragment or variant thereof, e.g., a phospholipase derived from Escherichia coli, 5. cerevisiae, Rhodococcus, Streptomyces or Acinetobacter species.
- phospholipases include phospholipase Al (PldA) from Acinetobacter sp. ADP1, phospholipase A (PldA) from E. coli, phospholipase from Streptomyces coelicolor A3(2), phospholipase A2 (PLA2-a) from Arabidopsis thaliana; phospholipase Al/ triacylglycerol lipase (DAD1; Defective Anther Dehiscence 1) from Arabidopsis thaliana, chloroplast DONGLE from Arabidopsis thaliana, patatin-like protein from Arabidopsis thaliana, and patatin from Anabaena variabilis ATCC 29413.
- PLD1 phospholipase Al
- chloroplast DONGLE from Arabidopsis thaliana, patatin-like protein from Arabidops
- lysophospholipases include phospholipase B (Plblp) from Saccharomyces cerevisiae S288c, phospholipase B (Plb2p) from Saccharomyces cerevisiae S288c, ACIAD1057 (tesA homolog) from Acinetobacter ADPl, ACIAD1943 lysophospholipase from Acinetobacter ADPl, and a lysophospholipase (YP_702320; RHAl_ro02357) from Rhodococcus.
- Plblp phospholipase B
- Plb2p phospholipase B
- ACIAD1057 tesA homolog
- Acinetobacter ADPl ACIAD1943 lysophospholipase from Acinetobacter ADPl
- YP_702320; RHAl_ro02357 from Rhodococcus.
- the encoded phospholipase comprises or consists of a Lysophospholipase LI (TesA), Lysophospholipase L2, TesB, or Vu patatin 1 protein, or a homolog, fragment, or variant thereof.
- the Lysophospholipase LI (TesA), Lysophospholipase L2, or TesB is a bacterial Lysophospholipase LI (TesA), Lysophospholipase L2, or TesB, such as an E.coli Lysophospholipase LI (TesA) having the wild-type sequence set forth in SEQ ID NO:133, an E.
- Vu patatin 1 protein has the wild-type sequenc set forth in SEQ ID NO:138.
- the phospholipase is modified such that it localizes predominantly to the cytoplasm instead of the periplasm.
- the phospholipase may have a deletion or mutation in a region associated with periplasmic localization.
- the phospholipase variant is derived from Lysophospholipase LI (TesA) or TesB.
- the Lysophospholipase LI (TesA) or TesB variant is a bacterial Lysophospholipase LI (TesA) or TesB variant, such as a cytoplasmic E.coli Lysophospholipase LI (PldC(*TesA)) variant having the sequence set forth in SEQ ID NO:121.
- phospholipase polypeptide sequences include phospholipase Al (PIdA) from Acinetobacter sp. ADPl (SEQ ID NO:157), phospholipase A (PIdA) from E. coli (SEQ ID NO:158), phospholipase from Streptomyces coelicolor A3(2) (SEQ ID NO:159), phospholipase A2 (PLA2-a) from Arabidopsis thaliana (SEQ ID NO:160).
- PIdA phospholipase Al
- ADPl SEQ ID NO:157
- phospholipase A from E. coli
- phospholipase from Streptomyces coelicolor A3(2)
- PDA2-a phospholipase A2 from Arabidopsis thaliana
- DAD1 Defective Anther Dehiscence 1
- Arabidopsis thaliana SEQ ID NO:161
- chloroplast DONGLE from Arabidopsis thaliana
- patatin from Anabaena variabilis ATCC 29413 SEQ ID NO:164.
- lysophospholipase polypeptide sequences include phospholipase B (Plblp) from Saccharomyces cerevisiae S288c (SEQ ID NO:165), phospholipase B (Plb2p) from Saccharomyces cerevisiae S288c (SEQ ID NO:166), ACIAD1057 (TesA homolog) from Acinetobacter ADPl (SEQ ID NO:167), ACIAD1943 lysophospholipase from Acinetobacter ADP1 (SEQ I D NO:168), and a lysophospholipase (YP_702320; RHAl_ro02357) from Rhodococcus (SEQ ID NO:169).
- Fatty acyl reductase Certain embodiments relate to the use of overexpressed fatty acyl reductases to increase synthesis of fatty alcohols, and thereby increase production of wax esters in a WE-producing strain (e.g., a DGAT-expressing strain). For instance, certain embodiments may utilize a fatty acyl reductase, possibly in combination with an acyl-ACP reductase, and a DGAT. These embodiments may then further utilize an ACP, an ACCase, or both, and/or any of the modifications to glycogen production and storage or glycogen breakdown described herein.
- a WE-producing strain e.g., a DGAT-expressing strain.
- a fatty acyl reductase possibly in combination with an acyl-ACP reductase, and a DGAT.
- These embodiments may then further utilize an ACP, an ACCase, or both, and/or any of the modifications to glycogen production and
- Fatty Acyl Reductases catalyze the two step reduction of acyl-ACP's or acyl-COA's to acyl alcohols, also known as fatty alcohols.
- the first step proceeds via an acyl aldehyde intermediate, which is then converted in a second step to a fatty alcohol.
- These same enzymes can also directly reduce fatty aldehydes to fatty a lcohols (i.e. step two only). In this case they are sometimes referred to as fatty aldehyde reductases.
- Fatty alcohols can serve as a substrate for wax ester biosynthesis by a DGAT.
- Many fatty acyl reductases are characterized by three conserved seq uence elements.
- NADPH binding motif a motif characteristic of the catalytic site of NADP-utilizing enzymes, and a conserved C-terminal domain, referred to as the Male Sterile 2 domain, that is of unknown function (see Hofvander et al., FEBS Letters (2011) pp 3583-3543)
- certain embodiments may employ fatty acyl reductases to increase synthesis of fatty alcohols, which can then be incorporated into WE's, mainly by the DGAT-expressing (and thus WE-producing) photosynthetic microorganisms described herein.
- fatty acyl reductases can be used in any of the embodiments described herein, such as those that produce increased levels of free fatty alcohols, where it is desirable to turn these into WE's.
- these free fatty alcohols can then be esterified to fatty acids (in the form of acyl-ACP) by DGATs to generate WE's.
- One exemplary fatty acyl reductase includes a gene from Marinobacter aq uaeolei VT8, genbank accession number YP_959769.1 (see SEQ ID NOs:211 and 212 for polypeptide and polynucleotide sequence, respectively).
- Others include a gene from Simmondsia chinesis (Jojoba) AF149917 (see SEQ I D NOs:213 and 214 for polypeptide and polynucleotide seq uence, respectively); a gene from Euglena gracilis GU733919 (see SEQ ID NOs:215 and 216 for polypeptide and polynucleotide seq uence, respectively); a gene from Hahella chejuensis YP 436183 (see SEQ I D NOs:217 and 218 for polypeptide and polynucleotide sequence, respectively); a gene from Photobacterium profundum SS9 YP 130411.1 (see SEQ I D NOs:219 and 220 for polypeptide and polynucleotide sequence, respectively); a gene from Marinobacter algicola DG893 ZP 01892457 (see SEQ ID NOs:221 and 222 for polypeptide and polynucleotide sequence
- a modified photosynthetic microorganism further comprises additional modifications, such that it has reduced expression of one or more genes associated with a glycogen synthesis or storage pathway and/or increased expression of one or more polynucleotides that encode a protein associated with a glycogen breakdown pathway, or a functional variant of fragment thereof.
- modified photosynthetic microorganisms e.g., Cyanobacteria
- these modified photosynthetic microorganisms have reduced expression of one or more genes associated with glycogen synthesis and/or storage.
- these modified photosynthetic microorganisms have a mutated or deleted gene associated with glycogen synthesis and/or storage.
- these modified photosynthetic microorganisms comprise a vector that includes a portion of a mutated or deleted gene, e.g., a targeting vector used to generate a knockout or knockdown of one or more alleles of the mutated or deleted gene.
- these modified photosynthetic microorganisms comprise an antisense RNA or siRNA that binds to an mRNA expressed by a gene associated with glycogen synthesis and/or storage.
- modified photosynthetic microorganisms e.g., Cyanobacteria
- modified photosynthetic microorganisms comprise one or more exogenous or introduced nucleic acids that encode a polypeptide having an activity associated with a glycogen breakdown or triglyceride or fatty acid biosynthesis, including but not limited to any of those described herein.
- the exogenous nucleic acid does not comprise a nucleic acid sequence that is native to the microorganism's genome.
- the exogenous nucleic acid comprises a nucleic acid sequence that is native to the microorganism's genome, but it has been introduced into the microorganism, e.g., in a vector or by molecular biology techniques, for example, to increase expression of the nucleic acid and/or its encoded polypeptide in the microorganism.
- Glycogen Biosynthesis and Storage.
- Glycogen is a polysaccharide of glucose, which functions as a means of carbon and energy storage in most cells, including animal and bacterial cells. More specifically, glycogen is a very large branched glucose homopolymer containing about 90% a-l,4-glucosidic linkages and 10% a- 1,6 linkages.
- the biosynthesis and storage of glycogen in the form of a-l,4-polyglucans represents an important strategy to cope with transient starvation conditions in the environment.
- Glycogen biosynthesis involves the action of several enzymes. For instance, bacterial glycogen biosynthesis occurs generally through the following general steps: (1) formation of glucose-l-phosphate, catalyzed by phosphoglucomutase (Pgm), followed by (2) ADP-glucose synthesis from ATP and glucose 1-phosphate, catalyzed by glucose-l-phosphate adenylyltransferase (GlgC), followed by (3) transfer of the glucosyl moiety from ADP-glucose to a pre-existing a- 1,4 glucan primer, catalyzed by glycogen synthase (GIgA).
- Pgm phosphoglucomutase
- GlgC glucose-l-phosphate adenylyltransferase
- GgA glycogen synthase
- This latter step of glycogen synthesis typically occurs by utilizing ADP-glucose as the glucosyl donor for elongation of the a-l,4-glucosidic chain.
- the main regulatory step in glycogen synthesis takes place at the level of ADP-glucose synthesis, or step (2) above, is the reaction catalyzed by glucose-l-phosphate adenylyltransferase (GlgC), also known as ADP-glucose pyrophosphorylase (see, e.g., Ballicora et al., Microbiology and Molecular Biology Reviews 6:213-225, 2003).
- the main regulatory step in mammalian glycogen synthesis occurs at the level of glycogen synthase.
- the carbon that would have otherwise been stored as glycogen can be utilized by said photosynthetic microorganism to synthesize other carbon-based storage molecules, such as lipids, fatty acids, and triglycerides.
- modified photosynthetic microorganisms e.g., Cyanobacteria
- certain modified photosynthetic microorganisms may comprise a mutation, deletion, or any other alteration that disrupts one or more of these steps (i.e., renders the one or more steps "non-functional" with respect to glycogen biosynthesis and/or storage), or alters any one or more of the enzymes directly involved in these steps, or the genes encoding them.
- modified photosynthetic microorganisms e.g., Cyanobacteria
- Certain exemplary glycogen biosynthesis genes are described below.
- a modified photosynthetic microorganism e.g., a Cyanobacteria
- it may comprise a mutation or deletion in the phosphoglucomutase gene, including any of its regulatory elements (e.g., promoters, enhancers, transcription factors, positive or negative regulatory proteins, etc.).
- Phosphoglucomutase encoded by the gene pgm, catalyzes the reversible transformation of glucose 1-phosphate into glucose 6-phosphate, typically via the enzyme-bound intermediate, glucose 1,6-biphosphate (see, e.g., Lu et al., Journal of Bacteriology 176:5847-5851, 1994). Although this reaction is reversible, the formation of glucose-6-phosphate is markedly favored. However, typically when a large amount of glucose-6-phosphate is present, Pgm catalyzes the phosphorylation of the 1-carbon and the dephosphorylation of the c-carbon, resulting in glucose-l-phosphate.
- the resulting glucose-l-phosphate is then converted to UDP-glucose by a number of intermediate steps, including the catalytic activity of GIgC, which can then be added to a glycogen storage molecule by the activity of glycogen synthase, described below.
- the Pgm enzyme plays an intermediary role in the biosynthesis and storage of glycogen.
- the pgm gene is expressed in a wide variety of organisms, including most, if not all, Cyanobacteria.
- the pgm gene is also fairly conserved among Cyanobacteria, as can be appreciated upon comparison of SEQ ID NOs:24 (5. elongatus PCC7942), 25 (Synechocystis sp. PCC6803), and 26 ⁇ Synechococcus sp. WH8102), 79 ⁇ Synechococcus RCC307), and 80 (Synechococcus 7002),which provide the polynucleotide sequences of various pgm genes from Cyanobacteria.
- Cyanobacteria such as Synechococcus
- Glucose-l-Phosphate Adenylyltransferase glgC
- a modified photosynthetic microorganism e.g., a Cyanobacteria
- it may comprise a m utation or deletion in the glgC gene, including any of its regulatory elements.
- the enzyme encoded by the glgC gene (e.g., EC 2.7.7.27) participates generally in starch, glycogen and sucrose metabolism by catalyzing the following chemical reaction:
- the two substrates of this enzyme are ATP and alpha-D-glucose 1-phosphate, whereas its two products are diphosphate and ADP-glucose.
- the g/gC-encoded enzyme catalyzes the first committed and rate-limiting step in starch biosynthesis in plants and glycogen biosynthesis in bacteria. It is the enzymatic site for regulation of storage polysaccharide accum ulation in plants and bacteria, being allosterically activated or inhibited by metabolites of energy flux.
- the enzyme encoded by the glgC gene belongs to a family of transferases, specifically those transferases that transfer phosphorus-containing nucleotide groups (i.e., nucleotidyl-transferases).
- the systematic name of this enzyme class is typically referred to as ATP :alpha-D-glucose-l-phosphate adenylyltransferase.
- ADP glucose pyrophosphorylase glucose 1-phosphate adenylyltransferase, adenosine diphosphate glucose pyrophosphorylase, adenosine diphosphoglucose pyrophosphorylase, ADP-glucose pyrophosphorylase, ADP-glucose synthase, ADP-glucose synthetase, ADPG pyrophosphorylase, and ADP:alpha-D-glucose-l-phosphate adenylyltransferase.
- the glgC gene is expressed in a wide variety of plants and bacteria, including most, if not all, Cyanobacteria.
- the glgC gene is also fairly conserved among Cyanobacteria, as can be appreciated upon comparison of SEQ ID NOs:27 (5. elongatus PCC7942), 28 (Synechocystis sp. PCC6803), 29 ⁇ Synechococcus sp. PCC 7002), 30 ⁇ Synechococcus sp. WH8102), 31 (Synechococcus sp.
- RCC 307 32 (Trichodesmium erythraeum I MS 101), 33 (Anabaena varibilis), and 34 (Nostoc sp. PCC 7120), which describe the polynucleotide seq uences of various glgC genes from Cyanobacteria.
- a modified photosynthetic microorganism e.g., a Cyanobacteria
- expresses a reduced amount of a glycogen synthase gene in particular embodiments, it may comprise a deletion or mutation in the glycogen synthase gene, including any of is regulatory elements.
- Glycogen synthase also known as UDP-glucose-glycogen glucosyltransferase, is a glycosyltransferase enzyme that catalyses the reaction of UDP-glucose and (l,4-a-D-glucosyl) n to yield UDP and (l,4-a-D-glucosyl) n+ i.
- Glycogen synthase is an a-retaining glucosyltransferase that uses ADP-glucose to incorporate additional glucose monomers onto the growing glycogen polymer.
- GlgA catalyzes the final step of converting excess glucose residues one by one into a polymeric chain for storage as glycogen.
- glycogen synthases or a-l,4-glucan synthases
- a-l,4-glucan synthases have been divided into two families, animal/fungal glycogen synthases and bacterial/plant starch synthases, according to differences in seq uence, sugar donor specificity and regulatory mechanisms.
- detailed sequence analysis, predicted secondary structure comparisons, and threading analysis show that these two families are structurally related and that some domains of animal/fungal synthases were acquired to meet the particular regulatory requirements of those cell types.
- the C-terminal domain shows a similar fold with a six-stranded parallel ⁇ -sheet and nine a-helices. The last a-helix of this domain undergoes a kink at position 457-460, with the final 17 residues of the protein (461-477) crossing over to the N-terminal domain and continuing as a-helix, a typical feature of glycosyltransferase enzymes.
- glycogen synthase has a much wider catalytic cleft, which is predicted to undergo an important interdomain 'closure' movement during the catalytic cycle.
- the glgA gene is expressed in a wide variety of cells, including animal, plant, fungal, and bacterial cells, including most, if not all, Cyanobacteria.
- the glgA gene is also fairly conserved among Cyanobacteria, as can be appreciated upon com parison of SEQ I D NOs:35 (5. elongatus PCC7942), 36 (Synechocystis sp. PCC6803), 37 (Synechococcus sp. PCC 7002), 38 (Synechococcus sp. WH8102), 39 (Synechococcus sp.
- RCC 307 Trichodesmium erythraeum IMS 101
- 41 Arabaena variabilis
- 42 Nostoc sp. PCC 7120
- a modified photosynthetic microorganism of the present invention expresses an increased amount of one or more polypeptides associated with a glycogen breakdown pathway.
- said one or more polypeptides include glycogen phosphorylase (GlgP), glycogen isoamylase (GlgX), glucanotransferase (MalQ), phosphoglucomutase (Pgm), glucokinase (Glk), and/or phosphoglucose isomerase (Pgi), or a functional fragment or variant thereof, including, for example, those provided in SEQ ID NOs:68, 70, 72, 73, 83 or 85.
- Pgm polypeptide sequences useful according to the present invention are provided in SEQ ID NOs:74, 76, 77, 79, and 81.
- Pgm, Glk, and Pgi are bidirectional enzymes that can promote glycogen synthesis or breakdown depending on conditions.
- embodiments of the present invention include variants and fragments of any of the reference polypeptides and polynucleotides described herein (see, e.g., the Sequence Listing).
- Variant polypeptides are biologically active, that is, they continue to possess the enzymatic activity of a reference polypeptide.
- Such variants may result from, for example, genetic polymorphism and/or from human manipulation.
- Biologically active variants of a reference polypeptide will have at least 40%, 50%, 60%, 70%, generally at least 75%, 80%, 85%, usually about 90% to 95% or more, and typically about 97% or 98% or more sequence similarity or seq uence identity to the amino acid sequence for a reference protein as determined by sequence alignment programs described elsewhere herein using default parameters.
- a biologically active variant of a reference polypeptide may differ from that protein generally by as much 200, 100, 50 or 20 amino acid residues or suitably by as few as 1-15 amino acid residues, as few as 1-10, such as 6-10 amino acid residues, including about 20, 19, 18, 17, 16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2, or even 1 amino acid residues.
- a variant polypeptide differs from the reference seq uences referred to herein (see, e.g., the Sequence Listing) by at least one but by less than 15, 10 or 5 amino acid residues. I n other embodiments, it differs from the reference seq uences by at least one residue but less than 20%, 15%, 10% or 5% of the residues.
- a biologically active fragment can be a polypeptide fragment which is, for example, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29,30, 40, 50, 60, 70, 80, 90, 100, 110, 120, 130, 140, 150, 160, 170, 180, 190, 200, 220, 240, 260, 280, 300, 320, 340, 360, 380, 400, 450, 500, 600 or more contiguous amino acids, including all integers in between, of a reference polypeptide sequence.
- a reference polypeptide may be altered in various ways including amino acid substitutions, deletions, truncations, and insertions. Methods for such manipulations are generally known in the art. For example, amino acid sequence variants of a reference polypeptide can be prepared by mutations in the DNA. Methods for mutagenesis and nucleotide seq uence alterations are well known in the art. See, for example, Kunkel (PNAS USA. 82: 488-492, 1985); Kunkel et al., (Methods in Enzymol. 154: 367-382, 1987), U.S. Pat. No. 4,873,192, Watson, J. D.
- Polypeptide variants may contain conservative amino acid substitutions at various locations along their sequence, as compared to a reference amino acid sequence.
- a "conservative amino acid substitution” is one in which the amino acid residue is replaced with an amino acid residue having a similar side chain. Families of amino acid residues having similar side chains have been defined in the art, which can be generally sub-classified as follows:
- Acidic The residue has a negative charge due to loss of H ion at physiological pH and the residue is attracted by aqueous solution so as to seek the surface positions in the conformation of a peptide in which it is contained when the peptide is in aqueous medium at physiological pH.
- Amino acids having an acidic side chain include glutamic acid and aspartic acid.
- the residue has a positive charge due to association with H ion at physiological pH or within one or two pH units thereof (e.g., histidine) and the residue is attracted by aq ueous solution so as to seek the surface positions in the conformation of a peptide in which it is contained when the peptide is in aq ueous medium at physiological pH.
- Amino acids having a basic side chain include arginine, lysine and histidine.
- the residues are charged at physiological pH and, therefore, include amino acids having acidic or basic side chains (i.e., glutamic acid, aspartic acid, arginine, lysine and histidine).
- amino acids having acidic or basic side chains i.e., glutamic acid, aspartic acid, arginine, lysine and histidine.
- Hydrophobic The residues are not charged at physiological pH and the residue is repelled by aq ueous solution so as to seek the inner positions in the conformation of a peptide in which it is contained when the peptide is in aqueous medium.
- Amino acids having a hydrophobic side chain include tyrosine, valine, isoleucine, leucine, methionine, phenylalanine and tryptophan.
- Neutral/polar The residues are not charged at physiological pH, but the residue is not sufficiently repelled by aqueous solutions so that it would seek inner positions in the conformation of a peptide in which it is contained when the peptide is in aqueous medium.
- Amino acids having a neutral/polar side chain include asparagine, glutamine, cysteine, histidine, serine and threonine.
- proline This description also characterizes certain amino acids as “small” since their side chains are not sufficiently large, even if polar groups are lacking, to confer hydrophobicity.
- "small” amino acids are those with four carbons or less when at least one polar group is on the side chain and three carbons or less when not.
- Amino acids having a small side chain include glycine, serine, alanine and threonine.
- the gene-encoded secondary amino acid proline is a special case due to its known effects on the secondary conformation of peptide chains.
- the structure of proline differs from all the other naturally-occurring amino acids in that its side chain is bonded to the nitrogen of the a-amino group, as well as the a-carbon.
- the degree of attraction or repulsion req uired for classification as polar or nonpolar is arbitrary and, therefore, amino acids specifically contem plated by the invention have been classified as one or the other. Most amino acids not specifically named can be classified on the basis of known behaviour.
- Amino acid residues can be further sub-classified as cyclic or non-cyclic, and aromatic or non-aromatic, self-explanatory classifications with respect to the side-chain substituent groups of the residues, and as small or large.
- the residue is considered small if it contains a total of four carbon atoms or less, inclusive of the carboxyl carbon, provided an additional polar substituent is present; three or less if not.
- Small residues are, of course, always non-aromatic.
- amino acid residues may fall in two or more classes. For the naturally-occurring protein amino acids, sub-classification according to this scheme is presented in Table A.
- Conservative amino acid substitution also includes groupings based on side chains.
- a group of amino acids having aliphatic side chains is glycine, alanine, valine, leucine, and isoleucine; a group of amino acids having aliphatic-hydroxyl side chains is serine and threonine; a group of amino acids having amide-containing side chains is asparagine and glutamine; a group of amino acids having aromatic side chains is phenylalanine, tyrosine, and tryptophan; a group of amino acids having basic side chains is lysine, arginine, and histidine; and a group of amino acids having sulphur-containing side chains is cysteine and methionine.
- Amino acid substitutions falling within the scope of the invention are, in general, accomplished by selecting substitutions that do not differ significantly in their effect on maintaining (a) the structure of the peptide backbone in the area of the substitution, (b) the charge or hydrophobicity of the molecule at the target site, or (c) the bulk of the side chain. After the substitutions are introduced, the variants are screened for biological activity.
- similar amino acids for making conservative substitutions can be grouped into three categories based on the identity of the side chains.
- the first group includes glutamic acid, aspartic acid, arginine, lysine, histidine, which all have charged side chains;
- the second group includes glycine, serine, threonine, cysteine, tyrosine, glutamine, asparagine;
- the third group includes leucine, isoleucine, valine, alanine, proline, phenylalanine, tryptophan, methionine, as described in Zubay, G., Biochemistry, third edition, Wm.C. Brown Publishers (1993).
- a predicted non-essential amino acid residue in reference polypeptide is typically replaced with another amino acid residue from the same side chain family.
- mutations can be introduced randomly along all or part of a coding sequence, such as by saturation mutagenesis, and the resultant mutants can be screened for an activity of the parent polypeptide to identify mutants which retain that activity.
- the encoded peptide can be expressed recombinantly and the activity of the peptide can be determined.
- a "non-essential" amino acid residue is a residue that can be altered from the wild-type sequence of an embodiment polypeptide without abolishing or substantially altering one or more of its activities.
- the alteration does not substantially abolish one of these activities, for example, the activity is at least 20%, 40%, 60%, 70% or 80% 100%, 500%, 1000% or more of wild-type.
- An "essential" amino acid residue is a residue that, when altered from the wild-type sequence of a reference polypeptide, results in abolition of an activity of the parent molecule such that less than 20% of the wild-type activity is present.
- such essential amino acid residues may include those that are conserved in reference polypeptides across different species, including those sequences that are conserved in the enzymatic sites of reference polypeptides from various sources.
- the present invention also contemplates variants of the naturally-occurring reference polypeptide sequences or their biologically-active fragments, wherein the variants are distinguished from the naturally-occurring sequence by the addition, deletion, or substitution of one or more amino acid residues.
- variants will display at least about 30, 40, 50, 55, 60, 65, 70, 75, 80, 85, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99% similarity or sequence identity to a reference polypeptide sequence.
- sequences differing from the native or parent sequences by the addition, deletion, or substitution of 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 30, 40, 50, 60, 70, 80, 90, 100 or more amino acids but which retain the properties or activities of a parent or reference polypeptide sequence are contemplated.
- variant polypeptides differ from reference sequence by at least one but by less than 50, 40, 30, 20, 15, 10, 8, 6, 5, 4, 3 or 2 amino acid residue(s). In other embodiments, variant polypeptides differ from a reference sequence by at least 1% but less than 20%, 15%, 10% or 5% of the residues. (If this comparison requires alignment, the sequences should be aligned for maximum similarity. "Looped" out sequences from deletions or insertions, or mismatches, are considered differences.)
- sequence similarity or seq uence identity between sequences are performed as follows. To determine the percent identity of two amino acid sequences, or of two nucleic acid sequences, the sequences are aligned for optimal comparison purposes (e.g., gaps can be introduced in one or both of a first and a second amino acid or nucleic acid seq uence for optimal alignment and non-homologous sequences can be disregarded for comparison purposes). I n certain embodiments, the length of a reference sequence aligned for comparison purposes is at least 30%, preferably at least 40%, more preferably at least 50%, 60%, and even more preferably at least 70%, 80%, 90%, 100% of the length of the reference sequence.
- amino acid residues or nucleotides at corresponding amino acid positions or nucleotide positions are then compared. When a position in the first sequence is occupied by the same amino acid residue or nucleotide as the corresponding position in the second seq uence, then the molecules are identical at that position.
- the percent identity between the two sequences is a function of the number of identical positions shared by the sequences, taking into account the number of gaps, and the length of each gap, which need to be introduced for optimal a lignment of the two sequences.
- the comparison of sequences and determination of percent identity between two sequences can be accomplished using a mathematical algorithm.
- the percent identity between two amino acid seq uences is determined using the Needleman and Wunsch, (J. Mol. Biol. 48: 444-453, 1970) algorithm which has been incorporated into the GAP program in the GCG software package, using either a Blossum 62 matrix or a PAM250 matrix, and a gap weight of 16, 14, 12, 10, 8, 6, or 4 and a length weight of 1, 2, 3, 4, 5, or 6.
- the percent identity between two nucleotide seq uences is determined using the GAP program in the GCG software package, using a NWSgapdna.CMP matrix and a gap weight of 40, 50, 60, 70, or 80 and a length weight of 1, 2, 3, 4, 5, or 6.
- a particularly preferred set of parameters are a Blossum 62 scoring matrix with a gap penalty of 12, a gap extend penalty of 4, and a frameshift gap penalty of 5.
- the percent identity between two amino acid or nucleotide seq uences can be determined using the algorithm of E. Meyers and W. Miller (Cabios. 4:11-17, 1989) which has been incorporated into the ALIGN program (version 2.0), using a PAM 120 weight residue table, a gap length penalty of 12 and a gap penalty of 4.
- nucleic acid and protein sequences described herein can be used as a "q uery sequence" to perform a search against public databases to, for example, identify other family members or related sequences.
- Such searches can be performed using the NBLAST and XBLAST programs (version 2.0) of Altschul, et al., (1990, J. Mol. Biol, 215: 403-10).
- Gapped BLAST can be utilized as described in Altschul et al., ⁇ Nucleic Acids Res. 25: 3389-3402, 1997).
- the default parameters of the respective programs e.g., XBLAST and NBLAST.
- polynucleotides and/or polypeptides can be evaluated using a BLAST alignment tool.
- a local alignment consists simply of a pair of seq uence segments, one from each of the sequences being compared.
- a modification of Smith-Waterman or Sellers algorithms will find all segment pairs whose scores cannot be improved by extension or trimming, called high-scoring segment pairs (HSPs).
- HSPs high-scoring segment pairs
- the results of the BLAST alignments include statistical measures to indicate the likelihood that the BLAST score can be expected from chance alone.
- the raw score, S is calculated from the number of gaps and substitutions associated with each aligned sequence wherein higher similarity scores indicate a more significant alignment. Substitution scores are given by a look-up table (see PAM, BLOSUM). Gap scores are typically calculated as the sum of G, the gap opening penalty and L, the gap extension penalty. For a gap of length n, the gap cost would be G+Ln.
- the choice of gap costs, G and L is empirical, but it is customary to choose a high value for G (10-15), e.g., 11, and a low value for L (1-2) e.g., 1.
- bit score is derived from the raw alignment score S in which the statistical properties of the scoring system used have been taken into account. Bit scores are normalized with respect to the scoring system, therefore they can be used to compare alignment scores from different searches. The terms "bit score” and “similarity score” are used interchangeably. The bit score provides an indication of how good the alignment is; (i.e., the higher the score, the better the alignment).
- the E-Value describes the likelihood that a sequence with a similar score will occur in the database by chance. It is a prediction of the number of different alignments with scores equivalent to or better than S that are expected to occur in a database search by chance. The smaller the E-Value, the more significant the alignment. For example, an alignment having an E value of e 117 means that a seq uence with a similar score is very unlikely to occur simply by chance. Additionally, the expected score for aligning a random pair of amino acids is required to be negative, otherwise long alignments would tend to have high score independently of whether the segments aligned were related. Additionally, the BLAST algorithm uses an appropriate substitution matrix, nucleotide or amino acid and for gapped alignments uses gap creation and extension penalties. For example, BLAST alignment and comparison of polypeptide sequences are typically done using the BLOSUM62 matrix, a gap existence penalty of 11 and a gap extension penalty of 1.
- sequence similarity scores are reported from BLAST analyses done using the BLOSUM62 matrix, a gap existence penalty of 11 and a gap extension penalty of 1.
- sequence identity/similarity scores provided herein refer to the value obtained using GAP Version 10 (GCG, Accelrys, San Diego, Calif.) using the following parameters: % identity and % similarity for a nucleotide seq uence using GAP Weight of 50 and Length Weight of 3, and the nwsgapdna.cmp scoring matrix; % identity and % similarity for an amino acid sequence using GAP Weight of 8 and Length Weight of 2, and the BLOSUM62 scoring matrix (Henikoff and Henikoff, PNAS USA. 89:10915-10919, 1992).
- GAP uses the algorithm of Needleman and Wunsch (J Mol Biol. 48:443-453, 1970) to find the alignment of two complete sequences that maximizes the number of matches and minimizes the number of gaps.
- the variant polypeptide comprises an amino acid sequence that can be optimally aligned with a reference polypeptide sequence (see, e.g., Sequence Listing) to generate a BLAST bit scores or sequence similarity scores of at least about 50, 60, 70, 80, 90, 100, 100, 110, 120, 130, 140, 150, 160, 170, 180, 190, 200, 210, 220, 230, 240, 250, 260, 270, 280, 290, 300, 310, 320, 330, 340, 350, 360, 370, 380, 390, 400, 410, 420, 430, 440, 450, 460, 470, 480, 490, 500, 510, 520, 530, 540, 550, 560, 570, 580, 590, 600, 610, 620, 630, 640, 650, 660, 670, 680, 690, 700, 710, 720, 730, 740,
- Variants of a reference polypeptide can be identified by screening combinatorial libraries of mutants of a reference polypeptide. Libraries or fragments e.g., N terminal, C terminal, or internal fragments, of protein coding sequence can be used to generate a variegated population of fragments for screening and subsequent selection of variants of a reference polypeptide.
- a "chimeric protein” or “fusion protein” includes a reference polypeptide, or a polypeptide fragment linked to either another reference polypeptide (e.g., to create multiple fragments), to a non-reference polypeptide, or to both.
- a reference polypeptide can be fused to a heterologous polypeptide seq uence.
- a "heterologous polypeptide” typically has an amino acid sequence corresponding to a protein which is different from the reference protein sequence, and which can be derived from the same or a different organism.
- the reference polypeptide of the fusion protein can correspond to all or a portion of a biologically active amino acid sequence.
- a fusion protein includes at least one or two biologically active portions of reference protein.
- the polypeptides forming the fusion protein are typically linked C-terminus to N-terminus, although they can also be linked C-terminus to C-terminus, N-terminus to N-terminus, or N-terminus to C-terminus.
- the polypeptides of the fusion protein can be in any order.
- the fusion partner may be designed and included for essentially any desired purpose provided they do not adversely affect the enzymatic activity of the polypeptide.
- a fusion partner may comprise a sequence that assists in expressing the protein (an expression enhancer) at higher yields than the native recombinant protein.
- Other fusion partners may be selected so as to increase the solubility or stability of the protein or to enable the protein to be targeted to desired intracellular compartments.
- the fusion protein can include a moiety which has a high affinity for a ligand.
- the fusion protein can be a GST-fusion protein in which the reference polypeptide sequences are fused to the C-terminus of the GST sequences.
- Such fusion proteins can facilitate the purification and/or identification of the resulting polypeptide.
- the fusion protein can be reference polypeptide containing a heterologous signal sequence at its N-terminus. In certain host cells, expression and/or secretion of such proteins can be increased through use of a heterologous signal sequence.
- Fusion proteins may generally be prepared using standard techniques, described elsewhere herein.
- a peptide linker sequence may be employed to separate the first and second polypeptide components by a distance sufficient to ensure that each polypeptide folds into its secondary and tertiary structures, if desired. Exemplary peptide linkers are described elsewhere herein.
- Polynucleotides and Vectors include polynucleotides encoding a DGAT fusion protein described herein, the fusion protein comprising at least one DGAT polypeptide fused to at least one intracellular localization domain, such as a bacteria membrane- or bacterial plasma membrane (PM)-targeting domain.
- a DGAT fusion protein described herein the fusion protein comprising at least one DGAT polypeptide fused to at least one intracellular localization domain, such as a bacteria membrane- or bacterial plasma membrane (PM)-targeting domain.
- Such polynucleotides can be partially or fully isolated from other cellular components, within a vector, for exa mple, a composition comprising such a vector (e.g., in a tube or kit), or in a host cell, such as modified photosynthetic microorganism.
- polynucleotides and modified photosynthetic microorganisms comprising the same may optionally comprise one or more (introduced) polynucleotides encoding a lipid biosynthesis protein, and/or one or more (introduced) polynucleotides encoding a polypeptide associated with glycogen breakdown.
- nucleotide sequences that encode any functional naturally-occurring variants or fragments (e.g., allelic variants, orthologs, splice variants) or non-naturally occurring variants or fragments of these native polynucleotides (i.e., optimized by engineering), as well as com positions comprising such polynucleotides, including, for example, cloning and expression vectors.
- modified photosynthetic microorganisms described herein may optionally com prise a mutation or deletion in one or more genes associated with glycogen biosynthesis or storage, alone or in combination with the presence of overexpressed proteins associated with lipid biosynthesis proteins and/or glycogen breakdown.
- Certain modified photosynthetic microorganisms for example, for the production of wax esters, may optionally comprise a mutation or deletion in or more genes encoding an aldehyde decarbonylase, an aldehyde dehydrogenase, or both, either alone or in combination with the presence of overexpressed proteins associated with lipid biosynthesis proteins and/or glycogen breakdown.
- mutants in this context refer generally to those changes or alterations in a photosynthetic microorganism, e.g., a Cyanobacterium, that render the product of that gene non-functional or having reduced function.
- Such changes or alterations include nucleotide substitutions, deletions, or additions/insertions to the coding or regulatory sequences of a targeted gene (e.g., glgA, glgC, pgm, aldehyde decarbonylase, aldehyde dehydrogenase), in whole or in part, which disrupt, eliminate, down-regulate, or significantly reduce the expression of the polypeptide encoded by that gene, whether at the level of transcription, translation, post-translational modification, or protein stability.
- Such alterations can also reduce the enzymatic activity or other functional characteristic of the protein (e.g., localization), with or without reducing expression.
- Techniq ues for producing such alterations or changes, such as by recombination with a vector having a selectable marker are exemplified herein and known in the molecular biological art.
- one or more alleles of a gene e.g., two or all alleles, may be mutated or deleted within a photosynthetic microorganism.
- modified photosynthetic microorganisms, e.g., Cyanobacteria of the present invention are merodiploids or partial diploids.
- targeted gene or polypeptide may also be accomplished by targeting the m RNA of that gene, such as by using various antisense technologies (e.g., antisense oligonucleotides and siRNA) known in the art. Accordingly, targeted genes may be considered “non-functional" when the polypeptide or enzyme encoded by that gene is not expressed by the modified photosynthetic microorganism, or is expressed in negligible amounts.
- antisense technologies e.g., antisense oligonucleotides and siRNA
- DNA and “polynucleotide” and “nucleic acid” include a DNA molecule that has been isolated free of total genomic DNA of a particular species. Therefore, a DNA segment encoding a polypeptide refers to a DNA segment that contains one or more coding seq uences yet is substantially isolated away from, or purified free from, total genomic DNA of the species from which the DNA segment is obtained. Included within the terms “DNA segment” and “polynucleotide” are DNA segments and smaller fragments of such segments, and also recombinant vectors, including, for example, plasmids, cosmids, phagemids, phage, viruses, and the like.
- polynucleotide sequences of this invention can include genomic seq uences, extra-genomic and plasmid-encoded sequences and smaller engineered gene segments that express, or may be adapted to express, proteins, polypeptides, peptides and the like. Such segments may be naturally isolated, or modified synthetically by the hand of man.
- Polynucleotides may be single-stranded (coding or a ntisense) or double-stranded, and may be DNA (genomic, cDNA or synthetic) or RNA molecules. Additional coding or non-coding sequences may, but need not, be present within a polynucleotide of the present invention, and a polynucleotide may, but need not, be linked to other molecules and/or support materials.
- Polynucleotides may comprise a native seq uence (e.g., an endogenous sequence that encodes protein described herein) or may comprise a variant or fragment, or a biological functional eq uivalent of such a sequence.
- Polynucleotide variants may contain one or more substitutions, additions, deletions and/or insertions, as further described herein, preferably such that the enzymatic activity of the encoded polypeptide is not substantially diminished relative to the unmodified or reference polypeptide.
- the effect on the enzymatic activity of the encoded polypeptide may generally be assessed as described herein and known in the art.
- Embodiments of the present invention include polynucleotides (e.g., isolated polynucleotides) that encode any of the intracellular localization domain-DGAT fusion proteins described herein, such as the membrane-targeting domain-DGAT fusion proteins. These polynucleotides comprise at least one seq uence encoding a heterologous intracellular localization domain described herein, which is fused in-frame to at least one sequence encoding a DGAT polypeptide, or an active fragment or variant thereof.
- polynucleotides e.g., isolated polynucleotides
- These polynucleotides comprise at least one seq uence encoding a heterologous intracellular localization domain described herein, which is fused in-frame to at least one sequence encoding a DGAT polypeptide, or an active fragment or variant thereof.
- Certain embodiments thus include polynucleotides that encode any one or more of the intracellular localization domains described herein, where such polynucleotide(s) are fused in-frame to a DGAT-encoding polynucleotide.
- Exemplary seq uences that encode a membrane-targeting domain can be found within SEQ ID NOs:200 or 202, the respective PCC7942-0858 and PCC7942-1015 coding sequences of two methyl-accepting chemotaxis (MCP) proteins from 5. elongatus.
- the polynucleotide sequence may include about the N-terminal 129 nucleotides of SEQ ID NO:200 (PCC7942-0858), which encodes the leader sequence of the MCP protein encoded by the PCC7942-0858 gene; additional sequences can also be included, for instance, about the N-terminal 132, 135, 138, 141, 144, 147, 150, 153, 156, 159, 162, 165, 168, 171, 174, 177, 180 or more nucleotides of SEQ ID NO:200 (PCC7942-0858).
- DGAT-encoding portion of the fusion protein encodes a DGAT comprising or consisting of a polypeptide sequence set forth in any one of SEQ ID NOs:58, 59, 60 or 61, or a fragment or variant thereof.
- SEQ ID NO:58 is the sequence of DGATn;
- SEQ ID NO: 59 is the sequence of Streptomyces coelicolor DGAT (ScoDGAT or SDGAT);
- SEQ ID NO:60 is the sequence of Alcanivorax borkumensis DGAT (AboDGAT); and
- SEQ ID NO:61 is the sequence of DGATd (Acinetobacter baylii sp.).
- a DGAT-encoding portion of the fusion protein comprises or consists of a polynucleotide sequence set forth in any one of SEQ ID NOs:62, 63, 64, 65 or 66, or a fragment or variant thereof.
- SEQ ID NO:62 is a codon-optimized for expression in Cyanobacteria sequence that encodes DGATn; SEQ ID NO:63 has homology to SEQ ID NO:62; SEQ ID NO:64 is a codon-optimized for expression in Cyanobacteria sequence that encodes ScoDGAT; SEQ ID NO:65 is a codon-optimized for expression in Cyanobacteria sequence that encodes AboDGAT; and SEQ ID NO:66 is a codon-optimized for expression in Cyanobacteria sequence that encodes DGATd.
- DGATn and DGATd correspond to Acinetobacter baylii DGAT and a modified form thereof, which includes two additional amino acid residues immediately following the initiator methionine.
- a modified photosynthetic microorganism comprises an introduced polynucleotide that encodes one or more lipid biosynthesis proteins.
- a modified photosynthetic microorganism comprises an endogenous polynucleotide that encodes a lipid biosynthesis gene, where a regulatory element such as a promoter is introduced upstream of that polynucleotide to regulate or alter expression of the encoded protein.
- a modified photosynthetic microorganism comprises reduced or eliminated expression or activity of a lipid biosynthesis polypeptide. Included are full or partial deletions, and point mutations or insertions of an endogenous lipid biosynthesis gene that reduce or eliminate expression and/or activity of the encoded polypeptide.
- Exemplary lipid biosynthesis genes encode polypeptides such as acyl carrier proteins
- ACP acyl ACP synthases
- Aas acyl-ACP reductases
- alcohol dehydrogenases aldehyde dehydrogenases
- aldehyde decarbonylases thioesterases
- TES acetyl coenzyme A carboxylases
- ACCase phosphatidic acid phosphatases
- PAP phosphatidic acid phosphatases
- TAG triacylglycerol
- a modified photosynthetic microorganism comprises one or more polynucleotides encoding one or more acyl carrier proteins (ACP).
- ACP nucleotide seq uences include SEQ I D NO:5 from Synechococcus elongatus PCC7942, SEQ I D NOS:7, 9, and 11 from Acinetobacter sp. ADP1, and SEQ I D NO:13 from Spinacia oleracea.
- a modified photosynthetic microorganism comprises one or more polynucleotides encoding one or more acyl-ACP synthetase (Aas) enzymes.
- the Aas nucleotide sequence is derived from the Se918 gene of Synechococcus elongatus.
- One exem plary Aas sequence nucleotide sequence is SEQ I D NO:43 from Synechococcus elongatus PCC 7942.
- a modified photosynthetic microorganism of the present invention has a mutation such as a point mutation, insertion, or full or partial deletion of one or more endogenous Aas genes, for instance, the Se918 gene of S. elongatus PCC7942, to reduce or eliminate expression and/or activity of the encoded Aas polypeptide.
- a modified photosynthetic microorganism comprises one or more polynucleotides encoding one or more acyl-ACP reductase polypeptides.
- acyl-ACP reductase nucleotide sequences include orfl594 from Synechococcus elongatus PCC7942 (SEQ ID NO:l), and orfsll0209 from Synechocystis sp. PCC6803 (SEQ ID NO:3).
- Alcohol Dehydrogenases Certain embodiments may employ one or more alcohol dehydrogenase encoding polynucleotide seq uences.
- Exemplary alcohol dehydrogenases include slrll92 of Synechocystis sp. PCC6803 (SEQ ID NO:104) and ACIAD3612 from Acinetobacter baylii (SEQ I D NO:106).
- Aldehyde Dehydrogenases may employ one or more aldehyde dehydrogenase encoding polynucleotide seq uences. Certain embodiments, for example, for the production of triglycerides or wax esters, may comprise mutations such as point mutations, insertions, or full or partial deletions of one or more endogenous aldehyde dehydrogenase genes.
- One exemplary aldehyde dehydrogenase is orf0489 of Synechococcus elongatus PCC7942 (SEQ ID NO:102).
- Aldehyde Decarbonylases may comprise mutations such as point mutations, insertions, or full or partial deletions of one or more endogenous aldehyde decarbonylase genes.
- an aldehyde decarbonylase is encoded by orfl593 in 5.
- elongatus PCC7942 is encoded by orfsll0208 in Synechocystis sp. PCC6803.
- a modified photosynthetic microorganism comprises one or more polynucleotides encoding one or more thioesterases (TES) including acyl-ACP thioesterases and/or acyl-CoA thioesterases.
- TES thioesterases
- the polynucleotide sequence of the TES encodes a TesA or TesB polypeptide from E. coli, or a cytoplasmic TesA variant (*TesA) having the sequence set forth in SEQ ID NO:121.
- the polynucleotide sequence of the TES comprises that of the FatB gene, encoding a FatB enzyme, such as a C8, C12, C14, C16, or C18 FatB enzyme.
- the polynucleotide encodes a thioesterase (e.g., FatB thioesterase), having only thioesterase activity and little or no lysophospholipase activity.
- the thioesterase is a FatB acyl-ACP thioesterase, which can hydrolyze acyl-ACP but not acyl-CoA.
- SEQ ID NO:197 is an exemplary nucleotide sequence of a C8/C10 FatB2 thioesterase derived from Cuphea hookeriana, and SEQ ID NO:122 is codon-optimized for expression in Cyanobacteria.
- SEQ I D NO:123 is an exemplary nucleotide sequence of a C12 FatBl acyl-ACP thioesterase derived from Umbellularia californica, and SEQ ID NO:124 is a codon-optimized version of SEQ I D NO:123 for optimal expression in Cyanobacteria.
- SEQ I D NO:126 is an exemplary nucleotide sequence of a C14 FatBl thioesterase derived from Cinnamomum camphora, and SEQ:125 is a codon-optimized version of SEQ I D NO:126.
- SEQ I D NO:127 is an exemplary nucleotide sequence of a C16 FatBl thioesterase derived from Cuphea hookeriana, and SEQ ID NO:128 is a codon-optimized version of SEQ ID NO:127.
- one or more FatB seq uences are operably linked to a strong promoter, such as a Ptrc promoter.
- one or more FatB sequences are operably linked to a relatively weak promoter, such as an arabinose promoter.
- a polynucleotide encodes an acetyl-CoA carboxylase (ACCase) comprising or consisting of a polypeptide sequence set forth in any of SEQ ID NOs:55, 45, 46, 47, 48 or 49, or a fragment or variant thereof.
- ACCase polynucleotide comprises or consists of a polynucleotide sequence set forth in any of SEQ I D NOs:56, 57, 50, 51, 52, 53 or 54, or a fragment or variant thereof.
- SEQ I D NO:55 is the seq uence of Saccharomyces cerevisiae acetyl-CoA carboxylase (yAccl); and SEQ ID NO:56 is a codon-optimized for expression in Cyanobacteria sequence that encodes yAccl.
- SEQ ID NO:45 is Synechococcus sp. PCC 7002 AccA;
- SEQ ID NO:46 is Synechococcus sp. PCC 7002 AccB;
- SEQ ID NO:47 is Synechococcus sp. PCC 7002 AccC;
- SEQ I D NO:48 is Synechococcus sp. PCC 7002 AccD.
- SEQ I D NO:50 encodes Synechococcus sp. PCC 7002 AccA; SEQ ID NO:51 encodes Synechococcus sp. PCC 7002 AccB; SEQ ID NO:52 encodes Synechococcus sp. PCC 7002 AccC; and SEQ ID NO:53 encodes Synechococcus sp. PCC 7002 AccD.
- SEQ ID NO:49 is a Triticum aestivum ACCase; and SEQ ID NO:54 encodes this Triticum aestivum ACCase.
- a polynucleotide encodes a phosphatidate phosphatase (also referred to as a phosphatidic acid phosphatase; PAP) comprising or consisting of a polypeptide sequence set forth in SEQ I D NO:131, or a fragment or variant thereof.
- a phosphatidate phosphatase polynucleotide comprises or consists of a polynucleotide sequence set forth in SEQ I D NO:129 or SEQ I D NO:130, or a fragment or variant thereof.
- SEQ ID NO:131 is the sequence of Saccharomyces cerevisiae phosphatidate phosphatase (yPAHl), and SEQ ID NO:129 is a codon-optimized for expression in Cyanobacteria seq uence that encodes yPAHl.
- the nucleotide seq uence of the PAP is derived from the E. coli PgpB gene, and/or the PAP gene from Synechocystis sp. PCC6803.
- TAG hydrolases employ one or more TAG hydrolase encoding polynucleotide sequences.
- TAG hydrolase polynucleotide sequences include SDP1 (SUGAR-DEPENDENT1) triacylglycerol lipase from Arabidopsis thaliana (SEQ ID NO:153), ACIAD1335 from Acinetobacter sp. ADPl (SEQ ID NO:154), TG14P from S. cerevisiae (SEQ I D NO:175), and RHAl_ro04722 (YP_704665) TAG lipase from Rhodococcus (SEQ ID NO:155).
- Additional polynucleotide sequences for exemplary lipases/esterases include RHAl_ro01602 lipase/esterase from Rhodococcus sp. (see SEQ ID NO:156), and the RHAl_ro06856 lipase/esterase ⁇ see SEQ I D NO:119) from Rhodococcus sp.
- Fatty Acyl-CoA Synthetases employ one or more fatty acyl-CoA synthetase encoding polynucleotide sequences.
- One exemplary fatty acyl-CoA synthetase includes the FadD gene from E. coli (SEQ ID NO:16) which encodes a fatty acyl-CoA synthetase having substrate specificity for medium and long chain fatty acids.
- Other exemplary fatty acyl-CoA synthetases include those derived from 5.
- the Faalp coding sequence is set forth in SEQ I D NO:18
- the Faa2p coding sequence is set forth in SEQ I D NO:20
- the Faa3p is set forth in SEQ ID NO:22.
- SEQ I D NO:22 is codon-optimized for expression in 5. elongatus PCC7942.
- a modified photosynthetic microorganism comprises one or more polynucleotides encoding one or more lipases or phospholipases, including lysophospholipases, or a fragment or variant thereof.
- the encoded lysophospholipase is Lysophospholipase LI (TesA), Lysophospholipase L2, TesB, Vu Patatin 1 protein, or a homolog thereof.
- the encoded phospholipase e.g., a lysophospholipase
- the polynucleotide comprises a bacterial phospholipase polynucleotide sequence, e.g., a sequence derived from Escherichia coli, Enterococcus faecalis, or Lactobacillus plantarum.
- the encoded phospholipase is Lysophospholipase LI (TesA), Lysophospholipase L2, TesB, Vu Patatin 1 protein, or a functional fragment thereof.
- a lysophospholipase is a bacterial Lysophospholipase LI (TesA) or TesB, such as an E.coli Lysophospholipase LI encoded by a polynucleotide (pldC) having the wild-type sequence set forth in SEQ ID NO:196, or an E. coli TesB encoded by a polynucleotide having the wild-type sequence set forth in SEQ ID NO:132.
- the polypeptide sequence of E. coli Lysophospholipase LI is provided in SEQ ID NO:133
- the polypeptide sequence of E. coli TesB is provided in SEQ ID NO:134.
- a lysophospholipase is a Lysophospholipase L2, such as an E. coli Lysophospholipase L2 encoded by a polynucleotide (pldB) having the wild-type sequence set forth in SEQ ID NO:135, or a Vu patatin 1 protein encoded by a polynucleotide having the wild-type sequence set forth in SEQ ID NO:136.
- the polypeptide sequence of E. coli Lysophospholipase L2 is provided in SEQ ID NO:137
- the polypeptide sequence of Vu patatin 1 protein is provided in SEQ ID NO:138.
- the polynucleotide encoding the phospholipase variant is modified such that it encodes a phospholipase that localizes predominantly to the cytoplasm instead of the periplasm.
- it may encode a phospholipase having a deletion or mutation in a region associated with periplasmic localization.
- the encoded phospholipase variant is derived from Lysophospholipase LI (TesA).
- the Lysophospholipase LI (TesA) variant is a bacterial TesA, such as an E.coli Lysophospholipase (TesA) variant encoded by a polynucleotide having the sequence set forth in SEQ ID NO:139.
- the polypeptide sequence of the Lysophospholipase LI variant is provided in SEQ ID NO:121 (PldC(*TesA)).
- phospholipase-encoding polynucleotide sequences include phospholipase Al (PldA) from Acinetobacter sp. ADP1 (SEQ ID NO:140), phospholipase A (PldA) from E. coli (SEQ ID NO:141), phospholipase from Streptomyces coelicolor A3(2) (SEQ ID NO:142), phospholipase A2 (PLA2-a) from Arabidopsis thaliana (SEQ ID NO:143).
- DADl Defective Anther Dehiscence 1
- Arabidopsis thaliana SEQ ID NO:144
- chloroplast DONGLE from Arabidopsis thaliana
- patatin from Anabaena variabilis ATCC 29413 SEQ ID NO:147
- lysophospholipase-encoding polynucleotide sequences include phospholipase B (Plblp) from Saccharomyces cerevisiae S288c (SEQ I D NO:148), phospholipase B (Plb2p) from Saccharomyces cerevisiae S288c (SEQ ID NO:149), ACIAD1057 (TesA homolog) from Acinetobacter ADP1 (SEQ I D NO:150), ACIAD1943 lysophospholipase from Acinetobacter ADP1 (SEQ I D NO:151), and a lysophospholipase (YP_702320; RHAl_ro02357) from Rhodococcus (SEQ ID NO:152).
- Glycogen Biosynthesis and Storage Genes include reduced or eliminated expression and/or activity of one or more polypeptides associated with glycogen biosynthesis and/or storage, for instance, by mutation of one or more genes that encode such polypeptides. Included are full or partial deletions, and point mutations or insertions of one or more glycogen biosynthesis/storage genes that reduce or eliminate expression and/or biological activity of the encoded protein(s).
- Exemplary genes associated with glycogen synthesis and/or storage include glgC, pgm, and glgA.
- SEQ ID NOs: 25 (Synechocystis sp. PCC6803), 75 (Synechococcus elongatus PCC7942), 26 (Synechococcus sp. WH8102), 78 (Synechococcus RCC307), and 80 (Synechococcus 7002), which respectively encode Pgm polypeptides having sequences set forth in SEQ I D NOs:74, 76, 77, 79 and 81.
- a modified photosynthetic microorganism comprise one or more polynucleotides encoding one or more polypeptides associated with a glycogen breakdown, or a fragment or variant thereof.
- the one or more polypeptides are glycogen phosphorylase (GIgP), glycogen isoamylase (GIgX), glucanotransferase (MalQ), phosphoglucomutase (Pgm), glucokinase (Glk), and/or phosphoglucose isomerase (Pgi), or a functional fragment or variant thereof.
- a representative glgP polynucleotide sequence is provided in SEQ ID NO:67, and a representative GIgP polypeptide sequence is provided in SEQ ID NO:68.
- a representative glgX polynucleotide sequence is provided in SEQ I D NO:69, and a representative GIgX polypeptide sequence is provided in SEQ I D NO:70.
- a representative malQ polynucleotide seq uence is provided in SEQ ID NO:71, and a representative MalQ polypeptide sequence is provide in SEQ I D NO:72.
- a representative phosphoglucomutase (pgm) polynucleotide seq uence is provided in SEQ ID NO:24, and a representative phosphoglucomutase (Pgm) polypeptide sequence is provided in SEQ I D NO:73, with others provided infra (SEQ ID NOs:25, 26, 74-81).
- a representative glk polynucleotide sequence is provided in SEQ I D NO:82, and a representative Glk polypeptide seq uence is provided in SEQ ID NO:83.
- a representative pgi polynucleotide sequence is provided in SEQ I D NO:84, and a representative Pgi polypeptide sequence is provided in SEQ ID NO:85.
- a polynucleotide comprises one of these polynucleotide sequences, or a fragment or variant thereof, or encodes one of these polypeptide sequences, or a fragment or variant thereof.
- nucleotide seq uences that encode the proteins and enzymes of the application encompass full-length reference polynucleotides, as well as portions of the full-length or substantially full-length nucleotide sequences of these genes or their transcripts or DNA copies of these transcripts. Portions of a nucleotide seq uence may encode polypeptide portions or segments that retain the biological activity of the reference polypeptide.
- a portion of a nucleotide sequence that encodes a biologically active fragment of an enzyme provided herein may encode at least about 20, 21, 22, 23, 24, 25, 30, 40, 50, 60, 70, 80, 90, 100, 120, 150, 200, 300, 400, 500, 600, or more contiguous amino acid residues, almost up to the total number of amino acids present in a full-length enzyme.
- intermediate lengths in this context and in all other contexts used herein, means any length between the quoted values, such as 101, 102, 103, etc.; 151, 152, 153, etc.; 201, 202, 203, etc.
- polynucleotides described herein regardless of the length of the coding sequence itself, may be combined with other DNA seq uences, such as promoters, polyadenylation signals, additional restriction enzyme sites, multiple cloning sites, other coding segments, and the like, such that their overall length may vary considerably. It is therefore contemplated that a polynucleotide fragment of almost any length may be employed, with the total length preferably being limited by the ease of preparation and use in the intended recombinant DNA protocol.
- the invention also contemplates variants of the reference polynucleotide sequences described herein (see, e.g., the Sequence Listing).
- Nucleic acid variants can be naturally-occurring, such as allelic variants (same locus), homologs (different locus), and orthologs (different organism) or can be non natura lly-occurring.
- Naturally occurring variants such as these can be identified and isolated using well-known molecular biology techniq ues including, for example, various polymerase chain reaction (PCR) and hybridization-based techniques as known in the art.
- Naturally occurring variants can be isolated from any organism that encodes one or more genes having an activity of a reference polypeptide.
- Embodiments of the present invention therefore, encompass Cyanobacteria comprising such naturally-occurring polynucleotide variants.
- Non-naturally occurring variants can be made by mutagenesis techniques, including those applied to polynucleotides, cells, or organisms.
- the variants can contain nucleotide substitutions, deletions, inversions and insertions. Variation can occur in either or both the coding and non-coding regions.
- non-naturally occurring variants may have been optimized for use in Cyanobacteria, such as by engineering and screening the enzymes for increased activity, stability, or any other desirable feature.
- variations can produce both conservative and non-conservative amino acid substitutions (as compared to the originally encoded product).
- conservative variants include those sequences that, because of the degeneracy of the genetic code, encode the amino acid sequence of a reference polypeptide.
- variant nucleotide sequences also include synthetically derived polynucleotide sequences, such as those generated, for example, by using site-directed mutagenesis but which still encode a biologically active reference polypeptide, as described elsewhere herein.
- variants of a particular polynucleotide sequence will have at least about 30%, 40% 50%, 55%, 60%, 65%, 70%, generally at least about 75%, 80%, 85%, 90%, 95% or 98% or more sequence identity to a reference polynucleotide seq uence as determined by sequence alignment programs described elsewhere herein using default parameters.
- telomere sequences can be used to isolate corresponding sequences and alleles from other organisms, particularly other microorganisms. Methods are readily available in the art for the hybridization of nucleic acid sequences. Coding sequences from other organisms may be isolated according to well known techniques based on their sequence identity with the coding sequences set forth herein. In these techniq ues all or part of the known coding sequence is used as a probe which selectively hybridizes to other reference coding seq uences present in a population of cloned genomic DNA fragments or cDNA fragments (i.e., genomic or cDNA libraries) from a chosen organism.
- Low stringency conditions include and encompass from at least about 1% v/v to at least about 15% v/v formamide and from at least about 1 M to at least about 2 M salt for hybridization at 42° C, and at least about 1 M to at least about 2 M salt for washing at 42° C.
- Low stringency conditions also may include 1% Bovine Serum Albumin (BSA), 1 mM EDTA, 0.5 M NaHP0 4 (pH 7.2), 7% SDS for hybridization at 65° C, and (i) 2 x SSC, 0.1% SDS; or (ii) 0.5% BSA, 1 mM EDTA, 40 mM NaHP0 4 (pH 7.2), 5% SDS for washing at room temperature.
- BSA Bovine Serum Albumin
- One embodiment of low stringency conditions includes hybridization in 6 x sodium chloride/sodium citrate (SSC) at about 45 ⁇ C, followed by two washes in 0.2 x SSC, 0.1% SDS at least at 50°C (the temperature of the washes can be increased to 55° C for low stringency conditions).
- SSC sodium chloride/sodium citrate
- Medium stringency conditions include and encompass from at least about 16% v/v to at least about 30% v/v formamide and from at least about 0.5 M to at least about 0.9 M salt for hybridization at 42° C, and at least about 0.1 M to at least about 0.2 M salt for washing at 55° C.
- Medium stringency conditions also may include 1% Bovine Serum Albumin (BSA), 1 mM EDTA, 0.5 M NaHP0 4 (pH 7.2), 7% SDS for hybridization at 65° C, and (i) 2 x SSC, 0.1% SDS; or (ii) 0.5% BSA, 1 mM EDTA, 40 mM NaHP0 4 (pH 7.2), 5% SDS for washing at 60-65° C.
- BSA Bovine Serum Albumin
- 1 mM EDTA 1 mM EDTA, 0.5 M NaHP0 4 (pH 7.2), 7% SDS for hybridization at 65° C
- 2 x SSC 0.1% SDS
- BSA
- High stringency conditions include and encompass from at least about 31% v/v to at least about 50% v/v formamide and from about 0.01 M to about 0.15 M salt for hybridization at 42° C, and about 0.01 M to about 0.02 M salt for washing at 55° C.
- High stringency conditions also may include 1% BSA, 1 mM EDTA, 0.5 M NaHP0 4 (pH 7.2), 7% SDS for hybridization at 65° C, and (i) 0.2 x SSC, 0.1% SDS; or (ii) 0.5% BSA, 1 mM EDTA, 40 mM NaHP0 4 (pH 7.2), 1% SDS for washing at a temperature in excess of 65°C.
- One embodiment of high stringency conditions includes hybridizing in 6 x SSC at about 45 ⁇ C, followed by one or more washes in 0.2 x SSC, 0.1% SDS at 65°C.
- a reference polypeptide or enzyme described herein is encoded by a polynucleotide that hybridizes to a disclosed nucleotide sequence under very high stringency conditions.
- very high stringency conditions includes hybridizing in 0.5 M sodium phosphate, 7% SDS at 65°C, followed by one or more washes in 0.2 x SSC, 1% SDS at 65° C.
- T m 81.5 + 16.6 (logi 0 M) + 0.41 (%G+C) - 0.63 (% formamide) - (600/length)
- M is the concentration of Na + , preferably in the range of 0.01 molar to 0.4 molar
- %G+C is the sum of guanosine and cytosine bases as a percentage of the total number of bases, within the range between 30% and 75% G+C
- % formamide is the percent formamide concentration by volume
- length is the number of base pairs in the DNA duplex.
- a membrane e.g., a nitrocellulose membrane or a nylon membrane
- immobilized DNA is hybridized overnight at 42° C in a hybridization buffer (50% deionized formamide, 5 x SSC, 5 x Reinhardt's solution (0.1% fecal, 0.1% polyvinylpyrollidone and 0.1% bovine serum albumin), 0.1% SDS and 200 mg/mL denatured salmon sperm DNA) containing a labeled probe.
- a hybridization buffer 50% deionized formamide, 5 x SSC, 5 x Reinhardt's solution (0.1% fecal, 0.1% polyvinylpyrollidone and 0.1% bovine serum albumin), 0.1% SDS and 200 mg/mL denatured salmon sperm DNA
- the membrane is then subjected to two sequential medium stringency washes (i.e., 2 x SSC, 0.1% SDS for 15 min at 45° C, followed by 2 x SSC, 0.1% SDS for 15 min at 50° C), followed by two seq uential higher stringency washes (i.e., 0.2 x SSC, 0.1% SDS for 12 min at 55° C followed by 0.2 x SSC and 0.1% SDS solution for 12 min at 65-68°C).
- 2 x SSC 0.1% SDS for 15 min at 45° C
- 2 x SSC 0.1% SDS for 15 min at 50° C
- two seq uential higher stringency washes i.e., 0.2 x SSC, 0.1% SDS for 12 min at 55° C followed by 0.2 x SSC and 0.1% SDS solution for 12 min at 65-68°C.
- Polynucleotides and fusions thereof may be prepared, manipulated and/or expressed using any of a variety of well established techniques known and available in the art.
- polynucleotide sequences which encode polypeptides of the invention, or fusion proteins or functional eq uivalents thereof may be used in recombinant DNA molecules to direct expression of a triglyceride or lipid biosynthesis enzyme in appropriate host cells. Due to the inherent degeneracy of the genetic code, other DNA sequences that encode substantially the same or a functionally eq uivalent amino acid sequence may be produced and these sequences may be used to clone and express a given polypeptide.
- codons preferred by a particular prokaryotic or eukaryotic host can be selected to increase the rate of protein expression or to produce a recombinant RNA transcript having desirable properties, such as a half-life which is longer than that of a transcript generated from the naturally occurring seq uence.
- Such nucleotides are typically referred to as "codon-optimized.”
- polynucleotide sequences described herein can be engineered using methods generally known in the art in order to alter polypeptide encoding sequences for a variety of reasons, including but not limited to, alterations which modify the cloning, processing, expression and/or activity of the gene product.
- a nucleotide sequence encoding the polypeptide, or a functional eq uivalent may be inserted into appropriate expression vector, i.e., a vector that contains the necessary elements for the transcription and translation of the inserted coding sequence.
- appropriate expression vector i.e., a vector that contains the necessary elements for the transcription and translation of the inserted coding sequence.
- polynucleotide sequences may be introduced and expressed in Cyanobacterial systems.
- the present invention contemplates the use of vector and plasmid systems having regulatory sequences (e.g., promoters and enhancers) that are suitable for use in various Cyanobacteria (see, e.g., Koksharova et al., Applied Microbiol Biotechnol 58:123-37, 2002).
- the promiscuous RSF1010 plasmid provides autonomous replication in several Cyanobacteria of the genera Synechocystis and Synechococcus (see, e.g., Mermet-Bouvier et al., Curr Microbiol 26:323-327, 1993).
- the pFCl expression vector is based on the promiscuous plasmid RSF1010.
- pFCl harbors the lambda cl857 repressor-encoding gene and pR promoter, followed by the lambda cro ribosome-binding site and ATG translation initiation codon (see, e.g., Mermet-Bouvier et al., Curr Microbiol 28:145-148, 1994).
- the latter is located within the unique Ndel restriction site (CATATG) of pFCl and can be exposed after cleavage with this enzyme for in-frame fusion with the protein-coding sequence to be expressed.
- CAATG unique Ndel restriction site
- control elements or "regulatory sequences” present in an expression vector (or employed separately) are those non-translated regions of the vector, such as enhancers, promoters, 5' and 3' untranslated regions, which interact with host cellular proteins to carry out transcription and translation. Such elements may vary in their strength and specificity. Depending on the vector system and host utilized, any number of suitable transcription and translation elements, including constitutive and inducible promoters, may be used. Generally, it is well-known that strong E. coli promoters work well in Cyanobacteria.
- inducible promoters such as the hybrid lacZ promoter of the PBLUESCRIPT phagemid (Stratagene, La Jolla, Calif.) or PSPORT1 plasmid (Gibco BRL, Gaithersburg, Md.) and the like may be used.
- Other vectors containing IPTG inducible promoters such as pAM1579 and pAM2991trc, may be utilized according to the present invention.
- Certain embodiments may employ a temperature inducible system or temperature inducible regulatory sequences (e.g., promoters, enhancers, repressors).
- an operon with the bacterial phage left-ward promoter (P L ) and a temperature sensitive repressor gene CI857 may be employed to produce a temperature inducible system for producing fatty acids and/or triglycerides in Cyanobacteria (see, e.g., U.S. Patent No. 6,306,639, herein incorporated by reference). It is believed that at a non-permissible temperature (low temperature, 30 degrees Celsius), the repressor binds to the operator seq uence, and thus prevents RNA polymerase from initiating transcription at the P L promoter. Therefore, the expression of encoded gene or genes is repressed. When the cell culture is transferred to a permissible temperature (37-42 degrees Celsius), the repressor cannot bind to the operator. Under these conditions, RNA polymerase can initiate the transcription of the encoded gene or genes.
- a permissible temperature 37-42 degrees Celsius
- a number of expression vectors or regulatory sequences may be selected depending upon the use intended for the expressed polypeptide.
- vectors or regulatory seq uences which direct high level expression of encoded proteins may be used.
- overexpression of ACCase enzymes may be utilized to increase fatty acid biosynthesis.
- Such vectors include, but are not limited to, the multifunctional E.
- coli cloning and expression vectors such as BLUESCRI PT (Stratagene), in which the sequence encoding the polypeptide of interest may be ligated into the vector in frame with sequences for the amino-terminal Met and the subsequent 7 residues of ⁇ -galactosidase so that a hybrid protein is produced; pI N vectors (Van Heeke & Schuster, J. Biol. Chem. 264:5503 5509 (1989)); and the like.
- pGEX Vectors may also be used to express foreign polypeptides as fusion proteins with glutathione S-transferase (GST).
- Certain embodiments may employ the nirA promoter from Synechococcus sp. strain PCC7942, which is repressed by ammonium and induced by nitrite (see, e.g., Maeda et al., J. Bacteriol. 180:4080-4088, 1998; and Qi et al., Applied and Environmental Microbiology 71:5678-5684, 2005).
- the efficiency of expression may be enhanced by the inclusion of enhancers which are appropriate for the particular Cyanobacterial cell system which is used, such as those described in the literature.
- expression vectors or introduced promoters utilized to overexpress an exogenous or endogenous reference polypeptide, or fragment or variant thereof comprise a weak promoter under non-inducible conditions, e.g., to avoid toxic effects of long-term overexpression of any of these polypeptides.
- a weak promoter under non-inducible conditions
- expression levels from any given promoter may be determined, e.g., by performing quantitative polymerase chain reaction (q PCR) to determine the amount of transcript or mRNA produced by a promoter, e.g., before and after induction.
- a weak promoter is defined as a promoter that has a basal level of expression of a gene or transcript of interest, in the absence of inducer, that is ⁇ 2.0% of the expression level produced by the promoter of the rnpB gene in 5.
- elongatus PCC7942 is defined as a promoter that has a basal level of expression of a gene or transcript of interest, in the absence of inducer, that is ⁇ 5.0% of the expression level produced by the promoter of the rnpB gene in 5. elongatus PCC7942.
- Specific initiation signals may also be used to achieve more efficient translation of sequences encoding a polypeptide of interest. Such signals include the ATG initiation codon and adjacent sequences. I n cases where sequences encoding the polypeptide, its initiation codon, and upstream sequences are inserted into the appropriate expression vector, no additional transcriptional or translational control signals may be needed. However, in cases where only coding sequence, or a portion thereof, is inserted, exogenous translational control signals including the ATG initiation codon should be provided. Furthermore, the initiation codon should be in the correct reading frame to ensure translation of the entire insert. Exogenous translational elements and initiation codons may be of various origins, both natural and synthetic.
- Means for producing labeled hybridization or PCR probes for detecting sequences related to polynucleotides include oligolabeling, nick translation, end-labeling or PCR amplification using a labeled nucleotide.
- the sequences, or any portions thereof may be cloned into a vector for the production of an mRNA probe.
- Such vectors are known in the art, are commercially available, and may be used to synthesize RNA probes in vitro by addition of an appropriate RNA polymerase such as T7, T3, or SP6 and labeled nucleotides.
- reporter molecules or labels include radionuclides, enzymes, fluorescent, chemiluminescent, or chromogenic agents as well as substrates, cofactors, inhibitors, magnetic particles, and the like.
- Cyanobacterial host cells transformed with a polynucleotide sequence of interest may be cultured under conditions suitable for the expression and recovery of the protein from cell culture.
- the protein produced by a recombinant cell may be secreted or contained intracellularly depending on the sequence and/or the vector used.
- expression vectors containing polynucleotides of the invention may be designed to contain signal sequences which direct localization of the encoded polypeptide to a desired site within the cell.
- Other recombinant constructions may be used to join sequences encoding a polypeptide of interest to nucleotide sequence encoding a polypeptide domain which will direct secretion of the encoded protein.
- Certain embodiments relate to modified photosynthetic microorganisms, including Cyanobacteria, and methods of use thereof, wherein the modified photosynthetic microorganisms comprise one or more over-expressed, exogenous or introduced intracellular localization domain-DGAT fusion proteins, and a corresponding polynucleotide that encodes the same, where the DGAT fusion protein comprises a heterologous intracellular localization domain and a DGAT polypeptide, or variant or fragment thereof.
- the DGAT fusion protein is a membrane-targeting domain- or plasma membrane (PM)-targeting domain-DGAT fusion proteins.
- the DGAT polypeptide variant or fragment retains at least 50% of one or more activities of the wild-type DGAT polypeptide.
- the DGAT-fusion protein-expressing photosynthetic microorganisms described herein can further comprise one or more introduced or overexpressed lipid biosynthesis proteins.
- lipid biosynthesis proteins include, without limitation, acyl carrier proteins (ACP), acyl ACP synthases (Aas), acyl-ACP reductases, alcohol dehydrogenases, aldehyde dehydrogenases, aldehyde decarbonylases, thioesterases (TES), acetyl coenzyme A carboxylases (ACCase), phosphatidic acid phosphatases (PAP; or phosphatidate phosphatases), triacylglycerol (TAG) hydrolases, fatty acyl-CoA synthetases, and lipases/phospholipases, including any combinations thereof.
- ACP acyl carrier proteins
- Aas acyl ACP synthases
- TES acyl-ACP reductases
- alcohol dehydrogenases alde
- Certain preferred combinations include modified photosynthetic microorganisms having an exogenous or overexpressed DGAT fusion protein described herein in combination with an exogenous or overexpressed ACP; a DGAT fusion protein in combination with an Aas; a DGAT fusion protein in combination with an ACP and an Aas; a DGAT fusion protein in combination with an ACP and a TES such as *TesA or a FatB; a DGAT fusion protein in combination with an Aas and a TES such as *TesA or a FatB; and/or a DGAT fusion protein in combination with an ACP, an Aas, and a TES.
- TAG hydrolases include modified photosynthetic microorganisms having a DGAT fusion protein described herein, an exogenous or overexpressed ACP, Aas, or both, in combination with an exogenous or over-expressed TAG hydrolase, and optionally a TES.
- Certain embodiments, however, may employ a DGAT fusion protein and an over-expressed or exogenous TAG hydrolase, and optionally a TES, such as TesA (or *TesA) or any one or more of the FatB seq uences, with or without an ACP or Aas.
- these and related embodiments may be employed separately from those that require an ACP, an Aas, or both.
- certain embodiments may comprise a DGAT fusion protein and TAG hydrolase, and optionally a TES. Any one of these embodiments can be further combined with one or more additional lipid biosynthesis proteins, such as an ACCase, a PAP, a fatty acyl-CoA synthetase, and/or a PL such as PLC.
- Certain combinations incorporate one or more fatty acyl-CoA synthetases (e.g., FadD) into a DGAT fusion protein-expressing photosynthetic microorganism.
- fatty acyl-CoA synthetases e.g., FadD
- certain embodiments include modified photosynthetic microorganisms having an exogenous or overexpressed ACP, Aas, or both, in combination with a DGAT fusion protein and a fatty acyl-CoA synthetase, and optionally a TES and/or a TAG hydrolase.
- Certain embodiments may employ a DGAT fusion protein and an over-expressed or exogenous a fatty acyl-CoA synthetase, and optionally a TES, such as TesA (or *TesA) or any one or more of the FatB sequences, with or without an ACP or Aas.
- these and related embodiments may be employed separately from those that require an ACP, Aas, or both.
- certain embodiments may comprise a DGAT fusion protein and a fatty acyl-CoA synthetase, and optionally a TES (e.g., TesA, FatB). Any one of these embodiments can be further combined with one or more additional lipid biosynthesis proteins, such as an ACCase, a PAP, a TAG hydrolase, and/or a PL such as PLC.
- any one of these embodiments can also be combined with one or more introduced or overexpressed polynucleotides encoding a protein involved in a glycogen breakdown pathway, and/or with a strain having reduced expression of glycogen biosynthesis or storage pathways (e.g., full or partial deletion of glucose-l-phosphate adenyltransferase (glgC) gene and/or a phosphoglucomutase (pgm) gene).
- a specific modified photosynthetic microorganism could comprise a DGAT fusion protein described herein, an exogenous or overexpressed ACP, Aas, DGAT and PAP, combined with a full or partial deletion of the glgC gene and/or the pgm gene.
- Photosynthetic microorganisms of the present invention can also be modified to increase the production of fatty acids by introducing one or more exogenous polynucleotide sequences that encode one or more enzymes associated with fatty acid synthesis.
- the exogenous polynucleotide sequence encodes an enzyme that comprises an acyl-CoA carboxylase (ACCase) activity, typically a llowing increased ACCase expression, and, thus, increased intracellular ACCase activity. I ncreased intracellular ACCase activity contributes to the increased production of fatty acids because this enzyme catalyzes the "commitment step" of fatty acid synthesis.
- ACCase acyl-CoA carboxylase
- a modified photosynthetic microorganism comprising a DGAT fusion protein described herein and one of the following: an exogenous or overexpressed ACP in combination with an exogenous or overexpressed ACCase; an Aas in combination with an ACCase; an ACP and an Aas in combination with an ACCase; an ACP in combination with a PAP; an Aas in combination with a PAP; an ACP and an Aas in combination with a PAP; an ACP in combination with a PL such as PLA, PLB, or PLC; an Aas in combination with a PL; and an ACP and an Aas in combination with a PL.
- a modified photosynthetic microorganism comprising a DGAT fusion protein described herein and one of the following: an exogenous or overexpressed ACP in combination with an exogenous or overexpressed ACCase; an Aas in combination with an ACCase; an ACP
- Any one of these embodiments can be combined with each other (e.g., ACP, Aas, ACCase, and PAP), and/or further combined with an exogenous or overexpressed TES. Any one of these embodiments can also be combined with one or more introduced polynucleotides encoding a protein involved in a glycogen breakdown pathway, and/or with a strain having reduced expression of glycogen biosynthesis or storage pathways (e.g., full or partial deletion of glucose-l-phosphate adenyltransferase (glgC) gene and/or a phosphoglucomutase (pgm) gene).
- glgC glucose-l-phosphate adenyltransferase
- pgm phosphoglucomutase
- any one of the above embodiments can also be combined with a strain having reduced expression of an aldehyde decarbonylase.
- a strain having reduced expression of an aldehyde decarbonylase I n certain embodiments, such as Cyanobacteria including 5.
- elongatus PCC7942, orfl593 resides directly upstream of orfl594 (acyl-ACP reductase coding region) and encodes an aldehyde decarbonylase.
- the aldehyde decarbonylase encoded by orfl593 utilizes acyl aldehyde as a substrate for alkane production
- reducing expression of this protein may further increase yields of free fatty acids by shunting acyl aldehydes (produced by acyl-ACP reductase) away from an alkane-producing pathway, and towards a fatty acid-producing and storage pathway.
- PCC7942_orfl593 orthologs can be found, for example, in Synechocystis sp. PCC6803 (encoded by orfsll0208), N.
- a specific modified photosynthetic microorganism could comprise an overexpressed acyl-ACP reductase, combined with a full or partial deletion of the glgC gene and/or the pgm gene, optionally combined with an overexpressed ACP, ACCase, DGAT/acyl-CoA synthetase, or all of the foregoing, and optionally combined with a full or partial deletion of a gene encoding an aldehyde decarbonylase (e.g., PCC7942_orfl593, PCC6803_orfsll0208).
- an aldehyde decarbonylase e.g., PCC7942_orfl593, PCC6803_orfsll0208
- any one of these embodiments can also be combined with a strain having reduced expression of an acyl-ACP synthetase (Aas).
- an endogenous aldehyde dehydrogenase is acting on the acyl-aldehydes generated by orfl594 and converting them to free fatty acids.
- the normal role of such a dehydrogenase might involve removing or otherwise dealing with damaged lipids. I n this scenario, it is then likely that the Aas gene product recycles these free fatty acids by ligating them to ACP. Accordingly, reducing or eliminating expression of the Aas gene product might ultimately increase production of fatty acids, by reducing or preventing their transfer to ACP.
- a specific modified photosynthetic microorganism could comprise an overexpressed acyl-ACP reductase, combined with a full or partial deletion of the glgC gene and/or the pgm gene, optionally combined with an overexpressed ACP, ACCase, DGAT/acyl-CoA synthetase, or all of the foregoing, optionally combined with a full or partial deletion of a gene encoding an aldehyde decarbonylase (e.g., PCC7942_orfl593, PCC6803_orfsll0208), and optionally combined with a full or partial deletion of an Aas gene encoding an acyl-ACP synthetase.
- an aldehyde decarbonylase e.g., PC
- any one or more of these embodiments can also be combined with a strain having increased expression of an aldehyde dehydrogenase.
- One exemplary aldehyde dehydrogenase is encoded by orf0489 of Synechococcus elongatus PCC7942. Also included are homologs or paralogs thereof, functional equivalents thereof, and fragments or variants thereof. Functional equivalents can include aldehyde dehydrogenases with the ability to convert acyl aldehydes (e.g., nonyl-aldehyde) into fatty acids.
- the aldehyde dehydrogenase has the amino acid sequence of SEQ ID NO:103 (encoded by the polynucleotide sequence of SEQ I D NO:102), or an active fragment or variant of this seq uence
- modified photosynthetic microorganisms may comprise a DGAT fusion protein described herein and an introduced or overexpressed acyl-ACP reductase, to increase production of triglycerides; optionally in further combination with an introduced or overexpressed alcohol dehydrogenase, for instance, to produce wax esters relative to other lipids. Certain of these and related embodiments may be combined with reduced expression and/or activity of at least one endogenous aldehyde decarbonylase, endogenous aldehyde dehydrogenase, or both.
- particular modified photosynthetic microorganisms may comprise a DGAT fusion protein described herein in combination with an overexpressed or introduced acyl-ACP reductase and an overexpressed or introduced alcohol dehydrogenase, and in further combination with at least one mutation (e.g., point mutation, insertion, full or partial deletion) that reduces the expression and/or activity of an endogenous aldehyde decarbonylase.
- at least one mutation e.g., point mutation, insertion, full or partial deletion
- Certain modified photosynthetic microorganisms may comprise a DGAT fusion protein in combination with an overexpressed or introduced acyl-ACP reductase and an overexpressed or introduced alcohol dehydrogenase, and in further combination with at least one mutation (e.g., point mutation, insertion, full or partial deletion) that reduces the expression and/or activity of an endogenous aldehyde dehydrogenase.
- at least one mutation e.g., point mutation, insertion, full or partial deletion
- Some embodiments may include modified photosynthetic microorganisms that comprises a DGAT fusion protein in combination with an overexpressed or introduced acyl-ACP reductase and an overexpressed or introduced alcohol dehydrogenase, in further combination with at least one mutation that reduces the expression and/or activity of an endogenous aldehyde dehydrogenase and at least one mutation that reduces the expression and/or activity of an endogenous aldehyde decarbonylase.
- the modified photosynthetic microorganism is 5. elongatus, the aldehyde dehydrogenase is encoded by orf0489 and the aldehyde decarbonylase is encoded by orfl593 of S. elongatus.
- a modified photosynthetic microorganism comprising a DGAT fusion protein described herein and reduced glycogen accumulation, in combination with one more of an overexpressed ACP; an overexpressed acyl-ACP reductase in combination with an overexpressed ACP; an acyl-ACP reductase in combination with an ACCase; an acyl-ACP reductase in combination with an ACP and an ACCase; an overexpressed acyl-ACP reductase in combination with an overexpressed acyl-CoA synthetase (e.g., a membrane-targeting domain-DGAT fusion/acyl-CoA synthetase combination); an overexpressed acyl-ACP reductase with an overexpressed ACCase optionally in combination with an overexpressed acyl-CoA synthetase; and an overexpressed acyl-ACP reductase with an overexpressed ACP;
- Acyl-ACP reductase and DGAT-overexpressing strains optionally in combination with an overexpressed acyl-CoA synthetase, typically produce increased triglycerides relative to DGAT-only overexpressing strains.
- Any one of these embodiments can be combined with one or more introduced polynucleotides encoding a protein involved in a glycogen breakdown pathway, and/or with a strain having reduced expression of glycogen biosynthesis or storage pathways (e.g., full or partial deletion of glucose-l-phosphate adenyltransferase (glgC) gene and/or a phosphoglucomutase (pgm) gene).
- the present invention contemplates the use of any type of polynucleotide encoding a protein or enzyme associated with glycogen breakdown, removal, and/or elimination, as long as the modified photosynthetic microorganism accumulates a reduced amount of glycogen as compared to the wild type photosynthetic microorganism (e.g., under stress conditions).
- Increased expression or overexpression can be achieved a variety of ways. For example, by introducing a polynucleotide into the microorganism, modifying an endogenous gene to overexpress the polypeptide (e.g., by introducing an exogenous regulatory element such as a promoter), or both. For instance, one or more copies of an otherwise endogenous polynucleotide sequence can be introduced by recombinant techniques to increase expression, that is, to create additional copies of the otherwise endogenous polynucleotide sequence. Decreased expression and/or activity can also be achieved a variety of ways, described elsewhere herein and known in the art, including by mutation of coding and/or regulatory sequences of a gene of interest, and/or by RNA inhibition.
- Modified photosynthetic microorganisms of the present invention may be produced using any type of photosynthetic microorganism. These include, but are not limited to photosynthetic bacteria, green algae, and cyanobacteria.
- the photosynthetic microorganism can be, for example, a naturally photosynthetic microorganism, such as a Cyanobacterium, or an engineered photosynthetic microorganism, such as an artificially photosynthetic bacterium.
- microorganisms that are either naturally photosynthetic or can be engineered to be photosynthetic include, but are not limited to, bacteria; fungi; archaea; protists; eukaryotes, such as a green algae; and animals such as plankton, planarian, and amoeba.
- Examples of naturally occurring photosynthetic microorganisms include, but are not limited to, Spirulina maximum, Spirulina platensis, Dunaliella salina, Botrycoccus braunii, Chlorella vulgaris, Chlorella pyrenoidosa, Serenastrum capricomutum, Scenedesmus auadricauda, Porphyridium cruentum, Scenedesmus acutus, Dunaliella sp., Scenedesmus obliquus, Anabaenopsis, Aulosira, Cylindrospermum, Synechococcus sp., Synechocystis sp., and/or Tolypothrix.
- a modified Cyanobacteria of the present invention may be from any genera or species of Cyanobacteria that is genetically manipulable, i.e., permissible to the introduction and expression of exogenous genetic material.
- Examples of Cyanobacteria that can be engineered according to the methods of the present invention include, but are not limited to, the genus Synechocystis, Synechococcus, Thermosynechococcus, Nostoc, Prochlorococcu, Microcystis, Anabaena, Spirulina, and Gloeobacter.
- Cyanobacteria also known as blue-green algae, blue-green bacteria, or Cyanophyta, is a phylum of bacteria that obtain their energy through photosynthesis. Cyanobacteria can produce metabolites, such as carbohydrates, proteins, lipids and nucleic acids, from C0 2 , water, inorganic salts and light. Any Cyanobacteria may be used according to the present invention.
- Cyanobacteria include both unicellular and colonial species. Colonies may form filaments, sheets or even hollow balls. Some filamentous colonies show the ability to differentiate into several different cell types, such as vegetative cells, the normal, photosynthetic cells that are formed under favorable growing conditions; akinetes, the climate-resistant spores that may form when environmental conditions become harsh; and thick-walled heterocysts, which contain the enzyme nitrogenase, vital for nitrogen fixation.
- Heterocysts may also form under the appropriate environmental conditions (e.g., anoxic) whenever nitrogen is necessary. Heterocyst-forming species are specialized for nitrogen fixation and are able to fix nitrogen gas, which cannot be used by plants, into ammonia (NH 3 ), nitrites (N0 2 ⁇ ), or nitrates (N0 3 ⁇ ), which can be absorbed by plants and converted to protein and nucleic acids.
- anoxic e.g., anoxic
- Cyanobacteria also form motile filaments, called hormogonia, which travel away from the main biomass to bud and form new colonies elsewhere.
- the cells in a hormogonium are often thinner than in the vegetative state, and the cells on either end of the motile chain may be tapered.
- a hormogonium In order to break away from the parent colony, a hormogonium often must tear apart a weaker cell in a filament, called a necridium.
- Each individual Cyanobacterial cell typically has a thick, gelatinous cell wall.
- Cyanobacteria differ from other gram-negative bacteria in that the quorum sensing molecules autoinducer-2 and acyl-homoserine lactones are absent. They lack flagella, but hormogonia and some unicellular species may move about by gliding along surfaces. In water columns, some Cyanobacteria float by forming gas vesicles, like in archaea. Cyanobacteria have an elaborate and highly organized system of internal membranes that function in photosynthesis. Photosynthesis in Cyanobacteria generally uses water as an electron donor and produces oxygen as a by-product, though some Cyanobacteria may also use hydrogen sulfide, similar to other photosynthetic bacteria. Carbon dioxide is reduced to form carbohydrates via the Calvin cycle.
- thylakoids the photosynthetic machinery is embedded into folds of the cell membrane, called thylakoids. Due to their ability to fix nitrogen in aerobic conditions, Cyanobacteria are often found as symbionts with a number of other groups of organisms such as fungi (e.g., lichens), corals, pteridophytes (e.g., Azolla), and angiosperms (e.g., Gunnera), among others.
- fungi e.g., lichens
- corals e.g., corals
- pteridophytes e.g., Azolla
- angiosperms e.g., Gunnera
- Cyanobacteria are the only group of organisms that are able to reduce nitrogen and carbon in aerobic conditions.
- the water-oxidizing photosynthesis is accomplished by coupling the activity of photosystem (PS) II and I (Z-scheme). I n anaerobic conditions, Cyanobacteria are also able to use only PS I (i.e., cyclic photophosphorylation) with electron donors other than water (e.g., hydrogen sulfide, thiosulphate, or molecular hydrogen), similar to purple photosynthetic bacteria.
- Cyanobacteria share an archaeal property; the ability to reduce elemental sulfur by anaerobic respiration in the dark.
- the Cyanobacterial photosynthetic electron transport system shares the same compartment as the components of respiratory electron transport. Typically, the plasma membrane contains only components of the respiratory chain, while the thylakoid membrane hosts both respiratory and photosynthetic electron transport.
- Phycobilisomes attached to the thylakoid membrane, act as light harvesting antennae for the photosystems of Cyanobacteria.
- the phycobilisome components are responsible for the blue-green pigmentation of most Cyanobacteria. Color variations are mainly due to carotenoids and phycoerythrins, which may provide the cells with a red-brownish coloration. I n some Cyanobacteria, the color of light influences the composition of phycobilisomes. In green light, the cells accumulate more phycoerythrin, whereas in red light they produce more phycocyanin. Thus, the bacteria appear green in red light and red in green light.
- the Cyanobacteria may be, e.g., a marine form of Cyanobacteria or a fresh water form of Cyanobacteria.
- Examples of marine forms of Cyanobacteria include, but are not limited to Synechococcus WH8102, Synechococcus RCC307, Synechococcus N KBG 15041c, and Trichodesmium.
- Examples of fresh water forms of Cyanobacteria include, but are not limited to, 5.
- Exogenous genetic material encoding the desired enzymes or polypeptides may be introduced either transiently, such as in certain self-replicating vectors, or stably, such as by integration (e.g., recombination) into the Cyanobacterium's native genome.
- a genetically modified Cyanobacteria of the present invention may be capable of growing in brackish or salt water.
- the overall net cost for production of triglycerides will depend on both the nutrients req uired to grow the culture and the price for freshwater.
- Two such alternatives include: (1) the use of waste water from treatment plants; and (2) the use of salt or brackish water.
- Salt water in the oceans can range in salinity between 3.1% and 3.8%, the average being 3.5%, and this is mostly, but not entirely, made up of sodium chloride (NaCI) ions.
- Brackish water has more salinity than freshwater, but not as much as seawater. Brackish water contains between about 0.5% and 3% salinity, and thus includes a large range of salinity regimes and is therefore not precisely defined.
- Waste water is any water that has undergone human influence. It consists of liquid waste released from domestic and commercial properties, industry, and/or agriculture and can encompass a wide range of possible contaminants at varying concentrations.
- Synechococcus sp. PCC 7002 (formerly known as Agmenellum quadruplicatum strain PR-6) grows in brackish water, is unicellular and has an optimal growing temperature of 38°C. While this strain is well suited to grow in conditions of high salt, it will grow slowly in freshwater. I n particular embodiments, the present invention contemplates the use of a Cyanobacteria 5. elongatus PCC 7942, altered in a way that allows for growth in either waste water or salt/brackish water. A 5.
- elongatus PCC 7942 mutant resistant to sodium chloride stress has been described (Bagchi, S.N. et al., Photosynth Res. 2007, 92:87-101), and a genetically modified 5.
- elongatus PCC 7942 tolerant of growth in salt water has been described (Waditee, R. et al., PNAS. 2002, 99:4109-4114).
- a salt water tolerant strain is capable of growing in water or media having a salinity in the range of 0.5% to 4.0% salinity, although it is not necessarily capable of growing in all salinities encompassed by this range.
- a salt tolerant strain is capable of growth in water or media having a salinity in the range of 1.0% to 2.0% salinity. In another embodiment, a salt water tolerant strain is capable of growth in water or media having a salinity in the range of 2.0% to 3.0% salinity.
- Cyanobacteria that may be utilized and/or genetically modified according to the methods described herein include, but are not limited to, Chroococcales Cyanobacteria from the genera Aphanocapsa, Aphanothece, Chamaesiphon, Chroococcus, Chroogloeocystis, Coelosphaerium, Crocosphaera, Cyanobacterium, Cyanobium, Cyanodictyon, Cyanosarcina, Cyanothece, Dactylococcopsis, Gloecapsa, Gloeothece, Merismopedia, Microcystis, Radiocystis, Rhabdoderma, Snowella, Synychococcus, Synechocystis, Thermosenechococcus, and Woronichinia; Nostacales Cyanobacteria from the genera Anabaena, Anabaenopsis, Aphanizomenon, Aulosira, Calothrix, Coleodesmium,
- the Cyanobacterium is from the genus Synechococcus, including, but not limited to Synechococcus bigranulatus, Synechococcus elongatus, Synechococcus leopoliensis, Synechococcus lividus, Synechococcus nidulans, and Synechococcus rubescens.
- the Cyanobacterium is Anabaena sp. strain PCC 7120,
- Cyanobacterium is S. elongatus sp. strain PCC 7942.
- Cyanobacteria that may be utilized in the methods provided herein include, but are not limited to, Synechococcus sp. strains WH7803, WH8102, WH8103 (typically genetically modified by conjugation), Baeocyte-forming Chroococcidiopsis spp. (typically modified by conjugation/electroporation), non-heterocyst-forming filamentous strains Planktothrix sp., Plectonema boryanum M101 (typically modified by electroporation), and Heterocyst-forming strains Anabaena sp. strains ATCC 29413 (typically modified by conjugation), Tolypothrix sp. strain PCC 7601 (typically modified by conjugation/electroporation) and Nostoc punctiforme strain ATCC 29133 (typically modified by conjugation/electroporation).
- the Cyanobacterium may be 5. elongatus sp. strain PCC 7942 or Synechococcus sp. PCC 7002 (originally known as Agmenellum quadruplicatum).
- the genetically modified, photosynthetic microorganism, e.g., Cyanobacteria, of the present invention may be used to produce triglycerides and/or other carbon-based products from just sunlight, water, air, and minimal nutrients, using routine culture techniques of any reasonably desired scale.
- the present invention contemplates using spontaneous mutants of photosynthetic microorganisms that demonstrate a growth advantage under a defined growth condition.
- the modified photosynthetic microorganism e.g., Cyanobacteria
- the modified photosynthetic microorganism of the present invention a readily manageable and efficient source of feedstock in the subsequent production of both biofuels, such as biodiesel, as well as specialty chemicals, such as glycerin.
- Embodiments of the present invention also include methods of producing the modified photosynthetic microorganisms (e.g., Cyanobacterium) described herein.
- modified photosynthetic microorganisms e.g., Cyanobacterium
- the present invention comprises methods of modifying a photosynthetic microorganism to produce a modified photosynthetic microorganism that produces an increased amount of lipids, e.g., triglycerides, relative to a corresponding wild type photosynthetic microorganism or a differently modified photosynthetic microorganism (e.g., one that expresses DGAT but not a form that selectively localizes to an intracellular region such as a membrane, including the plasma membrane), comprising introducing into said microorganism one or more polynucleotides encoding a intracellular localization domain-DGAT fusion protein described herein, including active fragments or variants thereof.
- lipids e.g., triglycerides
- a heterologous intracellular localization domain e.g., a wild-type DGAT
- the methods may further comprise a step of selecting for photosynthetic microorganisms in which the one or more desired polynucleotides were successfully introduced, where the polynucleotides were, e.g., present in a vector that expressed a selectable marker, such as an antibiotic resistance gene.
- selection and isolation may include the use of antibiotic resistant markers known in the art (e.g., kanamycin, spectinomycin, and streptomycin).
- such photosynthetic microorganisms can be further modified by increasing the expression of one or more lipid biosynthesis proteins, for instance, by introducing an exogenous copy of a polynucleotide that encodes a lipid biosynthesis protein, by increasing expression of an endogenous lipid biosynthesis protein, or both.
- such photosynthetic microorganisms can be further modified by increasing the expression of one or more proteins associated with glycogen breakdown, for instance, by introducing an exogenous copy of a polynucleotide that encodes a glycogen breakdown protein, by increasing expression of an endogenous glycogen breakdown protein, or both.
- the present invention includes methods of producing a modified photosynthetic microorganism, e.g., a Cyanobacteria, comprising: (1) introducing into said photosynthetic microorganism one or more polynucleotides encoding one or more intracellular localization domain-DGAT fusion proteins, and (2) introducing into said photosynthetic microorganism one or more operatively linked promoters (e.g., inducible or regulable promoters) into a region upstream of an endogenous lipid biosynthesis protein coding sequence, and/or introducing one or more polynucleotides encoding a lipid biosynthesis protein, or a fragment or variant thereof.
- a modified photosynthetic microorganism e.g., a Cyanobacteria
- operatively linked promoters e.g., inducible or regulable promoters
- Exemplary lipid biosynthesis proteins include any one or more of acyl carrier proteins (ACP), acyl ACP synthetases (Aas), acyl-ACP reductases, alcohol dehydrogenases, aldehyde dehydrogenases, aldehyde decarbonylases, thioesterases (TES), acetyl coenzyme A carboxylases (ACCase), phosphatidic acid phosphatases (PAP; or phosphatidate phosphatases), triacylglycerol (TAG) hydrolases, fatty acyl-CoA synthetases, and lipases/phospholipases, including any combination thereof.
- ACP acyl carrier proteins
- Aas acyl ACP synthetases
- TES acyl-ACP reductases
- alcohol dehydrogenases aldehyde dehydrogenases
- aldehyde decarbonylases thioesterases
- TES acet
- Certain embodiments include methods of producing a modified photosynthetic microorganism, e.g., a Cyanobacteria, comprising: (1) introducing into said photosynthetic microorganism one or more polynucleotides encoding one or more intracellular localization domain-DGAT fusion proteins, and (2) introducing into said photosynthetic microorganism one or more operatively linked promoters (e.g., inducible or regulable promoters) into a region upstream of an endogenous glycogen breakdown protein coding sequence, and/or introducing one or more polynucleotides encoding a glycogen breakdown protein, or a fragment or variant thereof.
- operatively linked promoters e.g., inducible or regulable promoters
- glycogen breakdown proteins include any one or more of glycogen phosphorylase (GlgP), glycogen isoamylase (GlgX), glucanotransferase (MalQ), phosphoglucomutase (Pgm), glucokinase (Glk), and/or phosphoglucose isomerase (Pgi), including any combination thereof.
- GlgP glycogen phosphorylase
- GlgX glycogen isoamylase
- MalQ glucanotransferase
- Pgm phosphoglucomutase
- Glk glucokinase
- Pgi phosphoglucose isomerase
- Particular embodiments include methods of producing a modified photosynthetic microorganism, e.g., a Cyanobacteria, comprising: (1) introducing into said photosynthetic microorganism one or more polynucleotides encoding one or more intracellular localization domain-DGAT fusion proteins, (2) introducing into said photosynthetic microorganism one or more operatively linked promoters (e.g., inducible or regulable promoters) into a region upstream of an endogenous lipid biosynthesis protein coding sequence, and/or introducing one or more polynucleotides encoding a lipid biosynthesis protein, or a fragment or variant thereof, and (3) introducing into said photosynthetic microorganism one or more operatively linked promoters (e.g., inducible or regulable promoters) into a region upstream of an endogenous glycogen breakdown protein coding seq uence, and/or introducing one or more polynucleotides encoding a
- the lipid biosynthesis protein is an acyl carrier protein (ACP), an acyl-ACP synthetase (Aas), or both.
- ACP acyl carrier protein
- Aas acyl-ACP synthetase
- certain embodiments include methods for producing a modified photosynthetic microorganism, e.g., a Cyanobacteria, comprising: (1) introducing into said photosynthetic microorganism one or more polynucleotides encoding one or more intracellular localization domain-DGAT fusion proteins, (2) introducing into said photosynthetic microorganism one or more operatively linked promoters (e.g., inducible or regulable promoters) into a region upstream of an endogenous ACP coding seq uence, and/or introducing one or more polynucleotides encoding an ACP, or a fragment or variant thereof.
- operatively linked promoters e.g., inducible or regula
- These and related methods can further comprise (3) introducing into said photosynthetic microorganism one or more operatively linked promoters (e.g., inducible or regulable promoters) into a region upstream of an endogenous TES, ACCase, TAG hydrolase, fatty acyl CoA synthetase, PAP, and/or phospholipase coding seq uence, and/or introducing one or more polynucleotides encoding TES, ACCase, TAG hydrolase, fatty acyl CoA synthetase, PAP, and/or phospholipase, or a fragment or variant thereof.
- operatively linked promoters e.g., inducible or regulable promoters
- Some embodiments include methods for producing a modified photosynthetic microorganism, e.g., a Cyanobacteria, comprising: (1) introducing into said photosynthetic microorganism one or more polynucleotides encoding one or more intracellular localization domain-DGAT fusion proteins, (2) introducing into said photosynthetic microorganism one or more operatively linked promoters (e.g., inducible or regulable promoters) into a region upstream of an endogenous Aas coding sequence, and/or introducing one or more polynucleotides encoding an Aas polypeptide, or a fragment or variant thereof.
- operatively linked promoters e.g., inducible or regulable promoters
- These and related methods can further comprise (3) introducing into said photosynthetic microorganism one or more operatively linked promoters (e.g., inducible or regulable promoters) into a region upstream of an endogenous TES, ACCase, TAG hydrolase, fatty acyl CoA synthetase, PAP, and/or phospholipase coding sequence, and/or introducing one or more polynucleotides encoding TES, ACCase, TAG hydrolase, fatty acyl CoA synthetase, PAP, and/or phospholipase, or a fragment or variant thereof.
- operatively linked promoters e.g., inducible or regulable promoters
- Specific embodiments include methods of producing a modified photosynthetic microorganism, e.g., a Cyanobacteria, comprising: (1) introducing into said photosynthetic microorganism one or more polynucleotides encoding one or more intracellular localization domain-DGAT fusion proteins, (2) introducing into said photosynthetic microorganism one or more operatively linked promoters (e.g., inducible or regulable promoters) into a region upstream of an endogenous ACP coding seq uence, and/or introducing one or more polynucleotides encoding an ACP, or a fragment or variant thereof, and (3) introducing into said photosynthetic microorganism one or more operatively linked promoters (e.g., inducible or regulable promoters) into a region upstream of an endogenous Aas coding sequence, and/or introducing one or more polynucleotides encoding an Aas polypeptide, or
- These and related methods can further comprise (4) introducing into said photosynthetic microorganism one or more operatively linked promoters (e.g., inducible or regulable promoters) into a region upstream of an endogenous TES, ACCase, TAG hydrolase, fatty acyl CoA synthetase, PAP, and/or phospholipase coding sequence, and/or introducing one or more polynucleotides encoding TES, ACCase, TAG hydrolase, fatty acyl CoA synthetase, PAP, and/or phospholipase, or a fragment or variant thereof.
- operatively linked promoters e.g., inducible or regulable promoters
- the lipid biosynthesis protein is an acyl-ACP reductase, optionally in combination with an overexpressed alcohol dehydrogenase, for instance, to increase production of triglycerides and/or produce wax esters.
- Certain embodiments thus include methods of producing a modified photosynthetic microorganism, e.g., a Cyanobacteria, comprising: (1) introducing into said photosynthetic microorganism one or more polynucleotides encoding one or more intracellular localization domain-DGAT fusion proteins, and (2) introducing into said photosynthetic microorganism one or more operatively linked promoters (e.g., inducible or regulable promoters) into a region upstream of an endogenous acyl-ACP reductase coding sequence, and/or introducing one or more polynucleotides encoding an acyl-ACP reductase, or a fragment or variant thereof.
- a modified photosynthetic microorganism e.g., a Cyanobacteria
- methods of producing a modified photosynthetic microorganism comprising: (1) introducing into said photosynthetic microorganism one or more polynucleotides encoding one or more intracellular localization domain-DGAT fusion proteins, (2) introducing into said photosynthetic microorganism one or more operatively linked promoters (e.g., inducible or regulable promoters) into a region upstream of an endogenous acyl-ACP reductase coding sequence, and/or introducing one or more polynucleotides encoding an acyl-ACP reductase, or a fragment or variant thereof, and (3) introducing into said photosynthetic microorganism one or more operatively linked promoters (e.g., inducible or regulable promoters) into a region upstream of an endogenous alcohol dehydrogenase coding sequence, and/or
- any of the photosynthetic microorganisms described herein can be further modified by reducing expression and/or activity of one or more endogenous genes/proteins associated with glycogen synthesis and/or storage, one or more endogenous aldehyde dehydrogenases, one or more endogenous aldehyde decarbonylases, and/or one or more endogenous Aas polypeptides.
- exemplary genes/proteins associated with glycogen synthesis and/or storage include glgA, glgC, and pgm.
- expression or activity is reduced by knocking out or knocking down one or more alleles of said one or more genes.
- expression or activity of the one or more genes is reduced by contacting the photosynthetic microorganism with an antisense oligonucleotide or interfering RNA, e.g., an siRNA, that targets said one or more genes.
- an antisense oligonucleotide or interfering RNA e.g., an siRNA
- a vector that expresses a polynucleotide that hybridizes to said one or more genes e.g., an antisense oligonucleotide or an siRNA is introduced into said photosynthetic microorganism.
- mutants such as point mutants, insertions, or full or partial deletions of a gene of interest and/or one or more of its regulatory elements (e.g., promoters, enhancers), to reduce expression and/or activity of a protein of interest.
- mutants such as point mutants, insertions, or full or partial deletions of a gene of interest and/or one or more of its regulatory elements (e.g., promoters, enhancers), to reduce expression and/or activity of a protein of interest.
- Natural selection or directed selection can also be used to identify naturally-occurring mutants having reduced expression and/or activity of a protein of interest.
- particular embodiments include methods for producing a modified photosynthetic microorganism having reduced expression and/or activity of an aldehyde dehydrogenase, an aldehyde decarbonylase, or both.
- These and related embodiments may comprise (1) introducing into said photosynthetic microorganism one or more polynucleotides encoding one or more intracellular localization domain-DGAT fusion proteins, and (2) introducing one or more mutations into an endogenous gene encoding an aldehyde dehydrogenase, such as a point mutation, insertion, or full or partial deletion, which reduces expression and/or activity of the aldehyde dehydrogenase, e.g., renders the aldehyde dehydrogenase "non-functional," as described herein.
- Also included are methods for producing a modified photosynthetic microorganism comprising (1) introducing into said photosynthetic microorganism one or more polynucleotides encoding one or more intracellular localization domain-DGAT fusion proteins, and (2) introducing one or more mutations into an endogenous gene encoding an aldehyde decarbonylase, such as a point mutation, insertion, or full or partial deletion, which reduces expression and/or activity of the aldehyde decarbonylase.
- Some embodiments include methods for producing a modified photosynthetic microorganism, comprising (1) introducing into said photosynthetic microorganism one or more polynucleotides encoding one or more intracellular localization domain-DGAT fusion proteins, (2) introducing one or more mutations into an endogenous gene encoding an aldehyde dehydrogenase, such as a point mutation, insertion, or full or partial deletion, which reduces expression and/or activity of the aldehyde dehydrogenase, and (3) introducing one or more mutations into an endogenous gene encoding an aldehyde decarbonylase, such as a point mutation, insertion, or full or partial deletion, which reduces expression and/or activity of the aldehyde decarbonylase.
- Particular methods include producing a modified photosynthetic microorganism having increased expression of an acyl-ACP reductase and an alcohol dehydrogenase, in combination with reduced expression and/or activity of an aldehyde dehydrogenase, reduced expression and/or activity of an aldehyde decarbonylase, or both.
- These and related embodiments can be useful in the production of wax esters, as described herein.
- Some embodiments thus include methods for producing a modified photosynthetic microorganism, comprising (1) introducing into said photosynthetic microorganism one or more polynucleotides encoding one or more intracellular localization domain-DGAT fusion proteins, (2) introducing into said photosynthetic microorganism one or more operatively linked promoters (e.g., inducible or regulable promoters) into a region upstream of an endogenous acyl-ACP reductase coding sequence, and/or introducing one or more polynucleotides encoding an acyl-ACP reductase, or a fragment or variant thereof, (3) introducing into said photosynthetic microorganism one or more operatively linked promoters (e.g., inducible or regulable promoters) into a region upstream of an endogenous alcohol dehydrogenase coding sequence, and/or introducing one or more polynucleotides encoding an alcohol dehydrogenase
- the photosynthetic microorganism is 5. elongatus, the aldehyde dehydrogenase is encoded by orf0489, and the aldehyde decarbonylase is encoded by orfl593.
- Photosynthetic microorganisms may be genetically modified according to techniques known in the art, e.g., delete a portion or all of a gene or to introduce a polynucleotide that expresses a functional polypeptide.
- genetic manipulation in photosynthetic microorganisms e.g., Cyanobacteria
- non-replicating vectors which contain native photosynthetic microorganism sequences, exogenous genes of interest, and selectable markers or drug resistance genes.
- the vectors may be integrated into the photosynthetic microorganism's genome through homologous recombination. I n this way, an exogenous gene of interest and the drug resistance gene are stably integrated into the photosynthetic microorganism's genome. Such recombinants cells can then be isolated from non-recombinant cells by drug selection.
- Cyanobacteria Molecular Biology, Genetics, and Evolution (eds. Antonio Herrera and Enrique Flores) Caister Academic Press, 2008, each of which is incorporated by reference for their description on gene transfer into Cyanobacteria, and other information on Cyanobacteria).
- an endogenous version of a protein e.g., ACP, Aas, TES, ACCase, TAG hydrolase, fatty acyl-CoA synthetase, PAP, PL
- a protein e.g., ACP, Aas, TES, ACCase, TAG hydrolase, fatty acyl-CoA synthetase, PAP, PL
- promoters may be constitutive or inducible.
- deletions or mutations of any of the one or more genes associated with the biosynthesis or storage of glycogen can be accomplished according to a variety of methods known in the art, including the use of a non-replicating, selectable vector system that is targeted to the upstream and downstream flanking regions of a given gene (e.g., glgC, pgm), and which recombines with the Cyanobacterial genome at those flanking regions to replace the endogenous coding sequence with the vector seq uence.
- a selectable marker in the vector seq uence such as a drug selectable marker
- Cyanobacterial cells containing the gene deletion can be readily isolated, identified and characterized.
- Such selectable vector-based recom bination methods need not be limited to targeting upstream and downstream flanking regions, but may also be targeted to internal sequences within a given gene, as long as that gene is rendered "non-functional," as described herein.
- deletions or mutations can also be accomplished using antisense-based technology.
- Cyanobacteria have been shown to contain natural regulatory events that rely on antisense regulation, such as a 177-nt ncRNA that is transcribed in antisense to the central portion of an iron-regulated transcript and blocks its accumulation through extensive base pairing (see, e.g., Duhring, et ai, Proc. Natl. Acad. Sci.
- antisense molecules encompass both single and double-stranded polynucleotides comprising a strand having a sequence that is complementary to a target coding strand of a gene or mRNA.
- antisense molecules include both single-stranded antisense oligonucleotides and double-stranded siRNA molecules.
- the modified photosynthetic microorganisms and methods of the present invention may be used to produce lipids, such as fatty acids, triglycerides, and/or wax esters. Accordingly, the present invention provides methods of producing lipids, comprising culturing any of the modified photosynthetic microorganisms of the present invention (described elsewhere herein)
- the modified photosynthetic microorganism is a Cyanobacterium that produces or accumulates increased lipids relative to an unmodified or wild-type Cyanobacterium of the same species, or a differently modified Cyanobacterium of the same species.
- the modified photosynthetic microorganism is a Cyanobacterium that produces or accumulates increased triglycerides relative to an unmodified or wild-type Cyanobacterium of the same species, or a differently modified Cyanobacterium of the same species.
- the differently modified Cyanobacterium expresses a wild-type DGAT, and no other form(s) of DGAT.
- Other examples of differently modified Cyanobacteria are described herein.
- increased triglyceride production is associated with improved cell growth characteristics relative to the differently modified Cyanobacterium, e.g., increased cell survival over time, and is thus measured over time, e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, or more days post-culture or post-induction of DGAT expression, or in a continuous culture system.
- the modified photosynthetic microorganism is a Cyanobacterium that produces or accumulates increased wax esters relative to an unmodified or wild-type Cyanobacterium of the same species, or a differently modified Cyanobacterium of the same species.
- the Cyanobacterium overexpresses an acyl-ACP reductase and an alcohol dehydrogenase, in combination with an intracellular localization domain-DGAT fusion protein.
- the differently modified Cyanobacterium is one that expresses DGAT in combination with an acyl-ACP reductase and an alcohol dehydrogenase, and thus produces wax esters, but expresses a wild-type DGAT and no other form of DGAT.
- increased wax ester production is associated with improved cell growth characteristics relative to the differently modified Cyanobacterium, e.g., increased cell survival over time, and is thus measured over time, e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, or more days post-culture or post-induction of DGAT expression, or in a continuous culture system.
- the one or more introduced polynucleotides are present in one or more expression constructs.
- the one or more expression constructs comprises one or more inducible promoters.
- the one or more expression constructs are stably integrated into the genome of said modified photosynthetic microorganism.
- the introduced polynucleotide encoding an introduced protein is present in an expression construct comprising a weak promoter under non-induced conditions.
- one or more of the introduced polynucleotides are codon-optimized for expression in a Cyanobacterium, e.g., a Synechococcus elongatus.
- the photosynthetic microorganism is a Synechococcus elongatus, such as Synechococcus elongatus strain PCC7942 or a salt tolerant variant of Synechococcus elongatus strain PCC7942.
- the photosynthetic microorganism is a Synechococcus sp. PCC 7002 or a Synechocystis sp. PCC6803.
- Photosynthetic microorganisms may be cultured according to techniques known in the art.
- Cyanobacteria may be cultured or cultivated according to techniques known in the art, such as those described in Acreman et al. (Journal of Industrial Microbiology and Biotechnology 13:193-194, 1994), in addition to photobioreactor based techniques, such as those described in Nedbal et al. (Biotechnol Bioeng. 100:902-10, 2008).
- One example of typical laboratory culture conditions for Cyanobacterium is growth in BG-11 medium (ATCC Medium 616) at 30° C in a vented culture flask with constant agitation and constant illumination at 30-100 ⁇ photons m "2 sec "1 .
- a wide variety of mediums are available for culturing Cyanobacteria, including, for example, Aiba and Ogawa (AO) Medium, Allen and Arnon Medium plus Nitrate (ATCC Medium 1142), Antia's (ANT) Medium, Aquil Medium, Ashbey's Nitrogen-free Agar, ASN-III Medium, ASP 2 Medium, ASW Medium (Artificial Seawater and derivatives), ATCC Medium 617 (BG-11 for Marine Blue-Green Algae; Modified ATCC Medium 616 [BG-11 medium]), ATCC Medium 819 (Blue-green Nitrogen-fixing Medium; ATCC Medium 616 [BG-11 medium] without N0 3 ), ATCC Medium 854 (ATCC Medium 616 [BG-11 medium] with Vitamin B i2 ), ATCC Medium 1047 (ATCC Medium 957 [MN marine medium] with Vitamin B i2 ), ATCC Medium 1077 (Nitrogen-fixing marine medium; ATCC Medium 957 [MN marine medium] without N0 3 ), ATCC Medium 12
- the modified photosynthetic microorganisms are cultured under conditions suitable for inducing expression of the introduced polynucleotide(s), e.g., wherein the introduced polynucleotide(s) comprise an inducible promoter.
- Conditions and reagents suitable for inducing inducible promoters are known and available in the art.
- auto-inductive systems for example, where a metabolite represses expression of the introduced polynucleotide, and the use of that metabolite by the microorganism over time decreases its concentration and thus its repressive activities, thereby allowing increased expression of the polynucleotide sequence.
- modified photosynthetic microorganisms e.g., Cyanobacteria
- light intensity is between 100 and 2000 uE/m2/s, or between 200 and 1000 uE/m2/s.
- the pH range of culture media is between 7.0 and 10.0.
- C0 2 is injected into the culture apparatus to a level in the range of 1% to 10%.
- the range of C0 2 is between 2.5% and 5%.
- nutrient supplementation is performed during the linear phase of growth. Each of these conditions may be desirable for triglyceride production.
- the modified photosynthetic microorganisms are cultured, at least for some time, under static growth conditions as opposed to shaking conditions.
- the modified photosynthetic microorganisms may be cultured under static conditions prior to inducing expression of an introduced polynucleotide (e.g., intracellular localization domain-DGAT fusion, acyl-ACP reductase, ACP, Aas, ACP/Aas, glycogen breakdown protein, ACCase, DGAT, fatty acyl-CoA synthetase, aldehyde dehydrogenase, alcohol dehydrogenase) and/or the modified photosynthetic microorganism may be cultured under static conditions while expression of an introduced polynucleotide is being induced, or during a portion of the time period during which expression on an introduced polynucleotide is being induced.
- Static growth conditions may be defined, for example, as growth without shaking or growth wherein the cells are shake
- the modified photosynthetic microorganisms are cultured, at least for some time, in media supplemented with varying amounts of bicarbonate.
- the modified photosynthetic microorganisms may be cultured with bicarbonate at 5, 10, 20, 50, 75, 100, 200, 300, 400, 500, 600, 700, 800, 900, 1000 mM bicarbonate prior to inducing expression of an introduced polynucleotide (e.g., membrane-targeting domain-DGAT fusion protein, acyl-ACP reductase, aldehyde dehydrogenase, ACP, Aas, ACP/Aas, glycogen breakdown protein, ACCase, DGAT, fatty acyl-CoA synthetase, alcohol dehydrogenase) and/or the modified photosynthetic microorganism may be cultured with aforementioned bicarbonate concentrations while expression of an introduced polynucleotide is being induced, or during a
- modified photosynthetic organisms and methods of the present invention may be used in the production of a biofuel and/or a specialty chemical, such as glycerin or a wax ester.
- a method of producing a biofuel comprises culturing any of the modified photosynthetic microorganisms of the present invention under conditions wherein the modified photosynthetic microorganism accumulates an increased amount of total cellular lipid, fatty acid, wax ester, and/or triglyceride, as compared to a corresponding wild-type photosynthetic microorganism, obtaining cellular lipid, fatty acid, wax ester, and/or triglyceride from said microorganism, and processing the obtained cellular lipid, fatty acid, wax ester, and/or triglyceride to produce a biofuel.
- a method of producing a biofuel comprises processing lipids, fatty acids, wax esters, and/or triglycerides produced by a modified photosynthetic microorganism of the present invention to produce a biofuel.
- a method of producing a biofuel comprises obtaining lipid, fatty acid, wax esters, and/or triglyceride produced by a modified photosynthetic microorganism of the present invention, and processing the obtained cellular lipid, fatty acid, wax ester, and/or triglyceride to produce a biofuel.
- the modified photosynthetic organism is grown under conditions wherein it has reduced growth but maintains photosynthesis.
- lipids from microorganisms to produce a biofuel or specialty chemical e.g., biodiesel
- triglycerides may be transesterified to produce biodiesel.
- Transesterification may be carried out by any one of the methods known in the art, such as alkali-, acid-, or lipase-catalysis (see, e.g., Singh et al., Recent Pat Biotechnol. 2008, 2(2):130-143).
- Various methods of transesterification utilize, for example, use of a batch reactor, a supercritical alcohol, an ultrasonic reactor, or microwave irradiation (Such methods are described, e.g., in Jeong and Park, Appl Biochem Biotechnol. 2006, 131(l-3):668-679; Fukuda et al., Journal of Bioscience and Engineering. 2001, 92(5):405-416; Shah and Gupta, Chemistry Central Journal. 2008, 2(l):l-9; and Carrillo-Munoz et al., J Org Chem. 1996, 61(22):7746-7749).
- the biodiesel may be further processed or purified, e.g., by distillation, and/or a biodiesel stabilizer may be added to the biodiesel, as described in U.S. Patent Application Publication No. 2008/0282606.
Landscapes
- Chemical & Material Sciences (AREA)
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- Biotechnology (AREA)
- Medicinal Chemistry (AREA)
- Biomedical Technology (AREA)
- Microbiology (AREA)
- Molecular Biology (AREA)
- Biophysics (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Gastroenterology & Hepatology (AREA)
- Tropical Medicine & Parasitology (AREA)
- Virology (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
This disclosure describes genetically modified photosynthetic microorganisms, including Cyanobacteria, that contain one or more exogenous genes encoding a diacyglycerol acyltransferase and at least one exogenous polynucleotide encoding at least one lipid packaging protein, and which are capable of producing increased amounts of fatty acids and/or synthesizing triglycerides.
Description
Cyanobacteria That Produce Lipid Packaging Proteins
CROSS-REFERENCE TO RELATED PATENT APPLICATIONS
This application claims priority to US Provisional Patent Application No. 61/780,712 filed on March 13, 2013, entitled "Cyanobacteria That Produce Lipid Packaging Proteins," which is hereby incorporated by reference in their entirety.
SEQUENCE LISTING
The Sequence Listing associated with this application is provided in text format in lieu of a paper copy, and is hereby incorporated by reference into the specification. The name of the text file containing the Sequence Listing is M077-0012USPl_Sequence Listing_ST25.txt. The text file is about 635 KB, was created on March 12, 2013, and is being submitted electronically via EFS-Web.
BACKGROUND
Certain organisms can be utilized as a source of oil in the production of biofuels. For example, algae naturally produce oil such as triglycerides as energy storage molecules, and certain biofuel-related technologies are presently focused on the use of algae as a feedstock for biofuels. Algae are photosynthetic organisms, and the use of triglyceride-producing organisms such as algae provides the ability to produce biodiesel from sunlight, water, C02, macronutrients, and micronutrients. Algae, however, cannot be readily genetically manipulated, and produce much less oil (i.e., triglycerides) under culture conditions than in the wild.
Like algae, Cyanobacteria obtain energy from photosynthesis, utilizing chlorophyll A and water to reduce C02. Certain Cyanobacteria can produce metabolites, such as carbohydrates, proteins, and fatty acids, from just sunlight, water C02, water, and inorganic salts. Unlike algae, Cyanobacteria can be genetically manipulated. For example, S. elongatus PCC 7942 (hereafter refered to as "5. elongatus PCC 7942") is a genetically manipulate, oligotrophic Cyanobacterium that thrives in low nutrient level conditions, and in the wild accumulates fatty
acids in the form of lipid membranes to about 4 to 8 % by dry weight. Cyanobacteria such as Synechococcus, however, produce no triglyceride energy storage molecules, since Cyanobacteria typically lack the essential enzymes involved in triglyceride synthesis.
Clearly, therefore, there is a need in the art for modified photosynthetic microorganisms, including Cyanobacteria, capable of producing oil with high efficiency, such as triglycerides, e.g., to be used as feedstock in the production of biofuels and/or various specialty chemicals.
BRIEF SUMMARY
Embodiments of the present invention relate to the demonstration that photosynthetic microorganisms, including Cyanobacteria, can be genetically modified to increase fatty acid biosynthesis, and to produce neutral lipids and lipid packaging proteins. Generally, the modified photosynthetic microorganisms, e.g., Cyanobacteria, of the present invention comprise one or more polynucleotides encoding one or more enzymes associated with neutral lipid synthesis and lipid packaging protein.
Embodiments concern a modified Cyanobacterium comprising an exogenous polynucleotide encoding a diacylglycerol acyltransferase (hereinafter referred to as "DGAT") and at least one exogenous polynucleotide encoding at least one lipid packaging protein.
Other embodiments concerns a method for producing a neutral lipid, comprising culturing a Cyanobacterium comprising an exogenous polynucleotide encoding a DGAT and at least one exogenous polynucleotide encoding at least one lipid packaging protein to produce a neutral lipid.
Still other embodiments concern a method for producing a modified Cyanobacterium, comprising modifying a Cyanobacterium by introducing into the Cyanobacterium an exogenous polynucleotide encoding a DGAT and at least one exogenous polynucleotide encoding at least one lipid packaging protein, wherein said Cyanobacterium produces a neutral lipid.
DETAILED DESCRIPTION
The present invention relates, in part, to the demonstration that photosynthetic organisms, including but not limited to Cyanobacteria, such as Synechococcus, which do not naturally produce neutral lipids can be genetically modified to synthesize neutral lipids, such as triglycerides, and to produce lipid packaging proteins which encapsulate the produced neutral lipids and which increase the size and/or number of the neutral lipid bodies. Examples of enzymes associated with neutral lipid synthesis include enzymes having a phosphatidate phosphatase activity and enzymes having a DGAT activity . Specifically, phosphatidate phosphatase enzymes catalyze the production of diacylglycerol molecules, an immediate pre-cursor to triglycerides, and DGAT enzymes catalyze the final step of triglyceride synthesis by converting the diacylglycerol precursors to triglycerides.
Definitions
Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by those of ordinary skill in the art to which the invention belongs. Although any methods and materials simila r or equivalent to those described herein can be used in the practice or testing of the present invention, preferred methods and materials are described. For the purposes of the present invention, the following terms are defined below.
The articles "a" and "an" are used herein to refer to one or to more than one (i.e. to at least one) of the grammatical object of the article. By way of example, "an element" means one element or more than one element.
The term "about" means a q uantity, level, value, num ber, frequency, percentage, dimension, size, amount, weight or length that varies by as much as 30, 25, 20, 25, 10, 9, 8, 7, 6, 5, 4, 3, 2 or 1 % to a reference quantity, level, value, number, frequency, percentage, dimension, size, amount, weight or length.
The term "biologically active fragment", as applied to fragments of a reference polynucleotide or polypeptide sequence, refers to a fragment that has at least about 0.1, 0.5, 1, 2, 5, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 96, 97, 98, 99, 100, 110, 120, 150, 200, 300, 400, 500, 600, 700, 800, 900, 1000% or more of the
activity of a reference sequence. The term "reference sequence" refers generally to a nucleic acid coding sequence, or amino acid sequence, of any enzyme having a DGAT activity, a phosphatidate phosphatase activity, and/or an acetyl-CoA carboxylase activity, as described herein {see, e.g., SEQ ID NOS:l-9).
I ncluded within the scope of the present invention are biologically active fragments of at least about 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 40, 50, 60, 70, 80, 90, 100, 120, 140, 160, 180, 200, 220, 240, 260, 280, 300, 320, 340, 360, 380, 400, 500, 600 or more contiguous nucleotides or amino acid residues in length, including all integers in between, which comprise or encode a polypeptide having an enzymatic activity of a reference polynucleotide or polypeptide. Representative biologically active fragments generally participate in an interaction, e.g., an intra-molecular or an inter-molecular interaction. An inter-molecular interaction can be a specific binding interaction or an enzymatic interaction. Examples of enzymatic interactions or activities include diacylglycerol acyltransferase activity, phosphatidate phosphatase activity, and/or acetyl-CoA carboxylase activity, as described herein.
"Coding seq uence" means any nucleic acid sequence that contributes to the code for the polypeptide product of a gene. By contrast, the term "non-coding sequence" refers to any nucleic acid sequence that does not contribute to the code for the polypeptide product of a gene.
Throughout this specification, unless the context requires otherwise, the words
"comprise", "comprises" and "comprising" will be understood to imply the inclusion of a stated step or element or group of steps or elements but not the exclusion of any other step or element or group of steps or elements.
"Consisting of" means including, and limited to, whatever follows the phrase "consisting of." Thus, the phrase "consisting of" indicates that the listed elements are required or mandatory, and that no other elements may be present.
"Consisting essentially of" means including any elements listed after the phrase, and limited to other elements that do not interfere with or contribute to the activity or action specified in the disclosure for the listed elements. Thus, the phrase "consisting essentially of"
indicates that the listed elements are required or mandatory, but that other elements are optional and may or may not be present depending upon whether or not they affect the activity or action of the listed elements.
The terms "complementary" and "complementarity" refer to polynucleotides (i.e., a sequence of nucleotides) related by the base-pairing rules. For example, the seq uence "A-G-T," is complementary to the sequence "T-C-A." Complementarity may be "partial," in which only some of the nucleic acids' bases are matched according to the base pairing rules. Or, there may be "complete" or "total" complementarity between the nucleic acids. The degree of complementarity between nucleic acid strands has significant effects on the efficiency and strength of hybridization between nucleic acid strands.
"Corresponds to" or "corresponding to" means (a) a polynucleotide having a nucleotide sequence that is substantially identical or complementary to all or a portion of a reference polynucleotide sequence or encoding an amino acid sequence identical to an amino acid sequence in a peptide or protein; or (b) a peptide or polypeptide having an amino acid sequence that is substantially identical to a seq uence of amino acids in a reference peptide or protein.
"Derivative" means a polypeptide that has been derived from the basic seq uence by modification, for example by conjugation or complexing with other chemical moieties (e.g., pegylation) or by post-translational modification techniq ues as would be understood in the art. The term "derivative" also includes within its scope alterations that have been made to a parent sequence including additions or deletions that provide for functionally eq uivalent molecules.
"Enzyme reactive conditions" means that any necessary conditions are available in an environment (i.e., such factors as temperature, pH, and lack of inhibiting substances) which will permit the enzyme to function. Enzyme reactive conditions can be either in vitro, such as in a test tube, or in vivo, as within a cell.
As used herein, the terms "function" and "functional" refer to a biological, enzymatic, or therapeutic function.
"Gene" is means a unit of inheritance that occupies a specific locus on a chromosome and consists of transcriptional and/or translational regulatory sequences and/or a coding region and/or non-translated sequences (i.e., introns, 5' and 3' untranslated sequences).
"Homology" refers to the percentage number of amino acids that are identical or constitute conservative substitutions. Homology may be determined using sequence comparison programs such as GAP (Deveraux et al., 1984, Nucleic Acids Research 12, 387-395) which is incorporated herein by reference. I n this way seq uences of a similar or substantially different length to those cited herein could be compared by insertion of gaps into the alignment, such gaps being determined, for example, by the comparison algorithm used by GAP.
The term "host cell" includes an individual cell or cell culture which can be or has been a recipient of any recombinant vector(s) or isolated polynucleotide of the invention. Host cells include progeny of a single host cell, and the progeny may not necessarily be completely identical (in morphology or in total DNA complement) to the original parent cell due to natural, accidental, or deliberate mutation and/or change. A host cell includes cells transfected or infected in vivo or in vitro with a recombinant vector or a polynucleotide of the invention. A host cell which comprises a recombinant vector of the invention is a recombinant host cell.
"Isolated" means material that is substantially or essentially free from components that normally accompany it in its native state. For example, an "isolated polynucleotide", as used herein, refers to a polynucleotide, which has been purified from the seq uences which flank it in a naturally-occurring state, e.g., a DNA fragment which has been removed from the sequences that are normally adjacent to the fragment. Alternatively, an "isolated peptide" or an "isolated polypeptide" and the like, as used herein, refer to in vitro isolation and/or purification of a peptide or polypeptide molecule from its natural cellular environment, and from association with other components of the cell.
"Increased" or "increasing" means the ability of one or more modified photosynthetic microorganisms, e.g., Cyanobacteria, to produce a greater amount of a given fatty acid, lipid molecule, or triglyceride as compared to a control Cyanobacteria, such as an unmodified Cyanobacteria or a differently modified Cyanobacteria. Production of fatty acids can be
measured according to techniq ues known in the art, such as Nile Red staining and gas chromatography. Production of triglycerides can be measured, for example, using commercially available enzymatic tests, including colorimetric enzymatic tests using glycerol-3-phosphate-oxidase.
As used herein, "neutral lipid" means any lipid that is soluble only in solvents of very low polarity. Neutral lipids are divided into two main groups: (1) acylglycerols (glycerides), i.e. fatty-acid esters of glycerol; and (2) waxes, i.e. fatty-acid esters of long-chain monohydroxy alcohols. The neutral lipid may include a fatty acid ester arising from the joining of the fatty acid carboxyl group to a hydroxyl group to make an ester bond. This can occur, for example, between a fatty acid and glycerol to make mono-, di- and triglycerides, or between a fatty acid and an alcohol to make a wax ester.
"Obtained from" means that a sample such as, for example, a polynucleotide extract or polypeptide extract is isolated from, or derived from, a particular source, such as a desired organism or a specific tissue within a desired orga nism. "Obtained from" can also refer to the situation in which a polynucleotide or polypeptide sequence is isolated from, or derived from, a particular organism or tissue within an organism. For example, a polynucleotide seq uence encoding a DGAT, phosphatidate phosphatase, and/or acetyl-CoA carboxylase enzyme may be isolated from a variety of prokaryotic or eukaryotic organisms, or from particular tissues or cells within certain eukaryotic organism.
The term "operably linked" as used herein means placing a gene under the regulatory control of a promoter, which then controls the transcription and optionally the translation of the gene. I n the construction of heterologous promoter/structural gene combinations, it is generally preferred to position the genetic sequence or promoter at a distance from the gene transcription start site that is approximately the same as the distance between that genetic sequence or promoter and the gene it controls in its natural setting; i.e. the gene from which the genetic sequence or promoter is derived. As is known in the art, some variation in this distance can be accommodated without loss of function. Similarly, the preferred positioning of a regulatory sequence element with respect to a heterologous gene to be placed under its control is defined by the positioning of the element in its natural setting; i.e., the genes from
which it is derived. "Constitutive promoters" are typically active, i.e., promote transcription, under most conditions. "Inducible promoters" are typically active only under certain conditions, such as in the presence of a given molecule factor (e.g., IPTG) or a given environmental condition (e.g., particular C02 concentration, nutrient levels, light, heat). I n the absence of that condition, inducible promoters typically do not allow significant or measurable levels of transcriptional activity. For example, inducible promoters may be induced according to temperature, pH, a hormone, a metabolite (e.g., lactose, mannitol, an amino acid), light (e.g., wavelength specific), osmotic potential (e.g., salt induced), a heavy metal, or an antibiotic. Numerous standard inducible promoters will be known to one of skill in the art.
The recitation "polynucleotide" or "nucleic acid" as used herein designates mRNA, RNA, cRNA, rRNA, cDNA or DNA. The term typically refers to polymeric form of nucleotides of at least 10 bases in length, either ribonucleotides or deoxynucleotides or a modified form of either type of nucleotide. The term includes single and double stranded forms of DNA.
The terms "polynucleotide variant" and "variant" and the like refer to polynucleotides displaying substantial seq uence identity with a reference polynucleotide sequence or polynucleotides that hybridize with a reference seq uence under stringent conditions that are defined hereinafter. These terms also encompass polynucleotides that are distinguished from a reference polynucleotide by the addition, deletion or substitution of at least one nucleotide. Accordingly, the terms "polynucleotide variant" and "variant" include polynucleotides in which one or more nucleotides have been added or deleted, or replaced with different nucleotides. I n this regard, it is well understood in the art that certain alterations inclusive of mutations, additions, deletions and substitutions can be made to a reference polynucleotide whereby the altered polynucleotide retains the biological function or activity of the reference polynucleotide, or has increased activity in relation to the reference polynucleotide (i.e., optimized). Polynucleotide variants include, for example, polynucleotides having at least 50% (and at least 51% to at least 99% and all integer percentages in between) sequence identity with a reference polynucleotide sequence that encodes a diacylglycerol acyltransferase, a phosphatidate phosphatase, and/or an acetyl-CoA carboxylase enzyme. The terms
"polynucleotide variant" and "variant" also include naturally-occurring allelic variants and orthologs that encode these enzymes.
Regarding polynucleotides, the term "exogenous" refers to a polynucleotide seq uence that does not naturally occur in a wild-type cell or organism, but is typically introduced into the cell by molecular biological techniq ues. Examples of exogenous polynucleotides include vectors, plasmids, and/or man-made nucleic acid constructs encoding a desired protein. Regarding polynucleotides, the term "endogenous" or "native" refers to naturally occurring polynucleotide sequences that may be found in a given wild-type cell or organism. For example, certain cyanobacterial species do not typically contain a DGAT gene, and, therefore, do not comprise an "endogenous" polynucleotide sequence that encodes a DGAT polypeptide. Also, a particular polynucleotide sequence that is isolated from a first organism and transferred to second organism by molecular biological techniques is typically considered an "exogenous" polynucleotide with respect to the second organism.
"Polypeptide," "polypeptide fragment," "peptide" and "protein" are used interchangeably herein to refer to a polymer of amino acid residues and to variants and synthetic analogues of the same. Thus, these terms apply to amino acid polymers in which one or more amino acid residues are synthetic non-naturally occurring amino acids, such as a chemical analogue of a corresponding naturally occurring amino acid, as well as to naturally-occurring amino acid polymers. I n certain aspects, polypeptides may include enzymatic polypeptides, or "enzymes," which typically catalyze (i.e., increase the rate of) various chemical reactions.
A polypeptide "variant" refers to polypeptides that are distinguished from a reference polypeptide seq uence by the addition, deletion or substitution of at least one amino acid residue. In certain embodiments, a polypeptide variant is distinguished from a reference polypeptide by one or more substitutions, which may be conservative or non-conservative. I n certain embodiments, the polypeptide variant comprises conservative substitutions and, in this regard, it is well understood in the art that some amino acids may be changed to others with broadly similar properties without changing the nature of the activity of the polypeptide.
Polypeptide variants also encompass polypeptides in which one or more amino acids have been added or deleted, or replaced with different amino acid residues.
As used herein, "lipid packaging protein" or "lipid body packaging protein" refers to structural components of intracellular lipid globules, or proteins that interact with or are evolutionarily related to the components of intracellular lipid globules, which encapsulate lipid and/or neutral lipids within the host cell. Examples of lipid packaging protein include Caleosin and Oleosin Proteins.
The present invention contemplates the use in the methods described herein of variants of full-length enzymes having DGAT activity, phosphatidate phosphatase activity, and/or acetyl-CoA carboxylase activity, truncated fragments of these full-length polypeptides, variants of truncated fragments, as well as their related biologically active fragments. Typically, biologically active fragments of a polypeptide may participate in an interaction, for example, an intra-molecular or an inter-molecular interaction. An inter-molecular interaction can be a specific binding interaction or an enzymatic interaction (e.g., the interaction can be transient and a covalent bond is formed or broken). Biologically active fragments of a polypeptide/enzyme having a diacylglycerol acyltransferase activity, a phosphatidate phosphatase activity, and/or acetyl-CoA carboxylase activity include peptides comprising amino acid sequences sufficiently similar to, or derived from, the amino acid sequences of a (putative) full-length reference polypeptide sequence. Typically, biologically active fragments comprise a domain or motif with at least one activity of a DGAT, phosphatidate phosphatase polypeptide, and/or acetyl-coA carboxylase polypeptide, and may include one or more (and in some cases all) of the various active domains. A biologically active fragment of a DGAT, phosphatidate phosphatase, and/or acetyl-CoA carboxylase polypeptide can be a polypeptide fragment which is, for example, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29,30, 40, 50, 60, 70, 80, 90, 100, 110, 120, 130, 140, 150, 160, 170, 180, 190, 200, 220, 240, 260, 280, 300, 320, 340, 360, 380, 400, 450, 500, 600 or more contiguous amino acids, including all integers in between, of a reference polypeptide sequence. In certain embodiments, a biologically active fragment comprises a conserved enzymatic sequence, domain, or motif, as described elsewhere herein and known in the art. Suitably, the biologically-active fragment
has no less than about 1%, 10%, 25%, 50% of an activity of the wild-type polypeptide from which it is derived.
A "reference sequence," as used herein, refers to a wild-type polynucleotide or polypeptide sequence from any organism, e.g., wherein the polynucleotide encodes a polypeptide having an acyl-ACP reductase, as described herein and known in the art. Exemplary polypeptide "reference sequences" are provided herein, including the polynucleotide and polypepetide sequences of an acyl-ACP reductase of Synechococcus elongatus PCC7942 (see SEQ ID NOs:l and 2 for the polynucleotide and polypeptide sequences, respectively) and an acyl-ACP reductase of Synechocystis sp. PCC6803 (SEQ ID NOs:3 and 4 for the polynucleotide and polypeptide sequences, respectively), among others known to a person skilled in the art.
The recitations "sequence identity" or, for example, comprising a "seq uence 50% identical to," as used herein, refer to the extent that sequences are identical on a nucleotide-by-nucleotide basis or an amino acid-by-amino acid basis over a window of comparison. Thus, a "percentage of sequence identity" may be calculated by comparing two optimally aligned sequences over the window of comparison, determining the number of positions at which the identical nucleic acid base (e.g., A, T, C, G, I) or the identical amino acid residue (e.g., Ala, Pro, Ser, Thr, Gly, Val, Leu, lie, Phe, Tyr, Trp, Lys, Arg, His, Asp, Glu, Asn, Gin, Cys and Met) occurs in both sequences to yield the number of matched positions, dividing the number of matched positions by the total number of positions in the window of comparison (i.e., the window size), and multiplying the result by 100 to yield the percentage of sequence identity.
Terms used to describe seq uence relationships between two or more polynucleotides or polypeptides include "reference seq uence", "comparison window", "seq uence identity", "percentage of sequence identity" and "substantial identity". A "reference sequence" is at least 12 but frequently 15 to 18 and often at least 25 monomer units, inclusive of nucleotides and amino acid residues, in length. Because two polynucleotides may each comprise (1) a sequence (i.e., only a portion of the complete polynucleotide seq uence) that is similar between the two polynucleotides, and (2) a seq uence that is divergent between the two polynucleotides,
sequence comparisons between two (or more) polynucleotides are typically performed by comparing sequences of the two polynucleotides over a "comparison window" to identify and compare local regions of seq uence similarity. A "comparison window" refers to a conceptual segment of at least 6 contiguous positions, usually about 50 to about 100, more usually about 100 to about 150 in which a sequence is compared to a reference seq uence of the same number of contiguous positions after the two seq uences are optimally aligned. The comparison window may comprise additions or deletions (i.e., gaps) of about 20% or less as compared to the reference seq uence (which does not comprise additions or deletions) for optimal alignment of the two sequences. Optimal alignment of sequences for aligning a comparison window may be conducted by computerized implementations of algorithms (GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package Release 7.0, Genetics Computer Group, 575 Science Drive Madison, Wl, USA) or by inspection and the best alignment (i.e., resulting in the highest percentage homology over the comparison window) generated by any of the various methods selected. Reference also may be made to the BLAST family of programs as for example disclosed by Altschul et al., 1997, Nucl. Acids Res. 25 :3389. A detailed discussion of sequence analysis can be found in Unit 19.3 of Ausubel et al., "Current Protocols in Molecular Biology", John Wiley & Sons I nc, 1994-1998, Chapter 15.
As used herein, the term "triglyceride" (triacylglycerol or neutral fat) refers to a fatty acid triester of glycerol. Triglycerides are typically non-polar and water-insoluble. Phosphoglycerides (or glycerophospholipids) are major lipid components of biological membranes.
"Transformation" refers to the permanent, heritable alteration in a cell resulting from the uptake and incorporation of foreign DNA into the host-cell genome; also, the transfer of an exogenous gene from one organism into the genome of another organism.
"Vector" means a polynucleotide molecule, preferably a DNA molecule derived, for example, from a plasmid, bacteriophage, yeast or virus, into which a polynucleotide can be inserted or cloned. A vector preferably contains one or more unique restriction sites and can be capable of autonomous replication in a defined host cell including a target cell or tissue or a progenitor cell or tissue thereof, or be integrable with the genome of the defined host such that
the cloned sequence is reproducible. Accordingly, the vector can be an autonomously replicating vector, i.e., a vector that exists as an extra-chromosomal entity, the replication of which is independent of chromosomal replication, e.g., a linear or closed circular plasmid, an extra-chromosomal element, a mini-chromosome, or an artificial chromosome. The vector can contain any means for assuring self-replication. Alternatively, the vector can be one which, when introduced into the host cell, is integrated into the genome and replicated together with the chromosome(s) into which it has been integrated. Such a vector may comprise specific sequences that allow recombination into a particular, desired site of the host chromosome. A vector system can comprise a single vector or plasmid, two or more vectors or plasmids, which together contain the total DNA to be introduced into the genome of the host cell, or a transposon. The choice of the vector will typically depend on the compatibility of the vector with the host cell into which the vector is to be introduced. I n the present case, the vector is preferably one which is operably functional in a bacterial cell, such as a cyanobacterial cell. The vector can include a reporter gene, such as a green fluorescent protein (GFP), which can be either fused in frame to one or more of the encoded polypeptides, or expressed separately. The vector can also include a selection marker such as an antibiotic resistance gene that can be used for selection of suitable transformants.
The terms "wild-type" and "naturally occurring" are used interchangeably to refer to a gene or gene product that has the characteristics of that gene or gene product when isolated from a naturally occurring source. A wild type gene or gene product (e.g., a polypeptide) is that which is most freq uently observed in a population and is thus arbitrarily designated as the "normal" or "wild-type" form of the gene.
Modified Photosynthetic Microorganisms and Methods of Producing Neutral Lipids
Certain embodiments of the present invention relate to modified photosynthetic microorganisms, e.g., Cyanobacteria, and methods of use thereof, wherein the modified photosynthetic microorganism comprises one or more polynucleotides encoding one or more enzymes associated with lipid and/or neutral lipid biosynthesis (such as, for example triglyceride, fatty acid ester, wax ester, and/or carotenoid biosynthesis) and at least one
exogenous polynucleotide encoding at least one protein associated with lipid packaging. According to certain embodiments, the one or more polynucleotides encoding one or more enzymes associated with triglyceride biosynthesis include but are not limited to a DGAT and/or a phosphatidate phosphatase. According to other embodiments, the at least one protein associated with lipid packaging includes polyhydroxyalkanoatel P (Phal P), triacylglycerol accum ulation deficient A (Tad A), Caleosin 1 1 (Clol 1), or proteins evolutionarily related to these proteins such as plastoglobulin 1 (pgl 1), plastoglobulin 2 (pgl 2), plastoglobulin-like (pgl-like), or a combination thereof. The present invention contemplates the use of naturally-occurring and non-naturally-occurring variants of these lipid packaging proteins and these DGAT and phosphatidate phosphatase enzymes, as well as variants of their encoding polynucleotides. In certain aspects, the DGAT encoding polynucleotide sequence is derived from Acinetobacter baylii (ADP1-DGAT), and the phosphatidate phosphatase encoding polynucleotide sequence is from Saccharomyces cerevisiae (yPahl). These enzyme encoding sequences, however, may be derived from any organism having a suitable DGAT or phosphatidate phosphatase enzyme, and may also include any man-made variants thereof, such as any optimized coding sequences (i.e., codon-optimized polynucleotides) or optimized polypeptide sequences. Exemplary polypeptide and polynucleotide sequences are described infra and include the lipid packaging protein derived from pgl 1 (Synechocystis) (see SEQ ID NOs:240 and 239 for polypeptide and polynucleotide sequence, respectively), pgl 2 (Synechocystis) (see SEQ I D NOs:242 and 241 for polypeptide and polynucleotide sequence, respectively), pgl-like (Microcystis aeruginosa) (see SEQ I D NOs:244 and 243 for polypeptide and polynucleotide sequence, respectively), pgl-like (Cyanothece sp) (see SEQ ID NOs:246 and 245 for polypeptide and polynucleotide sequence, respectively), pgl-like (Calothrix sp) (see SEQ I D NOs:248 and 247 for polypeptide and polynucleotide sequence, respectively), pgl-like (Nostoc punctiforme) (see SEQ I D NOs:250 and 249 for polypeptide and polynucleotide sequence, respectively), Phal P (Ra. Eutropha) (see SEQ I D NOs:234 and 233 for polypeptide and polynucleotide sequence, respectively), Tad A (Rhodococcus) (see SEQ I D NOs:236 and 235 for polypeptide and polynucleotide sequence, respectively), and Clol 1 (Arabidopsis) (see SEQ I D NOs:238 and 237 for polypeptide and polynucleotide sequence, respectively). These
encoding sequences, however, may be derived from any organism having suitable lipid packaging proteins, and may also include any man-made variants thereof, such as any optimized coding sequences (i.e., codon-optimized polynucleotides) or optimized polypeptide sequences. Exemplary polypeptide and polynucleotide sequences are described infra.
Plastofilobulin Proteins
In certain embodiments, a modified photosynthetic microorganism comprises modulated and/or introduced plastoglobulin 1 (pgl 1), plastoglobulin 2 (pgl 2) and plastoglobulin-like (pgl-like). In specific embodiments, the modified photosynthetic microorganism comprises overexpressed pgl 1, pgl 2, and pgl-like. The pgl 1 and pgl 2 in the cyanobacterium Synechocystis sp. PCC 6803 (slll568 and slrl024) encode polypeptides similar in sequence to plant plastoglobulins. These two genes may be inactivated individually and in combination by insertion of antibiotic resistance cassettes into convenient restriction enzyme sites in their open reading frames. See, for example, Cunningham (JB, 192, No. 6: 1700-1709, 2010) and the references cited therein.
Polyhydroxyalkanoate Proteins
In certain embodiments, a modified photosynthetic microorganism comprises modulated and/or introduced polyhydroxyalkanoatel P (Phal P). Pha 1 has been shown to be associated with TAG bodies if it is ectopically expressed in Acinetobacter. In addition, Pha 1 may be associated with a diversity of intracellular carbon storage bodies. Phal represents the major phasin on the surface of PHA inclusions in Ra. eutropha H16, plays an important role in the formation and structure of these inclusions, because its presence or absence affects the number and size of the inclusions, and the amount of PHB in the cells. According to the most accepted model, PHA inclusions are formed from soluble PHA synthases that polymerize the 3-hydroxybutyrate (3HB) of 3HB-CoA to PHB, with concomitant release of CoA. Since PHA synthases remain covalently linked to the growing PHB chain, an amphiphilic complex composed of the hydrophilic synthase and the elongating polymer chain is formed. These complexes are thought to aggregate to form micelle-like structures, which enlarge to PHA granules due to subsequent extension of the PHA chains. During granule growth, phasins and phospholipids are thought to migrate into the exposed hydrophobic surface of the polymer
core, thereby generating an interphase between the hydrophobic core and the cytoplasm. See, for example, Hanisch (Microbiology, 152: 3271-3280, 2006) and the references cited therein.
Triacylfilycerol accumulation deficient Proteins
I n certain embodiments, a modified photosynthetic microorganism comprises modulated and/or introduced triacylglycerol accumulation deficient A (Tad A), which refers to a gene from Rhodococcus that encodes a putative heparin-binding 63 hemagglutinin homolog that is important for TAG accumulation. See, for example, MacEachran (AEM, 76(21): 7217-7227, 2010) and the references cited therein.
Caleosin and/or Oleosin Proteins
I n certain embodiments, a modified photosynthetic microorganism comprises modulated and/or introduced Caleosin and/or Oleosin protein from A. thaliana (Clol). Caleosin and oleosin are embedded in the half unit membranes of oil bodies in many plant seeds. Functionally, oleosin is believed to coat the oil body surface and thereby prevent coalescence of adjacent oil bodies. Both proteins have similar proposed structures including N-and C-terminal cytosolic ends, and long hydrophobic internal segments with centrally located proline knot motifs. It has been hypothesized that Caleosin's function in oil bodies involves oil body biogenesis from the ER. Caleosins are a multigene family in Arabidopsis, with a single isoform, Clol, predominantly expressed in seeds. Clol, a lipid body associated transmembrane protein has been implicated in lipid body generation, size maintenance, and degradation of stored lipid; algal genomes, including C. reinhardtii genomes, lack genes encoding oleosin homologues but encode caleosin homologues. See, for example, Wang (EC, 8(12): 1856-1868, 2009) and the references cited therein.
I n certain embodiments, the modified photosynthetic microorganisms may comprise two or more polynucleotides that encode DGAT or a variant or fragment thereof. In particular embodiments, the two or more polynucletoides are identical or express the same DGAT. I n certain embodiments, these two or more polynucleotides may be different or may encode two different DGAT polypeptides. For example, in one embodiment, one of the polynucleotides may encode ADGATd, while another polynucleotide may encode ScoDGAT. In particular
embodiments, the following DGATs are coexpressed in modified photosynthetic microorganisms, e.g., Cyanobacteria, using one of the following double DGAT strains: ADGATd(NSl)::ADGATd(NS2); ADGATn(NSl): :ADGATn(NS2); ADGATn(NSl)::SDGAT(NS2); SDGAT(NSl)::ADGATn(NS2); SDGAT(NS1)::SDGAT(NS2). For the NS1 vector, pAM2291, EcoRI follows ATG and is part of the open reading frame (ORF). For the NS2 vector, pAM 1579, EcoRI follows ATG and is part of the ORF. A DGAT having EcoRI nucleotides following ATG may be cloned in either pAM2291 or pAM 1579; such a DGAT is referred to as ADGATd. Other embodiments utilize the vector, pAM2314FTrc3, which is an NS1 vector with Nde/8glll sites, or the vector, pAM1579FTrc3, which is the NS2 vector with Nde/8glll sites. A DGAT without EcoRI nucleotides may be cloned into either of these last two vectors. Such a DGAT is referred to as ADGATn. As shown in the accompanying Examples, modified photosynthetic microorganisms expressing different DGATs express TAGs having different fatty acid compositions. Accordingly, certain embodiments of the present invention contemplate expressing two or more different DGATs, in order to produce DAGs having varied fatty acid compositions. Related embodiments contemplate expressing two or more different phosphatidate phosphatase and/or two or more different acetyl-CoA carboxylases.
Embodiments of the present invention also relate to modified photosynthetic microorganisms, e.g., Cyanobacteria, and methods of use thereof, wherein the modified Cyanobacteria comprise one or more polynucleotides encoding enzymes associated with fatty acid biosynthesis and at least one polynucleotide encoding proteins associated with lipid packaging, wherein said polynucleotides are exogenous to the Cyanobacterium's native genome. In certain aspects, the enzymes associated with fatty acid synthesis comprise an acetyl-CoA carboxylase (ACCase) activity, including naturally-occurring and non-naturally-occurring functional variants of such enzymes and their encoding polynucleotides. In certain embodiments, the polynucleotide sequence encoding the ACCase enzyme is derived from Saccharomyces cerevisiae (yAccl). As above, however, these ACCase enzyme encoding sequences may be derived from any organism having a suitable ACCase enzyme, and may also include any man-made variants thereof, such as any optimized coding seq uences (i.e., codon-optimized polynucleotides) or optimized polypeptide sequences.
In certain embodiments, the modified photosynthetic microorganisms may comprise at least one DGAT polypeptide fused to at least one heterologous intracellular localization domain, such as a bacterial membrane-targeting domain. Such fusion proteins can be partially or fully isolated from other cellular components, or expressed, for example, in cell-free systems or a host cell, such as a modified photosynthetic microorganism. In other embodiments, the modified photosynthetic microorganisms may comprise at least one DGAT polypeptide fused to at least one lipid packaging protein. In certain aspects, the modified photosynthetic microorganisms described herein can optionally comprise any combination of one or more overexpressed or introduced lipid biosynthesis proteins and/or one or more overexpressed or introduced proteins associated with glycogen breakdown. Examples of lipid biosynthesis proteins include acyl carrier proteins (ACP), acyl ACP synthases (Aas), acyl-ACP reductases, alcohol dehydrogenases, aldehyde dehydrogenases, aldehyde decarbonylases, thioesterases (TES), acetyl coenzyme A carboxylases (ACCase), phosphatidic acid phosphatases (PAP; or phosphatidate phosphatases), triacylglycerol (TAG) hydrolases, fatty acyl-CoA synthetases, and lipases/phospholipases, fatty acyl reductase (FAR), as described herein. Exemplary proteins associated with glycogen breakdown are described infra.
Since, as noted above, fatty acids provide the starting material for neutral lipids such as triglyceride production, genetically modified Cyanobacteria having increased fatty acid production may be utilized to improve the overall production of triglycerides. Accordingly, certain embodiments relate to modified Cyanobacteria, and methods of use thereof, wherein the Cyanobacteria comprise one or more polynucleotides encoding enzymes associated with fatty acid synthesis and triglyceride synthesis and at least one polynucleotide encoding proteins associated with lipid packaging. As such, in certain embodiments, the modified Cyanobacteria of the present invention comprise one or more polynucleotides encoding enzymes that comprise a DGAT activity and/or a phosphatidate phosphatase activity, as well as an exogenous enzyme comprising an ACCase activity, and at least one polynucleotide encoding proteins that comprises a lipid packaging activity.
Embodiments of the present invention also include methods of producing neutral lipids such as triglycerides in a photosynthetic microorganism, e.g., a Cyanobacterium, comprising
introducing one or more polynucleotides encoding one or more enzymes associated with triglyceride biosynthesis and at least one polynucleotide encoding at least one protein associated with lipid packaging activities into a photosynthetic microorganism, incubating the photosynthetic microorganism for a time and under conditions sufficient to allow neutral lipids such as triglyceride production, thereby producing neutral lipids such as triglycerides in the photosynthetic microorganism. Also contemplated are methods of producing a neutral lipid in a photosynthetic microorganism, e.g., a Cyanobacterium, comprising culturing a photosynthetic microorganism comprising one or more polynucleotides encoding one or more enzymes associated with triglyceride biosynthesis and at least one polynucleotide encoding at least one protein associated with lipid packaging activities, for a time and under conditions sufficient to allow neutral lipid production. In certain embodiments, the one or more enzymes comprise a DGAT enzymatic activity and/or a phosphatidate phosphatase enzymatic activity. I n particular embodiments, the one or more enzymes comprise both a DGAT enzymatic activity and a phosphotidate phosphatase enzymatic activity. In certain embodiments the one or more enzymes comprise an acyl-CoA carboxylase (ACCase) enzymatic activity, as well as a DGAT enzymatic activity and/or a phosphatidate phosphatase enzymatic activity. I n one embodiment, the one or more enzymes comprise a DGAT enzymatic activity, a phosphotidate phosphatase enzymatic activity, and an acyl-CoA carboxylase (ACCase) enzymatic activity. I n particular embodiments, one or more of the polynucleotides are exogenous to the photosynthetic microorganism's native genome. In certain embodiments, the at least one protein comprises a lipid packaging protein. In particular embodiments, the at least one protein comprises pgl 1, pgl 2, pgl-like, Phal P, Tad A, Clol 1. In particular embodiments, the at least one polynucleotide is exogenous to the photosynthetic microorganism's native genome.
The present invention also relates to methods of producing an increased amount of fatty acid, e.g., a free fatty acid, in a photosynthetic microorganism, e.g., a Cyanobacterium, comprising introducing one or more polynucleotides encoding one or more enzymes associated with fatty acid biosynthesis and at least one polynucleotide encoding at least one protein associated with lipid packaging activities into a photosynthetic microorganism, culturing the Cyanobacterium for a time and under conditions sufficient to allow increased neutral lipid
production, thereby producing an increased amount of neutral lipids in the Cyanobacterium. Also contemplated are methods of producing an increased amount of neutral lipids in a photosynthetic microorganism, e.g., a Cyanobacterium, comprising culturing a photosynthetic microorganism comprising one or more polynucleotides encoding one or more enzymes associated with fatty acid biosynthesis and at least one polynucleotide encoding at least one protein associated with lipid packaging activities. In certain embodiments, one or more of the polynucleotides are exogenous to the photosynthetic microorganism's native genome. In certain embodiments, the one or more enzymes comprise an ACCase enzymatic activity. I n producing neutral lipids such as triglycerides, the modified Cyanobacteria of the present invention may be cultured according to routine techniques known in the art and exemplified herein, such as photobioreactor based culture techniques. In certain embodiments, the at least one polynucleotide is exogenous to the photosynthetic microorganism's native genome. In certain embodiments, the at least one polynucleotide comprises a lipid packaging activity. I n producing neutral lipids, the modified Cyanobacteria of the present invention may be cultured according to routine techniques known in the art and exemplified herein, such as photobioreactor based culture techniques.
The present invention also relates to methods of preparing a modified photosynthetic microorganism, e.g., a modified Cyanobacterium, such as by genetic modification, to increase production of naturally-occurring neutral lipids, e.g., free fatty acids, and/or to produce neutral lipids such as triglycerides. Photosynthetic microorganisms, such as Cyanobacteria, can be genetically modified according to routine techniques known in the art, such as by transformation of vectors suitable for use in Cyanobacteria. In certain aspects, genetic manipulation in Cyanobacteria can be performed by the introduction of non-replicating vectors which contain native Cyanobacterial seq uences, exogenous genes of interest, and drug resistance genes. Upon introduction into the Cyanobacterial cell, the vectors may be integrated into the Cyanobacterial genome through homologous recombination. I n this way, the exogenous gene of interest and the drug resistance gene are stably integrated into the Cyanobacterial genome. Such recombinants cells can then be isolated from non-recombinant cells by drug selection. Examples of suitable vectors are provided herein.
Embodiments of the present invention include methods of producing neutral lipids in a Cyanobacterium, comprising introducing one or more polynucleotides encoding one or more enzymes associated with triglyceride biosynthesis a nd at least one polynucleotide encoding at least one protein associated with lipid packaging activities into a Cyanobacterium, such as an enzyme having a phosphatidate phosphatase activity and/or an enzyme having a diacylglycerol acyltransferase activity, and the protein having lipid packaging activity.
I n certain aspects, genetically modified photosynthetic microorganisms, e.g., Cyanobacteria, may be prepared by (i) introducing one or more desired polynucleotides encoding one or more enzymes associated with triglyceride biosynthesis and lipid packaging activity into a photosynthetic microorganism, and (ii) selecting for, and/or isolating, photosynthetic microorganisms that comprise the one or more desired polynucleotides. As one example, selection and isolation may include the use of antibiotic resistant markers known in the art (e.g., kanamycin, spectinomycin, and streptomycin). In certain embodiments, genetically modified photosynthetic microorganisms, e.g., Cyanobacteria, may be prepared by (i) introducing one or more desired, exogenous polynucleotides encoding one or more enzymes associated with fatty acid biosynthesis and at least one polynucleotide encoding at least a protein associated with lipid packaging activities into a photosynthetic microorganism, e.g., a Cyanobacteria, and (ii) selecting for, and/or isolating, photosynthetic microorganisms that comprise one or more desired, exogenous polynucleotides. In certain embodiments, genetically modified photosynthetic microorganisms may be prepared by (i) introducing one or more desired polynucleotides encoding one or more enzymes associated with fatty acid biosynthesis and triglyceride synthesis and at least one polynucleotide encoding at least a protein associated with lipid packaging activities into a photosynthetic microorganism, and (ii) selecting for, and/or isolating, photosynthetic microorganisms that comprise the one or more desired polynucleotides.
I n certain embodiments, the one or more enzymes associated with triglyceride synthesis com prise a diacylglycerol acyltransferase (DGAT) enzymatic activity or a phosphatidate phosphatase enzymatic activity. I n some embodiments, the one or more enzymes associated with triglyceride synthesis comprise both a DGAT enzymatic activity and a phosphatidate
phosphatase enzymatic activity. In certain embodiments the one or more enzymes associated with fatty acid biosynthesis comprise an acyl-CoA carboxylase (ACCase) enzymatic activity. In particular embodiments, the one or more enzymes associated with triglyceride synthesis comprise an acyl-CoA carboxylase (ACCase) enzymatic activity and either a DGAT enzymatic activity or a phosphatidate phosphatase enzymatic activity. In one embodiment, the one or more enzymes associated with triglyceride synthesis comprise an acyl-CoA carboxylase (ACCase) enzymatic activity, a DGAT enzymatic activity, and a phosphatidate phosphatase enzymatic activity.
Modified photosynthetic microorganisms of the present invention may include intracellular localization domain-DGAT "fusion proteins," comprising at least one DGAT polypeptide fused to at least one heterologous intracellular localization domain, such as a bacterial membrane-targeting domain or at least one DGAT polypeptide fused to at least one lipid packaging protein. Polypeptides
Embodiments of the present invention include DGAT fusion proteins, comprising at least one DGAT polypeptide fused to at least one heterologous intracellular localization domain, such as a bacterial membrane-targeting domain. Such fusion proteins can be partially or fully isolated from other cellular components, or expressed, for example, in cell-free systems or a host cell, such as a modified photosynthetic microorganism.
Further to comprising a DGAT fusion protein described herein, the modified photosynthetic microorganisms described herein can optionally comprise any combination of one or more overexpressed or introduced lipid biosynthesis proteins and/or one or more overexpressed or introduced proteins associated with glycogen breakdown. Examples of lipid biosynthesis proteins include acyl carrier proteins (ACP), acyl ACP synthases (Aas), acyl-ACP reductases, alcohol dehydrogenases, aldehyde dehydrogenases, aldehyde decarbonylases, thioesterases (TES), acetyl coenzyme A carboxylases (ACCase), phosphatidic acid phosphatases (PAP; or phosphatidate phosphatases), triacylglycerol (TAG) hydrolases, fatty acyl-CoA
synthetases, and lipases/phospholipases, fatty acyl reductases (FAR), as described herein. Exemplary proteins associated with glycogen breakdown are described infra.
In specific instances, photosynthetic microorganisms may optionally comprise reduced, eliminated, or non-functional expression (e.g., expression of a deletion mutant with reduced or no functional activity) of one or more endogenous lipid biosynthesis proteins. In particular aspects, for example, in the production of wax esters, modified photosynthetic microorganisms such as Synechococcus may optionally comprise reduced, eliminated, or non-functional expression of one or more aldehyde decarbonylases (e.g., orfl593), aldehyde dehydrogenases (e.g., orf0489), or both. In specific aspects, a modified photosynthetic microorganism may optionally comprise reduced, eliminated, or non-functional expression of an Aas polypeptide.
Any of these modified photosynthetic microorganisms may optionally comprise reduced, eliminated, or non-functional expression of one or more proteins associated with glycogen biosynthesis, either alone or in combination with overexpressed lipid biosynthesis proteins and/or overexpressed glycogen breakdown proteins, or in combination with any other polypeptide-related modification described herein.
As will be apparent, modified photosynthetic microorganisms of the present invention may comprise any combination of one or more of the additional modifications noted herein, typically as long as they express at least one intracellular localization domain-DGAT fusion protein. It is further understood that the compositions and methods of the present invention may be practiced using biologically active variants and/or fragments of any of the polypeptides described herein.
(i) Intracellular Localization Domain-DGA T Fusion Proteins
As noted above, embodiments of the present invention include intracellular localization domain-DGAT "fusion proteins," comprising at least one DGAT polypeptide fused to at least one heterologous intracellular localization domain, such as a bacterial membrane-targeting domain.
"Fusion proteins" are defined elsewhere herein and well known in the art, as are methods of making fusion proteins. Fusion proteins may be prepared using standard techniques. For example, DNA sequences encoding the polypeptide components of a desired
fusion may be assembled separately, and ligated into an appropriate expression vector. The 3' end of the DNA sequence encoding one polypeptide component can be ligated, with or without a peptide linker (described below), to the 5' end of a DNA sequence encoding the second (or third, fourth, etc.) polypeptide component so that the reading frames of the sequences are in phase. This permits translation into a single fusion protein that retains the biological activity of both component polypeptides.
The ligated DNA sequences may be operably linked to suitable transcriptional or translational regulatory elements. The regulatory elements responsible for expression of DNA are typically located 5' to the DNA sequence encoding the first polypeptide (e.g., the membrane-targeting domain). Similarly, stop codons required to end translation and transcription termination signals are typically present 3' to the DNA seq uence encoding the second (or third, fourth, etc.) polypeptide.
I n the DGAT fusion proteins described herein, the intracellular localization or targeting domain can be fused to the N-terminus of the DGAT polypeptide, the C-terminus of the DGAT polypeptide, internally, or any combination thereof. For internal fusions, the intracellular localization or targeting domain can be fused to the DGAT polypeptide within the N-terminal region (e.g., within about the first 2, 3, 4, 5, 6, 7, 8, 9, 10, 20, 30, 40, 50, 60, 70, 80, 90, 100 or so amino acids), at an internal region (between the N-terminal and C-terminal regions), and/or within the C-terminal region (e.g., within about the last 2, 3, 4, 5, 6, 7, 8, 9, 10, 20, 30, 40, 50, 60, 70, 80, 90, 100 or so amino acids). Preferably, the intracellular localization or targeting domain is fused to the N-terminus of the DGAT polypeptide, or near the N-terminus, for example, within about the 1-100 amino acids. In particular embodiments, the intracellular localization or targeting domain is fused to the second amino acid of a DGAT polypeptide, resulting in the removal of the first residue (the methionine residue of the AUG codon).
The intracellular localization domains described herein alter the intracellular localization of the DGAT protein(s) to which they are fused. Such alterations can thus be measured relative to the localization of the corresponding wild-type DGAT protein(s). I n the most general aspects, a DGAT fusion protein is "targeted to" or "selectively localizes" to one or more defined intracellular region(s) of a photosynthetic microorganism, where at least about 30%, 40%, 50%,
60%, 70%, 80%, 90%, 95%, 98%, or nearly 100% of the DGAT fusion protein can be found associated with the defined intracellular region(s), relative to a cytoplasmic or a soluble fraction of the microorganism. In particular aspects, the intracellular region is one or more of the plasma membrane, a thylakoid, a vesicle, a lipid body, a glycogen granule, a polyhydroxybutyrate (PHB) body, a carboxysome, a cyanophycin granule, and/or an intracellular membrane, such as an intracellular membrane associated with a thylakoid, vesicle, lipid body, glycogen granule, PHB body, carboxysome, and/or cyanophycin granule. I n certain embodiments, the enzymatic domain(s) of the DGAT fusion protein selectively localizes to the cytoplasmic side of the intracellular region(s) or associated membranes.
I n more particular embodiments, a DGAT fusion protein is "targeted to" or "selectively localizes" to one or more membrane(s) of a photosynthetic microorganism, where at least about 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, 98%, or nearly 100% of the fusion protein can be found associated with at least one membrane (e.g., a membrane fraction) upon expression in a given photosynthetic microorganism (e.g., Cyanobacteria), relative to other cellular spaces, such as a cytoplasmic or soluble fraction of the cell. Examples of membranes include the plasma membrane and any intracellular membranes, such as intracellular membranes associated with a thylakoid, vesicle, lipid body, glycogen granule, PHB body, carboxysome, and/or cyanophycin granule. In some embodiments, the enzymatic domain(s) of the DGAT fusion protein selectively localize to the cytoplasmic side of the membrane.
In more specific embodiments, a DGAT fusion protein is "targeted to" or "selectively localizes" to the plasma membrane of a photosynthetic microorganism, where at least about 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, 98%, or nearly 100% of the fusion protein can be found at the plasma membrane (or associated with the plasma membrane) upon expression in a given photosynthetic microorganism (e.g.,. Cyanobacteria), relative to other cellular spaces, such as the cytoplasm, vesicles, lipid bodies, thylakoids, glycogen granules, PHB bodies, carboxysomes, cyanophycin granules, other intracellular membranes (e.g., thylakoid membranes, lipid body membranes, vesicle membranes), or any combination thereof. Thylakoids consist of a thylakoid membrane surrounding a thylakoid lumen, and are the site of
photosynthesis. In certain of these and related embodiments, the enzymatic domain(s) of the DGAT fusion protein selectively localize to the cytoplasmic side of the plasma membrane.
In particular aspects, the fusion to a heterologous intracellular localization domain limits the potential for DGAT-mediated photosystem disruption, generation of reactive oxygen species, and loss of cell viability, and thereby improves the cell growth phenotype of DGAT-expressing and lipid-producing photosynthetic microorganisms.
Intracellular Localization Domains. Generally, the intracellular localization domain sequences of the DGAT fusion proteins described herein can be obtained from any one or more signal or other sequences that selectively localize a given protein to a defined intracellular region (for instance, relative to dispersal throughout the cytoplasm), such as the plasma membrane, a thylakoid, a vesicle, a lipid body, a glycogen granule, a PHB body, a carboxysome, a cyanophycin granule, or an intracellular membrane. Particular examples thus include membrane-, thylakoid-, vesicle-, lipid body-, glycogen granule-, polyhydroxybutyrate (PHB) body-, carboxysome-, and cyanophycin granule-targeting domains, including domains that target DGAT to the membranes associated with these intracellular regions. In specific instances, the intracellular localization domain selectively localizes the active domain(s) of DGAT to the cytoplasmic side of the intracellular region, so that DGAT can interact with lipid-producing substrates in the cytoplasm.
The intracellular localization domain can be any length that is sufficient to selectively localize the DGAT fusion protein(s) to an intracellular region of a membrane, such as the plasma membrane, and/or alter its relation to other cell membranes or substrates, and allow the enzymatic portions of the DGAT polypeptide to interact with lipid-producing substrates in the cytoplasm. For instance, in certain embodiments, the intracellular localization domain can be anywhere from about 10-1000 amino acids in length, including about 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 110, 120, 130, 140, 150, 160, 170, 180, 190, 200, 210, 220, 230, 240, 250, 260, 270, 280, 290, 300, 310, 320, 330, 340, 350, 360, 370, 380, 390, 400, 410, 420, 430, 440, 450, 460, 470, 480, 490, 500, 510, 520, 530, 540, 550, 560, 570, 580, 590, 600, 700, 710, 720, 730, 740, 750, 760, 770, 780, 790, 800, 810, 820, 830, 840, 850, 860, 870, 880, 890, 900, 910, 920, 930, 940, 950, 960, 970, 980, 990, 1000 or more amino acids in
length, including all integers and ranges in between (e.g., 20-100, 30-100, 40-100, 50-100, 20-200, 30-200, 40-200, 50-200 amino acids in length).
I n particular embodiments, the intracellular localization domain is a membrane-targeting or plasma membrane (PM)-targeting domain. Such membrane-targeting sequences can be obtained or derived from any combination of N-terminal leader sequence(s), transmembrane domain sequence(s), and/or integral membrane seq uence(s) of a bacterial membrane protein, such as a bacterial plasma membrane protein. In certain instances, such bacterial plasma membrane proteins (in their endogenous state) selectively localize to the plasma membrane, and are characterized by having at least one C-terminal region that is localized to the cytoplasmic side of a bacterial plasma membrane, and/or the periplasmic side of the outer membrane (for plasma membrane proteins derived from gram-negative bacteria).
A bacterial plasma membrane protein "selectively localizes" to the plasma membrane where at least about 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, 98%, or nearly 100% of the protein can be found at the plasma membrane upon (preferably endogenous) expression in the bacteria from which it is derived (e.g., gram-positive bacteria, gram-negative bacteria, photosynthetic bacteria, Cyanobacteria), relative to other cellular spaces, such as the cytoplasm, the cell wall, other cellular Organelles' or membranes, such as thylakoid membranes for certain photosynthetic bacteria, or any combination thereof.
I n certain embodiments, the membrane-targeting or PM -targeting domain may com prise an amino acid seq uence of an N-terminal leader sequence, an amino acid seq uence of one or more transmembrane domains (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more transmembrane domains), an amino acid seq uence of one or more integral membrane domains, or any combination thereof. When combined, the sequences of the N-terminal leader, the transmembrane domain(s), and/or the integral membra ne domain(s) can be from the same or different bacterial plasma membrane protein(s).
Membrane-targeting or PM-targeting domain seq uences can be obtained from (or derived from) the signal sequences, transmembrane domains, or integral membrane domains of any variety of bacterial membrane proteins. For instance, the bacterial membrane protein can be an integral membrane protein (I M P), such as a transmembrane protein (TP). I n some
instances, the membrane-targeting domain is obtained from a single-pass transmembrane protein, having only one domain that spans the lipid bilayer of the plasma membrane, or a multi-pass transmembrane protein, having about 2, 3, 4, 5, 6, 7, 8, 9, 10 or more domains that span the lipid bilayer of the plasma membrane. For the latter, the membrane-targeting domain can comprise any one or more of the multiple transmembrane domains. In certain aspects, the membrane-targeting domain or transmembrane domain (TMD) can comprise an alpha-helical transmembrane structure, or a beta-barrel transmembrane structure, the latter typically deriving from gram-negative outer membrane proteins.
In some embodiments, the membrane-targeting or PM-targeting domain does not span the entire lipid bilayer, but inserts into or attaches to the cytoplasmic side of the membrane, such as the plasma membrane. Examples include membrane-targeting domains that interact with the membrane by an amphipathic helix (e.g., parallel to the membrane plane), membrane-targeting domains that interact with the membrane by a hydrophobic loop, and membrane-targeting domains that interact with the membrane by electrostatic or ionic interactions, for example, through calcium ions.
In some embodiments, the membrane-targeting domain sequence is obtained from a membrane protein or plasma membrane protein of one or more gram-negative bacteria, gram-positive bacteria, or other bacteria, such as Cyanobacteria. Exemplary bacteria are described elsewhere herein and known in the art.
In particular embodiments the membrane-targeting or PM-targeting domain sequence is obtained from a membrane protein or plasma membrane protein of a photosynthetic bacteria, such as a Cyanobacteria from the genera Aphanocapsa, Aphanothece, Chamaesiphon, Chroococcus, Chroogloeocystis, Coelosphaerium, Crocosphaera, Cyanobacterium, Cyanobium, Cyanodictyon, Cyanosarcina, Cyanothece, Dactylococcopsis, Gloecapsa, Gloeothece, Merismopedia, Microcystis, Radiocystis, Rhabdoderma, Snowella, Synychococcus, Synechocystis, Thermosenechococcus, and Woronichinia; Nostacales Cyanobacteria from the genera Anabaena, Anabaenopsis, Aphanizomenon, Aulosira, Calothrix, Coleodesmium, Cyanospira, Cylindrospermosis, Cylindrospermum, Fremyella, Gleotrichia, Microchaete, Nodularia, Nostoc, Rexia, Richelia, Scytonema, Sprirestis, Toypothrix, Oscillatoriales; Cyanobacteria from the
genera Arthrospira, Geitlerinema, Halomicronema, Halospirulina, Katagnymene, Leptolyngbya, Limnothrix, Lyngbya, Microcoleus, Oscillatoria, Phormidium, Planktothricoides, Planktothrix, Plectonema, Pseudoanabaena/Limnothrix, Schizothrix, Spirulina, Symploca, Trichodesmium, Tychonema; Pleurocapsales Canobacterium from the genera Chroococcidiopsis, Dermocarpa, Dermocarpella, Myxosarcina, Pleurocapsa, Stanieria, Xenococcus, Prochlorophytes; Cyanobacterium from the genera Prochloron, Prochlorococcus, Prochlorothrix; and Stigonematales cyanobacterium from the genera Capsosira, Chlorogeoepsis, Fischerella, Hapalosiphon, Mastigocladopsis, Nostochopsis, Stigonema, Symphyonema, Symphonemopsis, Umezakia, and Westiellopsis. In certain embodiments, the Cyanobacterium is from the genus Synechococcus, including, but not limited to Synechococcus bigranulatus, Synechococcus elongatus, Synechococcus leopoliensis, Synechococcus lividus, Synechococcus nidulans, and Synechococcus rubescens. In specific embodiments, the membrane-targeting domain sequence is derived from a plasma membrane protein from 5. elongatus sp. strain PCC7942 or Synechococcus sp. PCC 7002 (originally known as Agmenellum quadruplicatum).
In some embodiments, the membrane protein is a plasma membrane receptor protein, such as a chemoreceptor or chemotaxis protein. Particular examples include integral membrane chemoreceptors, e.g., transmembrane chemoreceptors. Examples of chemoreceptors or chemotaxis proteins include methyl-accepting chemotaxis proteins and amino acid chemotaxis receptors, such as serine chemotaxis receptors (e.g., Tsr receptor from Escherichia coli) and aspartate chemotaxis receptors. The membrane-targeting domain can thus be obtained from the signal sequence and/or transmembrane domains of any one or more of such bacterial plasma membrane receptors.
In particular embodiments, the membrane-targeting domain is obtained from (or derived from) a methyl-accepting chemotaxis protein (MCP). MCPs can be classified by topology type (see Zhulin, Adv Microb Physiol. 45:157-198, 2001) and signaling domain class (see Alexander and Zhulin, PNAS USA. 104:2885-2890, 2007). Topology type I MCPs have large periplasmic ligand-binding domains and an elongated cytoplasmic region consisting of a HAMP domain (i.e., histidine kinases, adenylyl cyclases, methyl-binding proteins, and phosphatases) followed by a signaling domain, which in turn is composed of "methylation," "flexible bundle,"
and "signaling" sub-domains (see Alexander and Zhulin, supra; and Hazelbauer et al., Trends Biochem Sci. 33:9-19, 2008). MCPs cluster together with other chemotaxis proteins in large arrays at the cell pole.
MCP arrays from variety of bacteria have been well-characterized, including, for example, E. coli, C. crescentus, Thermotoga maritima, Magnetospirillum magneticum, Rhodobacter sphaeroides, Treponema primitia, Listeria monocytogenes, Helicobacter hepaticus, Acetonema longum, Borrelia burgdorferi, Halothiobacillus neapolitanus, and Campylobacter jejuni (see Briegel et al., PNAS USA. 106:17181-17186, 2009). The membrane-targeting domain can thus be derived from the signal sequence and/or transmembrane domains of an MCP from any one or more of these bacteria, or an MCP from any other bacteria described herein or known in the art.
In specific embodiments, the MCP is encoded by PCC7942-0858 or PCC7942-1015 from 5. elongatus. The polypeptide and polynucleotide sequence of the 5. elongatus PCC7942-0858 MCP are set forth in SEQ ID NOS:199 and 200, respectively, and the polypeptide and polynucleotide sequence of the 5. elongatus PCC7942-1015 MCP are set forth in SEQ ID NOS:201 and 202, respectively.
In some embodiments, the membrane-targeting domain comprises or consists essentially of the N-terminal leader sequence, the first (N-terminal) transmembrane domain, and/or the second transmembrane domain of PCC7942-0858, singly or in combination together. In certain instances, the bacterial membrane-targeting domain comprises or consists essentially of about the N-terminal 43-53 amino acids of the MCP encoded by PCC7942-0858, for example, about residues 1-43, 4-44, 1-45, 1-46, 1-47, 1-48, 1-49, 1-50, 1-51, 1-52, 1-53, 1-54, 1-55, 1-56, 1-57, 1-58, 1-59, 1-60 of SEQ ID NO:199. In specific instances, the bacterial membrane-targeting domain comprises or consists essentially of the N-terminal signal sequence of the MCP encoded by PCC7942-0858, for example, about residues 1-43 of SEQ ID NO:199.
DGAT Polypeptides. As used herein, a "diacylglycerol acyltransferase" (DGAT) polypeptide includes any protein, polypeptide or peptide, obtainable from any cell source, which demonstrates the ability to catalyze the production of triacylglycerol from 1,
2-diacylglycerol and fatty acyl substrates under enzyme reactive conditions, in addition to any naturally-occurring (e.g., allelic variants, orthologs) or non-naturally occurring variants of a diacylglycerol acyltransferase sequence having such ability. DGAT polypeptides of the present invention also include bi-functional proteins, such as those bi-functional proteins that exhibit a DGAT activity as well as a CoA:fatty alcohol acyltransferase activity, e.g., a wax ester synthesis (WES) activity, as often found in many TAG producing bacteria.
DGATs are members of the O-acyltransferase superfamily, which esterify either sterols or diacyglycerols in an oleoyl-CoA-dependent manner. DGAT in particular esterifies diacylglycerols, which reaction represents the fina l enzymatic step in the production of triacylglycerols in plants, fungi and mammals. Specifically, DGAT is responsible for transferring an acyl group from acyl-coenzyme-A to the sn-3 position of 1, 2-diacylglycerol (DAG) to form triacylglycerol (TAG). DGAT is an integral membrane protein that has been generally described in Harwood (Biochem. Biophysics. Acta, 1301:7-56, 1996), Daum et al. (Yeast 16:1471-1510, 1998), and Coleman et al. (Annu. Rev. Nutr. 20:77-103, 2000) (each of which are herein incorporated by reference).
I n plants and fungi, DGAT is associated with the membrane and lipid body fractions. I n catalyzing TAGs, DGAT contributes mainly to the storage of carbon used as energy reserves. In animals, however, the role of DGAT is more complex. DGAT not only plays a role in lipoprotein assembly and the regulation of plasma triacylglycerol concentration (Bell, R. M ., et al. ), but participates as well in the regulation of diacylglycerol levels (see Brindley, Biochemistry of Lipids, Lipoproteins and Membranes, eds. Vance, D. E. & Vance, J. E. (Elsevier, Amsterdam), 171-203; and Nishizuka, Science 258:607-614, 1992, each of which are incorporated by reference).
I n eukaryotes, at least three independent DGAT gene families (DGAT1, DGAT2, and PDAT) have been described that encode proteins with the capacity to form TAG. Yeast contain all three of DGAT1, DGAT2, and PDAT, but the expression levels of these gene families varies during different phases of the life cycle (Dahlqvst, A., et al. Proc. Natl. Acad. Sci. USA 97:6487-6492, 2000, incorporated by reference).
In prokaryotes, WS/DGAT from Acinetobacter calcoaceticus ADP1 represents the first identified member of a widespread class of bacterial wax ester and TAG biosynthesis enzymes. This enzyme comprises a putative membrane-spanning region but shows no sequence homology to the DGAT1 and DGAT2 families from eukaryotes. Under in vitro conditions, WS/DGAT shows a broad capability of utilizing a large variety of fatty alcohols, and even thiols as acceptors of the acyl moieties of various acyl-CoA thioesters. WS/DGAT acyltransferase enzymes exhibit extraordinarily broad substrate specificity. Genes for homologous acyltransferases have been found in almost all bacteria capable of accumulating neutral lipids, including, for example, Acinetobacter baylii, A. baumanii, and M. avium, and M. tuberculosis CDC1551, in which about 15 functional homologues are present (see, e.g., Daniel et al., J. Bacteriol. 186:5017-5030, 2004; and Kalscheuer et al., J. Biol. Chem. 287:8075-8082, 2003).
DGAT proteins may utilize a variety of acyl substrates in a host cell, including fatty acyl-CoA and fatty acyl-ACP molecules. In addition, the acyl substrates acted upon by DGAT enzymes may have varying carbon chain lengths and degrees of saturation, although DGAT may demonstrate preferential activity towards certain molecules.
Like other members of the eukaryotic O-acyltransferase superfamily, eukaryotic DGAT polypeptides typically contain a FYxDWWN (SEQ ID NO:15) heptapeptide retention motif, as well as a histidine (or tyrosine)-serine-phenylalanine (H/YSF) tripeptide motif, as described in Zhongmin et al. (Journal of Lipid Research, 42:1282-1291, 2001) (herein incorporated by reference). The highly conserved FYxDWWN (SEQ ID NO:15) is believed to be involved in fatty Acyl-CoA binding. In certain instances, the DGAT polypeptide portion of the fusion proteins described herein may thus comprise one or more these motifs.
DGAT polypeptides utilized according to the fusion proteins described herein may be isolated from any organism, including eukaryotic and prokaryotic organisms. Eukaryotic organisms having a DGAT gene are well-known in the art, and include various animals (e.g., mammals, fruit flies, nematodes), plants, parasites, and fungi (e.g., yeast such as 5. cerevisiae and Schizosaccharomyces pombe). Examples of prokaryotic organisms include certain actinomycetes, a group of Gram-positive bacteria with high G+C ratio, such as those from the representative genera Actinomyces, Arthrobacter, Corynebacterium, Frankia, Micrococcus,
Mocrimonospora, Mycobacterium, Nocardia, Propionibacterium, Rhodococcus and Streptomyces. Particular examples of actinomycetes that have one or more genes encoding a DGAT activity include, for example, Mycobacterium tuberculosis, M. avium, M. smegmatis, Micromonospora echinospora, Rhodococcus opacus, R. ruber, and Streptomyces lividans.
Additional examples of prokaryotic organisms that encode one or more enzymes having a DGAT activity include members of the genera Acinetobacter, such as A. calcoaceticus, A. baumanii, A. baylii, and members of the generua Alcanivorax. I n certain embodiments, a DGAT polypeptide is from Acinetobacter baylii sp. ADP1, a gram-negative triglyceride forming prokaryote, which contains a well-characterized DGAT (AtfA).
I n particular embodiments, the DGAT polypeptide is an Acinetobacter DGAT (ADGAT), a
Streptomyces DGAT, or an Alcanivorax DGAT. I n specific embodiments, the DGAT polypeptide comprises or consists of a polypeptide sequence set forth in any one of SEQ ID NOs:58, 59, 60, or 61, or a fragment or variant thereof. SEQ I D NO:58 is the seq uence of DGATn; SEQ ID NO:59 is the seq uence of Streptomyces coelicolor DGAT (ScoDGAT or SDGAT); SEQ ID NO:60 is the sequence of Alcanivorax borkumensis DGAT (AboDGAT); and SEQ ID NO:61 is the sequence of DGATd.
I n certain embodiments, the modified photosynthetic microorganisms of the present invention may express two or more intracellular localization domain-DGAT fusion proteins. The DGAT polypeptides may by the same or different. In particular embodiments, the following intracellular localization domain-DGAT fusions are co-expressed in modified photosynthetic microorganisms, e.g., Cyanobacteria, using one of the following double DGAT strains: ADGATd: : ScoDGAT; ADGATd(NSl): :ADGATd(NS2); ADG/\7n(NSl)::ADGATn(NS2);
ADGATn(NSl): :SDGAT(NS2); SDGAT(NSl)::ADGATn(NS2); SDGAT(NS1): :SDGAT(NS2). For the NS1 vector, pAM2291, EcoRI follows ATG and is part of the open reading frame (ORF). For the NS2 vector, pAM 1579, EcoRI follows ATG and is part of the ORF. A DGAT having EcoRI nucleotides following ATG may be cloned in either pAM2291 or pAM 1579; such a DGAT is referred to as ADGATd. Other em bodiments utilize the vector, pAM2314FTrc3, which is an NS1 vector with Nde/Bglll sites, or the vector, pAM 1579FTrc3, which is the NS2 vector with Nde/Bgll l sites. A DGAT without EcoRI nucleotides may be cloned into either of these last two
vectors. Such a DGATis referred to as ADGATn. Modified photosynthetic microorganisms expressing different DGATs express TAGs having different fatty acid compositions. Accordingly, certain embodiments contemplate expressing two or more different intracellular localization domain-DGAT fusions, in order to produce TAGs having varied fatty acid compositions.
Peptide Linkers. In certain embodiments, a peptide linker sequence may be employed to separate the DGAT polypeptide(s) and the heterologous intracellular localization domain(s) by a distance sufficient to ensure that each polypeptide folds into its desired secondary and tertiary structures. Such a peptide linker sequence can be incorporated into the fusion protein using standard techniques well known in the art.
Certain peptide linker seq uences may be chosen based on the following exemplary factors: (1) their ability to adopt a flexible extended conformation; (2) their inability to adopt a secondary structure that could interact with functional epitopes on the first and second polypeptides; (3) their physiological stability; and (4) the lack of hydrophobic or charged residues that might react with the polypeptide functional epitopes, or other features. See, e.g., George and Heringa, J Protein Eng. 15:871-879, 2002.
The linker seq uence can be essentially any length, but is generally from about 1 to about 300 amino acids in length. Particular linkers can have an overall amino acid length of about 1-300 amino acids, 1-250, 1-200 amino acids, 1-150 amino acids, 1-100 amino acids, 1-90 amino acids, 1-80 amino acids, 1-70 amino acids, 1-60 amino acids, 1-50 amino acids, 1-40 amino acids, 1-30 amino acids, 1-20 amino acids, 1-10 amino acids, 1-5 amino acids, 1-4 amino acids, 1-3 amino acids, or about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16 ,17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 60, 70, 80, 90, 100, 110, 120, 130, 140, 150, 160, 170, 180, 190, 200, 210, 220, 230, 240, 250, 260, 270, 280, 290, 300 or more amino acids in length.
Certain amino acid seq uences which may be usefully employed as linkers include those disclosed in Maratea et al., Gene 40:39-46, 1985; M urphy et al., PNAS USA. 83:8258-8262, 1986; U.S. Pat. No. 4,935,233 and U.S. Pat. No. 4,751,180. Particular peptide linker sequences contain Gly, Ser, and/or Asn residues. Other near neutral amino acids, such as Thr and Ala may also be employed in the peptide linker sequence, if desired.
Certain exemplary linkers include Gly, Ser and/or Asn-containing linkers, as follows: [G]x, [Six, [N]x, [GS]X, [GGS]X, [GSS]X, [GSGS]X (SEQ I D NO:203), [GGSG]X (SEQ I D NO:204), [GGGS]X (SEQ I D NO:205), [GGGGS]X (SEQ I D NO:206), [GN]X, [GGN]X, [GNN]X, [GNGN]X (SEQ I D NO:207), [GGNG]X (SEQ I D NO:208), [GGGN]X (SEQ ID NO:209), [GGGGN]X (SEQ I D NO:210) linkers, where x is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 25, 30, 40, 50 or more. Other combinations of these and related amino acids will be apparent to persons skilled in the art.
In certain embodiments, however, any one or more of the peptide linkers are optional. For instance, linker seq uences may not required when the bacterial membrane protein (from which the membrane- or PM -targeting domain is derived) and the DGAT polypeptides have non-essential regions (e.g., N-terminal and/or C-terminal amino acids, or for the plasma membrane proteins, regions just downstream of the signal seq uences and/or transmembrane domains) that can be used to separate the functional domains and prevent steric interference.
(ii) Lipid Biosynthesis Proteins
In various embodiments, modified photosynthetic microorganisms of the present invention further comprise one or more exogenous (e.g., introduced) or overexpressed nucleic acids that encode a lipid biosynthesis protein, e.g., a polypeptide having an activity associated with triglyceride biosynthesis or fatty acid biosynthesis, including but not limited to any of those described herein. I n particular instances, a modified photosynthetic microorganism may comprise reduced expression and/or activity of one or more selected lipid biosynthesis proteins. Certain of these proteins are described in greater detail below.
In particular embodiments, the exogenous nucleic acid does not comprise a nucleic acid sequence that is native to the microorganism's genome. In some embodiments, the exogenous nucleic acid comprises a nucleic acid sequence that is native to the microorganism's genome, but it has been introduced into the microorganism, e.g., in a vector or by molecular biology techniques, for example, to increase expression of the nucleic acid and/or its encoded polypeptide in the microorganism. In certain embodiments, the expression of a native or endogenous nucleic acid and its corresponding protein can be increased by introducing a heterologous promoter upstream of the native gene. As noted above, lipid biosynthesis
proteins can be involved in triglyceride biosynthesis, fatty acid synthesis, wax ester synthesis, or any combination thereof.
Triglyceride Biosynthesis. Triglycerides, or triacylglycerols (TAGs), consist primarily of glycerol esterified with three fatty acids, and yield more energy upon oxidation than either carbohydrates or proteins. Triglycerides provide an important mechanism of energy storage for most eukaryotic organisms. In mammals, TAGs are synthesized and stored in several cell types, including adipocytes and hepatocytes (Bell et al. Annu. Rev. Biochem. 49:459-487,1980) (incorporated by reference). I n plants, TAG production is mainly important for the generation of seed oils.
I n contrast to eukaryotes, the observation of triglyceride production in prokaryotes has been limited to certain actinomycetes, such as members of the genera Mycobacterium, Nocardia, Rhodococcus and Streptomyces, in addition to certain members of the genus Acinetobacter. I n certain Actinomycetes species, triglycerides may accumulate to nearly 80% of the dry cell weight, but accumulate to only about 15% of the dry cell weight in Acinetobacter. I n general, triglycerides are stored in spherical lipid bodies, with quantities and diameters depending on the respective species, growth stage, and cultivation conditions. For example, cells of Rhodococcus opacus and Streptomyces lividans contain only few TAGs when cultivated in complex media with a high content of carbon and nitrogen; however, the lipid content and the number of TAG bodies increase drastically when the cells are cultivated in mineral salt medium with a low nitrogen-to-carbon ratio, yielding a maximum in the late stationary growth phase. At this stage, cells can be almost completely filled with lipid bodies exhibiting diameters ranging from 50 to 400 nm. One example is R. opacus PD630, in which lipids can reach more than 70% of the total cellular dry weight.
I n bacteria, TAG formation typically starts with the docking of a diacylglycerol acyltransferase enzyme to the plasma mem brane, followed by formation of small lipid droplets (SLDs). These SLDs are only some nanometers in diameter and remain associated with the membrane-docked enzyme. In this phase of lipid accumulation, SLDs typically form an emulsive, oleogenous layer at the plasma membrane. During prolonged lipid synthesis, SLDs leave the
membrane-associated acyltransferase and conglomerate to membrane-bound lipid prebodies. These lipid prebodies reach distinct sizes, e.g., about 200 nm in A. calcoaceticus and about 300 nm in R. opacus, before they lose contact with the membrane and are released into the cytoplasm. Free and membrane-bound lipid prebodies correspond to the lipid domains occurring in the cytoplasm and at the cell wall, as observed in M. smegmatis during fluorescence microscopy and also confirmed in R. opacus PD630 and A. calcoaceticus ADPl (see, e.g., Christensen et al., Mol. Microbiol. 31:1561-1572, 1999; and Waltermann et al., Mol. Microbiol. 55:750-763, 2005). Inside the lipid prebodies, SLDs coalesce with each other to form the homogenous lipid core found in mature lipid bodies, which often appear opaque in electron microscopy.
The compositions and structures of bacterial TAGs vary considerably depending on the microorganism and on the carbon source. In addition, unusual acyl moieties, such as phenyldecanoic acid and 4,8,12 trimethyl tridecanoic acid, may also contribute to the structural diversity of bacterial TAGs (see, e.g., Alvarez et al., Appl Microbiol Biotechnol. 60:367-76, 2002).
As with eukaryotes, the main function of TAGs in prokaryotes is to serve as a storage compound for energy and carbon. TAGs, however, may provide other functions in prokaryotes. For example, lipid bodies may act as a deposit for toxic or useless fatty acids formed during growth on recalcitrant carbon sources, which must be excluded from the plasma membrane and phospholipid (PL) biosynthesis. Furthermore, many TAG-accumulating bacteria are ubiquitous in soil, and in this habitat, water deficiency causing dehydration is a frequent environmental stress. Storage of evaporation-resistant lipids might be a strategy to maintain a basic water supply, since oxidation of the hydrocarbon chains of the lipids under conditions of dehydration would generate considerable amounts of water. Cyanobacteria such as Synechococcus, however, do not produce triglycerides, because these organisms lack the enzymes necessary for triglyceride biosynthesis.
Triglycerides are synthesized from fatty acids and glycerol. One mechanism of triglyceride (TAG) synthesis, sequential acylation of glycerol-3-phosphate via the "Kennedy Pathway" leads to the formation of phosphatidate. Phosphatidate is then dephosphorylated by the enzyme phosphatidate phosphatase to yield 1,2 diacylglycerol (DAG). Using DAG as a
substrate, at least three different classes of enzymes are capable of mediating TAG formation. As one example, an enzyme having diacylglycerol acyltransferase (DGAT) activity catalyzes the acylation of DAG using acyl-CoA as a substrate. Essentially, DGAT enzymes combine acyl-CoA with 1,2 diacylglycerol molecule to form a TAG. As an alternative, Acyl-CoA-independent TAG synthesis may be mediated by a phospholipid:DAG acyltransferase found in yeast and plants, which uses phospholipids as acyl donors for DAG esterification. Another mechanism of TAG synthesis in animals and plants may be mediated by a DAG-DAG-transacylase, which uses DAG as both an acyl donor and acceptor, yielding TAG and monoacylglycerol.
Modified photosynthetic microorganisms, e.g., Cyanobacteria, of the present invention may comprise one or more exogenous polynucleotides encoding polypeptides comprising one or more of the polypeptides and enzymes described herein.
Since wild type Cyanobacteria do not typically encode the enzymes necessary for triglyceride synthesis, such as the enzymes having diacylglycerol acyltransferase activity, embodiments of the present invention include genetically modified Cyanobacteria that comprise polynucleotides encoding one or more DGAT fusion proteins, optionally in combination with one or more enzymes having a fatty acyl-CoA synthetase activity. In particular embodiments, the one or more exogenous polynucleotides encode a DGAT fusion protein described herein and a fatty acyl-CoA synthetase, or a functional variant or fragment thereof.
Moreover, since triglycerides are typically formed from fatty acids, the level of fatty acid biosynthesis in a cell may limit the production of triglycerides. I ncreasing the level of fatty acid biosynthesis may, therefore, allow increased production of triglycerides. As discussed below, acetyl-CoA carboxylase catalyzes the commitment step to fatty acid biosynthesis. Thus, certain embodiments of the present invention include Cyanobacterium, and methods of use thereof, comprising polynucleotides that encode one or more enzymes having acetyl-CoA carboxylase activity to increase fatty acid biosynthesis and lipid production, in addition to one or more DGAT fusion proteins and optionally one or more enzymes having fatty acyl-CoA synthetase activity, to catalyze triglyceride production. These and related embodiments are detailed below.
Fatty Acid Biosynthesis. Fatty acids are a group of negatively charged, linear hydrocarbon chains of various length and various degrees of oxidation states. The negative charge is located at a carboxyl end group and is typically deprotonated at physiological pH values (pK ~ 2-3).The length of the fatty acid 'tail' determines its water solubility (or rather insolubility) and amphipathic characteristics. Fatty acids are components of phospholipids and sphingolipids, which form part of biological membranes, as well as triglycerides, which are primarily used as energy storage molecules inside cells.
Fatty acids are formed from acetyl-CoA and malonyl-CoA precursors. Malonyl-CoA is a carboxylated form of acetyl-CoA, and contains a 3-carbon dicarboxylic acid, malonate, bound to Coenzyme A. Acetyl-CoA carboxylase catalyzes the 2-step reaction by which acetyl-CoA is carboxylated to form malonyl-CoA. I n particular, ma lonate is formed from acetyl-CoA by the addition of C02 using the biotin cofactor of the enzyme acetyl-CoA carboxylase.
Fatty acid synthase (FAS) carries out the chain elongation steps of fatty acid biosynthesis. FAS is a large multienzyme complex. I n mammals, FAS contains two subunits, each containing multiple enzyme activities. In bacteria and plants, individual proteins, which associate into a large complex, catalyze the individual steps of the synthesis scheme. For example, in bacteria and plants, the acyl carrier protein is a smaller, independent protein.
Fatty acid synthesis starts with acetyl-CoA, and the chain grows from the "tail end" so that carbon 1 and the alpha-carbon of the complete fatty acid are added last. The first reaction is the transfer of an acetyl group to a pantothenate group of acyl carrier protein (ACP), a region of the large mammalian fatty acid synthase (FAS) protein. I n this reaction, acetyl CoA is added to a cysteine -SH group of the condensing enzyme (CE) domain: acetyl CoA + CE-cys-SH -> acetyl-cys-CE + CoASH. Mechanistically, this is a two step process, in which the group is first transferred to the ACP (acyl carrier peptide), and then to the cysteine -SH group of the condensing enzyme domain.
I n the second reaction, malonyl CoA is added to the ACP sulfhydryl group: malonyl CoA + ACP-SH -> malonyl ACP + CoASH. This -SH group is pa rt of a phosphopantethenic acid prosthetic group of the ACP.
I n the third reaction, the acetyl group is transferred to the malonyl group with the release of carbon dioxide: malonyl ACP + acetyl-cys-CE -> beta-ketobutyryl-ACP + C02.
I n the fourth reaction, the keto group is reduced to a hydroxyl group by the beta-ketoacyl reductase activity: beta-ketobutyryl-ACP + NADPH + H+ -> beta-hydroxybutyryl-ACP + NAD+.
I n the fifth reaction, the beta-hydroxybutyryl-ACP is dehydrated to form a trans- monounsaturated fatty acyl group by the beta-hydroxyacyl dehydratase activity: beta-hydroxybutyryl-ACP -> 2-butenoyl-ACP + H20.
I n the sixth reaction, the double bond is reduced by NADPH, yielding a saturated fatty acyl group two carbons longer than the initial one (an acetyl group was converted to a butyryl group in this case): 2-butenoyl-ACP + NADPH + H+ -> butyryl-ACP + NADP+. The butyryl group is then transferred from the ACP sulfhydryl group to the CE sulfhydryl: butyryl-ACP + CE-cys-SH -> ACP-SH + butyryl-cys-CE. This step is catalyzed by the same transferase activity utilized previously for the original acetyl group. The butyryl group is now ready to condense with a new malonyl group (third reaction above) to repeat the process. When the fatty acyl group becomes 16 carbons long, a thioesterase activity hydrolyses it, forming free palmitate: palmitoyl-ACP + H20 -> palmitate + ACP-SH. Fatty acid molecules can undergo further modification, such as elongation and/or desaturation.
Modified photosynthetic microorganisms, e.g., Cyanobacteria, may comprise one or more exogenous polynucleotides encoding any of the above polypeptides or enzymes involved in fatty acid synthesis. In particular embodiments, the enzyme is an acetyl-CoA carboxylase or a variant or functional fragment thereof.
Wax Ester Synthesis. Wax esters are esters of a fatty acid and a long-chain alcohol. These neutral lipids are composed of aliphatic alcohols and acids, with both moieties usually long-chain (e.g., C16 and Ci8) or very-long-chain (C20 and longer) carbon structures, though medium-chain-containing wax esters are included (e.g., Cw, C12 and Ci4). Wax esters have diverse biological functions in bacteria, insects, mammals, and terrestrial plants and are also important substrates for a variety of industrial applications. Various types of wax ester are
widely used in the manufacture of fine chemicals such as cosmetics, candles, printing inks, lubricants, coating stuffs, and others.
I n certain organisms, such as Acinetobacter, the pathway for wax ester synthesis of Acinetobacter spp. has been assumed to start from acyl coenzyme A (acyl-CoA), which is then reduced to the corresponding alcohol via acyl-CoA reductase and aldehyde reductase. In other organisms, for example, wax ester biosynthesis involves elongation of saturated Ci6 and Ci8 fatty acyl-CoAs to very-long-chain fatty acid wax precursors between 24 and 34 carbons in length, and their subsequent modification by either the alkane-forming (decarbonylation) or the alcohol-forming (acyl reduction) pathway (see Li et al., Plant Physiology 148:97-107, 2008).
In certain aspects, wax ester synthesis can occur via the acyl-ACP => acyl aldehyde pathway. I n this pathway, acyl-ACP reductase overexpression increases conversion of acyl-ACP into acyl aldehydes, alcohol dehydrogenase overexpression then increases conversion of acyl aldehydes into fatty alcohols, and DGAT overexpression cooperatively increases conversion of the fatty alcohols into their corresponding wax esters. Modified photosynthetic microorganisms, e.g., Cyanobacteria, may therefore comprise one or more exogenous polynucleotides encoding any of the above polypeptides or enzymes involved in wax ester synthesis.
Acyl Carrier Proteins. Embodiments of the present invention optionally include one or more exogenous (e.g., recombinantly introduced) or overexpressed ACP proteins. These proteins play crucial roles in fatty acid synthesis. Fatty acid synthesis in bacteria, including Cyanobacteria, is carried out by highly conserved enzymes of the type II fatty acid synthase system (FAS I I; consisting of about 19 genes) in a sequential, regulated manner. Acyl carrier protein (ACP) plays a central role in this process by carrying all the intermediates as thioesters attached to the terminus of its 4'-phosphopantetheine prosthetic group (ACP-thioesters). Apo-ACP, the product of acp gene, is typically activated by a phosphopantetheinyl transferease (PPT) such as the acyl carrier protein synthase (AcpS) type found in E. coli or the Sfp (surfactin type) PTT as characterized in Bacillus subtilis. Cyanobacteria posses an Sfp-like PPT, which is understood to act in both primary and secondary metabolism. Embodiments of the present
invention therefore include overexpression of PPTs such as AcpS and/or Sfp-type PPTs in combination with overexpression of cognate ACP encoding genes, such as ACP.
The ACP-thioesters are substrates for all of the enzymes of the FAS II system. The end product of fatty acid synthesis is a long acyl chain typically consisting of about 14-18 carbons attached to ACP by a thioester bond.
At least three enzymes of the FAS II system in other bacteria can be subject to feedback inhibition by acyl-ACPs: 1) the ACCase complex - a heterotetramer of the AccABCD genes that catalyzes the production of malonyl-coA, the first step in the pathway; 2) the product of the FabH gene (β-ketoacyl-ACP synthase III), which catalyzes the condensation of acetyl-CoA with malonyl-ACP; and 3) the product of the Fabl gene (enoyl-ACP reductase), which catalyzes the final elongation step in each round of elongation. Certain proteins such as acyl-ACP reductase are capable of increasing fatty acid production in photosynthetic bacteria such as Cyanobacteria, and it is believed that overexpression of ACP in combination with this protein and possibly other biosynthesis proteins will further increase fatty acid production in such strains.
An ACP can be derived from a variety of eukaryotic organisms, microorganisms (e.g., bacteria, fungi), or plants. In certain embodiments, an ACP polynucleotide sequence and its corresponding polypeptide sequence are derived from Cyanobacteria such as Synechococcus. In certain embodiments, ACPs can be derived from plants such as spinach. SEQ ID NOS:5-12 provide the nucleotide and polypeptide sequences of exemplary bacterial ACPs from Synechococcus and Acinetobacter, and SEQ ID NOS:13-14 provide the same for an exemplary plant ACP from Spinacia oleracea (spinach). SEQ ID NOS:5 and 6 derive from Synechococcus elongatus PCC7942, and SEQ ID NOS:7-12 derive from Acinetobacter sp. ADP1.
Examples of prokaryotic organisms having an ACP include certain actinomycetes, a group of Gram-positive bacteria with high G+C ratio, such as those from the representative genera Actinomyces, Arthrobacter, Corynebacterium, Frankia, Micrococcus, Mocrimonospora, Mycobacterium, Nocardia, Propionibacterium, Rhodococcus and Streptomyces. Particular examples of actinomycetes that have one or more genes encoding an ACP activity include, for example, Mycobacterium tuberculosis, M. avium, M. smegmatis, Micromonospora echinospora,
Rhodococcus opacus, R. ruber, and Streptomyces lividans. Additional examples of prokaryotic organisms that encode one or more enzymes having an ACP activity include members of the genera Acinetobacter, such as A. calcoaceticus, A. baumanii, A. baylii, and members of the generua Alcanivorax. I n certain embodiments, an ACP gene or enzyme is isolated from Acinetobacter baylii sp. ADP1, a gram-negative triglyceride forming prokaryote.
Acyl ACP Synthases (Aas). Acyl-ACP synthetases (Aas) catalyze the ATP-dependent acylation of the thiol of acyl carrier protein (ACP) with fatty acids, including those fatty acids having chain lengths from about C4 to C18. I n Cyanobacteria, among other functions, Aas enzymes not only directly incorporate exogenous fatty acids from the culture medium into other lipids, but also play a role in the recycling of acyl chains from lipid membranes. Deletion of Aas in cyanobacteria can lead to secretion of free fatty acids into the culture medium. See, e.g., Kaczmarzyk and Fulda, Plant Physiology 152:1598-1610, 2010.
Certain embodiments may overexpress one or more Aas polypeptides described herein and known in the art. According to one non-limiting theory, overexpression of Aas in com bination with overexpression of ACP leads to increased TAG production in DGAT-expressing strains, for example, by boosting acyl-ACP levels. Overexpression of Aas in optional combination with overexpression of ACP may likewise increase wax ester formation, for example, when com bined with overexpression of one or more alcohol dehydrogenase(s) and wax ester synthase(s), such as a bi-functional DGAT. Certain embodiments therefore include modified photosynthetic microorganisms comprising overexpressed Aas polypeptide(s), optionally in combination with overexpressed ACP polypeptide(s), especially when combined with overexpression of alcohol dehydrogenase, acyl-ACP reductase (e.g., orfl594), and wax ester synthase (e.g., aDGAT).
Examples of bacterial Aas enzymes include those derived from E. coli, Acinetobacter, and Vibrio sp. such as V. harveyi (see, e.g., Shanklin, Protein Expression and Purification. 18:355-360, 2000; Jiang et al., Biochemistry. 45:10008-10019, 2006). SEQ I D NOS:43 and 44, respectively, provide the nucleotide and polypeptide sequences of an exemplary Aas from Synechococcus elongatus PCC 7942 (0918).
I n specific embodiments, the Aas is derived from the same organism as the overexpressed ACP, DGAT, and /or the TES, if any one of these polypeptides is employed in combination with an Aas. Accordingly, certain embodiments include Aas sequences from any of the organisms described herein for deriving a DGAT or TES, including, for example, various animals (e.g., mammals, fruit flies, nematodes), plants, parasites, and fungi (e.g., yeast such as 5. cerevisiae and Schizosaccharomyces pombe). Examples of prokaryotic organisms include certain actinomycetes, a group of Gram-positive bacteria with high G+C ratio, such as those from the representative genera Actinomyces, Arthrobacter, Corynebacterium, Frankia, Micrococcus, Mocrimonospora, Mycobacterium, Nocardia, Propionibacterium, Rhodococcus and Streptomyces. Particular examples of actinomycetes that have one or more genes encoding an Aas activity include, for example, Mycobacterium tuberculosis, M. avium, M. smegmatis, Micromonospora echinospora, Rhodococcus opacus, R. ruber, and Streptomyces lividans. Additional examples of prokaryotic organisms that encode one or more enzymes having an Aas activity include members of the genera Acinetobacter, such as ,4. calcoaceticus, A. baumanii, A. baylii, and members of the generua Alcanivorax. In certain embodiments, an Aas gene or enzyme is isolated from Acinetobacter baylii sp. ADP1, a gram-negative triglyceride forming prokaryote.
According to one non-limiting theory, an endogenous aldehyde dehydrogenase may be acting on the excess acyl-aldehydes generated by overexpressed orfl594 and converting them to free fatty acids. The normal role of such a dehydrogenase might involve removing or otherwise dealing with damaged lipids. I n this scenario, it is then likely that the Aas gene product recycles these free fatty acids by ligating them to ACP. Accordingly, reducing or eliminating expression of the Aas gene product might ultimately increase production of fatty acids, by reducing or preventing their transfer to ACP. Hence, certain aspects include mutations (e.g., genomic) such as point mutations or insertions that reduce or eliminate the enzymatic activity of one or more endogenous acyl-ACP synthetases (or synthases). Also included are full or partial deletions of an endogenous gene encoding an Aas protein.
Acyl-ACP Reductases. Acyl-ACP reductases (or acyl-ACP dehydrogenases) are members of the reductase or short-chain dehydrogenase family, and are key enzymes of the type I I fatty
acid synthesis (FAS) system. Among other potential catalytic activities, an "acyl-ACP reductase" or "acyl-ACP dehydrogenase" as used herein is capable of catalyzing the conversion (reduction) of acyl-ACP to an acyl aldehyde (see Schirmer et al., supra) and the concomitant oxidation of NAD(P)H to NADP+. In some embodiments, the acyl-ACP reductase preferentially interacts with acyl-ACP, and does not interact significantly with acyl-CoA, i.e., it does not significantly catalyze the conversion of acyl-CoA to acyl aldehyde.
Acyl-ACP reductases can be derived from a variety of plants and bacteria, included photosynthetic microorganisms such as Cyanobacteria. One exemplary acyl-ACP reductase is encoded by orfl594 of Synechococcus elongatus PCC7942 (see SEQ ID NOs:l and 2 for the polynucleotide and polypeptide seq uences, respectively). Another exemplary acyl-ACP reductase is encoded by orfsll0209 of Synechocystis sp. PCC6803 (SEQ I D NOs:3 and 4 for the polynucleotide and polypeptide sequences, respectively).
Alcohol Dehydrogenases. Embodiments of the present invention optionally include one or more alcohol dehydrogenase polypeptides. Examples of alcohol dehydrogenases include those capable of using acyl or fatty aldehydes (e.g., one or more of nonyl-aldehyde, C12, Ci4, C16, Ci8, C20 attY aldehyde) as a substrate, and converting them into fatty alcohols. Specific examples include long-chain alcohol dehydrogenases, capable of using long-chain aldehydes (e.g., Ci6, Ci8, C20) as substrates. I n certain embodiments, the alcohol dehydrogenase is naturally-occurring or endogenous to the modified microorganism, and is sufficient to convert increased acyl aldehydes (produced by acyl-ACP reductase) into fatty alcohols, and thereby contribute to increased wax ester production and overall satisfactory growth characteristics. I n certain embodiments, the alcohol dehydrogenase is derived from a microorganism that differs from the one being modified.
I n these and related embodiments, expression or overexpression of an alcohol dehydrogenase may increase shunting of acyl aldehydes towards production of fatty alcohols, and away from production of other products such as alkanes, fatty acids, or triglycerides. When combined with one or more wax ester synthases, such as DGAT or other enzyme having wax ester synthase activity (e.g., the ability to convert fatty alcohols into wax esters), alcohol dehydrogenases may contribute to production of wax esters. They may also reduce
accum ulation of potentially toxic acyl aldehydes, and thereby improve growth characteristics of a modified microorganism.
Non-limiting examples of alcohol dehydrogenases include those encoded by slrll92 of Synechocystis sp. PCC6803 (SEQ I D NOS:104-105) and ACIAD3612 of Acinetobacter bayiyi (SEQ I D NOS:106-107). Also included are homologs or paralogs thereof, functional equivalents thereof, and fragments or variants thereofs. Functional equivalents can include alcohol dehydrogenases with the ability to efficiently convert acyl aldehydes (e.g., C6, C8, Ci0, C12, Ci4 ,Ci6, i8, C2o aldehydes) into fatty alcohols. Specific examples of functional equivalents include long-chain alcohol dehydrogenases, having the ability to utilize long-chain aldehydes (e.g., C16, Ci8, C2o) as substrates.
I n particular embodiments, the alcohol dehydrogenase has the amino acid sequence of SEQ ID NO:105 (encoded by the polynucleotide sequence of SEQ ID NO:104), or an active fragment or variant of this seq uence. I n some embodiments, the alcohol dehydrogenase has the amino acid seq uence of SEQ ID NO:107 (encoded by the polynucleotide sequence of SEQ ID NO:106), or an active fragment or variant of this sequence.
Aldehyde Dehydrogenases. Embodiments of the present invention optionally include one or more overexpressed or introduced aldehyde dehydrogenases. Examples of aldehyde dehydrogenases include enzymes capable of using acyl aldehydes (e.g., nonyl-aldehyde, C16 fatty aldehyde) as a substrate, and converting them into fatty acids. In certain embodiments, the aldehyde dehydrogenase is naturally-occurring or endogenous to the modified microorganism, and is sufficient to convert increased acyl aldehydes (produced by acyl-ACP reductase) into fatty acids, and thereby contribute to increased fatty acid production and overall satisfactory growth characteristics.
In certain embodiments, the aldehyde dehydrogenase can be overexpressed, for example, by recombinantly introducing a polynucleotide that encodes the enzyme, increasing expression of an endogenous enzyme, or both. An aldehyde dehydrogenase can be overexpressed in a strain that already expresses a naturally-occurring or endogenous enzyme, to further increase fatty acid production of an acyl-ACP reductase over-expressing strain and/or improve its growth characteristics, relative, for example, to an acyl-ACP
reductase-overexpressing strain that only expresses endogenous aldehyde dehydrogenase. An aldehyde dehydrogenase can also be expressed or overexpressed in a strain that does not have a naturally occurring aldehyde dehydrogenase of that type, e.g., it does not naturally express an enzyme that is capable of efficiently converting acyl aldehydes such as nonyl-aldehyde into fatty acids.
I n these and related embodiments, expression or overexpression of an aldehyde dehydrogenase may increase shunting of acyl aldehydes towards production of fatty acids, and away from production of other products such as alkanes. It may also reduce accumulation of potentially toxic acyl aldehydes, and thereby improve growth characteristics of a modified microorganism.
One exemplary aldehyde dehydrogenase is encoded by orf0489 of Synechococcus elongatus PCC7942. Also included are homologs or paralogs thereof, functional eq uivalents thereof, and fragments or variants thereof. Functional equivalents can include aldehyde dehydrogenases with the ability to efficiently convert acyl aldehydes (e.g., nonyl-aldehyde) into fatty acids. I n specific embodiments, the aldehyde dehydrogenase has the amino acid sequence of SEQ ID NO:103 (encoded by the polynucleotide sequence of SEQ I D NO:102), or an active fragment or variant of this sequence.
Particular embodiments include photosynthetic microorganisms having reduced expression and/or activity of one or more aldehyde dehydrogenases, for instance, in the production of wax esters. I ncluded are mutations (e.g., genomic) that reduce or eliminate the enzymatic activity of one or more endogenous aldehyde dehydrogenases, such as point mutations, insertions, or full or partial deletion m utations. Specific embodiments include a modified Synechococcus elongatus PCC7942 having a full or partial deletion of orf0489.
Aldehyde Decarbonylases. Certain embodiments include photosynthetic microorganisms having reduced expression and/or activity of one or more aldehyde decarbonylases. As used herein, an "aldehyde decarbonylase" is capable of catalyzing the conversion of an acyl aldehyde (or fatty aldehyde) to an alkane or alkene. I ncluded are members of the ferritin-like or ribonucleotide reductase-like family of nonheme diiron enzymes (see, e.g., Stubbe et ai, Trends Biochem Sci. 23 :438-43, 1998).
According to one non-limiting theory, because the a ldehyde decarbonylase encoded by PCC7942_orfl593 (from Synechococcus) or PCC6803_orfsll0208 (from Synechostis sp. PCC6803) utilizes acyl aldehyde as a substrate for alkane or alkene production, reducing expression of this protein may further increase yields of free fatty acids by shunting acyl aldehydes (produced by acyl-ACP reductase) away from an alkane-producing pathway, and towards a fatty acid-producing pathway. PCC7942_orfl593 and PCC6803_orfsll0208 orthologs can be found, for exam ple, in N. punctiforme PCC73102, Thermosynechococcus elongatus BP-1, Synechococcus sp. Ja-3-3AB, P. marinus M IT9313, P. marinus NATL2A, and Synechococcus sp. RS 9117, the latter having at least two paralogs (RS 9117-1 and -2).
Particular embodiments include mutations (e.g., genomic) that reduce or eliminate the enzymatic activity of one or more endogenous aldehyde decarbonylases, for instance, in the production of fatty acids or wax esters, optionally in combination with reduced expression of one or more endogenous aldehyde dehydrogenases. Also included are point mutations, insertions, and full or partial deletions of an endogenous gene encoding an aldehyde decarbonylase. Specific embodiments include a modified Synechococcus elongatus PCC7942 having a full or partial deletion of orfl593.
Thioesterases. Certain embodiment include one or more exogenous or overexpressed thioesterase enzymes, optionally in combination with at least one of an introduced ACP enzyme, an introduced Aas enzyme, or both. For instance, one embodiment relates to the use an introduced ACP and/or Aas to increase the growth and/or fatty acid production of a free fatty acid producing TES strain, such as a TesA strain or a FatB strain (i.e., a strain having an introduced TesA or FatB). Thioesterases, as referred to herein, exhibit esterase activity (splitting of an ester into acid and alcohol, in the presence of water) specifically at a thiol group. Fatty acids are often attached to cofactor molecules, such as coenzyme A (CoA) and acyl carrier protein (ACP), by thioester linkages during the process of de novo fatty acid synthesis. Certain embodiments employ thioesterases having acyl-ACP thioesterase activity, acyl-CoA thioesterase activity, or both activities. Examples of thioesterases having both activities (i.e., acyl-ACP/acyl-CoA thioesterases) include TesA and related embodiments. I n certain embodiments, a selected thioesterase has acyl-ACP thioesterase activity but not acyl-CoA
thioesterase activity. Examples of thioesterases having only acyl-ACP thioesterase activity include the FatB thioesterases and related embodiments.
Certain thioesterases have both thioesterase activity and lysophospholipase activity. Specific examples of thioesterases include TesA, TesB, and related embodiments. Certain embodiments may employ periplasmically-localized or cytoplasmically-localized enzymes that thioesterase activity, such as E. coli TesA or E. coli TesB. For instance, wild type TesA, being localized to the periplasm, is normally used to hydrolyze thioester linkages of fatty acid-ACP (acyl-ACP) or fatty acid-CoA (acyl-CoA) compounds scavenged from the environment. A mutant thioesterase, PldC (referred to interchangeably as PldC/*TesA or *TesA), is not exported to the periplasm due to deletion of an N-terminal amino acid sequence required for proper transport of TesA from the cytoplasm to the periplasm. This deletion results in a cytoplasmic-localized PldC(*TesA) protein that has access to endogenous acyl-ACP and acyl-CoA intermediates. Other mutations or deletions in the N-terminal region of TesA can be used to achieve the same result, i.e., a cytoplasmic TesA.
Overexpressed PldC(*TesA) results in hydrolysis of acyl groups from endogenous acyl-ACP and acyl-CoA molecules. Cells expressing PldC(*TesA) must channel additional cellular carbon and energy to maintain production of acyl-ACP and acyl-coA molecules, which are required for membrane lipid synthesis. Thus, PldC(*TesA) expression results in a net increase in total cellular lipid content. For instance, PldC(*TesA) expressed alone in Synechococcus doubles the total lipid content from 10% of biomass to 20% of biomass, a result that can be further increased by combining *TesA or related molecules with an introduced ACP and/or an introduced Aas. Hence, certain embodiments employ an exogenous or overexpressed cytoplasmic TesA (such as *TesA) in combination with an exogenous or overexpressed ACP, an exogenous or overexpressed Aas, or both.
I n certain embodiments, a thioesterase (TES) is an acyl-ACP thioesterase and/or an acyl-CoA thioesterase. In particular embodiments, the TES is a TesA or TesB polypeptide from E. coli, or a cytoplasmic TesA variant (*TesA) variant having the sequence set forth in SEQ ID NO:121, or a fragment or variant thereof.
Certain thioesterases have thioesterase activity only, i.e., they have little or no lysophospholipase activity. Examples of these thioesterases include enzymes of the FatB family. FatB encoded enzymes typically hydrolyze saturated C14-C18 ACPs, preferentially 16:0 ACP, but they can also hydrolyze 18:1 ACP. The production of medium chain (C8-C12) fatty acids in plants or seeds such as those of Cuphea spp. often results of FatB enzymes that have chain length specificities for medium chain fatty acyl-ACPs. These medium chain FatB thioesterases are present in many species with medium-chain fatty acids in their oil, including, for example, California bay laurel, coconut, and elm, among others. Hence, FatB sequences may be derived from these and other organisms. Particular examples include plant FatB acyl-ACP thioesterases such as C8, C12, C14, and C16 FatB thioesterases. Hence, in certain embodiments, the TES is a FatB polypeptide, such as a C8, C12, C14, C16, or C18 FatB.
Specific examples of FatB thioesterases include the Cuphea hookeriana C8/C10 FatB thioesterase, the Umbellularia californica C12 FatBl thioesterase, the Cinnamomum camphora C14 FatBl thioesterase, and the Cuphea hookeriana CIS FatBl thioesterase. I n specific embodiments, the thioesterase is a Cuphea hookeriana C8/C10 FatB, comprising the amino acid sequence of SEQ ID NO:108 (full-length protein) or SEQ ID NO:109 (mature protein without signal sequence). In particular embodiments, the thioesterase is a Umbellularia californica C12 FatBl, comprising the amino acid sequence of SEQ I D NO:110 (full-length protein) or SEQ ID NO:lll (mature protein without signal sequence). I n certain embodiments, the thioesterase is a Cinnamomum camphora C14 FatBl, comprising the amino acid sequence of SEQ I D NO:112 (full-length protein) or SEQ I D NO:113 (mature protein without signal sequence). I n particular embodiments, the thioesterase is a Cuphea hookeriana CIS FatBl, comprising the amino acid sequence of SEQ ID NO:114 (full-length protein) or SEQ ID NO:115 (mature protein without signal sequence).
Acetyl Coenzyme A carboxylases (ACCase). Embodiments of the present invention optionally include one or more exogenous (e.g., recombinantly introduced) or overexpressed ACCase proteins. As used herein, an "acetyl CoA carboxylase" gene includes any polynucleotide sequence encoding amino acids, such as protein, polypeptide or peptide, obtainable from any cell source, which demonstrates the ability to cata lyze the carboxylation of acetyl-CoA to
produce malonyl-CoA under enzyme reactive conditions, and further includes any naturally-occurring or non-naturally occurring variants of an acetyl-CoA carboxylase sequence having such ability.
Acetyl-CoA carboxylase (ACCase) is a biotin-dependent enzyme that catalyses the irreversible carboxylation of acetyl-CoA to produce malonyl-CoA through its two catalytic activities, biotin carboxylase (BC) and carboxyltransferase (CT). The biotin carboxylase (BC) domain catalyzes the first step of the reaction: the carboxylation of the biotin prosthetic group that is covalently linked to the biotin carboxyl carrier protein (BCCP) domain. In the second step of the reaction, the carboxyltransferase (CT) domain catalyzes the transfer of the carboxyl group from (carboxy) biotin to acetyl-CoA. Formation of malonyl-CoA by acetyl-CoA carboxylase (ACCase) represents the commitment step for fatty acid synthesis, because malonyl-CoA has no metabolic role other than serving as a precursor to fatty acids. Because of this reason, acetyl-CoA carboxylase represents a pivotal enzyme in the synthesis of fatty acids.
In most prokaryotes, ACCase is a multi-subunit enzyme, whereas in most eukaryotes it is a large, multi-domain enzyme. In yeast, the crystal structure of the CT domain of yeast ACCase has been determined at 2.7A resolution (Zhang et ai, Science, 299:2064-2067 (2003). This structure contains two domains, which share the same backbone fold. This fold belongs to the crotonase/CIpP family of proteins, with a b-b-a superhelix. The CT domain contains many insertions on its surface, which are important for the dimerization of ACCase. The active site of the enzyme is located at the dimer interface.
Although Cyanobacteria, such as Synechococcus, express a native ACCase enzyme, these bacteria typically do not produce or accumulate significant amounts of fatty acids. For example, Synechococcus in the wild accumulates fatty acids in the form of lipid membranes to a total of about 4% by dry weight.
Given the role of ACCase in the commitment step of fatty acid biosynthesis, embodiments of the present invention include methods of increasing the production of fatty acid biosynthesis, and, thus, lipid production, in Cyanobacteria by introducing one or more polynucleotides that encode an ACCase enzyme that is exogenous to the Cyanobacterium's native genome. Embodiments of the present invention also include a modified
Cyanobacterium, and compositions comprising said Cyanobacterium, comprising one or more polynucleotides that encode an ACCase enzyme that is exogenous to the Cyanobacterium's native genome.
A polynucleotide encoding an ACCase enzyme may be isolated or obtained from any organism, such as any prokaryotic or eukaryotic organism that contains an endogenous ACCase gene. Examples of eukaryotic organisms having an ACCase gene are well-known in the art, and include various animals (e.g., mammals, fruit flies, nematodes), plants, parasites, and fungi (e.g., yeast such as 5. cerevisiae and Schizosaccharomyces pombe). In certain embodiments, the ACCase encoding polynucleotide sequences are obtained from Synechococcus sp. PCC7002.
Examples of prokaryotic organisms that may be utilized to obtain a polynucleotide encoding an enzyme having ACCase activity include, but are not limited to, Escherichia coli, Legionella pneumophila, Listeria monocytogenes, Streptococcus pneumoniae, Bacillus subtilis, Ruminococcus obeum ATCC 29174, marine gamma proteobacterium HTCC2080, Roseovarius sp. HTCC2601, Oceanicola granulosus HTCC2516, Bacteroides caccae ATCC 43185, Vibrio alginolyticus 12G01, Pseudoalteromonas tunicata D2, Marinobacter sp. ELB17, marine gamma proteobacterium HTCC2143, Roseobacter sp. SK209-2-6, Oceanicola batsensis HTCC2597, Rhizobium leguminosarum bv. trifolii WSM 1325, Nitrobacter sp. Nb-311A, Chloroflexus aggregans DSM 9485, Chlorobaculum parvum, Chloroherpeton thalassium, Acinetobacter baumannii, Geobacillus, and Stenotrophomonas maltophilia, among others.
Particular exemplary acetyl-CoA carboxylases (ACCase) comprise or consist of a polypeptide sequence set forth in any of SEQ ID NOs:55, 45, 46, 47, 48 or 49, or a fragment or variant thereof. SEQ ID NO:55 is the sequence of Saccharomyces cerevisiae acetyl-CoA carboxylase (yAccl); SEQ ID NO:45 is Synechococcus sp. PCC 7002 AccA; SEQ ID NO:46 is Synechococcus sp. PCC 7002 AccB; SEQ I D NO:47 is Synechococcus sp. PCC 7002 AccC; and SEQ I D NO:48 is Synechococcus sp. PCC 7002 AccD; and SEQ ID NO:49 is a Triticum aestivum ACCase. I n certain embodiments, the introduced ACCase is not native to the genome of the modified photosynthetic microorganism.
Phosphatide Acid Phosphatases (PAP). As used herein, a "phosphatidate phosphatase" or "phosphatide acid phosphatase" gene includes any polynucleotide sequence encoding
amino acids, such as protein, polypeptide or peptide, obtainable from any cell source, which demonstrates the ability to catalyze the dephosphorylation of phosphatidate (PtdOH) under enzyme reactive conditions, yielding diacylglycerol (DAG) and inorganic phosphate, and further includes any naturally-occurring or non-naturally occurring variants of a phosphatidate phosphatase sequence having such ability.
Phosphatidate phosphatases (PAP, 3-s/i-phosphatidate phosphohydrolase) catalyze the dephosphorylation of phosphatidate (PtdOH), yielding diacylglycerol (DAG) and inorganic phosphate. This enzyme belongs to the family of hydrolases, specifically those acting on phosphoric monoester bonds. The systematic name of this enzyme class is 3-sn-phosphatidate phosphohydrolase. Other names in common use include phosphatic acid phosphatase, acid phosphatidyl phosphatase, and phosphatic acid phosphohydrolase. This enzyme participates in at least 4 metabolic pathways: glycerolipid metabolism, glycerophospholipid metabolism, ether lipid metabolism, and sphingolipid metabolism.
PAP enzymes have roles in both the synthesis of phospholipids and triacylglycerol through its product diacylglycerol, as well as the generation or degradation of lipid-signaling molecules in eukaryotic cells. PAP enzymes are typically classified as either Mg2+-dependent (referred to as PAP1 enzymes) or Mg2+-independent (PAP2 or lipid phosphate phosphatase (LPP) enzymes) with respect to their cofactor requirement for catalytic activity. In both yeast and mammalian systems, PAP2 enzymes are known to be involved in lipid signaling. By contrast, PAP1 enzymes, such as those found in Saccharomyces cerevisiae, play a role in de novo lipid synthesis (Han, et al. J Biol Chem. 281:9210-9218, 2006), thereby revealing that the two types of PAP are responsible for different physiological functions.
In both yeast and higher eukaryotic cells, the PAP reaction is the committed step in the synthesis of the storage lipid triacylglycerol (TAG), which is formed from PtdOH through the intermediate DAG. The reaction product DAG is also used in the synthesis of the membrane phospholipids phosphatidylcholine (PtdCho) and phosphatidylethanolamine. The substrate PtdOH is used for the synthesis of all membrane phospholipids (and the derivative inositol-containing sphingolipids) through the intermediate CDP-DAG. Thus, regulation of PAP activity might govern whether cells make storage lipids and phospholipids through DAG or
phospholipids through CDP-DAG. In addition, PAP is involved in the transcriptional regulation of phospholipid synthesis.
PAPl enzymes have been purified and characterized from the membrane and cytosolic fractions of yeast, including a gene (Pahl, formerly known as Smp2) been identified to encode a PAPl enzyme in 5. cerevisiae. The PaM-encoded PAPl enzyme is found in the cytosolic and membrane fractions of the cell, and its association with the membrane is peripheral in nature. As expected from the multiple forms of PAPl that have been purified from yeast, ραΜΔ mutants still contain PAPl activity, indicating the presence of an additional gene or genes encoding enzymes having PAPl activity.
Analysis of mutants lacking the PaM-encoded PAPl has provided evidence that this enzyme generates the DAG used for lipid synthesis. Cells containing the ραΜΔ mutation accumulate PtdOH and have reduced amounts of DAG and its acylated derivative TAG. Phospholipid synthesis predominates over the synthesis of TAG in exponentially growing yeast, whereas TAG synthesis predominates over the synthesis of phospholipids in the stationary phase of growth. The effects of the ραΜΔ mutation on TAG content are most evident in the stationary phase. For example, stationary phase cells devoid of the Pahl gene show a reduction of >90% in TAG content. Likewise, the ραΜΔ mutation shows the most marked effects on phospholipid composition (e.g. the consequent reduction in PtdCho content) in the exponential phase of growth. The importance of the PaM-encoded PAPl enzyme to cell physiology is further emphasized because of its role in the transcriptional regulation of phospholipid synthesis.
The requirement of Mg2+ ions as a cofactor for PAP enzymes is correlated with the catalytic motifs that govern the phosphatase reactions of these enzymes. For example, the PaW-encoded PAPl enzyme has a DxDxT (SEQ ID NO:198) catalytic motif within a haloacid dehalogenase (HAD)-like domain ("x" is any amino acid). This motif is found in a superfamily of Mg2+-dependent phosphatase enzymes, and its first aspartate residue is responsible for binding the phosphate moiety in the phosphatase reaction. By contrast, the DPPl-and .PPl-encoded PAP2 enzymes contain a three-domain lipid phosphatase motif that is localized to the hydrophilic surface of the membrane. This catalytic motif, which comprises the consensus
sequences KxxxxxxRP (domain 1) (SEQ ID NO:116), PSGH (domain 2) (SEQ ID NO:117), and SRxxxxxHxxxD (domain 3) (SEQ ID NO:118), is shared by a superfamily of lipid phosphatases that do not req uire Mg2+ ions for activity. The conserved arginine residue in domain 1 and the conserved histidine residues in domains 2 and 3 may be essential for the catalytic activity of PAP2 enzymes. Accordingly, a phosphatidate phosphatase polypeptide may comprise one or more of the above-described catalytic motifs.
A polypeptide having a phosphatidate phosphatase enzymatic activity may be obtained from any organism having a suitable, endogenous phosphatidate phosphatase gene. Examples of organisms that may be used to obtain a phosphatidate phosphatase encoding polynucleotide sequence include, but are not limited to, Homo sapiens, Mus musculus, Rattus norvegicus, Bos taurus, Drosophila melanogaster, Arabidopsis thaliana, Magnaporthe grisea, Saccharomyces cerevisiae, Schizosaccharomyces pom be, Cryptococcus neoformans, and Bacillus pumilus, among others. Specific examples of PAP enzymes include Pahl from 5. cerevisiae, PgpB from E. coli, and PAP from PCC6803.
I n specific embodiments, a phosphatidate phosphatase polypeptide comprises or consists of a polypeptide sequence set forth in SEQ I D NO:131, or a fragment or variant thereof. SEQ I D NO:131 is the sequence of Saccharomyces cerevisiae phosphatidate phosphatase (yPahl). I n certain embodiments, the polypeptide seq uence of the PAP is encoded by the E. coli PgpB gene, and/or the PAP gene from Synechocystis sp. PCC6803.
Triacylglycerol (TAG) Hydrolases. Certain embodiments relate to the use of exogenous or overexpressed TAG hydrolases (or TAG lipases) to increase production of TAGs in a TAG-producing strain. For instance, specific embodiments may utilize a TAG hydrolase in combination with a DGAT, and optionally a TES. These embodiments may then further utilize an ACP, an Aas, or both, any of the lipid biosynthesis proteins described herein, and/or any of the modifications to glycogen production and storage described herein. Hence, as noted above, TAG hydrolases may be used in TAG-producing strains (e.g., DGAT-expressing strains) with or without an ACP or Aas.
TAG hydrolases are carboxylesterases that are typically specific for insoluble long chain fatty acid TAGs. Carboxylesterases catalyze the chemical reaction:
carboxylic ester + H20 alcohol + carboxylate
Thus, the two substrates of this enzyme are carboxylic ester and H20, whereas its two products are alcohol and carboxylate. According to one non-limiting theory, it is understood that TAG hydrolase expression (or overexpression) in a TAG producing strain (e.g., DGAT/ACP, DGAT/Aas, DGAT/ACP/Aas) releases acyl chains to not only increase accumulation of free fatty acids (FFA), but also increase the amount of free 1, 2 diacylglycerol (DAG). This free DAG then serves as a substrate for DGAT, and thereby allows increased TAG production, especially in the presence of over-expressed ACP, Aas, or both. Accordingly, certain embodiments employing a TAG hydrolase produce increased amounts of TAG, relative, for example, to a DGAT only-expressing microorganism. I n specific embodiments, the TAG hydrolase is specific for TAG and not DAG, i.e., it preferentially acts on TAG relative to DAG.
Non-limiting examples of TAG hydrolases include SDP1 (SUGAR-DEPENDENT1) triacylglycerol lipase from Arabidopsis thaliana (SEQ I D NO:170), ACIAD1335 from Acinetobacter sp. ADP1 (SEQ I D NO:171), TG14P from S. cerevisiae (SEQ ID NO:172), and RHAl_ro04722 (YP_704665) TAG lipase from Rhodococcus (SEQ ID NO:173). Additional putative lipases/esterases from Rhodococcus include RHAl_ro01602 lipase/esterase (see SEQ ID NOs:156 and 174 for polynucleotide and polypeptide sequence, respectively), and RHAl_ro06856 lipase/esterase (see SEQ ID NOs:119 and 120 for polynucleotide and polypeptide seq uence, respectively).
Fatty Acyl-CoA Synthetases. Certain embodiments relate to the use of overexpressed fatty acyl-CoA synthetases to increase activation of fatty acids, and thereby increase production of TAGs in a TAG-producing strain (e.g., a DGAT-expressing strain). For instance, specific embodiments may utilize an acyl-ACP reductase in combination with a fatty acyl-CoA synthetase and a DGAT. These embodiments may then further utilize an ACP, an ACCase, or both, and/or any of the modifications to glycogen production and storage or glycogen breakdown described herein.
Fatty acyl-CoA synthetases activate fatty acids for metabolism by catalyzing the formation of fatty acyl-CoA thioesters. Fatty acyl-CoA thioesters can then serve not only as substrates for beta-oxidation, at least in bacteria capable of growing on fatty acids as a sole
source of carbon (e.g., E. coli, Salmonella), but also as acyl donors in phospholipid biosynthesis. Many fatty acyl-CoA synthetases are characterized by two highly conserved sequence elements, an ATP/AM P binding motif, which is common to enzymes that form an adenylated intermediate, and a fatty acid binding motif.
According to one non-limiting theory, certain embodiments may employ fatty acyl-CoA synthetases to increase activation of free fatty acids, which can then be incorporated into TAGs, mainly by the DGAT-expressing (and thus TAG-producing) photosynthetic microorganisms described herein. Hence, fatty acyl-CoA synthetases can be used in any of the embodiments described herein, such as those that produce increased levels of free fatty acids, where it is desirable to turn free fatty acids into TAGs. As noted above, these free fatty acids can then be activated by fatty acyl-CoA synthetases to generate acyl-CoA thioesters, which can then serve as substrates by DGAT to produce increased levels of TAGs.
One exemplary fatty acyl-CoA synthetase includes the FadD gene from E. coli (SEQ ID NOS:16 and 17 for nucleotide and polypeptide sequence, respectively), which encodes a fatty acyl-CoA synthetase having substrate specificity for medium and long chain fatty acids. Other exemplary fatty acyl-CoA synthetases include those derived from 5. cerevisiae; Faalp can use C12-C16 acyl-chains in vitro (see SEQ I D NOS:18 and 19 for nucleotide and polypeptide sequence, respectively), Faa2p shows a less restricted specificity ranging from C7-C17 (see SEQ I D NOS:20 and 21 for nucleotide and polypeptide seq uence, respectively), and Faa3p, together with that of DGAT1, enhances lipid accumulation in the presence of exogenous fatty acids in 5. cerevisiae (see SEQ ID NO:22 and 23 for nucleotide and polypeptide sequence, respectively). SEQ I D NO:22 is codon-optimized for expression in 5. elongatus PCC7942.
Li pases/Phospholi pases. In various embodiments, modified photosynthetic microorganisms, e.g., Cyanobacteria, of the present invention further comprise one or more exogenous or introduced nucleic acids that encode a polypeptide having a lipase or phospholipase activity, or a fragment or variant thereof. Lipases, including phospholipases, lysophospholipases, thioesterases, and enzymes having one, two, or all three of these activities, typically catalyze the hydrolysis of ester chemical bonds in lipid substrates. Without wishing to be bound by any one theory, in certain exemplary embodiments the expression of one or more
phospholipases can generate fatty acids from membra ne lipids, which may then be used by the ACP and/or Aas to make acyl-ACPs. These acyl-ACPs, for example, can then feed into the triglyceride synthesis pathways, thereby increasing triglyceride (TAG) production.
A phospholipase is an enzyme that hydrolyzes phospholipids into fatty acids and other lipophilic substances. There are four major classes, termed A, B, C and D distinguished by what type of reaction they catalyze. Phospholipase Al cleaves the SN-1 acyl chain, while Phospholipase A2 cleaves the SN-2 acyl chain, releasing arachidonic acid. Phospholipase B cleaves both SN-1 and SN-2 acyl chains, and is also known as a lysophospholipase. Phospholipase C cleaves before the phosphate, releasing diacylglycerol and a phosphate-containing head group. Phospholipases C play a central role in signal transduction, releasing the second messenger, inositol triphosphate. Phospholipase D cleaves after the phosphate, releasing phosphatidic acid and an alcohol. Types C and D are considered phosphodiesterases. In various embodiments, one or more phospholipase from any one of these classes may be used, alone or in any combination.
As noted above, phospholipases (PLA1,2) act on phospholipids of different kinds including phosphatidyl glycerol, the major phospholipid in Cyanobacteria, by cleaving the acyl chains off the snl or sn2 positions (carbon 1 or 2 on the glycerol backbone); some are selective for snl or sn2, others act on both. Lysophospholipases act on lysophospholipids, which can be the product of phospholipases or on lysophosphatidic acid, a normal intermediate of the de novo phosphatidic acid synthesis pathway, e.g., l-acyl-DAG-3-phosphate.
By way of non-limiting theory, it is understood that in certain embodiments, phospholipases and/or lysophospholipases can cleave off acyl chains from phospholipids or lysophospholipids and thus deregulate the normal recycling of the lipid membranes, including both cell membrane and thylakoid membranes, which then leads to accumulation of free fatty acids (FFAs). I n certain embodiments (e.g., TesA strains), these FFAs may accumulate extracellularly. In other embodiments (e.g., ACP and/or Aas over-expressing microorganisms), FFAs can be converted into acyl-ACPs by acyl ACP synthase (Aas) in a strain that also over-expresses ACP. In specific embodiments (e.g., DGAT-containing microorganisms), these acyl-ACPs can then serve as substrates for DGAT to make TAGs.
I n other embodiments, phospholipases can be over-expressed to generate lyshophospholipids and acyl chains. The lysophospholipids can then serve as substrates for a lysophospholipase, which cleaves off the remaining acyl chain. In some embodiments, these acyl chains can either accumulate as FFAs, or in other embodiments may serve as substrates of Acyl ACP synthase (Aas) to generate acyl-ACPs, which can then be used by DGAT to make TAGs.
Particular examples of phospholipase C enzymes include those derived from eukaryotes such as mammals and parasites, in addition to those derived from bacteria. Examples include phosphoinositide phospholipase C (EC 3.1.4.11), the main form found in eukaryotes, especially mammals, the zinc-dependent phospholipase C family of bacterial enzymes (EC 3.1.4.3) that includes alpha toxins, phosphatidylinositol diacylglycerol-lyase (EC 4.6.1.13), a related bacterial enzyme, and glycosylphosphatidylinositol diacylglycerol-lyase (EC 4.6.1.14), a trypanosomal enzyme.
I n particular embodiments, the present invention contemplates using a lysophospholipase. A lysophospholipase is an enzyme that catalyzes the chemical reaction :
2-lysophosphatidic acid + H20 glycerol-3-phosphate + a carboxylate
Thus, the two substrates of this enzyme are 2-lysophosphatidylcholine and H20, whereas its two products are glycerophosphocholine and carboxylate.
Lysophospholipase are are members of the hydrolase family, specifically those acting on carboxylic ester bonds. Lysophospholipases participate in glycerophospholipid metabolism. Examples of lysophospholipases include, but are not limited to, 2-Lysophosphatidylcholine acylhydrolase, Lecithinase B, Lysolecithinase, Phospholipase B, Lysophosphatidase, Lecitholipase, Phosphatidase B, Lysophosphatidylcholine hydrolase, Lysophospholipase Al, Lysophospholipase LI (TesA), Lysophopholipase L2, TesB, Lysophospholipase transacylase, Neuropathy target esterase, NTE, NTE-LysoPLA, NTE-lysophospholipase, and Vu Patatin 1 protein. I n particular embodiments, lysophospholipases utilized according to the present invention are derived from a bacteria, e.g., E. coli, or a plant. Any of these lysophospholipases may be used according to various embodiments of the present invention.
Certain lysophospholipases, such as Lysophospholipase LI (also referred to as PldC or TesA) are periplasmically-localized or cytoplasmically-localized enzymes that have both
lysophospholipase and thioesterase activity, as described above. Hence, certain thioesterases such as TesA can also be characterized as lysophospholipases. A mutant lysophospholipase described herein, PldC(*TesA), is not exported to the periplasm due to deletion of an N-terminal amino acid sequence required for proper transport of TesA from the cytoplasm to the periplasm. This results in a cytoplasmic-localized PldC(*TesA) protein that has access to endogenous acyl-ACP and acyl-CoA intermediates. Overexpressed PldC(*TesA) results in hydrolysis of acyl groups from endogenous acyl-ACP and acyl-CoA molecules. Cells expressing PldC(*TesA) must channel additional cellular carbon and energy to maintain production of acyl-ACP and acyl-coA molecules, which are required for membrane lipid synthesis. Thus, PldC(*TesA) expression results in a net increase in cellular lipid content. As described herein, PldC(*TesA) is expressed in Synechococcus lipid content doubles from 10% of biomass to 20% of biomass.
In certain embodiments, lysophospholipases utilized according to the present invention have both phospholipase and thioesterase activities. Examples of lysophospholipases that have both activities include, e.g., Lysophospholipase LI (TesA), such as E. coli Lysophospholipase LI, as well as fragments and variants thereof, including those described in the paragraph above. As a phospholipase, certain embodiments may employ TesA variants having only lysophospholipase activity, including variants with reduced or no thioesterase activity.
In particular embodiments, the phospholipase is a bacterial phospholipase, e.g., lysophospholipase, or a fragment or variant thereof, e.g., a phospholipase derived from Escherichia coli, 5. cerevisiae, Rhodococcus, Streptomyces or Acinetobacter species.
Additional non-limiting examples of phospholipases include phospholipase Al (PldA) from Acinetobacter sp. ADP1, phospholipase A (PldA) from E. coli, phospholipase from Streptomyces coelicolor A3(2), phospholipase A2 (PLA2-a) from Arabidopsis thaliana; phospholipase Al/ triacylglycerol lipase (DAD1; Defective Anther Dehiscence 1) from Arabidopsis thaliana, chloroplast DONGLE from Arabidopsis thaliana, patatin-like protein from Arabidopsis thaliana, and patatin from Anabaena variabilis ATCC 29413. Additional non-limiting examples of lysophospholipases include phospholipase B (Plblp) from Saccharomyces cerevisiae S288c, phospholipase B (Plb2p) from Saccharomyces cerevisiae S288c, ACIAD1057
(tesA homolog) from Acinetobacter ADPl, ACIAD1943 lysophospholipase from Acinetobacter ADPl, and a lysophospholipase (YP_702320; RHAl_ro02357) from Rhodococcus.
In particular embodiments, the encoded phospholipase comprises or consists of a Lysophospholipase LI (TesA), Lysophospholipase L2, TesB, or Vu patatin 1 protein, or a homolog, fragment, or variant thereof. In certain embodiments, the Lysophospholipase LI (TesA), Lysophospholipase L2, or TesB is a bacterial Lysophospholipase LI (TesA), Lysophospholipase L2, or TesB, such as an E.coli Lysophospholipase LI (TesA) having the wild-type sequence set forth in SEQ ID NO:133, an E. coli Lysophospholipase L2 having the wild-type sequence set forth in SEQ ID NO:137, or an E. coli TesB having the wild-type sequence set forth in SEQ ID NO:134. In particular embodiment, the Vu patatin 1 protein has the wild-type sequenc set forth in SEQ ID NO:138.
In particular embodiments, the phospholipase is modified such that it localizes predominantly to the cytoplasm instead of the periplasm. For example, the phospholipase may have a deletion or mutation in a region associated with periplasmic localization. In particular embodiments, the phospholipase variant is derived from Lysophospholipase LI (TesA) or TesB. In certain embodiments, the Lysophospholipase LI (TesA) or TesB variant is a bacterial Lysophospholipase LI (TesA) or TesB variant, such as a cytoplasmic E.coli Lysophospholipase LI (PldC(*TesA)) variant having the sequence set forth in SEQ ID NO:121.
Additional examples of phospholipase polypeptide sequences include phospholipase Al (PIdA) from Acinetobacter sp. ADPl (SEQ ID NO:157), phospholipase A (PIdA) from E. coli (SEQ ID NO:158), phospholipase from Streptomyces coelicolor A3(2) (SEQ ID NO:159), phospholipase A2 (PLA2-a) from Arabidopsis thaliana (SEQ ID NO:160). phospholipase Al/ triacylglycerol lipase (DAD1; Defective Anther Dehiscence 1) from Arabidopsis thaliana (SEQ ID NO:161), chloroplast DONGLE from Arabidopsis thaliana (SEQ ID NO:162), patatin-like protein from Arabidopsis thaliana (SEQ ID NO:163), and patatin from Anabaena variabilis ATCC 29413 (SEQ ID NO:164). Additional non-limiting examples of lysophospholipase polypeptide sequences include phospholipase B (Plblp) from Saccharomyces cerevisiae S288c (SEQ ID NO:165), phospholipase B (Plb2p) from Saccharomyces cerevisiae S288c (SEQ ID NO:166), ACIAD1057 (TesA homolog) from Acinetobacter ADPl (SEQ ID NO:167), ACIAD1943 lysophospholipase from Acinetobacter
ADP1 (SEQ I D NO:168), and a lysophospholipase (YP_702320; RHAl_ro02357) from Rhodococcus (SEQ ID NO:169).
Fatty acyl reductase. Certain embodiments relate to the use of overexpressed fatty acyl reductases to increase synthesis of fatty alcohols, and thereby increase production of wax esters in a WE-producing strain (e.g., a DGAT-expressing strain). For instance, certain embodiments may utilize a fatty acyl reductase, possibly in combination with an acyl-ACP reductase, and a DGAT. These embodiments may then further utilize an ACP, an ACCase, or both, and/or any of the modifications to glycogen production and storage or glycogen breakdown described herein.
Fatty Acyl Reductases catalyze the two step reduction of acyl-ACP's or acyl-COA's to acyl alcohols, also known as fatty alcohols. The first step proceeds via an acyl aldehyde intermediate, which is then converted in a second step to a fatty alcohol. These same enzymes can also directly reduce fatty aldehydes to fatty a lcohols (i.e. step two only). In this case they are sometimes referred to as fatty aldehyde reductases. Fatty alcohols can serve as a substrate for wax ester biosynthesis by a DGAT. Many fatty acyl reductases are characterized by three conserved seq uence elements. There is an NADPH binding motif, a motif characteristic of the catalytic site of NADP-utilizing enzymes, and a conserved C-terminal domain, referred to as the Male Sterile 2 domain, that is of unknown function (see Hofvander et al., FEBS Letters (2011) pp 3583-3543)
According to one non-limiting theory, certain embodiments may employ fatty acyl reductases to increase synthesis of fatty alcohols, which can then be incorporated into WE's, mainly by the DGAT-expressing (and thus WE-producing) photosynthetic microorganisms described herein. Hence, fatty acyl reductases can be used in any of the embodiments described herein, such as those that produce increased levels of free fatty alcohols, where it is desirable to turn these into WE's. As noted above, these free fatty alcohols can then be esterified to fatty acids (in the form of acyl-ACP) by DGATs to generate WE's.
One exemplary fatty acyl reductase includes a gene from Marinobacter aq uaeolei VT8, genbank accession number YP_959769.1 (see SEQ ID NOs:211 and 212 for polypeptide and polynucleotide sequence, respectively). Others include a gene from Simmondsia chinesis
(Jojoba) AF149917 (see SEQ I D NOs:213 and 214 for polypeptide and polynucleotide seq uence, respectively); a gene from Euglena gracilis GU733919 (see SEQ ID NOs:215 and 216 for polypeptide and polynucleotide seq uence, respectively); a gene from Hahella chejuensis YP 436183 (see SEQ I D NOs:217 and 218 for polypeptide and polynucleotide sequence, respectively); a gene from Photobacterium profundum SS9 YP 130411.1 (see SEQ I D NOs:219 and 220 for polypeptide and polynucleotide sequence, respectively); a gene from Marinobacter algicola DG893 ZP 01892457 (see SEQ ID NOs:221 and 222 for polypeptide and polynucleotide sequence, respectively); a gene from Marinobacter adhaerens HP15 ADP96574 (see SEQ ID NOs:223 and 224 for polypeptide and polynucleotide sequence, respectively); a gene from Arabidopsis thaliana CERF NM 119537 (see SEQ ID NOs:225 and 226 for polypeptide and polynucleotide seq uence, respectively); a gene from Arabidopsis thaliana At3g56700 NC 003074 (see SEQ I D NOs:227 and 228 for polypeptide and polynucleotide sequence, respectively); a gene from Aradopsis thaliana Atg22500 NC 003076 (see SEQ I D NOs:229 and 230 for polypeptide and polynucleotide sequence, respectively); and a gene from Triticum aestivum (Wheat bread) AJ459250 (see SEQ ID NOs:231 and 232 for polypeptide and polynucleotide seq uence, respectively).
(Hi) Glycogen Synthesis, Storage, and Breakdown
I n particular embodiments, a modified photosynthetic microorganism further comprises additional modifications, such that it has reduced expression of one or more genes associated with a glycogen synthesis or storage pathway and/or increased expression of one or more polynucleotides that encode a protein associated with a glycogen breakdown pathway, or a functional variant of fragment thereof.
In various embodiments, modified photosynthetic microorganisms, e.g., Cyanobacteria, of the present invention have reduced expression of one or more genes associated with glycogen synthesis and/or storage. In particular embodiments, these modified photosynthetic microorganisms have a mutated or deleted gene associated with glycogen synthesis and/or storage. I n particular embodiments, these modified photosynthetic microorganisms comprise a vector that includes a portion of a mutated or deleted gene, e.g., a targeting vector used to
generate a knockout or knockdown of one or more alleles of the mutated or deleted gene. In certain embodiments, these modified photosynthetic microorganisms comprise an antisense RNA or siRNA that binds to an mRNA expressed by a gene associated with glycogen synthesis and/or storage.
I n certain embodiments, modified photosynthetic microorganisms, e.g., Cyanobacteria, of the present invention comprise one or more exogenous or introduced nucleic acids that encode a polypeptide having an activity associated with a glycogen breakdown or triglyceride or fatty acid biosynthesis, including but not limited to any of those described herein. In particular embodiments, the exogenous nucleic acid does not comprise a nucleic acid sequence that is native to the microorganism's genome. In particular embodiments, the exogenous nucleic acid comprises a nucleic acid sequence that is native to the microorganism's genome, but it has been introduced into the microorganism, e.g., in a vector or by molecular biology techniques, for example, to increase expression of the nucleic acid and/or its encoded polypeptide in the microorganism.
Glycogen Biosynthesis and Storage. Glycogen is a polysaccharide of glucose, which functions as a means of carbon and energy storage in most cells, including animal and bacterial cells. More specifically, glycogen is a very large branched glucose homopolymer containing about 90% a-l,4-glucosidic linkages and 10% a- 1,6 linkages. For bacteria in particular, the biosynthesis and storage of glycogen in the form of a-l,4-polyglucans represents an important strategy to cope with transient starvation conditions in the environment.
Glycogen biosynthesis involves the action of several enzymes. For instance, bacterial glycogen biosynthesis occurs generally through the following general steps: (1) formation of glucose-l-phosphate, catalyzed by phosphoglucomutase (Pgm), followed by (2) ADP-glucose synthesis from ATP and glucose 1-phosphate, catalyzed by glucose-l-phosphate adenylyltransferase (GlgC), followed by (3) transfer of the glucosyl moiety from ADP-glucose to a pre-existing a- 1,4 glucan primer, catalyzed by glycogen synthase (GIgA). This latter step of glycogen synthesis typically occurs by utilizing ADP-glucose as the glucosyl donor for elongation of the a-l,4-glucosidic chain.
In bacteria, the main regulatory step in glycogen synthesis takes place at the level of ADP-glucose synthesis, or step (2) above, is the reaction catalyzed by glucose-l-phosphate adenylyltransferase (GlgC), also known as ADP-glucose pyrophosphorylase (see, e.g., Ballicora et al., Microbiology and Molecular Biology Reviews 6:213-225, 2003). In contrast, the main regulatory step in mammalian glycogen synthesis occurs at the level of glycogen synthase. As shown herein, by altering the regulatory and/or other active components in the glycogen synthesis pathway of photosynthetic microorganisms such as Cyanobacteria, and thereby reducing the biosynthesis and storage of glycogen, the carbon that would have otherwise been stored as glycogen can be utilized by said photosynthetic microorganism to synthesize other carbon-based storage molecules, such as lipids, fatty acids, and triglycerides.
Therefore, certain modified photosynthetic microorganisms, e.g., Cyanobacteria, of the present invention may comprise a mutation, deletion, or any other alteration that disrupts one or more of these steps (i.e., renders the one or more steps "non-functional" with respect to glycogen biosynthesis and/or storage), or alters any one or more of the enzymes directly involved in these steps, or the genes encoding them. As noted above, such modified photosynthetic microorganisms, e.g., Cyanobacteria, are typically capable of producing and/or accumulating an increased amount of lipids, such as fatty acids, as compared to a wild type photosynthetic microorganism. Certain exemplary glycogen biosynthesis genes are described below.
Phosphoglucomutase gene (pgm). In one embodiment, a modified photosynthetic microorganism, e.g., a Cyanobacteria, expresses a reduced amount of the phosphoglucomutase gene. In particular embodiments, it may comprise a mutation or deletion in the phosphoglucomutase gene, including any of its regulatory elements (e.g., promoters, enhancers, transcription factors, positive or negative regulatory proteins, etc.). Phosphoglucomutase (Pgm), encoded by the gene pgm, catalyzes the reversible transformation of glucose 1-phosphate into glucose 6-phosphate, typically via the enzyme-bound intermediate, glucose 1,6-biphosphate (see, e.g., Lu et al., Journal of Bacteriology 176:5847-5851, 1994). Although this reaction is reversible, the formation of glucose-6-phosphate is markedly favored.
However, typically when a large amount of glucose-6-phosphate is present, Pgm catalyzes the phosphorylation of the 1-carbon and the dephosphorylation of the c-carbon, resulting in glucose-l-phosphate. The resulting glucose-l-phosphate is then converted to UDP-glucose by a number of intermediate steps, including the catalytic activity of GIgC, which can then be added to a glycogen storage molecule by the activity of glycogen synthase, described below. Thus, under certain conditions, the Pgm enzyme plays an intermediary role in the biosynthesis and storage of glycogen.
The pgm gene is expressed in a wide variety of organisms, including most, if not all, Cyanobacteria. The pgm gene is also fairly conserved among Cyanobacteria, as can be appreciated upon comparison of SEQ ID NOs:24 (5. elongatus PCC7942), 25 (Synechocystis sp. PCC6803), and 26 {Synechococcus sp. WH8102), 79 {Synechococcus RCC307), and 80 (Synechococcus 7002),which provide the polynucleotide sequences of various pgm genes from Cyanobacteria.
Deletion of the pgm gene in Cyanobacteria, such as Synechococcus, has been demonstrated herein for the first time to reduce the accumulation of glycogen in said Cyanobacteria, and also to increase the production of other carbon-based products, such as lipids and fatty acids.
Glucose-l-Phosphate Adenylyltransferase (glgC). In one embodiment, a modified photosynthetic microorganism, e.g., a Cyanobacteria, expresses a reduced amount of a glucose-l-phosphate adenylyltransferase (glgC) gene. In certain embodiments, it may comprise a m utation or deletion in the glgC gene, including any of its regulatory elements. The enzyme encoded by the glgC gene (e.g., EC 2.7.7.27) participates generally in starch, glycogen and sucrose metabolism by catalyzing the following chemical reaction:
ATP + alpha-D-glucose 1-phosphate diphosphate + ADP-glucose
Thus, the two substrates of this enzyme are ATP and alpha-D-glucose 1-phosphate, whereas its two products are diphosphate and ADP-glucose. The g/gC-encoded enzyme catalyzes the first committed and rate-limiting step in starch biosynthesis in plants and glycogen biosynthesis in bacteria. It is the enzymatic site for regulation of storage polysaccharide
accum ulation in plants and bacteria, being allosterically activated or inhibited by metabolites of energy flux.
The enzyme encoded by the glgC gene belongs to a family of transferases, specifically those transferases that transfer phosphorus-containing nucleotide groups (i.e., nucleotidyl-transferases). The systematic name of this enzyme class is typically referred to as ATP :alpha-D-glucose-l-phosphate adenylyltransferase. Other names in common use include ADP glucose pyrophosphorylase, glucose 1-phosphate adenylyltransferase, adenosine diphosphate glucose pyrophosphorylase, adenosine diphosphoglucose pyrophosphorylase, ADP-glucose pyrophosphorylase, ADP-glucose synthase, ADP-glucose synthetase, ADPG pyrophosphorylase, and ADP:alpha-D-glucose-l-phosphate adenylyltransferase.
The glgC gene is expressed in a wide variety of plants and bacteria, including most, if not all, Cyanobacteria. The glgC gene is also fairly conserved among Cyanobacteria, as can be appreciated upon comparison of SEQ ID NOs:27 (5. elongatus PCC7942), 28 (Synechocystis sp. PCC6803), 29 {Synechococcus sp. PCC 7002), 30 {Synechococcus sp. WH8102), 31 (Synechococcus sp. RCC 307), 32 (Trichodesmium erythraeum I MS 101), 33 (Anabaena varibilis), and 34 (Nostoc sp. PCC 7120), which describe the polynucleotide seq uences of various glgC genes from Cyanobacteria.
Deletion of the glgC gene in Cyanobacteria, such as Synechococcus, reduces the accumulation of glycogen in said Cyanobacteria, and increases the production of other carbon-based products, such as lipids and fatty acids.
Glycogen Synthase (glgA). In one embodiment, a modified photosynthetic microorganism, e.g., a Cyanobacteria, expresses a reduced amount of a glycogen synthase gene. In particular embodiments, it may comprise a deletion or mutation in the glycogen synthase gene, including any of is regulatory elements. Glycogen synthase (GlgA), also known as UDP-glucose-glycogen glucosyltransferase, is a glycosyltransferase enzyme that catalyses the reaction of UDP-glucose and (l,4-a-D-glucosyl)n to yield UDP and (l,4-a-D-glucosyl)n+i. Glycogen synthase is an a-retaining glucosyltransferase that uses ADP-glucose to incorporate additional glucose monomers onto the growing glycogen polymer. Essentially, GlgA catalyzes
the final step of converting excess glucose residues one by one into a polymeric chain for storage as glycogen.
Classically, glycogen synthases, or a-l,4-glucan synthases, have been divided into two families, animal/fungal glycogen synthases and bacterial/plant starch synthases, according to differences in seq uence, sugar donor specificity and regulatory mechanisms. However, detailed sequence analysis, predicted secondary structure comparisons, and threading analysis show that these two families are structurally related and that some domains of animal/fungal synthases were acquired to meet the particular regulatory requirements of those cell types.
Crystal structures have been established for certain bacterial glycogen synthases (see, e.g., Buschiazzo et al., The EMBO Journal 23, 3196-3205, 2004). These structures show that reported glycogen synthase folds into two Rossmann-fold domains organized as in glycogen phosphorlyase and other glycosyltransferases of the glycosyltransferases superfamily, with a deep fissure between both domains that includes the catalytic center. The core of the N-terminal domain of this glycogen synthase consists of a nine-stranded, predominantly parallel, central |J -sheet flanked on both sides by seven a-helices. The C-terminal domain (residues 271-456) shows a similar fold with a six-stranded parallel β-sheet and nine a-helices. The last a-helix of this domain undergoes a kink at position 457-460, with the final 17 residues of the protein (461-477) crossing over to the N-terminal domain and continuing as a-helix, a typical feature of glycosyltransferase enzymes.
These structures also show that the overall fold and the active site architecture of glycogen synthase are remarkably similar to those of glycogen phosphorylase, the latter playing a central role in the mobilization of carbohydrate reserves, indicating a common catalytic mechanism and comparable substrate-binding properties. In contrast to glycogen phosphorylase, however, glycogen synthase has a much wider catalytic cleft, which is predicted to undergo an important interdomain 'closure' movement during the catalytic cycle.
Crystal structures have been established for certain GlgA enzymes (see, e.g., Jin et al., EMBO J 24:694-704, 2005, incorporated by reference). These studies show that the N-terminal catalytic domain of GlgA resembles a dinucleotide-binding Rossmann fold and the C-terminal domain adopts a left-handed parallel beta helix that is involved in cooperative allosteric
regulation and a unique oligomerization. Also, comm unication between the regulator-binding sites and the active site involves several distinct regions of the enzyme, including the N-terminus, the glucose-l-phosphate-binding site, a nd the ATP-binding site.
The glgA gene is expressed in a wide variety of cells, including animal, plant, fungal, and bacterial cells, including most, if not all, Cyanobacteria. The glgA gene is also fairly conserved among Cyanobacteria, as can be appreciated upon com parison of SEQ I D NOs:35 (5. elongatus PCC7942), 36 (Synechocystis sp. PCC6803), 37 (Synechococcus sp. PCC 7002), 38 (Synechococcus sp. WH8102), 39 (Synechococcus sp. RCC 307), 40 (Trichodesmium erythraeum IMS 101), 41 (Anabaena variabilis), and 42 (Nostoc sp. PCC 7120), which describe the polynucleotide sequences of various glgA genes from Cyanobacteria.
Glycogen Breakdown. In certain embodiments, a modified photosynthetic microorganism of the present invention expresses an increased amount of one or more polypeptides associated with a glycogen breakdown pathway. In particular embodiments, said one or more polypeptides include glycogen phosphorylase (GlgP), glycogen isoamylase (GlgX), glucanotransferase (MalQ), phosphoglucomutase (Pgm), glucokinase (Glk), and/or phosphoglucose isomerase (Pgi), or a functional fragment or variant thereof, including, for example, those provided in SEQ ID NOs:68, 70, 72, 73, 83 or 85. Examples of additional Pgm polypeptide sequences useful according to the present invention are provided in SEQ ID NOs:74, 76, 77, 79, and 81. Pgm, Glk, and Pgi are bidirectional enzymes that can promote glycogen synthesis or breakdown depending on conditions.
(iv) Polypeptide Variants and Fragments
As noted above, embodiments of the present invention include variants and fragments of any of the reference polypeptides and polynucleotides described herein (see, e.g., the Sequence Listing). Variant polypeptides are biologically active, that is, they continue to possess the enzymatic activity of a reference polypeptide. Such variants may result from, for example, genetic polymorphism and/or from human manipulation.
Biologically active variants of a reference polypeptide will have at least 40%, 50%, 60%, 70%, generally at least 75%, 80%, 85%, usually about 90% to 95% or more, and typically about
97% or 98% or more sequence similarity or seq uence identity to the amino acid sequence for a reference protein as determined by sequence alignment programs described elsewhere herein using default parameters. A biologically active variant of a reference polypeptide may differ from that protein generally by as much 200, 100, 50 or 20 amino acid residues or suitably by as few as 1-15 amino acid residues, as few as 1-10, such as 6-10 amino acid residues, including about 20, 19, 18, 17, 16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2, or even 1 amino acid residues. In some embodiments, a variant polypeptide differs from the reference seq uences referred to herein (see, e.g., the Sequence Listing) by at least one but by less than 15, 10 or 5 amino acid residues. I n other embodiments, it differs from the reference seq uences by at least one residue but less than 20%, 15%, 10% or 5% of the residues.
A biologically active fragment can be a polypeptide fragment which is, for example, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29,30, 40, 50, 60, 70, 80, 90, 100, 110, 120, 130, 140, 150, 160, 170, 180, 190, 200, 220, 240, 260, 280, 300, 320, 340, 360, 380, 400, 450, 500, 600 or more contiguous amino acids, including all integers in between, of a reference polypeptide sequence.
A reference polypeptide may be altered in various ways including amino acid substitutions, deletions, truncations, and insertions. Methods for such manipulations are generally known in the art. For example, amino acid sequence variants of a reference polypeptide can be prepared by mutations in the DNA. Methods for mutagenesis and nucleotide seq uence alterations are well known in the art. See, for example, Kunkel (PNAS USA. 82: 488-492, 1985); Kunkel et al., (Methods in Enzymol. 154: 367-382, 1987), U.S. Pat. No. 4,873,192, Watson, J. D. et al., ("Molecular Biology of the Gene," Fourth Edition, Benjamin/Cummings, Menlo Park, Calif., 1987) and the references cited therein. Guidance as to appropriate amino acid substitutions that do not affect biological activity of the protein of interest may be found in the model of Dayhoff et al., (1978) Atlas of Protein Seq uence and Structure (Natl. Biomed. Res. Found., Washington, D.C.).
Methods for screening gene products of combinatoria l libraries made by point mutations or truncation, and for screening cDNA libraries for gene products having a selected property are known in the art. Such methods are adaptable for rapid screening of the gene
libraries generated by combinatorial mutagenesis of reference polypeptides. Recursive ensemble mutagenesis (REM), a technique which enhances the frequency of functional mutants in the libraries, can be used in combination with the screening assays to identify polypeptide variants (Arkin and Yourvan, PNAS USA 89: 7811-7815, 1992; Delgrave et al., Protein Engineering. 6: 327-331, 1993). Conservative substitutions, such as exchanging one amino acid with another having similar properties, may be desirable as discussed in more detail below.
Polypeptide variants may contain conservative amino acid substitutions at various locations along their sequence, as compared to a reference amino acid sequence. A "conservative amino acid substitution" is one in which the amino acid residue is replaced with an amino acid residue having a similar side chain. Families of amino acid residues having similar side chains have been defined in the art, which can be generally sub-classified as follows:
Acidic: The residue has a negative charge due to loss of H ion at physiological pH and the residue is attracted by aqueous solution so as to seek the surface positions in the conformation of a peptide in which it is contained when the peptide is in aqueous medium at physiological pH. Amino acids having an acidic side chain include glutamic acid and aspartic acid.
Basic: The residue has a positive charge due to association with H ion at physiological pH or within one or two pH units thereof (e.g., histidine) and the residue is attracted by aq ueous solution so as to seek the surface positions in the conformation of a peptide in which it is contained when the peptide is in aq ueous medium at physiological pH. Amino acids having a basic side chain include arginine, lysine and histidine.
Charged: The residues are charged at physiological pH and, therefore, include amino acids having acidic or basic side chains (i.e., glutamic acid, aspartic acid, arginine, lysine and histidine).
Hydrophobic: The residues are not charged at physiological pH and the residue is repelled by aq ueous solution so as to seek the inner positions in the conformation of a peptide in which it is contained when the peptide is in aqueous medium. Amino acids having a hydrophobic side chain include tyrosine, valine, isoleucine, leucine, methionine, phenylalanine and tryptophan.
Neutral/polar: The residues are not charged at physiological pH, but the residue is not sufficiently repelled by aqueous solutions so that it would seek inner positions in the conformation of a peptide in which it is contained when the peptide is in aqueous medium. Amino acids having a neutral/polar side chain include asparagine, glutamine, cysteine, histidine, serine and threonine.
This description also characterizes certain amino acids as "small" since their side chains are not sufficiently large, even if polar groups are lacking, to confer hydrophobicity. With the exception of proline, "small" amino acids are those with four carbons or less when at least one polar group is on the side chain and three carbons or less when not. Amino acids having a small side chain include glycine, serine, alanine and threonine. The gene-encoded secondary amino acid proline is a special case due to its known effects on the secondary conformation of peptide chains. The structure of proline differs from all the other naturally-occurring amino acids in that its side chain is bonded to the nitrogen of the a-amino group, as well as the a-carbon. Several amino acid similarity matrices (e.g., PAM 120 matrix and PAM250 matrix as disclosed for example by Dayhoff et al., (1978), A model of evolutionary change in proteins. Matrices for determining distance relationships In M . 0. Dayhoff, (ed.), Atlas of protein sequence and structure, Vol. 5, pp. 345-358, National Biomedical Research Foundation, Washington DC; and by Gonnet et al., (Science. 256: 14430-1445, 1992), however, include proline in the same group as glycine, serine, alanine and threonine. Accordingly, for the purposes of the present invention, proline is classified as a "small" amino acid.
The degree of attraction or repulsion req uired for classification as polar or nonpolar is arbitrary and, therefore, amino acids specifically contem plated by the invention have been classified as one or the other. Most amino acids not specifically named can be classified on the basis of known behaviour.
Amino acid residues can be further sub-classified as cyclic or non-cyclic, and aromatic or non-aromatic, self-explanatory classifications with respect to the side-chain substituent groups of the residues, and as small or large. The residue is considered small if it contains a total of four carbon atoms or less, inclusive of the carboxyl carbon, provided an additional polar substituent is present; three or less if not. Small residues are, of course, always non-aromatic.
Dependent on their structural properties, amino acid residues may fall in two or more classes. For the naturally-occurring protein amino acids, sub-classification according to this scheme is presented in Table A.
Table A. Amino acid sub-classification
Conservative amino acid substitution also includes groupings based on side chains. For example, a group of amino acids having aliphatic side chains is glycine, alanine, valine, leucine, and isoleucine; a group of amino acids having aliphatic-hydroxyl side chains is serine and threonine; a group of amino acids having amide-containing side chains is asparagine and glutamine; a group of amino acids having aromatic side chains is phenylalanine, tyrosine, and tryptophan; a group of amino acids having basic side chains is lysine, arginine, and histidine; and a group of amino acids having sulphur-containing side chains is cysteine and methionine. For example, it is reasonable to expect that replacement of a leucine with an isoleucine or valine, an aspartate with a glutamate, a threonine with a serine, or a similar replacement of an amino acid with a structurally related amino acid will not have a major effect on the properties of the resulting variant polypeptide. Whether an amino acid change results in a functional truncated and/or variant polypeptide can readily be determined by assaying its enzymatic activity, as described herein. Conservative substitutions are shown in Table B under the heading of exemplary substitutions. Amino acid substitutions falling within the scope of the invention, are, in general, accomplished by selecting substitutions that do not differ significantly in their
effect on maintaining (a) the structure of the peptide backbone in the area of the substitution, (b) the charge or hydrophobicity of the molecule at the target site, or (c) the bulk of the side chain. After the substitutions are introduced, the variants are screened for biological activity.
Table B. Exemplary Amino Acid Substitutions
Alternatively, similar amino acids for making conservative substitutions can be grouped into three categories based on the identity of the side chains. The first group includes glutamic acid, aspartic acid, arginine, lysine, histidine, which all have charged side chains; the second group includes glycine, serine, threonine, cysteine, tyrosine, glutamine, asparagine; and the third group includes leucine, isoleucine, valine, alanine, proline, phenylalanine, tryptophan, methionine, as described in Zubay, G., Biochemistry, third edition, Wm.C. Brown Publishers (1993).
Thus, a predicted non-essential amino acid residue in reference polypeptide is typically replaced with another amino acid residue from the same side chain family. Alternatively, mutations can be introduced randomly along all or part of a coding sequence, such as by saturation mutagenesis, and the resultant mutants can be screened for an activity of the parent polypeptide to identify mutants which retain that activity. Following mutagenesis of the coding sequences, the encoded peptide can be expressed recombinantly and the activity of the peptide can be determined. A "non-essential" amino acid residue is a residue that can be altered from the wild-type sequence of an embodiment polypeptide without abolishing or substantially altering one or more of its activities. Suitably, the alteration does not substantially abolish one of these activities, for example, the activity is at least 20%, 40%, 60%, 70% or 80% 100%, 500%, 1000% or more of wild-type. An "essential" amino acid residue is a residue that, when altered from the wild-type sequence of a reference polypeptide, results in abolition of an activity of the parent molecule such that less than 20% of the wild-type activity is present. For example, such essential amino acid residues may include those that are conserved in reference polypeptides across different species, including those sequences that are conserved in the enzymatic sites of reference polypeptides from various sources.
Accordingly, the present invention also contemplates variants of the naturally-occurring reference polypeptide sequences or their biologically-active fragments, wherein the variants are distinguished from the naturally-occurring sequence by the addition, deletion, or substitution of one or more amino acid residues. In general, as noted above, variants will display at least about 30, 40, 50, 55, 60, 65, 70, 75, 80, 85, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99% similarity or sequence identity to a reference polypeptide sequence. Moreover, sequences differing from the native or parent sequences by the addition, deletion, or substitution of 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 30, 40, 50, 60, 70, 80, 90, 100 or more amino acids but which retain the properties or activities of a parent or reference polypeptide sequence are contemplated.
In some embodiments, variant polypeptides differ from reference sequence by at least one but by less than 50, 40, 30, 20, 15, 10, 8, 6, 5, 4, 3 or 2 amino acid residue(s). In other embodiments, variant polypeptides differ from a reference sequence by at least 1% but less
than 20%, 15%, 10% or 5% of the residues. (If this comparison requires alignment, the sequences should be aligned for maximum similarity. "Looped" out sequences from deletions or insertions, or mismatches, are considered differences.)
Calculations of sequence similarity or seq uence identity between sequences (the terms are used interchangeably herein) are performed as follows. To determine the percent identity of two amino acid sequences, or of two nucleic acid sequences, the sequences are aligned for optimal comparison purposes (e.g., gaps can be introduced in one or both of a first and a second amino acid or nucleic acid seq uence for optimal alignment and non-homologous sequences can be disregarded for comparison purposes). I n certain embodiments, the length of a reference sequence aligned for comparison purposes is at least 30%, preferably at least 40%, more preferably at least 50%, 60%, and even more preferably at least 70%, 80%, 90%, 100% of the length of the reference sequence. The amino acid residues or nucleotides at corresponding amino acid positions or nucleotide positions are then compared. When a position in the first sequence is occupied by the same amino acid residue or nucleotide as the corresponding position in the second seq uence, then the molecules are identical at that position.
The percent identity between the two sequences is a function of the number of identical positions shared by the sequences, taking into account the number of gaps, and the length of each gap, which need to be introduced for optimal a lignment of the two sequences.
The comparison of sequences and determination of percent identity between two sequences can be accomplished using a mathematical algorithm. In a preferred embodiment, the percent identity between two amino acid seq uences is determined using the Needleman and Wunsch, (J. Mol. Biol. 48: 444-453, 1970) algorithm which has been incorporated into the GAP program in the GCG software package, using either a Blossum 62 matrix or a PAM250 matrix, and a gap weight of 16, 14, 12, 10, 8, 6, or 4 and a length weight of 1, 2, 3, 4, 5, or 6. In yet another preferred embodiment, the percent identity between two nucleotide seq uences is determined using the GAP program in the GCG software package, using a NWSgapdna.CMP matrix and a gap weight of 40, 50, 60, 70, or 80 and a length weight of 1, 2, 3, 4, 5, or 6. A particularly preferred set of parameters (and the one that should be used unless otherwise
specified) are a Blossum 62 scoring matrix with a gap penalty of 12, a gap extend penalty of 4, and a frameshift gap penalty of 5.
The percent identity between two amino acid or nucleotide seq uences can be determined using the algorithm of E. Meyers and W. Miller (Cabios. 4:11-17, 1989) which has been incorporated into the ALIGN program (version 2.0), using a PAM 120 weight residue table, a gap length penalty of 12 and a gap penalty of 4.
The nucleic acid and protein sequences described herein can be used as a "q uery sequence" to perform a search against public databases to, for example, identify other family members or related sequences. Such searches can be performed using the NBLAST and XBLAST programs (version 2.0) of Altschul, et al., (1990, J. Mol. Biol, 215: 403-10). BLAST nucleotide searches can be performed with the NBLAST program, score = 100, wordlength = 12 to obtain nucleotide sequences homologous to nucleic acid molecules of the invention. BLAST protein searches can be performed with the XBLAST program, score = 50, wordlength = 3 to obtain amino acid sequences homologous to protein molecules of the invention. To obtain gapped alignments for comparison purposes, Gapped BLAST can be utilized as described in Altschul et al., {Nucleic Acids Res. 25: 3389-3402, 1997). When utilizing BLAST and Gapped BLAST programs, the default parameters of the respective programs (e.g., XBLAST and NBLAST) can be used.
I n one embodiment, as noted above, polynucleotides and/or polypeptides can be evaluated using a BLAST alignment tool. A local alignment consists simply of a pair of seq uence segments, one from each of the sequences being compared. A modification of Smith-Waterman or Sellers algorithms will find all segment pairs whose scores cannot be improved by extension or trimming, called high-scoring segment pairs (HSPs). The results of the BLAST alignments include statistical measures to indicate the likelihood that the BLAST score can be expected from chance alone.
The raw score, S, is calculated from the number of gaps and substitutions associated with each aligned sequence wherein higher similarity scores indicate a more significant alignment. Substitution scores are given by a look-up table (see PAM, BLOSUM).
Gap scores are typically calculated as the sum of G, the gap opening penalty and L, the gap extension penalty. For a gap of length n, the gap cost would be G+Ln. The choice of gap costs, G and L is empirical, but it is customary to choose a high value for G (10-15), e.g., 11, and a low value for L (1-2) e.g., 1.
The bit score, S', is derived from the raw alignment score S in which the statistical properties of the scoring system used have been taken into account. Bit scores are normalized with respect to the scoring system, therefore they can be used to compare alignment scores from different searches. The terms "bit score" and "similarity score" are used interchangeably. The bit score provides an indication of how good the alignment is; (i.e., the higher the score, the better the alignment).
The E-Value, or expected value, describes the likelihood that a sequence with a similar score will occur in the database by chance. It is a prediction of the number of different alignments with scores equivalent to or better than S that are expected to occur in a database search by chance. The smaller the E-Value, the more significant the alignment. For example, an alignment having an E value of e 117 means that a seq uence with a similar score is very unlikely to occur simply by chance. Additionally, the expected score for aligning a random pair of amino acids is required to be negative, otherwise long alignments would tend to have high score independently of whether the segments aligned were related. Additionally, the BLAST algorithm uses an appropriate substitution matrix, nucleotide or amino acid and for gapped alignments uses gap creation and extension penalties. For example, BLAST alignment and comparison of polypeptide sequences are typically done using the BLOSUM62 matrix, a gap existence penalty of 11 and a gap extension penalty of 1.
I n one embodiment, sequence similarity scores are reported from BLAST analyses done using the BLOSUM62 matrix, a gap existence penalty of 11 and a gap extension penalty of 1.
I n a particular embodiment, sequence identity/similarity scores provided herein refer to the value obtained using GAP Version 10 (GCG, Accelrys, San Diego, Calif.) using the following parameters: % identity and % similarity for a nucleotide seq uence using GAP Weight of 50 and Length Weight of 3, and the nwsgapdna.cmp scoring matrix; % identity and % similarity for an amino acid sequence using GAP Weight of 8 and Length Weight of 2, and the BLOSUM62
scoring matrix (Henikoff and Henikoff, PNAS USA. 89:10915-10919, 1992). GAP uses the algorithm of Needleman and Wunsch (J Mol Biol. 48:443-453, 1970) to find the alignment of two complete sequences that maximizes the number of matches and minimizes the number of gaps.
I n one particular embodiment, the variant polypeptide comprises an amino acid sequence that can be optimally aligned with a reference polypeptide sequence (see, e.g., Sequence Listing) to generate a BLAST bit scores or sequence similarity scores of at least about 50, 60, 70, 80, 90, 100, 100, 110, 120, 130, 140, 150, 160, 170, 180, 190, 200, 210, 220, 230, 240, 250, 260, 270, 280, 290, 300, 310, 320, 330, 340, 350, 360, 370, 380, 390, 400, 410, 420, 430, 440, 450, 460, 470, 480, 490, 500, 510, 520, 530, 540, 550, 560, 570, 580, 590, 600, 610, 620, 630, 640, 650, 660, 670, 680, 690, 700, 710, 720, 730, 740, 750, 760, 770, 780, 790, 800, 810, 820, 830, 840, 850, 860, 870, 880, 890, 900, 910, 920, 930, 940, 950, 960, 970, 980, 990, 1000, or more, including all integers and ranges in between, wherein the BLAST alignment used the BLOSUM62 matrix, a gap existence penalty of 11, and a gap extension penalty of 1.
Variants of a reference polypeptide can be identified by screening combinatorial libraries of mutants of a reference polypeptide. Libraries or fragments e.g., N terminal, C terminal, or internal fragments, of protein coding sequence can be used to generate a variegated population of fragments for screening and subsequent selection of variants of a reference polypeptide.
Methods for screening gene products of combinatoria l libraries made by point mutation or truncation, and for screening cDNA libraries for gene products having a selected property are known in the art. Such methods are adaptable for ra pid screening of the gene libraries generated by combinatorial mutagenesis of polypeptides.
The present invention also contemplates the use of chimeric or fusion proteins of the reference polypeptides described herein. As used herein, a "chimeric protein" or "fusion protein" includes a reference polypeptide, or a polypeptide fragment linked to either another reference polypeptide (e.g., to create multiple fragments), to a non-reference polypeptide, or to both. In certain embodiments, a reference polypeptide can be fused to a heterologous polypeptide seq uence. A "heterologous polypeptide" typically has an amino acid sequence
corresponding to a protein which is different from the reference protein sequence, and which can be derived from the same or a different organism. The reference polypeptide of the fusion protein can correspond to all or a portion of a biologically active amino acid sequence.
In certain embodiments, a fusion protein includes at least one or two biologically active portions of reference protein. The polypeptides forming the fusion protein are typically linked C-terminus to N-terminus, although they can also be linked C-terminus to C-terminus, N-terminus to N-terminus, or N-terminus to C-terminus. The polypeptides of the fusion protein can be in any order.
The fusion partner may be designed and included for essentially any desired purpose provided they do not adversely affect the enzymatic activity of the polypeptide. For example, in one embodiment, a fusion partner may comprise a sequence that assists in expressing the protein (an expression enhancer) at higher yields than the native recombinant protein. Other fusion partners may be selected so as to increase the solubility or stability of the protein or to enable the protein to be targeted to desired intracellular compartments.
The fusion protein can include a moiety which has a high affinity for a ligand. For example, the fusion protein can be a GST-fusion protein in which the reference polypeptide sequences are fused to the C-terminus of the GST sequences. Such fusion proteins can facilitate the purification and/or identification of the resulting polypeptide. Alternatively, the fusion protein can be reference polypeptide containing a heterologous signal sequence at its N-terminus. In certain host cells, expression and/or secretion of such proteins can be increased through use of a heterologous signal sequence.
Fusion proteins may generally be prepared using standard techniques, described elsewhere herein. A peptide linker sequence may be employed to separate the first and second polypeptide components by a distance sufficient to ensure that each polypeptide folds into its secondary and tertiary structures, if desired. Exemplary peptide linkers are described elsewhere herein.
Polynucleotides and Vectors
Embodiments of the present invention include polynucleotides encoding a DGAT fusion protein described herein, the fusion protein comprising at least one DGAT polypeptide fused to at least one intracellular localization domain, such as a bacteria membrane- or bacterial plasma membrane (PM)-targeting domain. Such polynucleotides can be partially or fully isolated from other cellular components, within a vector, for exa mple, a composition comprising such a vector (e.g., in a tube or kit), or in a host cell, such as modified photosynthetic microorganism.
These polynucleotides and modified photosynthetic microorganisms comprising the same may optionally comprise one or more (introduced) polynucleotides encoding a lipid biosynthesis protein, and/or one or more (introduced) polynucleotides encoding a polypeptide associated with glycogen breakdown.
Also included are nucleotide sequences that encode any functional naturally-occurring variants or fragments (e.g., allelic variants, orthologs, splice variants) or non-naturally occurring variants or fragments of these native polynucleotides (i.e., optimized by engineering), as well as com positions comprising such polynucleotides, including, for example, cloning and expression vectors.
Also, the modified photosynthetic microorganisms described herein may optionally com prise a mutation or deletion in one or more genes associated with glycogen biosynthesis or storage, alone or in combination with the presence of overexpressed proteins associated with lipid biosynthesis proteins and/or glycogen breakdown. Certain modified photosynthetic microorganisms, for example, for the production of wax esters, may optionally comprise a mutation or deletion in or more genes encoding an aldehyde decarbonylase, an aldehyde dehydrogenase, or both, either alone or in combination with the presence of overexpressed proteins associated with lipid biosynthesis proteins and/or glycogen breakdown.
The recitations "mutation" or "deletion," in this context refer generally to those changes or alterations in a photosynthetic microorganism, e.g., a Cyanobacterium, that render the product of that gene non-functional or having reduced function. Examples of such changes or alterations include nucleotide substitutions, deletions, or additions/insertions to the coding or regulatory sequences of a targeted gene (e.g., glgA, glgC, pgm, aldehyde decarbonylase, aldehyde dehydrogenase), in whole or in part, which disrupt, eliminate, down-regulate, or
significantly reduce the expression of the polypeptide encoded by that gene, whether at the level of transcription, translation, post-translational modification, or protein stability. Such alterations can also reduce the enzymatic activity or other functional characteristic of the protein (e.g., localization), with or without reducing expression.
Techniq ues for producing such alterations or changes, such as by recombination with a vector having a selectable marker, are exemplified herein and known in the molecular biological art. I n particular embodiments, one or more alleles of a gene, e.g., two or all alleles, may be mutated or deleted within a photosynthetic microorganism. In particular embodiments, modified photosynthetic microorganisms, e.g., Cyanobacteria, of the present invention are merodiploids or partial diploids.
The "deletion" of a targeted gene or polypeptide may also be accomplished by targeting the m RNA of that gene, such as by using various antisense technologies (e.g., antisense oligonucleotides and siRNA) known in the art. Accordingly, targeted genes may be considered "non-functional" when the polypeptide or enzyme encoded by that gene is not expressed by the modified photosynthetic microorganism, or is expressed in negligible amounts.
As used herein, the terms "DNA" and "polynucleotide" and "nucleic acid" include a DNA molecule that has been isolated free of total genomic DNA of a particular species. Therefore, a DNA segment encoding a polypeptide refers to a DNA segment that contains one or more coding seq uences yet is substantially isolated away from, or purified free from, total genomic DNA of the species from which the DNA segment is obtained. Included within the terms "DNA segment" and "polynucleotide" are DNA segments and smaller fragments of such segments, and also recombinant vectors, including, for example, plasmids, cosmids, phagemids, phage, viruses, and the like.
As will be understood by those skilled in the art, the polynucleotide sequences of this invention can include genomic seq uences, extra-genomic and plasmid-encoded sequences and smaller engineered gene segments that express, or may be adapted to express, proteins, polypeptides, peptides and the like. Such segments may be naturally isolated, or modified synthetically by the hand of man.
Polynucleotides may be single-stranded (coding or a ntisense) or double-stranded, and may be DNA (genomic, cDNA or synthetic) or RNA molecules. Additional coding or non-coding sequences may, but need not, be present within a polynucleotide of the present invention, and a polynucleotide may, but need not, be linked to other molecules and/or support materials.
Polynucleotides may comprise a native seq uence (e.g., an endogenous sequence that encodes protein described herein) or may comprise a variant or fragment, or a biological functional eq uivalent of such a sequence. Polynucleotide variants may contain one or more substitutions, additions, deletions and/or insertions, as further described herein, preferably such that the enzymatic activity of the encoded polypeptide is not substantially diminished relative to the unmodified or reference polypeptide. The effect on the enzymatic activity of the encoded polypeptide may generally be assessed as described herein and known in the art.
(i) Intracellular Localization Domain-DGA T Fusion Polynucleotides
Embodiments of the present invention include polynucleotides (e.g., isolated polynucleotides) that encode any of the intracellular localization domain-DGAT fusion proteins described herein, such as the membrane-targeting domain-DGAT fusion proteins. These polynucleotides comprise at least one seq uence encoding a heterologous intracellular localization domain described herein, which is fused in-frame to at least one sequence encoding a DGAT polypeptide, or an active fragment or variant thereof.
Certain embodiments thus include polynucleotides that encode any one or more of the intracellular localization domains described herein, where such polynucleotide(s) are fused in-frame to a DGAT-encoding polynucleotide. Exemplary seq uences that encode a membrane-targeting domain can be found within SEQ ID NOs:200 or 202, the respective PCC7942-0858 and PCC7942-1015 coding sequences of two methyl-accepting chemotaxis (MCP) proteins from 5. elongatus. For instance, in specific embodiments, the polynucleotide sequence may include about the N-terminal 129 nucleotides of SEQ ID NO:200 (PCC7942-0858), which encodes the leader sequence of the MCP protein encoded by the PCC7942-0858 gene; additional sequences can also be included, for instance, about the N-terminal 132, 135, 138,
141, 144, 147, 150, 153, 156, 159, 162, 165, 168, 171, 174, 177, 180 or more nucleotides of SEQ ID NO:200 (PCC7942-0858).
Also included are polynucleotides that encode any one or more of the DGAT polypeptides described herein, where such polynucleotide(s) are fused in-frame to a heterologous intracellular localization domain-encoding polynucleotide. In specific embodiments, a DGAT-encoding portion of the fusion protein encodes a DGAT comprising or consisting of a polypeptide sequence set forth in any one of SEQ ID NOs:58, 59, 60 or 61, or a fragment or variant thereof. SEQ ID NO:58 is the sequence of DGATn; SEQ ID NO: 59 is the sequence of Streptomyces coelicolor DGAT (ScoDGAT or SDGAT); SEQ ID NO:60 is the sequence of Alcanivorax borkumensis DGAT (AboDGAT); and SEQ ID NO:61 is the sequence of DGATd (Acinetobacter baylii sp.).
In certain embodiments, a DGAT-encoding portion of the fusion protein comprises or consists of a polynucleotide sequence set forth in any one of SEQ ID NOs:62, 63, 64, 65 or 66, or a fragment or variant thereof. SEQ ID NO:62 is a codon-optimized for expression in Cyanobacteria sequence that encodes DGATn; SEQ ID NO:63 has homology to SEQ ID NO:62; SEQ ID NO:64 is a codon-optimized for expression in Cyanobacteria sequence that encodes ScoDGAT; SEQ ID NO:65 is a codon-optimized for expression in Cyanobacteria sequence that encodes AboDGAT; and SEQ ID NO:66 is a codon-optimized for expression in Cyanobacteria sequence that encodes DGATd. DGATn and DGATd correspond to Acinetobacter baylii DGAT and a modified form thereof, which includes two additional amino acid residues immediately following the initiator methionine.
(ii) Lipid Biosynthesis Genes
In certain embodiments, a modified photosynthetic microorganism comprises an introduced polynucleotide that encodes one or more lipid biosynthesis proteins. In some instances, a modified photosynthetic microorganism comprises an endogenous polynucleotide that encodes a lipid biosynthesis gene, where a regulatory element such as a promoter is introduced upstream of that polynucleotide to regulate or alter expression of the encoded protein.
I n particular embodiments, a modified photosynthetic microorganism comprises reduced or eliminated expression or activity of a lipid biosynthesis polypeptide. Included are full or partial deletions, and point mutations or insertions of an endogenous lipid biosynthesis gene that reduce or eliminate expression and/or activity of the encoded polypeptide.
Exemplary lipid biosynthesis genes encode polypeptides such as acyl carrier proteins
(ACP), acyl ACP synthases (Aas), acyl-ACP reductases, alcohol dehydrogenases, aldehyde dehydrogenases, aldehyde decarbonylases, thioesterases (TES), acetyl coenzyme A carboxylases (ACCase), phosphatidic acid phosphatases (PAP; or phosphatidate phosphatases), triacylglycerol (TAG) hydrolases, fatty acyl-CoA synthetases, and lipases/phospholipases, as described herein.
Acyl Carrier Proteins. I n certain embodiments, a modified photosynthetic microorganism comprises one or more polynucleotides encoding one or more acyl carrier proteins (ACP). Exemplary ACP nucleotide seq uences include SEQ I D NO:5 from Synechococcus elongatus PCC7942, SEQ I D NOS:7, 9, and 11 from Acinetobacter sp. ADP1, and SEQ I D NO:13 from Spinacia oleracea.
Acyl ACP Synthases (Aas). I n certain embodiments, a modified photosynthetic microorganism comprises one or more polynucleotides encoding one or more acyl-ACP synthetase (Aas) enzymes. In certain embodiments, the Aas nucleotide sequence is derived from the Se918 gene of Synechococcus elongatus. One exem plary Aas sequence nucleotide sequence is SEQ I D NO:43 from Synechococcus elongatus PCC 7942.
I n particular embodiments, a modified photosynthetic microorganism of the present invention has a mutation such as a point mutation, insertion, or full or partial deletion of one or more endogenous Aas genes, for instance, the Se918 gene of S. elongatus PCC7942, to reduce or eliminate expression and/or activity of the encoded Aas polypeptide.
Acyl-ACP Reductases. I n certain embodiments, a modified photosynthetic microorganism comprises one or more polynucleotides encoding one or more acyl-ACP reductase polypeptides. Exemplary acyl-ACP reductase nucleotide sequences include orfl594 from Synechococcus elongatus PCC7942 (SEQ ID NO:l), and orfsll0209 from Synechocystis sp. PCC6803 (SEQ ID NO:3).
Alcohol Dehydrogenases. Certain embodiments may employ one or more alcohol dehydrogenase encoding polynucleotide seq uences. Exemplary alcohol dehydrogenases include slrll92 of Synechocystis sp. PCC6803 (SEQ ID NO:104) and ACIAD3612 from Acinetobacter baylii (SEQ I D NO:106).
Aldehyde Dehydrogenases. Certain embodiments may employ one or more aldehyde dehydrogenase encoding polynucleotide seq uences. Certain embodiments, for example, for the production of triglycerides or wax esters, may comprise mutations such as point mutations, insertions, or full or partial deletions of one or more endogenous aldehyde dehydrogenase genes. One exemplary aldehyde dehydrogenase is orf0489 of Synechococcus elongatus PCC7942 (SEQ ID NO:102).
Aldehyde Decarbonylases. Certain embodiments, for example, for the production of triglycerides or wax esters, may comprise mutations such as point mutations, insertions, or full or partial deletions of one or more endogenous aldehyde decarbonylase genes. One example of an aldehyde decarbonylase is encoded by orfl593 in 5. elongatus PCC7942. Another example is an aldehyde decarbonylase encoded by orfsll0208 in Synechocystis sp. PCC6803.
Thioesterases (TES). I n certain embodiments, a modified photosynthetic microorganism comprises one or more polynucleotides encoding one or more thioesterases (TES) including acyl-ACP thioesterases and/or acyl-CoA thioesterases. In certain embodiments, the polynucleotide sequence of the TES encodes a TesA or TesB polypeptide from E. coli, or a cytoplasmic TesA variant (*TesA) having the sequence set forth in SEQ ID NO:121.
I n certain embodiments, the polynucleotide sequence of the TES comprises that of the FatB gene, encoding a FatB enzyme, such as a C8, C12, C14, C16, or C18 FatB enzyme. In certain embodiments, the polynucleotide encodes a thioesterase (e.g., FatB thioesterase), having only thioesterase activity and little or no lysophospholipase activity. I n specific embodiments, the thioesterase is a FatB acyl-ACP thioesterase, which can hydrolyze acyl-ACP but not acyl-CoA. SEQ ID NO:197 is an exemplary nucleotide sequence of a C8/C10 FatB2 thioesterase derived from Cuphea hookeriana, and SEQ ID NO:122 is codon-optimized for expression in Cyanobacteria. SEQ I D NO:123 is an exemplary nucleotide sequence of a C12 FatBl acyl-ACP thioesterase derived from Umbellularia californica, and SEQ ID NO:124 is a codon-optimized
version of SEQ I D NO:123 for optimal expression in Cyanobacteria. SEQ I D NO:126 is an exemplary nucleotide sequence of a C14 FatBl thioesterase derived from Cinnamomum camphora, and SEQ:125 is a codon-optimized version of SEQ I D NO:126. SEQ I D NO:127 is an exemplary nucleotide sequence of a C16 FatBl thioesterase derived from Cuphea hookeriana, and SEQ ID NO:128 is a codon-optimized version of SEQ ID NO:127. I n certain embodiments, one or more FatB seq uences are operably linked to a strong promoter, such as a Ptrc promoter. I n other embodiments, one or more FatB sequences are operably linked to a relatively weak promoter, such as an arabinose promoter.
Acetyl Coenzyme A carboxylases (ACCase). In certain embodiments, a polynucleotide encodes an acetyl-CoA carboxylase (ACCase) comprising or consisting of a polypeptide sequence set forth in any of SEQ ID NOs:55, 45, 46, 47, 48 or 49, or a fragment or variant thereof. I n particular embodiments, a ACCase polynucleotide comprises or consists of a polynucleotide sequence set forth in any of SEQ I D NOs:56, 57, 50, 51, 52, 53 or 54, or a fragment or variant thereof. SEQ I D NO:55 is the seq uence of Saccharomyces cerevisiae acetyl-CoA carboxylase (yAccl); and SEQ ID NO:56 is a codon-optimized for expression in Cyanobacteria sequence that encodes yAccl. SEQ ID NO:45 is Synechococcus sp. PCC 7002 AccA; SEQ ID NO:46 is Synechococcus sp. PCC 7002 AccB; SEQ ID NO:47 is Synechococcus sp. PCC 7002 AccC; and SEQ I D NO:48 is Synechococcus sp. PCC 7002 AccD. SEQ I D NO:50 encodes Synechococcus sp. PCC 7002 AccA; SEQ ID NO:51 encodes Synechococcus sp. PCC 7002 AccB; SEQ ID NO:52 encodes Synechococcus sp. PCC 7002 AccC; and SEQ ID NO:53 encodes Synechococcus sp. PCC 7002 AccD. SEQ ID NO:49 is a Triticum aestivum ACCase; and SEQ ID NO:54 encodes this Triticum aestivum ACCase.
Phosphatide Acid Phosphatases (PAP). In certain embodiments, a polynucleotide encodes a phosphatidate phosphatase (also referred to as a phosphatidic acid phosphatase; PAP) comprising or consisting of a polypeptide sequence set forth in SEQ I D NO:131, or a fragment or variant thereof. In particular embodiments, a phosphatidate phosphatase polynucleotide comprises or consists of a polynucleotide sequence set forth in SEQ I D NO:129 or SEQ I D NO:130, or a fragment or variant thereof. SEQ ID NO:131 is the sequence of Saccharomyces cerevisiae phosphatidate phosphatase (yPAHl), and SEQ ID NO:129 is a
codon-optimized for expression in Cyanobacteria seq uence that encodes yPAHl. I n certain embodiments, the nucleotide seq uence of the PAP is derived from the E. coli PgpB gene, and/or the PAP gene from Synechocystis sp. PCC6803.
Triacylglycerol (TAG) Hydrolases. Certain embodiments employ one or more TAG hydrolase encoding polynucleotide sequences. Non-limiting examples of TAG hydrolase polynucleotide sequences include SDP1 (SUGAR-DEPENDENT1) triacylglycerol lipase from Arabidopsis thaliana (SEQ ID NO:153), ACIAD1335 from Acinetobacter sp. ADPl (SEQ ID NO:154), TG14P from S. cerevisiae (SEQ I D NO:175), and RHAl_ro04722 (YP_704665) TAG lipase from Rhodococcus (SEQ ID NO:155). Additional polynucleotide sequences for exemplary lipases/esterases include RHAl_ro01602 lipase/esterase from Rhodococcus sp. (see SEQ ID NO:156), and the RHAl_ro06856 lipase/esterase {see SEQ I D NO:119) from Rhodococcus sp.
Fatty Acyl-CoA Synthetases. Certain embodiments employ one or more fatty acyl-CoA synthetase encoding polynucleotide sequences. One exemplary fatty acyl-CoA synthetase includes the FadD gene from E. coli (SEQ ID NO:16) which encodes a fatty acyl-CoA synthetase having substrate specificity for medium and long chain fatty acids. Other exemplary fatty acyl-CoA synthetases include those derived from 5. cerevisiae; for example, the Faalp coding sequence is set forth in SEQ I D NO:18, the Faa2p coding sequence is set forth in SEQ I D NO:20, and the Faa3p is set forth in SEQ ID NO:22. SEQ I D NO:22 is codon-optimized for expression in 5. elongatus PCC7942.
Lipases/Phospholipases. I n certain embodiments, a modified photosynthetic microorganism comprises one or more polynucleotides encoding one or more lipases or phospholipases, including lysophospholipases, or a fragment or variant thereof. In certain embodiments, the encoded lysophospholipase is Lysophospholipase LI (TesA), Lysophospholipase L2, TesB, Vu Patatin 1 protein, or a homolog thereof.
In particular embodiments, the encoded phospholipase, e.g., a lysophospholipase, is a bacterial phospholipase, or a fragment or variant thereof, and the polynucleotide comprises a bacterial phospholipase polynucleotide sequence, e.g., a sequence derived from Escherichia coli, Enterococcus faecalis, or Lactobacillus plantarum. In particular embodiments, the encoded
phospholipase is Lysophospholipase LI (TesA), Lysophospholipase L2, TesB, Vu Patatin 1 protein, or a functional fragment thereof.
In certain embodiments, a lysophospholipase is a bacterial Lysophospholipase LI (TesA) or TesB, such as an E.coli Lysophospholipase LI encoded by a polynucleotide (pldC) having the wild-type sequence set forth in SEQ ID NO:196, or an E. coli TesB encoded by a polynucleotide having the wild-type sequence set forth in SEQ ID NO:132. The polypeptide sequence of E. coli Lysophospholipase LI is provided in SEQ ID NO:133, and the polypeptide sequence of E. coli TesB is provided in SEQ ID NO:134. In other embodiments, a lysophospholipase is a Lysophospholipase L2, such as an E. coli Lysophospholipase L2 encoded by a polynucleotide (pldB) having the wild-type sequence set forth in SEQ ID NO:135, or a Vu patatin 1 protein encoded by a polynucleotide having the wild-type sequence set forth in SEQ ID NO:136. The polypeptide sequence of E. coli Lysophospholipase L2 is provided in SEQ ID NO:137, and the polypeptide sequence of Vu patatin 1 protein is provided in SEQ ID NO:138.
In particular embodiments, the polynucleotide encoding the phospholipase variant is modified such that it encodes a phospholipase that localizes predominantly to the cytoplasm instead of the periplasm. For example, it may encode a phospholipase having a deletion or mutation in a region associated with periplasmic localization. In particular embodiments, the encoded phospholipase variant is derived from Lysophospholipase LI (TesA). In certain embodiments, the Lysophospholipase LI (TesA) variant is a bacterial TesA, such as an E.coli Lysophospholipase (TesA) variant encoded by a polynucleotide having the sequence set forth in SEQ ID NO:139. The polypeptide sequence of the Lysophospholipase LI variant is provided in SEQ ID NO:121 (PldC(*TesA)).
Additional examples of phospholipase-encoding polynucleotide sequences include phospholipase Al (PldA) from Acinetobacter sp. ADP1 (SEQ ID NO:140), phospholipase A (PldA) from E. coli (SEQ ID NO:141), phospholipase from Streptomyces coelicolor A3(2) (SEQ ID NO:142), phospholipase A2 (PLA2-a) from Arabidopsis thaliana (SEQ ID NO:143). phospholipase Al/ triacylglycerol lipase (DADl; Defective Anther Dehiscence 1) from Arabidopsis thaliana (SEQ ID NO:144), chloroplast DONGLE from Arabidopsis thaliana (SEQ ID NO:145), patatin-like protein from Arabidopsis thaliana (SEQ ID NO:146), and patatin from Anabaena variabilis ATCC
29413 (SEQ ID NO:147). Additional non-limiting exam ples of lysophospholipase-encoding polynucleotide sequences include phospholipase B (Plblp) from Saccharomyces cerevisiae S288c (SEQ I D NO:148), phospholipase B (Plb2p) from Saccharomyces cerevisiae S288c (SEQ ID NO:149), ACIAD1057 (TesA homolog) from Acinetobacter ADP1 (SEQ I D NO:150), ACIAD1943 lysophospholipase from Acinetobacter ADP1 (SEQ I D NO:151), and a lysophospholipase (YP_702320; RHAl_ro02357) from Rhodococcus (SEQ ID NO:152).
(Hi) Glycogen Biosynthesis, Storage, and Breakdown Genes
Glycogen Biosynthesis and Storage Genes. As noted above, certain embodiments include reduced or eliminated expression and/or activity of one or more polypeptides associated with glycogen biosynthesis and/or storage, for instance, by mutation of one or more genes that encode such polypeptides. Included are full or partial deletions, and point mutations or insertions of one or more glycogen biosynthesis/storage genes that reduce or eliminate expression and/or biological activity of the encoded protein(s). Exemplary genes associated with glycogen synthesis and/or storage include glgC, pgm, and glgA.
Examples of such glgC polynucleotide sequences are provided in SEQ I D NOs:28 {Synechocystis sp. PCC6803), 34 {Nostoc sp. PCC 7120), 33 {Anabaena variabilis), 32 (Trichodesmium erythraeum IMS 101), 27 (Synechococcus elongatus PCC7942), 30 (Synechococcus sp. WH8102), 31 (Synechococcus sp. RCC 307), and 29 (Synechococcus sp. PCC 7002), which respectively encode GlgC polypeptides having sequences set forth in SEQ ID NOs: 86, 87, 88, 89, 90, 91, 92, and 93.
Examples of such pgm polynucleotide sequences are provided in SEQ ID NOs: 25 (Synechocystis sp. PCC6803), 75 (Synechococcus elongatus PCC7942), 26 (Synechococcus sp. WH8102), 78 (Synechococcus RCC307), and 80 (Synechococcus 7002), which respectively encode Pgm polypeptides having sequences set forth in SEQ I D NOs:74, 76, 77, 79 and 81.
Examples of such glgA polynucleotide sequences are provided in SEQ I D NOs:36 (Synechocystis sp. PCC6803), 42 (Nostoc sp. PCC 7120), 41 (Anabaena variabilis), 40 (Trichodesmium erythraeum IMS 101), 35 (Synechococcus elongatus PCC7942), 38 (Synechococcus sp. WH8102), 39 (Synechococcus sp. RCC 307), and 37 (Synechococcus sp. PCC
7002), which respectively encode GlgA polypeptides having sequences set forth in SEQ ID NOs:94, 95, 96, 97, 98, 99, 100 and 101.
Glycogen Breakdown Genes. I n certain embodiments, a modified photosynthetic microorganism comprise one or more polynucleotides encoding one or more polypeptides associated with a glycogen breakdown, or a fragment or variant thereof. In particular embodiments, the one or more polypeptides are glycogen phosphorylase (GIgP), glycogen isoamylase (GIgX), glucanotransferase (MalQ), phosphoglucomutase (Pgm), glucokinase (Glk), and/or phosphoglucose isomerase (Pgi), or a functional fragment or variant thereof.
A representative glgP polynucleotide sequence is provided in SEQ ID NO:67, and a representative GIgP polypeptide sequence is provided in SEQ ID NO:68. A representative glgX polynucleotide sequence is provided in SEQ I D NO:69, and a representative GIgX polypeptide sequence is provided in SEQ I D NO:70. A representative malQ polynucleotide seq uence is provided in SEQ ID NO:71, and a representative MalQ polypeptide sequence is provide in SEQ I D NO:72. A representative phosphoglucomutase (pgm) polynucleotide seq uence is provided in SEQ ID NO:24, and a representative phosphoglucomutase (Pgm) polypeptide sequence is provided in SEQ I D NO:73, with others provided infra (SEQ ID NOs:25, 26, 74-81). A representative glk polynucleotide sequence is provided in SEQ I D NO:82, and a representative Glk polypeptide seq uence is provided in SEQ ID NO:83. A representative pgi polynucleotide sequence is provided in SEQ I D NO:84, and a representative Pgi polypeptide sequence is provided in SEQ ID NO:85.
(iv) Polynucleotide Variants, Fragments, Vectors, and Expression Systems
I n particular embodiments, a polynucleotide comprises one of these polynucleotide sequences, or a fragment or variant thereof, or encodes one of these polypeptide sequences, or a fragment or variant thereof.
Exemplary nucleotide seq uences that encode the proteins and enzymes of the application encompass full-length reference polynucleotides, as well as portions of the full-length or substantially full-length nucleotide sequences of these genes or their transcripts or DNA copies of these transcripts. Portions of a nucleotide seq uence may encode polypeptide
portions or segments that retain the biological activity of the reference polypeptide. A portion of a nucleotide sequence that encodes a biologically active fragment of an enzyme provided herein may encode at least about 20, 21, 22, 23, 24, 25, 30, 40, 50, 60, 70, 80, 90, 100, 120, 150, 200, 300, 400, 500, 600, or more contiguous amino acid residues, almost up to the total number of amino acids present in a full-length enzyme. It will be readily understood that "intermediate lengths," in this context and in all other contexts used herein, means any length between the quoted values, such as 101, 102, 103, etc.; 151, 152, 153, etc.; 201, 202, 203, etc.
The polynucleotides described herein, regardless of the length of the coding sequence itself, may be combined with other DNA seq uences, such as promoters, polyadenylation signals, additional restriction enzyme sites, multiple cloning sites, other coding segments, and the like, such that their overall length may vary considerably. It is therefore contemplated that a polynucleotide fragment of almost any length may be employed, with the total length preferably being limited by the ease of preparation and use in the intended recombinant DNA protocol.
The invention also contemplates variants of the reference polynucleotide sequences described herein (see, e.g., the Sequence Listing). Nucleic acid variants can be naturally-occurring, such as allelic variants (same locus), homologs (different locus), and orthologs (different organism) or can be non natura lly-occurring. Naturally occurring variants such as these can be identified and isolated using well-known molecular biology techniq ues including, for example, various polymerase chain reaction (PCR) and hybridization-based techniques as known in the art. Naturally occurring variants can be isolated from any organism that encodes one or more genes having an activity of a reference polypeptide. Embodiments of the present invention, therefore, encompass Cyanobacteria comprising such naturally-occurring polynucleotide variants.
Non-naturally occurring variants can be made by mutagenesis techniques, including those applied to polynucleotides, cells, or organisms. The variants can contain nucleotide substitutions, deletions, inversions and insertions. Variation can occur in either or both the coding and non-coding regions. I n certain aspects, non-naturally occurring variants may have
been optimized for use in Cyanobacteria, such as by engineering and screening the enzymes for increased activity, stability, or any other desirable feature.
The variations can produce both conservative and non-conservative amino acid substitutions (as compared to the originally encoded product). For nucleotide sequences, conservative variants include those sequences that, because of the degeneracy of the genetic code, encode the amino acid sequence of a reference polypeptide. Variant nucleotide sequences also include synthetically derived polynucleotide sequences, such as those generated, for example, by using site-directed mutagenesis but which still encode a biologically active reference polypeptide, as described elsewhere herein. Generally, variants of a particular polynucleotide sequence will have at least about 30%, 40% 50%, 55%, 60%, 65%, 70%, generally at least about 75%, 80%, 85%, 90%, 95% or 98% or more sequence identity to a reference polynucleotide seq uence as determined by sequence alignment programs described elsewhere herein using default parameters.
Known reference polynucleotide sequences (e.g., described herein) can be used to isolate corresponding sequences and alleles from other organisms, particularly other microorganisms. Methods are readily available in the art for the hybridization of nucleic acid sequences. Coding sequences from other organisms may be isolated according to well known techniques based on their sequence identity with the coding sequences set forth herein. In these techniq ues all or part of the known coding sequence is used as a probe which selectively hybridizes to other reference coding seq uences present in a population of cloned genomic DNA fragments or cDNA fragments (i.e., genomic or cDNA libraries) from a chosen organism.
Accordingly, the present invention also contemplates polynucleotides that hybridize to reference nucleotide sequences, or to their complements, under stringency conditions described below. As used herein, the term "hybridizes under low stringency, medium stringency, high stringency, or very high stringency conditions" describes conditions for hybridization and washing. Guidance for performing hybridization reactions can be found in Ausubel et ai, (1998, supra), Sections 6.3.1-6.3.6. Aqueous and non-aq ueous methods are described in that reference and either can be used.
Reference herein to "low stringency" conditions include and encompass from at least about 1% v/v to at least about 15% v/v formamide and from at least about 1 M to at least about 2 M salt for hybridization at 42° C, and at least about 1 M to at least about 2 M salt for washing at 42° C. Low stringency conditions also may include 1% Bovine Serum Albumin (BSA), 1 mM EDTA, 0.5 M NaHP04 (pH 7.2), 7% SDS for hybridization at 65° C, and (i) 2 x SSC, 0.1% SDS; or (ii) 0.5% BSA, 1 mM EDTA, 40 mM NaHP04 (pH 7.2), 5% SDS for washing at room temperature. One embodiment of low stringency conditions includes hybridization in 6 x sodium chloride/sodium citrate (SSC) at about 45^C, followed by two washes in 0.2 x SSC, 0.1% SDS at least at 50°C (the temperature of the washes can be increased to 55° C for low stringency conditions).
"Medium stringency" conditions include and encompass from at least about 16% v/v to at least about 30% v/v formamide and from at least about 0.5 M to at least about 0.9 M salt for hybridization at 42° C, and at least about 0.1 M to at least about 0.2 M salt for washing at 55° C. Medium stringency conditions also may include 1% Bovine Serum Albumin (BSA), 1 mM EDTA, 0.5 M NaHP04 (pH 7.2), 7% SDS for hybridization at 65° C, and (i) 2 x SSC, 0.1% SDS; or (ii) 0.5% BSA, 1 mM EDTA, 40 mM NaHP04 (pH 7.2), 5% SDS for washing at 60-65° C. One embodiment of medium stringency conditions includes hybridizing in 6 x SSC at about 45^C, followed by one or more washes in 0.2 x SSC, 0.1% SDS at 60°C.
"High stringency" conditions include and encompass from at least about 31% v/v to at least about 50% v/v formamide and from about 0.01 M to about 0.15 M salt for hybridization at 42° C, and about 0.01 M to about 0.02 M salt for washing at 55° C. High stringency conditions also may include 1% BSA, 1 mM EDTA, 0.5 M NaHP04 (pH 7.2), 7% SDS for hybridization at 65° C, and (i) 0.2 x SSC, 0.1% SDS; or (ii) 0.5% BSA, 1 mM EDTA, 40 mM NaHP04 (pH 7.2), 1% SDS for washing at a temperature in excess of 65°C. One embodiment of high stringency conditions includes hybridizing in 6 x SSC at about 45^C, followed by one or more washes in 0.2 x SSC, 0.1% SDS at 65°C.
In certain embodiments, a reference polypeptide or enzyme described herein is encoded by a polynucleotide that hybridizes to a disclosed nucleotide sequence under very high stringency conditions. One embodiment of very high stringency conditions includes hybridizing
in 0.5 M sodium phosphate, 7% SDS at 65°C, followed by one or more washes in 0.2 x SSC, 1% SDS at 65° C.
Other stringency conditions are well known in the art and the skilled artisan will recognize that various factors can be manipulated to optimize the specificity of the hybridization. Optimization of the stringency of the final washes can serve to ensure a high degree of hybridization. For detailed examples, see Ausubel et al., supra at pages 2.10.1 to 2.10.16 and Sambrook et al. (1989, supra) at sections 1.101 to 1.104.
While stringent washes are typically carried out at temperatures from about 42°C to 68° C, one skilled in the art will appreciate that other temperatures may be suitable for stringent conditions. Maximum hybridization rate typically occurs at about 20°C to 25°C below the Tm for formation of a DNA-DNA hybrid. It is well known in the art that the Tm is the melting temperature, or temperature at which two complementary polynucleotide sequences dissociate. Methods for estimating Tm are well known in the art (see Ausubel et al., supra at page 2.10.8).
In general, the Tm of a perfectly matched duplex of DNA may be predicted as an approximation by the formula: Tm = 81.5 + 16.6 (logi0 M) + 0.41 (%G+C) - 0.63 (% formamide) - (600/length) wherein: M is the concentration of Na+, preferably in the range of 0.01 molar to 0.4 molar; %G+C is the sum of guanosine and cytosine bases as a percentage of the total number of bases, within the range between 30% and 75% G+C; % formamide is the percent formamide concentration by volume; length is the number of base pairs in the DNA duplex. The Tm of a duplex DNA decreases by approximately 1°C with every increase of 1% in the number of randomly mismatched base pairs. Washing is generally carried out at Tm - 15° C for high stringency, or Tm - 30°C for moderate stringency.
In one example of a hybridization procedure, a membrane (e.g., a nitrocellulose membrane or a nylon membrane) containing immobilized DNA is hybridized overnight at 42° C in a hybridization buffer (50% deionized formamide, 5 x SSC, 5 x Reinhardt's solution (0.1% fecal, 0.1% polyvinylpyrollidone and 0.1% bovine serum albumin), 0.1% SDS and 200 mg/mL denatured salmon sperm DNA) containing a labeled probe. The membrane is then subjected to two sequential medium stringency washes (i.e., 2 x SSC, 0.1% SDS for 15 min at 45° C, followed
by 2 x SSC, 0.1% SDS for 15 min at 50° C), followed by two seq uential higher stringency washes (i.e., 0.2 x SSC, 0.1% SDS for 12 min at 55° C followed by 0.2 x SSC and 0.1% SDS solution for 12 min at 65-68°C).
Polynucleotides and fusions thereof may be prepared, manipulated and/or expressed using any of a variety of well established techniques known and available in the art. For example, polynucleotide sequences which encode polypeptides of the invention, or fusion proteins or functional eq uivalents thereof, may be used in recombinant DNA molecules to direct expression of a triglyceride or lipid biosynthesis enzyme in appropriate host cells. Due to the inherent degeneracy of the genetic code, other DNA sequences that encode substantially the same or a functionally eq uivalent amino acid sequence may be produced and these sequences may be used to clone and express a given polypeptide.
As will be understood by those of skill in the art, it may be advantageous in some instances to produce polypeptide-encoding nucleotide sequences possessing non-naturally occurring codons. For example, codons preferred by a particular prokaryotic or eukaryotic host can be selected to increase the rate of protein expression or to produce a recombinant RNA transcript having desirable properties, such as a half-life which is longer than that of a transcript generated from the naturally occurring seq uence. Such nucleotides are typically referred to as "codon-optimized."
Moreover, the polynucleotide sequences described herein can be engineered using methods generally known in the art in order to alter polypeptide encoding sequences for a variety of reasons, including but not limited to, alterations which modify the cloning, processing, expression and/or activity of the gene product.
I n order to express a desired polypeptide, a nucleotide sequence encoding the polypeptide, or a functional eq uivalent, may be inserted into appropriate expression vector, i.e., a vector that contains the necessary elements for the transcription and translation of the inserted coding sequence. Methods which are well known to those skilled in the art may be used to construct expression vectors containing seq uences encoding a polypeptide of interest and appropriate transcriptional and translational control elements. These methods include in vitro recombinant DNA techniques, synthetic techniques, and in vivo genetic recombination.
Such techniques are described in Sambrook et al., Molecular Cloning, A Laboratory Manual (1989), and Ausubel et al., Current Protocols in Molecular Biology (1989).
A variety of expression vector/host systems are known and may be utilized to contain and express polynucleotide sequences. In certain embodiments, the polynucleotides of the present invention may be introduced and expressed in Cyanobacterial systems. As such, the present invention contemplates the use of vector and plasmid systems having regulatory sequences (e.g., promoters and enhancers) that are suitable for use in various Cyanobacteria (see, e.g., Koksharova et al., Applied Microbiol Biotechnol 58:123-37, 2002). For example, the promiscuous RSF1010 plasmid provides autonomous replication in several Cyanobacteria of the genera Synechocystis and Synechococcus (see, e.g., Mermet-Bouvier et al., Curr Microbiol 26:323-327, 1993). As another example, the pFCl expression vector is based on the promiscuous plasmid RSF1010. pFCl harbors the lambda cl857 repressor-encoding gene and pR promoter, followed by the lambda cro ribosome-binding site and ATG translation initiation codon (see, e.g., Mermet-Bouvier et al., Curr Microbiol 28:145-148, 1994). The latter is located within the unique Ndel restriction site (CATATG) of pFCl and can be exposed after cleavage with this enzyme for in-frame fusion with the protein-coding sequence to be expressed.
The "control elements" or "regulatory sequences" present in an expression vector (or employed separately) are those non-translated regions of the vector, such as enhancers, promoters, 5' and 3' untranslated regions, which interact with host cellular proteins to carry out transcription and translation. Such elements may vary in their strength and specificity. Depending on the vector system and host utilized, any number of suitable transcription and translation elements, including constitutive and inducible promoters, may be used. Generally, it is well-known that strong E. coli promoters work well in Cyanobacteria. Also, when cloning in Cyanobacterial systems, inducible promoters such as the hybrid lacZ promoter of the PBLUESCRIPT phagemid (Stratagene, La Jolla, Calif.) or PSPORT1 plasmid (Gibco BRL, Gaithersburg, Md.) and the like may be used. Other vectors containing IPTG inducible promoters, such as pAM1579 and pAM2991trc, may be utilized according to the present invention.
Certain embodiments may employ a temperature inducible system or temperature inducible regulatory sequences (e.g., promoters, enhancers, repressors). As one example, an operon with the bacterial phage left-ward promoter (PL) and a temperature sensitive repressor gene CI857 may be employed to produce a temperature inducible system for producing fatty acids and/or triglycerides in Cyanobacteria (see, e.g., U.S. Patent No. 6,306,639, herein incorporated by reference). It is believed that at a non-permissible temperature (low temperature, 30 degrees Celsius), the repressor binds to the operator seq uence, and thus prevents RNA polymerase from initiating transcription at the PL promoter. Therefore, the expression of encoded gene or genes is repressed. When the cell culture is transferred to a permissible temperature (37-42 degrees Celsius), the repressor cannot bind to the operator. Under these conditions, RNA polymerase can initiate the transcription of the encoded gene or genes.
I n Cyanobacterial systems, a number of expression vectors or regulatory sequences may be selected depending upon the use intended for the expressed polypeptide. When large quantities are needed, vectors or regulatory seq uences which direct high level expression of encoded proteins may be used. For example, overexpression of ACCase enzymes may be utilized to increase fatty acid biosynthesis. Such vectors include, but are not limited to, the multifunctional E. coli cloning and expression vectors such as BLUESCRI PT (Stratagene), in which the sequence encoding the polypeptide of interest may be ligated into the vector in frame with sequences for the amino-terminal Met and the subsequent 7 residues of β-galactosidase so that a hybrid protein is produced; pI N vectors (Van Heeke & Schuster, J. Biol. Chem. 264:5503 5509 (1989)); and the like. pGEX Vectors (Promega, Madison, Wis.) may also be used to express foreign polypeptides as fusion proteins with glutathione S-transferase (GST).
Certain embodiments may employ Cyanobacterial promoters or regulatory operons. I n certain embodiments, a promoter may comprise an rbcLS operon of Synechococcus, as described, for example, in Ronen-Tarazi et al. (Plant Physiology 18:1461-1469, 1995), or a cpc operon of Synechocystis sp. strain PCC 6714, as described, for example, in I mashimizu et al. (J Bacteriol. 185:6477-80, 2003). In certain embodiments, the tRNApro gene from Synechococcus may also be utilized as a promoter, as described in Chungjatupornchai et al. (Curr Microbiol.
38:210-216, 1999). Certain embodiments may employ the nirA promoter from Synechococcus sp. strain PCC7942, which is repressed by ammonium and induced by nitrite (see, e.g., Maeda et al., J. Bacteriol. 180:4080-4088, 1998; and Qi et al., Applied and Environmental Microbiology 71:5678-5684, 2005). The efficiency of expression may be enhanced by the inclusion of enhancers which are appropriate for the particular Cyanobacterial cell system which is used, such as those described in the literature.
I n certain embodiments, expression vectors or introduced promoters utilized to overexpress an exogenous or endogenous reference polypeptide, or fragment or variant thereof, comprise a weak promoter under non-inducible conditions, e.g., to avoid toxic effects of long-term overexpression of any of these polypeptides. One example of such a vector for use in Cyanobacteria is the pBAD vector system. Expression levels from any given promoter may be determined, e.g., by performing quantitative polymerase chain reaction (q PCR) to determine the amount of transcript or mRNA produced by a promoter, e.g., before and after induction. I n certain instances, a weak promoter is defined as a promoter that has a basal level of expression of a gene or transcript of interest, in the absence of inducer, that is < 2.0% of the expression level produced by the promoter of the rnpB gene in 5. elongatus PCC7942. In other embodiments, a weak promoter is defined as a promoter that has a basal level of expression of a gene or transcript of interest, in the absence of inducer, that is < 5.0% of the expression level produced by the promoter of the rnpB gene in 5. elongatus PCC7942.
It will be apparent that further to their use in vectors, any of the regulatory elements described herein (e.g., promoters, enhancers, repressors, ribosome binding sites, transcription termination sites) may be introduced directly into the genome of a photosynthetic microorganism (e.g., Cyanobacterium), typically in a region surrounding (e.g., upstream or downstream of) an endogenous or naturally-occurring reference gene/polynucleotide seq uence described herein, to regulate expression (e.g., facilitate overexpression) of that gene.
Specific initiation signals may also be used to achieve more efficient translation of sequences encoding a polypeptide of interest. Such signals include the ATG initiation codon and adjacent sequences. I n cases where sequences encoding the polypeptide, its initiation codon, and upstream sequences are inserted into the appropriate expression vector, no additional
transcriptional or translational control signals may be needed. However, in cases where only coding sequence, or a portion thereof, is inserted, exogenous translational control signals including the ATG initiation codon should be provided. Furthermore, the initiation codon should be in the correct reading frame to ensure translation of the entire insert. Exogenous translational elements and initiation codons may be of various origins, both natural and synthetic.
A variety of protocols for detecting and measuring the expression of polynucleotide-encoded products, using either polyclonal or monoclonal antibodies specific for the product are known in the art. Examples include enzyme-linked immunosorbent assay (ELISA), radioimmunoassay (RIA), and fluorescence activated cell sorting (FACS). These and other assays are described, among other places, in Hampton et al., Serological Methods, a Laboratory Manual (1990) and Maddox et al., J. Exp. Med. 158:1211-1216 (1983). The presence or expression levels of a desired polynucleotide may also be confirmed by PCR.
A wide variety of labels and conjugation techniques are known by those skilled in the art and may be used in various nucleic acid and amino acid assays. Means for producing labeled hybridization or PCR probes for detecting sequences related to polynucleotides include oligolabeling, nick translation, end-labeling or PCR amplification using a labeled nucleotide. Alternatively, the sequences, or any portions thereof may be cloned into a vector for the production of an mRNA probe. Such vectors are known in the art, are commercially available, and may be used to synthesize RNA probes in vitro by addition of an appropriate RNA polymerase such as T7, T3, or SP6 and labeled nucleotides. These procedures may be conducted using a variety of commercially available kits. Suitable reporter molecules or labels, which may be used include radionuclides, enzymes, fluorescent, chemiluminescent, or chromogenic agents as well as substrates, cofactors, inhibitors, magnetic particles, and the like.
Cyanobacterial host cells transformed with a polynucleotide sequence of interest may be cultured under conditions suitable for the expression and recovery of the protein from cell culture. The protein produced by a recombinant cell may be secreted or contained intracellularly depending on the sequence and/or the vector used. As will be understood by those of skill in the art, expression vectors containing polynucleotides of the invention may be
designed to contain signal sequences which direct localization of the encoded polypeptide to a desired site within the cell. Other recombinant constructions may be used to join sequences encoding a polypeptide of interest to nucleotide sequence encoding a polypeptide domain which will direct secretion of the encoded protein.
Modified Photosynthetic Microorganisms
Certain embodiments relate to modified photosynthetic microorganisms, including Cyanobacteria, and methods of use thereof, wherein the modified photosynthetic microorganisms comprise one or more over-expressed, exogenous or introduced intracellular localization domain-DGAT fusion proteins, and a corresponding polynucleotide that encodes the same, where the DGAT fusion protein comprises a heterologous intracellular localization domain and a DGAT polypeptide, or variant or fragment thereof. In particular embodiments, the DGAT fusion protein is a membrane-targeting domain- or plasma membrane (PM)-targeting domain-DGAT fusion proteins. In some embodiments, the DGAT polypeptide variant or fragment retains at least 50% of one or more activities of the wild-type DGAT polypeptide.
In certain aspects, the DGAT-fusion protein-expressing photosynthetic microorganisms described herein can further comprise one or more introduced or overexpressed lipid biosynthesis proteins. Examples of lipid biosynthesis proteins include, without limitation, acyl carrier proteins (ACP), acyl ACP synthases (Aas), acyl-ACP reductases, alcohol dehydrogenases, aldehyde dehydrogenases, aldehyde decarbonylases, thioesterases (TES), acetyl coenzyme A carboxylases (ACCase), phosphatidic acid phosphatases (PAP; or phosphatidate phosphatases), triacylglycerol (TAG) hydrolases, fatty acyl-CoA synthetases, and lipases/phospholipases, including any combinations thereof.
Certain preferred combinations include modified photosynthetic microorganisms having an exogenous or overexpressed DGAT fusion protein described herein in combination with an exogenous or overexpressed ACP; a DGAT fusion protein in combination with an Aas; a DGAT fusion protein in combination with an ACP and an Aas; a DGAT fusion protein in combination with an ACP and a TES such as *TesA or a FatB; a DGAT fusion protein in combination with an
Aas and a TES such as *TesA or a FatB; and/or a DGAT fusion protein in combination with an ACP, an Aas, and a TES.
Also included are combinations that incorporate one or more TAG hydrolases into a TAG-producing strain. For example, certain embodiments include modified photosynthetic microorganisms having a DGAT fusion protein described herein, an exogenous or overexpressed ACP, Aas, or both, in combination with an exogenous or over-expressed TAG hydrolase, and optionally a TES. Certain embodiments, however, may employ a DGAT fusion protein and an over-expressed or exogenous TAG hydrolase, and optionally a TES, such as TesA (or *TesA) or any one or more of the FatB seq uences, with or without an ACP or Aas. Hence, these and related embodiments may be employed separately from those that require an ACP, an Aas, or both. For instance, certain embodiments may comprise a DGAT fusion protein and TAG hydrolase, and optionally a TES. Any one of these embodiments can be further combined with one or more additional lipid biosynthesis proteins, such as an ACCase, a PAP, a fatty acyl-CoA synthetase, and/or a PL such as PLC.
Certain combinations incorporate one or more fatty acyl-CoA synthetases (e.g., FadD) into a DGAT fusion protein-expressing photosynthetic microorganism. For instance, certain embodiments include modified photosynthetic microorganisms having an exogenous or overexpressed ACP, Aas, or both, in combination with a DGAT fusion protein and a fatty acyl-CoA synthetase, and optionally a TES and/or a TAG hydrolase. Certain embodiments, however, may employ a DGAT fusion protein and an over-expressed or exogenous a fatty acyl-CoA synthetase, and optionally a TES, such as TesA (or *TesA) or any one or more of the FatB sequences, with or without an ACP or Aas. Hence, these and related embodiments may be employed separately from those that require an ACP, Aas, or both. For instance, certain embodiments may comprise a DGAT fusion protein and a fatty acyl-CoA synthetase, and optionally a TES (e.g., TesA, FatB). Any one of these embodiments can be further combined with one or more additional lipid biosynthesis proteins, such as an ACCase, a PAP, a TAG hydrolase, and/or a PL such as PLC.
Any one of these embodiments can also be combined with one or more introduced or overexpressed polynucleotides encoding a protein involved in a glycogen breakdown pathway,
and/or with a strain having reduced expression of glycogen biosynthesis or storage pathways (e.g., full or partial deletion of glucose-l-phosphate adenyltransferase (glgC) gene and/or a phosphoglucomutase (pgm) gene). For instance, a specific modified photosynthetic microorganism could comprise a DGAT fusion protein described herein, an exogenous or overexpressed ACP, Aas, DGAT and PAP, combined with a full or partial deletion of the glgC gene and/or the pgm gene.
Photosynthetic microorganisms of the present invention can also be modified to increase the production of fatty acids by introducing one or more exogenous polynucleotide sequences that encode one or more enzymes associated with fatty acid synthesis. In certain aspects, the exogenous polynucleotide sequence encodes an enzyme that comprises an acyl-CoA carboxylase (ACCase) activity, typically a llowing increased ACCase expression, and, thus, increased intracellular ACCase activity. I ncreased intracellular ACCase activity contributes to the increased production of fatty acids because this enzyme catalyzes the "commitment step" of fatty acid synthesis. Similarly, in some aspects, modified photosynthetic microorganisms may comprise a DGAT fusion protein described herein in combination with an acyl-ACP reductase, for instance, to increase the production of fatty acids, a starting material for triglycerides, and thereby increase production of triglycerides.
Other combinations include, for example, a modified photosynthetic microorganism comprising a DGAT fusion protein described herein and one of the following: an exogenous or overexpressed ACP in combination with an exogenous or overexpressed ACCase; an Aas in combination with an ACCase; an ACP and an Aas in combination with an ACCase; an ACP in combination with a PAP; an Aas in combination with a PAP; an ACP and an Aas in combination with a PAP; an ACP in combination with a PL such as PLA, PLB, or PLC; an Aas in combination with a PL; and an ACP and an Aas in combination with a PL. Any one of these embodiments can be combined with each other (e.g., ACP, Aas, ACCase, and PAP), and/or further combined with an exogenous or overexpressed TES. Any one of these embodiments can also be combined with one or more introduced polynucleotides encoding a protein involved in a glycogen breakdown pathway, and/or with a strain having reduced expression of glycogen biosynthesis or storage
pathways (e.g., full or partial deletion of glucose-l-phosphate adenyltransferase (glgC) gene and/or a phosphoglucomutase (pgm) gene).
Any one of the above embodiments can also be combined with a strain having reduced expression of an aldehyde decarbonylase. I n certain embodiments, such as Cyanobacteria including 5. elongatus PCC7942, orfl593 resides directly upstream of orfl594 (acyl-ACP reductase coding region) and encodes an aldehyde decarbonylase. According to one non-limiting theory, because the aldehyde decarbonylase encoded by orfl593 utilizes acyl aldehyde as a substrate for alkane production, reducing expression of this protein may further increase yields of free fatty acids by shunting acyl aldehydes (produced by acyl-ACP reductase) away from an alkane-producing pathway, and towards a fatty acid-producing and storage pathway. PCC7942_orfl593 orthologs can be found, for example, in Synechocystis sp. PCC6803 (encoded by orfsll0208), N. punctiforme PCC 73102, Thermosynechococcus elongatus BP-1, Synechococcus sp. Ja-3-3AB, P. marinus M IT9313, P. marinus NATL2A, and Synechococcus sp. RS 9117, the latter having at least two paralogs (RS 9117-1 and -2). Included are strains having mutations or full or partial deletions of one or more genes encoding these and other aldehyde decarbonylases, such as 5. elongatus PCC7942 having a full or partial deletion of orfl593, and Synechocystis sp. PCC6803 having a full or partial deletion of orfsll0208). For instance, a specific modified photosynthetic microorganism could comprise an overexpressed acyl-ACP reductase, combined with a full or partial deletion of the glgC gene and/or the pgm gene, optionally combined with an overexpressed ACP, ACCase, DGAT/acyl-CoA synthetase, or all of the foregoing, and optionally combined with a full or partial deletion of a gene encoding an aldehyde decarbonylase (e.g., PCC7942_orfl593, PCC6803_orfsll0208).
Any one of these embodiments can also be combined with a strain having reduced expression of an acyl-ACP synthetase (Aas). Without wishing to be bound by any one theory, an endogenous aldehyde dehydrogenase is acting on the acyl-aldehydes generated by orfl594 and converting them to free fatty acids. The normal role of such a dehydrogenase might involve removing or otherwise dealing with damaged lipids. I n this scenario, it is then likely that the Aas gene product recycles these free fatty acids by ligating them to ACP. Accordingly, reducing or eliminating expression of the Aas gene product might ultimately increase production of fatty
acids, by reducing or preventing their transfer to ACP. I ncluded are mutations and full or partial deletions of one or more Aas genes, such as the Aas gene of Synechococcus elongatus PCC 7942. As one example, a specific modified photosynthetic microorganism could comprise an overexpressed acyl-ACP reductase, combined with a full or partial deletion of the glgC gene and/or the pgm gene, optionally combined with an overexpressed ACP, ACCase, DGAT/acyl-CoA synthetase, or all of the foregoing, optionally combined with a full or partial deletion of a gene encoding an aldehyde decarbonylase (e.g., PCC7942_orfl593, PCC6803_orfsll0208), and optionally combined with a full or partial deletion of an Aas gene encoding an acyl-ACP synthetase.
Any one or more of these embodiments can also be combined with a strain having increased expression of an aldehyde dehydrogenase. One exemplary aldehyde dehydrogenase is encoded by orf0489 of Synechococcus elongatus PCC7942. Also included are homologs or paralogs thereof, functional equivalents thereof, and fragments or variants thereof. Functional equivalents can include aldehyde dehydrogenases with the ability to convert acyl aldehydes (e.g., nonyl-aldehyde) into fatty acids. In specific embodiments, the aldehyde dehydrogenase has the amino acid sequence of SEQ ID NO:103 (encoded by the polynucleotide sequence of SEQ I D NO:102), or an active fragment or variant of this seq uence
Some modified photosynthetic microorganisms may comprise a DGAT fusion protein described herein and an introduced or overexpressed acyl-ACP reductase, to increase production of triglycerides; optionally in further combination with an introduced or overexpressed alcohol dehydrogenase, for instance, to produce wax esters relative to other lipids. Certain of these and related embodiments may be combined with reduced expression and/or activity of at least one endogenous aldehyde decarbonylase, endogenous aldehyde dehydrogenase, or both.
For instance, particular modified photosynthetic microorganisms may comprise a DGAT fusion protein described herein in combination with an overexpressed or introduced acyl-ACP reductase and an overexpressed or introduced alcohol dehydrogenase, and in further combination with at least one mutation (e.g., point mutation, insertion, full or partial deletion) that reduces the expression and/or activity of an endogenous aldehyde decarbonylase. Certain
modified photosynthetic microorganisms may comprise a DGAT fusion protein in combination with an overexpressed or introduced acyl-ACP reductase and an overexpressed or introduced alcohol dehydrogenase, and in further combination with at least one mutation (e.g., point mutation, insertion, full or partial deletion) that reduces the expression and/or activity of an endogenous aldehyde dehydrogenase. Some embodiments may include modified photosynthetic microorganisms that comprises a DGAT fusion protein in combination with an overexpressed or introduced acyl-ACP reductase and an overexpressed or introduced alcohol dehydrogenase, in further combination with at least one mutation that reduces the expression and/or activity of an endogenous aldehyde dehydrogenase and at least one mutation that reduces the expression and/or activity of an endogenous aldehyde decarbonylase. In specific embodiments, for instance, where the modified photosynthetic microorganism is 5. elongatus, the aldehyde dehydrogenase is encoded by orf0489 and the aldehyde decarbonylase is encoded by orfl593 of S. elongatus.
Other combinations include, for example, a modified photosynthetic microorganism comprising a DGAT fusion protein described herein and reduced glycogen accumulation, in combination with one more of an overexpressed ACP; an overexpressed acyl-ACP reductase in combination with an overexpressed ACP; an acyl-ACP reductase in combination with an ACCase; an acyl-ACP reductase in combination with an ACP and an ACCase; an overexpressed acyl-ACP reductase in combination with an overexpressed acyl-CoA synthetase (e.g., a membrane-targeting domain-DGAT fusion/acyl-CoA synthetase combination); an overexpressed acyl-ACP reductase with an overexpressed ACCase optionally in combination with an overexpressed acyl-CoA synthetase; and an overexpressed acyl-ACP reductase with an overexpressed ACP and ACCase, optionally in combination with an overexpressed acyl-CoA synthetase. Acyl-ACP reductase and DGAT-overexpressing strains, optionally in combination with an overexpressed acyl-CoA synthetase, typically produce increased triglycerides relative to DGAT-only overexpressing strains. Any one of these embodiments can be combined with one or more introduced polynucleotides encoding a protein involved in a glycogen breakdown pathway, and/or with a strain having reduced expression of glycogen biosynthesis or storage pathways (e.g., full or partial deletion of glucose-l-phosphate adenyltransferase (glgC) gene
and/or a phosphoglucomutase (pgm) gene). The present invention contemplates the use of any type of polynucleotide encoding a protein or enzyme associated with glycogen breakdown, removal, and/or elimination, as long as the modified photosynthetic microorganism accumulates a reduced amount of glycogen as compared to the wild type photosynthetic microorganism (e.g., under stress conditions).
Increased expression or overexpression can be achieved a variety of ways. For example, by introducing a polynucleotide into the microorganism, modifying an endogenous gene to overexpress the polypeptide (e.g., by introducing an exogenous regulatory element such as a promoter), or both. For instance, one or more copies of an otherwise endogenous polynucleotide sequence can be introduced by recombinant techniques to increase expression, that is, to create additional copies of the otherwise endogenous polynucleotide sequence. Decreased expression and/or activity can also be achieved a variety of ways, described elsewhere herein and known in the art, including by mutation of coding and/or regulatory sequences of a gene of interest, and/or by RNA inhibition.
Modified photosynthetic microorganisms of the present invention may be produced using any type of photosynthetic microorganism. These include, but are not limited to photosynthetic bacteria, green algae, and cyanobacteria. The photosynthetic microorganism can be, for example, a naturally photosynthetic microorganism, such as a Cyanobacterium, or an engineered photosynthetic microorganism, such as an artificially photosynthetic bacterium. Exemplary microorganisms that are either naturally photosynthetic or can be engineered to be photosynthetic include, but are not limited to, bacteria; fungi; archaea; protists; eukaryotes, such as a green algae; and animals such as plankton, planarian, and amoeba. Examples of naturally occurring photosynthetic microorganisms include, but are not limited to, Spirulina maximum, Spirulina platensis, Dunaliella salina, Botrycoccus braunii, Chlorella vulgaris, Chlorella pyrenoidosa, Serenastrum capricomutum, Scenedesmus auadricauda, Porphyridium cruentum, Scenedesmus acutus, Dunaliella sp., Scenedesmus obliquus, Anabaenopsis, Aulosira, Cylindrospermum, Synechococcus sp., Synechocystis sp., and/or Tolypothrix.
A modified Cyanobacteria of the present invention may be from any genera or species of Cyanobacteria that is genetically manipulable, i.e., permissible to the introduction and
expression of exogenous genetic material. Examples of Cyanobacteria that can be engineered according to the methods of the present invention include, but are not limited to, the genus Synechocystis, Synechococcus, Thermosynechococcus, Nostoc, Prochlorococcu, Microcystis, Anabaena, Spirulina, and Gloeobacter.
Cyanobacteria, also known as blue-green algae, blue-green bacteria, or Cyanophyta, is a phylum of bacteria that obtain their energy through photosynthesis. Cyanobacteria can produce metabolites, such as carbohydrates, proteins, lipids and nucleic acids, from C02, water, inorganic salts and light. Any Cyanobacteria may be used according to the present invention.
Cyanobacteria include both unicellular and colonial species. Colonies may form filaments, sheets or even hollow balls. Some filamentous colonies show the ability to differentiate into several different cell types, such as vegetative cells, the normal, photosynthetic cells that are formed under favorable growing conditions; akinetes, the climate-resistant spores that may form when environmental conditions become harsh; and thick-walled heterocysts, which contain the enzyme nitrogenase, vital for nitrogen fixation.
Heterocysts may also form under the appropriate environmental conditions (e.g., anoxic) whenever nitrogen is necessary. Heterocyst-forming species are specialized for nitrogen fixation and are able to fix nitrogen gas, which cannot be used by plants, into ammonia (NH3), nitrites (N02 ~), or nitrates (N03 ~), which can be absorbed by plants and converted to protein and nucleic acids.
Many Cyanobacteria also form motile filaments, called hormogonia, which travel away from the main biomass to bud and form new colonies elsewhere. The cells in a hormogonium are often thinner than in the vegetative state, and the cells on either end of the motile chain may be tapered. In order to break away from the parent colony, a hormogonium often must tear apart a weaker cell in a filament, called a necridium.
Each individual Cyanobacterial cell typically has a thick, gelatinous cell wall.
Cyanobacteria differ from other gram-negative bacteria in that the quorum sensing molecules autoinducer-2 and acyl-homoserine lactones are absent. They lack flagella, but hormogonia and some unicellular species may move about by gliding along surfaces. In water columns, some Cyanobacteria float by forming gas vesicles, like in archaea.
Cyanobacteria have an elaborate and highly organized system of internal membranes that function in photosynthesis. Photosynthesis in Cyanobacteria generally uses water as an electron donor and produces oxygen as a by-product, though some Cyanobacteria may also use hydrogen sulfide, similar to other photosynthetic bacteria. Carbon dioxide is reduced to form carbohydrates via the Calvin cycle. I n most forms, the photosynthetic machinery is embedded into folds of the cell membrane, called thylakoids. Due to their ability to fix nitrogen in aerobic conditions, Cyanobacteria are often found as symbionts with a number of other groups of organisms such as fungi (e.g., lichens), corals, pteridophytes (e.g., Azolla), and angiosperms (e.g., Gunnera), among others.
Cyanobacteria are the only group of organisms that are able to reduce nitrogen and carbon in aerobic conditions. The water-oxidizing photosynthesis is accomplished by coupling the activity of photosystem (PS) II and I (Z-scheme). I n anaerobic conditions, Cyanobacteria are also able to use only PS I (i.e., cyclic photophosphorylation) with electron donors other than water (e.g., hydrogen sulfide, thiosulphate, or molecular hydrogen), similar to purple photosynthetic bacteria. Furthermore, Cyanobacteria share an archaeal property; the ability to reduce elemental sulfur by anaerobic respiration in the dark. The Cyanobacterial photosynthetic electron transport system shares the same compartment as the components of respiratory electron transport. Typically, the plasma membrane contains only components of the respiratory chain, while the thylakoid membrane hosts both respiratory and photosynthetic electron transport.
Phycobilisomes, attached to the thylakoid membrane, act as light harvesting antennae for the photosystems of Cyanobacteria. The phycobilisome components (phycobiliproteins) are responsible for the blue-green pigmentation of most Cyanobacteria. Color variations are mainly due to carotenoids and phycoerythrins, which may provide the cells with a red-brownish coloration. I n some Cyanobacteria, the color of light influences the composition of phycobilisomes. In green light, the cells accumulate more phycoerythrin, whereas in red light they produce more phycocyanin. Thus, the bacteria appear green in red light and red in green light. This process is known as complementary chromatic adaptation and represents a way for the cells to maximize the use of available light for photosynthesis.
I n particular embodiments, the Cyanobacteria may be, e.g., a marine form of Cyanobacteria or a fresh water form of Cyanobacteria. Examples of marine forms of Cyanobacteria include, but are not limited to Synechococcus WH8102, Synechococcus RCC307, Synechococcus N KBG 15041c, and Trichodesmium. Examples of fresh water forms of Cyanobacteria include, but are not limited to, 5. elongatus PCC 7942, Synechocystis PCC 6803, Plectonema boryanum, and Anabaena sp. Exogenous genetic material encoding the desired enzymes or polypeptides may be introduced either transiently, such as in certain self-replicating vectors, or stably, such as by integration (e.g., recombination) into the Cyanobacterium's native genome.
In other embodiments, a genetically modified Cyanobacteria of the present invention may be capable of growing in brackish or salt water. When using a fresh water form of Cyanobacteria, the overall net cost for production of triglycerides will depend on both the nutrients req uired to grow the culture and the price for freshwater. One can foresee freshwater being a limited resource in the future, and in that case it would be more cost effective to find an alternative to freshwater. Two such alternatives include: (1) the use of waste water from treatment plants; and (2) the use of salt or brackish water.
Salt water in the oceans can range in salinity between 3.1% and 3.8%, the average being 3.5%, and this is mostly, but not entirely, made up of sodium chloride (NaCI) ions. Brackish water, on the other hand, has more salinity than freshwater, but not as much as seawater. Brackish water contains between about 0.5% and 3% salinity, and thus includes a large range of salinity regimes and is therefore not precisely defined. Waste water is any water that has undergone human influence. It consists of liquid waste released from domestic and commercial properties, industry, and/or agriculture and can encompass a wide range of possible contaminants at varying concentrations.
There is a broad distribution of Cyanobacteria in the oceans, with Synechococcus filling just one niche. Specifically, Synechococcus sp. PCC 7002 (formerly known as Agmenellum quadruplicatum strain PR-6) grows in brackish water, is unicellular and has an optimal growing temperature of 38°C. While this strain is well suited to grow in conditions of high salt, it will grow slowly in freshwater. I n particular embodiments, the present invention contemplates the
use of a Cyanobacteria 5. elongatus PCC 7942, altered in a way that allows for growth in either waste water or salt/brackish water. A 5. elongatus PCC 7942 mutant resistant to sodium chloride stress has been described (Bagchi, S.N. et al., Photosynth Res. 2007, 92:87-101), and a genetically modified 5. elongatus PCC 7942 tolerant of growth in salt water has been described (Waditee, R. et al., PNAS. 2002, 99:4109-4114). According to the present invention, a salt water tolerant strain is capable of growing in water or media having a salinity in the range of 0.5% to 4.0% salinity, although it is not necessarily capable of growing in all salinities encompassed by this range. In one embodiment, a salt tolerant strain is capable of growth in water or media having a salinity in the range of 1.0% to 2.0% salinity. In another embodiment, a salt water tolerant strain is capable of growth in water or media having a salinity in the range of 2.0% to 3.0% salinity.
Examples of Cyanobacteria that may be utilized and/or genetically modified according to the methods described herein include, but are not limited to, Chroococcales Cyanobacteria from the genera Aphanocapsa, Aphanothece, Chamaesiphon, Chroococcus, Chroogloeocystis, Coelosphaerium, Crocosphaera, Cyanobacterium, Cyanobium, Cyanodictyon, Cyanosarcina, Cyanothece, Dactylococcopsis, Gloecapsa, Gloeothece, Merismopedia, Microcystis, Radiocystis, Rhabdoderma, Snowella, Synychococcus, Synechocystis, Thermosenechococcus, and Woronichinia; Nostacales Cyanobacteria from the genera Anabaena, Anabaenopsis, Aphanizomenon, Aulosira, Calothrix, Coleodesmium, Cyanospira, Cylindrospermosis, Cylindrospermum, Fremyella, Gleotrichia, Microchaete, Nodularia, Nostoc, Rexia, Richelia, Scytonema, Sprirestis, and Toypothrix; Oscillatoriales Cyanobacteria from the genera Arthrospira, Geitlerinema, Halomicronema, Halospirulina, Katagnymene, Leptolyngbya, Limnothrix, Lyngbya, Microcoleus, Oscillatoria, Phormidium, Planktothricoides, Planktothrix, Plectonema, Pseudoanabaena/Limnothrix, Schizothrix, Spirulina, Symploca, Trichodesmium, Tychonema; Pleurocapsales cyanobacterium from the genera Chroococcidiopsis, Dermocarpa, Dermocarpella, Myxosarcina, Pleurocapsa, Stanieria, Xenococcus; Prochlorophytes Cyanobacterium from the genera Prochloron, Prochlorococcus, Prochlorothrix; and Stigonematales cyanobacterium from the genera Capsosira, Chlorogeoepsis, Fischerella, Hapalosiphon, Mastigocladopsis, Nostochopsis, Stigonema, Symphyonema, Symphonemopsis,
Umezakia, and Westiellopsis. In certain embodiments, the Cyanobacterium is from the genus Synechococcus, including, but not limited to Synechococcus bigranulatus, Synechococcus elongatus, Synechococcus leopoliensis, Synechococcus lividus, Synechococcus nidulans, and Synechococcus rubescens.
In certain embodiments, the Cyanobacterium is Anabaena sp. strain PCC 7120,
Synechocystis sp. strain PCC 6803, Nostoc muscorum, Nostoc ellipsosporum, or Nostoc sp. strain PCC 7120. In certain preferred embodiments, the Cyanobacterium is S. elongatus sp. strain PCC 7942.
Additional examples of Cyanobacteria that may be utilized in the methods provided herein include, but are not limited to, Synechococcus sp. strains WH7803, WH8102, WH8103 (typically genetically modified by conjugation), Baeocyte-forming Chroococcidiopsis spp. (typically modified by conjugation/electroporation), non-heterocyst-forming filamentous strains Planktothrix sp., Plectonema boryanum M101 (typically modified by electroporation), and Heterocyst-forming strains Anabaena sp. strains ATCC 29413 (typically modified by conjugation), Tolypothrix sp. strain PCC 7601 (typically modified by conjugation/electroporation) and Nostoc punctiforme strain ATCC 29133 (typically modified by conjugation/electroporation).
In certain preferred embodiments, the Cyanobacterium may be 5. elongatus sp. strain PCC 7942 or Synechococcus sp. PCC 7002 (originally known as Agmenellum quadruplicatum).
In particular embodiments, the genetically modified, photosynthetic microorganism, e.g., Cyanobacteria, of the present invention may be used to produce triglycerides and/or other carbon-based products from just sunlight, water, air, and minimal nutrients, using routine culture techniques of any reasonably desired scale. In specific embodiments, the present invention contemplates using spontaneous mutants of photosynthetic microorganisms that demonstrate a growth advantage under a defined growth condition. Among other benefits, the ability to produce large amounts of triglycerides from minimal energy and nutrient input makes the modified photosynthetic microorganism, e.g., Cyanobacteria, of the present invention a readily manageable and efficient source of feedstock in the subsequent production of both biofuels, such as biodiesel, as well as specialty chemicals, such as glycerin.
Methods of Producing Modified Photosynthetic Microorganisms
Embodiments of the present invention also include methods of producing the modified photosynthetic microorganisms (e.g., Cyanobacterium) described herein.
I n certain embodiments, the present invention comprises methods of modifying a photosynthetic microorganism to produce a modified photosynthetic microorganism that produces an increased amount of lipids, e.g., triglycerides, relative to a corresponding wild type photosynthetic microorganism or a differently modified photosynthetic microorganism (e.g., one that expresses DGAT but not a form that selectively localizes to an intracellular region such as a membrane, including the plasma membrane), comprising introducing into said microorganism one or more polynucleotides encoding a intracellular localization domain-DGAT fusion protein described herein, including active fragments or variants thereof.
Also included are methods of modifying a photosynthetic microorganism to produce a modified photosynthetic microorganism that has improved cell growth characteristics, relative to a corresponding, DGAT-expressing modified photosynthetic microorganism where the DGAT does not have a heterologous intracellular localization domain (e.g., a wild-type DGAT), comprising introducing into said microorganism one or more polynucleotides encoding a intracellular localization domain-DGAT fusion protein described herein, including active fragments or variants thereof.
The methods may further comprise a step of selecting for photosynthetic microorganisms in which the one or more desired polynucleotides were successfully introduced, where the polynucleotides were, e.g., present in a vector that expressed a selectable marker, such as an antibiotic resistance gene. As one example, selection and isolation may include the use of antibiotic resistant markers known in the art (e.g., kanamycin, spectinomycin, and streptomycin).
In certain aspects, such photosynthetic microorganisms can be further modified by increasing the expression of one or more lipid biosynthesis proteins, for instance, by introducing an exogenous copy of a polynucleotide that encodes a lipid biosynthesis protein, by increasing expression of an endogenous lipid biosynthesis protein, or both. In some aspects,
such photosynthetic microorganisms can be further modified by increasing the expression of one or more proteins associated with glycogen breakdown, for instance, by introducing an exogenous copy of a polynucleotide that encodes a glycogen breakdown protein, by increasing expression of an endogenous glycogen breakdown protein, or both.
Thus, in certain embodiments, the present invention includes methods of producing a modified photosynthetic microorganism, e.g., a Cyanobacteria, comprising: (1) introducing into said photosynthetic microorganism one or more polynucleotides encoding one or more intracellular localization domain-DGAT fusion proteins, and (2) introducing into said photosynthetic microorganism one or more operatively linked promoters (e.g., inducible or regulable promoters) into a region upstream of an endogenous lipid biosynthesis protein coding sequence, and/or introducing one or more polynucleotides encoding a lipid biosynthesis protein, or a fragment or variant thereof. Exemplary lipid biosynthesis proteins include any one or more of acyl carrier proteins (ACP), acyl ACP synthetases (Aas), acyl-ACP reductases, alcohol dehydrogenases, aldehyde dehydrogenases, aldehyde decarbonylases, thioesterases (TES), acetyl coenzyme A carboxylases (ACCase), phosphatidic acid phosphatases (PAP; or phosphatidate phosphatases), triacylglycerol (TAG) hydrolases, fatty acyl-CoA synthetases, and lipases/phospholipases, including any combination thereof.
Certain embodiments include methods of producing a modified photosynthetic microorganism, e.g., a Cyanobacteria, comprising: (1) introducing into said photosynthetic microorganism one or more polynucleotides encoding one or more intracellular localization domain-DGAT fusion proteins, and (2) introducing into said photosynthetic microorganism one or more operatively linked promoters (e.g., inducible or regulable promoters) into a region upstream of an endogenous glycogen breakdown protein coding sequence, and/or introducing one or more polynucleotides encoding a glycogen breakdown protein, or a fragment or variant thereof. Exemplary glycogen breakdown proteins include any one or more of glycogen phosphorylase (GlgP), glycogen isoamylase (GlgX), glucanotransferase (MalQ), phosphoglucomutase (Pgm), glucokinase (Glk), and/or phosphoglucose isomerase (Pgi), including any combination thereof.
Particular embodiments include methods of producing a modified photosynthetic microorganism, e.g., a Cyanobacteria, comprising: (1) introducing into said photosynthetic microorganism one or more polynucleotides encoding one or more intracellular localization domain-DGAT fusion proteins, (2) introducing into said photosynthetic microorganism one or more operatively linked promoters (e.g., inducible or regulable promoters) into a region upstream of an endogenous lipid biosynthesis protein coding sequence, and/or introducing one or more polynucleotides encoding a lipid biosynthesis protein, or a fragment or variant thereof, and (3) introducing into said photosynthetic microorganism one or more operatively linked promoters (e.g., inducible or regulable promoters) into a region upstream of an endogenous glycogen breakdown protein coding seq uence, and/or introducing one or more polynucleotides encoding a glycogen breakdown protein, or a fragment or variant thereof.
In particular embodiments, the lipid biosynthesis protein is an acyl carrier protein (ACP), an acyl-ACP synthetase (Aas), or both. For instance, certain embodiments include methods for producing a modified photosynthetic microorganism, e.g., a Cyanobacteria, comprising: (1) introducing into said photosynthetic microorganism one or more polynucleotides encoding one or more intracellular localization domain-DGAT fusion proteins, (2) introducing into said photosynthetic microorganism one or more operatively linked promoters (e.g., inducible or regulable promoters) into a region upstream of an endogenous ACP coding seq uence, and/or introducing one or more polynucleotides encoding an ACP, or a fragment or variant thereof. These and related methods can further comprise (3) introducing into said photosynthetic microorganism one or more operatively linked promoters (e.g., inducible or regulable promoters) into a region upstream of an endogenous TES, ACCase, TAG hydrolase, fatty acyl CoA synthetase, PAP, and/or phospholipase coding seq uence, and/or introducing one or more polynucleotides encoding TES, ACCase, TAG hydrolase, fatty acyl CoA synthetase, PAP, and/or phospholipase, or a fragment or variant thereof.
Some embodiments include methods for producing a modified photosynthetic microorganism, e.g., a Cyanobacteria, comprising: (1) introducing into said photosynthetic microorganism one or more polynucleotides encoding one or more intracellular localization domain-DGAT fusion proteins, (2) introducing into said photosynthetic microorganism one or
more operatively linked promoters (e.g., inducible or regulable promoters) into a region upstream of an endogenous Aas coding sequence, and/or introducing one or more polynucleotides encoding an Aas polypeptide, or a fragment or variant thereof. These and related methods can further comprise (3) introducing into said photosynthetic microorganism one or more operatively linked promoters (e.g., inducible or regulable promoters) into a region upstream of an endogenous TES, ACCase, TAG hydrolase, fatty acyl CoA synthetase, PAP, and/or phospholipase coding sequence, and/or introducing one or more polynucleotides encoding TES, ACCase, TAG hydrolase, fatty acyl CoA synthetase, PAP, and/or phospholipase, or a fragment or variant thereof.
Specific embodiments include methods of producing a modified photosynthetic microorganism, e.g., a Cyanobacteria, comprising: (1) introducing into said photosynthetic microorganism one or more polynucleotides encoding one or more intracellular localization domain-DGAT fusion proteins, (2) introducing into said photosynthetic microorganism one or more operatively linked promoters (e.g., inducible or regulable promoters) into a region upstream of an endogenous ACP coding seq uence, and/or introducing one or more polynucleotides encoding an ACP, or a fragment or variant thereof, and (3) introducing into said photosynthetic microorganism one or more operatively linked promoters (e.g., inducible or regulable promoters) into a region upstream of an endogenous Aas coding sequence, and/or introducing one or more polynucleotides encoding an Aas polypeptide, or a fragment or variant thereof. These and related methods can further comprise (4) introducing into said photosynthetic microorganism one or more operatively linked promoters (e.g., inducible or regulable promoters) into a region upstream of an endogenous TES, ACCase, TAG hydrolase, fatty acyl CoA synthetase, PAP, and/or phospholipase coding sequence, and/or introducing one or more polynucleotides encoding TES, ACCase, TAG hydrolase, fatty acyl CoA synthetase, PAP, and/or phospholipase, or a fragment or variant thereof.
In some embodiments, the lipid biosynthesis protein is an acyl-ACP reductase, optionally in combination with an overexpressed alcohol dehydrogenase, for instance, to increase production of triglycerides and/or produce wax esters. Certain embodiments thus include methods of producing a modified photosynthetic microorganism, e.g., a Cyanobacteria,
comprising: (1) introducing into said photosynthetic microorganism one or more polynucleotides encoding one or more intracellular localization domain-DGAT fusion proteins, and (2) introducing into said photosynthetic microorganism one or more operatively linked promoters (e.g., inducible or regulable promoters) into a region upstream of an endogenous acyl-ACP reductase coding sequence, and/or introducing one or more polynucleotides encoding an acyl-ACP reductase, or a fragment or variant thereof.
For wax ester production, also included are methods of producing a modified photosynthetic microorganism, e.g., a Cyanobacteria, comprising: (1) introducing into said photosynthetic microorganism one or more polynucleotides encoding one or more intracellular localization domain-DGAT fusion proteins, (2) introducing into said photosynthetic microorganism one or more operatively linked promoters (e.g., inducible or regulable promoters) into a region upstream of an endogenous acyl-ACP reductase coding sequence, and/or introducing one or more polynucleotides encoding an acyl-ACP reductase, or a fragment or variant thereof, and (3) introducing into said photosynthetic microorganism one or more operatively linked promoters (e.g., inducible or regulable promoters) into a region upstream of an endogenous alcohol dehydrogenase coding sequence, and/or introducing one or more polynucleotides encoding an alcohol dehydrogenase, or a fragment or variant thereof.
Any of the photosynthetic microorganisms described herein can be further modified by reducing expression and/or activity of one or more endogenous genes/proteins associated with glycogen synthesis and/or storage, one or more endogenous aldehyde dehydrogenases, one or more endogenous aldehyde decarbonylases, and/or one or more endogenous Aas polypeptides. Exemplary genes/proteins associated with glycogen synthesis and/or storage include glgA, glgC, and pgm.
In particular embodiments, expression or activity is reduced by knocking out or knocking down one or more alleles of said one or more genes. I n particular embodiments, expression or activity of the one or more genes is reduced by contacting the photosynthetic microorganism with an antisense oligonucleotide or interfering RNA, e.g., an siRNA, that targets said one or more genes. I n certain embodiments, a vector that expresses a polynucleotide that hybridizes to said one or more genes, e.g., an antisense oligonucleotide or an siRNA is introduced into said
photosynthetic microorganism. Also included is the generation of mutants, such as point mutants, insertions, or full or partial deletions of a gene of interest and/or one or more of its regulatory elements (e.g., promoters, enhancers), to reduce expression and/or activity of a protein of interest. Natural selection or directed selection can also be used to identify naturally-occurring mutants having reduced expression and/or activity of a protein of interest.
For instance, particular embodiments include methods for producing a modified photosynthetic microorganism having reduced expression and/or activity of an aldehyde dehydrogenase, an aldehyde decarbonylase, or both. These and related embodiments may comprise (1) introducing into said photosynthetic microorganism one or more polynucleotides encoding one or more intracellular localization domain-DGAT fusion proteins, and (2) introducing one or more mutations into an endogenous gene encoding an aldehyde dehydrogenase, such as a point mutation, insertion, or full or partial deletion, which reduces expression and/or activity of the aldehyde dehydrogenase, e.g., renders the aldehyde dehydrogenase "non-functional," as described herein. Also included are methods for producing a modified photosynthetic microorganism, comprising (1) introducing into said photosynthetic microorganism one or more polynucleotides encoding one or more intracellular localization domain-DGAT fusion proteins, and (2) introducing one or more mutations into an endogenous gene encoding an aldehyde decarbonylase, such as a point mutation, insertion, or full or partial deletion, which reduces expression and/or activity of the aldehyde decarbonylase.
Some embodiments include methods for producing a modified photosynthetic microorganism, comprising (1) introducing into said photosynthetic microorganism one or more polynucleotides encoding one or more intracellular localization domain-DGAT fusion proteins, (2) introducing one or more mutations into an endogenous gene encoding an aldehyde dehydrogenase, such as a point mutation, insertion, or full or partial deletion, which reduces expression and/or activity of the aldehyde dehydrogenase, and (3) introducing one or more mutations into an endogenous gene encoding an aldehyde decarbonylase, such as a point mutation, insertion, or full or partial deletion, which reduces expression and/or activity of the aldehyde decarbonylase.
Particular methods include producing a modified photosynthetic microorganism having increased expression of an acyl-ACP reductase and an alcohol dehydrogenase, in combination with reduced expression and/or activity of an aldehyde dehydrogenase, reduced expression and/or activity of an aldehyde decarbonylase, or both. These and related embodiments can be useful in the production of wax esters, as described herein. Some embodiments thus include methods for producing a modified photosynthetic microorganism, comprising (1) introducing into said photosynthetic microorganism one or more polynucleotides encoding one or more intracellular localization domain-DGAT fusion proteins, (2) introducing into said photosynthetic microorganism one or more operatively linked promoters (e.g., inducible or regulable promoters) into a region upstream of an endogenous acyl-ACP reductase coding sequence, and/or introducing one or more polynucleotides encoding an acyl-ACP reductase, or a fragment or variant thereof, (3) introducing into said photosynthetic microorganism one or more operatively linked promoters (e.g., inducible or regulable promoters) into a region upstream of an endogenous alcohol dehydrogenase coding sequence, and/or introducing one or more polynucleotides encoding an alcohol dehydrogenase, or a fragment or variant thereof, and either or both of (4) introducing one or more mutations into an endogenous gene encoding an aldehyde dehydrogenase, such as a point mutation, insertion, or full or partial deletion, which reduces expression and/or activity of the aldehyde dehydrogenase, and (5) introducing one or more mutations into an endogenous gene encoding an aldehyde decarbonylase, such as a point mutation, insertion, or full or partial deletion, which reduces expression and/or activity of the aldehyde decarbonylase. In specific embodiments, for instance, where the photosynthetic microorganism is 5. elongatus, the aldehyde dehydrogenase is encoded by orf0489, and the aldehyde decarbonylase is encoded by orfl593.
Photosynthetic microorganisms, e.g., Cyanobacteria, may be genetically modified according to techniques known in the art, e.g., delete a portion or all of a gene or to introduce a polynucleotide that expresses a functional polypeptide. As noted above, in certain aspects, genetic manipulation in photosynthetic microorganisms, e.g., Cyanobacteria, can be performed by the introduction of non-replicating vectors which contain native photosynthetic microorganism sequences, exogenous genes of interest, and selectable markers or drug
resistance genes. U pon introduction into the photosynthetic microorganism, the vectors may be integrated into the photosynthetic microorganism's genome through homologous recombination. I n this way, an exogenous gene of interest and the drug resistance gene are stably integrated into the photosynthetic microorganism's genome. Such recombinants cells can then be isolated from non-recombinant cells by drug selection. Cell transformation methods and selectable markers for Cyanobacteria are also well known in the art (see, e.g., Wirth, Mol Gen Genet 216:175-7, 1989; and Koksharova, Appl Microbiol Biotechnol 58:123-37, 2002; and The Cyanobacteria : Molecular Biology, Genetics, and Evolution (eds. Antonio Herrera and Enrique Flores) Caister Academic Press, 2008, each of which is incorporated by reference for their description on gene transfer into Cyanobacteria, and other information on Cyanobacteria).
In certain embodiments, an endogenous version of a protein (e.g., ACP, Aas, TES, ACCase, TAG hydrolase, fatty acyl-CoA synthetase, PAP, PL), if present, can be overexpressed by introducing a heterologous or other promoter upstream of the endogenous gene encoding that protein, i.e., the naturally-occurring version of that gene. Such promoters may be constitutive or inducible.
Generation of deletions or mutations of any of the one or more genes associated with the biosynthesis or storage of glycogen can be accomplished according to a variety of methods known in the art, including the use of a non-replicating, selectable vector system that is targeted to the upstream and downstream flanking regions of a given gene (e.g., glgC, pgm), and which recombines with the Cyanobacterial genome at those flanking regions to replace the endogenous coding sequence with the vector seq uence. Given the presence of a selectable marker in the vector seq uence, such as a drug selectable marker, Cyanobacterial cells containing the gene deletion can be readily isolated, identified and characterized. Such selectable vector-based recom bination methods need not be limited to targeting upstream and downstream flanking regions, but may also be targeted to internal sequences within a given gene, as long as that gene is rendered "non-functional," as described herein.
The generation of deletions or mutations can also be accomplished using antisense-based technology. For instance, Cyanobacteria have been shown to contain natural
regulatory events that rely on antisense regulation, such as a 177-nt ncRNA that is transcribed in antisense to the central portion of an iron-regulated transcript and blocks its accumulation through extensive base pairing (see, e.g., Duhring, et ai, Proc. Natl. Acad. Sci. USA 103:7054-7058, 2006), as well as a alrl690 mRNA that overlaps with, and is complementary to, the complete furA gene, which acts as an antisense RNA ( -furA RNA) interfering with furA transcript translation (see, e.g., Hernandez et ai, Journal of Molecular Biology 355:325-334, 2006). Thus, the incorporation of antisense molecules targeted to genes involved in glycogen biosynthesis or storage would be similarly expected to negatively regulate the expression of these genes, rendering them "non-functional," as described herein.
As used herein, antisense molecules encompass both single and double-stranded polynucleotides comprising a strand having a sequence that is complementary to a target coding strand of a gene or mRNA. Thus, antisense molecules include both single-stranded antisense oligonucleotides and double-stranded siRNA molecules.
Other modifications described herein may be produced using standard procedures and reagents, e.g., vectors, available in the art. Related methods are described in PCT Application No. WO 2010/075440, which is hereby incorporated by reference in its entirety.
Methods of Producing Lipids
The modified photosynthetic microorganisms and methods of the present invention may be used to produce lipids, such as fatty acids, triglycerides, and/or wax esters. Accordingly, the present invention provides methods of producing lipids, comprising culturing any of the modified photosynthetic microorganisms of the present invention (described elsewhere herein) In one embodiment, the modified photosynthetic microorganism is a Cyanobacterium that produces or accumulates increased lipids relative to an unmodified or wild-type Cyanobacterium of the same species, or a differently modified Cyanobacterium of the same species. In certain embodiments, the modified photosynthetic microorganism is a Cyanobacterium that produces or accumulates increased triglycerides relative to an unmodified or wild-type Cyanobacterium of the same species, or a differently modified Cyanobacterium of the same species. In specific instances, the differently modified Cyanobacterium expresses a
wild-type DGAT, and no other form(s) of DGAT. Other examples of differently modified Cyanobacteria are described herein. In certain aspects, increased triglyceride production is associated with improved cell growth characteristics relative to the differently modified Cyanobacterium, e.g., increased cell survival over time, and is thus measured over time, e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, or more days post-culture or post-induction of DGAT expression, or in a continuous culture system.
I n one embodiment, the modified photosynthetic microorganism is a Cyanobacterium that produces or accumulates increased wax esters relative to an unmodified or wild-type Cyanobacterium of the same species, or a differently modified Cyanobacterium of the same species. In these and related embodiments, the Cyanobacterium overexpresses an acyl-ACP reductase and an alcohol dehydrogenase, in combination with an intracellular localization domain-DGAT fusion protein. I n some embodiments, the differently modified Cyanobacterium is one that expresses DGAT in combination with an acyl-ACP reductase and an alcohol dehydrogenase, and thus produces wax esters, but expresses a wild-type DGAT and no other form of DGAT. In some aspects, increased wax ester production is associated with improved cell growth characteristics relative to the differently modified Cyanobacterium, e.g., increased cell survival over time, and is thus measured over time, e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, or more days post-culture or post-induction of DGAT expression, or in a continuous culture system.
I n certain embodiments, the one or more introduced polynucleotides are present in one or more expression constructs. I n particular embodiments, the one or more expression constructs comprises one or more inducible promoters. In certain embodiments, the one or more expression constructs are stably integrated into the genome of said modified photosynthetic microorganism.
I n certain embodiments, the introduced polynucleotide encoding an introduced protein is present in an expression construct comprising a weak promoter under non-induced conditions. In certain embodiments, one or more of the introduced polynucleotides are codon-optimized for expression in a Cyanobacterium, e.g., a Synechococcus elongatus.
In particular embodiments, the photosynthetic microorganism is a Synechococcus elongatus, such as Synechococcus elongatus strain PCC7942 or a salt tolerant variant of Synechococcus elongatus strain PCC7942. In particular embodiments, the photosynthetic microorganism is a Synechococcus sp. PCC 7002 or a Synechocystis sp. PCC6803.
Photosynthetic microorganisms may be cultured according to techniques known in the art. For example, Cyanobacteria may be cultured or cultivated according to techniques known in the art, such as those described in Acreman et al. (Journal of Industrial Microbiology and Biotechnology 13:193-194, 1994), in addition to photobioreactor based techniques, such as those described in Nedbal et al. (Biotechnol Bioeng. 100:902-10, 2008). One example of typical laboratory culture conditions for Cyanobacterium is growth in BG-11 medium (ATCC Medium 616) at 30° C in a vented culture flask with constant agitation and constant illumination at 30-100 μιηοΐθ photons m"2 sec"1.
A wide variety of mediums are available for culturing Cyanobacteria, including, for example, Aiba and Ogawa (AO) Medium, Allen and Arnon Medium plus Nitrate (ATCC Medium 1142), Antia's (ANT) Medium, Aquil Medium, Ashbey's Nitrogen-free Agar, ASN-III Medium, ASP 2 Medium, ASW Medium (Artificial Seawater and derivatives), ATCC Medium 617 (BG-11 for Marine Blue-Green Algae; Modified ATCC Medium 616 [BG-11 medium]), ATCC Medium 819 (Blue-green Nitrogen-fixing Medium; ATCC Medium 616 [BG-11 medium] without N03), ATCC Medium 854 (ATCC Medium 616 [BG-11 medium] with Vitamin Bi2), ATCC Medium 1047 (ATCC Medium 957 [MN marine medium] with Vitamin Bi2), ATCC Medium 1077 (Nitrogen-fixing marine medium; ATCC Medium 957 [MN marine medium] without N03), ATCC Medium 1234 (BG-11 Uracil medium; ATCC Medium 616 [BG-11 medium] with uracil), Beggiatoa Medium (ATCC Medium 138), Beggiatoa Medium 2 (ATCC Medium 1193), BG-11 Medium for Blue Green Algae (ATCC Medium 616), Blue-Green (BG) Medium, Bold's Basal (BB) Medium, Castenholtz D Medium, Castenholtz D Medium Modified (Halophilic cyanobacteria), Castenholtz DG Medium, Castenholtz DGN Medium, Castenholtz ND Medium, Chloroflexus Broth, Chloroflexus Medium (ATCC Medium 920), Chu's #10 Medium (ATCC Medium 341), Chu's #10 Medium Modified, Chu's #11 Medium Modified, DCM Medium, DYIV Medium, E27 Medium, E31 Medium and Derivatives, f/2 Medium, f/2 Medium Derivatives, Fraquil Medium (Freshwater Trace
Metal-Buffered Medium), Gorham's Medium for Algae (ATCC Medium 625), h/2 Medium, Jaworski's (JM) Medium, K Medium, LI Medium and Derivatives, M N Marine Medium (ATCC Medium 957), Plymouth Erdschreiber (PE) Medium, Prochlorococcus PC Medium, Proteose Peptone (PP) Medium, Prov Medium, Prov Medium Derivatives, S77 plus Vitamins Medium, S88 plus Vitamins Medium, Saltwater Nutrient Agar (SNA) Medium and Derivatives, SES Medium, SN Medium, Modified SN Medium, SNAX Medium, Soil/Water Biphasic (S/W) Medium and Derivatives, SOT Medium for Spirulina: ATCC Medium 1679, Spirulina (SP) Medium, van Rijn and Cohen (RC) Medium, Walsby's Medium, Yopp Medium, and Z8 Medium, among others.
I n particular embodiments, the modified photosynthetic microorganisms are cultured under conditions suitable for inducing expression of the introduced polynucleotide(s), e.g., wherein the introduced polynucleotide(s) comprise an inducible promoter. Conditions and reagents suitable for inducing inducible promoters are known and available in the art. Also included are the use of auto-inductive systems, for example, where a metabolite represses expression of the introduced polynucleotide, and the use of that metabolite by the microorganism over time decreases its concentration and thus its repressive activities, thereby allowing increased expression of the polynucleotide sequence.
I n certain embodiments, modified photosynthetic microorganisms, e.g., Cyanobacteria, are grown under conditions favorable for producing lipids, triglycerides and/or fatty acids. I n particular embodiments, light intensity is between 100 and 2000 uE/m2/s, or between 200 and 1000 uE/m2/s. In particular embodiments, the pH range of culture media is between 7.0 and 10.0. I n certain embodiments, C02 is injected into the culture apparatus to a level in the range of 1% to 10%. In particular embodiments, the range of C02 is between 2.5% and 5%. In certain embodiments, nutrient supplementation is performed during the linear phase of growth. Each of these conditions may be desirable for triglyceride production.
I n certain embodiments, the modified photosynthetic microorganisms are cultured, at least for some time, under static growth conditions as opposed to shaking conditions. For example, the modified photosynthetic microorganisms may be cultured under static conditions prior to inducing expression of an introduced polynucleotide (e.g., intracellular localization domain-DGAT fusion, acyl-ACP reductase, ACP, Aas, ACP/Aas, glycogen breakdown protein,
ACCase, DGAT, fatty acyl-CoA synthetase, aldehyde dehydrogenase, alcohol dehydrogenase) and/or the modified photosynthetic microorganism may be cultured under static conditions while expression of an introduced polynucleotide is being induced, or during a portion of the time period during which expression on an introduced polynucleotide is being induced. Static growth conditions may be defined, for example, as growth without shaking or growth wherein the cells are shaken at less than or equal to 30 rpm or less than or equal to 50 rpm.
I n certain embodiments, the modified photosynthetic microorganisms are cultured, at least for some time, in media supplemented with varying amounts of bicarbonate. For example, the modified photosynthetic microorganisms may be cultured with bicarbonate at 5, 10, 20, 50, 75, 100, 200, 300, 400, 500, 600, 700, 800, 900, 1000 mM bicarbonate prior to inducing expression of an introduced polynucleotide (e.g., membrane-targeting domain-DGAT fusion protein, acyl-ACP reductase, aldehyde dehydrogenase, ACP, Aas, ACP/Aas, glycogen breakdown protein, ACCase, DGAT, fatty acyl-CoA synthetase, alcohol dehydrogenase) and/or the modified photosynthetic microorganism may be cultured with aforementioned bicarbonate concentrations while expression of an introduced polynucleotide is being induced, or during a portion of the time period during which expression on an introduced polynucleotide is being induced.
In related embodiments, modified photosynthetic organisms and methods of the present invention may be used in the production of a biofuel and/or a specialty chemical, such as glycerin or a wax ester. Thus, in particular embodiments, a method of producing a biofuel comprises culturing any of the modified photosynthetic microorganisms of the present invention under conditions wherein the modified photosynthetic microorganism accumulates an increased amount of total cellular lipid, fatty acid, wax ester, and/or triglyceride, as compared to a corresponding wild-type photosynthetic microorganism, obtaining cellular lipid, fatty acid, wax ester, and/or triglyceride from said microorganism, and processing the obtained cellular lipid, fatty acid, wax ester, and/or triglyceride to produce a biofuel. In another embodiment, a method of producing a biofuel comprises processing lipids, fatty acids, wax esters, and/or triglycerides produced by a modified photosynthetic microorganism of the present invention to produce a biofuel. I n a further embodiment, a method of producing a
biofuel comprises obtaining lipid, fatty acid, wax esters, and/or triglyceride produced by a modified photosynthetic microorganism of the present invention, and processing the obtained cellular lipid, fatty acid, wax ester, and/or triglyceride to produce a biofuel. In particular embodiments, the modified photosynthetic organism is grown under conditions wherein it has reduced growth but maintains photosynthesis.
Methods of processing lipids from microorganisms to produce a biofuel or specialty chemical, e.g., biodiesel, are known and available in the art. For example, triglycerides may be transesterified to produce biodiesel. Transesterification may be carried out by any one of the methods known in the art, such as alkali-, acid-, or lipase-catalysis (see, e.g., Singh et al., Recent Pat Biotechnol. 2008, 2(2):130-143). Various methods of transesterification utilize, for example, use of a batch reactor, a supercritical alcohol, an ultrasonic reactor, or microwave irradiation (Such methods are described, e.g., in Jeong and Park, Appl Biochem Biotechnol. 2006, 131(l-3):668-679; Fukuda et al., Journal of Bioscience and Engineering. 2001, 92(5):405-416; Shah and Gupta, Chemistry Central Journal. 2008, 2(l):l-9; and Carrillo-Munoz et al., J Org Chem. 1996, 61(22):7746-7749). The biodiesel may be further processed or purified, e.g., by distillation, and/or a biodiesel stabilizer may be added to the biodiesel, as described in U.S. Patent Application Publication No. 2008/0282606.
Certain embodiments of the present invention now will be illustrated by the following Examples. The present invention may, however, be embodied in many different forms and should not be construed as limited to the embodiments set forth herein; rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the invention to those skilled in the art.
Claims
1. A modified Cyanobacterium comprising an exogenous polynucleotide encoding a diacylglycerol acyltransferase (DGAT) and at least one exogenous polynucleotide encoding at least one lipid packaging protein, wherein the modified Cyanobacterium has an increased growth rate and/or produces or accumulates an increased amount of neutral lipids as compared to a wild-type Cyanobacterium expressing only an exogenous DGAT.
2. The modified Cyanobacterium according to claim 1, wherein said DGAT is fused to a heterologous intracellular localization domain.
3. The modified Cyanobacterium according to claim 1, wherein said DGAT is fused to at least one lipid packaging protein.
4. The modified Cyanobacterium according to any one of claims 1-3, wherein said DGAT uses acyl-ACP as a substrate.
5. The modified Cyanobacterium according to any one of claims 1-4, wherein said DGAT is a prokaryotic DGAT.
6. The modified Cyanobacterium according to claim 5, wherein said DGAT is an
Acinetobacter DGAT, a Streptomyces DGAT, or an Alcanivorax DGAT.
7. The modified Cyanobacterium according to any one of claims 1-6, wherein said packaging protein is pgl 1, pgl 2, pgl-like, Phal P, Tad A, Clol 1 or a combination thereof.
8. The modified Cyanobacterium according to any one of claims 1-7, wherein said exogenous polynucleotide encoding DGAT is present in one or more expression constructs.
9. The modified Cyanobacterium according to any one of claims 1-8, wherein said at least one exogenous polynucleotide encoding at least one lipid packaging protein are present in one or more expression constructs.
10. The modified Cyanobacterium according to claim 8 or claim 9, wherein said one or more expression constructs are stably integrated into the Cyanobacterium genome.
11. The modified Cyanobacterium according to claim 8 or claim 9, wherein said one or more expression constructs comprise a constitutive promoter or an inducible promoter.
12. The modified Cyanobacterium according to any one of claims 1-11, further comprises an exogenous polynucleotide encoding an acyl carrier protein (ACP).
13. A method for producing a neutral lipid, comprising culturing the Cyanobacterium according to any one of claims 1-12 to produce a neutral lipid.
14. The method according to claim 13, wherein said DGAT uses acyl-ACP as a substrate.
15. A method for producing a neutral lipid, comprising culturing a Cyanobacterium comprising an exogenous polynucleotide encoding a diacylglycerol acyltransferase (DGAT) and at least one exogenous polynucleotide encoding at least one lipid packaging protein to produce a neutral lipid, wherein the modified Cyanobacterium has an increased growth rate and/or produces or accumulates an increased amount of neutral lipids as compared to a wild-type Cyanobacterium expressing only an exogenous DGAT.
16. The method according to claim 15, wherein said DGAT uses acyl-ACP as a substrate.
17. The method according to claim 15, wherein said DGAT is fused to a heterologous intracellular localization domain.
18. The method according to claim 15, wherein said DGAT is fused to said at least one lipid packaging protein.
19. The method according to any one of claims 15-18, wherein said DGAT is a prokaryotic DGAT.
20. The method according to any one of claims 15-19, wherein said DGAT is an Acinetobacter DGAT, a Streptomyces DGAT, or an Alcanivorax DGAT.
21. The method according to any one of claims 15-20 wherein said packaging protein is pgl 1, pgl 2, pgl-like, Phal P, Tad A, Clol 1 or a combination thereof.
22. The method according to any one of claims 15-21, wherein said Cyanobacterium further comprises an exogenous polynucleotide encoding an acyl carrier protein (ACP).
23. A method for making a modified Cyanobacterium, comprising modifying a
Cyanobacterium by introducing into the Cyanobacterium an exogenous polynucleotide encoding a diacylglycerol acyltransferase (DGAT) and at least one exogenous polynucleotide encoding at least one lipid packaging protein, wherein said Cyanobacterium produces a neutral lipid, wherein the modified Cyanobacterium has an increased growth rate and/or produces or
accumulates an increased amount of neutral lipids as compared to a wild-type Cyanobacterium expressing only an exogenous DGAT.
24. The method according to claim 23, wherein said DGAT uses acyl-ACP as a substrate.
25. The method according to claim 23, wherein said DGAT is fused to a heterologous intracellular localization domain.
26. The method according to claim 23, wherein said DGAT is fused to said at least one lipid packaging protein.
27. The method according to any one of claims 23-26, wherein said DGAT is a prokaryotic DGAT.
28. The method according to any one of claims 23-27, wherein said DGAT is an Acinetobacter DGAT, a Streptomyces DGAT, or an Alcanivorax DGAT.
29. The method according to any one of claims 23-28, wherein said packaging protein is pgl 1, pgl 2, pgl-like, Phal P, Tad A, Clol 1 or a combination thereof.
30. The method according to any one of claims 23-29, wherein said Cyanobacterium further comprises an exogenous polynucleotide encoding an acyl carrier protein (ACP).
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201361780712P | 2013-03-13 | 2013-03-13 | |
US61/780,712 | 2013-03-13 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2014164232A1 true WO2014164232A1 (en) | 2014-10-09 |
Family
ID=50434276
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2014/021423 WO2014164232A1 (en) | 2013-03-13 | 2014-03-06 | Cyanobacteria that produce lipid packaging proteins |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2014164232A1 (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10131870B2 (en) | 2014-09-09 | 2018-11-20 | Lumen Bioscience, Inc. | Targeted mutagenesis in Spirulina |
Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4751180A (en) | 1985-03-28 | 1988-06-14 | Chiron Corporation | Expression using fused genes providing for protein product |
US4873192A (en) | 1987-02-17 | 1989-10-10 | The United States Of America As Represented By The Department Of Health And Human Services | Process for site specific mutagenesis without phenotypic selection |
US4935233A (en) | 1985-12-02 | 1990-06-19 | G. D. Searle And Company | Covalently linked polypeptide cell modulators |
US6306639B1 (en) | 1997-02-19 | 2001-10-23 | Enol Energy Inc. | Genetically modified cyanobacteria for the production of ethanol, the constructs and method thereof |
US20080282606A1 (en) | 2007-04-16 | 2008-11-20 | Plaza John P | System and process for producing biodiesel |
WO2010075440A1 (en) | 2008-12-23 | 2010-07-01 | Targeted Growth, Inc. | Modified photosynthetic microorganisms with reduced glycogen and their use in producing carbon-based products |
WO2010078584A1 (en) * | 2009-01-05 | 2010-07-08 | Arizona Board Of Regents | Cyanobacterium that produces neutral lipids |
WO2011127118A1 (en) * | 2010-04-06 | 2011-10-13 | Algenetix, Inc. | Methods of producing oil in non-plant organisms |
-
2014
- 2014-03-06 WO PCT/US2014/021423 patent/WO2014164232A1/en active Application Filing
Patent Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4751180A (en) | 1985-03-28 | 1988-06-14 | Chiron Corporation | Expression using fused genes providing for protein product |
US4935233A (en) | 1985-12-02 | 1990-06-19 | G. D. Searle And Company | Covalently linked polypeptide cell modulators |
US4873192A (en) | 1987-02-17 | 1989-10-10 | The United States Of America As Represented By The Department Of Health And Human Services | Process for site specific mutagenesis without phenotypic selection |
US6306639B1 (en) | 1997-02-19 | 2001-10-23 | Enol Energy Inc. | Genetically modified cyanobacteria for the production of ethanol, the constructs and method thereof |
US20080282606A1 (en) | 2007-04-16 | 2008-11-20 | Plaza John P | System and process for producing biodiesel |
WO2010075440A1 (en) | 2008-12-23 | 2010-07-01 | Targeted Growth, Inc. | Modified photosynthetic microorganisms with reduced glycogen and their use in producing carbon-based products |
WO2010078584A1 (en) * | 2009-01-05 | 2010-07-08 | Arizona Board Of Regents | Cyanobacterium that produces neutral lipids |
WO2011127118A1 (en) * | 2010-04-06 | 2011-10-13 | Algenetix, Inc. | Methods of producing oil in non-plant organisms |
Non-Patent Citations (81)
Title |
---|
"The Cyanobacteria: Molecular Biology, Genetics, and Evolution", 2008, CAISTER ACADEMIC PRESS |
ACREMAN ET AL., JOURNAL OF INDUSTRIAL MICROBIOLOGY AND BIOTECHNOLOGY, vol. 13, 1994, pages 193 - 194 |
ALEXANDER; ZHULIN, PNAS USA, vol. 104, 2007, pages 2885 - 2890 |
ALTSCHUL ET AL., J. MOL. BIOL, vol. 215, 1990, pages 403 - 10 |
ALTSCHUL ET AL., NUCLEIC ACIDS RES., vol. 25, 1997, pages 3389 - 3402 |
ALTSCHUL ET AL.: "Nucl. Acids Res.", vol. 25, 1997, pages: 3389 |
ALVAREZ ET AL., APPL MICROBIOL BIOTECHNOL., vol. 60, 2002, pages 367 - 76 |
ARKIN; YOURVAN, PNAS USA, vol. 89, 1992, pages 7811 - 7815 |
AUSUBEL ET AL., CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, 1989 |
AUSUBEL ET AL.: "Current Protocols in Molecular Biology", 1994, JOHN WILEY & SONS INC |
BAGCHI, S.N. ET AL., PHOTOSYNTH RES., vol. 92, 2007, pages 87 - 101 |
BALLICORA ET AL., MICROBIOLOGY AND MOLECULAR BIOLOGY REVIEWS, vol. 6, 2003, pages 213 - 225 |
BELL ET AL., ANNU. REV. BIOCHEM., vol. 49, 1980, pages 459 - 487 |
BRIEGEL ET AL., PNAS USA, vol. 106, 2009, pages 17181 - 17186 |
BRINDLEY: "Biochemistry of Lipids, Lipoproteins and Membranes", ELSEVIER, pages: 171 - 203 |
BUSCHIAZZO ET AL., THE EMBO JOURNAL, vol. 23, 2004, pages 3196 - 3205 |
CARRILLO-MUNOZ ET AL., J ORG CHEM., vol. 61, no. 22, 1996, pages 7746 - 7749 |
CHRISTENSEN ET AL., MOL. MICROBIOL., vol. 31, 1999, pages 1561 - 1572 |
CHUNGJATUPORNCHAI ET AL., CURR MICROBIOL., vol. 38, 1999, pages 210 - 216 |
COLEMAN ET AL., ANNU. REV. NUTR., vol. 20, 2000, pages 77 - 103 |
CUNNINGHAM, JB, vol. 192, no. 6, 2010, pages 1700 - 1709 |
DAHLQVST, A. ET AL., PROC. NATL. ACAD. SCI. USA, vol. 97, 2000, pages 6487 - 6492 |
DANIEL ET AL., J. BACTERIOL., vol. 186, 2004, pages 5017 - 5030 |
DAUM ET AL., YEAST, vol. 16, 1998, pages 1471 - 1510 |
DAYHOFF ET AL.: "Atlas of Protein Sequence and Structure", 1978, NATL. BIOMED. RES. FOUND. |
DAYHOFF ET AL.: "Atlas of protein sequence and structure", vol. 5, 1978, NATIONAL BIOMEDICAL RESEARCH FOUNDATION, article "A model of evolutionary change in proteins. Matrices for determining distance relationships", pages: 345 - 358 |
DELGRAVE ET AL., PROTEIN ENGINEERING, vol. 6, 1993, pages 327 - 331 |
DEVERAUX ET AL., NUCLEIC ACIDS RESEARCH, vol. 12, 1984, pages 387 - 395 |
DUHRING ET AL., PROC. NATL. ACAD. SCI. USA, vol. 103, 2006, pages 7054 - 7058 |
E. MEYERS; W. MILLER, CABIOS, vol. 4, 1989, pages 11 - 17 |
FUKUDA ET AL., JOURNAL OF BIOSCIENCE AND ENGINEERING, vol. 92, no. 5, 2001, pages 405 - 416 |
GEORGE; HERINGA, J PROTEIN ENG., vol. 15, 2002, pages 871 - 879 |
GONNET ET AL., SCIENCE, vol. 256, 1992, pages 14430 - 1445 |
HAMPTON ET AL.: "Serological Methods, a Laboratory Manual", 1990 |
HAN ET AL., J BIOL CHEM., vol. 281, 2006, pages 9210 - 9218 |
HANISCH, MICROBIOLOGY, vol. 152, 2006, pages 3271 - 3280 |
HARWOOD, BIOCHEM. BIOPHYSICS. ACTA, vol. 1301, 1996, pages 7 - 56 |
HAZELBAUER ET AL., TRENDS BIOCHEM SCI., vol. 33, 2008, pages 9 - 19 |
HENIKOFF; HENIKOFF, PNAS USA, vol. 89, 1992, pages 10915 - 10919 |
HERNANDEZ ET AL., JOURNAL OF MOLECULAR BIOLOGY, vol. 355, 2006, pages 325 - 334 |
HOFVANDER ET AL., FEBS LETTERS, 2011, pages 3583 - 3543 |
IMASHIMIZU ET AL., J BACTERIOL., vol. 185, 2003, pages 6477 - 80 |
JEONG; PARK, APPL BIOCHEM BIOTECHNOL., vol. 131, no. 1-3, 2006, pages 668 - 679 |
JIANG ET AL., BIOCHEMISTRY, vol. 45, 2006, pages 10008 - 10019 |
JIN ET AL., EMBO J, vol. 24, 2005, pages 694 - 704 |
KACZMARZYK; FULDA, PLANT PHYSIOLOGY, vol. 152, 2010, pages 1598 - 1610 |
KALSCHEUER ET AL., J. BIOL. CHEM., vol. 287, 2003, pages 8075 - 8082 |
KOKSHAROVA ET AL., APPLIED MICROBIOL BIOTECHNOL, vol. 58, 2002, pages 123 - 37 |
KOKSHAROVA, APPL MICROBIOL BIOTECHNOL, vol. 58, 2002, pages 123 - 37 |
KUNKEL ET AL., METHODS IN ENZYMOL., vol. 154, 1987, pages 367 - 382 |
KUNKEL, PNAS USA., vol. 82, 1985, pages 488 - 492 |
LI ET AL., PLANT PHYSIOLOGY, vol. 148, 2008, pages 97 - 107 |
LU ET AL., JOURNAL OF BACTERIOLOGY, vol. 176, 1994, pages 5847 - 5851 |
MACEACHRAN, AEM, vol. 76, no. 21, 2010, pages 7217 - 7227 |
MADDOX ET AL., J. EXP. MED., vol. 158, 1983, pages 1211 - 1216 |
MAEDA ET AL., J. BACTERIOL., vol. 180, 1998, pages 4080 - 4088 |
MARATEA ET AL., GENE, vol. 40, 1985, pages 39 - 46 |
MERMET-BOUVIER ET AL., CURR MICROBIOL, vol. 26, 1993, pages 323 - 327 |
MERMET-BOUVIER ET AL., CURR MICROBIOL, vol. 28, 1994, pages 145 - 148 |
MURPHY ET AL., PNAS USA, vol. 83, 1986, pages 8258 - 8262 |
NEDBAL ET AL., BIOTECHNOL BIOENG., vol. 100, 2008, pages 902 - 10 |
NEEDLEMAN; WUNSCH, J MOL BIOL., vol. 48, 1970, pages 443 - 453 |
NEEDLEMAN; WUNSCH, J. MOL. BIOL., vol. 48, 1970, pages 444 - 453 |
NISHIZUKA, SCIENCE, vol. 258, 1992, pages 607 - 614 |
QI ET AL., APPLIED AND ENVIRONMENTAL MICROBIOLOGY, vol. 71, 2005, pages 5678 - 5684 |
RONEN-TARAZI ET AL., PLANT PHYSIOLOGY, vol. 18, 1995, pages 1461 - 1469 |
SAMBROOK ET AL.: "Molecular Cloning, A Laboratory Manual", 1989 |
SHAH; GUPTA, CHEMISTRY CENTRAL JOURNAL., vol. 2, no. 1, 2008, pages 1 - 9 |
SHANKLIN, PROTEIN EXPRESSION AND PURIFICATION, vol. 18, 2000, pages 355 - 360 |
SINGH ET AL., RECENT PAT BIOTECHNOL., vol. 2, no. 2, 2008, pages 130 - 143 |
STUBBE ET AL., TRENDS BIOCHEM SCI., vol. 23, 1998, pages 438 - 43 |
VAN HEEKE; SCHUSTER, J. BIOL. CHEM., vol. 264, 1989, pages 5503 5509 |
WADITEE, R. ET AL., PNAS, vol. 99, 2002, pages 4109 - 4114 |
WANG, EC, vol. 8, no. 12, 2009, pages 1856 - 1868 |
WATSON, J. D. ET AL.: "Molecular Biology of the Gene", 1987 |
WILTERMANN ET AL., MOL. MICROBIOL., vol. 55, 2005, pages 750 - 763 |
WIRTH, MOL GEN GENET, vol. 216, 1989, pages 175 - 7 |
ZHANG ET AL., SCIENCE, vol. 299, 2003, pages 2064 - 2067 |
ZHONGMIN ET AL., JOURNAL OF LIPID RESEARCH, vol. 42, 2001, pages 1282 - 1291 |
ZHULIN, ADV MICROB PHYSIOL., vol. 45, 2001, pages 157 - 198 |
ZUBAY, G.: "Biochemistry", 1993, WM.C. BROWN PUBLISHERS |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10131870B2 (en) | 2014-09-09 | 2018-11-20 | Lumen Bioscience, Inc. | Targeted mutagenesis in Spirulina |
US10336982B2 (en) | 2014-09-09 | 2019-07-02 | Lumen Bioscience, Inc. | Targeted mutagenesis in spirulina |
US10415012B2 (en) | 2014-09-09 | 2019-09-17 | Lumen Bioscience, Inc. | Targeted mutagenesis in spirulina |
US10415013B2 (en) | 2014-09-09 | 2019-09-17 | Lumen Biosciences, Inc. | Targeted mutagenesis in Spirulina |
US12065638B2 (en) | 2014-09-09 | 2024-08-20 | Lumen Bioscience, Inc. | Targeted mutagenesis in spirulina |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US8980613B2 (en) | Modified photosynthetic microorganisms for producing lipids | |
AU2011349463B2 (en) | Modified photosynthetic microorganisms for producing lipids | |
US8835137B2 (en) | Modified photosynthetic microorganisms with reduced glycogen and their use in producing carbon-based products | |
AU2009308236B2 (en) | Modified photosynthetic microorganisms for producing triglycerides | |
US11279912B2 (en) | Cyanobacteria having improved photosynthetic activity | |
US20180051057A1 (en) | Cyanobacteria having improved photosynthetic activity | |
AU2011349444B2 (en) | Modified photosynthetic microorganisms for producing lipids | |
EP2809790A2 (en) | Modified photosynthetic microorganisms for continuous production of carbon-containing compounds | |
US20150024442A1 (en) | Modified diacylglycerol acyltransferase proteins and methods of use thereof | |
WO2014164232A1 (en) | Cyanobacteria that produce lipid packaging proteins |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 14715162 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 14715162 Country of ref document: EP Kind code of ref document: A1 |