WO2011127409A2 - Methods and compositions related to fatty alcohol biosynthetic enzymes - Google Patents
Methods and compositions related to fatty alcohol biosynthetic enzymes Download PDFInfo
- Publication number
- WO2011127409A2 WO2011127409A2 PCT/US2011/031794 US2011031794W WO2011127409A2 WO 2011127409 A2 WO2011127409 A2 WO 2011127409A2 US 2011031794 W US2011031794 W US 2011031794W WO 2011127409 A2 WO2011127409 A2 WO 2011127409A2
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- fatty
- seq
- polypeptide
- fatty alcohol
- fatty acid
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 158
- 150000002191 fatty alcohols Chemical class 0.000 title claims description 317
- 230000001851 biosynthetic effect Effects 0.000 title claims description 171
- 239000000203 mixture Substances 0.000 title abstract description 34
- 102000004190 Enzymes Human genes 0.000 title description 68
- 108090000790 Enzymes Proteins 0.000 title description 68
- 244000005700 microbiome Species 0.000 claims abstract description 67
- 210000004027 cell Anatomy 0.000 claims description 362
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 312
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 309
- 229920001184 polypeptide Polymers 0.000 claims description 308
- 239000000194 fatty acid Substances 0.000 claims description 213
- 235000014113 dietary fatty acids Nutrition 0.000 claims description 212
- 229930195729 fatty acid Natural products 0.000 claims description 212
- 150000002192 fatty aldehydes Chemical class 0.000 claims description 207
- 150000004665 fatty acids Chemical class 0.000 claims description 203
- 108090000623 proteins and genes Proteins 0.000 claims description 195
- 150000002430 hydrocarbons Chemical class 0.000 claims description 99
- 229930195733 hydrocarbon Natural products 0.000 claims description 98
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 claims description 88
- 102000040430 polynucleotide Human genes 0.000 claims description 79
- 108091033319 polynucleotide Proteins 0.000 claims description 79
- 239000002157 polynucleotide Substances 0.000 claims description 79
- 229910052799 carbon Inorganic materials 0.000 claims description 77
- 239000004215 Carbon black (E152) Substances 0.000 claims description 72
- 230000014509 gene expression Effects 0.000 claims description 70
- 230000000694 effects Effects 0.000 claims description 60
- 150000001336 alkenes Chemical class 0.000 claims description 54
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 48
- 150000001335 aliphatic alkanes Chemical class 0.000 claims description 47
- 125000003729 nucleotide group Chemical group 0.000 claims description 38
- 101710129019 Long-chain acyl-[acyl-carrier-protein] reductase Proteins 0.000 claims description 36
- 239000002773 nucleotide Substances 0.000 claims description 36
- 102000007698 Alcohol dehydrogenase Human genes 0.000 claims description 34
- 239000000446 fuel Substances 0.000 claims description 33
- 108010021809 Alcohol dehydrogenase Proteins 0.000 claims description 32
- 230000001965 increasing effect Effects 0.000 claims description 24
- 235000021122 unsaturated fatty acids Nutrition 0.000 claims description 22
- 150000004670 unsaturated fatty acids Chemical class 0.000 claims description 21
- 239000002551 biofuel Substances 0.000 claims description 20
- 102000005488 Thioesterase Human genes 0.000 claims description 19
- 238000012217 deletion Methods 0.000 claims description 19
- 108020002982 thioesterase Proteins 0.000 claims description 19
- 230000037430 deletion Effects 0.000 claims description 18
- 230000002238 attenuated effect Effects 0.000 claims description 17
- 238000012258 culturing Methods 0.000 claims description 13
- 108010053754 Aldehyde reductase Proteins 0.000 claims description 12
- 102100027265 Aldo-keto reductase family 1 member B1 Human genes 0.000 claims description 12
- FWWQKRXKHIRPJY-UHFFFAOYSA-N octadecanal Chemical compound CCCCCCCCCCCCCCCCCC=O FWWQKRXKHIRPJY-UHFFFAOYSA-N 0.000 claims description 12
- 229920006395 saturated elastomer Polymers 0.000 claims description 12
- 230000001580 bacterial effect Effects 0.000 claims description 11
- 238000006467 substitution reaction Methods 0.000 claims description 11
- 150000004671 saturated fatty acids Chemical class 0.000 claims description 9
- 241000233866 Fungi Species 0.000 claims description 8
- 239000003502 gasoline Substances 0.000 claims description 8
- 238000003780 insertion Methods 0.000 claims description 7
- ADOBXTDBFNCOBN-UHFFFAOYSA-N 1-heptadecene Chemical compound CCCCCCCCCCCCCCCC=C ADOBXTDBFNCOBN-UHFFFAOYSA-N 0.000 claims description 6
- PJLHTVIBELQURV-UHFFFAOYSA-N 1-pentadecene Chemical compound CCCCCCCCCCCCCC=C PJLHTVIBELQURV-UHFFFAOYSA-N 0.000 claims description 6
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 claims description 6
- NDJKXXJCMXVBJW-UHFFFAOYSA-N heptadecane Chemical compound CCCCCCCCCCCCCCCCC NDJKXXJCMXVBJW-UHFFFAOYSA-N 0.000 claims description 6
- 230000037431 insertion Effects 0.000 claims description 6
- LQERIDTXQFOHKA-UHFFFAOYSA-N nonadecane Chemical compound CCCCCCCCCCCCCCCCCCC LQERIDTXQFOHKA-UHFFFAOYSA-N 0.000 claims description 6
- YCOZIPAWZNQLMR-UHFFFAOYSA-N pentadecane Chemical compound CCCCCCCCCCCCCCC YCOZIPAWZNQLMR-UHFFFAOYSA-N 0.000 claims description 6
- UHUFTBALEZWWIH-UHFFFAOYSA-N tetradecanal Chemical compound CCCCCCCCCCCCCC=O UHUFTBALEZWWIH-UHFFFAOYSA-N 0.000 claims description 6
- 210000005253 yeast cell Anatomy 0.000 claims description 6
- 125000002887 hydroxy group Chemical group [H]O* 0.000 claims description 5
- QFPVVMKZTVQDTL-UHFFFAOYSA-N (Z)-9-hexadecenal Natural products CCCCCCC=CCCCCCCCC=O QFPVVMKZTVQDTL-UHFFFAOYSA-N 0.000 claims description 4
- 101710181816 Pyruvate-formate-lyase deactivase Proteins 0.000 claims description 4
- 238000007792 addition Methods 0.000 claims description 4
- 210000005254 filamentous fungi cell Anatomy 0.000 claims description 4
- 235000021354 omega 7 monounsaturated fatty acids Nutrition 0.000 claims description 4
- IIYFAKIEWZDVMP-UHFFFAOYSA-N tridecane Chemical compound CCCCCCCCCCCCC IIYFAKIEWZDVMP-UHFFFAOYSA-N 0.000 claims description 4
- GFBZXBXGGYYWMU-UHFFFAOYSA-N 2-methylhexadec-2-enal Chemical compound CCCCCCCCCCCCCC=C(C)C=O GFBZXBXGGYYWMU-UHFFFAOYSA-N 0.000 claims description 3
- SYINEDRCBKSZPF-UHFFFAOYSA-N 2-methylicosanal Chemical compound CCCCCCCCCCCCCCCCCCC(C)C=O SYINEDRCBKSZPF-UHFFFAOYSA-N 0.000 claims description 3
- LUDOTVMKXVQXLC-UHFFFAOYSA-N 2-methyloctadec-2-enal Chemical compound CCCCCCCCCCCCCCCC=C(C)C=O LUDOTVMKXVQXLC-UHFFFAOYSA-N 0.000 claims description 3
- TZXFTUHLVMYUGE-UHFFFAOYSA-N 2-methyloctadecanal Chemical compound CCCCCCCCCCCCCCCCC(C)C=O TZXFTUHLVMYUGE-UHFFFAOYSA-N 0.000 claims description 3
- YITMLDIGEJSENC-UHFFFAOYSA-N hexadec-2-ene Chemical compound CCCCCCCCCCCCCC=CC YITMLDIGEJSENC-UHFFFAOYSA-N 0.000 claims description 3
- NIOYUNMRJMEDGI-UHFFFAOYSA-N hexadecanal Chemical compound CCCCCCCCCCCCCCCC=O NIOYUNMRJMEDGI-UHFFFAOYSA-N 0.000 claims description 3
- DCAYPVUWAIABOU-UHFFFAOYSA-N hexadecane Chemical compound CCCCCCCCCCCCCCCC DCAYPVUWAIABOU-UHFFFAOYSA-N 0.000 claims description 3
- FWBUWJHWAKTPHI-UHFFFAOYSA-N icosanal Chemical compound CCCCCCCCCCCCCCCCCCCC=O FWBUWJHWAKTPHI-UHFFFAOYSA-N 0.000 claims description 3
- CBFCDTFDPHXCNY-UHFFFAOYSA-N icosane Chemical compound CCCCCCCCCCCCCCCCCCCC CBFCDTFDPHXCNY-UHFFFAOYSA-N 0.000 claims description 3
- KUQIWULJSBTNPX-UHFFFAOYSA-N octadec-2-ene Chemical compound CCCCCCCCCCCCCCCC=CC KUQIWULJSBTNPX-UHFFFAOYSA-N 0.000 claims description 3
- RZJRJXONCZWCBN-UHFFFAOYSA-N octadecane Chemical compound CCCCCCCCCCCCCCCCCC RZJRJXONCZWCBN-UHFFFAOYSA-N 0.000 claims description 3
- YSSVMXHKWSNHLH-UHFFFAOYSA-N octadecenal Natural products CCCCCCC=CCCCCCCCCCC=O YSSVMXHKWSNHLH-UHFFFAOYSA-N 0.000 claims description 3
- BGHCVCJVXZWKCC-UHFFFAOYSA-N tetradecane Chemical compound CCCCCCCCCCCCCC BGHCVCJVXZWKCC-UHFFFAOYSA-N 0.000 claims description 3
- KLJFYXOVGVXZKT-CCEZHUSRSA-N trans-hexadec-2-enal Chemical compound CCCCCCCCCCCCC\C=C\C=O KLJFYXOVGVXZKT-CCEZHUSRSA-N 0.000 claims description 3
- 102100037611 Lysophospholipase Human genes 0.000 claims 1
- 229940053200 antiepileptics fatty acid derivative Drugs 0.000 abstract description 20
- 241000588724 Escherichia coli Species 0.000 description 86
- 239000000758 substrate Substances 0.000 description 72
- 229940088598 enzyme Drugs 0.000 description 67
- 238000004519 manufacturing process Methods 0.000 description 65
- 239000000047 product Substances 0.000 description 57
- -1 gaseous Substances 0.000 description 56
- 235000001014 amino acid Nutrition 0.000 description 55
- 229940024606 amino acid Drugs 0.000 description 48
- 150000007523 nucleic acids Chemical group 0.000 description 48
- 150000001413 amino acids Chemical class 0.000 description 46
- 102000039446 nucleic acids Human genes 0.000 description 44
- 108020004707 nucleic acids Proteins 0.000 description 44
- 102000004169 proteins and genes Human genes 0.000 description 42
- 102000004316 Oxidoreductases Human genes 0.000 description 39
- 108090000854 Oxidoreductases Proteins 0.000 description 39
- 238000006243 chemical reaction Methods 0.000 description 37
- 235000018102 proteins Nutrition 0.000 description 37
- 108020004414 DNA Proteins 0.000 description 32
- 239000003208 petroleum Substances 0.000 description 32
- 238000000855 fermentation Methods 0.000 description 31
- 230000004151 fermentation Effects 0.000 description 31
- 102000053602 DNA Human genes 0.000 description 30
- 239000013598 vector Substances 0.000 description 30
- 150000001875 compounds Chemical class 0.000 description 29
- BAWFJGJZGIEFAR-NNYOXOHSSA-N NAD zwitterion Chemical compound NC(=O)C1=CC=C[N+]([C@H]2[C@@H]([C@H](O)[C@@H](COP([O-])(=O)OP(O)(=O)OC[C@@H]3[C@H]([C@@H](O)[C@@H](O3)N3C4=NC=NC(N)=C4N=C3)O)O2)O)=C1 BAWFJGJZGIEFAR-NNYOXOHSSA-N 0.000 description 27
- 229930027945 nicotinamide-adenine dinucleotide Natural products 0.000 description 27
- 239000013604 expression vector Substances 0.000 description 26
- 239000012634 fragment Substances 0.000 description 26
- 108010019670 Chimeric Antigen Receptors Proteins 0.000 description 25
- 239000000463 material Substances 0.000 description 25
- 239000013612 plasmid Substances 0.000 description 25
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 24
- 239000000126 substance Substances 0.000 description 24
- 150000001299 aldehydes Chemical class 0.000 description 22
- HFJRKMMYBMWEAD-UHFFFAOYSA-N dodecanal Chemical compound CCCCCCCCCCCC=O HFJRKMMYBMWEAD-UHFFFAOYSA-N 0.000 description 22
- 150000002148 esters Chemical class 0.000 description 22
- 230000006696 biosynthetic metabolic pathway Effects 0.000 description 21
- 239000002609 medium Substances 0.000 description 21
- 241000196324 Embryophyta Species 0.000 description 20
- XJLXINKUBYWONI-DQQFMEOOSA-N [[(2r,3r,4r,5r)-5-(6-aminopurin-9-yl)-3-hydroxy-4-phosphonooxyoxolan-2-yl]methoxy-hydroxyphosphoryl] [(2s,3r,4s,5s)-5-(3-carbamoylpyridin-1-ium-1-yl)-3,4-dihydroxyoxolan-2-yl]methyl phosphate Chemical compound NC(=O)C1=CC=C[N+]([C@@H]2[C@H]([C@@H](O)[C@H](COP([O-])(=O)OP(O)(=O)OC[C@@H]3[C@H]([C@@H](OP(O)(O)=O)[C@@H](O3)N3C4=NC=NC(N)=C4N=C3)O)O2)O)=C1 XJLXINKUBYWONI-DQQFMEOOSA-N 0.000 description 20
- 230000027455 binding Effects 0.000 description 20
- 230000001419 dependent effect Effects 0.000 description 19
- 150000002194 fatty esters Chemical class 0.000 description 19
- 125000000539 amino acid group Chemical group 0.000 description 18
- 230000012010 growth Effects 0.000 description 18
- 101100267415 Bacillus subtilis (strain 168) yjgB gene Proteins 0.000 description 17
- 101100001273 Escherichia coli (strain K12) ahr gene Proteins 0.000 description 17
- 239000012074 organic phase Substances 0.000 description 17
- 101150087812 tesA gene Proteins 0.000 description 17
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 16
- 230000015572 biosynthetic process Effects 0.000 description 16
- 239000008103 glucose Substances 0.000 description 16
- XEKOWRVHYACXOJ-UHFFFAOYSA-N Ethyl acetate Chemical compound CCOC(C)=O XEKOWRVHYACXOJ-UHFFFAOYSA-N 0.000 description 15
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 15
- 238000002703 mutagenesis Methods 0.000 description 15
- 231100000350 mutagenesis Toxicity 0.000 description 15
- 230000002829 reductive effect Effects 0.000 description 15
- 241000894006 Bacteria Species 0.000 description 14
- CURLTUGMZLYLDI-UHFFFAOYSA-N Carbon dioxide Chemical compound O=C=O CURLTUGMZLYLDI-UHFFFAOYSA-N 0.000 description 14
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 14
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 14
- RGJOEKWQDUBAIZ-UHFFFAOYSA-N coenzime A Natural products OC1C(OP(O)(O)=O)C(COP(O)(=O)OP(O)(=O)OCC(C)(C)C(O)C(=O)NCCC(=O)NCCS)OC1N1C2=NC=NC(N)=C2N=C1 RGJOEKWQDUBAIZ-UHFFFAOYSA-N 0.000 description 14
- 239000005516 coenzyme A Substances 0.000 description 14
- 229940093530 coenzyme a Drugs 0.000 description 14
- KDTSHFARGAKYJN-UHFFFAOYSA-N dephosphocoenzyme A Natural products OC1C(O)C(COP(O)(=O)OP(O)(=O)OCC(C)(C)C(O)C(=O)NCCC(=O)NCCS)OC1N1C2=NC=NC(N)=C2N=C1 KDTSHFARGAKYJN-UHFFFAOYSA-N 0.000 description 14
- LQZZUXJYWNFBMV-UHFFFAOYSA-N dodecan-1-ol Chemical compound CCCCCCCCCCCCO LQZZUXJYWNFBMV-UHFFFAOYSA-N 0.000 description 14
- 230000004927 fusion Effects 0.000 description 14
- 238000002290 gas chromatography-mass spectrometry Methods 0.000 description 14
- 230000002018 overexpression Effects 0.000 description 14
- 230000037361 pathway Effects 0.000 description 14
- 229920002477 rna polymer Polymers 0.000 description 14
- 239000004094 surface-active agent Substances 0.000 description 14
- QTBSBXVTEAMEQO-UHFFFAOYSA-N Acetic acid Chemical compound CC(O)=O QTBSBXVTEAMEQO-UHFFFAOYSA-N 0.000 description 13
- 239000003599 detergent Substances 0.000 description 13
- 239000002028 Biomass Substances 0.000 description 12
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 12
- 229960003669 carbenicillin Drugs 0.000 description 12
- WIIZWVCIJKGZOK-RKDXNWHRSA-N chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 12
- 229960005091 chloramphenicol Drugs 0.000 description 12
- 230000002255 enzymatic effect Effects 0.000 description 12
- 239000006166 lysate Substances 0.000 description 12
- 150000002894 organic compounds Chemical class 0.000 description 12
- 102000005602 Aldo-Keto Reductases Human genes 0.000 description 11
- 108010084469 Aldo-Keto Reductases Proteins 0.000 description 11
- 238000003556 assay Methods 0.000 description 11
- FPPNZSSZRUTDAP-UWFZAAFLSA-N carbenicillin Chemical compound N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)C(C(O)=O)C1=CC=CC=C1 FPPNZSSZRUTDAP-UWFZAAFLSA-N 0.000 description 11
- 230000035772 mutation Effects 0.000 description 11
- BOPGDPNILDQYTO-NNYOXOHSSA-N nicotinamide-adenine dinucleotide Chemical compound C1=CCC(C(=O)N)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OC[C@@H]2[C@H]([C@@H](O)[C@@H](O2)N2C3=NC=NC(N)=C3N=C2)O)O1 BOPGDPNILDQYTO-NNYOXOHSSA-N 0.000 description 11
- 238000000746 purification Methods 0.000 description 11
- 108091028043 Nucleic acid sequence Proteins 0.000 description 10
- ZSLZBFCDCINBPY-ZSJPKINUSA-N acetyl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)C)O[C@H]1N1C2=NC=NC(N)=C2N=C1 ZSLZBFCDCINBPY-ZSJPKINUSA-N 0.000 description 10
- 235000010633 broth Nutrition 0.000 description 10
- 229910002092 carbon dioxide Inorganic materials 0.000 description 10
- 230000001413 cellular effect Effects 0.000 description 10
- 102000014914 Carrier Proteins Human genes 0.000 description 9
- 108010078791 Carrier Proteins Proteins 0.000 description 9
- 108010039731 Fatty Acid Synthases Proteins 0.000 description 9
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 9
- 150000001298 alcohols Chemical class 0.000 description 9
- 150000001720 carbohydrates Chemical class 0.000 description 9
- 239000003814 drug Substances 0.000 description 9
- 239000003550 marker Substances 0.000 description 9
- 108020004999 messenger RNA Proteins 0.000 description 9
- 235000000346 sugar Nutrition 0.000 description 9
- 102000009105 Short Chain Dehydrogenase-Reductases Human genes 0.000 description 8
- 108010048287 Short Chain Dehydrogenase-Reductases Proteins 0.000 description 8
- 239000000654 additive Substances 0.000 description 8
- 230000008827 biological function Effects 0.000 description 8
- 101150058049 car gene Proteins 0.000 description 8
- 125000004122 cyclic group Chemical group 0.000 description 8
- 101150015067 fabB gene Proteins 0.000 description 8
- 239000002816 fuel additive Chemical class 0.000 description 8
- 230000006870 function Effects 0.000 description 8
- 238000009396 hybridization Methods 0.000 description 8
- 230000008569 process Effects 0.000 description 8
- 238000000926 separation method Methods 0.000 description 8
- 229910052717 sulfur Inorganic materials 0.000 description 8
- 239000011593 sulfur Substances 0.000 description 8
- 101100319874 Escherichia coli (strain K12) yahK gene Proteins 0.000 description 7
- 238000009825 accumulation Methods 0.000 description 7
- 235000014633 carbohydrates Nutrition 0.000 description 7
- 239000001569 carbon dioxide Substances 0.000 description 7
- 230000000295 complement effect Effects 0.000 description 7
- BXWNKGSJHAJOGX-UHFFFAOYSA-N hexadecan-1-ol Chemical compound CCCCCCCCCCCCCCCCO BXWNKGSJHAJOGX-UHFFFAOYSA-N 0.000 description 7
- 238000002955 isolation Methods 0.000 description 7
- 150000002632 lipids Chemical class 0.000 description 7
- 229920000642 polymer Polymers 0.000 description 7
- 239000002243 precursor Substances 0.000 description 7
- 238000012360 testing method Methods 0.000 description 7
- 238000013518 transcription Methods 0.000 description 7
- 230000035897 transcription Effects 0.000 description 7
- 102000000452 Acetyl-CoA carboxylase Human genes 0.000 description 6
- 108010016219 Acetyl-CoA carboxylase Proteins 0.000 description 6
- 241000588625 Acinetobacter sp. Species 0.000 description 6
- 241000193830 Bacillus <bacterium> Species 0.000 description 6
- 108010018763 Biotin carboxylase Proteins 0.000 description 6
- RGJOEKWQDUBAIZ-IBOSZNHHSA-N CoASH Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCS)O[C@H]1N1C2=NC=NC(N)=C2N=C1 RGJOEKWQDUBAIZ-IBOSZNHHSA-N 0.000 description 6
- IAZDPXIOMUYVGZ-UHFFFAOYSA-N Dimethylsulphoxide Chemical compound CS(C)=O IAZDPXIOMUYVGZ-UHFFFAOYSA-N 0.000 description 6
- 241000660147 Escherichia coli str. K-12 substr. MG1655 Species 0.000 description 6
- 101150071111 FADD gene Proteins 0.000 description 6
- 108010011927 Long-chain-alcohol dehydrogenase Proteins 0.000 description 6
- NINIDFKCEFEMDL-UHFFFAOYSA-N Sulfur Chemical compound [S] NINIDFKCEFEMDL-UHFFFAOYSA-N 0.000 description 6
- 239000003242 anti bacterial agent Substances 0.000 description 6
- 239000000356 contaminant Substances 0.000 description 6
- 238000005516 engineering process Methods 0.000 description 6
- 230000004133 fatty acid degradation Effects 0.000 description 6
- 235000019387 fatty acid methyl ester Nutrition 0.000 description 6
- 239000000499 gel Substances 0.000 description 6
- 229910052760 oxygen Inorganic materials 0.000 description 6
- 239000002994 raw material Substances 0.000 description 6
- 230000009467 reduction Effects 0.000 description 6
- 230000010076 replication Effects 0.000 description 6
- 239000011780 sodium chloride Substances 0.000 description 6
- 239000002904 solvent Substances 0.000 description 6
- 238000003786 synthesis reaction Methods 0.000 description 6
- 239000001993 wax Substances 0.000 description 6
- 102100025573 1-alkyl-2-acetylglycerophosphocholine esterase Human genes 0.000 description 5
- 102100022089 Acyl-[acyl-carrier-protein] hydrolase Human genes 0.000 description 5
- 101100160356 Bacillus subtilis (strain 168) yncB gene Proteins 0.000 description 5
- 229920000324 Cellulosome Polymers 0.000 description 5
- 241000195493 Cryptophyta Species 0.000 description 5
- 101710088194 Dehydrogenase Proteins 0.000 description 5
- 101000836720 Dictyostelium discoideum Aldose reductase A Proteins 0.000 description 5
- 101100329777 Escherichia coli (strain K12) curA gene Proteins 0.000 description 5
- 241000238631 Hexapoda Species 0.000 description 5
- 241000223198 Humicola Species 0.000 description 5
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 5
- 241000235395 Mucor Species 0.000 description 5
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 5
- 239000004473 Threonine Substances 0.000 description 5
- 239000007983 Tris buffer Substances 0.000 description 5
- 230000002378 acidificating effect Effects 0.000 description 5
- 230000000996 additive effect Effects 0.000 description 5
- 125000000217 alkyl group Chemical group 0.000 description 5
- 125000003118 aryl group Chemical group 0.000 description 5
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical compound [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 description 5
- 239000003225 biodiesel Substances 0.000 description 5
- 101150070764 carB gene Proteins 0.000 description 5
- 150000001732 carboxylic acid derivatives Chemical class 0.000 description 5
- 210000000166 cellulosome Anatomy 0.000 description 5
- 238000010276 construction Methods 0.000 description 5
- 230000003247 decreasing effect Effects 0.000 description 5
- 230000007613 environmental effect Effects 0.000 description 5
- 210000003527 eukaryotic cell Anatomy 0.000 description 5
- 239000012530 fluid Substances 0.000 description 5
- 239000001963 growth medium Substances 0.000 description 5
- 150000002576 ketones Chemical class 0.000 description 5
- 230000000670 limiting effect Effects 0.000 description 5
- 210000004962 mammalian cell Anatomy 0.000 description 5
- 230000007246 mechanism Effects 0.000 description 5
- 230000004048 modification Effects 0.000 description 5
- 239000003921 oil Substances 0.000 description 5
- JRZJOMJEPLMPRA-UHFFFAOYSA-N olefin Natural products CCCCCCCC=C JRZJOMJEPLMPRA-UHFFFAOYSA-N 0.000 description 5
- 239000001301 oxygen Substances 0.000 description 5
- 238000005192 partition Methods 0.000 description 5
- 239000012071 phase Substances 0.000 description 5
- 239000004014 plasticizer Substances 0.000 description 5
- 238000002360 preparation method Methods 0.000 description 5
- 239000012925 reference material Substances 0.000 description 5
- 230000001105 regulatory effect Effects 0.000 description 5
- 229920005989 resin Polymers 0.000 description 5
- 239000011347 resin Substances 0.000 description 5
- 230000002441 reversible effect Effects 0.000 description 5
- 241000894007 species Species 0.000 description 5
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 5
- 238000012795 verification Methods 0.000 description 5
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 4
- OWEGMIWEEQEYGQ-UHFFFAOYSA-N 100676-05-9 Natural products OC1C(O)C(O)C(CO)OC1OCC1C(O)C(O)C(O)C(OC2C(OC(O)C(O)C2O)CO)O1 OWEGMIWEEQEYGQ-UHFFFAOYSA-N 0.000 description 4
- 235000014469 Bacillus subtilis Nutrition 0.000 description 4
- 101100493788 Bacillus subtilis (strain 168) bdhA gene Proteins 0.000 description 4
- 101100488142 Escherichia coli (strain K12) ydjL gene Proteins 0.000 description 4
- 102000004195 Isomerases Human genes 0.000 description 4
- 108090000769 Isomerases Proteins 0.000 description 4
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 4
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 4
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 4
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 4
- TWRXJAOTZQYOKJ-UHFFFAOYSA-L Magnesium chloride Chemical compound [Mg+2].[Cl-].[Cl-] TWRXJAOTZQYOKJ-UHFFFAOYSA-L 0.000 description 4
- LTYOQGRJFJAKNA-KKIMTKSISA-N Malonyl CoA Natural products S(C(=O)CC(=O)O)CCNC(=O)CCNC(=O)[C@@H](O)C(CO[P@](=O)(O[P@](=O)(OC[C@H]1[C@@H](OP(=O)(O)O)[C@@H](O)[C@@H](n2c3ncnc(N)c3nc2)O1)O)O)(C)C LTYOQGRJFJAKNA-KKIMTKSISA-N 0.000 description 4
- GUBGYTABKSRVRQ-PICCSMPSSA-N Maltose Natural products O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@@H](CO)OC(O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-PICCSMPSSA-N 0.000 description 4
- 108020004511 Recombinant DNA Proteins 0.000 description 4
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 4
- 101710137500 T7 RNA polymerase Proteins 0.000 description 4
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 4
- 101150070497 accC gene Proteins 0.000 description 4
- 230000004075 alteration Effects 0.000 description 4
- 125000003368 amide group Chemical group 0.000 description 4
- 239000012298 atmosphere Substances 0.000 description 4
- GUBGYTABKSRVRQ-QUYVBRFLSA-N beta-maltose Chemical compound OC[C@H]1O[C@H](O[C@H]2[C@H](O)[C@@H](O)[C@H](O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@@H]1O GUBGYTABKSRVRQ-QUYVBRFLSA-N 0.000 description 4
- 238000004422 calculation algorithm Methods 0.000 description 4
- 125000004432 carbon atom Chemical group C* 0.000 description 4
- 230000003915 cell function Effects 0.000 description 4
- 210000000349 chromosome Anatomy 0.000 description 4
- 229940079593 drug Drugs 0.000 description 4
- 230000009977 dual effect Effects 0.000 description 4
- 238000002474 experimental method Methods 0.000 description 4
- 239000000284 extract Substances 0.000 description 4
- 101150090981 fabG gene Proteins 0.000 description 4
- 238000002744 homologous recombination Methods 0.000 description 4
- 230000006801 homologous recombination Effects 0.000 description 4
- 238000000338 in vitro Methods 0.000 description 4
- 230000001939 inductive effect Effects 0.000 description 4
- 239000000543 intermediate Substances 0.000 description 4
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 4
- 229960000310 isoleucine Drugs 0.000 description 4
- 239000000314 lubricant Substances 0.000 description 4
- 101150068528 mabA gene Proteins 0.000 description 4
- LTYOQGRJFJAKNA-DVVLENMVSA-N malonyl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)CC(O)=O)O[C@H]1N1C2=NC=NC(N)=C2N=C1 LTYOQGRJFJAKNA-DVVLENMVSA-N 0.000 description 4
- 239000011159 matrix material Substances 0.000 description 4
- 238000005259 measurement Methods 0.000 description 4
- 238000012986 modification Methods 0.000 description 4
- ZQPPMHVWECSIRJ-KTKRTIGZSA-N oleic acid Chemical compound CCCCCCCC\C=C/CCCCCCCC(O)=O ZQPPMHVWECSIRJ-KTKRTIGZSA-N 0.000 description 4
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 4
- 230000000243 photosynthetic effect Effects 0.000 description 4
- 238000012545 processing Methods 0.000 description 4
- 210000001236 prokaryotic cell Anatomy 0.000 description 4
- 239000007787 solid Substances 0.000 description 4
- 239000000243 solution Substances 0.000 description 4
- UNFWWIHTNXNPBV-WXKVUWSESA-N spectinomycin Chemical compound O([C@@H]1[C@@H](NC)[C@@H](O)[C@H]([C@@H]([C@H]1O1)O)NC)[C@]2(O)[C@H]1O[C@H](C)CC2=O UNFWWIHTNXNPBV-WXKVUWSESA-N 0.000 description 4
- 229960000268 spectinomycin Drugs 0.000 description 4
- HLZKNKRTKFSKGZ-UHFFFAOYSA-N tetradecan-1-ol Chemical compound CCCCCCCCCCCCCCO HLZKNKRTKFSKGZ-UHFFFAOYSA-N 0.000 description 4
- 230000009466 transformation Effects 0.000 description 4
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 4
- 239000004474 valine Substances 0.000 description 4
- 230000003612 virological effect Effects 0.000 description 4
- UHPMCKVQTMMPCG-UHFFFAOYSA-N 5,8-dihydroxy-2-methoxy-6-methyl-7-(2-oxopropyl)naphthalene-1,4-dione Chemical compound CC1=C(CC(C)=O)C(O)=C2C(=O)C(OC)=CC(=O)C2=C1O UHPMCKVQTMMPCG-UHFFFAOYSA-N 0.000 description 3
- 241001165345 Acinetobacter baylyi Species 0.000 description 3
- 101100388296 Arabidopsis thaliana DTX51 gene Proteins 0.000 description 3
- 239000004475 Arginine Substances 0.000 description 3
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 3
- 241000228212 Aspergillus Species 0.000 description 3
- 244000063299 Bacillus subtilis Species 0.000 description 3
- 101100127701 Bacillus subtilis (strain 168) lcfB gene Proteins 0.000 description 3
- 101100052833 Bacillus subtilis (strain 168) yhdH gene Proteins 0.000 description 3
- UHOVQNZJYSORNB-UHFFFAOYSA-N Benzene Chemical class C1=CC=CC=C1 UHOVQNZJYSORNB-UHFFFAOYSA-N 0.000 description 3
- 241000123346 Chrysosporium Species 0.000 description 3
- 241000588722 Escherichia Species 0.000 description 3
- 101100378410 Escherichia coli (strain K12) acuI gene Proteins 0.000 description 3
- 101100502354 Escherichia coli (strain K12) fadK gene Proteins 0.000 description 3
- 101100201842 Escherichia coli (strain K12) rspB gene Proteins 0.000 description 3
- 241001646716 Escherichia coli K-12 Species 0.000 description 3
- 102000015303 Fatty Acid Synthases Human genes 0.000 description 3
- 241000223218 Fusarium Species 0.000 description 3
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 3
- 102000004867 Hydro-Lyases Human genes 0.000 description 3
- 108090001042 Hydro-Lyases Proteins 0.000 description 3
- UFHFLCQGNIYNRP-UHFFFAOYSA-N Hydrogen Chemical compound [H][H] UFHFLCQGNIYNRP-UHFFFAOYSA-N 0.000 description 3
- 241000235649 Kluyveromyces Species 0.000 description 3
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 3
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 3
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 3
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 3
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 3
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 3
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 3
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 description 3
- 241000186660 Lactobacillus Species 0.000 description 3
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 3
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 3
- 239000004472 Lysine Substances 0.000 description 3
- 241001465754 Metazoa Species 0.000 description 3
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 3
- 241001646725 Mycobacterium tuberculosis H37Rv Species 0.000 description 3
- 241000221960 Neurospora Species 0.000 description 3
- 241000228143 Penicillium Species 0.000 description 3
- 241000222385 Phanerochaete Species 0.000 description 3
- 241000235648 Pichia Species 0.000 description 3
- 241000222350 Pleurotus Species 0.000 description 3
- 241000589516 Pseudomonas Species 0.000 description 3
- 241000235402 Rhizomucor Species 0.000 description 3
- 241000316848 Rhodococcus <scale insect> Species 0.000 description 3
- 241000235070 Saccharomyces Species 0.000 description 3
- 101100215626 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) ADP1 gene Proteins 0.000 description 3
- 241000235346 Schizosaccharomyces Species 0.000 description 3
- 241000187747 Streptomyces Species 0.000 description 3
- 229930006000 Sucrose Natural products 0.000 description 3
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 3
- 241000222354 Trametes Species 0.000 description 3
- 241000223259 Trichoderma Species 0.000 description 3
- 241000235013 Yarrowia Species 0.000 description 3
- 240000008042 Zea mays Species 0.000 description 3
- 125000002252 acyl group Chemical group 0.000 description 3
- 235000004279 alanine Nutrition 0.000 description 3
- 238000004458 analytical method Methods 0.000 description 3
- 230000000692 anti-sense effect Effects 0.000 description 3
- 229940088710 antibiotic agent Drugs 0.000 description 3
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 3
- 235000009582 asparagine Nutrition 0.000 description 3
- 229960001230 asparagine Drugs 0.000 description 3
- 235000003704 aspartic acid Nutrition 0.000 description 3
- 238000007845 assembly PCR Methods 0.000 description 3
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 3
- 230000003115 biocidal effect Effects 0.000 description 3
- 230000004071 biological effect Effects 0.000 description 3
- 238000004113 cell culture Methods 0.000 description 3
- 230000010261 cell growth Effects 0.000 description 3
- 239000013592 cell lysate Substances 0.000 description 3
- 229920002678 cellulose Polymers 0.000 description 3
- 239000001913 cellulose Substances 0.000 description 3
- 235000010980 cellulose Nutrition 0.000 description 3
- 229960000541 cetyl alcohol Drugs 0.000 description 3
- 210000004978 chinese hamster ovary cell Anatomy 0.000 description 3
- 239000000975 dye Substances 0.000 description 3
- 239000002024 ethyl acetate extract Substances 0.000 description 3
- 101150016526 fadE gene Proteins 0.000 description 3
- 108010075712 fatty acid reductase Proteins 0.000 description 3
- 239000010408 film Substances 0.000 description 3
- 238000000769 gas chromatography-flame ionisation detection Methods 0.000 description 3
- 235000013922 glutamic acid Nutrition 0.000 description 3
- 239000004220 glutamic acid Substances 0.000 description 3
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 3
- 239000005431 greenhouse gas Substances 0.000 description 3
- 238000004128 high performance liquid chromatography Methods 0.000 description 3
- 239000001257 hydrogen Substances 0.000 description 3
- 229910052739 hydrogen Inorganic materials 0.000 description 3
- 230000006698 induction Effects 0.000 description 3
- 230000000977 initiatory effect Effects 0.000 description 3
- 230000003834 intracellular effect Effects 0.000 description 3
- 235000011073 invertase Nutrition 0.000 description 3
- 229930027917 kanamycin Natural products 0.000 description 3
- 229960000318 kanamycin Drugs 0.000 description 3
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 3
- 229930182823 kanamycin A Natural products 0.000 description 3
- 229940039696 lactobacillus Drugs 0.000 description 3
- 239000007788 liquid Substances 0.000 description 3
- 230000014759 maintenance of location Effects 0.000 description 3
- 238000001819 mass spectrum Methods 0.000 description 3
- 230000002503 metabolic effect Effects 0.000 description 3
- 239000000178 monomer Substances 0.000 description 3
- VLKZOEOYAKHREP-UHFFFAOYSA-N n-Hexane Chemical compound CCCCCC VLKZOEOYAKHREP-UHFFFAOYSA-N 0.000 description 3
- 229940049964 oleate Drugs 0.000 description 3
- 239000003348 petrochemical agent Substances 0.000 description 3
- 239000004033 plastic Substances 0.000 description 3
- 229920003023 plastic Polymers 0.000 description 3
- 230000028327 secretion Effects 0.000 description 3
- 230000001568 sexual effect Effects 0.000 description 3
- 238000002741 site-directed mutagenesis Methods 0.000 description 3
- 125000005480 straight-chain fatty acid group Chemical group 0.000 description 3
- 239000005720 sucrose Substances 0.000 description 3
- 239000004753 textile Substances 0.000 description 3
- 238000004809 thin layer chromatography Methods 0.000 description 3
- 239000002699 waste material Substances 0.000 description 3
- 101150052877 ycjQ gene Proteins 0.000 description 3
- 101150077217 yhfL gene Proteins 0.000 description 3
- GGQQNYXPYWCUHG-RMTFUQJTSA-N (3e,6e)-deca-3,6-diene Chemical compound CCC\C=C\C\C=C\CC GGQQNYXPYWCUHG-RMTFUQJTSA-N 0.000 description 2
- AMTITFMUKRZZEE-WAYWQWQTSA-N (Z)-hexadec-11-enal Chemical compound CCCC\C=C/CCCCCCCCCC=O AMTITFMUKRZZEE-WAYWQWQTSA-N 0.000 description 2
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 2
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 2
- 241000897241 Acinetobacter sp. ADP1 Species 0.000 description 2
- 241000186361 Actinobacteria <class> Species 0.000 description 2
- 101710146995 Acyl carrier protein Proteins 0.000 description 2
- 108700037654 Acyl carrier protein (ACP) Proteins 0.000 description 2
- 102000048456 Acyl carrier protein (ACP) Human genes 0.000 description 2
- 229920000936 Agarose Polymers 0.000 description 2
- XKRFYHLGVUSROY-UHFFFAOYSA-N Argon Chemical compound [Ar] XKRFYHLGVUSROY-UHFFFAOYSA-N 0.000 description 2
- 241001513093 Aspergillus awamori Species 0.000 description 2
- 241000892910 Aspergillus foetidus Species 0.000 description 2
- 241001225321 Aspergillus fumigatus Species 0.000 description 2
- 241000351920 Aspergillus nidulans Species 0.000 description 2
- 241000228245 Aspergillus niger Species 0.000 description 2
- 240000006439 Aspergillus oryzae Species 0.000 description 2
- 235000002247 Aspergillus oryzae Nutrition 0.000 description 2
- 241000193744 Bacillus amyloliquefaciens Species 0.000 description 2
- 101100012355 Bacillus anthracis fabH1 gene Proteins 0.000 description 2
- 241000193752 Bacillus circulans Species 0.000 description 2
- 241001328122 Bacillus clausii Species 0.000 description 2
- 241000193749 Bacillus coagulans Species 0.000 description 2
- 241000193422 Bacillus lentus Species 0.000 description 2
- 241000194107 Bacillus megaterium Species 0.000 description 2
- 241000194103 Bacillus pumilus Species 0.000 description 2
- 101100000756 Bacillus subtilis (strain 168) acpA gene Proteins 0.000 description 2
- 101100012357 Bacillus subtilis (strain 168) fabHA gene Proteins 0.000 description 2
- 101100066244 Bacillus subtilis (strain 168) fadF gene Proteins 0.000 description 2
- 241000193388 Bacillus thuringiensis Species 0.000 description 2
- 241000193764 Brevibacillus brevis Species 0.000 description 2
- 101100490145 Clostridium perfringens (strain 13 / Type A) ackA2 gene Proteins 0.000 description 2
- 108091033380 Coding strand Proteins 0.000 description 2
- 241001464430 Cyanobacterium Species 0.000 description 2
- HMFHBZSHGGEWLO-SOOFDHNKSA-N D-ribofuranose Chemical compound OC[C@H]1OC(O)[C@H](O)[C@@H]1O HMFHBZSHGGEWLO-SOOFDHNKSA-N 0.000 description 2
- SRBFZHDQGSBBOR-IOVATXLUSA-N D-xylopyranose Chemical compound O[C@@H]1COC(O)[C@H](O)[C@H]1O SRBFZHDQGSBBOR-IOVATXLUSA-N 0.000 description 2
- 230000004568 DNA-binding Effects 0.000 description 2
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 2
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 2
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 2
- 108020005199 Dehydrogenases Proteins 0.000 description 2
- 101100108235 Escherichia coli (strain K12) adhP gene Proteins 0.000 description 2
- 101100016532 Escherichia coli (strain K12) hcaF gene Proteins 0.000 description 2
- 101100096644 Escherichia coli (strain K12) srlD gene Proteins 0.000 description 2
- 101100431554 Escherichia coli (strain K12) ybdR gene Proteins 0.000 description 2
- 101100267667 Escherichia coli (strain K12) yohF gene Proteins 0.000 description 2
- 229930091371 Fructose Natural products 0.000 description 2
- RFSUNEUAIZKAJO-ARQDHWQXSA-N Fructose Chemical compound OC[C@H]1O[C@](O)(CO)[C@@H](O)[C@@H]1O RFSUNEUAIZKAJO-ARQDHWQXSA-N 0.000 description 2
- 239000005715 Fructose Substances 0.000 description 2
- 230000005526 G1 to G0 transition Effects 0.000 description 2
- 241000193385 Geobacillus stearothermophilus Species 0.000 description 2
- 102000005720 Glutathione transferase Human genes 0.000 description 2
- 108010070675 Glutathione transferase Proteins 0.000 description 2
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 2
- 101000937642 Homo sapiens Malonyl-CoA-acyl carrier protein transacylase, mitochondrial Proteins 0.000 description 2
- 241001480714 Humicola insolens Species 0.000 description 2
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 2
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 2
- 101100433987 Latilactobacillus sakei subsp. sakei (strain 23K) ackA1 gene Proteins 0.000 description 2
- 108090000364 Ligases Proteins 0.000 description 2
- 102000003960 Ligases Human genes 0.000 description 2
- 239000006142 Luria-Bertani Agar Substances 0.000 description 2
- 102100027329 Malonyl-CoA-acyl carrier protein transacylase, mitochondrial Human genes 0.000 description 2
- 229910021380 Manganese Chloride Inorganic materials 0.000 description 2
- GLFNIEUTAYBVOC-UHFFFAOYSA-L Manganese chloride Chemical compound Cl[Mn]Cl GLFNIEUTAYBVOC-UHFFFAOYSA-L 0.000 description 2
- 241001025881 Mycobacterium smegmatis str. MC2 155 Species 0.000 description 2
- LRHPLDYGYMQRHN-UHFFFAOYSA-N N-Butanol Chemical compound CCCCO LRHPLDYGYMQRHN-UHFFFAOYSA-N 0.000 description 2
- 108091034117 Oligonucleotide Proteins 0.000 description 2
- 229910019142 PO4 Inorganic materials 0.000 description 2
- 241001520808 Panicum virgatum Species 0.000 description 2
- 101100462488 Phlebiopsis gigantea p2ox gene Proteins 0.000 description 2
- 241000235403 Rhizomucor miehei Species 0.000 description 2
- 241001524101 Rhodococcus opacus Species 0.000 description 2
- PYMYPHUHKUWMLA-LMVFSUKVSA-N Ribose Natural products OC[C@@H](O)[C@@H](O)[C@@H](O)C=O PYMYPHUHKUWMLA-LMVFSUKVSA-N 0.000 description 2
- 101100066242 Salmonella typhimurium (strain LT2 / SGSC1412 / ATCC 700720) fadE gene Proteins 0.000 description 2
- 229920002472 Starch Polymers 0.000 description 2
- 241000187398 Streptomyces lividans Species 0.000 description 2
- 241001468239 Streptomyces murinus Species 0.000 description 2
- 101100297542 Streptomyces viridochromogenes (strain DSM 40736 / JCM 4977 / BCRC 1201 / Tue 494) phpC gene Proteins 0.000 description 2
- RAHZWNYVWXNFOC-UHFFFAOYSA-N Sulphur dioxide Chemical compound O=S=O RAHZWNYVWXNFOC-UHFFFAOYSA-N 0.000 description 2
- 241000192560 Synechococcus sp. Species 0.000 description 2
- 108010006785 Taq Polymerase Proteins 0.000 description 2
- MUMGGOZAMZWBJJ-DYKIIFRCSA-N Testostosterone Chemical compound O=C1CC[C@]2(C)[C@H]3CC[C@](C)([C@H](CC4)O)[C@@H]4[C@@H]3CCC2=C1 MUMGGOZAMZWBJJ-DYKIIFRCSA-N 0.000 description 2
- IQFYYKKMVGJFEH-XLPZGREQSA-N Thymidine Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](CO)[C@@H](O)C1 IQFYYKKMVGJFEH-XLPZGREQSA-N 0.000 description 2
- 241000378866 Trichoderma koningii Species 0.000 description 2
- 241000223262 Trichoderma longibrachiatum Species 0.000 description 2
- 241000499912 Trichoderma reesei Species 0.000 description 2
- 241000223261 Trichoderma viride Species 0.000 description 2
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 2
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 2
- AMTITFMUKRZZEE-UHFFFAOYSA-N Z11-16:Ald Natural products CCCCC=CCCCCCCCCCC=O AMTITFMUKRZZEE-UHFFFAOYSA-N 0.000 description 2
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 description 2
- 235000002017 Zea mays subsp mays Nutrition 0.000 description 2
- 238000004760 accelerator mass spectrometry Methods 0.000 description 2
- 101150006213 ackA gene Proteins 0.000 description 2
- 101150023061 acpP gene Proteins 0.000 description 2
- 101150051130 acpP1 gene Proteins 0.000 description 2
- 101150014383 adhE gene Proteins 0.000 description 2
- 239000000853 adhesive Substances 0.000 description 2
- 230000001070 adhesive effect Effects 0.000 description 2
- 238000005273 aeration Methods 0.000 description 2
- 238000003915 air pollution Methods 0.000 description 2
- 125000001931 aliphatic group Chemical group 0.000 description 2
- HMFHBZSHGGEWLO-UHFFFAOYSA-N alpha-D-Furanose-Ribose Natural products OCC1OC(O)C(O)C1O HMFHBZSHGGEWLO-UHFFFAOYSA-N 0.000 description 2
- 239000008346 aqueous phase Substances 0.000 description 2
- PYMYPHUHKUWMLA-UHFFFAOYSA-N arabinose Natural products OCC(O)C(O)C(O)C=O PYMYPHUHKUWMLA-UHFFFAOYSA-N 0.000 description 2
- 230000001651 autotrophic effect Effects 0.000 description 2
- 229940054340 bacillus coagulans Drugs 0.000 description 2
- 229940097012 bacillus thuringiensis Drugs 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- SRBFZHDQGSBBOR-UHFFFAOYSA-N beta-D-Pyranose-Lyxose Natural products OC1COC(O)C(O)C1O SRBFZHDQGSBBOR-UHFFFAOYSA-N 0.000 description 2
- 108010051210 beta-Fructofuranosidase Proteins 0.000 description 2
- 230000002457 bidirectional effect Effects 0.000 description 2
- 239000012620 biological material Substances 0.000 description 2
- 239000000872 buffer Substances 0.000 description 2
- 238000004364 calculation method Methods 0.000 description 2
- 101150014229 carA gene Proteins 0.000 description 2
- 150000001721 carbon Chemical group 0.000 description 2
- 125000002915 carbonyl group Chemical group [*:2]C([*:1])=O 0.000 description 2
- 150000007942 carboxylates Chemical group 0.000 description 2
- 230000015556 catabolic process Effects 0.000 description 2
- 238000004523 catalytic cracking Methods 0.000 description 2
- 239000006143 cell culture medium Substances 0.000 description 2
- 210000003850 cellular structure Anatomy 0.000 description 2
- 238000005119 centrifugation Methods 0.000 description 2
- 238000012512 characterization method Methods 0.000 description 2
- 239000003795 chemical substances by application Substances 0.000 description 2
- 238000010367 cloning Methods 0.000 description 2
- 238000004590 computer program Methods 0.000 description 2
- 235000005822 corn Nutrition 0.000 description 2
- 239000002537 cosmetic Substances 0.000 description 2
- 238000005336 cracking Methods 0.000 description 2
- 150000001924 cycloalkanes Chemical class 0.000 description 2
- 210000000805 cytoplasm Anatomy 0.000 description 2
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 2
- 230000006378 damage Effects 0.000 description 2
- KSMVZQYAVGTKIV-UHFFFAOYSA-N decanal Chemical compound CCCCCCCCCC=O KSMVZQYAVGTKIV-UHFFFAOYSA-N 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 239000000645 desinfectant Substances 0.000 description 2
- 239000002283 diesel fuel Substances 0.000 description 2
- 239000013024 dilution buffer Substances 0.000 description 2
- 150000002016 disaccharides Chemical class 0.000 description 2
- 101150084197 dkgA gene Proteins 0.000 description 2
- 101150019698 dkgB gene Proteins 0.000 description 2
- 239000013613 expression plasmid Substances 0.000 description 2
- 101150026389 fabF gene Proteins 0.000 description 2
- 101150035981 fabH gene Proteins 0.000 description 2
- 101150115959 fadR gene Proteins 0.000 description 2
- 239000003925 fat Substances 0.000 description 2
- 235000013305 food Nutrition 0.000 description 2
- 230000037433 frameshift Effects 0.000 description 2
- 230000002068 genetic effect Effects 0.000 description 2
- 229960001031 glucose Drugs 0.000 description 2
- 230000034659 glycolysis Effects 0.000 description 2
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 2
- 101150082055 hcaB gene Proteins 0.000 description 2
- 108090001018 hexadecanal dehydrogenase (acylating) Proteins 0.000 description 2
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 2
- 229940088597 hormone Drugs 0.000 description 2
- 239000005556 hormone Substances 0.000 description 2
- 230000006872 improvement Effects 0.000 description 2
- 230000010354 integration Effects 0.000 description 2
- 230000000155 isotopic effect Effects 0.000 description 2
- 238000011031 large-scale manufacturing process Methods 0.000 description 2
- 239000003446 ligand Substances 0.000 description 2
- 238000004811 liquid chromatography Methods 0.000 description 2
- 238000004895 liquid chromatography mass spectrometry Methods 0.000 description 2
- 239000010687 lubricating oil Substances 0.000 description 2
- 229910001629 magnesium chloride Inorganic materials 0.000 description 2
- 239000011565 manganese chloride Substances 0.000 description 2
- 238000004949 mass spectrometry Methods 0.000 description 2
- 230000001404 mediated effect Effects 0.000 description 2
- 239000013028 medium composition Substances 0.000 description 2
- 230000037353 metabolic pathway Effects 0.000 description 2
- 239000004530 micro-emulsion Substances 0.000 description 2
- 238000002156 mixing Methods 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- 150000002772 monosaccharides Chemical class 0.000 description 2
- 235000021281 monounsaturated fatty acids Nutrition 0.000 description 2
- 239000013642 negative control Substances 0.000 description 2
- GLDOVTGHNKAZLK-UHFFFAOYSA-N octadecan-1-ol Chemical compound CCCCCCCCCCCCCCCCCCO GLDOVTGHNKAZLK-UHFFFAOYSA-N 0.000 description 2
- TVMXDCGIABBOFY-UHFFFAOYSA-N octane Chemical compound CCCCCCCC TVMXDCGIABBOFY-UHFFFAOYSA-N 0.000 description 2
- 210000003463 organelle Anatomy 0.000 description 2
- 150000007524 organic acids Chemical class 0.000 description 2
- 235000005985 organic acids Nutrition 0.000 description 2
- 235000006408 oxalic acid Nutrition 0.000 description 2
- 150000002913 oxalic acids Chemical class 0.000 description 2
- 125000004430 oxygen atom Chemical group O* 0.000 description 2
- 239000002245 particle Substances 0.000 description 2
- 239000002304 perfume Substances 0.000 description 2
- 239000011846 petroleum-based material Substances 0.000 description 2
- 239000013520 petroleum-based product Substances 0.000 description 2
- 239000010452 phosphate Substances 0.000 description 2
- 125000002467 phosphate group Chemical group [H]OP(=O)(O[H])O[*] 0.000 description 2
- 229920001282 polysaccharide Polymers 0.000 description 2
- 239000005017 polysaccharide Substances 0.000 description 2
- 150000004804 polysaccharides Chemical class 0.000 description 2
- 101150060030 poxB gene Proteins 0.000 description 2
- 239000003755 preservative agent Substances 0.000 description 2
- 238000011002 quantification Methods 0.000 description 2
- 230000018612 quorum sensing Effects 0.000 description 2
- 239000011535 reaction buffer Substances 0.000 description 2
- 239000011541 reaction mixture Substances 0.000 description 2
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 description 2
- 230000003362 replicative effect Effects 0.000 description 2
- 239000001488 sodium phosphate Substances 0.000 description 2
- 229910000162 sodium phosphate Inorganic materials 0.000 description 2
- 230000000087 stabilizing effect Effects 0.000 description 2
- 239000008107 starch Substances 0.000 description 2
- 235000019698 starch Nutrition 0.000 description 2
- 239000007858 starting material Substances 0.000 description 2
- POOSGDOYLQNASK-UHFFFAOYSA-N tetracosane Chemical compound CCCCCCCCCCCCCCCCCCCCCCCC POOSGDOYLQNASK-UHFFFAOYSA-N 0.000 description 2
- RWQNBRDOKXIBIV-UHFFFAOYSA-N thymine Chemical compound CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 description 2
- 238000001890 transfection Methods 0.000 description 2
- 230000001131 transforming effect Effects 0.000 description 2
- 230000011637 translesion synthesis Effects 0.000 description 2
- IIYFAKIEWZDVMP-NJFSPNSNSA-N tridecane Chemical group CCCCCCCCCCCC[14CH3] IIYFAKIEWZDVMP-NJFSPNSNSA-N 0.000 description 2
- RYFMWSXOAZQYPI-UHFFFAOYSA-K trisodium phosphate Chemical compound [Na+].[Na+].[Na+].[O-]P([O-])([O-])=O RYFMWSXOAZQYPI-UHFFFAOYSA-K 0.000 description 2
- 101150046028 umuD gene Proteins 0.000 description 2
- 241000701447 unidentified baculovirus Species 0.000 description 2
- 238000011144 upstream manufacturing Methods 0.000 description 2
- 210000003501 vero cell Anatomy 0.000 description 2
- 239000011782 vitamin Substances 0.000 description 2
- 229940088594 vitamin Drugs 0.000 description 2
- 235000013343 vitamin Nutrition 0.000 description 2
- 229930003231 vitamin Natural products 0.000 description 2
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 2
- 239000002023 wood Substances 0.000 description 2
- 101150010642 ybbO gene Proteins 0.000 description 2
- 101150040649 ydjJ gene Proteins 0.000 description 2
- 101150097894 yghA gene Proteins 0.000 description 2
- ALSTYHKOOCGGFT-KTKRTIGZSA-N (9Z)-octadecen-1-ol Chemical compound CCCCCCCC\C=C/CCCCCCCCO ALSTYHKOOCGGFT-KTKRTIGZSA-N 0.000 description 1
- WRIDQFICGBMAFQ-UHFFFAOYSA-N (E)-8-Octadecenoic acid Natural products CCCCCCCCCC=CCCCCCCC(O)=O WRIDQFICGBMAFQ-UHFFFAOYSA-N 0.000 description 1
- CQKHFONAFZDDKV-VAWYXSNFSA-N (e)-dodec-1-en-1-ol Chemical compound CCCCCCCCCC\C=C\O CQKHFONAFZDDKV-VAWYXSNFSA-N 0.000 description 1
- GWSURTDMLUFMJH-FOCLMDBBSA-N (e)-hexadec-1-en-1-ol Chemical compound CCCCCCCCCCCCCC\C=C\O GWSURTDMLUFMJH-FOCLMDBBSA-N 0.000 description 1
- JEGNXMUWVCVSSQ-ISLYRVAYSA-N (e)-octadec-1-en-1-ol Chemical compound CCCCCCCCCCCCCCCC\C=C\O JEGNXMUWVCVSSQ-ISLYRVAYSA-N 0.000 description 1
- GWSURTDMLUFMJH-NXVVXOECSA-N (z)-hexadec-1-en-1-ol Chemical compound CCCCCCCCCCCCCC\C=C/O GWSURTDMLUFMJH-NXVVXOECSA-N 0.000 description 1
- MXHRCPNRJAMMIM-SHYZEUOFSA-N 2'-deoxyuridine Chemical compound C1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C=C1 MXHRCPNRJAMMIM-SHYZEUOFSA-N 0.000 description 1
- CKTSBUTUHBMZGZ-SHYZEUOFSA-N 2'‐deoxycytidine Chemical compound O=C1N=C(N)C=CN1[C@@H]1O[C@H](CO)[C@@H](O)C1 CKTSBUTUHBMZGZ-SHYZEUOFSA-N 0.000 description 1
- WIIZWVCIJKGZOK-IUCAKERBSA-N 2,2-dichloro-n-[(1s,2s)-1,3-dihydroxy-1-(4-nitrophenyl)propan-2-yl]acetamide Chemical compound ClC(Cl)C(=O)N[C@@H](CO)[C@@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-IUCAKERBSA-N 0.000 description 1
- ASJSAQIRZKANQN-CRCLSJGQSA-N 2-deoxy-D-ribose Chemical compound OC[C@@H](O)[C@@H](O)CC=O ASJSAQIRZKANQN-CRCLSJGQSA-N 0.000 description 1
- LQJBNNIYVWPHFW-UHFFFAOYSA-N 20:1omega9c fatty acid Natural products CCCCCCCCCCC=CCCCCCCCC(O)=O LQJBNNIYVWPHFW-UHFFFAOYSA-N 0.000 description 1
- YAXXOCZAXKLLCV-UHFFFAOYSA-N 3-dodecyloxolane-2,5-dione Chemical class CCCCCCCCCCCCC1CC(=O)OC1=O YAXXOCZAXKLLCV-UHFFFAOYSA-N 0.000 description 1
- LUCHPKXVUGJYGU-XLPZGREQSA-N 5-methyl-2'-deoxycytidine Chemical compound O=C1N=C(N)C(C)=CN1[C@@H]1O[C@H](CO)[C@@H](O)C1 LUCHPKXVUGJYGU-XLPZGREQSA-N 0.000 description 1
- 102000004567 6-phosphogluconate dehydrogenase Human genes 0.000 description 1
- 108020001657 6-phosphogluconate dehydrogenase Proteins 0.000 description 1
- QSBYPNXLFMSGKH-UHFFFAOYSA-N 9-Heptadecensaeure Natural products CCCCCCCC=CCCCCCCCC(O)=O QSBYPNXLFMSGKH-UHFFFAOYSA-N 0.000 description 1
- QTBSBXVTEAMEQO-UHFFFAOYSA-M Acetate Chemical compound CC([O-])=O QTBSBXVTEAMEQO-UHFFFAOYSA-M 0.000 description 1
- 108010001058 Acyl-CoA Dehydrogenase Proteins 0.000 description 1
- 102000002296 Acyl-CoA Dehydrogenases Human genes 0.000 description 1
- 102000004539 Acyl-CoA Oxidase Human genes 0.000 description 1
- 108020001558 Acyl-CoA oxidase Proteins 0.000 description 1
- 229930024421 Adenine Natural products 0.000 description 1
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 1
- 229920001817 Agar Polymers 0.000 description 1
- 101710124383 Alcohol dehydrogenase YqhD Proteins 0.000 description 1
- 108010025188 Alcohol oxidase Proteins 0.000 description 1
- 108020002663 Aldehyde Dehydrogenase Proteins 0.000 description 1
- 102000005369 Aldehyde Dehydrogenase Human genes 0.000 description 1
- 108020004306 Alpha-ketoglutarate dehydrogenase Chemical class 0.000 description 1
- 102000006589 Alpha-ketoglutarate dehydrogenase Human genes 0.000 description 1
- 241000219194 Arabidopsis Species 0.000 description 1
- 101100214752 Arabidopsis thaliana ABCG12 gene Proteins 0.000 description 1
- 101001062362 Arabidopsis thaliana Berberine bridge enzyme-like 3 Proteins 0.000 description 1
- 241000203069 Archaea Species 0.000 description 1
- 241000423334 Bacillus halodurans C-125 Species 0.000 description 1
- 241000194108 Bacillus licheniformis Species 0.000 description 1
- 101100223766 Bacillus subtilis (strain 168) des gene Proteins 0.000 description 1
- 101100280474 Bacillus subtilis (strain 168) fabL gene Proteins 0.000 description 1
- 101100098786 Bacillus subtilis (strain 168) tapA gene Proteins 0.000 description 1
- 101100159320 Bacillus subtilis (strain 168) ybdG gene Proteins 0.000 description 1
- 101100213149 Bacillus subtilis (strain 168) ydbC gene Proteins 0.000 description 1
- 229920001342 Bakelite® Polymers 0.000 description 1
- DKPFZGUDAPQIHT-UHFFFAOYSA-N Butyl acetate Natural products CCCCOC(C)=O DKPFZGUDAPQIHT-UHFFFAOYSA-N 0.000 description 1
- 241000244203 Caenorhabditis elegans Species 0.000 description 1
- 101100322583 Caenorhabditis elegans add-2 gene Proteins 0.000 description 1
- UGFAIRIUMAVXCW-UHFFFAOYSA-N Carbon monoxide Chemical compound [O+]#[C-] UGFAIRIUMAVXCW-UHFFFAOYSA-N 0.000 description 1
- 241000195597 Chlamydomonas reinhardtii Species 0.000 description 1
- 108010035563 Chloramphenicol O-acetyltransferase Proteins 0.000 description 1
- 241000191382 Chlorobaculum tepidum Species 0.000 description 1
- 235000001258 Cinchona calisaya Nutrition 0.000 description 1
- 241000186566 Clostridium ljungdahlii Species 0.000 description 1
- 108091026890 Coding region Proteins 0.000 description 1
- 241000589519 Comamonas Species 0.000 description 1
- 108020004635 Complementary DNA Proteins 0.000 description 1
- 241000701022 Cytomegalovirus Species 0.000 description 1
- WQZGKKKJIJFFOK-QTVWNMPRSA-N D-mannopyranose Chemical compound OC[C@H]1OC(O)[C@@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-QTVWNMPRSA-N 0.000 description 1
- JDMUPRLRUUMCTL-VIFPVBQESA-N D-pantetheine 4'-phosphate Chemical compound OP(=O)(O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCS JDMUPRLRUUMCTL-VIFPVBQESA-N 0.000 description 1
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 1
- CKTSBUTUHBMZGZ-UHFFFAOYSA-N Deoxycytidine Natural products O=C1N=C(N)C=CN1C1OC(CO)C(O)C1 CKTSBUTUHBMZGZ-UHFFFAOYSA-N 0.000 description 1
- 108010089760 Electron Transport Complex I Proteins 0.000 description 1
- 102000008013 Electron Transport Complex I Human genes 0.000 description 1
- 101100296146 Emericella nidulans (strain FGSC A4 / ATCC 38163 / CBS 112.46 / NRRL 194 / M139) pab1 gene Proteins 0.000 description 1
- 108010013369 Enteropeptidase Proteins 0.000 description 1
- 102100029727 Enteropeptidase Human genes 0.000 description 1
- YQYJSBFKSSDGFO-UHFFFAOYSA-N Epihygromycin Natural products OC1C(O)C(C(=O)C)OC1OC(C(=C1)O)=CC=C1C=C(C)C(=O)NC1C(O)C(O)C2OCOC2C1O YQYJSBFKSSDGFO-UHFFFAOYSA-N 0.000 description 1
- 101100069069 Escherichia coli (strain K12) gnsB gene Proteins 0.000 description 1
- 101100338498 Escherichia coli (strain K12) hcxA gene Proteins 0.000 description 1
- 101100018413 Escherichia coli (strain K12) idnO gene Proteins 0.000 description 1
- 101100181863 Escherichia coli (strain K12) lgoD gene Proteins 0.000 description 1
- 101100463242 Escherichia coli (strain K12) pdxI gene Proteins 0.000 description 1
- 101100159722 Escherichia coli (strain K12) yeaE gene Proteins 0.000 description 1
- 101100320410 Escherichia coli (strain K12) ygcW gene Proteins 0.000 description 1
- 101100267109 Escherichia coli (strain K12) ygfF gene Proteins 0.000 description 1
- 101100320445 Escherichia coli (strain K12) yghD gene Proteins 0.000 description 1
- 101100269244 Escherichia coli (strain K12) yiaY gene Proteins 0.000 description 1
- 101100321116 Escherichia coli (strain K12) yqhD gene Proteins 0.000 description 1
- 101100545052 Escherichia coli (strain K12) yuaV gene Proteins 0.000 description 1
- 241000701988 Escherichia virus T5 Species 0.000 description 1
- VGGSQFUCUMXWEO-UHFFFAOYSA-N Ethene Chemical compound C=C VGGSQFUCUMXWEO-UHFFFAOYSA-N 0.000 description 1
- 239000005977 Ethylene Substances 0.000 description 1
- 241000206602 Eukaryota Species 0.000 description 1
- 108091029865 Exogenous DNA Proteins 0.000 description 1
- 108700024394 Exon Proteins 0.000 description 1
- 108010074860 Factor Xa Proteins 0.000 description 1
- 102100027297 Fatty acid 2-hydroxylase Human genes 0.000 description 1
- 108010087894 Fatty acid desaturases Proteins 0.000 description 1
- 102000009114 Fatty acid desaturases Human genes 0.000 description 1
- 108010010803 Gelatin Proteins 0.000 description 1
- 108700028146 Genetic Enhancer Elements Proteins 0.000 description 1
- 108091072448 Gfo/Idh/MocA family Proteins 0.000 description 1
- 102000038737 Gfo/Idh/MocA family Human genes 0.000 description 1
- 108010023021 Glutamyl-tRNA reductase Proteins 0.000 description 1
- 108010063907 Glutathione Reductase Proteins 0.000 description 1
- 102100036442 Glutathione reductase, mitochondrial Human genes 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 244000068988 Glycine max Species 0.000 description 1
- 235000010469 Glycine max Nutrition 0.000 description 1
- 241000606768 Haemophilus influenzae Species 0.000 description 1
- 241000282414 Homo sapiens Species 0.000 description 1
- 101000937693 Homo sapiens Fatty acid 2-hydroxylase Proteins 0.000 description 1
- 241000701109 Human adenovirus 2 Species 0.000 description 1
- 108010087227 IMP Dehydrogenase Proteins 0.000 description 1
- 102000006674 IMP dehydrogenase Human genes 0.000 description 1
- 241000235058 Komagataella pastoris Species 0.000 description 1
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 1
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 1
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 1
- FBOZXECLQNJBKD-ZDUSSCGKSA-N L-methotrexate Chemical compound C=1N=C2N=C(N)N=C(N)C2=NC=1CN(C)C1=CC=C(C(=O)N[C@@H](CCC(O)=O)C(O)=O)C=C1 FBOZXECLQNJBKD-ZDUSSCGKSA-N 0.000 description 1
- JVTAAEKCZFNVCJ-UHFFFAOYSA-M Lactate Chemical compound CC(O)C([O-])=O JVTAAEKCZFNVCJ-UHFFFAOYSA-M 0.000 description 1
- 101100393312 Lactobacillus delbrueckii subsp. bulgaricus (strain ATCC 11842 / DSM 20081 / BCRC 10696 / JCM 1002 / NBRC 13953 / NCIMB 11778 / NCTC 12712 / WDCM 00102 / Lb 14) gpsA1 gene Proteins 0.000 description 1
- 235000019738 Limestone Nutrition 0.000 description 1
- 102000004882 Lipase Human genes 0.000 description 1
- 108090001060 Lipase Proteins 0.000 description 1
- 239000004367 Lipase Substances 0.000 description 1
- 241000829100 Macaca mulatta polyomavirus 1 Species 0.000 description 1
- 241000124008 Mammalia Species 0.000 description 1
- 108020000290 Mannitol dehydrogenase Proteins 0.000 description 1
- 108091027974 Mature messenger RNA Proteins 0.000 description 1
- 241001575980 Mendoza Species 0.000 description 1
- 241001074116 Miscanthus x giganteus Species 0.000 description 1
- 101710160066 Mitochondrial holo-[acyl-carrier-protein] synthase Proteins 0.000 description 1
- ZOKXTWBITQBERF-UHFFFAOYSA-N Molybdenum Chemical compound [Mo] ZOKXTWBITQBERF-UHFFFAOYSA-N 0.000 description 1
- 102000002568 Multienzyme Complexes Human genes 0.000 description 1
- 108010093369 Multienzyme Complexes Proteins 0.000 description 1
- 108700035964 Mycobacterium tuberculosis HsaD Proteins 0.000 description 1
- XCOBLONWWXQEBS-KPKJPENVSA-N N,O-bis(trimethylsilyl)trifluoroacetamide Chemical compound C[Si](C)(C)O\C(C(F)(F)F)=N\[Si](C)(C)C XCOBLONWWXQEBS-KPKJPENVSA-N 0.000 description 1
- VZUNGTLZRAYYDE-UHFFFAOYSA-N N-methyl-N'-nitro-N-nitrosoguanidine Chemical compound O=NN(C)C(=N)N[N+]([O-])=O VZUNGTLZRAYYDE-UHFFFAOYSA-N 0.000 description 1
- 102000002023 NADH:ubiquinone oxidoreductases Human genes 0.000 description 1
- 108050009313 NADH:ubiquinone oxidoreductases Proteins 0.000 description 1
- 101710192343 NADPH:adrenodoxin oxidoreductase, mitochondrial Proteins 0.000 description 1
- 102100036777 NADPH:adrenodoxin oxidoreductase, mitochondrial Human genes 0.000 description 1
- 102000004459 Nitroreductase Human genes 0.000 description 1
- CTQNGGLPUBDAKN-UHFFFAOYSA-N O-Xylene Chemical class CC1=CC=CC=C1C CTQNGGLPUBDAKN-UHFFFAOYSA-N 0.000 description 1
- 239000005642 Oleic acid Substances 0.000 description 1
- ZQPPMHVWECSIRJ-UHFFFAOYSA-N Oleic acid Natural products CCCCCCCCC=CCCCCCCCC(O)=O ZQPPMHVWECSIRJ-UHFFFAOYSA-N 0.000 description 1
- 241000228150 Penicillium chrysogenum Species 0.000 description 1
- 108091005804 Peptidases Proteins 0.000 description 1
- 108091093037 Peptide nucleic acid Proteins 0.000 description 1
- 241000209504 Poaceae Species 0.000 description 1
- 239000002202 Polyethylene glycol Substances 0.000 description 1
- 101710104207 Probable NADPH:adrenodoxin oxidoreductase, mitochondrial Proteins 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- 239000004365 Protease Substances 0.000 description 1
- 101710150451 Protein Bel-1 Proteins 0.000 description 1
- 102100037681 Protein FEV Human genes 0.000 description 1
- 101710198166 Protein FEV Proteins 0.000 description 1
- 101710188315 Protein X Proteins 0.000 description 1
- 241001209206 Pseudomonas fluorescens Pf0-1 Species 0.000 description 1
- CZPWVGJYEJSRLH-UHFFFAOYSA-N Pyrimidine Chemical compound C1=CN=CN=C1 CZPWVGJYEJSRLH-UHFFFAOYSA-N 0.000 description 1
- LCTONWCANYUPML-UHFFFAOYSA-M Pyruvate Chemical class CC(=O)C([O-])=O LCTONWCANYUPML-UHFFFAOYSA-M 0.000 description 1
- LOUPRKONTZGTKE-WZBLMQSHSA-N Quinine Natural products C([C@H]([C@H](C1)C=C)C2)C[N@@]1[C@@H]2[C@H](O)C1=CC=NC2=CC=C(OC)C=C21 LOUPRKONTZGTKE-WZBLMQSHSA-N 0.000 description 1
- 241000589771 Ralstonia solanacearum Species 0.000 description 1
- 108091007187 Reductases Proteins 0.000 description 1
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 1
- 241000191023 Rhodobacter capsulatus Species 0.000 description 1
- 241000187561 Rhodococcus erythropolis Species 0.000 description 1
- 241000190932 Rhodopseudomonas Species 0.000 description 1
- 241000190950 Rhodopseudomonas palustris Species 0.000 description 1
- 241000190984 Rhodospirillum rubrum Species 0.000 description 1
- AUNGANRZJHBGPY-SCRDCRAPSA-N Riboflavin Chemical compound OC[C@@H](O)[C@@H](O)[C@@H](O)CN1C=2C=C(C)C(C)=CC=2N=C2C1=NC(=O)NC2=O AUNGANRZJHBGPY-SCRDCRAPSA-N 0.000 description 1
- 108091028664 Ribonucleotide Proteins 0.000 description 1
- 240000000111 Saccharum officinarum Species 0.000 description 1
- 235000007201 Saccharum officinarum Nutrition 0.000 description 1
- 241001466077 Salina Species 0.000 description 1
- 241000831652 Salinivibrio sharmensis Species 0.000 description 1
- 241000235347 Schizosaccharomyces pombe Species 0.000 description 1
- 108020004682 Single-Stranded DNA Proteins 0.000 description 1
- 239000004902 Softening Agent Substances 0.000 description 1
- 241000838225 Stenotrophomonas maltophilia R551-3 Species 0.000 description 1
- 241000193998 Streptococcus pneumoniae Species 0.000 description 1
- 241001453296 Synechococcus elongatus Species 0.000 description 1
- 101100433719 Synechococcus elongatus (strain PCC 7942 / FACHB-805) Synpcc7942_1594 gene Proteins 0.000 description 1
- 241000192581 Synechocystis sp. Species 0.000 description 1
- 241000520244 Tatumella citrea Species 0.000 description 1
- 239000004098 Tetracycline Substances 0.000 description 1
- 241001313706 Thermosynechococcus Species 0.000 description 1
- RYYWUUFWQRZTIU-UHFFFAOYSA-N Thiophosphoric acid Chemical class OP(O)(S)=O RYYWUUFWQRZTIU-UHFFFAOYSA-N 0.000 description 1
- 108090000190 Thrombin Proteins 0.000 description 1
- 108700029229 Transcriptional Regulatory Elements Proteins 0.000 description 1
- 101710195626 Transcriptional activator protein Proteins 0.000 description 1
- 239000013504 Triton X-100 Substances 0.000 description 1
- 229920004890 Triton X-100 Polymers 0.000 description 1
- DRQXUCVJDCRJDB-UHFFFAOYSA-N Turanose Natural products OC1C(CO)OC(O)(CO)C1OC1C(O)C(O)C(O)C(CO)O1 DRQXUCVJDCRJDB-UHFFFAOYSA-N 0.000 description 1
- 101150105063 Ufc1 gene Proteins 0.000 description 1
- ZVNYJIZDIRKMBF-UHFFFAOYSA-N Vesnarinone Chemical compound C1=C(OC)C(OC)=CC=C1C(=O)N1CCN(C=2C=C3CCC(=O)NC3=CC=2)CC1 ZVNYJIZDIRKMBF-UHFFFAOYSA-N 0.000 description 1
- 101100119785 Vibrio anguillarum (strain ATCC 68554 / 775) fatB gene Proteins 0.000 description 1
- 108020000999 Viral RNA Proteins 0.000 description 1
- 239000004164 Wax ester Chemical group 0.000 description 1
- 108010091383 Xanthine dehydrogenase Proteins 0.000 description 1
- 102000005773 Xanthine dehydrogenase Human genes 0.000 description 1
- 108010093894 Xanthine oxidase Proteins 0.000 description 1
- 235000007244 Zea mays Nutrition 0.000 description 1
- 241000588902 Zymomonas mobilis Species 0.000 description 1
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 1
- 101150057540 aar gene Proteins 0.000 description 1
- 101150095244 ac gene Proteins 0.000 description 1
- DPXJVFZANSGRMM-UHFFFAOYSA-N acetic acid;2,3,4,5,6-pentahydroxyhexanal;sodium Chemical compound [Na].CC(O)=O.OCC(O)C(O)C(O)C(O)C=O DPXJVFZANSGRMM-UHFFFAOYSA-N 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 150000007513 acids Chemical class 0.000 description 1
- 229960000643 adenine Drugs 0.000 description 1
- 230000002411 adverse Effects 0.000 description 1
- 238000001261 affinity purification Methods 0.000 description 1
- 239000008272 agar Substances 0.000 description 1
- 238000013019 agitation Methods 0.000 description 1
- 150000007824 aliphatic compounds Chemical class 0.000 description 1
- 150000001345 alkine derivatives Chemical class 0.000 description 1
- WQZGKKKJIJFFOK-PHYPRBDBSA-N alpha-D-galactose Chemical compound OC[C@H]1O[C@H](O)[C@H](O)[C@@H](O)[C@H]1O WQZGKKKJIJFFOK-PHYPRBDBSA-N 0.000 description 1
- 229960000723 ampicillin Drugs 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 239000012736 aqueous medium Substances 0.000 description 1
- PYMYPHUHKUWMLA-WDCZJNDASA-N arabinose Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)C=O PYMYPHUHKUWMLA-WDCZJNDASA-N 0.000 description 1
- 229910052786 argon Inorganic materials 0.000 description 1
- 239000012300 argon atmosphere Substances 0.000 description 1
- 101150090235 aroB gene Proteins 0.000 description 1
- 150000001491 aromatic compounds Chemical class 0.000 description 1
- 125000004429 atom Chemical group 0.000 description 1
- 239000004637 bakelite Substances 0.000 description 1
- 239000011324 bead Substances 0.000 description 1
- 108091008324 binding proteins Proteins 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 239000007853 buffer solution Substances 0.000 description 1
- 210000004899 c-terminal region Anatomy 0.000 description 1
- 230000006860 carbon metabolism Effects 0.000 description 1
- 229910002091 carbon monoxide Inorganic materials 0.000 description 1
- 150000004649 carbonic acid derivatives Chemical class 0.000 description 1
- 239000001768 carboxy methyl cellulose Substances 0.000 description 1
- 230000021523 carboxylation Effects 0.000 description 1
- 238000006473 carboxylation reaction Methods 0.000 description 1
- 239000012159 carrier gas Substances 0.000 description 1
- 238000012219 cassette mutagenesis Methods 0.000 description 1
- 230000003197 catalytic effect Effects 0.000 description 1
- 238000001833 catalytic reforming Methods 0.000 description 1
- 230000006037 cell lysis Effects 0.000 description 1
- 239000006285 cell suspension Substances 0.000 description 1
- 230000004700 cellular uptake Effects 0.000 description 1
- 150000001793 charged compounds Chemical class 0.000 description 1
- 238000007385 chemical modification Methods 0.000 description 1
- 239000012707 chemical precursor Substances 0.000 description 1
- 238000012824 chemical production Methods 0.000 description 1
- 239000007795 chemical reaction product Substances 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 239000003638 chemical reducing agent Substances 0.000 description 1
- 230000002759 chromosomal effect Effects 0.000 description 1
- LOUPRKONTZGTKE-UHFFFAOYSA-N cinchonine Natural products C1C(C(C2)C=C)CCN2C1C(O)C1=CC=NC2=CC=C(OC)C=C21 LOUPRKONTZGTKE-UHFFFAOYSA-N 0.000 description 1
- ALSTYHKOOCGGFT-UHFFFAOYSA-N cis-oleyl alcohol Natural products CCCCCCCCC=CCCCCCCCCO ALSTYHKOOCGGFT-UHFFFAOYSA-N 0.000 description 1
- 238000012411 cloning technique Methods 0.000 description 1
- 230000004186 co-expression Effects 0.000 description 1
- 101150048956 coaA gene Proteins 0.000 description 1
- 239000003245 coal Substances 0.000 description 1
- 238000002485 combustion reaction Methods 0.000 description 1
- 239000002299 complementary DNA Substances 0.000 description 1
- 239000010779 crude oil Substances 0.000 description 1
- 235000021438 curry Nutrition 0.000 description 1
- 235000018417 cysteine Nutrition 0.000 description 1
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 1
- 230000009089 cytolysis Effects 0.000 description 1
- 229940104302 cytosine Drugs 0.000 description 1
- SUYVUBYJARFZHO-RRKCRQDMSA-N dATP Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-RRKCRQDMSA-N 0.000 description 1
- SUYVUBYJARFZHO-UHFFFAOYSA-N dATP Natural products C1=NC=2C(N)=NC=NC=2N1C1CC(O)C(COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-UHFFFAOYSA-N 0.000 description 1
- RGWHQCVHVJXOKC-SHYZEUOFSA-J dCTP(4-) Chemical compound O=C1N=C(N)C=CN1[C@@H]1O[C@H](COP([O-])(=O)OP([O-])(=O)OP([O-])([O-])=O)[C@@H](O)C1 RGWHQCVHVJXOKC-SHYZEUOFSA-J 0.000 description 1
- HAAZLUGHYHWQIW-KVQBGUIXSA-N dGTP Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 HAAZLUGHYHWQIW-KVQBGUIXSA-N 0.000 description 1
- NHVNXKFIZYSCEB-XLPZGREQSA-N dTTP Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)[C@@H](O)C1 NHVNXKFIZYSCEB-XLPZGREQSA-N 0.000 description 1
- 230000002950 deficient Effects 0.000 description 1
- 239000007857 degradation product Substances 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 239000005547 deoxyribonucleotide Substances 0.000 description 1
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 1
- 101150097617 desA gene Proteins 0.000 description 1
- MXHRCPNRJAMMIM-UHFFFAOYSA-N desoxyuridine Natural products C1C(O)C(CO)OC1N1C(=O)NC(=O)C=C1 MXHRCPNRJAMMIM-UHFFFAOYSA-N 0.000 description 1
- VILAVOFMIJHSJA-UHFFFAOYSA-N dicarbon monoxide Chemical compound [C]=C=O VILAVOFMIJHSJA-UHFFFAOYSA-N 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- 108010003123 dihydrolipoamide acyltransferase Proteins 0.000 description 1
- 238000010790 dilution Methods 0.000 description 1
- 239000012895 dilution Substances 0.000 description 1
- 108010066830 dimethyl sulfoxide reductase Proteins 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- NAGJZTKCGNOGPW-UHFFFAOYSA-N dithiophosphoric acid Chemical group OP(O)(S)=S NAGJZTKCGNOGPW-UHFFFAOYSA-N 0.000 description 1
- 229920001971 elastomer Polymers 0.000 description 1
- 239000000806 elastomer Substances 0.000 description 1
- 239000003974 emollient agent Substances 0.000 description 1
- 239000003995 emulsifying agent Substances 0.000 description 1
- 239000000839 emulsion Substances 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- 238000007824 enzymatic assay Methods 0.000 description 1
- 230000009483 enzymatic pathway Effects 0.000 description 1
- 150000002118 epoxides Chemical class 0.000 description 1
- 239000003797 essential amino acid Substances 0.000 description 1
- 235000020776 essential amino acid Nutrition 0.000 description 1
- 108010083294 ethanol acyltransferase Proteins 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 230000007717 exclusion Effects 0.000 description 1
- 108010052305 exodeoxyribonuclease III Proteins 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 101150104461 fabM gene Proteins 0.000 description 1
- 101150061398 fabR gene Proteins 0.000 description 1
- 210000003608 fece Anatomy 0.000 description 1
- 230000008713 feedback mechanism Effects 0.000 description 1
- 239000002921 fermentation waste Substances 0.000 description 1
- 239000000835 fiber Substances 0.000 description 1
- 239000003063 flame retardant Substances 0.000 description 1
- 239000000796 flavoring agent Substances 0.000 description 1
- 235000013355 food flavoring agent Nutrition 0.000 description 1
- 239000002803 fossil fuel Substances 0.000 description 1
- 238000005194 fractionation Methods 0.000 description 1
- 238000013467 fragmentation Methods 0.000 description 1
- 238000006062 fragmentation reaction Methods 0.000 description 1
- 239000003205 fragrance Substances 0.000 description 1
- 231100000221 frame shift mutation induction Toxicity 0.000 description 1
- 235000021588 free fatty acids Nutrition 0.000 description 1
- 230000008014 freezing Effects 0.000 description 1
- 238000007710 freezing Methods 0.000 description 1
- FTSSQIKWUOOEGC-RULYVFMPSA-N fructooligosaccharide Chemical compound OC[C@H]1O[C@@](CO)(OC[C@@]2(OC[C@@]3(OC[C@@]4(OC[C@@]5(OC[C@@]6(OC[C@@]7(OC[C@@]8(OC[C@@]9(OC[C@@]%10(OC[C@@]%11(O[C@H]%12O[C@H](CO)[C@@H](O)[C@H](O)[C@H]%12O)O[C@H](CO)[C@@H](O)[C@@H]%11O)O[C@H](CO)[C@@H](O)[C@@H]%10O)O[C@H](CO)[C@@H](O)[C@@H]9O)O[C@H](CO)[C@@H](O)[C@@H]8O)O[C@H](CO)[C@@H](O)[C@@H]7O)O[C@H](CO)[C@@H](O)[C@@H]6O)O[C@H](CO)[C@@H](O)[C@@H]5O)O[C@H](CO)[C@@H](O)[C@@H]4O)O[C@H](CO)[C@@H](O)[C@@H]3O)O[C@H](CO)[C@@H](O)[C@@H]2O)[C@@H](O)[C@@H]1O FTSSQIKWUOOEGC-RULYVFMPSA-N 0.000 description 1
- 229940107187 fructooligosaccharide Drugs 0.000 description 1
- 239000000295 fuel oil Substances 0.000 description 1
- ZZUFCTLCJUWOSV-UHFFFAOYSA-N furosemide Chemical compound C1=C(Cl)C(S(=O)(=O)N)=CC(C(O)=O)=C1NCC1=CC=CO1 ZZUFCTLCJUWOSV-UHFFFAOYSA-N 0.000 description 1
- 235000021255 galacto-oligosaccharides Nutrition 0.000 description 1
- 150000003271 galactooligosaccharides Chemical class 0.000 description 1
- 229930182830 galactose Natural products 0.000 description 1
- 239000007789 gas Substances 0.000 description 1
- 238000004817 gas chromatography Methods 0.000 description 1
- 238000001502 gel electrophoresis Methods 0.000 description 1
- 239000008273 gelatin Substances 0.000 description 1
- 229920000159 gelatin Polymers 0.000 description 1
- 235000019322 gelatine Nutrition 0.000 description 1
- 235000011852 gelatine desserts Nutrition 0.000 description 1
- 238000003209 gene knockout Methods 0.000 description 1
- 238000007429 general method Methods 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 125000002791 glucosyl group Chemical group C1([C@H](O)[C@@H](O)[C@H](O)[C@H](O1)CO)* 0.000 description 1
- 101150036612 gnl gene Proteins 0.000 description 1
- 101150097553 gnsA gene Proteins 0.000 description 1
- 101150095733 gpsA gene Proteins 0.000 description 1
- 230000009643 growth defect Effects 0.000 description 1
- 101150116274 gspA gene Proteins 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 238000010438 heat treatment Methods 0.000 description 1
- 239000001307 helium Substances 0.000 description 1
- 229910052734 helium Inorganic materials 0.000 description 1
- SWQJXJOGLNCZEY-UHFFFAOYSA-N helium atom Chemical compound [He] SWQJXJOGLNCZEY-UHFFFAOYSA-N 0.000 description 1
- 125000000623 heterocyclic group Chemical group 0.000 description 1
- GAPJZAHIPBLCCF-UHFFFAOYSA-N hexadec-1-en-1-one Chemical compound CCCCCCCCCCCCCCC=C=O GAPJZAHIPBLCCF-UHFFFAOYSA-N 0.000 description 1
- FUZZWVXGSFPDMH-UHFFFAOYSA-M hexanoate Chemical compound CCCCCC([O-])=O FUZZWVXGSFPDMH-UHFFFAOYSA-M 0.000 description 1
- 238000005984 hydrogenation reaction Methods 0.000 description 1
- 150000001261 hydroxy acids Chemical class 0.000 description 1
- 230000002779 inactivation Effects 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 239000000411 inducer Substances 0.000 description 1
- 239000012770 industrial material Substances 0.000 description 1
- 239000004434 industrial solvent Substances 0.000 description 1
- 238000001802 infusion Methods 0.000 description 1
- 230000002401 inhibitory effect Effects 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 239000001573 invertase Substances 0.000 description 1
- QXJSBBXBKPUZAA-UHFFFAOYSA-N isooleic acid Natural products CCCCCCCC=CCCCCCCCCC(O)=O QXJSBBXBKPUZAA-UHFFFAOYSA-N 0.000 description 1
- 125000001449 isopropyl group Chemical group [H]C([H])([H])C([H])(*)C([H])([H])[H] 0.000 description 1
- 238000002307 isotope ratio mass spectrometry Methods 0.000 description 1
- 101150046877 kduD gene Proteins 0.000 description 1
- 239000004922 lacquer Substances 0.000 description 1
- 239000010410 layer Substances 0.000 description 1
- 235000021190 leftovers Nutrition 0.000 description 1
- 230000003902 lesion Effects 0.000 description 1
- 239000006028 limestone Substances 0.000 description 1
- 235000019421 lipase Nutrition 0.000 description 1
- 238000001294 liquid chromatography-tandem mass spectrometry Methods 0.000 description 1
- 239000010871 livestock manure Substances 0.000 description 1
- 230000033001 locomotion Effects 0.000 description 1
- 150000004668 long chain fatty acids Chemical class 0.000 description 1
- 239000006210 lotion Substances 0.000 description 1
- 230000028744 lysogeny Effects 0.000 description 1
- 229910052943 magnesium sulfate Inorganic materials 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 230000004060 metabolic process Effects 0.000 description 1
- 229930182817 methionine Natural products 0.000 description 1
- 229960000485 methotrexate Drugs 0.000 description 1
- 229920000609 methyl cellulose Polymers 0.000 description 1
- 239000001923 methylcellulose Substances 0.000 description 1
- 235000010981 methylcellulose Nutrition 0.000 description 1
- 230000000813 microbial effect Effects 0.000 description 1
- 239000006151 minimal media Substances 0.000 description 1
- 229910052750 molybdenum Inorganic materials 0.000 description 1
- 239000011733 molybdenum Substances 0.000 description 1
- 125000004573 morpholin-4-yl group Chemical group N1(CCOCC1)* 0.000 description 1
- 230000007935 neutral effect Effects 0.000 description 1
- 108020001162 nitroreductase Proteins 0.000 description 1
- 239000002736 nonionic surfactant Substances 0.000 description 1
- JEGNXMUWVCVSSQ-UHFFFAOYSA-N octadec-1-en-1-ol Chemical compound CCCCCCCCCCCCCCCCC=CO JEGNXMUWVCVSSQ-UHFFFAOYSA-N 0.000 description 1
- 239000002674 ointment Substances 0.000 description 1
- XMLQWXUVTXCDDL-UHFFFAOYSA-N oleyl alcohol Natural products CCCCCCC=CCCCCCCCCCCO XMLQWXUVTXCDDL-UHFFFAOYSA-N 0.000 description 1
- 229920001542 oligosaccharide Polymers 0.000 description 1
- 150000002482 oligosaccharides Chemical class 0.000 description 1
- 239000012044 organic layer Substances 0.000 description 1
- 238000012261 overproduction Methods 0.000 description 1
- 230000001590 oxidative effect Effects 0.000 description 1
- 239000003973 paint Substances 0.000 description 1
- 238000007427 paired t-test Methods 0.000 description 1
- SECPZKHBENQXJG-FPLPWBNLSA-M palmitoleate Chemical compound CCCCCC\C=C/CCCCCCCC([O-])=O SECPZKHBENQXJG-FPLPWBNLSA-M 0.000 description 1
- 125000000913 palmityl group Chemical group [H]C([*])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])[H] 0.000 description 1
- 229940014662 pantothenate Drugs 0.000 description 1
- 235000019161 pantothenic acid Nutrition 0.000 description 1
- 239000011713 pantothenic acid Substances 0.000 description 1
- 239000012188 paraffin wax Substances 0.000 description 1
- 229920001277 pectin Polymers 0.000 description 1
- 239000001814 pectin Substances 0.000 description 1
- 235000010987 pectin Nutrition 0.000 description 1
- 239000008188 pellet Substances 0.000 description 1
- 150000002972 pentoses Chemical class 0.000 description 1
- 239000003209 petroleum derivative Substances 0.000 description 1
- 238000005191 phase separation Methods 0.000 description 1
- 239000008363 phosphate buffer Substances 0.000 description 1
- SXADIBFZNXBEGI-UHFFFAOYSA-N phosphoramidous acid Chemical compound NP(O)O SXADIBFZNXBEGI-UHFFFAOYSA-N 0.000 description 1
- 230000029553 photosynthesis Effects 0.000 description 1
- 238000010672 photosynthesis Methods 0.000 description 1
- 230000035790 physiological processes and functions Effects 0.000 description 1
- 239000010773 plant oil Substances 0.000 description 1
- 238000007747 plating Methods 0.000 description 1
- 101150112552 plsB gene Proteins 0.000 description 1
- 229920002401 polyacrylamide Polymers 0.000 description 1
- 229920001223 polyethylene glycol Polymers 0.000 description 1
- 239000008057 potassium phosphate buffer Substances 0.000 description 1
- 230000001376 precipitating effect Effects 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 230000002028 premature Effects 0.000 description 1
- 239000013615 primer Substances 0.000 description 1
- 239000002987 primer (paints) Substances 0.000 description 1
- 230000037452 priming Effects 0.000 description 1
- QQONPFPTGQHPMA-UHFFFAOYSA-N propylene Natural products CC=C QQONPFPTGQHPMA-UHFFFAOYSA-N 0.000 description 1
- 125000004805 propylene group Chemical group [H]C([H])([H])C([H])([*:1])C([H])([H])[*:2] 0.000 description 1
- 238000000734 protein sequencing Methods 0.000 description 1
- 230000002797 proteolythic effect Effects 0.000 description 1
- 230000006337 proteolytic cleavage Effects 0.000 description 1
- 101150108780 pta gene Proteins 0.000 description 1
- 150000003212 purines Chemical class 0.000 description 1
- 150000003222 pyridines Chemical class 0.000 description 1
- 229960000948 quinine Drugs 0.000 description 1
- 230000002285 radioactive effect Effects 0.000 description 1
- 239000000376 reactant Substances 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 238000007670 refining Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 108091008146 restriction endonucleases Proteins 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 239000002336 ribonucleotide Substances 0.000 description 1
- 125000002652 ribonucleotide group Chemical group 0.000 description 1
- 239000011435 rock Substances 0.000 description 1
- 235000003441 saturated fatty acids Nutrition 0.000 description 1
- 229930195734 saturated hydrocarbon Natural products 0.000 description 1
- 230000003248 secreting effect Effects 0.000 description 1
- 238000011218 seed culture Methods 0.000 description 1
- 238000002864 sequence alignment Methods 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 239000010865 sewage Substances 0.000 description 1
- 235000019812 sodium carboxymethyl cellulose Nutrition 0.000 description 1
- 229920001027 sodium carboxymethylcellulose Polymers 0.000 description 1
- 239000001509 sodium citrate Substances 0.000 description 1
- NLJMYIDDQXHKNR-UHFFFAOYSA-K sodium citrate Chemical compound O.O.[Na+].[Na+].[Na+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O NLJMYIDDQXHKNR-UHFFFAOYSA-K 0.000 description 1
- 238000000527 sonication Methods 0.000 description 1
- 238000009987 spinning Methods 0.000 description 1
- 230000002269 spontaneous effect Effects 0.000 description 1
- 230000006641 stabilisation Effects 0.000 description 1
- 238000011105 stabilization Methods 0.000 description 1
- 238000003153 stable transfection Methods 0.000 description 1
- 238000004230 steam cracking Methods 0.000 description 1
- 239000010902 straw Substances 0.000 description 1
- 229940031000 streptococcus pneumoniae Drugs 0.000 description 1
- 125000001424 substituent group Chemical group 0.000 description 1
- KDYFGRWQOYBRFD-UHFFFAOYSA-L succinate(2-) Chemical compound [O-]C(=O)CCC([O-])=O KDYFGRWQOYBRFD-UHFFFAOYSA-L 0.000 description 1
- 150000008163 sugars Chemical class 0.000 description 1
- 239000006228 supernatant Substances 0.000 description 1
- 230000008685 targeting Effects 0.000 description 1
- 150000003512 tertiary amines Chemical class 0.000 description 1
- 101150026728 tesB gene Proteins 0.000 description 1
- 229960003604 testosterone Drugs 0.000 description 1
- 229960002180 tetracycline Drugs 0.000 description 1
- 229930101283 tetracycline Natural products 0.000 description 1
- 235000019364 tetracycline Nutrition 0.000 description 1
- 150000003522 tetracyclines Chemical class 0.000 description 1
- 239000002562 thickening agent Substances 0.000 description 1
- 150000003573 thiols Chemical class 0.000 description 1
- RYYWUUFWQRZTIU-UHFFFAOYSA-K thiophosphate Chemical compound [O-]P([O-])([O-])=S RYYWUUFWQRZTIU-UHFFFAOYSA-K 0.000 description 1
- 229960004072 thrombin Drugs 0.000 description 1
- 229940113082 thymine Drugs 0.000 description 1
- 210000001519 tissue Anatomy 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 230000032258 transport Effects 0.000 description 1
- 150000003626 triacylglycerols Chemical class 0.000 description 1
- RULSWEULPANCDV-PIXUTMIVSA-N turanose Chemical compound OC[C@@H](O)[C@@H](O)[C@@H](C(=O)CO)O[C@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O RULSWEULPANCDV-PIXUTMIVSA-N 0.000 description 1
- 238000009281 ultraviolet germicidal irradiation Methods 0.000 description 1
- 101150115617 umuC gene Proteins 0.000 description 1
- 229940035893 uracil Drugs 0.000 description 1
- 239000002966 varnish Substances 0.000 description 1
- 238000010792 warming Methods 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
- 235000019386 wax ester Nutrition 0.000 description 1
- 238000001262 western blot Methods 0.000 description 1
- 229920001221 xylan Polymers 0.000 description 1
- 150000004823 xylans Chemical class 0.000 description 1
- 239000008096 xylene Chemical class 0.000 description 1
- 101150090985 ydfG gene Proteins 0.000 description 1
- 101150101608 ydhF gene Proteins 0.000 description 1
- 101150065440 ydjA gene Proteins 0.000 description 1
- 101150078527 ydjG gene Proteins 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N1/00—Microorganisms, e.g. protozoa; Compositions thereof; Processes of propagating, maintaining or preserving microorganisms or compositions thereof; Processes of preparing or isolating a composition containing a microorganism; Culture media therefor
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/52—Genes encoding for enzymes or proenzymes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P5/00—Preparation of hydrocarbons or halogenated hydrocarbons
- C12P5/02—Preparation of hydrocarbons or halogenated hydrocarbons acyclic
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P5/00—Preparation of hydrocarbons or halogenated hydrocarbons
- C12P5/02—Preparation of hydrocarbons or halogenated hydrocarbons acyclic
- C12P5/026—Unsaturated compounds, i.e. alkenes, alkynes or allenes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P7/00—Preparation of oxygen-containing organic compounds
- C12P7/02—Preparation of oxygen-containing organic compounds containing a hydroxy group
- C12P7/04—Preparation of oxygen-containing organic compounds containing a hydroxy group acyclic
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P7/00—Preparation of oxygen-containing organic compounds
- C12P7/24—Preparation of oxygen-containing organic compounds containing a carbonyl group
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P7/00—Preparation of oxygen-containing organic compounds
- C12P7/62—Carboxylic acid esters
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P7/00—Preparation of oxygen-containing organic compounds
- C12P7/64—Fats; Fatty oils; Ester-type waxes; Higher fatty acids, i.e. having at least seven carbon atoms in an unbroken chain bound to a carboxyl group; Oxidised oils or fats
- C12P7/6409—Fatty acids
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P7/00—Preparation of oxygen-containing organic compounds
- C12P7/64—Fats; Fatty oils; Ester-type waxes; Higher fatty acids, i.e. having at least seven carbon atoms in an unbroken chain bound to a carboxyl group; Oxidised oils or fats
- C12P7/6436—Fatty acid esters
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P7/00—Preparation of oxygen-containing organic compounds
- C12P7/64—Fats; Fatty oils; Ester-type waxes; Higher fatty acids, i.e. having at least seven carbon atoms in an unbroken chain bound to a carboxyl group; Oxidised oils or fats
- C12P7/6436—Fatty acid esters
- C12P7/6445—Glycerides
- C12P7/6463—Glycerides obtained from glyceride producing microorganisms, e.g. single cell oil
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P7/00—Preparation of oxygen-containing organic compounds
- C12P7/64—Fats; Fatty oils; Ester-type waxes; Higher fatty acids, i.e. having at least seven carbon atoms in an unbroken chain bound to a carboxyl group; Oxidised oils or fats
- C12P7/6436—Fatty acid esters
- C12P7/649—Biodiesel, i.e. fatty acid alkyl esters
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12R—INDEXING SCHEME ASSOCIATED WITH SUBCLASSES C12C - C12Q, RELATING TO MICROORGANISMS
- C12R2001/00—Microorganisms ; Processes using microorganisms
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02E—REDUCTION OF GREENHOUSE GAS [GHG] EMISSIONS, RELATED TO ENERGY GENERATION, TRANSMISSION OR DISTRIBUTION
- Y02E50/00—Technologies for the production of fuel of non-fossil origin
- Y02E50/10—Biofuels, e.g. bio-diesel
Definitions
- compositions, methods and systems effective to produce fatty acid derivatives [0001] Compositions, methods and systems effective to produce fatty acid derivatives.
- Petroleum is a limited, natural resource found in the Earth in liquid, gaseous, or solid forms. Petroleum is a valuable resource for producing various industrial materials. But petroleum products are developed at considerable costs, both financial and environmental. In addition to the economic cost, petroleum exploration carries a high environmental cost. In its natural form, crude petroleum extracted from the Earth has few commercial uses. It is a mixture of hydrocarbons (e.g. , paraffins (or alkanes), olefins (or alkenes), alkynes, napthenes (or cycloalkanes), aliphatic compounds, aromatic compounds, etc.) of varying length and
- Crude petroleum is also a primary source of raw materials for producing petrochemicals.
- the two main classes of raw materials derived from petroleum are short chain olefins (e.g., ethylene and propylene) and aromatics (e.g., benzene and xylene isomers). These raw materials are derived from longer chain hydrocarbons in crude petroleum by cracking it at considerable expense using a variety of methods, such as catalytic cracking, steam cracking, or catalytic reforming. These raw materials are used to make petrochemicals, which cannot be directly refined from crude petroleum, such as monomers, solvents, detergents, or adhesives.
- specialty chemicals such as plastics, resins, fibers, elastomers, pharmaceuticals, lubricants, or gels.
- specialty chemicals such as plastics, resins, fibers, elastomers, pharmaceuticals, lubricants, or gels.
- specialty chemicals that can be produced from petrochemical raw materials are fatty acids, hydrocarbons (e.g. , long chain, branched chain, saturated, unsaturated, etc.), fatty alcohols, esters, fatty aldehydes, ketones, lubricants, etc.
- Fatty alcohols have many commercial uses.
- the shorter chain fatty alcohols are used in the cosmetic and food industries as emulsifiers, emollients, and thickeners. Due to their amphiphilic nature, fatty alcohols behave as nonionic surfactants, which are useful in personal care and household products, for example, detergents.
- fatty alcohols are used in waxes, gums, resins, pharmaceutical salves and lotions, lubricating oil additives, textile antistatic and finishing agents, plasticizers, cosmetics, industrial solvents, and solvents for fats.
- Hydrocarbons have many commercial uses. For example, shorter chain alkanes are used as fuels. Longer chain alkanes (e.g., from five to sixteen carbons) are used as transportation fuels (e.g., gasoline, diesel, or aviation fuel). Alkanes having more than sixteen carbon atoms are important components of fuel oils and lubricating oils. Even longer alkanes, which are solid at room temperature, can be used, for example, as a paraffin wax. In addition, longer chain alkanes can be cracked to produce commercially valuable shorter chain hydrocarbons.
- shorter chain alkanes are used as fuels. Longer chain alkanes (e.g., from five to sixteen carbons) are used as transportation fuels (e.g., gasoline, diesel, or aviation fuel). Alkanes having more than sixteen carbon atoms are important components of fuel oils and lubricating oils. Even longer alkanes, which are solid at room temperature, can be used, for example, as a paraffin wax. In addition, longer chain alkane
- short chain alkenes are used in transportation fuels.
- Longer chain alkenes are used in plastics, lubricants, and synthetic lubricants.
- alkenes are used as a feedstock to produce alcohols, esters, plasticizers, surfactants, tertiary amines, enhanced oil recovery agents, fatty acids, thiols, alkenylsuccinic anhydrides, epoxides, chlorinated alkanes, chlorinated alkenes, waxes, fuel additives, and drag flow reducers.
- esters have many commercial uses.
- biodiesel an alternative fuel, is comprised of esters (e.g., fatty acid methyl esters, fatty acid ethyl esters, etc).
- esters are volatile with a pleasant odor, which makes them useful as fragrances or flavoring agents.
- esters are used as solvents for lacquers, paints, and varnishes.
- some naturally occurring substances such as waxes, fats, and oils are comprised of esters.
- Esters are also used as softening agents in resins and plasticizers, flame retardants, and additives in gasoline and oil.
- esters can be used in the manufacture of polymers, films, textiles, dyes, and pharmaceuticals.
- Aldehydes are used to produce many specialty chemicals. For example, aldehydes are used to produce polymers, resins (e.g., Bakelite), dyes, flavorings, plasticizers, perfumes, pharmaceuticals, and other chemicals. Some are used as solvents, preservatives, or disinfectants. Some natural and synthetic compounds, such as vitamins and hormones, are aldehydes.
- the invention provides recombinant microorganisms engineered to produce fatty acid derivatives and methods of use wherein the recombinant microorganisms comprise
- polynucleotide sequences encoding: (a) a fatty aldehyde biosynthetic polypeptide and (b) a fatty alcohol biosynthetic polypeptide, wherein the expression of the polypeptides is modified relative to the corresponding wild type polypeptides and the microorganism produces an increased titer of the fatty acid derivative relative to a wild type microorganism.
- microorganisms may further comprise a thioesterase (EC 3.1.1.5).
- Exemplary fatty aldehyde biosynthetic polypeptides (a) have at least 90% sequence identity to the amino acid sequence presented as SEQ ID NO: 41 , 43, 45, 47, 49, 51, 53, 55, 57, 59, 61 , 63, 65, 69, 71 , 73, 75, 77, 79, 81 , 83, 85, 87, 89, 91 , 93, 97, 99, 101 , 103, 105, 107, 109, 1 1 1 , 1 13, 1 15, 1 17, 119, 121 , 123, 125, or 127; (b) comprise an amino acid sequence motif with a sequence presented as (1) SEQ ID NO: 129, SEQ ID NO:130, SEQ ID N0:131 , and SEQ ID NO:132; (2) SEQ ID NO: 133; SEQ ID NO: 134; SEQ ID NO:135; SEQ ID NO: 136; or (3) SEQ ID NO: 129,
- Methods for producing a fatty alcohol comprising culturing such an engineered microorganism in the presence of a carbon source, under conditions wherein the fatty alcohol is produced at a titer of at least 300mg/L, are further provided.
- the engineered microorganism may be modified: (a) to express an attenuated level of an acyl-CoA synthase (EC 2.3.1.86) or (b) to further comprise an acyl-ACP reductase polypeptide, wherein (i) the acyl-ACP reductase polypeptide has amino acid sequence with at least 90% sequence identity to SEQ ID NO: 137, 139, 141, 143, 145, 147, 149, 151, or 153, (ii) the acyl-ACP reductase polypeptide has an amino acid motif presented as SEQ ID NO: 155, 156, 157, 158, 159, 160, 161,162, 163, 164, or 165, or (iii) the acyl-ACP reductase polypeptide is encoded by a polynucleotide having at least 90% sequence identity to SEQ ID NO: 138, 140, 142, 144
- expression of a fatty aldehyde reductase or alcohol dehydrogenase (EC 1.1.1.1 ) in the engineered microorganism may be increased or attenuated relative to the corresponding wild type polypeptide, or the gene encoding the fatty alcohol biosynthetic polypeptide may be knocked-out.
- the fatty alcohol biosynthetic polypeptide may (a) have at least 90% sequence identity to a polypeptide sequence selected from the group consisting of SEQ ID NO:l , 3, 5, 7, 9, 1 1 , 13, 15, 17, 19, 21, 23, 25, 27, 29, 31 , 33, 35, 37, and 39, or (b) be encoded by a polynucleotide having at least 90% sequence identity to the nucleotide sequence of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, or 40.
- the fatty alcohol produced by the claimed method may (a) comprise a C -C]8 fatty alcohol (e.g., a C 6 , C 8 , Qo, C )2 , C ]3 , C ]4 , Ci 5 , C 16 , C n , or C ]8 fatty alcohol); (b) have the hydroxyl group is in the primary (Ci) position; (c) be a saturated or unsaturated fatty alcohol; (d) be unsaturated at the omega-7 position; or (e) comprise a cis double bond.
- a C -C]8 fatty alcohol e.g., a C 6 , C 8 , Qo, C )2 , C ]3 , C ]4 , Ci 5 , C 16 , C n , or C ]8 fatty alcohol
- the invention further provides recombinant microorganisms engineered to produce hydrocarbons and methods of use wherein the recombinant microorganism further comprises (a) a hydrocarbon biosynthetic polypeptide having the amino acid sequence of SEQ ID NO: 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, or 200 with one or more amino acid substitutions, additions, deletions, or insertions; (b) a polynucleotide sequence encoding a hydrocarbon biosynthetic polypeptide, having at least 90% sequence identity to the amino acid sequence of SEQ ID NO: 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, or 200, or (c) a hydrocarbon biosynthetic polypeptide having the amino acid sequence of
- Methods for producing a hydrocarbon comprising culturing such engineered
- microorganisms in the presence of a carbon source, under conditions wherein the hydrocarbon is produced are further provided.
- the hydrocarbon produced by the claimed methods may (a) be an alkane or an alkene, e.g., a C13-C21 alkane or alkene, (b) have a 5 !3 C of -15.4 or greater, or (c) have a fM 14 C of at least 1.003.
- the hydrocarbon produced by the claimed methods may be used in a biofuel, for example, a diesel, gasoline, or jet fuel.
- the invention further provides the use of microorganisms such as a yeast cell, a fungus cell, a filamentous fungi cell, or a bacterial cell in practicing the claimed methods.
- microorganisms such as a yeast cell, a fungus cell, a filamentous fungi cell, or a bacterial cell in practicing the claimed methods.
- FIG. 1 A is a graphic representation of pathways for fatty alcohol production.
- FIG. I B is a graphic representation of pathways for hydrocarbon production.
- FIG. 2 includes a table listing exemplary homologs of E.coli K-12 MG 1655 ethanol- active dehydrogenase/acetaldehyde-active reductase AdhP [GenBank Accession No.
- FIG. 3 includes a table listing exemplary homologs of E.coli K- 12 MG 1655 2,5-diketo- D-gluconate reductase A, DkgA [GenBank Accession No. NP_417485.4].
- FIG. 4 includes a table listing exemplary homologs of E.coli K- 12 MG 1655 2,5-diketo- D-gluconate reductase B, DkgB [GenBank Accession No. NP 414743.1 ].
- FIG. 5 includes a table listing exemplary homologs of E.coli K-12 MG 1655 E.coli K-12 MG 1655 aldo-keto reductase Tas [GenBank Accession No. NP_41731 1 .1 ].
- FIG. 6 includes a table listing exemplary homologs of E.coli K-12 MG 1655 Zn- dependent and NAD(P)-binding oxidoreductase RspB [GenBank Accession No. NP 416097.1 ].
- FIG. 7 includes a table listing exemplary homologs of E.coli K-12 MG 1655 Zn- dependent and NAD(P)-binding oxidoreductase YahK [GenBank Accession No. NP 414859.1 ].
- FIG. 8 includes a table listing exemplary homologs of E.coli K-12 MG 1655 NAD(P)- binding oxidoreductase YbbO [GenBank Accession No. NP 415026.1].
- FIG. 9 includes a table listing exemplary homologs of E.coli K-12 MG 1655
- oxidoreductase YbdH [GenBank Accession No. NP_415132.1 ].
- FIG. 10 includes a table listing exemplary homologs of E. coli K-12 MG 1655 Zn- dependent and NAD(P)-binding oxidoreductase YbdR [GenBank Accession No.
- FIG. 1 1 includes a table listing exemplary homologs of E.coli K-12 MG 1655 NAD(P)- binding oxidoreductase YgfF [GenBank Accession No. NP_417378.1].
- FIG. 12 includes a table listing exemplary homologs of E.coli K-12 MG 1655 Zn- dependent and NAD(P)-binding oxidoreductase YhdH [Genbank Accession No. NP_417719.1 ].
- FIG. 13 includes a table listing exemplary homologs of E.coli K-12 MG 1655 Zn- dependent and NAD(P)-binding alcohol dehydrogenase YjgB [GenBank Accession No.
- FIG. 14 includes a table listing exemplary homologs of E.coli K-12 MG 1655 3- dehroquinate synthase AroB [GenBank Accession No. NP_417848.1].
- FIG. 15 includes a table listing exemplary homologs of E.coli K-12 MG 1655 Zn- dependent and NAD(P)-binding oxidoreductase YcjQ [GenBank Accession No. NP_415829.1 ].
- FIG. 16 includes a table listing exemplary homologs of E.coli K-12 MG 1655 NAD(P)- binding oxidoreductase YdbC [GenBank Accession No. NP_415924.1].
- FIG. 17 includes a table listing exemplary homologs of E.coli K-12 MG 1655 NADH-dependent alpha-keto reductase YdjG [GenBank Accession No. NP__416285.1].
- FIG. 18 includes a table listing exemplary homologs of E.coli K-12 MG 1655 NADPH-dependent aldo-keto reductase YeaE [GenBank Accession No. NP_416295.1 ].
- FIG. 19 includes a table listing exemplary homologs of E.coli K-12 MG 1655 NADP-dependent, Zn-dependent oxidoreductase YncB [GenBank Accession No. NP_415966.6].
- FIG. 20 includes a table listing exemplary homologs of E.coli K-12 MG 1655 NAD(P)-dependent alcohol dehydrogenase YqhD [GenBank Accession No. NP 417484.1].
- FIG. 21 includes a table listing exemplary homologs of E.coli K-12 MG 1655 Zn- dependent and NAD(P)-binding oxidoreductase YdjL [GenBank Accession No. NP_416290.1 ].
- FIG. 22 includes tables listing E.coli dehydratase/isomerase enzymes and
- FIG. 23 includes table listing E.coli keto-ACP synthase enzymes and keto-ACP synthase enzymes from other organisms.
- FIG. 24A is a graphic representation of fatty alcohols produced by a recombinant E.coli strain transformed with pETDuet-l -'tesA-alrAadpl and pACYCDuet-l -CarB.
- FIG. 24B is a GC/MS trace of fatty alcohol produced by a recombinant E.coli strain transformed with pETDuet-l-'tesA-alrAadpl and pACYCDuet-l -CarB as compared to the control strain, which did not express an alrAadpl .
- FIG. 25 is a graphic representation of fatty alcohols produced by a recombinant E.coli strain transformed with pETDuet-l -'tesA-yjgB and pACYCDuet-l -CarB.
- FIG. 26A is a GC/MS trace of fatty alcohol production in MG1655 (DE3, /3 ⁇ 4 Z ) )/pETDuet-l -tesA and pACYCDuet-l -CarB cells.
- FIG. 26A is a GC/MS trace of fatty alcohol production in MG1655 (DE3, /3 ⁇ 4 Z ) )/pETDuet-l -tesA and pACYCDuet-l -CarB cells.
- 26B is a GC/MS trace of fatty alcohol production in MG1655 (DE3, AfadD, j//g5: :kan)/pETDuet-l -tesA and pACYCDuet-1 - CarB cells.
- FIG. 26C is a GC/MS trace of fatty alcohol production in MG1655 (DE3, AfadD, jyg5: :kan)/pETDuet-l -'tesA-yjgB and pACYCDuet-l -CarB cells.
- the arrows in FIGs. 26A, 26B, and 26C indicate the absence of C12:0 fatty aldehydes.
- FIG. 27 is a graphic representation of fatty alcohol production in various deletion mutants of E. co li.
- FIG. 28 is a graphic representation of fatty alcohol production in various deletion mutants of E.coli.
- FIGs. 29A-29X are graphs depicting of the amount of fatty aldehydes converted to fatty alcohol using the enzymatic assays as described in Example 5.
- the title of each graph indicates the co-factor and substrate that were used in the assay.
- "CI 2" indicates a dodecanal substrate.
- "CI 6: 1” indicates a 1 1-cis-hexadecenal substrate.
- the tables accompanying the graphs indicate the percentages of fatty aldehydes that were converted into fatty alcohols at the marked concentrations, as measured by GC-FID.
- the tables also indicate the p-values for the samples' capacity to catalyze the conversion of fatty aldehydes to fatty alcohols.
- the invention is based, at least in part, on the discovery that altering the level of expression of one or more of a fatty alcohol biosynthetic polypeptide, a fatty aldehyde biosynthetic polypeptide, an acyl-ACP reductase polypeptide (EC 6.4.1.2) and a hydrocarbon biosynthetic polypeptide, e.g., a decarbonylase, in the microorganism host cell facilitates enhanced production of fatty acids and fatty acid derivatives by the microorganism.
- a fatty alcohol biosynthetic polypeptide e.g., a fatty aldehyde biosynthetic polypeptide
- an acyl-ACP reductase polypeptide EC 6.4.1.2
- a hydrocarbon biosynthetic polypeptide e.g., a decarbonylase
- Polypeptide names include all polypeptides that have the same activity (e.g., that catalyze the same fundamental chemical reaction).
- accession numbers referenced herein are derived from the NCBI database (National Center for Biotechnology Information) maintained by the National Institute of Health, U.S.A. Unless otherwise indicated, the accession numbers are as provided in the database as of October 2008.
- NC-IUBMB Nomenclature Committee of the International Union of Biochemistry and Molecular Biology
- Genomics sponsored in part by the University of Tokyo. Unless otherwise indicated, the EC numbers are as provided in the database as of October 2008.
- acyl CoA refers to an acyl thioester formed between the carbonyl carbon of alkyl chain and the sulfydryl group of the 4'-phosphopantethionyl moiety of coenzyme A (CoA), which has the formula R-C(0)S-CoA, where R is any alkyl group having at least 4 carbon atoms.
- CoA coenzyme A
- an acyl CoA will be an intermediate in the synthesis of fully saturated acyl CoAs, including, but not limited to 3-keto-acyl CoA, a 3 -hydroxy acyl CoA, a delta-2-trans-enoyl-CoA, or an alkyl acyl CoA.
- the carbon chain will have about 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, 20, 21 , 22, 23, 24, 25, or 26 carbons.
- the acyl CoA will be branched.
- the branched acyl CoA is an isoacyl CoA, in another it is an anti-isoacyl CoA.
- Each of these "acyl CoAs” are substrates for enzymes that convert them to fatty acid derivatives such as those described herein.
- the term "alcohol dehydrogenase” (EC 1.1.1.*) is a peptide capable of catalyzing the conversion of a fatty aldehyde to an alcohol (e.g., fatty alcohol).
- alcohol dehydrogenases will catalyze other reactions as well.
- some alcohol dehydrogenases will accept other substrates in addition to fatty aldehydes.
- Such non-specific alcohol dehydrogenases are, therefore, also included in this definition.
- Nucleic acid sequences encoding alcohol dehydrogenases are known in the art, and such alcohol dehydrogenases are publicly available.
- the aldehyde is any aldehyde made from a fatty acid or fatty acid derivative.
- the R group is at least about 1 , 2, 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20 carbons in length.
- polynucleotide is a nucleic acid that encodes an aldehyde biosynthetic polypeptide.
- an "aldehyde biosynthetic polypeptide is a polypeptide that is a part of the biosynthetic pathway of an aldehyde. Such polypeptide can act on a biological substrate to yield an aldehyde. In some instances, the aldehyde biosynthetic polypeptide has reductase activity.
- alkane means saturated hydrocarbons or compounds that consist only of carbon (C) and hydrogen (H), wherein these atoms are linked together by single bonds (i.e., they are saturated compounds).
- altered level of expression and “modified level of expression” are used interchangeably and mean that a polynucleotide, polypeptide, or hydrocarbon is present in a different concentration in an engineered microorganism as compared to its concentration in a corresponding wild-type cell under the same conditions.
- the term "attenuate” means to weaken, reduce or diminish.
- a polypeptide can be attenuated by modifying the polypeptide to reduce its activity (e.g., by modifying a nucleotide sequence that encodes the polypeptide).
- the polypeptide, polynucleotide, or hydrocarbon having an altered level of expression is "attenuated” or has a “decreased level of expression.”
- attenuate and “decreasing the level of expression” mean to express or cause to be expressed a polynucleotide, polypeptide, or hydrocarbon in a cell at a lesser concentration than is normally expressed in a corresponding wild-type cell under the same conditions.
- the degree of overexpression or attenuation can be 1.5-fold or more, e.g., 2-fold or more, 3-fold or more, 5- fold or more, 10-fold or more, or 15-fold or more.
- the degree of overexpression or attenuation can be 500-fold or less, e.g., 100-fold or less, 50-fold or less, 25- fold or less, or 20-fold or less.
- the degree of overexpression or attenuation can be bounded by any two of the above endpoints.
- the degree of overexpression or attenuation can be 1.5-500-fold, 2-50-fold, 10-25-fold, or 15-20-fold.
- biodiesel means a biofuel that can be a substitute of diesel, which is derived from petroleum.
- Biodiesel can be used in internal combustion diesel engines in either a pure form, which is referred to as “neat” biodiesel, or as a mixture in any concentration with petroleum-based diesel.
- Biodiesel can include esters or hydrocarbons, such as alcohols.
- biofuel refers to any fuel derived from biomass.
- Biofuels can be substituted for petroleum based fuels.
- biofuels are inclusive of transportation fuels (e.g., gasoline, diesel, jet fuel, etc.), heating fuels, and electricity-generating fuels.
- transportation fuels e.g., gasoline, diesel, jet fuel, etc.
- heating fuels e.g., heating fuels
- electricity-generating fuels e.g., electricity-generating fuels.
- Biofuels are a renewable energy source.
- biomass refers to any biological material from which a carbon source is derived.
- a biomass is processed into a carbon source, which is suitable for bioconversion.
- the biomass does not require further processing into a carbon source.
- the carbon source can be converted into a biofuel.
- An exemplary source of biomass is plant matter or vegetation, such as corn, sugar cane, or switchgrass.
- Another exemplary source of biomass is metabolic waste products, such as animal matter (e.g., cow manure).
- Further exemplary sources of biomass include algae and other marine plants.
- Biomass also includes waste products from industry, agriculture, forestry, and households, including, but not limited to, fermentation waste, ensilage, straw, lumber, sewage, garbage, cellulosic urban waste, and food leftovers.
- biomass also can refer to sources of carbon, such as carbohydrates (e.g., monosaccharides, disaccharides, or
- Branched chains may have more than one point of branching and may include cyclic branches.
- the branched fatty acid, branched fatty aldehyde, or branched fatty alcohol comprises a C 6 , C 7 , C 8 , C 9 , Cjo, Cn, C ]2 , C 13 , C ]4 , C ]5 , C 16 , C 17 , C ] 8 , C ]9 , C 20 , C 21 , C 22 , C 23 , C 24 , C 25 , or a C 26 branched fatty acid, branched fatty aldehyde, or branched fatty alcohol.
- the branched fatty acid, branched fatty aldehyde, or branched fatty alcohol is a C 6 , C 8 , C 10 , Ci 2 , C 13 , Ci 4 , C 15 , C 16 , C 17 , or C 18 branched fatty acid, branched fatty aldehyde, or branched fatty alcohol.
- the hydroxyl group of the branched fatty acid, branched fatty aldehyde, or branched fatty alcohol is in the primary (Ci) position.
- the branched fatty acid, branched fatty aldehyde, or branched fatty alcohol is an iso-fatty acid, iso-fatty aldehyde, or iso-fatty alcohol, or an antesio- fatty acid, an anteiso-fatty aldehyde, or anteiso-fatty alcohol.
- the branched fatty acid, branched fatty aldehyde, or branched fatty alcohol is selected from iso-C 7: o, iso-C 8 ;o, iso-C 9: o, iso-Cio:o, iso-Cii;o, iso-C] 2:0 , iso-Ci 3:0 , iso-C] 4:0 , iso-Ci 5:0 , iso-Ci 6:0 , iso-Ci 7:0 , iso-Ci 8: o, iso-Ci9;o, anteiso-C 7: o, anteiso-C 8:0 , anteiso-C9 : o, anteiso-Cio : o, anteiso-C 1 1 : o,anteiso- Ci 2:0 , anteiso-Ci 3: o, anteiso-Ci 4: o, ante
- carbon source refers to a substrate or compound suitable to be used as a source of carbon for prokaryotic or simple eukaryotic cell growth.
- Carbon sources can be in various forms, including, but not limited to polymers, carbohydrates, acids, alcohols, aldehydes, ketones, amino acids, peptides, and gases (e.g., CO and C0 ).
- Exemplary carbon sources include, but are not limited to, monosaccharides, such as glucose, fructose, mannose, galactose, xylose, and arabinose; oligosaccharides, such as fructo-oligosaccharide and galacto- oligosaccharide; polysaccharides such as starch, cellulose, pectin, and xylan; disaccharides, such as sucrose, maltose, and turanose; cellulosic material and variants such as methyl cellulose and sodium carboxymethyl cellulose; saturated or unsaturated fatty acid esters, succinate, lactate, and acetate; alcohols, such as ethanol, methanol, and glycerol, or mixtures thereof.
- the carbon source can also be a product of photosynthesis, such as glucose.
- monosaccharides such as glucose, fructose, mannose, galactose, xylose, and arabinose
- oligosaccharides such
- the carbon source is biomass. In other preferred embodiments, the carbon source is glucose.
- a “cloud point lowering additive” is an additive added to a composition to decrease or lower the cloud point of a solution.
- cloud point of a fluid means the temperature at which dissolved solids are no longer completely soluble. Below this temperature, solids begin precipitating as a second phase giving the fluid a cloudy appearance.
- cloud point refers to the temperature below which a solidified material or other heavy
- the presence of solidified materials influences the flowing behavior of the fluid, the tendency of the fluid to clog fuel filters, injectors, etc., the accumulation of solidified materials on cold surfaces (e.g., a pipeline or heat exchanger fouling), and the emulsion characteristics of the fluid with water.
- a nucleotide sequence is "complementary" to another nucleotide sequence if each of the bases of the two sequences matches (i.e., is capable of forming Watson Crick base pairs).
- the term "complementary strand” is used herein interchangeably with the term “complement”.
- the complement of a nucleic acid strand can be the complement of a coding strand or the
- condition sufficient to allow expression means any conditions that allow a microorganism host cell to produce a desired product, such as a polypeptide or fatty aldehyde described herein. Suitable conditions include, for example, fermentation conditions. Fermentation conditions can comprise many parameters, such as temperature ranges, levels of aeration, and media composition. Each of these conditions, individually and in combination, allows the host cell to grow. Exemplary culture media include broths or gels. Generally, the medium includes a carbon source, such as glucose, fructose, cellulose, or the like, that can be metabolized by a host cell directly.
- a carbon source such as glucose, fructose, cellulose, or the like
- a host cell can be cultured, for example, for about 4, 8, 12, 24, 36, or 48 hours. During and/or after culturing, samples can be obtained and analyzed to determine if the conditions allow expression. For example, the host cells in the sample or the medium in which the host cells were grown can be tested for the presence of a desired product. When testing for the presence of a product, assays, such as, but not limited to, TLC, HPLC, GC/FID, GC/MS, LC/MS, MS, can be used.
- control element means a transcriptional control element.
- Control elements include promoters and enhancers.
- the term “promoter element,” “promoter,” or “promoter sequence” refers to a DNA sequence that functions as a switch that activates the expression of a gene. If the gene is activated, it is said to be transcribed or participating in transcription. Transcription involves the synthesis of mRNA from the gene. A promoter, therefore, serves as a transcriptional regulatory element and also provides a site for initiation of transcription of the gene into mRNA. Control elements interact specifically with cellular proteins involved in transcription (Maniatis et al, Science 236: 1237, 1987).
- fatty acid means a carboxylic acid having the formula
- RCOOH represents an aliphatic group, preferably an alkyl group.
- R can comprise between about 4 and about 22 carbon atoms.
- Fatty acids can be saturated, monounsaturated, or polyunsaturated.
- the fatty acid is made from a fatty acid biosynthetic pathway.
- fatty acid biosynthetic pathway means a biosynthetic pathway that produces fatty acids.
- the fatty acid biosynthetic pathway includes fatty acid synthases that can be engineered, as described herein, to produce fatty acids, and in some embodiments can be expressed with additional enzymes to produce fatty acids having desired carbon chain
- fatty acid degradation enzyme means an enzyme involved in the breakdown or conversion of a fatty acid or fatty acid derivative into another product.
- a nonlimiting example of a fatty acid degradation enzyme is an acyl-CoA synthase (EC 2.3.1.86). Additional examples of fatty acid degradation enzymes are described herein.
- fatty acid derivative means products made in part from the fatty acid biosynthetic pathway of the production host organism.
- “Fatty acid derivative” also includes products made in part from acyl-ACP or acyl-ACP derivatives.
- the fatty acid biosynthetic pathway includes fatty acid synthase enzymes which can be engineered as described herein to produce fatty acid derivatives, and in some examples can be expressed with additional enzymes to produce fatty acid derivatives having desired carbon chain characteristics.
- Exemplary fatty acid derivatives include for example, fatty acids, acyl-CoA, fatty aldehyde, short and long chain alcohols, hydrocarbons, fatty alcohols, and esters (e.g., waxes, fatty acid esters, or fatty esters).
- esters e.g., waxes, fatty acid esters, or fatty esters.
- fatty acid derivative enzyme means any enzyme that may be expressed or overexpressed in the production of fatty acid derivatives. These enzymes may be part of the fatty acid biosynthetic pathway.
- Non-limiting examples of fatty acid derivative enzymes include fatty acid synthases, thioesterases (EC 3.1.
- acyl-CoA synthases (EC 2.3.1.86), acyl-CoA reductases, alcohol dehydrogenases, alcohol acyltransferases, fatty alcohol-forming acyl-CoA reductases, fatty acid (carboxylic acid) reductases, acyl-ACP reductases (EC 6.4.1.2), fatty acid hydroxylases, acyl- CoA desaturases, acyl-ACP desaturases, acyl-CoA oxidases, acyl-CoA dehydrogenases, ester synthases, and alkane biosynthetic polypeptides, etc.
- Fatty acid derivative enzymes can convert a substrate into a fatty acid derivative.
- the substrate may be a fatty acid derivative that the fatty acid derivative enzyme converts into a different fatty acid derivative.
- Exemplary suitable substrates include, C6-C 2 6 fatty aldehydes.
- fatty acid enzyme means any enzyme involved in fatty acid
- Fatty acid enzymes can be modified in host cells to produce fatty acids.
- Non- limiting examples of fatty acid enzymes include fatty acid synthases and thioesterases (EC 3.1. 2.14 or EC 3.1.1.5). Additional examples of fatty acid enzymes are described herein.
- fatty acid or derivative thereof means a "fatty acid” or a "fatty acid derivative.”
- fatty acid means a carboxylic acid having the formula RCOOH.
- R represents an aliphatic group, preferably an alkyl group.
- R can comprise between about 4 and about 22 carbon atoms.
- Fatty acids can be saturated, monounsaturated, or polyunsaturated.
- the fatty acid is made from a fatty acid biosynthetic pathway.
- fatty alcohol means an alcohol having the formula ROH.
- the fatty alcohol is any alcohol made from a fatty acid or fatty acid derivative.
- the R group of a fatty acid, fatty aldehyde, or fatty alcohol is at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 1 1 , at least 12, at least 13, at least 14, at least 15, at least 16, at least 17, at least 18, or at least 19, carbons in length.
- the R group is 20 or less, 19 or less, 18 or less, 17 or less, 16 or less, 15 or less, 14 or less, 13 or less, 12 or less, 1 1 or less, 10 or less, 9 or less, 8 or less, 7 or less, or 6 or less carbons in length.
- the R group can have an R group bounded by any two of the above endpoints.
- the R group can be 6-16 carbons in length, 10-14 carbons in length, or 12-18 carbons in length.
- the fatty acid, fatty aldehyde, or fatty alcohol is a C6, C7, C8, C9, CI O, Cl l , C12, C13, C14, C15, C16, C17, C18, C19, C20, C21 , C22, C23, C24, C25, or a C26 fatty acid, fatty aldehyde, or fatty alcohol.
- the fatty acid, fatty aldehyde, or fatty alcohol is a C6, C8, CI O, C12, C13, C14, C15, C16, C17, or C18 fatty acid, fatty aldehyde, or fatty alcohol.
- the R group of a fatty acid, fatty aldehyde, or fatty alcohol can be a straight chain or a branched chain.
- the fatty aldehyde is any aldehyde made from a fatty acid or fatty acid derivative.
- the R group is at least about 1 , 2, 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20 carbons in length.
- R can be straight or branched chain.
- the branched chains may have one or more points of branching.
- the branched chains may include cyclic branches.
- R can be saturated or unsaturated. If unsaturated, the R can have one or more points of unsaturation.
- the fatty aldehyde is produced biosynthetically.
- Fatty aldehydes have many uses.
- fatty aldehydes can be used to produce many specialty chemicals.
- fatty aldehydes are used to produce polymers, resins, dyes, flavorings, plasticizers, perfumes, pharmaceuticals, and other chemicals. Some are used as solvents, preservatives, or disinfectants.
- Some natural and synthetic compounds, such as vitamins and hormones, are aldehydes.
- a fatty ester may be used in reference to an ester.
- a fatty ester is any ester made from a fatty acid, for example a fatty acid ester.
- a fatty ester contains an A side and a B side.
- an "A side” of an ester refers to the carbon chain attached to the carboxylate oxygen of the ester.
- a "B side” of an ester refers to the carbon chain comprising the parent carboxylate of the ester.
- the A side is contributed by an alcohol
- the B side is contributed by a fatty acid.
- any alcohol can be used to form the A side of the fatty esters.
- the alcohol can be derived from the fatty acid biosynthetic pathway.
- the alcohol can be produced through non- fatty acid biosynthetic pathways.
- the alcohol can be provided exogenously.
- the alcohol can be supplied in the fermentation broth in instances where the fatty ester is produced by an organism.
- a carboxylic acid such as a fatty acid or acetic acid, can be supplied exogenously in instances where the fatty ester is produced by an organism that can also produce alcohol.
- the carbon chains comprising the A side or B side can be of any length.
- the A side of the ester is at least about 1 , 2, 3, 4, 5, 6, 7, 8, 10, 12, 14, 16, or 18 carbons in length.
- the A side of the ester is 1 carbon in length.
- the A side of the ester is 2 carbons in length.
- the B side of the ester can be at least about 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, or 26 carbons in length.
- the A side and/or the B side can be straight or branched chain.
- the branched chains can have one or more points of branching.
- the branched chains can include cyclic branches.
- the A side and/or B side can be saturated or unsaturated. If unsaturated, the A side and/or B side can have one or more points of
- the fatty acid ester is a fatty acid methyl ester (FAME) or a fatty acid ethyl ester (FAEE).
- FAME is a beta-hydroxy (B-OH) FAME.
- the fatty ester is a wax.
- the wax can be derived from a long chain alcohol and a long chain fatty acid.
- the fatty ester is a fatty acid thioester, for example fatty acyl Coenzyme A (Co A).
- the fatty ester is a fatty acyl pantothenate, an acyl carrier protein (ACP), or a fatty phosphate ester.
- Gene knockout refers to a procedure by which a gene encoding a target protein is modified or inactivated so to reduce or eliminate the function of the intact protein. Inactivation of the gene may be performed by general methods such as mutagenesis by UV irradiation or treatment with N-methyl-N'-nitro-N-nitrosoguanidine, site-directed mutagenesis, homologous recombination, insertion-deletion mutagenesis, or "Red-driven integration"
- a construct is introduced into a host cell, such that it is possible to select for homologous recombination events in the host cell.
- a knock-out construct including both positive and negative selection genes for efficiently selecting transfected cells that undergo a homologous recombination event with the construct.
- the alteration in the host cell may be obtained, for example, by replacing through a single or double crossover recombination a wild type DNA sequence by a DNA sequence containing the alteration.
- the alteration may, for example, be a DNA sequence encoding an antibiotic resistance marker or a gene complementing a possible auxotrophy of the host cell.
- Mutations include, but are not limited to, deletion-insertion mutations.
- An example of such an alteration includes a gene disruption, i.e., a perturbation of a gene such that the product that is normally produced from this gene is not produced in a functional form. This could be due to a complete deletion, a deletion and insertion of a selective marker, an insertion of a selective marker, a frameshift mutation, an in-frame deletion, or a point mutation that leads to premature termination.
- the entire mRNA for the gene is absent. In other situations, the amount of mRNA produced varies.
- a "host cell” is a cell used to produce a product described herein (e.g., a fatty alcohol described herein).
- a host cell can be modified to express or overexpress selected genes or to have attenuated expression of selected genes.
- Non-limiting examples of host cells include plant, animal, human, bacteria, yeast, or filamentous fungi cells.
- a polypeptide described herein has “increased level of activity.”
- “increased level of activity” is meant that a polypeptide has a higher level of biochemical or biological function (e.g., DNA binding or enzymatic activity) in an engineered host cell as compared to its level of biochemical and/or biological function in a corresponding wild-type host cell under the same conditions.
- the degree of enhanced activity can be about 10% or more, about 20% or more, about 50% or more, about 75% or more, about 100% or more, about 200% or more, about 500% or more, about 1000% or more, or any range therein.
- isolated refers to molecules separated from other DNAs or RNAs, respectively that are present in the natural source of the nucleic acid.
- isolated nucleic acid refers to include nucleic acid fragments, which are not naturally occurring as fragments and would not be found in the natural state.
- isolated is also used herein to refer to polypeptides, which are isolated from other cellular proteins and is meant to encompass both purified and recombinant polypeptides.
- isolated as used herein also refers to a nucleic acid or peptide that is substantially free of cellular material, viral material, or culture medium when produced by recombinant DNA techniques.
- isolated as used herein also refers to a nucleic acid or peptide that is substantially free of chemical precursors or other chemicals when chemically synthesized.
- isolated as used herein with respect to products, such as fatty alcohols, refers to products that are isolated from cellular components, cell culture media, or chemical or synthetic precursors.
- the "level of expression of a gene” refers to the level of mRNA, pre- mRNA nascent transcript(s), transcript processing intermediates, mature mRNA(s), and degradation products encoded by the gene.
- microorganism means prokaryotic and eukaryotic microbial species from the domains Archaea, Bacteria and Eucarya, the latter including yeast and filamentous fungi, protozoa, algae, or higher Protista.
- microbial cells ⁇ i.e. , cells from microbes
- microbes are used interchangeably and refer to cells or small organisms that can only be seen with the aid of a microscope.
- nucleic acid refers to polynucleotides, such as
- DNA deoxyribonucleic acid
- RNA ribonucleic acid
- the term should also be understood to include, as equivalents, analogs of RNAs or DNAs made from nucleotide analogs, and, as applicable to the embodiment being described, single (sense or antisense) and double-stranded polynucleotides, ESTs, chromosomes, cDNAs, mRNAs, and rRNAs.
- nucleotide refers to a monomeric unit of a polynucleotide that consists of a heterocyclic base, a sugar, and one or more phosphate groups.
- the naturally occurring bases are typically derivatives of purine or pyrimidine, though it should be understood that naturally and non-naturally occurring base analogs are also included.
- the naturally occurring sugar is the pentose (five-carbon sugar) deoxyribose (which forms DNA) or ribose (which forms RNA), though it should be understood that naturally and non-naturally occurring sugar analogs are also included.
- Nucleic acids are typically linked via phosphate bonds to form nucleic acids or polynucleotides, though many other linkages are known in the art (e.g., phosphorothioates, boranophosphates, and the like).
- Polynucleotides described herein may comprise degenerate nucleotides which are defined according to the IUPAC code for nucleotide degeneracy wherein B is C, G, or T; D is A, G, or T; H is A, C, or T; K is G or T; M is A or C; N is A, C, G, or T; R is A or G; S is C or G; V is A, C, or G; W is A or T; and Y is C or T.
- hydrocarbons containing at least one carbon-to-carbon double bond i.e., they are unsaturated compounds.
- operably linked means that selected nucleotide sequence (e.g. , encoding a polypeptide described herein) is in proximity with a promoter to allow the promoter to regulate expression of the selected DNA.
- the promoter is located upstream of the selected nucleotide sequence in terms of the direction of transcription and translation.
- operably linked is meant that a nucleotide sequence and a regulatory sequence(s) are connected in such a way as to permit gene expression when the appropriate molecules (e.g., transcriptional activator proteins) are bound to the regulatory sequence(s).
- the polypeptide, polynucleotide, or hydrocarbon having an altered or modified level of expression is "overexpressed” or has an "increased level of expression.”
- overexpress and “increasing the level of expression” mean to express or cause to be expressed a polynucleotide, polypeptide, or hydrocarbon in a cell at a greater concentration than is normally expressed in a corresponding wild-type cell under the same conditions.
- a polypeptide can be "overexpressed” in an engineered host cell when the polypeptide is present in a greater concentration in the engineered host cell as compared to its concentration in a non-engineered host cell of the same species under the same conditions.
- partition coefficient is defined as the equilibrium concentration of a compound in an organic phase divided by the concentration at equilibrium in an aqueous phase (e.g., fermentation broth).
- aqueous phase e.g., fermentation broth
- the organic phase is formed by the fatty aldehyde during the production process.
- an organic phase can be provided, such as by providing a layer of octane, to facilitate product separation.
- the partition characteristics of a compound can be described as logP. For example, a compound with a logP of 1 would partition 10: 1 to the organic phase. A compound with a logP of -1 would partition 1 : 10 to the organic phase.
- Polynucleotide refers to a polymer of DNA or RNA, which can be single-stranded or double-stranded and which can contain non-natural or altered nucleotides.
- polynucleotide refers to a polymeric form of nucleotides of any length, either ribonucleotides (RNA) or deoxyribonucleotides (DNA). These terms refer to the primary structure of the molecule, and thus include double- and single-stranded DNA, and double- and single-stranded RNA.
- the terms include, as equivalents, analogs of either RNA or DNA made from nucleotide analogs and modified polynucleotides such as, though not limited to methylated and/or capped
- the polynucleotide can be in any form, including but not limited to plasmid, viral, chromosomal, EST, cDNA, mRNA, and rRNA.
- polypeptide and protein refer to a polymer of amino acid residues.
- recombinant polypeptide refers to a polypeptide that is produced by recombinant DNA techniques, wherein generally DNA encoding the expressed protein or RNA is inserted into a suitable expression vector that is in turn used to transform a host cell to produce the polypeptide or RNA.
- purify means the removal or isolation of a molecule from its environment by, for example, isolation or separation.
- Substantially purified molecules are at least about 60% free, preferably at least about 75% free, and more preferably at least about 90% free from other components with which they are associated. As used herein, these terms also refer to the removal of contaminants from a sample. For example, the removal of contaminants can result in an increase in the percentage of fatty alcohol in a sample. For example, when fatty alcohols are produced in a host cell, the fatty alcohols can be purified by the removal of host cell proteins. After purification, the percentage of fatty alcohols in the sample is increased. The terms “purify,” “purified,” and “purification” do not require absolute purity. They are relative terms. Thus, for example, when fatty alcohols are produced in host cells, a purified fatty alcohol is one that is substantially separated from other cellular components (e.g., nucleic acids, polypeptides, lipids, carbohydrates, or other cellular components (e.g., nucleic acids, polypeptides, lipids, carbohydrates, or other cellular components.
- a purified fatty alcohol preparation is one in which the fatty alcohol is substantially free from contaminants, such as those that might be present following fermentation.
- a fatty alcohol is purified when at least about 50% by weight of a sample is composed of the fatty alcohol.
- a fatty alcohol is purified when at least about 60%, 70%, 80%, 85%, 90%, 92%, 95%, 98%, or 99% or more by weight of a sample is composed of the fatty alcohol.
- recombinant polypeptide refers to a polypeptide that is produced by recombinant DNA techniques, wherein generally DNA encoding the expressed protein or RNA is inserted into a suitable expression vector and that is in turn used to transform a host cell to produce the polypeptide or RNA.
- the R group of a branched or unbranched fatty acid, branched or unbranched fatty aldehyde, or branched or unbranched fatty alcohol can be "saturated” or "unsaturated”. If unsaturated, the R group can have one or more than one point of unsaturation.
- the unsaturated fatty acid, unsaturated fatty aldehyde, or unsaturated fatty alcohol is a monounsaturated fatty acid, monounsaturated fatty aldehyde, or monounsaturated fatty alcohol.
- the unsaturated fatty acid, unsaturated fatty aldehyde, or unsaturated fatty alcohol is a C6:l, C7: l, C8: l , C9: l, C10:l, CI 1 : 1 , C12:l , C13:l , C14:l , C15:l, C16: l, C17: l, C18:l, C19:l , C20:l, C21 :l, C22:l, C23:l, C24: l , C25: l , or a C26:l unsaturated fatty acid, unsaturated fatty aldehyde, or unsaturated fatty alcohol.
- the unsaturated fatty acid, unsaturated fatty aldehyde, or unsaturated fatty alcohol is C10:l , C12:l, C14: l, C16: l , or C18: l .
- the unsaturated fatty acid, unsaturated fatty aldehyde, or unsaturated fatty alcohol is unsaturated at the omega-7 position.
- the unsaturated fatty acid, unsaturated fatty aldehyde, or unsaturated fatty alcohol comprises a cis double bond.
- the term “substantially identical” is used to refer to a first amino acid or nucleotide sequence that contains a sufficient number of identical or equivalent (e.g., with a similar side chain, e.g., conserved amino acid substitutions) amino acid residues or nucleotides to a second amino acid or nucleotide sequence such that the first and second amino acid or nucleotide sequences have similar activities.
- synthase means an enzyme which catalyzes a synthesis process.
- synthase includes synthases, synthetases, and ligases.
- terminal olefin a-olefin
- terminal alkene terminal alkene
- 1-alkene are used interchangeably herein with reference to a-olefins or alkenes with a chemical formula C X H2 X , distinguished from other olefins with a similar molecular formula by linearity of the hydrocarbon chain and the position of the double bond at the primary or alpha position.
- transfection means the introduction of a nucleic acid (e.g., via an expression vector) into a recipient cell by nucleic acid-mediated gene transfer.
- transformation refers to a process in which a cell's genotype is changed as a result of the cellular uptake of exogenous DNA or RNA. This may result in the transformed cell expressing a recombinant form of an RNA or polypeptide. In the case of antisense expression from the transferred gene, the expression of a naturally-occurring form of the polypeptide is disrupted.
- a "transport protein” is a polypeptide that facilitates the movement of one or more compounds in and/or out of a cellular organelle and/or a cell.
- vector refers to a nucleic acid molecule capable of transporting another nucleic acid to which it has been linked.
- a useful vector is an episome ⁇ i.e., a nucleic acid capable of extra-chromosomal replication).
- Useful vectors are those capable of autonomous replication and/or expression of nucleic acids to which they are linked.
- Vectors capable of directing the expression of genes to which they are operatively linked are referred to herein as "expression vectors”. In general, expression vectors of utility in
- plasmids refer generally to circular double stranded DNA loops that, in their vector form, are not bound to the chromosome.
- vector refers generally to circular double stranded DNA loops that, in their vector form, are not bound to the chromosome.
- plasmid and vector are used interchangeably, as the plasmid is the most commonly used form of vector.
- vectors also included are such other forms of expression vectors that serve equivalent functions and that become known in the art subsequently hereto.
- the invention is based, at least in part, on the identification of a number of fatty alcohol biosynthetic enzymes or polypeptides that are capable of catalyzing the conversion of a fatty aldehyde to a fatty alcohol under suitable conditions, for example, in the presence of suitable substrates and/or co-factors.
- the fatty alcohols can be produced by one or more or all of the fatty alcohol biosynthesis pathways in E. coli that utilize, in part, genes that encode fatty aldehyde biosynthetic polypeptides, acyl-ACP reductases (EC 6.4.1.2), or the fatty alcohol biosynthetic enzymes of the present invention.
- the fatty alcohols are produced by a biosynthetic pathway depicted in FIG. 1 A
- a fatty acid is first activated by ATP and then reduced to generate a fatty aldehyde.
- the fatty aldehyde can then be further reduced into a fatty alcohol by a fatty alcohol biosynthetic polypeptide of the present invention, such as, for example, a fatty aldehyde reductase, an alcohol dehydrogenase, an oxidoreductase, an aldo-keto reductase, or a short-chain dehydrogenase.
- the fatty alcohols are produced by an alternative biosynthesis pathway depicted in FIG. 1 A.
- an acyl-ACP is converted into a fatty aldehyde catalyzed by an acyl- ACP reductase (EC 6.4.1.2).
- the fatty aldehyde is further reduced into a fatty alcohol by a fatty alcohol biosynthetic polypeptide of the present invention, for example, by a fatty aldehyde reductase, an alcohol dehydrogenase, an oxidoreductase, an aldo-keto reductase, or a short-chain dehydrogenase.
- Exemplary embodiments of fatty alcohol biosynthetic enzymes of the present invention includes, without limitation, adhP, dkgA, dkgB, rspB, yahK, ybbO, ybdH, ybdR, ygfF, yhdH, yjgB, aroB, ycjQ, ydbC, ydjG, yeaE, yncB, yghD, ydjL, Tas, among others.
- Suitable substrates of these enzymes include fatty aldehydes, for example fatty aldehydes with carbon chain lengths from C 10 to C 18 .
- Suitable co-factors include, without limitation, NAD, NAD(P), NADH, or NADPH.
- the methods described herein can be used to produce fatty alcohols in an engineered microorganism by conversion of fatty aldehydes into fatty alcohols.
- the fatty alcohol is produced by a fatty alcohol biosynthetic polypeptide having an amino acid sequence listed provided herein, as well as polypeptide variant thereof.
- an acyl- ACP reductase polypeptide is one that includes one or more of the amino acid motifs disclosed herein.
- the polypeptide can comprise one or more of SEQ ID NO: 155, 156, 157, 158, 159, 160, 161 , 162, 163, 164, or 165.
- a fatty alcohol is produced by expressing a gene encoding a fatty alcohol biosynthetic polypeptide that is capable of catalyzing the enzymatic conversion of a fatty aldehyde to a fatty alcohol.
- the method further includes isolating the fatty alcohol from the host cell.
- the fatty alcohol is present in the extracellular environment.
- the fatty alcohol is isolated from the extracellular environment.
- the fatty alcohol is spontaneously secreted, partially or completely, from the host cell.
- the fatty alcohol is transported into the extracellular environment.
- the fatty alcohol is passively transported into the extracellular environment.
- the method further includes purifying the fatty alcohol.
- the fatty alcohol biosynthetic polypeptide is about 200 amino acids to about 800 amino acids in length. In certain embodiments, the polypeptide is about 250 amino acids to about 700 amino acids in length, for example, is about 300 to about 600 amino acids in length, about 350 to about 500 amino acids in length, or about 350 to about 450 amino acids in length. In other embodiments, the fatty alcohol biosynthetic polypeptide is up to about 800 amino acids in length, for example, up to about 700 amino acids in length, about 600 amino acids in length, about 500 amino acids in length, about 450 amino acids in length, about 400 amino acids in length, about 350 amino acids in length, about 300 amino acids in length, about 250 amino acids in length, or about 200 amino acids in length.
- the fatty alcohol biosynthetic polypeptide is more than about 200 amino acids in length, for example, more than about 250 amino acids in length, about 300 amino acids in length, about 350 amino acids in length, about 400 amino acids in length, about 450 amino acids in length, about 500 amino acids in length, about 600 amino acids in length, about 700 amino acids in length, or about 800 amino acids in length.
- the fatty alcohol biosynthetic polypeptide comprises the amino acid sequence of SEQ ID NO: 1 , 3, 5, 7, 9, 1 1 , 13, 15, 17, 19, 21 , 23, 25, 27, 29, 31, 33, 35, 37, or 39, with one or more amino acid substitutions, additions, insertions, or deletions, wherein the polypeptide is capable of catalyzing the enzymatic conversion of a fatty aldehyde to a fatty alcohol under suitable conditions, for example, in the presence of suitable substrates and/or co- factors.
- the polypeptide is capable of catalyzing the enzymatic conversion of a fatty aldehyde into a fatty alcohol under suitable conditions, for example, in the presence of suitable substrates and/or co-factors.
- the polypeptide is a fatty aldehyde reductase and/or has fatty aldehyde reductase activity (EC 1.1.1.1).
- the polypeptide is an alcohol dehydrogenase and/or has alcohol dehydrogenase activity.
- the polypeptide is an aldo-keto reductase and/or has aldo-keto reductase activity.
- the polypeptide is a short-chain dehydrogenase and/or has short-chain dehydrogenase activity. In yet other embodiments, the polypeptide is an oxidoreductase and/or has oxidoreductase activity. In certain further embodiments, the polypeptide comprises one or more NAD(P)- or NAD(P)H- binding domains and/or is associated with an NAD(P) or NAD(P)H co-factor. In yet further embodiments, the three-dimensional or the predicted three-dimensional structure of the polypeptide comprises a Rossman fold.
- the fatty alcohol biosynthetic is a mutant or variant.
- Various known activity assays can be used to determine the enzymatic activity of a putative fatty alcohol biosynthetic polypeptide. These assays can be suitable or useful for determining, for example, the expression or level of various fatty alcohol biosynthetic polypeptides in an engineered host cell or microorganism.
- a 1.0 mL reaction mixture consisting of 5 mM aldehyde substrate, 40 mM potassium phosphate buffer, pH7.0, 125 ⁇ NADPH and enzyme can be prepared.
- One unit can be defined as the amount of enzyme activity catalyzing the conversion of 1.0 ⁇ of pyridine nucleotide per minute.
- a similar assay with somewhat different conditions can be carried out to determine the fatty alcohol biosynthetic enzymatic activity. See, e.g., Wahlen et al., App. Environ. Microbiol. 75(9):2758-2764 (2009).
- the assay can be run under an argon atmosphere in septum-sealed vials overnight at room temperature with constant and gentle mixing.
- the products of the reaction can then be extracted from the buffer by adding an equal volume of hexane, and organic layer components can be analyzed by gas chromatography equipped with a flame ionization detector (30 m by 0.32 mm inner diameter with 0.5 ⁇ film thickness, with argon as a carrier and a temperature ramp of, for example, from 60°C to 360°C, increasing at 10°C per minute).
- a continuous spectrophotometric assay can also be developed to determine a given polypeptide's capacity to convert a fatty aldehyde into a fatty alcohol.
- the activity assays and conditions described in the examples herein are also suitable for this determination.
- the fatty alcohol biosynthetic polypeptide has an amino acid sequence that is at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 91 %, at least about 92%, at least about 93%, at least about 94%, at least about 95%), at least about 96%, at least about 97%, at least about 98%, or at least about 99% identity to SEQ ID NO: 1 , 3, 5, 7, 9, 1 1 , 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, or 39.
- the polypeptide has the amino acid sequence of SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21 , 23, 25, 27, 29, 31, 33, 35, 37, or 39.
- the nucleotide sequence has at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, or at least about 99% identity to SEQ ID NO:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, or 40.
- the nucleotide sequence is SEQ ID NO:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, or 40.
- the nucleotide sequence hybridizes to a complement of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, or 40, or to a fragment thereof, for example, under low stringency, medium stringency, high stringency, or very high stringency conditions, wherein the polynucleotide encodes a polypeptide that is capable of catalyzing the enzymatic conversion of a fatty aldehyde to a fatty alcohol under suitable conditions, for example, in the presence of suitable substrates and/or co-factors.
- the polynucleotide encodes a fatty alcohol biosynthetic enzyme.
- the polynucleotide encodes a fatty aldehyde reductase and/or encodes a polypeptide having fatty aldehyde reductase activity.
- the polynucleotide encodes a fatty alcohol biosynthetic enzyme.
- the polynucleotide
- polynucleotides encodes an alcohol dehydrogenase and/or encodes a polypeptide having alcohol dehydrogenase activity. In other embodiments, the polynucleotide encodes an oxidoreductase and/or a polypeptide having oxidoreductase activity. In certain embodiments, the polynucleotide encodes an aldo-keto reductase and/or a polypeptide having aldo-keto reductase activity. In certain other embodiments, the polynucleotide encodes a short-chain dehydrogenase and/or a polypeptide having short-chain dehydrogenase activity.
- the polypeptide comprises one or more NAD(P)- or NAD(P)H- binding domains or is associated with an NAD(P) or NAD(P)H co-factors.
- the three-dimensional structure or the predicted three-dimensional structure of the polypeptide comprises a Rossman fold.
- the method can produce fatty alcohols comprising a C 6 -C 26 fatty alcohol.
- the fatty alcohol comprises a C 6 , C 7 , C 8 , C9, do, C 1 1 , C 12 , Ci3, Ci4, C 15 , Ci 6 , Cn, C 18 , C ⁇ C 20 , C 2 j, C 22 , C 23 , C 24 , C 2 5, or a C 26 fatty alcohol.
- the fatty alcohol is a C 6 , C 8 , C 10 , C 12, d 3 , CM, C 15 , C 16 , C 17 , or Ci8 fatty alcohol.
- the hydroxyl group of the fatty alcohol is in the primary (Ci) position.
- the fatty alcohol comprises a straight chain fatty alcohol.
- the fatty alcohol comprises a branched chain fatty alcohol.
- the fatty alcohol comprises a cyclic moiety.
- the fatty alcohol is an unsaturated fatty alcohol.
- the fatty alcohol is a monounsaturated fatty alcohol.
- the unsaturated fatty alcohol is a C6: l, C7: l , C8: l , C9: l , C10: l , Cl l : l , C12: l , C13 : l , C14: l , C15: l , C16: l , C17: l , C18: l , C19: l , C20: l , C21 : l , C22: l, C23 : l , C24: l , C25: l , or a C26: l unsaturated fatty alcohol.
- the fatty alcohol is unsaturated at the omega-7 position.
- the unsaturated fatty alcohol comprises a cis double
- the fatty alcohol is a saturated fatty alcohol.
- a suitable substrate for the polypeptide can be a fatty aldehyde.
- the fatty aldehyde comprises a C 6 - C 2 fatty aldehyde.
- the fatty aldehyde comprises a C 6 , C 7 , C 8 , C9, do, Cn, C12, C13, Cn, Cis, Ci6, Cn, de, C19, C 20 , C 2 ⁇ , C 22 , C 23 , C 24 , C 25 , or a C 26 fatty aldehyde.
- the fatty aldehyde is a C 6 , C 8 , do, C12, C 13 , Cj 4 , C 15 , C] 6 , C 17 , or C) 8 fatty aldehyde.
- the fatty aldehyde comprises a straight chain fatty aldehyde. In other embodiments, the fatty aldehyde comprises a branched chain fatty aldehyde. In yet other embodiments, the fatty aldehyde comprise one or more cyclic moieties. [00134] In some embodiments, the fatty aldehyde is an unsaturated fatty aldehyde. In other embodiments, the fatty aldehyde substrate is a monounsaturated fatty aldehyde. In yet other embodiments, the fatty aldehyde is a saturated fatty aldehyde.
- a suitable co-factor for the fatty alcohol biosynthetic polypeptide can be, for example, NAD, NADP, NADH, and/or NADPH.
- the polypeptide comprises a co-factor binding domain or is associated with one of more of the co-factors.
- the three-dimensional structure or the predicted three-dimensional structure of the polypeptide comprises a Rossman fold.
- the invention features an engineered microorganism comprising an exogenous control sequence stably incorporated into the genomic DNA of the microorganism upstream of a fatty alcohol biosynthetic polynucleotide comprising a nucleotide sequence having at least about 50% sequence identity to the nucleotide sequence of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, or 40, wherein the microorganism produces an increased level of a fatty alcohol relative to a wild-type microorganism.
- the nucleotide sequence has at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 91 %, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, or at least about 99% identity to SEQ ID NO:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, or 40.
- the nucleotide sequence is SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, or 40.
- the fatty alcohol biosynthetic polynucleotide is endogenous to the microorganism.
- the microorganism is engineered to express a modified level of a gene encoding a fatty acid derivative enzyme.
- modifying the expression of a gene encoding a fatty acid derivative enzyme includes expressing a gene encoding a fatty acid derivative enzyme and/or increasing the expression or activity of an endogenous fatty acid derivative enzyme.
- modifying the expression of a gene encoding a fatty acid derivative enzyme includes attenuating a gene encoding a fatty acid derivative enzyme and/or decreasing the expression or activity of an endogenous fatty acid derivative enzyme.
- the fatty acid derivative enzyme is a fatty acid synthase.
- the fatty acid derivative enzyme is a thioesterase (EC 3.1. 2.14 or EC 3.1.1.5).
- the thioesterase is encoded by tesA, tesA without leader sequence, tesB, fatB, fatB2, fatB3, fatA, or fatAl.
- one or more of the fatty alcohol biosynthetic polypeptides are overexpressed relative to expression in a wild type host cell.
- the fatty alcohol biosynthetic polypeptide described herein produce fatty alcohols from substrate via a reduction mechanism.
- the substrate is a fatty aldehyde or a derivative thereof, a fatty alcohol having particular branching patterns and carbon chain lengths can be produced from a fatty aldehyde having those characteristics that would result in a particular fatty alcohol.
- the fatty aldehyde substrates can, in turn, be obtained from another reaction mechanism, including, for example, via a reaction converting a fatty acid catalyzed by a fatty aldehyde biosynthetic enzyme or via a reaction converting an acyl-ACP substrate catalyzed by an acyl-ACP reductase.
- each step within a biosynthetic pathway that leads to the production of a fatty aldehyde derivative substrate can be modified to produce or overproduce the substrate of interest.
- known genes involved in the fatty acid biosynthetic pathway or the fatty aldehyde pathway can be expressed, overexpressed, or attenuated in host cells to produce a desired substrate ⁇ see, e.g., various enzymes described in PCT/US08/058788, incorporated by reference herein).
- a suitable fatty acid substrate can be converted into a fatty aldehyde substrate by, for example, a fatty aldehyde biosynthetic polypeptide such as a carboxylic acid reductase, or an acyl-ACP reductase.
- a fatty aldehyde biosynthetic polypeptide such as a carboxylic acid reductase, or an acyl-ACP reductase.
- the fatty aldehyde biosynthetic polypeptide can be selected from those described herein, or variants thereof.
- the acyl-ACP reductase can be one selected from those described herein, or a variant thereof.
- the fatty aldehyde substrate can be converted into a fatty alcohol by, for example, a gene encoding a fatty alcohol biosynthetic polypeptide of the present invention.
- a gene encoding a fatty alcohol biosynthetic polypeptide described herein can be expressed in a host cell that expresses an endogenous fatty alcohol biosynthetic polypeptide capable of converting a fatty aldehyde produced by the fatty aldehyde biosynthetic polypeptide into a corresponding fatty alcohol.
- a gene encoding a fatty alcohol biosynthetic polypeptide described herein such as an amino acid sequence selected from SEQ ID NO: l , 3, 5, 7, 9, 1 1 , 13, 15, 17, 19, 21 , 23, 25, 27, 29, 31 , 33, 35, 37, or 39, or a variant thereof.
- the fatty alcohol biosynthetic polypeptide described herein can be encoded by a polynucleotide comprising a sequence of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, or 40, or a variant thereof.
- the fatty alcohol biosynthetic polypeptide can be one selected from an AdhP homolog of FIG. 2, a DkgA homolog of FIG. 3, a DkgB homolog of FIG. 4, a Tas homolog of FIG. 5, an RspB homolog of FIG. 6, a YahK homolog of FIG. 7, a YbbO homolog of FIG. 8, a YbdH homolog of FIG. 9, a YbdR homolog of FIG. 10, a YgfF homolog of FIG. 1 1 , a YhdH homomolg of FIG. 12, a YjgB homolog of FIG. 13, an AroB homolog of FIG.
- the gene encoding a fatty alcohol biosynthetic polypeptide can be co-expressed in a host cell with a gene encoding a fatty aldehyde biosynthetic polypeptide or with a gene encoding an acyl-ACP reductase polypeptide described herein.
- the gene has a nucleotide sequence selected from those described herein, as well as polynucleotide variants thereof.
- the fatty alcohol biosynthetic gene is one encoding an AdhP homolog of FIG. 2, as well as polynucleotide variants thereof.
- the fatty alcohol biosynthetic gene is one encoding a DkgA homolog of FIG. 3, or one encoding a DkgB homolog of FIG. 4, or one encoding a Tas homolog of FIG. 5, or one encoding a RspB homolog of FIG. 6, or one encoding a YahK homolog of FIG.
- Suitable variants such as those listed in, for example, FIGs. 2-21 , can be identified using bioinformatic tools such as described hereinbelow.
- Fatty acid synthase is a group of polypeptides that catalyze the initiation and elongation of acyl chains (Marrakchi et al, Biochemical Society, 30: 1050-1055, 2002).
- the acyl carrier protein (ACP) along with the enzymes in the FAS pathway control the length, degree of saturation, and branching of the fatty acid derivatives produced.
- the fatty acid biosynthetic pathway involves the precursors acetyl-CoA and malonyl-CoA.
- the steps in this pathway are catalyzed by enzymes of the fatty acid biosynthesis (fab) and acetyl-CoA carboxylase (ace) gene families (see, e.g., Heath et al , Prog. Lipid Res. 40(6):467-97 (2001)).
- fab fatty acid biosynthesis
- ace acetyl-CoA carboxylase
- Host cells can be engineered to express fatty acid derivative substrates by
- fatty acid synthase genes such as acetyl-CoA and/or malonyl-CoA synthase genes.
- acetyl-CoA a multi enzyme complex comprising aceEF (which encodes the Elp dehydrogenase component, the E2p dihydrolipoamide acyltransferase component of the pyruvate and 2-oxoglutarate dehydrogenase complexes, and Ipd), panK, fabH, fabB, fabD, fabG, acpP, and fabF.
- GenBank accession numbers for these genes are: pdh (BAB34380, AAC73227, AAC73226), panK (also known as CoA, AAC76952), aceEF (AAC73227, AAC73226), /3 ⁇ 4H (AAC74175), fabB (P0A953), fabD (AAC74176), fabG (AAC74177), acpP (AAC74178),/ab (AAC74179).
- the expression levels of fadE, gpsA, IdhA, pflb, adhE, pta, poxB, ackA, and/or ackB can be attenuated or knocked-out in an engineered host cell by transformation with conditionally replicative or non-replicative plasmids containing null or deletion mutations of the corresponding genes or by substituting promoter or enhancer sequences.
- GenBank accession numbers for these genes are: fadE (AAC73325), gspA (AAC76632), IdhA (AAC74462), pflb (AAC73989), adhE (AAC74323J, pta ⁇ AAC15351), poxB (AAC73958), ackA (AAC75356), and ackB (BAB81430).
- the resulting host cells will have increased acetyl-CoA production levels when grown in an appropriate environment.
- Malonyl-CoA overexpression can be affected by introducing accABCD ⁇ e.g., accession number AAC73296, EC 6.4.1.2) into a host cell.
- Fatty acids can be further overexpressed in host cells by introducing into the host cell a DNA sequence encoding a lipase ⁇ e.g., accession numbers CAA89087, CAA98876).
- PlsB can lead to an increase in the levels of long chain acyl- ACP, which will inhibit early steps in the pathway ⁇ e.g., accABCD, fabH, and fabl).
- the plsB ⁇ e.g., accession number AAC7701 1) D31 IE mutation can be used to increase the amount of available fatty acids.
- a host cell can be engineered to overexpress a sfa gene (suppressor of fabA, e.g., accession number AAN79592) to increase production of monounsaturated fatty acids (Rock et al, J. Bacteriology 178:5382-5387, 1996).
- a sfa gene suppressor of fabA, e.g., accession number AAN79592
- the chain length of a fatty acid derivative substrate can be selected for by modifying the expression of selected thioesterases (EC 3.1. 2.14 or EC 3.1.1.5).
- the thioesterase influences the chain length of fatty acids produced.
- host cells can be engineered to express, overexpress, have attenuated expression, or not to express one or more selected thioesterases to increase the production of a preferred fatty acid derivative substrate.
- C 10 fatty acids can be produced by expressing a thioesterase that has a preference for producing Cio fatty acids and attenuating thioesterases that have a preference for producing fatty acids other than Cio fatty acids ⁇ e.g., a thioesterase which prefers to produce Ci 4 fatty acids). This would result in a relatively homogeneous population of fatty acids that have a carbon chain length of 10.
- C) 4 fatty acids can be produced by attenuating endogenous thioesterases that produce non-Cn fatty acids and expressing the thioesterases that use C14-ACP.
- C 12 fatty acids can be produced by expressing thioesterases that use C12-ACP and attenuating thioesterases that produce non-Ci 2 fatty acids.
- Acetyl-CoA, malonyl-CoA, and fatty acid overproduction can be verified using methods known in the art, for example, by using radioactive precursors, HPLC, or GC-MS subsequent to cell lysis.
- Non-limiting examples of thioesterases that can be used in the methods described herein are listed in Table 1.
- a fatty alcohol biosynthetic polypeptide, variant, or a fragment thereof is expressed in a host cell that contains a naturally occurring mutation that results in an increased level of fatty acids in the host cell.
- the host cell is genetically engineered to increase the level of fatty acids in the host cell relative to a corresponding wild- type host cell.
- the host cell can be genetically engineered to express a reduced level of an acyl-CoA synthase (EC 2.3.1.86) relative to a corresponding wild-type host cell.
- the host cell can be genetically engineered to express a reduced level of an acyl-CoA synthase relative to a corresponding wild-type host cell.
- the level of expression of one or more genes ⁇ e.g., an acyl-CoA synthase gene) is reduced by genetically engineering a "knock out" host cell.
- acyl-CoA synthase gene can be reduced or knocked out in a host cell.
- Non-limiting examples of acyl-CoA synthase genes include fa dD,fadK, BH3103, yhfL, Pfl-4354, EA V15023,fadDl adD2, RPCJ074 adDD35,fadDD22 aa3p or the gene encoding the protein ZP_01644857.
- Specific examples of acyl-CoA synthase genes include fadDD35 from M. tuberculosis H37Rv [NP_217021], fa dDD22 from M.
- tuberculosis H37Rv [ ⁇ _217464],/ ⁇ /£> from E. coli [NP_416319], fadK from E. coli [Y?_4 ⁇ 62 ⁇ 6], fadD from Acinetobacter sp.
- Bacillus subtilis [CAA99571 ], or those described in Shockey et al, Plant. Physiol. 129: 1710- 1722, 2002; Caviglia et al, J. Biol. Chem. 279: 1 163-1 169, 2004; Knoll et al. , J. Biol. Chem. 269(23): 16348-56, 1994; Johnson et al. , J. Biol. Chem. 269: 18037-18046, 1994; and Black et al., J. Biol Chem. 267: 25513-25520, 1992.
- Fatty aldehyde biosynthetic polypeptides refer to a group of polypeptides that can catalyze the enzymatic conversion of suitable fatty acid substrates into fatty aldehydes.
- Host cells can be engineered to express fatty aldehyde substrates by recombinantly expressing or overexpressing one or more fatty aldehyde biosynthetic genes, such as carboxylic acid reductases or fatty acid reductases.
- a fatty acid is first activated by ATP and then reduced by a carboxylic acid reductase (CAR)-like enzyme to generate a fatty aldehyde.
- CAR carboxylic acid reductase
- a fatty aldehyde is produced by expressing a fatty aldehyde biosynthetic gene, for example, a carboxylic acid reductase gene (car gene), having a nucleotide sequence provided herein, as well as nucleotide variants thereof.
- Examplary genes encode a polypeptide comprising SEQ ID NO: 41 , 43, 45, 47, 49, 51 , 53, 55, 57, 59, 61 , 63, 65, 69, 71 , 73, 75, 77, 79, 81 , 83, 85,
- the gene can comprise a polynucleotide sequence of SEQ ID NO: 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86,
- the fatty aldehyde biosynthetic polypeptide can comprise one or more of the amino acid motifs depicted herein in SEQ ID NO: 129-135.
- the fatty aldehyde biosynthetic gene can encode a polypeptide comprising SEQ ID NO: 129, SEQ ID NO: 130, SEQ ID NO:131 , and SEQ ID NO: 132; SEQ ID NO: 133; SEQ ID NO:134; SEQ ID NO: 135; SEQ ID NO: 136; and/or SEQ ID NO:129, SEQ ID NO:131, SEQ ID NO: 132, and SEQ ID NO: 133.
- fatty aldehyde substrates can be produced using an enzymatic pathway involving an acyl-ACP reductase.
- a fatty aldehyde can be produced from a suitable substrate, including, for example, an acyl-ACP, an acyl-CoA, or others, by expressing an acyl-ACP reductase gene (aar gene), having a nucleotide sequence provided herein, as well as nucleotide variants thereof.
- the acyl-ACP reductase gene can encode a polypeptide comprising SEQ ID NO: 155, 156, 157, 158, 159, 160, 161 , 162, 163, 164, or 165.
- acyl-ACP acyl-CoA
- fatty aldehydes fatty alcohols
- fatty alcohols which are described in, for example, PCT/US08/058788, the disclosure of which is incorporated herein by reference.
- the host cell is genetically engineered to express an attenuated level of a fatty acid degradation enzyme relative to a wild type host cell. In some embodiments, the host cell is genetically engineered to express an attenuated level of an acyl-CoA synthase (EC 2.3.1.86) relative to a wild type host cell.
- the host cell expresses an attenuated level of an acyl-CoA synthase encoded by fadD,fadK, BH3103, yhfL, Pfl-4354, EA V15023,fadDl,fadD2, RPC_4074,fadDD35,fadDD22,faa3p or the gene encoding the protein ZP_01644857.
- the genetically engineered host cell comprises a knockout of one or more genes encoding a fatty acid degradation enzyme, such as the
- the host cell is genetically engineered to express an attenuated level of a dehydratase/isomerase enzyme, such as an enzyme encoded by fabA or by a gene listed in the table of FIG. 22.
- the host cell comprises a knockout of fabA or a gene listed in the table of FIG. 22.
- the host cell is genetically engineered to express an attenuated level of a ketoacyl-ACP synthase, such as an enzyme encoded by fabB or by a gene listed in the table of FIG. 23.
- the host cell comprises a knockout of fabB or a gene listed in the table of FIG. 23.
- the host cell is genetically engineered to express a modified level of a gene encoding a desaturase enzyme, such as desA.
- Fatty alcohols can be produced from fatty aldehydes substrates that contain branched points by using a fatty alcohol biosynthetic polypeptide as described herein.
- the branched fatty aldehydes can be made from branched fatty acid derivatives as substrates for a fatty aldehyde biosynthetic polypeptide as described herein.
- E.coli naturally produces straight chain fatty acids (sFAs)
- E.coli can be engineered to produce branched chain fatty acids (brFAs) by introducing and expressing or overexpressing genes that provide branched precursors in the E.coli ⁇ e.g., bkd, ilv, icm, and fab gene families).
- a host cell can be engineered to express or overexpress genes encoding proteins for the elongation of brFAs ⁇ e.g., ACP, FabF, etc.) and/or to delete or attenuate the corresponding host cell genes that normally lead to sFAs.
- brFAs ⁇ e.g., ACP, FabF, etc.
- the degree of saturation in fatty acids can be controlled by regulating the degree of saturation of fatty acid intermediates.
- the sfa, gns, and fab families of genes can be expressed, overexpressed, or expressed at reduced levels, to control the saturation of fatty acids.
- Non-limiting examples of these genes include sfa [GenBank Accession No. AAN 79592, AAC 44390], gnsA [GenBank Accession No. ABD 18647.1], gnsB [GenBank Accession No. AAC 74076A],fabB [GenBank Accession No.
- host cells can be engineered to produce unsaturated fatty acids by engineering the production host to overexpress fabB or by growing the production host at low temperatures ⁇ e.g., less than 37 °C).
- FabB has preference to cis- 3decenoyl-ACP and results in unsaturated fatty acid production in E. coli.
- Overexpression of fabB results in the production of a significant percentage of unsaturated fatty acids (de Mendoza et al, J. Biol. Chem. 258:2098- 2101, 1983).
- the gene fabB may be inserted into and expressed in host cells not naturally having the gene. These unsaturated fatty acids can then be used as intermediates in host cells that are engineered to produce fatty acid derivatives, such as fatty aldehydes.
- a repressor of fatty acid biosynthesis for example, fabR (GenBank accession NP 418398 ), can be deleted, which will also result in increased unsaturated fatty acid production in E. coli (Zhang et al, J. Biol. Chem. 277: 15558, 2002). Similar deletions may be made in other host cells.
- a further increase in unsaturated fatty acids may be achieved, for example, by overexpressing «0 (trans-2, cis-3-decenoyl-ACP isomerase, GenBank accession DAA05501) and controlled expression of fabK (trans-2-enoyl-ACP reductase II, GenBank accession NP_357969) from Streptococcus pneumoniae (Marrakchi et al, J. Biol. Chem. 277: 44809, 2002), while deleting E. coli fabl (trans-2-enoyl-ACP reductase, GenBank accession NP_415804).
- the endogenous fabF gene can be attenuated, thus increasing the percentage of palmitoleate (CI 6: 1) produced.
- host cells can be engineered to produce saturated fatty acids by reducing the expression of an sfa, gns, and/or fab gene.
- Cyclic fatty alcohols can be produced from cyclic fatty aldehydes using cyclic fatty acid derivatives as substrates for a fatty aldehyde biosynthetic polypeptide described herein.
- genes that provide cyclic precursors ⁇ e.g., the ans, chc, and plm gene families) can be introduced into the host cell and expressed to allow initiation of fatty acid biosynthesis from cyclic precursors.
- the microorganism is further engineered to express a modified level of a gene encoding a fatty aldehyde biosynthesis polypeptide.
- modifying the expression of a gene encoding a fatty aldehyde biosynthesis polypeptide includes expressing a gene encoding a fatty aldehyde biosynthetic enzyme and/or increasing the expression or activity of an endogenous fatty aldehyde biosynthetic enzyme.
- the fatty aldehyde biosynthesis gene encodes a carboxylic acid reductase.
- the fatty aldehyde biosynthetic gene encodes a fatty acid reductase.
- the fatty aldehyde biosynthetic polypeptide comprises the amino acid sequence of SEQ ID NO: 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 97, 99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119,
- the fatty aldehyde biosynthetic polypeptide comprises an amino acid sequence having at least about 80% (e.g., at least about 80%, at least about 85%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99%) sequence identity to the amino acid sequence of SEQ ID NO: 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 97, 99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, or 127.
- the fatty aldehyde biosynthetic polypeptide is encoded by a polynucleotide having the sequence of SEQ ID NO:42, 44, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, or 128, or by a variant thereof.
- the fatty aldehyde biosynthetic polypeptide is encoded by a polynucleotide having at least 80% sequence identity to the sequence of SEQ ID NO:42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, or 128.
- the method further comprises expressing a gene encoding a fatty aldehyde biosynthesis polypeptide in the host cell.
- the fatty aldehyde biosynthetic polypeptide comprises the amino acid sequence of SEQ ID NO: 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 69, 71, 73, 75, 77, 79, 81,83, 85, 87, 89,91,93,97, 99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, or a variant thereof.
- the fatty aldehyde biosynthetic polypeptide comprises an amino acid sequence having at least about 80% sequence identity to SEQ ID NO: 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 97,99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, or 127.
- the fatty aldehyde biosynthetic polypeptide is encoded by a polynucleotide having the sequence of SEQ ID NO: 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78,80, 82, 84, 86, 88,90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120,
- the fatty aldehyde 122, 124, 126, 128, or by a variant thereof.
- the fatty aldehyde 122, 124, 126, 128, or by a variant thereof.
- biosynthetic polypeptide is encoded by a polynucleotide having at least about 80%> sequence identity to SEQ ID NO: 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 1 12, 1 14, 1 16, 1 18, 120, 122, 124, 126, or 128.
- the method comprises expressing a gene encoding a fatty aldehyde biosynthetic polypeptide comprising one or more of the amino acid motifs provided herein.
- the fatty aldehyde biosynthetic gene can encode a polypeptide comprising SEQ ID NO: 129, SEQ ID NO: 130, SEQ ID NO: 131 , and SEQ ID NO: 132; SEQ ID NO: 133 ; SEQ ID NO: 134; SEQ ID NO: 135; SEQ ID NO: 136; and/or SEQ ID NO: 129, SEQ ID NO: 131 , SEQ ID NO: 132 and SEQ ID NO: 133.
- SEQ ID NO: 131 includes a reductase domain
- SEQ ID NO: 132 includes an NADP-binding domain
- SEQ ID NO: 133 includes a
- SEQ ID NO: 134 includes an AMP-binding domain.
- the invention further includes expressing in a host cell a gene encoding an acyl-ACP reductase polypeptide in the host cell.
- the acyl-ACP reductase polypeptide comprises the amino acid sequence of SEQ ID NO: 137, 139, 141 , 143, 145, 147, 149, 151 , 153, or a variant thereof.
- the acyl-ACP reductase polypeptide comprises an amino acid sequence that has at least about 70% (e.g., at least about 70%, at least about 75%>, at least about 80%, at least about 85%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99%, or at least about 99%) sequence identity to SEQ ID NO: 137, 139, 141 , 143, 145, 147, 149, 151 , or 153.
- the acyl-ACP reductase polypeptide is encoded by a polynucleotide having the sequence of SEQ ID NO: 138, 140, 142, 144, 146, 148, 150, 152, or 154, or by a variant thereof. In some embodiments, the acyl-ACP reductase polypeptide is encoded by a
- polynucleotide having at least about 70% sequence identity to the sequence of SEQ ID NO: 138, 140, 142, 144, 146, 148, 150, 152, or 154.
- the method includes expressing in a host cell an acyl- ACP reductase gene encoding a polypeptide comprising one or more of the amino acid motifs disclosed herein.
- the polypeptide can comprise one or more of SEQ ID NO: 155, 156, 157, 158, 159, 160, 161 , 162, 163, 164, or 165.
- compositions and methods described herein can be used to produce
- hydrocarbons including, for example, alkanes and alkenes, from an appropriate substrate.
- the invention is based, at least in part, on the identification of a number of fatty alcohol biosynthetic enzymes or polypeptides that are capable of catalyzing the conversion of a fatty aldehyde to a fatty alcohol under suitable conditions, for example, in the presence of suitable substrates and/or co-factors.
- fatty alcohol biosynthetic enzymes or polypeptides capable of catalyzing the conversion of a fatty aldehyde to a fatty alcohol under suitable conditions, for example, in the presence of suitable substrates and/or co-factors.
- polypeptides can be attenuated or deleted from the host cell, which expresses or overexpresses one or more hydrocarbon biosynthetic polypeptides, optionally also expresses or overexpresses one or more fatty aldehyde biosynthetic polypeptides or one or more acyl-ACP reductases.
- the resulting host cell can be used to produce hydrocarbons such as, for example, alkanes or alkenes.
- the hydrocarbons are produced by a biosynthetic pathway depicted in FIG. IB.
- a fatty acid is first activated by ATP and then reduced by a fatty aldehyde biosynthetic polypeptide such as a carboxylic acid reductase (CAR)-like enzyme to generate a fatty aldehyde.
- CAR carboxylic acid reductase
- the fatty aldehyde can then be subject to a hydrocarbon biosynthetic polypeptide such as a decarbonylase and be reduced into a hydrocarbon.
- hydrocarbons are produced by an alternative biosynthesis pathway depicted in FIG. IB.
- an acyl-ACP is converted into a fatty aldehyde catalyzed by an acyl- ACP reductase.
- the fatty aldehyde is further subject to a hydrocarbon biosynthetic polypeptide and converts to a hydrocarbon such as an alkane or an alkene.
- a hydrocarbon biosynthetic polypeptide such as an alkane or an alkene.
- the fatty aldehydes can, in the presence of endogenous fatty alcohol biosynthetic enzyme activity, be converted into fatty alcohols. Therefore, attenuating one or more fatty alcohol biosynthetic polypeptides, or in particular embodiments, deleting one or more fatty alcohol biosynthetic polypeptides from the host cell can improve the production of hydrocarbons.
- the method further includes culturing the host cell in the presence of at least one biological substrate of the hydrocarbon biosynthetic polypeptide, the fatty aldehyde biosynthetic polypeptide, and/or the acyl-ACP reductase polypeptide.
- suitable substrates include, without limitation, a fatty acid derivative, an acyl-ACP, a fatty acid, an acyl-CoA, a fatty aldehyde, a fatty alcohol, or a fatty ester.
- the invention features a method of producing a hydrocarbon, the method comprising expressing an attenuated level of one or more fatty alcohol biosynthetic genes or a mutant and variant thereof in a host cell. In certain embodiments, the method further comprises deleting one or more fatty alcohol biosynthetic genes or a mutant and variant thereof from the host cell. Fatty alcohol biosynthetic genes, polypeptides, sequence motifs, mutants and variants thereof, are described hereinabove.
- the host cell is engineered such that it comprises no detectable level of fatty alcohol biosynthetic enzyme activity, for example, a fatty aldehyde reductase activity, an alcohol dehydrogenase activity, an aldo-keto reductase activity, an oxidoreductase activity, or a short-chain dehydrogenase activity.
- fatty alcohol biosynthetic enzyme activity for example, a fatty aldehyde reductase activity, an alcohol dehydrogenase activity, an aldo-keto reductase activity, an oxidoreductase activity, or a short-chain dehydrogenase activity.
- the method further comprises expressing a gene encoding a hydrocarbon biosynthetic polypeptide in the host cell.
- the hydrocarbon biosynthetic polypeptide comprises the amino acid sequence of SEQ ID NO: 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, or a variant thereof.
- the hydrocarbon biosynthetic polypeptide comprises at least about 70% sequence identity to SEQ ID NO: 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, or 200.
- the hydrocarbon biosynthetic polypeptide is encoded by a polynucleotide having the sequence of SEQ ID NO: 167, 169, 171, 173, 175, 177, 179, 181 , 183, 185, 187, 189, 191, 193, 195, 197, 199, or 201, or by a variant thereof.
- the hydrocarbon biosynthetic polypeptide is encoded by a polynucleotide having at least about 70% sequence identity to SEQ ID NO: 167, 169, 171 , 173, 175, 177, 179, 181 , 183, 185, 187, 189, 191, 193, 195, 197, 199, or 201.
- the method comprises expressing a gene encoding a hydrocarbon biosynthetic polypeptide comprising one or more amino acid motifs disclosed herein.
- the hydrocarbon biosynthetic polypeptide can comprise the amino acid sequence motifs of: (1) SEQ ID NO: 202; or (2) SEQ ID NO: 203 or SEQ ID NO:204, or SEQ ID NO:205; or (3) SEQ ID NO:206, and any one of SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205; or (4) SEQ ID NO:207 or SEQ ID NO:208, or SEQ ID NO:209, or SEQ ID NO:210.
- the hydrocarbon biosynthetic polypeptide has decarbonylase activity.
- the method further comprises isolating the hydrocarbon from the host cell. [00178] In some embodiments, the method further comprises expressing a gene encoding a fatty aldehyde biosynthesis polypeptide in the host cell. Fatty aldehyde biosynthetic genes, polypeptides, sequence motifs, mutants, and variants thereof are described hereinabove.
- the method can produce hydrocarbons.
- the hydrocarbon produced is an alkane.
- the alkane is a C 3 -C 2 5 alkane.
- the alkane is a C 3 , C 4 , C 5 , C 6 , C 7 , C 8 , C9, Cio, Cn, C 12 , Ci 3 , C]4, C] 5 , Cj6, Cn, Ci 8 , C19, C 2 o, C 2 i , C 22 , C 23 , C 24 , or C 25 alkane.
- the alkane is tridecane, methyltridecane, nonadecane, methylnonadecane, heptadecane, methylheptadecane, pentadecane, or methylpentadecane.
- the method further comprising culturing the host cell in the presence of a saturated fatty acid derivative, and the hydrocarbon produced is an alkane.
- the saturated fatty acid derivative is a C 6 -C 26 fatty acid derivative substrate.
- the fatty acid derivative substrate is a C 6 , C 7 , C 8 , C9, Cio, Cn, C12, Q3, C 14 , C 15 , Ci 6 , Cn, C 18 , Ci9, C 20 , C 2 i, C 22 , C 23 , C 24 , C 25 , or a C 26 fatty acid derivative substrate.
- the fatty acid derivative substrate is 2-methylicosanal, icosanal, octadecanal, tetradecanal, 2-methyloctadecanal, stearaldehyde, or palmitaldehyde.
- the method further includes isolating the alkane from the host cell or from the culture medium. In certain embodiments, the method further includes cracking or refining the alkane.
- the hydrocarbon carbon produced can be an alkene.
- the alkene is a C 3 -C 25 alkene.
- the alkene is a C 3 , C 4 , C 5 , C 6 , C 7 , C 8 , C9, Cio, Cn, C] 2 , C13, CM, Ci 5 , Ci 6 , C ⁇ i, Cj , C19, C 2 o, C 21 , C 22 , C 23 , C 24 , or C 2 alkene.
- the alkene is pentadecene, heptadecene, methylpentadecene, or methylheptadecene.
- the alkene is a straight chain alkene, a branched chain alkene, or a cyclic alkene.
- the method further comprises culturing the host cell in the presence of an unsaturated fatty acid derivative, and the hydrocarbon produced is an alkene.
- the unsaturated fatty acid derivative is a C 6 -C 2 6 fatty acid derivative substrate.
- the fatty acid derivative substrate is a C 6 , C 7 , C 8 , C9, Ci 0 , Q 1 , C 12 , C 13 , C 14 , C] 5 , Ci6, C] 7 , Ci8 ; Ci 9, C 20 , C 2 i, C 22 , C23, C 2 4, C25, or a C 26 unsaturated fatty acid derivative substrate.
- the fatty acid derivative substrate is octadecenal, hexadecenal, methylhexadecenal, or methyloctadecenal.
- the invention features a genetically engineered microorganism wherein the microorganism produces an increased level of a hydrocarbon relative to a wild-type microorganism.
- the invention features a method of making a hydrocarbon, the method comprising culturing a genetically engineered microorganism described herein under conditions suitable for gene expression, and isolating the hydrocarbon.
- the method comprising culturing the genetically engineered microorganism in the presence of a suitable biological substrate for the hydrocarbon biosynthetic polypeptide, the fatty aldehyde biosynthetic polypeptide, and/or the acyl-ACP reductase.
- the biological substrate is a fatty acid derivative, an acyl-ACP, a fatty acid, an acyl-CoA, a fatty aldehyde, a fatty alcohol, or a fatty ester.
- the substrate is a saturated fatty acid derivative
- the hydrocarbon produced is an alkane, for example, a C 3 -C 25 alkane.
- the alkane is a C 3 , C 4 , C 5 , C 6 , C 7 , C 8 , C 3 ⁇ 4 C] 0 , C11 , C12, Co, Ci4, C 15 , C 16 , C 17 , Ci 8 , C19, C20, C 21 , C 22 , C23, C 24 , or C 25 alkane.
- the alkane is tridecane, methyltridecane, nonadecane, methylnonadecane, heptadecane, methylheptadecane, pentadecane, or methylpentadecane.
- the alkane is a straight chain alkane, a branched chain alkane, or a cyclic alkane.
- the saturated fatty acid derivative is 2-methylicosanal, icosanal, octadecanal, tetradecanal, 2-methyloctadecanal, stearaldehyde, or palmitaldehyde.
- the biological substrate is an unsaturated fatty acid derivative and the hydrocarbon produced by the microorganism is an alkene, for example, a C3-C25 alkene.
- the alkene is a C 3 , C 4 , C 5 , C 6 , C 7 , C 8 , C 9 , d 0 , C n , C ]2 , C13, C ]4 , C ]5 , C ]6 , C ] 7 , Ci 8 , Ci9, C20, C 2 i, C22, C23, C2 4 , or C25 alkene.
- the alkene is pentadecene, heptadecene, methylpentadecene, or methylheptadecene.
- the alkene is a straight chain alkene, a branched chain alkene, or a cyclic alkene.
- the unsaturated fatty acid derivative is octadecenal, hexadecenal, methylhexadecenal, or methyloctadecenal.
- the invention features a hydrocarbon produced by any of the methods or microorganisms described herein.
- the hydrocarbon is an alkane or an alkene having a 6 13 C of about -15.4 or greater.
- the alkane or alkene has a 6 13 C of about -15.4 to about -10.9, or of about -13.92 to about -13.84.
- the alkane or alkene has an fjvi 14 C of at least about 1.003. In certain embodiments, the alkene or alkene has an fM i4 C of at least about 1.01 or at least about 1.5. In some embodiments, the alkane or alkene has an fM 14 C of about 1.1 11 to about 1.124.
- the invention features a biofuel comprising a hydrocarbon produced by any of the methods or microorganisms described herein.
- the hydrocarbon is an alkane or an alkene having a 5 13 C of about -15.4 or greater.
- the alkane or alkene has a ⁇ C of about -15.4 to about -10.9, or of about -13.92 to about -13.84.
- the alkane or alkene has an fM 14 C of at least about 1.003.
- the alkane or alkene has an fM 14 C of at least about 1.003.
- the alkane or alkene has an fM l4 C of at least about 1.01 or at least about 1.5.
- the alkane or alkene has an fwi 14 C of about 1.111 to about 1.124.
- a hydrocarbon is produced in a host cell or a microorganism described herein from a carbon source.
- a "variant" of polypeptide X refers to a polypeptide having the amino acid sequence of peptide X in which one or more amino acid residues is altered.
- the variant may have conservative changes or nonconservative changes.
- Guidance in determining which amino acid residues may be substituted, inserted, or deleted without affecting biological activity may be found using computer programs well known in the art, for example, LASERGENE software (DNASTAR).
- the term "variant,” when used in the context of a polynucleotide sequence may encompass a polynucleotide sequence related to that of a gene or the coding sequence thereof.
- This definition may also include, for example, "allelic,” “splice,” “species,” or “polymorphic” variants.
- a splice variant may have significant identity to a reference polynucleotide, but will generally have a greater or fewer number of polynucleotides due to alternative splicing of exons during mRNA processing.
- the corresponding polypeptide may possess additional functional domains or an absence of domains.
- Species variants are polynucleotide sequences that vary from one species to another. The resulting polypeptides generally will have significant amino acid identity relative to each other.
- a polymorphic variant is a variation in the polynucleotide sequence of a particular gene between individuals of a given species.
- Suitable variants can be identified using bioinformatic tools such as searching for the "bidirectional best hits" against the public databases, such as for example, the Kyoto Encyclopedia of Gene & Genomes (KEGG) database, and selecting bidirectional best hits having a Smith-Waterman score of, for example, above 1000.
- bioinformatics tools known to those skilled in the art, including for example, a bi-directional blast against known genome databases and the E.coli genome, can also be used for this purpose to identify homologs.
- Variants can be naturally occurring or created in vitro.
- such variants can be created using genetic engineering techniques, such as site directed mutagenesis, random chemical mutagenesis, Exonuclease III deletion procedures, or standard cloning techniques.
- such variants, fragments, analogs, or derivatives can be created using chemical synthesis or modification procedures.
- Methods of making variants are well known in the art. These include procedures in which nucleic acid sequences obtained from natural isolates are modified to generate nucleic acids that encode polypeptides having characteristics that enhance their value in industrial or laboratory applications. In such procedures, a large number of variant sequences having one or more nucleotide differences with respect to the sequence obtained from the natural isolate are generated and characterized. Typically, these nucleotide differences result in amino acid changes with respect to the polypeptides encoded by the nucleic acids from the natural isolates.
- variants can be created using error prone PCR (see, e.g., Leung et al, Technique 1 : 11-15, 1989; and Caldwell et al, PCR Methods Applic. 2:28-33, 1992).
- error prone PCR PCR is performed under conditions where the copying fidelity of the DNA polymerase is low, such that a high rate of point mutations is obtained along the entire length of the PCR product.
- nucleic acids to be mutagenized e.g., a fatty aldehyde biosynthetic polynucleotide sequence
- PCR primers e.g., a fatty aldehyde biosynthetic polynucleotide sequence
- reaction buffer MgCl2, MnCl 2 , Taq polymerase, and an appropriate concentration of dNTPs for achieving a high rate of point mutation along the entire length of the PCR product.
- the reaction can be performed using 20 fmoles of nucleic acid to be mutagenized (e.g., a fatty aldehyde biosynthetic polynucleotide sequence), 30 pmole of each PCR primer, a reaction buffer comprising 50 mM C1, 10 mM Tris HC1 (pH 8.3), and 0.01 % gelatin, 7 mM MgCl 2 , 0.5 mM MnCl 2 , 5 units of Taq polymerase, 0.2 mM dGTP, 0.2 mM dATP, 1 mM dCTP, and 1 mM dTTP.
- nucleic acid to be mutagenized e.g., a fatty aldehyde biosynthetic polynucleotide sequence
- a reaction buffer comprising 50 mM C1, 10 mM Tris HC1 (pH 8.3), and 0.01 % gelatin, 7 mM MgCl 2 ,
- PCR can be performed for 30 cycles of 94°C for 1 min, 45°C for 1 min, and 72°C for 1 min. However, it will be appreciated that these parameters can be varied as appropriate.
- the mutagenized nucleic acids are then cloned into an appropriate vector and the activities of the polypeptides encoded by the mutagenized nucleic acids are evaluated.
- Variants can also be created using oligonucleotide directed mutagenesis to generate site-specific mutations in any cloned DNA of interest.
- Oligonucleotide mutagenesis is described in, for example, Reidhaar-Olson et al, Science 241 :53-57, 1988. Briefly, in such procedures a plurality of double stranded oligonucleotides bearing one or more mutations to be introduced into the cloned DNA are synthesized and inserted into the cloned DNA to be mutagenized (e.g. , a fatty aldehyde biosynthetic polynucleotide sequence). Clones containing the mutagenized DNA are recovered, and the activities of the polypeptides they encode are assessed.
- mutagenized e.g. , a fatty aldehyde biosynthetic polynucleotide sequence
- Assembly PCR involves the assembly of a PCR product from a mixture of small DNA fragments. A large number of different PCR reactions occur in parallel in the same vial, with the products of one reaction priming the products of another reaction. Assembly PCR is described in, for example, U.S. Pat. No. 5,965,408.
- Still another method of generating variants is sexual PCR mutagenesis.
- sexual PCR mutagenesis forced homologous recombination occurs between DNA molecules of different, but highly related, DNA sequence in vitro as a result of random fragmentation of the DNA molecule based on sequence homology. This is followed by fixation of the crossover by primer extension in a PCR reaction.
- Sexual PCR mutagenesis is described in, for example, Stemmer, PNAS, USA 91 : 10747-10751 , 1994.
- Recursive ensemble mutagenesis can also be used to generate variants.
- Recursive ensemble mutagenesis is an algorithm for protein engineering (i.e., protein mutagenesis) developed to produce diverse populations of phenotypically related mutants whose members differ in amino acid sequence. This method uses a feedback mechanism to control successive rounds of combinatorial cassette mutagenesis. Recursive ensemble mutagenesis is described in, for example, Arkin et al, PNAS, USA 89:781 1 -7815, 1992.
- variants are created using exponential ensemble mutagenesis.
- Exponential ensemble mutagenesis is a process for generating combinatorial libraries with a high percentage of unique and functional mutants, wherein small groups of residues are randomized in parallel to identify, at each altered position, amino acids which lead to functional proteins.
- Exponential ensemble mutagenesis is described in, for example, Delegrave et al , Biotech. Res. 1 1 : 1548-1552, 1993. Random and site-directed mutageneses are described in, for example, Arnold, Curr. Opin. Biotech. 4:450-455, 1993.
- variants are created using shuffling procedures wherein portions of a plurality of nucleic acids that encode distinct polypeptides are fused together to create chimeric nucleic acid sequences that encode chimeric polypeptides as described in, for example, U.S. Pat. Nos. 5,965,408 and 5,939,250.
- Polynucleotide variants also include nucleic acid analogs.
- Nucleic acid analogs can be modified at the base moiety, sugar moiety, or phosphate backbone to improve, for example, stability, hybridization, or solubility of the nucleic acid. Modifications at the base moiety include deoxyuridine for deoxythymidine and 5-methyl-2'-deoxycytidine or 5-bromo-2'- doxycytidine for deoxycytidine. Modifications of the sugar moiety include modification of the 2' hydroxyl of the ribose sugar to form 2'-0-methyl or 2'-0-allyl sugars.
- the deoxyribose phosphate backbone can be modified to produce morpholino nucleic acids, in which each base moiety is linked to a six-membered, morpholino ring, or peptide nucleic acids, in which the deoxyphosphate backbone is replaced by a pseudopeptide backbone and the four bases are retained.
- morpholino nucleic acids in which each base moiety is linked to a six-membered, morpholino ring, or peptide nucleic acids, in which the deoxyphosphate backbone is replaced by a pseudopeptide backbone and the four bases are retained.
- deoxyphosphate backbone can be replaced with, for example, a phosphorothioate or phosphorodithioate backbone, a phosphoroamidite, or an alkyl phosphotriester backbone.
- Biosynthetic polypeptide variants can be variants in which one or more amino acid residues are substituted with a conserved or non-conserved amino acid residues.
- biosynthetic polypeptide variants are variants in which one or more amino acid residues are substituted with a conserved amino acid residue.
- Such substituted amino acid residue may or may not be one encoded by a genetic code.
- polypeptide by another amino acid of similar characteristics Typical conservative substitutions are the following replacements: replacement of an aliphatic amino acid, such as alanine, valine, leucine, and isoleucine, with another aliphatic amino acid; replacement of a serine with a threonine or vice versa; replacement of an acidic residue, such as aspartic acid and glutamic acid, with another acidic residue; replacement of a residue bearing an amide group, such as asparagine and glutamine, with another residue bearing an amide group; exchange of a basic residue, such as lysine and arginine, with another basic residue; and replacement of an aromatic residue, such as phenylalanine and tyrosine, with another aromatic residue.
- replacement of an aliphatic amino acid such as alanine, valine, leucine, and isoleucine
- replacement of an acidic residue such as aspartic acid and glutamic
- polypeptide variants are those in which one or more amino acid residues include a substituent group. Still other polypeptide variants are those in which the polypeptide is associated with another compound, such as a compound to increase the half-life of the polypeptide (e.g., polyethylene glycol).
- a compound to increase the half-life of the polypeptide e.g., polyethylene glycol
- Additional polypeptide variants are those in which additional amino acids are fused to the polypeptide, such as a leader sequence, a secretory sequence, a proprotein sequence, or a sequence which facilitates purification, enrichment, or stabilization of the polypeptide.
- the polypeptide variants retain the same biological function as a the native polypeptide, for example, retain fatty alcohol biosynthetic activity, such as fatty aldehyde reductase, alcohol dehydrogenase, aldo-keto reductase, short-chain alcohol
- the polypeptide variants have at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%), at least about 85%», at least about 90%>, at least about 91 %, at least about 92%», at least about 93%, at least about 94%», at least about 95%, or more than about 95%» homology to the native or wild-type sequence.
- the polypeptide variants include a fragment comprising at least about 5, 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, or 150 consecutive amino acids thereof.
- polypeptide variants or fragments thereof can be obtained by isolating nucleic acids encoding them using techniques described herein or by expressing synthetic nucleic acids encoding them. Alternatively, polypeptide variants or fragments thereof can be obtained through biochemical enrichment or purification procedures. The sequence of polypeptide variants or fragments can be determined by proteolytic digestion, gel electrophoresis, and/or
- sequence of the polypeptide variants or fragments can then be compared to the native or wild-type sequence using any of the programs described herein.
- the polypeptide variants and fragments thereof can be assayed for fatty aldehydes producing activity, fatty alcohol producing activity or hydrocarbon producing activity using routine methods.
- the polypeptide variants or fragment can be contacted with a substrate (e.g., a fatty acid or fatty aldehyde substrate) under conditions that allow the polypeptide variant to function.
- a substrate e.g., a fatty acid or fatty aldehyde substrate
- a decreased in the level of the substrate or an increase in the level of a fatty aldehydes, fatty alcohol or hydrocarbon, respectively, can be measured to determine the biological activity of the variant or fragment.
- homolog refers to a polynucleotide or a polypeptide comprising a sequence that is at least about 80%» homologous to the corresponding polynucleotide or polypeptide sequence.
- homology can be performed as follows. The sequences are aligned for optimal comparison purposes (e.g., gaps can be introduced in one or both of a first and a second amino acid or nucleic acid sequence for optimal alignment and nonhomologous sequences can be disregarded for comparison purposes).
- gaps can be introduced in one or both of a first and a second amino acid or nucleic acid sequence for optimal alignment and nonhomologous sequences can be disregarded for comparison purposes.
- the length of a first sequence that is aligned for comparison purposes is at least about 30%, preferably at least about 40%, more preferably at least about 50%, even more preferably at least about 60%, and even more preferably at least about 70%, at least about 80%, at least about 90%, or about 100% of the length of a second sequence.
- the amino acid residues or nucleotides at corresponding amino acid positions or nucleotide positions of the first and second sequences are then compared. When a position in the first sequence is occupied by the same amino acid residue or nucleotide as the corresponding position in the second sequence, then the molecules are identical at that position (as used herein, amino acid or nucleic acid "identity" is equivalent to amino acid or nucleic acid "homology").
- the percent identity between the two sequences is a function of the number of identical positions shared by the sequences, taking into account the number of gaps and the length of each gap, which need to be introduced for optimal alignment of the two sequences.
- the comparison of sequences and determination of percent homology between two sequences can be accomplished using a mathematical algorithm, such as BLAST (Altschul et al, J. Mol Biol, 215(3): 403-410 (1990)).
- the percent homology between two amino acid sequences also can be determined using the Needleman and Wunsch algorithm that has been incorporated into the GAP program in the GCG software package, using either a Blossum 62 matrix or a PAM250 matrix, and a gap weight of 16, 14, 12, 10, 8, 6, or 4 and a length weight of 1 , 2, 3,4, 5, or 6 (Needleman and Wunsch, J. Mol Biol, 48: 444-453 (1970)).
- the percent homology between two nucleotide sequences also can be determined using the GAP program in the GCG software package, using a NWSgapdna.CMP matrix and a gap weight of 40, 50, 60, 70, or 80 and a length weight of 1 , 2, 3, 4, 5, or 6.
- One of ordinary skill in the art can perform initial homology calculations and adjust the algorithm parameters accordingly.
- a preferred set of parameters (and the one that should be used if a practitioner is uncertain about which parameters should be applied to determine if a molecule is within a homology limitation of the claims) are a Blossum 62 scoring matrix with a gap penalty of 12, a gap extend penalty of 4, and a frameshift gap penalty of 5.
- hybridizes under low stringency, medium stringency, high stringency, or very high stringency conditions describes conditions for hybridization and washing.
- Guidance for performing hybridization reactions can be found in Current Protocols in Molecular Biology, John Wiley & Sons, N.Y. (1989), 6.3.1 - 6.3.6. Aqueous and nonaqueous methods are described in that reference and either method can be used.
- hybridization conditions referred to herein are as follows: 1 ) low stringency hybridization conditions in 6X sodium chloride/sodium citrate (SSC) at about 45 °C, followed by two washes in 0.2X SSC, 0.1 % SDS at least at 50 °C (the temperature of the washes can be increased to 55 °C for low stringency conditions); 2) medium stringency hybridization conditions in 6X SSC at about 45 °C, followed by one or more washes in 0.2X SSC, 0.1 % SDS at 60 °C; 3) high stringency hybridization conditions in 6X SSC at about 45 °C, followed by one or more washes in 0.2.X SSC, 0.1 % SDS at 65 °C; and preferably 4) very high stringency hybridization conditions are 0.5M sodium phosphate, 7% SDS at 65 °C, followed by one or more washes at 0.2X SSC, 1 % SDS at 65 °C. Very high stringency conditions (4) are the pref
- the polypeptide is a fragment of any of the polypeptides described herein.
- fragment refers to a shorter portion of a full-length polypeptide or protein ranging in size from four amino acid residues to the entire amino acid sequence minus one amino acid residue.
- a fragment refers to the entire amino acid sequence of a domain of a polypeptide or protein (e.g., a substrate binding domain or a catalytic domain).
- the polypeptide is a mutant or a variant of any of the polypeptides described herein.
- mutant and variant refer to a polypeptide having an amino acid sequence that differs from a wild-type polypeptide by at least one amino acid.
- the mutant or variant can comprise one or more of the following conservative amino acid substitutions: replacement of an aliphatic amino acid, such as alanine, valine, leucine, and isoleucine, with another aliphatic amino acid; replacement of a serine with a threonine; replacement of a threonine with a serine; replacement of an acidic residue, such as aspartic acid and glutamic acid, with another acidic residue; replacement of a residue bearing an amide group, such as asparagine and glutamine, with another residue bearing an amide group; exchange of a basic residue, such as lysine and arginine, with another basic residue; and replacement of an aromatic residue, such as phenylalanine and tyrosine, with another aromatic residue.
- the mutant polypeptide has about 1 , 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 30, 40, 50, 60, 70, 80, 90, 100, or more amino acid substitutions, additions, insertions, or deletions
- Preferred fragments or mutants of a polypeptide retain some or all of the biological function (e.g., enzymatic activity) of the corresponding wild-type polypeptide. In some embodiments, the fragment or mutant retains at least 75%, at least 80%, at least 90%, at least 95%, or at least 98% or more of the biological function of the corresponding wild-type polypeptide. In other embodiments, the fragment or mutant retains about 100% of the biological function of the corresponding wild-type polypeptide. Guidance in determining which amino acid residues may be substituted, inserted, or deleted without affecting biological activity may be found using computer programs well known in the art, for example, LASERGENETM software (DNASTAR, Inc., Madison, WI).
- a fragment or mutant exhibits increased biological function as compared to a corresponding wild-type polypeptide.
- a fragment or mutant may display at least a 10%, at least a 25%, at least a 50%, at least a 75%, or at least a 90%
- the fragment or mutant displays at least 100% (e.g., at least 200%, or at least 500%) improvement in enzymatic activity as compared to the corresponding wild-type polypeptide.
- polypeptides described herein may have additional conservative or non-essential amino acid substitutions, which do not have a substantial effect on the polypeptide function. Whether or not a particular substitution will be tolerated (i.e., will not adversely affect desired biological function, such as DNA binding or enzyme activity) can be determined as described in Bowie et al. (Science, 247: 1306-1310 (1990)).
- a "conservative amino acid substitution” is one in which the amino acid residue is replaced with an amino acid residue having a similar side chain.
- Families of amino acid residues having similar side chains have been defined in the art. These families include amino acids with basic side chains (e.g., lysine, arginine, histidine), acidic side chains (e.g., aspartic acid, glutamic acid), uncharged polar side chains (e.g., glycine, asparagine, glutamine, serine, threonine, tyrosine, cysteine), nonpolar side chains (e.g., alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan), beta-branched side chains (e.g., threonine, valine, isoleucine), and aromatic side chains (e.g., tyrosine, phenylalanine, tryptophan, histidine).
- the fatty acid or fatty acid derivative biosynthetic polypeptide or polynucleotide is from a bacterium, a cyanobacterium, an algae, a plant, an insect, a yeast, a fungus, or a mammal.
- the polypeptide is from a mammalian cell, plant cell, insect cell, fungus cell, cyanobacterial cell, algal cell, bacterial cell, or any other organisms described herein.
- a polynucleotide (or gene) sequence is provided to the host cell by way of a recombinant vector, which comprises a promoter operably linked to the polynucleotide sequence.
- the promoter is a developmentally-regulated, an organelle-specific, a tissue-specific, an inducible, a constitutive, or a cell-specific promoter.
- the recombinant vector comprises at least one sequence selected from the group consisting of (a) an expression control sequence operatively coupled to the polynucleotide sequence; (b) a selection marker operatively coupled to the polynucleotide sequence; (c) a marker sequence operatively coupled to the polynucleotide sequence; (d) a purification moiety operatively coupled to the polynucleotide sequence; (e) a secretion sequence operatively coupled to the polynucleotide sequence; and (f) a targeting sequence operatively coupled to the polynucleotide sequence.
- the expression vectors described herein include a polynucleotide sequence described herein in a form suitable for expression of the polynucleotide sequence in a host cell. It will be appreciated by those skilled in the art that the design of the expression vector can depend on such factors as the choice of the host cell to be transformed, the level of expression of polypeptide desired, etc.
- the expression vectors described herein can be introduced into host cells to produce polypeptides, including fusion polypeptides, encoded by the polynucleotide sequences as described herein.
- Fusion vectors add a number of amino acids to a polypeptide encoded therein, usually to the amino- or carboxy- terminus of the recombinant polypeptide.
- Such fusion vectors typically serve one or more of the following three purposes: (1 ) to increase expression of the recombinant polypeptide; (2) to increase the solubility of the recombinant polypeptide; and (3) to aid in the purification of the recombinant polypeptide by acting as a ligand in affinity purification.
- a proteolytic cleavage site is introduced at the junction of the fusion moiety and the recombinant polypeptide. This enables separation of the recombinant polypeptide from the fusion moiety after purification of the fusion polypeptide.
- enzymes include Factor Xa, thrombin, and enterokinase.
- Exemplary fusion expression vectors include pGEX (Pharmacia Biotech, Inc., Piscataway, NJ; Smith et al., Gene, 67: 31 -40 (1988)), pMAL (New England Biolabs, Beverly, MA), and pRITS (Pharmacia Biotech, Inc., Piscataway, N.J.), which fuse glutathione S-transferase (GST), maltose E binding protein, or protein A,
- Examples of inducible, non-fusion E. coli expression vectors include pTrc (Amann et al, Gene (1988) 69:301 -315) and pET l i d (Studier et al , Gene Expression Technology:
- Target gene expression from the pTrc vector relies on host RNA polymerase transcription from a hybrid trp- lac fusion promoter.
- Target gene expression from the pET l id vector relies on transcription from a T7 gnlO-lac fusion promoter mediated by a coexpressed viral RNA polymerase (T7 gnl ). This viral polymerase is supplied by host strains BL21(DE3) or HMS 174(DE3) from a resident ⁇ prophage harboring a T7 gnl gene under the transcriptional control of the lacUV 5 promoter.
- Suitable expression systems for both prokaryotic and eukaryotic cells are well known in the art; see, e.g., Sambrook et al., "Molecular Cloning: A Laboratory Manual," second edition, Cold Spring Harbor Laboratory, (1989).
- Examples of inducible, non-fusion E. coli expression vectors include pTrc (Amann et al., Gene, 69: 301 -315 (1988)) and PET 1 Id (Studier et al., Gene Expression Technology: Methods in Enzymology 185, Academic Press, San Diego, CA, pp. 60- 89 (1990)).
- a polynucleotide sequence of the invention is operably linked to a promoter derived from bacteriophage T5.
- the host cell is a yeast cell.
- the expression vector is a yeast expression vector.
- yeast expression vectors for expression in yeast include pYepSecl (Baldari et al, EMBO J., 6: 229-234 (1 87)), pMFa (Kurjan et al., Cell, 30: 933-943 (1982)), pJRY88 (Schultz et al., Gene, 54: 1 13-123 (1987)), pYES2 (Invitrogen Corp., San Diego, CA), and picZ (Invitrogen Corp., San Diego, CA).
- a polypeptide described herein can be expressed in insect cells using baculovirus expression vectors.
- Baculovirus vectors available for expression of proteins in cultured insect cells include, for example, the pAc series (Smith et al , Mol. Cell Biol. (1983) 3 :2156-2165) and the pVL series (Lucklow et al. , Virology (1989) 170:31 -39).
- the nucleic acids described herein can be expressed in mammalian cells using a mammalian expression vector.
- mammalian expression vectors include pCDM8 (Seed, Nature (1987) 329:840) and pMT2PC (Kaufman et al , EMBO J. (1987) 6: 187-195).
- the expression vector's control functions can be provided by viral regulatory elements.
- commonly used promoters are derived from polyoma, Adenovirus 2, cytomegalovirus, and Simian Virus 40.
- Vectors can be introduced into prokaryotic or eukaryotic cells via a variety of art- recognized techniques for introducing foreign nucleic acid (e.g., DNA) into a host cell. Suitable methods for transforming or transfecting host cells can be found in, for example, Sambrook et al. (supra).
- a gene that encodes a selectable marker e.g., resistance to an antibiotic
- selectable markers include those that confer resistance to drugs such as, but not limited to, ampicillin, kanamycin, chloramphenicol, or tetracycline.
- Nucleic acids encoding a selectable marker can be introduced into a host cell on the same vector as that encoding a polypeptide described herein or can be introduced on a separate vector. Cells stably transformed with the introduced nucleic acid can be identified by growth in the presence of an appropriate selection drug. [00238] Similarly, for stable transfection of mammalian cells, it is known that, depending upon the expression vector and transfection technique used, only a small fraction of cells may integrate the foreign DNA into their genome. In order to identify and select these integrants, a gene that encodes a selectable marker (e.g., resistance to an antibiotic) can be introduced into the host cells along with the gene of interest.
- a selectable marker e.g., resistance to an antibiotic
- Preferred selectable markers include those which confer resistance to drugs, such as G418, hygromycin, and methotrexate.
- Nucleic acids encoding a selectable marker can be introduced into a host cell on the same vector as that encoding a polypeptide described herein or can be introduced on a separate vector. Cells stably transfected with the introduced nucleic acid can be identified by growth in the presence of an appropriate selection drug.
- a "host cell” is a cell used to produce a product described herein (e.g., a fatty aldehydes, a fatty alcohol or a hydrocarbon).
- a host cell is referred to as an "engineered host cell” or a “recombinant host cell” if the expression of one or more polynucleotides or polypeptides in the host cell are altered or modified as compared to their expression in a corresponding wild-type host cell under the same conditions.
- the host cell can be selected from the group consisting of a eukaryotic plant, algae, cyanobacterium, green-sulfur bacterium, green non-sulfur bacterium, purple sulfur bacterium, purple non-sulfur bacterium, extremophile, yeast, fungus, engineered organisms thereof, or a synthetic organism.
- the host cell is light dependent or fixes carbon.
- the host cell is light dependent or fixes carbon.
- the host cell has autotrophic activity.
- Various host cells can be used to produce fatty aldehydes, fatty alcohols and hydrocarbons, as described herein.
- a host cell can be any prokaryotic or eukaryotic cell.
- a gene encoding a polypeptide described herein e.g., a fatty aldehyde biosynthetic polypeptide, or an acyl-ACP reductase polypeptide, and/or a fatty alcohol biosynthetic polypeptide
- bacterial cells such as E.
- coli coli
- insect cells yeast
- yeast or mammalian cells (such as Chinese hamster ovary cells (CHO) cells, COS cells, VERO cells, BHK cells, HeLa cells, Cvl cells, MDCK cells, 293 cells, 3T3 cells, or PC 12 cells).
- mammalian cells such as Chinese hamster ovary cells (CHO) cells, COS cells, VERO cells, BHK cells, HeLa cells, Cvl cells, MDCK cells, 293 cells, 3T3 cells, or PC 12 cells).
- Exemplary host cells can be from the genus Escherichia, Bacillus, Lactobacillus, Rhodococcus, Pseudomonas, Aspergillus, Trichoderma, Neurospora, Fusarium, Humicola, Rhizomucor, Kluyveromyces, Pichia, Mucor, Myceliophtora, Penicillium, Phanerochaete, Pleurotus, Trametes, Chrysosporium, Saccharomyces, Schizosaccharomyces, Yarrowia, or Streptomyces.
- the host cell is a Gram-positive bacterial cell. In other embodiments, the host cell is a Gram-negative bacterial cell.
- the host cell is selected from the genus Escherichia, Bacillus, Lactobacillus, Rhodococcus, Pseudomonas, Aspergillus, Trichoderma, Neurospora, Fusarium, Humicola, Rhizomucor, Kluyveromyces, Pichia, Mucor, Myceliophtora, Penicillium,
- Phanerochaete Pleurotus, Trametes, Chrysosporium, Saccharomyces, Stenotrophamonas, Schizosaccharomyces, Yarrowia, or Streptomyces.
- the host cell is a Bacillus lentus cell, a Bacillus brevis cell, a Bacillus stearothermophilus cell, a Bacillus licheniformis cell, a Bacillus alkalophilus cell, a Bacillus coagulans cell, a Bacillus circulans cell, a Bacillus pumilis cell, a Bacillus thuringiensis cell, a Bacillus clausii cell, a Bacillus megaterium cell, a Bacillus subtilis cell, or a Bacillus amyloliquefaciens cell.
- the host cell is a Trichoderma koningii cell, a Trichoderma viride cell, a Trichoderma reesei cell, a Trichoderma longibrachiatum cell, an Aspergillus awamori cell, an Aspergillus fumigates cell, an Aspergillus foetidus cell, an Aspergillus nidulans cell, an Aspergillus niger cell, an Aspergillus oryzae cell, a Humicola insolens cell, a Humicola lanuginose cell, a Rhodococcus opacus cell, a Rhizomucor miehei cell, or a Mucor michei cell.
- the host cell is a Streptomyces lividans cell or a
- the host cell is an Actinomycetes cell.
- the host cell is a Saccharomyces cerevisiae cell.
- Additional host cells that can be used in the methods described herein are described in WO2009/1 1 1513 and WO2009/1 1 1672.
- Transport proteins can export polypeptides and organic compounds ⁇ e.g. , fatty alcohols) out of a host cell.
- Many transport and efflux proteins serve to excrete a wide variety of compounds and can be naturally modified to be selective for particular types of hydrocarbons.
- Non-limiting examples of suitable transport proteins are ATP-Binding Cassette (ABC) transport proteins, efflux proteins, and fatty acid transporter proteins (FATP). Additional non-limiting examples of suitable transport proteins include the ABC transport proteins from organisms such as Caenorhabditis elegans, Arabidopsis thalania, Alkaligenes eutrophus, and Rhodococcus erythropolis. Exemplary ABC transport proteins include, without limitation, CER5 [Accession No: Atl g 51510, AY734542, At3g 2190, or Atl g51460], AtMRP5 [Accession No.
- Host cells can also be chosen for their endogenous ability to secrete organic compounds.
- the efficiency of organic compound production and secretion into the host cell environment can be expressed as a ratio of intracellular product to extracellular product. In some examples, the ratio can be about 5: 1 , 4: 1 , 3 : 1 , 2: 1 , 1 : 1 , 1 :2, 1 :3, 1 :4, or 1 :5.
- the production and isolation of fatty alcohols can be enhanced by employing beneficial fermentation techniques.
- One method for maximizing production while reducing costs is increasing the percentage of the carbon source that is converted to hydrocarbon products.
- Genes that can be activated to stop cell replication and growth in E. coli include umuDC genes. The overexpression of umuDC genes stops the progression from stationary phase to exponential growth (Murli et al, J. of Bad. 182: 1 127, 2000).
- UmuC is a DNA polymerase that can carry out translesion synthesis over non-coding lesions - the mechanistic basis of most UV and chemical mutagenesis.
- the umuDC gene products are involved in the process of translesion synthesis and also serve as a DNA sequence damage checkpoint.
- the umuDC gene products include UmuC, UmuD, umuD', UmuD' 2 C, UmuD' 2 , and UmuD 2 .
- product-producing genes can be activated, thus minimizing the need for replication and maintenance pathways to be used while a fatty aldehyde is being made.
- Host cells can also be engineered to express umuC and umuD from E. coli in pBAD24 under the prpBCDE promoter system through de novo synthesis of this gene with the appropriate end-product production genes.
- the percentage of input carbons converted to fatty alcohols can be a cost driver.
- the more efficient the process is i.e., the higher the percentage of input carbons converted to fatty alcohols), the less expensive the process will be.
- oxygen-containing carbon sources e.g., glucose and other carbohydrate based sources
- the oxygen must be released in the form of carbon dioxide.
- a carbon atom is also released leading to a maximal theoretical metabolic efficiency of approximately 34% (w/w) (for fatty acid derived products). This figure, however, changes for other organic compounds and carbon sources. Typical efficiencies in the literature are approximately less than 5%.
- Host cells engineered to produce fatty alcohols can have greater than about 1 , 3, 5, 10, 15, 20, 25, and 30% efficiency. In one example, host cells can exhibit an efficiency of about 10% to about 25%. In other examples, such host cells can exhibit an efficiency of about 25% to about 30%. In other examples, host cells can exhibit greater than 30% efficiency.
- the host cell can be additionally engineered to express recombinant cellulosomes, such as those described in PCT application number PCT/US2007/003736. These cellulosomes can allow the host cell to use cellulosic material as a carbon source.
- the host cell can be additionally engineered to express invertases (EC 3.2.1.26) so that sucrose can be used as a carbon source.
- the host cell can be engineered using the teachings described in U.S. Patent Nos. 5,000,000; 5,028,539; 5,424,202; 5,482,846; and 5,602,030; so that the host cell can assimilate carbon efficiently and use cellulosic materials as carbon sources.
- the fermentation chamber can enclose a fermentation that is undergoing a continuous reduction.
- a stable reductive environment can be created.
- the electron balance can be maintained by the release of carbon dioxide (in gaseous form).
- Efforts to augment the NAD/H and NADP/H balance can also facilitate in stabilizing the electron balance.
- the availability of intracellular NADPH can also be enhanced by engineering the host cell to express an NADH:NADPH transhydrogenase.
- the expression of one or more NADH:NADPH transhydrogenases converts the NADH produced in glycolysis to NADPH, which can enhance the production of fatty alcohols.
- the engineered host cells can be grown in batches of, for example, about 100 mL, 500 mL, 1 L, 2 L, 5 L, or 10 L; fermented; and induced to express desired fatty aldehyde biosynthetic genes and/or an alcohol dehydrogenase genes based on the specific genes encoded in the appropriate plasmids.
- the engineered host cells can be grown in batches of about 10 L, 100 L, 1000 L, 10,000 L, 100,000 L, 1,000,000 L or larger; fermented; and induced to express desired fatty aldehyde biosynthetic genes and/or alcohol dehydrogenase genes based on the specific genes encoded in the appropriate plasmids or incorporated into the host cell's genome.
- a suitable production host such as E. coli cells, harboring plasmids containing the desired genes or having the genes integrated in its chromosome can be incubated in a suitable reactor, for example a 1 L reactor, for 20 hours at 37 °C in M9 medium
- the production host can be induced with IPTG alcohol. After incubation, the spent media can be extracted and the organic phase can be examined for the presence of fatty alcohols using GC-MS.
- aliquots of no more than about 10% of the total cell volume can be removed each hour and allowed to sit without agitation to allow the fatty alcohols to rise to the surface and undergo a spontaneous phase separation or precipitation.
- the fatty alcohol component can then be collected, and the aqueous phase returned to the reaction chamber.
- the reaction chamber can be operated continuously. When the OD 6 oo drops below 0.6, the cells can be replaced with a new batch grown from a seed culture.
- a fatty alcohol can be produced using a purified polypeptide ⁇ e.g., a fatty alcohol biosynthetic polypeptide) described herein and a substrate (e.g., fatty aldehyde), produced, for example, by a method described herein.
- a host cell can be engineered to express a fatty alcohol biosynthetic polypeptide or variant as described herein.
- the host cell can be cultured under conditions suitable to allow expression of the polypeptide.
- Cell free extracts can then be generated using known methods.
- the host cells can be lysed using detergents or by sonication.
- the expressed polypeptides can be purified using known methods.
- substrates described herein can be added to the cell free extracts and maintained under conditions to allow conversion of the substrates (e.g., fatty aldehydes) to fatty alcohols.
- the fatty alcohols can then be separated and purified using known techniques.
- a fatty aldehyde can be converted into a fatty alcohol by contacting the fatty aldehyde with a fatty alcohol biosynthetic polypeptide provided herein, or a variant thereof.
- a fatty aldehyde can be converted into a fatty alcohol by contacting the fatty aldehyde with a fatty alcohol biosynthetic polypeptide that is an AdhP homolog of FIG. 2, a DkgA homolog of FIG. 3, a DkgB homolog of FIG. 4, a Tas homolog of FIG. 5, an RspB homolog of FIG. 6, a Yah homolog of FIG. 7, a YbbO homolog of FIG.
- the fatty alcohols produced during fermentation can be separated from the fermentation media. Any known technique for separating fatty alcohols from aqueous media can be used.
- One exemplary separation process is a two phase (bi-phasic) separation process. This process involves fermenting the genetically engineered host cells under conditions sufficient to produce fatty alcohols, allowing the fatty alcohol to collect in an organic phase, and separating the organic phase from the aqueous fermentation broth. This method can be practiced in both a batch and continuous fermentation processes.
- Bi-phasic separation uses the relative immiscibility of fatty alcohols to facilitate separation. Immiscible refers to the relative inability of a compound to dissolve in water and is defined by the compound's partition coefficient.
- the fatty alcohols produced by the methods described herein can be relatively immiscible in the fermentation broth, as well as in the cytoplasm. Therefore, the fatty alcohol can collect in an organic phase either intracellularly or extracellularly. The collection of the products in the organic phase can lessen the impact of the fatty alcohol on cellular function and can allow the host cell to produce more product.
- the methods described herein can result in the production of homogeneous compounds wherein at least about 60%, 70%, 80%, 90%, or 95% of the fatty alcohols produced will have carbon chain lengths that vary by less than about 6 carbons, less than about 4 carbons, or less than about 2 carbons. These compounds can also be produced with a relatively uniform degree of saturation. These compounds can be used directly as fuels, fuel additives, starting materials for production of other chemical compounds (e.g., polymers, surfactants, plastics, textiles, solvents, adhesives, etc.), or personal care additives. These compounds can also be used as feedstock for subsequent reactions, for example, hydrogenation, catalytic cracking (e.g., via hydro genati on, pyrolisis, or both), to make other products.
- chemical compounds e.g., polymers, surfactants, plastics, textiles, solvents, adhesives, etc.
- the fatty alcohols produced using methods described herein can contain between about 50% and about 90% carbon; or between about 5% and about 25% hydrogen. In other embodiments, the fatty alcohols produced using methods described herein can contain between about 65% and about 85% carbon; or between about 10% and about 15% hydrogen.
- the host cell is a Gram-positive bacterial cell. In other embodiments, the host cell is a Gram-negative bacterial cell.
- the host cell is selected from the genus Escherichia, Bacillus, Lactobacillus, Rhodococcus, Pseudomonas, Aspergillus, Trichoderma, Neurospora, Fusarium, Humicola, Rhizomucor, Kluyveromyces, Pichia, Mucor, Myceliophtora, Penicillium, Phanerochaete, Pleurotus, Trametes, Chrysosporium, Saccharomyces, Stenotrophamonas, Schizosaccharomyces, Yarrowia, or Streptomyces.
- the host cell is a Bacillus lentus cell, a Bacillus brevis cell, a Bacillus stearothermophilus cell, a Bacillus lichen formis cell, a Bacillus alkalophilus cell, a Bacillus coagulans cell, a Bacillus circulans cell, a Bacillus pumilis cell, a Bacillus thuringiensis cell, a Bacillus clausii cell, a Bacillus megaterium cell, a Bacillus subtilis cell, or a Bacillus amyloliquefaciens cell.
- the host cell is a Trichoderma koningii cell, a Trichoderma viride cell, a Trichoderma reesei cell, a Trichoderma longibrachiatum cell, an Aspergillus awamori cell, an Aspergillus fumigates cell, an Aspergillus foetidus cell, an Aspergillus nidulans cell, an Aspergillus niger cell, an Aspergillus oryzae cell, a Humicola insolens cell, a Humicola lanuginose cell, a Rhodococcus opacus cell, a Rhizomucor miehei cell, or a Mucor michei cell.
- the host cell is a Streptomyces lividans cell or a Streptomyces murinus cell.
- the host cell is an Actinomycetes cell.
- the host cell is a Saccharomyces cerevisiae cell. In some embodiments, the host cell is a Saccharomyces cerevisiae cell.
- the host cell is a CHO cell, a COS cell, a VERO cell, a BHK cell, a HeLa cell, a Cvl cell, an MDCK cell, a 293 cell, a 3T3 cell, or a PC 12 cell.
- the host cell is a cell from an eukaryotic plant, algae,
- the host cell is light-dependent or fixes carbon. In some embodiments, the host cell is light-dependent or fixes carbon. In some embodiments, the host cell has autotrophic activity. In some embodiments, the host cell has photoautotrophic activity, such as in the presence of light. In some embodiments, the host cell is heterotrophic or mixotrophic in the absence of light. In certain embodiments, the host cell is a cell from
- PCC 6803 Thermosynechococcus elongates BP-1, Chlorobium tepidum, Chlorojlexus auranticus, Chromatiumm vinosum, Rhodospirillum rubrum, Rhodobacter capsulatus, Rhodopseudomonas palusris, Clostridium ljungdahlii, Clostridiuthermocellum, Penicillium chrysogenum, Pichia pastoris, Saccharomyces cerevisiae, Schizosaccharomyces pombe, Pseudomonasjluorescens, or Zymomonas mobilis.
- the host cell is an E. coli cell.
- the E. coli cell is a strain B, a strain C, a strain K, or a strain W E. coli cell.
- the host cell is a Pantoea citrea cell.
- condition permissive for the production means any conditions that allow a host cell to produce a desired product, such as a fatty acid or a fatty acid derivative.
- condition in which the polynucleotide sequence of a vector is expressed means any conditions that allow a host cell to synthesize a polypeptide. Suitable conditions include, for example, fermentation conditions. Fermentation conditions can comprise many parameters, such as temperature ranges, levels of aeration, and media composition. Each of these conditions, individually and in combination, allows the host cell to grow. Exemplary culture media include broths or gels. Generally, the medium includes a carbon source that can be metabolized by a host cell directly
- a host cell can be cultured, for example, for about 4, 8, 12, 24, 36, 48, 72, or more hours. During and/or after culturing, samples can be obtained and analyzed to determine if the conditions allow production or expression. For example, the host cells in the sample or the medium in which the host cells were grown can be tested for the presence of a desired product.
- assays such as, but not limited to, MS, thin layer chromatography (TLC), high-performance liquid chromatography (HPLC), liquid chromatography (LC), GC coupled with a flame ionization detector (FID), GC-MS, and LC-MS can be used.
- TLC thin layer chromatography
- HPLC high-performance liquid chromatography
- LC liquid chromatography
- FID flame ionization detector
- GC-MS LC-MS
- LC-MS LC-MS
- the host cell can be additionally engineered to express a recombinant cellulosome, which can allow the host cell to use cellulosic material as a carbon source.
- exemplary cellulosomes suitable for use in the methods of the invention include, e.g, the cellulosomes described in International Patent Application Publication WO 2008/100251.
- the host cell also can be engineered to assimilate carbon efficiently and use cellulosic materials as carbon sources according to methods described in U.S. Patents 5,000,000; 5,028,539; 5,424,202; 5,482,846; and 5,602,030.
- the host cell can be engineered to express an invertase so that sucrose can be used as a carbon source.
- the fermentation chamber encloses a fermentation that is undergoing a continuous reduction, thereby creating a stable reductive environment.
- the electron balance can be maintained by the release of carbon dioxide (in gaseous form).
- Efforts to augment the NAD/H and NADP/H balance can also facilitate in stabilizing the electron balance.
- the availability of intracellular NADPH can also be enhanced by engineering the host cell to express an NADH:NADPH transhydrogenase.
- the expression of one or more NADH:NADPH transhydrogenases converts the NADH produced in glycolysis to NADPH, which can enhance the production of fatty aldehydes and fatty alcohols.
- the engineered host cells can be grown in batches of, for example, about 100 mL, 500 mL, 1 L, 2 L, 5 L, or 10 L; fermented; and induced to express a desired polynucleotide sequence, such as a polynucleotide sequence encoding a PPTase.
- a desired polynucleotide sequence such as a polynucleotide sequence encoding a PPTase.
- the engineered host cells can be grown in batches of about 10 L, 100 L, 1000 L, 10,000 L, 100,000 L, 1 ,000,000 L or larger; fermented; and induced to express a desired polynucleotide sequence.
- the fatty acids and derivatives thereof produced by the methods of invention generally are isolated from the host cell.
- isolated as used herein with respect to products, such as fatty acids and derivatives thereof, refers to products that are separated from cellular components, cell culture media, or chemical or synthetic precursors.
- the fatty acids and derivatives thereof produced by the methods described herein can be relatively immiscible in the fermentation broth, as well as in the cytoplasm. Therefore, the fatty acids and derivatives thereof can collect in an organic phase either intracellularly or extracellularly. The collection of the products in the organic phase can lessen the impact of the fatty acid or fatty acid derivative on cellular function and can allow the host cell to produce more product.
- the fatty acids and fatty acid derivatives produced by the methods of invention are purified.
- purify As used herein, the term “purify,” “purified,” or
- purification means the removal or isolation of a molecule from its environment by, for example, isolation or separation.
- substantially purified molecules are at least about 60% free (e.g., at least about 70% free, at least about 75% free, at least about 85% free, at least about 90% free, at least about 95% free, at least about 97% free, at least about 99% free) from other components with which they are associated.
- these terms also refer to the removal of contaminants from a sample. For example, the removal of contaminants can result in an increase in the percentage of a fatty aldehyde or a fatty alcohol in a sample.
- the fatty aldehyde or fatty alcohol when a fatty aldehyde or a fatty alcohol is produced in a host cell, the fatty aldehyde or fatty alcohol can be purified by the removal of host cell proteins. After purification, the percentage of a fatty acid or derivative thereof in the sample is increased.
- a purified fatty acid or derivative thereof is a fatty acid or derivative thereof that is substantially separated from other cellular components (e.g., nucleic acids, polypeptides, lipids, carbohydrates, or other hydrocarbons).
- a purified fatty acid preparation or a purified fatty acid derivative preparation is a fatty acid preparation or a fatty acid derivative preparation in which the fatty acid or derivative thereof is substantially free from contaminants, such as those that might be present following fermentation.
- a fatty acid or derivative thereof is purified when at least about 50% by weight of a sample is composed of the fatty acid or fatty acid derivative.
- a fatty acid or derivative thereof is purified when at least about 60%, e.g., at least about 70%, at least about 80%, at least about 85%, at least about 90%, at least about 92% or more by weight of a sample is composed of the fatty acid or derivative thereof.
- a fatty acid or derivative thereof is purified when less than about 100%, e.g., less than about 99%, less than about 98%, less than about 95%, less than about 90%, or less than about 80%) by weight of a sample is composed of the fatty acid or derivative thereof.
- a purified fatty acid or derivative thereof can have a purity level bounded by any two of the above endpoints.
- a fatty acid or derivative thereof can be purified when at least about 80%-95%, at least about 85%-99%, or at least about 90%-98% of a sample is composed of the fatty acid or fatty acid derivative.
- the fatty acid or derivative thereof may be present in the extracellular environment, or it may be isolated from the extracellular environment of the host cell. In certain embodiments, a fatty acid or derivative thereof is secreted from the host cell. In other embodiments, a fatty acid or derivative thereof is transported into the extracellular environment. In yet other embodiments, the fatty acid or derivative thereof is passively transported into the extracellular environment.
- a fatty acid or derivative thereof can be isolated from a host cell using methods known in the art, such as those disclosed in International Patent Application Publications WO 2010/042664 and WO 2010/062480.
- the methods described herein can result in the production of homogeneous compounds wherein at least about 60%, at least about 70%, at least about 80%>, at least about 90%, or at least about 95%, of the fatty acids or fatty acid derivatives produced will have carbon chain lengths that vary by less than 6 carbons, less than 5 carbons, less than 4 carbons, less than 3 carbons, or less than about 2 carbons.
- the methods described herein can result in the production of homogeneous compounds wherein less than about 98%, less than about 95%, less than about 90%, less than about 80%, or less than about 70% of the fatty acids or fatty acid derivatives produced will have carbon chain lengths that vary by less than 6 carbons, less than 5 carbons, less than 4 carbons, less than 3 carbons, or less than about 2 carbons.
- the fatty acids or fatty acid derivatives can have a degree of homogeneity bounded by any two of the above endpoints.
- the fatty acid or fatty acid derivative can have a degree of homogeneity wherein about 70%-95%, about 80%-98%, or about 90%-95% of the fatty acids or fatty acid derivatives produced will have carbon chain lengths that vary by less than 6 carbons, less than 5 carbons, less than 4 carbons, less than 3 carbons, or less than about 2 carbons. These compounds can also be produced with a relatively uniform degree of saturation.
- one or more of the titer, yield, or productivity of the fatty acid or derivative thereof produced by the engineered host cell having an altered level of expression of a FadR polypeptide is increased relative to that of the
- titer refers to the quantity of fatty acid or fatty acid derivative produced per unit volume of host cell culture.
- a fatty acid or a fatty acid derivative such as a terminal olefin, a fatty aldehyde, a fatty alcohol, an alkane, a fatty ester, a ketone or an internal olefins is produced at a titer of about 25 mg/L, about 50 mg/L, about 75 mg/L, about 100 mg/L, about 125 mg/L, about 150 mg/L, about 175 mg/L, about 200 mg/L, about 225 mg/L, about 250 mg/L, about 275 mg/L, about 300 mg/L, about 325 mg/L, about 350 mg/L, about 375 mg/L, about 400 mg/L, about 425 mg/L, about 450 mg/L, about 475 mg/L, about 500 mg/L, about 525
- a fatty acid or fatty acid derivative is produced at a titer of more than 2000 mg/L, more than 5000 mg/L, more than 10,000 mg/L, or higher, such as 50 g/L, 70 g/L, 100 g/L, 120 g/L, 150 g/L, or 200 g/L.
- yield of the fatty acid or derivative thereof produced by a host cell refers to the efficiency by which an input carbon source is converted to product (i.e., fatty acid or fatty acid derivative such as fatty alcohol or fatty ester) in a host cell.
- product i.e., fatty acid or fatty acid derivative such as fatty alcohol or fatty ester
- yield of the fatty acid or derivative thereof produced by a host cell refers to the efficiency by which an input carbon source is converted to product (i.e., fatty acid or fatty acid derivative such as fatty alcohol or fatty ester) in a host cell.
- oxygen-containing carbon sources e.g., glucose and other carbohydrate based sources
- the oxygen must be released in the form of carbon dioxide.
- a carbon atom is also released leading to a maximal theoretical metabolic efficiency of approximately 34% (w/w) (for fatty acid derived products). This figure, however, changes for other organic compounds and carbon sources. Typical yield reported in the literature are approximately less than 5%.
- Host cells engineered to produce fatty acids and fatty acid derivatives according to the methods of the invention can have a yield of at least about 3%, at least about 5%, at least about 10%, at least about 15%, at least about 18%, or at least about 20%. Alternatively, or in addition, the yield is about 30%) or less, about 27% or less, about 25% or less, or about 22%> or less. Thus, the yield can be bounded by any two of the above endpoints.
- the yield of the fatty acid or derivative thereof produced by the engineered host cell according to the methods of the invention can be about 5% to about 25%, about 10% to about 25%, about 10% to about 22%, about 15% to about 27%, or about 18% to about 22%. In other embodiments, the yield is greater than 30%.
- productivity of the fatty acid or derivative thereof produced by a host cell refers to the quantity of fatty acid or fatty acid derivative produced per unit volume of host cell culture per unit density of host cell culture.
- productivity of a fatty acid or a fatty acid derivative such as an olefin, a fatty aldehyde, a fatty alcohol, an alkane, a fatty ester, or a ketone produced by an engineered host cells is at least about at least about 3 mg/L/OD 6 oo, at least about 6 mg/L/OD 6 oo, at least about 9 mg/L/OD 6 oo, at least about 12 mg/L/OD 6 oo, or at least about 15 mg/L/OD oo-
- the productivity is about 50 mg/L/OD 6 oo or less, about 40 mg/L/OD 6 oo or less, about 30 mg/L/OD 6 o
- the productivity can be bounded by any two of the above endpoints.
- the productivity can be about 3 to about 30 mg/L/OD 6 oo, about 6 to about 20 mg/L/OD 6 oo, or about 15 to about 30 mg/L/OD 600 .
- compositions and methods of the invention the production and isolation of a desired fatty acid or derivative thereof (e.g., acyl-CoA, fatty acids, terminal olefins, fatty aldehydes, fatty alcohols, alkanes, alkenes, wax esters, ketones and internal olefins) can be enhanced by altering the expression of one or more genes involved in the regulation of fatty acid, fatty ester, alkane, alkene, olefin fatty alcohol production, degradation and/or secretion in the engineered host cell.
- a desired fatty acid or derivative thereof e.g., acyl-CoA, fatty acids, terminal olefins, fatty aldehydes, fatty alcohols, alkanes, alkenes, wax esters, ketones and internal olefins
- a desired fatty acid or derivative thereof e.g., acyl-CoA, fatty acids, terminal olefins, fatty
- Bioproducts e.g., fatty alcohols
- biologically produced organic compounds particularly fatty alcohols biologically produced using the fatty acid biosynthetic pathway
- hydrocarbons (and/or fatty aldehydes) described herein can be used as or converted into a fuel or as a specialty chemical.
- a fuel or specialty chemical One of ordinary skill in the art will appreciate that, depending upon the intended purpose of the fuel or specialty chemical, different
- hydrocarbons and/or fatty aldehydes
- a branched hydrocarbon may be desirable for automobile fuels that are intended to be used in cold climates.
- hydrocarbons when hydrocarbons are used as a feedstock for fuel and specialty chemical production, one of ordinarly skill in the art will appreciate that the characteristics of the hydrocarbon will affect the characteristics of the fuel or specialty chemicals produced. Hence the characteristics of the fuel or specialty chemical product can be selected for by producing particular hydrocarbons (and/or fatty aldehydes) for use as a feedstock.
- biofuels having desired fuel qualities can be produced from hydrocarbons (and/or fatty aldehydes). These thus represent a new source of biofuels, which can be used as jet fuels, diesel, or gasoline. Some biofuels made using hydrocarbons (and/or fatty aldehydes) thus prepared have not been produced from renewable sources and are new compositions of matter. These new fuels or specialty chemicals can be distinguished from fuels or specialty chemicals derived from petrochemical carbon on the basis of dual carbon-isotopic fingerprinting. Additionally, the specific source of biosourced carbon (e.g. , glucose vs. glycerol) can be determined by dual carbon-isotopic fingerprinting (see, e.g., U.S. Patent No. 7,169,588, which is herein incorporated by reference).
- biosourced carbon e.g., glucose vs. glycerol
- bioproducts can be distinguished from organic compounds derived from petrochemical carbon on the basis of dual carbon-isotopic fingerprinting ( 13 C/ 12 C) or 14 C dating. Additionally, the specific source of biosourced carbon (e.g., glucose vs. glycerol) can be determined by dual carbon-isotopic fingerprinting (see, e.g., U.S. Patent No. 7,169,588, which is herein incorporated by reference).
- biosourced carbon e.g., glucose vs. glycerol
- Bioproducts can be distinguished from petroleum based organic compounds by
- CI C stable carbon isotope ratio
- the CI C ratio in a given bioproduct is a consequence of the 13 C/ 12 C ratio in atmospheric carbon dioxide at the time the carbon dioxide is fixed. It also reflects the precise metabolic pathway. Regional variations also occur. Petroleum, C 3 plants (the broadleaf), C 4 plants (the grasses), and marine carbonates all show significant differences in 13 C/ 12 C and the corresponding 5 13 C values. Furthermore, lipid matter of C 3 and C 4 plants analyze differently than materials derived from the carbohydrate components of the same plants as a consequence of the metabolic pathway.
- 13 C shows large variations due to isotopic fractionation effects, the most significant of which for bioproducts is the photosynthetic mechanism.
- the major cause of differences in the carbon isotope ratio in plants is closely associated with differences in the pathway of photosynthetic carbon metabolism in the plants, particularly the reaction occurring during the primary carboxylation (i.e. , the initial fixation of atmospheric C0 2 ).
- Two large classes of vegetation are those that incorporate the "C3"(or Calvin- Benson) photosynthetic cycle and those that incorporate the "C 4 " (or Hatch-Slack) photosynthetic cycle.
- Both C 4 and C 3 plants exhibit a range of 13 C/ 12 C isotopic ratios, but typical values are about -7 to about -13 per mil for C 4 plants and about -19 to about -27 per mil for C 3 plants (see, e.g., Stuiver et al, Radiocarbon 19:355, 1977). Coal and petroleum fall generally in this latter range.
- the 13 C measurement scale was originally defined by a zero set by Pee Dee Belemnite (PDB) limestone, where values are given in parts per thousand deviations from this material.
- PDB Pee Dee Belemnite
- the "6 13 C” values are expressed in parts per thousand (per mil), abbreviated, %o, and are calculated as follows:
- compositions described herein include bioproducts produced by any of the methods described herein.
- the bioproduct can have a ⁇ C of about -28 or greater, about -27 or greater, -20 or greater, -18 or greater, -15 or greater, -13 or greater, -10 or greater, or
- the bioproduct can have a ⁇ 13 C of about -30 to about -15, about -27 to about -19, about -25 to about -21, about -15 to about -5, about -13 to about -7, or about -13 to about -10.
- the bioproduct can have a 6 13 C of about -10, -1 1, -12, or -12.3.
- Bioproducts can also be distinguished from petroleum based organic compounds by comparing the amount of 14 C in each compound. Because 14 C has a nuclear half life of 5730 years, petroleum based fuels containing "older” carbon can be distinguished from bioproducts which contain "newer” carbon (see, e.g., Currie, “Source Apportionment of Atmospheric Particles”, Characterization of Environmental Particles, J. Buffle and H. P. van Leeuwen, Eds., 1 of Vol. I of the IUPAC Environmental Analytical Chemistry Series (Lewis Publishers, Inc) (1992) 3-74).
- the fundamental definition relates to 0.95 times the 14 C / 12 C isotope ratio HOxI (referenced to AD 1950). This is roughly equivalent to decay-corrected pre- Industrial Revolution wood.
- fM is approximately 1.1.
- compositions described herein include bioproducts that can have an fM 14 C of at least about 1.
- the bioproduct can have an fM 14 C of at least about 1.01 , an fM 14 C of about 1 to about 1.5, an f M 14 C of about 1.04 to about 1.18, or an f M 14 C of about 1.1 1 1 to about 1.124.
- a biologically based carbon content is derived by assigning "100%" equal to 107.5 pMC and "0%" equal to 0 pMC. For example, a sample measuring 99 pMC will give an equivalent biologically based carbon content of 93%. This value is referred to as the mean biologically based carbon result and assumes all the components within the analyzed material originated either from present day biological material or petroleum based material.
- a bioproduct described herein can have a pMC of at least about 50, 60, 70, 75, 80, 85, 90, 95, 96, 97, 98, 99, or 100. In other instances, a bioproduct described herein can have a pMC of between about 50 and about 100; about 60 and about 100; about 70 and about 100; about 80 and about 100; about 85 and about 100; about 87 and about 98; or about 90 and about 95. In yet other instances, a bioproduct described herein can have a pMC of about 90, 91 , 92, 93, 94, or 94.2.
- the fatty alcohols described herein can be used as or converted into a surfactant or detergent composition.
- a surfactant or detergent composition One of ordinary skill in the art will appreciate that, depending upon the intended purpose of the surfactant or detergent, different fatty alcohols can be produced and used.
- the characteristics of the fatty alcohol feedstock will affect the characteristics of the surfactant or detergent produced.
- the characteristics of the surfactant or detergent product can be selected for by producing particular fatty alcohols for use as a feedstock.
- Fuel additives are used to enhance the performance of a fuel or engine.
- fuel additives can be used to alter the freezing/gelling point, cloud point, lubricity, viscosity, oxidative stability, ignition quality, octane level, and/or flash point.
- all fuel additives must be registered with Environmental Protection Agency.
- the names of fuel additives and the companies that sell the fuel additives are publicly available by contacting the EPA or by viewing the agency's website.
- the fatty alcohol-based biofuels described herein can be mixed with one or more fuel additives to impart a desired quality.
- fatty alcohol-based surfactants and/or detergents described herein can be mixed with other surfactants and/or detergents well known in the art.
- the mixture can include at least about 10%, 15%, 20%, 30%, 40%, 50%, or 60% by weight of the fatty alcohol.
- a surfactant or detergent composition can be made that includes at least about 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 85%, 90% or 95% of a fatty alcohol that includes a carbon chain that is 8, 10, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21 or 22 carbons in length.
- Such surfactant or detergent compositions can additionally include at least one additive selected from a surfactant; a microemulsion; at least about 5%, 10%, 15%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 85%, 90%, or 95% of surfactant or detergent from nonmicrobial sources such as plant oils or petroleum.
- hydrocarbon (and/or fatty aldehyde)-based biofuel described herein can be mixed with other fuels, such as various alcohols, such as ethanol and butanol, and petroleum derived products, such as gasoline, diesel, or jet fuel.
- fuels such as various alcohols, such as ethanol and butanol
- petroleum derived products such as gasoline, diesel, or jet fuel.
- the mixture can include at least about 10%, 15%, 20%>, 30%, 40%, 50%, or 60% by weight of the hydrocarbon (and/or fatty aldehydes).
- a biofuel composition can be made that includes at least about 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%), 80%, 85%, 90% or 95% of a hydrocarbon such as an alkane or an alkene that includes a carbon chain that is 8, 10, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21 or 22 carbons in length.
- Such biofuel composition can additionally include at least one additive selected from a cloud point lowering additive that can lower the cloud point to less than about 5°C, or 0°C; a surfactant, a microemulsion; at least about 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 85%), 90%) or 95% diesel fuel from triglycerides; petroleum-derived gasoline; or diesel fuel from petroleum.
- a cloud point lowering additive that can lower the cloud point to less than about 5°C, or 0°C
- a surfactant a microemulsion
- This example describes an experiment verifying that co-expression of a heterologous carboxylic acid reductase from Acinetobacter baylyi ADPl , AlrAadpl (a homolog of
- Acinetobacter sp. M-1 AlrA) and a CarB homolog resulted in fatty alcohol production in E.coli.
- Table 7 CAR-like Protein and the corresponding coding sequences.
- the fadD9 gene was amplified from genomic DNA of Mycobacterium tuberculosis H37Rv (obtained from The University of British Columbia, and Vancouver, BC Canada) using the primers fadD9F and FadDR (see Table 8).
- the PCR product was first cloned into PCR-blunt (Invitrogen) and then released as an Ndel-Avrll fragment.
- the Ndel-Avrll fragment was then cloned between the Ndel and ⁇ vrll sites of pACYCDuet- 1 (Novogen) to generate pACYCDuet- l-fadD9.
- the car A gene was amplified from the genomic DNA of Mycobacterium smegmatis MC2 155 (obtained from the ATCC (ATCC 23037D-5)) using primers CARMCaF and
- CARMCaR (see Table 8).
- the carB gene was amplified from the genomic DNA of
- Mycobacterium smegmatis MC2 155 obtained from the ATCC (ATCC 23037D-5)) using primers CARMCbF and CARMCbR (see Table 8). Each PCR product was first cloned into PCR-blunt and then released as an Ndel-Avrll fragment. Each of the two fragments was then subcloned between the Ndel and ⁇ 4vrII sites of pACYCDuet- 1 (Novogen) to generate pACYCDuet- 1-carA and pACYCDuet- 1-carB.
- Plasmid pETDuet-l-'tesA-alrAadpl was carried out with the protocol below.
- the gene alrAadpl was amplified from the genomic DNA of Acinetobacter baylyi ADP1 by a two-step PCR procedure.
- the first set of PCR reactions eliminated an internal Ncol site at bp 632-636 with the following primer pairs:
- the plasmid pACYCDuet-l-carA encoding the CAR homolog carA
- pETDuet-l-'tesA-alrAadpl see, e.g., FIG. 27A
- the plasmid pACYCDuet-l -carB, encoding the CAR homolog, carB was co-transformed with pETDuet-1 - 'tesA.
- pACYCDuet-l-carB was also separately co-transformed with pETDuet-1 - 'tesA-alrAadpl .
- pACYCDuet-l -carB was co-transformed with the empty vector pETDuet-1 (see, e.g., FIG. 27A).
- pACYCDuet- 1- fadD9 was also separately co -transformed with pETDuet-l-'tesA-alrAadpl .
- pACYCDuet-1 - fadD9 was co-transformed with the empty vector pETDuet-1 (see, e.g. , FIG. 27A).
- E. coli transformants were grown in 3 mL of LB medium supplemented with carbenicillin (100 mg/L) and chloramphenicol (34 mg/L) at 37 °C. After overnight growth, 15 ⁇ of culture was transferred into 2 mL of fresh LB medium supplemented with carbenicillin and chloramphenicol. After 3.5 hours of growth, 2 mL of culture were transferred into a 125 mL flask containing 20 mL of M9 medium with 2% glucose and with carbenicillin and
- Table 9 Acyl-composition of fatty alcohols produced by recombinant E.coli strains.
- GC/MS was performed using an Agilent 5975B MSD system equipped with a 30mx0.25mm (0.1 ⁇ film) DB-5 column.
- the column temperature was 3 min isothermal at 100°C.
- the column was programmed to rise from 100 °C to 320 °C at a rate of 20 °C/min.
- each compound was confirmed by matching the compound's mass spectrum to a standard's mass spectrum in the mass spectra library (e.g., C12:0, C12: l , C13:0, C14:0, C14:l , C15:0. C16:0, C16: l , C17:0, C18:0 and C18:l alcohols).
- a standard's mass spectrum in the mass spectra library e.g., C12:0, C12: l , C13:0, C14:0, C14:l , C15:0.
- C16:0, C16: l , C17:0, C18:0 and C18:l alcohols e.g., C12:0, C12: l , C13:0, C14:0, C14:l , C15:0.
- C16:0, C16: l , C17:0, C18:0 and C18:l alcohols e.g.
- This example describes the identification of a fatty alcohol biosynthetic polypeptide, YjgB, in E.coli.
- E. coli contains multiple enzymes that catalyze the reversible oxidoreduction of fatty aldehydes and fatty alcohols.
- a BLAST search and comparison of the E.coli K12 genomic and protein databases for homologs of Acinetobacter sp. M-l AlrA revealed that the E.coli enzyme YjgB might be the closest homolog with an about 57% sequence identity.
- This example sought to verify the fatty alcohol biosynthetic activity of E.coli YjgB by overexpressing YjgB with a CarB in E.coli and measure the accumulation of fatty aldehyde and production of fatty alcohols.
- the gene yjgB (GenBank accession number, NP_418690) insert was amplified using PCR from the genomic DNA of E. coli K-12 using the following primers.
- the plasmid pACYCDuet-l-fadD9 encoding the CAR homolog fadD9, was co- transformed with pETDuet-l-'tesA.
- pACYCDuet-1- fadD9 was also separately co- transformed with pETDuet-l-'tesA-yjgB.
- pACYCDuet-1- fadD9 was co- transformed with the empty vector pETDuet-1 (see, e.g., FIG. 28).
- the E. coli transformants were grown in 3 mL of LB medium supplemented with carbenicillin (100 mg/L) and chloramphenicol (34 mg/L) at 37 °C. After overnight growth, 15 of culture was transferred into 2 mL of fresh LB medium supplemented with carbenicillin and chloramphenicol. After 3.5 hours of growth, 2 mL of culture were transferred into a 125 mL flask containing 20 mL of M9 medium with 2% glucose and with carbenicillin and
- the measured retention times were 6.79 minutes for cis-5-dodecen-l-ol, 6.868 minutes for 1-dodecanol, 8.058 minutes for cis-7-tetradecen-l-ol, 8.19 minutes for 1- tetradecanol, 9.208 minutes for cis-9-hexadecen-l-ol, 9.30 minutes for 1-hexadecanol, and 10.209 minutes for cis-1 1-octadecen-l-ol.
- EXAMPLE 3 [00345] This example describes the identification of other fatty alcohol biosynthetic polypeptides in E.coli.
- Synechococcus elongatus (Synpcc7942_1594) (SEQ ID NO: 137).
- LB cultures were grown overnight at 37 °C, and 55 of stationary phase cultures were used to inoculate four independent 5.5 mL of LB. Those 5.5 mL cultures were then grown to an OD 6 oo of 0.8-1.0 and were then used to inoculate a corresponding number of 2 L baffled shakefiasks, each with 500 mL Hu-9 minimal media. 20 hrs after induction the cells were pelleted at 4,000 x g for 20 min.
- the cell pellet was resuspended in 30 mL of 100 mM phosphate buffer at pH 7.2 with lx Bacterial Protease Arrest (G Biosciences).
- the cells were lysed in a French press at 15,000 psi with two passes through the instrument.
- the cell debris was then removed by centrifuging at 10,000 x g for 20 mins.
- the cell lysate was loaded onto two HiTrapQ columns (GE Healthcare) connected in series.
- the following buffers were used to elute proteins: (A) 50 mM Tris, pH 7.5 and (B) 50 mM Tris, pH 7.5 with 1 M NaCl. A gradient from 0 % B to 100% B was ran over 5 column volumes at a flow rate of 3 mL/min while 4 mL fractions were collected.
- the fractions were assayed for alcohol biosynthetic enzymatic ⁇ e.g., aldehyde reductase/alcohol dehydrogenase) activity by taking 190 ⁇ of a protein fraction and adding 5 ⁇ of a 20 mM NADPH (Sigma) solution and 5 i of a 20 mM dodecanal (Fluka) solution in DMSO. The reactions were incubated at 37 °C for 1 hr. They were then extracted with 100 of ethyl acetate and analyzed for dodecanol via GC/MS. Fractions eluting around 350 mM NaCl contained a fatty alcohol biosynthetic enzyme activity.
- alcohol biosynthetic enzymatic ⁇ e.g., aldehyde reductase/alcohol dehydrogenase
- This example describes the verification of YjgB and YahK as fatty alcohol biosynthetic polypeptides.
- AATATCCTCCTTTAGTTCC-3 * (SEQ ID NO:224).
- This PCR product was electroporated into E. coli MG1655 (pKD46). The cells were plated on L-chloramphenicol (30 ⁇ g/mL)(L-Cm) and grown overnight at 37 °C. Individual colonies were picked on to another L-Cm plate and grown at 42 °C. These colonies were then patched to L-Cm and L- carbenicillin (100 mg/mL) (L-Cb) plates and grown at 37 °C overnight. Colonies that were Cm K and Cb ⁇ were evaluated further by PCR to ensure the PCR product inserted at the correct site.
- PCR verification was performed on colony lysates of these bacteria using the primers fadF (5 ' - ⁇ ) (SEQ ID NO:225) and fadR (5'- TCGCAACCTTTTCGTTGG-3 ' ) (SEQ ID NO:226). Expected size of the AfadDwCm deletion was about 1200 bp. The chloramphenicol resistance gene was eliminated using a FLP helper plasmid as described in Datsenko et al, Proc. Natl. Acad. Sci. USA 97:6640-6645 (2000). PCR verification of the deletion was performed with primers fadF and fadR.
- the MG1655 AfadD strain was unable to grow on M9 + oleate agar plates (oleate as carbon source). It was also unable to grow in M9 + oleate liquid media. The growth defect was complemented by an E. coli fadD gene supplied in trans (in pCL1920-Ptrc).
- the ⁇ 3 Lysogenization Kit (Novagen) was utilized, which is designed for site-specific integration of DE3 prophage into an E. coli host chromosome, such that the lysogenized host can be used to express target genes cloned in T7 expression vectors.
- DE3 is a recombinant phage carrying the cloned gene for T7 RNA polymerase under lacUVS control. Briefly, the host strain was cultured in LB supplemented with 0.2% maltose, 10 mM MgS0 4 , and antibiotics at 37 °C to an OD 600 of 0.5.
- 10 8 pfu ⁇ 3, 10 pfu Helper Phage, and 10 pfu Selection Phage were incubated with 10 ⁇ host cells.
- the host/phage mixture was incubated at 37 °C for 20 min to allow phage to adsorb to host.
- the mixture was pipeted onto an LB plate supplemented with antibiotics. The mixture was spread evenly using plating beads, and the plates were inverted plates and incubated at 37 °C overnight.
- T7 Tester Phage is a T7 phage deletion mutant that is completely defective unless active T7 RNA polymerase is provided by the host cell.
- the T7 Tester Phage makes very large plaques on authentic ⁇ 3 lysogens in the presence of IPTG, while much smaller plaques are observed in the absence of inducer.
- the relative size of the plaques in the absence of IPTG is an indication of the basal level expression of T7 RNA polymerase in the lysogen, and can vary widely between different host cell backgrounds.
- the yjgB knockout strain was constructed by using the following lambda red system ⁇ Datsenko et al , Proc. Natl. Acad. Sci. USA 97:6640- 6645 (2000)):
- kanamycin resistant gene from pKDl 3 was amplified with the primers yjgBRn: (5 ' -GCGCCTC AGATC AGCGCTGCGAATGATTTTCA AA AATCGGCTTTC AACACTG TAGGCTGGAGCTGCTTCG-3 , ) (SEQ ID NO:227), and yjgBFn: (S -CTGCCATGCTCTA CACTTCCCAAACAACACCAGAGAAGGACCAAAAAATGATTCCGGGGATCCGTCGAC
- PCR verification was performed on colony lysates of these bacteria using the primers BF ( - gtgrtggcgataCGACAAAACA-3 ' ) (SEQ ID NO:229) and BR (5 ⁇ - CCCCGCCCTGCCATGCTCTACAC-3 ⁇ ) (SEQ ID NO:230).
- the expected size of the yjgBy.kan knockout was about 1450 bp.
- Example 2 a fadE deletion strain was used for fatty aldehyde and fatty alcohol production from 'TesA, CAR homologs, and endogenous YjgB in E. coli.
- CAR homologs used fatty acids instead of acyl-CoA as a substrate
- the gene encoding for acyl-CoA synthase in E. coli (fadD) was deleted so that the fatty acids produced were not activated with CoA.
- E. coli strain MG1655(DE3, AfadD) was transformed with pETDuet-1 - 'tesA and pACYCDuet-l -carB .
- transformants were evaluated for fatty alcohol production using the methods described herein. These transformants produced about 360 mg/L of fatty alcohols (dodecanol, dodecenol, tetredecanol, tetredecenol, cetyl, hexadecenol, and octadecenol).
- YjgB was an alcohol dehydrogenase responsible for converting fatty aldehydes into their corresponding fatty alcohols
- pETDuet-l -'tesA and pACYCDuet-l -fadD9 were co-transformed into either MG1655(DE3, AfadD) or MG1655(DE3, AfadD, yjgB: ⁇ m).
- MG1655 (DE3, AfadD, yjgB ::kan) was transformed with both pETDuet-l -'tesA- yjgB and pACYCDuet-l -fadD9.
- the E. coli transformants were grown in 3 mL of LB medium supplemented with carbenicillin (100 mg/L) and chloramphenicol (34 mg/L) at 37 °C. After overnight growth, 15 of culture was transferred into 2 mL of fresh LB medium supplemented with carbenicillin and chloramphenicol. After 3.5 hrs of growth, 2 mL of culture was transferred into a 125 mL flask containing 20 mL of M9 medium with 2% glucose, carbenicillin, and chloramphenicol. When the OD 6 oo of the culture reached 0.9, 1 mM of IPTG was added to each flask.
- the yjgB knockout strain resulted in significant accumulation of dodecanal and a lower fatty alcohol titer (FIG. 29).
- the expression oiyjgB from plasmid pETDuet-l-'tesA-yjgB in the yjgB knockout strain effectively removed the accumulation of dodecanal (FIG. 29).
- the arrows in FIG. 29 indicate the GC trace of dodecanal (C12:0 aldehyde).
- yahK was knocked out in E. coli MG1655(DE3, AfadD, AyjgB) (control strain).
- the yahK knock-out strain MG1655(DE3, AfadD, Ayjg,B AyahK) was constructed with the lambda red system (Datsenko et al, supra) using the following primers: yahKJF: (CATATCAGGCGTTGCCAAATACACATAGCTAATCAGGAGTAAACACAATG) (SEQ ID NO:231); and yahK_R: (AATCGCACACTAACAGACTGAAAAAATTAATA AATACCCTGTGGTTTAAC) (SEQ ID NO:232).
- the AyahK strain did not convert dodecanal to dodecanol, while the wild type strain had this activity.
- each lysate was run on a HiTrapQ column as described above.
- the wild type lysate appeared to have fatty alcohol biosynthetic activity in fractions eluting around 350 mM NaCl, while the AyahK lysate appeared to have no fatty alcohol biosynthetic activity in this region.
- Further protein families that were likely to include potential alcohol biosynthetic polypeptides in E.coli may include, for example, the dehydroquinone synthase family (Pfam 01761), the phospho gluconate dehydrogenase family (Pfam 03446), the hydroxyacid dehydrogenase family (Pfam 02826, Pfam 00389), the aldehyde dehydrogenase family (Pfam 00171), the glutamyl-tRNA reductase family (Pfam 01488, Pfam 08501), the GFO/IDH/MOCA family (Pfam 01408, Pfam 02894), the mannitol dehydrogenase family (Pfam 01232, Pfam 08125), the IMP dehydrogenase family (Pfam 00478), the dehydroquinone synthase family (Pfam 01761), the phospho gluconate dehydrogenase family (Pfam 03
- oxidoreductase family (Pfam 10722), the epimerase family (Pfam 001370), the alcohol oxidase family (Pfam 00732, Pfam 05199), the PQQ dehydrogenase family (Pfam 0101 1), the xanthine dehydrogenase family (Pfam 00941), the FAD/NAD(P) -binding oxidoreductase family (Pfam 01266), the flavin/NADH-binding oxidoreductase family (Pfam 01613), the FAD-linked oxidoreductase family (Pfam 01565, Pfam 02913), the ferredoxin reductase family (Pfam 00175, Pfam 00970, Pfam 001 1 1), the anaerobic dehydrogenase family (Pfam 00384, Pfam 01568), the molybdenum-binding oxidore
- a control strain lacking a candidate alcohol dehydrogenase was also included in the experiment. 1 mL of each overnight culture was used to inoculate 50 mL of fresh LB with carbanecillin. The cultures were shaken at 37 °C until reaching an OD 6 oo of 0.8-1. The cultures were then transferred to 18 °C, induced with 1 mM IPTG, and shaken overnight.
- Cell free lysates were prepared by centrifuging the cultures at 4,000 x g for 20 mins. The cultures were then resuspended in 1 mL of Bugbuster (Novagen) and gently shaken at room temperature for 5 min. The cell debris was removed by spinning at 15,000 x g for 10 min. The resulting lysates were assayed for alcohol dehydrogenase activity by mixing 88 ⁇ of lysate, 2 of 40 mM cis-11-hexadecenal in DMSO, and 10 of 20 mM NADPH. The samples were incubated at 37 °C for 30 min. and were then extracted with 100 ⁇ , of ethyl acetate. The extracts were analyzed using GC/MS.
- MG1655(DE3, AfadD) (described above) was tested by transforming it with a plasmid expressing acyl-ACP reductase YP_400611 and analyzing fatty aldehyde and fatty alcohol titers.
- the test strain also contained a plasmid expressing a decarbonylase. This double knock-out mutant showed slightly higher fatty aldehyde titers in several experiments (see, e.g., FIG. 30), confirming that these two putative alcohol dehydrogenases contribute to fatty alcohol dehydrogenase activity in E. coli under production conditions.
- yncB and ydjA were deleted in the yjgB yahK double mutant.
- YdjA which is not a member of the four protein families mentioned above, demonstrated slightly elevated fatty aldehyde levels (see FIG. 30), indicating that it may also contribute to fatty alcohol dehydrogenase activity in E. coli under production conditions.
- FIG. 31 The first figure.
- a larger and more comprehensive set of putative fatty alcohol biosynthetic polypeptides were selected for an overexpression study to identify the members of various protein families that contribute to the reduction of fatty aldehydes to fatty alcohols in E.coli. Specifically, each of the fatty alcohol biosynthetic genes in Table 12 below were overexpressed and analyzed for fatty aldehyde conversion and/or fatty alcohol production.
- Table 12 Putative Fatty Alcohol Biosynthetic Genes That Were Overexpressed (including members of the 4 families mentioned above, with the most likely candidates for fatty alcohol biosynthetic enes
- ac gene was c one nto t e expression vector OP-80 (SEQ ID NO:233), which was digested with the restriction enzymes Ncol and EcoRI.
- the genes were amplified using PCR from E.coli MG1655 genomic DNA using the primers listed in Table 13.
- tdh f TAAGGAGGAATAAACCATGAAAGCGTTATCCAAACTGAAAGCGGAAG
- tdh r CGGGCCCAAGCTTCGAATTTTAATCCCAGCTCAGAATAACTTTCCCGGAC
- ucpA_f TAAGGAGGAATAAACCATGGGTAAACTCACGGGCAAGACAG
- ucpA_r CGGGCCCAAGCTTCGAATTTCAGATACCGACGCTAACCGTCTCC
- yahK_f TAAGGAGGAATAAACCATGAAGATCAAAGCTGTTGGTGCATATTCCG
- ybbO_f TAAGGAGGAATAAACCATGACTCATAAAGCAACGGAGATCCTGACAG (SEQ ID NO:282)
- ybbO r CGGGCCCAAGCTTCGAATTTCACCCCTGCAATATTTTGTCCATCACG (SEQ ID NO:283)
- ybdH_f TAAGGAGGAATAAACCATGCCTCACAATCCTATCCGCGTG (SEQ ID NO:284)
- ybdR f TAAGGAGGAATAAACCATGAAAGCATTGACTTATCACGGCCCAC (SEQ ID NO:286)
- ybdR_r CGGGCCCAAGCTTCGAATTTCATATTGTTCCCCCCGGCATCG (SEQ ID NO:287)
- yggP_f TAAGGAGGAATAAACCATGAAAACCAAAGTTGCTGCTATTTATGGCAAGC
- yggPjr CGGGCCCAAGCTTCGAATTTCATTGCGCGGCCTCCC
- yghZ_r CGGGCCCAAGCTTCGAATTTCATTTATCGGAAGACGCCTGCCAC (SEQ ID NO:315) yhdH f TAAGGAGGAATAAACCATGCAGGCGTTACTTTTAGAACAGCAGG(SEQ ID NO:316) yhdH_r CGGGCCCAAGCTTCGAATTTTAGTTAACCTTCACCAGCGTGCGAC(SEQ ID NO:317) yiaY_f TAAGGAGGAATAAACCATGGCAGCTTCAACGTTCTTTATTCCTTCTG (SEQ ID NO:318) yiaY r CGGGCCCAAGCTTCGAATTTTACATCGCTGCGCGATAAATCGCC (SEQ ID NO:319)
- yphC_f TAAGGAGGAATAAACCATGAAAACGATGCTGGCAGCTTATTTACCAG
- yphC_r CGGGCCCAAGCTTCGAATTTTAATCCGGGAAGTTAATCACAACTTTCCCGC
- yqhD_f TAAGGAGGAATAAACCATGAACAACTTTAATCTGCACACCCCAACC
- yqhD_r CGGGCCCAAGCTTCGAATTTTAGCGGGCGGCTTCGTATATACG
- Each primer was designed to contain 15 bases of overlap with the expression vector, enabling restrictionless cloning using the InFusion cloning kit (Clontech). Excess nucleotides and primers were removed from the PCR products using the ZR-96 DCC kit (Zymo Research). After ligation of the PCR products into the linearized OP-80, the resulting DNA was transformed into NEB Turbo competent cells (New England Biolabs, Inc. Ipswich, MA), and plated onto LB agar medium supplemented with 100 ⁇ ⁇ . spectinomycin and 1 % (w/v) glucose. Plasmid clones containing the appropriate inserts were identified using PCR, verified by sequencing and mini-prepped.
- the sequence verified plasmids were transformed into the expression strain, E.coli (DE3) AyjgB AyahK AydhD AdkgA, and plated onto LB agar medium supplemented with 100 ⁇ g/mL spectinomycin and 1 % (w/v) glucose. Individual colonies were picked and grown overnight at 37°C in LB liquid medium supplemented with 100 ⁇ g/mL spectinomycin and 1 % (w/v) glucose. The culture was then diluted 1 : 1000 into fresh LB with 100 ⁇ g/mL
- the cells were subsequently harvested by centrifugation for 10 minutes at 4,500 rpm. The supernatant was discarded and the cells were resuspended in 2.5 mL BugBuster lysis reagent (Novagen). The cell suspensions were placed on a vertical rotator for 45 minutes at 4°C to lyse the cells. Cell debris were removed by centrifugation for 10 minutes at 4,500 rpm, and the clarified lysates were used for activity assays.
- Each sample was evaluated in vitro to determine its ability to convert dodecanal or 1 1 -cis-hexadecenal into dodecanol or 1 1 -cis-hexadecenol, respectively, using the cell lysates as described above.
- the negative control consisted of a lysate prepared from cells transformed with an empty OP-80 expression vector.
- Each reaction contained 5-40 ih of cell lysate, 20 ⁇ , 20 mM dodecanal or 1 1 -cis- hexadecenal, 10 ⁇ , 20 mM NADH or NADPH, and sufficient dilution buffer (100 mM sodium phosphate, pH 7.0, 0.25% (v/v) Triton X-100) to bring the total volume to 400 ⁇ ,.
- sufficient dilution buffer 100 mM sodium phosphate, pH 7.0, 0.25% (v/v) Triton X-100
- a 50 ⁇ L sample of the organic phase was derivatized with BSTFA ( ⁇ , ⁇ - bis(trimethylsilyl)trifluoroacetamide) and analyzed on a GC/FID equipped with a Trace UFC-1 column (Thermo Scientific). Samples were run using a split ratio of 1 :300 and a program consisting of an initial temperature of 140°C for 0.3 minute, a ramp up of 150°C/min to 300°C, then holding at a constant temperature of 300°C for 0.05 minutes.
- BSTFA ⁇ , ⁇ - bis(trimethylsilyl)trifluoroacetamide
Landscapes
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Biotechnology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- Microbiology (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- General Chemical & Material Sciences (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Biomedical Technology (AREA)
- Oil, Petroleum & Natural Gas (AREA)
- Molecular Biology (AREA)
- Virology (AREA)
- Biophysics (AREA)
- Plant Pathology (AREA)
- Physics & Mathematics (AREA)
- Tropical Medicine & Parasitology (AREA)
- Medicinal Chemistry (AREA)
- Cell Biology (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
Compositions and methods for producing fatty acid derivatives using recombinant microorganisms are described herein.
Description
METHODS AND COMPOSITIONS RELATED TO FATTY ALCOHOL
BIOSYNTHETIC ENZYMES
BACKGROUND OF THE INVENTION
Field of the Invention
[0001] Compositions, methods and systems effective to produce fatty acid derivatives.
[0002] This application claims priority benefit to U.S. Provisional Application Serial Nos. 61/321 ,877, and 61/321 , 878, filed on April 8, 2010, which are expressly incorporated by reference herein in their entirety.
Background of the Technology
[0003] Petroleum is a limited, natural resource found in the Earth in liquid, gaseous, or solid forms. Petroleum is a valuable resource for producing various industrial materials. But petroleum products are developed at considerable costs, both financial and environmental. In addition to the economic cost, petroleum exploration carries a high environmental cost. In its natural form, crude petroleum extracted from the Earth has few commercial uses. It is a mixture of hydrocarbons (e.g. , paraffins (or alkanes), olefins (or alkenes), alkynes, napthenes (or cycloalkanes), aliphatic compounds, aromatic compounds, etc.) of varying length and
complexity. Hence, crude petroleum must be refined and purified before it can be used commercially. Crude petroleum is also a primary source of raw materials for producing petrochemicals. The two main classes of raw materials derived from petroleum are short chain olefins (e.g., ethylene and propylene) and aromatics (e.g., benzene and xylene isomers). These raw materials are derived from longer chain hydrocarbons in crude petroleum by cracking it at considerable expense using a variety of methods, such as catalytic cracking, steam cracking, or catalytic reforming. These raw materials are used to make petrochemicals, which cannot be directly refined from crude petroleum, such as monomers, solvents, detergents, or adhesives.
[0004] These petrochemicals can then be used to make specialty chemicals, such as plastics, resins, fibers, elastomers, pharmaceuticals, lubricants, or gels. Particular specialty chemicals that can be produced from petrochemical raw materials are fatty acids, hydrocarbons (e.g. , long
chain, branched chain, saturated, unsaturated, etc.), fatty alcohols, esters, fatty aldehydes, ketones, lubricants, etc.
[0005] Fatty alcohols have many commercial uses. The shorter chain fatty alcohols are used in the cosmetic and food industries as emulsifiers, emollients, and thickeners. Due to their amphiphilic nature, fatty alcohols behave as nonionic surfactants, which are useful in personal care and household products, for example, detergents. In addition, fatty alcohols are used in waxes, gums, resins, pharmaceutical salves and lotions, lubricating oil additives, textile antistatic and finishing agents, plasticizers, cosmetics, industrial solvents, and solvents for fats.
[0006] Hydrocarbons have many commercial uses. For example, shorter chain alkanes are used as fuels. Longer chain alkanes (e.g., from five to sixteen carbons) are used as transportation fuels (e.g., gasoline, diesel, or aviation fuel). Alkanes having more than sixteen carbon atoms are important components of fuel oils and lubricating oils. Even longer alkanes, which are solid at room temperature, can be used, for example, as a paraffin wax. In addition, longer chain alkanes can be cracked to produce commercially valuable shorter chain hydrocarbons.
[0007] Like short chain alkanes, short chain alkenes are used in transportation fuels. Longer chain alkenes are used in plastics, lubricants, and synthetic lubricants. In addition, alkenes are used as a feedstock to produce alcohols, esters, plasticizers, surfactants, tertiary amines, enhanced oil recovery agents, fatty acids, thiols, alkenylsuccinic anhydrides, epoxides, chlorinated alkanes, chlorinated alkenes, waxes, fuel additives, and drag flow reducers.
[0008] Esters have many commercial uses. For example, biodiesel, an alternative fuel, is comprised of esters (e.g., fatty acid methyl esters, fatty acid ethyl esters, etc). Some low molecular weight esters are volatile with a pleasant odor, which makes them useful as fragrances or flavoring agents. In addition, esters are used as solvents for lacquers, paints, and varnishes. Furthermore, some naturally occurring substances, such as waxes, fats, and oils are comprised of esters. Esters are also used as softening agents in resins and plasticizers, flame retardants, and additives in gasoline and oil. In addition, esters can be used in the manufacture of polymers, films, textiles, dyes, and pharmaceuticals.
[0009] Aldehydes are used to produce many specialty chemicals. For example, aldehydes are used to produce polymers, resins (e.g., Bakelite), dyes, flavorings, plasticizers, perfumes,
pharmaceuticals, and other chemicals. Some are used as solvents, preservatives, or disinfectants. Some natural and synthetic compounds, such as vitamins and hormones, are aldehydes.
[0010] Obtaining specialty chemicals from crude petroleum requires a significant financial investment as well as a great deal of energy. It is also an inefficient process because frequently the long chain hydrocarbons in crude petroleum are cracked to produce smaller monomers.
These monomers are then used as the raw material to manufacture the more complex specialty chemicals.
[0011] Finally, the burning of petroleum based fuels releases greenhouse gases (e.g., carbon dioxide) and other forms of air pollution (e.g., carbon monoxide, sulfur dioxide, etc.). As the world's demand for fuel increases, the emission of greenhouse gases and other forms of air pollution also increases. The accumulation of greenhouse gases in the atmosphere can lead to an increase global warming. Hence, in addition to damaging the environment locally (e.g., oil spills, dredging of marine environments, etc.), burning petroleum also damages the environment globally.
[0012] Due to the inherent challenges posed by petroleum, there is a need for a renewable petroleum source. For similar reasons, there is also a need for a renewable source of chemicals which are typically derived from petroleum. The current invention addresses these needs.
Brief Summary Of The Invention.
[0013] The invention provides recombinant microorganisms engineered to produce fatty acid derivatives and methods of use wherein the recombinant microorganisms comprise
polynucleotide sequences encoding: (a) a fatty aldehyde biosynthetic polypeptide and (b) a fatty alcohol biosynthetic polypeptide, wherein the expression of the polypeptides is modified relative to the corresponding wild type polypeptides and the microorganism produces an increased titer of the fatty acid derivative relative to a wild type microorganism. The recombinant
microorganisms may further comprise a thioesterase (EC 3.1.1.5).
[0014] Exemplary fatty aldehyde biosynthetic polypeptides: (a) have at least 90% sequence identity to the amino acid sequence presented as SEQ ID NO: 41 , 43, 45, 47, 49, 51, 53, 55, 57, 59, 61 , 63, 65, 69, 71 , 73, 75, 77, 79, 81 , 83, 85, 87, 89, 91 , 93, 97, 99, 101 , 103, 105, 107, 109, 1 1 1 , 1 13, 1 15, 1 17, 119, 121 , 123, 125, or 127; (b) comprise an amino acid sequence motif with
a sequence presented as (1) SEQ ID NO: 129, SEQ ID NO:130, SEQ ID N0:131 , and SEQ ID NO:132; (2) SEQ ID NO: 133; SEQ ID NO: 134; SEQ ID NO:135; SEQ ID NO: 136; or (3) SEQ ID NO: 129, SEQ ID NO: 131 , SEQ ID NO: 132 or SEQ ID NO:133; or (c) are encoded by a polynucleotide having at least 90% sequence identity to the nucleotide sequence presented as SEQ ID NO: 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 1 16, 118, 120, 122, 124, 126, or 128.
[0015] Methods for producing a fatty alcohol, comprising culturing such an engineered microorganism in the presence of a carbon source, under conditions wherein the fatty alcohol is produced at a titer of at least 300mg/L, are further provided.
[0016] In practicing the claimed methods, the engineered microorganism may be modified: (a) to express an attenuated level of an acyl-CoA synthase (EC 2.3.1.86) or (b) to further comprise an acyl-ACP reductase polypeptide, wherein (i) the acyl-ACP reductase polypeptide has amino acid sequence with at least 90% sequence identity to SEQ ID NO: 137, 139, 141, 143, 145, 147, 149, 151, or 153, (ii) the acyl-ACP reductase polypeptide has an amino acid motif presented as SEQ ID NO: 155, 156, 157, 158, 159, 160, 161,162, 163, 164, or 165, or (iii) the acyl-ACP reductase polypeptide is encoded by a polynucleotide having at least 90% sequence identity to SEQ ID NO: 138, 140, 142, 144, 146, 148, 150, 152, or 154.
[0017] In practicing the claimed methods, expression of a fatty aldehyde reductase or alcohol dehydrogenase (EC 1.1.1.1 ) in the engineered microorganism may be increased or attenuated relative to the corresponding wild type polypeptide, or the gene encoding the fatty alcohol biosynthetic polypeptide may be knocked-out.
[0018] The fatty alcohol biosynthetic polypeptide may (a) have at least 90% sequence identity to a polypeptide sequence selected from the group consisting of SEQ ID NO:l , 3, 5, 7, 9, 1 1 , 13, 15, 17, 19, 21, 23, 25, 27, 29, 31 , 33, 35, 37, and 39, or (b) be encoded by a polynucleotide having at least 90% sequence identity to the nucleotide sequence of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, or 40.
[0019] The fatty alcohol produced by the claimed method may (a) comprise a C -C]8 fatty alcohol (e.g., a C6, C8, Qo, C)2, C]3, C]4, Ci5, C16, Cn, or C]8 fatty alcohol); (b) have the
hydroxyl group is in the primary (Ci) position; (c) be a saturated or unsaturated fatty alcohol; (d) be unsaturated at the omega-7 position; or (e) comprise a cis double bond.
[0020] The invention further provides recombinant microorganisms engineered to produce hydrocarbons and methods of use wherein the recombinant microorganism further comprises (a) a hydrocarbon biosynthetic polypeptide having the amino acid sequence of SEQ ID NO: 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, or 200 with one or more amino acid substitutions, additions, deletions, or insertions; (b) a polynucleotide sequence encoding a hydrocarbon biosynthetic polypeptide, having at least 90% sequence identity to the amino acid sequence of SEQ ID NO: 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, or 200, or (c) a hydrocarbon biosynthetic polypeptide having the amino acid motif sequences presented as (1) SEQ ID NO:202; (2) SEQ ID NO:203 or SEQ ID NO:204, or SEQ ID NO:205; (3) SEQ ID NO:206, and any one of SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205; or (4) SEQ ID NO:207 or SEQ ID NO:208, or SEQ ID NO:209, or SEQ ID NO:210, wherein the hydrocarbon biosynthetic polypeptide has
decarbonylase activity.
[0021] Methods for producing a hydrocarbon, comprising culturing such engineered
microorganisms in the presence of a carbon source, under conditions wherein the hydrocarbon is produced, are further provided.
[0022] The hydrocarbon produced by the claimed methods may (a) be an alkane or an alkene, e.g., a C13-C21 alkane or alkene, (b) have a 5!3C of -15.4 or greater, or (c) have a fM 14C of at least 1.003.
[0023] The hydrocarbon produced by the claimed methods may be used in a biofuel, for example, a diesel, gasoline, or jet fuel.
[0024] The invention further provides the use of microorganisms such as a yeast cell, a fungus cell, a filamentous fungi cell, or a bacterial cell in practicing the claimed methods.
Brief Description of the Figures
[0025] FIG. 1 A is a graphic representation of pathways for fatty alcohol production. FIG. I B is a graphic representation of pathways for hydrocarbon production.
[0026] FIG. 2 includes a table listing exemplary homologs of E.coli K-12 MG 1655 ethanol- active dehydrogenase/acetaldehyde-active reductase AdhP [GenBank Accession No.
NP_415995.4].
[0027] FIG. 3 includes a table listing exemplary homologs of E.coli K- 12 MG 1655 2,5-diketo- D-gluconate reductase A, DkgA [GenBank Accession No. NP_417485.4].
[0028] FIG. 4 includes a table listing exemplary homologs of E.coli K- 12 MG 1655 2,5-diketo- D-gluconate reductase B, DkgB [GenBank Accession No. NP 414743.1 ].
[0029] FIG. 5 includes a table listing exemplary homologs of E.coli K-12 MG 1655 E.coli K-12 MG 1655 aldo-keto reductase Tas [GenBank Accession No. NP_41731 1 .1 ].
[0030] FIG. 6 includes a table listing exemplary homologs of E.coli K-12 MG 1655 Zn- dependent and NAD(P)-binding oxidoreductase RspB [GenBank Accession No. NP 416097.1 ].
[0031] FIG. 7 includes a table listing exemplary homologs of E.coli K-12 MG 1655 Zn- dependent and NAD(P)-binding oxidoreductase YahK [GenBank Accession No. NP 414859.1 ].
[0032] FIG. 8 includes a table listing exemplary homologs of E.coli K-12 MG 1655 NAD(P)- binding oxidoreductase YbbO [GenBank Accession No. NP 415026.1].
[0033] FIG. 9 includes a table listing exemplary homologs of E.coli K-12 MG 1655
oxidoreductase YbdH [GenBank Accession No. NP_415132.1 ].
[0034] FIG. 10 includes a table listing exemplary homologs of E. coli K-12 MG 1655 Zn- dependent and NAD(P)-binding oxidoreductase YbdR [GenBank Accession No.
NP_4155141.1 ].
[0035] FIG. 1 1 includes a table listing exemplary homologs of E.coli K-12 MG 1655 NAD(P)- binding oxidoreductase YgfF [GenBank Accession No. NP_417378.1].
[0036] FIG. 12 includes a table listing exemplary homologs of E.coli K-12 MG 1655 Zn- dependent and NAD(P)-binding oxidoreductase YhdH [Genbank Accession No. NP_417719.1 ].
[0037] FIG. 13 includes a table listing exemplary homologs of E.coli K-12 MG 1655 Zn- dependent and NAD(P)-binding alcohol dehydrogenase YjgB [GenBank Accession No.
NP_418690.4].
[0038] FIG. 14 includes a table listing exemplary homologs of E.coli K-12 MG 1655 3- dehroquinate synthase AroB [GenBank Accession No. NP_417848.1].
[0039] FIG. 15 includes a table listing exemplary homologs of E.coli K-12 MG 1655 Zn- dependent and NAD(P)-binding oxidoreductase YcjQ [GenBank Accession No. NP_415829.1 ].
[0040] FIG. 16 includes a table listing exemplary homologs of E.coli K-12 MG 1655 NAD(P)- binding oxidoreductase YdbC [GenBank Accession No. NP_415924.1].
[0041] FIG. 17 includes a table listing exemplary homologs of E.coli K-12 MG 1655 NADH- dependent alpha-keto reductase YdjG [GenBank Accession No. NP__416285.1].
[0042] FIG. 18 includes a table listing exemplary homologs of E.coli K-12 MG 1655 NADPH- dependent aldo-keto reductase YeaE [GenBank Accession No. NP_416295.1 ].
[0043] FIG. 19 includes a table listing exemplary homologs of E.coli K-12 MG 1655 NADP- dependent, Zn-dependent oxidoreductase YncB [GenBank Accession No. NP_415966.6].
[0044] FIG. 20 includes a table listing exemplary homologs of E.coli K-12 MG 1655 NAD(P)- dependent alcohol dehydrogenase YqhD [GenBank Accession No. NP 417484.1].
[0045] FIG. 21 includes a table listing exemplary homologs of E.coli K-12 MG 1655 Zn- dependent and NAD(P)-binding oxidoreductase YdjL [GenBank Accession No. NP_416290.1 ].
[0046] FIG. 22 includes tables listing E.coli dehydratase/isomerase enzymes and
dehydratase/isomerase enzymes from other organisms.
[0047] FIG. 23 includes table listing E.coli keto-ACP synthase enzymes and keto-ACP synthase enzymes from other organisms.
[0048] FIG. 24A is a graphic representation of fatty alcohols produced by a recombinant E.coli strain transformed with pETDuet-l -'tesA-alrAadpl and pACYCDuet-l -CarB. FIG. 24B is a GC/MS trace of fatty alcohol produced by a recombinant E.coli strain transformed with pETDuet-l-'tesA-alrAadpl and pACYCDuet-l -CarB as compared to the control strain, which did not express an alrAadpl .
[0049] FIG. 25 is a graphic representation of fatty alcohols produced by a recombinant E.coli strain transformed with pETDuet-l -'tesA-yjgB and pACYCDuet-l -CarB.
[0050] FIG. 26A is a GC/MS trace of fatty alcohol production in MG1655 (DE3, /¾ Z))/pETDuet-l -tesA and pACYCDuet-l -CarB cells. FIG. 26B is a GC/MS trace of fatty alcohol production in MG1655 (DE3, AfadD, j//g5: :kan)/pETDuet-l -tesA and pACYCDuet-1 - CarB cells. FIG. 26C is a GC/MS trace of fatty alcohol production in MG1655 (DE3, AfadD, jyg5: :kan)/pETDuet-l -'tesA-yjgB and pACYCDuet-l -CarB cells. The arrows in FIGs. 26A, 26B, and 26C indicate the absence of C12:0 fatty aldehydes.
[0051] FIG. 27 is a graphic representation of fatty alcohol production in various deletion mutants of E. co li.
[0052] FIG. 28 is a graphic representation of fatty alcohol production in various deletion mutants of E.coli.
[0053] FIGs. 29A-29X are graphs depicting of the amount of fatty aldehydes converted to fatty alcohol using the enzymatic assays as described in Example 5. The title of each graph indicates the co-factor and substrate that were used in the assay. "CI 2" indicates a dodecanal substrate. "CI 6: 1 " indicates a 1 1-cis-hexadecenal substrate. The tables accompanying the graphs indicate the percentages of fatty aldehydes that were converted into fatty alcohols at the marked concentrations, as measured by GC-FID. The tables also indicate the p-values for the samples' capacity to catalyze the conversion of fatty aldehydes to fatty alcohols.
DETAILED DESCRIPTION OF THE INVENTION
[0054] Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of the ordinary skill in the art to which this invention belongs. Although methods and materials similar or equivalent to those described herein can be used in the practice or testing of the present invention, suitable methods and materials are described below. All publications, patent applications, patents, and other references mentioned herein, including GenBank database sequences, are incorporated by reference in their entirety. In case of conflict, the present specification, including definitions, will control. In addition, the materials, methods, and examples are illustrative only and not intended to be limiting.
[0055] Other features and advantages of the invention will be apparent from the following detailed description, and from the claims.
[0056] The invention is based, at least in part, on the discovery that altering the level of expression of one or more of a fatty alcohol biosynthetic polypeptide, a fatty aldehyde biosynthetic polypeptide, an acyl-ACP reductase polypeptide (EC 6.4.1.2) and a hydrocarbon biosynthetic polypeptide, e.g., a decarbonylase, in the microorganism host cell facilitates enhanced production of fatty acids and fatty acid derivatives by the microorganism.
Definitions
[0057] Throughout the specification, a reference may be made using an abbreviated gene name or polypeptide name, but it is understood that such an abbreviated gene or polypeptide name represents the genus of genes or polypeptides. Such gene names include all genes encoding the same polypeptide and homologous polypeptides having the same physiological function.
Polypeptide names include all polypeptides that have the same activity (e.g., that catalyze the same fundamental chemical reaction).
[0058] Unless otherwise indicated, the accession numbers referenced herein are derived from the NCBI database (National Center for Biotechnology Information) maintained by the National Institute of Health, U.S.A. Unless otherwise indicated, the accession numbers are as provided in the database as of October 2008.
[0059] EC numbers are established by the Nomenclature Committee of the International Union of Biochemistry and Molecular Biology (NC-IUBMB) (available at
http://www.chem.qmul.ac.uk/iubmb/enzyme/). The EC numbers referenced herein are derived from the KEGG Ligand database, maintained by the Kyoto Encyclopedia of Genes and
Genomics, sponsored in part by the University of Tokyo. Unless otherwise indicated, the EC numbers are as provided in the database as of October 2008.
[0060] The articles "a" and "an" are used herein to refer to one or to more than one (i.e. , to at least one) of the grammatical object of the article. By way of example, "an element" means one element or more than one element.
[0061] As used herein "acyl CoA" refers to an acyl thioester formed between the carbonyl carbon of alkyl chain and the sulfydryl group of the 4'-phosphopantethionyl moiety of coenzyme A (CoA), which has the formula R-C(0)S-CoA, where R is any alkyl group having at least 4 carbon atoms. In some instances an acyl CoA will be an intermediate in the synthesis of fully saturated acyl CoAs, including, but not limited to 3-keto-acyl CoA, a 3 -hydroxy acyl CoA, a delta-2-trans-enoyl-CoA, or an alkyl acyl CoA. In some embodiments, the carbon chain will have about 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, 20, 21 , 22, 23, 24, 25, or 26 carbons. In other embodiments the acyl CoA will be branched. In one embodiment the branched acyl CoA is an isoacyl CoA, in another it is an anti-isoacyl CoA. Each of these "acyl CoAs" are substrates for enzymes that convert them to fatty acid derivatives such as those described herein.
[0062] As used herein, the term "alcohol dehydrogenase" (EC 1.1.1.*) is a peptide capable of catalyzing the conversion of a fatty aldehyde to an alcohol (e.g., fatty alcohol). Additionally, one of ordinary skill in the art will appreciate that some alcohol dehydrogenases will catalyze other reactions as well. For example, some alcohol dehydrogenases will accept other substrates in addition to fatty aldehydes. Such non-specific alcohol dehydrogenases are, therefore, also included in this definition. Nucleic acid sequences encoding alcohol dehydrogenases are known in the art, and such alcohol dehydrogenases are publicly available. Exemplary GenBank
Accession Numbers include those provided in the figures.
[0063] As used herein, the term "aldehyde" means a hydrocarbon having the formula RCHO characterized by an unsaturated carbonyl group (C=0). In a preferred embodiment, the aldehyde is any aldehyde made from a fatty acid or fatty acid derivative. In one embodiment, the R group is at least about 1 , 2, 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20 carbons in length.
[0064] As used herein, an "aldehyde biosynthetic gene" or an "aldehyde biosynthetic
polynucleotide" is a nucleic acid that encodes an aldehyde biosynthetic polypeptide.
[0065] As used herein, an "aldehyde biosynthetic polypeptide is a polypeptide that is a part of the biosynthetic pathway of an aldehyde. Such polypeptide can act on a biological substrate to yield an aldehyde. In some instances, the aldehyde biosynthetic polypeptide has reductase activity.
[0066] As used herein, the term "alkane" means saturated hydrocarbons or compounds that consist only of carbon (C) and hydrogen (H), wherein these atoms are linked together by single bonds (i.e., they are saturated compounds).
[0067] The terms "altered level of expression" and "modified level of expression" are used interchangeably and mean that a polynucleotide, polypeptide, or hydrocarbon is present in a different concentration in an engineered microorganism as compared to its concentration in a corresponding wild-type cell under the same conditions.
[0068] As used herein, the term "attenuate" means to weaken, reduce or diminish. For example, a polypeptide can be attenuated by modifying the polypeptide to reduce its activity (e.g., by modifying a nucleotide sequence that encodes the polypeptide).
[0069] In other embodiments, the polypeptide, polynucleotide, or hydrocarbon having an altered level of expression is "attenuated" or has a "decreased level of expression." As used herein,
"attenuate" and "decreasing the level of expression" mean to express or cause to be expressed a polynucleotide, polypeptide, or hydrocarbon in a cell at a lesser concentration than is normally expressed in a corresponding wild-type cell under the same conditions. The degree of overexpression or attenuation can be 1.5-fold or more, e.g., 2-fold or more, 3-fold or more, 5- fold or more, 10-fold or more, or 15-fold or more. Alternatively, or in addition, the degree of overexpression or attenuation can be 500-fold or less, e.g., 100-fold or less, 50-fold or less, 25- fold or less, or 20-fold or less. Thus, the degree of overexpression or attenuation can be bounded by any two of the above endpoints. For example, the degree of overexpression or attenuation can be 1.5-500-fold, 2-50-fold, 10-25-fold, or 15-20-fold.
[0070] As used herein, the term "biodiesel" means a biofuel that can be a substitute of diesel, which is derived from petroleum. Biodiesel can be used in internal combustion diesel engines in either a pure form, which is referred to as "neat" biodiesel, or as a mixture in any concentration with petroleum-based diesel. Biodiesel can include esters or hydrocarbons, such as alcohols.
[0071] As used therein, the term "biofuel" refers to any fuel derived from biomass. Biofuels can be substituted for petroleum based fuels. For example, biofuels are inclusive of transportation fuels (e.g., gasoline, diesel, jet fuel, etc.), heating fuels, and electricity-generating fuels. Biofuels are a renewable energy source.
[0072] As used herein, the term "biomass" refers to any biological material from which a carbon source is derived. In some embodiments, a biomass is processed into a carbon source, which is suitable for bioconversion. In other embodiments, the biomass does not require further processing into a carbon source. The carbon source can be converted into a biofuel. An exemplary source of biomass is plant matter or vegetation, such as corn, sugar cane, or switchgrass. Another exemplary source of biomass is metabolic waste products, such as animal matter (e.g., cow manure). Further exemplary sources of biomass include algae and other marine plants. Biomass also includes waste products from industry, agriculture, forestry, and households, including, but not limited to, fermentation waste, ensilage, straw, lumber, sewage, garbage, cellulosic urban waste, and food leftovers. The term "biomass" also can refer to sources of carbon, such as carbohydrates (e.g., monosaccharides, disaccharides, or
polysaccharides).
[0073] "Branched chains" may have more than one point of branching and may include cyclic branches. In some embodiments, the branched fatty acid, branched fatty aldehyde, or branched
fatty alcohol comprises a C6, C7, C8, C9, Cjo, Cn, C]2, C13, C]4, C]5, C16, C17, C] 8, C]9, C20, C21, C22, C23, C24, C25, or a C26 branched fatty acid, branched fatty aldehyde, or branched fatty alcohol. In particular embodiments, the branched fatty acid, branched fatty aldehyde, or branched fatty alcohol is a C6, C8, C10, Ci2, C13, Ci4, C15, C16, C17, or C18 branched fatty acid, branched fatty aldehyde, or branched fatty alcohol. In certain embodiments, the hydroxyl group of the branched fatty acid, branched fatty aldehyde, or branched fatty alcohol is in the primary (Ci) position. In certain embodiments, the branched fatty acid, branched fatty aldehyde, or branched fatty alcohol is an iso-fatty acid, iso-fatty aldehyde, or iso-fatty alcohol, or an antesio- fatty acid, an anteiso-fatty aldehyde, or anteiso-fatty alcohol. In exemplary embodiments, the branched fatty acid, branched fatty aldehyde, or branched fatty alcohol is selected from iso-C7:o, iso-C8;o, iso-C9:o, iso-Cio:o, iso-Cii;o, iso-C]2:0, iso-Ci3:0, iso-C]4:0, iso-Ci5:0, iso-Ci6:0, iso-Ci7:0, iso-Ci8:o, iso-Ci9;o, anteiso-C7:o, anteiso-C8:0, anteiso-C9:o, anteiso-Cio:o, anteiso-C1 1 :o,anteiso- Ci2:0, anteiso-Ci3:o, anteiso-Ci4:o, anteiso-Cj5:o, anteiso-Ci6:0, anteiso-Ci7:o, anteiso-Ci8:o, and anteiso-Cj9:o branched fatty acid, branched fatty aldehyde or branched fatty alcohol.
[0074] As used herein, the phrase "carbon source" refers to a substrate or compound suitable to be used as a source of carbon for prokaryotic or simple eukaryotic cell growth. Carbon sources can be in various forms, including, but not limited to polymers, carbohydrates, acids, alcohols, aldehydes, ketones, amino acids, peptides, and gases (e.g., CO and C0 ). Exemplary carbon sources include, but are not limited to, monosaccharides, such as glucose, fructose, mannose, galactose, xylose, and arabinose; oligosaccharides, such as fructo-oligosaccharide and galacto- oligosaccharide; polysaccharides such as starch, cellulose, pectin, and xylan; disaccharides, such as sucrose, maltose, and turanose; cellulosic material and variants such as methyl cellulose and sodium carboxymethyl cellulose; saturated or unsaturated fatty acid esters, succinate, lactate, and acetate; alcohols, such as ethanol, methanol, and glycerol, or mixtures thereof. The carbon source can also be a product of photosynthesis, such as glucose. In certain preferred
embodiments, the carbon source is biomass. In other preferred embodiments, the carbon source is glucose.
[0075] As used herein, a "cloud point lowering additive" is an additive added to a composition to decrease or lower the cloud point of a solution.
[0076] As used herein, the phrase "cloud point of a fluid" means the temperature at which dissolved solids are no longer completely soluble. Below this temperature, solids begin
precipitating as a second phase giving the fluid a cloudy appearance. In the petroleum industry, cloud point refers to the temperature below which a solidified material or other heavy
hydrocarbon crystallizes in a crude oil, refined oil, or fuel to form a cloudy appearance. The presence of solidified materials influences the flowing behavior of the fluid, the tendency of the fluid to clog fuel filters, injectors, etc., the accumulation of solidified materials on cold surfaces (e.g., a pipeline or heat exchanger fouling), and the emulsion characteristics of the fluid with water.
[0077] A nucleotide sequence is "complementary" to another nucleotide sequence if each of the bases of the two sequences matches (i.e., is capable of forming Watson Crick base pairs). The term "complementary strand" is used herein interchangeably with the term "complement". The complement of a nucleic acid strand can be the complement of a coding strand or the
complement of a non-coding strand.
[0078] As used herein, the term "conditions sufficient to allow expression" means any conditions that allow a microorganism host cell to produce a desired product, such as a polypeptide or fatty aldehyde described herein. Suitable conditions include, for example, fermentation conditions. Fermentation conditions can comprise many parameters, such as temperature ranges, levels of aeration, and media composition. Each of these conditions, individually and in combination, allows the host cell to grow. Exemplary culture media include broths or gels. Generally, the medium includes a carbon source, such as glucose, fructose, cellulose, or the like, that can be metabolized by a host cell directly. In addition, enzymes can be used in the medium to facilitate the mobilization (e.g., the depolymerization of starch or cellulose to fermentable sugars) and subsequent metabolism of the carbon source. To determine if conditions are sufficient to allow expression, a host cell can be cultured, for example, for about 4, 8, 12, 24, 36, or 48 hours. During and/or after culturing, samples can be obtained and analyzed to determine if the conditions allow expression. For example, the host cells in the sample or the medium in which the host cells were grown can be tested for the presence of a desired product. When testing for the presence of a product, assays, such as, but not limited to, TLC, HPLC, GC/FID, GC/MS, LC/MS, MS, can be used.
[0079] As used herein, "control element" means a transcriptional control element. Control elements include promoters and enhancers. The term "promoter element," "promoter," or "promoter sequence" refers to a DNA sequence that functions as a switch that activates the
expression of a gene. If the gene is activated, it is said to be transcribed or participating in transcription. Transcription involves the synthesis of mRNA from the gene. A promoter, therefore, serves as a transcriptional regulatory element and also provides a site for initiation of transcription of the gene into mRNA. Control elements interact specifically with cellular proteins involved in transcription (Maniatis et al, Science 236: 1237, 1987).
[0080] As used herein, the term "fatty acid" means a carboxylic acid having the formula
RCOOH. R represents an aliphatic group, preferably an alkyl group. R can comprise between about 4 and about 22 carbon atoms. Fatty acids can be saturated, monounsaturated, or polyunsaturated. In a preferred embodiment, the fatty acid is made from a fatty acid biosynthetic pathway.
[0081] As used herein, the term "fatty acid biosynthetic pathway" means a biosynthetic pathway that produces fatty acids. The fatty acid biosynthetic pathway includes fatty acid synthases that can be engineered, as described herein, to produce fatty acids, and in some embodiments can be expressed with additional enzymes to produce fatty acids having desired carbon chain
characteristics.
[0082] As used herein, the term "fatty acid degradation enzyme" means an enzyme involved in the breakdown or conversion of a fatty acid or fatty acid derivative into another product. A nonlimiting example of a fatty acid degradation enzyme is an acyl-CoA synthase (EC 2.3.1.86). Additional examples of fatty acid degradation enzymes are described herein.
[0083] As used herein, the term "fatty acid derivative" means products made in part from the fatty acid biosynthetic pathway of the production host organism. "Fatty acid derivative" also includes products made in part from acyl-ACP or acyl-ACP derivatives. The fatty acid biosynthetic pathway includes fatty acid synthase enzymes which can be engineered as described herein to produce fatty acid derivatives, and in some examples can be expressed with additional enzymes to produce fatty acid derivatives having desired carbon chain characteristics.
Exemplary fatty acid derivatives include for example, fatty acids, acyl-CoA, fatty aldehyde, short and long chain alcohols, hydrocarbons, fatty alcohols, and esters (e.g., waxes, fatty acid esters, or fatty esters).
[0084] As used herein, the term "fatty acid derivative enzyme" means any enzyme that may be expressed or overexpressed in the production of fatty acid derivatives. These enzymes may be part of the fatty acid biosynthetic pathway. Non-limiting examples of fatty acid derivative enzymes include fatty acid
synthases, thioesterases (EC 3.1. 2.14 or EC 3.1.1.5), acyl-CoA synthases (EC 2.3.1.86), acyl-CoA reductases, alcohol dehydrogenases, alcohol acyltransferases, fatty alcohol-forming acyl-CoA reductases, fatty acid (carboxylic acid) reductases, acyl-ACP reductases (EC 6.4.1.2), fatty acid hydroxylases, acyl- CoA desaturases, acyl-ACP desaturases, acyl-CoA oxidases, acyl-CoA dehydrogenases, ester synthases, and alkane biosynthetic polypeptides, etc. Fatty acid derivative enzymes can convert a substrate into a fatty acid derivative. In some examples, the substrate may be a fatty acid derivative that the fatty acid derivative enzyme converts into a different fatty acid derivative. Exemplary suitable substrates include, C6-C26 fatty aldehydes.
[0085] As used herein, "fatty acid enzyme" means any enzyme involved in fatty acid
biosynthesis. Fatty acid enzymes can be modified in host cells to produce fatty acids. Non- limiting examples of fatty acid enzymes include fatty acid synthases and thioesterases (EC 3.1. 2.14 or EC 3.1.1.5). Additional examples of fatty acid enzymes are described herein.
[0086] As used herein, the term "fatty acid or derivative thereof means a "fatty acid" or a "fatty acid derivative." The term "fatty acid" means a carboxylic acid having the formula RCOOH. R represents an aliphatic group, preferably an alkyl group. R can comprise between about 4 and about 22 carbon atoms. Fatty acids can be saturated, monounsaturated, or polyunsaturated. In a preferred embodiment, the fatty acid is made from a fatty acid biosynthetic pathway.
[0087] As used herein, "fatty alcohol" means an alcohol having the formula ROH. In some embodiments, the fatty alcohol is any alcohol made from a fatty acid or fatty acid derivative. In certain embodiments, the R group of a fatty acid, fatty aldehyde, or fatty alcohol is at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 1 1 , at least 12, at least 13, at least 14, at least 15, at least 16, at least 17, at least 18, or at least 19, carbons in length. Alternatively, or in addition, the R group is 20 or less, 19 or less, 18 or less, 17 or less, 16 or less, 15 or less, 14 or less, 13 or less, 12 or less, 1 1 or less, 10 or less, 9 or less, 8 or less, 7 or less, or 6 or less carbons in length. Thus, the R group can have an R group bounded by any two of the above endpoints. For example, the R group can be 6-16 carbons in length, 10-14 carbons in length, or 12-18 carbons in length. In some embodiments, the fatty acid, fatty aldehyde, or fatty alcohol is a C6, C7, C8, C9, CI O, Cl l , C12, C13, C14, C15, C16, C17, C18, C19, C20, C21 , C22, C23, C24, C25, or a C26 fatty acid, fatty aldehyde, or fatty alcohol. In certain embodiments, the fatty acid, fatty aldehyde, or fatty alcohol is a C6, C8, CI O, C12, C13, C14, C15, C16, C17, or C18 fatty acid, fatty aldehyde, or fatty alcohol. The R group of a fatty acid, fatty aldehyde, or fatty alcohol can be a straight chain or a branched chain.
[0088] As used herein, "fatty aldehyde" means an aldehyde having the formula RCHO characterized by an unsaturated carbonyl group (C=0). In a preferred embodiment, the fatty aldehyde is any aldehyde made from a fatty acid or fatty acid derivative. In one embodiment, the R group is at least about 1 , 2, 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20 carbons in length. R can be straight or branched chain. The branched chains may have one or more points of branching. In addition, the branched chains may include cyclic branches.
Furthermore, R can be saturated or unsaturated. If unsaturated, the R can have one or more points of unsaturation. In one embodiment, the fatty aldehyde is produced biosynthetically. Fatty aldehydes have many uses. For example, fatty aldehydes can be used to produce many specialty chemicals. For example, fatty aldehydes are used to produce polymers, resins, dyes, flavorings, plasticizers, perfumes, pharmaceuticals, and other chemicals. Some are used as solvents, preservatives, or disinfectants. Some natural and synthetic compounds, such as vitamins and hormones, are aldehydes.
[0089] As used herein, the term "fatty ester" may be used in reference to an ester. In a preferred embodiment, a fatty ester is any ester made from a fatty acid, for example a fatty acid ester. In some embodiments, a fatty ester contains an A side and a B side. As used herein, an "A side" of an ester refers to the carbon chain attached to the carboxylate oxygen of the ester. As used herein, a "B side" of an ester refers to the carbon chain comprising the parent carboxylate of the ester. In embodiments where the fatty ester is derived from the fatty acid biosynthetic pathway, the A side is contributed by an alcohol, and the B side is contributed by a fatty acid. Any alcohol can be used to form the A side of the fatty esters. For example, the alcohol can be derived from the fatty acid biosynthetic pathway. Alternatively, the alcohol can be produced through non- fatty acid biosynthetic pathways. Moreover, the alcohol can be provided exogenously. For example, the alcohol can be supplied in the fermentation broth in instances where the fatty ester is produced by an organism. Alternatively, a carboxylic acid, such as a fatty acid or acetic acid, can be supplied exogenously in instances where the fatty ester is produced by an organism that can also produce alcohol. The carbon chains comprising the A side or B side can be of any length. In one embodiment, the A side of the ester is at least about 1 , 2, 3, 4, 5, 6, 7, 8, 10, 12, 14, 16, or 18 carbons in length. When the fatty ester is a fatty acid methyl ester, the A side of the ester is 1 carbon in length. When the fatty ester is a fatty acid ethyl ester, the A side of the ester is 2 carbons in length. The B side of the ester can be at least about 4, 6, 8, 10, 12, 14, 16, 18, 20,
22, 24, or 26 carbons in length. The A side and/or the B side can be straight or branched chain. The branched chains can have one or more points of branching. In addition, the branched chains can include cyclic branches. Furthermore, the A side and/or B side can be saturated or unsaturated. If unsaturated, the A side and/or B side can have one or more points of
unsaturation. In some embodiments, the fatty acid ester is a fatty acid methyl ester (FAME) or a fatty acid ethyl ester (FAEE). In certain embodiments, the FAME is a beta-hydroxy (B-OH) FAME. In one embodiment, the fatty ester is a wax. The wax can be derived from a long chain alcohol and a long chain fatty acid. In another embodiment, the fatty ester is a fatty acid thioester, for example fatty acyl Coenzyme A (Co A). In other embodiments, the fatty ester is a fatty acyl pantothenate, an acyl carrier protein (ACP), or a fatty phosphate ester.
[0090] As used herein, "fraction of modern carbon" or has the same meaning as defined by National Institute of Standards and Technology (NIST) Standard Reference Materials (SRMs) 4990B and 4990C, known as oxalic acids standards HOxI and HOxII, respectively. The fundamental definition relates to 0.95 times the 14C /12C isotope ratio HOxI (referenced to AD 1950). This is roughly equivalent to decay- corrected pre- Industrial Revolution wood. For the current living biosphere (plant material), fM is approximately 1.1.
[0091] "Gene knockout", as used herein, refers to a procedure by which a gene encoding a target protein is modified or inactivated so to reduce or eliminate the function of the intact protein. Inactivation of the gene may be performed by general methods such as mutagenesis by UV irradiation or treatment with N-methyl-N'-nitro-N-nitrosoguanidine, site-directed mutagenesis, homologous recombination, insertion-deletion mutagenesis, or "Red-driven integration"
(Datsenko et al, Proc. Natl. Acad. Sci. USA, 97:6640-45, 2000). For example, in one embodiment, a construct is introduced into a host cell, such that it is possible to select for homologous recombination events in the host cell. One of skill in the art can readily design a knock-out construct including both positive and negative selection genes for efficiently selecting transfected cells that undergo a homologous recombination event with the construct. The alteration in the host cell may be obtained, for example, by replacing through a single or double crossover recombination a wild type DNA sequence by a DNA sequence containing the alteration. For convenient selection of transformants, the alteration may, for example, be a DNA sequence encoding an antibiotic resistance marker or a gene complementing a possible auxotrophy of the host cell. Mutations include, but are not limited to, deletion-insertion
mutations. An example of such an alteration includes a gene disruption, i.e., a perturbation of a gene such that the product that is normally produced from this gene is not produced in a functional form. This could be due to a complete deletion, a deletion and insertion of a selective marker, an insertion of a selective marker, a frameshift mutation, an in-frame deletion, or a point mutation that leads to premature termination. In some instances, the entire mRNA for the gene is absent. In other situations, the amount of mRNA produced varies.
[0092] As used herein, a "host cell" is a cell used to produce a product described herein (e.g., a fatty alcohol described herein). A host cell can be modified to express or overexpress selected genes or to have attenuated expression of selected genes. Non-limiting examples of host cells include plant, animal, human, bacteria, yeast, or filamentous fungi cells.
[0093] In some embodiments, a polypeptide described herein has "increased level of activity." By "increased level of activity" is meant that a polypeptide has a higher level of biochemical or biological function (e.g., DNA binding or enzymatic activity) in an engineered host cell as compared to its level of biochemical and/or biological function in a corresponding wild-type host cell under the same conditions. The degree of enhanced activity can be about 10% or more, about 20% or more, about 50% or more, about 75% or more, about 100% or more, about 200% or more, about 500% or more, about 1000% or more, or any range therein.
[0094] The term "isolated" as used herein with respect to nucleic acids, such as DNA or RNA, refers to molecules separated from other DNAs or RNAs, respectively that are present in the natural source of the nucleic acid. Moreover, by an "isolated nucleic acid" is meant to include nucleic acid fragments, which are not naturally occurring as fragments and would not be found in the natural state. The term "isolated" is also used herein to refer to polypeptides, which are isolated from other cellular proteins and is meant to encompass both purified and recombinant polypeptides. The term "isolated" as used herein also refers to a nucleic acid or peptide that is substantially free of cellular material, viral material, or culture medium when produced by recombinant DNA techniques. The term "isolated" as used herein also refers to a nucleic acid or peptide that is substantially free of chemical precursors or other chemicals when chemically synthesized. The term "isolated", as used herein with respect to products, such as fatty alcohols, refers to products that are isolated from cellular components, cell culture media, or chemical or synthetic precursors.
[0095] As used herein, the "level of expression of a gene" refers to the level of mRNA, pre- mRNA nascent transcript(s), transcript processing intermediates, mature mRNA(s), and degradation products encoded by the gene.
[0096] As used herein, the term "microorganism" means prokaryotic and eukaryotic microbial species from the domains Archaea, Bacteria and Eucarya, the latter including yeast and filamentous fungi, protozoa, algae, or higher Protista. The terms "microbial cells" {i.e. , cells from microbes) and "microbes" are used interchangeably and refer to cells or small organisms that can only be seen with the aid of a microscope.
[0097] As used herein, the term "nucleic acid" refers to polynucleotides, such as
deoxyribonucleic acid (DNA), and, where appropriate, ribonucleic acid (RNA). The term should also be understood to include, as equivalents, analogs of RNAs or DNAs made from nucleotide analogs, and, as applicable to the embodiment being described, single (sense or antisense) and double-stranded polynucleotides, ESTs, chromosomes, cDNAs, mRNAs, and rRNAs.
[0098] The term "nucleotide" as used herein refers to a monomeric unit of a polynucleotide that consists of a heterocyclic base, a sugar, and one or more phosphate groups. The naturally occurring bases (guanine, (G), adenine, (A), cytosine, (C), thymine, (T), and uracil (U)) are typically derivatives of purine or pyrimidine, though it should be understood that naturally and non-naturally occurring base analogs are also included. The naturally occurring sugar is the pentose (five-carbon sugar) deoxyribose (which forms DNA) or ribose (which forms RNA), though it should be understood that naturally and non-naturally occurring sugar analogs are also included. Nucleic acids are typically linked via phosphate bonds to form nucleic acids or polynucleotides, though many other linkages are known in the art (e.g., phosphorothioates, boranophosphates, and the like). Polynucleotides described herein may comprise degenerate nucleotides which are defined according to the IUPAC code for nucleotide degeneracy wherein B is C, G, or T; D is A, G, or T; H is A, C, or T; K is G or T; M is A or C; N is A, C, G, or T; R is A or G; S is C or G; V is A, C, or G; W is A or T; and Y is C or T.
[0099] The terms "olefin" and "alkene" are used interchangeably herein, and refer to
hydrocarbons containing at least one carbon-to-carbon double bond (i.e., they are unsaturated compounds).
[00100] As used herein, the term "operably linked" means that selected nucleotide sequence (e.g. , encoding a polypeptide described herein) is in proximity with a promoter to allow the
promoter to regulate expression of the selected DNA. In addition, the promoter is located upstream of the selected nucleotide sequence in terms of the direction of transcription and translation. By "operably linked" is meant that a nucleotide sequence and a regulatory sequence(s) are connected in such a way as to permit gene expression when the appropriate molecules (e.g., transcriptional activator proteins) are bound to the regulatory sequence(s).
[00101] The term "or" is used herein to mean, and is used interchangeably with, the term "and/or," unless context clearly indicates otherwise.
[00102] In some embodiments, the polypeptide, polynucleotide, or hydrocarbon having an altered or modified level of expression is "overexpressed" or has an "increased level of expression." As used herein, "overexpress" and "increasing the level of expression" mean to express or cause to be expressed a polynucleotide, polypeptide, or hydrocarbon in a cell at a greater concentration than is normally expressed in a corresponding wild-type cell under the same conditions. For example, a polypeptide can be "overexpressed" in an engineered host cell when the polypeptide is present in a greater concentration in the engineered host cell as compared to its concentration in a non-engineered host cell of the same species under the same conditions.
[00103] As used herein, "partition coefficient" or "P," is defined as the equilibrium concentration of a compound in an organic phase divided by the concentration at equilibrium in an aqueous phase (e.g., fermentation broth). In one embodiment of a bi-phasic system described herein, the organic phase is formed by the fatty aldehyde during the production process.
However, in some examples, an organic phase can be provided, such as by providing a layer of octane, to facilitate product separation. When describing a two phase system, the partition characteristics of a compound can be described as logP. For example, a compound with a logP of 1 would partition 10: 1 to the organic phase. A compound with a logP of -1 would partition 1 : 10 to the organic phase. By choosing an appropriate fermentation broth and organic phase, a fatty aldehyde with a high logP value can separate into the organic phase even at very low concentrations in the fermentation vessel.
[00104] "Polynucleotide" refers to a polymer of DNA or RNA, which can be single-stranded or double-stranded and which can contain non-natural or altered nucleotides. The terms "polynucleotide," "nucleic acid," and "nucleic acid molecule" are used herein interchangeably to refer to a polymeric form of nucleotides of any length, either ribonucleotides (RNA) or
deoxyribonucleotides (DNA). These terms refer to the primary structure of the molecule, and thus include double- and single-stranded DNA, and double- and single-stranded RNA. The terms include, as equivalents, analogs of either RNA or DNA made from nucleotide analogs and modified polynucleotides such as, though not limited to methylated and/or capped
polynucleotides. The polynucleotide can be in any form, including but not limited to plasmid, viral, chromosomal, EST, cDNA, mRNA, and rRNA.
[00105] The terms "polypeptide" and "protein" refer to a polymer of amino acid residues. The term "recombinant polypeptide" refers to a polypeptide that is produced by recombinant DNA techniques, wherein generally DNA encoding the expressed protein or RNA is inserted into a suitable expression vector that is in turn used to transform a host cell to produce the polypeptide or RNA.
[00106] As used herein, the term "purify," "purified," or "purification" means the removal or isolation of a molecule from its environment by, for example, isolation or separation.
"Substantially purified" molecules are at least about 60% free, preferably at least about 75% free, and more preferably at least about 90% free from other components with which they are associated. As used herein, these terms also refer to the removal of contaminants from a sample. For example, the removal of contaminants can result in an increase in the percentage of fatty alcohol in a sample. For example, when fatty alcohols are produced in a host cell, the fatty alcohols can be purified by the removal of host cell proteins. After purification, the percentage of fatty alcohols in the sample is increased. The terms "purify," "purified," and "purification" do not require absolute purity. They are relative terms. Thus, for example, when fatty alcohols are produced in host cells, a purified fatty alcohol is one that is substantially separated from other cellular components (e.g., nucleic acids, polypeptides, lipids, carbohydrates, or other
hydrocarbons). In another example, a purified fatty alcohol preparation is one in which the fatty alcohol is substantially free from contaminants, such as those that might be present following fermentation. In some embodiments, a fatty alcohol is purified when at least about 50% by weight of a sample is composed of the fatty alcohol. In other embodiments, a fatty alcohol is purified when at least about 60%, 70%, 80%, 85%, 90%, 92%, 95%, 98%, or 99% or more by weight of a sample is composed of the fatty alcohol.
[00107] As used herein, the term "recombinant polypeptide" refers to a polypeptide that is produced by recombinant DNA techniques, wherein generally DNA encoding the expressed
protein or RNA is inserted into a suitable expression vector and that is in turn used to transform a host cell to produce the polypeptide or RNA.
[00108] The R group of a branched or unbranched fatty acid, branched or unbranched fatty aldehyde, or branched or unbranched fatty alcohol can be "saturated" or "unsaturated". If unsaturated, the R group can have one or more than one point of unsaturation. In some embodiments, the unsaturated fatty acid, unsaturated fatty aldehyde, or unsaturated fatty alcohol is a monounsaturated fatty acid, monounsaturated fatty aldehyde, or monounsaturated fatty alcohol. In certain embodiments, the unsaturated fatty acid, unsaturated fatty aldehyde, or unsaturated fatty alcohol is a C6:l, C7: l, C8: l , C9: l, C10:l, CI 1 : 1 , C12:l , C13:l , C14:l , C15:l, C16: l, C17: l, C18:l, C19:l , C20:l, C21 :l, C22:l, C23:l, C24: l , C25: l , or a C26:l unsaturated fatty acid, unsaturated fatty aldehyde, or unsaturated fatty alcohol. In certain preferred embodiments, the unsaturated fatty acid, unsaturated fatty aldehyde, or unsaturated fatty alcohol is C10:l , C12:l, C14: l, C16: l , or C18: l . In yet other embodiments, the unsaturated fatty acid, unsaturated fatty aldehyde, or unsaturated fatty alcohol is unsaturated at the omega-7 position. In certain embodiments, the unsaturated fatty acid, unsaturated fatty aldehyde, or unsaturated fatty alcohol comprises a cis double bond.
[00109] As used herein, the term "substantially identical" (or "substantially homologous") is used to refer to a first amino acid or nucleotide sequence that contains a sufficient number of identical or equivalent (e.g., with a similar side chain, e.g., conserved amino acid substitutions) amino acid residues or nucleotides to a second amino acid or nucleotide sequence such that the first and second amino acid or nucleotide sequences have similar activities.
[00110] As used herein, the term "synthase" means an enzyme which catalyzes a synthesis process. As used herein, the term synthase includes synthases, synthetases, and ligases.
[00111] The terms "terminal olefin," "a-olefin", "terminal alkene" and "1-alkene" are used interchangeably herein with reference to a-olefins or alkenes with a chemical formula CXH2X, distinguished from other olefins with a similar molecular formula by linearity of the hydrocarbon chain and the position of the double bond at the primary or alpha position.
[00112] As used herein, the term "transfection" means the introduction of a nucleic acid (e.g., via an expression vector) into a recipient cell by nucleic acid-mediated gene transfer.
[00113] As used herein, "transformation" refers to a process in which a cell's genotype is changed as a result of the cellular uptake of exogenous DNA or RNA. This may result in the
transformed cell expressing a recombinant form of an RNA or polypeptide. In the case of antisense expression from the transferred gene, the expression of a naturally-occurring form of the polypeptide is disrupted.
[00114] As used herein, a "transport protein" is a polypeptide that facilitates the movement of one or more compounds in and/or out of a cellular organelle and/or a cell.
[00115] As used herein, the term "vector" refers to a nucleic acid molecule capable of transporting another nucleic acid to which it has been linked. One type of useful vector is an episome {i.e., a nucleic acid capable of extra-chromosomal replication). Useful vectors are those capable of autonomous replication and/or expression of nucleic acids to which they are linked. Vectors capable of directing the expression of genes to which they are operatively linked are referred to herein as "expression vectors". In general, expression vectors of utility in
recombinant DNA techniques are often in the form of "plasmids," which refer generally to circular double stranded DNA loops that, in their vector form, are not bound to the chromosome. In the present specification, "plasmid" and "vector" are used interchangeably, as the plasmid is the most commonly used form of vector. However, also included are such other forms of expression vectors that serve equivalent functions and that become known in the art subsequently hereto.
[00116]
DESCRIPTION OF EXEMPLARY EMBODIMENTS
Production Of Fatty Alcohols
[00117] The invention is based, at least in part, on the identification of a number of fatty alcohol biosynthetic enzymes or polypeptides that are capable of catalyzing the conversion of a fatty aldehyde to a fatty alcohol under suitable conditions, for example, in the presence of suitable substrates and/or co-factors. The fatty alcohols can be produced by one or more or all of the fatty alcohol biosynthesis pathways in E. coli that utilize, in part, genes that encode fatty aldehyde biosynthetic polypeptides, acyl-ACP reductases (EC 6.4.1.2), or the fatty alcohol biosynthetic enzymes of the present invention. In certain embodiments, the fatty alcohols are produced by a biosynthetic pathway depicted in FIG. 1 A In this pathway, a fatty acid is first activated by ATP and then reduced to generate a fatty aldehyde. The fatty aldehyde can then be further reduced into a fatty alcohol by a fatty alcohol biosynthetic polypeptide of the present
invention, such as, for example, a fatty aldehyde reductase, an alcohol dehydrogenase, an oxidoreductase, an aldo-keto reductase, or a short-chain dehydrogenase. In certain other embodiments, the fatty alcohols are produced by an alternative biosynthesis pathway depicted in FIG. 1 A. In this pathway, an acyl-ACP is converted into a fatty aldehyde catalyzed by an acyl- ACP reductase (EC 6.4.1.2). The fatty aldehyde is further reduced into a fatty alcohol by a fatty alcohol biosynthetic polypeptide of the present invention, for example, by a fatty aldehyde reductase, an alcohol dehydrogenase, an oxidoreductase, an aldo-keto reductase, or a short-chain dehydrogenase. Exemplary embodiments of fatty alcohol biosynthetic enzymes of the present invention includes, without limitation, adhP, dkgA, dkgB, rspB, yahK, ybbO, ybdH, ybdR, ygfF, yhdH, yjgB, aroB, ycjQ, ydbC, ydjG, yeaE, yncB, yghD, ydjL, Tas, among others. Suitable substrates of these enzymes include fatty aldehydes, for example fatty aldehydes with carbon chain lengths from C10 to C18. Suitable co-factors include, without limitation, NAD, NAD(P), NADH, or NADPH.
[00118] The methods described herein can be used to produce fatty alcohols in an engineered microorganism by conversion of fatty aldehydes into fatty alcohols. In some instances, the fatty alcohol is produced by a fatty alcohol biosynthetic polypeptide having an amino acid sequence listed provided herein, as well as polypeptide variant thereof.
[00119] In other instances, the methods described herein can be used to produce fatty alcohols in an engineered microorganism using an acyl-ACP reductase polypeptide having an amino acid sequence provided herein, as well as a polypeptide variant thereof. In some instances, an acyl- ACP reductase polypeptide is one that includes one or more of the amino acid motifs disclosed herein. For example, the polypeptide can comprise one or more of SEQ ID NO: 155, 156, 157, 158, 159, 160, 161 , 162, 163, 164, or 165.
Fatty Alcohol Biosynthetic Genes And Polypeptides.
[00120] In some instances, a fatty alcohol is produced by expressing a gene encoding a fatty alcohol biosynthetic polypeptide that is capable of catalyzing the enzymatic conversion of a fatty aldehyde to a fatty alcohol.
[00121] In some embodiments, the method further includes isolating the fatty alcohol from the host cell. In some embodiments, the fatty alcohol is present in the extracellular environment. In certain embodiments, the fatty alcohol is isolated from the extracellular environment. In certain
embodiments, the fatty alcohol is spontaneously secreted, partially or completely, from the host cell. In alternative embodiments, the fatty alcohol is transported into the extracellular environment. In other embodiments, the fatty alcohol is passively transported into the extracellular environment. In some embodiments, the method further includes purifying the fatty alcohol.
[00122] In some embodiments, the fatty alcohol biosynthetic polypeptide is about 200 amino acids to about 800 amino acids in length. In certain embodiments, the polypeptide is about 250 amino acids to about 700 amino acids in length, for example, is about 300 to about 600 amino acids in length, about 350 to about 500 amino acids in length, or about 350 to about 450 amino acids in length. In other embodiments, the fatty alcohol biosynthetic polypeptide is up to about 800 amino acids in length, for example, up to about 700 amino acids in length, about 600 amino acids in length, about 500 amino acids in length, about 450 amino acids in length, about 400 amino acids in length, about 350 amino acids in length, about 300 amino acids in length, about 250 amino acids in length, or about 200 amino acids in length. In other embodiments, the fatty alcohol biosynthetic polypeptide is more than about 200 amino acids in length, for example, more than about 250 amino acids in length, about 300 amino acids in length, about 350 amino acids in length, about 400 amino acids in length, about 450 amino acids in length, about 500 amino acids in length, about 600 amino acids in length, about 700 amino acids in length, or about 800 amino acids in length.
[00123] In some embodiments, the fatty alcohol biosynthetic polypeptide comprises the amino acid sequence of SEQ ID NO: 1 , 3, 5, 7, 9, 1 1 , 13, 15, 17, 19, 21 , 23, 25, 27, 29, 31, 33, 35, 37, or 39, with one or more amino acid substitutions, additions, insertions, or deletions, wherein the polypeptide is capable of catalyzing the enzymatic conversion of a fatty aldehyde to a fatty alcohol under suitable conditions, for example, in the presence of suitable substrates and/or co- factors. In certain embodiments, the polypeptide is capable of catalyzing the enzymatic conversion of a fatty aldehyde into a fatty alcohol under suitable conditions, for example, in the presence of suitable substrates and/or co-factors. In certain embodiments, the polypeptide is a fatty aldehyde reductase and/or has fatty aldehyde reductase activity (EC 1.1.1.1). In some embodiments, the polypeptide is an alcohol dehydrogenase and/or has alcohol dehydrogenase activity. In certain embodiments, the polypeptide is an aldo-keto reductase and/or has aldo-keto
reductase activity. In certain other embodiments, the polypeptide is a short-chain dehydrogenase and/or has short-chain dehydrogenase activity. In yet other embodiments, the polypeptide is an oxidoreductase and/or has oxidoreductase activity. In certain further embodiments, the polypeptide comprises one or more NAD(P)- or NAD(P)H- binding domains and/or is associated with an NAD(P) or NAD(P)H co-factor. In yet further embodiments, the three-dimensional or the predicted three-dimensional structure of the polypeptide comprises a Rossman fold.
[00124] In some embodiments, the fatty alcohol biosynthetic is a mutant or variant.
[00125] Various known activity assays can be used to determine the enzymatic activity of a putative fatty alcohol biosynthetic polypeptide. These assays can be suitable or useful for determining, for example, the expression or level of various fatty alcohol biosynthetic polypeptides in an engineered host cell or microorganism. For example, the capacity of a polypeptide to convert a fatty aldehyde into a fatty alcohol can be determined by measuring the rate of increase or decrease of NAD(P)H at 340 nm (ε =6.22 nM'1 cm"1) using aldehydes as substrates at 25°C. See, e.g., Schweiger et al., Appl. Microbiol. Biotechnol. (published online 31 July 2009). Specifically, a 1.0 mL reaction mixture consisting of 5 mM aldehyde substrate, 40 mM potassium phosphate buffer, pH7.0, 125 μΜ NADPH and enzyme can be prepared. One unit can be defined as the amount of enzyme activity catalyzing the conversion of 1.0 μηιοΐ of pyridine nucleotide per minute. Alternatively, a similar assay with somewhat different conditions can be carried out to determine the fatty alcohol biosynthetic enzymatic activity. See, e.g., Wahlen et al., App. Environ. Microbiol. 75(9):2758-2764 (2009). Specifically, about 50 μg of purified enzyme can be added to a reaction mixture containing 100 mM Tris buffer at pH 7.9, 100 mM NaCl, 2.4 mM of either NADPH or NADH as a reactant, and decanal, oleic acid, and hexadecanol as possible substrates. Optionally the assay can be run under an argon atmosphere in septum-sealed vials overnight at room temperature with constant and gentle mixing. The products of the reaction can then be extracted from the buffer by adding an equal volume of hexane, and organic layer components can be analyzed by gas chromatography equipped with a flame ionization detector (30 m by 0.32 mm inner diameter with 0.5 μπι film thickness, with argon as a carrier and a temperature ramp of, for example, from 60°C to 360°C, increasing at 10°C per minute). A continuous spectrophotometric assay can also be developed to determine a
given polypeptide's capacity to convert a fatty aldehyde into a fatty alcohol. The activity assays and conditions described in the examples herein are also suitable for this determination.
[00126] In some embodiments, the fatty alcohol biosynthetic polypeptide has an amino acid sequence that is at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 91 %, at least about 92%, at least about 93%, at least about 94%, at least about 95%), at least about 96%, at least about 97%, at least about 98%, or at least about 99% identity to SEQ ID NO: 1 , 3, 5, 7, 9, 1 1 , 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, or 39. In some embodiments, the polypeptide has the amino acid sequence of SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21 , 23, 25, 27, 29, 31, 33, 35, 37, or 39.
[00127] In some embodiments, the nucleotide sequence has at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, or at least about 99% identity to SEQ ID NO:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, or 40. In some embodiments, the nucleotide sequence is SEQ ID NO:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, or 40.
[00128] In other embodiments, the nucleotide sequence hybridizes to a complement of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, or 40, or to a fragment thereof, for example, under low stringency, medium stringency, high stringency, or very high stringency conditions, wherein the polynucleotide encodes a polypeptide that is capable of catalyzing the enzymatic conversion of a fatty aldehyde to a fatty alcohol under suitable conditions, for example, in the presence of suitable substrates and/or co-factors. In some embodiments, the polynucleotide encodes a fatty alcohol biosynthetic enzyme. In certain embodiments, the polynucleotide encodes a fatty aldehyde reductase and/or encodes a polypeptide having fatty aldehyde reductase activity. In some embodiments, the
polynucleotides encodes an alcohol dehydrogenase and/or encodes a polypeptide having alcohol dehydrogenase activity. In other embodiments, the polynucleotide encodes an oxidoreductase and/or a polypeptide having oxidoreductase activity. In certain embodiments, the polynucleotide encodes an aldo-keto reductase and/or a polypeptide having aldo-keto reductase activity. In
certain other embodiments, the polynucleotide encodes a short-chain dehydrogenase and/or a polypeptide having short-chain dehydrogenase activity. In yet further embodiments, the polypeptide comprises one or more NAD(P)- or NAD(P)H- binding domains or is associated with an NAD(P) or NAD(P)H co-factors. In other embodiments, the three-dimensional structure or the predicted three-dimensional structure of the polypeptide comprises a Rossman fold.
[00129] In any of the aspects of the invention described herein, the method can produce fatty alcohols comprising a C6-C26 fatty alcohol. In some embodiments, the fatty alcohol comprises a C6, C7, C8, C9, do, C 1 1 , C12, Ci3, Ci4, C15, Ci6, Cn, C18, C\ C20, C2j, C22, C23, C24, C25, or a C26 fatty alcohol. In particular embodiments, the fatty alcohol is a C6, C8, C10, C 12, d3, CM, C15, C16, C17, or Ci8 fatty alcohol. In certain embodiments, the hydroxyl group of the fatty alcohol is in the primary (Ci) position. In other embodiments, the fatty alcohol comprises a straight chain fatty alcohol. In other embodiments, the fatty alcohol comprises a branched chain fatty alcohol. In yet other embodiments, the fatty alcohol comprises a cyclic moiety.
[00130] In some embodiments, the fatty alcohol is an unsaturated fatty alcohol. In other embodiments, the fatty alcohol is a monounsaturated fatty alcohol. In certain embodiments, the unsaturated fatty alcohol is a C6: l, C7: l , C8: l , C9: l , C10: l , Cl l : l , C12: l , C13 : l , C14: l , C15: l , C16: l , C17: l , C18: l , C19: l , C20: l , C21 : l , C22: l, C23 : l , C24: l , C25: l , or a C26: l unsaturated fatty alcohol. In yet other embodiments, the fatty alcohol is unsaturated at the omega-7 position. In certain embodiments, the unsaturated fatty alcohol comprises a cis double bond.
[00131] In yet other embodiments, the fatty alcohol is a saturated fatty alcohol.
[00132] In any of the aspects of the invention described herein, a suitable substrate for the polypeptide can be a fatty aldehyde. In some embodiments, the fatty aldehyde comprises a C6- C2 fatty aldehyde. In some embodiments, the fatty aldehyde comprises a C6, C7, C8, C9, do, Cn, C12, C13, Cn, Cis, Ci6, Cn, de, C19, C20, C2\, C22, C23, C24, C25, or a C26 fatty aldehyde. In particular embodiments, the fatty aldehyde is a C6, C8, do, C12, C13, Cj4, C15, C]6, C17, or C)8 fatty aldehyde.
[00133] In other embodiments, the fatty aldehyde comprises a straight chain fatty aldehyde. In other embodiments, the fatty aldehyde comprises a branched chain fatty aldehyde. In yet other embodiments, the fatty aldehyde comprise one or more cyclic moieties.
[00134] In some embodiments, the fatty aldehyde is an unsaturated fatty aldehyde. In other embodiments, the fatty aldehyde substrate is a monounsaturated fatty aldehyde. In yet other embodiments, the fatty aldehyde is a saturated fatty aldehyde.
[00135] In any of the aspects of the invention described herein, a suitable co-factor for the fatty alcohol biosynthetic polypeptide can be, for example, NAD, NADP, NADH, and/or NADPH. In some embodiments, the polypeptide comprises a co-factor binding domain or is associated with one of more of the co-factors. In particular embodiments, the three-dimensional structure or the predicted three-dimensional structure of the polypeptide comprises a Rossman fold.
[00136] In another aspect, the invention features an engineered microorganism comprising an exogenous control sequence stably incorporated into the genomic DNA of the microorganism upstream of a fatty alcohol biosynthetic polynucleotide comprising a nucleotide sequence having at least about 50% sequence identity to the nucleotide sequence of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, or 40, wherein the microorganism produces an increased level of a fatty alcohol relative to a wild-type microorganism.
[00137] In some embodiments, the nucleotide sequence has at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 91 %, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, or at least about 99% identity to SEQ ID NO:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, or 40. In some embodiments, the nucleotide sequence is SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, or 40.
[00138] In some embodiments, the fatty alcohol biosynthetic polynucleotide is endogenous to the microorganism.
[00139] In other embodiments, the microorganism is engineered to express a modified level of a gene encoding a fatty acid derivative enzyme. In certain embodiments, modifying the expression of a gene encoding a fatty acid derivative enzyme includes expressing a gene encoding a fatty acid derivative enzyme and/or increasing the expression or activity of an endogenous fatty acid derivative enzyme. In alternative embodiments, modifying the expression
of a gene encoding a fatty acid derivative enzyme includes attenuating a gene encoding a fatty acid derivative enzyme and/or decreasing the expression or activity of an endogenous fatty acid derivative enzyme. In some embodiments, the fatty acid derivative enzyme is a fatty acid synthase. In other embodiments, the fatty acid derivative enzyme is a thioesterase (EC 3.1. 2.14 or EC 3.1.1.5). In particular embodiments, the thioesterase is encoded by tesA, tesA without leader sequence, tesB, fatB, fatB2, fatB3, fatA, or fatAl.
[00140] In certain embodiments, one or more of the fatty alcohol biosynthetic polypeptides are overexpressed relative to expression in a wild type host cell.
[00141] While not wishing to be bound by theory, it is believed that the fatty alcohol biosynthetic polypeptide described herein produce fatty alcohols from substrate via a reduction mechanism. In some instances, the substrate is a fatty aldehyde or a derivative thereof, a fatty alcohol having particular branching patterns and carbon chain lengths can be produced from a fatty aldehyde having those characteristics that would result in a particular fatty alcohol. The fatty aldehyde substrates can, in turn, be obtained from another reaction mechanism, including, for example, via a reaction converting a fatty acid catalyzed by a fatty aldehyde biosynthetic enzyme or via a reaction converting an acyl-ACP substrate catalyzed by an acyl-ACP reductase.
[00142] In addition, each step within a biosynthetic pathway that leads to the production of a fatty aldehyde derivative substrate can be modified to produce or overproduce the substrate of interest. For example, known genes involved in the fatty acid biosynthetic pathway or the fatty aldehyde pathway can be expressed, overexpressed, or attenuated in host cells to produce a desired substrate {see, e.g., various enzymes described in PCT/US08/058788, incorporated by reference herein).
[00143] A suitable fatty acid substrate can be converted into a fatty aldehyde substrate by, for example, a fatty aldehyde biosynthetic polypeptide such as a carboxylic acid reductase, or an acyl-ACP reductase. For example, the fatty aldehyde biosynthetic polypeptide can be selected from those described herein, or variants thereof. Alternatively, the acyl-ACP reductase can be one selected from those described herein, or a variant thereof. Then, the fatty aldehyde substrate can be converted into a fatty alcohol by, for example, a gene encoding a fatty alcohol biosynthetic polypeptide of the present invention. In some example, a gene encoding a fatty alcohol biosynthetic polypeptide described herein can be expressed in a host cell that expresses
an endogenous fatty alcohol biosynthetic polypeptide capable of converting a fatty aldehyde produced by the fatty aldehyde biosynthetic polypeptide into a corresponding fatty alcohol. In other instances, a gene encoding a fatty alcohol biosynthetic polypeptide described herein, such as an amino acid sequence selected from SEQ ID NO: l , 3, 5, 7, 9, 1 1 , 13, 15, 17, 19, 21 , 23, 25, 27, 29, 31 , 33, 35, 37, or 39, or a variant thereof. In certain embodiments, the fatty alcohol biosynthetic polypeptide described herein can be encoded by a polynucleotide comprising a sequence of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, or 40, or a variant thereof.
[00144] In yet a further embodiment, the fatty alcohol biosynthetic polypeptide can be one selected from an AdhP homolog of FIG. 2, a DkgA homolog of FIG. 3, a DkgB homolog of FIG. 4, a Tas homolog of FIG. 5, an RspB homolog of FIG. 6, a YahK homolog of FIG. 7, a YbbO homolog of FIG. 8, a YbdH homolog of FIG. 9, a YbdR homolog of FIG. 10, a YgfF homolog of FIG. 1 1 , a YhdH homomolg of FIG. 12, a YjgB homolog of FIG. 13, an AroB homolog of FIG. 14, a YcjQ homolog of FIG. 15, a YdbC Homolog of FIG. 16, a YdjG homolog of FIG. 17, a YeaE homolog of FIG. 18, aYncB homolog of FIG. 19, a YqhD homolog of FIG. 20, a YdjL homolog of FIG. 21 , or a variant thereof. In certain embodiments, the gene encoding a fatty alcohol biosynthetic polypeptide can be co-expressed in a host cell with a gene encoding a fatty aldehyde biosynthetic polypeptide or with a gene encoding an acyl-ACP reductase polypeptide described herein.
[00145] In certain embodiment, the gene has a nucleotide sequence selected from those described herein, as well as polynucleotide variants thereof. In exemplary embodiments, the fatty alcohol biosynthetic gene is one encoding an AdhP homolog of FIG. 2, as well as polynucleotide variants thereof. In other exemplary embodiments, the fatty alcohol biosynthetic gene is one encoding a DkgA homolog of FIG. 3, or one encoding a DkgB homolog of FIG. 4, or one encoding a Tas homolog of FIG. 5, or one encoding a RspB homolog of FIG. 6, or one encoding a YahK homolog of FIG. 7, or one encoding a YbbO homolog of FIG. 8, or one encoding a YbdH homolog of FIG. 9, or one encoding a YbdR homolog of FIG. 10, or one encoding a YgfF homolog of FIG. 1 1 , or one encoding a YhdH homolog of FIG. 12, or one encoding a YjgB homolog of FIG. 13, or one encoding an AroB homolog of FIG. 14, or one encoding the YcjQ homolog of FIG. 15, or one encoding a YdbC homolog of FIG. 16, or one
encoding a YdjG homolog of FIG. 17, or one encoding a YeaE homolog of FIG. 18, or one encoding a YncB homolog of FIG. 19, or one encoding a YqhD homolog of FIG. 20, or one encoding a YdjL homolog of FIG. 21 , or a variant thereof, can be used as a fatty alcohol biosynthetic polynucleotide in the methods described herein.
[00146] Suitable variants, such as those listed in, for example, FIGs. 2-21 , can be identified using bioinformatic tools such as described hereinbelow.
Synthesis of Substrates
[00147] Fatty acid synthase (FAS) is a group of polypeptides that catalyze the initiation and elongation of acyl chains (Marrakchi et al, Biochemical Society, 30: 1050-1055, 2002). The acyl carrier protein (ACP) along with the enzymes in the FAS pathway control the length, degree of saturation, and branching of the fatty acid derivatives produced. The fatty acid biosynthetic pathway involves the precursors acetyl-CoA and malonyl-CoA. The steps in this pathway are catalyzed by enzymes of the fatty acid biosynthesis (fab) and acetyl-CoA carboxylase (ace) gene families (see, e.g., Heath et al , Prog. Lipid Res. 40(6):467-97 (2001)).
[00148] Host cells can be engineered to express fatty acid derivative substrates by
recombinantly expressing or overexpressing one or more fatty acid synthase genes, such as acetyl-CoA and/or malonyl-CoA synthase genes. For example, to increase acetyl-CoA production, one or more of the following genes can be expressed in a host cell: pdh (a multi enzyme complex comprising aceEF (which encodes the Elp dehydrogenase component, the E2p dihydrolipoamide acyltransferase component of the pyruvate and 2-oxoglutarate dehydrogenase complexes, and Ipd), panK, fabH, fabB, fabD, fabG, acpP, and fabF. Exemplary GenBank accession numbers for these genes are: pdh (BAB34380, AAC73227, AAC73226), panK (also known as CoA, AAC76952), aceEF (AAC73227, AAC73226), /¾H (AAC74175), fabB (P0A953), fabD (AAC74176), fabG (AAC74177), acpP (AAC74178),/ab (AAC74179). Additionally, the expression levels of fadE, gpsA, IdhA, pflb, adhE, pta, poxB, ackA, and/or ackB can be attenuated or knocked-out in an engineered host cell by transformation with conditionally replicative or non-replicative plasmids containing null or deletion mutations of the corresponding
genes or by substituting promoter or enhancer sequences. Exemplary GenBank accession numbers for these genes are: fadE (AAC73325), gspA (AAC76632), IdhA (AAC74462), pflb (AAC73989), adhE (AAC74323J, pta {AAC15351), poxB (AAC73958), ackA (AAC75356), and ackB (BAB81430). The resulting host cells will have increased acetyl-CoA production levels when grown in an appropriate environment.
[00149] Malonyl-CoA overexpression can be affected by introducing accABCD {e.g., accession number AAC73296, EC 6.4.1.2) into a host cell. Fatty acids can be further overexpressed in host cells by introducing into the host cell a DNA sequence encoding a lipase {e.g., accession numbers CAA89087, CAA98876).
[00150] In addition, inhibiting PlsB can lead to an increase in the levels of long chain acyl- ACP, which will inhibit early steps in the pathway {e.g., accABCD, fabH, and fabl). The plsB {e.g., accession number AAC7701 1) D31 IE mutation can be used to increase the amount of available fatty acids.
[00151] In addition, a host cell can be engineered to overexpress a sfa gene (suppressor of fabA, e.g., accession number AAN79592) to increase production of monounsaturated fatty acids (Rock et al, J. Bacteriology 178:5382-5387, 1996).
[00152] The chain length of a fatty acid derivative substrate can be selected for by modifying the expression of selected thioesterases (EC 3.1. 2.14 or EC 3.1.1.5). The thioesterase influences the chain length of fatty acids produced. Hence, host cells can be engineered to express, overexpress, have attenuated expression, or not to express one or more selected thioesterases to increase the production of a preferred fatty acid derivative substrate. For example, C10 fatty acids can be produced by expressing a thioesterase that has a preference for producing Cio fatty acids and attenuating thioesterases that have a preference for producing fatty acids other than Cio fatty acids {e.g., a thioesterase which prefers to produce Ci4 fatty acids). This would result in a relatively homogeneous population of fatty acids that have a carbon chain length of 10. In other instances, C)4 fatty acids can be produced by attenuating endogenous thioesterases that produce non-Cn fatty acids and expressing the thioesterases that use C14-ACP. In some situations, C 12 fatty acids can be produced by expressing thioesterases that use C12-ACP and attenuating thioesterases that produce non-Ci2 fatty acids. Acetyl-CoA, malonyl-CoA, and fatty acid overproduction can be verified using methods known in the art, for example, by using
radioactive precursors, HPLC, or GC-MS subsequent to cell lysis. Non-limiting examples of thioesterases that can be used in the methods described herein are listed in Table 1.
[00153] Table 1: Thioesterases
Mayer et al. , BMC Plant Biology 7:1-11, 2007
[00154] In other instances, a fatty alcohol biosynthetic polypeptide, variant, or a fragment thereof, is expressed in a host cell that contains a naturally occurring mutation that results in an increased level of fatty acids in the host cell. In some instances, the host cell is genetically engineered to increase the level of fatty acids in the host cell relative to a corresponding wild- type host cell. For example, the host cell can be genetically engineered to express a reduced level of an acyl-CoA synthase (EC 2.3.1.86) relative to a corresponding wild-type host cell. For example, the host cell can be genetically engineered to express a reduced level of an acyl-CoA synthase relative to a corresponding wild-type host cell. In one embodiment, the level of expression of one or more genes {e.g., an acyl-CoA synthase gene) is reduced by genetically engineering a "knock out" host cell.
[00155] Any known acyl-CoA synthase gene can be reduced or knocked out in a host cell. Non-limiting examples of acyl-CoA synthase genes include fa dD,fadK, BH3103, yhfL, Pfl-4354, EA V15023,fadDl adD2, RPCJ074 adDD35,fadDD22 aa3p or the gene encoding the protein ZP_01644857. Specific examples of acyl-CoA synthase genes include fadDD35 from M. tuberculosis H37Rv [NP_217021], fa dDD22 from M. tuberculosis H37Rv [ΝΡ_217464],/αί/£> from E. coli [NP_416319], fadK from E. coli [Y?_4\62\ 6], fadD from Acinetobacter sp. ADPl [Y?_045024], fadD from Haemophilus influenza RdkW20 [NP_ 438551],/ac Z) from
Rhodopseudomonas palustris Bis B18 [YP_533919], BH3101 from Bacillus halodurans C-125 [NP_243969], Pfl-4354 from Pseudomonas fluorescens Pfo-1 [YP_350082], EA V15023 from Comamonas testosterone KF- 1 [ZP_01520072], yhfL from B. subtilis [NP_388908],/ d£>7 from P. aeruginosa PAOl [NP_251989], fadDl from Ralstonia solanacearum GM1 1000
[NP_520978],/ad£>2 from P. aeruginosa PAOl [NP_251990], the gene encoding the protein ZP_01644857 from Stenotrophomonas maltophilia R551-3, faa3p from Saccharomyces cerevisiae [NP_012257],,/¾ //? from Saccharomyces cerevisiae [NP_014962], IcfA from
Bacillus subtilis [CAA99571 ], or those described in Shockey et al, Plant. Physiol. 129: 1710- 1722, 2002; Caviglia et al, J. Biol. Chem. 279: 1 163-1 169, 2004; Knoll et al. , J. Biol. Chem. 269(23): 16348-56, 1994; Johnson et al. , J. Biol. Chem. 269: 18037-18046, 1994; and Black et al., J. Biol Chem. 267: 25513-25520, 1992.
Fatty Aldehyde Substrates
[00156] Fatty aldehyde biosynthetic polypeptides refer to a group of polypeptides that can catalyze the enzymatic conversion of suitable fatty acid substrates into fatty aldehydes. Host cells can be engineered to express fatty aldehyde substrates by recombinantly expressing or overexpressing one or more fatty aldehyde biosynthetic genes, such as carboxylic acid reductases or fatty acid reductases.
[00157] In this pathway, a fatty acid is first activated by ATP and then reduced by a carboxylic acid reductase (CAR)-like enzyme to generate a fatty aldehyde. In some
embodiments, a fatty aldehyde is produced by expressing a fatty aldehyde biosynthetic gene, for example, a carboxylic acid reductase gene (car gene), having a nucleotide sequence provided herein, as well as nucleotide variants thereof. Examplary genes encode a polypeptide comprising SEQ ID NO: 41 , 43, 45, 47, 49, 51 , 53, 55, 57, 59, 61 , 63, 65, 69, 71 , 73, 75, 77, 79, 81 , 83, 85,
87, 89, 91 , 93, 97, 99, 101 , 103, 105, 107, 109, 1 1 1 , 1 13, 1 15, 1 17, 1 19, 121 , 123, 125, 127, or a variant thereof. In another example, the gene can comprise a polynucleotide sequence of SEQ ID NO: 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86,
88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 1 12, 1 14, 1 16, 1 18, 120, 122, 124, 126, or 128, or a variant thereof. In further embodiments, the fatty aldehyde biosynthetic polypeptide can comprise one or more of the amino acid motifs depicted herein in SEQ ID NO: 129-135. For example, the fatty aldehyde biosynthetic gene can encode a polypeptide comprising SEQ ID
NO: 129, SEQ ID NO: 130, SEQ ID NO:131 , and SEQ ID NO: 132; SEQ ID NO: 133; SEQ ID NO:134; SEQ ID NO: 135; SEQ ID NO: 136; and/or SEQ ID NO:129, SEQ ID NO:131, SEQ ID NO: 132, and SEQ ID NO: 133.
[00158] Alternatively, fatty aldehyde substrates can be produced using an enzymatic pathway involving an acyl-ACP reductase. In some embodiments, a fatty aldehyde can be produced from a suitable substrate, including, for example, an acyl-ACP, an acyl-CoA, or others, by expressing an acyl-ACP reductase gene (aar gene), having a nucleotide sequence provided herein, as well as nucleotide variants thereof. For example, the acyl-ACP reductase gene can encode a polypeptide comprising SEQ ID NO: 155, 156, 157, 158, 159, 160, 161 , 162, 163, 164, or 165.
[00159] Other substrates that can be used to produce fatty aldehydes and fatty alcohols in the methods described herein are acyl-ACP, acyl-CoA, fatty aldehydes, or fatty alcohols, which are described in, for example, PCT/US08/058788, the disclosure of which is incorporated herein by reference.
Fatty Acid Degradation Enzymes
[00160] In some embodiments, the host cell is genetically engineered to express an attenuated level of a fatty acid degradation enzyme relative to a wild type host cell. In some embodiments, the host cell is genetically engineered to express an attenuated level of an acyl-CoA synthase (EC 2.3.1.86) relative to a wild type host cell. In particular embodiments, the host cell expresses an attenuated level of an acyl-CoA synthase encoded by fadD,fadK, BH3103, yhfL, Pfl-4354, EA V15023,fadDl,fadD2, RPC_4074,fadDD35,fadDD22,faa3p or the gene encoding the protein ZP_01644857. In certain embodiments, the genetically engineered host cell comprises a knockout of one or more genes encoding a fatty acid degradation enzyme, such as the
aforementioned acyl-CoA synthase genes.
[00161] In yet other embodiments, the host cell is genetically engineered to express an attenuated level of a dehydratase/isomerase enzyme, such as an enzyme encoded by fabA or by a gene listed in the table of FIG. 22. In some embodiments, the host cell comprises a knockout of fabA or a gene listed in the table of FIG. 22. In other embodiments, the host cell is genetically engineered to express an attenuated level of a ketoacyl-ACP synthase, such as an enzyme encoded by fabB or by a gene listed in the table of FIG. 23. In certain embodiments, the host cell
comprises a knockout of fabB or a gene listed in the table of FIG. 23. In yet other embodiments, the host cell is genetically engineered to express a modified level of a gene encoding a desaturase enzyme, such as desA.
Formation of Branched Fatty Alcohols
[00162] Fatty alcohols can be produced from fatty aldehydes substrates that contain branched points by using a fatty alcohol biosynthetic polypeptide as described herein. In turn, the branched fatty aldehydes can be made from branched fatty acid derivatives as substrates for a fatty aldehyde biosynthetic polypeptide as described herein. For example, although E.coli naturally produces straight chain fatty acids (sFAs), E.coli can be engineered to produce branched chain fatty acids (brFAs) by introducing and expressing or overexpressing genes that provide branched precursors in the E.coli {e.g., bkd, ilv, icm, and fab gene families).
Additionally, a host cell can be engineered to express or overexpress genes encoding proteins for the elongation of brFAs {e.g., ACP, FabF, etc.) and/or to delete or attenuate the corresponding host cell genes that normally lead to sFAs.
Fatty Alcohol Saturation Levels
[00163] The degree of saturation in fatty acids (which can then be converted into fatty aldehydes and then fatty alcohols as described herein) can be controlled by regulating the degree of saturation of fatty acid intermediates. For example, the sfa, gns, and fab families of genes can be expressed, overexpressed, or expressed at reduced levels, to control the saturation of fatty acids. Non-limiting examples of these genes include sfa [GenBank Accession No. AAN 79592, AAC 44390], gnsA [GenBank Accession No. ABD 18647.1], gnsB [GenBank Accession No. AAC 74076A],fabB [GenBank Accession No. BAA 16180, EC 23.1.41], fabK [GenBank Accession No. AAF 98273, EC \ .3A .9],fabL [GenBank Accession No. AAG 39821, EC 1.3.1.9], or fabM [GenBank Accession No. DAA 05501, EC 4.2.1.17].
[00164] For example, host cells can be engineered to produce unsaturated fatty acids by engineering the production host to overexpress fabB or by growing the production host at low temperatures {e.g., less than 37 °C). FabB has preference to cis- 3decenoyl-ACP and results in unsaturated fatty acid production in E. coli. Overexpression of fabB results in the production of a significant percentage of unsaturated fatty acids (de Mendoza et al, J. Biol. Chem. 258:2098-
2101, 1983). The gene fabB may be inserted into and expressed in host cells not naturally having the gene. These unsaturated fatty acids can then be used as intermediates in host cells that are engineered to produce fatty acid derivatives, such as fatty aldehydes.
[00165] In other instances, a repressor of fatty acid biosynthesis, for example, fabR (GenBank accession NP 418398 ), can be deleted, which will also result in increased unsaturated fatty acid production in E. coli (Zhang et al, J. Biol. Chem. 277: 15558, 2002). Similar deletions may be made in other host cells. A further increase in unsaturated fatty acids may be achieved, for example, by overexpressing «0 (trans-2, cis-3-decenoyl-ACP isomerase, GenBank accession DAA05501) and controlled expression of fabK (trans-2-enoyl-ACP reductase II, GenBank accession NP_357969) from Streptococcus pneumoniae (Marrakchi et al, J. Biol. Chem. 277: 44809, 2002), while deleting E. coli fabl (trans-2-enoyl-ACP reductase, GenBank accession NP_415804). In some examples, the endogenous fabF gene can be attenuated, thus increasing the percentage of palmitoleate (CI 6: 1) produced.
[00166] In yet other examples, host cells can be engineered to produce saturated fatty acids by reducing the expression of an sfa, gns, and/or fab gene.
Formation of Cyclic Fatty Alcohols
[00167] Cyclic fatty alcohols can be produced from cyclic fatty aldehydes using cyclic fatty acid derivatives as substrates for a fatty aldehyde biosynthetic polypeptide described herein. To produce cyclic fatty acid derivative substrates, genes that provide cyclic precursors {e.g., the ans, chc, and plm gene families) can be introduced into the host cell and expressed to allow initiation of fatty acid biosynthesis from cyclic precursors.
FATTY ALDEHYDE BIOSYNTHETIC GENES AND POLYPEPTIDES.
[00168] In some embodiments, the microorganism is further engineered to express a modified level of a gene encoding a fatty aldehyde biosynthesis polypeptide. In certain embodiments, modifying the expression of a gene encoding a fatty aldehyde biosynthesis polypeptide includes expressing a gene encoding a fatty aldehyde biosynthetic enzyme and/or increasing the expression or activity of an endogenous fatty aldehyde biosynthetic enzyme. In some embodiments, the fatty aldehyde biosynthesis gene encodes a carboxylic acid reductase. In further embodiments, the fatty aldehyde biosynthetic gene encodes a fatty acid reductase.
[00169] In particular embodiments, the fatty aldehyde biosynthetic polypeptide comprises the amino acid sequence of SEQ ID NO: 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 97, 99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119,
121, 123, 125, 127, or a variant thereof. In some embodiments, the fatty aldehyde biosynthetic polypeptide comprises an amino acid sequence having at least about 80% (e.g., at least about 80%, at least about 85%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99%) sequence identity to the amino acid sequence of SEQ ID NO: 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 97, 99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, or 127.
[00170] In another embodiment, the fatty aldehyde biosynthetic polypeptide is encoded by a polynucleotide having the sequence of SEQ ID NO:42, 44, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, or 128, or by a variant thereof. In some embodiments, the fatty aldehyde biosynthetic polypeptide is encoded by a polynucleotide having at least 80% sequence identity to the sequence of SEQ ID NO:42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, or 128. In some embodiments, the method further comprises expressing a gene encoding a fatty aldehyde biosynthesis polypeptide in the host cell. In particular embodiments, the fatty aldehyde biosynthetic polypeptide comprises the amino acid sequence of SEQ ID NO: 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 69, 71, 73, 75, 77, 79, 81,83, 85, 87, 89,91,93,97, 99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, or a variant thereof. In some embodiment, the fatty aldehyde biosynthetic polypeptide comprises an amino acid sequence having at least about 80% sequence identity to SEQ ID NO: 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 97,99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, or 127. In another embodiment, the fatty aldehyde biosynthetic polypeptide is encoded by a polynucleotide having the sequence of SEQ ID NO: 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78,80, 82, 84, 86, 88,90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120,
122, 124, 126, 128, or by a variant thereof. In some embodiments, the fatty aldehyde
biosynthetic polypeptide is encoded by a polynucleotide having at least about 80%> sequence
identity to SEQ ID NO: 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 1 12, 1 14, 1 16, 1 18, 120, 122, 124, 126, or 128. In further embodiments, the method comprises expressing a gene encoding a fatty aldehyde biosynthetic polypeptide comprising one or more of the amino acid motifs provided herein. For example, the fatty aldehyde biosynthetic gene can encode a polypeptide comprising SEQ ID NO: 129, SEQ ID NO: 130, SEQ ID NO: 131 , and SEQ ID NO: 132; SEQ ID NO: 133 ; SEQ ID NO: 134; SEQ ID NO: 135; SEQ ID NO: 136; and/or SEQ ID NO: 129, SEQ ID NO: 131 , SEQ ID NO: 132 and SEQ ID NO: 133. SEQ ID NO: 131 includes a reductase domain; SEQ ID NO: 132 includes an NADP-binding domain; SEQ ID NO: 133 includes a
phosphopantetheine attachment site; and SEQ ID NO: 134 includes an AMP-binding domain.
ACYL-ACP REDUCTASE GENES AND POLYPEPTIDES.
[00171] In certain other embodiments, the invention further includes expressing in a host cell a gene encoding an acyl-ACP reductase polypeptide in the host cell. In some embodiments, the acyl-ACP reductase polypeptide comprises the amino acid sequence of SEQ ID NO: 137, 139, 141 , 143, 145, 147, 149, 151 , 153, or a variant thereof. In some embodiments, the acyl-ACP reductase polypeptide comprises an amino acid sequence that has at least about 70% (e.g., at least about 70%, at least about 75%>, at least about 80%, at least about 85%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99%, or at least about 99%) sequence identity to SEQ ID NO: 137, 139, 141 , 143, 145, 147, 149, 151 , or 153. In another embodiment, the acyl-ACP reductase polypeptide is encoded by a polynucleotide having the sequence of SEQ ID NO: 138, 140, 142, 144, 146, 148, 150, 152, or 154, or by a variant thereof. In some embodiments, the acyl-ACP reductase polypeptide is encoded by a
polynucleotide having at least about 70% sequence identity to the sequence of SEQ ID NO: 138, 140, 142, 144, 146, 148, 150, 152, or 154.
[00172] In yet further embodiments, the method includes expressing in a host cell an acyl- ACP reductase gene encoding a polypeptide comprising one or more of the amino acid motifs disclosed herein. For example, the polypeptide can comprise one or more of SEQ ID NO: 155, 156, 157, 158, 159, 160, 161 , 162, 163, 164, or 165.
HYDROCARBON BIOSYNTHETIC GENES AND POLYPEPTIDES.
[00173] The compositions and methods described herein can be used to produce
hydrocarbons, including, for example, alkanes and alkenes, from an appropriate substrate.
[00174] The invention is based, at least in part, on the identification of a number of fatty alcohol biosynthetic enzymes or polypeptides that are capable of catalyzing the conversion of a fatty aldehyde to a fatty alcohol under suitable conditions, for example, in the presence of suitable substrates and/or co-factors. One or more of these fatty alcohol biosynthetic
polypeptides can be attenuated or deleted from the host cell, which expresses or overexpresses one or more hydrocarbon biosynthetic polypeptides, optionally also expresses or overexpresses one or more fatty aldehyde biosynthetic polypeptides or one or more acyl-ACP reductases. The resulting host cell can be used to produce hydrocarbons such as, for example, alkanes or alkenes. In certain embodiments, the hydrocarbons are produced by a biosynthetic pathway depicted in FIG. IB. In this pathway, a fatty acid is first activated by ATP and then reduced by a fatty aldehyde biosynthetic polypeptide such as a carboxylic acid reductase (CAR)-like enzyme to generate a fatty aldehyde. The fatty aldehyde can then be subject to a hydrocarbon biosynthetic polypeptide such as a decarbonylase and be reduced into a hydrocarbon. In certain other embodiments, hydrocarbons are produced by an alternative biosynthesis pathway depicted in FIG. IB. In this pathway, an acyl-ACP is converted into a fatty aldehyde catalyzed by an acyl- ACP reductase. The fatty aldehyde is further subject to a hydrocarbon biosynthetic polypeptide and converts to a hydrocarbon such as an alkane or an alkene. In both of these pathways, the fatty aldehydes can, in the presence of endogenous fatty alcohol biosynthetic enzyme activity, be converted into fatty alcohols. Therefore, attenuating one or more fatty alcohol biosynthetic polypeptides, or in particular embodiments, deleting one or more fatty alcohol biosynthetic polypeptides from the host cell can improve the production of hydrocarbons. In some embodiments, the method further includes culturing the host cell in the presence of at least one biological substrate of the hydrocarbon biosynthetic polypeptide, the fatty aldehyde biosynthetic polypeptide, and/or the acyl-ACP reductase polypeptide. Exemplary suitable substrates include, without limitation, a fatty acid derivative, an acyl-ACP, a fatty acid, an acyl-CoA, a fatty aldehyde, a fatty alcohol, or a fatty ester.
[00175] In another aspect, the invention features a method of producing a hydrocarbon, the method comprising expressing an attenuated level of one or more fatty alcohol biosynthetic genes or a mutant and variant thereof in a host cell. In certain embodiments, the method further comprises deleting one or more fatty alcohol biosynthetic genes or a mutant and variant thereof from the host cell. Fatty alcohol biosynthetic genes, polypeptides, sequence motifs, mutants and variants thereof, are described hereinabove.
[00176] In certain other embodiments, the host cell is engineered such that it comprises no detectable level of fatty alcohol biosynthetic enzyme activity, for example, a fatty aldehyde reductase activity, an alcohol dehydrogenase activity, an aldo-keto reductase activity, an oxidoreductase activity, or a short-chain dehydrogenase activity.
[00177] In some embodiments, the method further comprises expressing a gene encoding a hydrocarbon biosynthetic polypeptide in the host cell. In particular embodiments, the hydrocarbon biosynthetic polypeptide comprises the amino acid sequence of SEQ ID NO: 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, or a variant thereof. In some embodiments, the hydrocarbon biosynthetic polypeptide comprises at least about 70% sequence identity to SEQ ID NO: 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, or 200. In another embodiment, the hydrocarbon biosynthetic polypeptide is encoded by a polynucleotide having the sequence of SEQ ID NO: 167, 169, 171, 173, 175, 177, 179, 181 , 183, 185, 187, 189, 191, 193, 195, 197, 199, or 201, or by a variant thereof. In some embodiments, the hydrocarbon biosynthetic polypeptide is encoded by a polynucleotide having at least about 70% sequence identity to SEQ ID NO: 167, 169, 171 , 173, 175, 177, 179, 181 , 183, 185, 187, 189, 191, 193, 195, 197, 199, or 201. In further
embodiments, the method comprises expressing a gene encoding a hydrocarbon biosynthetic polypeptide comprising one or more amino acid motifs disclosed herein. For example, the hydrocarbon biosynthetic polypeptide can comprise the amino acid sequence motifs of: (1) SEQ ID NO: 202; or (2) SEQ ID NO: 203 or SEQ ID NO:204, or SEQ ID NO:205; or (3) SEQ ID NO:206, and any one of SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205; or (4) SEQ ID NO:207 or SEQ ID NO:208, or SEQ ID NO:209, or SEQ ID NO:210. In certain embodiments, the hydrocarbon biosynthetic polypeptide has decarbonylase activity. In some embodiments, the method further comprises isolating the hydrocarbon from the host cell.
[00178] In some embodiments, the method further comprises expressing a gene encoding a fatty aldehyde biosynthesis polypeptide in the host cell. Fatty aldehyde biosynthetic genes, polypeptides, sequence motifs, mutants, and variants thereof are described hereinabove.
[00179] In any of the aspects of the invention described herein, the method can produce hydrocarbons. In some embodiments, the hydrocarbon produced is an alkane. In some embodiments, the alkane is a C3-C25 alkane. For example, the alkane is a C3, C4, C5, C6, C7, C8, C9, Cio, Cn, C12, Ci3, C]4, C]5, Cj6, Cn, Ci8, C19, C2o, C2i , C22, C23, C24, or C25 alkane. In some embodiments, the alkane is tridecane, methyltridecane, nonadecane, methylnonadecane, heptadecane, methylheptadecane, pentadecane, or methylpentadecane.
[00180] In certain embodiments, the method further comprising culturing the host cell in the presence of a saturated fatty acid derivative, and the hydrocarbon produced is an alkane. In certain embodiments, the saturated fatty acid derivative is a C6-C26 fatty acid derivative substrate. For example, the fatty acid derivative substrate is a C6, C7, C8, C9, Cio, Cn, C12, Q3, C14, C15, Ci6, Cn, C18, Ci9, C20, C2i, C22, C23, C24, C25, or a C26 fatty acid derivative substrate. In particular embodiments, the fatty acid derivative substrate is 2-methylicosanal, icosanal, octadecanal, tetradecanal, 2-methyloctadecanal, stearaldehyde, or palmitaldehyde.
[00181] In some embodiments, the method further includes isolating the alkane from the host cell or from the culture medium. In certain embodiments, the method further includes cracking or refining the alkane.
[00182] In any of the aspects of the invention herein, the hydrocarbon carbon produced can be an alkene. In some embodiments, the alkene is a C3-C25 alkene. For example, the alkene is a C3, C4, C5, C6, C7, C8, C9, Cio, Cn, C]2, C13, CM, Ci5, Ci6, C\i, Cj , C19, C2o, C21, C22, C23, C24, or C2 alkene. In some embodiments, the alkene is pentadecene, heptadecene, methylpentadecene, or methylheptadecene.
[00183] In some embodiments, the alkene is a straight chain alkene, a branched chain alkene, or a cyclic alkene.
[00184] In certain embodiments, the method further comprises culturing the host cell in the presence of an unsaturated fatty acid derivative, and the hydrocarbon produced is an alkene. In certain embodiments, the unsaturated fatty acid derivative is a C6-C26 fatty acid derivative
substrate. For example, the fatty acid derivative substrate is a C6, C7, C8, C9, Ci0, Q 1 , C12, C13, C14, C]5, Ci6, C]7, Ci8; Ci 9, C20, C2i, C22, C23, C24, C25, or a C26 unsaturated fatty acid derivative substrate. In particular embodiments, the fatty acid derivative substrate is octadecenal, hexadecenal, methylhexadecenal, or methyloctadecenal.
[00185] In another aspect, the invention features a genetically engineered microorganism wherein the microorganism produces an increased level of a hydrocarbon relative to a wild-type microorganism.
[00186] In another aspect, the invention features a method of making a hydrocarbon, the method comprising culturing a genetically engineered microorganism described herein under conditions suitable for gene expression, and isolating the hydrocarbon. In certain embodiments, the method comprising culturing the genetically engineered microorganism in the presence of a suitable biological substrate for the hydrocarbon biosynthetic polypeptide, the fatty aldehyde biosynthetic polypeptide, and/or the acyl-ACP reductase.
[00187] In some embodiments, the biological substrate is a fatty acid derivative, an acyl-ACP, a fatty acid, an acyl-CoA, a fatty aldehyde, a fatty alcohol, or a fatty ester.
[00188] In some embodiments, the substrate is a saturated fatty acid derivative, and the hydrocarbon produced is an alkane, for example, a C3-C25 alkane. For example, the alkane is a C3, C4, C5, C6, C7, C8, C¾ C]0, C11 , C12, Co, Ci4, C15, C16, C17, Ci8, C19, C20, C21, C22, C23, C24, or C25 alkane. In some embodiments, the alkane is tridecane, methyltridecane, nonadecane, methylnonadecane, heptadecane, methylheptadecane, pentadecane, or methylpentadecane.
[00189] In some embodiments, the alkane is a straight chain alkane, a branched chain alkane, or a cyclic alkane.
[00190] In some embodiments, the saturated fatty acid derivative is 2-methylicosanal, icosanal, octadecanal, tetradecanal, 2-methyloctadecanal, stearaldehyde, or palmitaldehyde.
[00191] In other embodiments, the biological substrate is an unsaturated fatty acid derivative and the hydrocarbon produced by the microorganism is an alkene, for example, a C3-C25 alkene. For example, the alkene is a C3, C4, C5, C6, C7, C8, C9, d0, Cn, C]2, C13, C]4, C]5, C]6, C] 7, Ci8, Ci9, C20, C2i, C22, C23, C24, or C25 alkene. In some embodiments, the alkene is pentadecene, heptadecene, methylpentadecene, or methylheptadecene.
[00192] In some embodiments, the alkene is a straight chain alkene, a branched chain alkene, or a cyclic alkene. In some embodiments, the unsaturated fatty acid derivative is octadecenal, hexadecenal, methylhexadecenal, or methyloctadecenal.
[00193] In another aspect, the invention features a hydrocarbon produced by any of the methods or microorganisms described herein. In particular embodiments, the hydrocarbon is an alkane or an alkene having a 613C of about -15.4 or greater. In certain embodiments, the alkane or alkene has a 613C of about -15.4 to about -10.9, or of about -13.92 to about -13.84.
[00194] In other embodiments, the alkane or alkene has an fjvi14C of at least about 1.003. In certain embodiments, the alkene or alkene has an fMi4C of at least about 1.01 or at least about 1.5. In some embodiments, the alkane or alkene has an fM14C of about 1.1 11 to about 1.124.
[00195] In another aspect, the invention features a biofuel comprising a hydrocarbon produced by any of the methods or microorganisms described herein. In particular embodiments, the hydrocarbon is an alkane or an alkene having a 513C of about -15.4 or greater. In exemplary embodiments, the alkane or alkene has a δ C of about -15.4 to about -10.9, or of about -13.92 to about -13.84. In other embodiments, the alkane or alkene has an fM14C of at least about 1.003. For example, the alkane or alkene has an fM14C of at least about 1.003. For example, the alkane or alkene has an fMl4C of at least about 1.01 or at least about 1.5. In some embodiments, the alkane or alkene has an fwi14C of about 1.111 to about 1.124.
[00196] In any of the aspects described herein, a hydrocarbon is produced in a host cell or a microorganism described herein from a carbon source.
Variants
[00197] As used herein, a "variant" of polypeptide X refers to a polypeptide having the amino acid sequence of peptide X in which one or more amino acid residues is altered. The variant may have conservative changes or nonconservative changes. Guidance in determining which amino acid residues may be substituted, inserted, or deleted without affecting biological activity may be found using computer programs well known in the art, for example, LASERGENE software (DNASTAR). The term "variant," when used in the context of a polynucleotide sequence, may encompass a polynucleotide sequence related to that of a gene or the coding sequence thereof. This definition may also include, for example, "allelic," "splice," "species," or
"polymorphic" variants. A splice variant may have significant identity to a reference polynucleotide, but will generally have a greater or fewer number of polynucleotides due to alternative splicing of exons during mRNA processing. The corresponding polypeptide may possess additional functional domains or an absence of domains. Species variants are polynucleotide sequences that vary from one species to another. The resulting polypeptides generally will have significant amino acid identity relative to each other. A polymorphic variant is a variation in the polynucleotide sequence of a particular gene between individuals of a given species.
[00198] Suitable variants, such as those described herein, for example in FIGs. 2-21 , can be identified using bioinformatic tools such as searching for the "bidirectional best hits" against the public databases, such as for example, the Kyoto Encyclopedia of Gene & Genomes (KEGG) database, and selecting bidirectional best hits having a Smith-Waterman score of, for example, above 1000. Other bioinformatics tools known to those skilled in the art, including for example, a bi-directional blast against known genome databases and the E.coli genome, can also be used for this purpose to identify homologs.
[00199] Variants can be naturally occurring or created in vitro. In particular, such variants can be created using genetic engineering techniques, such as site directed mutagenesis, random chemical mutagenesis, Exonuclease III deletion procedures, or standard cloning techniques. Alternatively, such variants, fragments, analogs, or derivatives can be created using chemical synthesis or modification procedures.
[00200] Methods of making variants are well known in the art. These include procedures in which nucleic acid sequences obtained from natural isolates are modified to generate nucleic acids that encode polypeptides having characteristics that enhance their value in industrial or laboratory applications. In such procedures, a large number of variant sequences having one or more nucleotide differences with respect to the sequence obtained from the natural isolate are generated and characterized. Typically, these nucleotide differences result in amino acid changes with respect to the polypeptides encoded by the nucleic acids from the natural isolates.
[00201] For example, variants can be created using error prone PCR (see, e.g., Leung et al, Technique 1 : 11-15, 1989; and Caldwell et al, PCR Methods Applic. 2:28-33, 1992). In error prone PCR, PCR is performed under conditions where the copying fidelity of the DNA
polymerase is low, such that a high rate of point mutations is obtained along the entire length of the PCR product. Briefly, in such procedures, nucleic acids to be mutagenized (e.g., a fatty aldehyde biosynthetic polynucleotide sequence), are mixed with PCR primers, reaction buffer, MgCl2, MnCl2, Taq polymerase, and an appropriate concentration of dNTPs for achieving a high rate of point mutation along the entire length of the PCR product. For example, the reaction can be performed using 20 fmoles of nucleic acid to be mutagenized (e.g., a fatty aldehyde biosynthetic polynucleotide sequence), 30 pmole of each PCR primer, a reaction buffer comprising 50 mM C1, 10 mM Tris HC1 (pH 8.3), and 0.01 % gelatin, 7 mM MgCl2, 0.5 mM MnCl2, 5 units of Taq polymerase, 0.2 mM dGTP, 0.2 mM dATP, 1 mM dCTP, and 1 mM dTTP. PCR can be performed for 30 cycles of 94°C for 1 min, 45°C for 1 min, and 72°C for 1 min. However, it will be appreciated that these parameters can be varied as appropriate. The mutagenized nucleic acids are then cloned into an appropriate vector and the activities of the polypeptides encoded by the mutagenized nucleic acids are evaluated.
[00202] Variants can also be created using oligonucleotide directed mutagenesis to generate site-specific mutations in any cloned DNA of interest. Oligonucleotide mutagenesis is described in, for example, Reidhaar-Olson et al, Science 241 :53-57, 1988. Briefly, in such procedures a plurality of double stranded oligonucleotides bearing one or more mutations to be introduced into the cloned DNA are synthesized and inserted into the cloned DNA to be mutagenized (e.g. , a fatty aldehyde biosynthetic polynucleotide sequence). Clones containing the mutagenized DNA are recovered, and the activities of the polypeptides they encode are assessed.
[00203] Another method for generating variants is assembly PCR. Assembly PCR involves the assembly of a PCR product from a mixture of small DNA fragments. A large number of different PCR reactions occur in parallel in the same vial, with the products of one reaction priming the products of another reaction. Assembly PCR is described in, for example, U.S. Pat. No. 5,965,408.
[00204] Still another method of generating variants is sexual PCR mutagenesis. In sexual PCR mutagenesis, forced homologous recombination occurs between DNA molecules of different, but highly related, DNA sequence in vitro as a result of random fragmentation of the DNA molecule based on sequence homology. This is followed by fixation of the crossover by
primer extension in a PCR reaction. Sexual PCR mutagenesis is described in, for example, Stemmer, PNAS, USA 91 : 10747-10751 , 1994.
[00205] Recursive ensemble mutagenesis can also be used to generate variants. Recursive ensemble mutagenesis is an algorithm for protein engineering (i.e., protein mutagenesis) developed to produce diverse populations of phenotypically related mutants whose members differ in amino acid sequence. This method uses a feedback mechanism to control successive rounds of combinatorial cassette mutagenesis. Recursive ensemble mutagenesis is described in, for example, Arkin et al, PNAS, USA 89:781 1 -7815, 1992.
[00206] In some embodiments, variants are created using exponential ensemble mutagenesis. Exponential ensemble mutagenesis is a process for generating combinatorial libraries with a high percentage of unique and functional mutants, wherein small groups of residues are randomized in parallel to identify, at each altered position, amino acids which lead to functional proteins.
Exponential ensemble mutagenesis is described in, for example, Delegrave et al , Biotech. Res. 1 1 : 1548-1552, 1993. Random and site-directed mutageneses are described in, for example, Arnold, Curr. Opin. Biotech. 4:450-455, 1993.
[00207] In some embodiments, variants are created using shuffling procedures wherein portions of a plurality of nucleic acids that encode distinct polypeptides are fused together to create chimeric nucleic acid sequences that encode chimeric polypeptides as described in, for example, U.S. Pat. Nos. 5,965,408 and 5,939,250.
[00208] Polynucleotide variants also include nucleic acid analogs. Nucleic acid analogs can be modified at the base moiety, sugar moiety, or phosphate backbone to improve, for example, stability, hybridization, or solubility of the nucleic acid. Modifications at the base moiety include deoxyuridine for deoxythymidine and 5-methyl-2'-deoxycytidine or 5-bromo-2'- doxycytidine for deoxycytidine. Modifications of the sugar moiety include modification of the 2' hydroxyl of the ribose sugar to form 2'-0-methyl or 2'-0-allyl sugars. The deoxyribose phosphate backbone can be modified to produce morpholino nucleic acids, in which each base moiety is linked to a six-membered, morpholino ring, or peptide nucleic acids, in which the deoxyphosphate backbone is replaced by a pseudopeptide backbone and the four bases are retained. (See, e.g. , Summerton et al , Antisense Nucleic Acid Drug Dev. (1997) 7: 187-195; and Hyrup et al , Bioorgan. Med. Chem. (1996) 4:5-23.) In addition, the deoxyphosphate backbone
can be replaced with, for example, a phosphorothioate or phosphorodithioate backbone, a phosphoroamidite, or an alkyl phosphotriester backbone.
[00209] Biosynthetic polypeptide variants can be variants in which one or more amino acid residues are substituted with a conserved or non-conserved amino acid residues. In preferred embodiments, biosynthetic polypeptide variants are variants in which one or more amino acid residues are substituted with a conserved amino acid residue. Such substituted amino acid residue may or may not be one encoded by a genetic code.
[00210] Conservative substitutions are those that substitute a given amino acid in a
polypeptide by another amino acid of similar characteristics. Typical conservative substitutions are the following replacements: replacement of an aliphatic amino acid, such as alanine, valine, leucine, and isoleucine, with another aliphatic amino acid; replacement of a serine with a threonine or vice versa; replacement of an acidic residue, such as aspartic acid and glutamic acid, with another acidic residue; replacement of a residue bearing an amide group, such as asparagine and glutamine, with another residue bearing an amide group; exchange of a basic residue, such as lysine and arginine, with another basic residue; and replacement of an aromatic residue, such as phenylalanine and tyrosine, with another aromatic residue.
[00211] Other polypeptide variants are those in which one or more amino acid residues include a substituent group. Still other polypeptide variants are those in which the polypeptide is associated with another compound, such as a compound to increase the half-life of the polypeptide (e.g., polyethylene glycol).
[00212] Additional polypeptide variants are those in which additional amino acids are fused to the polypeptide, such as a leader sequence, a secretory sequence, a proprotein sequence, or a sequence which facilitates purification, enrichment, or stabilization of the polypeptide.
[00213] In some instances, the polypeptide variants retain the same biological function as a the native polypeptide, for example, retain fatty alcohol biosynthetic activity, such as fatty aldehyde reductase, alcohol dehydrogenase, aldo-keto reductase, short-chain alcohol
dehydrogenases, or oxidoreductase activity or retain fatty aldehyde biosynthetic activity, such as carboxylic acid or fatty acid reductase activity, and have amino acid sequences substantially identical thereto.
[00214] In other instances, the polypeptide variants have at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%), at least about 85%», at least about 90%>, at least about 91 %, at least about 92%», at least about 93%, at least about 94%», at least about 95%, or more than about 95%» homology to the native or wild-type sequence. In another embodiment, the polypeptide variants include a fragment comprising at least about 5, 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, or 150 consecutive amino acids thereof.
[00215] The polypeptide variants or fragments thereof can be obtained by isolating nucleic acids encoding them using techniques described herein or by expressing synthetic nucleic acids encoding them. Alternatively, polypeptide variants or fragments thereof can be obtained through biochemical enrichment or purification procedures. The sequence of polypeptide variants or fragments can be determined by proteolytic digestion, gel electrophoresis, and/or
microsequencing. The sequence of the polypeptide variants or fragments can then be compared to the native or wild-type sequence using any of the programs described herein.
[00216] The polypeptide variants and fragments thereof can be assayed for fatty aldehydes producing activity, fatty alcohol producing activity or hydrocarbon producing activity using routine methods. For example, the polypeptide variants or fragment can be contacted with a substrate (e.g., a fatty acid or fatty aldehyde substrate) under conditions that allow the polypeptide variant to function. A decreased in the level of the substrate or an increase in the level of a fatty aldehydes, fatty alcohol or hydrocarbon, respectively, can be measured to determine the biological activity of the variant or fragment.
[00217] The terms "homolog," "homologue," and "homologous" as used herein refer to a polynucleotide or a polypeptide comprising a sequence that is at least about 80%» homologous to the corresponding polynucleotide or polypeptide sequence. One of ordinary skill in the art is well aware of methods to determine homology between two or more sequences. Briefly, calculations of "homology" between two sequences can be performed as follows. The sequences are aligned for optimal comparison purposes (e.g., gaps can be introduced in one or both of a first and a second amino acid or nucleic acid sequence for optimal alignment and nonhomologous sequences can be disregarded for comparison purposes). In a preferred
embodiment, the length of a first sequence that is aligned for comparison purposes is at least
about 30%, preferably at least about 40%, more preferably at least about 50%, even more preferably at least about 60%, and even more preferably at least about 70%, at least about 80%, at least about 90%, or about 100% of the length of a second sequence. The amino acid residues or nucleotides at corresponding amino acid positions or nucleotide positions of the first and second sequences are then compared. When a position in the first sequence is occupied by the same amino acid residue or nucleotide as the corresponding position in the second sequence, then the molecules are identical at that position (as used herein, amino acid or nucleic acid "identity" is equivalent to amino acid or nucleic acid "homology"). The percent identity between the two sequences is a function of the number of identical positions shared by the sequences, taking into account the number of gaps and the length of each gap, which need to be introduced for optimal alignment of the two sequences.
[00218] The comparison of sequences and determination of percent homology between two sequences can be accomplished using a mathematical algorithm, such as BLAST (Altschul et al, J. Mol Biol, 215(3): 403-410 (1990)). The percent homology between two amino acid sequences also can be determined using the Needleman and Wunsch algorithm that has been incorporated into the GAP program in the GCG software package, using either a Blossum 62 matrix or a PAM250 matrix, and a gap weight of 16, 14, 12, 10, 8, 6, or 4 and a length weight of 1 , 2, 3,4, 5, or 6 (Needleman and Wunsch, J. Mol Biol, 48: 444-453 (1970)). The percent homology between two nucleotide sequences also can be determined using the GAP program in the GCG software package, using a NWSgapdna.CMP matrix and a gap weight of 40, 50, 60, 70, or 80 and a length weight of 1 , 2, 3, 4, 5, or 6. One of ordinary skill in the art can perform initial homology calculations and adjust the algorithm parameters accordingly. A preferred set of parameters (and the one that should be used if a practitioner is uncertain about which parameters should be applied to determine if a molecule is within a homology limitation of the claims) are a Blossum 62 scoring matrix with a gap penalty of 12, a gap extend penalty of 4, and a frameshift gap penalty of 5. Additional methods of sequence alignment are known in the biotechnology arts (see, e.g., Rosenberg, BMC Bioinformatics, 6: 278 (2005); Altschul et al., FEBS J., 272(20): 5101-5109 (2005)).
[00219] As used herein, the term "hybridizes under low stringency, medium stringency, high stringency, or very high stringency conditions" describes conditions for hybridization and
washing. Guidance for performing hybridization reactions can be found in Current Protocols in Molecular Biology, John Wiley & Sons, N.Y. (1989), 6.3.1 - 6.3.6. Aqueous and nonaqueous methods are described in that reference and either method can be used. Specific hybridization conditions referred to herein are as follows: 1 ) low stringency hybridization conditions in 6X sodium chloride/sodium citrate (SSC) at about 45 °C, followed by two washes in 0.2X SSC, 0.1 % SDS at least at 50 °C (the temperature of the washes can be increased to 55 °C for low stringency conditions); 2) medium stringency hybridization conditions in 6X SSC at about 45 °C, followed by one or more washes in 0.2X SSC, 0.1 % SDS at 60 °C; 3) high stringency hybridization conditions in 6X SSC at about 45 °C, followed by one or more washes in 0.2.X SSC, 0.1 % SDS at 65 °C; and preferably 4) very high stringency hybridization conditions are 0.5M sodium phosphate, 7% SDS at 65 °C, followed by one or more washes at 0.2X SSC, 1 % SDS at 65 °C. Very high stringency conditions (4) are the prefen'ed conditions unless otherwise specified.
[00220] In some embodiments, the polypeptide is a fragment of any of the polypeptides described herein. The term "fragment" refers to a shorter portion of a full-length polypeptide or protein ranging in size from four amino acid residues to the entire amino acid sequence minus one amino acid residue. In certain embodiments of the invention, a fragment refers to the entire amino acid sequence of a domain of a polypeptide or protein (e.g., a substrate binding domain or a catalytic domain).
[00221] In some embodiments, the polypeptide is a mutant or a variant of any of the polypeptides described herein. The terms "mutant" and "variant" as used herein refer to a polypeptide having an amino acid sequence that differs from a wild-type polypeptide by at least one amino acid. For example, the mutant or variant can comprise one or more of the following conservative amino acid substitutions: replacement of an aliphatic amino acid, such as alanine, valine, leucine, and isoleucine, with another aliphatic amino acid; replacement of a serine with a threonine; replacement of a threonine with a serine; replacement of an acidic residue, such as aspartic acid and glutamic acid, with another acidic residue; replacement of a residue bearing an amide group, such as asparagine and glutamine, with another residue bearing an amide group; exchange of a basic residue, such as lysine and arginine, with another basic residue; and replacement of an aromatic residue, such as phenylalanine and tyrosine, with another aromatic
residue. In some embodiments, the mutant polypeptide has about 1 , 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 30, 40, 50, 60, 70, 80, 90, 100, or more amino acid substitutions, additions, insertions, or deletions.
[00222] Preferred fragments or mutants of a polypeptide retain some or all of the biological function (e.g., enzymatic activity) of the corresponding wild-type polypeptide. In some embodiments, the fragment or mutant retains at least 75%, at least 80%, at least 90%, at least 95%, or at least 98% or more of the biological function of the corresponding wild-type polypeptide. In other embodiments, the fragment or mutant retains about 100% of the biological function of the corresponding wild-type polypeptide. Guidance in determining which amino acid residues may be substituted, inserted, or deleted without affecting biological activity may be found using computer programs well known in the art, for example, LASERGENE™ software (DNASTAR, Inc., Madison, WI).
[00223] In yet other embodiments, a fragment or mutant exhibits increased biological function as compared to a corresponding wild-type polypeptide. For example, a fragment or mutant may display at least a 10%, at least a 25%, at least a 50%, at least a 75%, or at least a 90%
improvement in enzymatic activity as compared to the corresponding wild-type polypeptide. In other embodiments, the fragment or mutant displays at least 100% (e.g., at least 200%, or at least 500%) improvement in enzymatic activity as compared to the corresponding wild-type polypeptide.
[00224] It is understood that the polypeptides described herein may have additional conservative or non-essential amino acid substitutions, which do not have a substantial effect on the polypeptide function. Whether or not a particular substitution will be tolerated (i.e., will not adversely affect desired biological function, such as DNA binding or enzyme activity) can be determined as described in Bowie et al. (Science, 247: 1306-1310 (1990)).
[00225] A "conservative amino acid substitution" is one in which the amino acid residue is replaced with an amino acid residue having a similar side chain. Families of amino acid residues having similar side chains have been defined in the art. These families include amino acids with basic side chains (e.g., lysine, arginine, histidine), acidic side chains (e.g., aspartic acid, glutamic acid), uncharged polar side chains (e.g., glycine, asparagine, glutamine, serine, threonine, tyrosine, cysteine), nonpolar side chains (e.g., alanine, valine, leucine, isoleucine, proline,
phenylalanine, methionine, tryptophan), beta-branched side chains (e.g., threonine, valine, isoleucine), and aromatic side chains (e.g., tyrosine, phenylalanine, tryptophan, histidine).
[00226] In some embodiments, the fatty acid or fatty acid derivative biosynthetic polypeptide or polynucleotide is from a bacterium, a cyanobacterium, an algae, a plant, an insect, a yeast, a fungus, or a mammal. In certain embodiments, the polypeptide is from a mammalian cell, plant cell, insect cell, fungus cell, cyanobacterial cell, algal cell, bacterial cell, or any other organisms described herein.
Vectors and expression
[00227] In some embodiments, a polynucleotide (or gene) sequence is provided to the host cell by way of a recombinant vector, which comprises a promoter operably linked to the polynucleotide sequence. In certain embodiments, the promoter is a developmentally-regulated, an organelle-specific, a tissue-specific, an inducible, a constitutive, or a cell-specific promoter.
[00228] In some embodiments, the recombinant vector comprises at least one sequence selected from the group consisting of (a) an expression control sequence operatively coupled to the polynucleotide sequence; (b) a selection marker operatively coupled to the polynucleotide sequence; (c) a marker sequence operatively coupled to the polynucleotide sequence; (d) a purification moiety operatively coupled to the polynucleotide sequence; (e) a secretion sequence operatively coupled to the polynucleotide sequence; and (f) a targeting sequence operatively coupled to the polynucleotide sequence.
[00229] The expression vectors described herein include a polynucleotide sequence described herein in a form suitable for expression of the polynucleotide sequence in a host cell. It will be appreciated by those skilled in the art that the design of the expression vector can depend on such factors as the choice of the host cell to be transformed, the level of expression of polypeptide desired, etc. The expression vectors described herein can be introduced into host cells to produce polypeptides, including fusion polypeptides, encoded by the polynucleotide sequences as described herein.
[00230] Expression of genes encoding polypeptides in prokaryotes, for example, E. coli, is most often carried out with vectors containing constitutive or inducible promoters directing the expression of either fusion or non-fusion polypeptides. Fusion vectors add a number of amino
acids to a polypeptide encoded therein, usually to the amino- or carboxy- terminus of the recombinant polypeptide. Such fusion vectors typically serve one or more of the following three purposes: (1 ) to increase expression of the recombinant polypeptide; (2) to increase the solubility of the recombinant polypeptide; and (3) to aid in the purification of the recombinant polypeptide by acting as a ligand in affinity purification. Often, in fusion expression vectors, a proteolytic cleavage site is introduced at the junction of the fusion moiety and the recombinant polypeptide. This enables separation of the recombinant polypeptide from the fusion moiety after purification of the fusion polypeptide. Examples of such enzymes, and their cognate recognition sequences, include Factor Xa, thrombin, and enterokinase. Exemplary fusion expression vectors include pGEX (Pharmacia Biotech, Inc., Piscataway, NJ; Smith et al., Gene, 67: 31 -40 (1988)), pMAL (New England Biolabs, Beverly, MA), and pRITS (Pharmacia Biotech, Inc., Piscataway, N.J.), which fuse glutathione S-transferase (GST), maltose E binding protein, or protein A,
respectively, to the target recombinant polypeptide.
[00231] Examples of inducible, non-fusion E. coli expression vectors include pTrc (Amann et al, Gene (1988) 69:301 -315) and pET l i d (Studier et al , Gene Expression Technology:
Methods in Enzymology 185, Academic Press, San Diego, Calif. (1990) 60-89). Target gene expression from the pTrc vector relies on host RNA polymerase transcription from a hybrid trp- lac fusion promoter. Target gene expression from the pET l id vector relies on transcription from a T7 gnlO-lac fusion promoter mediated by a coexpressed viral RNA polymerase (T7 gnl ). This viral polymerase is supplied by host strains BL21(DE3) or HMS 174(DE3) from a resident λ prophage harboring a T7 gnl gene under the transcriptional control of the lacUV 5 promoter.
[00232] Suitable expression systems, for both prokaryotic and eukaryotic cells are well known in the art; see, e.g., Sambrook et al., "Molecular Cloning: A Laboratory Manual," second edition, Cold Spring Harbor Laboratory, (1989). Examples of inducible, non-fusion E. coli expression vectors include pTrc (Amann et al., Gene, 69: 301 -315 (1988)) and PET 1 Id (Studier et al., Gene Expression Technology: Methods in Enzymology 185, Academic Press, San Diego, CA, pp. 60- 89 (1990)). In certain embodiments, a polynucleotide sequence of the invention is operably linked to a promoter derived from bacteriophage T5.
[00233] In another embodiment, the host cell is a yeast cell. In this embodiment, the expression vector is a yeast expression vector. Examples of vectors for expression in yeast
include pYepSecl (Baldari et al, EMBO J., 6: 229-234 (1 87)), pMFa (Kurjan et al., Cell, 30: 933-943 (1982)), pJRY88 (Schultz et al., Gene, 54: 1 13-123 (1987)), pYES2 (Invitrogen Corp., San Diego, CA), and picZ (Invitrogen Corp., San Diego, CA).
[00234] Alternatively, a polypeptide described herein can be expressed in insect cells using baculovirus expression vectors. Baculovirus vectors available for expression of proteins in cultured insect cells (e.g. , Sf9 cells) include, for example, the pAc series (Smith et al , Mol. Cell Biol. (1983) 3 :2156-2165) and the pVL series (Lucklow et al. , Virology (1989) 170:31 -39).
[00235] In yet another embodiment, the nucleic acids described herein can be expressed in mammalian cells using a mammalian expression vector. Examples of mammalian expression vectors include pCDM8 (Seed, Nature (1987) 329:840) and pMT2PC (Kaufman et al , EMBO J. (1987) 6: 187-195). When used in mammalian cells, the expression vector's control functions can be provided by viral regulatory elements. For example, commonly used promoters are derived from polyoma, Adenovirus 2, cytomegalovirus, and Simian Virus 40. Other suitable expression systems for both prokaryotic and eukaryotic cells are described in chapters 16 and 17 of Sambrook et al, eds., Molecular Cloning: A Laboratory Manual. 2nd, ed., Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989.
[00236] Vectors can be introduced into prokaryotic or eukaryotic cells via a variety of art- recognized techniques for introducing foreign nucleic acid (e.g., DNA) into a host cell. Suitable methods for transforming or transfecting host cells can be found in, for example, Sambrook et al. (supra).
[00237] For stable transformation of bacterial cells, it is known that, depending upon the expression vector and transformation technique used, only a small fraction of cells will take-up and replicate the expression vector. In order to identify and select these transformants, a gene that encodes a selectable marker (e.g., resistance to an antibiotic) can be introduced into the host cells along with the gene of interest. Selectable markers include those that confer resistance to drugs such as, but not limited to, ampicillin, kanamycin, chloramphenicol, or tetracycline.
Nucleic acids encoding a selectable marker can be introduced into a host cell on the same vector as that encoding a polypeptide described herein or can be introduced on a separate vector. Cells stably transformed with the introduced nucleic acid can be identified by growth in the presence of an appropriate selection drug.
[00238] Similarly, for stable transfection of mammalian cells, it is known that, depending upon the expression vector and transfection technique used, only a small fraction of cells may integrate the foreign DNA into their genome. In order to identify and select these integrants, a gene that encodes a selectable marker (e.g., resistance to an antibiotic) can be introduced into the host cells along with the gene of interest. Preferred selectable markers include those which confer resistance to drugs, such as G418, hygromycin, and methotrexate. Nucleic acids encoding a selectable marker can be introduced into a host cell on the same vector as that encoding a polypeptide described herein or can be introduced on a separate vector. Cells stably transfected with the introduced nucleic acid can be identified by growth in the presence of an appropriate selection drug.
Host Cells
[00239] As used herein, a "host cell" is a cell used to produce a product described herein (e.g., a fatty aldehydes, a fatty alcohol or a hydrocarbon).
[00240] A host cell is referred to as an "engineered host cell" or a "recombinant host cell" if the expression of one or more polynucleotides or polypeptides in the host cell are altered or modified as compared to their expression in a corresponding wild-type host cell under the same conditions.
[00241] In any of the aspects of the invention described herein, the host cell can be selected from the group consisting of a eukaryotic plant, algae, cyanobacterium, green-sulfur bacterium, green non-sulfur bacterium, purple sulfur bacterium, purple non-sulfur bacterium, extremophile, yeast, fungus, engineered organisms thereof, or a synthetic organism. In some embodiments, the host cell is light dependent or fixes carbon. In some embodiments, the host cell is light dependent or fixes carbon. In some embodiments, the host cell has autotrophic activity.
[00242] Various host cells can be used to produce fatty aldehydes, fatty alcohols and hydrocarbons, as described herein. A host cell can be any prokaryotic or eukaryotic cell. For example, a gene encoding a polypeptide described herein (e.g., a fatty aldehyde biosynthetic polypeptide, or an acyl-ACP reductase polypeptide, and/or a fatty alcohol biosynthetic polypeptide) can be expressed in bacterial cells (such as E. coli), insect cells, yeast, or
mammalian cells (such as Chinese hamster ovary cells (CHO) cells, COS cells, VERO cells, BHK cells, HeLa cells, Cvl cells, MDCK cells, 293 cells, 3T3 cells, or PC 12 cells).
[00243] Exemplary host cells can be from the genus Escherichia, Bacillus, Lactobacillus, Rhodococcus, Pseudomonas, Aspergillus, Trichoderma, Neurospora, Fusarium, Humicola, Rhizomucor, Kluyveromyces, Pichia, Mucor, Myceliophtora, Penicillium, Phanerochaete, Pleurotus, Trametes, Chrysosporium, Saccharomyces, Schizosaccharomyces, Yarrowia, or Streptomyces.
[00244] In some embodiments, the host cell is a Gram-positive bacterial cell. In other embodiments, the host cell is a Gram-negative bacterial cell.
[00245] In some embodiments, the host cell is selected from the genus Escherichia, Bacillus, Lactobacillus, Rhodococcus, Pseudomonas, Aspergillus, Trichoderma, Neurospora, Fusarium, Humicola, Rhizomucor, Kluyveromyces, Pichia, Mucor, Myceliophtora, Penicillium,
Phanerochaete, Pleurotus, Trametes, Chrysosporium, Saccharomyces, Stenotrophamonas, Schizosaccharomyces, Yarrowia, or Streptomyces.
[00246] In certain embodiments, the host cell is a Bacillus lentus cell, a Bacillus brevis cell, a Bacillus stearothermophilus cell, a Bacillus licheniformis cell, a Bacillus alkalophilus cell, a Bacillus coagulans cell, a Bacillus circulans cell, a Bacillus pumilis cell, a Bacillus thuringiensis cell, a Bacillus clausii cell, a Bacillus megaterium cell, a Bacillus subtilis cell, or a Bacillus amyloliquefaciens cell.
[00247] In other embodiments, the host cell is a Trichoderma koningii cell, a Trichoderma viride cell, a Trichoderma reesei cell, a Trichoderma longibrachiatum cell, an Aspergillus awamori cell, an Aspergillus fumigates cell, an Aspergillus foetidus cell, an Aspergillus nidulans cell, an Aspergillus niger cell, an Aspergillus oryzae cell, a Humicola insolens cell, a Humicola lanuginose cell, a Rhodococcus opacus cell, a Rhizomucor miehei cell, or a Mucor michei cell.
[00248] In yet other embodiments, the host cell is a Streptomyces lividans cell or a
Streptomyces murinus cell.
[00249] In yet other embodiments, the host cell is an Actinomycetes cell. [00250] In some embodiments, the host cell is a Saccharomyces cerevisiae cell.
[00251] Additional host cells that can be used in the methods described herein are described in WO2009/1 1 1513 and WO2009/1 1 1672.
Transport Proteins
[00252] Transport proteins can export polypeptides and organic compounds {e.g. , fatty alcohols) out of a host cell. Many transport and efflux proteins serve to excrete a wide variety of compounds and can be naturally modified to be selective for particular types of hydrocarbons.
[00253] Non-limiting examples of suitable transport proteins are ATP-Binding Cassette (ABC) transport proteins, efflux proteins, and fatty acid transporter proteins (FATP). Additional non-limiting examples of suitable transport proteins include the ABC transport proteins from organisms such as Caenorhabditis elegans, Arabidopsis thalania, Alkaligenes eutrophus, and Rhodococcus erythropolis. Exemplary ABC transport proteins include, without limitation, CER5 [Accession No: Atl g 51510, AY734542, At3g 2190, or Atl g51460], AtMRP5 [Accession No. NP_171908], AmiS2 [Accession No: JC5491 ], and AtPGPl [Accession No: NP_181228]. Host cells can also be chosen for their endogenous ability to secrete organic compounds. The efficiency of organic compound production and secretion into the host cell environment (e.g., culture medium, fermentation broth) can be expressed as a ratio of intracellular product to extracellular product. In some examples, the ratio can be about 5: 1 , 4: 1 , 3 : 1 , 2: 1 , 1 : 1 , 1 :2, 1 :3, 1 :4, or 1 :5.
Fermentation
[00254] The production and isolation of fatty alcohols can be enhanced by employing beneficial fermentation techniques. One method for maximizing production while reducing costs is increasing the percentage of the carbon source that is converted to hydrocarbon products.
[00255] During normal cellular lifecycles, carbon is used in cellular functions, such as producing lipids, saccharides, proteins, organic acids, and nucleic acids. Reducing the amount of carbon necessary for growth-related activities can increase the efficiency of carbon source conversion to product. This can be achieved by, for example, first growing host cells to a desired density (for example, a density achieved at the peak of the log phase of growth). At such a point, replication checkpoint genes can be harnessed to stop the growth of cells. Specifically, quorum sensing mechanisms (reviewed in Camilli et al , Science 31 1 : 1 1 13, 2006; Venturi FEMS
Microbio. Rev. 30:274-291 , 2006; and Reading et al , FEMS Microbiol. Lett. 254: 1 - 1 1 , 2006) can be used to activate checkpoint genes, such as p53, p21, or other checkpoint genes.
[00250] Genes that can be activated to stop cell replication and growth in E. coli include umuDC genes. The overexpression of umuDC genes stops the progression from stationary phase to exponential growth (Murli et al, J. of Bad. 182: 1 127, 2000). UmuC is a DNA polymerase that can carry out translesion synthesis over non-coding lesions - the mechanistic basis of most UV and chemical mutagenesis. The umuDC gene products are involved in the process of translesion synthesis and also serve as a DNA sequence damage checkpoint. The umuDC gene products include UmuC, UmuD, umuD', UmuD'2C, UmuD'2, and UmuD2. Simultaneously, product-producing genes can be activated, thus minimizing the need for replication and maintenance pathways to be used while a fatty aldehyde is being made. Host cells can also be engineered to express umuC and umuD from E. coli in pBAD24 under the prpBCDE promoter system through de novo synthesis of this gene with the appropriate end-product production genes.
[00251] The percentage of input carbons converted to fatty alcohols can be a cost driver. The more efficient the process is (i.e., the higher the percentage of input carbons converted to fatty alcohols), the less expensive the process will be. For oxygen-containing carbon sources (e.g., glucose and other carbohydrate based sources), the oxygen must be released in the form of carbon dioxide. For every 2 oxygen atoms released, a carbon atom is also released leading to a maximal theoretical metabolic efficiency of approximately 34% (w/w) (for fatty acid derived products). This figure, however, changes for other organic compounds and carbon sources. Typical efficiencies in the literature are approximately less than 5%. Host cells engineered to produce fatty alcohols can have greater than about 1 , 3, 5, 10, 15, 20, 25, and 30% efficiency. In one example, host cells can exhibit an efficiency of about 10% to about 25%. In other examples, such host cells can exhibit an efficiency of about 25% to about 30%. In other examples, host cells can exhibit greater than 30% efficiency.
[00252] The host cell can be additionally engineered to express recombinant cellulosomes, such as those described in PCT application number PCT/US2007/003736. These cellulosomes can allow the host cell to use cellulosic material as a carbon source. For example, the host cell can be additionally engineered to express invertases (EC 3.2.1.26) so that sucrose can be used as a carbon source. Similarly, the host cell can be engineered using the teachings described in U.S. Patent Nos. 5,000,000; 5,028,539; 5,424,202; 5,482,846; and 5,602,030; so that the host cell can assimilate carbon efficiently and use cellulosic materials as carbon sources.
[00253] In one example, the fermentation chamber can enclose a fermentation that is undergoing a continuous reduction. In this instance, a stable reductive environment can be created. The electron balance can be maintained by the release of carbon dioxide (in gaseous form). Efforts to augment the NAD/H and NADP/H balance can also facilitate in stabilizing the electron balance. The availability of intracellular NADPH can also be enhanced by engineering the host cell to express an NADH:NADPH transhydrogenase. The expression of one or more NADH:NADPH transhydrogenases converts the NADH produced in glycolysis to NADPH, which can enhance the production of fatty alcohols.
[00254] For small scale production, the engineered host cells can be grown in batches of, for example, about 100 mL, 500 mL, 1 L, 2 L, 5 L, or 10 L; fermented; and induced to express desired fatty aldehyde biosynthetic genes and/or an alcohol dehydrogenase genes based on the specific genes encoded in the appropriate plasmids. For large scale production, the engineered host cells can be grown in batches of about 10 L, 100 L, 1000 L, 10,000 L, 100,000 L, 1,000,000 L or larger; fermented; and induced to express desired fatty aldehyde biosynthetic genes and/or alcohol dehydrogenase genes based on the specific genes encoded in the appropriate plasmids or incorporated into the host cell's genome.
[00255] For example, a suitable production host, such as E. coli cells, harboring plasmids containing the desired genes or having the genes integrated in its chromosome can be incubated in a suitable reactor, for example a 1 L reactor, for 20 hours at 37 °C in M9 medium
supplemented with 2% glucose, carbenicillin, and chloramphenicol. When the OD6oo of the culture reaches 0.9, the production host can be induced with IPTG alcohol. After incubation, the spent media can be extracted and the organic phase can be examined for the presence of fatty alcohols using GC-MS.
[00256] In some instances, after the first hour of induction, aliquots of no more than about 10% of the total cell volume can be removed each hour and allowed to sit without agitation to allow the fatty alcohols to rise to the surface and undergo a spontaneous phase separation or precipitation. The fatty alcohol component can then be collected, and the aqueous phase returned to the reaction chamber. The reaction chamber can be operated continuously. When the OD6oo drops below 0.6, the cells can be replaced with a new batch grown from a seed culture.
Producing Fatty Alcohols Using Cell-free Methods
[00257] In some methods described herein, a fatty alcohol can be produced using a purified polypeptide {e.g., a fatty alcohol biosynthetic polypeptide) described herein and a substrate (e.g., fatty aldehyde), produced, for example, by a method described herein. For example, a host cell can be engineered to express a fatty alcohol biosynthetic polypeptide or variant as described herein. The host cell can be cultured under conditions suitable to allow expression of the polypeptide. Cell free extracts can then be generated using known methods. For example, the host cells can be lysed using detergents or by sonication. The expressed polypeptides can be purified using known methods. After obtaining the cell free extracts, substrates described herein can be added to the cell free extracts and maintained under conditions to allow conversion of the substrates (e.g., fatty aldehydes) to fatty alcohols. The fatty alcohols can then be separated and purified using known techniques.
[00258] In some instances, a fatty aldehyde can be converted into a fatty alcohol by contacting the fatty aldehyde with a fatty alcohol biosynthetic polypeptide provided herein, or a variant thereof. In other instances, a fatty aldehyde can be converted into a fatty alcohol by contacting the fatty aldehyde with a fatty alcohol biosynthetic polypeptide that is an AdhP homolog of FIG. 2, a DkgA homolog of FIG. 3, a DkgB homolog of FIG. 4, a Tas homolog of FIG. 5, an RspB homolog of FIG. 6, a Yah homolog of FIG. 7, a YbbO homolog of FIG. 8, a YbdH homolog of FIG. 9, a YbdR homolog of FIG. 10, a YgfF homolog of FIG. 1 1 , a YhdH homolog of FIG. 12, a YjgB homolog of FIG. 13, an AroB homolog of FIG. 14, a YcjQ homolog of FIG. 15, a YdbC homolog of FIG. 16, a YdjG homolog of FIG. 17, a YeaE homolog of FIG. 18, a YncB homolog of FIG. 19, a YqhD homolog of FIG. 20, a YdjL homolog of FIG. 21 , or a variant thereof.
Post-Production Processing
[00259] The fatty alcohols produced during fermentation can be separated from the fermentation media. Any known technique for separating fatty alcohols from aqueous media can be used. One exemplary separation process is a two phase (bi-phasic) separation process. This process involves fermenting the genetically engineered host cells under conditions sufficient to produce fatty alcohols, allowing the fatty alcohol to collect in an organic phase, and separating the organic phase from the aqueous fermentation broth. This method can be practiced in both a batch and continuous fermentation processes.
[00260] Bi-phasic separation uses the relative immiscibility of fatty alcohols to facilitate separation. Immiscible refers to the relative inability of a compound to dissolve in water and is defined by the compound's partition coefficient. One of ordinary skill in the art will appreciate that by choosing a fermentation broth and organic phase, such that the fatty alcohol being produced has a high logP value, the fatty alcohol can separate into the organic phase, even at very low concentrations, in the fermentation vessel.
[00261] The fatty alcohols produced by the methods described herein can be relatively immiscible in the fermentation broth, as well as in the cytoplasm. Therefore, the fatty alcohol can collect in an organic phase either intracellularly or extracellularly. The collection of the products in the organic phase can lessen the impact of the fatty alcohol on cellular function and can allow the host cell to produce more product.
[00262] The methods described herein can result in the production of homogeneous compounds wherein at least about 60%, 70%, 80%, 90%, or 95% of the fatty alcohols produced will have carbon chain lengths that vary by less than about 6 carbons, less than about 4 carbons, or less than about 2 carbons. These compounds can also be produced with a relatively uniform degree of saturation. These compounds can be used directly as fuels, fuel additives, starting materials for production of other chemical compounds (e.g., polymers, surfactants, plastics, textiles, solvents, adhesives, etc.), or personal care additives. These compounds can also be used as feedstock for subsequent reactions, for example, hydrogenation, catalytic cracking (e.g., via hydro genati on, pyrolisis, or both), to make other products.
[00263] In some embodiments, the fatty alcohols produced using methods described herein can contain between about 50% and about 90% carbon; or between about 5% and about 25% hydrogen. In other embodiments, the fatty alcohols produced using methods described herein can contain between about 65% and about 85% carbon; or between about 10% and about 15% hydrogen.
[00264] In some embodiments, the host cell is a Gram-positive bacterial cell. In other embodiments, the host cell is a Gram-negative bacterial cell.
[0265] In some embodiments, the host cell is selected from the genus Escherichia, Bacillus, Lactobacillus, Rhodococcus, Pseudomonas, Aspergillus, Trichoderma, Neurospora, Fusarium, Humicola, Rhizomucor, Kluyveromyces, Pichia, Mucor, Myceliophtora, Penicillium,
Phanerochaete, Pleurotus, Trametes, Chrysosporium, Saccharomyces, Stenotrophamonas, Schizosaccharomyces, Yarrowia, or Streptomyces.
[0266] In other embodiments, the host cell is a Bacillus lentus cell, a Bacillus brevis cell, a Bacillus stearothermophilus cell, a Bacillus lichen formis cell, a Bacillus alkalophilus cell, a Bacillus coagulans cell, a Bacillus circulans cell, a Bacillus pumilis cell, a Bacillus thuringiensis cell, a Bacillus clausii cell, a Bacillus megaterium cell, a Bacillus subtilis cell, or a Bacillus amyloliquefaciens cell.
[0267] In other embodiments, the host cell is a Trichoderma koningii cell, a Trichoderma viride cell, a Trichoderma reesei cell, a Trichoderma longibrachiatum cell, an Aspergillus awamori cell, an Aspergillus fumigates cell, an Aspergillus foetidus cell, an Aspergillus nidulans cell, an Aspergillus niger cell, an Aspergillus oryzae cell, a Humicola insolens cell, a Humicola lanuginose cell, a Rhodococcus opacus cell, a Rhizomucor miehei cell, or a Mucor michei cell.
[0268] In yet other embodiments, the host cell is a Streptomyces lividans cell or a Streptomyces murinus cell.
[0269] In yet other embodiments, the host cell is an Actinomycetes cell.
[0270] In some embodiments, the host cell is a Saccharomyces cerevisiae cell. In some embodiments, the host cell is a Saccharomyces cerevisiae cell.
[0271] In still other embodiments, the host cell is a CHO cell, a COS cell, a VERO cell, a BHK cell, a HeLa cell, a Cvl cell, an MDCK cell, a 293 cell, a 3T3 cell, or a PC 12 cell.
[0272] In other embodiments, the host cell is a cell from an eukaryotic plant, algae,
cyanolacterium, green-sulfur bacterium, green non-sulfur bacterium, purple sulfur bacterium, purple non-sulfur bacterium, extremophile, yeast, fungus, an engineered organism thereof, or a synthetic organism. In some embodiments, the host cell is light-dependent or fixes carbon. In some embodiments, the host cell is light-dependent or fixes carbon. In some embodiments, the host cell has autotrophic activity. In some embodiments, the host cell has photoautotrophic activity, such as in the presence of light. In some embodiments, the host cell is heterotrophic or mixotrophic in the absence of light. In certain embodiments, the host cell is a cell from
Avabidopsis thaliana, Panicum virgatum, Miscanthus giganteus, Zea mays, Botryococcuse braunii, Chlamydomonas reinhardtii, Dunaliela salina, Synechococcus Sp. PCC 7002,
Synechococcus Sp. PCC 7942, Synechocystis Sp. PCC 6803, Thermosynechococcus elongates BP-1, Chlorobium tepidum, Chlorojlexus auranticus, Chromatiumm vinosum, Rhodospirillum rubrum, Rhodobacter capsulatus, Rhodopseudomonas palusris, Clostridium ljungdahlii, Clostridiuthermocellum, Penicillium chrysogenum, Pichia pastoris, Saccharomyces cerevisiae, Schizosaccharomyces pombe, Pseudomonasjluorescens, or Zymomonas mobilis.
[0273] In certain preferred embodiments, the host cell is an E. coli cell. In some embodiments, the E. coli cell is a strain B, a strain C, a strain K, or a strain W E. coli cell.
[0274] In other embodiments, the host cell is a Pantoea citrea cell.
Production of Fatty Acids Deriviatives in Host Cells.
[0275] As used herein, the term "conditions permissive for the production" means any conditions that allow a host cell to produce a desired product, such as a fatty acid or a fatty acid derivative. Similarly, the term "conditions in which the polynucleotide sequence of a vector is expressed" means any conditions that allow a host cell to synthesize a polypeptide. Suitable conditions include, for example, fermentation conditions. Fermentation conditions can comprise many parameters, such as temperature ranges, levels of aeration, and media composition. Each of these conditions, individually and in combination, allows the host cell to grow. Exemplary culture media include broths or gels. Generally, the medium includes a carbon source that can be metabolized by a host cell directly
[00276] To determine if conditions are sufficient to allow production of a product or expression of a polypeptide, a host cell can be cultured, for example, for about 4, 8, 12, 24, 36, 48, 72, or more hours. During and/or after culturing, samples can be obtained and analyzed to determine if the conditions allow production or expression. For example, the host cells in the sample or the medium in which the host cells were grown can be tested for the presence of a desired product. When testing for the presence of a fatty acid or fatty acid derivative, assays, such as, but not limited to, MS, thin layer chromatography (TLC), high-performance liquid chromatography (HPLC), liquid chromatography (LC), GC coupled with a flame ionization detector (FID), GC-MS, and LC-MS can be used. When testing for the expression of a polypeptide, techniques such as, but not limited to, Western blotting and dot blotting may be used.
[00277] In the compositions and methods of the invention, the production and isolation of fatty acids and fatty acid derivatives can be enhanced by optimizing fermentation conditions. In some embodiments, fermentation conditions are optimized to increase the percentage of the carbon source that is converted to hydrocarbon products. During normal cellular lifecycles, carbon is used in cellular functions, such as producing lipids, saccharides, proteins, organic acids, and nucleic acids. Reducing the amount of carbon necessary for growth-related activities can increase the efficiency of carbon source conversion to product. This can be achieved by, for example, first growing host cells to a desired density (for example, a density achieved at the peak of the log phase of growth). At such a point, replication checkpoint genes can be harnessed to stop the growth of cells. Specifically, quorum sensing mechanisms (reviewed in Camilli et al., Science 31 1 : 1 1 13 (2006); Venturi, FEMS Microbiol. Rev., 30: 274-291 (2006); and Reading et al., FEMS Microbiol. Lett., 254: 1 -1 1 (2006)) can be used to activate checkpoint genes, such as p53, p21, or other checkpoint genes.
[00278] The host cell can be additionally engineered to express a recombinant cellulosome, which can allow the host cell to use cellulosic material as a carbon source. Exemplary cellulosomes suitable for use in the methods of the invention include, e.g, the cellulosomes described in International Patent Application Publication WO 2008/100251. The host cell also can be engineered to assimilate carbon efficiently and use cellulosic materials as carbon sources according to methods described in U.S. Patents 5,000,000; 5,028,539; 5,424,202; 5,482,846; and 5,602,030. In addition, the host cell can be engineered to express an invertase so that sucrose can be used as a carbon source.
[00279] In some embodiments of the fermentation methods of the invention, the fermentation chamber encloses a fermentation that is undergoing a continuous reduction, thereby creating a stable reductive environment. The electron balance can be maintained by the release of carbon dioxide (in gaseous form). Efforts to augment the NAD/H and NADP/H balance can also facilitate in stabilizing the electron balance. The availability of intracellular NADPH can also be enhanced by engineering the host cell to express an NADH:NADPH transhydrogenase. The expression of one or more NADH:NADPH transhydrogenases converts the NADH produced in glycolysis to NADPH, which can enhance the production of fatty aldehydes and fatty alcohols.
[00280] For small scale production, the engineered host cells can be grown in batches of, for example, about 100 mL, 500 mL, 1 L, 2 L, 5 L, or 10 L; fermented; and induced to express a desired polynucleotide sequence, such as a polynucleotide sequence encoding a PPTase. For large scale production, the engineered host cells can be grown in batches of about 10 L, 100 L, 1000 L, 10,000 L, 100,000 L, 1 ,000,000 L or larger; fermented; and induced to express a desired polynucleotide sequence.
[00281] The fatty acids and derivatives thereof produced by the methods of invention generally are isolated from the host cell. The term "isolated" as used herein with respect to products, such as fatty acids and derivatives thereof, refers to products that are separated from cellular components, cell culture media, or chemical or synthetic precursors. The fatty acids and derivatives thereof produced by the methods described herein can be relatively immiscible in the fermentation broth, as well as in the cytoplasm. Therefore, the fatty acids and derivatives thereof can collect in an organic phase either intracellularly or extracellularly. The collection of the products in the organic phase can lessen the impact of the fatty acid or fatty acid derivative on cellular function and can allow the host cell to produce more product.
[00282] In some embodiments, the fatty acids and fatty acid derivatives produced by the methods of invention are purified. As used herein, the term "purify," "purified," or
"purification" means the removal or isolation of a molecule from its environment by, for example, isolation or separation. "Substantially purified" molecules are at least about 60% free (e.g., at least about 70% free, at least about 75% free, at least about 85% free, at least about 90% free, at least about 95% free, at least about 97% free, at least about 99% free) from other components with which they are associated. As used herein, these terms also refer to the removal of contaminants from a sample. For example, the removal of contaminants can result in an increase in the percentage of a fatty aldehyde or a fatty alcohol in a sample. For example, when a fatty aldehyde or a fatty alcohol is produced in a host cell, the fatty aldehyde or fatty alcohol can be purified by the removal of host cell proteins. After purification, the percentage of a fatty acid or derivative thereof in the sample is increased.
[00283] As used herein, the terms "purify," "purified," and "purification" are relative terms which do not require absolute purity. Thus, for example, when a fatty acid or derivative thereof is produced in host cells, a purified fatty acid or derivative thereof is a fatty acid or derivative
thereof that is substantially separated from other cellular components (e.g., nucleic acids, polypeptides, lipids, carbohydrates, or other hydrocarbons).
[00284] Additionally, a purified fatty acid preparation or a purified fatty acid derivative preparation is a fatty acid preparation or a fatty acid derivative preparation in which the fatty acid or derivative thereof is substantially free from contaminants, such as those that might be present following fermentation. In some embodiments, a fatty acid or derivative thereof is purified when at least about 50% by weight of a sample is composed of the fatty acid or fatty acid derivative. In other embodiments, a fatty acid or derivative thereof is purified when at least about 60%, e.g., at least about 70%, at least about 80%, at least about 85%, at least about 90%, at least about 92% or more by weight of a sample is composed of the fatty acid or derivative thereof. Alternatively, or in addition, a fatty acid or derivative thereof is purified when less than about 100%, e.g., less than about 99%, less than about 98%, less than about 95%, less than about 90%, or less than about 80%) by weight of a sample is composed of the fatty acid or derivative thereof. Thus, a purified fatty acid or derivative thereof can have a purity level bounded by any two of the above endpoints. For example, a fatty acid or derivative thereof can be purified when at least about 80%-95%, at least about 85%-99%, or at least about 90%-98% of a sample is composed of the fatty acid or fatty acid derivative.
[00285] The fatty acid or derivative thereof may be present in the extracellular environment, or it may be isolated from the extracellular environment of the host cell. In certain embodiments, a fatty acid or derivative thereof is secreted from the host cell. In other embodiments, a fatty acid or derivative thereof is transported into the extracellular environment. In yet other embodiments, the fatty acid or derivative thereof is passively transported into the extracellular environment.
[00286] A fatty acid or derivative thereof can be isolated from a host cell using methods known in the art, such as those disclosed in International Patent Application Publications WO 2010/042664 and WO 2010/062480.
[00287] The methods described herein can result in the production of homogeneous compounds wherein at least about 60%, at least about 70%, at least about 80%>, at least about 90%, or at least about 95%, of the fatty acids or fatty acid derivatives produced will have carbon chain lengths that vary by less than 6 carbons, less than 5 carbons, less than 4 carbons, less than
3 carbons, or less than about 2 carbons. Alternatively, or in addition, the methods described herein can result in the production of homogeneous compounds wherein less than about 98%, less than about 95%, less than about 90%, less than about 80%, or less than about 70% of the fatty acids or fatty acid derivatives produced will have carbon chain lengths that vary by less than 6 carbons, less than 5 carbons, less than 4 carbons, less than 3 carbons, or less than about 2 carbons. Thus, the fatty acids or fatty acid derivatives can have a degree of homogeneity bounded by any two of the above endpoints. For example, the fatty acid or fatty acid derivative can have a degree of homogeneity wherein about 70%-95%, about 80%-98%, or about 90%-95% of the fatty acids or fatty acid derivatives produced will have carbon chain lengths that vary by less than 6 carbons, less than 5 carbons, less than 4 carbons, less than 3 carbons, or less than about 2 carbons. These compounds can also be produced with a relatively uniform degree of saturation.
[00288] As a result of the methods of the present invention, one or more of the titer, yield, or productivity of the fatty acid or derivative thereof produced by the engineered host cell having an altered level of expression of a FadR polypeptide is increased relative to that of the
corresponding wild-type host cell.
[00289] The term "titer" refers to the quantity of fatty acid or fatty acid derivative produced per unit volume of host cell culture. In any aspect of the compositions and methods described herein, a fatty acid or a fatty acid derivative such as a terminal olefin, a fatty aldehyde, a fatty alcohol, an alkane, a fatty ester, a ketone or an internal olefins is produced at a titer of about 25 mg/L, about 50 mg/L, about 75 mg/L, about 100 mg/L, about 125 mg/L, about 150 mg/L, about 175 mg/L, about 200 mg/L, about 225 mg/L, about 250 mg/L, about 275 mg/L, about 300 mg/L, about 325 mg/L, about 350 mg/L, about 375 mg/L, about 400 mg/L, about 425 mg/L, about 450 mg/L, about 475 mg/L, about 500 mg/L, about 525 mg/L, about 550 mg/L, about 575 mg/L, about 600 mg/L, about 625 mg/L, about 650 mg/L, about 675 mg/L, about 700 mg/L, about 725 mg/L, about 750 mg/L, about 775 mg/L, about 800 mg/L, about 825 mg/L, about 850 mg/L, about 875 mg/L, about 900 mg/L, about 925 mg/L, about 950 mg/L, about 975 mg/L, about 1000 g/L, about 1050 mg/L, about 1075 mg/L, about 1 100 mg/L, about 1 125 mg/L, about 1 150 mg/L, about 1 175 mg/L, about 1200 mg/L, about 1225 mg/L, about 1250 mg/L, about 1275 mg/L, about 1300 mg/L, about 1325 mg/L, about 1350 mg/L, about 1375 mg/L, about 1400 mg/L,
about 1425 mg/L, about 1450 mg/L, about 1475 mg/L, about 1500 mg/L, about 1525 mg/L, about 1550 mg/L, about 1575 mg/L, about 1600 mg/L, about 1625 mg/L, about 1650 mg/L, about 1675 mg/L, about 1700 mg/L, about 1725 mg/L, about 1750 mg/L, about 1775 mg/L, about 1800 mg/L, about 1825 mg/L, about 1850 mg/L, about 1875 mg/L, about 1900 mg/L, about 1925 mg/L, about 1950 mg/L, about 1975 mg/L, about 2000 mg/L, or a range bounded by any two of the foregoing values. In other embodiments, a fatty acid or fatty acid derivative is produced at a titer of more than 2000 mg/L, more than 5000 mg/L, more than 10,000 mg/L, or higher, such as 50 g/L, 70 g/L, 100 g/L, 120 g/L, 150 g/L, or 200 g/L.
[00290] The term "yield of the fatty acid or derivative thereof produced by a host cell" refers to the efficiency by which an input carbon source is converted to product (i.e., fatty acid or fatty acid derivative such as fatty alcohol or fatty ester) in a host cell. For oxygen-containing carbon sources (e.g., glucose and other carbohydrate based sources), the oxygen must be released in the form of carbon dioxide. For every 2 oxygen atoms released, a carbon atom is also released leading to a maximal theoretical metabolic efficiency of approximately 34% (w/w) (for fatty acid derived products). This figure, however, changes for other organic compounds and carbon sources. Typical yield reported in the literature are approximately less than 5%. Host cells engineered to produce fatty acids and fatty acid derivatives according to the methods of the invention can have a yield of at least about 3%, at least about 5%, at least about 10%, at least about 15%, at least about 18%, or at least about 20%. Alternatively, or in addition, the yield is about 30%) or less, about 27% or less, about 25% or less, or about 22%> or less. Thus, the yield can be bounded by any two of the above endpoints. For example, the yield of the fatty acid or derivative thereof produced by the engineered host cell according to the methods of the invention can be about 5% to about 25%, about 10% to about 25%, about 10% to about 22%, about 15% to about 27%, or about 18% to about 22%. In other embodiments, the yield is greater than 30%.
[00291] The term "productivity of the fatty acid or derivative thereof produced by a host cell" refers to the quantity of fatty acid or fatty acid derivative produced per unit volume of host cell culture per unit density of host cell culture. In any aspect of the compositions and methods described herein, the productivity of a fatty acid or a fatty acid derivative such as an olefin, a fatty aldehyde, a fatty alcohol, an alkane, a fatty ester, or a ketone produced by an engineered host cells is at least about at least about 3 mg/L/OD6oo, at least about 6 mg/L/OD6oo, at least
about 9 mg/L/OD6oo, at least about 12 mg/L/OD6oo, or at least about 15 mg/L/OD oo- Alternatively, or in addition, the productivity is about 50 mg/L/OD6oo or less, about 40 mg/L/OD6oo or less, about 30 mg/L/OD6oo or less, or about 20 mg/L/OD6oo or less. Thus, the productivity can be bounded by any two of the above endpoints. For example, the productivity can be about 3 to about 30 mg/L/OD6oo, about 6 to about 20 mg/L/OD6oo, or about 15 to about 30 mg/L/OD600.
[00292] In the compositions and methods of the invention, the production and isolation of a desired fatty acid or derivative thereof (e.g., acyl-CoA, fatty acids, terminal olefins, fatty aldehydes, fatty alcohols, alkanes, alkenes, wax esters, ketones and internal olefins) can be enhanced by altering the expression of one or more genes involved in the regulation of fatty acid, fatty ester, alkane, alkene, olefin fatty alcohol production, degradation and/or secretion in the engineered host cell.
Characterization and Utility of Fatty Acids and Derivatives Thereof
[00293] Bioproducts (e.g., fatty alcohols) comprising biologically produced organic compounds, particularly fatty alcohols biologically produced using the fatty acid biosynthetic pathway, have not been produced from renewable sources and, as such, are new compositions of matter.
[00294] The hydrocarbons (and/or fatty aldehydes) described herein can be used as or converted into a fuel or as a specialty chemical. One of ordinary skill in the art will appreciate that, depending upon the intended purpose of the fuel or specialty chemical, different
hydrocarbons (and/or fatty aldehydes) can be produced and used. For example, a branched hydrocarbon may be desirable for automobile fuels that are intended to be used in cold climates. In addition, when hydrocarbons are used as a feedstock for fuel and specialty chemical production, one of ordinarly skill in the art will appreciate that the characteristics of the hydrocarbon will affect the characteristics of the fuel or specialty chemicals produced. Hence the characteristics of the fuel or specialty chemical product can be selected for by producing particular hydrocarbons (and/or fatty aldehydes) for use as a feedstock.
[00295] Using the methods described herein, biofuels having desired fuel qualities can be produced from hydrocarbons (and/or fatty aldehydes). These thus represent a new source of biofuels, which can be used as jet fuels, diesel, or gasoline. Some biofuels made using
hydrocarbons (and/or fatty aldehydes) thus prepared have not been produced from renewable sources and are new compositions of matter. These new fuels or specialty chemicals can be distinguished from fuels or specialty chemicals derived from petrochemical carbon on the basis of dual carbon-isotopic fingerprinting. Additionally, the specific source of biosourced carbon (e.g. , glucose vs. glycerol) can be determined by dual carbon-isotopic fingerprinting (see, e.g., U.S. Patent No. 7,169,588, which is herein incorporated by reference).
[00296] The ability to distinguish bioproducts from petroleum based organic compounds is beneficial in tracking these materials in commerce. For example, organic compounds or chemicals comprising both biologically based and petroleum based carbon isotope profiles may be distinguished from organic compounds and chemicals made only of petroleum based materials. Hence, the instant materials may be followed in commerce on the basis of their unique carbon isotope profile.
[00297] These new bioproducts can be distinguished from organic compounds derived from petrochemical carbon on the basis of dual carbon-isotopic fingerprinting (13C/12C) or 14C dating. Additionally, the specific source of biosourced carbon (e.g., glucose vs. glycerol) can be determined by dual carbon-isotopic fingerprinting (see, e.g., U.S. Patent No. 7,169,588, which is herein incorporated by reference).
[00298] Bioproducts can be distinguished from petroleum based organic compounds by
13 12 13 12
comparing the stable carbon isotope ratio ( CI C) in each fuel. The CI C ratio in a given bioproduct is a consequence of the 13C/12C ratio in atmospheric carbon dioxide at the time the carbon dioxide is fixed. It also reflects the precise metabolic pathway. Regional variations also occur. Petroleum, C3 plants (the broadleaf), C4 plants (the grasses), and marine carbonates all show significant differences in 13C/12C and the corresponding 513C values. Furthermore, lipid matter of C3 and C4 plants analyze differently than materials derived from the carbohydrate components of the same plants as a consequence of the metabolic pathway.
[00299] Within the precision of measurement, 13C shows large variations due to isotopic fractionation effects, the most significant of which for bioproducts is the photosynthetic mechanism. The major cause of differences in the carbon isotope ratio in plants is closely associated with differences in the pathway of photosynthetic carbon metabolism in the plants, particularly the reaction occurring during the primary carboxylation (i.e. , the initial fixation of atmospheric C02). Two large classes of vegetation are those that incorporate the "C3"(or Calvin-
Benson) photosynthetic cycle and those that incorporate the "C4" (or Hatch-Slack) photosynthetic cycle.
[00300] Both C4 and C3 plants exhibit a range of 13C/12C isotopic ratios, but typical values are about -7 to about -13 per mil for C4 plants and about -19 to about -27 per mil for C3 plants (see, e.g., Stuiver et al, Radiocarbon 19:355, 1977). Coal and petroleum fall generally in this latter range. The 13C measurement scale was originally defined by a zero set by Pee Dee Belemnite (PDB) limestone, where values are given in parts per thousand deviations from this material. The "613C" values are expressed in parts per thousand (per mil), abbreviated, %o, and are calculated as follows:
513C (%o) = [(13C/12C) sample- (13C/12C) standard]/ ('¾/'¾) sta„dard * 1000
[00301] Since the PDB reference material (RM) has been exhausted, a series of alternative PvMs have been developed in cooperation with the IAEA, USGS, NIST, and other selected international isotope laboratories. Notations for the per mil deviations from PDB is δ C.
Measurements are made on C02 by high precision stable ratio mass spectrometry (IRMS) on molecular ions of masses 44, 45, and 46.
[00302] The compositions described herein include bioproducts produced by any of the methods described herein. Specifically, the bioproduct can have a δ C of about -28 or greater, about -27 or greater, -20 or greater, -18 or greater, -15 or greater, -13 or greater, -10 or greater, or
-8 or greater. For example, the bioproduct can have a δ 13 C of about -30 to about -15, about -27 to about -19, about -25 to about -21, about -15 to about -5, about -13 to about -7, or about -13 to about -10. In other instances, the bioproduct can have a 613C of about -10, -1 1, -12, or -12.3.
[00303] Bioproducts can also be distinguished from petroleum based organic compounds by comparing the amount of 14C in each compound. Because 14C has a nuclear half life of 5730 years, petroleum based fuels containing "older" carbon can be distinguished from bioproducts which contain "newer" carbon (see, e.g., Currie, "Source Apportionment of Atmospheric Particles", Characterization of Environmental Particles, J. Buffle and H. P. van Leeuwen, Eds., 1 of Vol. I of the IUPAC Environmental Analytical Chemistry Series (Lewis Publishers, Inc) (1992) 3-74).
[00304] The basic assumption in radiocarbon dating is that the constancy of 14C concentration in the atmosphere leads to the constancy of 14C in living organisms. However, because of atmospheric nuclear testing since 1950 and the burning of fossil fuel since 1850, 14C has
acquired a second, geochemical time characteristic. Its concentration in atmospheric C02, and hence in the living biosphere, approximately doubled at the peak of nuclear testing, in the mid- 1960s. It has since been gradually returning to the steady-state cosmogenic (atmospheric) baseline isotope rate (14C /12C) of about 1.2 x 10"12, with an approximate relaxation "half-life" of 7-10 years. (This latter half-life must not be taken literally; rather, one must use the detailed atmospheric nuclear input/decay function to trace the variation of atmospheric and biospheric 14C since the onset of the nuclear age.)
[00305] It is this latter biospheric 14C time characteristic that holds out the promise of annual dating of recent biospheric carbon. 14C can be measured by accelerator mass spectrometry (AMS), with results given in units of "fraction of modern carbon" ^M). fM is defined by National Institute of Standards and Technology (NIST) Standard Reference Materials (SRMs) 4990B and 4990C. As used herein, "fraction of modern carbon" or "fivi" has the same meaning as defined by National Institute of Standards and Technology (NIST) Standard Reference Materials (SRMs) 4990B and 4990C, known as oxalic acids standards HOxI and HOxII, respectively. The fundamental definition relates to 0.95 times the 14C /12C isotope ratio HOxI (referenced to AD 1950). This is roughly equivalent to decay-corrected pre- Industrial Revolution wood. For the current living biosphere (plant material), fM is approximately 1.1.
[00306] The compositions described herein include bioproducts that can have an fM 14C of at least about 1. For example, the bioproduct can have an fM 14C of at least about 1.01 , an fM 14C of about 1 to about 1.5, an fM 14C of about 1.04 to about 1.18, or an fM 14C of about 1.1 1 1 to about 1.124.
[00307] Another measurement of 14C is known as the percent of modern carbon, pMC. For an archaeologist or geologist using 14C dates, AD 1950 equals "zero years old". This also represents 100 pMC. "Bomb carbon" in the atmosphere reached almost twice the normal level in 1963 at the peak of thermo-nuclear weapons. Its distribution within the atmosphere has been approximated since its appearance, showing values that are greater than 100 pMC for plants and animals living since AD 1950. It has gradually decreased over time with today's value being near 107.5 pMC. This means that a fresh biomass material, such as corn, would give a 14C signature near 107.5 pMC. Petroleum based compounds will have a pMC value of zero.
Combining fossil carbon with present day carbon will result in a dilution of the present day pMC content. By presuming 107.5 pMC represents the 14C content of present day biomass materials
and 0 pMC represents the 14C content of petroleum based products, the measured pMC value for that material will reflect the proportions of the two component types. For example, a material derived 100% from present day soybeans would give a radiocarbon signature near 107.5 pMC. If that material was diluted 50% with petroleum based products, it would give a radiocarbon signature of approximately 54 pMC.
[00308] A biologically based carbon content is derived by assigning "100%" equal to 107.5 pMC and "0%" equal to 0 pMC. For example, a sample measuring 99 pMC will give an equivalent biologically based carbon content of 93%. This value is referred to as the mean biologically based carbon result and assumes all the components within the analyzed material originated either from present day biological material or petroleum based material.
[00309] A bioproduct described herein can have a pMC of at least about 50, 60, 70, 75, 80, 85, 90, 95, 96, 97, 98, 99, or 100. In other instances, a bioproduct described herein can have a pMC of between about 50 and about 100; about 60 and about 100; about 70 and about 100; about 80 and about 100; about 85 and about 100; about 87 and about 98; or about 90 and about 95. In yet other instances, a bioproduct described herein can have a pMC of about 90, 91 , 92, 93, 94, or 94.2.
[00310] The fatty alcohols described herein can be used as or converted into a surfactant or detergent composition. One of ordinary skill in the art will appreciate that, depending upon the intended purpose of the surfactant or detergent, different fatty alcohols can be produced and used. For example, when the fatty alcohols described herein are used as a feedstock for surfactant or detergent production, one of ordinary skill in the art will appreciate that the characteristics of the fatty alcohol feedstock will affect the characteristics of the surfactant or detergent produced. Hence, the characteristics of the surfactant or detergent product can be selected for by producing particular fatty alcohols for use as a feedstock.
[00311] Fuel additives are used to enhance the performance of a fuel or engine. For example, fuel additives can be used to alter the freezing/gelling point, cloud point, lubricity, viscosity, oxidative stability, ignition quality, octane level, and/or flash point. In the United States, all fuel additives must be registered with Environmental Protection Agency. The names of fuel additives and the companies that sell the fuel additives are publicly available by contacting the EPA or by viewing the agency's website. One of ordinary skill in the art will appreciate that the fatty
alcohol-based biofuels described herein can be mixed with one or more fuel additives to impart a desired quality.
[00312] The fatty alcohol-based surfactants and/or detergents described herein can be mixed with other surfactants and/or detergents well known in the art.
[00313] In some examples, the mixture can include at least about 10%, 15%, 20%, 30%, 40%, 50%, or 60% by weight of the fatty alcohol. In other examples, a surfactant or detergent composition can be made that includes at least about 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 85%, 90% or 95% of a fatty alcohol that includes a carbon chain that is 8, 10, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21 or 22 carbons in length. Such surfactant or detergent compositions can additionally include at least one additive selected from a surfactant; a microemulsion; at least about 5%, 10%, 15%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 85%, 90%, or 95% of surfactant or detergent from nonmicrobial sources such as plant oils or petroleum.
[00314] The hydrocarbon (and/or fatty aldehyde)-based biofuel described herein can be mixed with other fuels, such as various alcohols, such as ethanol and butanol, and petroleum derived products, such as gasoline, diesel, or jet fuel.
[00315] In some examples, the mixture can include at least about 10%, 15%, 20%>, 30%, 40%, 50%, or 60% by weight of the hydrocarbon (and/or fatty aldehydes). In other examples, a biofuel composition can be made that includes at least about 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%), 80%, 85%, 90% or 95% of a hydrocarbon such as an alkane or an alkene that includes a carbon chain that is 8, 10, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21 or 22 carbons in length. Such biofuel composition can additionally include at least one additive selected from a cloud point lowering additive that can lower the cloud point to less than about 5°C, or 0°C; a surfactant, a microemulsion; at least about 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 85%), 90%) or 95% diesel fuel from triglycerides; petroleum-derived gasoline; or diesel fuel from petroleum.
[00316] Although the foregoing has been described in some detail by way of illustration and example for purposes of clarity and understanding, it will be apparent to those skilled in the art that certain changes and modifications may be practiced. Various aspects of the invention have been achieved by a series of experiments, some of which are described by way of the following non-limiting examples. Therefore, the description and examples should not be construed as limiting the scope of the invention, which is delineated by the appended description of exemplary embodiments.
EXAMPLES
EXAMPLE 1
[00317] An AlrA enzyme from Acinetobacter sp. M-1 has been shown to catalyze the reduction of fatty aldehyde into fatty alcohols in vitro at neutral or low pH conditions. (Tani et al. Appl. Environ. Microbiol. 66(12):5231 -5 (2000)). However, E.coli fatty alcohol biosynthetic polypeptides, which are capable of catalyzing the reduction of fatty aldehydes to fatty alcohols, were not identified, although it has been reported that E.coli constitutively expresses such a reductase activity. (Naccarato et al, Lipids 9(6):419-28 (1974)). It had also been reported that the E.coli reductase activity was NADPH-dependent. Id.
[00318] A BLAST search of the Acinetobacter baylyi ADPl genomic and protein databases for homologs of Acinetobacter sp. M-1 AlrA revealed an Acinetobacter baylyi ADPl homolog, AlrAadpl (GenPept Accession Number CAG 70248.1), has about 79% identity to the
Acinetobacter sp. M-1 AlrA.
[00319] This example describes an experiment verifying that co-expression of a heterologous carboxylic acid reductase from Acinetobacter baylyi ADPl , AlrAadpl (a homolog of
Acinetobacter sp. M-1 AlrA) and a CarB homolog resulted in fatty alcohol production in E.coli.
CAR Plasmid Construction
[00320] Three E. coli expression plasmids were constructed to express the genes encoding the CAR homologs listed in Table 7.
Table 7: CAR-like Protein and the corresponding coding sequences.
[00321] The fadD9 gene was amplified from genomic DNA of Mycobacterium tuberculosis H37Rv (obtained from The University of British Columbia, and Vancouver, BC Canada) using the primers fadD9F and FadDR (see Table 8). The PCR product was first cloned into PCR-blunt (Invitrogen) and then released as an Ndel-Avrll fragment. The Ndel-Avrll fragment was then
cloned between the Ndel and ^vrll sites of pACYCDuet- 1 (Novogen) to generate pACYCDuet- l-fadD9.
[00322] The car A gene was amplified from the genomic DNA of Mycobacterium smegmatis MC2 155 (obtained from the ATCC (ATCC 23037D-5)) using primers CARMCaF and
CARMCaR (see Table 8). The carB gene was amplified from the genomic DNA of
Mycobacterium smegmatis MC2 155 (obtained from the ATCC (ATCC 23037D-5)) using primers CARMCbF and CARMCbR (see Table 8). Each PCR product was first cloned into PCR-blunt and then released as an Ndel-Avrll fragment. Each of the two fragments was then subcloned between the Ndel and ^4vrII sites of pACYCDuet- 1 (Novogen) to generate pACYCDuet- 1-carA and pACYCDuet- 1-carB.
Table 8: Primers used to amplify genes encoding CAR homologs
[00323] Construction of plasmid pETDuet-l-'tesA-alrAadpl was carried out with the protocol below. The plasmid pETDuet-l-'tesA-alrAadpl was prepared by inserting the alrAadpl gene (gene locus-tag= "ACIAD3612"), a homolog of Acinetobacter baylyi ADP1, into the Nc l and Hindlll sites of pETDuet-l-'tesA.
[00324] The gene alrAadpl was amplified from the genomic DNA of Acinetobacter baylyi ADP1 by a two-step PCR procedure. The first set of PCR reactions eliminated an internal Ncol site at bp 632-636 with the following primer pairs:
ADP1 Air mutl reverse:
5'-GACCACGTGATCGGCCCCCATAGCTTTGAGCTCATC (SEQ ID NO:217)
ADP1 Air] mutl forward:
5 ' -G ATG AGCTC AAAGCTATGGGGGCCG ATCACGTGGTC (SEQ ID NO:218)
[00325] The PCR products were then isolated, purified using the Qiagen gel extraction kit, and used as inputs for a second PCR reaction with the following primers to produce full-length AlrAadpl with a C->T mutation at position 633 :
NcoIADPl Alrl forward:
5 ' -AATACC ATGGC AAC AACTA ATGTG ATTC ATGCTT ATGCTGC A (SEQ ID NO:219) Hindlll ADP1 Alrl reverse:
5 ' -ATAAAAGCTTTTAAAA ATCGGCTTTAAGTACAATCCGATAAC (SEQ ID NO:220)
Evaluation of fatty alcohol production
[00326] In order to evaluate the affect of carboxylic acid reductases and alcohol dehydrogenases on the production of fatty alcohols, various combinations of the prepared plasmids were transformed in the E. coli strain C41 (DE3, AfadE) (described in
PCT/US08/058788).
[00327] For example, the plasmid pACYCDuet-l-carA, encoding the CAR homolog carA, was co-transformed with pETDuet-l-'tesA-alrAadpl (see, e.g., FIG. 27A). The plasmid pACYCDuet-l -carB, encoding the CAR homolog, carB, was co-transformed with pETDuet-1 - 'tesA. In addition, pACYCDuet-l-carB was also separately co-transformed with pETDuet-1 - 'tesA-alrAadpl . As a control, pACYCDuet-l -carB was co-transformed with the empty vector pETDuet-1 (see, e.g., FIG. 27A). The plasmid pACYCDuet-l -fadD9, encoding the CAR homolog fadD9, was also co-transformed with pETDuet-l -'tesA. In addition, pACYCDuet- 1- fadD9 was also separately co -transformed with pETDuet-l-'tesA-alrAadpl . As a control, pACYCDuet-1 - fadD9was co-transformed with the empty vector pETDuet-1 (see, e.g. , FIG. 27A).
[00328] The E. coli transformants were grown in 3 mL of LB medium supplemented with carbenicillin (100 mg/L) and chloramphenicol (34 mg/L) at 37 °C. After overnight growth, 15 μΕ of culture was transferred into 2 mL of fresh LB medium supplemented with carbenicillin and chloramphenicol. After 3.5 hours of growth, 2 mL of culture were transferred into a 125 mL flask containing 20 mL of M9 medium with 2% glucose and with carbenicillin and
chloramphenicol. When the OD 0o of the culture reached 0.9, 1 mM of IPTG was added to each flask. After 20 hours of growth at 37 °C, 20 mL of ethyl acetate (with 1 % of acetic acid, v/v)
was added to each flask to extract the fatty alcohols produced during the fermentation. The crude ethyl acetate extract was directly analyzed with GC/MS as described herein.
[00329] The expression of carA or carB with the leaderless tesA and alrAadpl resulted in fatty alcohol titers of greater than 700 mg/L and reduced fatty aldehyde production (see, e.g., FIG. 27A). Likewise, fadD9 co-expressed with the leaderless tesA and alrAadpl produced over 300 mg/L of fatty alcohol. When expressed without the leaderless tesA, neither carB nor fadD9 produced more than 10 mg/L of fatty alcohols (possibly resulting from the accumulation of free fatty acids in the cell due to endogenous tesA). Taken together, this data indicates that fatty acids are the substrates for these CAR homologs and that overexpression of a thioesterase, such as 'tesA (to release fatty acids from acyl-ACP), achieves significant production of fatty alcohols.
[00330] Depending upon the CAR homolog expressed in E. coli strain C41 (DE3, Δ/adE) (described below in Example 2), different mixtures of fatty alcohols were produced. Different compositions of fatty alcohols were observed among the three CAR homologs evaluated (see Table 9). FadD9 produced more Cj2 fatty alcohols relative to other fatty alcohols with carbon chain lengths greater than 12. Both CarA and CarB produced a wider range in chain length of fatty alcohols than was observed when expressing FadD9.
Table 9: Acyl-composition of fatty alcohols produced by recombinant E.coli strains.
* the leaderless TesA. C12, including C12:0 and C12: l fatty alcohols
Quantification and Identification of Fatty Alcohols
[00331] GC/MS was performed using an Agilent 5975B MSD system equipped with a 30mx0.25mm (0.1 Ομιη film) DB-5 column. The column temperature was 3 min isothermal at 100°C. The column was programmed to rise from 100 °C to 320 °C at a rate of 20 °C/min.
When the final temperature was reached, the column remained isothermal for 5 minutes at 320°C. The injection volume was 1 μί. The carrier gas, helium, was released at 1.3 mL/min. The mass spectrometer was equipped with an electron impact ionization source. The ionization source temperature was set at 300 °C.
[00332] Prior to quantification, various alcohols were identified using two methods. First, the GC retention time of each compound was compared to the retention time of a known standard, such as a cetyl alcohol, dodecanol, tetradecanol, octadecanol, or cis-9-octadecenol. Second, identification of each compound was confirmed by matching the compound's mass spectrum to a standard's mass spectrum in the mass spectra library (e.g., C12:0, C12: l , C13:0, C14:0, C14:l , C15:0. C16:0, C16: l , C17:0, C18:0 and C18:l alcohols).
EXAMPLE 2
[00333] This example describes the identification of a fatty alcohol biosynthetic polypeptide, YjgB, in E.coli.
[00334] E. coli contains multiple enzymes that catalyze the reversible oxidoreduction of fatty aldehydes and fatty alcohols. A BLAST search and comparison of the E.coli K12 genomic and protein databases for homologs of Acinetobacter sp. M-l AlrA revealed that the E.coli enzyme YjgB might be the closest homolog with an about 57% sequence identity. This example sought to verify the fatty alcohol biosynthetic activity of E.coli YjgB by overexpressing YjgB with a CarB in E.coli and measure the accumulation of fatty aldehyde and production of fatty alcohols.
[00335] The plasmid pETDuet-l-'tesA-yjgB carrying 'tesA and yjgB (a putative alcohol dehydrogenase; GenBank accession number, NP 418690; GenPept accession number
AAC77226) from the E. coli K12 strain was prepared.
[00336] The gene yjgB (GenBank accession number, NP_418690) insert was amplified using PCR from the genomic DNA of E. coli K-12 using the following primers.
Ncol YjgB forward:
aatccTGGCATCGATGATAAAAAGCTATGCCGCAAAAG (SEQ ID NO:221)
Hindlll YjgB reverse:
ataaaagctTTCAAAAATCGGCTTTCAACACCACGCGG (SEQ ID NO:222)
[00337] The PCR product was then subcloned into the Ncol and Hindlll sites of pETDuet-1-
'tesA to generate pETDuet-l-'tesA-yjgB.
[00338] In order to evaluate the affect of carboxylic acid reductases and alcohol
dehydrogenases on the production of fatty alcohols, various combinations of the prepared plasmids were transformed in the E. coli strain C41 (DE3, AfadE) (described in
PCT/US08/058788).
[00339] The plasmid pACYCDuet-l-carB, encoding the CAR homolog carB, was co- transformed with pETDuet-l-'tesA. In addition, pACYCDuet-l-carB was also separately co- transformed with pETDuet-l-'tesA-yjgB. As a control, pACYCDuet-l -carB was co-transformed with the empty vector pETDuet-1 (see, e.g., FIG. 28).
[00340] The plasmid pACYCDuet-l-fadD9, encoding the CAR homolog fadD9, was co- transformed with pETDuet-l-'tesA. In addition, pACYCDuet-1- fadD9 was also separately co- transformed with pETDuet-l-'tesA-yjgB. As a control, pACYCDuet-1- fadD9was co- transformed with the empty vector pETDuet-1 (see, e.g., FIG. 28).
[00341] As an additional control, pETDuet-l-'tesA-yjgB was co-transformed with the empty vector pACYCDuet-1.
[00342] The E. coli transformants were grown in 3 mL of LB medium supplemented with carbenicillin (100 mg/L) and chloramphenicol (34 mg/L) at 37 °C. After overnight growth, 15 of culture was transferred into 2 mL of fresh LB medium supplemented with carbenicillin and chloramphenicol. After 3.5 hours of growth, 2 mL of culture were transferred into a 125 mL flask containing 20 mL of M9 medium with 2% glucose and with carbenicillin and
chloramphenicol. When the OD6oo of the culture reached 0.9, 1 mM of IPTG was added to each flask. After 20 hours of growth at 37 °C, 20 mL of ethyl acetate (with 1% of acetic acid, v/v) was added to each flask to extract the fatty alcohols produced during the fermentation. The crude ethyl acetate extract was directly analyzed with GC/MS as described herein.
[00343] The measured retention times were 6.79 minutes for cis-5-dodecen-l-ol, 6.868 minutes for 1-dodecanol, 8.058 minutes for cis-7-tetradecen-l-ol, 8.19 minutes for 1- tetradecanol, 9.208 minutes for cis-9-hexadecen-l-ol, 9.30 minutes for 1-hexadecanol, and 10.209 minutes for cis-1 1-octadecen-l-ol.
[00344] As can be concluded from this example, the production of fatty alcohols from fatty aldehydes in the E. coli strains described above may have been catalyzed by more than one endogenous fatty alcohol biosynthetic polypeptides. On the other hand, it has been demonstrated that overexpression of YjgB with CarB and leaded ess TesA significantly reduced the accumulation of fatty aldehydes, as compared to control strains that did not overexpress YjgB. But it was also noted that overexpression of YjgB appeared to reduce the overall fatty alcohol production.
EXAMPLE 3
[00345] This example describes the identification of other fatty alcohol biosynthetic polypeptides in E.coli.
[00346] A reverse genetic approach was used to identify potential fatty alcohol biosynthetic genes in E. coli MG1655 cells by expressing the acyl-ACP reductase YP_40061 1 from
Synechococcus elongatus (Synpcc7942_1594) (SEQ ID NO: 137). Four 3 mL LB cultures were grown overnight at 37 °C, and 55
of stationary phase cultures were used to inoculate four independent 5.5 mL of LB. Those 5.5 mL cultures were then grown to an OD6oo of 0.8-1.0 and were then used to inoculate a corresponding number of 2 L baffled shakefiasks, each with 500 mL Hu-9 minimal media. 20 hrs after induction the cells were pelleted at 4,000 x g for 20 min. The cell pellet was resuspended in 30 mL of 100 mM phosphate buffer at pH 7.2 with lx Bacterial Protease Arrest (G Biosciences). The cells were lysed in a French press at 15,000 psi with two passes through the instrument. The cell debris was then removed by centrifuging at 10,000 x g for 20 mins. The cell lysate was loaded onto two HiTrapQ columns (GE Healthcare) connected in series. The following buffers were used to elute proteins: (A) 50 mM Tris, pH 7.5 and (B) 50 mM Tris, pH 7.5 with 1 M NaCl. A gradient from 0 % B to 100% B was ran over 5 column volumes at a flow rate of 3 mL/min while 4 mL fractions were collected.
[00347] The fractions were assayed for alcohol biosynthetic enzymatic {e.g., aldehyde reductase/alcohol dehydrogenase) activity by taking 190 μΤ of a protein fraction and adding 5 μΐ of a 20 mM NADPH (Sigma) solution and 5 i of a 20 mM dodecanal (Fluka) solution in DMSO. The reactions were incubated at 37 °C for 1 hr. They were then extracted with 100 of ethyl acetate and analyzed for dodecanol via GC/MS. Fractions eluting around 350 mM NaCl contained a fatty alcohol biosynthetic enzyme activity.
[00348] Fractions containing fatty alcohol biosynthetic enzyme activity were pooled and loaded onto a 1 mL ResourceQ column (GE Healthcare). The same conditions used for the HiTrapQ column were used, except 0.5 mL fractions were collected. Protein fractions demonstrating a capacity of converting fatty aldehydes to fatty alcohols were then pooled and concentrated using Ami con (Milipore) protein concentrators (10,000 kDa cutoffs) to a volume of 1 mL. The solution was then loaded onto a HiPrep 200 size exclusion column (GE Healthcare). A buffer solution containing 50 mM Tris, pH 7.5, and 150 mM NaCl was run through the column at a rate of 0.3 mL per min. 2 mL fractions were collected. Two protein fractions were identified as having fatty alcohol biosynthetic enzyme activity. These two fractions, plus
fractions before and after these two fractions, were loaded onto a polyacrylamide gel and stained with SimplySafe Commassie stain (Invitrogen).
[00349] Comparing the bands in the active and inactive fractions, one protein band, which appeared in the active fraction, was not seen in the inactive fraction. This protein band was cut from gel and submitted to the Stanford Mass Spectroscopy Facility for LC/MS/MS protein sequencing. One of the proteins identified in this analysis was YahK. E.coli YahK was determined to be the closest paralogs of YjgB, with about 31% sequence identity to the latter.
EXAMPLE 4
[00350] This example describes the verification of YjgB and YahK as fatty alcohol biosynthetic polypeptides.
Construction offadD deletion strain
[00351] The fadD gene of E. coli MG1655 was deleted using the lambda red system
{Datsenko et al., Proc. Natl. Acad. Sci. USA. 97: 6640-6645 (2000)) as follows:
[00352] The chloramphenicol acetyltransferase gene from pKD3 was amplified with the primers fadl : (5 ' -TA ACCGGCGTCTGACGACTGACTTAACGCTC AGGCTTTATT
GTCCACTTTGTGTAGGCTGGAGCTGCTTCG-3") (SEQ ID NO:223), and fad2: (5'-
CATTTGGGGTTGCGATGACGACGAACACGCATTTTAGAGGTGAAGAATTGCATATG
AATATCCTCCTTTAGTTCC-3 *) (SEQ ID NO:224).
[00353] This PCR product was electroporated into E. coli MG1655 (pKD46). The cells were plated on L-chloramphenicol (30 μg/mL)(L-Cm) and grown overnight at 37 °C. Individual colonies were picked on to another L-Cm plate and grown at 42 °C. These colonies were then patched to L-Cm and L- carbenicillin (100 mg/mL) (L-Cb) plates and grown at 37 °C overnight. Colonies that were CmK and Cb^ were evaluated further by PCR to ensure the PCR product inserted at the correct site.
[00354] PCR verification was performed on colony lysates of these bacteria using the primers fadF (5'- ΰΟΤ^ΟΤΟΟΤΑΑΤΟΑΤΤΤΟΟ^) (SEQ ID NO:225) and fadR (5'- TCGCAACCTTTTCGTTGG-3 ') (SEQ ID NO:226). Expected size of the AfadDwCm deletion was about 1200 bp. The chloramphenicol resistance gene was eliminated using a FLP helper plasmid as described in Datsenko et al, Proc. Natl. Acad. Sci. USA 97:6640-6645 (2000). PCR
verification of the deletion was performed with primers fadF and fadR. The MG1655 AfadD strain was unable to grow on M9 + oleate agar plates (oleate as carbon source). It was also unable to grow in M9 + oleate liquid media. The growth defect was complemented by an E. coli fadD gene supplied in trans (in pCL1920-Ptrc).
Construction ofMG1655(DE3, AfadD) strain
[00355] To generate a T7 -responsive strain, the λϋΕ3 Lysogenization Kit (Novagen) was utilized, which is designed for site-specific integration of DE3 prophage into an E. coli host chromosome, such that the lysogenized host can be used to express target genes cloned in T7 expression vectors. DE3 is a recombinant phage carrying the cloned gene for T7 RNA polymerase under lacUVS control. Briefly, the host strain was cultured in LB supplemented with 0.2% maltose, 10 mM MgS04, and antibiotics at 37 °C to an OD600 of 0.5. Next, 108 pfu λϋΕ3, 10 pfu Helper Phage, and 10 pfu Selection Phage were incubated with 10 μΐ host cells. The host/phage mixture was incubated at 37 °C for 20 min to allow phage to adsorb to host. Finally, the mixture was pipeted onto an LB plate supplemented with antibiotics. The mixture was spread evenly using plating beads, and the plates were inverted plates and incubated at 37 °C overnight.
[00356] λϋΕ3 lysogen candidates were evaluated by their ability to support the growth of the T7 Tester Phage. T7 Tester Phage is a T7 phage deletion mutant that is completely defective unless active T7 RNA polymerase is provided by the host cell. The T7 Tester Phage makes very large plaques on authentic λϋΕ3 lysogens in the presence of IPTG, while much smaller plaques are observed in the absence of inducer. The relative size of the plaques in the absence of IPTG is an indication of the basal level expression of T7 RNA polymerase in the lysogen, and can vary widely between different host cell backgrounds.
[00357] The following procedure was used to determine the presence of DE3 lysogeny. First, candidate colonies were grown in LB supplemented with 0.2% maltose, 10 mM MgS0 , and antibiotics at 37 °C to an OD6oo of 0.5. An aliquot of T7 Tester Phage was then diluted in IX Phage Dilution Buffer to a titer of 2 x 10 pfu/mL. In duplicate tubes, 100 host cells were mixed with 100 \\L diluted phage. The host/phage mixture was incubated at room temperature for 10 min to allow phage to adsorb to host. Next, 3 raL of molten top agarose was added to each tube containing host and phage. The contents of one duplicate were plated onto an LB plate and the other duplicate onto an LB plate supplemented with 0.4 mM IPTG (isopropyl-b-
thiogalactopyranoside) to evaluate induction of T7 RNA polymerase. Plates were allowed to sit undisturbed for 5 min until the top agarose hardened. The plates were then inverted at 30 °C overnight.
Construction ofMG1655(DE3, AfadD, yjgB::kan) strain
[00358] The yjgB knockout strain, MG1655(DE3, AfadD, yjgB .kan), was constructed by using the following lambda red system {Datsenko et al , Proc. Natl. Acad. Sci. USA 97:6640- 6645 (2000)):
[00359] The kanamycin resistant gene from pKDl 3 was amplified with the primers yjgBRn: (5 ' -GCGCCTC AGATC AGCGCTGCGAATGATTTTCA AA AATCGGCTTTC AACACTG TAGGCTGGAGCTGCTTCG-3,) (SEQ ID NO:227), and yjgBFn: (S -CTGCCATGCTCTA CACTTCCCAAACAACACCAGAGAAGGACCAAAAAATGATTCCGGGGATCCGTCGAC
C-3') (SEQ ID NO:228). The PCR product was then electroporated into E. coli MG1655 (DE3, AfadD)/pKD46. The cells were plated on kanamycin (50 μg/mL) (L-Kan) and grown overnight at 37 °C. Individual colonies were picked on to another L-Kan plate and grown at 42 °C. These colonies were then patched to L-Kan and carbenicillin (100 mg/mL) (L-Cb) plates and grown at 37 °C overnight. Colonies that were kanK and Cb"5 were evaluated further by PCR to ensure the PCR product was inserted at the correct site.
[00360] PCR verification was performed on colony lysates of these bacteria using the primers BF ( - gtgrtggcgataCGACAAAACA-3 ') (SEQ ID NO:229) and BR (5Λ- CCCCGCCCTGCCATGCTCTACAC-3^) (SEQ ID NO:230). The expected size of the yjgBy.kan knockout was about 1450 bp.
[00361] In Example 2, a fadE deletion strain was used for fatty aldehyde and fatty alcohol production from 'TesA, CAR homologs, and endogenous YjgB in E. coli. Here, to demonstrate that CAR homologs used fatty acids instead of acyl-CoA as a substrate, the gene encoding for acyl-CoA synthase in E. coli (fadD) was deleted so that the fatty acids produced were not activated with CoA. E. coli strain MG1655(DE3, AfadD) was transformed with pETDuet-1 - 'tesA and pACYCDuet-l -carB . The transformants were evaluated for fatty alcohol production using the methods described herein. These transformants produced about 360 mg/L of fatty alcohols (dodecanol, dodecenol, tetredecanol, tetredecenol, cetyl, hexadecenol, and octadecenol). Confirming YjgB as a fatty alcohol bio synthetic polypeptide
[00362] To confirm that YjgB was an alcohol dehydrogenase responsible for converting fatty aldehydes into their corresponding fatty alcohols, pETDuet-l -'tesA and pACYCDuet-l -fadD9 were co-transformed into either MG1655(DE3, AfadD) or MG1655(DE3, AfadD, yjgB: ±m). At the same time, MG1655(DE3, AfadD, yjgB ::kan) was transformed with both pETDuet-l -'tesA- yjgB and pACYCDuet-l -fadD9.
[00363] The E. coli transformants were grown in 3 mL of LB medium supplemented with carbenicillin (100 mg/L) and chloramphenicol (34 mg/L) at 37 °C. After overnight growth, 15 of culture was transferred into 2 mL of fresh LB medium supplemented with carbenicillin and chloramphenicol. After 3.5 hrs of growth, 2 mL of culture was transferred into a 125 mL flask containing 20 mL of M9 medium with 2% glucose, carbenicillin, and chloramphenicol. When the OD6oo of the culture reached 0.9, 1 mM of IPTG was added to each flask. After 20 hrs of growth at 37 °C, 20 mL of ethyl acetate (with 1 % of acetic acid, v/v) was added to each flask to extract the fatty alcohols produced during the fermentation. The crude ethyl acetate extract was directly analyzed with GC/MS as described herein.
[00364] The yjgB knockout strain resulted in significant accumulation of dodecanal and a lower fatty alcohol titer (FIG. 29). The expression oiyjgB from plasmid pETDuet-l-'tesA-yjgB in the yjgB knockout strain effectively removed the accumulation of dodecanal (FIG. 29).
Dodecanal accumulated in the yjgB knockout strain, but it was not observed in either the wild- type strain (MG1655(DE3, AfadD)) or the yjgB knockout strain with theyjgB expression plasmid. The arrows in FIG. 29 indicate the GC trace of dodecanal (C12:0 aldehyde).
[00365] This data confirms that YjgB was involved in converting dodecanal into dodecanol, although there may be other alcohol dehydrogenase(s) present in E. coli to convert other aldehydes into alcohols.
Confirming YahK as a fatty alcohol bio synthetic polypeptide
[00366] To verify that YahK was indeed an alcohol dehydrogenase, yahK was knocked out in E. coli MG1655(DE3, AfadD, AyjgB) (control strain). The yahK knock-out strain MG1655(DE3, AfadD, Ayjg,B AyahK) was constructed with the lambda red system (Datsenko et al, supra) using the following primers:
yahKJF: (CATATCAGGCGTTGCCAAATACACATAGCTAATCAGGAGTAAACACAATG) (SEQ ID NO:231); and yahK_R: (AATCGCACACTAACAGACTGAAAAAATTAATA AATACCCTGTGGTTTAAC) (SEQ ID NO:232).
[00367] This AyahK strain and the control strain, both expressing the acyl-ACP reductase YP 40061 1, were cultured under conditions described above. Cell free lysates were made from both strains, and each lysate was assayed for fatty alcohol biosynthetic activity as discussed above.
[00368] The AyahK strain did not convert dodecanal to dodecanol, while the wild type strain had this activity. For additional verification, each lysate was run on a HiTrapQ column as described above. The wild type lysate appeared to have fatty alcohol biosynthetic activity in fractions eluting around 350 mM NaCl, while the AyahK lysate appeared to have no fatty alcohol biosynthetic activity in this region.
EXAMPLE 5
[00369] This example describes the identification of further fatty alcohol biosynthetic polypeptides in E.coli.
Bioinformatics
[00370] It was reasoned that potential fatty alcohol biosynthetic polypeptides in E. coli were most likely members of of the following four protein families: Zn-dependent alcohol dehydrogenases (Pfam 00107 and 08240), Fe-dependent alcohol dehydrogenases (Pfam 00465), aldo-keto reductases (Pfam 00248) and short-chain dehydrogenases (Pfam 00106) (Pfam = protein family according to "pfam.sanger.ac.uk"). Further protein families that were likely to include potential alcohol biosynthetic polypeptides in E.coli may include, for example, the dehydroquinone synthase family (Pfam 01761), the phospho gluconate dehydrogenase family (Pfam 03446), the hydroxyacid dehydrogenase family (Pfam 02826, Pfam 00389), the aldehyde dehydrogenase family (Pfam 00171), the glutamyl-tRNA reductase family (Pfam 01488, Pfam 08501), the GFO/IDH/MOCA family (Pfam 01408, Pfam 02894), the mannitol dehydrogenase family (Pfam 01232, Pfam 08125), the IMP dehydrogenase family (Pfam 00478), the
oxidoreductase family (Pfam 10722), the epimerase family (Pfam 001370), the alcohol oxidase family (Pfam 00732, Pfam 05199), the PQQ dehydrogenase family (Pfam 0101 1), the xanthine dehydrogenase family (Pfam 00941), the FAD/NAD(P) -binding oxidoreductase family (Pfam
01266), the flavin/NADH-binding oxidoreductase family (Pfam 01613), the FAD-linked oxidoreductase family (Pfam 01565, Pfam 02913), the ferredoxin reductase family (Pfam 00175, Pfam 00970, Pfam 001 1 1), the anaerobic dehydrogenase family (Pfam 00384, Pfam 01568), the molybdenum-binding oxidoreductase family (Pfam 01315, Pfam 02738), the DMSO reductase family (Pfam 02976), the nitroreductase family (Pfam 00881), the FeS-binding oxidoreductase family (Pfam 00037, Pfam 07992), another oxidoreductase family (Pfam 00037, Pfam 01558, Pfam 01855, Pfam 02775, Pfam 10371 ), the Fe-S oxidoreductase family (Pfam 04055), the NADH-ubiquinone oxidoreductase family (Pfam 02508), the NAD(P)H:quinine oxidoreductase family (Pfam 05368), the NADH:ubiquinone oxidoreductase family (Pfam 01512, Pfam 10531 , Pfam 10589), the glutathione reductase family (Pfam 02852, Pfam 07992), or a number of other predicted oxidoreductase families including, for example those within Pfam 03006, Pfam 03960, Pfam 00070. The potential families from which a fatty alcohol biosynthetic polypeptide of E.coli can be isolated are listed in Table 10 below.
Table 10: Protein families that may contain additional Fatty Alcohol Biosynthetic
Polypeptides
[00371] The following 8 candidates were chosen for initial experimental analysis: yahK, yjgB, adhP, dkgA, dkgB, yhdH, ydjL, and yqhD (Table \ 2).
[00372] To determine if these genes could reduce fatty aldehydes to fatty alcohols, these 8 genes were cloned into a pET-Duet vector along with E. coli 'tesA. These genes were then transformed into E. coli (DE3) MG1655 AyjgBAyahK cells. Next 3 mL overnight starter cultures were grown in LB with carbanecillin (100 mg/L) at 37 °C. A control strain lacking a candidate alcohol dehydrogenase was also included in the experiment. 1 mL of each overnight culture was used to inoculate 50 mL of fresh LB with carbanecillin. The cultures were shaken at 37 °C until reaching an OD6oo of 0.8-1. The cultures were then transferred to 18 °C, induced with 1 mM IPTG, and shaken overnight.
[00373] Cell free lysates were prepared by centrifuging the cultures at 4,000 x g for 20 mins. The cultures were then resuspended in 1 mL of Bugbuster (Novagen) and gently shaken at room temperature for 5 min. The cell debris was removed by spinning at 15,000 x g for 10 min. The resulting lysates were assayed for alcohol dehydrogenase activity by mixing 88 μΕ of lysate, 2 of 40 mM cis-11-hexadecenal in DMSO, and 10 of 20 mM NADPH. The samples were incubated at 37 °C for 30 min. and were then extracted with 100 μΐ, of ethyl acetate. The extracts were analyzed using GC/MS.
[00374] All proteins showed significantly better conversion of cis- 1 1 -hexadecenal to cis- 1 1 - hexadecanol as compared with the 'TesA only control (see Table 11). These results were confirmed in assays using dodecanal instead of cis-11-hexadecenal as the substrate (see Table 11).
[00375] To investigate how these enzymes contribute to fatty alcohol dehydrogenase activity in E. coli under production conditions, first the yjgB yahK double knock-out strain in
MG1655(DE3, AfadD) (described above) was tested by transforming it with a plasmid expressing acyl-ACP reductase YP_400611 and analyzing fatty aldehyde and fatty alcohol titers. The test strain also contained a plasmid expressing a decarbonylase. This double knock-out mutant showed slightly higher fatty aldehyde titers in several experiments (see, e.g., FIG. 30), confirming that these two putative alcohol dehydrogenases contribute to fatty alcohol dehydrogenase activity in E. coli under production conditions. Next, two additional genes, yncB and ydjA, were deleted in the yjgB yahK double mutant. YdjA, which is not a member of the four protein families mentioned above, demonstrated slightly elevated fatty aldehyde levels (see FIG. 30), indicating that it may also contribute to fatty alcohol dehydrogenase activity in E. coli under production conditions.
Overexpression of select fatty alcohol biosynthetic polypeptide candidates
[00376] Additionally, the active fatty alcohol dehydrogenases from Table 11 were also deleted in MG1655 (DE3, AfadD, Ayjg,B AyahK) and tested as described above. Several of these deletion strains showed slightly elevated fatty aldehyde levels, suggesting that these may also contribute to fatty alcohol dehydrogenase activity in E. coli under production conditions
FIG. 31).
Table 11: Overexpression of putative fatty alcohol dehydrogenase genes
EXAMPLE 6
[00377] This example describes an overexpression study of a more comprehensive set of putative fatty alcohol biosynthetic polypeptides in E.coli
[00378] A larger and more comprehensive set of putative fatty alcohol biosynthetic polypeptides were selected for an overexpression study to identify the members of various protein families that contribute to the reduction of fatty aldehydes to fatty alcohols in E.coli. Specifically, each of the fatty alcohol biosynthetic genes in Table 12 below were overexpressed and analyzed for fatty aldehyde conversion and/or fatty alcohol production.
Table 12: Putative Fatty Alcohol Biosynthetic Genes That Were Overexpressed (including members of the 4 families mentioned above, with the most likely candidates for fatty alcohol biosynthetic enes
pfam00106 pfam00106 pfam00106 pfam00106 pfam00106 pfam00106 pfam00106 pfam00106 pfam00106 pfam00106 pfam00106 pfam00106 pfam00106 pfam00106 pfam00106 pfam00106 pfam00106 pfam01761 pfam01761 pfam01761
[00379] ac gene was c one nto t e expression vector OP-80 (SEQ ID NO:233), which was digested with the restriction enzymes Ncol and EcoRI. The genes were amplified using PCR from E.coli MG1655 genomic DNA using the primers listed in Table 13.
Table 13: primers
qor _f TAAGGAGGAATAAACCATGGCAACACGAATTGAATTTCACAAGCACG (SEQ ID NO:268) qor _r CGGGCCCAAGCTTCGAATTTTATGGAATCAGCAGGCTGGAACCTTG (SEQ ID NO:269)
TAAGGAGGAATAAACCATGAAAAGCATATTAATTGAAAAACCGAATCAACTGGC (SEQ ID rspB _f NO:270)
CGGGCCCAAGCTTCGAATTTATTCAGAAAAAGTGAGTAAGACTTTGCAGCAATGTTTTTG
rspB r (SEQ ID NO:271)
srlD f TAAGGAGGAATAAACCATGAATCAGGTTGCCGTTGTCATCGG (SEQ ID NO:272) srlD r CGGGCCCAAGCTTCGAATTTCAGAACATCACCTGACCGCCG (SEQ ID NO:273)
tdh f TAAGGAGGAATAAACCATGAAAGCGTTATCCAAACTGAAAGCGGAAG (SEQ ID NO:274) tdh r CGGGCCCAAGCTTCGAATTTTAATCCCAGCTCAGAATAACTTTCCCGGAC (SEQ ID NO:275) ucpA_f TAAGGAGGAATAAACCATGGGTAAACTCACGGGCAAGACAG (SEQ ID NO:276) ucpA_r CGGGCCCAAGCTTCGAATTTCAGATACCGACGCTAACCGTCTCC (SEQ ID NO:277) yahK_f TAAGGAGGAATAAACCATGAAGATCAAAGCTGTTGGTGCATATTCCG (SEQ ID NO:278)
CGGGCCCAAGCTTCGAATTTCAGTCTGTTAGTGTGCGATTATCGATAACAAAACG (SEQ ID yahK r NO:279)
yajO_f TAAGGAGGAATAAACCATGCAATACAACCCCTTAGGAAAAACCGAC (SEQ ID NO:280)
CGGGCCCAAGCTTCGAATTTTATTTAAATCCTACGACAGGATGCGGTTTATACGG (SEQ ID yajO_r NO:281)
ybbO_f TAAGGAGGAATAAACCATGACTCATAAAGCAACGGAGATCCTGACAG (SEQ ID NO:282) ybbO r CGGGCCCAAGCTTCGAATTTCACCCCTGCAATATTTTGTCCATCACG (SEQ ID NO:283) ybdH_f TAAGGAGGAATAAACCATGCCTCACAATCCTATCCGCGTG (SEQ ID NO:284)
CGGGCCCAAGCTTCGAATTTCAGGCTTTAAACGATTCCACTTTTTTGAACGC (SEQ ID ybdH__r NO:285)
ybdR f TAAGGAGGAATAAACCATGAAAGCATTGACTTATCACGGCCCAC (SEQ ID NO:286) ybdR_r CGGGCCCAAGCTTCGAATTTCATATTGTTCCCCCCGGCATCG (SEQ ID NO:287)
TAAGGAGGAATAAACCATGCATTACCAGCCAAAACAAGATTTACTCAATGATC (SEQ ID yciK_f NO:288)
yciK_r CGGGCCCAAGCTTCGAATTTCATTGGGAAATTCCTGGTTTACGGCC (SEQ ID NO:289) ycjQ„f TAAGGAGGAATAAACCATGAAAAAGTTAGTAGCCACAGCACCGC (SEQ ID NO:290)
CGGGCCCAAGCTTCGAATT TAAAACGTAACGCCCATTTTGATGCTCTGTTC (SEQ ID ycjQ r NO:291)
TAAGGAGGAATAAACCATGAGCAGCAATACATTTACTCTCGGTACAAAATC (SEQ ID ydt>C_f NO:292)
CGGGCCCAAGCTTCGAATTTTATTCTCGCGAAATACCATCCAACGTAGACAAC (SEQ ID ydbC_r NO:293)
ydfij_f TAAGGAGGAATAAACCATGATCGTTTTAGTAACTGGAGCAACGGCAG (SEQ ID NO:294) ydfG^r CGGGCCCAAGCTTCGAATTTTACTGACGGTGGACATTCAGTCCG (SEQ ID NO:295) ydhF f TAAGGAGGAATAAACCATGGTTCAGCGTATTACTATTGCGCCG (SEQ ID NO:296) ydhF_r CGGGCCCAAGCTTCGAATTTTACGGTACGTCGTACCCCAGTG (SEQ ID NO:297)
TAAGGAGGAATAAACCATGAAAAAGATACCTTTAGGCACAACGGATATTACGC (SEQ ID ydjG_f NO:298)
ydjG_r CGGGCCCAAGCTTCGAATTTTAACGCTCCAGGGCCTCTGC (SEQ ID NO:299)
ydjJ f TAAGGAGGAATAAACCATGAAAAATTCAAAAGCAATATTGCAGGTGCCG (SEQ ID NO:300)
CGGGCCCAAGCTTCGAATTAATCGCTAATTTTAATAACGCCTTTAATAATGTCGCGTTTG
ydjJ r (SEQ ID NO:301 )
ydjL„f TAAGGAGGAATAAACCATGAAAGCACTGGCTCGGTTTGGC (SEQ ID NO:302)
CGGGCCCAAGCTTCGAATTTTATTCATCAAAGTCGTAAGTCATGATCACTTTGATTGCG (SEQ ydjL r ID NO:303)
TAAGGAGGAATAAACCATGCAACAAAAAATGATTCAATTTAGTGGCGATGTCTC (SEQ ID yeaE_f NO:304)
yeaEjr CGGGCCCAAGCTTCGAATTTCACACCATATCCAGCGCAGTTTTTCC (SEQ ID NO:305) ygcW_f TAAGGAGGAATAAACCATGTCAATCGAATCTCTCAATGCGTTCTCAATG (SEQ ID NO:306) ygcW r CGGGCCCAAGCTTCGAATTTTAGCGCACTAAATAACCGCCATCAACC (SEQ ID NO:307) ygff_f TAAGGAGGAATAAACCATGGCTATAGCACTTGTGACTGGTGG (SEQ ID NO:308)
ygfF_r CGGGCCCAAGCTTCGAATTTTATTTCCCGCCCGCCAAATCG (SEQ ID NO:309)
yggP_f TAAGGAGGAATAAACCATGAAAACCAAAGTTGCTGCTATTTATGGCAAGC (SEQ ID NO:310) yggPjr CGGGCCCAAGCTTCGAATTTCATTGCGCGGCCTCCC (SEQ ID NO:31 1)
yghA f TAAGGAGGAATAAACCATGTCTCATTTAAAAGACCCGACCACGCAG (SEQ ID NO:312) yghA r CGGGCCCAAGCTTCGAATTTTAACCTAAATGCTCGCCGCCG (SEQ ID NO:313)
yghz__f TAAGGAGGAATAAACCATGGTCTGGTTAGCGAATCCCGAAC (SEQ ID NO:314)
yghZ_r CGGGCCCAAGCTTCGAATTTCATTTATCGGAAGACGCCTGCCAC (SEQ ID NO:315) yhdH f TAAGGAGGAATAAACCATGCAGGCGTTACTTTTAGAACAGCAGG(SEQ ID NO:316) yhdH_r CGGGCCCAAGCTTCGAATTTTAGTTAACCTTCACCAGCGTGCGAC(SEQ ID NO:317) yiaY_f TAAGGAGGAATAAACCATGGCAGCTTCAACGTTCTTTATTCCTTCTG (SEQ ID NO:318) yiaY r CGGGCCCAAGCTTCGAATTTTACATCGCTGCGCGATAAATCGCC (SEQ ID NO:319)
yjgB„f TAAGGAGGAATAAACCATGTCGATGATAAAAAGCTATGCCGCAAAAGAAG (SEQ ID NO:320) yj B^r CGGGCCCAAGCTTCGAATTTCAAAAATCGGCTTTCAACACCACGC (SEQ ID NO:321) yjgi_f TAAGGAGGAATAAACCATGGGCGCTTTTACAGGTAAGACAGTTC (SEQ ID NO:322) yjgU" CGGGCCCAAGCTTCGAATTTTATGCGCCAAACGCGCCATC (SEQ ID NO:323)
yjjN^f TAAGGAGGAATAAACCATGTCTACGATGAATGTTTTAATTTGCCAGCAGC (SEQ ID NO:324)
CGGGCCCAAGCTTCGAATTTCAGAAAGTAATTACGCCTTTAATTAACTCACGATTGTTAA
yjj r (SEQ ID NO:325)
yncB f TAAGGAGGAATAAACCATGGGGCAACAAAAGCAGCGTAATC (SEQ ID NO:326)
yncB r CGGGCCCAAGCTTCGAATTTTAATCATCACCCGCCACGCG (SEQ ID NO:327)
yohF f TAAGGAGGAATAAACCATGGCACAGGTTGCGATTATTACCGC (SEQ ID NO:328)
yohF r CGGGCCCAAGCTTCGAATTCTATTCTGGGTTGAACTGTGGATTCGCC (SEQ ID NO:329) tas f TAAGGAGGAATAAACCATGCAATATCACCGTATACCCCACAGTTCG (SEQ ID NO:330)
CGGGCCCAAGCTTCGAATTTTATGGTGCCGGATAAGTATAAACCTGATGCAC (SEQ ID tas r NO:331)
hcaB f TAAGGAGGAATAAACCATGAGCGATCTGCATAACGAGTCCATTTTTATTAC (SEQ ID NO:332) hcaB r CGGGCCCAAGCTTCGAATTTTAAAGATCCAGCCCAGCCGCTAC (SEQ ID NO:333) fabG f TAAGGAGGAATAAACCATGAATTTTGAAGGAAAAATCGCACTGGTAACCG (SEQ ID NO:334) fabG r CGGGCCCAAGCTTCGAATTTCAGACCATGTACATCCCGCCG (SEQ ID NO:335)
yphC_f TAAGGAGGAATAAACCATGAAAACGATGCTGGCAGCTTATTTACCAG (SEQ ID NO:336) yphC_r CGGGCCCAAGCTTCGAATTTTAATCCGGGAAGTTAATCACAACTTTCCCGC (SEQ ID NO:337) yqhD_f TAAGGAGGAATAAACCATGAACAACTTTAATCTGCACACCCCAACC (SEQ ID NO:338) yqhD_r CGGGCCCAAGCTTCGAATTTTAGCGGGCGGCTTCGTATATACG (SEQ ID NO:339)
[00380] Each primer was designed to contain 15 bases of overlap with the expression vector, enabling restrictionless cloning using the InFusion cloning kit (Clontech). Excess nucleotides and primers were removed from the PCR products using the ZR-96 DCC kit (Zymo Research). After ligation of the PCR products into the linearized OP-80, the resulting DNA was transformed into NEB Turbo competent cells (New England Biolabs, Inc. Ipswich, MA), and plated onto LB agar medium supplemented with 100 μ τηΐ. spectinomycin and 1 % (w/v) glucose. Plasmid clones containing the appropriate inserts were identified using PCR, verified by sequencing and mini-prepped.
[00381] The sequence verified plasmids were transformed into the expression strain, E.coli (DE3) AyjgB AyahK AydhD AdkgA, and plated onto LB agar medium supplemented with 100 μg/mL spectinomycin and 1 % (w/v) glucose. Individual colonies were picked and grown overnight at 37°C in LB liquid medium supplemented with 100 μg/mL spectinomycin and 1 % (w/v) glucose. The culture was then diluted 1 : 1000 into fresh LB with 100 μg/mL
spectinomycin and 1 % (w/v) glucose and grown in a shaker for 5-6 hours at 37°C. The culture was then induced with 1 mM isopropyl β-D-l -thiogalactopyranoside (IPTG) and grown in a shaker for 18 hours at 18°C.
[00382] The cells were subsequently harvested by centrifugation for 10 minutes at 4,500 rpm. The supernatant was discarded and the cells were resuspended in 2.5 mL BugBuster lysis reagent (Novagen). The cell suspensions were placed on a vertical rotator for 45 minutes at 4°C to lyse the cells. Cell debris were removed by centrifugation for 10 minutes at 4,500 rpm, and the clarified lysates were used for activity assays.
[00383] Each sample was evaluated in vitro to determine its ability to convert dodecanal or 1 1 -cis-hexadecenal into dodecanol or 1 1 -cis-hexadecenol, respectively, using the cell lysates as described above. The negative control consisted of a lysate prepared from cells transformed with an empty OP-80 expression vector.
[00384] Each reaction contained 5-40 ih of cell lysate, 20 μΐ, 20 mM dodecanal or 1 1 -cis- hexadecenal, 10 μΐ, 20 mM NADH or NADPH, and sufficient dilution buffer (100 mM sodium phosphate, pH 7.0, 0.25% (v/v) Triton X-100) to bring the total volume to 400 μΐ,. The mixture was incubated for 2 hours at 37°C with constant shaking at 250 rpm.
[00385] To prepare samples for analysis, 40 μL 1 M HC1 and 400 μL butyl acetate was added. Tetracosane (a C24 alkane) was added as an internal standard (at 500 mg/L). The mixture was shaken for 15 minutes at 2,000 rpm, then centrifuged at 4,500 rpm for 10 minutes at 20°C.
[00386] A 50 μL sample of the organic phase was derivatized with BSTFA (Ν,Ο- bis(trimethylsilyl)trifluoroacetamide) and analyzed on a GC/FID equipped with a Trace UFC-1 column (Thermo Scientific). Samples were run using a split ratio of 1 :300 and a program consisting of an initial temperature of 140°C for 0.3 minute, a ramp up of 150°C/min to 300°C, then holding at a constant temperature of 300°C for 0.05 minutes.
[00387] The percentage of aldehyde substrate that had been converted to alcohol was calculated for each sample, and a paired t-test was used to identify candidates that had converted
the most aldehyde into alcohol as compared to the negative control, using a p value of less than or equal to about 0.05. The candidate that displayed statistically significant levels of fatty alcohol biosynthetic enzyme activity were identified and listed below in Table 14.
TABLE 14: Fatty alcohol biosynthetic polypeptides identified using various substrates
yncB
Claims
1. A microorganism engineered to produce a fatty acid derivative, said microorganism
comprising, polynucleotide sequences encoding:
(a) a thioesterase (EC 3.1.1.5);
(b) a fatty aldehyde biosynthetic polypeptide; and
(c) a fatty alcohol biosynthetic polypeptide,
wherein expression of said polypeptides is modified relative to the corresponding wild type polypeptide, and said microorganism produces an increased titer of the fatty acid derivative relative to a wild type microorganism.
2. The engineered microorganism according to claim 1 , wherein said fatty aldehyde
biosynthetic polypeptide has at least 90% sequence identity to the amino acid sequence presented as SEQ ID NO: 41, 43, 45, 47, 49, 51 , 53, 55, 57, 59, 61 , 63, 65, 69, 71, 73, 75, 77, 79, 81 , 83, 85, 87, 89, 91 , 93, 97, 99, 101, 103, 105, 107, 109, 111, 113, 1 15, 1 17, 1 19, 121 , 123, 125, or 127.
3. The engineered microorganism according to claim 1, wherein said fatty aldehyde
biosynthetic polypeptide comprises an amino acid sequence motif with a sequence presented as (1) SEQ ID NO:129, SEQ ID NO:130, SEQ ID NO: 131, and SEQ ID NO:132; (2) SEQ ID NO:133; SEQ ID NO:134; SEQ ID NO:135; SEQ ID NO: 136; or (3) SEQ ID NO:129, SEQ ID NO:131 , SEQ ID NO: 132 or SEQ ID NO: 133.
4. The engineered microorganism according to claim 1 , wherein said fatty aldehyde
biosynthetic polypeptide is encoded by a polynucleotide having at least 90% sequence identity to the nucleotide sequence presented as SEQ ID NO: 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 112, 1 14, 1 16, 118, 120, 122, 124, 126, or 128.
5. A microorganism engineered to produce a fatty acid derivative, said microorganism
comprising, polynucleotide sequences encoding:
(a) a fatty aldehyde biosynthetic polypeptide; and
(b) a fatty alcohol biosynthetic polypeptide,
wherein expression of said polypeptides is modified relative to the corresponding wild type polypeptides, and said microorganism produces an increased titer of the fatty acid derivative relative to a wild type microorganism.
6. The engineered microorganism according to claim 5, wherein said fatty aldehyde
biosynthetic polypeptide is an acyl-ACP reductase.
7. The engineered microorganism according to claim 6, wherein the acyl-ACP reductase has an amino acid sequence having at least 90% sequence identity to a sequence presented as SEQ ID NO: 137, 139, 141 , 143, 145, 147, 149, 151, or 153.
8. The engineered microorganism according to claim 6, wherein the acyl-ACP reductase
polypeptide comprises an amino acid motif presented as SEQ ID NO: 155, 156, 157, 158, 159, 160, 161 ,162, 163, 164, or 165.
9. The engineered microorganism according to claim 6, wherein the acyl-ACP reductase
polypeptide is encoded by a polynucleotide having at least 90% sequence identity to a sequence presented as SEQ ID NO: 138, 140, 142, 144, 146, 148, 150, 152, or 154.
10. A method of producing a fatty alcohol, the method comprising;
culturing an engineered microorganism according to any one of claims 1 to 9 in the presence of a carbon source, under conditions wherein said fatty alcohol is produced at a titer of at least 300mg/L.
11. The method according to claim 10, wherein the engineered microorganism is modified to express an attenuated level of an acyl-CoA synthase (EC 2.3.1.86).
12. The method according to any one of claims 1 to 11, wherein the fatty alcohol biosynthetic polypeptide is a fatty aldehyde reductase or alcohol dehydrogenase (EC 1.1.1.1) and the expression of polypeptide is increased relative to the corresponding wild type polypeptide.
13. The method according to claim 12, wherein the fatty alcohol biosynthetic polypeptide has at least 90% sequence identity to a polypeptide sequence selected from the group consisting of SEQ ID NO: l, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, and 39.
14. The method according to claim 12, wherein fatty alcohol biosynthetic polypeptide is
encoded by a polynucleotide having at least 90% sequence identity to the nucleotide sequence presented as SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, or 40.
15. The method of claim 10, wherein the fatty alcohol is spontaneously secreted from the
microorganism, actively transported into the extracellular environment, or passively transported into the extracellular environment.
16. The method of claim 10, further comprising isolating the fatty alcohol from the culture.
17. The method of claim 10, wherein the fatty alcohol comprises a C6-Cig fatty alcohol.
18. The method of claim 17, wherein the fatty alcohol is a C6, C8, C10, Cn, Cn, C]4, Cj5, Ci6, C17, or C]8 fatty alcohol.
19. The method of claim 10, wherein the hydroxyl group is in the primary (d) position.
20. The method of claim 10, wherein the fatty alcohol is an unsaturated fatty alcohol.
21. The method of claim 20, wherein the unsaturated fatty alcohol is C10:l , C12:l , C14: l, C16:l, or C18:l .
22. The method of claim 20, wherein the fatty alcohol is unsaturated at the omega-7 position.
23. The method of claim 20, wherein the unsaturated fatty alcohol comprises a cis double bond.
24. The method of claim 10, wherein the fatty alcohol is a saturated fatty alcohol.
25. The method of any one of claims 10-24, wherein the microorganism is selected from the group consisting of a yeast cell, a fungus cell, a filamentous fungi cell, and a bacterial cell.
26. An engineered microorganism according to any one of claims 1 to 9, wherein the fatty
alcohol biosynthetic polypeptide is a fatty aldehyde reductase or alcohol dehydrogenase (EC 1.1.1.1) and the gene encoding said polypeptide is knocked-out.
27. The engineered microorganism according to claim 26, further comprising a polynucleotide sequence encoding a hydrocarbon biosynthetic polypeptide, having at least 90% sequence identity to the amino acid sequence of SEQ ID NO: 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, or 200.
28. The engineered microorganism according to claim 27, wherein the hydrocarbon
biosynthetic polypeptide has the amino acid sequence of SEQ ID NO: 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, or 200 with one or more amino acid substitutions, additions, deletions, or insertions.
29. The engineered microorganism according to claim 26, wherein the hydrocarbon
biosynthetic polypeptide has amino acid sequence having the amino acid motif sequences of (1) SEQ ID NO:202; (2) SEQ ID NO:203 or SEQ ID NO:204, or SEQ ID NO:205; (3) SEQ ID NO:206, and any one of SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205; or (4) SEQ ID NO:207 or SEQ ID NO:208, or SEQ ID NO:209, or SEQ ID NO:210; wherein the hydrocarbon biosynthetic polypeptide has decarbonylase activity.
30. A method of producing a hydrocarbon, the method comprising;
culturing an engineered microorganism according to any one of claims 27 to 29 in the presence of a carbon source, under conditions wherein said hydrocarbon is spontaneously secreted from the microorganism, actively transported into the extracellular environment, or passively transported into the extracellular environment.
31. The method of claim 30, wherein the hydrocarbon is secreted by the microorganism.
32. The method of claim 30, wherein the hydrocarbon is an alkane.
33. The method of claim 32, wherein the alkane comprises a Ci3-C2i alkane.
34. The method of claim 32, wherein the alkane is selected from the group consisting of
tridecane, methyltridecane, nonadecane, methylnonadecane, heptadecane,
methylheptadecane, pentadecane, and methylpentadecane.
35. The method of claim 30, further comprising culturing the microorganism in the presence of a saturated fatty acid derivative.
36. The method of claim 35, wherein the saturated fatty acid derivative is a Ci4-C22 saturated fatty acid derivative.
37. The method of claim 35, wherein the saturated fatty acid derivative is selected from the group consisting of 2-methylicosanal, icosanal, octadecanal, tetradecanal, 2- methyloctadecanal, stearaldehyde, palmitaldehyde, and their derivatives.
38. The method of claim 30, wherein the hydrocarbon is an alkene.
39. The method of claim 38, wherein the alkene comprises a Ci3-C22 alkene.
40. The method of claim 38, wherein the alkene is selected form the group consisting of
pentadecene, heptadecene, methylpentadecene, and methylheptadecene.
41. The method of claim 30, further comprising culturing the microorganism in the presence of an unsaturated fatty acid derivative.
42. The method of claim 41 , wherein the unsaturated fatty acid derivative is a Ci4-C22
unsaturated fatty acid derivative.
43. The method of claim 41, wherein the unsaturated fatty acid derivative is selected from the group consisting of octadecenal, hexadecenal, methylhexadecenal, and methyloctadecenal.
44. A hydrocarbon produced by any one of the methods of claims 30-43.
45. A biofuel comprising the hydrocarbon of claim 44.
46. The biofuel of claim 45, wherein the biofuel is a diesel, gasoline, or jet fuel.
47. The biofuel of claim 46, wherein the hydrocarbon has 513C of -15.4 or greater.
48. The biofuel of claim 47, wherein the hydrocarbon has a fivi14C of at least 1.003.
49. The method of claim 30, wherein the microorganism is selected from the group consisting of a yeast cell, a fungus cell, a filamentous fungi cell, and a bacterial cell.
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US32187810P | 2010-04-08 | 2010-04-08 | |
US32187710P | 2010-04-08 | 2010-04-08 | |
US61/321,877 | 2010-04-08 | ||
US61/321,878 | 2010-04-08 |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2011127409A2 true WO2011127409A2 (en) | 2011-10-13 |
WO2011127409A3 WO2011127409A3 (en) | 2012-06-07 |
Family
ID=44761197
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2011/031794 WO2011127409A2 (en) | 2010-04-08 | 2011-04-08 | Methods and compositions related to fatty alcohol biosynthetic enzymes |
Country Status (3)
Country | Link |
---|---|
US (1) | US20110250663A1 (en) |
AR (1) | AR084377A1 (en) |
WO (1) | WO2011127409A2 (en) |
Cited By (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2012135760A1 (en) | 2011-03-30 | 2012-10-04 | Ls9, Inc. | Compositions comprising and methods for producing beta-hydroxy fatty acid esters |
DE102012207921A1 (en) | 2012-05-11 | 2013-11-14 | Evonik Industries Ag | Multi-stage synthesis process with synthesis gas |
JP2014121325A (en) * | 2012-12-21 | 2014-07-03 | Evonik Industries Ag | PRODUCTION OF ω-AMINO FATTY ACID |
WO2014113571A3 (en) * | 2013-01-16 | 2014-11-13 | Ls9, Inc. | Acyl-acp reductase with improved properties |
WO2015057155A1 (en) * | 2013-10-18 | 2015-04-23 | Biopetrolia Ab | Engineering of hydrocarbon metabolism in yeast |
WO2015085271A1 (en) * | 2013-12-05 | 2015-06-11 | REG Life Sciences, LLC | Microbial production of fatty amines |
JP2020022473A (en) * | 2013-06-14 | 2020-02-13 | アールイージー ライフ サイエンシズ リミテッド ライアビリティ カンパニー | METHODS OF PRODUCING ω-HYDROXYLATED FATTY ACID DERIVATIVES |
US10787648B2 (en) | 2015-12-15 | 2020-09-29 | Genomatica, Inc. | Omega-hydroxylase-related fusion polypeptide variants with improved properties |
US11421206B2 (en) | 2014-06-16 | 2022-08-23 | Genomatica, Inc. | Omega-hydroxylase-related fusion polypeptides with improved properties |
Families Citing this family (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP2655605A1 (en) * | 2010-12-20 | 2013-10-30 | Matrix Genetics, LLC | Modified photosynthetic microorganisms for producing lipids |
EP3674399B1 (en) | 2012-04-02 | 2021-09-22 | Genomatica, Inc. | Car enzymes and improved production of fatty alcohols |
WO2014015278A1 (en) * | 2012-07-20 | 2014-01-23 | Codexis, Inc. | Production of fatty alcohols from engineered microorganisms |
KR20150069015A (en) * | 2012-10-15 | 2015-06-22 | 게노마티카 인코포레이티드 | Microorganisms and methods for production of specific length fatty alcohols and related compounds |
US9034629B2 (en) * | 2013-01-25 | 2015-05-19 | Joule Unlimited Technologies, Inc. | Recombinant synthesis of medium chain-length alkanes |
AU2014225436B2 (en) | 2013-03-07 | 2018-05-10 | Genomatica, Inc. | Downstream processing of fatty alcohol compositions produced by recombinant host cells |
US20160376600A1 (en) | 2013-11-25 | 2016-12-29 | Genomatica, Inc. | Methods for enhancing microbial production of specific length fatty alcohols in the presence of methanol |
AU2015289430B2 (en) * | 2014-07-18 | 2020-03-05 | Genomatica, Inc. | Microbial production of fatty diols |
EP3342873A1 (en) * | 2016-12-29 | 2018-07-04 | Metabolic Explorer | Conversion of methylglyoxal into hydroxyacetone using enzymes and applications thereof |
CA3058950A1 (en) | 2017-04-03 | 2018-10-11 | Genomatica, Inc. | Thioesterase variants having improved activity for the production of medium-chain fatty acid derivatives |
WO2020047304A1 (en) | 2018-08-31 | 2020-03-05 | Genomatica, Inc. | Xylr mutant for improved xylose utilization or improved co-utilization of glucose and xylose |
WO2020123563A1 (en) * | 2018-12-10 | 2020-06-18 | The Regents Of The University Of California | Engineered polypeptides that exhibit increased catalytic efficiency for unnatural cofactors and uses thereof |
CN112410223A (en) * | 2020-10-21 | 2021-02-26 | 中国科学院广州地球化学研究所 | Method capable of separating functional microorganisms capable of degrading polycyclic aromatic hydrocarbons in situ |
Citations (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5000000A (en) | 1988-08-31 | 1991-03-19 | University Of Florida | Ethanol production by Escherichia coli strains co-expressing Zymomonas PDC and ADH genes |
US5028539A (en) | 1988-08-31 | 1991-07-02 | The University Of Florida | Ethanol production using engineered mutant E. coli |
US5424202A (en) | 1988-08-31 | 1995-06-13 | The University Of Florida | Ethanol production by recombinant hosts |
US5482846A (en) | 1988-08-31 | 1996-01-09 | University Of Florida | Ethanol production in Gram-positive microbes |
US5602030A (en) | 1994-03-28 | 1997-02-11 | University Of Florida Research Foundation | Recombinant glucose uptake system |
US5939250A (en) | 1995-12-07 | 1999-08-17 | Diversa Corporation | Production of enzymes having desired activities by mutagenesis |
US5965408A (en) | 1996-07-09 | 1999-10-12 | Diversa Corporation | Method of DNA reassembly by interrupting synthesis |
US20070003736A1 (en) | 2004-03-18 | 2007-01-04 | Sca Hygiene Products Ab | Method and device for producing a multi-ply web of flexible material, such as paper and nonwoven, and multi-ply material produced by the method |
US7169588B2 (en) | 1995-05-12 | 2007-01-30 | E. I. Du Pont De Nemours And Company | Bioconversion of a fermentable carbon source to 1,3-propanediol by a single microorganism |
WO2008100251A1 (en) | 2007-02-13 | 2008-08-21 | Ls9, Inc. | Modified microorganism uses therefor |
WO2009111513A1 (en) | 2008-03-03 | 2009-09-11 | Joule Biotechnologies, Inc. | Engineered co2 fixing microorganisms producing carbon-based products of interest |
WO2009111672A1 (en) | 2008-03-05 | 2009-09-11 | Genomatica, Inc. | Primary alcohol producing organisms |
WO2010042664A2 (en) | 2008-10-07 | 2010-04-15 | Ls9, Inc. | Method and compositions for producing fatty aldehydes |
WO2010062480A2 (en) | 2008-10-28 | 2010-06-03 | Ls9, Inc. | Methods and compositions for producing fatty alcohols |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7056714B2 (en) * | 2003-03-11 | 2006-06-06 | University Of Iowa Research Foundation, Inc. | Carboxylic acid reductase polypeptide, nucleotide sequence encoding same and methods of use |
WO2007142784A1 (en) * | 2006-05-31 | 2007-12-13 | Archer-Daniels-Midland Company | Enzymatic method of making aldehydes from fatty acids |
AU2008230735A1 (en) * | 2007-03-28 | 2008-10-02 | Ls9, Inc. | Enhanced production of fatty acid derivatives |
CA2722441C (en) * | 2008-05-16 | 2021-01-26 | Ls9, Inc. | Methods and compositions for producing hydrocarbons |
-
2011
- 2011-04-08 US US13/083,066 patent/US20110250663A1/en not_active Abandoned
- 2011-04-08 WO PCT/US2011/031794 patent/WO2011127409A2/en active Application Filing
- 2011-04-08 AR ARP110101186A patent/AR084377A1/en unknown
Patent Citations (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5028539A (en) | 1988-08-31 | 1991-07-02 | The University Of Florida | Ethanol production using engineered mutant E. coli |
US5424202A (en) | 1988-08-31 | 1995-06-13 | The University Of Florida | Ethanol production by recombinant hosts |
US5482846A (en) | 1988-08-31 | 1996-01-09 | University Of Florida | Ethanol production in Gram-positive microbes |
US5000000A (en) | 1988-08-31 | 1991-03-19 | University Of Florida | Ethanol production by Escherichia coli strains co-expressing Zymomonas PDC and ADH genes |
US5602030A (en) | 1994-03-28 | 1997-02-11 | University Of Florida Research Foundation | Recombinant glucose uptake system |
US7169588B2 (en) | 1995-05-12 | 2007-01-30 | E. I. Du Pont De Nemours And Company | Bioconversion of a fermentable carbon source to 1,3-propanediol by a single microorganism |
US5939250A (en) | 1995-12-07 | 1999-08-17 | Diversa Corporation | Production of enzymes having desired activities by mutagenesis |
US5965408A (en) | 1996-07-09 | 1999-10-12 | Diversa Corporation | Method of DNA reassembly by interrupting synthesis |
US20070003736A1 (en) | 2004-03-18 | 2007-01-04 | Sca Hygiene Products Ab | Method and device for producing a multi-ply web of flexible material, such as paper and nonwoven, and multi-ply material produced by the method |
WO2008100251A1 (en) | 2007-02-13 | 2008-08-21 | Ls9, Inc. | Modified microorganism uses therefor |
WO2009111513A1 (en) | 2008-03-03 | 2009-09-11 | Joule Biotechnologies, Inc. | Engineered co2 fixing microorganisms producing carbon-based products of interest |
WO2009111672A1 (en) | 2008-03-05 | 2009-09-11 | Genomatica, Inc. | Primary alcohol producing organisms |
WO2010042664A2 (en) | 2008-10-07 | 2010-04-15 | Ls9, Inc. | Method and compositions for producing fatty aldehydes |
WO2010062480A2 (en) | 2008-10-28 | 2010-06-03 | Ls9, Inc. | Methods and compositions for producing fatty alcohols |
Non-Patent Citations (54)
Title |
---|
"Current Protocols in Molecular Biology", 1989, JOHN WILEY & SONS, pages: 6.3.1 - 6.3.6 |
"Molecular Cloning: A Laboratory Manual", 1989, COLD SPRING HARBOR LABORATORY PRESS |
ALTSCHUL ET AL., FEBS J., vol. 272, no. 20, 2005, pages 5101 - 5109 |
ALTSCHUL ET AL., J. MOL. BIOL., vol. 215, no. 3, 1990, pages 403 - 410 |
AMANN ET AL., GENE, vol. 69, 1988, pages 301 - 315 |
ARKIN ET AL., PNAS, USA, vol. 89, 1992, pages 7811 - 7815 |
ARNOLD, CURR. OPIN. BIOTECH., vol. 4, 1993, pages 450 - 455 |
BALDARI ET AL., EMBO J., vol. 6, 1987, pages 229 - 234 |
BLACK ET AL., J. BIOL CHEM., vol. 267, 1992, pages 25513 - 25520 |
BOWIE ET AL., SCIENCE, vol. 247, 1990, pages 1306 - 1310 |
CALDWELL ET AL., PCR METHODS APPLIC., vol. 2, 1992, pages 28 - 33 |
CAMILLI ET AL., SCIENCE, vol. 311, 2006, pages 1113 |
CAVIGLIA ET AL., J. BIOL. CHEM., vol. 279, 2004, pages 1163 - 1169 |
CURRIE: "Characterization of Environmental Particles", 1992, LEWIS PUBLISHERS, INC, article "Source Apportionment of Atmospheric Particles", pages: 3 - 74 |
DATSENKO ET AL., PROC. NATL. ACAD. SCI. USA, vol. 97, 2000, pages 6640 - 45 |
DATSENKO ET AL., PROC. NATL. ACAD. SCI. USA, vol. 97, 2000, pages 6640 - 6645 |
DATSENKO ET AL., PROC. NATL. ACAD. SCI. USA., vol. 97, 2000, pages 6640 - 6645 |
DELEGRAVE ET AL., BIOTECH. RES., vol. 11, 1993, pages 1548 - 1552 |
HEATH ET AL., PROG. LIPID RES., vol. 40, no. 6, 2001, pages 467 - 97 |
HYRUP ET AL., BIOORGAN. MED. CHEM., vol. 4, 1996, pages 5 - 23 |
JOHNSON ET AL., J. BIOL. CHEM., vol. 269, 1994, pages 18037 - 18046 |
KAUFMAN ET AL., EMBO J., vol. 6, 1987, pages 187 - 195 |
KNOLL ET AL., J. BIOL. CHEM., vol. 269, no. 23, 1994, pages 16348 - 56 |
KURJAN ET AL., CELL, vol. 30, 1982, pages 933 - 943 |
LEUNG ET AL., TECHNIQUE, vol. 1, 1989, pages 11 - 15 |
LUCKLOW ET AL., VIROLOGY, vol. 170, 1989, pages 31 - 39 |
MANIATIS ET AL., SCIENCE, vol. 236, 1987, pages 1237 |
MARRAKCHI ET AL., BIOCHEMICAL SOCIETY, vol. 30, 2002, pages 1050 - 1055 |
MARRAKCHI ET AL., J. BIOL. CHEM., vol. 277, 2002, pages 44809 |
MENDOZA ET AL., J. BIOL. CHEM., vol. 258, 1983, pages 2098 - 2101 |
MURLI ET AL., J. OF BACT., vol. 182, 2000, pages 1127 |
NACCARATO ET AL., LIPIDS, vol. 9, no. 6, 1974, pages 419 - 28 |
NEEDLEMAN, WUNSCH, J. MOL. BIOL., vol. 48, 1970, pages 444 - 453 |
READING ET AL., FEMS MICROBIOL. LETT., vol. 254, 2006, pages 1 - 11 |
READING ET AL., FEMSMICROBIOL. LETT., vol. 254, 2006, pages 1 - 11 |
REIDHAAR-OLSON ET AL., SCIENCE, vol. 241, 1988, pages 53 - 57 |
ROCK ET AL., J. BACTERIOLOGY, vol. 178, 1996, pages 5382 - 5387 |
ROSENBERG, BMC BIOINFORMATICS, vol. 6, 2005, pages 278 |
SAMBROOK ET AL.: "Molecular Cloning: A Laboratory Manual", 1989, COLD SPRING HARBOR LABORATORY |
SCHULTZ ET AL., GENE, vol. 54, 1987, pages 113 - 123 |
SCHWEIGER ET AL., APPL. MICROBIOL. BIOTECHNOL., 31 July 2009 (2009-07-31) |
SEED, NATURE, vol. 329, 1987, pages 840 |
SHOCKEY ET AL., PLANT. PHYSIOL., vol. 129, 2002, pages 1710 - 1722 |
SMITH ET AL., GENE, vol. 67, 1988, pages 31 - 40 |
SMITH ET AL., MOL. CELL BIOL., vol. 3, 1983, pages 2156 - 2165 |
STEMMER, PNAS, USA, vol. 91, 1994, pages 10747 - 10751 |
STUDIER ET AL.: "Gene Expression Technology: Methods in Enzymology", vol. 185, 1990, ACADEMIC PRESS, pages: 60 - 89 |
STUIVER ET AL., RADIOCARBON, vol. 19, 1977, pages 355 |
SUMMERTON ET AL., ANTISENSE NUCLEIC ACID DRUG DEV., vol. 7, 1997, pages 187 - 195 |
TANI ET AL., APPL. ENVIRON. MICROBIOL., vol. 66, no. 12, 2000, pages 5231 - 5 |
VENTURI, FEMS MICROBIO. REV, vol. 30, 2006, pages 274 - 291 |
VENTURI, FEMS MICROBIOL. REV., vol. 30, 2006, pages 274 - 291 |
WAHLEN ET AL., APP. ENVIRON. MICROBIOL., vol. 75, no. 9, 2009, pages 2758 - 2764 |
ZHANG ET AL., J. BIOL. CHEM., vol. 277, 2002, pages 15558 |
Cited By (42)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2012135760A1 (en) | 2011-03-30 | 2012-10-04 | Ls9, Inc. | Compositions comprising and methods for producing beta-hydroxy fatty acid esters |
DE102012207921A1 (en) | 2012-05-11 | 2013-11-14 | Evonik Industries Ag | Multi-stage synthesis process with synthesis gas |
WO2013167663A2 (en) | 2012-05-11 | 2013-11-14 | Evonik Industries Ag | Multi-stage synthesis method with synthesis gas |
US10787688B2 (en) | 2012-05-11 | 2020-09-29 | Evonik Operations Gmbh | Multi-stage synthesis method with synthesis gas |
EP2847340A2 (en) * | 2012-05-11 | 2015-03-18 | Evonik Industries AG | Multi-stage fermentation process starting from synthesis gas |
JP2014121325A (en) * | 2012-12-21 | 2014-07-03 | Evonik Industries Ag | PRODUCTION OF ω-AMINO FATTY ACID |
KR20140090935A (en) * | 2012-12-21 | 2014-07-18 | 에보닉 인두스트리에스 아게 | Production of ω-amino fatty acids |
KR102252150B1 (en) | 2012-12-21 | 2021-05-17 | 에보니크 오퍼레이션즈 게엠베하 | PRODUCTION OF ω-AMINO FATTY ACIDS |
US11130944B2 (en) | 2013-01-16 | 2021-09-28 | Genomatica, Inc. | Acyl-ACP reductase with improved properties |
JP2021006067A (en) * | 2013-01-16 | 2021-01-21 | アールイージー ライフ サイエンシズ リミテッド ライアビリティ カンパニー | Acyl-ACP reductase with improved properties |
KR102265148B1 (en) | 2013-01-16 | 2021-06-15 | 게노마티카 인코포레이티드 | Acyl-acp reductase with improved properties |
EP3103867A1 (en) * | 2013-01-16 | 2016-12-14 | REG Life Sciences, LLC | Acyl-acp reductase with improved properties |
KR102082247B1 (en) | 2013-01-16 | 2020-02-28 | 알이지 라이프 사이언시스, 엘엘씨 | Acyl-acp reductase with improved properties |
JP2017042170A (en) * | 2013-01-16 | 2017-03-02 | アールイージー ライフ サイエンシズ リミテッド ライアビリティ カンパニー | Acyl-ACP reductase with improved properties |
KR101746040B1 (en) | 2013-01-16 | 2017-06-13 | 알이지 라이프 사이언시스, 엘엘씨 | Acyl-acp reductase with improved properties |
KR20170066694A (en) * | 2013-01-16 | 2017-06-14 | 알이지 라이프 사이언시스, 엘엘씨 | Acyl-acp reductase with improved properties |
US9683219B2 (en) | 2013-01-16 | 2017-06-20 | REG Life Sciences, LLC | Acyl-ACP reductase with improved properties |
JP2016503663A (en) * | 2013-01-16 | 2016-02-08 | アールイージー ライフ サイエンシズ リミテッド ライアビリティ カンパニー | Acyl-ACP reductase with improved properties |
JP2017209118A (en) * | 2013-01-16 | 2017-11-30 | アールイージー ライフ サイエンシズ リミテッド ライアビリティ カンパニー | Acyl-ACP reductase with improved properties |
US9873865B2 (en) | 2013-01-16 | 2018-01-23 | REG Life Sciences, LLC | Acyl-ACP reductase with improved properties |
JP7094343B2 (en) | 2013-01-16 | 2022-07-01 | ジェノマティカ, インコーポレイテッド | Acyl-ACP reductase with improved properties |
EP3385375A1 (en) * | 2013-01-16 | 2018-10-10 | REG Life Sciences, LLC | Acyl-acp reductase with improved properties |
US10208294B2 (en) | 2013-01-16 | 2019-02-19 | REG Life Sciences, LLC | Acyl-ACP reductase with improved properties |
JP2019141083A (en) * | 2013-01-16 | 2019-08-29 | アールイージー ライフ サイエンシズ リミテッド ライアビリティ カンパニー | Acyl-ACP reductase with improved properties |
WO2014113571A3 (en) * | 2013-01-16 | 2014-11-13 | Ls9, Inc. | Acyl-acp reductase with improved properties |
KR20200020998A (en) * | 2013-01-16 | 2020-02-26 | 알이지 라이프 사이언시스, 엘엘씨 | Acyl-acp reductase with improved properties |
US11981952B2 (en) | 2013-06-14 | 2024-05-14 | Genomatica, Inc. | Methods of producing omega-hydroxylated fatty acid derivatives |
JP2020022473A (en) * | 2013-06-14 | 2020-02-13 | アールイージー ライフ サイエンシズ リミテッド ライアビリティ カンパニー | METHODS OF PRODUCING ω-HYDROXYLATED FATTY ACID DERIVATIVES |
JP7168539B2 (en) | 2013-06-14 | 2022-11-09 | ジェノマティカ, インコーポレイテッド | Method for Producing ω-Hydroxylated Fatty Acid Derivatives |
US9957513B2 (en) | 2013-10-18 | 2018-05-01 | Biopetrolia Ab | Engineering of hydrocarbon metabolism in yeast |
US9777283B2 (en) | 2013-10-18 | 2017-10-03 | Biopetrolia Ab | Engineering of hydrocarbon metabolism in yeast |
WO2015057155A1 (en) * | 2013-10-18 | 2015-04-23 | Biopetrolia Ab | Engineering of hydrocarbon metabolism in yeast |
CN105874075A (en) * | 2013-12-05 | 2016-08-17 | Reg生命科学有限责任公司 | Microbial production of fatty amines |
US10900057B2 (en) | 2013-12-05 | 2021-01-26 | Genomatica, Inc. | Recombinant microorganisms for the production of fatty amines |
JP2021177785A (en) * | 2013-12-05 | 2021-11-18 | ジェノマティカ, インコーポレイテッド | Microbial production method of fatty amines |
JP2016538870A (en) * | 2013-12-05 | 2016-12-15 | アールイージー ライフ サイエンシズ リミテッド ライアビリティ カンパニー | Microbial production method of fatty amines |
US11814660B2 (en) | 2013-12-05 | 2023-11-14 | Genomatica, Inc. | Recombinant microorganisms for the production of fatty amines |
WO2015085271A1 (en) * | 2013-12-05 | 2015-06-11 | REG Life Sciences, LLC | Microbial production of fatty amines |
US11421206B2 (en) | 2014-06-16 | 2022-08-23 | Genomatica, Inc. | Omega-hydroxylase-related fusion polypeptides with improved properties |
US11441130B2 (en) | 2014-06-16 | 2022-09-13 | Genomatica, Inc. | Omega-hydroxylase-related fusion polypeptides with improved properties |
US10787648B2 (en) | 2015-12-15 | 2020-09-29 | Genomatica, Inc. | Omega-hydroxylase-related fusion polypeptide variants with improved properties |
US11384341B2 (en) | 2015-12-15 | 2022-07-12 | Genomatica, Inc. | Omega-hydroxylase-related fusion polypeptide variants with improved properties |
Also Published As
Publication number | Publication date |
---|---|
WO2011127409A3 (en) | 2012-06-07 |
US20110250663A1 (en) | 2011-10-13 |
AR084377A1 (en) | 2013-05-15 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20210324430A1 (en) | Methods And Compositions For Producing Fatty Alcohols | |
US20110250663A1 (en) | Methods and compositions related to fatty alcohol biosynthetic enzymes | |
CA2738938C (en) | Methods and compositions for producing fatty aldehydes | |
US20160340694A1 (en) | Methods and compositions for producing olefins | |
EP2417246A1 (en) | Production of fatty acid derivatives | |
US20130035513A1 (en) | Methods and compositions for enhanced production of fatty aldehydes and fatty alcohols |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 11715364 Country of ref document: EP Kind code of ref document: A2 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 11715364 Country of ref document: EP Kind code of ref document: A2 |