OA18507A - Production of α-olefins. - Google Patents
Production of α-olefins. Download PDFInfo
- Publication number
- OA18507A OA18507A OA1201700501 OA18507A OA 18507 A OA18507 A OA 18507A OA 1201700501 OA1201700501 OA 1201700501 OA 18507 A OA18507 A OA 18507A
- Authority
- OA
- OAPI
- Prior art keywords
- olefins
- seq
- free fatty
- fatty acid
- host cell
- Prior art date
Links
- 239000004711 α-olefin Substances 0.000 title claims abstract description 144
- 238000004519 manufacturing process Methods 0.000 title claims abstract description 89
- 235000014113 dietary fatty acids Nutrition 0.000 claims abstract description 264
- 239000000194 fatty acid Substances 0.000 claims abstract description 264
- 150000004665 fatty acids Chemical class 0.000 claims abstract description 252
- 230000000694 effects Effects 0.000 claims abstract description 103
- 229920001184 polypeptide Polymers 0.000 claims abstract description 101
- 102000004190 Enzymes Human genes 0.000 claims description 114
- 108090000790 Enzymes Proteins 0.000 claims description 114
- 150000007523 nucleic acids Chemical class 0.000 claims description 109
- 239000000758 substrate Substances 0.000 claims description 100
- 108020004707 nucleic acids Proteins 0.000 claims description 98
- 238000006243 chemical reaction Methods 0.000 claims description 33
- 239000000314 lubricant Substances 0.000 claims description 27
- 230000014509 gene expression Effects 0.000 claims description 25
- 239000000203 mixture Substances 0.000 claims description 25
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 23
- 239000001963 growth media Substances 0.000 claims description 19
- 241000195493 Cryptophyta Species 0.000 claims description 18
- 240000004808 Saccharomyces cerevisiae Species 0.000 claims description 18
- 102000005488 thioesterase family Human genes 0.000 claims description 18
- 108020002982 thioesterase family Proteins 0.000 claims description 18
- 239000002609 media Substances 0.000 claims description 14
- 229920001850 Nucleic acid sequence Polymers 0.000 claims description 11
- 241000894006 Bacteria Species 0.000 claims description 10
- 241000196324 Embryophyta Species 0.000 claims description 9
- 241000233866 Fungi Species 0.000 claims description 8
- 230000002018 overexpression Effects 0.000 claims description 7
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 claims description 5
- 238000009776 industrial production Methods 0.000 claims description 4
- 239000000178 monomer Substances 0.000 claims description 3
- 230000001105 regulatory Effects 0.000 claims description 3
- 238000005829 trimerization reaction Methods 0.000 claims description 2
- 241000058981 Jeotgalicoccus sp. Species 0.000 claims 1
- 230000015572 biosynthetic process Effects 0.000 abstract description 11
- 230000037348 biosynthesis Effects 0.000 abstract description 6
- 102000004169 proteins and genes Human genes 0.000 description 29
- 108090000623 proteins and genes Proteins 0.000 description 29
- 235000018102 proteins Nutrition 0.000 description 27
- 150000001336 alkenes Chemical class 0.000 description 23
- OKTJSMMVPCPJKN-UHFFFAOYSA-N carbon Chemical group [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 21
- PEDCQBHIVMGVHV-UHFFFAOYSA-N glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 20
- 239000000047 product Substances 0.000 description 18
- 239000002253 acid Substances 0.000 description 16
- -1 potymers Substances 0.000 description 15
- 229920001405 Coding region Polymers 0.000 description 13
- 238000000034 method Methods 0.000 description 12
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 10
- 150000002430 hydrocarbons Chemical class 0.000 description 10
- 230000004048 modification Effects 0.000 description 10
- 238000006011 modification reaction Methods 0.000 description 10
- 241000588724 Escherichia coli Species 0.000 description 9
- 238000000746 purification Methods 0.000 description 9
- 108020004999 Messenger RNA Proteins 0.000 description 8
- 235000001014 amino acid Nutrition 0.000 description 8
- 239000003550 marker Substances 0.000 description 8
- 239000000463 material Substances 0.000 description 8
- 229920002106 messenger RNA Polymers 0.000 description 8
- 230000037361 pathway Effects 0.000 description 8
- 101710038710 MPAO1 Proteins 0.000 description 7
- 101700011072 PAO Proteins 0.000 description 7
- 101700031972 PAOX Proteins 0.000 description 7
- 102100014567 PAOX Human genes 0.000 description 7
- 101700076839 SMOX Proteins 0.000 description 7
- 150000001413 amino acids Chemical class 0.000 description 7
- 230000001851 biosynthetic Effects 0.000 description 7
- 230000000875 corresponding Effects 0.000 description 7
- 241000894007 species Species 0.000 description 7
- 239000000126 substance Substances 0.000 description 7
- 235000000346 sugar Nutrition 0.000 description 7
- 150000008163 sugars Chemical class 0.000 description 7
- RAXXELZNTBOGNW-UHFFFAOYSA-N Imidazole Chemical compound C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 description 6
- XEKOWRVHYACXOJ-UHFFFAOYSA-N acetic acid ethyl ester Chemical compound CCOC(C)=O XEKOWRVHYACXOJ-UHFFFAOYSA-N 0.000 description 6
- 150000007513 acids Chemical class 0.000 description 6
- 238000007792 addition Methods 0.000 description 6
- 229910052799 carbon Inorganic materials 0.000 description 6
- 230000003197 catalytic Effects 0.000 description 6
- 229920003013 deoxyribonucleic acid Polymers 0.000 description 6
- 125000002467 phosphate group Chemical group [H]OP(=O)(O[H])O[*] 0.000 description 6
- 230000001131 transforming Effects 0.000 description 6
- 108090000604 Hydrolases Proteins 0.000 description 5
- 102000004157 Hydrolases Human genes 0.000 description 5
- 229940035893 Uracil Drugs 0.000 description 5
- 125000000539 amino acid group Chemical group 0.000 description 5
- 238000004458 analytical method Methods 0.000 description 5
- 239000011575 calcium Substances 0.000 description 5
- 238000002290 gas chromatography-mass spectrometry Methods 0.000 description 5
- 229920000642 polymer Polymers 0.000 description 5
- 238000006467 substitution reaction Methods 0.000 description 5
- 229920000160 (ribonucleotides)n+m Polymers 0.000 description 4
- 241000195628 Chlorophyta Species 0.000 description 4
- 241000206751 Chrysophyceae Species 0.000 description 4
- 108020004705 Codon Proteins 0.000 description 4
- 229920002676 Complementary DNA Polymers 0.000 description 4
- 241000192700 Cyanobacteria Species 0.000 description 4
- FDGQSTZJBFJUBT-UHFFFAOYSA-N Hypoxanthine Chemical compound O=C1NC=NC2=C1NC=N2 FDGQSTZJBFJUBT-UHFFFAOYSA-N 0.000 description 4
- SBUJHOSQTJFQJX-NOAMYHISSA-N Kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 4
- POULHZVOKOAJMA-UHFFFAOYSA-N Lauric acid Chemical compound CCCCCCCCCCCC(O)=O POULHZVOKOAJMA-UHFFFAOYSA-N 0.000 description 4
- TUNFSRHWOTWDNC-UHFFFAOYSA-N Myristic acid Chemical compound CCCCCCCCCCCCCC(O)=O TUNFSRHWOTWDNC-UHFFFAOYSA-N 0.000 description 4
- 241000199919 Phaeophyceae Species 0.000 description 4
- 241000206572 Rhodophyta Species 0.000 description 4
- LRFVTYWOQMYALW-UHFFFAOYSA-N Xanthine Chemical compound O=C1NC(=O)NC2=C1NC=N2 LRFVTYWOQMYALW-UHFFFAOYSA-N 0.000 description 4
- 241000206764 Xanthophyceae Species 0.000 description 4
- 239000002199 base oil Substances 0.000 description 4
- 239000002551 biofuel Substances 0.000 description 4
- 125000004432 carbon atoms Chemical group C* 0.000 description 4
- 238000004113 cell culture Methods 0.000 description 4
- 239000002299 complementary DNA Substances 0.000 description 4
- 238000010790 dilution Methods 0.000 description 4
- 239000000284 extract Substances 0.000 description 4
- 230000004136 fatty acid synthesis Effects 0.000 description 4
- 238000000855 fermentation Methods 0.000 description 4
- 230000004151 fermentation Effects 0.000 description 4
- 230000005017 genetic modification Effects 0.000 description 4
- 235000013617 genetically modified food Nutrition 0.000 description 4
- 238000005805 hydroxylation reaction Methods 0.000 description 4
- 238000002955 isolation Methods 0.000 description 4
- 229960000318 kanamycin Drugs 0.000 description 4
- 244000005700 microbiome Species 0.000 description 4
- IJGRMHOSHXDMSA-UHFFFAOYSA-N nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 4
- 239000001301 oxygen Substances 0.000 description 4
- 229910052760 oxygen Inorganic materials 0.000 description 4
- MYMOFIZGZYHOMD-UHFFFAOYSA-N oxygen Chemical compound O=O MYMOFIZGZYHOMD-UHFFFAOYSA-N 0.000 description 4
- 239000008188 pellet Substances 0.000 description 4
- 238000003259 recombinant expression Methods 0.000 description 4
- 239000006228 supernatant Substances 0.000 description 4
- 238000003786 synthesis reaction Methods 0.000 description 4
- 101710035147 ura3 Proteins 0.000 description 4
- 229940064005 Antibiotic throat preparations Drugs 0.000 description 3
- 229940083879 Antibiotics FOR TREATMENT OF HEMORRHOIDS AND ANAL FISSURES FOR TOPICAL USE Drugs 0.000 description 3
- 229940042052 Antibiotics for systemic use Drugs 0.000 description 3
- 229940042786 Antitubercular Antibiotics Drugs 0.000 description 3
- 235000017934 Bacillus subtilis subsp subtilis str 168 Nutrition 0.000 description 3
- 241000276408 Bacillus subtilis subsp. subtilis str. 168 Species 0.000 description 3
- 229910014454 Ca-Cu Inorganic materials 0.000 description 3
- 108090000489 Carboxy-Lyases Proteins 0.000 description 3
- 102000004031 Carboxy-Lyases Human genes 0.000 description 3
- 240000006262 Cuphea hookeriana Species 0.000 description 3
- 102000003849 Cytochrome P450 Human genes 0.000 description 3
- 108050008488 Cytochrome P450 Proteins 0.000 description 3
- 241001198387 Escherichia coli BL21(DE3) Species 0.000 description 3
- 229940093922 Gynecological Antibiotics Drugs 0.000 description 3
- 241000195644 Neochloris Species 0.000 description 3
- 241000206766 Pavlova Species 0.000 description 3
- 241001148696 Stichococcus Species 0.000 description 3
- 229940024982 Topical Antifungal Antibiotics Drugs 0.000 description 3
- 239000003242 anti bacterial agent Substances 0.000 description 3
- 238000004166 bioassay Methods 0.000 description 3
- 230000003115 biocidal Effects 0.000 description 3
- 235000010633 broth Nutrition 0.000 description 3
- UIIMBOGNXHQVGW-UHFFFAOYSA-M buffer Substances [Na+].OC([O-])=O UIIMBOGNXHQVGW-UHFFFAOYSA-M 0.000 description 3
- 150000001720 carbohydrates Chemical class 0.000 description 3
- 235000014633 carbohydrates Nutrition 0.000 description 3
- 239000003054 catalyst Substances 0.000 description 3
- 239000007795 chemical reaction product Substances 0.000 description 3
- 101700001741 fatB Proteins 0.000 description 3
- 239000000446 fuel Substances 0.000 description 3
- 239000007789 gas Substances 0.000 description 3
- 239000000499 gel Substances 0.000 description 3
- 238000010353 genetic engineering Methods 0.000 description 3
- 238000000338 in vitro Methods 0.000 description 3
- 229940079866 intestinal antibiotics Drugs 0.000 description 3
- 150000002632 lipids Chemical class 0.000 description 3
- 230000000813 microbial Effects 0.000 description 3
- 238000010369 molecular cloning Methods 0.000 description 3
- 229940005935 ophthalmologic Antibiotics Drugs 0.000 description 3
- 235000012162 pavlova Nutrition 0.000 description 3
- 239000012071 phase Substances 0.000 description 3
- AJPJDKMHJJGVTQ-UHFFFAOYSA-M sodium dihydrogen phosphate Chemical compound [Na+].OP(O)([O-])=O AJPJDKMHJJGVTQ-UHFFFAOYSA-M 0.000 description 3
- 229910000162 sodium phosphate Inorganic materials 0.000 description 3
- 239000000243 solution Substances 0.000 description 3
- 230000002194 synthesizing Effects 0.000 description 3
- CZPWVGJYEJSRLH-UHFFFAOYSA-N 289-95-2 Chemical compound C1=CN=CN=C1 CZPWVGJYEJSRLH-UHFFFAOYSA-N 0.000 description 2
- 241001607836 Achnanthes Species 0.000 description 2
- 241000640374 Alicyclobacillus acidocaldarius Species 0.000 description 2
- 241000611184 Amphora Species 0.000 description 2
- 241000192542 Anabaena Species 0.000 description 2
- 241000219195 Arabidopsis thaliana Species 0.000 description 2
- 241000203069 Archaea Species 0.000 description 2
- 241000132092 Aster Species 0.000 description 2
- 241000206761 Bacillariophyta Species 0.000 description 2
- 239000002028 Biomass Substances 0.000 description 2
- 241001536324 Botryococcus Species 0.000 description 2
- 241000227752 Chaetoceros Species 0.000 description 2
- 241000195585 Chlamydomonas Species 0.000 description 2
- 241000195649 Chlorella <Chlorellales> Species 0.000 description 2
- 241000180279 Chlorococcum Species 0.000 description 2
- 241001608005 Cocconeis Species 0.000 description 2
- 241001147476 Cyclotella Species 0.000 description 2
- 241000206743 Cylindrotheca Species 0.000 description 2
- QIVBCDIJIAJPQS-SECBINFHSA-N D-tryptophane Chemical compound C1=CC=C2C(C[C@@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-SECBINFHSA-N 0.000 description 2
- 241000199914 Dinophyceae Species 0.000 description 2
- 235000019749 Dry matter Nutrition 0.000 description 2
- 241000195634 Dunaliella Species 0.000 description 2
- 241000195620 Euglena Species 0.000 description 2
- 102100018265 FA2H Human genes 0.000 description 2
- 101700026954 FA2H Proteins 0.000 description 2
- 241000692360 Fistulifera Species 0.000 description 2
- 241001105002 Fragilariopsis Species 0.000 description 2
- BJHIKXHVCXFQLS-UYFOZJQFSA-N Fructose Natural products OC[C@@H](O)[C@@H](O)[C@H](O)C(=O)CO BJHIKXHVCXFQLS-UYFOZJQFSA-N 0.000 description 2
- VEXZGXHMUGYJMC-UHFFFAOYSA-N HCl Chemical compound Cl VEXZGXHMUGYJMC-UHFFFAOYSA-N 0.000 description 2
- 241000206759 Haptophyceae Species 0.000 description 2
- KEMQGTRYUADPNZ-UHFFFAOYSA-N Heptadecanoic acid Chemical compound CCCCCCCCCCCCCCCCC(O)=O KEMQGTRYUADPNZ-UHFFFAOYSA-N 0.000 description 2
- 229960000310 ISOLEUCINE Drugs 0.000 description 2
- UGQMRVRMYYASKQ-KMPDEGCQSA-N Inosine Natural products O[C@H]1[C@H](O)[C@@H](CO)O[C@@H]1N1C(N=CNC2=O)=C2N=C1 UGQMRVRMYYASKQ-KMPDEGCQSA-N 0.000 description 2
- UGQMRVRMYYASKQ-KQYNXXCUSA-N Inosine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C2=NC=NC(O)=C2N=C1 UGQMRVRMYYASKQ-KQYNXXCUSA-N 0.000 description 2
- 241001501885 Isochrysis Species 0.000 description 2
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 2
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 2
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 2
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 2
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 2
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 2
- 241001534525 Lampriscus Species 0.000 description 2
- 241000013738 Monochrysis Species 0.000 description 2
- 235000021360 Myristic acid Nutrition 0.000 description 2
- 125000001429 N-terminal alpha-amino-acid group Chemical group 0.000 description 2
- 241000196305 Nannochloris Species 0.000 description 2
- 241000502321 Navicula Species 0.000 description 2
- 241000180701 Nitzschia <flatworm> Species 0.000 description 2
- 241000059630 Nodularia <Cyanobacteria> Species 0.000 description 2
- 241000192656 Nostoc Species 0.000 description 2
- IPCSVZSSVZVIGE-UHFFFAOYSA-N Palmitic acid Chemical compound CCCCCCCCCCCCCCCC(O)=O IPCSVZSSVZVIGE-UHFFFAOYSA-N 0.000 description 2
- 108010002747 Pfu DNA polymerase Proteins 0.000 description 2
- 241000425347 Phyla <beetle> Species 0.000 description 2
- 108010078762 Protein Precursors Proteins 0.000 description 2
- 102000014961 Protein Precursors Human genes 0.000 description 2
- 241000192511 Pseudanabaena Species 0.000 description 2
- KDCGOANMDULRCW-UHFFFAOYSA-N Purine Chemical compound N1=CNC2=NC=NC2=C1 KDCGOANMDULRCW-UHFFFAOYSA-N 0.000 description 2
- 229920001914 Ribonucleotide Polymers 0.000 description 2
- 229920000972 Sense strand Polymers 0.000 description 2
- JVBXVOWTABLYPX-UHFFFAOYSA-L Sodium dithionite Chemical compound [Na+].[Na+].[O-]S(=O)S([O-])=O JVBXVOWTABLYPX-UHFFFAOYSA-L 0.000 description 2
- DBMJMQXJHONAFJ-UHFFFAOYSA-M Sodium laurylsulphate Chemical compound [Na+].CCCCCCCCCCCCOS([O-])(=O)=O DBMJMQXJHONAFJ-UHFFFAOYSA-M 0.000 description 2
- 229920000978 Start codon Polymers 0.000 description 2
- 239000004473 Threonine Substances 0.000 description 2
- 239000007983 Tris buffer Substances 0.000 description 2
- 238000009825 accumulation Methods 0.000 description 2
- 239000000654 additive Substances 0.000 description 2
- 239000000853 adhesive Substances 0.000 description 2
- 230000001070 adhesive Effects 0.000 description 2
- 150000001335 aliphatic alkanes Chemical class 0.000 description 2
- 102000005840 alpha-Galactosidase Human genes 0.000 description 2
- 108010030291 alpha-Galactosidase Proteins 0.000 description 2
- 230000000692 anti-sense Effects 0.000 description 2
- 230000002238 attenuated Effects 0.000 description 2
- 230000001580 bacterial Effects 0.000 description 2
- 239000011203 carbon fibre reinforced carbon Substances 0.000 description 2
- 230000001413 cellular Effects 0.000 description 2
- 239000001913 cellulose Substances 0.000 description 2
- 229920002678 cellulose Polymers 0.000 description 2
- 238000005119 centrifugation Methods 0.000 description 2
- 239000000356 contaminant Substances 0.000 description 2
- 239000005547 deoxyribonucleotide Substances 0.000 description 2
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 2
- IAZDPXIOMUYVGZ-UHFFFAOYSA-N dimethylsulphoxide Chemical compound CS(C)=O IAZDPXIOMUYVGZ-UHFFFAOYSA-N 0.000 description 2
- 230000002255 enzymatic Effects 0.000 description 2
- 238000007824 enzymatic assay Methods 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 238000004817 gas chromatography Methods 0.000 description 2
- 230000002068 genetic Effects 0.000 description 2
- 230000004077 genetic alteration Effects 0.000 description 2
- 231100000118 genetic alteration Toxicity 0.000 description 2
- DHMQDGOQFOQNFH-UHFFFAOYSA-N glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 2
- 230000003899 glycosylation Effects 0.000 description 2
- 238000006206 glycosylation reaction Methods 0.000 description 2
- 238000004128 high performance liquid chromatography Methods 0.000 description 2
- 229910000041 hydrogen chloride Inorganic materials 0.000 description 2
- 230000000640 hydroxylating Effects 0.000 description 2
- 238000000099 in vitro assay Methods 0.000 description 2
- 238000002347 injection Methods 0.000 description 2
- 239000007924 injection Substances 0.000 description 2
- 229960003786 inosine Drugs 0.000 description 2
- 238000003780 insertion Methods 0.000 description 2
- 239000007788 liquid Substances 0.000 description 2
- 239000006166 lysate Substances 0.000 description 2
- 150000004667 medium chain fatty acids Chemical class 0.000 description 2
- 238000002703 mutagenesis Methods 0.000 description 2
- 231100000350 mutagenesis Toxicity 0.000 description 2
- 229910052757 nitrogen Inorganic materials 0.000 description 2
- 230000003287 optical Effects 0.000 description 2
- 239000012074 organic phase Substances 0.000 description 2
- 238000006366 phosphorylation reaction Methods 0.000 description 2
- 230000000865 phosphorylative Effects 0.000 description 2
- 239000004033 plastic Substances 0.000 description 2
- 229920003023 plastic Polymers 0.000 description 2
- 150000003881 polyketide derivatives Chemical class 0.000 description 2
- 229930001119 polyketides Natural products 0.000 description 2
- OZAIFHULBGXAKX-UHFFFAOYSA-N precursor Substances N#CC(C)(C)N=NC(C)(C)C#N OZAIFHULBGXAKX-UHFFFAOYSA-N 0.000 description 2
- 108090000765 processed proteins & peptides Proteins 0.000 description 2
- 102000004196 processed proteins & peptides Human genes 0.000 description 2
- 238000001742 protein purification Methods 0.000 description 2
- 230000002829 reduced Effects 0.000 description 2
- 239000011347 resin Substances 0.000 description 2
- 229920005989 resin Polymers 0.000 description 2
- 108091007521 restriction endonucleases Proteins 0.000 description 2
- 239000002336 ribonucleotide Substances 0.000 description 2
- 125000002652 ribonucleotide group Chemical group 0.000 description 2
- 238000007086 side reaction Methods 0.000 description 2
- 239000002904 solvent Substances 0.000 description 2
- 238000001228 spectrum Methods 0.000 description 2
- 239000012536 storage buffer Substances 0.000 description 2
- 239000004094 surface-active agent Substances 0.000 description 2
- 239000004753 textile Substances 0.000 description 2
- 230000014621 translational initiation Effects 0.000 description 2
- 238000000870 ultraviolet spectroscopy Methods 0.000 description 2
- 239000004474 valine Substances 0.000 description 2
- 229940075420 xanthine Drugs 0.000 description 2
- DCTOHCCUXLBQMS-UHFFFAOYSA-N 1-undecene Chemical compound CCCCCCCCCC=C DCTOHCCUXLBQMS-UHFFFAOYSA-N 0.000 description 1
- 241000251468 Actinopterygii Species 0.000 description 1
- 229920001817 Agar Polymers 0.000 description 1
- 241000589158 Agrobacterium Species 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- 229960001230 Asparagine Drugs 0.000 description 1
- 241001474374 Blennius Species 0.000 description 1
- KGBXLFKZBHKPEV-UHFFFAOYSA-N Boric acid Chemical compound OB(O)O KGBXLFKZBHKPEV-UHFFFAOYSA-N 0.000 description 1
- 210000001736 Capillaries Anatomy 0.000 description 1
- 239000004215 Carbon black (E152) Substances 0.000 description 1
- 229960005091 Chloramphenicol Drugs 0.000 description 1
- WIIZWVCIJKGZOK-RKDXNWHRSA-N Chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 1
- 229920002759 Circular DNA Polymers 0.000 description 1
- 108020004638 Circular DNA Proteins 0.000 description 1
- 240000007170 Cocos nucifera Species 0.000 description 1
- 235000013162 Cocos nucifera Nutrition 0.000 description 1
- ARUVKPQLZAKDPS-UHFFFAOYSA-L Copper(II) sulfate Chemical compound [Cu+2].[O-][S+2]([O-])([O-])[O-] ARUVKPQLZAKDPS-UHFFFAOYSA-L 0.000 description 1
- 241001466512 Coscinodiscus Species 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N D-Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- MTCFGRXMJLQNBG-UWTATZPHSA-N D-serine Chemical compound OC[C@@H](N)C(O)=O MTCFGRXMJLQNBG-UWTATZPHSA-N 0.000 description 1
- 238000001712 DNA sequencing Methods 0.000 description 1
- 108010055012 EC 4.1.1.23 Proteins 0.000 description 1
- 239000005977 Ethylene Substances 0.000 description 1
- 241000206602 Eukaryota Species 0.000 description 1
- 108009000552 Fatty Acid Biosynthesis Proteins 0.000 description 1
- 239000005715 Fructose Substances 0.000 description 1
- 229960002989 Glutamic Acid Drugs 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 241001499732 Gyrosigma Species 0.000 description 1
- 240000006669 Helianthus annuus Species 0.000 description 1
- 235000003222 Helianthus annuus Nutrition 0.000 description 1
- 108091006038 His-tagged proteins Proteins 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 1
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 1
- XUJNEKJLAYXESH-REOHCLBHSA-N L-cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 1
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 1
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 1
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 1
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 1
- 239000005639 Lauric acid Substances 0.000 description 1
- XIXADJRWDQXREU-UHFFFAOYSA-M Lithium acetate Chemical compound [Li+].CC([O-])=O XIXADJRWDQXREU-UHFFFAOYSA-M 0.000 description 1
- 229940071257 Lithium acetate Drugs 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- LTYOQGRJFJAKNA-KKIMTKSISA-N Malonyl CoA Natural products S(C(=O)CC(=O)O)CCNC(=O)CCNC(=O)[C@@H](O)C(CO[P@](=O)(O[P@](=O)(OC[C@H]1[C@@H](OP(=O)(O)O)[C@@H](O)[C@@H](n2c3ncnc(N)c3nc2)O1)O)O)(C)C LTYOQGRJFJAKNA-KKIMTKSISA-N 0.000 description 1
- 230000036740 Metabolism Effects 0.000 description 1
- 230000035633 Metabolized Effects 0.000 description 1
- 241001478792 Monoraphidium Species 0.000 description 1
- 229910004619 Na2MoO4 Inorganic materials 0.000 description 1
- 241000326556 Odontella <springtail> Species 0.000 description 1
- 229920000272 Oligonucleotide Polymers 0.000 description 1
- 102000015636 Oligopeptides Human genes 0.000 description 1
- 108010038807 Oligopeptides Proteins 0.000 description 1
- 241000315040 Omura Species 0.000 description 1
- 241000514008 Oocystis Species 0.000 description 1
- 235000021314 Palmitic acid Nutrition 0.000 description 1
- 241000206731 Phaeodactylum Species 0.000 description 1
- 229960005190 Phenylalanine Drugs 0.000 description 1
- 241000209504 Poaceae Species 0.000 description 1
- 208000008425 Protein Deficiency Diseases 0.000 description 1
- 241001509341 Pyramimonas Species 0.000 description 1
- 241000221523 Rhodotorula toruloides Species 0.000 description 1
- 241000195663 Scenedesmus Species 0.000 description 1
- TVXXNOYZHKPKGW-UHFFFAOYSA-N Sodium molybdate Chemical compound [Na+].[Na+].[O-][Mo]([O-])(=O)=O TVXXNOYZHKPKGW-UHFFFAOYSA-N 0.000 description 1
- 241000191940 Staphylococcus Species 0.000 description 1
- 241000507187 Staphylococcus massiliensis Species 0.000 description 1
- 229920002472 Starch Polymers 0.000 description 1
- 241000203644 Streptoalloteichus hindustanus Species 0.000 description 1
- 241000192707 Synechococcus Species 0.000 description 1
- 241000192560 Synechococcus sp. Species 0.000 description 1
- 241000192584 Synechocystis Species 0.000 description 1
- 241000196321 Tetraselmis Species 0.000 description 1
- JZRWCGZRTZMZEH-UHFFFAOYSA-N Thiamine Natural products CC1=C(CCO)SC=[N+]1CC1=CN=C(C)N=C1N JZRWCGZRTZMZEH-UHFFFAOYSA-N 0.000 description 1
- 231100000765 Toxin Toxicity 0.000 description 1
- 241000192118 Trichodesmium Species 0.000 description 1
- 102100005023 UMPS Human genes 0.000 description 1
- 241000218199 Umbellularia Species 0.000 description 1
- 241000235013 Yarrowia Species 0.000 description 1
- VBJHPXDIVMXHJU-UHFFFAOYSA-N Zeocin Chemical compound N=1C(C=2SC=C(N=2)C(=O)NCCCCN=C(N)N)CSC=1CCNC(=O)C(C(O)C)NC(=O)C(C)C(O)C(C)NC(=O)C(C(OC1C(C(O)C(O)C(CO)O1)OC1C(C(OC(N)=O)C(O)C(CO)O1)O)C=1[N]C=NC=1)NC(=O)C1=NC(C(CC(N)=O)NCC(N)C(N)=O)=NC(N)=C1C VBJHPXDIVMXHJU-UHFFFAOYSA-N 0.000 description 1
- 108010084455 Zeocin Proteins 0.000 description 1
- 230000002378 acidificating Effects 0.000 description 1
- 230000002411 adverse Effects 0.000 description 1
- 239000008272 agar Substances 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 125000001931 aliphatic group Chemical group 0.000 description 1
- 125000000217 alkyl group Chemical group 0.000 description 1
- 239000002647 aminoglycoside antibiotic agent Substances 0.000 description 1
- ZGXJTSGNIOSYLO-UHFFFAOYSA-N aminolevulinic acid Chemical compound NCC(=O)CCC(O)=O ZGXJTSGNIOSYLO-UHFFFAOYSA-N 0.000 description 1
- QGZKDVFQNNGYKY-UHFFFAOYSA-N ammonia Chemical compound N QGZKDVFQNNGYKY-UHFFFAOYSA-N 0.000 description 1
- 150000003863 ammonium salts Chemical class 0.000 description 1
- 125000003118 aryl group Chemical group 0.000 description 1
- 235000009582 asparagine Nutrition 0.000 description 1
- 229920000704 biodegradable plastic Polymers 0.000 description 1
- 230000005587 bubbling Effects 0.000 description 1
- OYPRJOBELJOOCE-UHFFFAOYSA-N calcium Chemical compound [Ca] OYPRJOBELJOOCE-UHFFFAOYSA-N 0.000 description 1
- 229910052791 calcium Inorganic materials 0.000 description 1
- UGFAIRIUMAVXCW-UHFFFAOYSA-N carbon monoxide Chemical compound [O+]#[C-] UGFAIRIUMAVXCW-UHFFFAOYSA-N 0.000 description 1
- 229910002091 carbon monoxide Inorganic materials 0.000 description 1
- 150000001732 carboxylic acid derivatives Chemical class 0.000 description 1
- 125000002091 cationic group Chemical group 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 239000003245 coal Substances 0.000 description 1
- 238000009833 condensation Methods 0.000 description 1
- 230000005494 condensation Effects 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 230000001276 controlling effect Effects 0.000 description 1
- 229910000366 copper(II) sulfate Inorganic materials 0.000 description 1
- 229920003020 cross-linked polyethylene Polymers 0.000 description 1
- 239000004703 cross-linked polyethylene Substances 0.000 description 1
- 239000010779 crude oil Substances 0.000 description 1
- 238000010192 crystallographic characterization Methods 0.000 description 1
- 125000004122 cyclic group Chemical group 0.000 description 1
- 235000018417 cysteine Nutrition 0.000 description 1
- 230000003247 decreasing Effects 0.000 description 1
- 238000011033 desalting Methods 0.000 description 1
- 229910001882 dioxygen Inorganic materials 0.000 description 1
- 230000002222 downregulating Effects 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 239000003797 essential amino acid Substances 0.000 description 1
- 235000020776 essential amino acid Nutrition 0.000 description 1
- VGGSQFUCUMXWEO-UHFFFAOYSA-N ethene Chemical compound C=C VGGSQFUCUMXWEO-UHFFFAOYSA-N 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 150000002190 fatty acyls Chemical group 0.000 description 1
- 238000005755 formation reaction Methods 0.000 description 1
- SXMOKYXNAPLNCW-GORZOVPNSA-N formyl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC=O)O[C@H]1N1C2=NC=NC(N)=C2N=C1 SXMOKYXNAPLNCW-GORZOVPNSA-N 0.000 description 1
- 238000006062 fragmentation reaction Methods 0.000 description 1
- 230000002538 fungal Effects 0.000 description 1
- 238000000769 gas chromatography-flame ionisation detection Methods 0.000 description 1
- 238000001502 gel electrophoresis Methods 0.000 description 1
- 238000010358 genetic engineering technique Methods 0.000 description 1
- 239000008103 glucose Substances 0.000 description 1
- 235000013922 glutamic acid Nutrition 0.000 description 1
- 239000004220 glutamic acid Substances 0.000 description 1
- 239000001257 hydrogen Substances 0.000 description 1
- 229910052739 hydrogen Inorganic materials 0.000 description 1
- 125000004435 hydrogen atoms Chemical class [H]* 0.000 description 1
- 238000006460 hydrolysis reaction Methods 0.000 description 1
- 238000011081 inoculation Methods 0.000 description 1
- 239000003456 ion exchange resin Substances 0.000 description 1
- 229920003303 ion-exchange polymer Polymers 0.000 description 1
- 125000000468 ketone group Chemical group 0.000 description 1
- 230000000670 limiting Effects 0.000 description 1
- 238000004895 liquid chromatography mass spectrometry Methods 0.000 description 1
- 239000012160 loading buffer Substances 0.000 description 1
- LTYOQGRJFJAKNA-DVVLENMVSA-N malonyl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)CC(O)=O)O[C@H]1N1C2=NC=NC(N)=C2N=C1 LTYOQGRJFJAKNA-DVVLENMVSA-N 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 230000001404 mediated Effects 0.000 description 1
- 230000004060 metabolic process Effects 0.000 description 1
- 230000035786 metabolism Effects 0.000 description 1
- 239000003607 modifier Substances 0.000 description 1
- 238000001823 molecular biology technique Methods 0.000 description 1
- 238000001728 nano-filtration Methods 0.000 description 1
- 235000015097 nutrients Nutrition 0.000 description 1
- 108010086644 oleoyl-(acyl-carrier-protein) hydrolase Proteins 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 239000003415 peat Substances 0.000 description 1
- 239000003208 petroleum Substances 0.000 description 1
- 229920000098 polyolefin Polymers 0.000 description 1
- 239000012474 protein marker Substances 0.000 description 1
- 230000002285 radioactive Effects 0.000 description 1
- 239000012429 reaction media Substances 0.000 description 1
- 239000011541 reaction mixture Substances 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 230000000284 resting Effects 0.000 description 1
- 238000002741 site-directed mutagenesis Methods 0.000 description 1
- 239000002002 slurry Substances 0.000 description 1
- 239000011684 sodium molybdate Substances 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 238000000527 sonication Methods 0.000 description 1
- 239000008107 starch Substances 0.000 description 1
- 235000019698 starch Nutrition 0.000 description 1
- 239000007858 starting material Substances 0.000 description 1
- 238000006277 sulfonation reaction Methods 0.000 description 1
- 230000001629 suppression Effects 0.000 description 1
- 230000004083 survival Effects 0.000 description 1
- 230000002123 temporal effect Effects 0.000 description 1
- 101700084814 tesB Proteins 0.000 description 1
- KYMBYSLLVAOCFI-UHFFFAOYSA-N thiamine Chemical compound CC1=C(CCO)SCN1CC1=CN=C(C)N=C1N KYMBYSLLVAOCFI-UHFFFAOYSA-N 0.000 description 1
- 235000019157 thiamine Nutrition 0.000 description 1
- 229960003495 thiamine Drugs 0.000 description 1
- 239000011721 thiamine Substances 0.000 description 1
- 210000001519 tissues Anatomy 0.000 description 1
- 239000003053 toxin Substances 0.000 description 1
- 108020003112 toxins Proteins 0.000 description 1
- 238000004450 types of analysis Methods 0.000 description 1
- 238000002525 ultrasonication Methods 0.000 description 1
- 150000004670 unsaturated fatty acids Chemical class 0.000 description 1
- 235000021122 unsaturated fatty acids Nutrition 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- 239000011534 wash buffer Substances 0.000 description 1
- WQZGKKKJIJFFOK-VFUOTHLCSA-N β-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 1
Abstract
The present invention relates to the biosynthesis of α-olefins. In particular, the invention provides methods for the production of medium-chain αolefins, more particularly C11 α-olefins, using a polypeptide with decarboxylase activity on free fatty acids with 8 to 14 carbons, in particular on C8-C12 free fatty acids, more particularly on C12 free fatty acids, or a genetically engineered host cell expressing or overexpressing said polypeptide.
Description
DUPLICATA
DEMANDE DE BREVET D’INVENTION PCT/EP2016/065389 DU 30/06/2016
Nom du Déposant(s):
1. TOTAL RAFFINAGE CHIMIE
2. Q1NGDAO 1NSTITUTE OF B1OENERGY AND B1OPROCESS TECHNOLOGY (QIBEBT)
Titre d’invention: PRODUCTION OF a-OLEFINS
Noms d’inventeur(s):
1. FOURAGE, Laurent
2. LAEUFFER, Frédéric
3. STRUB, Henri
4. WANG, Yun
5. XU, Jian
6. XU, Huifang ,
7. Ll, Shengying
Mandataire : SCP AKKUM, AKKUM & Associâtes
B.P 4966 Yaoundé - Cameroun.
PRODUCTION OF α-OLEFINS
TECHNICAL FIELD
The application generally relates to biosynthetic processes, ln particular, the application relates to the biosynthesis of a-olefins.
BACKGROUND
Unsaturated hydrocarbons such as α-olefins are an Industrially Important group of molécules which can serve as precursors for lubricants and surfactants ln addition to 10 being used ln fuels, or as feedstock for further chemical conversion to e.g. fuels, potymers, plastics, textiles, solvents, adhesives, etc.
α-Olefins, more particularty linear α-olefins, hâve traditionally been produced from petroleum sources through oligomérization of ethylene but it would be advantageous to fmd alternative methods for producing such α-olefins, notably from renewabte 15 resources. Moreover, the chemical synthesis results in a mixture of linear a-olefins which exhibit different carbon chain length. Such mixture may not be desired as feedstock for fùrther chemical conversion such as the production of poly-a-olefîns of Interest.
Microbial production of olefins has been reported. Examples include a three-gene 20 cluster responsible for generating alkenes with Internat double bonds through the headto-head condensation of two fatty acyl-coenzyme A (acyl-CoA) molécules in Micmcoccus luteus (Beller et al. 2010 Appl Environ Microbiol 76:1212-1223); a unique P450 decarboxytase OleTjE from Jaotgalicoccus sp. ATCC 6456, which directly decarboxylates 0« to C» free fatty acids to form α-olefins in presence of H2O2 (WO 25 2009/085278, Rude et al. 2011 Appl Environ Microbiol 77:1718-1727, Uu et al. 2014
Biotechnology for Biofuels 7:28); another enzyme of the cyp152 P450 enzyme family, namely P450esp from Bacillus subtilis subsp. subtilis str. 168, for which decarboxytase activity on palmitic acid has been shown (WO 2009/085278, Rude et al. 2011); and a type I polyketide synthase from Synechococcus sp. PCC 7002, which is capable of 30 transforming fatty acyl-ACPs Into α-olefins via sequential polyketide synthase chain élongation, keto réduction, sulfonation mediated by its sutfotransferase domain, and
the coupled hydrolysis and décarboxylation catalyzed by the thloesterase domain (Mendez-Perez et al. 2011 Appl Environ Microbiol 77:4264-4267).
ln general, there is a need for alternative production methods of α-olefins, and more particularly improved processes that allow production of α-olefins at a lower cost and/or 5 that are more environmental friendly. It is also desired to provide production methods which allow tailoring of the carbon chain length of the α-otefins, more particularly a production method for medium-chain α-olefins (i.e. a-otefins having a carbon chain length comprised between 7 and 13), more particularly for Cn a-olefins.
SUMMARY OF THE INVENTION
The présent invention solves one or more of the above described problems of the prior art. ln particular, methods are provided for the production of α-olefins which allow controlling the carbon chain length of the a-olefins.
The présent invention is based, at least In part, on the discovery that certain genomic 15 sequences of Alicyclobacilius acidocaldarius and Staphylococcus massiliensis encode enzymes having free fatty acid decarboxylase activity. None of these organisms hâve previously been reported to produce terminal olefins. It has further been found that certain polypeptides can be identified which hâve spécifie decarboxylase activity on medium-chain free fatty acids, in particular Ce to C12 free fatty acids, more particularly 20 on C12 free fatty acids, thereby producing medium-chain α-olefîns, in particular C? to
C11 α-olefins, more particularly Cn α-olefins. The application provides examples thereof, more particularly an olefin-producing enzyme (Sm46) identified in Staphilococcus massiliensis, the P450bsp fatty acid hydroxylase (Bs168) from Bacillus subtilis subsp. subtilis str. 168 and the decarboxylase enzyme identified in 25 Alicyciobaciilus acidocaldarius, hâve spécifie decarboxylase activity on C12 free fatty acids, thereby producing Cn α-olefins. The identification of these enzymes facilitâtes the further identification of similar enzymes from other organisms.
The présent invention is in particular captured by any one or any combination of one or more of the below numbered aspects and embodiments (i) to (xvii) wherein:
(i) A method for the production of C7-C11 α-olefins comprising culturing a recombinant host cell comprising a recombinant nucleic acid encoding a decarboxylase enzyme under conditions suitable for the production of C7 to Ch α-olefins by said host cell, wherein the preferred substrate of said decarboxylase enzyme Is an CrC12 free fatty acid.
(ii) The method according to (i), wherein said nucleic acid encoding a decarboxylase enzyme comprises a nucléotide sequence having at least about 75%, preferably at least about 80%, more preferably at least about 95%, sequence identity to SEQ ID NO:10, SEQ ID NO:7, SEQ ID NO: 3 or SEQ ID NO: 5 and wherein said recombinant nucleic acid ensures expression or overexpression of said decarboxylase.
(iii) The method according to (i) or (ii), wherein said nucleic acid encodes a polypeptide comprising an amino acid sequence having at least 80%, preferably at least 90% Identity, to SEQ ID NO:13, SEQ ID NO: 8, SEQ ID NO: 4 or SEQ ID NO: 6 or an active fragment of said polypeptide.
(iv) The method according to any one of (i) to (iii), wherein said nucleic acid encodes a polypeptide comprising the amino add sequence of SEQ ID NO:13.
(v) The method according to any one of (i) to (iv), wherein C,2 free fatty acids are the preferred substrate of said decarboxylase, and wherein said α-olefins are Ch a-olefins.
(vi) The method according to any one of (i) to (v), wherein said host cell Is cultivated ln a medium comprising Ce-Ci2free fatty acids, preferably Cufree fatty acids.
(vii) The method according to any one of (i) to (vi), wherein the host cell has further been genetically engineered to produce or overproduce Ce-Cu free fatty acids, preferably Cufree fatty acids.
(viii) The method according to (vii), wherein the host cell comprises a recombinant nucleic acid encoding an enzyme involved in the production of free fatty acids with a carbon chain length comprised between 8 and 12, more particularly a C12 free fatty acid.
(ix) The method according to claim (viii), wherein the host ceil comprises a recombinant nucieic acid encoding a thioesterase having activity on Ce to Cu acyl-ACP, preferably on Cn acyl-ACP.
(x) The method according to any one of (i) to (ix), wherein the host cell is an oieaginous host ceil.
(xi) The method according to any one of (I) to (x), wherein the host ceil Is selected from the group comprising bacteria, yeasts, fungi, piants and aigae.
(xii) The method according to any one of (i) to (xi), further comprising the step of recovering the α-olefins from the host ceil or the culture medium.
(xiii) A method for the production of poly-a-oiefins comprising the foilowing steps:
i) producing Ct-Ch α-olefins according to a method according to any one of (i) to (xii); and ii) performîng an oligomérization reaction using the α-olefins obtained in step i) as monomer to produce an oligomer; and optionally, iii) hydrogenating the oiigomer produced in step ii).
(xiv) The method according to (xiii), wherein the poly-a-olefins are C33 poly-a-olefins, wherein step i) comprises the production of Cn α-olefins, and wherein the oligomérization reaction in step ii) is a trimerization reaction.
(xv) A composition comprising poly-a-olefins obtainabie by the method according to (xiv), wherein at least 85%; preferably at least 90%, more preferably at least 95%, of the poly-a-oiefins are C33 poly-a-olefins.
(xvi) A poiypeptide having decarboxylase activity, wherein said polypeptide comprises an amino acid sequence having at least 80%, preferably at ieast 90%, identity to SEQ ID NO:13.
(xvii) An isolated nucieic acid encoding an enzyme having decarboxyiase activity, 35 wherein said nucieic acid comprises a nucléotide sequence having at least about 80%,
s more preferably at least about 95%, sequence identity to SEQ ID NO: 7 or SEQ ID NO:
5.
(xviii) The polypeptide according to (xvi) or the Isoiated nucleic acid according to (xvii), 5 wherein C8 to Cu free fatty acids, preferably Ce to C12 free fatty acids, more preferably Cu free fatty acids are the preferred substrate of said decarboxylase.
(xix) The isoiated nucleic acid according to (xvii) or (xviii), wherein said nucleic acid comprises a nucléotide sequence having at least about 95% sequence identity to SEQ 10 ID NO: 7.
(xx) A vector comprising at least one regulatory sequence operatively coupled to the nucleic acid sequence of any of (xvii) to (xix).
(xxi) A host cell comprising lhe nucleic acid according to any of claims (xvii) to (xix) Integrated Into its genome or the vector according to claim 20.
(xxii) The host cell of (xxi), which is an oleaglnous eukaryotic microalgae or oleaginous yeast.
(xxiii) Use of a host cell according to (xxi) or (xxii) for the industrial production of lubricants.
(xxiv) Lubricant comprising poly-a-olefins obtalnable by a method according to (xiii) or 25 (xiv).
(xxv) Lubricant according to (xxiv), wherein said poly-a-olefins are biosourced polya-olefïns.
(xxvi) Lubricant according to (xxiv) or (xxv), wherein at least 50%, preferably at least 85%, of said poly-a-olefins consist of CJ3 poly-a-olefins.
(xxvii) Lubricant according to any one of (xxiv) to (xxvi), wherein said lubricant is an automotive lubricant.
(ix) Th© method according to claim (viii), wherein the host cell comprises a recombinant nucleic acid encoding a thioesterase having activity on Ca to C« acyl-ACP, preferably on Ci2 acyl-ACP.
(x) The method according to any one of (i) to (ix), wherein the host ceil is an oleaginous host cell. - - ' (xi) The method according to any one of (i) to (x), wherein the host cell is selected from the group comprising bacteria, yeasts, fungi. plants and algae.
(xii) The method according to any one of (i) to (xi), further comprising the step of recovering the α-olefins from the host cell or the culture medium.
(xiii) A method for the production of poly-a-olefins comprising the following steps:
i) producing C7-C11 α-olefins according to a method according to any one of (i) to (xii); and li) performing an oligomérization reaction using the α-olefins obtained In step i) as monomer to produce an oligomer; and optionally, iii) hydrogenating the oligomer produced in step Ii).
(xiv) The method according to (xiii), wherein the poly-a-olefins are C» poly-a-olefins, wherjein step i) comprises the production of Cu α-oleftns, and wherein the oligomérization réaction In step ii) is a trimerlzation reaction.
(xv) A composition comprising poly-a-olefins obtainable by the method according to (xiv), wherein at least 85%, preferably at least 90%, more preferably at least 95%, of the poly-a-olefins are C33 poly-a-olefins.
(xvi) A polypeptide having decarboxylase activity, wherein said polypeptide comprises an amino acid sequence having at ieast 80%, preferably at least 90%, identify to SEQ IDNO:13.
(xvii) An isolated nucleic acid encoding an enzyme having decarboxylase activity, wherein said nucleic acid comprises a nucléotide sequence having at least about 80%, more preferably at least about 95%, sequence Identity to SEQ ID NO: 7 or SEQ ID NO:
5.
(xviii) The polypeptide according to (xvi) or the Isolated nucleic acid according to (xvii), wherein C# to Cu free fatty acids, preferably Ce to C« free fatty acids, more preferably C12 free fatty acids are the prefemed substrate of said decarboxylase.
(xix) The Isolated nucleic acid according to (xvii) or (xviii), wherein said nucleic acid comprises a nucléotide sequence having at least about 95% sequence Identity to SEQ ID NO: 7, (xx) A vector comprising at.least one regulatory sequence operatively coupled to the nucleic acid sequence of any of (xvii) to (xix).
(xxi) A host cell comprising the nucleic acid according to any of claims (xvii) to (xlx) integrated Into Its genome or the vector according to claim 20.
(xxii) The host cell of (xxl), which Is an oleaglnous eukaryotic microalgae or oleaglnous yeast.
(xxlil) Use of a host cell according to (xxi) or (xxii) for the Industrial production of lubricants.
(xxiv) Lubricant comprising poly-a-olefins obtainable by a method according to (xiii) or (xiv).
(xxv) Lubricant according to (xxiv), wherein said poly-a-olefins are biosourced polya-oleflns.
(xxvl) Lubricant according to (xxiv) or (xxv), wherein at least 50%, preferably at least 85%, of said poly-a-olefins conslst of C33 poly-a-olefins.
(xxvfî) Lubricant according to any one of (xxiv) to (xxvi), wherein said lubricant Is an automotive lubricant.
*
BRIEF DESCRIPTION OF THE FIGURES
The teaching of the application Is illustrated by the following Figures which are to be consldered as illustrative only and do not In any way limit the scope of the claims.
Figure 1: Nucléotide (A,C,E,G) and amino acid (0,D,F,H) sequences of OleTx (Α,θ), 5 Bs168 (C,D), Aa162 (E,F) and Sm46 (G,H).
Figure 2: Plasmid map of pET28a.
Figure 3: Heterologous expression of genes encoding OleTje, 0s168, Aa162 and Sm46 in E. coli. SDS-PAGE analysis of lysâtes of E. coli cells carrying a recombinant plasmid comprising a gene encoding OleTJ, 0s168, Aa162 or Sm46, cultured under 10 conditions to induce expression of the transgene (0.4 mM IPTG) or not (unlnduced). M:
protein marker.
Figure 4: In vitro production of α-olefins by OleTjE, 0s168, Aa162 and Sm46. Genes encoding OleTJ, 0s168, Aa162 and Sm46 were recombinantly expressed in E coli. The supematant of the E. coli lysâtes was used in an in vitro assay for α-olefin 15 biosynthesis In the presence of C12· Cm or Ci® fatty acid substrate (1 mM) and H2O2 (500 pM). Cu, Cu and Cu α-olefin production by the different enzymes Is indicated as the area under the gas chromatographie (GC) peak.
Figure 5: (A) Codon-optimlzed (for expression in E. coli) nucléotide sequences encoding Sm46 and a truncated variant Sm46-del29 that has the N-terminal 29 amino 20 acids deleted (SEQ ID NO:9 and SEQ ID NO:10, respectively) with flanking restriction sites. Codon-optimlzed (for expression in E. coli) nucléotide sequence (SEQ ID NO: 9) encoding Sm46 is shown; Nde I and Xho I restriction sites are indicated in boid, the deleted sequence to encode for the truncated variant Sm46-del29 is underiined. (B) Amino acid sequence of Sm46-del29 (SEQ ID NO: 13).
Figure 6: In vitro conversion of free fatty acids Into α-olefins by a truncated variant of Sm46: Sm46-del29. Conversion percentages of the free fatty acid substrates (A) and the corresponding α-alkene products (B) are shown.
Figure 7: In vitro production of α-olefïns by Aa162, 0s168 and Sm46-del29.
Recombinantly expressed and purified enzyme (0.2 pM) was used in an in vitro assay for α-olefin biosynthesis in the presence of C« fatty acid substrate. The conversion
percentages are shown for, from left to right, Cu free fatty acid substrate, Cu a-olefin, a-hydroxy Cu fatty acid, β-hydroxy Cu fatty acid.
DETA1LED DESCRIPTION OF THE INVENTION
Unless otherwise defined, ail terms used In discloslng the Invention, including technical and scientific terms, hâve the meaning as commonly understood by one of ordinary skill in the art to which this Invention belongs. By means of further guidance, term définitions are Included to better appreciate the teaching of the présent invention.
As used herein, the singular forms a, an, and the include both singular and plural 10 referents unless the context clearly dictâtes otherwise.
The terms comprising, comprises and comprised of as used herein are synonymous with including, Includes or containing, contains, and are inclusive or open-ended and do not exclude additional, non-recited members, éléments or method steps. Where reference is made to embodiments as comprising certain éléments or 15 steps, this encompasses also embodiments which consist essentially of the recited éléments or steps.
The recitation of numerical ranges by endpolnts includes ail numbers and fractions subsumed within the respective ranges, as well as the recited endpolnts.
The term about as used herein when referring to a measurable value such as a 20 parameter, an amount, a temporal duration, and the like, Is meant to encompass variations of +/-10% or less, preferabiy +/-5% or less, more preferably +M% or less, and still more preferably +/-0.1% or less of and from the specified value, insofar such variations are appropriate to perform In the disclosed Invention, it is to be understood that the value to which the modifier about refers Is itself also specifically, and 25 preferably, disclosed.
AU documents cited In the présent spécification are hereby Incorporated by reference in their entirety.
Standard reference work setting forth the general principles of biochemistry includes
Biochemical Pathways: An Atlas of Biochemistry and Molecular Biology, ed. Michal, G,
John Wiley and Sons, Inc., New York, US, 1999.
β
The term ‘olefin’ or “alkene refera herein to molécules composed of carbon and hydrogen, containing at least one carbon-carbon double bond. Olefins containing one carbon-carbon double bond are denoted herein as mono-unsaturated hydrocarbons and hâve the chemical formula CnHîn. where n equals at least two.
Alpha-olefins, “α-olefins, 1-alkenes or “terminal olefins’ are used as synonyms herein and dénoté olefins or alkenes having a double bond at the primary or alpha (a) position. “Linear α-olefins or “LAO as used herein refer to α-olefins that hâve a linear hydrocarbon chain, whereas “branched α-olefins hâve a branch on one or more carbon atoms of the hydrocarbon chain. The term “medium-chain α-olefins is used herein to dénoté α-olefins with 7 to 13 carbons and encompasses any one or more of C? α-olefins, Ce a-olefins, C» α-olefins, Cio α-olefins, Cn α-olefins, C12 a-olefins, and C13 α-olefins. The term “uneven-numbered α-olefins refera to α-olefins wherein the number of carbon atoms is not even ln number. Thus, uneven-numbered mediumchain α-olefins encompass C7, Ce, C11 and C13 a-olefins.
As used herein, the term fatty acid or “free fatty acid means a carboxylic acid having the formula RCOOH, or a sait (RCOO-) thereof. R represents an aliphatic group, preferably an alkyl group. Fatty acids can be saturated, mono-unsaturated, or polyunsaturated. The term ‘medium-chain fatty acid or “medium-chain free fatty acid as used herein dénotés a fatty acid or free fatty acid having 8 to 14 carbon atoms. The 20 term “even-numbered fatty acids refera to fatty acids wherein the number of carbon atoms Is even in number. Thus, even-numbered medium-chain fatty acids encompass Ca, C10, C12 and Cu fatty acids.
As used herein, the term “host ceil’ refera to a cell that can be used to produce an aolefin as described herein. A host cell may be an isolated cell or a cell line grown in 25 culture, or a cell which résides in a living tissue or organism.
As used herein, the terms microbial, microbial organism or micro-organism are intended to mean any organism that existe as a microscopie cell that Is included within the domains of archaea, bacteria or eukaryotes. Therefore, the term Is Intended to encompass prokaryotlc or eukaryotic ceils or organisms having a microscopie size and • 30 includes bacteria, archaea and eubacteria such as cyanobacteria of ail species as well as eukaryotic micro-organisms such as fungi, Including yeasts, and algae. The term
also includes cell cultures of any species that can be cultured for the production of a blochemlcal.
The term “oleaginous as used herein with reference to a host cell dénotés cells characterized by their lipid accumulation capability. Typically, their biomass contains 5 over 20% lipids In dry matter.
The algae group encompasses, without limitation, (i) several eukaryotic phyla, Including the Rhodophyta (red algae), Chlorophyta (green algae), Dinoflagellata, Haptophyta, (ii) several classes from the eukaryotic phylum Heterokontophyta which include without limitation the classes Bacillariophycea (diatoms), Eustigmatophycea, 10 Phaeophyceae (brown algae), Xanthophyceae (yellow-green algae) and
Chrysophyceae (golden algae), and (iii) the prokaryotic phylum Cyanobacteria (bluegreen algae).
The term algae includes for example généra selected from : Achnanthes, Amphora, Anabaena, Anikstrodasmls, Arachnoidiscusm, Aster. Botryococcus, Chaetoceros, 15 Chlamydomonas, Chlorella, Chlomcoccum, Chorathron, Cocconeis, Coscinodiscus,
Cyclotella, Cylindrotheca, Dunalialla, Emiliana, Euglena, Fistulifara, Fragilariopsis, Gyrosigma, Hamatococcus, Isochrysis, Lampriscus, Monochrysis, Monoraphidium, Nannochloris, Nannnochloropsis, Navicula, Neochloris, Naphrochloris, Nephrosalmls, Nitzschia, Nodularia, Nostoc, Odontalla, Oochromonas, Oocystls, Oscillartoria, 20 Pavlova, Phaaodactylum, Playtmonas, Pleumchrysis, Porhyra, Pseudoanabaena,
Pyramimonas, Stichococcus, Synachococcus. Synechocystis, Tetraselmis, Thalassioslra, and Trichodasmlum.
The terms genetically engineered or genetically modified’ or ‘Tecombinanf as used herein with reference to a host cell dénoté a non-naturally occurring host cell, as well 25 as its recombinant progeny, that has at least one genetic alteration not found in a naturally occurring strain of the referenced species, including wild-type strains of the referenced species. Such genetic modification Is typically achieved by technical means (i.e. non-naturally) through human intervention and may include, e.g., the introduction of an exogenous nucleic acid and/or the modification, over-expression, or délétion of 30 an endogenous nucleic acid.
The term exogenous or ’foreign’ as used herein is intended to mean that the referenced molécule, in particular nucleic acid, is not naturally présent in the host cell.
The term endogenous or “native as used herein dénotés that the referenced molécule, in particular nucleic acid, is présent in the host cell.
By “recombinant nucleic acid when referring to a nucleic acid in a recombinant host cell, is meant that at least part of said nucleic acid is not naturally présent in the host 5 cell in the same genomic location. For instance a recombinant nucleic acid can comprise a coding sequence naturally occurring in the host cell under control of an exogenous promotor, or it can be an additional copy of a gene naturally occurring in the host cell, or a recombinant nucleic acid can comprise an exogenous coding sequence under the control of an endogenous promoter.
By nucleic acid’ is meant ollgomers and polymers of any length composed essentiaily of nucléotides, e.g., deoxyribonucleotides and/or ribonucleotides. Nucleic acids can comprise purine and/or pyrimidine bases and/or other natural (e.g., xanthine, inosine, hypoxanthine), chemically or biochemically modified (e.g., methylated), non-natural, or derivatised nucléotide bases. The backbone of nucleic acids can comprise sugars and phosphate groups, as can typically be found in RNA or DNA, and/or one or more modified or substituted sugars and/or one or more modified or substituted phosphate groups. Modifications of phosphate groups or sugars may be introduced to improve stability, résistance tô enzymatic dégradation, or some other useful property. A nucleic acid can be for example double-stranded, partly double stranded, or single-stranded.
Where single-stranded, the nucleic acid can be the sense strand or the antisense strand. The nucleic acid can be circular or linear. The term nucleic acid as used herein preferably encompasses DNA and RNA, specifically including genomic, hnRNA, pre-mRNA, mRNA, cDNA, recombinant or synthetic nucleic acids, including vectors.
By encoding is meant that a nucleic acid sequence or part(s) thereof corresponds, by 25 virtue of the genetic code of an organism in question, to a particular amino acid sequence, e.g., the amino acid sequence of a desired polypeptide or protein. By means of example, nucleic acids encoding” a particular polypeptide or protein, e.g. an enzyme, may encompass genomic, hnRNA, pre-mRNA, mRNA, cDNA, recombinant or synthetic nucleic acids.
Preferably, a nucleic acid encoding a particular polypeptide or protein may comprise an open reading frame (ORF) encoding said polypeptide or protein. An “open reading frame or “ORF refers to a succession of coding nucléotide triplets (codons) starting with a translation initiation codon and closlng with a translation termination codon
known per se, and not containing any internai ln-frame translation termination codon, and potentially capable of encoding a polypeptide or protein. Hence, the term may be synonymous with coding sequence* as used ln the art.
The nucleic acids taught herein may encode more than one polypeptide or protein.
Such nucleic acids are denoted as polycistronic* nucleic acids and typically comprise several ORFs or coding sequences, each encoding a polypeptide or protein.
The terms polypeptide and protein are used Interchangeably herein and generally refer to a polymer of amino acid residues linked by peptide bonds, and are not limited to a minimum length of the product. Thus, peptides, oiigopeptides, polypeptides, 10 dimers (hetero- and homo-), multimers (hetero- and homo-), and the like, are included within the définition. Both full-length proteins and fragments thereof are encompassed by the définition. The terms also include post-expression modifications of the polypeptide, for example, glycosylation, acétylation, phosphorylation, etc. Furthermore, for purposes of the présent invention, the terms also refer to such when including 15 modifications, such as délétions, additions and substitutions (e.g., conservative in nature), to the sequence of a native protein or polypeptide.
The term enzyme* as used herein dénotés a biological molecufe that catalyzes e chemical reaction. The term encompasses single enzymes, I.e. single catalytic entities, as well as Systems comprising more than one catalytic entity. The enzymes described 20 herein can naturally possess the recited activity or they can be englneered to exhibit said activity.
As used herein, fatty acid enzyme means any enzyme Involved in fatty acid blosynthesls. As used herein, the term fatty acid biosynthetic pathway means a biosynthetic pathway that produces fatty acids. Fatty acid enzymes can be expressed 25 or over-expressed in a host cell to produce fatty acids.
As used herein, the terms purify, purified, or purification means the removal or isolation of a molécule from its environment by, for example, isolation or séparation. As used herein, these terms also refer to the removal of contaminants from a sample. For example, when α-oleflns are produced in a host cell, the oiefins can be purified by the 30 removal of other cellular components (e.g., nucleic acids, polypeptides, iipids, carbohydrates, or other hydrocarbons). The terms purify, purified, and purification* do not require absolute purity. They are relative terms.
also includes cell cultures of any species that can be cultured for the production of a biochemical.
The term oleaginous as used herein with reference to a host cell dénotés cells characterized by their lipid accumulation capability. Typically, their biomass contains 5 over 20% liplds In dry matter.
* The algae group encompasses, without limitation, (i) several eukaryotic phyla, including the Rhodophyta (red algae), Chlorophyta (green algae), Dinoflagellata, Haptophyta, (ii) several classes from the eukaryotic phylum Heterokontophyta which include without limitation the classes Bacillariophycea (diatoms), Eustigmatophycea, 10 Phaeophyceae (brown algae), Xanthophyceae (yellow-green algae) and
Chrysophyceae (golden algae), and (iii) the prokaryotic phylum Cyanobacterla (bluegreen algae).
The term algae includes for example généra selected from : Achnanthes, Amphora, Anabaena, Anikstrodesmis, Arachnoidiscusm, Aster, Bobyococcus, Chaetoceros, 15 Chlamydomonas, Chlorella, Chlorococcum, Chorethron, Cocconeis, Cosclnodlscus, Cyclotella, CylindrOtheca, Dunaliella, Emlliana, Euglena, Fistulifera, Fragilariopsis, Gyroslgma, Hematococcus, Isochrysis, Lampriscus, Monochrysis, Monoraphîdlum, Nannochloris, Nannnochloropsis, Navicula, Neochloris, Nephrochloris, Nephroselmls, Nitzschla, Nodularia, Nostoc, Odontella, Oochromonas, Oocystis, Oscillartoria, ' ' 20 Pavlova, Phaeodactylum, Playtmonas, Pleurochrysls, Portiyra, Pseudoanabaena, Pyramlmonas, Stichococcus, Synechococcus, Synechocystls, Tetraselmls, Thalassioslra, and Trichodesmium.
The terms “genetically engineered or “genetically modified or recombinant as used herein with reference to a host cell dénoté a non-naturally occurring host cell, as well 25 as its recombinant progeny, that has at least one genetic alteration not found in a naturally occurring strain of the referenced species, including wild-type strains of the referenced species. Such genetic modification Is typically achieved by technical means (i.e. non-naturally) through human intervention and may Include, e.g., the introduction of an exogenous nucleic acid and/or the modification, over-expression, or délétion of 30 an endogenous nucleic acid.
The term exogenous or foreign as used herein Is intended to mean that the referenced molécule, in particular nucleic acid, is not naturally présent ln the host cell.
The term “endogenous or ‘native* as used herein dénotés that the referenced molécule, ln particular nucleic add, Is présent ln the host cell.
By ‘recombinant nucleic add* when referring to a nucleic add in a recombinant host cell, is meant that at least part of said nucleic acid is not naturally présent ln the host *. 5 cell in the same genomic location. For instance a recombinant nudelc acid can comprise a coding sequence naturally occurring ln the host cell under control of an exogenous promotor, or It can be an additional copy of a gene naturally occurring ln the host cell, or a recombinant nucleic acid can comprise an exogenous coding sequence under the control of an endogenous promoter.
By ‘nudeic add* Is meant oligomers and polymers of any length composed essentially of nucléotides, e.g., deoxyribonucleotides and/or ribonucleotides. Nudeic acids can comprise purine and/or pyrimidine bases and/or other natural (e.g., xanthine, inosine, hypoxanthine), chemically or biochemically modified (e.g., methylated), non-natural, or derivatised nucléotide bases. The backbone of nudeic acids can comprise sugars and phosphate groups, as can typically be found in RNA or DNA, and/or one or more modified or substituted sugars and/or one or more modified or substituted phosphate groups. Modifications of phosphate groups or sugars may be introduced to improve stability, résistance to enzymatic dégradation, or some other useful property. A nucleic ’· acid can be for example double-stranded, partly double stranded, or single-stranded. .20 Where single-stranded, the nudeic add can be the sense strand or the antisense strand. The ‘nucleic acid* can be circular or linear. The term nucleic add* as used herein preferably encompasses DNA and RNA, specificaîly including genomic, hnRNA, pre-mRNA, mRNA, cDNA, recombinant or synthetic nucleic acids, Including vectors.
By encoding* Is meant that a nucleic acid sequence or part(s) thereof corresponds, by virtue of the genetic code of an organism ln question, to a particular amino acid sequence, e.g., the amino acid sequence of a desired polypeptide or protein. By means of example, nudeic acids ‘encoding* a particular polypeptide or protein, e.g. an enzyme, may encompass genomic, hnRNA, pre-mRNA, mRNA, cDNA, recombinant or synthetic nudeic acids.
Preferably, a nucleic acid encoding a particular polypeptide or protein may comprise an open reading frame (ORF) encoding said polypeptide or protein. An ‘open reading frame’ or ‘ORF refera to a succession of coding nudeotide triplets (codons) starting with a translation initiation codon and dosing with a translation termination codon
known per se, and not containing any internai in-frame translation termination codon, and potentially capable of encoding a polypeptide or protein. Hence, the term may be synonymous with ‘coding sequence as used in the art.
The nucleic acids taught herein may encode more than one polypeptide or protein.
Such nucleic acids are denoted as “polycistronic* nucleic acids and typically comprise several ORFs or coding sequences, each encoding a polypeptide or protein.
The terms polypeptide and protein are used interchangeably herein and generally refer to a polymer of amino acid residues linked by peptide bonds, and are not limited to a minimum length of the product. Thus, peptides, oligopeptides, polypeptides, 10 dimers (hetero- and homo-), multimers (hetero- and homo-), and the like, are Included within the définition. Both fuil-length proteins and fragments thereof are encompassed by the définition. The terms also include post-expression modifications of the polypeptide, for example, glycosylation, acétylation, phosphorylation, etc. Furthermore, for purposes of the present invention, the terms also refer to such when including 15 modifications, such as délétions, additions and substitutions (e.g., conservative In nature), to the sequence of a native protein or polypeptide.
The term enzyme as used herein dénotés a biological molécule that catalyzes a chemical reaction. The term encompasses single enzymes, I.e. single catalytic entities, as well as Systems comprising more than one catalytic entity. The enzymes described 20 herein can naturally possess the redted activity or they can be engineered to exhibit said activity.
As used herein, fatty acid enzyme means any enzyme involved in fatty acid biosynthesis. As used herein, the term fatty acid biosynthetic pathway means a biosynthetic pathway that produces fatty acids. Fatty acid enzymes can be expressed 25 or over-expressed in a host cell to produce fatty acids.
As used herein, the terms purify, purified, or purification means the removal or isolation of a molécule from its environment by, for example, isolation or séparation. As used herein, these terms also refer to the removal of contaminants from a sample. For example, when α-olefins are produced in a host cell, the olefins can be purified by the 30 removal of other celiular components (e.g., nucleic acids, polypeptides, iipids, carbohydrates, or other hydrocarbons). The terms purify, purified, and purification do not require absolute purity. They are relative terms.
As used herein, the terms identity” and identical and the like refer to the sequence similarity between two polymeric molécules, e.g., between two nucieic acid molécules or polypeptides. Methods for comparing sequences and determining sequence identity are well known in the art. By. means of example, percentage of sequence Identity refers 5 to a percentage of Identical nucieic acids or amino acids between two sequences after alignmenf of these sequences. Alignments and percentages of identity can be performed and calculated with various different programs and algorithme known ln the art. Preferred alignment algorithme include BLAST (Altschul, 1990; available for instance at the NCBI website) and Clustal (reviewed in Chenna, 2003; available for 10 instance at the EBI website). Preferably, BLAST is used to calculate the percentage of
Identity between two sequences, such as the Blast 2 sequences algorithm described by Tatusova and Madden 1999 (FEMS Microbiol Lett 174: 247-250), for example using the published default settings or other suitable settlngs (such as, e.g., for the BLASTN algorithm: cost to open a gap = 5, cost to extend a gap 2, penalty for a mlsmatch = 15 2, reward for a match = 1, gap x_dropoff = 50, expectation value ~ 10.0, word size 28; or for the BLASTP algorithm: matrix = Blosum62, cost to open a gap = 11, cost to extend a gap = 1, expectation value = 10.0, word size = 3).
The term “renewable is used herein to refer to a material (e.g. a molécule, a composition or a product) that can be produced or Is derivable from a natural resource 20 which is periodically (e.g., annually or perennially) replenished through the actions of plants of terrestrial, aquatic or oceanic ecosystems (e.g., agricultural crops, edible and non-edible grasses, forest products, seaweed, or algae), or microorganisms (e.g., bacteria, fungi, or yeast).
The term renewable resource refers to a natural resource that can be replenished 25 within a 100 year time frame. The resource may be replenished naturally,· or via agricultural techniques. Renewable resources Include, for example but without limitation, plants, animais, fish, bacteria, fungi, yeasts, algae and forestry products. They may be naturally occurring, hybrids, or geneticaily engineered organisme. Natural resources such as crude oil, coal, and peat which take longer than 100 years to form 30 are not considered to be renewable resources.
The term bio-based content refers herein to the amount of carbon from a renewable resource in a material as a percentage of the mass of the total organic carbon ln the material, as determined by standard ASTM D6866.
The term biosourced with respect to a material (e.g. a molécule, a composition or a product) means that such material Is derived from starting matériels of renewable origin (I.e. from renewable resources). Accordingly, subject to typical measurement errors, a biosourced material has a bio-based content of at least 90%, preferably at least 95%, 5 more preferably at least about 96%, 97% or 98%, even more preferably at least about
99% such as about 100%. ’
The présent application generally relates to the biosynthesis of olefins, in particular aoiefins, more particularly medium-chaln a-olefins.
More particularly, the application provides nucléotide sequences encoding mediumchain free fatty acid decarboxylases and the polypeptides encoded thereby, recombinant organisms comprising said nucléotide sequences, methods of production of medîum-chain α-olefins using said polypeptides or said recombinant organisms and products obtained by these methods.
Nucléotide sequences
The application provides nucleic acids encoding enzymes having decarboxylase activity on medium-chain free fatty acids, which are of interest for the production of the corresponding medium-chain α-olefins. Indeed, the inventors hâve identified a number of nucleic acid sequences which encode enzymes having decarboxylase activity on 20 medium-chain free fatty acids, ln particular embodiments these nucleic acids encode decarboxylase enzymes for which medium-chain free fatty acids are preferred substrates.
Nucléotide sequences encoding enzymes having medium-chain free fatty acid decarboxylase activity, in particular CrCn free fatty acid decarboxylase activity, more 25 particularly C12 free fatty acid decarboxylase activity, and accordingly suitable for use ln the methods envisaged herein include the sequence encoding the P450bsh fatty acid hydroxylase (Bs168) from Bacillus subtilis subsp. subtilis str. 168 (SEQ ID NO:3); a genomic sequence of Alicyclobacillus acidocaldarius encoding an olefin-producing enzyme (Aa162) (SEQ ID NO:5); and a genomic sequence of Staphilococcus 30 massiliensls encoding an olefin-producing enzyme (Sm46) (SEQ ID NO:7), as well as variants of these sequences. Variant nucléotide sequences may for Instance be codonoptimized sequences for recombinant expression in a host cell of cholce. For instance,
nucléotide sequences having at least about 70% or 75%, preferably at least about 80%, more preferably at least about 85%, 90% or 95%, even more preferably at least about 96%, 97%, 98% or 99% sequence Identity to SEQ ID NO:3, SEQ ID NO:5 or SEQ ID NO:7 and encoding enzymes having medium-chain free fatty acid decarboxylase activity, in particular Cs-Cu free fatty acid decarboxylase activity, more particularly C12 free fatty acid decarboxylase activity, are envisaged for use in the methods disclosed herein. An exemplary variant nucléotide sequence of SEQ ID NO:7 Is SEQ ID NO:9, which has been codon-optimized for recombinant expression in E. coll.
ln particular embodiments, the decarboxylase enzymes encoded by the nucleic acid sequences envisaged herein hâve substrate preference for medium-chain free fatty acids, preferably CrCu free fatty acids, more preferably Ci2 free fatty acids.
The inventors are the first to Identify the nucleic acid sequence of Allcyclobacillus acidocaldarius (SEQ ID NO:5) and a genomlc sequence of Staphilococcus massiliensls 15 (SEQ ID NO:7) encoding olefin-produclng enzymes (termed Aa162 and Sm46, respectively). More particularly, the nucleic acid sequence of Alicyclobacillus acidocaldarius (SEQ ID NO:5) was found to encode a decarboxylase (Aa162, SEQ ID NO:6) acting preferably on medium-chain free fatty acids, more particularly on C,2 free fatty acids, which can ensure an Increased efficiency of the production of medium20 chain α-olefins, more particularly Cu α-olefins, and/or Increased purity of said aolefins. The genomic sequence of Staphilococcus massiliensls (SEQ ID NO:7) was found to encode an olefin-produclng enzyme (Sm46, SEQ ID NO:8) that has decarboxylase activity on medium-chain free fatty acids, ln particular, it was shown that Sm46 shows substrate preference for Ca-Cu free fatty acids, more particularly Ci2 free 25 fatty acid was found to be the preferred substrate for Sm46: Also advantageously, the
Sm46 polypeptide shows spécifie decarboxylase activity, i.e. the formation of coproducts, in particular hydroxy fatty acids such as a- and β-hydroxy fatty acids, when converting free fatty acid substrate is minimal with said polypeptide.
These novel sequences allow the production of medium-chain α-olefins, ln particular
C7-C11 α-olefins, from the corresponding fatty acid substrates such as the production of
C11 α-olefins from Cu free fatty acids, ln addition, these sequences can be used to further Identify sequences encoding enzymes having medium-chain free fatty acid
decarboxylase activity, Including decarboxylases for which C12 free fatty acids are the preferred substrate.
Variants (or mutants) of sequences Îdentified herein can be naturally occurring or they can be man-made e.g. using genetic engineering techniques. Such techniques are weil 5 known in the art and include, for example but without limitation, site directed mutagenesis, random chemical mutagenesis, and standard cloning techniques. Other exemplary techniques for mutagenesis are recombination techniques such as DNA shuffling that use fragments of existing sequences and mix them ln novel combinations.
Particularly envisaged herein are variant nucléotide sequences obtained by screening 10 a shuffling library between the nucléotide sequences set forth în SEQ ID NO:1 and
SEQ ID NO:7, and a shuffling library between the nucléotide sequences set forth in SEQ ID NO:3 and SEQ ID NO:5 for recombinant nucléotide sequences encoding recomblnant polypeptides having a desired biological activity, in particular decarboxylase activity, more particularly (spécifie) medium-chain free fatty acid 15 decarboxylase activity such as CrCu free fatty acid decarboxylase activity, even more preferably the preferred substrate of these decarboxylases is a Ce-Cu free fatty acid. The technique of DNA shuffling îs weil known in the art. Reference can be made to Stemmer (1994. Nature 370:389-391) for an exemplary shuffling technique.
The nucléotide sequences envisaged herein encoding enzymes having medium-chain 20 free fatty acid decarboxylase activity are of particular interest for the recombinant production of medium-chain α-olefins, e.g. through the (over)expression of said nucleic acid sequences in a recombinant host cell, as detailed below.
Recombinant host cells
The application provides geneticalïy engineered host cells capable of producing 25 medium-chain α-olefins, wherein said host cells are characterized în that they comprise a recombinant nucleic acid encoding an enzyme having decarboxylase activity on Ce to Cu free fatty acids, preferably on C12 free fatty acids as described hereinabove. For instance, these recombinant host celis may comprise an exogenous nucleic acid encoding a medium-chain free fatty acid decarboxylase, or they may over-express an 30 endogenous nucleic acid encoding a medium-chain free fatty acid decarboxylase. ln particular embodiments, the recombinant host cells comprise a recombinant nucleic acid comprising a nucléotide sequence having at least about 70% or 75%, preferably at
least about 80%, more preferably at least about 85%, 90% or 95%, even more preferably at least about 96%, 97%, 98% or 99% sequence identity to SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO:9 or SEQ ID NQ:10 and encoding an enzyme having decarboxylase activity on free fatty acids with a carbon chain length 5 comprised between 8 and 14, preferably on Ca-Cu free fatty acids, more preferably on
Cu free fatty acids, ln particular embodiments, the recombinant host cells comprise a recombinant nucleic acid comprising a nucléotide sequence having at least about 70% or 75%, preferably at least about 80%, more preferably at least about 85%, 90% or 95%, even more preferably at least about 96%, 97%, 98% or 99% sequence identity to 10 SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO: 3. or SEQ ID NO: 5 and encoding a decarboxylase with preferred activity on Ce- Cu free fatty acids, preferably on Ca- Ci2 free fatty acids, more preferably on C12 free fatty acids, ln particular embodiments, the recombinant host cells comprise a recombinant nucleic acid comprising a nucléotide sequence that encodes a polypeptide comprising an amino 15 acid sequence of SEQ ID NO:8, SEQ ID NO:13, SEQ ID NO:4 or SEQ ID NO:6 or a functional variant, ln particular a functional fragment, of said polypeptide.
The genetically engineered host cells disclosed herein may be further genetically modified through expression of one or more recombinant nucleic acids encoding an enzyme Involved in a fatty acid biosynthetic pathway. For example, the recombinant 20 host cells may be further modified to overexpress a nucleic acid encoding an enzyme involved in an endogenous fatty acid biosynthetic pathway, or they may be further modified by introduction into the host cell of an exogenous nucleic acid encoding an enzyme involved in fatty acid synthesis. When expressed, the recombinant nucleic acid encoding the enzyme involved ln fatty acid synthesis confers to the host cell the ability 25 to produce or overproduce a fatty acid.
Each step within a fatty acid biosynthetic pathway can be modified to produce or overproduce a fatty acid of Interest. For example, known genes Involved in the synthesis of fatty acids can be expressed or overexpressed ln a host cell to produce a desired free fatty acid, or attenuated to inhibit production of a non-desired fatty acid.
For instance, production of hydrocarbons starting from acyl-ACP is ensured by thioesterases, whereafter the production of terminal olefins is catalyzed by a decarboxylase. Accordingly, exempiary genes involved In fatty acid synthesis include genes encoding thioesterase.
In a preferred embodiment, the host cells are further genetically engineered to express or overexpress a thioesterase to induce or increase free fatty acid production. The chain length of a fatty acid substrate is controlled by thioesterase, and hence, by (over)expressing a suitable thioesterase, a free fatty acid with desired carbon chain 5 length can be obtained. Non-limiting examples of thloesterases are provided in Table 1.
Preferably, medium-chain fatty add (MCFA)-specific thioesterases are used for (over)expression In the recombinant host cells according to the invention.
Table 1: Thloesterases
GenBank accession number (UniProtKB/SwissProt) | Source organlsm | Gene | Preferential product produced |
AAC73596 | E. coli | fesA without leader sequence | Cia i |
AAC73555 | Ecoli | tesB | |
AAA34215 (Q41635); AAC49001 | Umbellularia Califomica | fatB | C120 |
AAC49269 (Q39513) | Cuphea hookeriana | fatB2 | Ca o — Cio o |
AAC49269; AAC72881 | Cuphea hookeriana | fatB3 | Ci4 0~ Ciao |
AAC49151 (Q39473) | Cinnamonum camphorum | fatB | Cl4 0 |
CAA85388 | Arabidopsis thaliana | fatB | Cia i |
NP_189147; NP_193041 | Arabidopsis thaliana | fatA | Cia 1 |
CAC39106 | Bradyrhiizobium japonicum | fatA | Cia 1 |
AAC72883 | Cuphea hookeriana | fatA | Cu 1 |
AAL79381 | Helianthus annus | fatA1 | |
JF338905 | Cocos nucifera | C120 |
The thloesterase may be a thloesterase that Is naturally présent In higher plants. Two familles of acyi-ACP thloesterases are présent In higher plants: the Class I acyl-ACP thioesterases encoded by FatA genes, which are responsible for cleaving long-chain (for example, Ci« and Cie) unsaturated fatty acids from acyi-ACP, and the Class II acyl-ACP thloesterases encoded by FatB genes, which are active on saturated fatty acyl chains, and which can be spécifie for medium-chain (Ce-Cu) acyl-ACPs or which can be active on both medium- and long-chain fatty acyl-ACPs. Non-limiting examples of thloesterases which are MCFA-specifîc and naturally présent ln plants, and hence suitable for (over)expresslon ln the recombinant host cells described herein, are thioesterases encoded by FatB genes, or the thioesterases described for Instance In Voelker et al. (1992 Science 257:72-74) and Jing et al. (2011 Biochemistry 12:44). The thloesterase may also be an englneered thloesterase as described for instance In Voelker et al. (1994 Journal of Bacteriology 176:7320-7327),
Depending on the α-olefin of Interest, the expression of thloesterase enzymes may be either induced (by introduction into the host cell of an exogenous nucleic acid encoding said enzyme), stimulated (by overexpresslon of an endogenous gene encoding said enzyme) or attenuated (by modification of an endogenous gene encoding said enzyme).
in some situations, C12 free fatty acids can be produced by expressing or overexpressing thloesterases that use C12-ACP (for example, accession numbers Q41635 and JF3389Ô5) and attenuating thioesterases that produce non-Cu fatty acids, in other instances, Cu free fatty acids can be produced by attenuating endogenous thloesterases that produce non-Cu fatty acids and (over)expressing the thloesterases that use Cu- ACP (for example, accession number Q39473).
Acetyi-CoA, malonyl-CoA, and free fatty acid overproduction can be verified using methods known In the art, for example, by using radioactive precursors, HPLC, and
GC-MS subséquent to cell lysls.
For the production of Cu α-olefins, the host cells are preferably modified by the introduction of an exogenous nucleic acid encoding a thioesterase having preferentiel hydrolase activity towards C12 acyl-ACP substrate such as Q41635 or JF338905 and/or upregulating endogenous genes encoding a thioesterase having preferential hydrolase 5 activity towards C12 acyl-ACP, and optionally downregulating endogenous genes encoding thioesterases that produce non-Cu fatty acids.
The medium-chaln free fatty acids produced by the fatty acid enzymes In the host cells envisaged herein arethe substratesofthe decarboxylase enzymes described herein.
Accordingly, particularly preferred host cells for the production of medium-chain a10 olefins are recombinant host cells comprising:
- a recombinant nucleic acid encoding a thioesterase having preferential hydrolase activity towards medium-chain acyl-ACP substrate; and
- a recombinant nucleic acid encoding a medium-chain free fatty acid decarboxylase.
Particularly preferred host cells for the production of Cu α-olefins are recombinant host cells comprising:
- a recombinant nucleic acid encoding a thioesterase having preferential hydrolase activity towards C12 acyl-ACP substrate; and
- a recombinant nucleic add encoding a C12 free fatty acid decarboxylase.
In particular embodiments, the free fatty acid decarboxylase is an enzyme having at least about 70%, preferably at least about 80%, more preferably at least about 85%, 90% or 95%, even more preferably at least about 96%, 97%, 98% or 99% sequence identity to SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8 or SEQ ID NO:13 and having medium-chain free fatty acid decarboxylase activity, In particular CrC« free fatty acid decarboxylase activity. In particular embodiments, the fatty acid decarboxylase Is Sm46 (SEQ ID NO:8), Sm46-del29 (SEQ ID NO:13), Bs168 (SEQ ID NO:4) or Aa162 (SEQ ID NO:6) or a functional variant thereof since spécifie production of mediumchain α-olefins, In particular C7-C11 α-olefins, more particularly C11 α-olefins, was shown herein for these enzymes. Accordingly, in particular embodiments, the 30 application provides a genetically engineered host cell comprising a recombinant nucleic add comprising a nucléotide sequence having at least about 70% or 75%, preferably at least about 80%, more preferably at least about 85%, 90% or 95%, even
more preferably at least about 96%, 97%, 98% or 99% sequence Identity to, SEQ ID N0:3, SEQ ID N0:5, SEQ ID N0:7, SEQ ID N0:9 or SEQ ID NO:10 and encoding an enzyme having medium-chain free fatty acid decarboxylase activity, In particular Ca-Cu free fatty acid decarboxylase activity. in further particular embodiments, the 5 recombinant nucleic acid comprises a nucléotide sequence having at least about 70% or 75%, preferably at least about 80%, more preferably at least about 85%, 90% or 95%, even more preferably at least about 96%, 97%, 98% or 99% sequence identity to SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO: 3 or SEQ ID NO: 5 and encodes an enzyme having decarboxylase activity for which Ce-Cu free fatty acids, 10 more particularly Cn free fatty acids, are the preferred substrate, ln preferred embodiments, the host cell comprises a recombinant nucleic acid encoding a polypeptide having an amino acid sequence comprising SEQ ID NO:8, SEQ ID NO: 13, SEQ ID NO:4 or SEQ ID NO:6 or a functional variant thereof. ln further embodiments, the genetlcally engineered host celi further comprises a recombinant nucleic acid 15 encoding a thloesterase having preferentiai hydrolase activity towards medium-chain . acyl-ACP substrate, preferably towards Ce-Cu acyl-ACP substrate, more preferably towards C12 acyl-ACP substrate.
ln particular embodiments, where the above nucieotide sequences are endogenously expressed by the host cell of the invention', It Is envlsaged that expression of these 20 sequences can be Increased specifically so as to ensure commercially relevant medium-chain α-olefin production.
More particularly, In particular embodiments, the host cell is selected to hâve a high endogenous thloesterase activity. Methods of selecting cells having particular properties are known in the art.
Àny celi that can be suitably transformed with and/or genetically engineered to ensure (over)expression of one or more of the described recombinant nucleic acids can be used ln the context of the présent invention. The host cells disclosed herein can be any prokaryotic or eukaryotic organisai or cell. Non-limiting examples of host celis Include plant cells, bacterial cells, yeast cells, fungal celis, and algal cells. ln embodiments, the 30 host cells are genetically engineered bacteria, or genetically engineered fungi, in particular yeasts, genetically engineered algae, or genetically engineered plant cells.
Preferably, the host cells are oleaginous host cells. For example, the host celi may be an oleaginous bacterium, an oleaginous fungus, oleaginous yeast or an oleaginous
alga. Non-limiting examples of oleaginous yeasts include Lipomycas starkeyl, Rhodosporidium toruloides, Rhodotoruia glutinis, and Yarrowia fipolytica. Non-limitîng examples of oleaginous algae généra include Botryococcus, Chaetocems, Chfore/ta, Chlorococcum, Cyiindrotheca, Dunaliella, Fistulifera, Isochrysls, Nannochloropsls, 5 Neochloris, Nitzschia, Pavlova, Scenedesmus, Skelatonema, Stichococcus and Tetra salmis.
Thus, the genetically engineered host cells disclosed herein comprise a recombinant nucleic acid encoding a medium-chain free fatty acid decarboxylase disclosed herein, and optionally one or more recombinant nucleic acids encoding a fatty acid enzyme, 10 I.e. an enzyme involved in fatty acid synthesis. Additionally or altematively the expression of one or more genes encoding enzymes Involved ln the production of fatty acids other than medium-chain free fatty acids may be suppressed, decreased or limited.
The methods for generating the genetically engineered host celis described herein 15 involve standard genetic modifications, for which well-established methods are available to the skilled person.
Genetic engineering of the host cells to contain a recombinant nucleic acid encoding a polypeptide or a fatty acid enzyme as described herein Is accomplished in one or more steps via the design and construction of appropriate vectors and transformation of the 20 host cells with those vectors.
Electroporation and/or chemical (such as calcium chloride- or lithium acetate-based) transformation methods or Agrobacterium tumefaciens-medialed transformation methods as known ln the art can be used.
The recombinant nucleic acid(s) encoding a decarboxylase enzyme and/or a fatty acid 25 enzyme that may be provided in the host cells in the context of the présent invention typically comprises a coding sequence encoding the decarboxylase enzyme and/or the fatty acid enzyme placed under the transcriptional control of one or more promoters and one or more terminators, both of which are functional in the host cell.
Promoter and terminator sequences may be native to the host cell or exogenous to the 30 host cell. Useful promoter and terminator sequences include those that are highly identical (I.e. having an Identifies score of 90% or more, preferably 95% or more, most preferably 99% or more) ln their functional portions compared to the functional portions
of promoter and terminator sequences, respectively, that are native to the host cell, particularly when the insertion of the recombinant nucleic acid is targeted at a spécifie site ln the host genome. The use of native (to the host) promoters and terminators, together with their respective upstream and downstream flanking régions, can permit 5 the targeted Intégration of the recombinant nucleic acid Into spécifie loci of the host genome.
Additionally or altematively, the coding sequence may be native to the host cell or exogenous to the host cell.
Numerous vectors are known to practitioners skilled in the art, and sélection of an 10 appropriate vector Is a matter of choice. The vectors can either be eut with particular restriction enzymes or used as circular DNA.
The vectors taught herein preferably comprise (a combination of) a recombinant nucleic acid as described herein. in particular, a vector comprises (a combination of) the coding sequence of a polypeptide or a fatty add enzyme as described herein and 15 associated promoter and terminator sequences. The vector may contain restriction sites of various types for linearization or fragmentation. Vectors may further contain a backbone portion (such as for propagation ln E. coli) many of which are conveniently obtained from commercially available yeast or bacterial vectors. The vector preferably comprises one or more sélection marker gene cassettes. A sélection marker gene Is 20 one that encodes a protein needed for the survival and/or growth of the transformed cell in a sélective culture medium. Typical sélection marker genes encode proteins that (a) confer résistance to antibiotics or other toxins such as chloramphenicol, zeocin (sh ble gene from Streptoalloteichus hindustanus), genetecin, melibiase (MEL5), hygromydn (aminoglycoside antibiotic résistance gene from E. coli), ampiciliin, 25 tétracycline, or kanamydn (kanamycin résistance gene of Tn903), (b) complément auxotrophic deficiencies of the cell. Two prominent examples of auxotrophlc deficiendes are the amino add leucine deficiency (e.g. LEU2 gene) or uracil deficiency (e.g. URA3 gene). Cells that are orotidine-5'-phosphate decarboxylase négative (ura3-) cannot grow on media lacklng uracil. Thus a functional URA3 gene can be used as a 30 marker on a cell having a uracil deficiency, and successful transformants can be selected on a medium lacking uracil. Only cells transformed with the functional URA3 gene are able to synthesize uracil and grow on such medium. If the wild-type strain does not hâve a uradl deficiency (as Is the case with /. orlentalis, for example), an
auxotrophlc mutant having the deficiency must be made in order to use URA3 as a sélection marker for the strain. Methods for accomplishing this are well known In the art. The sélection marker cassette typically further Includes a promoter and terminator sequence, operatively linked to the sélection marker gene, and which are opérable ln 5 the host.
Successful transformants can be selected for in known manner, by taking advantage of the attributes contributed by the marker gene, or by other characteristics (such as ability to produce α-olefins) contributed by the inserted recombinant nucleic acids. Screening can also be performed by PCR or Southem analysis to confirm that the 10 desired insertions, and optionally délétions hâve taken place, to confirm copy number and to Identify the point of intégration of coding sequences Into the host genome. Activity (such as α-olefin-producing activity) of the polypeptide encoded by the inserted coding sequence can be confirmed using known assay methods as described elsewhere herein.
Also disclosed herein are methods for obtaining a genetically engineered host cell capable of producing an α-olefin of Interest as described herein, which method may comprise transforming a host cell with a recombinant nucleic acid encoding an enzyme having medium-chain free fatty acid decarboxylase activity as taught herein, more particularly a decarboxylase generating the α-olefin of Interest and optionally one or 20 more recombinant nucleic acids encoding a fatty acid enzyme as taught herein, more particularly a fatty acid enzyme involved In the synthesis of the substrate for the free fatty acid decarboxylase enzyme capable of generating the α-olefin of interest. ln particular, the method may comprise the steps of:
a) transforming a host cell with a recombinant nucleic acid encoding an enzyme having medium-chain free fatty acid decarboxylase activity as taught herein and optionally one or more recombinant nucleic acids encoding a fatty acid enzyme as taught herein; and
b) selecting a host cell capable of producing an α-olefin of Interest.
In particular embodiments, the method further comprises modifylng said host cell so as 30 to reduce the endogenous production of olefins other than the α-olefin of Interest.
As detailed above, different genetic modifications are envisaged herein which Induce medium-chain α-olefin production and/or increase the yield of (one or more particular)
medium-chain α-olefins In a host cell. Accordingly, the présent invention also relates to the use of the genetically engineered host cells as described herein for the production of α-olefins, more particularly medium-chain a-olefins.
Produclna α-olefins usina recombinant host cells
In a further aspect, the Invention provides methods for the production of medium-chain α-olefins, more particularly Cn α-olefins, which method comprises providing a genetically engineered host cell as described above and culturing said genetically engineered host cell In a culture medium so as to allow the production of medium-chain α-olefins. More particularly, the host cell is cultured under conditions suitable to ensure 10 expression or overexpression of the enzyme having medium-chain free fatty acid decarboxylase activity envlsaged herein and optionally one or more fatty acid enzyme(s) Involved in the synthesis of the substrate of the enzyme having mediumchain free fatty acid decarboxylase activity.
in particular embodiments, the host cells ensure a rate of α-olefin production, more 15 particularly medium-chain α-olefin production, which is sufficiently high to be Industrially valuable. Indeed, the recombinant host cells disclosed herein may be capable of ensuring a high yield at limited production costs. Furthermore, they are capable of producing α-olefins of desired carbon chain length. Indeed, the decarboxylase enzymes envlsaged herein preferably hâve substrate preference for a 20 medium-chain free fatty acid, in particular a Ce-Cn free fatty acid, more particularly Cn free fatty acid. Also advantageously, the production of unwanted co-products such as hydroxyl fatty acids is minimal. Indeed, the polypetides envlsaged herein preferably hâve spécifie decarboxylase activity.
The recombinant host cells are cultured under conditions suitable for the production of 25 medium-chain α-olefins by the host cells. More particularly this implies conditions sufficient to allow (overexpression of the recombinant nucieic acid encoding a decarboxylase enzyme, which means any condition that allows a host cell to (over)produce a medium-chain free fatty acid decarboxylase or a fatty acid enzyme as described herein. Suitable conditions include, for example, fermentation conditions. 30 Fermentation conditions can comprise many parameters, such as température ranges, levels of aération, and media composition. Each of these conditions, individualiy and ln combination, allows the host cell to grow. To détermine if conditions are sufficient to
allow (over)expression, a host cell can be cultured, for example, for about 4, 8,12,18, 24, 36, or 48 hours. During and/or after culturing, samples can be obtained and analyzed to détermine if the conditions allow (overjexpresslon. For example, the host cells in the sample or the culture medium in which the host cells were grown can be 5 tested for the presence of a desired product. When testing for the presence of a desired product, assays, such as, but not limited to, sodium dodecyl sulfate poiyacrylamide gel electrophoresis (SDS-PAGE), TLC, HPLC, GC/FID, GC/MS, LC/MS, MS, can be used.
Exempiary culture media Include broths or gels. Mlcro-organisms are typically grown in 10 a culture medium comprising a carbon source to be used for growth of the microorganism. Exempiary carbon sources Include carbohydrates, such as glucose, fructose, cellulose, or the like, that can be directly metabolized by a micro-organlsm. ln addition, enzymes can be added to the culture medium to facilitate the mobllizatlon (e.g., the depolymerization of starch or cellulose to fermentabie sugars) and 15 subséquent metabolism of the carbon source. A culture medium may optîonally contain further nutrients as requîred by the particular strain, including Inorganic nitrogen sources such as ammonia or ammonium salts, and the like, and minerais and the like.
Other growth conditions, such as température, cell density, and the like are generally selected to provide an economlcal process. Températures during each of the growth 20 phase and the production phase may range from above the freezlng température of the medium to about 50*C.
The culturing step of the methods of the invention may be conducted aeroblcally, anaerobically, or substantially anaerobically. Briefly, anaérobie conditions refer to an environment devoid of oxygen. Substantially anaérobie conditions include, for example, 25 a culture, batch fermentation or continuous fermentation such that the dissolved oxygen concentration in the medium remains between 0 and 10% of saturation. Substantially anaérobie conditions also Includes growing or resting cells in liquid medium or on solid agar inside a sealed chamber maintalned with an atmosphère of less than 1% oxygen. The percent of oxygen can be maintained by, for example, 30 sparglng the culture with an N2/CO2 mixture or other suitable non-oxygen gas or gasses.
The cultivation step of the methods described herein can be conducted contlnuously, batch-wise, or some combination thereof.
In particular embodiments, wherein phototrophic algae are used as host cells, the method for the production of medium-chain α-olefins may comprise providing algae genetically engineered to (overjproduce medium-chain α-olefins as taught herein, and culturing said algae in photobioreactors or an open pond System using CO2 and 5 suniight as feedstock.
In certain embodiments, the conditions suitable for the production of medium-chain aoiefins may further imply cultivating the host cells In a culture medium which comprises at least one fatty acid substrate, which is converted into an α-olefin by the decarboxylase encoded by the recombinant nucleic acid comprised in the host cell.
Preferably, the fatty acid substrate is a saturated free fatty add substrate. Also preferably, the fatty acid substrate Is a straight chain free fatty acid substrate. Also preferably, the fatty acid substrate Is an even-numbered CrCu free fatty acid substrate (i.e. a free Ce, Cio, C12, or Cu free fatty acid substrate or any combination thereof), preferably an even-numbered Ce-Cu free fatty add substrate, more preferably a C12 15 free fatty acid substrate.
Particularly Intended herein Is the production of medium-chain α-olefins, more particularly Cu α-olefîns. Medium-chain α-olefins, more particularly Cu α-olefins can be obtained using a recombinant host cell described herein specifically modified for the production of medium-chain α-olefins, more particularly Cu a-olefins.
In further embodiments, methods are provided for producing medium-chain a-olefins, more particularly Cu α-olefins, which, In addition to the steps detailed above, further comprise the step of recovering the α-olefins from the host cell or the culture medium. Suitable purification can be carried out by methods known to the person skilled In the art such as by using lysls methods, extraction, Ion exchange resins, electrodialysls, 25 nanofiltration, etc.
Accordingly, methods are provided for the production of medium-chain α-olefins, more particularly Cu α-olefins, which methods comprise the steps of:
(I) providing a genetically engineered host cell as described herein;
(il) culturing the host cells in a culture medium under conditions suitable for the production of medium-chain α-olefins, and (iii) recovering the α-olefins from the host cell or the culture medium.
ln particular embodiments, wherein oleaginous yeasts are used as host cells, the method for the production of medium-chain α-olefins may comprise the following steps:
(i) providing oleaginous yeasts genetically engineered to (over)produce medium-chain α-olefins as taught herein;
(ii) cuit u ring said oleaginous yeasts ln fermentera; and (iii) recovering the α-olefins from the oleaginous yeast or the culture medium.
ln particular embodiments, wherein phototrophic algae are used as host cells, the method for the production of medium-chain α-olefins may comprise the following steps:
(i) providing algae genetically engineered to (over)produce medium-chain α-olefins as 10 taught herein;
(ii) culturing said algae In photobloreactors or an open pond system using COj and sunlight as feedstock; and (iii) recovering the α-olefins from the algae or the culture medium.
ln particular embodiments the host cells are cultivated under conditions which allow 15 sécrétion of α-olefins into the environment.
Typically, ln the methods for the production of medium chain α-olefins envisaged herein, the decarboxylase expressed by the host cell is not secreted by said host cell and the α-olefin is produced inside the host cell. However, in particular embodiments or for particular applications, it Is of interest to ençure sécrétion of the decarboxylase by 20 the host cells provided herein. This can be of interest where the enzyme Is envisaged to be active upon sécrétion into Its environment. A sécrétion signal sequence can be operably linked to the nucleic acid encoding the free fatty acid decarboxylase to this end. ln this connection, “operably linked’ dénotés that the sequence encoding the sécrétion signal peptide and the sequence encoding the polypeptide to be secreted are 25 connected in frame or in phase, such that upon expression the signai peptide facilitâtes the sécrétion of the polypeptide so-linked thereto.
Polypeptides
Also disclosed herein are enzymes having medium-chain free fatty add decarboxylase activity. ln particular embodiments, these polypeptides are encoded by nucleic acids and nucléotide sequences described herein. Accordingly, disclosed herein are polypeptides encoded by a nucleic acid comprising a nucléotide sequence having at least about 70% or 75%, preferably at least about 80%, more preferably at least about
85%, 90% or 95%, even more preferably at least about 96%, 97%, 98% or 99% sequence Identity to SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO:9 or SEQ ID NO:10 having medium-chain free fatty acid decarboxylase activity, in particular Ce-Cn free fatty acid decarboxylase activity, preferably Cn free fatty acid 5 decarboxylase activity, Indeed, in particular embodiments, the enzymes having medium-chain free fatty acid decarboxylase activity, Ce-Cn free fatty acid decarboxylase activity, preferably C12 free fatty acid decarboxylase activity can be obtained by recombinant expression of said nucleic acids in a host cell. Methods for recombinant production of polypeptides are known in the art.
in particular embodiments, the polypeptides which are enzymes having medium-chain free fatty acid decarboxylase activity, in particular Ce-Cn free fatty acid decarboxylase activity, more particularly Cn free fatty acid decarboxylase activity, include polypeptides having an amino acid sequence comprising, consisting essentially of or consisting of, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8 or SEQ ID NO:13, and 15 variants, including active fragments, of these polypeptides having medium-chain free fatty acid decarboxylase activity, in particular Ce-Cn free fatty acid decarboxylase activity, more particularly C12 free fatty acid decarboxylase activity. Preferred polypeptides are those consisting of the amino acid sequence of SEQ ID NO:4 (I.e. Bs168), SEQ ID NO:6 (Le. Aa162), SEQ ID NO:8 (i.e. Sm46), or SEQ ID NO:13 (Le.
Sm46-del29) and functional variants of these polypeptides. Indeed, SEQ ID NO:8, and its active fragment of SEQ ID NO:13, are novel polypeptides provided herein having medium-chain free fatty acid decarboxylase activity, in particular Ce-Cn free fatty acid decarboxylase activity, more particularly C12 free fatty acid decarboxylase activity.
In particular embodiments, the polypeptides hâve decarboxylase activity with a 25 preferred activity on medium-chain free fatty acids, in particular on a Ce-Cn free fatty acid, more particularly on Cn free fatty acids, I.e. the preferred substrate for these decarboxylase enzymes are medium-chain free fatty acids, in particular Ce-Cn free fatty acids, more particularly Cn free fatty acids. In particular embodiments, these enzymes hâve at least about 80%, 85%, 90% or 95%, even more preferably at least 30 about 96%, 97%, 98% or 99% sequence identity to SEQ ID NO:8, SEQ ID NO:13, SEQ
ID NO:4 or to SEQ ID NO: 6. Preferred enzymes comprise or consist of SEQ ID NO:8,
SEQ ID NO:13, SEQ ID NO:4 or SEQ ID NO: 6 and functional variants thereof such as but not limited to an active fragment or variant thereof which maintains the activity of
the enzyme comprising or consisting of SEQ ID N0:8, SEQ ID NO: 13, SEQ ID NO:4 or SEQ ID NO: 6.
As used herein In connection to a decarboxylase enzyme, a ‘preferred substrate* refers to the free fatty acid for which the decarboxylase enzyme has the highest activity, I.e.
when reacting with a number of free fatty acids, the decarboxylase enzyme has the highest activity when the substrate Is the preferred free fatty acid substrate. The activity of a decarboxylase enzyme can be determined by measuring the concentration of the corresponding α-olefin product when the decarboxylase acid Is reacted with a free fatty acid substrate. The α-olefin concentration in a reaction or culture medium can be 10 measured by methods well known in the art, such as by GC/MS analysis. The substrate preference for a decarboxylase enzyme can hence be determined by calculating the conversion ratio of each free fatty acid substrate tested Into corresponding α-olefin product, wherein the preferred substrate Is the free fatty acid substrate with the highest conversion ratio. The substrate preference for a 15 decarboxylase enzyme can also be determined by calculating the conversion ratio of each free fatty acid substrate tested, wherein the preferred substrate is the free fatty acid substrate with the highest conversion ratio. The “conversion ratio or ‘substrate conversion ratio* means herein the ratio between consumed substrate (for instance fatty acid substrate concentration (e.g. in mM)) and produced reaction product(s) (for 20 instance α-olefin concentration (e.g. ln mM) and also potential co-product(s) is quantified (for instance hydroxyl fatty acid concentration (e.g. ln mM)). The conversion ratio or conversion percentage for a spécifie reaction product can be calculated as the ratio between consumed substrate (for Instance fatty acid substrate concentration (e.g. ln mM)) and the spécifie reaction product produced (for Instance α-olefin concentration 25 (e.g. ln mM). ·
Also envlsaged herein are variant polypeptides ofthe polypeptides described herein. It is understood that the variant polypeptides described herein may hâve conservative or non-essential amino acid substitutions, which do not hâve a substantial effect on the polypeptide fùnctions. Whether or not a particular substitution will be tolerated (i.e., will 30 not adversely affect desired biological properties) can be determined as described in Bowle et al. (1990) (Science 247:1306 1310). A conservative amino acid substitution is one ln which the amino acid residue is replaced with an amino acid residue having a similar side chain. Families of amino acid residues having similar side chains hâve
been defined ln the art. These families Include amino acids with basic side chains (e.g., lysine, arginine, histidine), acidic side chains (e.g., aspartîc acid, glutamic acid), uncharged polar side chains (e.g., glycine, asparagine, glutamine, serine, threonine, tyrosine, cysteine), nonpolar side chains (e.g., alanine, valine, leucine, isoleucine, 5 proline, phenylalanlne, méthionine, tryptophan), beta-branched side chains (e.g., threonine, valine, Isoleucine), and aromatic side chains (e.g., tyrosine, phenylalanine, tryptophan, histidine).
Additional polypeptide variants are those ln which additional amino acids are fused to the polypeptide, such as a sécrétion signal sequence, or a sequence which facilitâtes 10 purification of the polypeptide.
Yet other polypeptide variants include functional or active fragments comprising at least about 5, 10, 15, 20, 25, 30, 35, 40, 50, 75, 100 or 150 consecutive amino acids, and which retain the same biological function as Bs168, Aa162, or Sm46 (e.g. retain olefinproducing activity, more particularly improved decarboxylase activity towards medium15 chain free fatty acids, more preferably on Cu FFA substrate). Exemplary functional or active fragments include without limitation truncated forms of the polypeptides described herein, which retain the decarboxylase activity of Bs168, Aa162, or Sm46. These functional or active fragments hence retain at least the decarboxylase catalytlc domain of the polypeptide, i.e. the part of the polypeptide that is involved ln the 20 decarboxylase reaction. A particular example of such truncated form is the polypeptide having an amino acid sequence of SEQ ID NO:13 (encoded by the nucléotide sequence of SEQ ID NO:10), which polypeptide Is a functional fragment of Sm46 wherein the 29 N-terminal amino acids were deleted.
Functional variants of the polypeptides described herein retain the decarboxylase 25 activity of the polypeptides. Accordingly, the functional variants may comprise or consist of an amino acid sequence having at least about 70%, preferably at least about 80%, more preferably at least about 85%, 90% or 95%, even more preferably at least about 96%, 97%, 98% or 99% sequence Identity ln the decarboxylase catalytic domain of SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, or SEQ ID NO: 13. The polypeptide 30 variants may hâve an amino acid sequence substantially Identical to SEQ ID NO:4,
SEQ ID NO:6, SEQ ID NO:8, or SEQ ID NO:13 or they may hâve an amino acid sequence having at least about 70%, preferably at least about 80%, more preferably at least about 85%, 90% or 95%, even more preferably at least about 96%, 97%, 98% or
99% sequence Identity to SEQ ID NO:4, SEQ ID N0:6, SEQ ID N0:8, or SEQ ID NO:13. The polypeptide variants, In particular the active or functional fragments, may hâve an amino acid sequence wherein the 40 or less, preferably 35 or less, more preferably 30 or less such as 29 or less, N-terminal amino acids of SEQ ID NO:4, SEQ 5 ID NO.6, SEQ ID NO:8 or SEQ ID NO: 13 hâve been deleted.
The polypeptides disclosed herein hâve free fatty acid decarboxylase activity, in particular decarboxylase activity on Ce-Cu free fatty acids, more particularly on evennumbered Ce-Cu free fatty acids such as on Ce, Cio, Cu and/or C« free fatty acids, even more particulariy Ce-Cu free fatty acid decarboxylase activity such as Cu free 10 fatty acid decarboxylase activity, Indeed, as shown In the experimental section, decarboxylase activity has been shown for OleTjE, Bs168, Aa162, Sm46 or Sm46del29, more particulariy decarboxylase activity on Cu, Cu and Cu free fatty acids. The Sm46, Sm46-del29, Aa162 and Bs168 enzymes showed higher activity on Cu free fatty acid, resulting in higher production of Cu a-olefins.
Decarboxylase activity of a polypeptide can be assayed using routine methods. For example, the polypeptide can be contacted with a substrate, In particular a free fatty acid substrate, under conditions that allow the polypeptide to function. A decrease In the level of the substrate or an Increase In the level of an α-olefin can be measured to détermine decarboxylase or olefin-producing activity.
The polypeptides described herein may further catalyze the hydroxylation of fatty acids, in particular the a- and β-hydroxylation of fatty acids, as side reactions. Preferred polypeptides are those for which the décarboxylation activity is the dominant activity,
i.e. which hâve spécifie decarboxylase activity. In preferred embodiments, the conversion of a free fatty substrate into hydroxy fatty acid, in particular a- and β25 hydroxy fatty acid by the polypeptide Is only marginal (such as less than 25%, preferably less than 20%, more preferably less than 10% of total products) as compared to the conversion of the fatty acid substrate into the corresponding a-olefin. In embodiments, the conversion percentage for hydroxy fatty acids is less than 25%, preferably less than 20%, more preferably less than 10%. The conversion percentage for hydroxy fatty acids can be calculated as the ratio between consumed substrate (for instance fatty acid substrate concentration (e.g. in mM)) and produced hydroxyl fatty acids (for instance hydroxyl fatty acid concentration (e.g. in mM)). Particular examples of polypeptides with spécifie decarboxylase activity are Sm46 or functional variants
thereof, including active fragments thereof, such as the truncated form of Sm46 wherein the 29 N-termlnal amino acids were deleted (i.e. the polypeptide encoded by the nucléotide sequence of SEQ ID NO:10 or the polypeptide of SEQ ID N0:13).
The polypeptides envisaged herein can be produced by recombinant expression ln a host cell. ln particular embodiments, the polypeptide is secreted by the host cell.
Production of g-olefins
Also provided herein is the use of the medium-chain free fatty acid decarboxylase enzymes disclosed herein for the production of medium-chain α-olefins, more particularly Cn a-olefins.
Some methods described herein relate to the production of medium-chain a-olefins, more particularly Cu α-olefins, using a (purified) medium-chain free fatty acid decarboxylase enzyme disclosed herein and a free fatty acid substrate. Accordingly, disclosed herein Is a method for the production of medium-chain α-olefins, more particularly Cu α-oiefins, comprising contacting a medium-chain free fatty acid 15 decarboxylase enzyme with a suitable free fatty acid substrate so as to produce medium-chain α-olefins, more particularly Cu a-olefins.
In particular embodiments, the medium-chain free fatty acid decarboxylase is a polypeptide having an amino acid sequence of SEQ ID NO:4, SEQ ID NO:6 SEQ ID NO:8 or SEQ ID NO:13 or a functional variant, including a functional or active fragment, 20 of said polypeptide, ln particular embodiments, the medium-chain free fatty acid decarboxylase is a polypeptide having an amino acid sequence having at least about 70%, preferably at least about 80%, more preferably at least about 85%, 90% or 95%, even more preferably at least about 96%, 97%, 98% or 99% sequence Identity to SEQ
ID NO:4, SEQ ID NO:6 SEQ ID NO:8 or SEQ ID NO:13. ln particular embodiments, the 25 enzyme has a preferred activity on medium-chain free fatty acids, more preferably on
Cb-Cw free fatty acids. Most particularly, the enzyme Is a decarboxylase having preferred activity on Cn free fatty acids and having an amino acid sequence having at least about 85%, 90% or 95%, even more preferably at ieast about 96%, 97%, 98% or 99% sequence identity to SEQ ID NO:8, SEQ ID NO:13, SEQ ID NO: 4 or SEQ ID NO: 30 6. ln particular embodiments, the enzyme has 95% amino acid sequence Identity to
SEQ ID NO:8, SEQ ID NO:13, SEQ ID NO: 4 or SEQ ID NO: 6.
For example, a host cell can be genetically engineered to overexpress a medium-chain free fatty acid decarboxylase as disclosed herein. The recombinant host cell can be cultured under conditions sufficient to allow (over)express!on of the decarboxylase. Cell-free extracts can then be generated using known methods. For example, the host 5 cells can be lysed using détergents or by sonication. The overexpressed polypeptides can be purified using known methods, or the cell-free extracts can be used as such for the production of olefins. The host cells can also be genetically engineered to overexpress a medium-chain free fatty acid decarboxylase as disclosed herein and to secrete said polypeptide Into the growth medium as described elsewhere. The secreted 10 polypeptides can then be separated from the growth medium and optionally purified using known methods without the need for obtaining cell-free extracts,
Next, free fatty acid substrates can be added to the cell-free extracts or (purified) enzymes and maintained under conditions to allow conversion of the free fatty acid substrates to α-olefins. The α-olefins can then be separated and purified using known 15 techniques.
Olefins having particular branchlng patterns, levels of saturation, and carbon chain length can be produced from free fatty acid substrates having those particular characteristics using the described methods. For example, the fatty acid substrate may be an unsaturated free fatty acid substrate (e.g. a monounsaturated free fatty acid 20 substrate), or a saturated free fatty acid substrate. The fatty acid substrate may be a straight chain free fatty acid substrate, a branched chain free fatty acid substrate, or a free fatty acid substrate that Includes a cyclic moiety.
Preferably, the fatty acid substrate Is a saturated free fatty acid substrate. Also preferably, the fatty acid substrate Is a straight chain free fatty acid substrate. Also 25 preferably, the fatty acid substrate is an even-numbered CrCu free fatty acid substrate (i.e. a Ca, Cio, Cw and/or Cu free fatty acid substrate), more preferably an evennumbered Ca-Ci2 free fatty acid substrate, most preferably a C12 fatty acid substrate.
Particularly intended herein Is the production of Ch α-olefins. Ch α-olefins can be obtained from a C« free fatty acid substrate using a Ct2 free fatty acid decarboxylase enzyme described herein. As shown ln the experimental section, Sm46, Sm46-del29,
Bs168 and Aa162 speciflcally show decarboxylase activity on C» free fatty acid substrates, i.e. Ci2 free fatty acid substrate is the preferred substrate for a-olefin
production for Sm46, Sm46-del29, Bs166 and Aa162. Hence, in particular embodiments, the présent invention relates to a method for the production of Cn aolefins, which method comprises contacting a C12 free fatty acid decarboxylase encoded by a nucleic acid comprising a nucléotide sequence having at least about 5 70% or 75%, preferably at least about 80%, more preferably at least about 85%, 90% or 95%, even more preferably at least about 96%, 97%, 96% or 99% sequence identity to SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:3 or SEQ !D NO:5. ln embodiments, the method comprises contacting a a polypeptide having an amino acid sequence comprising or consisting of SEQ ID NO:8t SEQ ID NO: 13, SEQ ID NO:4 or 10 SEQ ID NO:6 or a functional variant of said polypeptide, preferably a polypeptide consisting of SEQ !D NO:8, SEQ ID NO: 13, SEQ ID NO:4 or SEQ ID NO:6, with a C12 free fatty acid substrate, preferably dodecanoic acid (or lauric acid).
Also provided herein are medium-chain α-olefins, more particularly the Cn a-olefins, obtainable by the methods disclosed herein.
Medium-chain a-olefins
Also disclosed herein are the medium-chain α-olefins, and compositions comprising medium-chain α-oiefins more particularly the Cn α-olefins, produced by the methods as herein described. The methods described herein advantageously resuit ln the production of homogenous α-olefins, wherein the a-olefins produced hâve a uniform 20 carbon chain length. These processes are hence more efficient than conventional processes which resuit in the production of mixture of α-olefins with different carbon chain length and which require séparation of the different α-olefins for subséquent reactions.
Post-production processing
The produced α-olefins, more particularly medium-chain α-olefins such as Cn aoiefins, can be used as or converted Into a fuel, in particular a biofuel. These a-olefins, more particularly medium-chain α-olefins such as Cn α-olefins, can also be used as starting material for the production of chemicals or Personal care additives (e.g. polymers, surfactants, plastics, textiles, solvents, adhesives, etc.). They can also be 30 used as feedstock for subséquent réactions, such as hydrogénation and/or oligomérization reactions, to make other products.
A further aspect of the Invention relates to a method for the production of poly-a-olefins (PAO), said method comprising:
a) producing α-olefins, more particularly medium-chain α-olefins, according to a method disclosed herein;
b) olîgomerizing the α-olefins produced ln step a); and optionally
c) hydrogenating the oligomer produced In step b).
ln particular embodiments, a method is provided for the production of C33 PAOs, which comprises:
a) producing Cu α-olefins according to a method disclosed herein;
b) trimerizing the Cu α-olefins produced in step a); and optionally
c) hydrogenating the trimer produced ln step b).
Oligomérization of medium-chain α-olefins In the presence of a catalyst Is well known in the art. Catalysts that can be used for the oligomérization step are for example, but not limited to, AlCb, BF3, BFs complexes for cationic oligomérization, and métal based 15 catalysts like metallocenes.
Following the oligomérization step, residual unsaturation that Is potentially présent In the oligomers Is saturated by catalytic hydrogénation resulting ln saturated alîphatic hydrocarbons with one or more side branches.
The oligomers obtained by methods as described herein are known under the generic 20 name of poly-a-olefins (PAO). The PAO production methods described herein advantageously resuit in the homogenous production of PAOs of a well-defined carbon chain length. Accordingly, the application also provides a composition of PAO's obtainable by a PAO production method described herein, characterized in that at least 50%, preferably at least 85% or 90%, more preferably at least 95% such as 96%, 97%, 25 98% or even 99% of the PAOs hâve a well-defined carbon chain length such as C33
PAOs. Such a homogenous composition of poly-a-oiefins has not previously been disclosed.
The methods provided herein allow obtaining a base oil with a well-defined viscosity.
The PAOs, more particularly the C33 PAOs, obtainable by a method as described herein can be used as base oils, which display very attractive viscosity indices, with the viscosity increasing with the number of carbons. These base oils can be used,
together with additives and optionally other base oils, to formulate lubricants. In particular, PAOs with a number of carbons of about 30, more particularly 33 carbons, are preferred for automotive lubricants.
Accordingly, In a further aspect, the présent Invention relates to the use of the medium5 chain free fatty acid decarboxylase enzymes and the recombinant host cells described herein for the industrial production of lubricants.
Also provided herein are lubricants comprising poly-a-olefins, more particularly lubricants comprising poly-a-olefins which contain a more homogenous composition of poly-a-olefins, more particularly a high concentration of poly-olefins of a well-defined tO length, such as those obtainable by a method as described herein. Indeed, the Invention allows for the production of lubricants which are produced based on blosourced medium-chain alpha-olefins. More particularly, the Invention allows for the provision of lubricants comprising poly-a-olefins, whereby at least 50%, preferably at least 85% or 90%, more preferably at least 95% such as 96%, 97%, 98% or even 99% 15 of said poly-a-olefins are poly-a-olefins of a well defined carbon chain length, such as
C33 poly-a-olefins. Lubricants comprising such a homogenous composition of poly-aolefins hâve not previously been disclosed.
Further disclosed herein are methods for the production of alkanes, said methods comprising: (a) production of α-olefins more particularly medium-chain a-olefins, 20 according to a method disclosed herein; and (b) hydrogénation of the a-olefins obtained in step (a) to produce alkanes.
The présent invention will now be further Illustrated by means of the following non limiting examples.
EXAMPLES
Example 1: Production of Cn α-oleflns by OleTje. Sm46, Bs168 and Aa162
Material and methods
Molecular cloning
The genes encoding OleTJE (SEQ ID N0:1), Sm46 (SEQ ID N0:3), Bs168 (SEQ ID N0:5) and Aa162 (SEQ ID N0:7) were cloned Into the Ncfel/BamHI sites of the pET28a plasmid (Novagen, Fig. 2) by standard molecular biology techniques.
Protein overexpression and purification
E. coli BL21(DE3) cells (Novagen) carrying a recombinant plasmid or an empty PET28a plasmid (control) were cultured for several cycles to ensure best growth state at 37*C in 10 mL LB medium supplemented with 50 pg/ml of kanamycin, followed by inoculation (1:100 ratio) Into 50 mL fresh Terrifie Broth medium containing 50 pg/ml of kanamycin, 1 mM thlamine, 10% glycerol and a rare sait solution (6750 mg/l FeClj, 500 mg/l ZnCk, 500 mg/l CoCla, 500 mg/l Na2MoO4, 250 mg/l CaCb, 465 mg/l CuSO4, and 125 mg/l H3BO3) at 37’C. Cells were grown at 37'C for 3 to 4 h until the optical density at 600 nm (ODeoo) reached 0.6 to 0.8, at which 0.2 or 0.4 mM lsopropyl-β-οthiogalactopyranoside (IPTG) and 0.5 mM δ-amlnolevulinic acid were added, followed by 18 h of cultivation at 16°C.
Then, the cell cultures were recovered by centrifugation (at 4’C) and re-suspended in
1.5 mL 50 mM Tris buffer, pH 7.5, containing 0.1 M NaCI. The re-suspended cell cultures were disrupted by sonicatlon and centrifuged at 13,000 x g for 30 min (4 °C). The supernatant was collected and the pellet was re-suspended in 1.5 mL 50 mM Tris buffer, pH 7.5, containing 0.1 M NaCI for further analysis. An SDS gel was run to 20 check the expression of the genes using 15 pL of supernatant or pellet solution mixed with 5 pL of 4x loading buffer (Fig. 3).
The amount of total proteins within the supernatant was quantified using Pierce BCA kit according to the manufacturer^ instructions.
• ln vitro enzymatic assays ln vitro olefin biosynthesis was evaluated by adding to the supernatant a fatty acid substrate in DMSO to a final concentration of 0.2 mM (1:100 dilution) or 1 mM (1:10 dilution), and 500 pM H2O2. The reaction mixture was incubated at 28’C for 2-3 h, or stopped using 50 pL of 10 M HCl at t=0, 15, 30, 60 and 120 min for a dynamic test. Then, Cie fatty acid was added at a final concentration of 0.2 mM (1:100 dilution) as an internai standard. 300 pL réaction mixture of each reaction condition was extracted with equal volume of ethyl acetate, and the organic phase was analyzed for hydrocarbons by gas chromatography/mass spectrometry (GC/MS). In the meanwhile, undecene of
different concentrations (0, 0.05 mM, 0.1 mM, 0.2 mM and 0.5 mM) were tested to produce a standard curve.
Results
Figure 4 shows fatty acid decarboxylase activity for ali tested enzymes OleTjE, Sm46, 5 Bs168 and Aa162. Bs168 and Aa162 show preference for Cu fatty acid substrate, thereby specrfically producing Cu α-olefins. ln later experiments, Sm46 was also shown to hâve substrate preference for Cu fatty acid substrate (data not shown).
Example 2: Production of α-oleflns by a truncated Sm46 from Staphylococcus 10 Masslllensls
The construct pET28b-Sm46 of Example 1 did show poor expression in E. coli BL21(DE3), which did not Improve upon subcloning into another expression vector pCWori or optimization of the expression conditions, ln order to maximize expression 15 level and to assess potential of Sm46 as a decarboxylase enzyme with Cu as preferred substrate, a truncated version of codon-optlmlzed (for expression in E. coli) Sm46, Sm46-de!29, was designed, which has the N-termlnal 29 amino acids deleted, and the activity ofthe recombinant truncated Sm46-del29 was tested.
Material and methods
Matériels
Fatty acid substrates and terminal alkene authentic standards were purchased from TCI (Shanghaï, China). Antibiotics were obtained from Solar-Blo (Beijing, China). Other chemicals were purchased from Sigma Aldrich (St Louis, MO, USA) or Ameresco (Solon, OH, USA). Molecular cloning kits, such as E.Z.N.A.TM Plasmid Miniprep Kit 25 and Wizard SV Gel and PCR Clean-up System, were purchased from OMEGA Bio-Tek (Jlnan, China) and Promega (Madison, Wl, USA) respectively. Oligonucleotides and codon-optlmlzed gene were synthesized by Genewiz (Suzhou, China). The Pfu DNA polymerases and ail restriction endonucleases were obtained from Takara (Dalian, China). Ni-NTA resin used for protein purification was from Qlagen (Valencla, CA, 30 USA); Millipore Amlcon Ultra centrifugal fliters (Bilierica, MA, USA) and PD-10 desalting columns were from GE Healthcare (Piscataway, NJ, USA).
Molecular cloning Sm46-del29 gene
The 1,275 bp fragment of the truncated gene, Sm46-de!29, was PCR-amplified from the Initial construct of pET28b-Srri46 (see Example 1) using Pfu DNA polymerase and subsequently subcloned Into pET28b vector at the Nde l/Xho I restriction sites. The sequences of primera used are as follows:
GTCCATATGGCAAAAAAGCTGCCTAAAGTG (Sm46-del29-F, forward primer, SEQ ID N0:11) and GTACTCGAGTTATTTGCGGGCAACACGCGG (Sm46-del29-R, reverse primer, SEQ ID NO: 12). Ail recombinant plasmid constructs were confirmed by DNA sequencing (Sangon Biotech, Shanghai, China). Upon sequence vérification, plasmids were used to transform E. coli strain BL21 (DE3) for protein expression.
Heterologous expression and purification ofSm46-de!29
The £ coil BL21 (DE3) cells transformed with the recombinant pET28b-Sm46-de/29 plasmid were grown ovemlght at 37 ’C with shaking at 220 rpm in LB medium containîng 50 pg/ml kanamycin. The ovemight culture was used as seed culture to 15 Inoculate (1:100 dilution) 1 to 3 litres of modified Terrifie Broth contalning 4 % glycerol, mM thiamine, trace métal and the corresponding antibiotics. Cells were then grown at 37 ’C for 3 to 4 h until the optical density at 600 nm (QD600) reached - 0.6, at which point δ-aminolevulinic acid (0.5 mM final concentration) was supplemented and the expression of Sm46-del29 was Induced by the addition of 0.2 mM isopropyl-B-D20 thiogalactopyranoside (IPTG). Cells were then further cultured for 24 h at a reduced température of 18’C before being harvested by centrifugation at 6000 rpm, 4 ’C. The cell pellet was frozen at-80 ’C until requlred.
Purification of the His-tagged protein was carried out as described by Liu et al. (2014 25 Biotechnol. Biofuels 7:28) with minor modifications. Ail protein purification steps were performed at 4 ’C. Specifically, the cell pellets were thawed and re-suspended ln 40 ml Tysis buffer (50 mM NaH2PO4, 300 mM NaCI, 10% glycerol and 10 mM imidazole, pH 8.0) through vortexing. After cell disruption by ultrasonication, the cell lysate was centrifuged at 12,000 x g for 30 min to remove cellular débris. To the clarified cell 30 lysate, 1 ml of Ni-NTA resin was added and gently mixed at 4’C for 1 h. The slurry was then loaded onto an empty column, and washed with approximately 100 ml of wash buffer (50 mM NaH2PO4, 300 mM NaCI, 10% glycerol and 20 mM imidazole, pH 8.0) until no proteins were détectable in flow-through. The bound target proteins were eluted with elutîon buffer (50 mM NaH2PO4, 300 mM NaCI, 10% glycerol and 250 mM
Imidazole, pH 8.0). The eluates were pooled and concentrated with an Amicon Ultra centrifugal filter, and buffer exchanged on a PD-10 column into storage buffer (50 mM NahhPOx, 300 mM NaCI 10% glycerol, pH 7.4). The final purified protein was flashfrozen in liquid nitrogen and stored at -80’C for later use.
UV-visibia Spectroscopy—Spectroscoplc characterization ofSm46-dei29
Analysis of the UV-visible spectroscopic properties of Sm46-del29 was performed on a Cary 60 UV-visible spectrophotometer (Varian, UK). The ferrous-CO complex of Sm46del29 was prepared by slow bubbling of carbon monoxide gas into a solution of ferrie 10 P450, and subséquent réduction of the protein with sodium dithionite. The CO-bound reduced différence spectrum of the enzyme was obtained through recording the respective spectrum before and after the addition of sodium dithionite (Omura and Sato 1964. J Biol Chem 239:2379-2385, Amaya et al. 2018 J Inorg Biochem. 158:11-16).
ln vitro enzymatic assays
The fatty acid décarboxylation assays containing 1.0 pM Sm46-del29, 200 pM fatty acid substrate (from C« to C2o), 220 pM HjOj in 200 pl of storage buffer were carried out at 28*C for 2 h. Reactions were quenched by the addition of 20 pl of 10 M HCl. Heptadecanoic acid was added as Internai standard and the mixture was extracted by 20 200 pl ethyl acetate. Following extraction, the organic phase was collected and analyzed by gas chromatography as described below.
Gas chromatography (GC)
The GC analytical method for hydrocarbon and fatty acid samples was adapted from 25 Guan et al. (2011 J Chromatogr A 1218:8289-8293). The analyses were performed on an Agitent 7Θ90Β gas chromatograph equipped with a capillary column HP-INNOWAX (Agitent Technologies, Santa Clara, CA, USA; cross-linked polyethylene glycerol, l.d.
0,25 pm film thickness, 30 m by 0.25 mm). The hélium flow rate was set to 1 ml per minute. The oven température was controlled initially at 40’C for 4 min, then Increased 30 at the rate of 10*C per min to 250’C, and held for 5 min. The Injecting température was set to 280’C with the injection volume of 1 pl under splitless injection conditions. The response factors between fatty acids and alkenes were determined by analyzing known authentic fatty acids (C10-C20), 1-alkenes (C9-C19) and 1-heptadecanoic acid standards as described in Liu et al. (2014 Biotechnol. Biofuels 7:28).
Results
The truncated variant of Sm46 (i.e. Sm46-del29) had decarboxylase activity and showed substrate préférence for C12 free fatty acid for α-olefin production among the 5 tested fatty acid substrates (Fig. 6). Sm46-del29 was also able to decarboxylate Cw free fatty acid, but technical problems (due to volatility) hlndered détection of the formed C# a-olefin.
Example 3: Production of α-oleflns and hydroxy fatty acids by Sm46-del29, 10 Bs168and Aa162
Recomblnantly expressed and purified Bs168 and Aa162 from Example 1 and Sm46del29 of Example 2 were reacted with Cm fatty acid and the products of décarboxylation (Cu α-olefin) and hydroxylation (α-ΟΗ-Cu fatty acid and β-ΟΗ-Cu fatty acid) reactions were analyzed.
All enzymes tested were able to decarboxylate myristic acid (Ct4), but also catalyzed aand β-hydroxylation of myristic acid as side reactions. Fatty acid décarboxylation was the dominant reaction for all tested enzymes, but Sm46-del29 formed less hydroxy fatty acids, Indicating that these enzymes hâve spécifie decarboxylase activity.
Claims (20)
1. A method for the production of C7-C11 α-olefins comprising culturing a recombinant host cell comprising a recombinant nucleic acid encoding a decarboxylase enzyme under conditions suitable for the production of C7 to Cn α-olefins by said host cell, wherein said decarboxylase enzyme has the highest activity on an C6-C12free fatty acid substrate, wherein said recombinant nucleic acid encoding a decarboxylase enzyme comprises a nucléotide sequence having at least 75% sequence identity to SEQ ID NO:10, SEQ ID N0:7, or SEQ ID NO: 5 and wherein said recombinant nucleic acid ensures expression or overexpression of said decarboxylase enzyme.
2. The method according to claim 1, wherein said recombinant nucleic acid encodes a polypeptide comprising an amino add sequence having at least 80% identity to SEQ ID NO:13, SEQ ID NO: 8, or SEQ ID NO: 6, or an active fragment of said polypeptide.
3. The method according to any one of daims 1 or 2, wherein said recombinant nucleic acid encodes a polypeptide comprising the amino acid sequence of SEQ ID NO:13.
4. The method according to any one of claims 1 to 3, wherein said decarboxylase enzyme has the highest activity on C12 free fatty acids, and wherein said a-olefins are Cu a-olefins.
5. The method according to any one of claims 1 to 4, wherein said host cell is cultivated in a medium comprising Ca-C12free fatty acids.
6. The method according to daim 5, wherein said host cell is cultivated in a medium comprising Ci2free fatty acids.
7. The method according to any one of claims 1 to 6, wherein the host cell further comprises a recombinant nucleic add encoding a thioesterase having activity on Ca to C12 acyl-ACP.
8. The method according to claim 7, wherein said thioesterase has activity on C12 acylACP.
9. The method according to any one of claims 1 to 8, wherein the host cell is an oleaginous host cell.
10. The method according to any one of claims 1 to 9, wherein the host cell is selected from the group consisting of bacteria, yeasts, fungi, plants and algae.
11. The method according to any one of claims 1 to 10, further comprising the step of recovering the α-olefins from the host cell or the culture medium.
12. A method for the production of poly-a-olefins comprising the following steps:
i) producing Cr-Cn a-olefins according to a method according to any one of claims 1 to 11 ; and ii) performing an oligomérization reaction using the α-olefins obtained in step i) as monomer to produce an oligomer.
13. The method according to claim 12, further comprising the step of hydrogenating the oligomer produced in step ii).
14. The method according to claim 12 or 13, wherein the poly-a-olefins are C33 poly-aolefins, wherein step i) comprises the production of Cn a-olefins, and wherein the oligomérization reaction in step ii) is a trimerization reaction.
15. A composition comprising poly-a-olefins obtainable by the method according to claim 14, wherein at least 85% of the poly-a-olefins are C33 poly-a-olefins.
16. Use of a host cell comprising a nucleic acid comprising a nucleic acid sequence having at least 80% sequence identity to SEQ ID NO:7 or SEQ ID NO:5 integrated into its genome, or comprising a vector comprising at least one regulatory sequence operatively coupled to a nucleic acid sequence having at least 80% sequence identity to DEQ ID NO:7 or SEQ ID NO:5 for the industrial production of lubricants.
17. Use according to claim 16, wherein the host cell is an oleaginous eukaryotic microalga or oleaginous yeast.
18. Lubricant comprising poly-a-olefins obtainable by a method according to any one of claims 12 to 14.
19. Lubricant according to claim 18, wherein said poly-a-olefins are biosourced polya-olefins.
20. Lubricant according to claim 18 or 19, wherein at least 50% of said poly-a-olefins consist of C33 poly-a-olefins.
1/10
A) SEQ ID NO:1: Jeotgalicoccus sp. ATCC 8456 terminal olefin-forming fatty acid decarboxylase gene, complété cds
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP15174554.4 | 2015-06-30 |
Publications (1)
Publication Number | Publication Date |
---|---|
OA18507A true OA18507A (en) | 2018-12-04 |
Family
ID=
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP7068362B2 (en) | Enzyme variants with improved ester synthase properties | |
US10407638B2 (en) | Production of alpha-olefins | |
JP2014526262A (en) | Production of odd-chain fatty acid derivatives in recombinant microbial cells. | |
JP6553610B2 (en) | Microbial production method of fatty amines | |
MX2013007288A (en) | Genetically engineered microorganisms comprising 4-hydroxybenzoly-coa thioesterases and methods of using the same for producing free fatty acids and fatty acid derivatives. | |
US20210292733A1 (en) | Improved cytochrome p450 fatty acid decarboxylases | |
JP2022043100A (en) | Production of odd chain fatty acid derivatives in recombinant microbial cells | |
KR102677413B1 (en) | Thioesterase variants with improved activity for the production of medium chain fatty acid derivatives | |
JP2019103507A (en) | Improved acetyl-CoA carboxylase variants | |
WO2019069969A1 (en) | Lipid manufacturing method | |
OA18507A (en) | Production of α-olefins. | |
WO2017022587A1 (en) | Lipid production method |