WO2020005495A1 - Methods and cells for producing fatty acids and fatty acid chain products - Google Patents

Methods and cells for producing fatty acids and fatty acid chain products Download PDF

Info

Publication number
WO2020005495A1
WO2020005495A1 PCT/US2019/035801 US2019035801W WO2020005495A1 WO 2020005495 A1 WO2020005495 A1 WO 2020005495A1 US 2019035801 W US2019035801 W US 2019035801W WO 2020005495 A1 WO2020005495 A1 WO 2020005495A1
Authority
WO
WIPO (PCT)
Prior art keywords
coa
fatty acid
recombinant cell
seq
ketoacyl
Prior art date
Application number
PCT/US2019/035801
Other languages
French (fr)
Inventor
David A. Hogsett
Hans H. Liao
Michael T.M. LOUIE
Ana NEGRETE-RAYMOND
Catherine B. Poor
Travis R. WOLTER
Original Assignee
Cargill, Incorporated
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Cargill, Incorporated filed Critical Cargill, Incorporated
Publication of WO2020005495A1 publication Critical patent/WO2020005495A1/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/10Transferases (2.)
    • C12N9/1025Acyltransferases (2.3)
    • C12N9/1029Acyltransferases (2.3) transferring groups other than amino-acyl groups (2.3.1)

Definitions

  • Fatty acids and fatty acid chain products have a number of industrial uses.
  • fatty acids and fatty acid chain products can be utilized as solvents for lacquers, paints, varnishes and other compositions; as plasticizers for organic resins, as fragrances and flavorings, as fuels for jet and other internal combustion engines, and as raw materials for making a variety of downstream products.
  • fatty acids and fatty acid chain products are produced industrially from non-renewable fossil fuels or from plant oils such as palm and coconut oil.
  • these conventional methods of producing fatty acids and fatty acid chain products have a number of drawbacks.
  • a particular drawback is poor selectivity of a specific carbon chain length.
  • the products of these conventional methods tend to be a mixture of compounds having a range of carbon chain lengths. Separating these mixtures of compounds can be difficult and can often lead to poor yields with much of the non-renewable fossil fuels or plant oils converted to lower-value byproducts that do not have the desired carbon chain lengths.
  • Some biological cells can naturally produce some types of fatty acid chain products. For example, almost all living cells produce triglycerides of fatty acids as well as other fatty acid esters. These triglycerides and other esters play a role in the metabolism, cellular structure, and other biological processes of the cells, and can perform other useful functions such as storing energy. Therefore, some biological cells can potentially be used to produce fatty acid chain products industrially. Among other potential advantages, biological production of fatty acids and fatty acid products in some cases can rely on annually renewable carbon sources such as sugars, rather than on fossil fuels.
  • Biological cells tend to produce fatty acid groups with chain lengths of 12 carbon atoms or greater.
  • naturally-occurring biological cells are a good source for C12 and higher fatty acids and their chain products.
  • triglycerides produced naturally by such cells can be hydrolyzed to produce C12 or higher fatty acids, which can in turn be converted to other derivatives such as esters or alcohols.
  • few cells naturally produce fatty acids of C6 - C10 chain length in significant quantities.
  • Some biological cells can produce fatty acid chain products through a native metabolic pathway that starts with acetyl-CoA and malonyl-ACP (malonyl-acyl-carrier protein).
  • Acetyl-CoA condenses with malonyl-ACP with loss of carbon dioxide and CoA to produce 3- ketobutyryl-ACP.
  • Subsequent enzymatic reactions convert the 3-ketobutyryl-ACP successively to 3-hydroxybutyryl-ACP, then to trans-2-butenoyl-ACP (with loss of water) and finally to butyryl-ACP.
  • the butyryl-ACP can re-enter this reaction cycle in place of acetyl-CoA to produce hexanoyl-ACP.
  • This cycle repeats itself, producing in each iteration a longer carbon atom chain by adding two carbon atoms at a time, until terminated by some other cellular process.
  • this native metabolic pathway has low termination rates for C6-C10 chain lengths, resulting in low production of C6-C10 fatty acids and fatty acid chain products.
  • Some biological cells can be genetically modified to increase production of C6-
  • C10 chain lengths are C10 chain lengths.
  • These genetic modifications can include insertions of heterologous genes and modification and/or deletions of endogenous genes to increase production of C6-C10 chain lengths.
  • heterologous genes can be inserted to provide enzymes to catalyze specific reactions between chemical intermediates along a pathway that results in production of C6-C10 chain lengths.
  • Okamura et ak in PNAS vol. 107, no. 25, pp. 11265-11270 (2010), reported that the enzyme produced by the nphT7 gene of a soil-isolated Streptomyces sp. strain can catalyze a single condensation of acetyl-CoA and malonyl-CoA to produce acetoacetyl- CoA.
  • U.S. Application No. 2014/0051136 disclosed that fatty acid chain products can be produced by a cell modified to include the nphT7 gene and additional heterologous genes.
  • the amounts of C6 or longer chain fatty acid chain products produced were limited and selectivity of C6-C10 chain lengths was limited.
  • WO 2015/010103 described mutants of the nphT7 gene that code for NphT7 enzymes that more efficiently catalyze the condensation of longer-chain acyl-CoA compounds with malonyl-CoA.
  • Cells modified to include both the wild-type nphT7 gene and the nphT7 mutant gene produced greater relative quantities of longer fatty acids, but selectivity of C6-C10 chain lengths remained poor.
  • C6-C10 fatty acid chain products in particular were made in only small amounts.
  • nucleic acid sequences and proteins as well as biological processes to produce fatty acids at higher yield and with selectivity to C6-C10 fatty acids or fatty acid chain products.
  • One aspect provides an expression system comprising at least one expression cassette comprising a promoter operably linked to a heterologous nucleic acid segment encoding a polypeptide with 3-ketoacyl-CoA synthase activity and having at least 80% sequence identity to an amino acid sequence set forth in SEQ ID NO: 1-15, wherein the polypeptide catalyzes the condensation of acyl-CoA with malonyl-CoA to form 3-ketoacyl-CoA.
  • the polypeptide with 3-ketoacyl-CoA synthase activity has at least 80% sequence identity to an amino acid sequence set forth in SEQ ID NO: 1-4.
  • the polypeptide catalyzes one or more condensations selected from the group consisting of: a) C2-CoA with malonyl-CoA to form 3-keto-C4-CoA; b) C4-CoA with malonyl-CoA to form 3-keto-C6-CoA; c) C6-C0A with malonyl-CoA to form 3-keto-C8-CoA; and d) C8-C0A with malonyl-CoA to form 3-keto- ClO-CoA.
  • an amino acid residue of the polypeptide with 3-ketoacyl-CoA synthase activity that aligns with amino acid residue 147 of SEQ ID NO: 16 is serine and an amino acid residue that aligns with amino acid residue 217 of SEQ ID NO: 16 is valine.
  • the promoter is an inducible promoter, such as an inducible promoter sensitive to lowering phosphate concentration.
  • the promoter is a PpstSIH promoter or a PphoE promoter.
  • One aspect provides a recombinant cell comprising the expression system described herein, wherein the recombinant cell produces a fatty acid and/or fatty acid chain product having a chain length of one or more of C4, C6, C8 and/or C10.
  • the fatty acid chain product comprises one or more of fatty acid ester, fatty alcohol, fatty acid amide and fatty acid amine.
  • the fatty acid ester comprises one or more of fatty acid methyl ester, fatty acid ethyl ester, fatty acid propyl ester, fatty acid butyl ester, fatty acid pentyl ester and fatty acid hexyl ester.
  • the recombinant cell produces a fatty acid ester of C4, C6, C8, and/or C10 chain length. In another aspect, the recombinant cell produces a fatty acid ester of C8 and/or C10 chain length.
  • the alkoxy group of the fatty acid ester is derived from a Cl, C2, C3, or C4 monoalcohol. In another aspect, the alkoxy group is derived from a Cl or C2 monoalcohol. In one aspect, the monoalcohol is methanol.
  • the recombinant cell described herein further comprises one or more of: a heterologous nucleic acid segment encoding a polypeptide with enoyl-CoA reductase activity; a heterologous nucleic acid segment encoding a polypeptide with bifunctional 3- hydroxyacyl-CoA dehydrogenase/dehydratase activity; a heterologous nucleic acid segment encoding a polypeptide with ester synthase activity; a heterologous nucleic acid segment encoding a polypeptide with acetyl-CoA carboxylase activity; and a heterologous nucleic acid segment encoding a mutant polypeptide with ketoacyl-CoA synthase activity.
  • the heterologous nucleic acid segment encoding a mutant polypeptide with ketoacyl-CoA synthase activity is an Asch variant selected from the group consisting of Asch(Tl84I, F236L, V268A, V296A, V317A, S328G), Asch(Tl84I, V296A, V268A), and Asch(V296A).
  • the recombinant cell is a fungal cell, a bacterial cell, or a plant cell.
  • the recombinant cell comprises an Escherichia coli. In another aspect, the recombinant cell comprises a yeast cell.
  • One aspect provides a cell culture comprising: a) a recombinant cell described herein; and b) one or more fatty acids or fatty acid chain products produced by the recombinant cell.
  • One aspect provides a method of producing fatty acids and/or fatty acid chain products comprising growing the recombinant cell described herein in a culture medium, wherein the recombinant cell is grown under conditions in which the heterologous nucleic acid segment encoding the polypeptide with 3-ketoacy-CoA synthase activity is expressed; and the recombinant cell produces a fatty acid and/or a fatty acid chain product.
  • One aspect provides one or more fatty acids or fatty acid chain products produced herein for use in products.
  • One aspect provides an isolated 3-ketoacyl-CoA synthase having at least 80% sequence identity to an amino acid sequence set forth in SEQ ID NO: 1-15 for use in catalyzing the condensation of acyl-CoA with malonyl-CoA to form 3-ketoacyl-CoA.
  • the 3-ketoacyl-CoA synthase has activity for one or more of C2-CoA, C4-CoA and C6-C0A intermediates.
  • the 3-ketoacyl-CoA synthase has activity for C2-CoA and C4-CoA intermediates.
  • the 3-ketoacyl-CoA synthase has activity for C2-CoA, C4-CoA and C6-C0A intermediates.
  • One aspect provides a recombinant cell described herein for use in producing fatty acid and/or fatty acid chain products having a chain length of one or more of C4, C6, C8 and/or C10.
  • One aspect provides a 3-ketoacyl-CoA synthase having at least 80% sequence identity to an amino acid sequence set forth in SEQ ID NO: 1-4, wherein an amino acid residue of the 3-ketoacyl-CoA synthase that aligns with amino acid residue 147 of SEQ ID NO: 16 is serine and an amino acid residue that aligns with amino acid residue 217 of SEQ ID NO: 16 is valine.
  • the recombinant cell produces a fatty acid and/or fatty acid chain product having a chain length of one or more of C6, C8 and/or C10. In another aspect, the recombinant cell produces a fatty acid ester of C6, C8, and/or C10 chain length. In one aspect, the recombinant cell produces a fatty acid ester of C8 and/or C10 chain length.
  • the alkoxy group is derived from a Cl, C2, C3, or C4 monoalcohol. In one aspect, the alkoxy group is derived from a Cl or C2 monoalcohol. In some aspects, the monoalcohol is methanol.
  • FIG. 1 shows graphical views of the sequence alignment. Each line corresponds to a 3-ketoacyl-CoA synthase enzyme as indicated by the designation. The amino acid position of the first residue of each row is indicated. The SV mutations are marked with the boxes.
  • FIGS. 2A-2D show graphical views of the sequence alignment. Each line corresponds to a 3-ketoacyl-CoA synthase enzyme as indicated by the designation. The amino acid position of the first residue of each row is indicated. The SV mutations are marked with the boxes. Using the NphT7 numbering, it is I147S and F217V. Sequence alignment was done using Clone Manager alignment software with a Scoring Matrix: Standard Linear and default settings.
  • FIG. 3 shows the FAME production by NphT7 and its mutants SV and LSVA in a C10 FAME production pathway with or without chromosomal expression of wild type nphT7.
  • FIG. 4 shows FAME production by wild type Nbra, Sten, Sko3, Scin in three different FAME production pathways (C8 FAME, C10 FAME, C8/C10 mix).
  • FIG. 5 shows FAME production by wild- type Nbra and its SV mutants vs.
  • NphT7(LSVA) in three different FAME production pathways C8 FAME, C10 FAME, C8/C10 mix
  • C8 FAME C10 FAME
  • C8/C10 mix C8/C10 mix
  • FIG. 6 shows FAME production by wild- type Sten and its SV mutants vs.
  • NphT7(LSVA) in three different FAME production pathways C8 FAME, C10 FAME, C8/C10 mix
  • C8 FAME C10 FAME
  • C8/C10 mix C8/C10 mix
  • FIG. 7 shows FAME production by wild-type Sko3 and its SV mutants vs.
  • NphT7(LSVA) in three different FAME production pathways C8 FAME, C10 FAME, C8/C10 mix
  • C8 FAME C10 FAME
  • C8/C10 mix C8/C10 mix
  • FIG. 8 shows FAME production by wild- type Scin and its SV mutants vs.
  • NphT7(LSVA) in three different FAME production pathways C8 FAME, C10 FAME, C8/C10 mix
  • C8 FAME C10 FAME
  • C8/C10 mix C8/C10 mix
  • recombinant cells can be genetically modified to express heterologous polypeptides for the biosynthesis of C6-C10 fatty acids and/or fatty acid chain products.
  • These recombinant cells can be genetically modified to express identified enzymes having 3-ketoacyl-CoA synthase activity to produce C6-C10 fatty acids and/or fatty acid chain products.
  • these identified enzymes having 3-ketoacyl-CoA synthase activity can catalyze the stepwise addition of malonyl-CoA to an acyl-CoA substrate that can be further converted to a C6-C10 fatty acid and/or fatty acid chain product.
  • an enzyme having 3-ketoacyl-CoA synthase activity can catalyze the addition of malonyl-CoA to a C2-CoA substrate to lead to a C4-CoA product that can be further converted to its fatty acid and/or fatty acid chain product, the addition of malonyl-CoA to a C4-CoA substrate to lead to a C6-C0A product that can be further converted to its fatty acid and/or fatty acid chain product, the addition of malonyl-CoA to a C6-C0A substrate to lead to a C8-C0A product that can be further converted to its fatty acid and/or fatty acid chain product, and/or the addition of malonyl- CoA to a C8-C0A substrate to lead to a ClO-CoA product that can be further converted to its fatty acid and/or fatty acid chain product.
  • an enzyme having 3-ketoacyl-CoA synthase activity can catalyze the addition of malonyl-CoA to a C2-CoA substrate (acetyl-CoA) to lead to a 3-keto-C4-CoA, the addition of malonyl-CoA to a C4-CoA substrate to lead to a 3- keto-C6-CoA, the addition of malonyl-CoA to a C6-C0A substrate to lead to a 3-keto-C8-CoA, and/or the addition of malonyl-CoA to a C8-C0A substrate to lead to a 3-keto-ClO-CoA.
  • the recombinant cells can be genetically modified to express other heterologous polypeptides for the biosynthesis of C6-C10 fatty acids and/or fatty acid chain products.
  • the recombinant cells can be genetically modified to express one or more heterologous polypeptides capable of converting a 3-keto-CX-CoA substrate (e.g., 3-keto-C4-CoA, 3-keto-C6-CoA, 3-keto-C8-CoA, and/or 3- keto-ClO-CoA) to a 3-OH CX-CoA substrate (e.g., 3-OH-C4-CoA, 3- OH-C6-C0A, 3-OH-C8- CoA, and/or 3-OH-ClO-CoA) (where X represents the carbon chain length).
  • a 3-keto-CX-CoA substrate e.g., 3-keto-C4-CoA, 3-keto-C6-CoA, 3-keto-C8-CoA, and/or 3- keto-ClO-CoA
  • 3-OH CX-CoA substrate e.g., 3-OH-C4-CoA, 3- OH-C6-C0
  • the recombinant cells can be genetically modified to express one or more heterologous polypeptides capable of converting a 3-OH-CX-CoA substrate (e.g., 3-OH-C4-CoA, 3-OH-C6-CoA, 3-OH-C8-CoA, and/or 3-OH-ClO-CoA) to a trans-enoyl-CX-CoA substrate (e.g., enoyl-C4-CoA, enoyl-C6- CoA, enoyl-C8-CoA, and/or enoyl-ClO-CoA) (where X represents the carbon chain length).
  • a 3-OH-CX-CoA substrate e.g., 3-OH-C4-CoA, 3-OH-C6-CoA, 3-OH-C8-CoA, and/or 3-OH-ClO-CoA
  • trans-enoyl-CX-CoA substrate e.g., enoyl-
  • the recombinant cells can be genetically modified to express one or more heterologous polypeptides capable of converting a trans-enoyl-CX-CoA substrate (e.g., enoyl-C4-CoA, enoyl- C6-C0A, enoyl-C8-CoA, and/or enoyl-ClO-CoA) to a CX-CoA substrate (e.g., C4-CoA, C6- CoA, C8-C0A, and/or ClO-CoA) (where X represents the carbon chain length).
  • a trans-enoyl-CX-CoA substrate e.g., enoyl-C4-CoA, enoyl- C6-C0A, enoyl-C8-CoA, and/or enoyl-ClO-CoA
  • CX-CoA substrate e.g., C4-CoA, C6- Co
  • the recombinant cells can be genetically modified to express one or more heterologous polypeptides capable of converting a CX-CoA substrate (e.g., C4-CoA, C6-C0A, C8-C0A, and/or ClO-CoA) to a fatty acid ester.
  • a CX-CoA substrate e.g., C4-CoA, C6-C0A, C8-C0A, and/or ClO-CoA
  • nucleic acid means one or more nucleic acids.
  • Methods available to those skilled in the art can be used to construct genetic expression constructs and recombinant cells as described. These methods include, but are not limited to, in vitro recombinant DNA techniques, synthetic DNA techniques, in vivo
  • nucleic acid can be used interchangeably to refer to nucleic acid comprising DNA, RNA, derivatives thereof, or combinations thereof.
  • Amino acid residues in all amino acid sequences described herein are ordered in the N-terminus to C-terminus direction. “Upstream” means in the direction toward the N- terminus, and“downstream” means toward the C-terminus direction.
  • The“start” of an amino acid sequence is the first amino acid residue in the N-terminus direction.
  • the first amino acid residue (amino acid residue 1) for any sequence or sub-sequence described herein is the amino acid residue at its N-terminus.
  • A“sub-sequence” is a sequence of amino acid residues contained within a larger amino acid sequence.
  • Polypeptides refer to polymeric chains of amino acids connected by peptide bonds.
  • polypeptides can fold into unique three dimensional structures that allow the polypeptides to function as enzymes that catalyze distinct biochemical reactions.
  • a 3-ketoacyl-CoA synthase polypeptide catalyzes the condensation reaction of an acyl- CoA with malonyl-CoA to form a 3-ketoacyl-CoA.
  • Identity is used herein to indicate the extent to which two (nucleotide or amino acid) sequences have the same residues at the same positions in side-by-side alignment of their sequences. The identity is expressed herein as a % identity as determined using BLAST (National Center for Biological Information (NCBI) Basic Local Alignment Search Tool) version 2.2.31 software (National Center for Biotechnology Information, U.S. National Library of Medicine, Bethesda, Maryland, USA), using default parameters unless indicated otherwise in this paragraph.
  • NCBI National Center for Biological Information
  • NCBI National Center for Biological Information
  • Nucleic acid % sequence identity between nucleic acid sequences is determined using standard nucleotide BLAST with the following default parameters: Max target sequences: 100; Short queries: Automatically adjust parameters for short input sequences; Expect threshold: 10; Word size: 28; Max matches in a query range: 0; Match/Mismatch Scores: 1, -2; Gap costs: Linear; Filter: Low complexity regions; Mask: Mask for lookup table only.
  • a sequence having a % identity score of XX% (for example, 80% or 95%) to a reference sequence as determined in this manner is considered to be XX% identical to or, equivalently, have XX% sequence identity to, the reference sequence.
  • the sequence containing the sub-sequence under investigation is aligned with the reference sequence, and the amino acid residue of the sub-sequence under investigation occupies the same position in the alignment as does the amino acid residue of the reference sub-sequence.
  • the enzymes described herein can be expressed from expression cassettes.
  • Such an expression cassette (also referred to as a transgene) can include a promoter operably linked to a nucleic acid segment that encodes any of the enzymes described herein.
  • expression vector and "expression cassette” refer to a recombinant nucleic acid molecule containing a desired nucleic acid coding segment and appropriate nucleic acid sequences for the expression of the operably linked nucleic acid coding segment in a particular host cell/recombinant cells or organism.
  • Nucleic acid sequences for expression of proteins in prokaryotes usually include a promoter, an operator (optional), and a ribosome binding site, often along with other sequences.
  • Eukaryotic cells can utilize promoters, enhancers, and termination and polyadenylation signals.
  • Nucleic acid sequences for expression of proteins can be heterologous to the nucleic acid coding segment.
  • a “heterologous nucleic acid,” “heterologous gene” and “heterologous protein” indicate a nucleic acid, gene or protein, respectively, which has come from a source other than its native source.
  • the terms “heterologous” and “exogenous” are sometimes used interchangeably with “recombinant.”
  • Such heterologous nucleic acid, heterologous genes, and/or heterologous proteins may have been artificially supplied to the biological system or host/recombinant cell.
  • Promoters provide for expression of mRNA.
  • the promoter can be a native promoter.
  • the promoter can in some cases be heterologous to the nucleic acid segment.
  • such a heterologous promoter may not be naturally linked to such nucleic acid segment.
  • some expression cassettes and expression vectors have been recombinantly engineered to include a nucleic acid coding segment (such as one encoding an enzyme described herein) operably linked to a heterologous promoter.
  • a nucleic acid coding segment is operably linked to the promoter, for example, when it is located downstream from the promoter.
  • a variety of promoters can be included in the expression cassettes and/or expression vectors.
  • an endogenous promoter or an endogenous promoter that expresses the native enzyme, can be employed.
  • Promoter regions are typically found in the flanking DNA upstream from the coding sequence in both prokaryotic and eukaryotic cells.
  • a promoter sequence provides for regulation of transcription of the downstream gene sequence and typically includes from about 50 to about 2,000 nucleotide base pairs.
  • Promoter sequences can also contain regulatory sequences such as enhancer sequences that can influence the level of gene expression.
  • Some isolated promoter sequences can provide for gene expression of heterologous DNAs, that is a DNA different from the native or homologous DNA.
  • Promoters can be strong or weak, or inducible.
  • a strong promoter provides for a high level of gene expression, whereas a weak promoter provides for a low level of gene expression.
  • An inducible promoter is a promoter that provides for the turning on and off gene expression in response to an exogenously added agent, or to an environmental or developmental stimulus. Promoters can also provide for cell specific, tissue specific or developmental regulation.
  • a strong promoter for heterologous DNAs can be advantageous because it provides for a sufficient level of gene expression for easy detection and selection of transformed cells and provides for a high level of gene expression when desired.
  • the promoter within such expression cassettes / vectors can be functional in a fungal cell.
  • An enzyme-encoding nucleic acid segment can be combined with a promoter by standard methods to yield an expression cassette or transgene, for example, as described in Sambrook et al. (MOLECULAR CLONING: A LABORATORY MANUAL. Second Edition (Cold Spring Harbor, NY: Cold Spring Harbor Press (1989); MOLECULAR CLONING: A LABORATORY MANUAL. Third Edition (Cold Spring Harbor, NY: Cold Spring Harbor Press (2000)). Briefly, a plasmid containing a promoter can be constructed. Typically, these plasmids are constructed to have multiple cloning sites having specificity for different restriction enzymes downstream from the promoter.
  • the enzyme encoding nucleic acids can be subcloned downstream from the promoter region using restriction enzymes and positioned to ensure that the nucleic acid segment is inserted in proper orientation with respect to the promoter so that the coding segment can be expressed as sense or antisense RNA.
  • the enzyme nucleic acid is operably linked to a promoter, the expression cassette so formed can be subcloned into a plasmid or other vector.
  • A“recombinant cell” is a cell whose genetic material has been altered by a human using genetic engineering techniques including molecular cloning.
  • Recombinant cells can comprise cells whose genetic material has been altered by the addition of genetic material from a cell of a different organism by genetic engineering.
  • Recombinant cells can also comprise cells whose genetic material has been altered by the deletion of genetic material by genetic engineering.
  • Recombinant cells can also comprise cells altered by the addition of extra copies of genetic material that is normally native to the recombinant cell by genetic engineering.
  • genetic material such as genes, promoters and terminators is“heterologous” if it is (i) non-native to the wild-type cell before it was converted to a recombinant cell and/or (ii) is native to the cell, but is present at a location different than where that genetic material is present in the wild-type cell and/or (iii) is under the regulatory control of a non-native promoter and/or non-native terminator ⁇ Extra copies of native genetic material are considered as“heterologous” for purposes of this application, even if such extra copies are present at the same locus as that genetic material is present in the wild-type cell.
  • a gene (such as a gene coding for a 3-ketoacyl-CoA synthase) is“heterologous” if it is not natively expressed by the native cell before it is converted to a recombinant cell, if it is native to the wild-type cell, but is present at a location different than where that genetic material is present in the wild-type cell, if it is under the regulatory control of a non-native promoter and/or non-native terminator, and/or if extra copies are present at the same locus where that genetic material is normally present in the wild-type cell.
  • a heterologous gene can produce a heterologous polypeptide.
  • A“3-ketoacyl-CoA synthase” is an enzyme that catalyzes the condensation reaction of an acyl-CoA with malonyl-CoA to form a 3-ketoacyl-CoA (which is 2 carbon longer in chain length than the acyl-CoA primer), CoA-SH, and C0 2 .
  • One method for evaluating the ability of an enzyme to catalyze this reaction is by measuring the release of free CoA-SH using 5,5'-dithiobis-(2-nitrobenzoic acid) (DTNB), malonyl-CoA as the donor substrate, and a C2-C8 acyl-CoA as the primer substrate, in the presence of the enzyme.
  • DTNB 5,5'-dithiobis-(2-nitrobenzoic acid)
  • malonyl-CoA as the donor substrate
  • C2-C8 acyl-CoA as the primer substrate
  • the ability of an enzyme to catalyze this condensation reaction can also be evaluated by using an acyl-CoA as the primer substrate and malonyl-CoA as the donor substrate in the presence of 5 mM Mg ++ salt by measuring the increase in absorbance at 303 nm as a function of the increase in the formation of a Mg ++ -complex with the 3-ketoacyl-CoA product.
  • 3-Ketoacyl-CoA synthase will produce acetoacetyl-CoA from acetyl-CoA primer, 3- ketohexanoyl-CoA from butyryl-CoA primer, 3-ketooctanoyl-CoA from hexanoyl-CoA primer, and 3-ketodecanoyl-CoA from octanoyl-CoA primer.
  • NC-IUBMB Enzyme Nomenclature
  • A“CoA” or“CoA-SH”, as described here refers to Coenzyme A.
  • a CoA substrate can refer to a straight chain carbon chain covalently linked to Coenzyme A.
  • C4-CoA refers to a C4 chain covalently linked to Coenzyme A via a thioester bond between the thiol of the Coenzyme A and a terminal carbon of the C4 chain.
  • A“fatty acid,” as described here, is a straight-chain or branched-chain monoalkanoic acid having at least four carbon atoms.
  • A“fatty acid chain product” is a compound having a straight chain alkyl group formed in a series of one or more reactions at the site of the terminal carboxyl group of a fatty acid (or a thioester bond of a corresponding -CoA compound).
  • a fatty acid chain product can comprise a straight carbon chain with a different end group such as, for example, an ester, an alcohol group, an amino group, an aldehyde group, a ketone, a methyl group, or an alkenyl group.
  • Fatty acid chain product can include fatty acid ester, fatty alcohol, fatty acid amide, fatty acid imide, and fatty acid amine.
  • A“fatty acid ester” is an ester compound corresponding to the reaction product of a fatty acid or a fatty acyl-CoA and an alcohol (with loss of a water or a CoA-SH).
  • a fatty acid ester can also correspond to the reaction product of a fatty acyl CoA and an alcohol.
  • Fatty acid ester can include fatty acid methyl ester, fatty acid ethyl ester, fatty acid propyl ester, fatty acid butyl ester, fatty acid pentyl ester, and fatty acid hexyl ester.
  • Fatty acid methyl ester (FAME) is an ester compound corresponding to the reaction product of a fatty acid or a fatty acyl-CoA and methanol.
  • Chain lengths of fatty acids and fatty acid chain products are sometimes indicated herein by the shorthand“CX,” wherein X is a number designating the number of carbon atoms.
  • the number of carbon atoms designated in each case represents the carbon length of the straight- chain compound (after removal of CoA or ACP coenzymes) formed by the recombinant cell through one or more iterations of the reaction cycle: acyl-CoA (or acyl- ACP) + malonyl-CoA to form a 3-ketoacyl compound; reduction of the 3-ketoacyl compounds to form a 3-hydroxyacyl compound; dehydration of the 3-hydroxyacyl-CoA to form a 3-enoylacyl compound; and reduction of the 3-enoylacyl compound to the corresponding acyl compound.
  • CX is a number designating the number of carbon atoms.
  • the number of carbon atoms designated in each case represents the carbon length of the straight- chain compound (after removal of CoA or ACP coenzy
  • Each iteration of this reaction cycle adds two carbon atoms to the starting acyl-CoA or acyl-ACP.
  • the number of carbon atoms does not include additional carbon atoms that may be added during the formation of any fatty acid chain products of the fatty acid, such as, for example, carbons included in an ester group following an esterification.
  • hexanoic acid methyl ester is considered as a“C6” fatty acid ester compound, the carbon of the methyl ester group not being counted.
  • The“CX” designation can also apply to fatty acid chain products.
  • The“CX” designation can also apply to fatty acid chain products. For example,
  • C4 FAME refers to methyl butanoate
  • C6 FAME refers to methyl hexanoate
  • C8 FAME refers to methyl octanoate
  • C10 FAME refers to methyl decanoate
  • C12 FAME refers to methyl dodecanoate.
  • The“CX” designation can also be used for fatty acid intermediates.
  • C2-CoA refers to acetyl-CoA
  • C4-CoA refers to butyryl-CoA
  • C6-C0A refers to hexanoyl-CoA
  • C8-C0A refers to octanoyl-CoA
  • ClO-CoA refers to decanoyl-CoA
  • Cl2-CoA refers to dodecanoyl-CoA.
  • engineered biosynthetic pathway refers to a biosynthetic pathway that is at least partially genetically engineered into a recombinant cell and comprises one or more enzymes to carry out a sequence of steps to produce a desired product, such as a fatty acid or fatty acid chain product.
  • one or more of the enzymes of the engineered biosynthetic pathway do not naturally occur in the unmodified cell. These enzymes can be introduced into the cell by inserting a heterologous enzyme by genetic engineering.
  • one or more of the enzymes of the engineered biosynthetic pathway do not naturally occur in sufficient copy number in an unmodified cell and additional copies of the enzymes are inserted by genetic engineering.
  • the engineered biosynthetic pathway also comprises genetic modifications to the cell to reduce or eliminate competing metabolic pathways and/or to reduce interfering activities, such as degradation of desired products or necessary intermediates.
  • the term“endogenous” gene refers to a gene that is naturally found in a particular cell.
  • the term“overexpress” is used to refer to the expression of a heterologous gene in a recombinant cell at levels higher than the level of gene expression in a wild type cell.
  • an endogenous gene is deleted.
  • the terms“deletion,”“deleted,”“knockout,” and “knocked out” can be used interchangeably to refer to an endogenous gene that has been engineered to no longer be expressed in a recombinant cell.
  • a deleted/knocked out gene is an endogenous gene that is deleted to increase production of a desired product, such as a fatty acid or fatty acid chain product.
  • the terms“variant” and“mutant” protein are used to describe a protein sequence that has been modified at one or more amino acids, compared to the wild type sequence of a particular protein. Mutations to the amino acid residues encoded by the wild-type genes are designated herein by the shorthand designation for the wild-type enzyme, followed in parenthesis by a 3-, 4- or 5 character code consisting of a first letter designating the amino acid residue in the native enzyme, a 1-, 2- or 3-digit number indicating the position of that amino acid residue in the native enzyme, and a final letter designating the amino acid residue in that position in the mutated enzyme.
  • NphT7(Il47S, F217V) indicates that an isoleucine (I) at amino acid residue position 147 in the wild type NphT7 enzyme has been replaced with a serine (S) and that a phenylalanine at amino acid residue position 217 in the wild type NphT7 enzyme has been replaced with a valine (V).
  • Mutations to the amino acid residues encoded by the wild-type genes can also be designated herein by the shorthand designation for the wild-type enzyme, followed in parenthesis by the final letter or letters designating the amino acid residue in that was mutated. For example the designation
  • NphT7(SV) indicates that two amino acids have been mutated to a serine (S) and valine (V) respectively.
  • S serine
  • V valine
  • the single-letter designations are IUPAC amino acid abbreviations as reported, for example, at Eur. J. Biochem. 138:9-37(1984).
  • the recombinant cell is a prokaryotic cell. In some aspects, the recombinant cell is a eukaryotic cell.
  • the recombinant cell is a microorganism, and may be a single- celled microorganism.
  • the recombinant cell may be a plant cell, including a cell from a plant within any of the Chlorophyta, Charophyta, Marchantiophyta, Anthocerotophyta, Bryophyta,
  • Lycopodiophyta Pteridophyta, Cycadophyta, Ginkgophyta, Pinophyta, Gnetophyta or
  • Such a plant cell may be, for example, a cell from a plant within any of the genera Arabidopsis, Beta, Glycine, Jatropha, Miscanthus, Panicum, Phalaris, Populus, Saccharum, Salix, Simmondsia, and Zea.
  • the recombinant cell may be a fungal, microalgae, algae or red algae
  • the recombinant cell may be a yeast cell.
  • a yeast or fungal cell may be an oleaginous yeast or fungus, and/or may be a Crabtree negative yeast or fungus.
  • oleaginous fungi refers to yeasts or filamentous fungi, which accumulate at least 10%, 12.5%, 15%, 17.5%, preferably at least 20% or even at least 25%
  • biomass (w/w) of their biomass as lipid. They may even accumulate at least 30%, 40%, 50%, 60%, 70%, 80% (w/w) or more of their biomass as lipids.
  • the biomass is usually measured as cell dry weight (CDW).
  • A“Crabtree -positive” organism is one that is capable of producing ethanol in the presence of oxygen, whereas a "Crabtree-negative” organism is not.
  • a yeast cell having a Crabtree-negative phenotype is any yeast cell that does not exhibit the Crabtree effect.
  • the term “Crabtree-negative” refers to both naturally occurring and genetically modified organisms. Briefly, the Crabtree effect is defined as the inhibition of oxygen consumption by a microorganism when cultured under aerobic conditions due to the presence of a high concentration of glucose (e.g., 10 g-glucose L-l).
  • a yeast cell having a Crabtree positive phenotype continues to ferment irrespective of oxygen availability due to the presence of glucose, while a yeast cell having a Crabtree-negative phenotype does not exhibit glucose mediated inhibition of oxygen consumption.
  • Crabtree-positive yeast produce an excess of alcohol rather than biomass production.
  • yeast cells include, Pichia, Candida, Klebsiella, Hansenula,
  • Saccharomyces Schizosaccharomyces, Zygosaccharomyces, Debaryomyces, Cryptoococcus, Rhodotorula, Rhodosporidium, Lipomyces and Yarrowia.
  • yeast cells include C. sonorensis, K. marxianus, K. thermotolerans, C. methane sorbosa, Saccharomyces bulderi (S. bulderi), I. orientalis, C. lambica, C. sorboxylosa, C. zemplinina, C. geochares, P. membranifaciens, Z. kombuchaensis, C. sorbosivorans, C. vanderwaltii, C. sorbophila, Z.
  • K. marxianus and C. sonorensis include those described in WO 00/71738 Al, WO 02/42471 A2, WO 03/049525 A2, WO 03/102152 A2 and WO
  • Suitable strains of I. orientalis are ATCC strain 32196 and ATCC strain PTA- 6648.
  • the recombinant cell is a bacteria cell.
  • the bacteria may be a gram-positive or gram-negative bacteria. It may be a cell within any of the Chlamydiae, green nonsulfur, actinobacteria, planctomycetes, spirochaetes, fusobacteria, cyanobacteria, thermophilic sulphate-reducer, acidobacteria or proteobacteria classifications of bacteria (Ciccarelli et al, Science 311 (5765): 1283-7 (2006).
  • suitable bacterial cells include, for example, those within any of the genera Clostridium, Zymomonas, Escherichia, Salmonella, Rhodococcus, Pseudomonas, Streptomyces, Bacillus, Lactobacillus, Enterococcus, Alcaligenes, Klebsiella, Paenibacillus, Arthrobacter, Corynebacterium, Bacteriophage, Brevibacterium, Acanthoceras, Acanthococcus, Acarvochloris, Achnanthes, Achnanthidiun, Actinastrum, Actinochloris, Actinocyclus,
  • Batrarhospermum Binurlearia, Bitrichia, Blidingia, Botrdiopsis, Botrydium, Botryococcus, Botryosphaerella, Brachiomonas, Brachysira, Brachytrichia, Brebissonia, Bulbochaete, Bumnilleria, Buinilleriopsis, Caloneis, Calothrix, Campylodiscus, Capsosiphon, Carteria, Catena, Cavinula, Cenritractus, Centroniella, Ceratiunt, Chaetoceros, Chaetochloris,
  • Chaetomorpha Chaetonella, Chaetonemna, Chaetopeltis, Chaetophora, Chaetosphaeridium, Chamaesiphion, Chara, Characiochloris, Characiopsis, Characium, Chorales, Chilomonas, Chlainomonas, Chlamydoblepharis, Chlamydocapsa, Chlamydomonas, Chlamydomonopsis, Chlamydomnyxa, Chlamydonephris, Chlorangiella, Chlorangiopsis, Chlorella, Chlorobotrys, Chlorobrachis, Chlorochytrium, Chloroccun, Chlorogloea, Chlorogloeopsis, Chlorogonium, Chlorolobion, Chloromonas, Chlorophysema, Cholorphyta, Cholorosaccus, Cholorosarcina, Choricystis, Chromophyton, Ch
  • Chrysocapsella Chrysochaete, Chrysohromulina, Chrysococcus, Chrysocrinus,
  • Chrynsolepidomonas Chrysolykos, Chrysonebula, Chrysophyta, Chrysopyxis, Chrysosaccus, Chrysophaerella, Chrysotephanosphaera, Clodophora, Clastidium, Closteriopsis, Closterium, Coccomyxa, Cocconeis, Coelastrella, Coelastrum, Coelosphaerium, Coenochloris,
  • Coenococcus Coenocystis, Colacium, Coleochaete, Collodictyon, Compsogonopsis,
  • Compsopogon Conjugatophyta, Conoehaete, Coronastrum, Cosmarium, Cosmnioneis, Cosmocladium, Crateriportula, Craticula, Crinalium, Crucigenia, Crucigeniella, Cryptoaulax, Cryptomonas, Cryptophyta, Ctenophora, Cyanodictyon, Cyanonephron, Cyanophora,
  • Cyanophyta Cyanothece, Cyanothomonas, Cyclonexis, Cyclostephanos, Cyclotella,
  • Cylindrocapsa Cylindrocystis, Cylindrospermum, Cylindrotheca, Cymatopleura, Cymbella, Cymbeilonitzschia, Cystodinium Dactylococcopsis, Debarya, Denticula, Dermatochrysis, Dermorarpa, Dermocarpella, Desmatractum, Desmidium, Desmococcus, Desmonema, Desmosiphon, Diacanthos, Diacronema, Diadesmis, Diatoma, Diatomella, Dicellula,
  • Dichothrix Dichtotomococcrus, Dicranochaete, Dictyochloris, Dictyococcus, Dictyosphaerium, Didymocystis, Didymogenes, Didymosphenia, Dilabifilum, Dimorphoccus, Dinobryon,
  • Entomorpha Entocladia, Entomoeis, Entophysalis, Ephichrysis, Epipyxis, Epithemia, Eremosphaura, Euastropsis, Euatstrum, Eucapsis, Eucocconeis, Eudorina, Euglena,
  • Euglenophyta Eunotia, Eustigmatophyta, Eutreptia, Fallcia, Ficherella, Fragilaria,
  • Glaucophyta Glenodiniopsis, Glenodinium, Gloeomonas, Gloeoplax, Gloeothece, Geloeotila, Gloeotrichia, Gloiodictyon, Golenkinia, Golenkiniopsis, Gomontia, Gomphocymbella,
  • Gomphonema Gomphosphaeria, Gonatozygon, Gongrosia, Gongrosira, Goniochloris, Gonium, Gonyostomum, Granulochloris, Granulocystopsis, Groenbladia, Gymnodiunium, Gymnozyga, Gyrosignma, Haematocuccus, Hafniomonas, Hallassia, Hammatoidea, Hannaea, Hantzchia, Hapalosiphon, Haplotaenium, Haptophyta, Haslea, Hemidinuim, Hemitonia, Heribaudiella, Heteromastix, Heterothrix, Hibberdia, Hildenbrandia, Hillea, Holopedium, Homoeothrix, Hormanthonema, Hormotila, Hyalobrachion, Hyalocardium, Hyalodiscus, Hyalogonium, Hyalothe
  • mice Micractinium, Micrasterias, Microchaete, Microcoleus, Microcystis, Microglena, Micromonas, Microspora, Microthamnion, Mischococcus, Monocrysis, Monodus, Monomastix,
  • Nephroclamys Nephrocytium, Nephrodiella, Nephroselmis, Netrium, Nitella, Nitellopsis, Nitzschia, Nodularia, Nostoc, Ochromonas, Oedogonium, Oligochaetophora, Onychonema, Oocadrium, Oocrystis, Opephora, Ophiocytium, Orthoseira, Oscillartoria, Oxyneis,
  • Pachycladella Palmella, Palmodictyon, Pnadorina, Pannus, Paralia, Pascherina, Paulshulzia, Pediastrum, Pedinella, Pedinomonas, Pedinopera, Pelagodictyon, Penium, Peranema,
  • Peridiniopsis Peridinium, Peronia, Petroneis, Phacotus, Phacus, Phaeaster, Phaeodermatium, Phaeophyta, Phaeoshaera, Phaeothamnion, Phormidium, Phycopeltis, Phyllariochloris, Phyllocadium, Phyllomitas, Pinnilaria, Pitophora, Placoneis, Planctonema, Planktophaeria, Planothidium, Plectonema, Pleodorina, Pleurastrum, Pleurocapsa, Pleurocladia, Pleurodiscus, Pleurosigma, Pleurosira, Pleurotaenium, Pocillomanas, Podohedea, Polyblepharides,
  • Pseudophaerocystis Pseudostaurastrum, Pseudostraurosira, Pyrrophyta, Quadrichloris, Quadricoccus, Quadrigula, Radiocuccus, Radiobetalum, Raphidiopsis, Raphidocelis,
  • Raphidonema Raphidophyta
  • Peimeria Rhadorderma, Rhabomonas, Rhizoclonium
  • Rhodomonas Rhodiphyta, Rhoicosenia, Rhopalodia, Rivularia, Rosenvingiella, Rossithidium, Roya, Scenedesmus, Scherffelia, Schizochlamydella, Schizochlamys, Schizomeris, Schizothrix, Schroederia, Scolioneis, Scotiella, Scotiellopsis, Scourfieldia, Scytonema, Slenastrum,
  • Sphaerellopsis Sphaerodinium, Sphaeroplea, Sphaerozosma, Spiniferomonas, Spirogyra, Spirotaenia, Spirulina, Spondylomorum, Spondylosium, Sporotetras, Spumella, Staurastrum, Stauerodesmus, Stauroneis, Staurosira, Staurrosiella, Stenopterobia, Stephanocostis,
  • bacteria cells include Escherichia coli, Oligotropha carboxidovorans, Pseudomononas sp. Alcaligenes eutrophus ( Cupriavidus necator), Bacillus licheniformis, Paenibacillus macerans, Rhodococcus erythropolis, Pseudomonas putida, Lactobacillus plantarum, Enterococcus faecium, Enterococcus gallinarium, Enterococcus faecalis, Bacillus subtilis, Cupriavidus basilensis, Cupriavidus campinensis, Cupriavidus gilardi, Cupriavidus laharsis, Cupriavidus metallidurans, Cupriavidus oxalaticus, Cupriavidus pauculus, Cupriavidus pinatubonensis, Cupriavidus respiraculi and Cupriavidus taiwanensis.
  • the bacterial cells include Escherichia
  • the recombinant cell may be a synthetic cell or a cell produced by a synthetic genome, as described in U.S. Patent Publication 2007/0264688, or 2007/0269862.
  • the cell may be a CHO cell, a COS cell, a VERO cell, a BHK cell, a HeLa cell, a Cvl cell, an MDCK cell, a 293 cell, a 3T3 cell, or a PC 12 cell.
  • this application discloses a recombinant cell genetically modified to express one or more heterologous enzymes having 3-ketoacyl-CoA synthase activity.
  • the heterologous enzymes having 3-ketoacyl-CoA synthase activity can comprise enzymes naturally found in Nocardia brasiliensis ATCC 700358 (SEQ ID NO: 1) (Designation: Nbra), Streptomyces sp. KO-3988 (SEQ ID NO: 2) (Designation: SK03), Streptomyces cinnamonensis (SEQ ID NO: 3) (Designation: Scin), Streptomyces tendae (SEQ ID NO: 4) (Designation: Sten), Streptomyces sp.
  • KO-3988 (SEQ ID NO: 5) (Designation: SK03-2), Austwickia chelonae NBRC 105200 (SEQ ID NO: 6) (Designation: Ache), Streptomyces sp. Mgl (SEQ ID NO: 7) (Designation: Smgl), Spirochaeta africana DSM 8902 (SEQ ID NO: 8) (Designation: Safr), Streptomyces sp. HK10576 (SEQ ID NO: 9) (Designation: SHK1), Streptomyces violaceusniger Tu 4113 (SEQ ID NO: 10) (Designation: Svio), Amycolatopsis sp.
  • the heterologous enzyme having 3-ketoacyl-CoA synthase activity comprises the amino acid sequence of SEQ ID NO: 1 and corresponds to the polypeptide designated as Nbra.
  • the heterologous enzyme having 3-ketoacyl- CoA synthase activity comprises a polypeptide having more than 50%, more than 55%, more than 60%, more than 65%, more than 70%, more than 75%, more than 80%, more than 85%, more than 90%, more than 91%, more than 92%, more than 93%, more than 94%, more than 95%, more than 96%, more than 97%, more than 98%, or more than 99% sequence identity to the amino acid sequence of SEQ ID NO: 1.
  • a recombinant cell genetically modified to express the polypeptide designated as Nbra produces fatty acids and/or fatty acid chain products.
  • the recombinant cell genetically modified to express the polypeptide designated as Nbra further comprises an engineered biosynthetic pathway to produce fatty acids and/or fatty acid chain products.
  • the recombinant cell genetically modified to express the polypeptide designated as Nbra further comprises an engineered biosynthetic pathway to produce FAME.
  • the recombinant cell expressing heterologous Nbra and comprising an engineered biosynthetic pathway can also produce a mixture of FAMEs with different carbon chain lengths.
  • the recombinant cell expressing heterologous Nbra can produce a FAME mixture comprising C6 FAME, C8 FAME, C10 FAME, and/or C12 FAME.
  • the recombinant cell expressing heterologous Nbra and comprising an engineered biosynthetic pathway can catalyze the addition of malonyl-CoA to a C2-CoA substrate to lead to a C4 fatty acid and/or fatty acid chain product, the addition of malonyl-CoA to a C4-CoA substrate to lead to a C6 fatty acid and/or fatty acid chain product, the addition of malonyl-CoA to a C6-C0A substrate to lead to a C8 fatty acid and/or fatty acid chain product, the addition of malonyl-CoA to a C8-C0A substrate to lead to a C10 fatty acid and/or fatty acid chain product, and/or the addition of malonyl-CoA to a ClO-CoA substrate to lead to a C 12 fatty acid and/or fatty acid chain product.
  • the recombinant cell expressing heterologous Nbra can also comprise one or more additional heterologous polypeptides with 3-ketoacyl-CoA synthase activity (e.g., a NphT7 polypeptide and/or a NphT7 polypeptide mutant as described herein) to increase production of fatty acids and/or fatty acid chain products.
  • additional heterologous polypeptides with 3-ketoacyl-CoA synthase activity e.g., a NphT7 polypeptide and/or a NphT7 polypeptide mutant as described herein
  • the heterologous enzyme having 3-ketoacyl-CoA synthase activity comprises the amino acid sequence of SEQ ID NO: 2 and corresponds to the polypeptide designated as SK03.
  • the heterologous enzyme having 3-ketoacyl- CoA synthase activity comprises a polypeptide having more than 50%, more than 55%, more than 60%, more than 65%, more than 70%, more than 75%, more than 80%, more than 85%, more than 90%, more than 91%, more than 92%, more than 93%, more than 94%, more than 95%, more than 96%, more than 97%, more than 98%, or more than 99% sequence identity to the amino acid sequence of SEQ ID NO: 2.
  • a recombinant cell genetically modified to express the polypeptide designated as SK03 produces fatty acids and/or fatty acid chain products.
  • the recombinant cell genetically modified to express the polypeptide designated as SK03 further comprises an engineered biosynthetic pathway to produce fatty acids and/or fatty acid chain products.
  • the recombinant cell genetically modified to express the polypeptide designated as SK03 further comprises an engineered biosynthetic pathway to produce FAME.
  • the recombinant cell expressing heterologous SK03 and comprising an engineered biosynthetic pathway can also produce a mixture of FAMEs with different carbon chain lengths.
  • the recombinant cell expressing heterologous SK03 can produce a FAME mixture comprising C6 FAME, C8 FAME, C10 FAME, and/or C12 FAME.
  • the recombinant cell expressing heterologous SK03 and comprising an engineered biosynthetic pathway can catalyze the addition of malonyl-CoA to a C2-CoA substrate to lead to a C4 fatty acid and/or fatty acid chain product, the addition of malonyl-CoA to a C4-CoA substrate to lead to a C6 fatty acid and/or fatty acid chain product, the addition of malonyl-CoA to a C6-C0A substrate to lead to a C8 fatty acid and/or fatty acid chain product, the addition of malonyl-CoA to a C8-C0A substrate to lead to a C10 fatty acid and/or fatty acid chain product, and/or the addition of malonyl-CoA to a ClO-CoA substrate to lead to a C 12 fatty acid and/or fatty acid chain product.
  • the recombinant cell expressing heterologous SK03 can also comprise one or more additional heterologous polypeptides with 3-ketoacyl-CoA synthase activity (e.g., a NphT7 polypeptide and/or a NphT7 polypeptide mutant as described herein) to increase production of fatty acids and/or fatty acid chain products.
  • additional heterologous polypeptides with 3-ketoacyl-CoA synthase activity e.g., a NphT7 polypeptide and/or a NphT7 polypeptide mutant as described herein
  • the heterologous enzyme having 3-ketoacyl-CoA synthase activity comprises the amino acid sequence of SEQ ID NO: 3 and corresponds to the polypeptide designated as Scin.
  • the heterologous enzyme having 3-ketoacyl- CoA synthase activity comprises a polypeptide having more than 50%, more than 55%, more than 60%, more than 65%, more than 70%, more than 75%, more than 80%, more than 85%, more than 90%, more than 91%, more than 92%, more than 93%, more than 94%, more than 95%, more than 96%, more than 97%, more than 98%, or more than 99% sequence identity to the amino acid sequence of SEQ ID NO: 3.
  • a recombinant cell genetically modified to express the polypeptide designated as Scin produces fatty acids and/or fatty acid chain products.
  • the recombinant cell genetically modified to express the polypeptide designated as Scin further comprises an engineered biosynthetic pathway to produce fatty acids and/or fatty acid chain products.
  • the recombinant cell genetically modified to express the polypeptide designated as Scin further comprises an engineered biosynthetic pathway to produce FAME.
  • the recombinant cell expressing heterologous Scin and comprising an engineered biosynthetic pathway can also produce a mixture of FAMEs with different carbon chain lengths.
  • the recombinant cell expressing heterologous Scin can produce a FAME mixture comprising C6 FAME, C8 FAME, C10 FAME, and C12 FAME.
  • the recombinant cell expressing heterologous Scin and comprising an engineered biosynthetic pathway can catalyze the addition of malonyl-CoA to a C2-CoA substrate to lead to a C4 fatty acid and/or fatty acid chain product, the addition of malonyl-CoA to a C4-CoA substrate to lead to a C6 fatty acid and/or fatty acid chain product, the addition of malonyl-CoA to a C6-C0A substrate to lead to a C8 fatty acid and/or fatty acid chain product, the addition of malonyl-CoA to a C8-C0A substrate to lead to a C 10 fatty acid and/or fatty acid chain product, and/or the addition of malonyl-CoA to a ClO-CoA substrate to lead to a C12 fatty acid and/or fatty acid chain product.
  • the recombinant cell expressing heterologous Scin and comprising an engineered biosynthetic pathway can
  • heterologous Scin can also comprise one or more additional heterologous polypeptides with 3- ketoacyl-CoA synthase activity (e.g., a NphT7 polypeptide and/or a NphT7 polypeptide mutant as described herein) to increase production of fatty acids and/or fatty acid chain products.
  • additional heterologous polypeptides with 3- ketoacyl-CoA synthase activity e.g., a NphT7 polypeptide and/or a NphT7 polypeptide mutant as described herein
  • the heterologous enzyme having 3-ketoacyl-CoA synthase activity comprises the amino acid sequence of SEQ ID NO: 4 and corresponds to the polypeptide designated as Sten.
  • the heterologous enzyme having 3-ketoacyl- CoA synthase activity comprises a polypeptide having more than 50%, more than 55%, more than 60%, more than 65%, more than 70%, more than 75%, more than 80%, more than 85%, more than 90%, more than 91%, more than 92%, more than 93%, more than 94%, more than 95%, more than 96%, more than 97%, more than 98%, or more than 99% sequence identity to the amino acid sequence of SEQ ID NO: 4.
  • a recombinant cell genetically modified to express the polypeptide designated as Sten produces fatty acids and/or fatty acid chain products.
  • the recombinant cell genetically modified to express the polypeptide designated as Sten further comprises an engineered biosynthetic pathway to produce fatty acids and/or fatty acid chain products.
  • the recombinant cell genetically modified to express the polypeptide designated as Sten further comprises an engineered biosynthetic pathway to produce FAME.
  • the recombinant cell expressing heterologous Sten and comprising an engineered biosynthetic pathway can also produce a mixture of FAMEs with different carbon chain lengths.
  • the recombinant cell expressing heterologous Sten can produce a FAME mixture comprising C6 FAME, C8 FAME, C10 FAME, and C12 FAME.
  • the recombinant cell expressing heterologous Sten and comprising an engineered biosynthetic pathway can catalyze the addition of malonyl-CoA to a C2-CoA substrate to lead to a C4 fatty acid and/or fatty acid chain product, the addition of malonyl-CoA to a C4-CoA substrate to lead to a C6 fatty acid and/or fatty acid chain product, the addition of malonyl-CoA to a C6-C0A substrate to lead to a C8 fatty acid and/or fatty acid chain product, the addition of malonyl-CoA to a C8-C0A substrate to lead to a C 10 fatty acid and/or fatty acid chain product, and/or the addition of malonyl-CoA to a ClO-CoA substrate to lead to a C12 fatty acid and/or fatty acid chain product.
  • the recombinant cell expressing heterologous Sten and comprising an engineered biosynthetic pathway can
  • heterologous Sten can also comprise one or more additional heterologous polypeptides with 3- ketoacyl-CoA synthase activity (e.g., a NphT7 polypeptide and/or a NphT7 polypeptide mutant as described herein) to increase production of fatty acids and/or fatty acid chain products.
  • additional heterologous polypeptides with 3- ketoacyl-CoA synthase activity e.g., a NphT7 polypeptide and/or a NphT7 polypeptide mutant as described herein
  • the heterologous enzyme having 3-ketoacyl-CoA synthase activity comprises the amino acid sequence of any one of SEQ ID NOs: 5-15 and corresponds to the polypeptides designated as SK03-2, Ache, Smgl, Safr, SHK1, Svio, ATTC, Ubac, Gsul, Abay-2 and Ctet, respectively.
  • the heterologous enzyme having 3-ketoacyl- CoA synthase activity comprises a polypeptide having more than 50%, more than 55%, more than 60%, more than 65%, more than 70%, more than 75%, more than 80%, more than 85%, more than 90%, more than 91%, more than 92%, more than 93%, more than 94%, more than 95%, more than 96%, more than 97%, more than 98%, or more than 99% sequence identity to the amino acid sequence of any one of SEQ ID NOs: 5-15.
  • a recombinant cell genetically modified to express any one of the polypeptides designated as SK03-2, Ache, Smgl, Safr, SHK1, Svio, ATTC, Ubac, Gsul, Abay-2 or Ctet produces fatty acids and/or fatty acid chain products.
  • the recombinant cell genetically modified to express the polypeptide designated as SK03-2, Ache, Smgl, Safr, SHK1, Svio, ATTC, Ubac, Gsul, Abay-2 or Ctet further comprises an engineered biosynthetic pathway to produce fatty acids and/or fatty acid chain products.
  • the recombinant cell genetically modified to express the polypeptide designated as SK03-2, Ache, Smgl, Safr, SHK1, Svio, ATTC, Ubac, Gsul, Abay-2 or Ctet further comprises an engineered biosynthetic pathway to produce FAME.
  • each of the heterologous polypeptides with 3-ketoacyl-CoA synthase activity identified above can comprise one or more identified amino acid substitutions within its amino acid sequence to alter a specific activity against a specific CoA substrate when compared to the respective unsubstituted heterologous polypeptide.
  • These heterologous polypeptides with 3-ketoacyl-CoA synthase activity with identified amino acid substitutions can be considered variants or mutants.
  • one or more identified amino acid substitutions in the variant can alter the specific activity of the respective heterologous polypeptide with 3- ketoacyl-CoA synthase against a C4-CoA substrate.
  • a variant can comprise increased specific activity for certain CoA substrate and can concomitantly produce a certain 3-keto-CoA intermediate.
  • a variant with increased specific activity for a C6-C0A substrate will produce a 3-keto-C8-CoA product leading to C8 fatty acids and/or C8 fatty acid chain products.
  • a variant with decreased specific activity against certain CoA may also be desirable to reduce production of certain unwanted fatty acids and/or fatty acid chain products.
  • a variant with decreased specific activity for a C8-C0A substrate will produce less 3-keto-Cl0- CoA product leading to reduced amounts of C10 fatty acids and/or C10 fatty acid chain products.
  • a recombinant cell with a variant with one or more identified amino acid substitutions can comprise increased total FAME production; in other cases, the variant results in decreased total FAME production compared to the wild-type enzyme.
  • a recombinant cell with a variant with one or more identified amino acid substitutions can also comprise altered ratios of FAME production by carbon chain length when compared to the respective polypeptide with unaltered sequence.
  • a recombinant cell with a variant with one or more identified amino acid substitutions can comprise altered ratios of FAME production by carbon chain length of one or more of C6-FAME (methyl hexanoate), C8-FAME (methyl octanoate), C10-FAME (methyl decanoate), and C12-FAME (methyl dodecanoate).
  • the identified amino acid substitutions can also comprise one or more amino acid substitutions at amino acid positions of Nbra, SK03, Scin, Sten, SK03-2, Ache, Smgl, Safr, SHK1, Svio, ATTC, Ubac, Gsul, Abay-2 and Ctet that align to amino acid positions 147 and/or 217 of Npht7 (see, for example, the sequence alignments in Figs. 1 and 2).
  • a Nbra (SV) variant with more than one identified amino acid substitutions can comprise an altered specific activity for C2-CoA, C4-CoA, C6-C0A, and C8- CoA substrates when compared to the respective polypeptide with unaltered sequence.
  • a recombinant cell with a Nbra (SV) variant and an engineered biosynthetic pathway can comprise an increased total FAME production of a mixture of C6-FAME (methyl hexanoate), C8-FAME (methyl octanoate), C10-FAME (methyl decanoate), and C12-FAME (methyl dodecanoate) when compared to the respective polypeptide with unaltered sequence.
  • the recombinant cell with a Nbra (SV) variant and an engineered biosynthetic pathway can comprise an increased total FAME production when compared to the respective polypeptide with unaltered sequence. It is noted that the wild-type Nbra with an engineered biosynthetic pathway showed little to no ability to produce FAME.
  • a Sten (SV) variant with more than one identified amino acid substitution can comprise an altered specific activity for C2-CoA, C4-CoA, C6-C0A, and C8- CoA substrates when compared to the respective polypeptide with unaltered sequence.
  • a recombinant cell with a Sten (SV) variant can comprise a decrease in total FAME production of a mixture of C6-FAME (methyl hexanoate), C8-FAME (methyl octanoate), C10- FAME (methyl decanoate), and C12-FAME (methyl dodecanoate) when compared to the respective polypeptide with unaltered sequence.
  • the recombinant cell with a Sten (SV) variant can comprise a decrease in total FAME production when compared to the respective polypeptide with unaltered sequence. It is noted that a recombinant cell with a Sten (SV) variant produces little to no FAME.
  • a SK03 (SV) variant with more than one identified amino acid substitution can comprise an altered specific activity for C2-CoA, C4-CoA, C6-C0A, and C8- CoA substrates when compared to the respective polypeptide with unaltered sequence.
  • a recombinant cell with a SK03 (SV) variant and an engineered biosynthetic pathway can comprise an increased total FAME production of a mixture of C6-FAME (methyl hexanoate), C8-FAME (methyl octanoate), C10-FAME (methyl decanoate), and C12-FAME (methyl dodecanoate) when compared to the respective polypeptide with unaltered sequence.
  • the recombinant cell with a SK03 (SV) variant can comprise an increased total FAME production when compared to the respective polypeptide with unaltered sequence.
  • a Scin (SV) variant with more than one identified amino acid substitution can comprise an altered specific activity for C2-CoA, C4-CoA, C6-C0A, and C8- CoA substrates when compared to the respective polypeptide with unaltered sequence.
  • a recombinant cell with a Scin (SV) variant and an engineered biosynthetic pathway can comprise a decreased total FAME production of a mixture of C6-FAME (methyl hexanoate), C8-FAME (methyl octanoate), C10-FAME (methyl decanoate), and C12-FAME (methyl dodecanoate) when compared to the respective polypeptide with unaltered sequence.
  • the recombinant cell with a Scin (SV) variant can comprise a decreased total FAME production when compared to the respective polypeptide with unaltered sequence.
  • the recombinant cell can comprise more than one heterologous enzyme having 3-ketoacyl-CoA synthase activity.
  • the recombinant cell can comprise one or more of Nbra, SK03, Scin, Sten, SK03-2, Ache, Smgl, Safr, SHK1, Svio, ATTC, Ubac, Gsul, Abay-2 and Ctet, variant(s) corresponding to I147S and/or F217V (NphT7 numbering) of Nbra, SK03, Scin, Sten, SK03-2, Ache, Smgl, Safr, SHK1, Svio, ATTC, Ubac, Gsul, Abay-2 and Ctet, and additional enzymes having 3-ketoacyl-CoA synthase activity (e.g., NphT7 (SEQ ID NO: 16), NphT7(SV) variant (SEQ ID NO: 17), and NphT7
  • NphT7 SEQ
  • the recombinant cell can comprise heterologous 3-ketoacyl- CoA synthase enzymes having complementary specific activities.
  • a heterologous NphT7 comprising high specific activity for C2-CoA substrates can be combined with another heterologous 3-ketoacyl-CoA synthase enzyme.
  • a heterologous NphT7(LSVA) variant comprising high specific activity for C4-CoA substrates can be combined with another heterologous 3-ketoacyl-CoA synthase enzyme.
  • a recombinant cell comprising heterologous Nbra (SV) variant, with and without heterologous NphT7 comprises an increase in total FAME production compared to a recombinant cell comprising heterologous wild-type Nbra, with or without heterologous NphT7.
  • a recombinant cell comprising heterologous Sten and heterologous NphT7 comprises increased total FAME production compared to a recombinant cell comprising heterologous Sten and no heterologous NphT7.
  • a recombinant cell comprising heterologous Sten (SV) variant and heterologous NphT7 comprises a decrease in total FAME production compared to a recombinant cell comprising heterologous wild-type Sten and heterologous NphT7. In the absence of NphT7, no FAME was produced from a recombinant cell comprising the heterologous Sten (SV) mutant.
  • a recombinant cell comprising heterologous SK03, with and without NphT7 comprises FAME production.
  • a recombinant cell comprising heterologous SK03 (SV) variant, with and without heterologous NphT7 comprises FAME production.
  • a recombinant cell comprising heterologous Scin, with and wthout NphT7 comprises FAME production.
  • a recombinant cell comprising heterologous Scin (SV) variant, with and without heterologous NphT7 comprises FAME production.
  • this application discloses a recombinant cell with an engineered biosynthetic pathway comprising one or more heterologous enzymes that convert chemical precursors and/or substrates into desired chemical products.
  • the engineered biosynthetic pathway comprises a pathway of heterologous enzymes to synthesize the desired chemical product.
  • individual heterologous enzymes work in a stepwise fashion to convert a precursor into the desired chemical product.
  • the engineered biosynthetic pathway can comprise one or more identified heterologous enzymes having 3-ketoacyl-CoA synthase activity to produce fatty acids and fatty acid chain products of C4-C10 chain length.
  • the engineered biosynthetic pathway can comprise one or more of Nbra, SK03, Scin, Sten, SK03-2, Ache, Smgl, Safr, SHK1, Svio AATC, Ubac, Gsul, Abay-2, Ctet, and/or their respective variants to produce fatty acids and fatty acid chain products of C4-C10 chain length.
  • the engineered biosynthetic pathway can also comprise additional heterologous enzyme(s) that work in combination with one or more of Nbra, SK03, Scin, Sten, SK03-2, Ache, Smgl, Safr, SHK1, Svio AATC, Ubac, Gsul, Abay-2, Ctet, and/or their respective variants to produce fatty acids and fatty acid chain products of C4-C10 chain length.
  • additional heterologous enzyme(s) that work in combination with one or more of Nbra, SK03, Scin, Sten, SK03-2, Ache, Smgl, Safr, SHK1, Svio AATC, Ubac, Gsul, Abay-2, Ctet, and/or their respective variants to produce fatty acids and fatty acid chain products of C4-C10 chain length.
  • the recombinant cell comprises one or more of Nbra, SK03,
  • the reduction takes place in three steps, the first being the reduction of the 3-ketoacyl group to the corresponding 3-hydroxyacyl group.
  • the second reaction is a dehydration to the corresponding trans-2-enoylacyl compound, which is reduced in a third step to the corresponding acyl-CoA.
  • the first reaction step is enzymatically catalyzed by a keto-CoA reductase (KCR) enzyme (EC 1.1.1.35).
  • KCR keto-CoA reductase
  • the second step is enzymatically catalyzed by a 3-hydroxy-acyl-CoA dehydratase (3HDh) enzyme (EC 4.2.1.17).
  • Some bifunctional enzymes catalyze both of the first and second step reactions (EC 1.1.1.35 and EC 4.2.1.55).
  • the third reaction step is enzymatically catalyzed by an enoyl-CoA reductase (ECR) enzyme (EC 1.1.1.32).
  • the engineered biosynthetic pathway can further comprises at least one of (1) a heterologous KCR gene that encodes for a KCR enzyme; (2) a heterologous 3HDh gene that encodes for a 3HDh enzyme; (3) a heterologous gene that encodes for a bifunctional enzyme that catalyzes both of the first and second reaction steps (EC 1.1.1.35 and 4.1.2.55) and (4) a heterologous ECR gene that encodes for an ECR enzyme.
  • the recombinant cell contains at least (1), (2) and (4) or at least (3) and (4).
  • the recombinant cell further comprises (5) at least one heterologous 3-ketoacy-CoA synthase gene, different from the 3-ketoacyl-CoA synthases described above, which encodes for a 3-ketoacyl- CoA synthase.
  • the gene can be under the control of a promoter and/or terminator sequences active in the recombinant cell.
  • the KCR enzyme may be, for example, one encoded by a P. aeruginosa fadB gene and/or having an amino acid sequence at least 80%, at least 90%, at least 95%, at least 99% or 100% identical to SEQ ID NO: 36, one encoded by a P. aeruginosa fadG gene and/or having an amino acid sequence at least 80%, at least 90%, at least 95%, at least 99% or 100% identical to SEQ ID NO: 102, one encoded by a C. beijerinckii hbd gene and/or having an amino acid sequence at least 80%, at least 90%, at least 95%, at least 99% or 100% identical to SEQ ID NO: 101, and others as described in WO 2015/010103.
  • the 3HDh enzyme may be, for example, one encoded by a C. acetobutylicum crt
  • short-chain-enoyl-CoA hydratase gene and/or having an amino acid sequence at least 80%, at least 90%, at least 95%, at least 99% or 100% identical to SEQ ID NO: 99, one encoded by a P. putida ech (enoyl-CoA hydratase/aldolase) gene and/or having an amino acid sequence at least 80%, at least 90%, at least 95%, at least 99% or 100% identical to SEQ ID NO: 100, and others as described in WO 2015/010103.
  • Suitable bifunctional enzymes that catalyze both the first and second reaction steps include, for example, one encoded by an E. colifadB gene and/or having an amino acid sequence at least 80%, at least 90%, at least 95%, at least 99% or 100% identical to SEQ ID NO: 98; one encoded by an R. novegicus ech2 gene, and others as described in WO 2015/010103.
  • Suitable ECR enzymes include, for example, one encoded by a T denticola ter gene and/or having an amino acid sequence at least 80%, at least 90%, at least 95%, at least 99% or 100% identical to SEQ ID NO: 84.
  • the recombinant cell can further include at least one heterologous 3-ketobutyryl-
  • CoA synthase gene different from the modified 3-ketoacyl-CoA synthases described above, which encodes for a 3-ketobutyryl-CoA synthase.
  • the heterologous 3-ketobutyryl-CoA synthase gene may encode for a 3-ketobutyryl-CoA synthase enzyme that is at least 80%, at least 90%, at least 95%, at least 99% or 100% identical to any of those identified as SEQ ID NOs: 1-120 of WO 2015/10103.
  • heterologous 3-ketobutyryl-CoA synthase gene is a
  • the recombinant cell includes at least one gene that encodes for a modified NphT7 enzyme as described in WO 2015/10103.
  • the modified NphT7 enzyme comprises an amino acid sequence having at least 70% but less than 100% sequence identity to SEQ ID NO: 16.
  • the modified NphT7 enzyme may have, for example, one or more amino acid substitutions selected from the group consisting of H100L, I147T, F217V, Y 144L, V157F, G309S, G288S, a PDRP to HFLQ substitution for amino acid residues 86-89, 1147F, I147M, I147Q, I147S, I147C, 1147E, I147N, I147W, I147D, I147R, I147P, I147L, V196G, I147G, I147H, I147K, I147V, 1147 A, I147Y, F217G, F217A, F217L, F217I, F217M, F217T, F217P, F217S, F217E, F217L, F217V, F217W, S323A and S323V, and any combination of any two or more thereof.
  • amino acid substitutions selected from the group consisting of H100L,
  • the modified NphT7 enzyme comprises at least one amino acid substitution selected from the group consisting of I147V, I147S, I147T, and at least one additional amino acid substitution selected from H100L, F217V, S323A and S323V.
  • the modified NphT7 enzyme corresponds to SEQ ID NO: 33.
  • the modified NphT7 enzyme comprises an I147V, I147S or I147T amino acid substitution and an S323A amino acid substitution (corresponding to SEQ ID NO: 33 in which amino acid 100 is H, amino acid 147 is V, S or T, amino acid 217 is F and amino acid 323 is A).
  • the modified NphT7 enzyme comprises an H100L substitution, an I147V, I147S or I147T amino acid substitution, an F217V substitution and an S323A amino acid substitution (corresponding to SEQ ID NO: 33 in which amino acid residue 100 is L, amino acid residue 147 is V, S or T, amino acid residue 217 is V and amino acid residue 323 is A).
  • the recombinant cell includes both of (1) a Streptomyces Sp
  • the recombinant cell includes a gene that encodes for an enzyme having SEQ ID NO: 16 and another gene that encodes for an enzyme having SEQ ID NO: 33. In some aspects, the recombinant cell includes a gene that encodes for an enzyme having SEQ ID NO: 16 and another gene that encodes for an enzyme having SEQ ID NO: 33 in which amino acid residue 100 is H or L, amino acid residue 147 is S, T or V, amino acid residue 271 is F or V and amino acid residue 323 is A.
  • the recombinant cell can further include at least one heterologous 3-ketoacy-
  • the heterologous 3-ketoacyl-CoA synthase gene may encode for a 3-ketoacyl-CoA synthase enzyme that is at least 80%, at least 90%, at least 95%, at least 99% or 100% identical to any of those identified as SEQ ID NO: 1-81, 86-93, 97, 105-116, and 119-171 in PCT/US 18/16394.
  • the 3-ketoacyl-CoA synthase gene is an Acinetobacter schindleri
  • CIP 107287 gene Designation: Asch
  • a gene that encodes for a 3-ketoacyl-CoA synthase enzyme that is at least 80%, at least 90%, at least 95%, at least 99% or 100% identical to SEQ ID NO: 18-20, 22 and/or 26.
  • the recombinant cell includes at least one gene that encodes for a modified Asch enzyme as described in PCT/US18/16394.
  • the modified Asch enzyme comprises an amino acid sequence having at least 70% but less than 100% sequence identity to SEQ ID NO: 26.
  • the modified Asch enzyme may have, for example, one or more amino acid substitutions selected from the group consisting of T184I, F236L, V268A, V296A, V317A, and S328G, and any combination of any two or more thereof.
  • the modified Asch enzyme may have altered specific activities relative to an unmodified Asch enzyme.
  • a modified Asch enzyme may have amino acid substitutions comprising T184I, F236L, V268A, V296A, V317A, and S328G (SEQ ID NO: 19) and may result in C8 FAME as the most abundant product.
  • a modified Asch enzyme may have amino acid substitutions comprising T184I, V296A, and V268A (SEQ ID NO: 22) and may result in a mixture of C8 FAME and C10 FAME as the most abundant products.
  • a modified Asch enzyme may have amino acid substitutions comprising V296A (SEQ ID NO: 18) and may result in C10 FAME as the most abundant product.
  • the recombinant cell can produce one or more enzymes that terminate the acyl elongation cycle and produce a product having the desired chain length.
  • a termination enzyme may or may not be heterologous.
  • the selection of termination enzyme may depend on whether the desired product is a fatty acid or a derivative thereof such as a fatty alcohol, a fatty aldehyde, a fatty alkene, a fatty amide, a fatty ester or a fatty alkane.
  • the recombinant cell in some aspects includes a heterologous thioesterase gene that encodes for a thioesterase such as an acyl-CoA esterase, in which case the product will be a fatty acid.
  • a thioesterase such as an acyl-CoA esterase
  • Suitable thioesterases include those described in Table 11 of WO 2015/101013.
  • the recombinant cell includes a gene that encodes for an ester synthase, in which case the product typically is a fatty acid ester.
  • Suitable ester synthases have amino acid sequences at least 80%, at least 90%, at least 95%, at least 99% or at least 100% identical to any of the Marinobacter aquacolei Maql enzyme (SEQ ID NO: 289 of WO
  • the ester synthase may have an amino acid sequence at least 80%, at least 90%, at least 95%, at least 99% or at least 100% identical to the Hahella chejuensis Hche ester synthase (SEQ ID NO: 37).
  • the recombinant cell may also include one or more genes that encode for one or more of a fatty acyl-CoA reductase (alcohol or aldehyde forming), a fatty aldehyde reductase, an acyl-ACP reductase, an acyl-CoA:ACP acyltransferase, an acyl-CoA hydrolase, a carboxylic acid reductase, an aldehyde dehydrogenase and/or an acyl-ACP reductase.
  • a fatty acyl-CoA reductase alcohol or aldehyde forming
  • a fatty aldehyde reductase an acyl-ACP reductase
  • an acyl-CoA:ACP acyltransferase an acyl-CoA hydrolase
  • carboxylic acid reductase an aldehyde dehydrogenase
  • the recombinant cell also may include (A) one or more genes that encode for a carboxyl transferase subunit a enzyme, (EC 6.3.1.2) such as an E. coli AccA enzyme or an enzyme that is at least 80%, at least 90%, at least 95% or at least 99% identical thereto; (B) one or more genes that encode for a biotin carboxyl carrier protein, (EC 6.4.1.2) such as an E. coli AccB enzyme (SEQ ID NO: 29) or an enzyme that is at least 80%, at least 90%, at least 95% or at least 99% identical thereto; (C) one or more genes that encode for a biotin carboxylase subunit enzyme, (EC 6.3.4.14) such as an E.
  • coli AccC enzyme SEQ ID NO: 30 or an enzyme that is at least 80%, at least 90%, at least 95% or at least 99% identical thereto;
  • D a carboxyl transferase subunit b (EC 6.4.1.2), such as an E. coli AccD enzyme or an enzyme that is at least 80%, at least 90%, at least 95% or at least 99% identical thereto,
  • E a fused E. coli AccD subunit and AccA subunit enzyme (SEQ ID NO: 28); or a combination of any two or more thereof.
  • all of (A) - (E) are present.
  • the recombinant cell described herein further comprises one or more additional genetic modifications to reduce or eliminate the expression of certain endogenous enzymes in the recombinant cell. Reducing or eliminating the expression of these certain endogenous enzymes in the recombinant cell can increase the production of desired products, such as fatty acids and/or fatty acid chain products.
  • These reduced or eliminated endogenous enzymes include one or more of the following enzymes:
  • Methylglyoxal synthase (EC 4.2.3.3), for example that encoded by the E. coli mgsA gene.
  • Lactate dehydrogenase (EC 1.1.1.27), for example that encoded by the E. coli IdhA gene.
  • Phosphotransacetylase (EC 2.3.1.8), for example that encoded by the E. coli pta gene.
  • Acetate kinase (EC 2.7.2.1), for example that encoded by E. coli ackA gene.
  • Acyl-CoA synthase (EC 6.2.1.3), for example that encoded by the E. colifadD gene.
  • Pyruvate formate lyase (EC 2.3.1.54), for example that encoded by the E.coli pflB gene.
  • Pyruvate oxidase (EC 1.2.2.2), for example that encoded by the E. coli poxB gene.
  • Fused acetaldehyde-CoA dehydrogense (EC 1.2.1.10), for example that encoded by the E. coli adhE gene.
  • Trigger factor (EC 5.2.1.8), for example that encoded by the E. coli tig gene.
  • Restriction endonuclease (EC 3.1.21.3), for example that encoded by the E. coli hsdr514 gene.
  • the atoDAEB operon The atoDAEB operon.
  • Acyl-CoA thioesterase (EC 3.1.2.-), for example that encoded by the E. coli tesB or yciA gene.
  • Acyl-coenzyme A dehydrogenase (EC 1.3.8.7), for example that encoded by the E. coli fadE gene.
  • 3-ketoacyl-CoA thiolase (EC 2.3.1.16), for example that encoded by the E. colifadA gene.
  • L-ribulokinase (EC2.7.1.16), for example that encoded by the E. coli araB gene.
  • L-ribulose-5-phosphate-4-epimerase (EC 5.1.3.4), for example that encoded by the E. coli araD gene.
  • Beta-D-galactosidase (EC 3.2.1.23), for example that encoded by the E.coli lacZ gene. Lambda phage lysogen.
  • Rhamnulose-l -phosphate aldolase (EC 4.1.2.19), for example that encoded by the E. coli rhaD gene.
  • Rhamnulokinase (EC 2.7.1.5), for example that encoded by the E. coli rhaB gene.
  • Truncated RNase PH (EC 2.7.7.56) Rph-1 gene.
  • Any heterologous gene may be operatively linked to a promoter and/or terminator sequence that is functional in the recombinant strain.
  • the promoter may be an inducible promoter that functions only under certain conditions.
  • a low phosphate inducible promoter such as the promoter of the wild-type E. coli phoE gene (PphoE) is a useful promoter for the 3-ketoacyl synthase gene.
  • PphoE wild-type E. coli phoE gene
  • Such a promoter is active in a low phosphate environment. Accordingly, a recombinant cell in which the 3-ketoacyl synthase gene is under the control of an E.
  • the coli phoE promoter or another low phosphate inducible promoter may be cultivated in a fermentation medium containing no more than 25 mM phosphate, especially no more than 20 mM, no more than 2 mM, no more than lmM, no more than 0.5 mM, or no more than 0.25 mM phosphate.
  • the promoter that is a low phosphate inducible promoter is the promoter for the pstS gene ( PpstS ). This promoter may be constructed to include a binding site for Integration Host Factor and is thus designated PpstSIH (Lyzen et ak, Plasmid 60:125 (2008)).
  • Any heterologous gene may be integrated into the genome of the recombinant strain and/or present in one or more plasmids. If integrated into the genome, the heterologous gene may be inserted at a targeted or random location. Transformation methods such as electroporation and chemical methods (including calcium chloride and/or lithium acetate methods) known in the art are suitable. Examples of suitable transformation methods are described, for example, in Molecular Cloning: A Laboratory Manual, 4 th Ed. Spring Harbor Press 2012. In general, no special transformation methods are necessary to produce the recombinant cells.
  • Deletions and/or disruptions of native genes can be performed by transformation methods, by mutagenesis and/or by forced evolution methods.
  • mutagenesis methods cells are exposed to ultraviolet radiation or a mutagenic substance, under conditions sufficient to achieve a high kill rate (60-99.9%, preferably 90-99.9%) of the cells.
  • Surviving cells are then plated and selected or screened for cells having the deleted or disrupted metabolic activity.
  • Disruption or deletion of the desired native gene(s) can be confirmed through PCR or Southern analysis methods.
  • the recombinant cells described herein are used to produce compounds having a straight-chain alkyl group.
  • the recombinant cells are grown under conditions such that they produce such compounds, and the compounds may be recovered.
  • the recombinant cell is a plant cell
  • the plant can be grown and the compound having the straight-chain alkyl group can be recovered from the plant or any portion thereof, such as roots, stems, leaves, flowers, seeds, seed pods and the like, in which the compound accumulates during the growth of the plant.
  • Single-cell and other microcells can be used in a culturing process to produce such compounds.
  • Culturing is performed generally by forming a culture medium that includes at least one carbon source that is capable of being metabolized by the recombinant cell to produce the product compounds and nutrients as may be required by the specific recombinant cell.
  • the nutrients may include, for example, at least one nitrogen source such as yeast extract, peptone, tryptone, soy flour, com steep liquor, or casein, at least one phosphorus source, one or more vitamins such as biotin, vitamin B12 and derivatives of vitamin B 12, thiamin, pantothenate, one or more trace metals and the like.
  • the fermentation medium may also contain additional materials such as anti-foam agents, biocides, buffers and the like.
  • the culture medium may also include a reagent that reacts with the straight-chain compound to produce the desired product.
  • the culture medium preferably contains an alkanol such as methanol, ethanol or a C3-C8 alkanol. The alkanol reacts to produce the corresponding ester.
  • a native or heterologous ester synthase, or other appropriate enzyme, may be expressed by the recombinant cell to catalyze such a reaction.
  • the culture medium is inoculated with the recombinant cell, and the inoculum is cultured in the medium so that the cell density increases to a cell density suitable for production.
  • the culture medium is then maintained at conditions sufficient for the recombinant cells to produce the desired product.
  • Suitable culture conditions will of course depend on the requirements of the particular recombinant strain.
  • the temperature of the culture medium may be, for example from 20°C to 70°C, with a temperature of 25 to 40°C being preferred for most recombinant cells.
  • the pH of the culture medium may be, for example, from 2.0 to 10.0, from 3.0 to
  • the culturing may be performed under aerobic, microaerobic, or anaerobic conditions, as required or can be tolerated by the particular recombinant cell.
  • the equipment may include, for example, a tank suitable for holding the recombinant cell and the culture medium; a line for discharging contents from the culture tank to an extraction and/or separation vessel; and an extraction and/or separation vessel suitable for removal of the chemical product from cell culture waste.
  • the carbon source is one or more carbon-containing compounds that can be metabolized by the recombinant cell as a source of carbon.
  • suitable carbon sources include sugars such as glucose, sucrose, fructose, lactose, C-5 sugars such as xylose and arabinose, glycerol and polysaccharides such as starch and cellulose.
  • Other suitable carbon sources include fermentable sugars as may be obtained from cellulosic and lignocellulosic biomass through processes of pretreatment and saccharification, as described, for example, in U.S. Patent Publication No. 2007/0031918A1, hemicellulose, lignin, starch, oligosaccharides and/or monosaccharides.
  • suitable carbon sources include high-fructose com syrup, cheese whey permeate, cornsteep liquor, sugar beet molasses, and barley malt. Still other suitable carbon sources include carbon dioxide, carbon monoxide, methanol, methylamine and glucosamine.
  • the culturing process may be continued until a titer of the desired product reaches at least 0.01, at least 0.05, at least 0.1, at least 0.25, at least 0.5 or at least 1 g per liter of culture medium (g/L).
  • the fermentation process may be continued until the titer reaches, for example, up to 40, up to 45, up to 50, up to 80, up to 100, or up to 120 g/L.
  • the specific productivity may be, for example, from 0.01 and 0.60 grams of the desired product per gram of cells on a dry weight basis per hour (g chemical product/g DCW-hr).
  • the volumetric productivity achieved may be at least 0.005 g of the desired product per liter per hour (g/L-hr), at least 0.01 g/L-hr, at least 0.1 g/L-hr or at least 0.5 g/L-hr, and may be up to, for example, 10 g/L-hr, up to 5 g/L-hr or up to 1 g/L-hr.
  • (culture) period may be greater than about 0.01, 0.05, 0.10, 0.20, 0.5, 1.0, 2.0, 3.0, 4.0, 5.0, 6.0, 7.0, 8.0, 9.0, 10.0, 11.0 or 12.0 grams of chemical product per gram DCW of cells (based on the final DCW at the end of the 24-hour period).
  • NphT7 acetoacetyl-CoA via the enzyme NphT7.
  • the saturated C4-CoA is then reacted with malonyl-CoA to create a C6-C0A species that also undergoes iterative keto reduction, dehydration and enoyl reduction to make a saturated C6-C0A.
  • the enzyme catalyzing this second reaction is also NphT7, but a variant that has been engineered to accept the saturated C4-CoA by mutagenesis, called NphT7(LSVA). Together these two enzymes can initiate FAME biosynthesis, however only the LSVA variant is is needed for FAME production.
  • NphT7(LSVA) NphT7(LSVA)
  • this variant can extend from a C2- CoA to a C6-C0A molecule
  • the wild-type enzyme can only extend from C2-CoA to C4-CoA molecule.
  • the pathway typically benefits from expression of wild type NphT7, however it is not required.
  • the current FAME pathway benefits from an additional 3-ketoacyl-CoA synthase, provided by heterologous Asch or one of its variants engineered for increased chain length specificity compared to wildtype Asch.
  • enzymes which can natively catalyze the C2-CoA to
  • NphT7 A subset of the same mutations introduced into NphT7 that broaden the substrate specificity and extend the catalytic activity from C2-CoA to C6-C0A can also be introduced into the enzmyes described herein.
  • These two mutations (called SV) enable some enzymes that otherwise show no ablity to produce FAME to now produce FAME in the context of the pathway, and in some cases, behave similarly to NphT7(LSVA), an example of such is provided by Nbra.
  • LSVA an example of such is provided by Nbra.
  • the SV mutations are what allow the enzymes to function in the FAME pathway. However, not all of the enzymes behave in this fashion, for example, wild type Sten produces some FAME; however, as a SV mutant it does not.
  • enzymes that have the highest activity on C2-CoA as well as specific enzymes that have the highest activity on C4-CoA.
  • SV mutations to change the substrate specificity of the enzymes. Although enzymes can have activity on both substrates, C2-CoA and C4-CoA, the mutations can change the relative activity on each substrate and as a consequence influence total FAME production (and the amounts of each individual FAME that is produced).
  • Recombinant Strain 1 is a mutant of the E. coli strain designated BW25113, available from the E. coli Genetic Strain Center (CGSC#7636; Dept of Molecular, Cellular, and Developmental Biology, Yale University, New Haven, Connecticut), having the following additional genetic modifications:
  • Recombinant Strain 2 is a mutant of the BW25113 E. coli strain with the following genetic modifications:
  • Recombinant Strain 3 is a mutant of the BW25113 E. coli strain with the following genetic modifications:
  • Recombinant Strain 4 is a mutant of the BW25113 E. coli strain with the following genetic modifications:
  • Mutant E. coli strains are prepared using standard electroporation methods. In each case, the host strain is transformed with a“Type 1” plasmid and a“Type 2” plasmid as described below.
  • Type 1 plasmids are pACYC plasmids containing the pl5a origin of replication and a chloramphenicol resistance marker:
  • Type 1A This plasmid includes a gene encoding for a 3-ketoacyl-CoA synthase or a mutated variant.
  • the 3-ketoacyl-CoA synthase genes are as indicated in the specific examples below.
  • This plasmid includes an E. coli bifunctional 3-hydroxyacyl-CoA
  • dehydrogenase/dehydratase (fadB) gene (SEQ. ID NO. 36) and a T. denticola enoyl-CoA (ter) gene (SEQ. ID NO. 34) cassette, all under a native E. coli pstSIH promoter (SEQ. ID NO. 39) and a native E. coli terminator.
  • This plasmid also contains a Hahella chejuensis ester synthase gene (SEQ. ID NO. 37) fused to a DNA sequence encoding a protein fragment containing 6 histidine residues and a protease recognition site under an E. coli phoE promoter (SEQ. ID NO.
  • an ACC (acetyl-CoA carboxylase) cassette including fused E. coli accA and accD genes with a E. coli tpiA promoter (SEQ ID NO: 31) and a cassette including the E. coli accB and E. coli accC genes under an E. coli rpiA promoter (SEQ ID NO: 32).
  • Type 1B This plasmid includes a gene encoding for a 3-ketoacyl-CoA synthase or a mutated variant.
  • the 3-ketoacyl-CoA synthase genes are as indicated in the specific examples below.
  • This plasmid also includes an E. coli bifunctional 3-hydroxyacyl-CoA dehydrogenase/dehydratase (fadB) gene (SEQ ID NO: 36) and a T denticola enoyl-CoA (ter) gene (SEQ ID NO: 34) cassette, all under a native E. coli pstSIH promoter (SEQ ID NO: 39) and a native E. coli terminator.
  • Type 2 plasmids are pET Plasmids containing a ColEl origin of replication and a kanamycin resistance marker:
  • Type 2A This plasmid includes a gene encoding Asch or a variant of Asch with various ratios of C8 and C10 FAME production under an E. coli phoE promoter (SEQ ID NO: 38) and an E. coli terminator. The Asch genes are as indicated in the specific examples below.
  • Type 2B This plasmid includes a gene encoding Asch or a variant of Asch with various ratios of C8 and C10 FAME production fused to a DNA sequence encoding a protein fragment containing 6 histidine residues and a protease recognition site. The Asch genes are as indicated in the specific examples below.
  • this plasmid also contains a Hahella chejuensis ester synthase gene (SEQ ID NO: 37) fused to a DNA sequence encoding a protein fragment containing 6 histidine residues and a protease recognition site. Both genes are under an E. coli phoE promoter (SEQ ID NO: 38).
  • 3-ketoacyl-CoA synthase genes are synthesized based on published polypeptide sequence information for various wild type enzymes, and using codons optimized for expression in E. coli. Site-specific mutants of the synthesized 3-ketoacyl-CoA synthase genes are generated by oligonucleotide-directed mutagenesis.
  • the sources of the wild-type genes, the short-hand designations used herein for each of them, and the amino acid sequence of the native enzyme produced by the wild-type genes are as follows:
  • GeneInfo Identifier refers to sequence identification numbers used by the GenBank genetic sequence database at the National Center for Biotechnology Information, U.S. National Library of Medicine, Bethesda, Maryland, USA.
  • Mutations to the amino acid residues encoded by the wild- type genes are designated herein by the shorthand designation for the wild-type enzyme, followed in parentheses by a 3-, 4- or 5 character code consisting of a first letter designating the amino acid residue in the native enzyme, a 1-, 2- or 3-digit number indicating the position of that amino acid residue in the native enzyme, and a final letter designating the amino acid residue in that position in the mutated enzyme.
  • mutations to the amino acid residues encoded by the wild-type genes can also be designated herein by the shorthand designation for the wild-type enzyme, followed in parenthesis by the final letter or letters designating the amino acid residue in that was mutated.
  • the single-letter designations are IUPAC amino acid abbreviations as reported, for example, at Eur. J. Biochem. 138:9-37(1984).
  • one of the above described recombinant strains (e.g. Recombinant Strain 1, Recombinant Strain 2, Recombinant Strain 3 or Recombinant Strain 4) is transformed with a “Type 1” plasmid (e.g. Type 1A or Type 1B) and an above-described“Type 2” plasmid (e.g. Type 2A or Type 2B).
  • a “Type 1” plasmid e.g. Type 1A or Type 1B
  • an above-described“Type 2” plasmid e.g. Type 2A or Type 2B
  • the 3-ketoacyl-CoA synthase and the promoter for the 3-ketoacyl-CoA synthase gene are as indicated in the specific examples below.
  • Seed 2 flasks contain 25-30 ml synthetic medium containing salts, glucose, NH 4 Cl, and is supplemented with vitamins, yeast extract, 35 pg/ml kanamycin, 20 pg/ml chloramphenicol, and 0-2% methanol. Seed 2 flasks are incubated at 32°C for 6-7 hours.
  • the ODeoo of a 1: 10 dilution of this culture is determined in order to inoculate a production flask to a final ODeoo of 0.01-0.025.
  • Production flasks contain 25 ml synthetic medium containing salts, glucose, NH 4 Cl, and supplemented with vitamins, yeast extract, 35 pg/ml kanamycin, 20 pg/ml chloramphenicol, 1.25-2.5 mM phosphate, 0-2% methanol, 3 - 6g/l glucose, and 15-40 g/l glycerol.
  • This synthetic medium functions as a limited phosphate medium that promotes the activity of a low phosphate inducible promoter such as the E.
  • in vitro assays are used to determine a specific activity of a 3-ketoacyl-CoA synthase for different chain length primers.
  • the assay can be used to determine the specific activity of a particular 3-ketoacyl-CoA synthase for a particular chain length primer.
  • the assay can be carried out to test for specific activity for a C6-C0A primer by reacting a C6-C0A primer and malonyl-CoA with a particular 3-ketoacyl-CoA synthase.
  • Magnesium chloride is added to the reaction mixture and complexes between the resulting 3- ketoC8-CoA and magnesium ion are formed.
  • the reaction progress can be monitored by detecting these 3-ketoC8-CoA and magnesium ion complexes at a UV absorbance of 303 nm.
  • These in vitro assays can be performed with cell lysates of cells expressing a 3-ketoacyl-CoA synthase and/or an isolated 3-ketoacyl-CoA synthase.
  • CoA synthase has specific activity for C6-C0A primers and that the assayed 3-ketoacyl-CoA synthase has 3-ketoC8-CoA synthase activity.
  • the specific activity for other chain length primers can be assayed in similar fashion by substituting a different chain length CoA primer.
  • a C2-CoA primer the specific activity for C2-CoA primers can be assayed and the 3-ketoC4-CoA synthase activity can be determined.
  • the specific activity for C4-CoA primers can be assayed and the 3-ketoC6-CoA synthase activity can be determined.
  • C6-C0A primer By substituting a C6-C0A primer, the specific activity for C6-C0A primers can be assayed and the 3-ketoC8-CoA synthase activity can be determined.
  • C8-C0A primer By substituting a C8-C0A primer, the specific activity for C8-C0A primers can be assayed and the 3-ketoClO-CoA synthase activity can be determined.
  • the in vitro assay is carried out on isolated 3-ketoacyl-CoA synthase.
  • the 3-ketoacyl-CoA synthase can be isolated by methods known in the art.
  • the 3-ketoacyl-CoA synthase can be overexpressed as a fusion polypeptide with an affinity tag such as a poly-histidine tag and then purified by immobilized metal affinity chromatography.
  • the in vitro assay is then carried out on this isolated 3-ketoacyl-CoA synthase by preparing a reaction mixture of CoA primer (0.3 mM), malonyl-CoA (0.3 mM) and 5 mM MgCF in a buffer of 50 mM Tris, pH 8.0.
  • the reaction is initiated by adding the isolated 3- ketoacyl-CoA synthase at various concentrations. As described above, the progress of the reaction can be monitored by detecting 3-ketoCX-CoA and magnesium ion complexes at a UV absorbance of 303 nm, where X represents the chain length of the resulting product.
  • the CoA primer is selected from C2-CoA, C4-CoA, C6-C0A, or C8-C0A.
  • the reactions can be carried out in 96-well microtiter plates.
  • Threshold for activity was 0.05 U/mg.
  • NphT7 (SEQ ID NO: 16), Sko3 (SEQ ID NO: 2), Scin (SEQ ID NO: 3), Sten (SEQ ID NO: 4), Nbra (SEQ ID NO: 1), FABH_ECOLI (SEQ ID NO: 21), was carried out with the Clone Manager software package (Scientific & Educational Software, Denver, Colorado). The sequence alignment was performed as a global protein sequence alignment with the Scoring Matrix BLOSUM62 and using NphT7 (SEQ ID NO: 16) as the reference sequence (FIG. 1). In the alignment in Figure 1, the SV mutations are marked with the boxes; using the NphT7 numbering, the SV mutations are I147S and F217V.
  • sequence alignment was performed as a global protein sequence alignment with the Scoring Matrix BLOSUM62.
  • the SV mutations are marked with the boxes; using the NphT7 numbering, the SV mutations are I147S and F217V.
  • FIGS. 1 and 2 show graphical views of the sequence alignment. Each line corresponds to a 3-ketoacyl-CoA synthase enzyme as indicated by the designation. The amino acid position of the first residue of each row is indicated.
  • Example 3 shows graphical views of the sequence alignment. Each line corresponds to a 3-ketoacyl-CoA synthase enzyme as indicated by the designation. The amino acid position of the first residue of each row is indicated.
  • FIG. 3 show the FAME production by NphT7 and its mutants SV and LSVA in a C10 FAME production pathway (the production strain has an Asch gene that produces mainly C10 FAME).
  • This data was generated using recombinant strains 3 and 4 (for + or - chromosomal integration of nphT7).
  • These recombinant strains contain plasmid pair type 1B and 2B, where plasmids 1B have nphT7 variants as indicated in the table and plasmid 2B is the same for all strains (this plasmid 2B contains Asch wild type with a His tag).
  • NphT7 wild type in combination with an additional 3- ketoacyl-CoA synthase such as an Asch variant is not sufficient to produce FAME of C6 or longer chain length.
  • a mutant of NphT7, such as SV or LSVA, in combination with an Asch variant produces FAME.
  • NphT7 This data shows the wild type homologs in a clean background using recombinant strain 2 which does not have a chromosomal integration of nphT7.
  • the recombinant strain contains plasmid pairs 1A and 2A. Plasmids 2A contain the Asch variants that will determine the main FAME species as indicated in the table. For C10 FAME production Asch(V296A) (SEQ ID NO: 20) was used; for C8 FAME production
  • Plasmids 1A vary based on the NphT7 homologs evaluated, as indicated in the table. As indicated in example 3, similar recombinant strains containing the wild type NphT7 do not produce FAME.
  • the following examples show FAME production by strains expressing the wild- type genes or their corresponding SV mutant variants. Variants were evaluated in recombinant strains 1 (with a chromosomal nphT7 integration) and 2 (no nphT7 integration in the chromosome). The homologs or the control NphT7(LSVA) were expressed in a plasmid type 1A and were paired with plasmids type 2A containing variant Asch genes that preferentially produce C10 FAME, C8 FAME, or mixed C10 and C8 FAME as indicated in the table, all these without a His tag.
  • Nbra(SV) showed the ability to produce FAME in both backgrounds.
  • Nbra(SV) performed similarly to NphT7(LSVA).
  • the wild type Sten showed the ability to produce FAME in both backgrounds with and without NphT7 when combined with different Asch variants that preferentially produce C10 FAME, C8 FAME, or mixed C10 and C8 FAME.
  • the amount of FAME was reduced compared to the NphT7(LSVA), but the result pointed out a clear distinction from NphT7 in that wild type Sten can support FAME production.
  • FAME production by the Sten(SV) variant was lower than by the wild-type Sten in both backgrounds, indicating that the S and/or V mutations are context- specific.
  • Sko3 wild- type demonstrated FAME production in both backgrounds with and without NphT7 when combined with different Asch variants that preferentially produce C10 FAME, C8 FAME, or mixed C10 and C8 FAME.
  • FAME titers were comparable to those found with NphT7(LSVA) in a background without NphT7.
  • Sko3(SV) also showed FAME production in both backgrounds with and without NphT7.
  • the wild- type Scin showed ability to support FAME production in both backgrounds with and without NphT7 when combined with different Asch variants that preferentially produce C10 FAME, C8 FAME, or mixed C10 and C8 FAME.
  • Scin (SV) also showed the ability to support FAME production in both backgrounds with and without NphT7.
  • the SV mutation caused a reduction in total FAME produced, similar to Sko3(SV), again indicating that the benefits conferred by one or both of these changes is context- specific.
  • Ache (SEQ ID NO: 6) :
  • Gsul (SEQ ID NO: 13) : Geobacter sulfurreducens PCA
  • Abay-2 (SEQ ID NO: 14) :
  • Acinetobacter baylyi DSM 14961 CIP 107474
  • NphT7 (SEQ ID NO: 16) :
  • FGEWHVDGGFSIAAMNELELK fadB (SEQ ID NO: 36)
  • KKEEDAAVEDLLAEVSQPKRDFSEEEI IARMMIPMVNEWRCLEEGI IATPAEADMALVY GLGFPPFHGGAFRWLDTLGSAKYLDMAQQYQHLGPLYEVPEGLRNKARHNEPYYPPVEPA RPVGDLKTA

Abstract

A recombinant cell comprising a heterologous gene encoding a polypeptide with 3-ketoacyl-CoA synthase activity produces fatty acid or fatty acid chain product. Methods for producing fatty acid or fatty acid chain product with the recombinant cell are also disclosed.

Description

METHODS AND CELLS FOR PRODUCING FATTY ACIDS AND
FATTY ACID CHAIN PRODUCTS
CROSS-REFERENCE TO RELATED APPLICATIONS
[0001] This application claims the benefit of U.S. Provisional Application No.
62/691,274, filed June 28, 2018, and entitled“METHODS AND CELLS FOR PRODUCING FATTY ACIDS AND FATTY ACID CHAIN PRODUCTS”, which application is hereby incorporated by reference herein in its entirety. This application claims the benefit of U.S. Provisional Applicationl No. 62/693,714, filed July 3, 2018, and entitled“METHODS AND CELLS FOR PRODUCING FATTY ACIDS AND FATTY ACID CHAIN PRODUCTS”, which application is hereby incorporated by reference herein in its entirety.
BACKGROUND
[0002] Fatty acids and fatty acid chain products have a number of industrial uses. For instance, fatty acids and fatty acid chain products can be utilized as solvents for lacquers, paints, varnishes and other compositions; as plasticizers for organic resins, as fragrances and flavorings, as fuels for jet and other internal combustion engines, and as raw materials for making a variety of downstream products. Currently, fatty acids and fatty acid chain products are produced industrially from non-renewable fossil fuels or from plant oils such as palm and coconut oil. However, these conventional methods of producing fatty acids and fatty acid chain products have a number of drawbacks. A particular drawback is poor selectivity of a specific carbon chain length. The products of these conventional methods tend to be a mixture of compounds having a range of carbon chain lengths. Separating these mixtures of compounds can be difficult and can often lead to poor yields with much of the non-renewable fossil fuels or plant oils converted to lower-value byproducts that do not have the desired carbon chain lengths.
[0003] Some biological cells can naturally produce some types of fatty acid chain products. For example, almost all living cells produce triglycerides of fatty acids as well as other fatty acid esters. These triglycerides and other esters play a role in the metabolism, cellular structure, and other biological processes of the cells, and can perform other useful functions such as storing energy. Therefore, some biological cells can potentially be used to produce fatty acid chain products industrially. Among other potential advantages, biological production of fatty acids and fatty acid products in some cases can rely on annually renewable carbon sources such as sugars, rather than on fossil fuels.
[0004] Biological cells tend to produce fatty acid groups with chain lengths of 12 carbon atoms or greater. Thus, naturally-occurring biological cells are a good source for C12 and higher fatty acids and their chain products. For example, triglycerides produced naturally by such cells can be hydrolyzed to produce C12 or higher fatty acids, which can in turn be converted to other derivatives such as esters or alcohols. On the other hand, few cells naturally produce fatty acids of C6 - C10 chain length in significant quantities.
[0005] Some biological cells can produce fatty acid chain products through a native metabolic pathway that starts with acetyl-CoA and malonyl-ACP (malonyl-acyl-carrier protein). Acetyl-CoA condenses with malonyl-ACP with loss of carbon dioxide and CoA to produce 3- ketobutyryl-ACP. Subsequent enzymatic reactions convert the 3-ketobutyryl-ACP successively to 3-hydroxybutyryl-ACP, then to trans-2-butenoyl-ACP (with loss of water) and finally to butyryl-ACP. The butyryl-ACP can re-enter this reaction cycle in place of acetyl-CoA to produce hexanoyl-ACP. This cycle repeats itself, producing in each iteration a longer carbon atom chain by adding two carbon atoms at a time, until terminated by some other cellular process. However, this native metabolic pathway has low termination rates for C6-C10 chain lengths, resulting in low production of C6-C10 fatty acids and fatty acid chain products.
[0006] Some biological cells can be genetically modified to increase production of C6-
C10 chain lengths. These genetic modifications can include insertions of heterologous genes and modification and/or deletions of endogenous genes to increase production of C6-C10 chain lengths. Often, heterologous genes can be inserted to provide enzymes to catalyze specific reactions between chemical intermediates along a pathway that results in production of C6-C10 chain lengths. For example, Okamura et ak, in PNAS vol. 107, no. 25, pp. 11265-11270 (2010), reported that the enzyme produced by the nphT7 gene of a soil-isolated Streptomyces sp. strain can catalyze a single condensation of acetyl-CoA and malonyl-CoA to produce acetoacetyl- CoA. Additionally, U.S. Application No. 2014/0051136 disclosed that fatty acid chain products can be produced by a cell modified to include the nphT7 gene and additional heterologous genes. However, the amounts of C6 or longer chain fatty acid chain products produced were limited and selectivity of C6-C10 chain lengths was limited. WO 2015/010103 described mutants of the nphT7 gene that code for NphT7 enzymes that more efficiently catalyze the condensation of longer-chain acyl-CoA compounds with malonyl-CoA. Cells modified to include both the wild-type nphT7 gene and the nphT7 mutant gene produced greater relative quantities of longer fatty acids, but selectivity of C6-C10 chain lengths remained poor. C6-C10 fatty acid chain products in particular were made in only small amounts.
SUMMARY
[0007] Provided herein are nucleic acid sequences and proteins, as well as biological processes to produce fatty acids at higher yield and with selectivity to C6-C10 fatty acids or fatty acid chain products.
[0008] One aspect provides an expression system comprising at least one expression cassette comprising a promoter operably linked to a heterologous nucleic acid segment encoding a polypeptide with 3-ketoacyl-CoA synthase activity and having at least 80% sequence identity to an amino acid sequence set forth in SEQ ID NO: 1-15, wherein the polypeptide catalyzes the condensation of acyl-CoA with malonyl-CoA to form 3-ketoacyl-CoA. In one aspect, the polypeptide with 3-ketoacyl-CoA synthase activity has at least 80% sequence identity to an amino acid sequence set forth in SEQ ID NO: 1-4. In another aspect, the polypeptide catalyzes one or more condensations selected from the group consisting of: a) C2-CoA with malonyl-CoA to form 3-keto-C4-CoA; b) C4-CoA with malonyl-CoA to form 3-keto-C6-CoA; c) C6-C0A with malonyl-CoA to form 3-keto-C8-CoA; and d) C8-C0A with malonyl-CoA to form 3-keto- ClO-CoA. In one aspect, an amino acid residue of the polypeptide with 3-ketoacyl-CoA synthase activity that aligns with amino acid residue 147 of SEQ ID NO: 16 is serine and an amino acid residue that aligns with amino acid residue 217 of SEQ ID NO: 16 is valine. In one aspect, the promoter is an inducible promoter, such as an inducible promoter sensitive to lowering phosphate concentration. In one aspect, the promoter is a PpstSIH promoter or a PphoE promoter.
[0009] One aspect provides a recombinant cell comprising the expression system described herein, wherein the recombinant cell produces a fatty acid and/or fatty acid chain product having a chain length of one or more of C4, C6, C8 and/or C10. In one aspect, the fatty acid chain product comprises one or more of fatty acid ester, fatty alcohol, fatty acid amide and fatty acid amine. In another aspect, the fatty acid ester comprises one or more of fatty acid methyl ester, fatty acid ethyl ester, fatty acid propyl ester, fatty acid butyl ester, fatty acid pentyl ester and fatty acid hexyl ester. In one aspect, the recombinant cell produces a fatty acid ester of C4, C6, C8, and/or C10 chain length. In another aspect, the recombinant cell produces a fatty acid ester of C8 and/or C10 chain length. In one aspect, the alkoxy group of the fatty acid ester is derived from a Cl, C2, C3, or C4 monoalcohol. In another aspect, the alkoxy group is derived from a Cl or C2 monoalcohol. In one aspect, the monoalcohol is methanol.
[0010] In one aspect, the recombinant cell described herein further comprises one or more of: a heterologous nucleic acid segment encoding a polypeptide with enoyl-CoA reductase activity; a heterologous nucleic acid segment encoding a polypeptide with bifunctional 3- hydroxyacyl-CoA dehydrogenase/dehydratase activity; a heterologous nucleic acid segment encoding a polypeptide with ester synthase activity; a heterologous nucleic acid segment encoding a polypeptide with acetyl-CoA carboxylase activity; and a heterologous nucleic acid segment encoding a mutant polypeptide with ketoacyl-CoA synthase activity. In one aspect, the heterologous nucleic acid segment encoding a mutant polypeptide with ketoacyl-CoA synthase activity is an Asch variant selected from the group consisting of Asch(Tl84I, F236L, V268A, V296A, V317A, S328G), Asch(Tl84I, V296A, V268A), and Asch(V296A).
[0011] In one aspect, the recombinant cell is a fungal cell, a bacterial cell, or a plant cell.
In one aspect, the recombinant cell comprises an Escherichia coli. In another aspect, the recombinant cell comprises a yeast cell.
[0012] One aspect provides a cell culture comprising: a) a recombinant cell described herein; and b) one or more fatty acids or fatty acid chain products produced by the recombinant cell.
[0013] One aspect provides a method of producing fatty acids and/or fatty acid chain products comprising growing the recombinant cell described herein in a culture medium, wherein the recombinant cell is grown under conditions in which the heterologous nucleic acid segment encoding the polypeptide with 3-ketoacy-CoA synthase activity is expressed; and the recombinant cell produces a fatty acid and/or a fatty acid chain product.
[0014] One aspect provides one or more fatty acids or fatty acid chain products produced herein for use in products.
[0015] One aspect provides an isolated 3-ketoacyl-CoA synthase having at least 80% sequence identity to an amino acid sequence set forth in SEQ ID NO: 1-15 for use in catalyzing the condensation of acyl-CoA with malonyl-CoA to form 3-ketoacyl-CoA. In another aspect, the 3-ketoacyl-CoA synthase has activity for one or more of C2-CoA, C4-CoA and C6-C0A intermediates. In one aspect, the 3-ketoacyl-CoA synthase has activity for C2-CoA and C4-CoA intermediates. In one aspect, the 3-ketoacyl-CoA synthase has activity for C2-CoA, C4-CoA and C6-C0A intermediates.
[0016] One aspect provides a recombinant cell described herein for use in producing fatty acid and/or fatty acid chain products having a chain length of one or more of C4, C6, C8 and/or C10.
[0017] One aspect provides a 3-ketoacyl-CoA synthase having at least 80% sequence identity to an amino acid sequence set forth in SEQ ID NO: 1-4, wherein an amino acid residue of the 3-ketoacyl-CoA synthase that aligns with amino acid residue 147 of SEQ ID NO: 16 is serine and an amino acid residue that aligns with amino acid residue 217 of SEQ ID NO: 16 is valine.
[0018] In one aspect, the recombinant cell produces a fatty acid and/or fatty acid chain product having a chain length of one or more of C6, C8 and/or C10. In another aspect, the recombinant cell produces a fatty acid ester of C6, C8, and/or C10 chain length. In one aspect, the recombinant cell produces a fatty acid ester of C8 and/or C10 chain length. In another aspect, the alkoxy group is derived from a Cl, C2, C3, or C4 monoalcohol. In one aspect, the alkoxy group is derived from a Cl or C2 monoalcohol. In some aspects, the monoalcohol is methanol.
BRIEF DESCRIPTION OF THE FIGURES
[0019] FIG. 1 shows graphical views of the sequence alignment. Each line corresponds to a 3-ketoacyl-CoA synthase enzyme as indicated by the designation. The amino acid position of the first residue of each row is indicated. The SV mutations are marked with the boxes.
Using the NphT7 numbering, it is I147S and F217V. Sequence alignment was done using Clone Manager alignment software with a Scoring Matrix: Standard Linear and default settings.
[0020] FIGS. 2A-2D show graphical views of the sequence alignment. Each line corresponds to a 3-ketoacyl-CoA synthase enzyme as indicated by the designation. The amino acid position of the first residue of each row is indicated. The SV mutations are marked with the boxes. Using the NphT7 numbering, it is I147S and F217V. Sequence alignment was done using Clone Manager alignment software with a Scoring Matrix: Standard Linear and default settings.
[0021] FIG. 3 shows the FAME production by NphT7 and its mutants SV and LSVA in a C10 FAME production pathway with or without chromosomal expression of wild type nphT7. [0022] FIG. 4 shows FAME production by wild type Nbra, Sten, Sko3, Scin in three different FAME production pathways (C8 FAME, C10 FAME, C8/C10 mix).
[0023] FIG. 5 shows FAME production by wild- type Nbra and its SV mutants vs.
NphT7(LSVA) in three different FAME production pathways (C8 FAME, C10 FAME, C8/C10 mix) with or without chromosomal expression of wild type nphT7.
[0024] FIG. 6 shows FAME production by wild- type Sten and its SV mutants vs.
NphT7(LSVA) in three different FAME production pathways (C8 FAME, C10 FAME, C8/C10 mix) with or without chromosomal expression of wild type nphT7.
[0025] FIG. 7 shows FAME production by wild-type Sko3 and its SV mutants vs.
NphT7(LSVA) in three different FAME production pathways (C8 FAME, C10 FAME, C8/C10 mix) with or without chromosomal expression of wild type nphT7.
[0026] FIG. 8 shows FAME production by wild- type Scin and its SV mutants vs.
NphT7(LSVA) in three different FAME production pathways (C8 FAME, C10 FAME, C8/C10 mix) with or without chromosomal expression of wild type nphT7.
DETAILED DESCRIPTION
[0027] Although some wild type biological cells can naturally produce some types of fatty acid chain products, these biological cells tend to produce fatty acids and fatty acid chain products with chain lengths of C12 or higher. Unfortunately, few cells naturally produce fatty acids and/or fatty acid chain products of C6-C10 chain length in significant quantities.
[0028] Applicants have shown that recombinant cells can be genetically modified to express heterologous polypeptides for the biosynthesis of C6-C10 fatty acids and/or fatty acid chain products. These recombinant cells can be genetically modified to express identified enzymes having 3-ketoacyl-CoA synthase activity to produce C6-C10 fatty acids and/or fatty acid chain products. In particular, these identified enzymes having 3-ketoacyl-CoA synthase activity can catalyze the stepwise addition of malonyl-CoA to an acyl-CoA substrate that can be further converted to a C6-C10 fatty acid and/or fatty acid chain product. For example, an enzyme having 3-ketoacyl-CoA synthase activity can catalyze the addition of malonyl-CoA to a C2-CoA substrate to lead to a C4-CoA product that can be further converted to its fatty acid and/or fatty acid chain product, the addition of malonyl-CoA to a C4-CoA substrate to lead to a C6-C0A product that can be further converted to its fatty acid and/or fatty acid chain product, the addition of malonyl-CoA to a C6-C0A substrate to lead to a C8-C0A product that can be further converted to its fatty acid and/or fatty acid chain product, and/or the addition of malonyl- CoA to a C8-C0A substrate to lead to a ClO-CoA product that can be further converted to its fatty acid and/or fatty acid chain product. In some aspects, an enzyme having 3-ketoacyl-CoA synthase activity can catalyze the addition of malonyl-CoA to a C2-CoA substrate (acetyl-CoA) to lead to a 3-keto-C4-CoA, the addition of malonyl-CoA to a C4-CoA substrate to lead to a 3- keto-C6-CoA, the addition of malonyl-CoA to a C6-C0A substrate to lead to a 3-keto-C8-CoA, and/or the addition of malonyl-CoA to a C8-C0A substrate to lead to a 3-keto-ClO-CoA.
[0029] In addition to enzymes having 3-ketoacyl-CoA synthase activity, the recombinant cells can be genetically modified to express other heterologous polypeptides for the biosynthesis of C6-C10 fatty acids and/or fatty acid chain products. For example, the recombinant cells can be genetically modified to express one or more heterologous polypeptides capable of converting a 3-keto-CX-CoA substrate (e.g., 3-keto-C4-CoA, 3-keto-C6-CoA, 3-keto-C8-CoA, and/or 3- keto-ClO-CoA) to a 3-OH CX-CoA substrate (e.g., 3-OH-C4-CoA, 3- OH-C6-C0A, 3-OH-C8- CoA, and/or 3-OH-ClO-CoA) (where X represents the carbon chain length). The recombinant cells can be genetically modified to express one or more heterologous polypeptides capable of converting a 3-OH-CX-CoA substrate (e.g., 3-OH-C4-CoA, 3-OH-C6-CoA, 3-OH-C8-CoA, and/or 3-OH-ClO-CoA) to a trans-enoyl-CX-CoA substrate (e.g., enoyl-C4-CoA, enoyl-C6- CoA, enoyl-C8-CoA, and/or enoyl-ClO-CoA) (where X represents the carbon chain length).
The recombinant cells can be genetically modified to express one or more heterologous polypeptides capable of converting a trans-enoyl-CX-CoA substrate (e.g., enoyl-C4-CoA, enoyl- C6-C0A, enoyl-C8-CoA, and/or enoyl-ClO-CoA) to a CX-CoA substrate (e.g., C4-CoA, C6- CoA, C8-C0A, and/or ClO-CoA) (where X represents the carbon chain length). Additionally, the recombinant cells can be genetically modified to express one or more heterologous polypeptides capable of converting a CX-CoA substrate (e.g., C4-CoA, C6-C0A, C8-C0A, and/or ClO-CoA) to a fatty acid ester.
[0030] As used herein, the singular forms“a,”“an,” and“the” include plural referents unless the context clearly dictates otherwise. For example, reference to“a nucleic acid” means one or more nucleic acids.
[0031] Methods available to those skilled in the art can be used to construct genetic expression constructs and recombinant cells as described. These methods include, but are not limited to, in vitro recombinant DNA techniques, synthetic DNA techniques, in vivo
recombination techniques, and polymerase chain reaction (PCR) techniques. For example, techniques as described in Green & Sambrook, 2012, MOLECULAR CLONING: A
LABORATORY MANUAL, Fourth Edition, Cold Spring Harbor Laboratory, New York;
Ausubel et ak, 1989, CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, Greene Publishing Associates and Wiley Interscience, New York, and PCR Protocols: A Guide to Methods and Applications (Innis et ak, 1990, Academic Press, San Diego, CA) can be used to construct the genetic expression constructs and the recombinant cells described in this application.
[0032] The terms“polynucleotide,”“nucleotide,”“oligonucleotide,” and“nucleic acid” can be used interchangeably to refer to nucleic acid comprising DNA, RNA, derivatives thereof, or combinations thereof.
[0033] Amino acid residues in all amino acid sequences described herein are ordered in the N-terminus to C-terminus direction. “Upstream” means in the direction toward the N- terminus, and“downstream” means toward the C-terminus direction. The“start” of an amino acid sequence is the first amino acid residue in the N-terminus direction. The first amino acid residue (amino acid residue 1) for any sequence or sub-sequence described herein is the amino acid residue at its N-terminus.
[0034] A“sub-sequence” is a sequence of amino acid residues contained within a larger amino acid sequence.
[0035] Polypeptides refer to polymeric chains of amino acids connected by peptide bonds. In some aspects, polypeptides can fold into unique three dimensional structures that allow the polypeptides to function as enzymes that catalyze distinct biochemical reactions. For example, a 3-ketoacyl-CoA synthase polypeptide catalyzes the condensation reaction of an acyl- CoA with malonyl-CoA to form a 3-ketoacyl-CoA.
[0036] “Identity” is used herein to indicate the extent to which two (nucleotide or amino acid) sequences have the same residues at the same positions in side-by-side alignment of their sequences. The identity is expressed herein as a % identity as determined using BLAST (National Center for Biological Information (NCBI) Basic Local Alignment Search Tool) version 2.2.31 software (National Center for Biotechnology Information, U.S. National Library of Medicine, Bethesda, Maryland, USA), using default parameters unless indicated otherwise in this paragraph. Identity between amino acid sequences is determined using protein BLAST with the following parameters: Max target sequences: 100; Short queries: Automatically adjust parameters for short input sequences; Expect threshold: 10; Word size: 6; Max matches in a query range: 0; Matrix: BLOSUM62; Gap Costs: (Existence: 11, Extension: 1); Compositional adjustments: Conditional compositional score matrix adjustment; Filter: none selected; Mask: none selected. Nucleic acid % sequence identity between nucleic acid sequences is determined using standard nucleotide BLAST with the following default parameters: Max target sequences: 100; Short queries: Automatically adjust parameters for short input sequences; Expect threshold: 10; Word size: 28; Max matches in a query range: 0; Match/Mismatch Scores: 1, -2; Gap costs: Linear; Filter: Low complexity regions; Mask: Mask for lookup table only. A sequence having a % identity score of XX% (for example, 80% or 95%) to a reference sequence as determined in this manner is considered to be XX% identical to or, equivalently, have XX% sequence identity to, the reference sequence.
[0037] An amino acid residue of a sequence or sub-sequence under investigation
“aligns” to an amino acid residue of a reference sequence or sub-sequence when:
i) in the case of an entire sequence, the sequences are aligned using the BLAST version 2.2.31 software in the manner described above, and the amino acid residue of the sequence under investigation occupies the same position in the alignment as does the amino acid residue of the reference sequence;
ii) in the case of a sub-sequence, the sequence containing the sub-sequence under investigation is aligned with the reference sequence, and the amino acid residue of the sub-sequence under investigation occupies the same position in the alignment as does the amino acid residue of the reference sub-sequence.
[0038] The enzymes described herein can be expressed from expression cassettes. Such an expression cassette (also referred to as a transgene) can include a promoter operably linked to a nucleic acid segment that encodes any of the enzymes described herein.
[0039] The terms "expression vector" and "expression cassette" refer to a recombinant nucleic acid molecule containing a desired nucleic acid coding segment and appropriate nucleic acid sequences for the expression of the operably linked nucleic acid coding segment in a particular host cell/recombinant cells or organism. Nucleic acid sequences for expression of proteins in prokaryotes usually include a promoter, an operator (optional), and a ribosome binding site, often along with other sequences. Eukaryotic cells can utilize promoters, enhancers, and termination and polyadenylation signals.
[0040] Nucleic acid sequences for expression of proteins can be heterologous to the nucleic acid coding segment. A "heterologous nucleic acid," "heterologous gene" and "heterologous protein" indicate a nucleic acid, gene or protein, respectively, which has come from a source other than its native source. The terms "heterologous" and "exogenous" are sometimes used interchangeably with "recombinant." Such heterologous nucleic acid, heterologous genes, and/or heterologous proteins may have been artificially supplied to the biological system or host/recombinant cell.
[0041] Promoters provide for expression of mRNA. In some cases, the promoter can be a native promoter. However, the promoter can in some cases be heterologous to the nucleic acid segment. In other words, such a heterologous promoter may not be naturally linked to such nucleic acid segment. Instead, some expression cassettes and expression vectors have been recombinantly engineered to include a nucleic acid coding segment (such as one encoding an enzyme described herein) operably linked to a heterologous promoter. A nucleic acid coding segment is operably linked to the promoter, for example, when it is located downstream from the promoter.
[0042] A variety of promoters can be included in the expression cassettes and/or expression vectors. In some cases, an endogenous promoter, or an endogenous promoter that expresses the native enzyme, can be employed. Promoter regions are typically found in the flanking DNA upstream from the coding sequence in both prokaryotic and eukaryotic cells. A promoter sequence provides for regulation of transcription of the downstream gene sequence and typically includes from about 50 to about 2,000 nucleotide base pairs. Promoter sequences can also contain regulatory sequences such as enhancer sequences that can influence the level of gene expression. Some isolated promoter sequences can provide for gene expression of heterologous DNAs, that is a DNA different from the native or homologous DNA.
[0043] Promoters can be strong or weak, or inducible. A strong promoter provides for a high level of gene expression, whereas a weak promoter provides for a low level of gene expression. An inducible promoter is a promoter that provides for the turning on and off gene expression in response to an exogenously added agent, or to an environmental or developmental stimulus. Promoters can also provide for cell specific, tissue specific or developmental regulation. A strong promoter for heterologous DNAs can be advantageous because it provides for a sufficient level of gene expression for easy detection and selection of transformed cells and provides for a high level of gene expression when desired. In some cases, the promoter within such expression cassettes / vectors can be functional in a fungal cell. [0044] An enzyme-encoding nucleic acid segment can be combined with a promoter by standard methods to yield an expression cassette or transgene, for example, as described in Sambrook et al. (MOLECULAR CLONING: A LABORATORY MANUAL. Second Edition (Cold Spring Harbor, NY: Cold Spring Harbor Press (1989); MOLECULAR CLONING: A LABORATORY MANUAL. Third Edition (Cold Spring Harbor, NY: Cold Spring Harbor Press (2000)). Briefly, a plasmid containing a promoter can be constructed. Typically, these plasmids are constructed to have multiple cloning sites having specificity for different restriction enzymes downstream from the promoter. The enzyme encoding nucleic acids can be subcloned downstream from the promoter region using restriction enzymes and positioned to ensure that the nucleic acid segment is inserted in proper orientation with respect to the promoter so that the coding segment can be expressed as sense or antisense RNA. Once the enzyme nucleic acid is operably linked to a promoter, the expression cassette so formed can be subcloned into a plasmid or other vector.
[0045] A“recombinant cell” is a cell whose genetic material has been altered by a human using genetic engineering techniques including molecular cloning. Recombinant cells can comprise cells whose genetic material has been altered by the addition of genetic material from a cell of a different organism by genetic engineering. Recombinant cells can also comprise cells whose genetic material has been altered by the deletion of genetic material by genetic engineering. Recombinant cells can also comprise cells altered by the addition of extra copies of genetic material that is normally native to the recombinant cell by genetic engineering.
[0046] For purposes of this application, genetic material such as genes, promoters and terminators is“heterologous” if it is (i) non-native to the wild-type cell before it was converted to a recombinant cell and/or (ii) is native to the cell, but is present at a location different than where that genetic material is present in the wild-type cell and/or (iii) is under the regulatory control of a non-native promoter and/or non-native terminator· Extra copies of native genetic material are considered as“heterologous” for purposes of this application, even if such extra copies are present at the same locus as that genetic material is present in the wild-type cell.
[0047] A gene (such as a gene coding for a 3-ketoacyl-CoA synthase) is“heterologous” if it is not natively expressed by the native cell before it is converted to a recombinant cell, if it is native to the wild-type cell, but is present at a location different than where that genetic material is present in the wild-type cell, if it is under the regulatory control of a non-native promoter and/or non-native terminator, and/or if extra copies are present at the same locus where that genetic material is normally present in the wild-type cell. A heterologous gene can produce a heterologous polypeptide.
[0048] A“3-ketoacyl-CoA synthase” is an enzyme that catalyzes the condensation reaction of an acyl-CoA with malonyl-CoA to form a 3-ketoacyl-CoA (which is 2 carbon longer in chain length than the acyl-CoA primer), CoA-SH, and C02. One method for evaluating the ability of an enzyme to catalyze this reaction is by measuring the release of free CoA-SH using 5,5'-dithiobis-(2-nitrobenzoic acid) (DTNB), malonyl-CoA as the donor substrate, and a C2-C8 acyl-CoA as the primer substrate, in the presence of the enzyme. The formation of the corresponding 3-ketoacyl-CoA from any of these primer substrates indicates the enzyme is a 3- ketoacyl-CoA synthase.
[0049] The ability of an enzyme to catalyze this condensation reaction can also be evaluated by using an acyl-CoA as the primer substrate and malonyl-CoA as the donor substrate in the presence of 5 mM Mg++ salt by measuring the increase in absorbance at 303 nm as a function of the increase in the formation of a Mg++-complex with the 3-ketoacyl-CoA product. 3-Ketoacyl-CoA synthase will produce acetoacetyl-CoA from acetyl-CoA primer, 3- ketohexanoyl-CoA from butyryl-CoA primer, 3-ketooctanoyl-CoA from hexanoyl-CoA primer, and 3-ketodecanoyl-CoA from octanoyl-CoA primer.
[0050] EC numbers are established by the Nomenclature Committee of the International
Union of Biochemistry and Molecular Biology (NC-IUBMB) ( Enzyme Nomenclature
1992 (Academic Press, San Diego, California, ISBN 0-12-227164-5 (hardback), 0-12-227165-3 (paperback)) with Supplement 1 (1993), Supplement 2 (1994), Supplement 3 (1995),
Supplement 4 (1997) and Supplement 5 (in Eur. J. Biochem. 1994, 223, 1-5; Eur. J. Biochem. 1995, 232, 1-6; Eur. J. Biochem. 1996, 237, 1-5; Eur. J. Biochem. 1997, 250; 1-6, and Eur. J. Biochem. 1999, 264, 610-650). The EC numbers referenced herein are derived from the KEGG Ligand database, maintained by the Kyoto Encyclopedia of Genes and Genomics, sponsored in part by the University of Tokyo. Unless otherwise indicated, the EC numbers are as provided in the database as of April 2009.
[0051] A“CoA” or“CoA-SH”, as described here refers to Coenzyme A. A CoA substrate can refer to a straight chain carbon chain covalently linked to Coenzyme A. For example, C4-CoA refers to a C4 chain covalently linked to Coenzyme A via a thioester bond between the thiol of the Coenzyme A and a terminal carbon of the C4 chain. [0052] A“fatty acid,” as described here, is a straight-chain or branched-chain monoalkanoic acid having at least four carbon atoms.
[0053] A“fatty acid chain product” is a compound having a straight chain alkyl group formed in a series of one or more reactions at the site of the terminal carboxyl group of a fatty acid (or a thioester bond of a corresponding -CoA compound). A fatty acid chain product can comprise a straight carbon chain with a different end group such as, for example, an ester, an alcohol group, an amino group, an aldehyde group, a ketone, a methyl group, or an alkenyl group. Fatty acid chain product can include fatty acid ester, fatty alcohol, fatty acid amide, fatty acid imide, and fatty acid amine.
[0054] A“fatty acid ester” is an ester compound corresponding to the reaction product of a fatty acid or a fatty acyl-CoA and an alcohol (with loss of a water or a CoA-SH). A fatty acid ester can also correspond to the reaction product of a fatty acyl CoA and an alcohol. Fatty acid ester can include fatty acid methyl ester, fatty acid ethyl ester, fatty acid propyl ester, fatty acid butyl ester, fatty acid pentyl ester, and fatty acid hexyl ester. Fatty acid methyl ester (FAME) is an ester compound corresponding to the reaction product of a fatty acid or a fatty acyl-CoA and methanol.
[0055] Chain lengths of fatty acids and fatty acid chain products are sometimes indicated herein by the shorthand“CX,” wherein X is a number designating the number of carbon atoms. The number of carbon atoms designated in each case represents the carbon length of the straight- chain compound (after removal of CoA or ACP coenzymes) formed by the recombinant cell through one or more iterations of the reaction cycle: acyl-CoA (or acyl- ACP) + malonyl-CoA to form a 3-ketoacyl compound; reduction of the 3-ketoacyl compounds to form a 3-hydroxyacyl compound; dehydration of the 3-hydroxyacyl-CoA to form a 3-enoylacyl compound; and reduction of the 3-enoylacyl compound to the corresponding acyl compound.
[0056] Each iteration of this reaction cycle adds two carbon atoms to the starting acyl-CoA or acyl-ACP. The number of carbon atoms does not include additional carbon atoms that may be added during the formation of any fatty acid chain products of the fatty acid, such as, for example, carbons included in an ester group following an esterification. Thus, hexanoic acid methyl ester is considered as a“C6” fatty acid ester compound, the carbon of the methyl ester group not being counted.
[0057] The“CX” designation can also apply to fatty acid chain products. For example,
C4 FAME refers to methyl butanoate, C6 FAME refers to methyl hexanoate, C8 FAME refers to methyl octanoate, C10 FAME refers to methyl decanoate, and C12 FAME refers to methyl dodecanoate. The“CX” designation can also be used for fatty acid intermediates. For example, C2-CoA refers to acetyl-CoA, C4-CoA refers to butyryl-CoA, C6-C0A refers to hexanoyl-CoA, C8-C0A refers to octanoyl-CoA, ClO-CoA refers to decanoyl-CoA, and Cl2-CoA refers to dodecanoyl-CoA.
[0058] The term“engineered biosynthetic pathway” refers to a biosynthetic pathway that is at least partially genetically engineered into a recombinant cell and comprises one or more enzymes to carry out a sequence of steps to produce a desired product, such as a fatty acid or fatty acid chain product. In some aspects, one or more of the enzymes of the engineered biosynthetic pathway do not naturally occur in the unmodified cell. These enzymes can be introduced into the cell by inserting a heterologous enzyme by genetic engineering. In some aspects, one or more of the enzymes of the engineered biosynthetic pathway do not naturally occur in sufficient copy number in an unmodified cell and additional copies of the enzymes are inserted by genetic engineering. In some cases, the engineered biosynthetic pathway also comprises genetic modifications to the cell to reduce or eliminate competing metabolic pathways and/or to reduce interfering activities, such as degradation of desired products or necessary intermediates.
[0059] The term“endogenous” gene refers to a gene that is naturally found in a particular cell. The term“overexpress” is used to refer to the expression of a heterologous gene in a recombinant cell at levels higher than the level of gene expression in a wild type cell. In some aspects, an endogenous gene is deleted. The terms“deletion,”“deleted,”“knockout,” and “knocked out” can be used interchangeably to refer to an endogenous gene that has been engineered to no longer be expressed in a recombinant cell. In some aspects, a deleted/knocked out gene is an endogenous gene that is deleted to increase production of a desired product, such as a fatty acid or fatty acid chain product.
[0060] The terms“variant” and“mutant” protein are used to describe a protein sequence that has been modified at one or more amino acids, compared to the wild type sequence of a particular protein. Mutations to the amino acid residues encoded by the wild-type genes are designated herein by the shorthand designation for the wild-type enzyme, followed in parenthesis by a 3-, 4- or 5 character code consisting of a first letter designating the amino acid residue in the native enzyme, a 1-, 2- or 3-digit number indicating the position of that amino acid residue in the native enzyme, and a final letter designating the amino acid residue in that position in the mutated enzyme. For example the designation NphT7(Il47S, F217V) indicates that an isoleucine (I) at amino acid residue position 147 in the wild type NphT7 enzyme has been replaced with a serine (S) and that a phenylalanine at amino acid residue position 217 in the wild type NphT7 enzyme has been replaced with a valine (V). Mutations to the amino acid residues encoded by the wild-type genes can also be designated herein by the shorthand designation for the wild-type enzyme, followed in parenthesis by the final letter or letters designating the amino acid residue in that was mutated. For example the designation
NphT7(SV) indicates that two amino acids have been mutated to a serine (S) and valine (V) respectively. The single-letter designations are IUPAC amino acid abbreviations as reported, for example, at Eur. J. Biochem. 138:9-37(1984).
[0061] In some aspects, the recombinant cell is a prokaryotic cell. In some aspects, the recombinant cell is a eukaryotic cell.
[0062] In some aspects, the recombinant cell is a microorganism, and may be a single- celled microorganism.
[0063] The recombinant cell may be a plant cell, including a cell from a plant within any of the Chlorophyta, Charophyta, Marchantiophyta, Anthocerotophyta, Bryophyta,
Lycopodiophyta, Pteridophyta, Cycadophyta, Ginkgophyta, Pinophyta, Gnetophyta or
Magnoliophyta plants. Such a plant cell may be, for example, a cell from a plant within any of the genera Arabidopsis, Beta, Glycine, Jatropha, Miscanthus, Panicum, Phalaris, Populus, Saccharum, Salix, Simmondsia, and Zea.
[0064] The recombinant cell may be a fungal, microalgae, algae or red algae
(heterokont) cell. The recombinant cell may be a yeast cell. A yeast or fungal cell may be an oleaginous yeast or fungus, and/or may be a Crabtree negative yeast or fungus.
[0065] The term“oleaginous fungi” refers to yeasts or filamentous fungi, which accumulate at least 10%, 12.5%, 15%, 17.5%, preferably at least 20% or even at least 25%
(w/w) of their biomass as lipid. They may even accumulate at least 30%, 40%, 50%, 60%, 70%, 80% (w/w) or more of their biomass as lipids. The biomass is usually measured as cell dry weight (CDW).
[0066] A“Crabtree -positive” organism is one that is capable of producing ethanol in the presence of oxygen, whereas a "Crabtree-negative" organism is not. A yeast cell having a Crabtree-negative phenotype is any yeast cell that does not exhibit the Crabtree effect. The term “Crabtree-negative” refers to both naturally occurring and genetically modified organisms. Briefly, the Crabtree effect is defined as the inhibition of oxygen consumption by a microorganism when cultured under aerobic conditions due to the presence of a high concentration of glucose (e.g., 10 g-glucose L-l). In other words, a yeast cell having a Crabtree positive phenotype continues to ferment irrespective of oxygen availability due to the presence of glucose, while a yeast cell having a Crabtree-negative phenotype does not exhibit glucose mediated inhibition of oxygen consumption. Crabtree-positive yeast produce an excess of alcohol rather than biomass production.
[0067] Examples of suitable yeast cells include, Pichia, Candida, Klebsiella, Hansenula,
Kluyveromyces, Trichosporon, Brettanomyces, Pachysolen, Issatchenkia, Yamadazyma
Saccharomyces, Schizosaccharomyces, Zygosaccharomyces, Debaryomyces, Cryptoococcus, Rhodotorula, Rhodosporidium, Lipomyces and Yarrowia. Examples of specific yeast cells include C. sonorensis, K. marxianus, K. thermotolerans, C. methane sorbosa, Saccharomyces bulderi (S. bulderi), I. orientalis, C. lambica, C. sorboxylosa, C. zemplinina, C. geochares, P. membranifaciens, Z. kombuchaensis, C. sorbosivorans, C. vanderwaltii, C. sorbophila, Z.
bisporus, Z. lentus, Saccharomyces bayanus (S. bayanus), D. caste llii, C, boidinii, C. etchellsii,
K. lactis, P. jadinii, P. anomala, Saccharomyces cerevisiae ( S. cerevisiae ) Pichia galeiformis, Pichia sp. YB-4149 (NRRL designation ), Candida ethanolica, P. deserticola, P.
membranifaciens, P. fermentans, Rhodosporidium toruloide, Lipomyces starkeyii, L. lipoferus, Candida revkaufi, C. pulcherrima, C. tropicalis, C. utilis, Trichosporon pullas, T. cutaneum, Rhodotorula glutinous, R. garminis, Yarrowia lipolytica and Saccharomycopsis crataegensis ( S. crataegensis). Suitable strains of K. marxianus and C. sonorensis include those described in WO 00/71738 Al, WO 02/42471 A2, WO 03/049525 A2, WO 03/102152 A2 and WO
03/102201A2. Suitable strains of I. orientalis are ATCC strain 32196 and ATCC strain PTA- 6648.
[0068] In some aspects, the recombinant cell is a bacteria cell. The bacteria may be a gram-positive or gram-negative bacteria. It may be a cell within any of the Chlamydiae, green nonsulfur, actinobacteria, planctomycetes, spirochaetes, fusobacteria, cyanobacteria, thermophilic sulphate-reducer, acidobacteria or proteobacteria classifications of bacteria (Ciccarelli et al, Science 311 (5765): 1283-7 (2006).
[0069] Examples of suitable bacterial cells include, for example, those within any of the genera Clostridium, Zymomonas, Escherichia, Salmonella, Rhodococcus, Pseudomonas, Streptomyces, Bacillus, Lactobacillus, Enterococcus, Alcaligenes, Klebsiella, Paenibacillus, Arthrobacter, Corynebacterium, Bacteriophage, Brevibacterium, Acanthoceras, Acanthococcus, Acarvochloris, Achnanthes, Achnanthidiun, Actinastrum, Actinochloris, Actinocyclus,
Actinotaenium, Amphichrsis, Amphidiniunm, Amphikrikos, Amplhipleura, Amphiprora, Amphithrix, Amphora, Anabaena, Anabaenopsis, Aneumnastus, Ankistrodesmius, Ankyra, Anomoeoneis, Apatococcus, Aphanizomenon, Aphanocapsa, Aphanochaete, Aphanothece, Apiocystis, Apistonema, Arthrodesmus, Artherospira, Ascochloris, Asterionella, Asterococcus, Audouinella, Aularoseira, Bacillaria, Balbiania, Bambiusina, Bangia, Basichlamys,
Batrarhospermum, Binurlearia, Bitrichia, Blidingia, Botrdiopsis, Botrydium, Botryococcus, Botryosphaerella, Brachiomonas, Brachysira, Brachytrichia, Brebissonia, Bulbochaete, Bumnilleria, Buinilleriopsis, Caloneis, Calothrix, Campylodiscus, Capsosiphon, Carteria, Catena, Cavinula, Cenritractus, Centroniella, Ceratiunt, Chaetoceros, Chaetochloris,
Chaetomorpha, Chaetonella, Chaetonemna, Chaetopeltis, Chaetophora, Chaetosphaeridium, Chamaesiphion, Chara, Characiochloris, Characiopsis, Characium, Chorales, Chilomonas, Chlainomonas, Chlamydoblepharis, Chlamydocapsa, Chlamydomonas, Chlamydomonopsis, Chlamydomnyxa, Chlamydonephris, Chlorangiella, Chlorangiopsis, Chlorella, Chlorobotrys, Chlorobrachis, Chlorochytrium, Chloroccun, Chlorogloea, Chlorogloeopsis, Chlorogonium, Chlorolobion, Chloromonas, Chlorophysema, Cholorphyta, Cholorosaccus, Cholorosarcina, Choricystis, Chromophyton, Chromulina, Chroococcidiopsis, Chrococcus, Chroodactylon, Chroomonas, Chroothece, Chrysamoeba, Chysapsis, Chrysidiastrum, Chrysocapsa,
Chrysocapsella, Chrysochaete, Chrysohromulina, Chrysococcus, Chrysocrinus,
Chrynsolepidomonas, Chrysolykos, Chrysonebula, Chrysophyta, Chrysopyxis, Chrysosaccus, Chrysophaerella, Chrysotephanosphaera, Clodophora, Clastidium, Closteriopsis, Closterium, Coccomyxa, Cocconeis, Coelastrella, Coelastrum, Coelosphaerium, Coenochloris,
Coenococcus, Coenocystis, Colacium, Coleochaete, Collodictyon, Compsogonopsis,
Compsopogon, Conjugatophyta, Conoehaete, Coronastrum, Cosmarium, Cosmnioneis, Cosmocladium, Crateriportula, Craticula, Crinalium, Crucigenia, Crucigeniella, Cryptoaulax, Cryptomonas, Cryptophyta, Ctenophora, Cyanodictyon, Cyanonephron, Cyanophora,
Cyanophyta, Cyanothece, Cyanothomonas, Cyclonexis, Cyclostephanos, Cyclotella,
Cylindrocapsa, Cylindrocystis, Cylindrospermum, Cylindrotheca, Cymatopleura, Cymbella, Cymbeilonitzschia, Cystodinium Dactylococcopsis, Debarya, Denticula, Dermatochrysis, Dermorarpa, Dermocarpella, Desmatractum, Desmidium, Desmococcus, Desmonema, Desmosiphon, Diacanthos, Diacronema, Diadesmis, Diatoma, Diatomella, Dicellula,
Dichothrix, Dichtotomococcrus, Dicranochaete, Dictyochloris, Dictyococcus, Dictyosphaerium, Didymocystis, Didymogenes, Didymosphenia, Dilabifilum, Dimorphoccus, Dinobryon,
Dinocuccus, Diplochloris, Diploneis, Diplostauron, Distrionella, Docidium, Draparnaldia, Dunaliella, Dysmorphaocuccus, Ecballocystis, Elakatothrix, Ellerbeckia, Encyonema,
Enteromorpha, Entocladia, Entomoeis, Entophysalis, Ephichrysis, Epipyxis, Epithemia, Eremosphaura, Euastropsis, Euatstrum, Eucapsis, Eucocconeis, Eudorina, Euglena,
Euglenophyta, Eunotia, Eustigmatophyta, Eutreptia, Fallcia, Ficherella, Fragilaria,
Fragilariforma, Franceia, Frustulia, Curcilla, Geminella, Genicularia, Glaucocystis,
Glaucophyta, Glenodiniopsis, Glenodinium, Gloeomonas, Gloeoplax, Gloeothece, Geloeotila, Gloeotrichia, Gloiodictyon, Golenkinia, Golenkiniopsis, Gomontia, Gomphocymbella,
Gomphonema, Gomphosphaeria, Gonatozygon, Gongrosia, Gongrosira, Goniochloris, Gonium, Gonyostomum, Granulochloris, Granulocystopsis, Groenbladia, Gymnodiunium, Gymnozyga, Gyrosignma, Haematocuccus, Hafniomonas, Hallassia, Hammatoidea, Hannaea, Hantzchia, Hapalosiphon, Haplotaenium, Haptophyta, Haslea, Hemidinuim, Hemitonia, Heribaudiella, Heteromastix, Heterothrix, Hibberdia, Hildenbrandia, Hillea, Holopedium, Homoeothrix, Hormanthonema, Hormotila, Hyalobrachion, Hyalocardium, Hyalodiscus, Hyalogonium, Hyalotheca, Hydrianum, Hydrococcus, Hydrocoleum, Hydrocoryne, Hydrodictyon, Hydrosera, Hydrurus, Hyella, Hymenomonas, Isthmochloron, Johannesbaptistia, Juranyiella, Karayevia, Kathablepharis, Katodinium, Kaphyrion, Keratococcus, Kirchneriella, Klebsormidium, Kolbesia, Koliella, Komarekia, Korshikoviella, Kraskella, Lagerheimia, Lagynion,
Lamprothamnium, Lemanea, Lepocinclis, Leptosira, Lobococcus, Lobocystis, Lobomonas, Luticola, Lynbya, Malleochloris, Mallomonas, Mantoniella, Marssoniella, Martyana,
Mastigocloleus, Gastogloia, Melosira, Merismopedia, Mesostigma, Mesotaenium,
Micractinium, Micrasterias, Microchaete, Microcoleus, Microcystis, Microglena, Micromonas, Microspora, Microthamnion, Mischococcus, Monocrysis, Monodus, Monomastix,
Monoraphidium, Monostroma, Mougeotia, Mougeotiopsis, Myochloris, Myromecia,
Myxocarcina, Naegeliella, Nannochloris, Nautoccus, Navicula, Neglectella, Neidium,
Nephroclamys, Nephrocytium, Nephrodiella, Nephroselmis, Netrium, Nitella, Nitellopsis, Nitzschia, Nodularia, Nostoc, Ochromonas, Oedogonium, Oligochaetophora, Onychonema, Oocadrium, Oocrystis, Opephora, Ophiocytium, Orthoseira, Oscillartoria, Oxyneis,
Pachycladella, Palmella, Palmodictyon, Pnadorina, Pannus, Paralia, Pascherina, Paulshulzia, Pediastrum, Pedinella, Pedinomonas, Pedinopera, Pelagodictyon, Penium, Peranema,
Peridiniopsis, Peridinium, Peronia, Petroneis, Phacotus, Phacus, Phaeaster, Phaeodermatium, Phaeophyta, Phaeoshaera, Phaeothamnion, Phormidium, Phycopeltis, Phyllariochloris, Phyllocadium, Phyllomitas, Pinnilaria, Pitophora, Placoneis, Planctonema, Planktophaeria, Planothidium, Plectonema, Pleodorina, Pleurastrum, Pleurocapsa, Pleurocladia, Pleurodiscus, Pleurosigma, Pleurosira, Pleurotaenium, Pocillomanas, Podohedea, Polyblepharides,
Polychaetophora, Polyedriella, Polyedriopsis, Polygoniochloris, Polyepidomanas, Polytaenia, Polytoma, Polytomella, Porphyridium, Posteriochromonas, Prasinochloris, Prasinocladus, Prasinophyta, Praisola, Prochlorphyta, Psammodictyon, Psammothidium, Pseudanabaena, Pseudenoclonium, Psuedocarteria, Pseudochate, Pseaudocharacium, Pseudococcomyxa, Pseudodictyosphaerium, Pseudokephyrion, Pseudoncobrysa, Pseudoquadrigula,
Pseudophaerocystis, Pseudostaurastrum, Pseudostraurosira, Pyrrophyta, Quadrichloris, Quadricoccus, Quadrigula, Radiocuccus, Radiobetalum, Raphidiopsis, Raphidocelis,
Raphidonema, Raphidophyta, Peimeria, Rhadorderma, Rhabomonas, Rhizoclonium,
Rhodomonas, Rhodiphyta, Rhoicosenia, Rhopalodia, Rivularia, Rosenvingiella, Rossithidium, Roya, Scenedesmus, Scherffelia, Schizochlamydella, Schizochlamys, Schizomeris, Schizothrix, Schroederia, Scolioneis, Scotiella, Scotiellopsis, Scourfieldia, Scytonema, Slenastrum,
Selenochloris, Sellaphora, Semiorbis, Siderocelis, Diderocystopsis, Dimonsenia, Siphononema, Sirocladium, Sirogonium, Skeletonema, Sorastrum, Spermatozopis, Sphaerellocystis,
Sphaerellopsis, Sphaerodinium, Sphaeroplea, Sphaerozosma, Spiniferomonas, Spirogyra, Spirotaenia, Spirulina, Spondylomorum, Spondylosium, Sporotetras, Spumella, Staurastrum, Stauerodesmus, Stauroneis, Staurosira, Staurrosiella, Stenopterobia, Stephanocostis,
Stephanodiscus, Stephanoporos, Stephanoshaera, Stichoccus, Stichogloea, Sigeoclonium, Stigonema, Stipitocuccus, Stokesiella, Stombomonas, Stylochrysalis, Stylodinium, Styloyxis, Stylosphaeridium, Surirella, Sykidion, Symploca, Synechococcus, Synechocystis, Synedra, Synochromonas, Synura, Tabellaria, Tabularia, Teilingia, Temnogametum, Tetmemorus, Tetrachlorella, Tetracyclus, Tetrademus, Tetraedriella, tetraedron, Tetraselmis, Tetraspora, Tetrastrum, Thalassiosira, Thanmiochaete, Thoakochloris, Thorea, Tolypella, Tolypothrix, Trachelomonas, Trachydiscus, Trebouxia, Trentepholia, Treubaria, Tribonema, Trichodesmium, Tricodiscus, Trochiscia, Tryblionella, Ulothrix, Uroglena, Uronema, Urosolenia, Urospora,
Uva, Vacuolaria, Vaucheria, Volvox, Volvulina, Westella, Woloszynskia, Xanthidium,
Xanthophyta, Exencoccus, Zygenema, Zygnemopsis, and Zygonium.
[0070] Specific examples of bacteria cells include Escherichia coli, Oligotropha carboxidovorans, Pseudomononas sp. Alcaligenes eutrophus ( Cupriavidus necator), Bacillus licheniformis, Paenibacillus macerans, Rhodococcus erythropolis, Pseudomonas putida, Lactobacillus plantarum, Enterococcus faecium, Enterococcus gallinarium, Enterococcus faecalis, Bacillus subtilis, Cupriavidus basilensis, Cupriavidus campinensis, Cupriavidus gilardi, Cupriavidus laharsis, Cupriavidus metallidurans, Cupriavidus oxalaticus, Cupriavidus pauculus, Cupriavidus pinatubonensis, Cupriavidus respiraculi and Cupriavidus taiwanensis. In some aspects, the bacterium is Nocardia sp. NRRL 5646, Nocardiafarcinica, Streptomyces griseus, Salinispora arenicola, or Clavibacter michiganenesis.
[0071] The recombinant cell may be a synthetic cell or a cell produced by a synthetic genome, as described in U.S. Patent Publication 2007/0264688, or 2007/0269862. The cell may be a CHO cell, a COS cell, a VERO cell, a BHK cell, a HeLa cell, a Cvl cell, an MDCK cell, a 293 cell, a 3T3 cell, or a PC 12 cell.
[0072] In some aspects, this application discloses a recombinant cell genetically modified to express one or more heterologous enzymes having 3-ketoacyl-CoA synthase activity. The heterologous enzymes having 3-ketoacyl-CoA synthase activity can comprise enzymes naturally found in Nocardia brasiliensis ATCC 700358 (SEQ ID NO: 1) (Designation: Nbra), Streptomyces sp. KO-3988 (SEQ ID NO: 2) (Designation: SK03), Streptomyces cinnamonensis (SEQ ID NO: 3) (Designation: Scin), Streptomyces tendae (SEQ ID NO: 4) (Designation: Sten), Streptomyces sp. KO-3988 (SEQ ID NO: 5) (Designation: SK03-2), Austwickia chelonae NBRC 105200 (SEQ ID NO: 6) (Designation: Ache), Streptomyces sp. Mgl (SEQ ID NO: 7) (Designation: Smgl), Spirochaeta africana DSM 8902 (SEQ ID NO: 8) (Designation: Safr), Streptomyces sp. HK10576 (SEQ ID NO: 9) (Designation: SHK1), Streptomyces violaceusniger Tu 4113 (SEQ ID NO: 10) (Designation: Svio), Amycolatopsis sp. ATCC 39116 (SEQ ID NO: 11) (Designation: AATC), uncultured bacterium (SEQ ID NO: 12) (Designated Ubac), Geobacter sulfurreducens PCA (SEQ ID NO: 13) (Designated Gsul), Acinetobacter baylyi DSM 14961 = CIP 107474 (SEQ ID NO: 14) (Designated Abay-2), and/or Clostridium tetani E88 (SEQ ID NO: 15) Designation: Ctet).
[0073] In some aspects, the heterologous enzyme having 3-ketoacyl-CoA synthase activity comprises the amino acid sequence of SEQ ID NO: 1 and corresponds to the polypeptide designated as Nbra. In other aspects, the heterologous enzyme having 3-ketoacyl- CoA synthase activity comprises a polypeptide having more than 50%, more than 55%, more than 60%, more than 65%, more than 70%, more than 75%, more than 80%, more than 85%, more than 90%, more than 91%, more than 92%, more than 93%, more than 94%, more than 95%, more than 96%, more than 97%, more than 98%, or more than 99% sequence identity to the amino acid sequence of SEQ ID NO: 1.
[0074] In some aspects, a recombinant cell genetically modified to express the polypeptide designated as Nbra produces fatty acids and/or fatty acid chain products. In some aspects, the recombinant cell genetically modified to express the polypeptide designated as Nbra further comprises an engineered biosynthetic pathway to produce fatty acids and/or fatty acid chain products. In some aspects, the recombinant cell genetically modified to express the polypeptide designated as Nbra further comprises an engineered biosynthetic pathway to produce FAME.
[0075] In other cases, the recombinant cell expressing heterologous Nbra and comprising an engineered biosynthetic pathway can also produce a mixture of FAMEs with different carbon chain lengths. For example, the recombinant cell expressing heterologous Nbra can produce a FAME mixture comprising C6 FAME, C8 FAME, C10 FAME, and/or C12 FAME.
[0076] In some cases, the recombinant cell expressing heterologous Nbra and comprising an engineered biosynthetic pathway can catalyze the addition of malonyl-CoA to a C2-CoA substrate to lead to a C4 fatty acid and/or fatty acid chain product, the addition of malonyl-CoA to a C4-CoA substrate to lead to a C6 fatty acid and/or fatty acid chain product, the addition of malonyl-CoA to a C6-C0A substrate to lead to a C8 fatty acid and/or fatty acid chain product, the addition of malonyl-CoA to a C8-C0A substrate to lead to a C10 fatty acid and/or fatty acid chain product, and/or the addition of malonyl-CoA to a ClO-CoA substrate to lead to a C 12 fatty acid and/or fatty acid chain product. In other cases, the recombinant cell expressing heterologous Nbra can also comprise one or more additional heterologous polypeptides with 3-ketoacyl-CoA synthase activity (e.g., a NphT7 polypeptide and/or a NphT7 polypeptide mutant as described herein) to increase production of fatty acids and/or fatty acid chain products.
[0077] In some aspects, the heterologous enzyme having 3-ketoacyl-CoA synthase activity comprises the amino acid sequence of SEQ ID NO: 2 and corresponds to the polypeptide designated as SK03. In other aspects, the heterologous enzyme having 3-ketoacyl- CoA synthase activity comprises a polypeptide having more than 50%, more than 55%, more than 60%, more than 65%, more than 70%, more than 75%, more than 80%, more than 85%, more than 90%, more than 91%, more than 92%, more than 93%, more than 94%, more than 95%, more than 96%, more than 97%, more than 98%, or more than 99% sequence identity to the amino acid sequence of SEQ ID NO: 2.
[0078] In some aspects, a recombinant cell genetically modified to express the polypeptide designated as SK03 produces fatty acids and/or fatty acid chain products. In some aspects, the recombinant cell genetically modified to express the polypeptide designated as SK03 further comprises an engineered biosynthetic pathway to produce fatty acids and/or fatty acid chain products. In some aspects, the recombinant cell genetically modified to express the polypeptide designated as SK03 further comprises an engineered biosynthetic pathway to produce FAME.
[0079] In other cases, the recombinant cell expressing heterologous SK03 and comprising an engineered biosynthetic pathway can also produce a mixture of FAMEs with different carbon chain lengths. For example, the recombinant cell expressing heterologous SK03 can produce a FAME mixture comprising C6 FAME, C8 FAME, C10 FAME, and/or C12 FAME.
[0080] In some cases, the recombinant cell expressing heterologous SK03 and comprising an engineered biosynthetic pathway can catalyze the addition of malonyl-CoA to a C2-CoA substrate to lead to a C4 fatty acid and/or fatty acid chain product, the addition of malonyl-CoA to a C4-CoA substrate to lead to a C6 fatty acid and/or fatty acid chain product, the addition of malonyl-CoA to a C6-C0A substrate to lead to a C8 fatty acid and/or fatty acid chain product, the addition of malonyl-CoA to a C8-C0A substrate to lead to a C10 fatty acid and/or fatty acid chain product, and/or the addition of malonyl-CoA to a ClO-CoA substrate to lead to a C 12 fatty acid and/or fatty acid chain product. In other cases, the recombinant cell expressing heterologous SK03 can also comprise one or more additional heterologous polypeptides with 3-ketoacyl-CoA synthase activity (e.g., a NphT7 polypeptide and/or a NphT7 polypeptide mutant as described herein) to increase production of fatty acids and/or fatty acid chain products.
[0081] In some aspects, the heterologous enzyme having 3-ketoacyl-CoA synthase activity comprises the amino acid sequence of SEQ ID NO: 3 and corresponds to the polypeptide designated as Scin. In other aspects, the heterologous enzyme having 3-ketoacyl- CoA synthase activity comprises a polypeptide having more than 50%, more than 55%, more than 60%, more than 65%, more than 70%, more than 75%, more than 80%, more than 85%, more than 90%, more than 91%, more than 92%, more than 93%, more than 94%, more than 95%, more than 96%, more than 97%, more than 98%, or more than 99% sequence identity to the amino acid sequence of SEQ ID NO: 3.
[0082] In some aspects, a recombinant cell genetically modified to express the polypeptide designated as Scin produces fatty acids and/or fatty acid chain products. In some aspects, the recombinant cell genetically modified to express the polypeptide designated as Scin further comprises an engineered biosynthetic pathway to produce fatty acids and/or fatty acid chain products. In some aspects, the recombinant cell genetically modified to express the polypeptide designated as Scin further comprises an engineered biosynthetic pathway to produce FAME.
[0083] In other cases, the recombinant cell expressing heterologous Scin and comprising an engineered biosynthetic pathway can also produce a mixture of FAMEs with different carbon chain lengths. For example, the recombinant cell expressing heterologous Scin can produce a FAME mixture comprising C6 FAME, C8 FAME, C10 FAME, and C12 FAME.
[0084] In some cases, the recombinant cell expressing heterologous Scin and comprising an engineered biosynthetic pathway can catalyze the addition of malonyl-CoA to a C2-CoA substrate to lead to a C4 fatty acid and/or fatty acid chain product, the addition of malonyl-CoA to a C4-CoA substrate to lead to a C6 fatty acid and/or fatty acid chain product, the addition of malonyl-CoA to a C6-C0A substrate to lead to a C8 fatty acid and/or fatty acid chain product, the addition of malonyl-CoA to a C8-C0A substrate to lead to a C 10 fatty acid and/or fatty acid chain product, and/or the addition of malonyl-CoA to a ClO-CoA substrate to lead to a C12 fatty acid and/or fatty acid chain product. In other cases, the recombinant cell expressing
heterologous Scin can also comprise one or more additional heterologous polypeptides with 3- ketoacyl-CoA synthase activity (e.g., a NphT7 polypeptide and/or a NphT7 polypeptide mutant as described herein) to increase production of fatty acids and/or fatty acid chain products.
[0085] In some aspects, the heterologous enzyme having 3-ketoacyl-CoA synthase activity comprises the amino acid sequence of SEQ ID NO: 4 and corresponds to the polypeptide designated as Sten. In other aspects, the heterologous enzyme having 3-ketoacyl- CoA synthase activity comprises a polypeptide having more than 50%, more than 55%, more than 60%, more than 65%, more than 70%, more than 75%, more than 80%, more than 85%, more than 90%, more than 91%, more than 92%, more than 93%, more than 94%, more than 95%, more than 96%, more than 97%, more than 98%, or more than 99% sequence identity to the amino acid sequence of SEQ ID NO: 4. [0086] In some aspects, a recombinant cell genetically modified to express the polypeptide designated as Sten produces fatty acids and/or fatty acid chain products. In some aspects, the recombinant cell genetically modified to express the polypeptide designated as Sten further comprises an engineered biosynthetic pathway to produce fatty acids and/or fatty acid chain products. In some aspects, the recombinant cell genetically modified to express the polypeptide designated as Sten further comprises an engineered biosynthetic pathway to produce FAME.
[0087] In other cases, the recombinant cell expressing heterologous Sten and comprising an engineered biosynthetic pathway can also produce a mixture of FAMEs with different carbon chain lengths. For example, the recombinant cell expressing heterologous Sten can produce a FAME mixture comprising C6 FAME, C8 FAME, C10 FAME, and C12 FAME.
[0088] In some cases, the recombinant cell expressing heterologous Sten and comprising an engineered biosynthetic pathway can catalyze the addition of malonyl-CoA to a C2-CoA substrate to lead to a C4 fatty acid and/or fatty acid chain product, the addition of malonyl-CoA to a C4-CoA substrate to lead to a C6 fatty acid and/or fatty acid chain product, the addition of malonyl-CoA to a C6-C0A substrate to lead to a C8 fatty acid and/or fatty acid chain product, the addition of malonyl-CoA to a C8-C0A substrate to lead to a C 10 fatty acid and/or fatty acid chain product, and/or the addition of malonyl-CoA to a ClO-CoA substrate to lead to a C12 fatty acid and/or fatty acid chain product. In other cases, the recombinant cell expressing
heterologous Sten can also comprise one or more additional heterologous polypeptides with 3- ketoacyl-CoA synthase activity (e.g., a NphT7 polypeptide and/or a NphT7 polypeptide mutant as described herein) to increase production of fatty acids and/or fatty acid chain products.
[0089] In some aspects, the heterologous enzyme having 3-ketoacyl-CoA synthase activity comprises the amino acid sequence of any one of SEQ ID NOs: 5-15 and corresponds to the polypeptides designated as SK03-2, Ache, Smgl, Safr, SHK1, Svio, ATTC, Ubac, Gsul, Abay-2 and Ctet, respectively. In other aspects, the heterologous enzyme having 3-ketoacyl- CoA synthase activity comprises a polypeptide having more than 50%, more than 55%, more than 60%, more than 65%, more than 70%, more than 75%, more than 80%, more than 85%, more than 90%, more than 91%, more than 92%, more than 93%, more than 94%, more than 95%, more than 96%, more than 97%, more than 98%, or more than 99% sequence identity to the amino acid sequence of any one of SEQ ID NOs: 5-15. [0090] In some aspects, a recombinant cell genetically modified to express any one of the polypeptides designated as SK03-2, Ache, Smgl, Safr, SHK1, Svio, ATTC, Ubac, Gsul, Abay-2 or Ctet produces fatty acids and/or fatty acid chain products. In some aspects, the recombinant cell genetically modified to express the polypeptide designated as SK03-2, Ache, Smgl, Safr, SHK1, Svio, ATTC, Ubac, Gsul, Abay-2 or Ctet further comprises an engineered biosynthetic pathway to produce fatty acids and/or fatty acid chain products. In some aspects, the recombinant cell genetically modified to express the polypeptide designated as SK03-2, Ache, Smgl, Safr, SHK1, Svio, ATTC, Ubac, Gsul, Abay-2 or Ctet further comprises an engineered biosynthetic pathway to produce FAME.
[0091] In some aspects, each of the heterologous polypeptides with 3-ketoacyl-CoA synthase activity identified above can comprise one or more identified amino acid substitutions within its amino acid sequence to alter a specific activity against a specific CoA substrate when compared to the respective unsubstituted heterologous polypeptide. These heterologous polypeptides with 3-ketoacyl-CoA synthase activity with identified amino acid substitutions can be considered variants or mutants. For instance, one or more identified amino acid substitutions in the variant can alter the specific activity of the respective heterologous polypeptide with 3- ketoacyl-CoA synthase against a C4-CoA substrate. In other words, the specific activity of the respective variant against a C4-CoA substrate can be increased or decreased when compared to the respective polypeptide with unaltered sequence. Therefore, a variant can comprise increased specific activity for certain CoA substrate and can concomitantly produce a certain 3-keto-CoA intermediate. For example, a variant with increased specific activity for a C6-C0A substrate will produce a 3-keto-C8-CoA product leading to C8 fatty acids and/or C8 fatty acid chain products. Likewise, a variant with decreased specific activity against certain CoA may also be desirable to reduce production of certain unwanted fatty acids and/or fatty acid chain products. For example, a variant with decreased specific activity for a C8-C0A substrate will produce less 3-keto-Cl0- CoA product leading to reduced amounts of C10 fatty acids and/or C10 fatty acid chain products.
[0092] In some aspects, a recombinant cell with a variant with one or more identified amino acid substitutions can comprise increased total FAME production; in other cases, the variant results in decreased total FAME production compared to the wild-type enzyme. A recombinant cell with a variant with one or more identified amino acid substitutions can also comprise altered ratios of FAME production by carbon chain length when compared to the respective polypeptide with unaltered sequence. For example, a recombinant cell with a variant with one or more identified amino acid substitutions can comprise altered ratios of FAME production by carbon chain length of one or more of C6-FAME (methyl hexanoate), C8-FAME (methyl octanoate), C10-FAME (methyl decanoate), and C12-FAME (methyl dodecanoate).
[0093] In some aspects, the identified amino acid substitutions can also comprise one or more amino acid substitutions at amino acid positions of Nbra, SK03, Scin, Sten, SK03-2, Ache, Smgl, Safr, SHK1, Svio, ATTC, Ubac, Gsul, Abay-2 and Ctet that align to amino acid positions 147 and/or 217 of Npht7 (see, for example, the sequence alignments in Figs. 1 and 2).
[0094] In some aspects, a Nbra (SV) variant with more than one identified amino acid substitutions can comprise an altered specific activity for C2-CoA, C4-CoA, C6-C0A, and C8- CoA substrates when compared to the respective polypeptide with unaltered sequence. In some aspects, a recombinant cell with a Nbra (SV) variant and an engineered biosynthetic pathway can comprise an increased total FAME production of a mixture of C6-FAME (methyl hexanoate), C8-FAME (methyl octanoate), C10-FAME (methyl decanoate), and C12-FAME (methyl dodecanoate) when compared to the respective polypeptide with unaltered sequence. In some cases, the recombinant cell with a Nbra (SV) variant and an engineered biosynthetic pathway can comprise an increased total FAME production when compared to the respective polypeptide with unaltered sequence. It is noted that the wild-type Nbra with an engineered biosynthetic pathway showed little to no ability to produce FAME.
[0095] In some aspects, a Sten (SV) variant with more than one identified amino acid substitution can comprise an altered specific activity for C2-CoA, C4-CoA, C6-C0A, and C8- CoA substrates when compared to the respective polypeptide with unaltered sequence. In some aspects, a recombinant cell with a Sten (SV) variant can comprise a decrease in total FAME production of a mixture of C6-FAME (methyl hexanoate), C8-FAME (methyl octanoate), C10- FAME (methyl decanoate), and C12-FAME (methyl dodecanoate) when compared to the respective polypeptide with unaltered sequence. In some cases, the recombinant cell with a Sten (SV) variant can comprise a decrease in total FAME production when compared to the respective polypeptide with unaltered sequence. It is noted that a recombinant cell with a Sten (SV) variant produces little to no FAME.
[0096] In some aspects, a SK03 (SV) variant with more than one identified amino acid substitution can comprise an altered specific activity for C2-CoA, C4-CoA, C6-C0A, and C8- CoA substrates when compared to the respective polypeptide with unaltered sequence. In some aspects, a recombinant cell with a SK03 (SV) variant and an engineered biosynthetic pathway can comprise an increased total FAME production of a mixture of C6-FAME (methyl hexanoate), C8-FAME (methyl octanoate), C10-FAME (methyl decanoate), and C12-FAME (methyl dodecanoate) when compared to the respective polypeptide with unaltered sequence. In some cases, the recombinant cell with a SK03 (SV) variant can comprise an increased total FAME production when compared to the respective polypeptide with unaltered sequence.
[0097] In some aspects, a Scin (SV) variant with more than one identified amino acid substitution can comprise an altered specific activity for C2-CoA, C4-CoA, C6-C0A, and C8- CoA substrates when compared to the respective polypeptide with unaltered sequence. In some aspects, a recombinant cell with a Scin (SV) variant and an engineered biosynthetic pathway can comprise a decreased total FAME production of a mixture of C6-FAME (methyl hexanoate), C8-FAME (methyl octanoate), C10-FAME (methyl decanoate), and C12-FAME (methyl dodecanoate) when compared to the respective polypeptide with unaltered sequence. In some cases, the recombinant cell with a Scin (SV) variant can comprise a decreased total FAME production when compared to the respective polypeptide with unaltered sequence.
[0098] In some aspects, the recombinant cell can comprise more than one heterologous enzyme having 3-ketoacyl-CoA synthase activity. In other aspects, the recombinant cell can comprise one or more of Nbra, SK03, Scin, Sten, SK03-2, Ache, Smgl, Safr, SHK1, Svio, ATTC, Ubac, Gsul, Abay-2 and Ctet, variant(s) corresponding to I147S and/or F217V (NphT7 numbering) of Nbra, SK03, Scin, Sten, SK03-2, Ache, Smgl, Safr, SHK1, Svio, ATTC, Ubac, Gsul, Abay-2 and Ctet, and additional enzymes having 3-ketoacyl-CoA synthase activity (e.g., NphT7 (SEQ ID NO: 16), NphT7(SV) variant (SEQ ID NO: 17), and NphT7(LSVA) variant (SEQ ID NO: 33)). In some cases, the recombinant cell can comprise heterologous 3-ketoacyl- CoA synthase enzymes having complementary specific activities. For example, a heterologous NphT7 comprising high specific activity for C2-CoA substrates can be combined with another heterologous 3-ketoacyl-CoA synthase enzyme. Likewise, a heterologous NphT7(LSVA) variant comprising high specific activity for C4-CoA substrates can be combined with another heterologous 3-ketoacyl-CoA synthase enzyme. In other words, heterologous enzymes with certain high specific activities (e.g., for C2-CoA substrates and/or C4-CoA substrates) can be paired with heterologous enzymes having other high specific activities (e.g., for C6-C0A substrates and/or C8-C0A substrates). [0099] In some aspects, a recombinant cell comprising heterologous Nbra (SV) variant, with and without heterologous NphT7 comprises an increase in total FAME production compared to a recombinant cell comprising heterologous wild-type Nbra, with or without heterologous NphT7.
[0100] In some aspects, a recombinant cell comprising heterologous Sten and heterologous NphT7 comprises increased total FAME production compared to a recombinant cell comprising heterologous Sten and no heterologous NphT7.
[0101] In some aspects, a recombinant cell comprising heterologous Sten (SV) variant and heterologous NphT7 comprises a decrease in total FAME production compared to a recombinant cell comprising heterologous wild-type Sten and heterologous NphT7. In the absence of NphT7, no FAME was produced from a recombinant cell comprising the heterologous Sten (SV) mutant.
[0102] In some aspects, a recombinant cell comprising heterologous SK03, with and without NphT7, comprises FAME production. In some aspects, a recombinant cell comprising heterologous SK03 (SV) variant, with and without heterologous NphT7, comprises FAME production.
[0103] In some aspects, a recombinant cell comprising heterologous Scin, with and wthout NphT7, comprises FAME production. In some aspects, a recombinant cell comprising heterologous Scin (SV) variant, with and without heterologous NphT7, comprises FAME production.
Engineered Biosynthetic Pathway Reaction Steps
[0104] In some aspects, this application discloses a recombinant cell with an engineered biosynthetic pathway comprising one or more heterologous enzymes that convert chemical precursors and/or substrates into desired chemical products. In some cases, the engineered biosynthetic pathway comprises a pathway of heterologous enzymes to synthesize the desired chemical product. In some instances, individual heterologous enzymes work in a stepwise fashion to convert a precursor into the desired chemical product. The engineered biosynthetic pathway can comprise one or more identified heterologous enzymes having 3-ketoacyl-CoA synthase activity to produce fatty acids and fatty acid chain products of C4-C10 chain length. In other words, the engineered biosynthetic pathway can comprise one or more of Nbra, SK03, Scin, Sten, SK03-2, Ache, Smgl, Safr, SHK1, Svio AATC, Ubac, Gsul, Abay-2, Ctet, and/or their respective variants to produce fatty acids and fatty acid chain products of C4-C10 chain length. The engineered biosynthetic pathway can also comprise additional heterologous enzyme(s) that work in combination with one or more of Nbra, SK03, Scin, Sten, SK03-2, Ache, Smgl, Safr, SHK1, Svio AATC, Ubac, Gsul, Abay-2, Ctet, and/or their respective variants to produce fatty acids and fatty acid chain products of C4-C10 chain length.
[0105] In some aspects, the recombinant cell comprises one or more of Nbra, SK03,
Scin, Sten, SK03-2, Ache, Smgl, Safr, SHK1, Svio AATC, Ubac, Gsul, Abay-2, Ctet, and/or their respective variants to catalyze the reaction of acyl-CoA with malonyl-CoA to produce intermediates to produce fatty acids and fatty acid chain products of C4-C10 chain length. However, the reaction of acyl-CoA with malonyl-CoA produces a 3-ketoacyl-CoA compound that must be reduced to the corresponding acyl compound before it can condense with another molecule of malonyl-CoA to extend the chain. The reduction takes place in three steps, the first being the reduction of the 3-ketoacyl group to the corresponding 3-hydroxyacyl group. The second reaction is a dehydration to the corresponding trans-2-enoylacyl compound, which is reduced in a third step to the corresponding acyl-CoA. The first reaction step is enzymatically catalyzed by a keto-CoA reductase (KCR) enzyme (EC 1.1.1.35). The second step is enzymatically catalyzed by a 3-hydroxy-acyl-CoA dehydratase (3HDh) enzyme (EC 4.2.1.17). Some bifunctional enzymes catalyze both of the first and second step reactions (EC 1.1.1.35 and EC 4.2.1.55). The third reaction step is enzymatically catalyzed by an enoyl-CoA reductase (ECR) enzyme (EC 1.1.1.32).
[0106] Accordingly, the engineered biosynthetic pathway can further comprises at least one of (1) a heterologous KCR gene that encodes for a KCR enzyme; (2) a heterologous 3HDh gene that encodes for a 3HDh enzyme; (3) a heterologous gene that encodes for a bifunctional enzyme that catalyzes both of the first and second reaction steps (EC 1.1.1.35 and 4.1.2.55) and (4) a heterologous ECR gene that encodes for an ECR enzyme. In some aspects, the recombinant cell contains at least (1), (2) and (4) or at least (3) and (4). In some aspects, the recombinant cell further comprises (5) at least one heterologous 3-ketoacy-CoA synthase gene, different from the 3-ketoacyl-CoA synthases described above, which encodes for a 3-ketoacyl- CoA synthase. In each case, the gene can be under the control of a promoter and/or terminator sequences active in the recombinant cell.
[0107] The KCR enzyme may be, for example, one encoded by a P. aeruginosa fadB gene and/or having an amino acid sequence at least 80%, at least 90%, at least 95%, at least 99% or 100% identical to SEQ ID NO: 36, one encoded by a P. aeruginosa fadG gene and/or having an amino acid sequence at least 80%, at least 90%, at least 95%, at least 99% or 100% identical to SEQ ID NO: 102, one encoded by a C. beijerinckii hbd gene and/or having an amino acid sequence at least 80%, at least 90%, at least 95%, at least 99% or 100% identical to SEQ ID NO: 101, and others as described in WO 2015/010103.
[0108] The 3HDh enzyme may be, for example, one encoded by a C. acetobutylicum crt
(short-chain-enoyl-CoA hydratase) gene and/or having an amino acid sequence at least 80%, at least 90%, at least 95%, at least 99% or 100% identical to SEQ ID NO: 99, one encoded by a P. putida ech (enoyl-CoA hydratase/aldolase) gene and/or having an amino acid sequence at least 80%, at least 90%, at least 95%, at least 99% or 100% identical to SEQ ID NO: 100, and others as described in WO 2015/010103.
[0109] Suitable bifunctional enzymes that catalyze both the first and second reaction steps include, for example, one encoded by an E. colifadB gene and/or having an amino acid sequence at least 80%, at least 90%, at least 95%, at least 99% or 100% identical to SEQ ID NO: 98; one encoded by an R. novegicus ech2 gene, and others as described in WO 2015/010103.
[0110] Suitable ECR enzymes include, for example, one encoded by a T denticola ter gene and/or having an amino acid sequence at least 80%, at least 90%, at least 95%, at least 99% or 100% identical to SEQ ID NO: 84.
[0111] The recombinant cell can further include at least one heterologous 3-ketobutyryl-
CoA synthase gene, different from the modified 3-ketoacyl-CoA synthases described above, which encodes for a 3-ketobutyryl-CoA synthase. The heterologous 3-ketobutyryl-CoA synthase gene may encode for a 3-ketobutyryl-CoA synthase enzyme that is at least 80%, at least 90%, at least 95%, at least 99% or 100% identical to any of those identified as SEQ ID NOs: 1-120 of WO 2015/10103.
[0112] In some aspects, the heterologous 3-ketobutyryl-CoA synthase gene is a
Streptomyces Sp CL190 gene and/or a gene that encodes for an NphT7 enzyme that is at least 80%, at least 90%, at least 95%, at least 99% or 100% identical to SEQ ID NO: 16.
[0113] In some aspects, the recombinant cell includes at least one gene that encodes for a modified NphT7 enzyme as described in WO 2015/10103. The modified NphT7 enzyme comprises an amino acid sequence having at least 70% but less than 100% sequence identity to SEQ ID NO: 16. The modified NphT7 enzyme may have, for example, one or more amino acid substitutions selected from the group consisting of H100L, I147T, F217V, Y 144L, V157F, G309S, G288S, a PDRP to HFLQ substitution for amino acid residues 86-89, 1147F, I147M, I147Q, I147S, I147C, 1147E, I147N, I147W, I147D, I147R, I147P, I147L, V196G, I147G, I147H, I147K, I147V, 1147 A, I147Y, F217G, F217A, F217L, F217I, F217M, F217T, F217P, F217S, F217E, F217L, F217V, F217W, S323A and S323V, and any combination of any two or more thereof.
[0114] In some aspects, the modified NphT7 enzyme comprises at least one amino acid substitution selected from the group consisting of I147V, I147S, I147T, and at least one additional amino acid substitution selected from H100L, F217V, S323A and S323V. In some aspects, the modified NphT7 enzyme corresponds to SEQ ID NO: 33. In some aspects, the modified NphT7 enzyme comprises an I147V, I147S or I147T amino acid substitution and an S323A amino acid substitution (corresponding to SEQ ID NO: 33 in which amino acid 100 is H, amino acid 147 is V, S or T, amino acid 217 is F and amino acid 323 is A). In some aspects, the modified NphT7 enzyme comprises an H100L substitution, an I147V, I147S or I147T amino acid substitution, an F217V substitution and an S323A amino acid substitution (corresponding to SEQ ID NO: 33 in which amino acid residue 100 is L, amino acid residue 147 is V, S or T, amino acid residue 217 is V and amino acid residue 323 is A).
[0115] In some aspects, the recombinant cell includes both of (1) a Streptomyces Sp
CL190 nphT7 gene and/or a gene that encodes for an NphT7 enzyme that is at least 80%, at least 90%, at least 95%, at least 99% or 100% identical to SEQ ID NO: 16 and (2) a modified NphT7 enzyme having one or more amino acid substitutions selected from the group consisting of H100L, I147T, F217V, Y144L, V157F, G309S, G288S, a PDRP to HFLQ substitution for amino acid residues 86-89, 1147F, I147M, I147Q, I147S, I147C, 1147E, I147N, I147W, I147D, I147R, I147P, I147L, V196G, I147G, I147H, I147K, I147V, I147A, I147Y, F217G, F217A, F217L, F217I, F217M, F217T, F217P, F217S, F217E, F217L, F217V, F217W, S323A and S323V, and any combination of any two or more thereof. In other aspects, the recombinant cell includes a gene that encodes for an enzyme having SEQ ID NO: 16 and another gene that encodes for an enzyme having SEQ ID NO: 33. In some aspects, the recombinant cell includes a gene that encodes for an enzyme having SEQ ID NO: 16 and another gene that encodes for an enzyme having SEQ ID NO: 33 in which amino acid residue 100 is H or L, amino acid residue 147 is S, T or V, amino acid residue 271 is F or V and amino acid residue 323 is A.
[0116] The recombinant cell can further include at least one heterologous 3-ketoacy-
CoA synthase gene, different from the 3-ketoacyl-CoA synthases described above, which encodes for a 3-ketoacyl-CoA synthase. The heterologous 3-ketoacyl-CoA synthase gene may encode for a 3-ketoacyl-CoA synthase enzyme that is at least 80%, at least 90%, at least 95%, at least 99% or 100% identical to any of those identified as SEQ ID NO: 1-81, 86-93, 97, 105-116, and 119-171 in PCT/US 18/16394.
[0117] In some aspects, the 3-ketoacyl-CoA synthase gene is an Acinetobacter schindleri
CIP 107287 gene (Designation: Asch) and/or a gene that encodes for a 3-ketoacyl-CoA synthase enzyme that is at least 80%, at least 90%, at least 95%, at least 99% or 100% identical to SEQ ID NO: 18-20, 22 and/or 26.
[0118] In some aspects, the recombinant cell includes at least one gene that encodes for a modified Asch enzyme as described in PCT/US18/16394. The modified Asch enzyme comprises an amino acid sequence having at least 70% but less than 100% sequence identity to SEQ ID NO: 26. The modified Asch enzyme may have, for example, one or more amino acid substitutions selected from the group consisting of T184I, F236L, V268A, V296A, V317A, and S328G, and any combination of any two or more thereof. The modified Asch enzyme may have altered specific activities relative to an unmodified Asch enzyme. For example, a modified Asch enzyme may have amino acid substitutions comprising T184I, F236L, V268A, V296A, V317A, and S328G (SEQ ID NO: 19) and may result in C8 FAME as the most abundant product. A modified Asch enzyme may have amino acid substitutions comprising T184I, V296A, and V268A (SEQ ID NO: 22) and may result in a mixture of C8 FAME and C10 FAME as the most abundant products. A modified Asch enzyme may have amino acid substitutions comprising V296A (SEQ ID NO: 18) and may result in C10 FAME as the most abundant product.
[0119] The recombinant cell can produce one or more enzymes that terminate the acyl elongation cycle and produce a product having the desired chain length. Such a termination enzyme may or may not be heterologous. The selection of termination enzyme may depend on whether the desired product is a fatty acid or a derivative thereof such as a fatty alcohol, a fatty aldehyde, a fatty alkene, a fatty amide, a fatty ester or a fatty alkane.
[0120] The recombinant cell in some aspects includes a heterologous thioesterase gene that encodes for a thioesterase such as an acyl-CoA esterase, in which case the product will be a fatty acid. Suitable thioesterases include those described in Table 11 of WO 2015/101013.
[0121] In some aspects the recombinant cell includes a gene that encodes for an ester synthase, in which case the product typically is a fatty acid ester. Suitable ester synthases have amino acid sequences at least 80%, at least 90%, at least 95%, at least 99% or at least 100% identical to any of the Marinobacter aquacolei Maql enzyme (SEQ ID NO: 289 of WO
2015/10103), the Psychrobacter cryohaloentis Pcryl enzyme (SEQ ID NO: 290 of WO
2015/10103), the Rhodococcus jostii Rjosl enzyme (SEQ ID NO: 291 of WO 2015/10103), the Alcanivorax borkumensis strain SK2 Aborkl enzyme (SEQ ID NO: 292 of WO 2015/10103) and the Hahella chejuensis hche gene (SEQ ID NO: 37). The ester synthase may have an amino acid sequence at least 80%, at least 90%, at least 95%, at least 99% or at least 100% identical to the Hahella chejuensis Hche ester synthase (SEQ ID NO: 37).
[0122] The recombinant cell may also include one or more genes that encode for one or more of a fatty acyl-CoA reductase (alcohol or aldehyde forming), a fatty aldehyde reductase, an acyl-ACP reductase, an acyl-CoA:ACP acyltransferase, an acyl-CoA hydrolase, a carboxylic acid reductase, an aldehyde dehydrogenase and/or an acyl-ACP reductase.
[0123] The recombinant cell also may include (A) one or more genes that encode for a carboxyl transferase subunit a enzyme, (EC 6.3.1.2) such as an E. coli AccA enzyme or an enzyme that is at least 80%, at least 90%, at least 95% or at least 99% identical thereto; (B) one or more genes that encode for a biotin carboxyl carrier protein, (EC 6.4.1.2) such as an E. coli AccB enzyme (SEQ ID NO: 29) or an enzyme that is at least 80%, at least 90%, at least 95% or at least 99% identical thereto; (C) one or more genes that encode for a biotin carboxylase subunit enzyme, (EC 6.3.4.14) such as an E. coli AccC enzyme (SEQ ID NO: 30) or an enzyme that is at least 80%, at least 90%, at least 95% or at least 99% identical thereto; (D) a carboxyl transferase subunit b (EC 6.4.1.2), such as an E. coli AccD enzyme or an enzyme that is at least 80%, at least 90%, at least 95% or at least 99% identical thereto, (E) a fused E. coli AccD subunit and AccA subunit enzyme (SEQ ID NO: 28); or a combination of any two or more thereof. In some aspects, all of (A) - (E) are present.
[0124] In some aspects, the recombinant cell described herein further comprises one or more additional genetic modifications to reduce or eliminate the expression of certain endogenous enzymes in the recombinant cell. Reducing or eliminating the expression of these certain endogenous enzymes in the recombinant cell can increase the production of desired products, such as fatty acids and/or fatty acid chain products. These reduced or eliminated endogenous enzymes include one or more of the following enzymes:
Methylglyoxal synthase (EC 4.2.3.3), for example that encoded by the E. coli mgsA gene.
Lactate dehydrogenase (EC 1.1.1.27), for example that encoded by the E. coli IdhA gene. Phosphotransacetylase (EC 2.3.1.8), for example that encoded by the E. coli pta gene.
Acetate kinase (EC 2.7.2.1), for example that encoded by E. coli ackA gene.
Acyl-CoA synthase (EC 6.2.1.3), for example that encoded by the E. colifadD gene.
Pyruvate formate lyase (EC 2.3.1.54), for example that encoded by the E.coli pflB gene.
Pyruvate oxidase (EC 1.2.2.2), for example that encoded by the E. coli poxB gene.
Fused acetaldehyde-CoA dehydrogense (EC 1.2.1.10), for example that encoded by the E. coli adhE gene.
Trigger factor (EC 5.2.1.8), for example that encoded by the E. coli tig gene.
Restriction endonuclease (EC 3.1.21.3), for example that encoded by the E. coli hsdr514 gene.
The atoDAEB operon.
Acyl-CoA thioesterase (EC 3.1.2.-), for example that encoded by the E. coli tesB or yciA gene.
Acyl-coenzyme A dehydrogenase (EC 1.3.8.7), for example that encoded by the E. coli fadE gene.
3-ketoacyl-CoA thiolase (EC 2.3.1.16), for example that encoded by the E. colifadA gene.
L-ribulokinase (EC2.7.1.16), for example that encoded by the E. coli araB gene.
L-ribulose-5-phosphate-4-epimerase (EC 5.1.3.4), for example that encoded by the E. coli araD gene.
Beta-D-galactosidase (EC 3.2.1.23), for example that encoded by the E.coli lacZ gene. Lambda phage lysogen.
Rhamnulose-l -phosphate aldolase (EC 4.1.2.19), for example that encoded by the E. coli rhaD gene.
Rhamnulokinase (EC 2.7.1.5), for example that encoded by the E. coli rhaB gene.
F mating factor.
Truncated RNase PH (EC 2.7.7.56) Rph-1 gene.
[0125] Other genetic modifications may be present in the recombinant cell, including any of those described in WO 2015/10103.
[0126] Any heterologous gene may be operatively linked to a promoter and/or terminator sequence that is functional in the recombinant strain. The promoter may be an inducible promoter that functions only under certain conditions. For example, a low phosphate inducible promoter such as the promoter of the wild-type E. coli phoE gene (PphoE) is a useful promoter for the 3-ketoacyl synthase gene. Such a promoter is active in a low phosphate environment. Accordingly, a recombinant cell in which the 3-ketoacyl synthase gene is under the control of an E. coli phoE promoter or another low phosphate inducible promoter may be cultivated in a fermentation medium containing no more than 25 mM phosphate, especially no more than 20 mM, no more than 2 mM, no more than lmM, no more than 0.5 mM, or no more than 0.25 mM phosphate. In some aspects, the promoter that is a low phosphate inducible promoter is the promoter for the pstS gene ( PpstS ). This promoter may be constructed to include a binding site for Integration Host Factor and is thus designated PpstSIH (Lyzen et ak, Plasmid 60:125 (2008)).
[0127] Any heterologous gene may be integrated into the genome of the recombinant strain and/or present in one or more plasmids. If integrated into the genome, the heterologous gene may be inserted at a targeted or random location. Transformation methods such as electroporation and chemical methods (including calcium chloride and/or lithium acetate methods) known in the art are suitable. Examples of suitable transformation methods are described, for example, in Molecular Cloning: A Laboratory Manual, 4th Ed. Spring Harbor Press 2012. In general, no special transformation methods are necessary to produce the recombinant cells.
[0128] Deletions and/or disruptions of native genes can be performed by transformation methods, by mutagenesis and/or by forced evolution methods. In mutagenesis methods, cells are exposed to ultraviolet radiation or a mutagenic substance, under conditions sufficient to achieve a high kill rate (60-99.9%, preferably 90-99.9%) of the cells. Surviving cells are then plated and selected or screened for cells having the deleted or disrupted metabolic activity. Disruption or deletion of the desired native gene(s) can be confirmed through PCR or Southern analysis methods.
[0129] The recombinant cells described herein are used to produce compounds having a straight-chain alkyl group. The recombinant cells are grown under conditions such that they produce such compounds, and the compounds may be recovered.
[0130] When the recombinant cell is a plant cell, the plant can be grown and the compound having the straight-chain alkyl group can be recovered from the plant or any portion thereof, such as roots, stems, leaves, flowers, seeds, seed pods and the like, in which the compound accumulates during the growth of the plant. [0131] Single-cell and other microcells can be used in a culturing process to produce such compounds.
[0132] Culturing is performed generally by forming a culture medium that includes at least one carbon source that is capable of being metabolized by the recombinant cell to produce the product compounds and nutrients as may be required by the specific recombinant cell. The nutrients may include, for example, at least one nitrogen source such as yeast extract, peptone, tryptone, soy flour, com steep liquor, or casein, at least one phosphorus source, one or more vitamins such as biotin, vitamin B12 and derivatives of vitamin B 12, thiamin, pantothenate, one or more trace metals and the like. The fermentation medium may also contain additional materials such as anti-foam agents, biocides, buffers and the like.
[0133] In some cases, such as the production of fatty acid esters, the culture medium may also include a reagent that reacts with the straight-chain compound to produce the desired product. In the specific case of fatty acid esters, for example the culture medium preferably contains an alkanol such as methanol, ethanol or a C3-C8 alkanol. The alkanol reacts to produce the corresponding ester. A native or heterologous ester synthase, or other appropriate enzyme, may be expressed by the recombinant cell to catalyze such a reaction.
[0134] Generally, the culture medium is inoculated with the recombinant cell, and the inoculum is cultured in the medium so that the cell density increases to a cell density suitable for production. The culture medium is then maintained at conditions sufficient for the recombinant cells to produce the desired product.
[0135] Suitable culture conditions will of course depend on the requirements of the particular recombinant strain. The temperature of the culture medium may be, for example from 20°C to 70°C, with a temperature of 25 to 40°C being preferred for most recombinant cells.
[0136] The pH of the culture medium may be, for example, from 2.0 to 10.0, from 3.0 to
9.0 or from 6.0 to pH 8.5.
[0137] It is contemplated that the described aspects may be practiced using either batch, fed-batch or continuous processes and that any known mode of bio-production would be suitable.
[0138] The culturing may be performed under aerobic, microaerobic, or anaerobic conditions, as required or can be tolerated by the particular recombinant cell. Generally, no special culturing equipment is needed to perform the fermentation. The equipment may include, for example, a tank suitable for holding the recombinant cell and the culture medium; a line for discharging contents from the culture tank to an extraction and/or separation vessel; and an extraction and/or separation vessel suitable for removal of the chemical product from cell culture waste.
[0139] The carbon source is one or more carbon-containing compounds that can be metabolized by the recombinant cell as a source of carbon. Examples of suitable carbon sources include sugars such as glucose, sucrose, fructose, lactose, C-5 sugars such as xylose and arabinose, glycerol and polysaccharides such as starch and cellulose. Other suitable carbon sources include fermentable sugars as may be obtained from cellulosic and lignocellulosic biomass through processes of pretreatment and saccharification, as described, for example, in U.S. Patent Publication No. 2007/0031918A1, hemicellulose, lignin, starch, oligosaccharides and/or monosaccharides. Other suitable carbon sources include high-fructose com syrup, cheese whey permeate, cornsteep liquor, sugar beet molasses, and barley malt. Still other suitable carbon sources include carbon dioxide, carbon monoxide, methanol, methylamine and glucosamine.
[0140] The culturing process may be continued until a titer of the desired product reaches at least 0.01, at least 0.05, at least 0.1, at least 0.25, at least 0.5 or at least 1 g per liter of culture medium (g/L). The fermentation process may be continued until the titer reaches, for example, up to 40, up to 45, up to 50, up to 80, up to 100, or up to 120 g/L. The specific productivity may be, for example, from 0.01 and 0.60 grams of the desired product per gram of cells on a dry weight basis per hour (g chemical product/g DCW-hr). The volumetric productivity achieved may be at least 0.005 g of the desired product per liter per hour (g/L-hr), at least 0.01 g/L-hr, at least 0.1 g/L-hr or at least 0.5 g/L-hr, and may be up to, for example, 10 g/L-hr, up to 5 g/L-hr or up to 1 g/L-hr.
[0141] In some aspects, specific productivity as measured over a 24-hour fermentation
(culture) period may be greater than about 0.01, 0.05, 0.10, 0.20, 0.5, 1.0, 2.0, 3.0, 4.0, 5.0, 6.0, 7.0, 8.0, 9.0, 10.0, 11.0 or 12.0 grams of chemical product per gram DCW of cells (based on the final DCW at the end of the 24-hour period).
EXAMPLES
[0142] The following examples are provided to illustrate the disclosure, but are not intended to limit the scope thereof. All parts and percentages are by weight unless otherwise indicated. Overview
[0143] The current FAME pathway takes acetyl-CoA and condenses it with a malonyl-
CoA to create acetoacetyl-CoA via the enzyme NphT7. After subsequent keto reduction, dehydration and enoyl reduction, the saturated C4-CoA is then reacted with malonyl-CoA to create a C6-C0A species that also undergoes iterative keto reduction, dehydration and enoyl reduction to make a saturated C6-C0A. The enzyme catalyzing this second reaction is also NphT7, but a variant that has been engineered to accept the saturated C4-CoA by mutagenesis, called NphT7(LSVA). Together these two enzymes can initiate FAME biosynthesis, however only the LSVA variant is is needed for FAME production. In this sense, it is necessary for only one version of NphT7 to be present, NphT7(LSVA), since this variant can extend from a C2- CoA to a C6-C0A molecule, whereas the wild-type enzyme can only extend from C2-CoA to C4-CoA molecule. The pathway typically benefits from expression of wild type NphT7, however it is not required. In addition to NphT7 and/or NphT7(LSVA), the current FAME pathway benefits from an additional 3-ketoacyl-CoA synthase, provided by heterologous Asch or one of its variants engineered for increased chain length specificity compared to wildtype Asch.
[0144] Provided herein are enzymes with similar activity to NphT7, but with different substrate specificities.
[0145] Also provided herein are enzymes which can natively catalyze the C2-CoA to
C6-C0A reaction without modification (Sten, Sko3, Scin, plus others that show in vitro activity on C2-CoA and C4-CoA), which is a function not seen in wild type NphT7.
[0146] A subset of the same mutations introduced into NphT7 that broaden the substrate specificity and extend the catalytic activity from C2-CoA to C6-C0A can also be introduced into the enzmyes described herein. These two mutations (called SV) enable some enzymes that otherwise show no ablity to produce FAME to now produce FAME in the context of the pathway, and in some cases, behave similarly to NphT7(LSVA), an example of such is provided by Nbra. Thus, the SV mutations are what allow the enzymes to function in the FAME pathway. However, not all of the enzymes behave in this fashion, for example, wild type Sten produces some FAME; however, as a SV mutant it does not.
[0147] Also described herein are enzymes that have the highest activity on C2-CoA, as well as specific enzymes that have the highest activity on C4-CoA. [0148] Further described are the SV mutations to change the substrate specificity of the enzymes. Although enzymes can have activity on both substrates, C2-CoA and C4-CoA, the mutations can change the relative activity on each substrate and as a consequence influence total FAME production (and the amounts of each individual FAME that is produced).
Example 1.
[0149] The following recombinant E. coli strains were used in the following examples as indicated.
[0150] Recombinant Strain 1 is a mutant of the E. coli strain designated BW25113, available from the E. coli Genetic Strain Center (CGSC#7636; Dept of Molecular, Cellular, and Developmental Biology, Yale University, New Haven, Connecticut), having the following additional genetic modifications:
Figure imgf000041_0001
[0151] Recombinant Strain 2 is a mutant of the BW25113 E. coli strain with the following genetic modifications:
Figure imgf000041_0002
Figure imgf000042_0001
[0152] Recombinant Strain 3 is a mutant of the BW25113 E. coli strain with the following genetic modifications:
Figure imgf000042_0002
[0153] Recombinant Strain 4 is a mutant of the BW25113 E. coli strain with the following genetic modifications:
Figure imgf000042_0003
Figure imgf000043_0001
[0154] The J ollowing recombinant plasmids were used in the following examples as indicated.
[0155] Mutant E. coli strains are prepared using standard electroporation methods. In each case, the host strain is transformed with a“Type 1” plasmid and a“Type 2” plasmid as described below.
[0156] Type 1 plasmids are pACYC plasmids containing the pl5a origin of replication and a chloramphenicol resistance marker:
[0157] Type 1A: This plasmid includes a gene encoding for a 3-ketoacyl-CoA synthase or a mutated variant. The 3-ketoacyl-CoA synthase genes are as indicated in the specific examples below. This plasmid includes an E. coli bifunctional 3-hydroxyacyl-CoA
dehydrogenase/dehydratase (fadB) gene (SEQ. ID NO. 36) and a T. denticola enoyl-CoA (ter) gene (SEQ. ID NO. 34) cassette, all under a native E. coli pstSIH promoter (SEQ. ID NO. 39) and a native E. coli terminator. This plasmid also contains a Hahella chejuensis ester synthase gene (SEQ. ID NO. 37) fused to a DNA sequence encoding a protein fragment containing 6 histidine residues and a protease recognition site under an E. coli phoE promoter (SEQ. ID NO. 38), and an ACC (acetyl-CoA carboxylase) cassette including fused E. coli accA and accD genes with a E. coli tpiA promoter (SEQ ID NO: 31) and a cassette including the E. coli accB and E. coli accC genes under an E. coli rpiA promoter (SEQ ID NO: 32).
[0158] Type 1B: This plasmid includes a gene encoding for a 3-ketoacyl-CoA synthase or a mutated variant. The 3-ketoacyl-CoA synthase genes are as indicated in the specific examples below. This plasmid also includes an E. coli bifunctional 3-hydroxyacyl-CoA dehydrogenase/dehydratase (fadB) gene (SEQ ID NO: 36) and a T denticola enoyl-CoA (ter) gene (SEQ ID NO: 34) cassette, all under a native E. coli pstSIH promoter (SEQ ID NO: 39) and a native E. coli terminator.
[0159] Type 2 plasmids are pET Plasmids containing a ColEl origin of replication and a kanamycin resistance marker:
[0160] Type 2A: This plasmid includes a gene encoding Asch or a variant of Asch with various ratios of C8 and C10 FAME production under an E. coli phoE promoter (SEQ ID NO: 38) and an E. coli terminator. The Asch genes are as indicated in the specific examples below. [0161] Type 2B: This plasmid includes a gene encoding Asch or a variant of Asch with various ratios of C8 and C10 FAME production fused to a DNA sequence encoding a protein fragment containing 6 histidine residues and a protease recognition site. The Asch genes are as indicated in the specific examples below. Additionally, this plasmid also contains a Hahella chejuensis ester synthase gene (SEQ ID NO: 37) fused to a DNA sequence encoding a protein fragment containing 6 histidine residues and a protease recognition site. Both genes are under an E. coli phoE promoter (SEQ ID NO: 38).
[0162] 3-ketoacyl-CoA synthase genes are synthesized based on published polypeptide sequence information for various wild type enzymes, and using codons optimized for expression in E. coli. Site-specific mutants of the synthesized 3-ketoacyl-CoA synthase genes are generated by oligonucleotide-directed mutagenesis. The sources of the wild-type genes, the short-hand designations used herein for each of them, and the amino acid sequence of the native enzyme produced by the wild-type genes are as follows:
Figure imgf000044_0001
Figure imgf000045_0001
*GenInfo Identifier (GI) refers to sequence identification numbers used by the GenBank genetic sequence database at the National Center for Biotechnology Information, U.S. National Library of Medicine, Bethesda, Maryland, USA.
[0163] Mutations to the amino acid residues encoded by the wild- type genes are designated herein by the shorthand designation for the wild-type enzyme, followed in parentheses by a 3-, 4- or 5 character code consisting of a first letter designating the amino acid residue in the native enzyme, a 1-, 2- or 3-digit number indicating the position of that amino acid residue in the native enzyme, and a final letter designating the amino acid residue in that position in the mutated enzyme. Alternatively, mutations to the amino acid residues encoded by the wild-type genes can also be designated herein by the shorthand designation for the wild-type enzyme, followed in parenthesis by the final letter or letters designating the amino acid residue in that was mutated. The single-letter designations are IUPAC amino acid abbreviations as reported, for example, at Eur. J. Biochem. 138:9-37(1984).
[0164] Production of recombinant E. coli strains:
[0165] Recombinant E. coli strains are prepared using standard electroporation methods.
In each case, one of the above described recombinant strains (e.g. Recombinant Strain 1, Recombinant Strain 2, Recombinant Strain 3 or Recombinant Strain 4) is transformed with a “Type 1” plasmid (e.g. Type 1A or Type 1B) and an above-described“Type 2” plasmid (e.g. Type 2A or Type 2B). For each Type 2 plasmid, the 3-ketoacyl-CoA synthase and the promoter for the 3-ketoacyl-CoA synthase gene are as indicated in the specific examples below.
[0166] Small scale fermentation method - shake flask protocol
[0167] A culture of synthetic medium containing salts, glucose, NFLCl, and
supplemented with vitamins, yeast extract, 35 pg/ml kanamycin and 20 pg/ml chloramphenicol is inoculated with the strain to be tested and grown overnight at 32°C. The ODeoo of a 1: 10 dilution of this culture is determined in order to inoculate a Seed 2 flask to a final ODeoo of 0.3. Seed 2 flasks contain 25-30 ml synthetic medium containing salts, glucose, NH4Cl, and is supplemented with vitamins, yeast extract, 35 pg/ml kanamycin, 20 pg/ml chloramphenicol, and 0-2% methanol. Seed 2 flasks are incubated at 32°C for 6-7 hours.
[0168] The ODeoo of a 1: 10 dilution of this culture is determined in order to inoculate a production flask to a final ODeoo of 0.01-0.025. Production flasks contain 25 ml synthetic medium containing salts, glucose, NH4Cl, and supplemented with vitamins, yeast extract, 35 pg/ml kanamycin, 20 pg/ml chloramphenicol, 1.25-2.5 mM phosphate, 0-2% methanol, 3 - 6g/l glucose, and 15-40 g/l glycerol. This synthetic medium functions as a limited phosphate medium that promotes the activity of a low phosphate inducible promoter such as the E. coli phoE promoter (SEQ. ID NO. 38) or E. coli pstSIH promoter (SEQ ID NO: 39). Production flasks are incubated at 32°C. When phosphate is depleted the following additions are made: 2 ml methyl tetradecanoate, 1 ml 12.5% Polysorbate 80 (sold under the trademark name Tween 80), 1-1.25 ml 50% glycerol. Methanol (0-0.5 ml) is added at the beginning of the production assay or right after phosphate depletion. Additional methanol can be added after phosphate depletion to compensate for methanol evaporation. Flasks are incubated at 35 °C for a further 24 hours. Samples are taken at 24 hours and extracted with 0.1 % HC1 in MTBE (Methyl tert butyl ether). The extracts are analyzed for fatty acid esters by gas chromatography.
[0169] In vitro assay for specific activity against different chain length primers
[0170] In vitro assays are used to determine a specific activity of a 3-ketoacyl-CoA synthase for different chain length primers. In some aspects, the assay can be used to determine the specific activity of a particular 3-ketoacyl-CoA synthase for a particular chain length primer. For example, the assay can be carried out to test for specific activity for a C6-C0A primer by reacting a C6-C0A primer and malonyl-CoA with a particular 3-ketoacyl-CoA synthase.
Magnesium chloride is added to the reaction mixture and complexes between the resulting 3- ketoC8-CoA and magnesium ion are formed. The reaction progress can be monitored by detecting these 3-ketoC8-CoA and magnesium ion complexes at a UV absorbance of 303 nm. These in vitro assays can be performed with cell lysates of cells expressing a 3-ketoacyl-CoA synthase and/or an isolated 3-ketoacyl-CoA synthase.
[0171] The presence of 3-ketoC8-CoA products indicates that the assayed 3-ketoacyl-
CoA synthase has specific activity for C6-C0A primers and that the assayed 3-ketoacyl-CoA synthase has 3-ketoC8-CoA synthase activity. The specific activity for other chain length primers can be assayed in similar fashion by substituting a different chain length CoA primer. For example, by substituting a C2-CoA primer, the specific activity for C2-CoA primers can be assayed and the 3-ketoC4-CoA synthase activity can be determined. By substituting a C4-CoA primer, the specific activity for C4-CoA primers can be assayed and the 3-ketoC6-CoA synthase activity can be determined. By substituting a C6-C0A primer, the specific activity for C6-C0A primers can be assayed and the 3-ketoC8-CoA synthase activity can be determined. By substituting a C8-C0A primer, the specific activity for C8-C0A primers can be assayed and the 3-ketoClO-CoA synthase activity can be determined.
[0172] In some aspects, the in vitro assay is carried out on isolated 3-ketoacyl-CoA synthase. The 3-ketoacyl-CoA synthase can be isolated by methods known in the art. For example, the 3-ketoacyl-CoA synthase can be overexpressed as a fusion polypeptide with an affinity tag such as a poly-histidine tag and then purified by immobilized metal affinity chromatography. The in vitro assay is then carried out on this isolated 3-ketoacyl-CoA synthase by preparing a reaction mixture of CoA primer (0.3 mM), malonyl-CoA (0.3 mM) and 5 mM MgCF in a buffer of 50 mM Tris, pH 8.0. The reaction is initiated by adding the isolated 3- ketoacyl-CoA synthase at various concentrations. As described above, the progress of the reaction can be monitored by detecting 3-ketoCX-CoA and magnesium ion complexes at a UV absorbance of 303 nm, where X represents the chain length of the resulting product. The CoA primer is selected from C2-CoA, C4-CoA, C6-C0A, or C8-C0A. The reactions can be carried out in 96-well microtiter plates.
[0173] The following table shows the in vitro specific activity of purified enzymes.
Threshold for activity was 0.05 U/mg.
Figure imgf000047_0001
Figure imgf000048_0001
[0174] These in vitro assay results indicate there exists a wide diversity of specific activities and acyl-CoA substrate preferences in these enzymes annotated as 3-ketoacyl-CoA synthases.
Example 2.
[0175] An amino acid sequence alignment of the 3-ketoacyl-CoA synthase enzymes,
NphT7 (SEQ ID NO: 16), Sko3 (SEQ ID NO: 2), Scin (SEQ ID NO: 3), Sten (SEQ ID NO: 4), Nbra (SEQ ID NO: 1), FABH_ECOLI (SEQ ID NO: 21), was carried out with the Clone Manager software package (Scientific & Educational Software, Denver, Colorado). The sequence alignment was performed as a global protein sequence alignment with the Scoring Matrix BLOSUM62 and using NphT7 (SEQ ID NO: 16) as the reference sequence (FIG. 1). In the alignment in Figure 1, the SV mutations are marked with the boxes; using the NphT7 numbering, the SV mutations are I147S and F217V.
[0176] An amino acid sequence alignment of the 3-ketoacyl-CoA synthase enzymes,
Gsul (SEQ ID NO: 13), SHK1 (SEQ ID NO: 9), Svio (SEQ ID NO: 10), Nbra (SEQ ID NO: 1), Sten (SEQ ID NO: 4), SK03-2 (SEQ ID NO: 5), Scin (SEQ ID NO: 3), Sko3 (SEQ ID NO: 2), NphT7 (SEQ ID NO: 16), Ache (SEQ ID NO: 6), Smgl (SEQ ID NO: 7), Safr (SEQ ID NO: 8), FABH_ECOLI (SEQ ID NO: 21), Ctet (SEQ ID NO: 15), Ubac (SEQ ID NO: 12), Abay-2 (SEQ ID NO: 14) was carried out with the Clone Manager software package (Scientific & Educational Software, Denver, Colorado). The sequence alignment was performed as a global protein sequence alignment with the Scoring Matrix BLOSUM62. In the alignment in Figure 2, the SV mutations are marked with the boxes; using the NphT7 numbering, the SV mutations are I147S and F217V.
[0177] FIGS. 1 and 2 show graphical views of the sequence alignment. Each line corresponds to a 3-ketoacyl-CoA synthase enzyme as indicated by the designation. The amino acid position of the first residue of each row is indicated. Example 3
[0178] The following table and FIG. 3 show the FAME production by NphT7 and its mutants SV and LSVA in a C10 FAME production pathway (the production strain has an Asch gene that produces mainly C10 FAME). This data was generated using recombinant strains 3 and 4 (for + or - chromosomal integration of nphT7). These recombinant strains contain plasmid pair type 1B and 2B, where plasmids 1B have nphT7 variants as indicated in the table and plasmid 2B is the same for all strains (this plasmid 2B contains Asch wild type with a His tag).
Figure imgf000049_0001
[0179] This data shows that NphT7 wild type in combination with an additional 3- ketoacyl-CoA synthase such as an Asch variant is not sufficient to produce FAME of C6 or longer chain length. However, a mutant of NphT7, such as SV or LSVA, in combination with an Asch variant, produces FAME.
Example 4.
[0180] The table below and FIG. 4 show how the wild type enzymes are different than
NphT7. This data shows the wild type homologs in a clean background using recombinant strain 2 which does not have a chromosomal integration of nphT7. The recombinant strain contains plasmid pairs 1A and 2A. Plasmids 2A contain the Asch variants that will determine the main FAME species as indicated in the table. For C10 FAME production Asch(V296A) (SEQ ID NO: 20) was used; for C8 FAME production
Asch(Tl84I,F236L,V268A,V296A,V3l7A,S328G) (SEQ ID NO: 19) was used; for mixed C8/C10 FAME products Asch(Tl84I) (SEQ ID NO: 20) was used. Plasmids 1A vary based on the NphT7 homologs evaluated, as indicated in the table. As indicated in example 3, similar recombinant strains containing the wild type NphT7 do not produce FAME.
Figure imgf000050_0001
[0181] This data shows that unlike wild type NphT7, three of these wild type enzymes are able to support FAME production (Sten, Sko3, Scin) in combination with an additional 3- ketoacyl-CoA synthase such as an Asch variant. Unexpectedly, wildtype Nbra does not support production of significant amounts of FAME even though the in vitro assay results shown in Example 1 would indicate that, unlike wild- type NphT7, wildtype Nbra should be capable of supporting FAME production. Example 5.
[0182] The following examples show FAME production by strains expressing the wild- type genes or their corresponding SV mutant variants. Variants were evaluated in recombinant strains 1 (with a chromosomal nphT7 integration) and 2 (no nphT7 integration in the chromosome). The homologs or the control NphT7(LSVA) were expressed in a plasmid type 1A and were paired with plasmids type 2A containing variant Asch genes that preferentially produce C10 FAME, C8 FAME, or mixed C10 and C8 FAME as indicated in the table, all these without a His tag. For C10 FAME production Asch(V296A) (SEQ ID NO: 18) was used; for C8 FAME production Asch(Tl84I,F236L,V268A,V296A,V3l7A,S328G) (SEQ ID NO: 19) was used; for mixed C8/C10 FAME products Asch(Tl84I) (SEQ ID NO: 20) was used.
a. Nbra/Nbra(SV) (FIG. 5)
Figure imgf000051_0001
Figure imgf000052_0001
[0183] The wild type Nbra showed no ability to support FAME production in either background with or without NphT7; however the Nbra (SV) showed the ability to produce FAME in both backgrounds. When combined with different Asch variants that preferentially produce C10 FAME, C8 FAME, or mixed C10 and C8 FAME, Nbra(SV) performed similarly to NphT7(LSVA). These results show that the SV mutations in Nbra enable the production of C6- C10 FAME, similar to the NphT7(LSVA).
b. Sten/Sten(SV) (FIG. 6)
Figure imgf000052_0002
Figure imgf000053_0001
[0184] Unlike Nbra, the wild type Sten showed the ability to produce FAME in both backgrounds with and without NphT7 when combined with different Asch variants that preferentially produce C10 FAME, C8 FAME, or mixed C10 and C8 FAME. For all Asch contexts the amount of FAME was reduced compared to the NphT7(LSVA), but the result pointed out a clear distinction from NphT7 in that wild type Sten can support FAME production. FAME production by the Sten(SV) variant was lower than by the wild-type Sten in both backgrounds, indicating that the S and/or V mutations are context- specific.
c. Sko3/Sko3(SV) (FIG. 7)
Figure imgf000053_0002
Figure imgf000054_0001
[0185] Sko3 wild- type demonstrated FAME production in both backgrounds with and without NphT7 when combined with different Asch variants that preferentially produce C10 FAME, C8 FAME, or mixed C10 and C8 FAME. FAME titers were comparable to those found with NphT7(LSVA) in a background without NphT7. Sko3(SV) also showed FAME production in both backgrounds with and without NphT7.
d. Scin/Scin(SV) (FIG. 8)
Figure imgf000055_0001
Figure imgf000056_0001
[0186] The wild- type Scin showed ability to support FAME production in both backgrounds with and without NphT7 when combined with different Asch variants that preferentially produce C10 FAME, C8 FAME, or mixed C10 and C8 FAME. Scin (SV) also showed the ability to support FAME production in both backgrounds with and without NphT7. In the background without NphT7, the SV mutation caused a reduction in total FAME produced, similar to Sko3(SV), again indicating that the benefits conferred by one or both of these changes is context- specific.
SEQUENCES
Nbra (SEQ ID NO: 1) :
Nocardia brasiliensis ATCC 700358
MNNIAVLGTGSYLPDRIVSNSEVGSGADVDSEWI IRKTAIRERRWALPDQATSDLATHAA SAALDAAGISADEVSAIWATSTPDHPQPPTAAFVQHNLGARGASAFDVNAVCSGFVFAL SAVEAAIARAGGGYGLWGADVYSRILNPADRRTWLFGDGAGAWLGPSASGGLRRFGL HTFGDLTSLIRVPAGGSRQPYDPAAHELGAQYFTMDGRGVRAFVNGSLPVLVKQFLHDSG VAPDDITHLIPHQANGVMLAELAEELGLVNATMHTTVRYYGNTGAASIPITLDNAARTGG IRPGDTVLLVGFGGGMAVGLTLVEW
Sko3 (SEQ ID NO: 2) :
Streptomyces sp . KO-3988
MTDLRILGTGAYVPERIASNDEVAAAAGVDDAWITAKTGIRERRWAADDQATSDLATAAG RAALRSAGITAEDLSVIWATSTPDRPQPPTAAYVQQRLGASDAAAFDVNAVCSGMVFAL AAAEGVLTRTGGHALVIGADLYSRILNRADRRTVILFGDGAGAVILGSSVDHGPRIRHLA LHSFGELSGLIEVPAGGSRMPVDQTVLDAGLQYFAMDGRGVRNFVCDHLPQLVKGFLHKC GWPDDIDHFVPHQANGTMLDSLFADLELPRATMHRTLTHYANTGAASIPITLDAAARAG AFNPGDLILMAGFGGGMSAGFALVEW
Scin (SEQ ID NO: 3) :
Streptomyces cinnamonensis
MTGIAADIAVLGTGAYVPDRIVSNDEVGAPAGVDGEWIVRKTAIRERRWAAPGQATSDLA
VAAGRAALESAGITVGQLSLIWATSTPDRPQPPTAVYVQQQLGALGAAAFDVNAVCSGS VFALSWEGMLARQGGHALVIGADVYSRILNPADRKTWLFGDGAGAMVLGPAGGGRAAG
ARVRHVTLHTVGELAGLIQVPAGGSRQPADQAVLDAGLQYFAMDGREVRRFWEQLPQLT
KQFLHEAGWPDDVAHFVPHQANGVMLDTVIEDLALSRATTHLTLERYGNTGAASIPITL
DAAAREGAFRPGDLILLAGFGGGMAAGLALLEW
Sten (SEQ ID NO: 4) :
Streptomyces tendae
MSDVRILGTGAYVPDRIVSNDEVGAPAGVDDAWITAKTAIRERRWAATHQATSDLATQAA RNALDCAGITAAELTVIIVATSTPDRPQPPTAAYVQHNLAATHAAAFDINAVCSGEIFAL SAAEGTLARKGGHALI IGADLYSRILNPTDRKTTVLFGDGAGAMILGPATTGPRIRHLTL HTYGEHSDLIQVPAGGSRLPTTTQTLQQDQHYFTMDGRGVRRFVADHLPRLTKEFLHEAG TVPDDIRHFIPHQANGVMLDDLFTQLALPHATMHQTLTTYANTGAASIPLTLDTAHRNNA LHPGDLILLAGFGGGMAAGLALIEW
SK03-2 (SEQ ID NO: 5) :
Streptomyces sp . KO-3988
MTEVGILGTGAQVPDRWSNDEAGAAAGVDDAWWRKTAIRERRWAAPEQATSDLAAAAG SAALRSAGITPDQLSVIWATSTPDRPQPPTAAYVQQLLGAVGAAAFDVNAVCSGEVFAL AAAEGILTRRGGHALVIGADVYSRILNPADRRTWLFGDGAGAMVLGPGRGARVRNLALH TFGELAGLIEVPAGGSRRPADQAALDAGLQYFTMDGRAVRSFVEDRLPLLTKQFLHDSGV GPDDIRHFVPHQANGVMLDALFAELALNRAVMHRTLTHYGNTGAAS IP ITLAEAARSGAF RPGDLILLAGFGGGMAVGFALVEW
Ache (SEQ ID NO: 6) :
Austwickia chelonae NBRC 105200
MGVAVWGTGAYVPERIVTNDEVGAPAGVDDAWIRSKTMIRERRWVAENEATSDMATEAGR
RALEAAGITADELSYIWATSTPDRPQPPTAAYVQHNLQAHHAAGFDMNAVCSGFVFAGS
TVARMIAATGGYGLVIGADVYSRILNPRDRRTVILFGDGAGAWLGPSDGSNHGVLATSL
HTFGALNDKIMVPAGGSRLPTEDTHYETGLAYFTMEGRAVKEFVLQELPPLVDSFFAETG
ISPQEVDHFIPHQANGMMLDELVPTLGLSGARTHRTLEKYGNTGAASVGITLDQAAREGE
LHSGDRIFLAGFGGGMAAGLALIVW
SMgl (SEQ ID NO: 7) :
Streptomyces sp . Mgl
MSARGIVGTGSYLPEREVSNKEVAEWCGITAEWIEERTEILTRRYAAPDEATSDLAAHAL
TSALRSAGITADRLDYLIVSTSTGDSPQPPTANLVQHAVGATGAACFDVNWCSGFVFGL
EMAHRLLAARPDGYAAVIGADVYSRILDHADHRTTVLLGDGAGAAIVGPVRDGHGYLGAE
LASRGDAHRLIRVEAGGSRIPTTPQTLADGGHYFRMEGRGVRDFVMDHVPPVLDRLLRGA
GLTAADVDHFVPHQPNGVLLRGLVDKIGMSHAHTHRTLEKYGNVGSASVPVALDDGCRSG
LIRDGDLWMAGFGGGMSVGATLMRWQERS
Safr (SEQ ID NO: 8) :
Spirochaeta africana DSM 8902 MHSKHTELQEAVITGAGAYVPPRRMSNDELAGFLDTSDEWIFSKTGIHNRHIAAEDQATS DLGAAAARQALADAGLTPLDIDLILVATSSPDYNGLPSTACWQDLLGATNAAAMDVAAV CSGFVYALETARAFARSGSARRILVIGAEIYSRWDWSDRSTCVLFGDGAGAVWESLPE SAGNTAPRFPRARVLDSVLKSRGSDAQALVRPAGGTRRPVNPGDNFGADQFLQMDGRKVY NFAVGAIGEVIQELLTRNGLSLAELDWVIPHQANARILEAAARRLKYPVEQMYSNIAEYA NTSAASIPIAMNEMAAAGMLKPGQNI ITVGFGAGLTYGGNLLRTW
SHK1 (SEQ ID NO: 9) :
Streptomyces sp . HKI0576
MTSAIGILGTGAYLPAREVGNDELAALIPDTTAEWILRKTGIRSRRYAAPDEAASDLAAG AARAALADAGLTADRVDHI IVCTSTGDQPLPPTASWQQLLGAGSAACFDLNAVCAGFVY GLEAARGLIAVRPGAHVLLIGADVYSRFLDFTDRRSAVLLGDGAGAVWGAVEDGNGILG IDLSTRGDAQDLLRIPAGGSRSPASAGTVAEGGHHLRMQGRAVSEFVLENIPPGVDKLLA GAGVRAEEIDCFVPHQANAVLIRELAARCGLADTPLVQWETYGNTGAASVPIGLDHAVR EGTVEPGGLVLLSAFGAGMSVGNGLLRWGGGPA
Svio (SEQ ID NO: 10) :
Streptomyces violaceusniger Tu 4113
MTDHPVRTPGGRPVGILGTGSYLPAEAVSNELVAERAGVTAEWIAAKTGIRRRRYAADHE ATSDLAVEAARAALADAGIRAGQLGWIWATSTPDHPQPATACLVQHRIGATGAAAFDIN AVCSGFVFALVTAAGLLAGSGAPAPYALVIGADVYSRI IDRTDRRTAVLFGDGAGAWLG PVRHGYGLTGSLLTSDGALHELIQVPAGGSRAPASEKTLADGGHFFRMRGRAVGEYVLAE LPRAIRALLAAHRTDAAGVDHFIPHQANGVLLAKALPDLGLPRARTHLTVAEHGNTSAAS IPLALDDARRQGVFTDGELLLLAGFGGGMSLGAALVRWQDGHGGS
AATC (SEQ ID NO: 11) :
Amycolatopsis sp . ATCC 39116
MMKTWLGTGSYLPPTVLTSTELGDRLGTGGQWIVDKTHIRERRVADGSEATSDLATAAA RRALQSSGTSPDELDLVIVATSTPDQP IPATAALVQANI SADRAAAFDVDAVCSGFVYAL WAHGMLLADDSCRKALVIGADIYSRILDYDDRRTAVLFGDGAGAWLGKTNADRGIHTT LLGCDGKRADLVQVPAGGSRRPASPQTLADGLHYFKMIGRPVRELAGQVMPEWEQLLKR AGLTLDDVQHLVPHQANGVMLAELDSALGLAPGVMCRTVEHFGNTGAASVPITLDHAVRG GQIGEDDRWLVTFGGGMSWGGALLTWAYTHGEETSHD
Ubac (SEQ ID NO: 12) :
hypothetical protein ACD_77C00103G0010 [uncultured bacterium]
MKQLIRNVKI IGTGSYVPEKIYTNEYLETFISTTSSWIFENIGIKERHIAAPNQATSDLA TIAGQRAIDDAGIKNEDIDLI IVATATPDRKAPSTAAFVQHKLNAVNAAAFDMNAVCSGF LFGMSVASQYIASGVYNNILVIGADTFSRITDWTKRDAVFFGDGAGAWITSANITEGFL AYRIYTDTQNEMLGFTIPGGGSEIPLTENNLNDQYFQMNGKSVFASATQALPKAINQVLA DTGLTIGDIDIMIPHQPSIRILQKTAELIGLPFEKVMTNMDRYANTSGGTIPILLDEVKK SGKLKRGNIVLFAAVGSGWTYGASI IKWA
Gsul (SEQ ID NO: 13) : Geobacter sulfurreducens PCA
MMRARIVGTGSAVPSKVLTNFDLEKMVDTSDEWVTTRTGIKERRIAVDGEYTSTFATLAA ERALEMAGVKASDLDLLIVATITPDFPFPATACWQSNLKATKAAAYDISAACSGFIYAL AQASNAIRSGSARKALVIGAEVLSRIIDWTDRNTCLLFGDGAGAWLEACDDGHGVLSTH LHSDGSYWELLYQPGCGNRNPAVQKTLDDRRIYLMMQGNEVFKLAVRAMEDAALEALDAN GLTPADISLFIPHQANRRI IDAIGKRLGLPGEKVYVNLDRFGNTSAASIPLALDEANRSG RIKPNDVWFDAFGGGLTWGSALVRW
Abay-2 (SEQ ID NO: 14) :
Acinetobacter baylyi DSM 14961 = CIP 107474
MGIRITGTGLYHPAESISNEELVESLNAYVEYYNAENAEKIAAGELPERLGSSAEFIEKA SGIKSRYVIEKSGVLDPQRLRPRLQERTDDELSIQAEWGVLAAKQAMENAKVTAEDIDW ILACSNMQRPYPAVAVEIQSALGIQGYAYDMNVACSAATFGLKQAYDAIQSGARRVLLVN VEITSGHTDYRSRDCHFIFGDVATASI IEQTDTKTGFEILNIHLFTQFSNNIRNNFGFLN RSEDATRDDKQFRQDGRKVFKDVCPLVAKI ITAQLEKNQVNPQDIKRFWLHQANLNMNQL IAKLVLGKEAEAERTPI ILDEFANTSSAGVI IALHRTGEEVSEGEFGVICSFGAGYSVGS LLVQKRVA
Ctet (SEQ ID NO: 15) :
Clostridium tetani E88
MKNKEVRILSTGKYLPPISISNHDLSKI IDTNDNWIKTRTGIEKRRITKGENTSDLGTKA ALDALRKGGISPEELDLI IVATITPDYFTPSTACI IQRNIKAYNAFAFDISAACSGFTYG ISIASQFIRNGVAKKVLVIGVETLSKLVDWKDRNTCILFGDGSGAAILTESNEKGIMNVY LGSDGRGADLLKCKSSSLTVNSDELKELLNSKEEDLENKFIEMDGKEIFKFAVKVMIKGI EKVLKDSNLELKDINYI IPHQANLRI IEHVAKKLGIDENKFYININHYGNTSAASIPIAL AEVDEKGLLKKGDNVILVGFGAGLTWAASLIKWI
NphT7 (SEQ ID NO: 16) :
Streptomyces sp . CL190
MTDVRFRI IGTGAYVPERIVSNDEVGAPAGVDDDWITRKTGIRQRRWAADDQATSDLATA AGRAALKAAGITPEQLTVIAVATSTPDRPQPPTAAYVQHHLGATGTAAFDVNAVCSGTVF ALSSVAGTLVYRGGYALVIGADLYSRILNPADRKTWLFGDGAGAMVLGPTSTGTGPIVR RVALHTFGGLTDLIRVPAGGSRQPLDTDGLDAGLQYFAMDGREVRRFVTEHLPQLIKGFL HEAGVDAADISHFVPHQANGVMLDEVFGELHLPRATMHRTVETYGNTGAASIPITMDAAV RAGSFRPGELVLLAGFGGGMAASFALIEW nphT7 (SV) (SEQ ID NO: 17)
Artificial sequence
MTDVRFRI IGTGAYVPERIVSNDEVGAPAGVDDDWITRKTGIRQRRWAADDQATSDLATA AGRAALKAAGITPEQLTVIAVATSTPDRPQPPTAAYVQHHLGATGTAAFDVNAVCSGTVF ALSSVAGTLVYRGGYALVIGADLYSRSLNPADRKTWLFGDGAGAMVLGPTSTGTGPIVR RVALHTFGGLTDLIRVPAGGSRQPLDTDGLDAGLQYVAMDGREVRRFVTEHLPQLIKGFL HEAGVDAADISHFVPHQANGVMLDEVFGELHLPRATMHRTVETYGNTGAASIPITMDAAV RAGSFRPGELVLLAGFGGGMAASFALIEW Asch (V296A) (SEQ ID NO: 18)
Artificial sequence
MGIRITGTGLFHPTDFITNEELVESLNAYVEQYNLENADKIAAGEIEELRGSSAEFIEKA SGIKRRYVAEKTGILDPKRLRPLLHERSNDELSIQAEWGVAAAKQAMENAGVTAEDIDW ILSCSNIQRAYPALAIEIQTALGIQGYAYDMNVACSAATFGIKQAADAIKSGARRVLMVN VEITSGHTDFRSRDCHFIFGDVATASI IEETETKTGFEIEDIELFTQFSNNIRNNFGYLN LSEVDADIDNNRFRQDGRKVFKEVCPLVAKMITKQLEKNQIEPTNVKRFWLHQANANMNE LILKLIVGKEHAKSELVPLILDEFANTSSAGVI IALHRTANEVNDGEYGVLCSFGAGYSV GSILVKKHVA
Asch (T184I, F236L, V268A, V296A, V317A, S328G) (SEQ ID NO: 19) Artificial sequence
MGIRITGTGLFHPTDFITNEELVESLNAYVEQYNLENADKIAAGEIEELRGSSAEFIEKA SGIKRRYVAEKTGILDPKRLRPLLHERSNDELSIQAEWGVAAAKQAMENAGVTAEDIDW ILSCSNIQRAYPALAIEIQTALGIQGYAYDMNVACSAATFGIKQAADAIKSGARRVLMVN VEIISGHTDFRSRDCHFIFGDVATASI IEETETKTGFEIEDIELFTQFSNNIRNNLGYLN LSEVDADIDNNRFRQDGRKVFKEVCPLAAKMITKQLEKNQIEPTNVKRFWLHQANANMNE LILKLIVGKEHAKSELAPLILDEFANTGSAGVI IALHRTANEVNDGEYGVLCSFGAGYSV GSILVKKHVA
Asch (T184I) (SEQ ID NO: 20)
Artificial sequence
MGIRITGTGLFHPTDFITNEELVESLNAYVEQYNLENADKIAAGEIEELRGSSAEFIEKA SGIKRRYVAEKTGILDPKRLRPLLHERSNDELSIQAEWGVAAAKQAMENAGVTAEDIDW ILSCSNIQRAYPALAIEIQTALGIQGYAYDMNVACSAATFGIKQAADAIKSGARRVLMVN VEIISGHTDFRSRDCHFIFGDVATASI IEETETKTGFEIEDIELFTQFSNNIRNNFGYLN LSEVDADIDNNRFRQDGRKVFKEVCPLVAKMITKQLEKNQIEPTNVKRFWLHQANVNMNE LILKLIVGKEHAKSELVPLILDEFANTSSAGVI IALHRTANEVNDGEYGVLCSFGAGYSV GSILVKKHVA
FabH E. coli (SEQ ID NO: 21)
E. coli
MYTKI IGTGSYLPEQVRTNADLEKMVDTSDEWIVTRTGIRERHIAAPNETVSTMGFEAAT RAIEMAGIEKDQIGLIWATTSATHAFPSAACQIQSMLGIKGCPAFDVAAACAGFTYALS VADQYVKSGAVKYALWGSDVLARTCDPTDRGTI I IFGDGAGAAVLAASEEPGI ISTHLH ADGSYGELLTLPNADRVNPENSIHLTMAGNEVFKVAVTELAHIVDETLAANNLDRSQLDW LVPHQANLRI ISATAKKLGMSMDNVWTLDRHGNTSAASVPCALDEAVRDGRIKPGQLVL LEAFGGGFTWGSALVRF
Asch (T184I, V296A, V268A) (SEQ ID NO: 22)
Artificial sequence
MGIRITGTGLFHPTDFITNEELVESLNAYVEQYNLENADKIAAGEIEELRGSSAEFIEKA SGIKRRYVAEKTGILDPKRLRPLLHERSNDELSIQAEWGVAAAKQAMENAGVTAEDIDW ILSCSNIQRAYPALAIEIQTALGIQGYAYDMNVACSAATFGIKQAADAIKSGARRVLMVN VEIISGHTDFRSRDCHFIFGDVATASI IEETETKTGFEIEDIELFTQFSNNIRNNFGYLN LSEVDADIDNNRFRQDGRKVFKEVCPLAAKMITKQLEKNQIEPTNVKRFWLHQANANMNE LILKLIVGKEHAKSELVPLILDEFANTSSAGVI IALHRTANEVNDGEYGVLCSFGAGYSV GSILVKKHVA
Sko3 (SV) (SEQ ID NO: 23)
Artificial sequence
MTDLRILGTGAYVPERIASNDEVAAAAGVDDAWITAKTGIRERRWAADDQATSDLATAAG RAALRSAGITAEDLSVIWATSTPDRPQPPTAAYVQQRLGASDAAAFDVNAVCSGMVFAL AAAEGVLTRTGGHALVIGADLYSRSLNRADRRTVILFGDGAGAVILGSSVDHGPRIRHLA LHSFGELSGLIEVPAGGSRMPVDQTVLDAGLQYVAMDGRGVRNFVCDHLPQLVKGFLHKC GWPDDIDHFVPHQANGTMLDSLFADLELPRATMHRTLTHYANTGAASIPITLDAAARAG AFNPGDLILMAGFGGGMSAGFALVEW
Sten(SV) (SEQ ID NO: 24)
Artificial sequence
MSDVRILGTGAYVPDRIVSNDEVGAPAGVDDAWITAKTAIRERRWAATHQATSDLATQAA RNALDCAGITAAELTVIIVATSTPDRPQPPTAAYVQHNLAATHAAAFDINAVCSGEIFAL SAAEGTLARKGGHALI IGADLYSRSLNPTDRKTTVLFGDGAGAMILGPATTGPRIRHLTL HTYGEHSDLIQVPAGGSRLPTTTQTLQQDQHYVTMDGRGVRRFVADHLPRLTKEFLHEAG TVPDDIRHFIPHQANGVMLDDLFTQLALPHATMHQTLTTYANTGAASIPLTLDTAHRNNA LHPGDLILLAGFGGGMAAGLALIEW
Nbra(SV) (SEQ ID NO: 25)
Artificial sequence
MNNIAVLGTGSYLPDRIVSNSEVGSGADVDSEWI IRKTAIRERRWALPDQATSDLATHAA SAALDAAGISADEVSAIWATSTPDHPQPPTAAFVQHNLGARGASAFDVNAVCSGFVFAL SAVEAAIARAGGGYGLWGADVYSRSLNPADRRTWLFGDGAGAWLGPSASGGLRRFGL HTFGDLTSLIRVPAGGSRQPYDPAAHELGAQYVTMDGRGVRAFVNGSLPVLVKQFLHDSG VAPDDITHLIPHQANGVMLAELAEELGLVNATMHTTVRYYGNTGAASIPITLDNAARTGG IRPGDTVLLVGFGGGMAVGLTLVEW
Asch (SEQ ID NO: 26)
Acinetobacter schindleri CIP 107287
MGIRITGTGLFHPTDFITNEELVESLNAYVEQYNLENADKIAAGEIEELRGSSAEFIEKA SGIKRRYVAEKTGILDPKRLRPLLHERSNDELSIQAEWGVAAAKQAMENAGVTAEDIDW ILSCSNIQRAYPALAIEIQTALGIQGYAYDMNVACSAATFGIKQAADAIKSGARRVLMVN VEITSGHTDFRSRDCHFIFGDVATASI IEETETKTGFEIEDIELFTQFSNNIRNNFGYLN LSEVDADIDNNRFRQDGRKVFKEVCPLVAKMITKQLEKNQIEPTNVKRFWLHQANVNMNE LILKLIVGKEHAKSELVPLILDEFANTSSAGVI IALHRTANEVNDGEYGVLCSFGAGYSV GSILVKKHVA Scin (SV) (SEQ ID NO: 27)
Artificial sequence
MTGIAADIAVLGTGAYVPDRIVSNDEVGAPAGVDGEWIVRKTAIRERRWAAPGQATSDLA
VAAGRAALESAGITVGQLSLIWATSTPDRPQPPTAVYVQQQLGALGAAAFDVNAVCSGS
VFALSWEGMLARQGGHALVIGADVYSRSLNPADRKTWLFGDGAGAMVLGPAGGGRAAG
ARVRHVTLHTVGELAGLIQVPAGGSRQPADQAVLDAGLQYVAMDGREVRRFWEQLPQLT
KQFLHEAGWPDDVAHFVPHQANGVMLDTVIEDLALSRATTHLTLERYGNTGAASIPITL
DAAAREGAFRPGDLILLAGFGGGMAAGLALLEW
Fused E. coli accd and acca genes:
accDA fusion (SEQ ID NO: 28)
Artificial sequence
MSWIERIKSNITPTRKASIPEGVWTKCDSCGQVLYRAELERNLEVCPKCDHHMRMTARNR LHSLLDEGSLVELGSELEPKDVLKFRDSKKYKDRLASAQKETGEKDALWMKGTLYGMPV VAAAFEFAFMGGSMGSWGARFVRAVEQALEDNCPLICFSASGGARMQEALMSLMQMAKT SAALAKMQERGLPYISVLTDPTMGGVSASFAMLGDLNIAEPKALIGFAGPRVIEQTVREK LPPGFQRSEFLIEKGAIDMIVRRPEMRLKLASILAKLMNLPAPNPEAPREGVWPPVPDQ EPEALSGGGGSGGGGSGGGGSGGGGSAAASLNFLDFEQPIAELEAKIDSLTAVSRQDEKL DINIDEEVHRLREKSVELTRKIFADLGAWQIAQLARHPQRPYTLDYVRLAFDEFDELAGD RAYADDKAIVGGIARLDGRPVMI IGHQKGRETKEKIRRNFGMPAPEGYRKALRLMQMAER FKMPIITFIDTPGAYPGVGAEERGQSEAIARNLREMSRLGVPWCTVIGEGGSGGALAIG VGDKVNMLQYSTYSVISPEGCASILWKSADKAPLAAEAMGIIAPRLKELKLIDSI IPEPL GGAHRNPEAMAASLKAQLLADLADLDVLSTEDLKNRRYQRLMSYGYA accB protein (SEQ ID NO: 29)
E. coli
MDIRKIKKLIELVEESGISELEISEGEESVRISRAAPAASFPVMQQAYAAPMMQQPAQSN AAAPATVPSMEAPAAAEISGHIVRSPMVGTFYRTPSPDAKAFIEVGQKVNVGDTLCIVEA MKMMNQIEADKSGTVKAILVESGQPVEFDEPLI/VIE accC protein (SEQ ID NO: 30)
E. coli
MLDKIVIANRGEIALRILRACKELGIKTVAVHSSADRDLKHVLLADETVCIGPAPSVKSY LNIPAI ISAAEITGAVAIHPGYGFLSENANFAEQVERSGFIFIGPKAETIRLMGDKVSAI AAMKKAGVPCVPGSDGPLGDDMDKNRAIAKRIGYPVI IKASGGGGGRGMRWRGDAELAQ SISMTRAEAKAAFSNDMVYMEKYLENPRHVEIQVLADGQGNAIYLAERDCSMQRRHQKW EEAPAPGITPELRRYIGERCAKACVDIGYRGAGTFEFLFENGEFYFIEMNTRIQVEHPVT EMITGVDLIKEQLRIAAGQPLSIKQEEVHVRGHAVECRINAEDPNTFLPSPGKITRFHAP GGFGVRWESHIYAGYTVPPYYDSMIGKLICYGENRDVAIARMKNALQELI IDGIKTNVDL QIRIMNDENFQHGGTNIHYLEKKLGLQEK tpiA Promoter (SEQ ID NO: 31)
E. coli GGTTTGAATAAATGACAAAAAGCAAAGCCTTTGTGCCGATGAATCTCTATACTGTTTCACA rpiA promoter (SEQ ID NO: 32)
E. coli
GAGTTAACCACGCGGCTTGCCAACGGGGTCTGAATCGCTTTTTTTGTATATAATGCGTGT nphT7 (LSVA) (SEQ ID NO: 33)
Artificial sequence
MTDVRFRI IGTGAYVPERIVSNDEVGAPAGVDDDWITRKTGIRQRRWAADDQATSDLATA AGRAALKAAGITPEQLTVIAVATSTPDRPQPPTAAYVQHLLGATGTAAFDVNAVCSGTVF ALSSVAGTLVYRGGYALVIGADLYSRSLNPADRKTWLFGDGAGAMVLGPTSTGTGPIVR RVALHTFGGLTDLIRVPAGGSRQPLDTDGLDAGLQYVAMDGREVRRFVTEHLPQLIKGFL HEAGVDAADISHFVPHQANGVMLDEVFGELHLPRATMHRTVETYGNTGAASIPITMDAAV RAGSFRPGELVLLAGFGGGMAAAFALIEW
Ter (SEQ ID NO: 34)
Treponema denticola
MIVKPMVRNNICLNAHPQGCKKGVEDQIEYTKKRITAEVKAGAKAPKNVLVLGCSNGYGL ASRITAAFGYGAATIGVSFEKAGSETKYGTPGWYNNLAFDEAAKREGLYSVTIDGDAFSD EIKAQVIEEAKKKGIKFDLIVYSLASPVRTDPDTGIMHKSVLKPFGKTFTGKTVDPFTGE LKEISAEPANDEEAAATVKVMGGEDWEVGSNS fabl S241F (SEQ ID NO: 35)
Artificial sequence
MGFLSGKRILVTGVASKLSIAYGIAQAMHREGAELAFTYQNDKLKGRVEEFAAQLGSDIV
LQCDVAEDASIDTMFAELGKVWPKFDGFVHSIGFAPGDQLDGDYVNAVTREGFKIAHDIS
SYSFVAMAKACRSMLNPGSALLTLSYLGAERAIPNYNVMGLAKASLEANVRYMANAMGPE
GVRVNAISAGPIRTLAASGIKDFRKMLAHCEAVTPIRRTVTIEDVGNSAAFLCSDLSAGI
FGEWHVDGGFSIAAMNELELK fadB (SEQ ID NO: 36)
E. coli
MVYKGDTLYLDWLEDGIAELVFDAPGSVNKLDTATVASLGEAIGVLEQQSDLKGLLLRSN KAAFIVGADITEFLSLFLVPEEQLSQWLHFANSVFNRLEDLPVPTIAAVNGYALGGGCEC VLATDYRLATPDLRIGLPETKLGIMPGFGGSVRMPRMLGADSALEI IAAGKDVGADQALK IGLVDGWKAEKLVEGAKAVLRQAINGDLDWKAKRQPKLEPLKLSKIEATMSFTIAKGMV AQTAGKHYPAPITAVKTIEAAARFGREEALNLENKSFVPLAHTNEARALVGIFLNDQYVK GKAKKLTKDVETPKQAAVLGAGIMGGGIAYQSAWKGVPWMKDINDKSLTLGMTEAAKLL NKQLERGKIDGLKLAGVISTIHPTLDYAGFDRVDIWEAWENPKVKKAVLAETEQKVRQ DTVLASNTSTIPISELANALERPENFCGMHFFNPVHRMPLVEIIRGEKSSDETIAKWAW ASKMGKTPIWNDCPGFFVNRVLFPYFAGFSQLLRDGADFRKIDKVMEKQFGWPMGPAYL LDWGIDTAHHAQAVMAAGFPQRMQKDYRDAIDALFDANRFGQKNGLGFWRYKEDSKGKP
KKEEDAAVEDLLAEVSQPKRDFSEEEI IARMMIPMVNEWRCLEEGI IATPAEADMALVY GLGFPPFHGGAFRWLDTLGSAKYLDMAQQYQHLGPLYEVPEGLRNKARHNEPYYPPVEPA RPVGDLKTA
Hche WES (SEQ ID NO: 37)
Hahela chejuensis
MTPLSPVDQIFLWLEKRQQPMHVGGLHIFSFPDDADAKYMTELAQQLRAYATPQAPFNRR
LRQRWGRYYWDTDAQFDLEHHFRHEALPKPGRIRELLAHVSAEHSNLMDRERPMWECHLI
EGIRGRRFAVYYKAHHCMLDGVAAMRMCVKSYSFDPTATEMPPIWAISKDVTPARETQAP
AAGDLVHSLSQLVEGAGRQLATVPTLIRELGKNLLKARDDSDAGLIFRAPPSILNQRITG
SRRFAAQSYALERFKAIGKAFQATVNDWLAVCGSALRNYLLSRQALPDQPLIAMAPMSI
RQDDSDSGNQIAMILANLGTHIADPVRRLELTQASARESKERFRQMTPEEAVNYTALTLA
PSGLNLLTGLAPKWQAFNWISNVPGPNKPLYWNGARLEGMYPVSIPVDYAALNITLVSY
RDQLEFGFTACRRTLPSMQRLLDYIEQGIAELEKAAGV phoE promoter (SEQ ID NO: 38)
E . coli
gatcttgata tcaaacgaac gttttagcag gactgtcgtc ggttgccaac catctgcgag caaagcatgg cgttttgttg cgcgggatca gcaagcctag cggcagttgt ttacgctttt attacagatt taataaatta ccacatttta agaatattat taatctgtaa tatatcttta acaatctcag gttaaaaact ttcctgtttt caacg pstsIH promoter (SEQ ID NO: 39)
E. coli
aaatcagact gaagacttta tctctctgtc ataaaactgt catattcctt acatataact gtcacctgtt tgtcctattt tgcttctcgt agccaacaaa caatgcttta tgaatcctcc c
[0187] All publications, patents and patent applications, as well accession numbers and the nucleotide and/or protein sequences that can be found therein, are incorporated herein by reference. While in the foregoing specification this invention has been described in relation to certain embodiments thereof, and many details have been set forth for purposes of illustration, it will be apparent to those skilled in the art that the invention is susceptible to additional embodiments and that certain of the details described herein may be varied considerably without departing from the basic principles of the invention.

Claims

WHAT IS CLAIMED IS:
1. An expression system comprising at least one expression cassette comprising a promoter operably linked to a heterologous nucleic acid segment encoding a polypeptide with 3- ketoacyl-CoA synthase activity and having at least 80% sequence identity to an amino acid sequence set forth in SEQ ID NO: 1-15, wherein the polypeptide catalyzes condensation of acyl-CoA with malonyl-CoA to form 3-ketoacyl-CoA.
2. The expression system of claim 1, wherein the polypeptide with 3-ketoacyl-CoA
synthase activity has at least 80% sequence identity to an amino acid sequence set forth in SEQ ID NO: 1-4.
3. The expression system of claim 1 or 2, wherein the polypeptide catalyzes one or more condensations selected from the group consisting of:
a) C2-CoA with malonyl-CoA to form 3-keto-C4-CoA;
b) C4-CoA with malonyl-CoA to form 3-keto-C6-CoA;
c) C6-C0A with malonyl-CoA to form 3-keto-C8-CoA; and
d) C8-C0A with malonyl-CoA to form 3-keto-Cl0-CoA.
4. The expression system of claim 2, wherein an amino acid residue of the polypeptide with 3-ketoacyl-CoA synthase activity that aligns with amino acid residue 147 of SEQ ID NO: 16 is serine and an amino acid residue that aligns with amino acid residue 217 of SEQ ID NO: 16 is valine.
5. The expression system of any one of claims 1-4, wherein the promoter is an inducible promoter.
6. The expression system of any one of claims 1-5, wherein the promoter is an inducible promoter sensitive to lowering phosphate concentration.
7. The expression system of any one of claims 1-6, wherein the promoter is a PpstSIH promoter or a PphoE promoter.
8. A recombinant cell comprising the expression system of any one of claims 1-7, wherein the recombinant cell produces a fatty acid and/or fatty acid chain product having a chain length of one or more of C4, C6, C8 and/or C10.
9. The recombinant cell of claim 8, wherein the fatty acid chain product comprises one or more of fatty acid ester, fatty alcohol, fatty acid amide and fatty acid amine.
10. The recombinant cell of claim 9, where in the fatty acid ester comprises one or more of fatty acid methyl ester, fatty acid ethyl ester, fatty acid propyl ester, fatty acid butyl ester, fatty acid pentyl ester and fatty acid hexyl ester.
11. The recombinant cell of claim 8, wherein the recombinant cell produces a fatty acid ester of C4, C6, C8, and/or C10 chain length.
12. The recombinant cell of claim 8, wherein the recombinant cell produces a fatty acid ester of C8 and/or C10 chain length.
13. The recombinant cell of any one of claims 11-12, wherein the alkoxy group of the fatty acid ester is derived from a Cl, C2, C3, or C4 monoalcohol.
14. The recombinant cell of any one of claims 11-12, wherein the alkoxy group is derived from a Cl or C2 monoalcohol.
15. The recombinant cell of claim 14, wherein the monoalcohol is methanol.
16. The recombinant cell of any one of claims 11-15, further comprising one or more of:
a heterologous nucleic acid segment encoding a polypeptide with enoyl-CoA reductase activity;
a heterologous nucleic acid segment encoding a polypeptide with bifunctional 3- hydroxyacyl-CoA dehydrogenase/dehydratase activity; a heterologous nucleic acid segment encoding a polypeptide with ester synthase activity;
a heterologous nucleic acid segment encoding a polypeptide with acetyl-CoA carboxylase activity; and
a heterologous nucleic acid segment encoding a mutant polypeptide with ketoacyl-CoA synthase activity.
17. The recombinant cells of claim 16, wherein the heterologous nucleic acid segment encoding a mutant polypeptide with ketoacyl-CoA synthase activity is an Asch variant selected from the group consisting of Asch (T184I, F236L, V268A, V296A, V317A, S328G); Asch (T184I, V296A, V268A); and Asch (V296A).
18. The recombinant cell of any one of claims 11-17, wherein the recombinant cell is a fungal cell, a bacterial cell, or a plant cell.
19. The recombinant cell of claim 18, wherein the recombinant cell comprises an
Escherichia coli.
20. The recombinant cell of claim 18, wherein the recombinant cell comprises a yeast cell.
21. A cell culture comprising:
a) the recombinant cell of any one of claims 11-20; and
b) one or more fatty acids or fatty acid chain products produced by the recombinant cell.
22. A method of producing fatty acids and/or fatty acid chain products comprising growing the recombinant cell of any one of claims 11-20 in a culture medium,
wherein the recombinant cell is grown under conditions in which the heterologous nucleic acid segment encoding the polypeptide with 3-ketoacy-CoA synthase activity is expressed; and
the recombinant cell produces a fatty acid and/or a fatty acid chain product.
23. One or more fatty acids or fatty acid chain products of claim 21 for use in products.
24. An isolated 3-ketoacyl-CoA synthase having at least 80% sequence identity to an amino acid sequence set forth in SEQ ID NO: 1-16 for use in catalyzing the condensation of acyl-CoA with malonyl-CoA to form 3-ketoacyl-CoA.
25. The 3-ketoacyl-CoA synthase of claim 24, wherein said synthase has activity for one or more of C2-CoA, C4-CoA and C6-C0A intermediates.
26. The 3-ketoacyl-CoA synthase of claim 25, wherein said synthase has activity for C2- CoA and C4-CoA intermediates.
27. The 3-ketoacyl-CoA synthase of claim 24, wherein said synthase has activity for C2- CoA, C4-CoA and C6-C0A intermediates.
28. A recombinant cell of claim 8 for use in producing fatty acid and/or fatty acid chain products having a chain length of one or more of C4, C6, C8 and/or C10.
29. An 3-ketoacyl-CoA synthase having at least 80% sequence identity to an amino acid sequence set forth in SEQ ID NO: 1-4, wherein an amino acid residue of the synthase that aligns with amino acid residue 147 of SEQ ID NO: 16 is serine and an amino acid residue that aligns with amino acid residue 217 of SEQ ID NO: 16 is valine.
30. A recombinant cell comprising the 3-ketoacyl-CoA synthase of claim 29, wherein the recombinant cell produces a fatty acid and/or fatty acid chain product having a chain length of one or more of C4, C6, C8 and/or C10.
31. The recombinant cell of claim 30, wherein the recombinant cell produces a fatty acid ester of C4, C6, C8, and/or C10 chain length.
32. The recombinant cell of claim 30, wherein the recombinant cell produces a fatty acid ester of C8 and/or C10 chain length.
33. The recombinant cell of any one of claims 31-32, wherein the alkoxy group is derived from a Cl, C2, C3, or C4 monoalcohol.
34. The recombinant cell of any one of claims 31-32, wherein the alkoxy group is derived from a Cl or C2 monoalcohol.
35. The recombinant cell of claim 34, wherein the monoalcohol is methanol.
PCT/US2019/035801 2018-06-28 2019-06-06 Methods and cells for producing fatty acids and fatty acid chain products WO2020005495A1 (en)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US201862691274P 2018-06-28 2018-06-28
US62/691,274 2018-06-28
US201862693714P 2018-07-03 2018-07-03
US62/693,714 2018-07-03

Publications (1)

Publication Number Publication Date
WO2020005495A1 true WO2020005495A1 (en) 2020-01-02

Family

ID=68985180

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2019/035801 WO2020005495A1 (en) 2018-06-28 2019-06-06 Methods and cells for producing fatty acids and fatty acid chain products

Country Status (1)

Country Link
WO (1) WO2020005495A1 (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2015010103A2 (en) * 2013-07-19 2015-01-22 Opx Biotechnologies, Inc. Microorganisms and methods for the production of fatty acids and fatty acid derived products
US20150275242A1 (en) * 2012-10-15 2015-10-01 Genomatica, Inc. Microorganisms and methods for production of specific length fatty alcohols and related compounds
US20170211102A1 (en) * 2014-05-15 2017-07-27 Calysta, Inc. Methods for biological production of very long carbon chain compounds
WO2017214133A2 (en) * 2016-06-06 2017-12-14 Provivi, Inc. Semi-biosynthetic production of fatty alcohols and fatty aldehydes

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150275242A1 (en) * 2012-10-15 2015-10-01 Genomatica, Inc. Microorganisms and methods for production of specific length fatty alcohols and related compounds
WO2015010103A2 (en) * 2013-07-19 2015-01-22 Opx Biotechnologies, Inc. Microorganisms and methods for the production of fatty acids and fatty acid derived products
US20170211102A1 (en) * 2014-05-15 2017-07-27 Calysta, Inc. Methods for biological production of very long carbon chain compounds
WO2017214133A2 (en) * 2016-06-06 2017-12-14 Provivi, Inc. Semi-biosynthetic production of fatty alcohols and fatty aldehydes

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
KAWASAKI ET AL.: "Presence or opay Diphosphate synthase ene in an Actinomycete Possessing the Mevalonate Pathway", THE JOURNAL OF ANTIBIOTICS, vol. 57, no. 11, 25 November 2004 (2004-11-25), pages 739 - 747, XP055666789 *
SEEGER ET AL.: "The biosynthetic genes for prenylated phenazines are located at two different chromosomal loci of Streptomyces cinnamonensis DSM 1042", MICROBIAL BIOTECHNOLOGY, vol. 4, no. 2, 8 December 2010 (2010-12-08), pages 252 - 262, XP055666792 *

Similar Documents

Publication Publication Date Title
US20200308613A1 (en) Methods and genetically modified cells for production of fatty acid esters
EP2344652B1 (en) Ethanol production by microorganisms
US8906665B2 (en) Methods and compositions for the recombinant biosynthesis of fatty acids and esters
US9163264B2 (en) Ethanol production in microorganisms
EP2787061A1 (en) Fuel composition comprising a mixture of n-alkanes
US20110020867A1 (en) Constructs And Methods For Efficient Transformation Of Micro-Organisms For Production Of Carbon-Based Products Of Interest
CN104312966A (en) CO2 fixing engineered microorganisms producing carbon-based target products
WO2012015949A2 (en) Methods and compositions for improving yields of reduced products of photosynthetic microorganisms
US20220403425A1 (en) Genetically modified cells that produce c6-c10 fatty acid derivatives
WO2020005495A1 (en) Methods and cells for producing fatty acids and fatty acid chain products
US9029124B2 (en) Photoalkanogens with increased productivity
US20120164705A1 (en) Metabolic Switch
WO2019236819A1 (en) Methods and genetically modified cells for producing c8 fatty acids and fatty acid chain products
WO2015200335A1 (en) Engineered photosynthetic microbes and recombinant synthesis of carbon-based products

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19825442

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 19825442

Country of ref document: EP

Kind code of ref document: A1