CA2904395A1 - Thioesterases and cells for production of tailored oils - Google Patents

Thioesterases and cells for production of tailored oils Download PDF

Info

Publication number
CA2904395A1
CA2904395A1 CA2904395A CA2904395A CA2904395A1 CA 2904395 A1 CA2904395 A1 CA 2904395A1 CA 2904395 A CA2904395 A CA 2904395A CA 2904395 A CA2904395 A CA 2904395A CA 2904395 A1 CA2904395 A1 CA 2904395A1
Authority
CA
Canada
Prior art keywords
seq
sequence
oil
cuphea
amino acid
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
CA2904395A
Other languages
French (fr)
Inventor
George N. RUDENKO
Jason Casolari
Scott Franklin
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Corbion Biotech Inc
Original Assignee
Solazyme Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US13/837,996 external-priority patent/US9290749B2/en
Application filed by Solazyme Inc filed Critical Solazyme Inc
Publication of CA2904395A1 publication Critical patent/CA2904395A1/en
Abandoned legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/14Hydrolases (3)
    • C12N9/16Hydrolases (3) acting on ester bonds (3.1)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/82Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
    • C12N15/8241Phenotypically and genetically modified plants via recombinant DNA technology
    • C12N15/8242Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits
    • C12N15/8243Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits involving biosynthetic or metabolic pathways, i.e. metabolic engineering, e.g. nicotine, caffeine
    • C12N15/8247Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits involving biosynthetic or metabolic pathways, i.e. metabolic engineering, e.g. nicotine, caffeine involving modified lipid metabolism, e.g. seed oil composition
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P7/00Preparation of oxygen-containing organic compounds
    • C12P7/64Fats; Fatty oils; Ester-type waxes; Higher fatty acids, i.e. having at least seven carbon atoms in an unbroken chain bound to a carboxyl group; Oxidised oils or fats
    • C12P7/6409Fatty acids
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y301/00Hydrolases acting on ester bonds (3.1)
    • C12Y301/02Thioester hydrolases (3.1.2)
    • C12Y301/02014Oleoyl-[acyl-carrier-protein] hydrolase (3.1.2.14), i.e. ACP-thioesterase
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02EREDUCTION OF GREENHOUSE GAS [GHG] EMISSIONS, RELATED TO ENERGY GENERATION, TRANSMISSION OR DISTRIBUTION
    • Y02E50/00Technologies for the production of fuel of non-fossil origin
    • Y02E50/10Biofuels, e.g. bio-diesel

Landscapes

  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Engineering & Computer Science (AREA)
  • Organic Chemistry (AREA)
  • Wood Science & Technology (AREA)
  • Zoology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Biotechnology (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Biochemistry (AREA)
  • Biomedical Technology (AREA)
  • Molecular Biology (AREA)
  • Microbiology (AREA)
  • Oil, Petroleum & Natural Gas (AREA)
  • Physics & Mathematics (AREA)
  • Plant Pathology (AREA)
  • Biophysics (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Cell Biology (AREA)
  • Nutrition Science (AREA)
  • General Chemical & Material Sciences (AREA)
  • Medicinal Chemistry (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Preparation Of Compounds By Using Micro-Organisms (AREA)
  • Edible Oils And Fats (AREA)
  • Enzymes And Modification Thereof (AREA)
  • Peptides Or Proteins (AREA)
  • Fats And Perfumes (AREA)

Abstract

The invention features plant acyl-ACP thioesterase genes of the FatB class and proteins encoded by these genes. The genes are useful for constructing recombinant host cells having altered fatty acid profiles. Oleaginous microalga host cells with the new genes or previously identified FatB genes are disclosed. The microalgae cells produce triglycerides with useful fatty acid profiles.

Description

THIOESTERASES AND CELLS FOR PRODUCTION OF TAILORED OILS
Cross Reference to related Applications [0001] This application is a Continuation-in-part of United States Patent Application No. 13/837,996, filed March 15, 2013, and claims the benefit of United States Provisional Patent Application Serial No. 61/791,861, filed March 15, 2013, and United States Provisional Patent Application Serial No. 61/917,217, filed December 17, 2013, each of which is hereby incorporated by reference herein in its entirety.
Background
[0002] Certain organisms including plants and some microalgae use a type II
fatty acid biosynthetic pathway, characterized by the use of discrete, monofunctional enzymes for fatty acid synthesis. In contrast, mammals and fungi use a single, large, multifunctional protein.
[0003] Type II fatty acid biosynthesis typically involves extension of a growing acyl-ACP (acyl-carrier protein) chain by two carbon units followed by cleavage by an acyl-ACP thioesterase. In plants, two main classes of acyl-ACP
thioesterases have been identified: (i) those encoded by genes of the FatA
class, which tend to hydrolyze oleoyl-ACP into oleate (an 18:1 fatty acid) and ACP, and (ii) those encoded by genes of the FatB class, which liberate C8-C16 fatty acids from corresponding acyl-ACP molecules.
[0004] Different FatB genes from various plants have specificities for different acyl chain lengths. As a result, different gene products will produce different fatty acid profiles in plant seeds. See, US Patent Nos. 5,850,022;
5,723,761;
5,639,790; 5,807,893; 5,455,167; 5,654,495; 5,512,482;5,298,421;5,667,997; and 5,344,771; 5,304,481. Recently, FatB genes have been cloned into oleaginous microalgae to produce triglycerides with altered fatty acid profiles. See, W02010/063032, W02011/150411, W02012/106560, and W02013/158938.

Summary [0005] In various aspects, the invention(s) contemplated herein may include, but need not be limited to, any one or more of the following embodiments:
[0006] Embodiment 1: A nucleic acid construct including a regulatory element and a FatB gene expressing an active acyl-ACP thioesterase operable to produce an altered fatty acid profile in an oil produced by a cell expressing the nucleic acid construct, wherein the FatB gene expresses a protein having an amino acid sequence falling within clade 5 of Table la, the sequence having at least 94.6%
sequence identity with each of SEQ ID NOs: 88, 82, 85, and 103, and optionally wherein the fatty acid of the oil is enriched in C8 and C10 fatty acids.
[0007] Embodiment 2: A nucleic acid construct including a regulatory element and a FatB gene expressing an active acyl-ACP thioesterase operable to produce an altered fatty acid profile in an oil produced by a cell expressing the nucleic acid construct, wherein the FatB gene expresses a protein having an amino acid sequence falling within one of clades 1-12 of Table la.
[0008] Embodiment 3: The nucleic acid construct of embodiment 2, wherein the FatB gene expresses a protein having an amino acid sequence falling within clade 1 of Table la, the sequence having at least 85.9% sequence identity with each of SEQ
ID NOs: 19, 161, 22, and 160, and optionally wherein the fatty acid of the oil is enriched in C14 and C16 fatty acids.
[0009] Embodiment 4: The nucleic acid construct of embodiment 2, wherein the FatB gene expresses a protein having an amino acid sequence falling within clade 2 of Table la, the sequence having at least 89.5% sequence identity with each of SEQ
ID NOs: 134-136, 132, 133, 137, 124, 122, 123, 125, and optionally wherein the fatty acid of the oil is enriched in C12 and C14 fatty acids.
[0010] Embodiment 5: The nucleic acid construct of embodiment 2, wherein the FatB gene expresses a protein having an amino acid sequence falling within clade 3 of Table la, the sequence having at least 92.5% sequence identity with each of SEQ
ID NOs: 126 and 127, and optionally wherein the fatty acid of the oil is enriched in C12 and C14 fatty acids.
[0011] Embodiment 6: The nucleic acid construct of embodiment 2, wherein the FatB gene expresses a protein having an amino acid sequence falling within clade 4 of Table la, the sequence having at least 83.8% sequence identity with SEQ
ID NO:
79, and optionally wherein the fatty acid of the oil is enriched in C12 and C14 fatty acids.
[0012] Embodiment 7: The nucleic acid construct of embodiment 2, wherein the FatB gene expresses a protein having an amino acid sequence falling within clade 6 of Table la, the sequence haying at least 99.9% sequence identity with each of SEQ
ID NOs: 111 and 110, and optionally wherein the fatty acid of the oil is enriched in C10 fatty acids.
[0013] Embodiment 8: The nucleic acid construct of embodiment 2, wherein the FatB gene expresses a protein having an amino acid sequence falling within clade 7 of Table la, the sequence haying at least 89.5% sequence identity with each of SEQ
ID NOs: 73, 106, 185, 172, 171, 173, 174, and optionally wherein the fatty acid of the oil is enriched in C10 and C12 fatty acids.
[0014] Embodiment 9: The nucleic acid construct of embodiment 2, wherein the FatB gene expresses a protein having an amino acid sequence falling within clade 8 of Table la, the sequence haying at least 85.9% sequence identity with each of SEQ
ID NOs: 112, 113, 142, 145, 143, 144, 139, 140, 138, 141, and optionally wherein the fatty acid of the oil is enriched in C12 and C14 fatty acids.
[0015] Embodiment 10: The nucleic acid construct of embodiment 2, wherein the FatB gene expresses a protein having an amino acid sequence falling within clade 9 of Table la, the sequence having at least 83.8% sequence identity with each of SEQ ID NOs: 187-189, and optionally wherein the fatty acid of the oil is enriched in C12 and C14 fatty acids.
[0016] Embodiment 11: The nucleic acid construct of embodiment 2, wherein the FatB gene expresses a protein having an amino acid sequence falling within clade 10 of Table la, the sequence having at least 95.9% sequence identity with each of SEQ ID NOs: 147, 149, 146, 150, 152, 151, 148, 154, 156, 155, 157, 108, 75, 190, 191, and 192, and optionally wherein the fatty acid of the oil is enriched in C14 and C16 fatty acids.
[0017] Embodiment 12: The nucleic acid construct of embodiment 2, wherein the FatB gene expresses a protein having an amino acid sequence falling within clade 11 of Table la, the sequence having at least 88.7% sequence identity with SEQ ID NO: 121, and optionally wherein the fatty acid of the oil is enriched in C14 and C16 fatty acids.
[0018] Embodiment 13: The nucleic acid construct of embodiment 2, wherein the FatB gene expresses a protein having an amino acid sequence falling within clade 12 of Table la, the sequence having at least 72.8% sequence identity with each of SEQ ID NOs: 129 and 186, and optionally wherein the fatty acid of the oil is enriched in C16 fatty acids.
[0019] Embodiment 14: An isolated nucleic acid or recombinant DNA
construct including a nucleic acid, wherein the nucleic acid has at least 80%
sequence identity to any of SEQ ID NOS: 2, 3, 5, 6, 8, 9, 11, 12, 14, 15, 17, 18, 20, 21, 23, 24, 26, 27, 29, 30, 32, 33, 35, 36, 38, 39, 41, 42, 44, 45, 47, 48, 50, 51, 53, 54, 56, 57, 59, 60, 62, 63, 65, 66, 68, 69, 71, 72, 74, 76, 78, 80, 81, 83, 84, 86, 87, 89, 90, 92, 93, 95, 96, 98, 99, 101, 102, 104, 105, 107, 109 or any equivalent sequences by virtue of the degeneracy of the genetic code.
[0020] Embodiment 15: An isolated nucleic acid sequence encoding a protein or a host cell expressing a protein having at least 80% sequence identity to any of SEQ ID NOS: 1, 4, 7, 10, 13, 16, 19, 22, 25, 28, 31, 34, 37, 40, 43, 46, 49, 52, 55, 58, 61, 64, 67, 70, 73, 75, 77, 79, 82, 85, 88, 91, 94, 97, 100, 103, 106, 108, 110-192 or a fragment thereof having acyl-ACP thioesterase activity.
[0021] Embodiment 16: The isolated nucleic acid of embodiment 15, wherein, the protein has acyl-ACP thioesterase activity operable to alter the fatty acid profile of an oil produced by a recombinant cell including that sequence.
[0022] Embodiment 17: A method of producing a recombinant cell that produces an altered fatty acid profile, the method including transforming the cell with a nucleic acid according to any of embodiments 1-3.
[0023] Embodiment 18: A host cell produced by the method of embodiment 17.
[0024] Embodiment 19: The host cell of embodiment 18, wherein the host cell is selected from a plant cell, a microbial cell, and a microalgal cell.
[0025] Embodiment 20: A method for producing an oil or oil-derived product, the method including cultivating a host cell of embodiment 5 or 6, and extracting oil produced thereby, optionally wherein the cultivation is heterotrophic growth on sugar.
[0026] Embodiment 21: The method of embodiment 20, further including producing a fatty acid, fuel, chemical, or other oil-derived product from the oil.
[0027] Embodiment 22: An oil produced by the method of embodiment 20, optionally having a fatty acid profile including at least 20% C8, C10, C12, C14 or C16 fatty acids.
[0028] Embodiment 23: An oil-derived product produced by the method of embodiment 21.
[0029] Embodiment 24: The oil of embodiment 23, wherein the oil is produced by a microalgae and optionally, lacks C24-alpha sterols.
Description of Illustrative Embodiments of the Invention Definitions
[0030] As used with respect to nucleic acids, the term "isolated"
refers to a nucleic acid that is free of at least one other component that is typically present with the naturally occurring nucleic acid. Thus, a naturally occurring nucleic acid is isolated if it has been purified away from at least one other component that occurs naturally with the nucleic acid.
[0031] A "natural oil" or "natural fat" shall mean a predominantly triglyceride oil obtained from an organism, where the oil has not undergone blending with another natural or synthetic oil, or fractionation so as to substantially alter the fatty acid profile of the triglyceride. In connection with an oil comprising triglycerides of a particular regiospecificity, the natural oil or natural fat has not been subjected to interesterification or other synthetic process to obtain that regiospecific triglyceride profile, rather the regiospecificity is produced naturally, by a cell or population of cells. In connection with a natural oil or natural fat, and as used generally throughout the present disclosure, the terms oil and fat are used interchangeably, except where otherwise noted. Thus, an "oil" or a "fat" can be liquid, solid, or partially solid at room temperature, depending on the makeup of the substance and other conditions.
Here, the term "fractionation" means removing material from the oil in a way that changes its fatty acid profile relative to the profile produced by the organism, however accomplished. The terms "natural oil" and "natural fat" encompass such oils obtained from an organism, where the oil has undergone minimal processing, including refining, bleaching and/or degumming, which does not substantially change its triglyceride profile. A natural oil can also be a "noninteresterified natural oil", which means that the natural oil has not undergone a process in which fatty acids have been redistributed in their acyl linkages to glycerol and remain essentially in the same configuration as when recovered from the organism.
[0032] "Exogenous gene" shall mean a nucleic acid that codes for the expression of an RNA and/or protein that has been introduced into a cell (e.g.
by transformation/transfection), and is also referred to as a "transgene". A cell comprising an exogenous gene may be referred to as a recombinant cell, into which additional exogenous gene(s) may be introduced. The exogenous gene may be from a different species (and so heterologous), or from the same species (and so homologous), relative to the cell being transformed. Thus, an exogenous gene can include a homologous gene that occupies a different location in the genome of the cell or is under different control, relative to the endogenous copy of the gene. An exogenous gene may be present in more than one copy in the cell. An exogenous gene may be maintained in a cell, for example, as an insertion into the genome (nuclear or plastid) or as an episomal molecule.
[0033] "Fatty acids" shall mean free fatty acids, fatty acid salts, or fatty acyl moieties in a glycerolipid. It will be understood that fatty acyl groups of glycerolipids can be described in terms of the carboxylic acid or anion of a carboxylic acid that is produced when the triglyceride is hydrolyzed or saponified.
[0034] "Microalgae" are microbial organisms that contain a chloroplast or other plastid, and optionally that are capable of performing photosynthesis, or a prokaryotic microbial organism capable of performing photosynthesis.
Microalgae include obligate photoautotrophs, which cannot metabolize a fixed carbon source as energy, as well as heterotrophs, which can live solely off of a fixed carbon source.
Microalgae include unicellular organisms that separate from sister cells shortly after cell division, such as Chlamydomonas, as well as microbes such as, for example, Volvox, which is a simple multicellular photosynthetic microbe of two distinct cell types. Microalgae include cells such as Chlorella, Dunaliella, and Prototheca.

Microalgae also include other microbial photosynthetic organisms that exhibit cell-cell adhesion, such as Agmenellum, Anabaena, and Pyrobotrys. Microalgae also include obligate heterotrophic microorganisms that have lost the ability to perform photosynthesis, such as certain dinoflagellate algae species and species of the genus Prototheca.
[0035] An "oleaginous" cell is a cell capable of producing at least 20% lipid by dry cell weight, naturally or through recombinant or classical strain improvement.
An "oleaginous microbe" or "oleaginous microorganism" is a microbe, including a microalga that is oleaginous.
[0036] The term "percent sequence identity," in the context of two or more amino acid or nucleic acid sequences, refers to two or more sequences or subsequences that are the same or have a specified percentage of amino acid residues or nucleotides that are the same, when compared and aligned for maximum correspondence, as measured using a sequence comparison algorithm or by visual inspection. For sequence comparison to determine percent nucleotide or amino acid identity, typically one sequence acts as a reference sequence, to which test sequences are compared. When using a sequence comparison algorithm, test and reference sequences are input into a computer, subsequence coordinates are designated, if necessary, and sequence algorithm program parameters are designated. The sequence comparison algorithm then calculates the percent sequence identity for the test sequence(s) relative to the reference sequence, based on the designated program parameters. Optimal alignment of sequences for comparison can be conducted using the NCBI BLAST software (ncbi.nlm.nih.gov/BLAST/) set to default parameters.
For example, to compare two nucleic acid sequences, one may use blastn with the "BLAST 2 Sequences" tool Version 2Ø12 (Apr. 21, 2000) set at the following default parameters: Matrix: BLOSUM62; Reward for match: 1; Penalty for mismatch: -2; Open Gap: 5 and Extension Gap: 2 penalties; Gap x drop-off: 50;
Expect: 10; Word Size: 11; Filter: on. For a pairwise comparison of two amino acid sequences, one may use the "BLAST 2 Sequences" tool Version 2Ø12 (Apr. 21, 2000) with blastp set, for example, at the following default parameters:
Matrix:
BLOSUM62; Open Gap: 11 and Extension Gap: 1 penalties; Gap x drop-off 50;
Expect: 10; Word Size: 3; Filter: on.
[0037] In connection with a natural oil, a "profile" is the distribution of particular species or triglycerides or fatty acyl groups within the oil. A
"fatty acid profile" is the distribution of fatty acyl groups in the triglycerides of the oil without reference to attachment to a glycerol backbone. Fatty acid profiles are typically determined by conversion to a fatty acid methyl ester (FAME), followed by gas chromatography (GC) analysis with flame ionization detection (FID). The fatty acid profile can be expressed as one or more percent of a fatty acid in the total fatty acid signal determined from the area under the curve for that fatty acid. FAME-GC-FID
measurement approximate weight percentages of the fatty acids.
[0038] As used herein, an oil is said to be "enriched" in one or more particular fatty acids if there is at least a 10% increase in the mass of that fatty acid in the oil relative to the non-enriched oil. For example, in the case of a cell expressing a heterologous FatB gene described herein, the oil produced by the cell is said to be enriched in, e.g., C8 and C16 fatty acids if the mass of these fatty acids in the oil is at least 10% greater than in oil produced by a cell of the same type that does not express the heterologous FatB gene (e.g., wild type oil).
[0039] "Recombinant" is a cell, nucleic acid, protein or vector that has been modified due to the introduction of an exogenous nucleic acid or the alteration of a native nucleic acid. Thus, e.g., recombinant (host) cells can express genes that are not found within the native (non-recombinant) form of the cell or express native genes differently than those genes are expressed by a non-recombinant cell.
Recombinant cells can, without limitation, include recombinant nucleic acids that encode a gene product or suppression elements such as mutations, knockouts, antisense, interfering RNA (RNAi) or dsRNA that reduce the levels of active gene product in a cell. A
"recombinant nucleic acid" is a nucleic acid originally formed in vitro, in general, by the manipulation of nucleic acid, e.g., using polymerases, ligases, exonucleases, and endonucleases, using chemical synthesis, or otherwise is in a form not normally found in nature. Recombinant nucleic acids may be produced, for example, to place two or more nucleic acids in operable linkage. Thus, an isolated nucleic acid or an expression vector formed in vitro by nucleic by ligating DNA molecules that are not normally joined in nature, are both considered recombinant for the purposes of this invention. Recombinant nucleic acids can also be produced in other ways; e.g., using chemical DNA synthesis. Once a recombinant nucleic acid is made and introduced into a host cell or organism, it may replicate using the in vivo cellular machinery of the host cell; however, such nucleic acids, once produced recombinantly, although subsequently replicated intracellularly, are still considered recombinant for purposes of this invention. Similarly, a "recombinant protein" is a protein made using recombinant techniques, i.e., through the expression of a recombinant nucleic acid.
[0040] Embodiments of the present invention relate to the use of FatB
genes isolated from plants, which can be expressed in a host cell in order to alter the fatty acid profile of an oil produced by the recombinant cell. Although the microalga, Prototheca moriformis, was used to screen the genes for ability to the alter fatty acid profile, the genes are useful in a wide variety of host cells. For example, the genes can be expressed in bacteria, other microalgae, or higher plants. The genes can be expressed in higher plants according to the methods of US Patent Nos.
5,850,022;
5,723,761; 5,639,790; 5,807,893; 5,455,167; 5,654,495;
5,512,482;5,298,421;5,667,997; 5,344,771; and 5,304,481. The fatty acids can be further converted to triglycerides, fatty aldehydes, fatty alcohols and other oleochemicals either synthetically or biosynthetically.
[0041] In specific embodiments, triglycerides are produced by a host cell expressing a novel FatB gene. A triglyceride-containing natural oil can be recovered from the host cell. The natural oil can be refined, degummed, bleached and/or deodorized. The oil, in its natural or processed form, can be used for foods, chemicals, fuels, cosmetics, plastics, and other uses. In other embodiments, the FatB
gene may not be novel, but the expression of the gene in a microalga is novel.
[0042] The genes can be used in a variety of genetic constructs including plasmids or other vectors for expression or recombination in a host cell. The genes can be codon optimized for expression in a target host cell. The proteins produced by the genes can be used in vivo or in purified form.
[0043] For example, the gene can be prepared in an expression vector comprising an operably linked promoter and 5'UTR. Where a plastidic cell is used as the host, a suitably active plastid targeting peptide can be fused to the FATB gene, as in the examples below. Generally, for the newly identified FATB genes, there are roughly 50 amino acids at the N-terminal that constitute a plastid transit peptide, which are responsible for transporting the enzyme to the chloroplast. In the examples below, this transit peptide is replaced with a 38 amino acid sequence that is effective in the Prototheca moriformis host cell for transporting the enzyme to the plastids of those cells. Thus, the invention contemplates deletions and fusion proteins in order to optimize enzyme activity in a given host cell. For example, a transit peptide from the host or related species may be used instead of that of the newly discovered plant genes described here.
[0044] A selectable marker gene may be included in the vector to assist in isolating a transformed cell. Examples of selectable markers useful in microlagae include sucrose invertase and antibiotic resistance genes.
[0045] The gene sequences disclosed can also be used to prepare antisense, or inhibitory RNA (e.g., RNAi or hairpin RNA) to inhibit complementary genes in a plant or other organism.
[0046] FatB genes found to be useful in producing desired fatty acid profiles in a cell are summarized below in Table 1. Nucleic acids or proteins having the sequence of SEQ ID NOS: 1-109 can be used to alter the fatty acid profile of a recombinant cell. Variant nucleic acids can also be used; e.g., variants having at least 70, 80, 85, 90, 95, 96, 97, 98, or 99% sequence identity to SEQ ID NOS: 2, 3, 5, 6, 8, 9, 11, 12, 14, 15, 17, 18, 20, 21, 23, 24, 26, 27, 29, 30, 32, 33, 35, 36, 38, 39, 41, 42, 44, 45, 47, 48, 50, 51, 53, 54, 56, 57, 59, 60, 62, 63, 65, 66, 68, 69, 71, 72, 74, 76, 78, 80, 81, 83, 84, 86, 87, 89, 90, 92, 93, 95, 96, 98, 99, 101, 102, 104, 105, 107 or 109.
Codon optimization of the genes for a variety of host organisms is contemplated, as is the use of gene fragments. Preferred codons for Prototheca strains and for Chlorella protothecoides are shown below in Tables 2 and 3, respectively. Codon usage for Cuphea wrightii is shown in Table 3a. Codon usage for Arabidopsis is shown in Table 3b; for example, the most preferred of codon for each amino acid can be selected. Codon tables for other organisms including microalgae and higher plants are known in the art. In some embodiments, the first and/or second most preferred Prototheca codons are employed for codon optimization. In specific embodiments, the novel amino acid sequences contained in the sequence listings below are converted into nucleic acid sequences according to the most preferred codon usage in Prototheca, Chlorella, Cuphea wrightii, or Arabidopsis as set forth in tables 2 through 3b or nucleic acid sequences having at least 70, 80, 85, 90, 95, 96, 97, 98, or 99%
sequence identity to these derived nucleic acid sequences.
[0047] In embodiments of the invention, there is protein or a nucleic acid encoding a protein having any of SEQ ID NOS: 1, 4, 7, 10, 13, 16, 19, 22, 25, 28, 31, 34, 37, 40, 43, 46, 49, 52, 55, 58, 61, 64, 67, 70, 73, 75, 77, 79, 82, 85, 88, 91, 94, 97, 100, 103, 106, 108, or 110-192. In an embodiment, there is protein or a nucleic acid encoding a protein having at least 70, 75, 80, 85, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, or 100% sequence identity with any of SEQ ID NOS: 1, 4, 7, 10, 13, 16, 19, 22, 25, 28, 31, 34, 37, 40, 43, 46, 49, 52, 55, 58, 61, 64, 67, 70, 73, 75, 77, 79, 82, 85, 88, 91, 94, 97, 100, 103, 106, 108, or 110-192. In certain embodiments, the invention encompasses a fragment any of the above-described proteins or nucleic acids (including fragments of protein or nucleic acid variants), wherein the protein fragment has acyl-ACP thioesterase activity or the nucleic acid fragment encodes such a protein fragment. In other embodiments, the fragment includes a domain of an acyl-ACP
thioesterase that mediates a particular function, e.g., a specificity-determining domain.
Illustrative fragments can be produced by C-terminal and/or N-terminal truncations and include at least 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% of the full-length sequences disclosed herein.
[0048] In certain embodiments, percent sequence identity for variants of the nucleic acids or proteins discussed above can be calculated by using the full-length nucleic acid sequence (e.g., one of SEQ ID NOS: 2, 3, 5, 6, 8,9, 11, 12, 14, 15, 17, 18, 20, 21, 23, 24, 26, 27, 29, 30, 32, 33, 35, 36, 38, 39, 41, 42, 44, 45, 47, 48, 50, 51, 53, 54, 56, 57, 59, 60, 62, 63, 65, 66, 68, 69, 71, 72, 74, 76, 78, 80, 81, 83, 84, 86, 87, 89, 90, 92, 93, 95, 96, 98, 99, 101, 102, 104, 105, 107 or 109) or full-length amino acid sequence (e.g., one of SEQ ID NOS: 1, 4, 7, 10, 13, 16, 19, 22, 25, 28, 31, 34, 37, 40, 43, 46, 49, 52, 55, 58, 61, 64, 67, 70, 73, 75, 77, 79, 82, 85, 88, 91, 94, 97, 100, 103, 106, 108, or 110-192) as the reference sequence and comparing the full-length test sequence to this reference sequence. In some embodiments relating to fragments, percent sequence identity for variants of nucleic acid or protein fragments can be calculated over the entire length of the fragment.
[0049] The nucleic acids can be in isolated form, or part of a vector or other construct, chromosome or host cell. It has been found that is many cases the full length gene (and protein) is not needed; for example, deletion of some or all of the N-terminal hydrophobic domain (typically an 18 amino acid domain starting with LPDW) yields a still-functional gene. In addition, fusions of the specificity determining regions of the genes in Table 1 with catalytic domains of other acyl-ACP
thioesterases can yield functional genes. Thus, in certain embodiments, the invention encompasses functional fragments (e.g., specificity determining regions) of the disclosed nucleic acid or amino acids fused to heterologous acyl-ACP
thioesterase nucleic acid or amino acid sequences, respectively.
Table 1: FatB genes according to embodiments of the present invention Species Gene Name Sequence Variant Amino Native Protothec (relative to Acid CDS a dominant Sequenc nucleotid moriformi transcript e of CDS e s codon-identified) (no sequence optimized additiona (not nucleotid 1 tags) codon- e optimize sequence d, no of CDS
additiona 1 cloning sites) Cinnamomu CcFATB1b M25L, M322R, SEQ ID SEQ ID SEQ ID
m camphora AT367-D368 NO: 1 NO: 2 NO: 3 Cinnamomu CcFATB4 "wild-type" SEQ ID SEQ ID SEQ ID
m camphora NO: 4 NO: 5 NO: 6 Cinnamomu CcFATB3 "wild-type" SEQ ID SEQ ID SEQ ID
m camphora NO: 7 NO: 8 NO: 9 Cuphea ChsFATB1 "wild-type" SEQ ID SEQ ID SEQ ID
hyssopifolia NO: 10 NO:11 NO: 12 Cuphea ChsFATB2 "wild-type" SEQ ID SEQ ID SEQ ID
hyssopifolia NO: 13 NO: 14 NO: 15 Cuphea ChsFATB2b +a.a.248-259 SEQ ID
SEQ ID SEQ ID
hyssopifolia NO: 16 NO: 17 NO: 18 Cuphea ChsFATB3 "wild-type" SEQ ID
SEQ ID SEQ ID
hyssopifolia NO: 19 NO: 20 NO: 21 Cuphea ChsFATB3b V204I,C239F, SEQ ID
SEQ ID SEQ ID
hyssopifolia E243D, M251V NO: 22 NO: 23 NO: 24 Cuphea CuPSR23FATB "wild-type" SEQ ID
SEQ ID SEQ ID
PSR23 3 NO: 25 NO: 26 NO: 27 Cuphea CwFATB3 "wild-type" SEQ ID
SEQ ID SEQ ID
wrightii NO: 28 NO: 29 NO: 30 Cuphea CwFATB4a "wild-type" SEQ ID
SEQ ID SEQ ID
wrightii NO: 31 NO: 32 NO: 33 Cuphea CwFATB4b "wild-type" SEQ ID
SEQ ID SEQ ID
wrightii NO: 34 NO: 35 NO: 36 Cuphea CwFATB5 "wild-type" SEQ ID
SEQ ID SEQ ID
wrightii NO: 37 NO: 38 NO: 39 Cuphea ChtFATBla "wild-type" SEQ ID
SEQ ID SEQ ID
heterophylla NO: 40 NO: 41 NO: 42 Cuphea ChtFATB1b P16S, T2OP, SEQ ID
SEQ ID SEQ ID
heterophylla G945, G105W, NO: 43 NO: 44 NO: 45 5293F, L305F
Cuphea ChtFATB2b "wild-type" SEQ ID
SEQ ID SEQ ID
heterophylla NO: 46 NO: 47 NO: 48 Cuphea ChtFATB2a 517P, P21S, SEQ SEQ ID
SEQ ID
heterophylla T28N, L30P, IDO NO: 50 NO: 51 533L, G76D, NO: 49 578P, G137W
Cuphea ChtFATB2c G76D, 578P SEQ ID
SEQ ID SEQ ID
heterophylla NO: 52 NO: 53 NO: 54 Cuphea ChtFATB2d 521P, T28N, SEQ ID
SEQ ID SEQ ID
heterophylla L30P, 533L, NO: 55 NO: 56 NO: 57 G76D, R97L, H124L, W127L, I132S, K258N, C303R, E309G, K334T, T386A
Cuphea ChtFATB2e G76D, R97L, SEQ ID
SEQ ID SEQ ID
heterophylla H124L, 11325, NO: 58 NO: 59 NO: 60 G1525, H165L, T211N, K258N, C303R, E309G, K334T, T386A
Cuphea ChtFATB2f R97L, H124L, SEQ ID
SEQ ID SEQ ID
heterophylla I132S, G1525, NO: 61 NO: 62 NO: 63 H165L, T211N
Cuphea ChtFATB2g A6T, A16V, 517P, SEQ ID SEQ ID SEQ ID
heterophylla G76D, R97L, NO: 64 NO: 65 NO: 66 H124L, I132S, S143I, G1525, A157T, H165L, T211N, G414A
Cuphea ChtFATB3a "wild-type" SEQ ID
SEQ ID SEQ ID
heterophylla NO: 67 NO: 68 NO: 69 Cuphea ChtFATB3b C67G, H72Q, SEQ ID
SEQ ID SEQ ID
heterophylla L128F,N179I NO: 70 NO: 71 NO: 72 Cuphea CvisFATB1 published SEQ ID N/A SEQ ID
viscosissima NO: 73 NO: 74 Cuphea CvisFATB2 published SEQ ID N/A SEQ ID
viscosissima NO: 75 NO: 76 Cuphea CvisFATB3 published SEQ ID N/A SEQ ID
viscosissima NO: 77 NO: 78 Cuphea CcalcFATB1 "wild-type" SEQ ID
SEQ ID SEQ ID
calcarata NO: 79 80 81 Cuphea CpaiFATB1 "wild-type" SEQ ID
SEQ ID SEQ ID
painteri NO: 82 83 84 Cuphea ChookFATB4 "wild-type" SEQ ID SEQ ID SEQ ID
hookeriana NO: 85 86 87 Cuphea CaFATB1 "wild-type" SEQ ID SEQ ID SEQ ID
avigera var. NO: 88 89 90 pukherrima Cuphea CPauFATB1 "wild-type" SEQ ID SEQ ID SEQ ID
paucipetala NO: 91 92 93 Cuphea CprocFATB1 "wild-type" SEQ ID SEQ ID SEQ ID
procumbens NO: 94 95 96 Cuphea CprocFATB2 "wild-type" SEQ ID SEQ ID SEQ ID
procumbens NO: 97 98 99 Cuphea CprocFATB3 "wild-type" SEQ ID SEQ ID SEQ ID
procumbens NO: 100 101 102 Cuphea CigneaFATB1 "wildtype"; partial SEQ ID SEQ ID SEQ ID
ignea (missing N- NO: 103 104 105 terminal portion of native transit peptide, fused to CpSADltp_trimm ed transit peptide) Consensus JcFATB1 Consensus SEQ ID None, SEQ ID
sequence NO: 106 can be NO: 107 codon optimize d for a given host Consensus JcPATB2 Consensus SEQ ID None, SEQ ID
sequence NO: 108 can be NO:

codon optimize d for a given host In certain embodiments, a host cell (e.g. plant or microalgal cell) is transformed to produce a recombinant FATB protein falling into one of clades 1-12 of Table la.
These clades were determined by sequence alignment and observation of changes in fatty acid profile when expressed in Prototheca. See Example 5. The FATB amino acid sequence can fall within x% amino acid sequence identity of each sequence in that clade listed in Table la, where x is a first second or third cutoff value, also listed in Table la.

Table la: Groupings of Novel FatB genes into clades.
Clade Amino Acid SEQ ID Nos. in Clade Example Function First Cutoff Second Third No. (see Table 6) Value Cutoff Cutoff (minimum Value Value % amino acid identity to members of clade) 1 ChsFATB3 (SEQ ID NO: 19) Increase C14/C16 85.9 97.4 98 ChsFATB3d (SEQ ID NO: 161) fatty acids ChsFATB3b (SEQ ID NO: 22) ChsFATB3c (SEQ ID NO: 160) 2 ChtFATB1a.2 (SEQ ID NO: 134) Increase C12/C14 89.5 95 98 ChtFATB1a.3 (SEQ ID NO: 135) fatty acids ChtFATB1a.4 (SEQ ID NO: 136) ChtFATB1 a (SEQ ID NO: 132)_ ChtFATB1a.1 (SEQ ID NO: 133) ChtFATB lb (SEQ ID NO: 137) CwFATB5b (SEQ ID NO: 124) CwFATB5 (SEQ ID NO: 122) CwFATB5a (SEQ ID NO: 123) CwFATB5c (SEQ ID NO: 125) 3 CwFATB5.1 (SEQ ID NO: 126) Increase C12/C14 92.5 95 98 CwFATB5.1a (SEQ ID NO: 127) fatty acids 4 Cca1cFATB1 (SEQ ID NO: 79) Increase C12/C14 83.8 93 95 fatty acids CaFATB1 (SEQ ID NO: 88) Increase C8/C10 94.6 96 98 CpaiFATB1 (SEQ ID NO: 82) fatty acids ChookFATB4 (SEQ ID NO: 85) CigneaFATB1 (SEQ ID NO: 103) 6 CuPSR23FATB3b (SEQ ID NO: 111) Increase C10 fatty 99.9 CuPSR23FATB3 (SEQ ID NO: 110) acids 7 CvisFATB1 (SEQ ID NO: 73) Increase C10/C12 89.5 93 96 JcFATB1/SzFATB1 (SEQ ID NO: 106) fatty acids CgFATB1b (SEQ ID NO: 185) CprocFATB1 (SEQ ID NO: 172) CpauFATB1 (SEQ ID NO: 171) CprocFATB2 (SEQ ID NO: 173) CprocFATB3 (SEQ ID NO: 174) 8 CwFATB3 (SEQ ID NO: 112) Increase C12/C14 85.9 98.9 99.5 CwFATB3a (SEQ ID NO: 113) fatty acids ChtFATB2e (SEQ ID NO: 142) ChtFATB2h (SEQ ID NO: 145) ChtFATB2f (SEQ ID NO: 143) ChtFATB2g (SEQ ID NO: 144) ChtFATB2a (SEQ ID NO: 139) ChtFATB2c (SEQ ID NO: 140) ChtFATB2b (SEQ ID NO: 138) ChtFATB2d (SEQ ID NO: 141) 9 CcrFATB2c (SEQ ID NO: 187) Increase C12/C14 83.8 90 95 CcrFATB2 (SEQ ID NO: 188) fatty acids CcrFATB2b (SEQ ID NO: 189) ChtFATB3b (SEQ ID NO: 147) Increase C14/C16 95.9 98 99 ChtFATB3d (SEQ ID NO: 149) fatty acids ChtFATB3a (SEQ ID NO: 146) ChtFATB3e (SEQ ID NO: 150) ChtFATB3g (SEQ ID NO: 152) ChtFATB3f (SEQ ID NO: 151) ChtFATB3c (SEQ ID NO: 148) ChsFATB2 (SEQ ID NO: 154) ChsFATB2c (SEQ ID NO: 156) ChsFATB2b (SEQ ID NO: 155) ChsFATB2d (SEQ ID NO: 157) JcFATB2/SzFATB2 (SEQ ID NO: 108) CvisFATB2 (SEQ ID NO: 75) CcrFATB1 (SEQ ID NO: 190) CcrFATB1b (SEQ ID NO: 191) CcrFATB1 c (SEQ ID NO: 192) 11 Increase C14/C16 88.7 94.5 CwFATB4b.1 (SEQ ID NO: 121) fatty acids 12 CcFATB3 (SEQ ID NO: 129) Increase C16 fatty 72.8 85 90 UcFATB3 (SEQ ID NO: 186) acids (predicted) Table 2: Preferred codon usage in Prototheca strains Ala GCG 345 (0.36) Asn AAT 8 (0.04) GCA 66 (0.07) AAC 201 (0.96) GCT 101 (0.11) GCC 442 (0.46) Pro CCG 161 (0.29) CCA 49 (0.09) Cys TGT 12 (0.10) CCT 71 (0.13) TGC 105 (0.90) CCC 267 (0.49) Asp GAT 43 (0.12) Gln CAG
226 (0.82) GAC 316 (0.88) CAA 48 (0.18) Glu GAG 377 (0.96) Arg AGG 33 (0.06) GAA 14 (0.04) AGA 14 (0.02) CGG 102 (0.18) Phe TTT 89 (0.29) CGA 49 (0.08) TTC 216 (0.71) CGT 51 (0.09) CGC 331 (0.57) Gly GGG 92 (0.12) GGA 56 (0.07) Ser AGT 16 (0.03) GGT 76 (0.10) AGC 123 (0.22) GGC 559 (0.71) TCG 152 (0.28) TCA 31 (0.06) His CAT 42 (0.21) TCT 55 (0.10) CAC 154 (0.79) TCC 173 (0.31) Ile ATA 4(0.01) Thr ACG 184 (0.38) ATT 30 (0.08) ACA 24 (0.05) ATC 338 (0.91) ACT 21 (0.05) ACC 249 (0.52) Lys AAG 284 (0.98) AAA 7 (0.02) Val GTG 308 (0.50) GTA 9(0.01) Leu TTG 26 (0.04) GTT 35 (0.06) TTA 3 (0.00) GTC 262 (0.43) CTG 447 (0.61) CTA 20 (0.03) Tip TGG 107 (1.00) CTT 45 (0.06) CTC 190 (0.26) Tyr TAT 10 (0.05) TAC 180 (0.95) Met ATG 191 (1.00) Stop TGA/TAG/TAA

Table 3: Preferred codon usage in Chlorella protothecoides TTC (Phe) TAC (Tyr) TGC (Cys) TGA (Stop) TGG (Trp) CCC (Pro) CAC (His) CGC (Arg) CTG (Leu) CAG (Gin) ATC (Ile) ACC (Thr) GAC (Asp) TCC (Ser) ATG (Met) AAG (Lys) GCC (Ala) AAC (Asn) GGC (Gly) GTG (Val) GAG (Glu) Table 3a: Codon usage for Cuphea wrightii UUU F 0.48 19.5 ( 52) UCU S 0.21 19.5 ( 52) UAU Y 0.45 6.4 ( 17) UGU C 0.41 10.5 ( 28) UUC F 0.52 21.3 ( 57) UCC S 0.26 23.6 ( 63) UAC Y 0.55 7.9 ( 21) UGC C 0.59 15.0( 40) UUA L 0.07 5.2 ( 14) UCA S 0.18 16.8 ( 45) UAA * 0.33 0.7 ( 2) UGA * 0.33 0.7 ( 2) UUG L 0.19 14.6( 39) UCG S 0.11 9.7( 26) UAG * 0.33 0.7( 2) UGG W 1.00 15.4 ( 41) CUU L 0.27 21.0 ( 56) CCU P 0.48 21.7 ( 58) CAU H 0.60 11.2 ( 30) CGU R 0.09 5.6 ( 15) CUC L 0.22 17.2 ( 46) CCC P 0.16 7.1 ( 19) CAC H 0.40 7.5 ( 20) CGC R 0.13 7.9 ( 21) CUA L 0.13 10.1 ( 27) CCA P 0.21 9.7( 26) CAA Q 0.31 8.6( 23) CGA R 0.11 6.7 ( 18) CUG L 0.12 9.7 ( 26) CCG P 0.16 7.1 ( 19) CAG Q 0.69 19.5 ( 52) CGG R 0.16 9.4 ( 25) AUU I 0.44 22.8 ( 61) ACU T 0.33 16.8 ( 45) AAU N 0.66 31.4 ( 84) AGU S 0.18 16.1 ( 43) AUC I 0.29 15.4 ( 41) ACC T 0.27 13.9 ( 37) AAC N 0.34 16.5 ( 44) AGC S 0.07 6.0 ( 16) AUA 10.27 13.9 ( 37) ACA T 0.26 13.5 ( 36) AAA K 0.42 21.0 ( 56) AGA R 0.24 14.2 ( 38) AUG M 1.00 28.1 ( 75) ACG T 0.14 7.1 ( 19) AAG K 0.58 29.2 ( 78) AGG R 0.27 16.1 ( 43) GUU V 0.28 19.8 ( 53) GCU A 0.35 31.4 ( 84) GAU D 0.63 35.9 ( 96) GGU G 0.29 26.6( 71) GUC V 0.21 15.0 ( 40) GCC A 0.20 18.0 ( 48) GAC D 0.37 21.0 ( 56) GGC G 0.20 18.0( 48) GUA V 0.14 10.1 ( 27) GCA A 0.33 29.6 ( 79) GAA E 0.41 18.3 ( 49) GGA G 0.35 31.4( 84) GUG V 0.36 25.1 ( 67) GCG A 0.11 9.7( 26) GAG E 0.59 26.2 ( 70) GGG G 0.16 14.2 ( 38) Table 3b: Codon usage for Arabidopsis UUU F 0.51 21.8 (678320) UCU S 0.28 25.2 (782818) UAU Y 0.52 14.6 (455089) UGU
C
0.60 10.5 (327640) UUC F 0.49 20.7 (642407) UCC S 0.13 11.2 (348173) UAC Y 0.48 13.7 (427132) UGC
C
0.40 7.2 (222769) UUA L 0.14 12.7 (394867) UCA S 0.20 18.3 (568570) UAA * 0.36 0.9 ( 29405) UGA
*
0.44 1.2 ( 36260) UUG L 0.22 20.9 (649150) UCG S 0.10 9.3 (290158) UAG * 0.20 0.5 ( 16417) UGG W

1.00 12.5 (388049) CUU L 0.26 24.1 (750114) CCU P 0.38 18.7 (580962) CAU H 0.61 13.8 (428694) CGU
R
0.17 9.0 (280392) CUC L 0.17 16.1 (500524) CCC P0.11 5.3 (165252) CAC H 0.39 8.7 (271155) CGC R
0.07 3.8 (117543) CUA L 0.11 9.9 (307000) CCA P 0.33 16.1 (502101) CAA Q 0.56 19.4 (604800) CGA
R
0.12 6.3 (195736) CUG L 0.11 9.8 (305822) CCG P 0.18 8.6 (268115) CAG Q 0.44 15.2 (473809) CGG R

0.09 4.9 (151572) AUU 10.41 21.5 (668227) ACU T 0.34 17.5 (544807) AAU N 0.52 22.3 (693344) AGU
S
0.16 14.0 (435738) AUC I 0.35 18.5 (576287) ACC T 0.20 10.3 (321640) AAC N 0.48 20.9 (650826) AGC
S
0.13 11.3 (352568) AUA I 0.24 12.6 (391867) ACA T 0.31 15.7 (487161) AAA K 0.49 30.8 (957374) AGAR
0.35 19.0 (589788) AUG M 1.00 24.5 (762852) ACG T 0.15 7.7 (240652) AAG K 0.51 32.7 (1016176) AGG

R 0.20 11.0 (340922) GUU V 0.40 27.2 (847061) GCU A 0.43 28.3 (880808) GAU D 0.68 36.6 (1139637) GGU
G 0.34 22.2 (689891) GUC V 0.19 12.8 (397008) GCC A 0.16 10.3 (321500) GAC D 0.32 17.2 (535668) GGC
G
0.14 9.2 (284681) GUA V 0.15 9.9 (308605) GCA A 0.27 17.5 (543180) GAA E 0.52 34.3 (1068012) GGA

G 0.37 24.2 (751489) GUG V 0.26 17.4 (539873) GCG A 0.14 9.0 (280804) GAGE 0.48 32.2 (1002594) GGG
G 0.16 10.2 (316620) Host Cells
[0050] The host cell can be a single cell (e.g., microalga, bacteria, yeast) or part of a multicellular organism such as a plant or fungus. Methods for expressing Fatb genes in a plant are given in 5,850,022; 5,723,761; 5,639,790; 5,807,893;
5,455,167; 5,654,495; 5,512,482;5,298,421;5,667,997; and 5,344,771; 5,304,481, or can be accomplished using other techniques generally known in plant biotechnology.
Engineering of oleaginous microbes including those of Chlorophyta is disclosed in W02010/063032, W02011,150411, and W02012/106560 and in the examples below.
[0051] Examples of oleaginous host cells include plant cells and microbial cells having a type II fatty acid biosynthetic pathway, including plastidic oleaginous cells such as those of oleaginous algae. Specific examples of microalgal cells include heterotrophic or obligate heterotrophic microalgae of the phylum Chlorophtya, the class Trebouxiophytae, the order Chlorellales, or the family Chlorellacae.
Examples of oleaginous microalgae are provided in Published PCT Patent Applications W02008/151149, W02010/06032, W02011/150410, and W02011/150411, including species of Chlorella and Prototheca, a genus comprising obligate heterotrophs. The oleaginous cells can be, for example, capable of producing 25, 30, 40, 50, 60, 70, 80, 85, or about 90% oil by cell weight, 5%. Optionally, the oils produced can be low in DHA or EPA fatty acids. For example, the oils can comprise less than 5%, 2 %, or 1% DHA and/or EPA. The above-mentioned publications also disclose methods for cultivating such cells and extracting oil, especially from microalgal cells; such methods are applicable to the cells disclosed herein and incorporated by reference for these teachings. When microalgal cells are used they can be cultivated autotrophically (unless an obligate heterotroph) or in the dark using a sugar (e.g., glucose, fructose and/or sucrose). In any of the embodiments described herein, the cells can be heterotrophic cells comprising an exogenous invertase gene so as to allow the cells to produce oil from a sucrose feedstock. Alternately, or in addition, the cells can metabolize xylose from cellulosic feedstocks. For example, the cells can be genetically engineered to express one or more xylose metabolism genes such as those encoding an active xylose transporter, a xylulose-5-phosphate transporter, a xylose isomerase, a xylulokinase, a xylitol dehydrogenase and a xylose reductase. See W02012/154626, "GENETICALLY ENGINEERED
MICROORGANISMS THAT METABOLIZE XYLOSE", published Nov 15, 2012.
Oils and Related Products
[0052] The oleaginous cells express one or more exogenous genes encoding fatty acid biosynthesis enzymes. As a result, some embodiments feature natural oils that were not obtainable from a non-plant or non-seed oil, or not obtainable at all.
[0053] The oleaginous cells produce a storage oil, which is primarily triacylglyceride and may be stored in storage bodies of the cell. A raw oil may be obtained from the cells by disrupting the cells and isolating the oil.
W02008/151149, W02010/06032, W02011/150410, and W02011/1504 disclose heterotrophic cultivation and oil isolation techniques. For example, oil may be obtained by cultivating, drying and pressing the cells. The oils produced may be refined, bleached and deodorized (RBD) as known in the art or as described in W02010/120939. The raw or RBD oils may be used in a variety of food, chemical, and industrial products or processes. After recovery of the oil, a valuable residual biomass remains.
Uses for the residual biomass include the production of paper, plastics, absorbents, adsorbents, as animal feed, for human nutrition, or for fertilizer.
[0054] Where a fatty acid profile of a triglyceride (also referred to as a "triacylglyceride" or "TAG") cell oil is given here, it will be understood that this refers to a nonfractionated sample of the storage oil extracted from the cell analyzed under conditions in which phospholipids have been removed or with an analysis method that is substantially insensitive to the fatty acids of the phospholipids (e.g.
using chromatography and mass spectrometry). The oil may be subjected to an RBD
process to remove phospholipids, free fatty acids and odors yet have only minor or negligible changes to the fatty acid profile of the triglycerides in the oil.
Because the cells are oleaginous, in some cases the storage oil will constitute the bulk of all the TAGs in the cell.
[0055] The stable carbon isotope value 613C is an expression of the ratio of 13C/12C relative to a standard (e.g. PDB, carbonite of fossil skeleton of Belemnite americana from Peedee formation of South Carolina). The stable carbon isotope value 613C (0/00) of the oils can be related to the 613C value of the feedstock used.

In some embodiments, the oils are derived from oleaginous organisms heterotrophically grown on sugar derived from a C4 plant such as corn or sugarcane.
In some embodiments the 613C (0/00) of the oil is from -10 to -17 0/00 or from -13 to -16 0/00.
[0056] The oils produced according to the above methods in some cases are made using a microalgal host cell. As described above, the microalga can be, without limitation, fall in the classification of Chlorophyta, Trebouxiophyceae , Chlorellales, Chlorellaceae, or Chlorophyceae. It has been found that microalgae of Trebouxiophyceae can be distinguished from vegetable oils based on their sterol profiles. Oil produced by Chlorella protothecoides was found to produce sterols that appeared to be brassicasterol, ergosterol, campesterol, stigmasterol, and 13-sitosterol, when detected by GC-MS. However, it is believed that all sterols produced by Chlorella have C24I3 stereochemistry. Thus, it is believed that the molecules detected as campesterol, stigmasterol, and 13-sitosterol, are actually 22,23-dihydrobrassicasterol, proferasterol and clionasterol, respectively. Thus, the oils produced by the microalgae described above can be distinguished from plant oils by the presence of sterols with C24I3 stereochemistry and the absence of C24a stereochemistry in the sterols present. For example, the oils produced may contain 22, 23-dihydrobrassicasterol while lacking campesterol; contain clionasterol, while lacking inI3-sitosterol, and/or contain poriferasterol while lacking stigmasterol.
Alternately, or in addition, the oils may contain significant amounts of A7-poriferasterol.
[0057] In one embodiment, the oils provided herein are not vegetable oils.
Vegetable oils are oils extracted from plants and plant seeds. Vegetable oils can be distinguished from the non-plant oils provided herein on the basis of their oil content.
A variety of methods for analyzing the oil content can be employed to determine the source of the oil or whether adulteration of an oil provided herein with an oil of a different (e.g. plant) origin has occurred. The determination can be made on the basis of one or a combination of the analytical methods. These tests include but are not limited to analysis of one or more of free fatty acids, fatty acid profile, total triacylglycerol content, diacylglycerol content, peroxide values, spectroscopic properties (e.g. UV absorption), sterol profile, sterol degradation products, antioxidants (e.g. tocopherols), pigments (e.g. chlorophyll), dl3C values and sensory analysis (e.g. taste, odor, and mouth feel). Many such tests have been standardized for commercial oils such as the Codex Alimentarius standards for edible fats and oils.
[0058] Sterol profile analysis is a particularly well-known method for determining the biological source of organic matter. Campesterol, b-sitosterol, and stigamsterol are common plant sterols, with b-sitosterol being a principle plant sterol.
For example, b-sitosterol was found to be in greatest abundance in an analysis of certain seed oils, approximately 64% in corn, 29% in rapeseed, 64% in sunflower, 74%
in cottonseed, 26% in soybean, and 79% in olive oil (Gul et al. J. Cell and Molecular Biology 5:71-79, 2006).
[0059] Oil isolated from Prototheca moriformis strain UTEX1435 were separately clarified (CL), refined and bleached (RB), or refined, bleached and deodorized (RBD) and were tested for sterol content according to the procedure described in JAOCS vol.
60, no.8, August 1983. Results of the analysis are shown below (units in mg/100g):
Refined, Refined &
Sterol Crude Clarified bleached, &
bleached deodorized 1 Ergosterol (56%) (55%) (50%) (50%) 5,22-cholestadien-24-14.6 18.8 14 15.2 2 methyl-3-ol (2.1%) (2.6%) (2.4%) (2.5%) (Brassicasterol) 24-methylcholest-5-en-3-ol (Campesterol or 10.7 11.9 10.9 10.8 22,23- (1.6%) (1.6%) (1.8%) (1.8%) dihydrobrassicasterol) 5,22-cholestadien-24-57.7 59.2 46.8 49.9 4 ethyl-3-ol (Stigmasterol (8.4%) (8.2%) (7.9%) (8.3%) or poriferasterol) 24-ethylcholest-5-en-9.64 9.92 9.26 10.2 5 3-ol 03-Sitosterol or (1.4%) (1.4%) (1.6%) (1.7%) clionasterol) 6 Other sterols 209 221 216 213 Total sterols 685.64 718.82 589.96 601.1 [0060] These results show three striking features. First, ergosterol was found to be the most abundant of all the sterols, accounting for about 50% or more of the total sterols. The amount of ergosterol is greater than that of campestero1,13-sitosterol, and stigmasterol combined. Ergosterol is steroid commonly found in fungus and not commonly found in plants, and its presence particularly in significant amounts serves as a useful marker for non-plant oils. Secondly, the oil was found to contain brassicasterol. With the exception of rapeseed oil, brassicasterol is not commonly found in plant based oils. Thirdly, less than 2%13-sitosterol was found to be present.
13-sitosterol is a prominent plant sterol not commonly found in microalgae, and its presence particularly in significant amounts serves as a useful marker for oils of plant origin. In summary, Prototheca moriformis strain UTEX1435 has been found to contain both significant amounts of ergosterol and only trace amounts of13-sitosterol as a percentage of total sterol content. Accordingly, the ratio of ergosterol : 13-sitosterol or in combination with the presence of brassicasterol can be used to distinguish this oil from plant oils.
[0061] In some embodiments, the oil content of an oil provided herein contains, as a percentage of total sterols, less than 20%, 15%, 10%, 5%, 4%, 3%, 2%, or 1%13-sitosterol. In other embodiments the oil is free from13-sitosterol.
[0062] In some embodiments, the oil is free from one or more of13-sitosterol, campesterol, or stigmasterol. In some embodiments the oil is free from13-sitosterol, campesterol, and stigmasterol. In some embodiments the oil is free from campesterol.
In some embodiments the oil is free from stigmasterol.
[0063] In some embodiments, the oil content of an oil provided herein comprises, as a percentage of total sterols, less than 20%, 15%, 10%, 5%, 4%, 3%, 2%, or 1% 24-ethylcholest-5-en-3-ol. In some embodiments, the 24-ethylcholest-5-en-3-ol is clionasterol. In some embodiments, the oil content of an oil provided herein comprises, as a percentage of total sterols, at least 1%, 2%, 3%, 4%, 5%, 6%, 7%, 8%, 9%, or 10% clionasterol.
[0064] In some embodiments, the oil content of an oil provided herein contains, as a percentage of total sterols, less than 20%, 15%, 10%, 5%, 4%, 3%, 2%, or 1% 24-methylcholest-5-en-3-ol. In some embodiments, the 24-methylcholest-5-en-3-ol is 22, 23-dihydrobrassicasterol. In some embodiments, the oil content of an oil provided herein comprises, as a percentage of total sterols, at least 1%, 2%, 3%, 4%, 5%, 6%, 7%, 8%, 9%, or 10% 22,23-dihydrobrassicasterol.
[0065] In some embodiments, the oil content of an oil provided herein contains, as a percentage of total sterols, less than 20%, 15%, 10%, 5%, 4%, 3%, 2%, or 1%
5,22-cholestadien-24-ethyl-3-ol. In some embodiments, the 5, 22-cholestadien-24-ethy1-3-ol is poriferasterol. In some embodiments, the oil content of an oil provided herein comprises, as a percentage of total sterols, at least 1%, 2%, 3%, 4%, 5%, 6%, 7%, 8%, 9%, or 10% poriferasterol.
[0066] In some embodiments, the oil content of an oil provided herein contains ergosterol or brassicasterol or a combination of the two. In some embodiments, the oil content contains, as a percentage of total sterols, at least 5%, 10%, 20%, 25%, 35%, 40%, 45%, 50%, 55%, 60%, or 65% ergosterol. In some embodiments, the oil content contains, as a percentage of total sterols, at least 25% ergosterol.
In some embodiments, the oil content contains, as a percentage of total sterols, at least 40%
ergosterol. In some embodiments, the oil content contains, as a percentage of total sterols, at least 5%, 10%, 20%, 25%, 35%, 40%, 45%, 50%, 55%, 60%, or 65% of a combination of ergosterol and brassicasterol.
[0067] In some embodiments, the oil content contains, as a percentage of total sterols, at least 1%, 2%, 3%, 4% or 5% brassicasterol. In some embodiments, the oil content contains, as a percentage of total sterols less than 10%, 9%, 8%, 7%, 6%, or 5%
brassicasterol.
[0068] In some embodiments the ratio of ergosterol to brassicasterol is at least 5:1, 10:1, 15:1, or 20:1.
[0069] In some embodiments, the oil content contains, as a percentage of total sterols, at least 5%, 10%, 20%, 25%, 35%, 40%, 45%, 50%, 55%, 60%, or 65% ergosterol and less than 20%, 15%, 10%, 5%, 4%, 3%, 2%, or 1% 13-sitosterol. In some embodiments, the oil content contains, as a percentage of total sterols, at least 25%
ergosterol and less than 5%13-sitosterol. In some embodiments, the oil content further comprises brassicasterol.
[0070] Sterols contain from 27 to 29 carbon atoms (C27 to C29) and are found in all eukaryotes. Animals exclusively make C27 sterols as they lack the ability to further modify the C27 sterols to produce C28 and C29 sterols. Plants however are able to synthesize C28 and C29 sterols, and C28/C29 plant sterols are often referred to as phytosterols. The sterol profile of a given plant is high in C29 sterols, and the primary sterols in plants are typically the C29 sterols b-sitosterol and stigmasterol. In contrast, the sterol profile of non-plant organisms contain greater percentages of C27 and C28 sterols. For example the sterols in fungi and in many microalgae are principally C28 sterols. The sterol profile and particularly the striking predominance of C29 sterols over C28 sterols in plants has been exploited for determining the proportion of plant and marine matter in soil samples (Huang, Wen-Yen, Meinschein W. G., "Sterols as ecological indicators"; Geochimica et Cosmochimia Acta. Vol 43.
pp 739-745).
[0071] In some embodiments the primary sterols in the microalgal oils provided herein are sterols other than b-sitosterol and stigmasterol. In some embodiments of the microalgal oils, C29 sterols make up less than 50%, 40%, 30%, 20%, 10%, or 5%
by weight of the total sterol content.
[0072] In some embodiments the microalgal oils provided herein contain C28 sterols in excess of C29 sterols. In some embodiments of the microalgal oils, C28 sterols make up greater than 50%, 60%, 70%, 80%, 90%, or 95% by weight of the total sterol content. In some embodiments the C28 sterol is ergosterol. In some embodiments the C28 sterol is brassicasterol.
[0073] In embodiments of the present invention, oleaginous cells expressing one or more of the genes of Table 1 can produce an oil with at least 20, 40, 60 or 70%
of C8, C10, C12, C14 or C16 fatty acids. In a specific embodiment, the level of myristate (C14:0) in the oil is greater than 30%.
[0074] Thus, in embodiments of the invention, there is a process for producing an oil, triglyceride, fatty acid, or derivative of any of these, comprising transforming a cell with any of the nucleic acids discussed herein. In another embodiment, the transformed cell is cultivated to produce an oil and, optionally, the oil is extracted.
Oil extracted in this way can be used to produce food, oleochemicals or other products.
[0075] The oils discussed above alone or in combination are useful in the production of foods, fuels and chemicals (including plastics, foams, films, etc). The oils, triglycerides, fatty acids from the oils may be subjected to C-H
activation, hydroamino methylation, methoxy-carbonation, ozonolysis, enzymatic transformations, epoxidation, methylation, dimerization, thiolation, metathesis, hydro-alkylation, lactonization, or other chemical processes.
[0076] After extracting the oil, a residual biomass may be left, which may have use as a fuel, as an animal feed, or as an ingredient in paper, plastic, or other product. For example, residual biomass from heterotrophic algae can be used in such products.
Example 1. Discovery of Novel FATB sequences
[0077]
Sequences of novel plant acyl-ACP thioesterases involved in seed-specific mid-chain (C8-C16) fatty acid biosynthesis in higher plants were isolated. Seed-specific lipid production genes were isolated through direct interrogation of RNA pools accumulating in oilseeds. Based on phylogenetic analysis, novel enzymes can be classified as members of FatB family of acyl-ACP
thioesterases.
[0078] Seeds of oleaginous plants were obtained from local grocery stores or requested through USDA ARS National Plant Germplasm System (NPGS) from North Central Regional Plant Introduction Station (NCRIS) or USDA ARS North Central Soil Conservation Research Laboratory (Morris, MI). Dry seeds were homogenized in liquid nitrogen to powder, resuspended in cold extraction buffer containing 6-8M Urea and 3M LiC1 and left on ice for a few hours to overnight at 4 oC. The seed homogenate was passed through NucleoSpin Filters (Macherey-Nagel) by centrifugation at 20,000g for 20 minutes in the refrigerated microcentrifuge (4 C).
The resulting RNA pellets were resuspended in the buffer containing 20 mM Tris HC1, pH7.5, 0.5% SDS, 100 mM NaC1, 25 mM EDTA, 2% PVPP) and RNA was subsequently extracted once with Phenol-Chloroform-Isoamyl Alcohol (25:24:1, v/v) and once with chloroform. RNA was finally precipitated with isopropyl alcohol (0.7 Vol.) in the presence of 150 mM of Na Acetate, pH5.2, washed with 80% ethanol by centrifugation, and dried. RNA samples were treated with Turbo DNAse (Lifetech) and purified further using RNeasy kits (Qiagen) following manufacturers' protocols.

The resulting purified RNA samples were converted to pair-end cDNA libraries and subjected to next-generation sequencing (2x100bp) using Illumina Hiseq 2000 platform. RNA sequence reads were assembled into corresponding seed transcriptomes using Trinity or Oases packages. Putative thioesterase-containg cDNA
contigs were identified by mining transcriptomes for sequences with homology to known thioesterases. These in silico identified putative thioesterase cDNAs have been further verified by direct reverse transcription PCR analysis using seed RNA and primer pairs targeting full-length thioesterase cDNAs. The resulting amplified products were cloned and sequenced de novo to confirm authenticity of identified thioesterase genes.
[0079] To interrogate evolutionary and functional relationship between novel acyl-ACP thioesterases and the members of two existing thioesterase classes (FatA
and FatB), we performed a phylogenetic analysis using published full-length (Mayer and Shanklin, 2007) and truncated (THYME database) amino acid thioesterase sequences. Novel proteins appear to group with known acyl-ACP FatB
thioesterases involved in biosynthesis of C8-C16 fatty acids. Moreover, novel thioesterases appear to cluster into 3 predominant out-groups suggesting distinct functional similarity and evolutionary relatedness among members of each cluster.
[0080] The amino acid sequences of the FatB genes follow are shown in Table 4.
Table 4: Amino acid sequences of FatB genes:
CuPSR23 FATB3 SEQ ID NO: 110 CuPSR23 FATB3b SEQ ID NO: 111 CwFATB3 SEQ ID NO: 112 CwFATB3a SEQ ID NO: 113 CwFATB3b SEQ ID NO: 114 CwFATB3c SEQ ID NO: 115 CwFATB4a SEQ ID NO: 116 CwFATB4a.1 SEQ ID NO: 117 CwFATB4a.2 SEQ ID NO: 118 CwFATB4a.3 SEQ ID NO: 119 CwFATB4b SEQ ID NO: 120 CwFATB4b.1 SEQ ID NO: 121 CwFATB5 SEQ ID NO: 122 CwFATB5a SEQ ID NO: 123 CwFATB5b SEQ ID NO: 124 CwFATB5c SEQ ID NO: 125 CwFATB5.1 SEQ ID NO: 126 CwFATB5.1a SEQ ID NO: 127 CcFATB2b SEQ ID NO: 128 CcFATB3 SEQ ID NO: 129 CcFATB3b SEQ ID NO: 130 CcFATB3c SEQ ID NO: 131 ChtFATBla SEQ ID NO: 132 ChtFATB1a.1 SEQ ID NO: 133 ChtFATB1a.2 SEQ ID NO: 134 ChtFATB1a.3 SEQ ID NO: 135 ChtFATB1a.4 SEQ ID NO: 136 C htFATB 1 b SEQ ID NO: 137 ChtFATB2b SEQ ID NO: 138 ChtFATB2a SEQ ID NO: 139 ChtFATB2c SEQ ID NO: 140 ChtFATB2d SEQ ID NO: 141 ChtFATB2e SEQ ID NO: 142 ChtFATB2f SEQ ID NO: 143 ChtFATB2g SEQ ID NO: 144 ChtFATB2h SEQ ID NO: 145 C htFATB3 a SEQ ID NO: 146 ChtFATB3b SEQ ID NO: 147 ChtFATB3c SEQ ID NO: 148 ChtFATB3d SEQ ID NO: 149 ChtFATB3e SEQ ID NO: 150 ChtFATB3f SEQ ID NO: 151 ChtFATB3g SEQ ID NO: 152 ChsFATB1 SEQ ID NO: 153 ChsFATB2 SEQ ID NO: 154 ChsFatB2b SEQ ID NO: 155 ChsFatB2c SEQ ID NO: 156 ChsFatB2d SEQ ID NO: 157 Chs FATB3 SEQ ID NO: 158 ChsFatb3b SEQ ID NO: 159 ChsFatB3c SEQ ID NO: 160 ChsFATB3d SEQ ID NO: 161 ChsFATB3e SEQ ID NO: 162 ChsFATB3f SEQ ID NO: 163 ChsFATB3g SEQ ID NO: 164 ChsFATB3h SEQ ID NO: 165 ChsFATB3i SEQ ID NO: 166 ChsFATB3j SEQ ID NO: 167 ChsFATB3j:
MVAAEASSALFSVRTPGTSPKPGKFGNWPTSLSVPFKSKSNHNGGFQV
KANASARPKANGSAVSLKSGSLDTQEDTSS S S SPPRTFINQLPDWSMLLSAITT
VFVAAEKQWTMLDRKSKRPDMLMDPFGVDRVVQDGAVFRQSFSIRSYEIGA
DRTASIETLMNIFQETSLNHCKSIGLLNDGFGRTPEMCKRDLIWVVTKMHIEV
NRYPTWGDTIEVNTWVSESGKTGMGRDWLISDFHTGDILIRATSVCAMMNQ
KTRRFSKFPYEVRQELAPHFVDSAPVIEDYQKLHKLDVKTGDSICNGLTPRW
NDLDVNQHVNNVKYIGWILESVPTEVFETQELCGLTLEYRQECGRDSVLESV
TAMDPSKEGDRSLYQHLLRLEDGTDIAKGRTKWRPKNAGKTSNGNSIS

Example 2. Cloning and fatty acid analysis of cells transformed with novel FATB
genes
[0081] In the example below, we detail the effect of expressing plant oilseed transcriptome-derived, heterologous thioesterases in the UTEX1435 (web.biosci.utexas.edu/utex/) strain, Strain A.
[0082] As in Example 1, RNA was extracted from dried plant seeds and submitted for paired-end sequencing using the Illumina Hiseq 2000 platform.
RNA
sequence reads were assembled into corresponding seed transcriptomes using Trinity or Oases packages and putative thioesterase-containing cDNA contigs were identified by mining transcriptomes for sequences with homology to known thioesterases.
These in silico identified putative thioesterase cDNAs were verified by direct reverse transcription PCR analysis using seed RNA and primer pairs targeting full-length thioesterase cDNAs. The resulting amplified products were cloned and sequenced de novo to confirm authenticity of identified thioesterase genes and to identify sequence variants arising from expression of different gene alleles or diversity of sequences within a population of seeds. The resulting amino acid sequences were subjected to phylogenetic analysis using published full-length (Mayer and Shanklin, 2007) and truncated (THYME database) FatB sequences. The thioesterases that clustered with acyl-ACP FatB thioesterases, which are involved in biosynthesis of C8-C16 fatty acids, were pursued.
Construction of Transforming Vectors Expressing Acyl-ACP FatB Thioesterases
[0083] 27 putative acyl-ACP FatB thioesterases from the species Cinnamomum camphora, Cuphea hyssopifolia, Cuphea PSR23, Cuphea wrightii, Cuphea heterophylla, and Cuphea viscosissima were synthesized in a codon-optimized form to reflect Prototheca moriformis (UTEX 1435) codon usage. Of the 27 genes synthesized, 24 were identified by our transcriptome sequencing efforts and the 3 genes from Cuphea viscosissima, were from published sequences in GenBank.
[0084] Transgenic strains were generated via transformation of the base strain Strain A (Prototheca moriformis, derived from UTEX 1435 by classical mutation and screening for high oil production) with a construct encoding 1 of the 27 FatB
thioesterases. The construct pSZ2760 encoding Cinnamomum camphora (Cc) FATB1b is shown as an example, but identical methods were used to generate each of the remaining 26 constructs encoding the different respective thioesterases.
Construct pSZ2760 can be written as 6S::CrTUB2:ScSUC2:CvNR::PmAMT3:CcFATB1b:CvNR::6S. The sequence of the transforming DNA is provided in Table 5 (pSZ2760). The relevant restriction sites in the construct from 5'-3', BspQ1, KpnI, AscI, MfeI, EcoRI, SpeI, XhoI, Sad, BspQ1, respectively, are indicated in lowercase, bold, and underlined. BspQ1 sites delimit the 5' and 3' ends of the transforming DNA. Bold, lowercase sequences at the 5' and 3' end of the construct represent genomic DNA from UTEX 1435 that target integration to the 6S locus via homologous recombination. Proceeding in the 5' to 3' direction, the selection cassette has the C. reinhardtii ,8-tubulin promoter driving expression of the S. cerevisiae gene SUC2 (conferring the ability to grow on sucrose) and the Chlorella vulgaris Nitrate Reductase (NR) gene 3' UTR. The promoter is indicated by lowercase, boxed text. The initiator ATG and terminator TGA for ScSUC2 are indicated by bold, uppercase italics, while the coding region is indicated with lowercase italics. The 3' UTR is indicated by lowercase underlined text. The spacer region between the two cassettes is indicated by upper case text. The second cassette containing the codon optimized CcFATB1b gene (Table 5; pSZ2760) from Cinnamomum camphora is driven by the Prototheca moriformis endogenous AMT3 promoter, and has the Chlorella vulgaris Nitrate Reductase (NR) gene 3' UTR.
In this cassette, the AMT3 promoter is indicated by lowercase, boxed text. The initiator ATG and terminator TGA for the CcFATB1b gene are indicated in bold, uppercase italics, while the coding region is indicated by lowercase italics and the spacer region is indicated by upper case text. The 3' UTR is indicated by lowercase underlined text.
The final construct was sequenced to ensure correct reading frame and targeting sequences.
Table 5: pSZ2760 Transforming construct gctcttcgccgccgccactcctgctcgagcgcgcccgcgcgtgcgccgccagcgccttggccttttcgccgcgctcgtg c gcgtcgctgatgtccatcaccaggtccatgaggtctgccttgcgccggctgagccactgcttcgtccgggcggccaaga g gagcatgagggaggactcctggtccagggtcctgacgtggtcgcggctctgggagcgggccagcatcatctggctctgc cgcaccgaggccgcctccaactggtcctccagcagccgcagtcgccgccgaccctggcagaggaagacaggtgaggg gggtatgaattgtacagaacaaccacgagccttgtctaggcagaatccctaccagtcatggctttacctggatgacggc ctg cgaacagctgtccagcgaccctcgctgccgccgcttctcccgcacgcttctttccagcaccgtgatggcgcgagccagc g ccgcacgctggcgctgcgcttcgccgatctgaggacagtcggggaactctgatcagtctaaacccccttgcgcgttagt gtt gccatcctttgcagaccggtgagagccgacttgttgtgcgccaccccccacaccacctcctcccagaccaattctgtca cct ttttggcgaaggcatcggcctcggcctgcagagaggacagcagtgcccagccgctgggggttggcggatgcacgctca ggtacc ctttcttgcgctatgacacttccagcaaaaggtagggcgggctgcgagacggcttcccggcgctgcatgcaaca ccgatgatgettcgaccccccgaagctccttcggggctgcatgggcgctccgatgccgctccagggcgagcgctgttta a atagccaggcccccgattgcaaagacattatagcgagctaccaaagccatattcaaacacctagatcactaccacttct aca caggccactcgagcttgtgatcgcactccgctaagggggcgcctcttcctcttcgtttcagtcacaacccgcaaac ggcgc gccA TGctgctgcaggccttcctgttcctgctggccggcttcgccgccaagatcagcgcctccatgacgaacgagac gtccgaccgccccctggtgcacttcacccccaacaagggctggatgaacgaccccaacggcctgtggtacgacgag aaggacgccaagtggcacctgtacttccagtacaacccgaacgacaccgtctgggggacgcccttgttctggggcca cgccacgtccgacgacctgaccaactgggaggaccagcccatcgccatcgccccgaagcgcaacgactccggcgc cttctccggctccatggtggtggactacaacaacacctccggcttcttcaacgacaccatcgacccgcgccagcgctgc gtggccatctggacctacaacaccccggagtccgaggagcagtacatctcctacagcctggacggcggctacaccttc accgagtaccagaagaaccccgtgctggccgccaactccacccagttccgcgacccgaaggtcttctggtacgagcc ctcccagaagtggatcatgaccgcggccaagtcccaggactacaagatcgagatctactcctccgacgacctgaagt cctggaagctggagtccgcgttcgccaacgagggcttcctcggctaccagtacgagtgccccggcctgatcgaggtcc ccaccgagcaggaccccagcaagtcctactgggtgatgttcatctccatcaaccccggcgccccggccggcggctcct tcaaccagtacttcgtcggcagcttcaacggcacccacttcgaggccttcgacaaccagtcccgcgtggtggacttcgg caaggactactacgccctgcagaccttcttcaacaccgacccgacctacgggagcgccctgggcatcgcgtgggcctc caactgggagtactccgccttcgtgcccaccaacccctggcgctcctccatgtccctcgtgcgcaagttctccctcaac a ccgagtaccaggccaacccggagacggagctgatcaacctgaaggccgagccgatcctgaacatcagcaacgccg gcccctggagccggttcgccaccaacaccacgttgacgaaggccaacagctacaacgtcgacctgtccaacagcac cggcaccctggagttcgagctggtgtacgccgtcaacaccacccagacgatctccaagtccgtgttcgcggacctctcc ctctggttcaagggcctggaggaccccgaggagtacctccgcatgggcttcgaggtgtccgcgtcctccttcttcctgg a ccgcgggaacagcaaggtgaagttcgtgaaggagaacccctacttcaccaaccgcatgagcgtgaacaaccagcc cttcaagagcgagaacgacctgtcctactacaaggtgtacggcttgctggaccagaacatcctggagctgtacttcaac gacggcgacgtcgtgtccaccaacacctacttcatgaccaccgggaacgccctgggctccgtgaacatgacgacggg ggtggacaacctgttctacatcgacaagttccaggtgcgcgaggtcaagTGA
caattggcagcagcagctcggatag tatcgacacactctggacgctggtcgtgtgatggactgttgccgccacacttgctgccttgacctgtgaatatccctgc cgctt ttatcaaacagcctcagtgtgtttgatcttgtgtgtacgcgcttttgcgagttgctagctgcttgtgctatttgcgaat accacccc cagcatccccttccctcgtttcatatcgcttgcatcccaaccgcaacttatctacgctgtcctgctatccctcagcgct gctcct gctectgctcactgcccctcgcacagccttggifigggctccgcctgtattctcctggtactgcaacctgtaaaccagc actg caatgctgatgcacgggaagtagtgggatgggaacacaaatggaAAGCTGTATAGGGATAAgaattcgg ccgacaggacgcgcgtcaaaggtgctggtcgtgtatgccctggccggcaggtcgttgctgctgctggttagtgattccg ca accctgattttggcgtcttattttggcgtggcaaacgctggcgcccgcgagccgggccggcggcgatgcggtgccccac g gctgccggaatccaagggaggcaagagcgcccgggtcagttgaagggctttacgcgcaaggtacagccgctectgcaa ggctgcgtggtggaattggacgtgcaggtcctgctgaagttcctccaccgcctcaccagcggacaaagcaccggtgtat c aggtccgtgtcatccactctaaagaactcgactacgacctactgatggccctagattcttcatcaaaaacgcctgagac actt gcccaggattgaaactccctgaagggaccaccaggggccctgagttgttccttccccccgtggcgagctgccagccagg ctgtacctgtgatcgaggctggcgggaaaataggcttcgtgtgctcaggtcatgggaggtgcaggacagctcatgaaac g ccaacaatcgcacaattcatgtcaagctaatcagctatttcctettcacgagctgtaattgteccaaaattctggtcta ccgggg gtgatccttcgtgtacgggcccttccctcaaccctaggtatgcgcgcatgcggtcgccgcgcaactcgcgcgagggccg a gggtttgggacgggccgteccgaaatgcagttgcacccggatgcgtggcacatttttgcgataatttatgcaatggact gct ctgcaaaattctggctctgtcgccaaccctaggatcagcggcgtaggatttcgtaatcattcgtcctgatggggagcta ccg actaccctaatatcagcccgactgcctgacgccagcgtccactifigtgcacacattccattcgtgcccaagacatttc attgt ggtgcgaagcgtecccagttacgctcacctgtttcccgacctccttactgttctgtcgacagagegggcccacaggccg gt cgcagccactagtATGgccaccacctccctggcctccgccttctgctccatgaaggccgtgatgctggcccgcgacg gccgcggcctgaagccccgctcctccgacctgcagctgcgcgccggcaacgcccagacctccctgaagatgatcaac ggcaccaagttctcctacaccgagtccctgaagaagctgcccgactggtccatgctgttcgccgtgatcaccaccatct t ctccgccgccgagaagcagtggaccaacctggagtggaagcccaagcccaaccccccccagctgctggacgacca cttcggcccccacggcctggtgttccgccgcaccttcgccatccgctcctacgaggtgggccccgaccgctccacctcc atcgtggccgtgatgaaccacctgcaggaggccgccctgaaccacgccaagtccgtgggcatcctgggcgacggctt cggcaccaccctggagatgtccaagcgcgacctgatctgggtggtgaagcgcacccacgtggccgtggagcgctacc ccgcctggggcgacaccgtggaggtggagtgctgggtgggcgcctccggcaacaacggccgccgccacgacttcct ggtgcgcgactgcaagaccggcgagatcctgacccgctgcacctccctgtccgtgatgatgaacacccgcacccgcc gcctgtccaagatccccgaggaggtgcgcggcgagatcggccccgccttcatcgacaacgtggccgtgaaggacga ggagatcaagaagccccagaagctgaacgactccaccgccgactacatccagggcggcctgaccccccgctggaa cgacctggacatcaaccagcacgtgaacaacatcaagtacgtggactggatcctggagaccgtgcccgactccatctt cgagtcccaccacatctcctccttcaccatcgagtaccgccgcgagtgcacccgcgactccgtgctgcagtccctgacc accgtgtccggcggctcctccgaggccggcctggtgtgcgagcacctgctgcagctggagggcggctccgaggtgct gcgcgccaagaccgagtggcgccccaagctgtccttccgcggcatctccgtgatccccgccgagtcctccgtgatgga ctacaaggaccacgacggcgactacaaggaccacgacatcgactacaaggacgacgacgacaagTGActcga ggcagcagcagctcggatagtatcgacacactctggacgctggtcgtgtgatggactgttgccgccacacttgctgcct tg acctgtgaatatccctgccgcttttatcaaacagcctcagtgtgtttgatcttgtgtgtacgcgcttttgcgagttgct agctgctt gtgctatttgcgaataccacccccagcatccccttccctcgificatatcgcttgcatcccaaccgcaacttatctacg ctgtcc tgctatccctcagcgctgctcctgctcctgctcactgcccctcgcacagccttggtttgggctccgcctgtattctcct ggtact gcaacctgtaaaccagcactgcaatgctgatgcacgggaagtagtgggatgggaacacaaatggaAAGCTGTAT
AGGGATAACAGGGTAATgagctcttgifitccagaaggagttgctccttgagccificattctcagcctcgata acctccaaagccgctctaattgtggagggggttcgaatttaaaagettggaatgttggttcgtgcgtctggaacaagcc cag acttgttgctcactgggaaaaggaccatcagctccaaaaaacttgccgctcaaaccgcgtacctctgctttcgcgcaat ctgc cctgttgaaatcgccaccacattcatattgtgacgcttgagcagtctgtaattgcctcagaatgtggaatcatctgccc cctgtg cgagcccatgccaggcatgtcgcgggcgaggacacccgccactcgtacagcagaccattatgctacctcacaatagttc a taacagtgaccatatttctcgaagctccccaacgagcacctccatgctctgagtggccaccccccggccctggtgcttg cg gagggcaggtcaaccggcatggggctaccgaaatccccgaccggatcccaccacccccgcgatgggaagaatctctcc ccgggatgtgggcccaccaccagcacaacctgctggcccaggcgagcgtcaaaccataccacacaaatatccttggcat cggccctgaattccttctgccgctctgctacccggtgcttctgtccgaagcaggggttgctagggatcgctccgagtcc gca aacccttgtcgcgtggcggggcttgttcgagcttgaagagc
[0085] Constructs encoding the identified heterologous FatB genes, such as CcFATB1b from pSZ2760 in Table 6, were transformed into Strain A, and selected for the ability to grow on sucrose. Transformations, cell culture, lipid production and fatty acid analysis were all carried out as previously described. After cultivating on sucrose under low nitrogen conditions to accumulate oil, fatty acid profiles were determined by FAME-GC. The top performer from each transformation, as judged by the ability to produce the highest level of midchain fatty acids, is shown in Table 4.

Table 6: Alteration of Fatty Acid Profiles in S3150 upon Expression of Heterologous FatB Thioesterases FA profile of top performer from each transformation (%; primary lipid in Strain A background) Species Gene Name . SZ Plasmid Strain C8:0 C10:0 C12:0 C14:0 C16:0 C18:0 C18:1 C18:2 C18:3a ..-Cinnamomum camphora CcFATB1b pSZ2760 A; T526; D1670-13 0 0 Cinnamomumcamphora CcFATB4 pSZ2756 6; T525; D1666-4: 0 1 33 Cinnamomum camphora CcFATB3 pSZ2755 .i.; T525; D1665-4 0 0 0 Cuphea hyssopifolia ChsFATB1 pSZ2778 A; T535; D1689-4 0 Cuphea hyssopifolia ChsFATB2 pSZ2796 A; T537; D1700-4 0 Cuphea hyssopifolia ChsFATB2b pSZ2792 A; T537; D1696-9 0 Cuphea hyssopifolia ChsFATB3 pSZ2797 '6 T537; D1701-48 0 Cuphea hyssopifolla ChsFATB3b pSZ2795 j..t T537; D16994 0 Cuphea PSR23 Cu PSR23FATB3 pSZ2793 ik; T537; D1697-0 0 1 Cuphea wrightll CwFATB3 pSZ2751 jOi T525; D1661-4 0 Cuphea wrightH CwFATB4a pSZ2752 ik T525; D1662-3Cr:::

Cuphea wrightii CwFATB4b pSZ2753 ik T525; D1663-2* 0 Cuphea wrightii CwFATB5 pSZ2754 ik T525; D1664-3* 0 Cuphea heterophylla ChtFATB1a pSZ2757 ik T525; D1667-19.:

Cuphea heterophylla ChtFATB1b pSZ2773 .iF.i T535; D1685-*

Cuphea heterophylla ChtFATB2b pSZ2780 is; T535; D16914:::

Cuphea heterophylla. ChtFATB2a .,pSZ2774 :ii; T537; D1702-*

Cuphea heterophylla ChtFATB2c .p.SZ2758. :4; T525; D1668-24 0 0 3 2 23 3 58 7 .1 Cuphea heterophylla ChtFATB2d .p.SZ2759. :i.k; T526; D16694 Cuphea heterophylla ChtFATB2e .pSZ2775. :4; T535; D1686-2k Cuphea heterophylla ChtFATB2f pSZ2777 :A; T535; D1688- 3.4]

= = = = = =
Cuphea heterophylla ChtFATB2g .pSZ2794. 4; T537; D1698-11 Cuphea heterophylla ChtFATB3a .pSZ2776. :A; T535; D1687-2k Cuphea heterophylla ChtFATB3b pSZ2779 :4; T535; D1690- 3.f _;=z Cuphea viscos&sima CvisF ATB1 pSZ2810 A; T540; D1711 30 0 Cuphea viscos&sima CvisF ATB2 pSZ2817 :A; T547; D1718-1 .
.9.. P. Ø 4 51 2 76 6 0 Cuphea viscos&sima Cv1sFATB3 pSZ2791 :A; T537; D1695-1 , 0 0 . 0 . 8 28 2 52 8 1 A (parent stra4::::::::2::::: .:::::: r :t::: T :t::::::11:::;T:: 1 28 r a,, 1 58
[0086] Many of the acyl-ACP FatB thioesterases were found to exhibit midchain activity when expressed in Prototheca moriformis. For example, expression of CcFATB1b causes an increase in myristate levels from 2% of total fatty acids in the parent, Strain A, to ¨15% in the D1670-13 primary transformant. Other examples include CcFATB4, which exhibits an increase in laurate levels from 0% in Strain A to ¨33%, and ChsFATB 3 , which exhibits an increase in myristate levels to ¨34%.
Although some of the acyl-ACP thioesterases did not exhibit dramatic effects on midchain levels in the current incarnation, efforts will likely develop to optimize some of these constructs.
[0087] Sequences of the Heterologous Acyl-ACP Thioesterases Identified and Transformed into P. moriformis (UTEX 1435)
[0088] A complete listing of relevant sequences for the transforming constructs, such as the deduced amino acid sequence of the encoded acyl-ACP
thioesterase, the native CDS coding sequence, the Prototheca moriformis codon-optimized coding sequence, and the nature of the sequence variants examined, is provided as SEQ ID NOS: 1-78.
Example 3. Discovery and Cloning of Additional FATB genes Additional FATB genes were obtained from seeds as described above. The species and number of FatB genes identified were:
Species Accession Novel FatB
Number Thioesterase Genes Cuphea calcarata 534665 1 Cuphea painteri 288248 1 Cuphea hookeriana 534896 1 Cuphea avigera var. Ames 1 pulcherrima 17868 Cuphea paucipetala 534877 1 Cuphea procumbens 534881 3 Cuphea ignea 534773 1
[0089] The thioesterases that clustered with acyl-ACP FatB
thioesterases, which are involved in biosynthesis of C8-C16 fatty acids, were pursued. The native, putative plastid-targeting transit peptide sequence is indicated by underlining.
[0090] Construction of Transforming Vectors Expressing Acyl-ACP FatB
Thioesterases. The nine putative Acyl-ACP FatB Thioesterases from the species Cuphea calcarata, Cuphea painter, Cuphea hookeriana, Cuphea avigera var.
pulcherrima, Cuphea paucipetala, Cuphea procumbens, and Cuphea ignea were synthesized in a codon-optimized form to reflect UTEX 1435 codon usage. In contrast to the previous example, the new Acyl-ACP FatB thioesterases were synthesized with a modified transit peptide from Chlorella protothecoides (Cp) in place of the native transit peptide. The modified transit peptide derived from the CpSAD1 gene, "CpSAD ltp trimmed", was synthesized as an in-frame, N-terminal fusion to the FatB acyl-ACP thioesterases in place of the native transit peptide; the resulting sequences are listed below. The novel FatB genes were cloned into Prototheca moriformis as described above. Constructs encoding heterologous FatB
genes were transformed into strain S6165 (a descendant of S3150/Strain A) and selected for the ability to grow on sucrose. Transformations, cell culture, lipid production and fatty acid analysis were all carried out as previously described. The results for the nine novel FatB acyl-ACP thioesterases are displayed in the table immediately below.

o n Do FA
profile of top performer from each transformation (%; primary lipid) 4=.
1¨k ¨r) ,_, un P
1¨, Species Gene Name SZ Plasmid Strain C8:0 C10:0 C12:0 C14:0 C16:0 C18:0 C18:1 C18:2 C18:3a o o P Cuphea calcarata CcalcFATB1 pSZ3764 S6165; T778; D2508-26 , 0 1 12 18 29 2 29 5 1 .6.
PL. Cuphea painteri CpaiFATB1 pSZ3838 S6165; T841;

CT C Cuphea hookeriana ChookFATB4 pSZ3837 56165; T788; D2552-18: ]] 0 0 0 2 32 2 54 7 1 :]:
Cuphea avigera var. pulcherrima CaFATB1 pSZ4084 56165. T841. D2800-õ
Cuphea paucipetala CpauFATB1 pSZ3762 S6165; T778; D2506-46 ]]] 0 9 1 3 28 2 47 7 1 _ ,--,.
( ) 0 Cuphea procumbens CprocFATB1 pSZ3929 56165; T814; D2675-1: ii . 0 5 ====

'73 Cuphea procumbens CprocFATB2 pSZ3839 S6165; T788; D2553-2]:] ii . .0 .0 .0 .2. 32 .3 55 .6 Pp P
Cuphea procumbens CprocFATB3 pSZ3763 S6165; T778;
D2507- 29::: ]]] 0 3 1 2 28 3 54 6 1 '11 Cuphea ignea CigneaFATB1 pSZ3930 6165;T814;]S D2676-34 ]]] 0 . 8 . 1 . 4 . 24 .
2 . 51 . 8 . 1 õ
,¨ rp S6165 (parent strain) ]]...] ]3).]:: 4 1).]:: 4 1).]:: 4 x 4 29 t A I 58 : ]A 1 1: P
N, II.
,0 .
0.
la le, Ul IV

-1.
I-=
Ul I
,..., u, , Oh O'' '¨= =
n 2-1¨, 0 C>
P ^ Er' =,' .
PL. c,' .0 . .
`' , -...) c, ...
¨
IN.) ¨
6 c) .6.
O

cA
.6.
c,:' acid levels; CigneaFATB1, which exhibits 8% C10:0 and 1% C12:0 fatty acid levels;
Cca1cFATB1, which exhibits 18% C14:0 and 12% C12:0 levels; and CaFATB1, which exhibits 22% C8:0 and 9% C10:0 fatty acid levels.
[0092] CaFATB1, which exhibits high C8:0 and C10:0 levels, is of particular interest. CaFATB1 arose from two separate contigs that were assembled from the Cupha avigera var. pulcherrima transcriptome, S17 Cavig trinity 7406 and 517 Cavig trinity 7407. Although the two partial contigs exhibit only 17 nucleotides of overlap, we were able to assemble a putative full length transcript encoding CaFATB1 from the two contigs and then subsequently confirm the existence of the full-length transcript by direct reverse transcription PCR
analysis using seed RNA and primer pairs targeting the full-length CaFATB1 thioesterase cDNA. Tjellstrom et al. (2013) discloses the expression of a newly identified fatty acyl-ACP thioesterase from Cuphea pulcherrima that they named "CpuFATB3"
(Genbank accession number KC675178). The coding sequence of CpuFATB3 is 100% identical to the CaFATB1 gene we identified and contains one nucleotide difference in the RNA sequence outside the predicted coding region. Tjellstrom et al.
(2013) showed that CpuFATB3 produces an average of 4.8% C8:0 when expressed in Arabidopsis, and further requires deletion of two acyl-ACP synthetases, AAE15/16, to produce an average of 9.2% C8:0 with a maximum level of ¨12% C8Ø The CaFATB1 gene we identified was codon-optimized for expression in UTEX1435 and generated as a CpSAD ltp-trimmed transit peptide fusion before introduction into S6165. The CpSAD ltp trimmed:CaFATB1 gene produces an average C8:0 level of 14% and a maximum level of 22% C8:0 without requiring the deletion of endogenous acyl-ACP synthetases.
[0093] Table 7. Amino Acid Sequences of Additional Novel FatB Acyl-ACP
Thioesterases. In the appended sequence listings, the native, putative plastid-targeting transit peptide sequence is underlined:
FatB Sequence ID NO:
Cca1cFATB1 (Cuphea calcarata FATB1) SEQ ID NO: 168 ChookFATB4 (Cuphea hookeriana FATB4) SEQ ID NO: 169 CaFATB1 (Cuphea avigera var. pulcherrima FATB1) SEQ ID NO: 170 CpauFATB1 (Cuphea paucipetala FATB1) SEQ ID NO: 171 CprocFATB1 (Cuphea procumbens FATB1) SEQ ID NO: 172 CprocFATB2 (Cuphea procumbens FATB2) SEQ ID NO: 173 CprocFATB3 (Cuphea procumbens FATB3) SEQ ID NO: 174 CigneaFATB1 (Cuphea ignea FATB1) SEQ ID NO: 175 Cca1cFATB1 (Cuphea calcarata FATB1) SEQ ID NO: 176 ChookFATB4 (Cuphea hookeriana FATB4) SEQ ID NO: 177 CaFATB1 (Cuphea avigera var. pulcherrima FATB1) SEQ ID NO: 178 CpauFATB1 (Cuphea paucipetala FATB1) SEQ ID NO: 179 CprocFATB1 (Cuphea procumbens FATB1) SEQ ID NO: 180 CprocFATB2 (Cuphea procumbens FATB2) SEQ ID NO: 181 CprocFATB3 (Cuphea procumbens FATB3) SEQ ID NO: 182 CigneaFATB1 (Cuphea ignea FATB1) SEQ ID NO: 183 Example 4. FATB consensus sequences: Discovery, cloning and fatty acid profiles [0094] In the course of testing several new putative midchain FatB
thioesterases in UTEX1435, S3150 (Strain A above), we identified several thioesterases with increased C10:0 and C16:0 activity above the background midchain levels found in the strain. We reasoned that a consensus sequence could be obtained for an idealized C10:0 thioesterase and C16:0 thioesterase from aligning the best-performing C10:0 and C16:0 thioesterases. A consensus C10:0 specific thioesterase sequence was generated using the C. palustris FatB1 (CpFATB1), C. PSR2 3 FatB3 (CuPSR23FATB3), C. viscosissima FatB1 (CvisFATB1), C. glossostoma FatB1 (CgFATB1), and C. carthagenensis FatB2 (CcrFATB2) sequences as inputs resulting in a C10:0 specific consensus sequence termed JcFATB1/SzFATB1. A consensus C16:0 specific thioesterase sequence was generated using the C. heterophylla FatB3a (ChtFATB3a), C. carthagenensis FatB1 (CcrFATB1), C. viscosissima FatB2 (CvisFATB2), C. hookeriana FatB1 (ChFATB1; AAC48990), C. hyssopifolia FatB2 (ChsFATB2), C. calophylla FatB2 (Cca1FATB2; ABB71581), C. hookeriana FatB1-1 (ChFATB1-1; AAC72882), C. lanceolata FatB1 (C1FATB1; CAC19933), and C.
wrightii FatB4a (CwFATB4a) sequences as inputs resulting in a C16:0 specific consensus sequence termed JcFATB2/SzFATB2. The resulting consensus sequences were synthesized, cloned into a vector identical to that used to test other FatB
thioesterases, and introduced into S3150 as described above. The consensus amino acid sequences are given as SEQ ID NOs. 106 and 107; the nucleic acid sequences were based on these amino acid sequences using codon optimization for Prototheca moriformis. The transformants were selected, cultivated and the oil was extracted and analyzed by FAME-GC-FID. The fatty acid profiles obtained are given in the table below.
FA profile of top performer from each transformation (%; primary lipid) Gene SZ CC C
C CC C C Cl Nam Plas 8: 10 12 14 16 18 18 18 8:3 Species e mid Strain 0 :0 :0 :0 :0 :0 :1 :2 a Consensu 8315%
Sequence JcFA pSZ3 .T617;
1 TB1 187 9,300 11 0 2 0 2 26 3 57 8 1 Consensu Sequence JcFA pSZ3 iT6OO;

(parent 0D111110 DID0 2 28 3acOAD7 lt strain):.
10 Example 5: Clade analysis Various novel FATB thioesterases were clustered according to a neighbor joining algorithm. These were found to form twelve clades as listed in Table la.
Putative function was assigned based on expression in Prototheca as described above.
[0095] The described embodiments of the invention are intended to be merely exemplary and numerous variations and modifications will be apparent to those skilled in the art. All such variations and modifications are intended to be within the scope of the present invention.

Sequence Listing SEQ ID NO 1:
Cinnamomum camphora (Cc) FATB1b variant M25L, M322R, AT367-D368 amino acid sequence MATT S LASAFC SMKAVMLARDGRGLKPRS SDLQLRAGNAQTSLKMINGTKF SYTE SLKKLPD
WS MLFAVITTIF SAAEKQWTNLEWKPKPNPPQLLDDHFGPHGLVFRRTFAIRSYEVGPDRS T SI
VAVMNHLQEAALNHAKSVGILGDGFGTTLEM SKRDLIWVVKRTHVAVERYPAWGDTVEVE
CWVGASGNNGRRHDFLVRDCKTGEILTRCT SLSVMMNTRTRRLSKIPEEVRGEIGPAFIDNVA
VKDEEIKKPQKLNDSTADYIQGGLTPRWNDLDINQHVNNIKYVDWILETVPDSIFESHHIS SFTI
EYRRECTRDSVLQSLTTVSGGS SEAGLVCEHLLQLEGGSEVLRAKTEWRPKLSFRGISVIPAES
SV*
SEQ ID NO 2:
Cinnamomum camphora (Cc) FATB1b variant M25L, M322R, AT367-D368 coding DNA
sequence TTAGCTTCTGCTTTCTGCTCGATGAAAGCTGTAATGTTGGCTCGTGATGGCAGGGGCTTGA
AACCCAGGAGCAGTGATTTGCAGCTGAGGGCGGGAAATGCACAAACCTCTTTGAAGATGA
TCAATGGGACCAAGTTCAGTTACACAGAGAGCTTGAAAAAGTTGCCTGACTGGAGCATGC
TCTTTGCAGTGATCACGACCATCTTTTCGGCTGCTGAGAAGCAGTGGACCAATCTAGAGTG
GAAGCCGAAGCCGAATCCACCCCAGTTGCTTGATGACCATTTTGGGCCGCATGGGTTAGTT
TTCAGGCGCACCTTTGCCATCAGATCGTATGAGGTGGGACCTGACCGCTCCACATCTATAG
TGGCTGTTATGAATCACTTGCAGGAGGCTGCACTTAATCATGCGAAGAGTGTGGGAATTCT
AGGAGATGGATTCGGTACGACGCTAGAGATGAGTAAGAGAGATCTGATATGGGTTGTGAA
ACGCACGCATGTTGCTGTGGAACGGTACCCTGCTTGGGGTGATACTGTTGAAGTAGAGTG
CTGGGTTGGTGCATCGGGAAATAATGGCAGGCGCCATGATTTCCTTGTCCGGGACTGCAA
AACAGGCGAAATTCTTACAAGATGTACCAGTCTTTCGGTGATGATGAATACAAGGACAAG
GAGGTTGTCCAAAATCCCTGAAGAAGTTAGAGGGGAGATAGGGCCTGCATTCATTGATAA
TGTGGCTGTCAAGGACGAGGAAATTAAGAAACCACAGAAGCTCAATGACAGCACTGCAG
ATTACATCCAAGGAGGATTGACTCCTCGATGGAATGATTTGGATATCAATCAGCACGTTA
ACAACATCAAATACGTTGACTGGATTCTTGAGACTGTCCCAGACTCAATCTTTGAGAGTCA
TCATATTTCCAGCTTCACTATTGAATACAGGAGAGAGTGCACGAGGGATAGCGTGCTGCA
GTCCCTGACCACTGTCTCCGGTGGCTCGTCGGAAGCTGGGTTAGTGTGCGAGCACTTGCTC
CAGCTTGAAGGTGGGTCTGAGGTATTGAGGGCAAAAACAGAGTGGAGGCCTAAGCTTAGT
TTCAGAGGGATTAGTGTGATACCCGCAGAATCGAGTGTCTAA
SEQ ID NO 3:
Cinnamomum camphora (Cc) FATB1b variant M25L, M322R, AT367-D368 coding DNA
sequence codon optimized for Prototheca moriformis TTAGCTTCTGCTTTCTGCTCGATGAAAGCTGTAATGTTGGCTCGTGATGGCAGGGGCTTGA
AACCCAGGAGCAGTGATTTGCAGCTGAGGGCGGGAAATGCACAAACCTCTTTGAAGATGA
TCAATGGGACCAAGTTCAGTTACACAGAGAGCTTGAAAAAGTTGCCTGACTGGAGCATGC
TCTTTGCAGTGATCACGACCATCTTTTCGGCTGCTGAGAAGCAGTGGACCAATCTAGAGTG
GAAGCCGAAGCCGAATCCACCCCAGTTGCTTGATGACCATTTTGGGCCGCATGGGTTAGTT
TTCAGGCGCACCTTTGCCATCAGATCGTATGAGGTGGGACCTGACCGCTCCACATCTATAG
TGGCTGTTATGAATCACTTGCAGGAGGCTGCACTTAATCATGCGAAGAGTGTGGGAATTCT
AGGAGATGGATTCGGTACGACGCTAGAGATGAGTAAGAGAGATCTGATATGGGTTGTGAA
ACGCACGCATGTTGCTGTGGAACGGTACCCTGCTTGGGGTGATACTGTTGAAGTAGAGTG
CTGGGTTGGTGCATCGGGAAATAATGGCAGGCGCCATGATTTCCTTGTCCGGGACTGCAA
AACAGGCGAAATTCTTACAAGATGTACCAGTCTTTCGGTGATGATGAATACAAGGACAAG
GAGGTTGTCCAAAATCCCTGAAGAAGTTAGAGGGGAGATAGGGCCTGCATTCATTGATAA
TGTGGCTGTCAAGGACGAGGAAATTAAGAAACCACAGAAGCTCAATGACAGCACTGCAG
ATTACATCCAAGGAGGATTGACTCCTCGATGGAATGATTTGGATATCAATCAGCACGTTA
ACAACATCAAATACGTTGACTGGATTCTTGAGACTGTCCCAGACTCAATCTTTGAGAGTCA
TCATATTTCCAGCTTCACTATTGAATACAGGAGAGAGTGCACGAGGGATAGCGTGCTGCA

GTCCCTGACCACTGTCTCCGGTGGCTCGTCGGAAGCTGGGTTAGTGTGCGAGCACTTGCTC
CAGCTTGAAGGTGGGTCTGAGGTATTGAGGGCAAAAACAGAGTGGAGGCCTAAGCTTAGT
TTCAGAGGGATTAGTGTGATACCCGCAGAATCGAGTGTCTAA
SEQ ID NO:4 Cinnamomum camphora (Cc) FATB4 amino acid sequence MVTTSLASAYF SMKAVMLAPDGRGIKPRS SGLQVRAGNERNSCKVINGTKVKDTEGLKGC ST
LQGQSMLDDHFGLHGLVFRRTFAIRCYEVGPDRST SIMAVMNHLQEAARNHAE SLGLLGDGF
GETLEMSKRDLIWVVRRTHVAVERYPAWGDTVEVEAWVGASGNTGMRRDFLVRDCKTGHI
LTRCTSVSVMMNMRTRRLSKIPQEVRAEIDPLFIEKVAVKEGEIKKLQKLNDSTADYIQGGWT
PRWNDLDVNQHVNNIIYVGWIFKSVPD SI SENHHL S SITLEYRRECTRGNKLQ SLTTVCGGS SE
AGIICEHLLQLEDGSEVLRARTEWRPKHTDSFQGISERFPQQEPHK
SEQ ID NO: 5 Cinnamomum camphora (Cc) FATB4 coding DNA sequence ATGGTCACCACCTCTTTAGCTTCCGCTTACTTCTCGATGAAAGCTGTAATGTTGGCTCCTGA
CGGCAGGGGCATAAAGCCCAGGAGCAGTGGTTTGCAGGTGAGGGCGGGAAATGAACGAA
ACTCTTGCAAGGTGATCAATGGGACCAAGGTCAAAGACACGGAGGGCTTGAAAGGGTGC
AGCACGTTGCAAGGCCAGAGCATGCTTGATGACCATTTTGGTCTGCATGGGCTAGTTTTCA
GGCGCACCTTTGCAATCAGATGCTATGAGGTTGGACCTGACCGCTCCACATCCATAATGGC
TGTTATGAATCACTTGCAGGAAGCTGCACGTAATCATGCGGAGAGTCTGGGACTTCTAGG
AGATGGATTCGGTGAGACACTGGAGATGAGTAAGAGAGATCTGATATGGGTTGTGAGACG
CACGCATGTTGCTGTGGAACGGTACCCTGCTTGGGGCGATACTGTTGAAGTCGAGGCCTG
GGTGGGTGCATCAGGTAACACTGGCATGCGCCGCGATTTCCTTGTCCGCGACTGCAAAAC
TGGCCACATTCTTACAAGATGTACCAGTGTTTCAGTGATGATGAATATGAGGACAAGGAG
ATTGTCCAAAATTCCCCAAGAAGTTAGAGCGGAGATTGACCCTCTTTTCATTGAAAAGGTT
GCTGTCAAGGAAGGGGAAATTAAAAAATTACAGAAGTTGAATGATAGCACTGCAGATTAC
ATTCAAGGGGGTTGGACTCCTCGATGGAATGATTTGGATGTCAATCAGCACGTGAACAAT
ATCATATACGTTGGCTGGATTTTTAAGAGCGTCCCAGACTCTATCTCTGAGAATCATCATC
TTTCTAGCATCACTCTCGAATACAGGAGAGAGTGCACAAGGGGCAACAAGCTGCAGTCCC
TGACCACTGTTTGTGGTGGCTCGTCGGAAGCTGGGATCATATGTGAGCACCTACTCCAGCT
TGAGGATGGGTCTGAGGTTTTGAGGGCAAGAACAGAGTGGAGGCCCAAGCACACCGATA
GTTTCCAAGGCATTAGTGAGAGATTCCCGCAGCAAGAACCGCATAAGTAA
SEQ ID NO: 6 Cinnamomum camphora (Cc) FATB4 coding DNA sequence codon optimized for Prototheca moriformis ATGGTGACCACCTCCCTGGCCTCCGCCTACTTCTCCATGAAGGCCGTGATGCTGGCCCCCG
ACGGCCGCGGCATCAAGCCCCGCTCCTCCGGCCTGCAGGTGCGCGCCGGCAACGAGCGCA
ACTCCTGCAAGGTGATCAACGGCACCAAGGTGAAGGACACCGAGGGCCTGAAGGGCTGC
TCCACCCTGCAGGGCCAGTCCATGCTGGACGACCACTTCGGCCTGCACGGCCTGGTGTTCC
GCCGCACCTTCGCCATCCGCTGCTACGAGGTGGGCCCCGACCGCTCCACCTCCATCATGGC
CGTGATGAACCACCTGCAGGAGGCCGCCCGCAACCACGCCGAGTCCCTGGGCCTGCTGGG
CGACGGCTTCGGCGAGACCCTGGAGATGTCCAAGCGCGACCTGATCTGGGTGGTGCGCCG
CACCCACGTGGCCGTGGAGCGCTACCCCGCCTGGGGCGACACCGTGGAGGTGGAGGCCTG
GGTGGGCGCCTCCGGCAACACCGGCATGCGCCGCGACTTCCTGGTGCGCGACTGCAAGAC
CGGCCACATCCTGACCCGCTGCACCTCCGTGTCCGTGATGATGAACATGCGCACCCGCCGC
CTGTCCAAGATCCCCCAGGAGGTGCGCGCCGAGATCGACCCCCTGTTCATCGAGAAGGTG
GCCGTGAAGGAGGGCGAGATCAAGAAGCTGCAGAAGCTGAACGACTCCACCGCCGACTA
CATCCAGGGCGGCTGGACCCCCCGCTGGAACGACCTGGACGTGAACCAGCACGTGAACAA
CATCATCTACGTGGGCTGGATCTTCAAGTCCGTGCCCGACTCCATCTCCGAGAACCACCAC
CTGTCCTCCATCACCCTGGAGTACCGCCGCGAGTGCACCCGCGGCAACAAGCTGCAGTCC
CTGACCACCGTGTGCGGCGGCTCCTCCGAGGCCGGCATCATCTGCGAGCACCTGCTGCAG
CTGGAGGACGGCTCCGAGGTGCTGCGCGCCCGCACCGAGTGGCGCCCCAAGCACACCGAC
TCCTTCCAGGGCATCTCCGAGCGCTTCCCCCAGCAGGAGCCCCACAAGTGA

SEQ ID NO: 7 Cinnamomum camphora (Cc) FATB3 amino acid sequence MVATAAASAFFPVGAPAT S SAT SAKASMMPDNLDARGIKPKPAS S SGLQVKANAHASPKING
SKVSTDTLKGEDTLTS SPAPRTFINQLPDWSMFLAAITTIFLAAEKQWTNLDWKPRRPDMLAD
PFGIGRFMQDGLIFRQHFAIRSYEIGADRTASIETLMNHLQETALNHVRSAGLLGDGFGATPEM
SRRDLIWVVTRMQVLVDRYPAWGDIVEVETWVGASGKNGMRRDWLVRDSQTGEILTRATSV
WVMMNKRTRRLSKLPEEVRGEIGPYFIEDVAIIEEDNRKLQKLNENTADNVRRGLTPRWSDLD
VNQHVNNVKYIGWILE SAPGSILE SHELSCMTLEYRRECGKDSVLQSMTAVSGGGSAAGGSPE
S SVECDHLLQLE SGPEVVRGRTEWRPKSANNSRSILEMPAE SL
SEQ ID NO: 8 Cinnamomum camphora (Cc) FATB3 coding DNA sequence ATGGTTGCCACCGCTGCTGCTTCTGCTTTCTTCCCGGTCGGTGCTCCGGCTACGTCATCTGC
AACTTCAGCCAAAGCGTCGATGATGCCTGATAATTTGGATGCCAGAGGCATCAAACCGAA
GCCGGCTTCGTCCAGCGGCTTGCAGGTTAAGGCAAATGCCCATGCCTCTCCCAAGATTAAT
GGTTCCAAGGTGAGCACGGATACCTTGAAGGGGGAAGACACCTTAACTTCCTCGCCCGCC
CCACGGACCTTTATCAACCAATTGCCTGACTGGAGCATGTTCCTTGCTGCCATCACAACTA
TTTTCTTGGCTGCCGAGAAGCAGTGGACGAATCTCGACTGGAAGCCCAGAAGACCCGACA
TGCTTGCTGACCCGTTTGGCATCGGGAGGTTTATGCAGGATGGGCTGATTTTCAGGCAGCA
CTTTGCAATCAGATCTTATGAGATTGGGGCTGATAGAACGGCGTCTATAGAGACTTTAATG
AATCACTTGCAGGAGACTGCACTTAATCATGTGAGGAGTGCTGGACTCCTAGGTGATGGA
TTTGGTGCGACACCTGAGATGAGTAGAAGAGATCTGATATGGGTTGTAACACGTATGCAG
GTTCTTGTGGACCGCTACCCTGCTTGGGGTGATATTGTTGAAGTAGAGACCTGGGTTGGTG
CATCTGGAAAAAATGGTATGCGCCGTGATTGGCTTGTTCGGGACAGCCAAACTGGTGAAA
TTCTCACACGAGCTACCAGTGTTTGGGTGATGATGAATAAACGGACAAGGCGATTGTCCA
AACTTCCTGAAGAAGTTAGAGGGGAAATAGGGCCTTATTTTATAGAAGATGTTGCTATCA
TAGAGGAGGACAACAGGAAACTACAGAAGCTCAATGAAAACACTGCTGATAATGTTCGA
AGGGGTTTGACTCCTCGCTGGAGTGATCTGGATGTTAATCAGCATGTGAACAATGTCAAAT
ACATTGGTTGGATTCTTGAGAGTGCACCAGGATCCATCTTGGAGAGTCATGAGCTTTCCTG
CATGACCCTTGAATACAGGAGAGAATGTGGGAAGGACAGTGTGCTGCAGTCAATGACTGC
TGTCTCTGGTGGAGGCAGTGCAGCAGGTGGCTCACCAGAATCTAGCGTTGAGTGTGACCA
CTTGCTCCAGCTAGAGAGTGGGCCTGAAGTTGTGAGGGGAAGAACCGAGTGGAGGCCCA
AGAGTGCTAATAACTCGAGGAGCATCCTGGAGATGCCGGCCGAGAGC
SEQ ID NO: 9 Cinnamomum camphora (Cc) FATB4 coding DNA sequence codon optimized for Prototheca moriformis ATGGTGGCCACCGCCGCCGCCTCCGCCTTCTTCCCCGTGGGCGCCCCCGCCACCTCCTCCG
CCACCTCCGCCAAGGCCTCCATGATGCCCGACAACCTGGACGCCCGCGGCATCAAGCCCA
AGCCCGCCTCCTCCTCCGGCCTGCAGGTGAAGGCCAACGCCCACGCCTCCCCCAAGATCA
ACGGCTCCAAGGTGTCCACCGACACCCTGAAGGGCGAGGACACCCTGACCTCCTCCCCCG
CCCCCCGCACCTTCATCAACCAGCTGCCCGACTGGTCCATGTTCCTGGCCGCCATCACCAC
CATCTTCCTGGCCGCCGAGAAGCAGTGGACCAACCTGGACTGGAAGCCCCGCCGCCCCGA
CATGCTGGCCGACCCCTTCGGCATCGGCCGCTTCATGCAGGACGGCCTGATCTTCCGCCAG
CACTTCGCCATCCGCTCCTACGAGATCGGCGCCGACCGCACCGCCTCCATCGAGACCCTGA
TGAACCACCTGCAGGAGACCGCCCTGAACCACGTGCGCTCCGCCGGCCTGCTGGGCGACG
GCTTCGGCGCCACCCCCGAGATGTCCCGCCGCGACCTGATCTGGGTGGTGACCCGCATGC
AGGTGCTGGTGGACCGCTACCCCGCCTGGGGCGACATCGTGGAGGTGGAGACCTGGGTGG
GCGCCTCCGGCAAGAACGGCATGCGCCGCGACTGGCTGGTGCGCGACTCCCAGACCGGCG
AGATCCTGACCCGCGCCACCTCCGTGTGGGTGATGATGAACAAGCGCACCCGCCGCCTGT
CCAAGCTGCCCGAGGAGGTGCGCGGCGAGATCGGCCCCTACTTCATCGAGGACGTGGCCA
TCATCGAGGAGGACAACCGCAAGCTGCAGAAGCTGAACGAGAACACCGCCGACAACGTG
CGCCGCGGCCTGACCCCCCGCTGGTCCGACCTGGACGTGAACCAGCACGTGAACAACGTG
AAGTACATCGGCTGGATCCTGGAGTCCGCCCCCGGCTCCATCCTGGAGTCCCACGAGCTGT
CCTGCATGACCCTGGAGTACCGCCGCGAGTGCGGCAAGGACTCCGTGCTGCAGTCCATGA

CCGCCGTGTCCGGCGGCGGCTCCGCCGCCGGCGGCTCCCCCGAGTCCTCCGTGGAGTGCG
ACCACCTGCTGCAGCTGGAGTCCGGCCCCGAGGTGGTGCGCGGCCGCACCGAGTGGCGCC
CCAAGTCCGCCAACAACTCCCGCTCCATCCTGGAGATGCCCGCCGAGTCCCTGTGA
SEQ ID NO: 10 Cuphea hyssopifolia (Chs) FATB1 amino acid sequence MVATNAAAF SAYTFFLT SP THGYS SKRLADTQNGYPGT SLKSKSTPPPAAAAARNGALPLLAS
ICKCPKKADGSMQLDS SLVFGFQFYIRSYEVGADQTV SIQTVLNYLQEAAINHVQ SAGYFGD S
FGATPEMTKRNLIWVITKMQVLVDRYPAWGDVVQVDTWTC S SGKNSMQRDWFVRDLKTGD
IITRAS SVWVLMNRLTRKL SKIPEAVLEEAKLFVMNTAPTVDDNRKLPKLDGS SADYVLSGLT
PRWSDLDMNQHVNNVKYIAWILESVPQSIPETHKL SAITVEYRRECGKNSVLQSLTNVSGDGI
TCGNSIIECHHLLQLETGPEILLARTEWISKEPGFRGAPIQAEKVYNNK*
SEQ ID NO: 11 Cuphea hyssopifolia (Chs) FATB1 coding DNA sequence ATGGTTGCCACTAATGCTGCTGCCTTTTCTGCTTATACTTTCTTCCTTACTTCACCAACTCA
TGGTTACTCTTCCAAACGTCTCGCCGATACTCAAAATGGTTATCCGGGTACCTCCTTGAAA
TCGAAATCCACTCCTCCACCAGCTGCTGCTGCTGCTCGTAACGGTGCATTGCCACTGCTGG
CCTCCATCTGCAAATGCCCCAAAAAGGCTGATGGGAGTATGCAACTAGACAGCTCCTTGG
TCTTCGGGTTTCAATTTTACATTAGATCATATGAAGTGGGTGCGGATCAAACCGTGTCAAT
ACAGACAGTACTCAATTACTTACAGGAGGCAGCCATCAATCATGTTCAGAGTGCTGGCTA
TTTTGGTGATAGTTTTGGCGCCACCCCGGAAATGACCAAGAGGAACCTCATCTGGGTTATC
ACTAAGATGCAGGTTTTGGTGGATCGCTATCCCGCTTGGGGCGATGTTGTTCAAGTTGATA
CATGGACCTGTAGTTCTGGTAAAAACAGCATGCAGCGTGATTGGTTCGTACGGGATCTCA
AAACTGGAGATATTATAACAAGAGCCTCGAGCGTGTGGGTGCTGATGAATAGACTCACCA
GAAAATTATCAAAAATTCCTGAAGCAGTTCTGGAAGAAGCAAAACTTTTTGTGATGAACA
CTGCCCCCACCGTAGATGACAACAGGAAGCTACCAAAGCTGGATGGCAGCAGTGCTGATT
ATGTCCTCTCTGGCTTAACTCCTAGATGGAGCGACTTAGATATGAACCAGCATGTCAACAA
TGTGAAGTACATAGCCTGGATCCTTGAGAGTGTCCCTCAGAGCATACCGGAGACACACAA
GCTGTCAGCGATAACCGTGGAGTACAGGAGAGAATGTGGCAAGAACAGCGTCCTCCAGTC
TCTGACCAACGTCTCCGGGGATGGAATCACATGTGGAAACAGTATTATCGAGTGCCACCA
TTTGCTTCAACTTGAGACTGGCCCAGAGATTCTACTAGCGCGGACGGAGTGGATATCCAA
GGAACCTGGGTTCAGGGGAGCTCCAATCCAGGCAGAGAAAGTCTACAACAACAAATAA
SEQ ID NO: 12 Cuphea hyssopifolia (Chs) FATB1 coding DNA sequence codon optimized for Prototheca moriformis ATGGTGGCCACCAACGCCGCCGCCTTCTCCGCCTACACCTTCTTCCTGACCTCCCCCACCC
ACGGCTACTCCTCCAAGCGCCTGGCCGACACCCAGAACGGCTACCCCGGCACCTCCCTGA
AGTCCAAGTCCACCCCCCCCCCCGCCGCCGCCGCCGCCCGCAACGGCGCCCTGCCCCTGCT
GGCCTCCATCTGCAAGTGCCCCAAGAAGGCCGACGGCTCCATGCAGCTGGACTCCTCCCT
GGTGTTCGGCTTCCAGTTCTACATCCGCTCCTACGAGGTGGGCGCCGACCAGACCGTGTCC
ATCCAGACCGTGCTGAACTACCTGCAGGAGGCCGCCATCAACCACGTGCAGTCCGCCGGC
TACTTCGGCGACTCCTTCGGCGCCACCCCCGAGATGACCAAGCGCAACCTGATCTGGGTG
ATCACCAAGATGCAGGTGCTGGTGGACCGCTACCCCGCCTGGGGCGACGTGGTGCAGGTG
GACACCTGGACCTGCTCCTCCGGCAAGAACTCCATGCAGCGCGACTGGTTCGTGCGCGAC
CTGAAGACCGGCGACATCATCACCCGCGCCTCCTCCGTGTGGGTGCTGATGAACCGCCTG
ACCCGCAAGCTGTCCAAGATCCCCGAGGCCGTGCTGGAGGAGGCCAAGCTGTTCGTGATG
AACACCGCCCCCACCGTGGACGACAACCGCAAGCTGCCCAAGCTGGACGGCTCCTCCGCC
GACTACGTGCTGTCCGGCCTGACCCCCCGCTGGTCCGACCTGGACATGAACCAGCACGTG
AACAACGTGAAGTACATCGCCTGGATCCTGGAGTCCGTGCCCCAGTCCATCCCCGAGACC
CACAAGCTGTCCGCCATCACCGTGGAGTACCGCCGCGAGTGCGGCAAGAACTCCGTGCTG
CAGTCCCTGACCAACGTGTCCGGCGACGGCATCACCTGCGGCAACTCCATCATCGAGTGC
CACCACCTGCTGCAGCTGGAGACCGGCCCCGAGATCCTGCTGGCCCGCACCGAGTGGATC
TCCAAGGAGCCCGGCTTCCGCGGCGCCCCCATCCAGGCCGAGAAGGTGTACAACAACAAG
TGA

SEQ ID NO: 13 Cuphea hyssopifolia (Chs) FATB2 amino acid sequence MVATAAS SAFFPVPSPDAS SRPGKLGNGS S SLSPLKPKLMANGGLQVKANASAPPKINGS SVG
LKSGSLKTQEDTPSAPPPRTFINQLPDWSMLLAAITTVFLAAEKQWMMLDWKPKRPDMLVDP
FGLGRIVQDGLVFRQNF SIRSYEIGADRTASIETVMNHLQETALNHVKSAGLLNDGFGRTLEM
YKRDLIWVVAKMQVMVNRYPTWGDTVEVNTWVAKSGKNGMRRDWLISDCNTGEILTRAS S
VWVMMNQKTRRLSKIPDEVRHEIEPHFVDSAPVIEDDDRKLPKLDEKTADSIRKGLTPKWNDL
DVNQHVNNVKYIGWILESTPPEVLETQELC SLTLEYRRECGRE SVLESLTAVDP SGKGSGSQFQ
HLLRLEDGGEIVKGRTEWRPKTAGINGPIAS GET S PGD S S*
SEQ ID NO: 14 Cuphea hyssopifolia (Chs) FATB2 coding DNA sequence ATGGTGGCTACCGCTGCAAGTTCAGCATTCTTCCCTGTGCCGTCCCCCGACGCCTCCTCTA
GACCTGGAAAGCTCGGCAATGGGTCATCGAGCTTGAGCCCCCTCAAGCCCAAATTGATGG
CCAATGGCGGGTTGCAGGTTAAGGCAAACGCCAGTGCCCCTCCTAAGATCAATGGTTCTT
CGGTCGGTCTAAAGTCCGGCAGTCTCAAGACTCAGGAAGACACTCCTTCGGCGCCTCCTCC
CCGGACTTTTATTAACCAGCTGCCTGATTGGAGTATGCTTCTTGCTGCAATCACTACTGTCT
TCTTGGCAGCAGAGAAGCAGTGGATGATGCTTGATTGGAAACCCAAGAGGCCTGACATGC
TTGTGGACCCGTTCGGATTGGGAAGGATTGTTCAAGATGGGCTTGTGTTCAGGCAGAATTT
TTCGATTAGGTCCTATGAAATAGGCGCTGATCGCACTGCGTCTATAGAGACGGTGATGAA
CCACTTGCAGGAAACAGCTCTCAATCATGTTAAGAGTGCTGGGCTTCTTAATGACGGCTTT
GGTCGTACTCTTGAGATGTATAAAAGGGACCTTATTTGGGTTGTTGCAAAAATGCAGGTCA
TGGTTAACCGCTATCCTACTTGGGGCGACACGGTTGAAGTGAATACTTGGGTTGCCAAGTC
AGGGAAAAATGGTATGCGTCGTGATTGGCTCATAAGTGATTGCAATACAGGAGAAATTCT
TACTAGAGCATCAAGTGTGTGGGTCATGATGAATCAAAAGACAAGAAGATTGTCAAAAAT
TCCAGATGAGGTTCGACATGAGATAGAGCCTCATTTCGTGGACTCTGCTCCCGTCATTGAA
GATGATGACCGGAAACTTCCCAAGCTGGATGAGAAGACTGCTGACTCCATCCGCAAGGGT
CTAACTCCGAAGTGGAATGACTTGGATGTCAATCAGCACGTCAACAACGTGAAGTACATT
GGGTGGATTCTTGAGAGTACTCCACCAGAAGTTCTGGAGACCCAGGAGTTATGTTCCCTTA
CCCTGGAATATAGGCGGGAATGCGGAAGGGAGAGCGTGCTGGAGTCCCTCACTGCTGTGG
ACCCCTCTGGAAAGGGCTCTGGGTCTCAGTTCCAGCACCTTCTGCGGCTTGAGGATGGAG
GTGAGATTGTGAAGGGGAGAACTGAGTGGCGACCCAAGACTGCAGGAATCAATGGGCCA
ATAGCATCCGGGGAGACCTCACCTGGAGACTCTTCTTAG
SEQ ID NO: 15 Cuphea hyssopifolia (Chs) FATB2 coding DNA sequence codon optimized for Prototheca moriformis ATGGTGGCCACCGCCGCCTCCTCCGCCTTCTTCCCCGTGCCCTCCCCCGACGCCTCCTCCCG
CCCCGGCAAGCTGGGCAACGGCTCCTCCTCCCTGTCCCCCCTGAAGCCCAAGCTGATGGCC
AACGGCGGCCTGCAGGTGAAGGCCAACGCCTCCGCCCCCCCCAAGATCAACGGCTCCTCC
GTGGGCCTGAAGTCCGGCTCCCTGAAGACCCAGGAGGACACCCCCTCCGCCCCCCCCCCC
CGCACCTTCATCAACCAGCTGCCCGACTGGTCCATGCTGCTGGCCGCCATCACCACCGTGT
TCCTGGCCGCCGAGAAGCAGTGGATGATGCTGGACTGGAAGCCCAAGCGCCCCGACATGC
TGGTGGACCCCTTCGGCCTGGGCCGCATCGTGCAGGACGGCCTGGTGTTCCGCCAGAACTT
CTCCATCCGCTCCTACGAGATCGGCGCCGACCGCACCGCCTCCATCGAGACCGTGATGAA
CCACCTGCAGGAGACCGCCCTGAACCACGTGAAGTCCGCCGGCCTGCTGAACGACGGCTT
CGGCCGCACCCTGGAGATGTACAAGCGCGACCTGATCTGGGTGGTGGCCAAGATGCAGGT
GATGGTGAACCGCTACCCCACCTGGGGCGACACCGTGGAGGTGAACACCTGGGTGGCCAA
GTCCGGCAAGAACGGCATGCGCCGCGACTGGCTGATCTCCGACTGCAACACCGGCGAGAT
CCTGACCCGCGCCTCCTCCGTGTGGGTGATGATGAACCAGAAGACCCGCCGCCTGTCCAA
GATCCCCGACGAGGTGCGCCACGAGATCGAGCCCCACTTCGTGGACTCCGCCCCCGTGAT
CGAGGACGACGACCGCAAGCTGCCCAAGCTGGACGAGAAGACCGCCGACTCCATCCGCA
AGGGCCTGACCCCCAAGTGGAACGACCTGGACGTGAACCAGCACGTGAACAACGTGAAG
TACATCGGCTGGATCCTGGAGTCCACCCCCCCCGAGGTGCTGGAGACCCAGGAGCTGTGC
TCCCTGACCCTGGAGTACCGCCGCGAGTGCGGCCGCGAGTCCGTGCTGGAGTCCCTGACC
GCCGTGGACCCCTCCGGCAAGGGCTCCGGCTCCCAGTTCCAGCACCTGCTGCGCCTGGAG

GACGGCGGCGAGATCGTGAAGGGCCGCACCGAGTGGCGCCCCAAGACCGCCGGCATCAA
CGGCCCCATCGCCTCCGGCGAGACCTCCCCCGGCGACTCCTCCTGA
SEQ ID NO: 16 Cuphea hyssopifolia (Chs) FATB2b +a.a.248-259 variant amino acid sequence MVATAAS SAFFPVPSPDAS SRPGKLGNGS S SLSPLKPKLMANGGLQVKANASAPPKINGS SVG
LKSGSLKTQEDTPSAPPPRTFINQLPDWSMLLAAITTVFLAAEKQWMMLDWKPKRPDMLVDP
FGLGRIVQDGLVFRQNF SIRSYEIGADRTASIETVMNHLQETALNHVKSAGLLNDGFGRTLEM
YKRDLIWVVAKMQVMVNRYPTWGDTVEVNTWVAKSGKNGMRRDWLISDCNTGEILTRAS S
KSQIMLPLHYCSVWVMMNQKTRRLSKIPDEVRHEIEPHFVDSAPVIEDDDRKLPKLDEKTAD S
IRKGLTPKWNDLDVNQHVNNVKYIGWILE STPPEVLETQELC S LTLEYRRECGRE SVLE SLTA
VDP SGKGSGSQFQHLLRLEDGGEIVKGRTEWRPKTAGINGPIASGET SPGD S S*
SEQ ID NO: 17 Cuphea hyssopifolia (Chs) FATB2b+a.a.248-259 variant coding DNA sequence ATGGTGGCTACCGCTGCAAGTTCAGCATTCTTCCCTGTGCCGTCCCCCGACGCCTCCTCTA
GACCTGGAAAGCTCGGCAATGGGTCATCGAGCTTGAGCCCCCTCAAGCCCAAATTGATGG
CCAATGGCGGGTTGCAGGTTAAGGCAAACGCCAGTGCCCCTCCTAAGATCAATGGTTCTT
CGGTCGGTCTAAAGTCCGGCAGTCTCAAGACTCAGGAAGACACTCCTTCGGCGCCTCCTCC
CCGGACTTTTATTAACCAGCTGCCTGATTGGAGTATGCTTCTTGCTGCAATCACTACTGTCT
TCTTGGCAGCAGAGAAGCAGTGGATGATGCTTGATTGGAAACCCAAGAGGCCTGACATGC
TTGTGGACCCGTTCGGATTGGGAAGGATTGTTCAAGATGGGCTTGTGTTCAGGCAGAATTT
TTCGATTAGGTCCTATGAAATAGGCGCTGATCGCACTGCGTCTATAGAGACGGTGATGAA
CCACTTGCAGGAAACAGCTCTCAATCATGTTAAGAGTGCTGGGCTTCTTAATGACGGCTTT
GGTCGTACTCTTGAGATGTATAAAAGGGACCTTATTTGGGTTGTTGCAAAAATGCAGGTCA
TGGTTAACCGCTATCCTACTTGGGGCGACACGGTTGAAGTGAATACTTGGGTTGCCAAGTC
AGGGAAAAATGGTATGCGTCGTGATTGGCTCATAAGTGATTGCAATACAGGAGAAATTCT
TACTAGAGCATCAAGTAAAAGCCAAATTATGTTACCCTTACATTATTGCAGTGTGTGGGTC
ATGATGAATCAAAAGACAAGAAGATTGTCAAAAATTCCAGATGAGGTTCGACATGAGATA
GAGCCTCATTTCGTGGACTCTGCTCCCGTCATTGAAGATGATGACCGGAAACTTCCCAAGC
TGGATGAGAAGACTGCTGACTCCATCCGCAAGGGTCTAACTCCGAAGTGGAATGACTTGG
ATGTCAATCAGCACGTCAACAACGTGAAGTACATTGGGTGGATTCTTGAGAGTACTCCAC
CAGAAGTTCTGGAGACCCAGGAGTTATGTTCCCTTACCCTGGAATATAGGCGGGAATGCG
GAAGGGAGAGCGTGCTGGAGTCCCTCACTGCTGTGGACCCCTCTGGAAAGGGCTCTGGGT
CTCAGTTCCAGCACCTTCTGCGGCTTGAGGATGGAGGTGAGATTGTGAAGGGGAGAACTG
AGTGGCGACCCAAGACTGCAGGAATCAATGGGCCAATAGCATCCGGGGAGACCTCACCTG
GAGACTCTTCTTAG
SEQ ID NO: 18 Cuphea hyssopifolia (Chs) FATB2b +a.a.248-259 variant coding DNA sequence codon optimized for Prototheca moriformis ATGGTGGCCACCGCCGCCTCCTCCGCCTTCTTCCCCGTGCCCTCCCCCGACGCCTCCTCCCG
CCCCGGCAAGCTGGGCAACGGCTCCTCCTCCCTGTCCCCCCTGAAGCCCAAGCTGATGGCC
AACGGCGGCCTGCAGGTGAAGGCCAACGCCTCCGCCCCCCCCAAGATCAACGGCTCCTCC
GTGGGCCTGAAGTCCGGCTCCCTGAAGACCCAGGAGGACACCCCCTCCGCCCCCCCCCCC
CGCACCTTCATCAACCAGCTGCCCGACTGGTCCATGCTGCTGGCCGCCATCACCACCGTGT
TCCTGGCCGCCGAGAAGCAGTGGATGATGCTGGACTGGAAGCCCAAGCGCCCCGACATGC
TGGTGGACCCCTTCGGCCTGGGCCGCATCGTGCAGGACGGCCTGGTGTTCCGCCAGAACTT
CTCCATCCGCTCCTACGAGATCGGCGCCGACCGCACCGCCTCCATCGAGACCGTGATGAA
CCACCTGCAGGAGACCGCCCTGAACCACGTGAAGTCCGCCGGCCTGCTGAACGACGGCTT
CGGCCGCACCCTGGAGATGTACAAGCGCGACCTGATCTGGGTGGTGGCCAAGATGCAGGT
GATGGTGAACCGCTACCCCACCTGGGGCGACACCGTGGAGGTGAACACCTGGGTGGCCAA
GTCCGGCAAGAACGGCATGCGCCGCGACTGGCTGATCTCCGACTGCAACACCGGCGAGAT
CCTGACCCGCGCCTCCTCCAAGTCCCAGATCATGCTGCCCCTGCACTACTGCTCCGTGTGG
GTGATGATGAACCAGAAGACCCGCCGCCTGTCCAAGATCCCCGACGAGGTGCGCCACGAG

ATCGAGCCCCACTTCGTGGACTCCGCCCCCGTGATCGAGGACGACGACCGCAAGCTGCCC
AAGCTGGACGAGAAGACCGCCGACTCCATCCGCAAGGGCCTGACCCCCAAGTGGAACGA
CCTGGACGTGAACCAGCACGTGAACAACGTGAAGTACATCGGCTGGATCCTGGAGTCCAC
CCCCCCCGAGGTGCTGGAGACCCAGGAGCTGTGCTCCCTGACCCTGGAGTACCGCCGCGA
GTGCGGCCGCGAGTCCGTGCTGGAGTCCCTGACCGCCGTGGACCCCTCCGGCAAGGGCTC
CGGCTCCCAGTTCCAGCACCTGCTGCGCCTGGAGGACGGCGGCGAGATCGTGAAGGGCCG
CACCGAGTGGCGCCCCAAGACCGCCGGCATCAACGGCCCCATCGCCTCCGGCGAGACCTC
CCCCGGCGACTCCTCCTGA
SEQ ID NO: 19 Cuphea hyssopifolia (Chs) FATB3 amino acid sequence MVAAEAS SALF SVRTPGTSPKPGKFGNWPT SL SVPFK SKSNHNGGFQVICANASARPICANG SA
VSLKSGSLDTQEDTSSSSSPPRTFINQLPDWSMLLSAITTVFVAAEKQWTMLDRKSKRPDMLM
DPFGVDRVVQDGAVFRQ SF SIRSYEIGADRTASIETLMNIFQETSLNHCKSIGLLNDGFGRTPEM
CKRDLIWVVTKMHVEVNRYPTWGDTIEVNTWVSE SGKTGMGRDWLISDCHTGEILIRAT SMC
AMMNQKTRRF SKFPYEVRQELAPHFVD SAPVIEDYQKLHKLDVKTGD SICNGLTPRWNDLDV
NQHVNNVKYIGWILESVPTEVFETQELCGLTLEYRRECGRDSVLE SVTAMDPSKEGDRSLYQH
LLRLEDGADIAKGRTKWRPKNAGTNGAISTGKTSNGNSIS*
SEQ ID NO: 20 Cuphea hyssopifolia (Chs) FATB3 coding DNA sequence ATGGTGGCTGCCGAAGCAAGTTCTGCACTCTTCTCCGTTCGAACCCCGGGAACCTCCCCTA
AACCCGGGAAGTTCGGGAATTGGCCAACGAGCTTGAGCGTCCCCTTCAAGTCCAAATCAA
ACCACAATGGCGGCTTTCAGGTTAAGGCAAACGCCAGTGCCCGTCCTAAGGCTAACGGTT
CTGCAGTAAGTCTAAAGTCTGGCAGCCTCGACACTCAGGAGGACACTTCATCGTCGTCCTC
TCCTCCTCGGACTTTCATTAACCAGTTGCCCGACTGGAGTATGCTGCTGTCCGCGATCACG
ACCGTCTTCGTGGCGGCTGAGAAGCAGTGGACGATGCTTGATCGGAAATCTAAGAGGCCC
GACATGCTCATGGACCCGTTTGGGGTTGACAGGGTTGTTCAGGATGGGGCTGTGTTCAGA
CAGAGTTTTTCGATTAGGTCTTACGAAATAGGCGCTGATCGAACAGCCTCTATAGAGACG
CTGATGAACATCTTCCAGGAAACATCTCTCAATCATTGTAAGAGTATCGGTCTTCTCAATG
ACGGCTTTGGTCGTACTCCTGAGATGTGTAAGAGGGACCTCATTTGGGTGGTTACAAAAAT
GCACGTCGAGGTTAATCGCTATCCTACTTGGGGTGATACTATCGAGGTCAATACTTGGGTC
TCCGAGTCGGGGAAAACCGGTATGGGTCGTGATTGGCTGATAAGTGATTGTCATACAGGA
GAAATTCTAATAAGAGCAACGAGCATGTGTGCTATGATGAATCAAAAGACGAGAAGATTC
TCAAAATTTCCATATGAGGTTCGACAGGAGTTGGCGCCTCATTTTGTGGACTCTGCTCCTG
TCATTGAAGACTATCAAAAATTGCACAAGCTTGATGTGAAGACGGGTGATTCCATTTGCA
ATGGCCTAACTCCAAGGTGGAATGACTTGGATGTCAATCAGCACGTTAACAATGTGAAGT
ACATTGGGTGGATTCTCGAGAGTGTTCCAACGGAAGTTTTCGAGACCCAGGAGCTATGTG
GCCTCACCCTTGAGTATAGGCGGGAATGCGGAAGGGACAGTGTGCTGGAGTCCGTGACCG
CTATGGATCCATCAAAAGAGGGAGACAGATCTCTGTACCAGCACCTTCTTCGGCTTGAGG
ATGGGGCTGATATCGCGAAGGGCAGAACCAAGTGGCGGCCGAAGAATGCAGGAACCAAT
GGGGCAATATCAACAGGAAAGACTTCAAATGGAAACTCGATCTCTTAG
SEQ ID NO: 21 Cuphea hyssopifolia (Chs) FATB3 coding DNA sequence codon optimized for Prototheca moriformis ATGGTGGCCGCCGAGGCCTCCTCCGCCCTGTTCTCCGTGCGCACCCCCGGCACCTCCCCCA
AGCCCGGCAAGTTCGGCAACTGGCCCACCTCCCTGTCCGTGCCCTTCAAGTCCAAGTCCAA
CCACAACGGCGGCTTCCAGGTGAAGGCCAACGCCTCCGCCCGCCCCAAGGCCAACGGCTC
CGCCGTGTCCCTGAAGTCCGGCTCCCTGGACACCCAGGAGGACACCTCCTCCTCCTCCTCC
CCCCCCCGCACCTTCATCAACCAGCTGCCCGACTGGTCCATGCTGCTGTCCGCCATCACCA
CCGTGTTCGTGGCCGCCGAGAAGCAGTGGACCATGCTGGACCGCAAGTCCAAGCGCCCCG
ACATGCTGATGGACCCCTTCGGCGTGGACCGCGTGGTGCAGGACGGCGCCGTGTTCCGCC
AGTCCTTCTCCATCCGCTCCTACGAGATCGGCGCCGACCGCACCGCCTCCATCGAGACCCT
GATGAACATCTTCCAGGAGACCTCCCTGAACCACTGCAAGTCCATCGGCCTGCTGAACGA
CGGCTTCGGCCGCACCCCCGAGATGTGCAAGCGCGACCTGATCTGGGTGGTGACCAAGAT

GCACGTGGAGGTGAACCGCTACCCCACCTGGGGCGACACCATCGAGGTGAACACCTGGGT
GTCCGAGTCCGGCAAGACCGGCATGGGCCGCGACTGGCTGATCTCCGACTGCCACACCGG
CGAGATCCTGATCCGCGCCACCTCCATGTGCGCCATGATGAACCAGAAGACCCGCCGCTT
CTCCAAGTTCCCCTACGAGGTGCGCCAGGAGCTGGCCCCCCACTTCGTGGACTCCGCCCCC
GTGATCGAGGACTACCAGAAGCTGCACAAGCTGGACGTGAAGACCGGCGACTCCATCTGC
AACGGCCTGACCCCCCGCTGGAACGACCTGGACGTGAACCAGCACGTGAACAACGTGAA
GTACATCGGCTGGATCCTGGAGTCCGTGCCCACCGAGGTGTTCGAGACCCAGGAGCTGTG
CGGCCTGACCCTGGAGTACCGCCGCGAGTGCGGCCGCGACTCCGTGCTGGAGTCCGTGAC
CGCCATGGACCCCTCCAAGGAGGGCGACCGCTCCCTGTACCAGCACCTGCTGCGCCTGGA
GGACGGCGCCGACATCGCCAAGGGCCGCACCAAGTGGCGCCCCAAGAACGCCGGCACCA
ACGGCGCCATCTCCACCGGCAAGACCTCCAACGGCAACTCCATCTCCTGA
SEQ ID NO: 22 Cuphea hyssopifolia (Chs) FATB3b (V204I,C239F, E243D, M251V variant) amino acid sequence MVAAEAS SALF SVRTPGTSPKPGKFGNWPT SLSVPFK SKSNHNGGFQVKANASARPKANG SA
VSLKSGSLDTQEDTSSSSSPPRTFINQLPDWSMLLSAITTVFVAAEKQWTMLDRKSKRPDMLM
DPFGVDRVVQDGAVFRQ SF SIRSYEIGADRTASIETLMNIFQETSLNHCKSIGLLNDGFGRTPEM
CKRDLIWVVTKMHIEVNRYPTWGDTIEVNTWVSESGKTGMGRDWLISDFHTGDILIRATSVC
AMMNQKTRRF SKFPYEVRQELAPHFVD SAPVIEDYQKLHKLDVKTGD SICNGLTPRWNDLDV
NQHVNNVKYIGWILESVPTEVFETQELCGLTLEYRRECGRDSVLE SVTAMDPSKEGDRSLYQH
LLRLEDGADIAKGRTKWRPKNAGTNGAISTGKTSNGNSIS*
SEQ ID NO: 23 Cuphea hyssopifolia (Chs) FATB3b (V204I,C239F, E243D, M251V variant) coding DNA sequence ATGGTGGCTGCCGAAGCAAGTTCTGCACTCTTCTCCGTTCGAACCCCGGGAACCTCCCCTA
AACCCGGGAAGTTCGGGAATTGGCCAACGAGCTTGAGCGTCCCCTTCAAGTCCAAATCAA
ACCACAATGGCGGCTTTCAGGTTAAGGCAAACGCCAGTGCCCGTCCTAAGGCTAACGGTT
CTGCAGTAAGTCTAAAGTCTGGCAGCCTCGACACTCAGGAGGACACTTCATCGTCGTCCTC
TCCTCCTCGGACTTTCATTAACCAGTTGCCCGACTGGAGTATGCTGCTGTCCGCGATCACG
ACCGTCTTCGTGGCGGCTGAGAAGCAGTGGACGATGCTTGATCGGAAATCTAAGAGGCCC
GACATGCTCATGGACCCGTTTGGGGTTGACAGGGTTGTTCAGGATGGGGCTGTGTTCAGA
CAGAGTTTTTCGATTAGGTCTTACGAAATAGGCGCTGATCGAACAGCCTCTATAGAGACG
CTGATGAACATCTTCCAGGAAACATCTCTCAATCATTGTAAGAGTATCGGTCTTCTCAATG
ACGGCTTTGGTCGTACTCCTGAGATGTGTAAGAGGGACCTCATTTGGGTGGTTACAAAAAT
GCACATCGAGGTTAATCGCTATCCTACTTGGGGTGATACTATCGAGGTCAATACTTGGGTC
TCCGAGTCGGGGAAAACCGGTATGGGTCGTGATTGGCTGATAAGTGATTTTCATACAGGA
GACATTCTAATAAGAGCAACGAGCGTGTGTGCTATGATGAATCAAAAGACGAGAAGATTC
TCAAAATTTCCATATGAGGTTCGACAGGAGTTAGCGCCTCATTTTGTGGACTCTGCTCCAG
TCATTGAAGACTATCAAAAATTGCACAAGCTTGATGTGAAGACGGGTGATTCCATTTGCA
ATGGCCTAACTCCAAGGTGGAATGACTTGGATGTCAATCAGCACGTTAACAATGTGAAGT
ACATTGGGTGGATTCTCGAGAGTGTTCCAACGGAAGTTTTCGAGACCCAGGAGCTATGTG
GCCTCACCCTTGAGTATAGGCGGGAATGCGGAAGGGACAGTGTGCTGGAGTCCGTGACCG
CTATGGATCCCTCAAAAGAGGGAGACAGATCTCTGTACCAGCACCTTCTTCGGCTTGAGG
ATGGGGCTGATATCGCGAAGGGCAGAACCAAGTGGCGGCCGAAGAATGCAGGAACCAAT
GGGGCAATATCAACAGGAAAGACTTCAAATGGAAACTCGATCTCTTAG
SEQ ID NO: 24 Cuphea hyssopifolia (Chs) FATB3b (V204I,C239F, E243D, M251V variant) coding DNA sequence codon optimized for Prototheca moriformis ATGGTGGCCGCCGAGGCCTCCTCCGCCCTGTTCTCCGTGCGCACCCCCGGCACCTCCCCCA
AGCCCGGCAAGTTCGGCAACTGGCCCACCTCCCTGTCCGTGCCCTTCAAGTCCAAGTCCAA
CCACAACGGCGGCTTCCAGGTGAAGGCCAACGCCTCCGCCCGCCCCAAGGCCAACGGCTC
CGCCGTGTCCCTGAAGTCCGGCTCCCTGGACACCCAGGAGGACACCTCCTCCTCCTCCTCC
CCCCCCCGCACCTTCATCAACCAGCTGCCCGACTGGTCCATGCTGCTGTCCGCCATCACCA
CCGTGTTCGTGGCCGCCGAGAAGCAGTGGACCATGCTGGACCGCAAGTCCAAGCGCCCCG

ACATGCTGATGGACCCCTTCGGCGTGGACCGCGTGGTGCAGGACGGCGCCGTGTTCCGCC
AGTCCTTCTCCATCCGCTCCTACGAGATCGGCGCCGACCGCACCGCCTCCATCGAGACCCT
GATGAACATCTTCCAGGAGACCTCCCTGAACCACTGCAAGTCCATCGGCCTGCTGAACGA
CGGCTTCGGCCGCACCCCCGAGATGTGCAAGCGCGACCTGATCTGGGTGGTGACCAAGAT
GCACATCGAGGTGAACCGCTACCCCACCTGGGGCGACACCATCGAGGTGAACACCTGGGT
GTCCGAGTCCGGCAAGACCGGCATGGGCCGCGACTGGCTGATCTCCGACTTCCACACCGG
CGACATCCTGATCCGCGCCACCTCCGTGTGCGCCATGATGAACCAGAAGACCCGCCGCTT
CTCCAAGTTCCCCTACGAGGTGCGCCAGGAGCTGGCCCCCCACTTCGTGGACTCCGCCCCC
GTGATCGAGGACTACCAGAAGCTGCACAAGCTGGACGTGAAGACCGGCGACTCCATCTGC
AACGGCCTGACCCCCCGCTGGAACGACCTGGACGTGAACCAGCACGTGAACAACGTGAA
GTACATCGGCTGGATCCTGGAGTCCGTGCCCACCGAGGTGTTCGAGACCCAGGAGCTGTG
CGGCCTGACCCTGGAGTACCGCCGCGAGTGCGGCCGCGACTCCGTGCTGGAGTCCGTGAC
CGCCATGGACCCCTCCAAGGAGGGCGACCGCTCCCTGTACCAGCACCTGCTGCGCCTGGA
GGACGGCGCCGACATCGCCAAGGGCCGCACCAAGTGGCGCCCCAAGAACGCCGGCACCA
ACGGCGCCATCTCCACCGGCAAGACCTCCAACGGCAACTCCATCTCCTGA
SEQ ID NO: 25 Cuphea PSR23 (Cu) FATB3 amino acid sequence MVVAAATSAFFPVPAPGT SPKPGKSGNWP S SL SP TFKPKSIPNAGFQVKANASAHPKANGSAV
NLKSGSLNTQEDTS S S PPPRAFLNQLPDW SMLLTAIT TVFVAAEKQWTMLDRKSKRPDMLVD
SVGLKCIVRDGLVSRQSFLIRSYEIGADRTASIETLMNHLQET SINHCKSLGLLNDGFGRTPGM
CKNDLIWVLTKMQIMVNRYPTWGDTVEINTWF SQSGKIGMASDWLISDCNTGEILIRATSVW
AMMNQKTRRF SRLPYEVRQELTPHFVD SPHVIEDNDQKLHKFDVKTGD SIRKGLTPRWNDLD
VNQHVSNVKYIGWILESMPIEVLETQELC SLTVEYRRECGMDSVLE SVTAVDPSENGGRSQYK
HLLRLEDGTDIVKSRTEWRPKNAGTNGAISTSTAKTSNGNSVS*
SEQ ID NO: 26 Cuphea PSR23 (Cu) FATB3 coding DNA sequence ATGGTGGTGGCTGCAGCAACTTCTGCATTCTTCCCCGTTCCAGCCCCGGGAACCTCCCCTA
AACCCGGGAAGTCCGGCAACTGGCCATCGAGCTTGAGCCCTACCTTCAAGCCCAAGTCAA
TCCCCAATGCCGGATTTCAGGTTAAGGCAAATGCCAGTGCCCATCCTAAGGCTAACGGTTC
TGCAGTAAATCTAAAGTCTGGCAGCCTCAACACTCAGGAGGACACTTCGTCGTCCCCTCCT
CCCCGGGCTTTCCTTAACCAGTTGCCTGATTGGAGTATGCTTCTGACTGCAATCACGACCG
TCTTCGTGGCGGCAGAGAAGCAGTGGACTATGCTTGATAGGAAATCTAAGAGGCCTGACA
TGCTCGTGGACTCGGTTGGGTTGAAGTGTATTGTTCGGGATGGGCTCGTGTCCAGACAGAG
TTTTTTGATTAGATCTTATGAAATAGGCGCTGATCGAACAGCCTCTATAGAGACGCTGATG
AACCACTTGCAGGAAACATCTATCAATCATTGTAAGAGTTTGGGTCTTCTCAATGACGGCT
TTGGTCGTACTCCTGGGATGTGTAAAAACGACCTCATTTGGGTGCTTACAAAAATGCAGAT
CATGGTGAATCGCTACCCAACTTGGGGCGATACTGTTGAGATCAATACCTGGTTCTCTCAG
TCGGGGAAAATCGGTATGGCTAGCGATTGGCTAATAAGTGATTGCAACACAGGAGAAATT
CTTATAAGAGCAACGAGCGTGTGGGCTATGATGAATCAAAAGACGAGAAGATTCTCAAGA
CTTCCATACGAGGTTCGCCAGGAGTTAACGCCTCATTTTGTGGACTCTCCTCATGTCATTG
AAGACAATGATCAGAAATTGCATAAGTTTGATGTGAAGACTGGTGATTCCATTCGCAAGG
GTCTAACTCCGAGGTGGAACGACTTGGATGTGAATCAGCACGTAAGCAACGTGAAGTACA
TTGGGTGGATTCTCGAGAGTATGCCAATAGAAGTTTTGGAGACACAGGAGCTATGCTCTCT
CACCGTAGAATATAGGCGGGAATGCGGAATGGACAGTGTGCTGGAGTCCGTGACTGCTGT
GGATCCCTCAGAAAATGGAGGCCGGTCTCAGTACAAGCACCTTCTGCGGCTTGAGGATGG
GACTGATATCGTGAAGAGCAGAACTGAGTGGCGACCGAAGAATGCAGGAACTAACGGGG
CGATATCAACATCAACAGCAAAGACTTCAAATGGAAACTCGGTCTCTTAG
SEQ ID NO: 27 Cuphea PSR23 (Cu) FATB3 coding DNA sequence codon optimized for Prototheca moriformis ATGGTGGTGGCCGCCGCCACCTCCGCCTTCTTCCCCGTGCCCGCCCCCGGCACCTCCCCCA
AGCCCGGCAAGTCCGGCAACTGGCCCTCCTCCCTGTCCCCCACCTTCAAGCCCAAGTCCAT
CCCCAACGCCGGCTTCCAGGTGAAGGCCAACGCCTCCGCCCACCCCAAGGCCAACGGCTC

CGCCGTGAACCTGAAGTCCGGCTCCCTGAACACCCAGGAGGACACCTCCTCCTCCCCCCCC
CCCCGCGCCTTCCTGAACCAGCTGCCCGACTGGTCCATGCTGCTGACCGCCATCACCACCG
TGTTCGTGGCCGCCGAGAAGCAGTGGACCATGCTGGACCGCAAGTCCAAGCGCCCCGACA
TGCTGGTGGACTCCGTGGGCCTGAAGTGCATCGTGCGCGACGGCCTGGTGTCCCGCCAGT
CCTTCCTGATCCGCTCCTACGAGATCGGCGCCGACCGCACCGCCTCCATCGAGACCCTGAT
GAACCACCTGCAGGAGACCTCCATCAACCACTGCAAGTCCCTGGGCCTGCTGAACGACGG
CTTCGGCCGCACCCCCGGCATGTGCAAGAACGACCTGATCTGGGTGCTGACCAAGATGCA
GATCATGGTGAACCGCTACCCCACCTGGGGCGACACCGTGGAGATCAACACCTGGTTCTC
CCAGTCCGGCAAGATCGGCATGGCCTCCGACTGGCTGATCTCCGACTGCAACACCGGCGA
GATCCTGATCCGCGCCACCTCCGTGTGGGCCATGATGAACCAGAAGACCCGCCGCTTCTCC
CGCCTGCCCTACGAGGTGCGCCAGGAGCTGACCCCCCACTTCGTGGACTCCCCCCACGTG
ATCGAGGACAACGACCAGAAGCTGCACAAGTTCGACGTGAAGACCGGCGACTCCATCCGC
AAGGGCCTGACCCCCCGCTGGAACGACCTGGACGTGAACCAGCACGTGTCCAACGTGAAG
TACATCGGCTGGATCCTGGAGTCCATGCCCATCGAGGTGCTGGAGACCCAGGAGCTGTGC
TCCCTGACCGTGGAGTACCGCCGCGAGTGCGGCATGGACTCCGTGCTGGAGTCCGTGACC
GCCGTGGACCCCTCCGAGAACGGCGGCCGCTCCCAGTACAAGCACCTGCTGCGCCTGGAG
GACGGCACCGACATCGTGAAGTCCCGCACCGAGTGGCGCCCCAAGAACGCCGGCACCAA
CGGCGCCATCTCCACCTCCACCGCCAAGACCTCCAACGGCAACTCCGTGTCCTGA
SEQ ID NO: 28 Cuphea wrightii (Cw) FATB3 amino acid sequence MVVAAAAS SAFFPVPAPRTTPKPGKFGNWPS SL SPPFKPKSNPNGRFQVKANVSPHPKANGSA
VSLKSGSLNTLEDPP S SPPPRTFLNQLPDWSRLRTAITTVFVAAEKQF TRLDRK SKRPDMLVDW
FGSETIVQDGLVFRERF SIRSYEIGADRTASIETLMNHLQDTSLNHCKSVGLLNDGFGRT SEMC
TRDLIWVLTKMQIVVNRYPTWGDTVEINSWF SQ SGKIGMGRDWLIS DCNTGEILVRAT SAWA
MMNQKTRRF S KLPCEVRQEIAPHFVDAPPVIEDNDRKLHKFDVKTGD SICKGLTPGWNDLDV
NQHV SNVKYIGWILE SMPTEVLETQELC S LTLEYRRECGRE SVVE SVT SMNP SKVGDRSQYQH
LLRLEDGADIMKGRTEWRPKNAGTNRAIST*
SEQ ID NO: 29 Cuphea wrightii (Cw) FATB3 coding DNA sequence ATGGTGGTGGCTGCTGCAGCAAGTTCTGCATTCTTCCCTGTTCCAGCACCTAGAACCACGC
CTAAACCCGGGAAGTTCGGCAATTGGCCATCGAGCTTGAGCCCGCCCTTCAAGCCCAAGT
CAAACCCCAATGGTAGATTTCAGGTTAAGGCAAATGTCAGTCCTCATCCTAAGGCTAACG
GTTCTGCAGTAAGTCTAAAGTCTGGCAGCCTCAACACTCTGGAGGACCCTCCGTCGTCCCC
TCCTCCTCGGACTTTCCTTAACCAGTTGCCTGATTGGAGTAGGCTTCGGACTGCAATCACG
ACCGTCTTCGTGGCGGCAGAGAAGCAGTTCACTAGGCTCGATCGAAAATCTAAGAGGCCT
GACATGCTAGTGGACTGGTTTGGGTCAGAGACTATTGTTCAGGATGGGCTCGTGTTCAGA
GAGAGATTTTCGATCAGGTCTTACGAAATAGGCGCTGATCGAACAGCCTCTATAGAGACG
CTGATGAACCACTTGCAGGACACATCTCTGAATCATTGTAAGAGTGTGGGTCTTCTCAATG
ACGGCTTTGGTCGTACCTCGGAGATGTGTACAAGAGACCTCATTTGGGTGCTTACAAAAAT
GCAGATCGTGGTGAATCGCTATCCAACTTGGGGCGATACTGTCGAGATCAATAGCTGGTT
CTCCCAGTCGGGGAAAATCGGTATGGGTCGCGATTGGCTAATAAGTGATTGCAACACAGG
AGAAATTCTTGTAAGAGCAACGAGCGCTTGGGCCATGATGAATCAAAAGACGAGAAGATT
CTCAAAACTTCCATGCGAGGTTCGCCAGGAGATAGCGCCTCATTTTGTGGACGCTCCTCCT
GTCATTGAAGACAATGATCGGAAATTGCATAAGTTTGATGTGAAGACTGGTGATTCCATTT
GCAAGGGTCTAACTCCGGGGTGGAATGACTTGGATGTCAATCAGCACGTAAGCAACGTGA
AGTACATTGGGTGGATTCTCGAGAGTATGCCTACAGAAGTTTTGGAGACCCAGGAGCTAT
GCTCTCTCACCCTTGAATATAGGCGGGAATGTGGAAGGGAAAGTGTGGTAGAGTCCGTGA
CCTCTATGAATCCCTCAAAAGTTGGAGACCGGTCTCAGTACCAACACCTTCTGCGGCTTGA
GGATGGGGCTGATATCATGAAGGGCAGAACTGAGTGGAGACCAAAGAATGCAGGAACCA
ACCGGGCGATATCAACATGA
SEQ ID NO: 30 Cuphea wrightii (Cw) FATB3 coding DNA sequence codon optimized for Prototheca moriformis ATGGTGGTGGCCGCCGCCGCCTCCTCCGCCTTCTTCCCCGTGCCCGCCCCCCGCACCACCC
CCAAGCCCGGCAAGTTCGGCAACTGGCCCTCCTCCCTGTCCCCCCCCTTCAAGCCCAAGTC
CAACCCCAACGGCCGCTTCCAGGTGAAGGCCAACGTGTCCCCCCACCCCAAGGCCAACGG
CTCCGCCGTGTCCCTGAAGTCCGGCTCCCTGAACACCCTGGAGGACCCCCCCTCCTCCCCC
CCCCCCCGCACCTTCCTGAACCAGCTGCCCGACTGGTCCCGCCTGCGCACCGCCATCACCA
CCGTGTTCGTGGCCGCCGAGAAGCAGTTCACCCGCCTGGACCGCAAGTCCAAGCGCCCCG
ACATGCTGGTGGACTGGTTCGGCTCCGAGACCATCGTGCAGGACGGCCTGGTGTTCCGCG
AGCGCTTCTCCATCCGCTCCTACGAGATCGGCGCCGACCGCACCGCCTCCATCGAGACCCT
GATGAACCACCTGCAGGACACCTCCCTGAACCACTGCAAGTCCGTGGGCCTGCTGAACGA
CGGCTTCGGCCGCACCTCCGAGATGTGCACCCGCGACCTGATCTGGGTGCTGACCAAGAT
GCAGATCGTGGTGAACCGCTACCCCACCTGGGGCGACACCGTGGAGATCAACTCCTGGTT
CTCCCAGTCCGGCAAGATCGGCATGGGCCGCGACTGGCTGATCTCCGACTGCAACACCGG
CGAGATCCTGGTGCGCGCCACCTCCGCCTGGGCCATGATGAACCAGAAGACCCGCCGCTT
CTCCAAGCTGCCCTGCGAGGTGCGCCAGGAGATCGCCCCCCACTTCGTGGACGCCCCCCC
CGTGATCGAGGACAACGACCGCAAGCTGCACAAGTTCGACGTGAAGACCGGCGACTCCAT
CTGCAAGGGCCTGACCCCCGGCTGGAACGACCTGGACGTGAACCAGCACGTGTCCAACGT
GAAGTACATCGGCTGGATCCTGGAGTCCATGCCCACCGAGGTGCTGGAGACCCAGGAGCT
GTGCTCCCTGACCCTGGAGTACCGCCGCGAGTGCGGCCGCGAGTCCGTGGTGGAGTCCGT
GACCTCCATGAACCCCTCCAAGGTGGGCGACCGCTCCCAGTACCAGCACCTGCTGCGCCT
GGAGGACGGCGCCGACATCATGAAGGGCCGCACCGAGTGGCGCCCCAAGAACGCCGGCA
CCAACCGCGCCATCTCCACCTGA
SEQ ID NO: 31 Cuphea wrightii (Cw) FATB4a amino acid sequence MVATAAS SAFFPVPSADTS S SRPGKLGSGPS SLSPLKPKSIPNGGLQVKANASAPPKINGS SVGL
KSGGF KTQED SP SAPPPRTF INQLPDW SMLLAAITTVFLAAEKQWMMLDWKPKRPDMLVDPF
GLGSIVQDGLVFRQNF SIRSYEIGADRTASIETVMNHLQETALNHVKIAGLSNDGFGRTPEMYK
RDLIWVVAKMQVMVNRYPTWGDTVEVNTWVAKSGKNGMRRDWLI SDCNTGEILTRAS SVW
VMMNQKTRRL SKIPDEVRNEIEPHFVD SAPVVEDDDRKLPKLDENTAD SIRKGLTPRWNDLD
VNQHVNNVKYIGWILESTPPEVLETQELC SLTLEYRRECGRESVLE SLTAVDPSAEGYASRFQH
LLRLEDGGEIVKARTEWRPKNAGINGVVP SEE S SPGDFF*
SEQ ID NO: 32 Cuphea wrightii (Cw) FATB4a coding DNA sequence TTGGTGGCTACCGCTGCAAGTTCTGCATTTTTCCCCGTGCCATCCGCCGACACCTCCTCCTC
GAGACCCGGAAAGCTCGGCAGTGGACCATCGAGCTTGAGCCCCCTCAAGCCCAAATCGAT
CCCCAATGGCGGCTTGCAGGTTAAGGCAAACGCCAGTGCCCCTCCTAAGATCAATGGTTC
CTCGGTCGGTCTAAAGTCGGGCGGTTTCAAGACTCAGGAAGACTCTCCTTCGGCCCCTCCT
CCGCGGACTTTTATCAACCAGTTGCCTGATTGGAGTATGCTTCTTGCTGCAATCACTACTG
TCTTCTTGGCTGCAGAGAAGCAGTGGATGATGCTTGATTGGAAACCTAAGAGGCCTGACA
TGCTCGTGGACCCGTTCGGATTGGGAAGTATTGTTCAGGATGGGCTTGTGTTCAGGCAGAA
TTTTTCAATTAGGTCCTACGAAATAGGCGCCGATCGAACTGCGTCTATAGAGACGGTGATG
AACCATTTGCAGGAAACAGCTCTCAATCATGTCAAGATTGCTGGGCTTTCTAATGACGGCT
TTGGTCGTACTCCTGAGATGTATAAAAGAGACCTTATTTGGGTTGTTGCAAAAATGCAGGT
CATGGTTAACCGCTATCCTACTTGGGGTGACACGGTTGAAGTGAATACTTGGGTTGCCAAG
TCAGGGAAAAATGGTATGCGTCGTGACTGGCTCATAAGTGATTGCAATACTGGAGAGATT
CTTACAAGAGCATCAAGCGTGTGGGTCATGATGAATCAAAAGACAAGAAGATTGTCAAAA
ATTCCAGATGAGGTTCGAAATGAGATAGAGCCTCATTTTGTGGACTCTGCTCCCGTCGTTG
AAGATGATGATCGGAAACTTCCCAAGCTGGATGAGAACACTGCTGACTCCATCCGCAAGG
GTCTAACTCCGAGGTGGAATGACTTGGATGTCAATCAGCACGTCAACAACGTGAAGTACA
TCGGATGGATTCTTGAGAGTACTCCACCAGAAGTTCTGGAGACCCAGGAGTTATGCTCCCT
GACCCTGGAATACAGGCGGGAATGTGGAAGGGAGAGCGTGCTGGAGTCCCTCACTGCTGT
CGACCCGTCTGCAGAGGGCTATGCGTCCCGGTTTCAGCACCTTCTGCGGCTTGAGGATGGA
GGTGAGATCGTGAAGGCGAGAACTGAGTGGCGACCCAAGAATGCTGGAATCAATGGGGT
GGTACCATCCGAGGAGTCCTCACCTGGAGACTTCTTTTAG

SEQ ID NO: 33 Cuphea wrightii (Cw) FATB4a coding DNA sequence codon optimized for Prototheca moriformis ATGGTGGCCACCGCCGCCTCCTCCGCCTTCTTCCCCGTGCCCTCCGCCGACACCTCCTCCTC
CCGCCCCGGCAAGCTGGGCTCCGGCCCCTCCTCCCTGTCCCCCCTGAAGCCCAAGTCCATC
CCCAACGGCGGCCTGCAGGTGAAGGCCAACGCCTCCGCCCCCCCCAAGATCAACGGCTCC
TCCGTGGGCCTGAAGTCCGGCGGCTTCAAGACCCAGGAGGACTCCCCCTCCGCCCCCCCC
CCCCGCACCTTCATCAACCAGCTGCCCGACTGGTCCATGCTGCTGGCCGCCATCACCACCG
TGTTCCTGGCCGCCGAGAAGCAGTGGATGATGCTGGACTGGAAGCCCAAGCGCCCCGACA
TGCTGGTGGACCCCTTCGGCCTGGGCTCCATCGTGCAGGACGGCCTGGTGTTCCGCCAGAA
CTTCTCCATCCGCTCCTACGAGATCGGCGCCGACCGCACCGCCTCCATCGAGACCGTGATG
AACCACCTGCAGGAGACCGCCCTGAACCACGTGAAGATCGCCGGCCTGTCCAACGACGGC
TTCGGCCGCACCCCCGAGATGTACAAGCGCGACCTGATCTGGGTGGTGGCCAAGATGCAG
GTGATGGTGAACCGCTACCCCACCTGGGGCGACACCGTGGAGGTGAACACCTGGGTGGCC
AAGTCCGGCAAGAACGGCATGCGCCGCGACTGGCTGATCTCCGACTGCAACACCGGCGAG
ATCCTGACCCGCGCCTCCTCCGTGTGGGTGATGATGAACCAGAAGACCCGCCGCCTGTCC
AAGATCCCCGACGAGGTGCGCAACGAGATCGAGCCCCACTTCGTGGACTCCGCCCCCGTG
GTGGAGGACGACGACCGCAAGCTGCCCAAGCTGGACGAGAACACCGCCGACTCCATCCG
CAAGGGCCTGACCCCCCGCTGGAACGACCTGGACGTGAACCAGCACGTGAACAACGTGA
AGTACATCGGCTGGATCCTGGAGTCCACCCCCCCCGAGGTGCTGGAGACCCAGGAGCTGT
GCTCCCTGACCCTGGAGTACCGCCGCGAGTGCGGCCGCGAGTCCGTGCTGGAGTCCCTGA
CCGCCGTGGACCCCTCCGCCGAGGGCTACGCCTCCCGCTTCCAGCACCTGCTGCGCCTGGA
GGACGGCGGCGAGATCGTGAAGGCCCGCACCGAGTGGCGCCCCAAGAACGCCGGCATCA
ACGGCGTGGTGCCCTCCGAGGAGTCCTCCCCCGGCGACTTCTTCTGA
SEQ ID NO: 34 Cuphea wrightii (Cw) FATB4b amino acid sequence MVATAAS SAFFPVPSADTS S SRPGKLGNGPS SLSPLKPKSIPNGGLQVKANASAPPKINGS SVGL
KSGSFKTQEDAPSAPPPRTFINQLPDWSMLLAAITTVFLAAEKQWMMLDWKPKRPDMLVDPF
GLGSIVQDGLVFRQNF SIRSYEIGADRTASIETVMNHLQETALNHVKIAGLS SDGFGRTPAMSK
RDLIWVVAKMQVMVNRYPAWGDTVEVNTWVAKSGKNGMRRDWLISDCNTGEILTRAS SV
WVMMNQKTRRLSKIPDEVRNEIEPHFVDSAPVVEDDDRKLPKLDENTADSIRKGLTPRWNDL
DVNQHVNNVKYIGWILE STPAEVLETQELC SLTLEYRRECGRE SVLESLTAVDP SGEGDGSKF
QHLLRLEDGGEIVKARTEWRPKNAGINGVVP SEE S SPGGDFF*
SEQ ID NO: 35 Cuphea wrightii (Cw) FATB4b coding DNA sequence TTGGTGGCTACCGCTGCAAGTTCTGCATTTTTCCCCGTACCATCCGCCGACACCTCCTCATC
GAGACCCGGAAAGCTCGGCAATGGGCCATCGAGCTTGAGCCCCCTCAAGCCGAAATCGAT
CCCCAATGGCGGGTTGCAGGTTAAGGCAAACGCCAGTGCCCCTCCTAAGATCAATGGTTC
CTCGGTCGGTCTGAAGTCGGGCAGTTTCAAGACTCAGGAAGACGCTCCTTCGGCCCCTCCT
CCTCGGACTTTTATCAACCAGTTGCCTGATTGGAGTATGCTTCTTGCTGCAATCACTACTGT
CTTCTTGGCTGCAGAGAAGCAGTGGATGATGCTTGATTGGAAACCTAAGAGGCCTGACAT
GCTTGTCGACCCGTTCGGATTGGGAAGTATTGTTCAGGATGGGCTTGTTTTCAGGCAGAAT
TTCTCGATTAGGTCCTACGAAATAGGCGCTGATCGCACTGCGTCTATAGAGACGGTGATG
AACCATTTGCAGGAAACAGCTCTCAATCATGTTAAGATTGCTGGGCTTTCTAGTGATGGCT
TTGGTCGTACTCCTGCGATGTCTAAACGGGACCTCATTTGGGTTGTTGCGAAAATGCAGGT
CATGGTTAACCGCTACCCTGCTTGGGGTGACACGGTTGAAGTGAATACTTGGGTTGCCAA
GTCAGGGAAAAATGGTATGCGTCGTGACTGGCTCATAAGTGATTGCAACACTGGAGAGAT
TCTTACAAGAGCATCAAGCGTGTGGGTCATGATGAATCAAAAGACAAGAAGATTGTCAAA
AATTCCAGATGAGGTTCGAAATGAGATAGAGCCTCATTTTGTGGACTCTGCGCCCGTCGTT
GAAGACGATGACCGGAAACTTCCCAAGCTGGATGAGAACACTGCTGACTCCATCCGCAAG
GGTCTAACTCCGAGGTGGAATGACTTGGATGTCAATCAGCACGTCAACAACGTGAAGTAC
ATTGGGTGGATTCTTGAGAGTACTCCAGCAGAAGTTCTGGAGACCCAGGAATTATGTTCCC
TGACCCTGGAATACAGGCGGGAATGTGGAAGGGAGAGCGTGCTGGAGTCCCTCACTGCTG
TAGATCCGTCTGGAGAGGGCGATGGGTCCAAGTTCCAGCACCTTCTGCGGCTTGAGGATG

GAGGTGAGATCGTGAAGGCGAGAACTGAGTGGCGACCAAAGAATGCTGGAATCAATGGG
GTGGTACCATCCGAGGAGTCCTCACCTGGTGGAGACTTCTTTTAA
SEQ ID NO: 36 Cuphea wrightii (Cw) FATB4b coding DNA sequence codon optimized for Prototheca moriformis ATGGTGGCCACCGCCGCCTCCTCCGCCTTCTTCCCCGTGCCCTCCGCCGACACCTCCTCCTC
CCGCCCCGGCAAGCTGGGCAACGGCCCCTCCTCCCTGTCCCCCCTGAAGCCCAAGTCCATC
CCCAACGGCGGCCTGCAGGTGAAGGCCAACGCCTCCGCCCCCCCCAAGATCAACGGCTCC
TCCGTGGGCCTGAAGTCCGGCTCCTTCAAGACCCAGGAGGACGCCCCCTCCGCCCCCCCCC
CCCGCACCTTCATCAACCAGCTGCCCGACTGGTCCATGCTGCTGGCCGCCATCACCACCGT
GTTCCTGGCCGCCGAGAAGCAGTGGATGATGCTGGACTGGAAGCCCAAGCGCCCCGACAT
GCTGGTGGACCCCTTCGGCCTGGGCTCCATCGTGCAGGACGGCCTGGTGTTCCGCCAGAA
CTTCTCCATCCGCTCCTACGAGATCGGCGCCGACCGCACCGCCTCCATCGAGACCGTGATG
AACCACCTGCAGGAGACCGCCCTGAACCACGTGAAGATCGCCGGCCTGTCCTCCGACGGC
TTCGGCCGCACCCCCGCCATGTCCAAGCGCGACCTGATCTGGGTGGTGGCCAAGATGCAG
GTGATGGTGAACCGCTACCCCGCCTGGGGCGACACCGTGGAGGTGAACACCTGGGTGGCC
AAGTCCGGCAAGAACGGCATGCGCCGCGACTGGCTGATCTCCGACTGCAACACCGGCGAG
ATCCTGACCCGCGCCTCCTCCGTGTGGGTGATGATGAACCAGAAGACCCGCCGCCTGTCC
AAGATCCCCGACGAGGTGCGCAACGAGATCGAGCCCCACTTCGTGGACTCCGCCCCCGTG
GTGGAGGACGACGACCGCAAGCTGCCCAAGCTGGACGAGAACACCGCCGACTCCATCCG
CAAGGGCCTGACCCCCCGCTGGAACGACCTGGACGTGAACCAGCACGTGAACAACGTGA
AGTACATCGGCTGGATCCTGGAGTCCACCCCCGCCGAGGTGCTGGAGACCCAGGAGCTGT
GCTCCCTGACCCTGGAGTACCGCCGCGAGTGCGGCCGCGAGTCCGTGCTGGAGTCCCTGA
CCGCCGTGGACCCCTCCGGCGAGGGCGACGGCTCCAAGTTCCAGCACCTGCTGCGCCTGG
AGGACGGCGGCGAGATCGTGAAGGCCCGCACCGAGTGGCGCCCCAAGAACGCCGGCATC
AACGGCGTGGTGCCCTCCGAGGAGTCCTCCCCCGGCGGCGACTTCTTCTGA
SEQ ID NO: 37 Cuphea wrightii (Cw) FATB5 amino acid sequence MVAAAAS SAFF SVPTPGTPPKPGKFGNWP S SLSVPFKPDNGGFHVKANASAHPKANGSAVNL
KSGS LETPPRSF INQLPDLSVLL SKITTVFGAAEKQWKRPGMLVEPFGVDRIFQDGVFFRQ SF SI
RSYEIGVDRTASIETLMNIFQETSLNHCKSIGLLNDGFGRTPEMCKRDLIWVVTKIQVEVNRYP
TWGDTIEVNTWVSE SGKNGMGRDWLISDCRTGEILIRATSVWAMMNQNTRRLSKFPYEVRQE
IAPHFVDSAPVIEDDQKLQKLDVKTGD SIRDGLTPRWNDLDVNQHVNNVKYIGWILKSVPIEV
FETQELCGVTLEYRRECGRDSVLE SVTAMDPAKEGDRCVYQHLLRLEDGADITIGRTEWRPK
NAGANGAMS SGKT SNGNCLIEGRGWQPFRVVRLIF*
SEQ ID NO: 38 Cuphea wrightii (Cw) FATB5 coding DNA sequence ATGGTGGCTGCCGCAGCAAGTTCTGCATTCTTCTCTGTTCCAACCCCGGGAACGCCCCCTA
AACCCGGGAAGTTCGGTAACTGGCCATCGAGCTTGAGCGTCCCCTTCAAGCCCGACAATG
GTGGCTTTCATGTCAAGGCAAACGCCAGTGCCCATCCTAAGGCTAATGGTTCTGCGGTAA
ATCTAAAGTCTGGCAGCCTCGAGACTCCTCCTCGGAGTTTCATTAACCAGCTGCCGGACTT
GAGTGTGCTTCTGTCCAAAATCACGACTGTCTTCGGGGCGGCTGAGAAGCAGTGGAAGAG
GCCCGGCATGCTCGTGGAACCGTTTGGGGTTGACAGGATTTTTCAGGATGGTGTTTTTTTC
AGACAGAGTTTTTCTATCAGGTCTTACGAAATAGGCGTTGATCGAACAGCCTCGATAGAG
ACACTGATGAACATCTTCCAGGAAACATCTTTGAATCATTGCAAGAGTATCGGTCTTCTCA
ACGATGGCTTTGGTCGTACTCCTGAGATGTGTAAGAGGGACCTCATTTGGGTGGTTACGAA
AATTCAGGTCGAGGTGAATCGCTATCCTACTTGGGGTGACACTATCGAAGTCAATACTTGG
GTCTCGGAGTCGGGGAAAAACGGTATGGGTCGGGATTGGCTGATAAGTGATTGCCGTACT
GGAGAGATTCTTATAAGAGCAACGAGCGTGTGGGCGATGATGAATCAAAACACGAGAAG
ATTGTCAAAATTTCCATATGAGGTTCGACAGGAGATAGCGCCTCATTTTGTGGACTCTGCT
CCTGTCATTGAAGACGATCAAAAGTTGCAGAAGCTTGATGTGAAGACAGGTGATTCCATT
CGCGATGGTCTAACTCCGAGATGGAATGACTTGGATGTCAATCAACACGTTAACAATGTG
AAGTACATTGGATGGATTCTCAAGAGTGTTCCAATAGAAGTTTTCGAGACACAGGAGCTA

TGCGGCGTCACACTTGAATATAGGCGGGAATGCGGAAGGGACAGTGTGCTGGAGTCAGTG
ACCGCTATGGATCCAGCAAAAGAGGGAGACCGGTGTGTGTACCAGCACCTTCTTCGGCTT
GAGGATGGAGCTGATATCACTATAGGCAGAACCGAGTGGCGGCCGAAGAATGCAGGAGC
CAATGGTGCAATGTCATCAGGAAAGACTTCAAATGGAAACTGTCTCATAGAAGGAAGGGG
TTGGCAACCTTTCCGAGTTGTGCGTTTAATTTTCTGA
SEQ ID NO: 39 Cuphea wrightii (Cw) FATB5 coding DNA sequence codon optimized for Prototheca moriformis ATGGTGGCCGCCGCCGCCTCCTCCGCCTTCTTCTCCGTGCCCACCCCCGGCACCCCCCCCA
AGCCCGGCAAGTTCGGCAACTGGCCCTCCTCCCTGTCCGTGCCCTTCAAGCCCGACAACGG
CGGCTTCCACGTGAAGGCCAACGCCTCCGCCCACCCCAAGGCCAACGGCTCCGCCGTGAA
CCTGAAGTCCGGCTCCCTGGAGACCCCCCCCCGCTCCTTCATCAACCAGCTGCCCGACCTG
TCCGTGCTGCTGTCCAAGATCACCACCGTGTTCGGCGCCGCCGAGAAGCAGTGGAAGCGC
CCCGGCATGCTGGTGGAGCCCTTCGGCGTGGACCGCATCTTCCAGGACGGCGTGTTCTTCC
GCCAGTCCTTCTCCATCCGCTCCTACGAGATCGGCGTGGACCGCACCGCCTCCATCGAGAC
CCTGATGAACATCTTCCAGGAGACCTCCCTGAACCACTGCAAGTCCATCGGCCTGCTGAAC
GACGGCTTCGGCCGCACCCCCGAGATGTGCAAGCGCGACCTGATCTGGGTGGTGACCAAG
ATCCAGGTGGAGGTGAACCGCTACCCCACCTGGGGCGACACCATCGAGGTGAACACCTGG
GTGTCCGAGTCCGGCAAGAACGGCATGGGCCGCGACTGGCTGATCTCCGACTGCCGCACC
GGCGAGATCCTGATCCGCGCCACCTCCGTGTGGGCCATGATGAACCAGAACACCCGCCGC
CTGTCCAAGTTCCCCTACGAGGTGCGCCAGGAGATCGCCCCCCACTTCGTGGACTCCGCCC
CCGTGATCGAGGACGACCAGAAGCTGCAGAAGCTGGACGTGAAGACCGGCGACTCCATC
CGCGACGGCCTGACCCCCCGCTGGAACGACCTGGACGTGAACCAGCACGTGAACAACGTG
AAGTACATCGGCTGGATCCTGAAGTCCGTGCCCATCGAGGTGTTCGAGACCCAGGAGCTG
TGCGGCGTGACCCTGGAGTACCGCCGCGAGTGCGGCCGCGACTCCGTGCTGGAGTCCGTG
ACCGCCATGGACCCCGCCAAGGAGGGCGACCGCTGCGTGTACCAGCACCTGCTGCGCCTG
GAGGACGGCGCCGACATCACCATCGGCCGCACCGAGTGGCGCCCCAAGAACGCCGGCGC
CAACGGCGCCATGTCCTCCGGCAAGACCTCCAACGGCAACTGCCTGATCGAGGGCCGCGG
CTGGCAGCCCTTCCGCGTGGTGCGCCTGATCTTCTGA
SEQ ID NO: 40 Cuphea heterophylla (Cht) FATBla amino acid sequence MVAAAAS SAFF SVPTPGT STKPGNFGNWPS SLSVPFKPE SNHNGGFRVKANASAHPKANGSA
VNLKSGSLETQEDTS S S SPPPRTFIKQLPDWGMLLSKITTVFGAAERQWKRPGMLVEPFGVDRI
FQDGVFFRQ SF SIRSYEIGADRTASIETLMNIFQET SLNHCKSIGLLNDGFGRTPEMCKRDLIWV
VTKIQVEVNRYPTWGDTIEVNTWVSE SGKNGMGRDWLISDCRTGEILIRATSVWAMMNRKT
RRLSKFPYEVRQEIAPHFVDSAPVIEDDKKLHKLDVKTGDSIRKGLTPRWNDLDVNQHVNNV
KYIGWILKSVPAEVFETQELCGVTLEYRRECGRDSVLESVTAMDTAKEGDRSLYQHLLRLEDG
ADITIGRTEWRPKNAGANGAISTGKTSNENSVS*
SEQ ID NO: 41 Cuphea heterophylla (Cht) FATBla coding DNA sequence ATGGTGGCTGCCGCAGCAAGTTCTGCATTCTTCTCCGTTCCAACCCCGGGAACCTCCACTA
AACCCGGGAACTTCGGCAATTGGCCATCGAGCTTGAGCGTCCCCTTCAAGCCCGAATCAA
ACCACAATGGTGGCTTTCGGGTCAAGGCAAACGCCAGTGCTCATCCTAAGGCTAACGGTT
CTGCAGTAAATCTAAAGTCTGGCAGCCTCGAGACTCAGGAGGACACTTCATCGTCGTCCC
CTCCTCCTCGGACTTTTATTAAGCAGTTGCCCGACTGGGGTATGCTTCTGTCCAAAATCAC
GACTGTCTTCGGGGCGGCTGAGAGGCAGTGGAAGAGGCCCGGCATGCTTGTGGAACCGTT
TGGGGTTGACAGGATTTTTCAGGATGGGGTTTTTTTCAGACAGAGTTTTTCGATCAGGTCT
TACGAAATAGGCGCTGATCGAACAGCCTCAATAGAGACGCTGATGAACATCTTCCAGGAA
ACATCTCTGAATCATTGTAAGAGTATCGGTCTTCTCAATGACGGCTTTGGTCGTACTCCTG
AGATGTGTAAGAGGGACCTCATTTGGGTGGTTACGAAAATTCAGGTCGAGGTGAATCGCT
ATCCTACTTGGGGTGATACTATTGAGGTCAATACTTGGGTCTCAGAGTCGGGGAAAAACG
GTATGGGTCGTGATTGGCTGATAAGCGATTGCCGTACCGGAGAAATTCTTATAAGAGCAA
CGAGCGTGTGGGCTATGATGAATCGAAAGACGAGAAGATTGTCAAAATTTCCATATGAGG

TTCGACAGGAGATAGCGCCTCATTTTGTGGACTCTGCTCCTGTCATTGAAGACGATAAAAA
ATTGCACAAGCTTGATGTTAAGACGGGTGATTCCATTCGCAAGGGTCTAACTCCAAGGTG
GAATGACTTGGATGTCAATCAGCACGTTAACAATGTGAAGTACATTGGGTGGATTCTCAA
GAGTGTTCCAGCAGAAGTTTTCGAGACCCAGGAGCTATGCGGAGTCACCCTTGAGTACAG
GCGGGAATGTGGAAGGGACAGTGTGCTGGAGTCCGTGACCGCTATGGATACCGCAAAAG
AGGGAGACCGGTCTCTGTACCAGCACCTTCTTCGGCTTGAGGATGGGGCTGATATCACCAT
AGGCAGAACCGAGTGGCGGCCGAAGAATGCAGGAGCCAATGGGGCAATATCAACAGGAA
AGACTTCAAATGAAAACTCTGTCTCTTAG
SEQ ID NO: 42 Cuphea heterophylla (Cht) FATBla coding DNA sequence codon optimized for Prototheca moriformis ATGGTGGCCGCCGCCGCCTCCTCCGCCTTCTTCTCCGTGCCCACCCCCGGCACCTCCACCA
AGCCCGGCAACTTCGGCAACTGGCCCTCCTCCCTGTCCGTGCCCTTCAAGCCCGAGTCCAA
CCACAACGGCGGCTTCCGCGTGAAGGCCAACGCCTCCGCCCACCCCAAGGCCAACGGCTC
CGCCGTGAACCTGAAGTCCGGCTCCCTGGAGACCCAGGAGGACACCTCCTCCTCCTCCCCC
CCCCCCCGCACCTTCATCAAGCAGCTGCCCGACTGGGGCATGCTGCTGTCCAAGATCACCA
CCGTGTTCGGCGCCGCCGAGCGCCAGTGGAAGCGCCCCGGCATGCTGGTGGAGCCCTTCG
GCGTGGACCGCATCTTCCAGGACGGCGTGTTCTTCCGCCAGTCCTTCTCCATCCGCTCCTA
CGAGATCGGCGCCGACCGCACCGCCTCCATCGAGACCCTGATGAACATCTTCCAGGAGAC
CTCCCTGAACCACTGCAAGTCCATCGGCCTGCTGAACGACGGCTTCGGCCGCACCCCCGA
GATGTGCAAGCGCGACCTGATCTGGGTGGTGACCAAGATCCAGGTGGAGGTGAACCGCTA
CCCCACCTGGGGCGACACCATCGAGGTGAACACCTGGGTGTCCGAGTCCGGCAAGAACGG
CATGGGCCGCGACTGGCTGATCTCCGACTGCCGCACCGGCGAGATCCTGATCCGCGCCAC
CTCCGTGTGGGCCATGATGAACCGCAAGACCCGCCGCCTGTCCAAGTTCCCCTACGAGGT
GCGCCAGGAGATCGCCCCCCACTTCGTGGACTCCGCCCCCGTGATCGAGGACGACAAGAA
GCTGCACAAGCTGGACGTGAAGACCGGCGACTCCATCCGCAAGGGCCTGACCCCCCGCTG
GAACGACCTGGACGTGAACCAGCACGTGAACAACGTGAAGTACATCGGCTGGATCCTGAA
GTCCGTGCCCGCCGAGGTGTTCGAGACCCAGGAGCTGTGCGGCGTGACCCTGGAGTACCG
CCGCGAGTGCGGCCGCGACTCCGTGCTGGAGTCCGTGACCGCCATGGACACCGCCAAGGA
GGGCGACCGCTCCCTGTACCAGCACCTGCTGCGCCTGGAGGACGGCGCCGACATCACCAT
CGGCCGCACCGAGTGGCGCCCCAAGAACGCCGGCGCCAACGGCGCCATCTCCACCGGCAA
GACCTCCAACGAGAACTCCGTGTCCTGA
SEQ ID NO: 43 Cuphea heterophylla (Cht) FATB lb (P16S, T2OP, G945, G105W, 5293F, L305F
variant) amino acid sequence MVAAAAS SAFF SVPTSGT SPKPGNFGNWP S SLSVPFKPES SHNGGFQVKANASAHPKANGSAV
NLKSGSLETQEDT S S S SPPPRTFIKQLPDWSMLLSKITTVFWAAERQWKRPGMLVEPFGVDRIF
QDGVFFRQ SF SIRSYEIGADRTASIETLMNIFQET S LNHCKSIGLLNDGFGRTPEMCKRDLIWVV
TKIQVEVNRYPTWGDTIEVNTWVSESGKNGMGRDWLISDCRTGEILIRAT SVWAMMNRKTRR
LSKFPYEVRQEIAPHFVDSAPVIEDDKKLHKLDVKTGDFIRKGLTPRWNDFDVNQHVNNVKYI
GWILKSVPAEVFETQELCGVTLEYRRECGRDSVLESVTAMDTAKEGDRSLYQHLLRLEDGADI
TIGRTEWRPKNAGANGAISTGKTSNENSVS*
SEQ ID NO: 44 Cuphea heterophylla (Cht) FATB1b(P165, T2OP, G945, G105W, 5293F, L305F
variant) coding DNA
sequence ATGGTGGCTGCCGCAGCAAGTTCTGCATTCTTCTCCGTTCCAACCTCGGGAACCTCCCCTA
AACCCGGGAACTTCGGCAATTGGCCATCGAGCTTGAGCGTCCCCTTCAAGCCCGAATCAA
GCCACAATGGTGGCTTTCAGGTCAAGGCAAACGCCAGTGCCCATCCTAAGGCTAACGGTT
CTGCAGTAAATCTAAAGTCTGGCAGCCTCGAGACTCAGGAGGACACTTCATCGTCGTCCC
CTCCTCCTCGGACTTTTATTAAGCAGTTGCCCGACTGGAGTATGCTTCTGTCCAAAATCAC
GACTGTCTTCTGGGCGGCTGAGAGGCAGTGGAAGAGGCCCGGCATGCTTGTGGAACCGTT
TGGGGTTGACAGGATTTTTCAGGATGGGGTTTTTTTCAGACAGAGTTTTTCGATCAGGTCT

TACGAAATAGGCGCTGATCGAACAGCCTCAATAGAGACGCTGATGAACATCTTCCAGGAA
ACATCTCTGAATCATTGTAAGAGTATCGGTCTTCTCAATGACGGCTTTGGTCGTACTCCTG
AGATGTGTAAGAGGGACCTCATTTGGGTGGTTACGAAAATTCAGGTCGAGGTGAATCGCT
ATCCTACTTGGGGTGATACTATTGAGGTCAATACTTGGGTCTCAGAGTCGGGGAAAAACG
GTATGGGTCGTGATTGGCTGATAAGCGATTGCCGTACCGGAGAAATTCTTATAAGAGCAA
CGAGCGTGTGGGCTATGATGAATCGAAAGACGAGAAGATTGTCAAAATTTCCATATGAGG
TTCGACAGGAGATAGCGCCTCATTTTGTGGACTCTGCTCCTGTCATTGAAGACGATAAAAA
ATTGCACAAGCTTGATGTTAAGACGGGTGATTTCATTCGCAAGGGTCTAACTCCAAGGTG
GAATGACTTTGATGTCAATCAGCACGTTAACAATGTGAAGTACATTGGGTGGATTCTCAA
GAGTGTTCCAGCAGAAGTTTTCGAGACCCAGGAGCTATGCGGAGTCACCCTTGAGTATAG
GCGGGAATGTGGAAGGGACAGTGTGCTGGAGTCCGTGACCGCTATGGATACCGCAAAAG
AGGGAGACCGGTCTCTGTACCAGCACCTTCTTCGGCTTGAGGATGGGGCTGATATCACCAT
AGGCAGAACCGAGTGGCGGCCGAAGAATGCAGGAGCCAATGGGGCAATATCAACAGGAA
AGACTTCAAATGAAAACTCTGTCTCTTAG
SEQ ID NO: 45 Cuphea heterophylla (Cht) FATB lb (P16S, T2OP, G94S, G105W, S293F, L305F
variant) coding DNA
sequence codon optimized for Prototheca moriformis ATGGTGGCCGCCGCCGCCTCCTCCGCCTTCTTCTCCGTGCCCACCTCCGGCACCTCCCCCA
AGCCCGGCAACTTCGGCAACTGGCCCTCCTCCCTGTCCGTGCCCTTCAAGCCCGAGTCCTC
CCACAACGGCGGCTTCCAGGTGAAGGCCAACGCCTCCGCCCACCCCAAGGCCAACGGCTC
CGCCGTGAACCTGAAGTCCGGCTCCCTGGAGACCCAGGAGGACACCTCCTCCTCCTCCCCC
CCCCCCCGCACCTTCATCAAGCAGCTGCCCGACTGGTCCATGCTGCTGTCCAAGATCACCA
CCGTGTTCTGGGCCGCCGAGCGCCAGTGGAAGCGCCCCGGCATGCTGGTGGAGCCCTTCG
GCGTGGACCGCATCTTCCAGGACGGCGTGTTCTTCCGCCAGTCCTTCTCCATCCGCTCCTA
CGAGATCGGCGCCGACCGCACCGCCTCCATCGAGACCCTGATGAACATCTTCCAGGAGAC
CTCCCTGAACCACTGCAAGTCCATCGGCCTGCTGAACGACGGCTTCGGCCGCACCCCCGA
GATGTGCAAGCGCGACCTGATCTGGGTGGTGACCAAGATCCAGGTGGAGGTGAACCGCTA
CCCCACCTGGGGCGACACCATCGAGGTGAACACCTGGGTGTCCGAGTCCGGCAAGAACGG
CATGGGCCGCGACTGGCTGATCTCCGACTGCCGCACCGGCGAGATCCTGATCCGCGCCAC
CTCCGTGTGGGCCATGATGAACCGCAAGACCCGCCGCCTGTCCAAGTTCCCCTACGAGGT
GCGCCAGGAGATCGCCCCCCACTTCGTGGACTCCGCCCCCGTGATCGAGGACGACAAGAA
GCTGCACAAGCTGGACGTGAAGACCGGCGACTTCATCCGCAAGGGCCTGACCCCCCGCTG
GAACGACTTCGACGTGAACCAGCACGTGAACAACGTGAAGTACATCGGCTGGATCCTGAA
GTCCGTGCCCGCCGAGGTGTTCGAGACCCAGGAGCTGTGCGGCGTGACCCTGGAGTACCG
CCGCGAGTGCGGCCGCGACTCCGTGCTGGAGTCCGTGACCGCCATGGACACCGCCAAGGA
GGGCGACCGCTCCCTGTACCAGCACCTGCTGCGCCTGGAGGACGGCGCCGACATCACCAT
CGGCCGCACCGAGTGGCGCCCCAAGAACGCCGGCGCCAACGGCGCCATCTCCACCGGCAA
GACCTCCAACGAGAACTCCGTGTCCTGA
SEQ ID NO: 46 Cuphea heterophylla (Cht) FATB2b amino acid sequence MVVAAAAS SAFFPVPASGT SPKPGKFGTWLS S S SP SYKPK SNP SGGF QVICANASAHPICANGSA
VSLKSGSLNTQEGTS S SPPPRTFLNQLPDWSRLRTAITTVFVAAEKQLTMLDRKSKKPDMHVD
WFGLEIIVQDGLVFRE SF SIRSYEIGADRTASIETLMNHLQDTSLNHCKSVGLLNDGFGRTPEM
CKRDLIWVLTKMQIMVNRYPTWGDTVEINSWF SQSGKIGMGRNWLISDCNTGEILIRATSIWA
MMNQKTRRF S KLPNEVRQEIAPHFVDAPPVIEDNDRKLHKFDVKTGD SICKGLTPEWNDLDV
NQHVSNVKYIGWILESMPKEVLDTQELC SLTLEYRRECGRDSVLESVTAMDPSKVGDRSQYQ
HLLRLEDGTDIMKGRTEWRPKNAGTNGAISTGKT SNGNSVS*
SEQ ID NO: 47 Cuphea heterophylla (Cht) FATB2b coding DNA sequence ATGGTGGTGGCTGCTGCAGCAAGCTCTGCATTCTTCCCTGTTCCGGCATCTGGAACCTCCC
CTAAACCCGGGAAGTTCGGGACTTGGCTATCGAGCTCGAGCCCTTCCTACAAGCCCAAGT
CAAACCCCAGTGGTGGATTTCAGGTTAAGGCAAATGCCAGTGCTCATCCTAAGGCTAACG

GTTCCGCAGTAAGTCTAAAGTCTGGCAGCCTCAACACTCAGGAGGGCACTTCGTCGTCCCC
TCCTCCTCGGACTTTCCTTAACCAGTTGCCTGATTGGAGTAGGCTTCGGACTGCAATCACG
ACCGTCTTCGTGGCGGCAGAGAAGCAGTTGACTATGCTCGATCGAAAGTCTAAGAAGCCT
GACATGCACGTGGACTGGTTTGGGTTGGAGATTATTGTTCAGGATGGGCTCGTGTTCAGAG
AGAGTTTTTCGATCAGGTCTTACGAAATAGGCGCTGATCGAACAGCCTCTATAGAAACGTT
GATGAACCATTTGCAGGACACATCTTTGAACCATTGTAAGAGTGTGGGTCTTCTCAATGAC
GGCTTTGGTCGTACCCCGGAGATGTGTAAAAGGGACCTCATTTGGGTGCTTACAAAAATG
CAGATCATGGTGAATCGCTATCCAACTTGGGGCGATACTGTCGAGATCAATAGCTGGTTCT
CCCAGTCCGGGAAAATCGGTATGGGTCGCAATTGGCTAATAAGTGATTGCAACACAGGAG
AAATTCTTATAAGAGCAACGAGCATTTGGGCCATGATGAATCAAAAGACGAGAAGATTCT
CAAAACTTCCAAACGAGGTTCGCCAGGAGATAGCGCCTCATTTTGTGGACGCCCCTCCTGT
CATTGAAGACAATGATCGAAAATTGCATAAGTTTGATGTGAAGACTGGTGATTCCATTTGC
AAGGGTCTAACACCGGAGTGGAATGACTTGGATGTCAATCAGCACGTAAGCAACGTGAAG
TACATTGGGTGGATTCTCGAGAGTATGCCAAAAGAAGTTTTGGACACCCAGGAGCTATGC
TCTCTCACCCTTGAATATAGGCGGGAATGCGGAAGGGATAGTGTGCTGGAGTCTGTGACC
GCTATGGATCCCTCAAAAGTTGGAGACCGATCTCAGTACCAGCACCTTCTGCGGCTTGAA
GATGGGACTGATATCATGAAGGGCAGAACTGAGTGGCGACCAAAGAATGCAGGAACCAA
CGGGGCTATATCAACAGGAAAGACTTCAAATGGAAACTCGGTCTCTTAG
SEQ ID NO: 48 Cuphea heterophylla (Cht) FATB2b coding DNA sequence codon optimized for Prototheca moriformis ATGGTGGTGGCCGCCGCCGCCTCCTCCGCCTTCTTCCCCGTGCCCGCCTCCGGCACCTCCC
CCAAGCCCGGCAAGTTCGGCACCTGGCTGTCCTCCTCCTCCCCCTCCTACAAGCCCAAGTC
CAACCCCTCCGGCGGCTTCCAGGTGAAGGCCAACGCCTCCGCCCACCCCAAGGCCAACGG
CTCCGCCGTGTCCCTGAAGTCCGGCTCCCTGAACACCCAGGAGGGCACCTCCTCCTCCCCC
CCCCCCCGCACCTTCCTGAACCAGCTGCCCGACTGGTCCCGCCTGCGCACCGCCATCACCA
CCGTGTTCGTGGCCGCCGAGAAGCAGCTGACCATGCTGGACCGCAAGTCCAAGAAGCCCG
ACATGCACGTGGACTGGTTCGGCCTGGAGATCATCGTGCAGGACGGCCTGGTGTTCCGCG
AGTCCTTCTCCATCCGCTCCTACGAGATCGGCGCCGACCGCACCGCCTCCATCGAGACCCT
GATGAACCACCTGCAGGACACCTCCCTGAACCACTGCAAGTCCGTGGGCCTGCTGAACGA
CGGCTTCGGCCGCACCCCCGAGATGTGCAAGCGCGACCTGATCTGGGTGCTGACCAAGAT
GCAGATCATGGTGAACCGCTACCCCACCTGGGGCGACACCGTGGAGATCAACTCCTGGTT
CTCCCAGTCCGGCAAGATCGGCATGGGCCGCAACTGGCTGATCTCCGACTGCAACACCGG
CGAGATCCTGATCCGCGCCACCTCCATCTGGGCCATGATGAACCAGAAGACCCGCCGCTT
CTCCAAGCTGCCCAACGAGGTGCGCCAGGAGATCGCCCCCCACTTCGTGGACGCCCCCCC
CGTGATCGAGGACAACGACCGCAAGCTGCACAAGTTCGACGTGAAGACCGGCGACTCCAT
CTGCAAGGGCCTGACCCCCGAGTGGAACGACCTGGACGTGAACCAGCACGTGTCCAACGT
GAAGTACATCGGCTGGATCCTGGAGTCCATGCCCAAGGAGGTGCTGGACACCCAGGAGCT
GTGCTCCCTGACCCTGGAGTACCGCCGCGAGTGCGGCCGCGACTCCGTGCTGGAGTCCGT
GACCGCCATGGACCCCTCCAAGGTGGGCGACCGCTCCCAGTACCAGCACCTGCTGCGCCT
GGAGGACGGCACCGACATCATGAAGGGCCGCACCGAGTGGCGCCCCAAGAACGCCGGCA
CCAACGGCGCCATCTCCACCGGCAAGACCTCCAACGGCAACTCCGTGTCCTGA
SEQ ID NO: 49 Cuphea heterophylla (Cht) FATB2a (517P, P21S, T28N, L30P, 533L, G76D, 578P, G137W variant) amino acid sequence MVVAAAAS SAFFPVPAPGTTSKPGKFGNWPS SLSP SF KPKSNPNGGFQVICANASAHPICANGS
AVSLKSGSLNTKEDTPS SPPPRTFLNQLPDWSRLRTAITTVFVAAEKQLTMLDRKSKKPDMHV
DWF GLEIIVQDWLVFRE SF SIRSYEIGADRTASIETLMNHLQDTSLNHCKSVGLLNDGFGRTPE
MCKRDLIWVLTKMQIMVNRYPTWGDTVEIN SWF SQSGKIGMGRNWLISDCNTGEILIRATSIW
AMMNQKTRRF SKLPNEVRQEIAPHFVDAPPLIEDNDRKLHKFDVKTGD SICKGLTPEWNDLD
VNQHVSNVKYIGWILESMPKEVLDTQELC SLTLEYRRECGRDSVLESVTAMDPSKVGDRSQY
QHLLRLEDGTDIMKGRTEWRPKNAGTNGAISTGKT SNGNSVS*

SEQ ID NO: 50 Cuphea heterophylla (Cht) FATB2a (517P, P21S, T28N, L30P, 533L, G76D, 578P, G137W variant) coding DNA sequence ATGGTGGTGGCTGCTGCAGCAAGTTCTGCATTCTTCCCTGTTCCAGCACCTGGAACCACGT
CTAAACCCGGGAAGTTCGGCAATTGGCCATCGAGCTTGAGCCCTTCCTTCAAGCCCAAGTC
AAACCCCAATGGTGGATTTCAGGTTAAGGCAAATGCCAGCGCTCATCCTAAGGCTAACGG
GTCTGCAGTAAGTCTAAAGTCTGGCAGCCTCAACACTAAGGAGGACACTCCGTCGTCCCC
TCCTCCTCGGACTTTCCTTAACCAGTTGCCTGATTGGAGTAGGCTTCGGACTGCAATCACG
ACCGTCTTCGTGGCGGCAGAGAAGCAGTTGACTATGCTCGATCGAAAGTCTAAGAAGCCT
GACATGCACGTGGACTGGTTTGGGTTGGAGATTATTGTTCAGGATTGGCTCGTGTTCAGAG
AGAGTTTTTCGATCAGGTCTTACGAAATAGGCGCTGATCGAACAGCCTCTATAGAAACGTT
GATGAACCATTTGCAGGACACATCTTTGAACCATTGTAAGAGTGTGGGTCTTCTCAATGAC
GGCTTTGGTCGTACCCCGGAGATGTGTAAAAGGGACCTCATTTGGGTGCTTACAAAAATG
CAGATCATGGTGAATCGCTATCCAACTTGGGGCGATACTGTCGAGATCAATAGCTGGTTCT
CCCAGTCCGGGAAAATCGGTATGGGTCGCAATTGGCTAATAAGTGATTGCAACACAGGAG
AAATTCTTATAAGAGCAACGAGCATTTGGGCCATGATGAATCAAAAGACGAGAAGATTCT
CAAAACTTCCAAACGAGGTTCGCCAGGAGATAGCTCCTCATTTTGTGGACGCCCCTCCTCT
CATTGAAGACAATGATCGAAAATTGCATAAGTTTGATGTGAAGACTGGTGATTCCATTTGC
AAGGGTCTAACACCGGAGTGGAATGACTTGGATGTCAATCAGCACGTAAGCAACGTGAAG
TACATTGGGTGGATTCTCGAGAGTATGCCAAAAGAAGTTTTGGACACCCAGGAGCTATGC
TCTCTCACCCTTGAATATAGGCGGGAATGCGGAAGGGACAGTGTGCTGGAGTCTGTGACC
GCTATGGATCCCTCAAAAGTTGGAGACCGATCTCAGTACCAGCACCTTCTGCGGCTTGAA
GATGGGACTGATATCATGAAGGGCAGAACTGAGTGGCGACCAAAGAATGCAGGAACCAA
CGGGGCGATATCAACAGGAAAGACTTCAAATGGAAACTCGGTCTCTTAG
SEQ ID NO: 51 Cuphea heterophylla (Cht) FATB2a (517P, P21S, T28N, L30P, 533L, G76D, 578P, G137W variant) coding DNA sequence codon optimized for Prototheca moriformis ATGGTGGTGGCCGCCGCCGCCTCCTCCGCCTTCTTCCCCGTGCCCGCCCCCGGCACCACCT
CCAAGCCCGGCAAGTTCGGCAACTGGCCCTCCTCCCTGTCCCCCTCCTTCAAGCCCAAGTC
CAACCCCAACGGCGGCTTCCAGGTGAAGGCCAACGCCTCCGCCCACCCCAAGGCCAACGG
CTCCGCCGTGTCCCTGAAGTCCGGCTCCCTGAACACCAAGGAGGACACCCCCTCCTCCCCC
CCCCCCCGCACCTTCCTGAACCAGCTGCCCGACTGGTCCCGCCTGCGCACCGCCATCACCA
CCGTGTTCGTGGCCGCCGAGAAGCAGCTGACCATGCTGGACCGCAAGTCCAAGAAGCCCG
ACATGCACGTGGACTGGTTCGGCCTGGAGATCATCGTGCAGGACTGGCTGGTGTTCCGCG
AGTCCTTCTCCATCCGCTCCTACGAGATCGGCGCCGACCGCACCGCCTCCATCGAGACCCT
GATGAACCACCTGCAGGACACCTCCCTGAACCACTGCAAGTCCGTGGGCCTGCTGAACGA
CGGCTTCGGCCGCACCCCCGAGATGTGCAAGCGCGACCTGATCTGGGTGCTGACCAAGAT
GCAGATCATGGTGAACCGCTACCCCACCTGGGGCGACACCGTGGAGATCAACTCCTGGTT
CTCCCAGTCCGGCAAGATCGGCATGGGCCGCAACTGGCTGATCTCCGACTGCAACACCGG
CGAGATCCTGATCCGCGCCACCTCCATCTGGGCCATGATGAACCAGAAGACCCGCCGCTT
CTCCAAGCTGCCCAACGAGGTGCGCCAGGAGATCGCCCCCCACTTCGTGGACGCCCCCCC
CCTGATCGAGGACAACGACCGCAAGCTGCACAAGTTCGACGTGAAGACCGGCGACTCCAT
CTGCAAGGGCCTGACCCCCGAGTGGAACGACCTGGACGTGAACCAGCACGTGTCCAACGT
GAAGTACATCGGCTGGATCCTGGAGTCCATGCCCAAGGAGGTGCTGGACACCCAGGAGCT
GTGCTCCCTGACCCTGGAGTACCGCCGCGAGTGCGGCCGCGACTCCGTGCTGGAGTCCGT
GACCGCCATGGACCCCTCCAAGGTGGGCGACCGCTCCCAGTACCAGCACCTGCTGCGCCT
GGAGGACGGCACCGACATCATGAAGGGCCGCACCGAGTGGCGCCCCAAGAACGCCGGCA
CCAACGGCGCCATCTCCACCGGCAAGACCTCCAACGGCAACTCCGTGTCCTGA
SEQ ID NO: 52 Cuphea heterophylla (Cht) FATB2c (G76D, 578P variant) amino acid sequence MVVAAAAS SAFFPVPASGT SPKPGKFGTWLS S S SP SYKPK SNP SGGF QVKANASAHPKANGSA
VSLKSGSLNTKEDTPS SPPPRTFLNQLPDWNRLRTAITTVFVAAEKQLTMLDRKSKKPDMHVD
WFGLEIIVQDGLVFRE SF SIRSYEIGADRTASIETLMNHLQDTSLNHCKSVGLLNDGFGRTPEM

CKRDLIWVLTKMQIMVNRYPTWGDTVEINSWF SQSGKIGMGRNWLISDCNTGEILIRATSIWA
MMNQKTRRF S KLPNEVRQEIAPHFVDAPPVIEDNDRKLHKFDVKTGD SICKGLTPEWNDLDV
NQHVSNVKYIGWILESMPKEVLDTQELC SLTLEYRRECGRDSVLESVTAMDPSKVGDRSQYQ
HLLRLEDGTDIMKGRTEWRPKNAGTNGAISTGKT SNGNSVS*
SEQ ID NO: 53 Cuphea heterophylla (Cht) FATB2c (G76D, 578P variant) coding DNA sequence ATGGTGGTGGCTGCTGCAGCAAGCTCTGCATTCTTCCCTGTTCCGGCATCTGGAACCTCCC
CTAAACCCGGGAAGTTCGGGACTTGGCTATCGAGCTCGAGCCCTTCCTACAAGCCCAAGT
CAAACCCCAGTGGTGGATTTCAGGTTAAGGCAAATGCCAGTGCTCATCCTAAGGCTAACG
GTTCCGCAGTAAGTCTAAAGTCTGGCAGCCTCAACACTAAGGAGGACACTCCGTCGTCCC
CTCCTCCTCGGACTTTCCTTAACCAGTTGCCTGATTGGAATAGGCTTCGGACTGCAATCAC
GACCGTCTTCGTGGCGGCAGAGAAGCAGTTGACTATGCTCGATCGAAAGTCTAAGAAGCC
TGACATGCACGTGGACTGGTTTGGGTTGGAGATTATTGTTCAGGATGGGCTCGTGTTCAGA
GAGAGTTTTTCGATCAGGTCTTACGAAATAGGCGCTGATCGAACAGCCTCTATAGAAACG
TTGATGAACCATTTGCAGGACACATCTTTGAACCATTGTAAGAGTGTGGGTCTTCTCAATG
ACGGCTTTGGTCGTACCCCGGAGATGTGTAAAAGGGACCTCATTTGGGTGCTTACAAAAA
TGCAGATCATGGTGAATCGCTATCCAACTTGGGGCGATACTGTCGAGATCAATAGCTGGTT
CTCCCAGTCCGGGAAAATCGGTATGGGTCGCAATTGGCTAATAAGTGATTGCAACACAGG
AGAAATTCTTATAAGAGCAACGAGCATTTGGGCCATGATGAATCAAAAGACGAGAAGATT
CTCAAAACTTCCAAACGAGGTTCGCCAGGAGATAGCGCCTCATTTTGTGGACGCCCCTCCT
GTCATTGAAGACAATGATCGAAAATTGCATAAGTTTGATGTGAAGACTGGTGATTCCATTT
GCAAGGGTCTAACACCGGAGTGGAATGACTTGGATGTCAATCAGCACGTAAGCAACGTGA
AGTACATTGGGTGGATTCTCGAGAGTATGCCAAAAGAAGTTTTGGACACCCAGGAGCTAT
GCTCTCTCACCCTTGAATATAGGCGGGAATGCGGAAGGGACAGTGTGCTGGAGTCTGTGA
CCGCTATGGATCCCTCAAAAGTTGGGGACCGATCTCAGTACCAGCACCTTCTGCGGCTTGA
AGATGGGACTGATATCATGAAGGGCAGAACTGAGTGGCGACCAAAGAATGCAGGAACCA
ACGGGGCTATATCAACAGGAAAGACTTCAAATGGAAACTCGGTCTCTTAG
SEQ ID NO: 54 Cuphea heterophylla (Cht) FATB2c (G76D, 578P variant) coding DNA sequence codon optimized for Prototheca moriformis ATGGTGGTGGCCGCCGCCGCCTCCTCCGCCTTCTTCCCCGTGCCCGCCTCCGGCACCTCCC
CCAAGCCCGGCAAGTTCGGCACCTGGCTGTCCTCCTCCTCCCCCTCCTACAAGCCCAAGTC
CAACCCCTCCGGCGGCTTCCAGGTGAAGGCCAACGCCTCCGCCCACCCCAAGGCCAACGG
CTCCGCCGTGTCCCTGAAGTCCGGCTCCCTGAACACCAAGGAGGACACCCCCTCCTCCCCC
CCCCCCCGCACCTTCCTGAACCAGCTGCCCGACTGGAACCGCCTGCGCACCGCCATCACCA
CCGTGTTCGTGGCCGCCGAGAAGCAGCTGACCATGCTGGACCGCAAGTCCAAGAAGCCCG
ACATGCACGTGGACTGGTTCGGCCTGGAGATCATCGTGCAGGACGGCCTGGTGTTCCGCG
AGTCCTTCTCCATCCGCTCCTACGAGATCGGCGCCGACCGCACCGCCTCCATCGAGACCCT
GATGAACCACCTGCAGGACACCTCCCTGAACCACTGCAAGTCCGTGGGCCTGCTGAACGA
CGGCTTCGGCCGCACCCCCGAGATGTGCAAGCGCGACCTGATCTGGGTGCTGACCAAGAT
GCAGATCATGGTGAACCGCTACCCCACCTGGGGCGACACCGTGGAGATCAACTCCTGGTT
CTCCCAGTCCGGCAAGATCGGCATGGGCCGCAACTGGCTGATCTCCGACTGCAACACCGG
CGAGATCCTGATCCGCGCCACCTCCATCTGGGCCATGATGAACCAGAAGACCCGCCGCTT
CTCCAAGCTGCCCAACGAGGTGCGCCAGGAGATCGCCCCCCACTTCGTGGACGCCCCCCC
CGTGATCGAGGACAACGACCGCAAGCTGCACAAGTTCGACGTGAAGACCGGCGACTCCAT
CTGCAAGGGCCTGACCCCCGAGTGGAACGACCTGGACGTGAACCAGCACGTGTCCAACGT
GAAGTACATCGGCTGGATCCTGGAGTCCATGCCCAAGGAGGTGCTGGACACCCAGGAGCT
GTGCTCCCTGACCCTGGAGTACCGCCGCGAGTGCGGCCGCGACTCCGTGCTGGAGTCCGT
GACCGCCATGGACCCCTCCAAGGTGGGCGACCGCTCCCAGTACCAGCACCTGCTGCGCCT
GGAGGACGGCACCGACATCATGAAGGGCCGCACCGAGTGGCGCCCCAAGAACGCCGGCA
CCAACGGCGCCATCTCCACCGGCAAGACCTCCAACGGCAACTCCGTGTCCTGA

SEQ ID NO: 55 Cuphea heterophylla (Cht) FATB2d (521P, T28N, L30P, 533L, G76D, R97L, H124L, W127L, I132S, K258N, C303R, E309G, K334T, T386A variant) amino acid sequence MVVAAAAS SAFFPVPAPGTTSKPGKFGNWPS SLSP SF KPKSNPNGGFQVICANASAHPICANGS
AVSLKSGSLNTQEDT S S SPPPRTFLNQLPDWSRLLTAISTVFVAAEKQLTMLDRKSKRPDMLV
DLFGLESIVQDGLVFRESYSIRSYEIGADRTASIETLMNHLQDTSLNHCKSVGLLNDGFGRTPE
MCKRDLIWVLTKMQIMVNRYPTWGDTVEINSWF SQ SGKIGMGRNWLISDCNTGEILIRATSIW
AMMNQNTRRF SKLPNEVRQEIAPHFVDAPPVIEDNDRKLHKFDVKTGD SIRKGLTPGWNDLD
VNQHVSNVKYIGWILESMPTEVLETQELC SLTLEYRRECGRESVLE SVTAMNPSKVGDRSQYQ
HLLRLEDGADIMKGRTEWRPKNAGTNGAISTGKTSNGNSVS*
SEQ ID NO: 56 Cuphea heterophylla (Cht) FATB2d (521P, T28N, L30P, 533L, G76D, R97L, H124L, W127L, I132S, K258N, C303R, E309G, K334T, T386A variant) coding DNA sequence ATGGTGGTGGCTGCTGCAGCAAGTTCTGCATTCTTCCCTGTTCCAGCACCTGGAACCACGT
CTAAACCCGGGAAGTTCGGCAATTGGCCATCGAGCTTGAGCCCTTCCTTCAAGCCCAAGTC
AAACCCCAATGGTGGATTTCAGGTTAAGGCAAATGCCAGTGCTCATCCTAAGGCTAACGG
TTCTGCGGTAAGTCTAAAGTCTGGCAGCCTCAACACTCAGGAGGACACTTCGTCGTCCCCT
CCTCCTCGGACATTCCTTAACCAGTTGCCTGATTGGAGTAGGCTTCTGACTGCAATCTCGA
CCGTCTTCGTGGCGGCAGAGAAGCAGTTGACTATGCTCGATCGAAAATCTAAGAGGCCTG
ACATGCTCGTGGACTTGTTTGGGTTGGAGAGTATTGTTCAGGATGGGCTCGTGTTCAGAGA
GAGTTATTCGATCAGGTCTTACGAAATAGGCGCTGATCGAACAGCCTCTATAGAAACGTT
GATGAACCATTTGCAGGACACATCTTTGAACCATTGTAAGAGTGTGGGTCTTCTCAATGAC
GGCTTTGGTCGTACCCCGGAGATGTGTAAAAGGGACCTCATTTGGGTGCTTACAAAAATG
CAGATCATGGTGAATCGCTATCCAACTTGGGGCGATACTGTCGAGATCAATAGCTGGTTCT
CCCAGTCCGGGAAAATCGGTATGGGTCGCAATTGGCTAATAAGTGATTGCAACACAGGAG
AAATTCTTATAAGAGCAACGAGCATTTGGGCCATGATGAATCAAAATACGAGAAGATTCT
CAAAACTTCCAAACGAGGTTCGCCAGGAGATAGCGCCTCATTTTGTTGACGCTCCTCCTGT
CATTGAAGACAATGATCGAAAATTGCATAAGTTTGATGTGAAGACTGGTGATTCCATTCG
CAAGGGTCTAACTCCGGGGTGGAATGACTTGGATGTCAATCAGCACGTAAGCAACGTGAA
GTACATTGGGTGGATTCTCGAGAGTATGCCAACAGAAGTTTTGGAGACCCAGGAGCTATG
CTCTCTCACCCTTGAATATAGGCGGGAATGCGGAAGGGAAAGTGTGCTGGAGTCCGTGAC
CGCTATGAATCCCTCAAAAGTTGGAGACCGGTCTCAGTACCAGCACCTTCTACGGCTTGAG
GATGGGGCTGATATCATGAAGGGCAGAACTGAGTGGCGACCAAAGAATGCAGGAACCAA
CGGGGCGATATCAACAGGAAAGACTTCAAATGGAAACTCGGTCTCTTAG
SEQ ID NO: 57 Cuphea heterophylla (Cht) FATB2d (521P, T28N, L30P, 533L, G76D, R97L, H124L, W127L, I132S, K258N, C303R, E309G, K334T, T386A variant) coding DNA sequence codon optimized for Prototheca moriformis ATGGTGGTGGCCGCCGCCGCCTCCTCCGCCTTCTTCCCCGTGCCCGCCCCCGGCACCACCT
CCAAGCCCGGCAAGTTCGGCAACTGGCCCTCCTCCCTGTCCCCCTCCTTCAAGCCCAAGTC
CAACCCCAACGGCGGCTTCCAGGTGAAGGCCAACGCCTCCGCCCACCCCAAGGCCAACGG
CTCCGCCGTGTCCCTGAAGTCCGGCTCCCTGAACACCCAGGAGGACACCTCCTCCTCCCCC
CCCCCCCGCACCTTCCTGAACCAGCTGCCCGACTGGTCCCGCCTGCTGACCGCCATCTCCA
CCGTGTTCGTGGCCGCCGAGAAGCAGCTGACCATGCTGGACCGCAAGTCCAAGCGCCCCG
ACATGCTGGTGGACCTGTTCGGCCTGGAGTCCATCGTGCAGGACGGCCTGGTGTTCCGCG
AGTCCTACTCCATCCGCTCCTACGAGATCGGCGCCGACCGCACCGCCTCCATCGAGACCCT
GATGAACCACCTGCAGGACACCTCCCTGAACCACTGCAAGTCCGTGGGCCTGCTGAACGA
CGGCTTCGGCCGCACCCCCGAGATGTGCAAGCGCGACCTGATCTGGGTGCTGACCAAGAT
GCAGATCATGGTGAACCGCTACCCCACCTGGGGCGACACCGTGGAGATCAACTCCTGGTT
CTCCCAGTCCGGCAAGATCGGCATGGGCCGCAACTGGCTGATCTCCGACTGCAACACCGG
CGAGATCCTGATCCGCGCCACCTCCATCTGGGCCATGATGAACCAGAACACCCGCCGCTT
CTCCAAGCTGCCCAACGAGGTGCGCCAGGAGATCGCCCCCCACTTCGTGGACGCCCCCCC
CGTGATCGAGGACAACGACCGCAAGCTGCACAAGTTCGACGTGAAGACCGGCGACTCCAT

CCGCAAGGGCCTGACCCCCGGCTGGAACGACCTGGACGTGAACCAGCACGTGTCCAACGT
GAAGTACATCGGCTGGATCCTGGAGTCCATGCCCACCGAGGTGCTGGAGACCCAGGAGCT
GTGCTCCCTGACCCTGGAGTACCGCCGCGAGTGCGGCCGCGAGTCCGTGCTGGAGTCCGT
GACCGCCATGAACCCCTCCAAGGTGGGCGACCGCTCCCAGTACCAGCACCTGCTGCGCCT
GGAGGACGGCGCCGACATCATGAAGGGCCGCACCGAGTGGCGCCCCAAGAACGCCGGCA
CCAACGGCGCCATCTCCACCGGCAAGACCTCCAACGGCAACTCCGTGTCCTGA
SEQ ID NO: 58 Cuphea heterophylla (Cht) FATB2e (G76D, R97L, H124L, I132S, G1525, H165L, T211N, K258N, C303R, E309G, K334T, T386A variant) amino acid sequence MVVAAAAS SAFFPVPASGT SPKPGKFGTWLS S S SP SYKPK SNP SGGF QVICANASAHPICANGSA
VSLKSGSLNTQEDTS S S PPPQTFLNQLPDWS RLLTAI STVFVAAEKQLTMLDRK SKRPDMLVD
WFGLE SIVQDGLVF RE SYSIRSYEISADRTASIETVMNLLQETSLNHCKSMGILNDGFGRTPEM
CKRDLIWVLTKMQILVNRYPNWGDTVEINSWF SQSGKIGMGRNWLISDCNTGEILIRAT SIWA
MMNQNTRRF S KLPNEVRQEIAPHFVDAPPVIEDNDRKLHKF DVKTGD SIRKGLTPGWNDLDV
NQHVSNVKYIGWILESMPTEVLETQELCSLTLEYRRECGRDSVLESVTAMNPSKVGDRSQYQ
HLLRLEDGADIMKGRTEWRPKNAGTNGAISTGKTSNGNSVS*
SEQ ID NO: 59 Cuphea heterophylla (Cht) FATB2e (G76D, R97L, H124L, I132S, G1525, H165L, T211N, K258N, C303R, E309G, K334T, T386A variant) coding DNA sequence ATGGTGGTGGCTGCTGCAGCAAGCTCTGCATTCTTCCCTGTTCCGGCATCTGGAACCTCCC
CTAAACCCGGGAAGTTCGGGACTTGGCTATCGAGCTCGAGCCCTTCCTACAAGCCCAAGT
CAAACCCCAGTGGTGGATTTCAGGTTAAGGCAAATGCCAGTGCTCATCCTAAGGCTAACG
GTTCTGCAGTAAGTCTAAAGTCTGGCAGCCTCAACACTCAGGAGGACACTTCGTCGTCCCC
TCCTCCTCAGACATTCCTTAACCAGTTGCCTGATTGGAGTAGGCTTCTGACAGCAATCTCG
ACCGTCTTCGTGGCGGCAGAGAAGCAGTTGACTATGCTCGATCGAAAATCTAAAAGGCCT
GACATGCTCGTGGACTGGTTTGGGTTGGAGAGTATTGTTCAGGATGGGCTCGTGTTCAGAG
AGAGTTATTCGATCAGGTCTTACGAAATAAGCGCTGATCGAACAGCCTCTATAGAGACGG
TGATGAACCTCTTGCAGGAAACATCTCTCAATCATTGTAAGAGTATGGGTATTCTCAATGA
CGGCTTTGGTCGTACCCCGGAGATGTGCAAAAGGGACCTCATTTGGGTGCTTACAAAAAT
GCAGATCTTGGTGAATCGCTATCCAAATTGGGGTGATACTGTCGAGATCAATAGCTGGTTC
TCCCAGTCCGGGAAAATCGGTATGGGTCGCAATTGGCTAATAAGTGATTGCAACACAGGA
GAAATTCTTATAAGAGCAACGAGCATTTGGGCCATGATGAATCAAAATACGAGAAGATTC
TCAAAACTTCCAAACGAGGTTCGCCAGGAGATAGCGCCTCATTTTGTTGACGCTCCTCCTG
TCATTGAAGACAATGATCGAAAATTGCATAAGTTTGATGTGAAGACTGGTGATTCCATTCG
CAAGGGTCTAACTCCGGGGTGGAATGACTTGGATGTCAATCAGCACGTAAGCAACGTGAA
GTACATTGGGTGGATTCTCGAGAGTATGCCAACAGAAGTTTTGGAGACCCAGGAGCTATG
CTCTCTCACCCTTGAATATAGGCGGGAATGCGGAAGGGACAGTGTGCTGGAGTCCGTGAC
CGCTATGAATCCCTCAAAAGTTGGAGACCGGTCTCAGTACCAGCACCTTCTACGGCTTGAG
GATGGGGCTGATATCATGAAGGGCAGAACTGAGTGGCGACCAAAGAATGCAGGAACCAA
CGGGGCGATATCAACAGGAAAGACTTCAAATGGAAACTCGGTCTCTTAG
SEQ ID NO: 60 Cuphea heterophylla (Cht) FATB2e (G76D, R97L, H124L, I132S, G1525, H165L, T211N, K258N, C303R, E309G, K334T, T386A variant) coding DNA sequence codon optimized for Prototheca moriformis ATGGTGGTGGCCGCCGCCGCCTCCTCCGCCTTCTTCCCCGTGCCCGCCTCCGGCACCTCCC
CCAAGCCCGGCAAGTTCGGCACCTGGCTGTCCTCCTCCTCCCCCTCCTACAAGCCCAAGTC
CAACCCCTCCGGCGGCTTCCAGGTGAAGGCCAACGCCTCCGCCCACCCCAAGGCCAACGG
CTCCGCCGTGTCCCTGAAGTCCGGCTCCCTGAACACCCAGGAGGACACCTCCTCCTCCCCC
CCCCCCCAGACCTTCCTGAACCAGCTGCCCGACTGGTCCCGCCTGCTGACCGCCATCTCCA
CCGTGTTCGTGGCCGCCGAGAAGCAGCTGACCATGCTGGACCGCAAGTCCAAGCGCCCCG
ACATGCTGGTGGACTGGTTCGGCCTGGAGTCCATCGTGCAGGACGGCCTGGTGTTCCGCG
AGTCCTACTCCATCCGCTCCTACGAGATCTCCGCCGACCGCACCGCCTCCATCGAGACCGT

GATGAACCTGCTGCAGGAGACCTCCCTGAACCACTGCAAGTCCATGGGCATCCTGAACGA
CGGCTTCGGCCGCACCCCCGAGATGTGCAAGCGCGACCTGATCTGGGTGCTGACCAAGAT
GCAGATCCTGGTGAACCGCTACCCCAACTGGGGCGACACCGTGGAGATCAACTCCTGGTT
CTCCCAGTCCGGCAAGATCGGCATGGGCCGCAACTGGCTGATCTCCGACTGCAACACCGG
CGAGATCCTGATCCGCGCCACCTCCATCTGGGCCATGATGAACCAGAACACCCGCCGCTT
CTCCAAGCTGCCCAACGAGGTGCGCCAGGAGATCGCCCCCCACTTCGTGGACGCCCCCCC
CGTGATCGAGGACAACGACCGCAAGCTGCACAAGTTCGACGTGAAGACCGGCGACTCCAT
CCGCAAGGGCCTGACCCCCGGCTGGAACGACCTGGACGTGAACCAGCACGTGTCCAACGT
GAAGTACATCGGCTGGATCCTGGAGTCCATGCCCACCGAGGTGCTGGAGACCCAGGAGCT
GTGCTCCCTGACCCTGGAGTACCGCCGCGAGTGCGGCCGCGACTCCGTGCTGGAGTCCGT
GACCGCCATGAACCCCTCCAAGGTGGGCGACCGCTCCCAGTACCAGCACCTGCTGCGCCT
GGAGGACGGCGCCGACATCATGAAGGGCCGCACCGAGTGGCGCCCCAAGAACGCCGGCA
CCAACGGCGCCATCTCCACCGGCAAGACCTCCAACGGCAACTCCGTGTCCTGA
SEQ ID NO: 61 Cuphea heterophylla (Cht) FATB2f (R97L, H124L, I132S, G1525, H165L, T211N
variant) amino acid sequence MVVAAAAS SAFFPVPASGT SPKPGKFGTWLS S S SP SYKPK SNP SGGF QVKANASAHPKANGSA
VSLKSGSLNTQEGTS S SPPPRTFLNQLPDWSRLLTAISTVFVAAEKQLTMLDRKSKRPDMLVD
WFGLE SIVQDGLVF RE SYSIRSYEISADRTASIETVMNLLQETSLNHCKSMGILNDGFGRTPEM
CKRDLIWVLTKMQILVNRYPNWGDTVEINSWF SQSGKIGMGRNWLISDCNTGEILIRAT SIWA
MMNQKTRRF S KLPNEVRQEIAPHFVDAPPVIEDNDRKLHKFDVKTGD SICKGLTPEWNDLDV
NQHVSNVKYIGWILESMPKEVLDTQELC SLTLEYRRECGRDSVLESVTAMDPSKVGDRSQYQ
HLLRLEDGTDIMKGRTEWRPKNAGTNGAISTGKT SNGNSVS*
SEQ ID NO: 62 Cuphea heterophylla (Cht) FATB2f (R97L, H124L, I132S, G1525, H165L, T211N
variant) coding DNA sequence ATGGTGGTGGCTGCTGCAGCAAGCTCTGCATTCTTCCCTGTTCCGGCATCTGGAACCTCCC
CTAAACCCGGGAAGTTCGGGACTTGGCTATCGAGCTCGAGCCCTTCCTACAAGCCCAAGT
CAAACCCCAGTGGTGGATTTCAGGTTAAAGCAAATGCCAGTGCTCATCCTAAGGCTAACG
GTTCCGCAGTAAGTCTAAAGTCTGGCAGCCTCAACACTCAGGAGGGCACTTCGTCGTCCCC
TCCTCCTCGGACATTCCTTAACCAGTTGCCTGATTGGAGTAGGCTTCTGACTGCAATCTCG
ACCGTCTTCGTGGCGGCAGAGAAGCAGTTGACTATGCTCGATCGAAAATCTAAGAGGCCT
GACATGCTCGTGGACTGGTTTGGGTTGGAGAGTATTGTTCAGGATGGGCTCGTGTTCAGAG
AGAGTTATTCGATCAGGTCTTACGAAATAAGCGCTGATCGAACAGCCTCTATAGAGACGG
TGATGAACCTCTTGCAGGAAACATCTCTCAATCATTGTAAGAGTATGGGTATTCTCAATGA
CGGCTTTGGTCGTACCCCGGAGATGTGCAAAAGGGACCTCATTTGGGTGCTTACAAAAAT
GCAGATCTTGGTGAATCGCTATCCAAATTGGGGTGATACTGTCGAGATCAATAGCTGGTTC
TCCCAGTCCGGGAAAATCGGTATGGGTCGCAATTGGCTAATAAGTGATTGCAACACAGGA
GAAATTCTTATAAGAGCAACGAGCATTTGGGCCATGATGAATCAAAAGACGAGAAGATTC
TCAAAACTTCCAAATGAGGTTCGCCAGGAGATAGCGCCTCATTTTGTGGACGCCCCTCCTG
TCATTGAAGACAATGATCGAAAATTGCATAAGTTTGATGTGAAGACTGGTGATTCCATTTG
CAAGGGTCTAACACCGGAGTGGAACGACTTGGATGTCAATCAGCACGTAAGCAACGTGAA
GTACATTGGGTGGATTCTCGAGAGTATGCCAAAAGAAGTTTTGGACACCCAGGAGCTATG
CTCTCTCACCCTTGAATATAGGCGGGAATGCGGAAGGGACAGTGTGCTGGAGTCTGTGAC
CGCTATGGATCCCTCAAAAGTTGGAGACCGATCTCAGTACCAGCACCTTCTGCGGCTTGAA
GATGGGACTGATATCATGAAGGGCAGAACTGAGTGGCGACCAAAGAATGCAGGAACCAA
CGGGGCGATATCAACAGGAAAGACTTCAAATGGAAACTCGGTCTCTTAG
SEQ ID NO: 63 Cuphea heterophylla (Cht) FATB2f (R97L, H124L, I132S, G1525, H165L, T211N
variant) coding DNA sequence codon optimized for Prototheca moriformis ATGGTGGTGGCCGCCGCCGCCTCCTCCGCCTTCTTCCCCGTGCCCGCCTCCGGCACCTCCC
CCAAGCCCGGCAAGTTCGGCACCTGGCTGTCCTCCTCCTCCCCCTCCTACAAGCCCAAGTC

CAACCCCTCCGGCGGCTTCCAGGTGAAGGCCAACGCCTCCGCCCACCCCAAGGCCAACGG
CTCCGCCGTGTCCCTGAAGTCCGGCTCCCTGAACACCCAGGAGGGCACCTCCTCCTCCCCC
CCCCCCCGCACCTTCCTGAACCAGCTGCCCGACTGGTCCCGCCTGCTGACCGCCATCTCCA
CCGTGTTCGTGGCCGCCGAGAAGCAGCTGACCATGCTGGACCGCAAGTCCAAGCGCCCCG
ACATGCTGGTGGACTGGTTCGGCCTGGAGTCCATCGTGCAGGACGGCCTGGTGTTCCGCG
AGTCCTACTCCATCCGCTCCTACGAGATCTCCGCCGACCGCACCGCCTCCATCGAGACCGT
GATGAACCTGCTGCAGGAGACCTCCCTGAACCACTGCAAGTCCATGGGCATCCTGAACGA
CGGCTTCGGCCGCACCCCCGAGATGTGCAAGCGCGACCTGATCTGGGTGCTGACCAAGAT
GCAGATCCTGGTGAACCGCTACCCCAACTGGGGCGACACCGTGGAGATCAACTCCTGGTT
CTCCCAGTCCGGCAAGATCGGCATGGGCCGCAACTGGCTGATCTCCGACTGCAACACCGG
CGAGATCCTGATCCGCGCCACCTCCATCTGGGCCATGATGAACCAGAAGACCCGCCGCTT
CTCCAAGCTGCCCAACGAGGTGCGCCAGGAGATCGCCCCCCACTTCGTGGACGCCCCCCC
CGTGATCGAGGACAACGACCGCAAGCTGCACAAGTTCGACGTGAAGACCGGCGACTCCAT
CTGCAAGGGCCTGACCCCCGAGTGGAACGACCTGGACGTGAACCAGCACGTGTCCAACGT
GAAGTACATCGGCTGGATCCTGGAGTCCATGCCCAAGGAGGTGCTGGACACCCAGGAGCT
GTGCTCCCTGACCCTGGAGTACCGCCGCGAGTGCGGCCGCGACTCCGTGCTGGAGTCCGT
GACCGCCATGGACCCCTCCAAGGTGGGCGACCGCTCCCAGTACCAGCACCTGCTGCGCCT
GGAGGACGGCACCGACATCATGAAGGGCCGCACCGAGTGGCGCCCCAAGAACGCCGGCA
CCAACGGCGCCATCTCCACCGGCAAGACCTCCAACGGCAACTCCGTGTCCTGA
SEQ ID NO: 64 Cuphea heterophylla (Cht) FATB2g (A6T, A16V, 517P, G76D, R97L, H124L, I132S, S1431, G1525, A157T, H165L, T211N, G414A variant) amino acid sequence MVVAATAS SAFFPVPVPGT SPKPGKFGTWLS S S SP SYKPKSNP SGGFQVICANASAHPICANGSA
VSLKSGSLNTQEDTS S SPPPRTFLNQLPDWSRLLTAISTVFVAAEKQLTMLDRKSKRPDMLVD
WFGLE SIVQDGLVFREIYSIRSYEISADRTT SIETVMNLLQET SLNHCKSMGILNDGFGRTPEMC
KRDLIWVLTKMQILVNRYPNWGDTVEIN SWF SQ SGKIGMGRNWLISDCNTGEILIRATSIWAM
MNQKTRRF SKLPNEVRQEIAPHFVDAPPVIEDNDRKLHKFDVKTGD S ICKGLTPEWNDLDVN
QHVSNVKYIGWILESMPKEVLDTQELC SLTLEYRRECGRDSVLE SVTAMDPSKVGDRSQYQH
LLRLEDGTDIMKGRTEWRPKNAGTNGAISTGKTSNANSVS*
SEQ ID NO: 65 Cuphea heterophylla (Cht) FATB2g (A6T, A16V, 517P, G76D, R97L, H124L, I132S, S1431, G1525, A157T, H165L, T211N, G414A variant) coding DNA sequence ATGGTGGTGGCTGCTACAGCAAGTTCTGCATTCTTCCCTGTTCCTGTACCTGGAACCTCCC
CTAAACCCGGAAAGTTCGGGACTTGGCTATCGAGCTCGAGCCCTTCCTACAAGCCCAAGT
CAAACCCCAGTGGTGGATTTCAGGTTAAGGCAAATGCCAGTGCTCATCCTAAGGCTAACG
GTTCTGCAGTAAGTCTAAAGTCTGGCAGCCTCAACACTCAGGAGGACACTTCGTCGTCCCC
TCCTCCTCGGACATTCCTTAACCAGTTGCCTGATTGGAGTAGGCTTCTGACTGCAATCTCG
ACCGTCTTCGTGGCGGCAGAGAAGCAGTTGACTATGCTCGATCGAAAATCTAAGAGGCCT
GACATGCTCGTGGACTGGTTTGGGTTGGAGAGTATTGTTCAGGATGGGCTCGTGTTCAGAG
AGATTTATTCGATCAGGTCTTACGAAATAAGCGCTGATCGAACAACCTCTATAGAGACGG
TGATGAACCTCTTGCAGGAAACATCTCTCAATCATTGTAAGAGTATGGGTATTCTCAATGA
CGGCTTTGGTCGTACCCCGGAGATGTGCAAAAGGGACCTCATTTGGGTGCTTACAAAAAT
GCAGATCTTGGTGAATCGCTATCCAAATTGGGGTGATACTGTCGAGATCAATAGCTGGTTC
TCCCAGTCCGGGAAAATCGGTATGGGTCGCAATTGGCTAATAAGTGATTGCAACACAGGA
GAAATTCTTATAAGAGCAACGAGCATTTGGGCCATGATGAATCAAAAGACGAGAAGATTC
TCAAAACTTCCAAACGAGGTTCGCCAGGAGATAGCGCCTCATTTTGTGGACGCCCCTCCTG
TCATTGAAGACAATGATCGAAAATTGCATAAGTTTGATGTGAAGACTGGTGATTCCATTTG
CAAGGGTCTAACACCGGAGTGGAATGACTTGGATGTCAATCAGCACGTAAGCAACGTGAA
GTACATTGGGTGGATTCTCGAGAGTATGCCAAAAGAAGTTTTGGACACCCAGGAGCTATG
CTCTCTCACCCTTGAATATAGGCGGGAATGCGGAAGGGACAGTGTGCTGGAGTCTGTGAC
CGCTATGGATCCCTCAAAAGTTGGAGACCGATCTCAGTACCAGCACCTTCTGCGGCTTGAA
GATGGGACTGATATCATGAAGGGCAGAACTGAGTGGCGACCAAAGAATGCAGGAACCAA
CGGGGCGATATCAACAGGAAAGACTTCAAATGCAAACTCGGTCTCTTAG

SEQ ID NO: 66 Cuphea heterophylla (Cht) FATB2g (A6T, A16V, 517P, G76D, R97L, H124L, I132S, S1431, G1525, Al 57T, H165L, T211N, G414A variant) coding DNA sequence codon optimized for Prototheca moriformis ATGGTGGTGGCCGCCACCGCCTCCTCCGCCTTCTTCCCCGTGCCCGTGCCCGGCACCTCCC
CCAAGCCCGGCAAGTTCGGCACCTGGCTGTCCTCCTCCTCCCCCTCCTACAAGCCCAAGTC
CAACCCCTCCGGCGGCTTCCAGGTGAAGGCCAACGCCTCCGCCCACCCCAAGGCCAACGG
CTCCGCCGTGTCCCTGAAGTCCGGCTCCCTGAACACCCAGGAGGACACCTCCTCCTCCCCC
CCCCCCCGCACCTTCCTGAACCAGCTGCCCGACTGGTCCCGCCTGCTGACCGCCATCTCCA
CCGTGTTCGTGGCCGCCGAGAAGCAGCTGACCATGCTGGACCGCAAGTCCAAGCGCCCCG
ACATGCTGGTGGACTGGTTCGGCCTGGAGTCCATCGTGCAGGACGGCCTGGTGTTCCGCG
AGATCTACTCCATCCGCTCCTACGAGATCTCCGCCGACCGCACCACCTCCATCGAGACCGT
GATGAACCTGCTGCAGGAGACCTCCCTGAACCACTGCAAGTCCATGGGCATCCTGAACGA
CGGCTTCGGCCGCACCCCCGAGATGTGCAAGCGCGACCTGATCTGGGTGCTGACCAAGAT
GCAGATCCTGGTGAACCGCTACCCCAACTGGGGCGACACCGTGGAGATCAACTCCTGGTT
CTCCCAGTCCGGCAAGATCGGCATGGGCCGCAACTGGCTGATCTCCGACTGCAACACCGG
CGAGATCCTGATCCGCGCCACCTCCATCTGGGCCATGATGAACCAGAAGACCCGCCGCTT
CTCCAAGCTGCCCAACGAGGTGCGCCAGGAGATCGCCCCCCACTTCGTGGACGCCCCCCC
CGTGATCGAGGACAACGACCGCAAGCTGCACAAGTTCGACGTGAAGACCGGCGACTCCAT
CTGCAAGGGCCTGACCCCCGAGTGGAACGACCTGGACGTGAACCAGCACGTGTCCAACGT
GAAGTACATCGGCTGGATCCTGGAGTCCATGCCCAAGGAGGTGCTGGACACCCAGGAGCT
GTGCTCCCTGACCCTGGAGTACCGCCGCGAGTGCGGCCGCGACTCCGTGCTGGAGTCCGT
GACCGCCATGGACCCCTCCAAGGTGGGCGACCGCTCCCAGTACCAGCACCTGCTGCGCCT
GGAGGACGGCACCGACATCATGAAGGGCCGCACCGAGTGGCGCCCCAAGAACGCCGGCA
CCAACGGCGCCATCTCCACCGGCAAGACCTCCAACGCCAACTCCGTGTCCTGA
SEQ ID NO: 67 Cuphea heterophylla (Cht) FATB3aamino acid sequence MVATAAS SAFFPVPSPDTS SRPGKLGNGS S SLRPLKPKFVANAGLQVKANASAPPKINGS SVSL
KSC SLKTHEDTPSAPPPRTFINQLPDWSMLLAAITTVFLAAEKQWMMLDWKPKRPDMLVDPF
GLGRIVQDGLVFRQNF SIRSYEIGADRTASIETVMNHLQETALNHVKSAGLLNEGFGRTPEMY
KRDLIWVVAKMQVMVNRYPTWGDTVEVNTWVAKSGKNGMRRDWLI SDCNTGEILTRAS SV
WVMMNQKTRKLSKIPDEVRHEIEPHFVDSAPVIEDDDWKLPKLDEKTADSIRKGLTPKWNDL
DVNQHVNNVKYIGWILESTPPEVLETQELC SLTLEYRRECGRE SVLESLTAVDP SGKGFGPQFQ
HLLRLEDGGEIVKGRTEWRPKTAGINGTIASGET SPGNS*
SEQ ID NO: 68 Cuphea heterophylla (Cht) FATB3a coding DNA sequence ATGGTGGCCACCGCTGCAAGTTCTGCATTCTTCCCGGTGCCGTCCCCGGACACCTCCTCTA
GACCGGGAAAGCTCGGAAATGGGTCATCAAGCTTGAGGCCCCTCAAGCCCAAATTTGTTG
CCAATGCTGGGCTGCAGGTTAAGGCAAACGCCAGTGCCCCTCCTAAGATCAATGGTTCCT
CGGTCAGTCTAAAGTCTTGCAGTCTCAAGACTCATGAAGACACTCCTTCAGCTCCTCCTCC
GCGGACTTTTATCAACCAGTTGCCTGATTGGAGCATGCTTCTTGCTGCAATCACTACTGTC
TTCTTGGCAGCAGAGAAGCAGTGGATGATGCTTGATTGGAAACCAAAGAGGCCTGACATG
CTTGTGGACCCGTTCGGATTGGGAAGGATTGTTCAGGATGGGCTTGTGTTCAGGCAGAATT
TTTCGATTAGGTCCTATGAAATAGGCGCTGATCGCACTGCATCCATAGAGACGGTGATGA
ACCACTTGCAGGAAACGGCTCTCAATCATGTTAAGAGTGCGGGGCTTCTTAATGAAGGCT
TTGGTCGTACTCCTGAGATGTATAAAAGGGACCTTATTTGGGTTGTCGCGAAAATGCAGGT
CATGGTTAACCGCTATCCTACTTGGGGTGACACGGTTGAAGTGAATACTTGGGTTGCCAAG
TCAGGGAAAAATGGTATGCGTCGTGATTGGCTCATAAGTGATTGCAATACAGGAGAAATT
CTTACAAGGGCATCAAGTGTGTGGGTCATGATGAATCAAAAGACAAGAAAATTGTCAAAG
ATTCCAGATGAGGTTCGGCATGAGATAGAGCCTCATTTTGTGGACTCTGCTCCCGTCATTG
AAGACGATGACTGGAAACTTCCCAAGCTGGATGAGAAAACTGCTGACTCCATCCGCAAGG
GTCTAACTCCGAAGTGGAATGACTTGGATGTCAATCAGCACGTCAACAACGTGAAGTACA
TTGGGTGGATTCTTGAGAGTACTCCACCAGAAGTTCTGGAGACCCAGGAGTTATGTTCCCT

TACCCTGGAATACAGGCGGGAATGCGGAAGGGAGAGTGTGCTGGAGTCCCTCACTGCTGT
GGACCCCTCTGGAAAGGGCTTTGGGCCCCAGTTTCAGCACCTTCTGAGGCTTGAGGATGG
AGGTGAGATCGTAAAGGGGAGAACTGAGTGGCGACCCAAGACTGCAGGTATCAATGGGA
CGATTGCATCTGGGGAGACCTCACCTGGAAACTCTTAG
SEQ ID NO: 69 Cuphea heterophylla (Cht) FATB3a coding DNA sequence codon optimized for Prototheca moriformis ATGGTGGCCACCGCCGCCTCCTCCGCCTTCTTCCCCGTGCCCTCCCCCGACACCTCCTCCCG
CCCCGGCAAGCTGGGCAACGGCTCCTCCTCCCTGCGCCCCCTGAAGCCCAAGTTCGTGGCC
AACGCCGGCCTGCAGGTGAAGGCCAACGCCTCCGCCCCCCCCAAGATCAACGGCTCCTCC
GTGTCCCTGAAGTCCTGCTCCCTGAAGACCCACGAGGACACCCCCTCCGCCCCCCCCCCCC
GCACCTTCATCAACCAGCTGCCCGACTGGTCCATGCTGCTGGCCGCCATCACCACCGTGTT
CCTGGCCGCCGAGAAGCAGTGGATGATGCTGGACTGGAAGCCCAAGCGCCCCGACATGCT
GGTGGACCCCTTCGGCCTGGGCCGCATCGTGCAGGACGGCCTGGTGTTCCGCCAGAACTT
CTCCATCCGCTCCTACGAGATCGGCGCCGACCGCACCGCCTCCATCGAGACCGTGATGAA
CCACCTGCAGGAGACCGCCCTGAACCACGTGAAGTCCGCCGGCCTGCTGAACGAGGGCTT
CGGCCGCACCCCCGAGATGTACAAGCGCGACCTGATCTGGGTGGTGGCCAAGATGCAGGT
GATGGTGAACCGCTACCCCACCTGGGGCGACACCGTGGAGGTGAACACCTGGGTGGCCAA
GTCCGGCAAGAACGGCATGCGCCGCGACTGGCTGATCTCCGACTGCAACACCGGCGAGAT
CCTGACCCGCGCCTCCTCCGTGTGGGTGATGATGAACCAGAAGACCCGCAAGCTGTCCAA
GATCCCCGACGAGGTGCGCCACGAGATCGAGCCCCACTTCGTGGACTCCGCCCCCGTGAT
CGAGGACGACGACTGGAAGCTGCCCAAGCTGGACGAGAAGACCGCCGACTCCATCCGCA
AGGGCCTGACCCCCAAGTGGAACGACCTGGACGTGAACCAGCACGTGAACAACGTGAAG
TACATCGGCTGGATCCTGGAGTCCACCCCCCCCGAGGTGCTGGAGACCCAGGAGCTGTGC
TCCCTGACCCTGGAGTACCGCCGCGAGTGCGGCCGCGAGTCCGTGCTGGAGTCCCTGACC
GCCGTGGACCCCTCCGGCAAGGGCTTCGGCCCCCAGTTCCAGCACCTGCTGCGCCTGGAG
GACGGCGGCGAGATCGTGAAGGGCCGCACCGAGTGGCGCCCCAAGACCGCCGGCATCAA
CGGCACCATCGCCTCCGGCGAGACCTCCCCCGGCAACTCCTGA
SEQ ID NO: 70 Cuphea heterophylla (Cht) FATB3b (C67G, H72Q, L128F, N1791 variant) amino acid sequence MVATAAS SAFFPVPSPDTS SRPGKLGNGS S SLRPLKPKFVANAGLQVKANASAPPKINGS SVSL
KSGSLKTQEDTPSAPPPRTFINQLPDWSMLLAAITTVFLAAEKQWMMLDWKPKRPDMLVDPF
GFGRIVQDGLVFRQNF SIRSYEIGADRTASIETVMNHLQETALNHVKSAGLLIEGFGRTPEMYK
RDLIWVVAKMQVMVNRYPTWGDTVEVNTWVAKSGKNGMRRDWLI SDCNTGEILTRAS SVW
VMMNQKTRKLSKIPDEVRHEIEPHFVD SAPVIEDDDWKLPKLDEKTADSIRKGLTPKWNDLD
VNQHVNNVKYIGWILESTPPEVLETQELC SLTLEYRRECGRESVLESLTAVDPSGKGFGPQFQH
LLRLEDGGEIVKGRTEWRPKTAGINGTIASGET SPGNS*
SEQ ID NO: 71 Cuphea heterophylla (Cht) FATB3b (C67G, H72Q, L128F, N1791 variant) coding DNA
sequence ATGGTGGCCACCGCTGCAAGTTCTGCATTCTTCCCGGTGCCATCCCCGGACACCTCCTCTA
GACCGGGAAAGCTCGGAAATGGGTCATCAAGCTTGAGGCCCCTCAAGCCCAAATTTGTTG
CCAATGCTGGGCTGCAGGTTAAGGCAAACGCCAGTGCCCCTCCTAAGATCAATGGTTCCT
CGGTCAGTCTAAAGTCTGGCAGTCTCAAGACTCAGGAAGACACTCCTTCGGCTCCTCCTCC
GCGGACTTTTATCAACCAGTTGCCTGATTGGAGCATGCTTCTTGCTGCAATCACTACTGTC
TTCTTGGCAGCAGAGAAGCAGTGGATGATGCTTGATTGGAAACCAAAGAGGCCTGACATG
CTTGTGGACCCGTTCGGATTTGGAAGGATTGTTCAGGATGGGCTTGTGTTCAGGCAGAATT
TTTCGATTAGGTCCTATGAAATAGGCGCTGATCGCACTGCATCTATAGAGACGGTGATGA
ACCACTTGCAGGAAACGGCTCTCAATCATGTTAAGAGTGCGGGGCTTCTTATTGAAGGCTT
TGGTCGTACTCCTGAGATGTATAAAAGGGACCTTATTTGGGTTGTCGCGAAAATGCAGGTC
ATGGTTAACCGCTATCCTACTTGGGGTGACACGGTTGAAGTGAATACTTGGGTTGCCAAGT
CAGGGAAAAATGGTATGCGTCGTGATTGGCTCATAAGTGATTGCAATACAGGAGAAATTC
TTACTAGAGCATCAAGTGTGTGGGTCATGATGAATCAAAAGACAAGAAAATTGTCAAAGA

TTCCAGATGAGGTTCGGCATGAGATAGAGCCTCATTTTGTGGACTCTGCTCCCGTCATTGA
AGACGATGACTGGAAACTTCCCAAGCTGGATGAGAAAACTGCTGACTCCATCCGCAAGGG
TCTAACTCCGAAGTGGAATGACTTGGATGTCAATCAGCACGTCAACAACGTGAAGTACAT
TGGGTGGATTCTTGAGAGTACTCCACCAGAAGTTCTGGAGACCCAGGAGTTATGTTCCCTT
ACCCTGGAATACAGGCGGGAATGCGGAAGGGAGAGTGTGCTGGAGTCCCTCACTGCTGTG
GACCCCTCTGGAAAGGGCTTTGGGCCCCAGTTTCAGCACCTTCTGAGGCTTGAGGATGGA
GGTGAGATCGTAAAGGGGAGAACTGAGTGGCGACCCAAGACTGCAGGTATCAATGGGAC
GATTGCATCTGGGGAGACCTCACCTGGAAACTCTTAG
SEQ ID NO: 72 Cuphea heterophylla (Cht) FATB3b (C67G, H72Q, L128F, N1791 variant) coding DNA
sequence codon optimized for Prototheca moriformis ATGGTGGCCACCGCCGCCTCCTCCGCCTTCTTCCCCGTGCCCTCCCCCGACACCTCCTCCCG
CCCCGGCAAGCTGGGCAACGGCTCCTCCTCCCTGCGCCCCCTGAAGCCCAAGTTCGTGGCC
AACGCCGGCCTGCAGGTGAAGGCCAACGCCTCCGCCCCCCCCAAGATCAACGGCTCCTCC
GTGTCCCTGAAGTCCGGCTCCCTGAAGACCCAGGAGGACACCCCCTCCGCCCCCCCCCCCC
GCACCTTCATCAACCAGCTGCCCGACTGGTCCATGCTGCTGGCCGCCATCACCACCGTGTT
CCTGGCCGCCGAGAAGCAGTGGATGATGCTGGACTGGAAGCCCAAGCGCCCCGACATGCT
GGTGGACCCCTTCGGCTTCGGCCGCATCGTGCAGGACGGCCTGGTGTTCCGCCAGAACTTC
TCCATCCGCTCCTACGAGATCGGCGCCGACCGCACCGCCTCCATCGAGACCGTGATGAAC
CACCTGCAGGAGACCGCCCTGAACCACGTGAAGTCCGCCGGCCTGCTGATCGAGGGCTTC
GGCCGCACCCCCGAGATGTACAAGCGCGACCTGATCTGGGTGGTGGCCAAGATGCAGGTG
ATGGTGAACCGCTACCCCACCTGGGGCGACACCGTGGAGGTGAACACCTGGGTGGCCAAG
TCCGGCAAGAACGGCATGCGCCGCGACTGGCTGATCTCCGACTGCAACACCGGCGAGATC
CTGACCCGCGCCTCCTCCGTGTGGGTGATGATGAACCAGAAGACCCGCAAGCTGTCCAAG
ATCCCCGACGAGGTGCGCCACGAGATCGAGCCCCACTTCGTGGACTCCGCCCCCGTGATC
GAGGACGACGACTGGAAGCTGCCCAAGCTGGACGAGAAGACCGCCGACTCCATCCGCAA
GGGCCTGACCCCCAAGTGGAACGACCTGGACGTGAACCAGCACGTGAACAACGTGAAGT
ACATCGGCTGGATCCTGGAGTCCACCCCCCCCGAGGTGCTGGAGACCCAGGAGCTGTGCT
CCCTGACCCTGGAGTACCGCCGCGAGTGCGGCCGCGAGTCCGTGCTGGAGTCCCTGACCG
CCGTGGACCCCTCCGGCAAGGGCTTCGGCCCCCAGTTCCAGCACCTGCTGCGCCTGGAGG
ACGGCGGCGAGATCGTGAAGGGCCGCACCGAGTGGCGCCCCAAGACCGCCGGCATCAAC
GGCACCATCGCCTCCGGCGAGACCTCCCCCGGCAACTCCTGA
SEQ ID NO: 73 Cuphea viscosissima (Cvis) FATB1 amino acid sequence MVAAAATSAFFPVPAPGT SPKPGKSGNWP S SLSPTFKPKSIPNGGFQVICANASAHPICANGSAV
NLKSGSLNTQEDTS S S PPPRAFLNQLPDW SMLLTAIT TVFVAAEKQWTMLDRKSKRPDMLVD
SVGLKSIVRDGLVS RH SF SIRSYEIGADRTASIETLMNHLQETTINHCKSLGLHNDGFGRTPGM
CKNDLIWVLTKMQIMVNRYPTWGDTVEINTWF SQSGKIGMASDWLISDCNTGEILIRATSVW
AMMNQKTRRF SRLPYEVRQELTPHFVD SPHVIEDNDQKLRKFDVKTGD SIRKGLTPRWNDLD
VNQHVSNVKYIGWILESMPIEVLETQELC SLTVEYRRECGMDSVLE SVTAVDPSENGGRSQYK
HLLRLEDGTDIVKSRTEWRPKNAGTNGAISTSTAKTSNGNSVS
SEQ ID NO: 74 Cuphea viscosissima (Cvis) FATB1 coding DNA sequence codon optimized for Prototheca moriformis ATGGTGGCCGCCGCCGCCACCTCCGCCTTCTTCCCCGTGCCCGCCCCCGGCACCTCCCCCA
AGCCCGGCAAGTCCGGCAACTGGCCCTCCTCCCTGTCCCCCACCTTCAAGCCCAAGTCCAT
CCCCAACGGCGGCTTCCAGGTGAAGGCCAACGCCTCCGCCCACCCCAAGGCCAACGGCTC
CGCCGTGAACCTGAAGTCCGGCTCCCTGAACACCCAGGAGGACACCTCCTCCTCCCCCCCC
CCCCGCGCCTTCCTGAACCAGCTGCCCGACTGGTCCATGCTGCTGACCGCCATCACCACCG
TGTTCGTGGCCGCCGAGAAGCAGTGGACCATGCTGGACCGCAAGTCCAAGCGCCCCGACA
TGCTGGTGGACTCCGTGGGCCTGAAGTCCATCGTGCGCGACGGCCTGGTGTCCCGCCACTC
CTTCTCCATCCGCTCCTACGAGATCGGCGCCGACCGCACCGCCTCCATCGAGACCCTGATG
AACCACCTGCAGGAGACCACCATCAACCACTGCAAGTCCCTGGGCCTGCACAACGACGGC

TTCGGCCGCACCCCCGGCATGTGCAAGAACGACCTGATCTGGGTGCTGACCAAGATGCAG
ATCATGGTGAACCGCTACCCCACCTGGGGCGACACCGTGGAGATCAACACCTGGTTCTCC
CAGTCCGGCAAGATCGGCATGGCCTCCGACTGGCTGATCTCCGACTGCAACACCGGCGAG
ATCCTGATCCGCGCCACCTCCGTGTGGGCCATGATGAACCAGAAGACCCGCCGCTTCTCCC
GCCTGCCCTACGAGGTGCGCCAGGAGCTGACCCCCCACTTCGTGGACTCCCCCCACGTGAT
CGAGGACAACGACCAGAAGCTGCGCAAGTTCGACGTGAAGACCGGCGACTCCATCCGCA
AGGGCCTGACCCCCCGCTGGAACGACCTGGACGTGAACCAGCACGTGTCCAACGTGAAGT
ACATCGGCTGGATCCTGGAGTCCATGCCCATCGAGGTGCTGGAGACCCAGGAGCTGTGCT
CCCTGACCGTGGAGTACCGCCGCGAGTGCGGCATGGACTCCGTGCTGGAGTCCGTGACCG
CCGTGGACCCCTCCGAGAACGGCGGCCGCTCCCAGTACAAGCACCTGCTGCGCCTGGAGG
ACGGCACCGACATCGTGAAGTCCCGCACCGAGTGGCGCCCCAAGAACGCCGGCACCAAC
GGCGCCATCTCCACCTCCACCGCCAAGACCTCCAACGGCAACTCCGTGTCCTGA
SEQ ID NO: 75 Cuphea viscosissima (Cvis) FATB2 amino acid sequence MVATAAS SAFFPVPSADTS SRPGKLGNGP S SF SPLKPKSIPNGGLQVKASASAPPKINGS SVGLK
SGGLKTHDDAP SAPPPRTFINQLPDW SMLLAAITTAFLAAEKQWMMLDRKPKRLDMLEDPFG
LGRVVQDGLVFRQNF SIRSYEIGADRTASIETVMNHLQETALNHVKTAGLSNDGFGRTPEMYK
RDLIWVVAKMQVMVNRYPTWGDTVEVNTWVAKSGKNGMRRDWLI SDCNTGEILTRAS SVW
VMMNQKTRKLSKIPDEVRREIEPHFVD SAPVIEDDDRKLPKLDEKSADSIRKGLTPRWNDLDV
NQHVNNAKYIGWILESTPPEVLETQELC SLTLEYRRECGRESVLESLTAVDPSGEGYGSQFQHL
LRLEDGGEIVKGRTEWRPKNAGINGVVP SEE S SPGDYS
SEQ ID NO: 76 Cuphea viscosissima (Cvis) FATB2 coding DNA sequence codon optimized for Prototheca moriformis ATGGTGGCCACCGCCGCCTCCTCCGCCTTCTTCCCCGTGCCCTCCGCCGACACCTCCTCCC
GCCCCGGCAAGCTGGGCAACGGCCCCTCCTCCTTCTCCCCCCTGAAGCCCAAGTCCATCCC
CAACGGCGGCCTGCAGGTGAAGGCCTCCGCCTCCGCCCCCCCCAAGATCAACGGCTCCTC
CGTGGGCCTGAAGTCCGGCGGCCTGAAGACCCACGACGACGCCCCCTCCGCCCCCCCCCC
CCGCACCTTCATCAACCAGCTGCCCGACTGGTCCATGCTGCTGGCCGCCATCACCACCGCC
TTCCTGGCCGCCGAGAAGCAGTGGATGATGCTGGACCGCAAGCCCAAGCGCCTGGACATG
CTGGAGGACCCCTTCGGCCTGGGCCGCGTGGTGCAGGACGGCCTGGTGTTCCGCCAGAAC
TTCTCCATCCGCTCCTACGAGATCGGCGCCGACCGCACCGCCTCCATCGAGACCGTGATGA
ACCACCTGCAGGAGACCGCCCTGAACCACGTGAAGACCGCCGGCCTGTCCAACGACGGCT
TCGGCCGCACCCCCGAGATGTACAAGCGCGACCTGATCTGGGTGGTGGCCAAGATGCAGG
TGATGGTGAACCGCTACCCCACCTGGGGCGACACCGTGGAGGTGAACACCTGGGTGGCCA
AGTCCGGCAAGAACGGCATGCGCCGCGACTGGCTGATCTCCGACTGCAACACCGGCGAGA
TCCTGACCCGCGCCTCCTCCGTGTGGGTGATGATGAACCAGAAGACCCGCAAGCTGTCCA
AGATCCCCGACGAGGTGCGCCGCGAGATCGAGCCCCACTTCGTGGACTCCGCCCCCGTGA
TCGAGGACGACGACCGCAAGCTGCCCAAGCTGGACGAGAAGTCCGCCGACTCCATCCGCA
AGGGCCTGACCCCCCGCTGGAACGACCTGGACGTGAACCAGCACGTGAACAACGCCAAGT
ACATCGGCTGGATCCTGGAGTCCACCCCCCCCGAGGTGCTGGAGACCCAGGAGCTGTGCT
CCCTGACCCTGGAGTACCGCCGCGAGTGCGGCCGCGAGTCCGTGCTGGAGTCCCTGACCG
CCGTGGACCCCTCCGGCGAGGGCTACGGCTCCCAGTTCCAGCACCTGCTGCGCCTGGAGG
ACGGCGGCGAGATCGTGAAGGGCCGCACCGAGTGGCGCCCCAAGAACGCCGGCATCAAC
GGCGTGGTGCCCTCCGAGGAGTCCTCCCCCGGCGACTACTCCTGA
SEQ ID NO: 77 Cuphea viscosissima (Cvis) FATB3 amino acid sequence MVAAAAS SAFF SF PTPGT SPKPGKFGNWP S SLSIPFNPKSNHNGGIQVICANASAHPICANGSAVS
LKAGSLETQEDTS SP SPPPRTFI SQLPDWSMLVSAIT TVFVAAEKQWTMLDRK SKRPDVLVEPF
VQDGVSFRQ SF SIRSYEIGVDRTASIETLMNIFQET SLNHCKSLGLLNDGFGRTPEMCKRDLIW
VVTKMQIEVNRYPTWGDTIEVTTWVSE SGKNGMSRDWLISDCHSGEILIRAT SVWAMMNQK
TRRLSKIPDEVRQEIVPYFVDSAPVIEDDRKLHKLDVKTGDSIRNGLTPRWNDFDVNQHVNNV

KYIAWLLKSVPTEVFETQELCGLTLEYRRECRRDSVLESVTAMDPSKEGDRSLYQHLLRLENG
ADIALGRTEWRPKNAGATGAVSTGKTSNGNSVS
SEQ ID NO: 78 Cuphea viscosissima (Cvis) FATB3 coding DNA sequence codon optimized for Prototheca moriformis ATGGTGGCCGCCGCCGCCTCCTCCGCCTTCTTCTCCTTCCCCACCCCCGGCACCTCCCCCAA
GCCCGGCAAGTTCGGCAACTGGCCCTCCTCCCTGTCCATCCCCTTCAACCCCAAGTCCAAC
CACAACGGCGGCATCCAGGTGAAGGCCAACGCCTCCGCCCACCCCAAGGCCAACGGCTCC
GCCGTGTCCCTGAAGGCCGGCTCCCTGGAGACCCAGGAGGACACCTCCTCCCCCTCCCCCC
CCCCCCGCACCTTCATCTCCCAGCTGCCCGACTGGTCCATGCTGGTGTCCGCCATCACCAC
CGTGTTCGTGGCCGCCGAGAAGCAGTGGACCATGCTGGACCGCAAGTCCAAGCGCCCCGA
CGTGCTGGTGGAGCCCTTCGTGCAGGACGGCGTGTCCTTCCGCCAGTCCTTCTCCATCCGC
TCCTACGAGATCGGCGTGGACCGCACCGCCTCCATCGAGACCCTGATGAACATCTTCCAG
GAGACCTCCCTGAACCACTGCAAGTCCCTGGGCCTGCTGAACGACGGCTTCGGCCGCACC
CCCGAGATGTGCAAGCGCGACCTGATCTGGGTGGTGACCAAGATGCAGATCGAGGTGAAC
CGCTACCCCACCTGGGGCGACACCATCGAGGTGACCACCTGGGTGTCCGAGTCCGGCAAG
AACGGCATGTCCCGCGACTGGCTGATCTCCGACTGCCACTCCGGCGAGATCCTGATCCGC
GCCACCTCCGTGTGGGCCATGATGAACCAGAAGACCCGCCGCCTGTCCAAGATCCCCGAC
GAGGTGCGCCAGGAGATCGTGCCCTACTTCGTGGACTCCGCCCCCGTGATCGAGGACGAC
CGCAAGCTGCACAAGCTGGACGTGAAGACCGGCGACTCCATCCGCAACGGCCTGACCCCC
CGCTGGAACGACTTCGACGTGAACCAGCACGTGAACAACGTGAAGTACATCGCCTGGCTG
CTGAAGTCCGTGCCCACCGAGGTGTTCGAGACCCAGGAGCTGTGCGGCCTGACCCTGGAG
TACCGCCGCGAGTGCCGCCGCGACTCCGTGCTGGAGTCCGTGACCGCCATGGACCCCTCC
AAGGAGGGCGACCGCTCCCTGTACCAGCACCTGCTGCGCCTGGAGAACGGCGCCGACATC
GCCCTGGGCCGCACCGAGTGGCGCCCCAAGAACGCCGGCGCCACCGGCGCCGTGTCCACC
GGCAAGACCTCCAACGGCAACTCCGTGTCCTGA
SEQ ID NO: 79 Cuphea cakarata (Ccalc) FATB1 amino acid sequence MVAA SA S SAFF SVPTPGT SPKPGKFGNWPS SLSVPFKPRSNNSGGFQVICANASAHPICANGSAV
SLKSGSLETQEDNS S S SRPPRTFIKQLPDW SMLL SAIT TVFVAAEKQWTMFDRKSKRSDMLVD
PFVVDRIVQDGVLFRQ SF S IRSYEIGADRTASIETLMNIFQET SLNHCKSMGLLYEGF GRTPEMC
KRDLIWVVTKIHIKVNRYPTWGDTIEVTTWVSESGKNGMGRDWLISDCHTGEILIRAT SVWA
MMNQTTRRLSKFPYELRQEIAPHFVDSDPVIEDNRRLLNFDVKTGDSIRKGLTPRWNDLDVNQ
HVNNVKYIGWILESVPTEVFDTRELCGLTLEYRQECGRGSVLESVTAMDPSKEGDRSLYQHLL
RLEDGTDIVKGRTEWRPKNAGTNGPVSTRKTTNGS SVS
SEQ ID NO: 80 Cuphea cakarata (Ccalc) FATB1 coding DNA sequence ATGGTGGCTGCTTCAGCAAGTTCTGCATTCTTCTCCGTCCCAACCCCGGGAACCTCTCCTA
AACCCGGGAAGTTCGGCAATTGGCCATCGAGCTTGAGCGTCCCATTCAAGCCCAGATCAA
ACAACAGTGGCGGCTTTCAGGTTAAGGCAAACGCCAGTGCTCATCCTAAGGCTAACGGTT
CTGCAGTAAGTCTAAAGTCTGGGAGCCTCGAGACTCAGGAGGACAATTCGTCGTCGTCTC
GTCCTCCTCGGACTTTCATTAAACAGTTGCCGGACTGGAGTATGCTTCTTTCCGCGATCAC
AACCGTCTTCGTGGCGGCTGAGAAGCAGTGGACGATGTTTGATCGGAAATCTAAGAGGTC
TGACATGCTCGTGGACCCGTTTGTGGTTGACAGGATTGTTCAGGATGGGGTTCTGTTCAGA
CAGAGTTTTTCGATTAGGTCTTACGAAATAGGCGCTGATCGAACAGCCTCTATTGAGACGC
TGATGAACATCTTCCAGGAAACATCTCTCAATCATTGTAAGAGTATGGGTCTTCTCTATGA
AGGCTTTGGTCGTACTCCTGAGATGTGTAAGAGGGACCTCATTTGGGTGGTTACGAAAAT
ACATATCAAGGTGAATCGCTATCCGACTTGGGGTGATACTATCGAGGTCACTACTTGGGTC
TCCGAGTCGGGCAAAAACGGTATGGGTCGCGATTGGCTGATAAGTGATTGCCATACAGGA
GAAATTCTTATAAGAGCAACGAGTGTGTGGGCTATGATGAATCAAACGACGAGAAGATTG
TCGAAATTTCCATATGAGCTTCGACAGGAGATAGCGCCACATTTTGTGGACTCGGATCCTG
TCATTGAAGACAATCGAAGATTGCTCAACTTTGATGTGAAGACGGGTGATTCCATTCGCA
AGGGTCTAACTCCAAGGTGGAATGACTTGGATGTCAATCAGCACGTTAACAATGTGAAGT

ACATTGGGTGGATTCTCGAGAGTGTTCCAACAGAAGTTTTCGATACCCGGGAGCTATGCG
GCCTCACCCTTGAGTATAGGCAGGAATGCGGAAGAGGAAGTGTGCTGGAGTCCGTGACCG
CTATGGATCCCTCAAAAGAGGGAGACCGGTCTCTGTACCAGCACCTTCTTCGGCTTGAGG
ATGGGACTGATATCGTGAAGGGCAGAACCGAGTGGCGGCCAAAGAATGCAGGAACCAAT
GGGCCAGTATCAACAAGAAAGACTACAAATGGAAGCTCAGTCTCTTAG
SEQ ID NO: 81 Cuphea cakarata (Ccalc) FATB1 coding DNA sequence codon optimized for Prototheca moriformis ATGGTGGCCGCCTCCGCCTCCTCCGCCTTCTTCTCCGTGCCCACCCCCGGCACCTCCCCCA
AGCCCGGCAAGTTCGGCAACTGGCCCTCCTCCCTGTCCGTGCCCTTCAAGCCCCGCTCCAA
CAACTCCGGCGGCTTCCAGGTGAAGGCCAACGCCTCCGCCCACCCCAAGGCCAACGGCTC
CGCCGTGTCCCTGAAGTCCGGCTCCCTGGAGACCCAGGAGGACAACTCCTCCTCCTCCCGC
CCCCCCCGCACCTTCATCAAGCAGCTGCCCGACTGGTCCATGCTGCTGTCCGCCATCACCA
CCGTGTTCGTGGCCGCCGAGAAGCAGTGGACCATGTTCGACCGCAAGTCCAAGCGCTCCG
ACATGCTGGTGGACCCCTTCGTGGTGGACCGCATCGTGCAGGACGGCGTGCTGTTCCGCC
AGTCCTTCTCCATCCGCTCCTACGAGATCGGCGCCGACCGCACCGCCTCCATCGAGACCCT
GATGAACATCTTCCAGGAGACCTCCCTGAACCACTGCAAGTCCATGGGCCTGCTGTACGA
GGGCTTCGGCCGCACCCCCGAGATGTGCAAGCGCGACCTGATCTGGGTGGTGACCAAGAT
CCACATCAAGGTGAACCGCTACCCCACCTGGGGCGACACCATCGAGGTGACCACCTGGGT
GTCCGAGTCCGGCAAGAACGGCATGGGCCGCGACTGGCTGATCTCCGACTGCCACACCGG
CGAGATCCTGATCCGCGCCACCTCCGTGTGGGCCATGATGAACCAGACCACCCGCCGCCT
GTCCAAGTTCCCCTACGAGCTGCGCCAGGAGATCGCCCCCCACTTCGTGGACTCCGACCCC
GTGATCGAGGACAACCGCCGCCTGCTGAACTTCGACGTGAAGACCGGCGACTCCATCCGC
AAGGGCCTGACCCCCCGCTGGAACGACCTGGACGTGAACCAGCACGTGAACAACGTGAA
GTACATCGGCTGGATCCTGGAGTCCGTGCCCACCGAGGTGTTCGACACCCGCGAGCTGTG
CGGCCTGACCCTGGAGTACCGCCAGGAGTGCGGCCGCGGCTCCGTGCTGGAGTCCGTGAC
CGCCATGGACCCCTCCAAGGAGGGCGACCGCTCCCTGTACCAGCACCTGCTGCGCCTGGA
GGACGGCACCGACATCGTGAAGGGCCGCACCGAGTGGCGCCCCAAGAACGCCGGCACCA
ACGGCCCCGTGTCCACCCGCAAGACCACCAACGGCTCCTCCGTGTCCTGA
SEQ ID NO: 82 Cuphea painteri (Cpai) FATB1 amino acid sequence MVAAAATSAFFPVPAPGT SPNPRKFGSWPS SLSP SLPKSIPNGGFQVKANASAHPKANGSAVSL
KSGSLNTQENT S S SPPPRTFLHQLPDW SRLLTAITTVFVKS KRPDMHDRK SKRPDMLVDLFGLE
S SVQDALVFRQ SF SIRSYEIGTDRTASIETLMNHLQET SLNHCKSTGILLDGFGRTLEMCKRELI
WVVIKMQIQVNRYPAWGDTVEINTRF SRLGKIGMGRDWLISDCNTGEILIRAT SEYAMMNQK
TRRL SKLPYEVHQEIAPLF VD SPPVIEDNDLKVHKFEVKTGD SIQKGL SPGWNDLDVNQHVSN
VKYIGWILESMPTEVLETQELC SLALEYRRECGRDSVLE SVTAMDP SKVGGRSQYQHLLRLED
GTAIVNGITEWRPKNAGANGAISTGKT SNGNSVS
SEQ ID NO: 83 Cuphea painteri (Cpai) FATB1 coding DNA sequence ATGGTGGCTGCTGCAGCAACTTCTGCATTCTTCCCTGTTCCAGCCCCGGGAACCTCCCCAA
ATCCCAGGAAATTCGGAAGTTGGCCATCGAGCTTGAGCCCTTCCTTGCCCAAGTCAATCCC
CAATGGCGGATTTCAGGTAAAGGCAAATGCCAGTGCCCATCCGAAGGCTAACGGTTCTGC
AGTTAGTCTAAAGTCTGGCAGCCTCAACACTCAGGAGAACACTTCGTCGTCCCCTCCTCCT
CGGACTTTCCTTCACCAGTTGCCTGATTGGAGTAGGCTTCTGACTGCAATCACGACCGTGT
TCGTGAAATCTAAGAGGCCTGACATGCATGATCGGAAATCTAAGAGGCCTGACATGCTGG
TGGACTTGTTTGGGTTGGAAAGTAGTGTTCAGGATGCGCTCGTGTTCAGACAGAGTTTTTC
GATTAGGTCTTATGAAATAGGCACTGATCGAACAGCCTCTATAGAGACGCTGATGAACCA
CTTGCAGGAAACATCTCTCAATCATTGTAAAAGTACCGGTATTCTCCTTGACGGCTTCGGT
CGTACTCTTGAGATGTGTAAAAGGGAACTCATTTGGGTGGTAATAAAAATGCAAATTCAG
GTGAATCGCTATCCAGCATGGGGCGATACTGTCGAGATCAATACCCGGTTCTCCCGGTTGG
GGAAAATTGGTATGGGTCGCGATTGGCTAATAAGTGATTGCAACACAGGAGAAATTCTAA
TAAGAGCAACGAGCGAGTATGCCATGATGAATCAAAAGACGAGAAGACTCTCAAAACTT

CCATACGAGGTTCACCAGGAGATAGCGCCTCTTTTTGTCGACTCTCCTCCTGTGATTGAAG
ACAATGATCTGAAAGTGCATAAATTTGAAGTGAAGACTGGTGATTCCATTCAAAAGGGTC
TATCCCCGGGGTGGAATGACTTGGATGTCAATCAGCACGTAAGCAACGTGAAGTACATTG
GGTGGATTCTCGAGAGTATGCCAACAGAAGTTTTGGAGACCCAGGAGCTATGCTCTCTCG
CCCTTGAATATAGGCGGGAATGCGGAAGGGACAGTGTGCTGGAGTCCGTGACCGCAATGG
ATCCCTCAAAAGTTGGAGGCCGTTCTCAGTACCAGCACCTTCTGCGGCTTGAGGATGGGA
CTGCTATCGTGAACGGCATAACTGAGTGGCGGCCGAAGAATGCAGGAGCTAATGGGGCG
ATATCAACGGGAAAGACTTCAAATGGAAACTCGGTCTCTTAG
SEQ ID NO: 84 Cuphea painteri (Cpai) FATB1 coding DNA sequence codon optimized for Prototheca moriformis ATGGTGGCCGCCGCCGCCACCTCCGCCTTCTTCCCCGTGCCCGCCCCCGGCACCTCCCCCA
ACCCCCGCAAGTTCGGCTCCTGGCCCTCCTCCCTGTCCCCCTCCCTGCCCAAGTCCATCCCC
AACGGCGGCTTCCAGGTGAAGGCCAACGCCTCCGCCCACCCCAAGGCCAACGGCTCCGCC
GTGTCCCTGAAGTCCGGCTCCCTGAACACCCAGGAGAACACCTCCTCCTCCCCCCCCCCCC
GCACCTTCCTGCACCAGCTGCCCGACTGGTCCCGCCTGCTGACCGCCATCACCACCGTGTT
CGTGAAGTCCAAGCGCCCCGACATGCACGACCGCAAGTCCAAGCGCCCCGACATGCTGGT
GGACCTGTTCGGCCTGGAGTCCTCCGTGCAGGACGCCCTGGTGTTCCGCCAGTCCTTCTCC
ATCCGCTCCTACGAGATCGGCACCGACCGCACCGCCTCCATCGAGACCCTGATGAACCAC
CTGCAGGAGACCTCCCTGAACCACTGCAAGTCCACCGGCATCCTGCTGGACGGCTTCGGC
CGCACCCTGGAGATGTGCAAGCGCGAGCTGATCTGGGTGGTGATCAAGATGCAGATCCAG
GTGAACCGCTACCCCGCCTGGGGCGACACCGTGGAGATCAACACCCGCTTCTCCCGCCTG
GGCAAGATCGGCATGGGCCGCGACTGGCTGATCTCCGACTGCAACACCGGCGAGATCCTG
ATCCGCGCCACCTCCGAGTACGCCATGATGAACCAGAAGACCCGCCGCCTGTCCAAGCTG
CCCTACGAGGTGCACCAGGAGATCGCCCCCCTGTTCGTGGACTCCCCCCCCGTGATCGAG
GACAACGACCTGAAGGTGCACAAGTTCGAGGTGAAGACCGGCGACTCCATCCAGAAGGG
CCTGTCCCCCGGCTGGAACGACCTGGACGTGAACCAGCACGTGTCCAACGTGAAGTACAT
CGGCTGGATCCTGGAGTCCATGCCCACCGAGGTGCTGGAGACCCAGGAGCTGTGCTCCCT
GGCCCTGGAGTACCGCCGCGAGTGCGGCCGCGACTCCGTGCTGGAGTCCGTGACCGCCAT
GGACCCCTCCAAGGTGGGCGGCCGCTCCCAGTACCAGCACCTGCTGCGCCTGGAGGACGG
CACCGCCATCGTGAACGGCATCACCGAGTGGCGCCCCAAGAACGCCGGCGCCAACGGCGC
CATCTCCACCGGCAAGACCTCCAACGGCAACTCCGTGTCCTGA
SEQ ID NO: 85 Cuphea hookeriana (Chook) FATB4 amino acid sequence MVAAAATSAFFPVPAPGT SPNPRKFGSWPS SL SP SLPNSIPNGGFQVKANASAHPKANGSAVSL
KSGSLNTQENT S S SPPPRTFLHQLPDW SRLLTAITTVFVKS KRPDMHDRK SKRPDMLVDLFGLE
S SVQDALVFRQRF SIRSYEIGTDRTASMETLMNHLQET SLNHCKSTGILLDGFGRTLEMCKREL
IWVVIKMQIQVNRYPAWGDTVEINTRF SRLGKIGMGRDWLISDCNTGEILIRATSEYAMMNQK
TRRLSKLPYEVRQEIAPLFVDSPPVIEDNDLKVHKFEVKTGDSIHKGLTPGWNDLDVNQHVNN
VKYIGWILESTPPEVLETQELC SLTLEYRRECGRESVLESLTAMDPSGGGYGSQFQHLLRLEDG
GEIVKGRTEWRPKNGVINGVVPTGES SPGDYS
SEQ ID NO: 86 Cuphea hookeriana (Chook) FATB4 coding DNA sequence ATGGTGGCTGCTGCAGCAACTTCTGCATTCTTCCCTGTTCCAGCCCCGGGAACCTCCCCTA
ATCCCAGGAAATTCGGAAGTTGGCCATCGAGCTTGAGCCCTTCCTTGCCCAACTCAATCCC
CAATGGCGGATTTCAGGTAAAGGCAAATGCCAGTGCCCATCCGAAGGCTAACGGTTCTGC
AGTTAGTCTAAAGTCTGGCAGCCTCAACACTCAGGAGAACACTTCGTCGTCCCCTCCTCCT
CGGACTTTCCTTCACCAGTTGCCTGATTGGAGTAGGCTTCTGACTGCAATCACGACCGTGT
TCGTGAAATCTAAGAGGCCTGACATGCATGATCGGAAATCTAAGAGGCCTGACATGCTGG
TGGACTTGTTTGGGTTGGAGAGTAGTGTTCAGGATGCGCTCGTGTTCAGACAGAGATTTTC
GATTAGGTCTTATGAAATAGGCACTGATCGAACAGCCTCTATGGAGACGCTGATGAACCA
CTTGCAGGAAACATCTCTCAATCATTGTAAAAGTACCGGTATTCTCCTTGACGGCTTCGGT
CGTACTCTTGAGATGTGTAAAAGGGAACTCATTTGGGTGGTAATAAAAATGCAGATTCAG

GTGAATCGCTATCCAGCATGGGGCGATACTGTCGAGATCAATACCCGGTTCTCCCGGTTGG
GGAAAATTGGTATGGGTCGCGATTGGCTAATAAGTGATTGCAACACAGGAGAAATTCTTA
TAAGAGCAACGAGCGAGTATGCCATGATGAATCAAAAGACGAGAAGACTCTCAAAACTT
CCATACGAGGTTCGCCAGGAGATAGCGCCTCTTTTTGTCGACTCTCCTCCTGTGATTGAAG
ACAATGATCTGAAAGTGCATAAATTTGAAGTGAAGACTGGTGATTCCATTCACAAGGGTC
TAACTCCGGGGTGGAATGACTTGGATGTCAATCAGCACGTCAACAACGTGAAGTACATCG
GGTGGATTCTTGAGAGTACTCCACCAGAAGTTCTGGAGACCCAGGAGTTATGTTCCCTTAC
TCTGGAATACAGGCGGGAATGTGGAAGGGAGAGCGTGCTGGAGTCCCTCACTGCTATGGA
TCCCTCTGGAGGGGGTTATGGGTCCCAGTTTCAGCACCTTCTGCGGCTTGAGGATGGAGGT
GAGATCGTGAAGGGGAGAACCGAGTGGCGACCCAAGAATGGTGTAATCAATGGGGTGGT
ACCAACCGGGGAGTCCTCACCTGGAGACTACTCTTAG
SEQ ID NO: 87 Cuphea hookeriana (Chook) FATB4 coding DNA sequence codon optimized for Prototheca moriformis ATGGTGGCCGCCGCCGCCACCTCCGCCTTCTTCCCCGTGCCCGCCCCCGGCACCTCCCCCA
ACCCCCGCAAGTTCGGCTCCTGGCCCTCCTCCCTGTCCCCCTCCCTGCCCAACTCCATCCCC
AACGGCGGCTTCCAGGTGAAGGCCAACGCCTCCGCCCACCCCAAGGCCAACGGCTCCGCC
GTGTCCCTGAAGTCCGGCTCCCTGAACACCCAGGAGAACACCTCCTCCTCCCCCCCCCCCC
GCACCTTCCTGCACCAGCTGCCCGACTGGTCCCGCCTGCTGACCGCCATCACCACCGTGTT
CGTGAAGTCCAAGCGCCCCGACATGCACGACCGCAAGTCCAAGCGCCCCGACATGCTGGT
GGACCTGTTCGGCCTGGAGTCCTCCGTGCAGGACGCCCTGGTGTTCCGCCAGCGCTTCTCC
ATCCGCTCCTACGAGATCGGCACCGACCGCACCGCCTCCATGGAGACCCTGATGAACCAC
CTGCAGGAGACCTCCCTGAACCACTGCAAGTCCACCGGCATCCTGCTGGACGGCTTCGGC
CGCACCCTGGAGATGTGCAAGCGCGAGCTGATCTGGGTGGTGATCAAGATGCAGATCCAG
GTGAACCGCTACCCCGCCTGGGGCGACACCGTGGAGATCAACACCCGCTTCTCCCGCCTG
GGCAAGATCGGCATGGGCCGCGACTGGCTGATCTCCGACTGCAACACCGGCGAGATCCTG
ATCCGCGCCACCTCCGAGTACGCCATGATGAACCAGAAGACCCGCCGCCTGTCCAAGCTG
CCCTACGAGGTGCGCCAGGAGATCGCCCCCCTGTTCGTGGACTCCCCCCCCGTGATCGAG
GACAACGACCTGAAGGTGCACAAGTTCGAGGTGAAGACCGGCGACTCCATCCACAAGGG
CCTGACCCCCGGCTGGAACGACCTGGACGTGAACCAGCACGTGAACAACGTGAAGTACAT
CGGCTGGATCCTGGAGTCCACCCCCCCCGAGGTGCTGGAGACCCAGGAGCTGTGCTCCCT
GACCCTGGAGTACCGCCGCGAGTGCGGCCGCGAGTCCGTGCTGGAGTCCCTGACCGCCAT
GGACCCCTCCGGCGGCGGCTACGGCTCCCAGTTCCAGCACCTGCTGCGCCTGGAGGACGG
CGGCGAGATCGTGAAGGGCCGCACCGAGTGGCGCCCCAAGAACGGCGTGATCAACGGCG
TGGTGCCCACCGGCGAGTCCTCCCCCGGCGACTACTCCTGA
SEQ ID NO: 88 Cuphea avigera var. pulcherrima (Ca) FATB1 amino acid sequence MVAAAAS SAFF SVPVPGT SPKPGKFRIWPS SLSPSFKPKPIPNGGLQVICANSRAHPICANGSAVS
LKSGSLNTQEDTS S SPPPRTFLHQLPDWSRLLTAIT TVFVKSKRPDMHDRKSKRPDMLMD SFG
LE SIVQEGLEFRQ SF SIRSYEIGTDRTASIETLMNYLQET SLNHCKSTGILLDGFGRTPEMCKRDL
IWVVTKMKIKVNRYPAWGDTVEINTWF SRLGKIGKGRDWLISDCNTGEILIRATSAYATMNQ
KTRRL SKLPYEVHQEIAPLFVD SPPVIEDNDLKLHKFEVKTGDSIHKGLTPGWNDLDVNQHVS
NVKYIGWILE SMPTEVLETQELC SLALEYRRECGRDSVLE SVTAMDPTKVGGRSQYQHLLRLE
DGTDIVKCRTEWRPKNPGANGAISTGKTSNGNSVS
SEQ ID NO: 89 Cuphea avigera var. pulcherrima (Ca) FATB1 coding DNA sequence ATGGTGGCTGCTGCAGCAAGTTCTGCATTCTTCTCTGTTCCAGTCCCGGGAACCTCTCCTA
AACCCGGGAAGTTCAGAATTTGGCCATCGAGCTTGAGCCCTTCCTTCAAGCCCAAGCCGA
TCCCCAATGGTGGATTGCAGGTTAAGGCAAATTCCAGGGCACATCCGAAGGCTAACGGTT
CTGCAGTTAGTCTAAAGTCTGGCAGCCTCAACACTCAGGAGGACACTTCGTCGTCCCCTCC
TCCTCGGACTTTCCTTCACCAGTTGCCTGATTGGAGTAGGCTTCTGACTGCAATCACGACC
GTGTTCGTGAAATCTAAGAGGCCTGACATGCATGATCGGAAATCTAAGAGGCCTGACATG

CTGATGGACTCGTTTGGGTTGGAGAGTATTGTTCAAGAAGGGCTCGAGTTCAGACAGAGT
TTTTCGATTAGGTCTTATGAAATAGGCACTGATCGAACAGCCTCTATAGAGACGCTGATGA
ACTACTTGCAGGAAACATCTCTCAATCATTGTAAGAGTACCGGTATTCTCCTTGACGGCTT
TGGTCGTACTCCTGAGATGTGTAAAAGGGACCTCATTTGGGTGGTAACAAAAATGAAGAT
CAAGGTGAATCGCTATCCAGCTTGGGGCGATACTGTCGAGATCAATACCTGGTTCTCCCGG
TTGGGGAAAATCGGAAAGGGTCGCGATTGGCTAATAAGTGATTGCAACACAGGAGAAATT
CTTATAAGAGCAACGAGCGCGTATGCCACGATGAATCAAAAGACGAGAAGACTCTCAAA
ACTTCCATACGAGGTTCACCAGGAGATAGCGCCTCTCTTTGTCGACTCTCCTCCTGTCATT
GAAGACAATGATCTGAAATTGCATAAGTTTGAAGTGAAGACTGGTGATTCCATTCACAAG
GGTCTAACTCCGGGGTGGAATGACTTGGATGTCAATCAGCACGTAAGCAACGTGAAGTAC
ATTGGGTGGATTCTCGAGAGTATGCCAACAGAAGTTTTGGAGACCCAGGAGCTATGCTCT
CTCGCCCTTGAATATAGGCGGGAATGCGGAAGGGACAGTGTGCTAGAGTCCGTGACAGCT
ATGGATCCCACAAAAGTTGGAGGCCGGTCTCAGTACCAGCACCTTCTGCGACTTGAGGAT
GGGACTGATATCGTGAAGTGCAGAACTGAGTGGCGGCCGAAGAATCCAGGAGCTAATGG
GGCAATATCAACGGGAAAGACTTCAAATGGAAACTCGGTCTCTTAG
SEQ ID NO: 90 Cuphea avigera var. pulcherrima (Ca) FATB1 coding DNA sequence codon optimized for Prototheca moriformis ATGGTGGCCGCCGCCGCCTCCTCCGCCTTCTTCTCCGTGCCCGTGCCCGGCACCTCCCCCA
AGCCCGGCAAGTTCCGCATCTGGCCCTCCTCCCTGTCCCCCTCCTTCAAGCCCAAGCCCAT
CCCCAACGGCGGCCTGCAGGTGAAGGCCAACTCCCGCGCCCACCCCAAGGCCAACGGCTC
CGCCGTGTCCCTGAAGTCCGGCTCCCTGAACACCCAGGAGGACACCTCCTCCTCCCCCCCC
CCCCGCACCTTCCTGCACCAGCTGCCCGACTGGTCCCGCCTGCTGACCGCCATCACCACCG
TGTTCGTGAAGTCCAAGCGCCCCGACATGCACGACCGCAAGTCCAAGCGCCCCGACATGC
TGATGGACTCCTTCGGCCTGGAGTCCATCGTGCAGGAGGGCCTGGAGTTCCGCCAGTCCTT
CTCCATCCGCTCCTACGAGATCGGCACCGACCGCACCGCCTCCATCGAGACCCTGATGAA
CTACCTGCAGGAGACCTCCCTGAACCACTGCAAGTCCACCGGCATCCTGCTGGACGGCTTC
GGCCGCACCCCCGAGATGTGCAAGCGCGACCTGATCTGGGTGGTGACCAAGATGAAGATC
AAGGTGAACCGCTACCCCGCCTGGGGCGACACCGTGGAGATCAACACCTGGTTCTCCCGC
CTGGGCAAGATCGGCAAGGGCCGCGACTGGCTGATCTCCGACTGCAACACCGGCGAGATC
CTGATCCGCGCCACCTCCGCCTACGCCACCATGAACCAGAAGACCCGCCGCCTGTCCAAG
CTGCCCTACGAGGTGCACCAGGAGATCGCCCCCCTGTTCGTGGACTCCCCCCCCGTGATCG
AGGACAACGACCTGAAGCTGCACAAGTTCGAGGTGAAGACCGGCGACTCCATCCACAAG
GGCCTGACCCCCGGCTGGAACGACCTGGACGTGAACCAGCACGTGTCCAACGTGAAGTAC
ATCGGCTGGATCCTGGAGTCCATGCCCACCGAGGTGCTGGAGACCCAGGAGCTGTGCTCC
CTGGCCCTGGAGTACCGCCGCGAGTGCGGCCGCGACTCCGTGCTGGAGTCCGTGACCGCC
ATGGACCCCACCAAGGTGGGCGGCCGCTCCCAGTACCAGCACCTGCTGCGCCTGGAGGAC
GGCACCGACATCGTGAAGTGCCGCACCGAGTGGCGCCCCAAGAACCCCGGCGCCAACGG
CGCCATCTCCACCGGCAAGACCTCCAACGGCAACTCCGTGTCC
SEQ ID NO: 91 Cuphea paucipetala (Cpau) FATB1 amino acid sequence MVAAAAS SAFFPVPAPGTSPKPGKSGNWPS SLSPSIKPMSIPNGGFQVICANASAHPICANGSAV
NLKSGSLNTQEDTS S S PPPRAFLNQLPDW SMLLTAIT TVFVAAEKQWTMRDRK SKRPDMLVD
SVGLKSVVLDGLVSRQIF SIRSYEIGADRTASIETLMNHLQET SINHCKSLGLLNDGFGRTPGMC
KNDLIWVLTKMQIMVNRYPTWGDTVEINTWF SHSGKIGMASDWLITDCNTGEILIRAT SVWA
MMNQKTRRF S RLPYEVRQELTPHYVD SPHVIEDNDRKLHKFDVKTGD SIRKGLTPRWNDLDV
NQHVSNVKYIGWILESMPIEVLETQELC SLTVEYRRECGMDSVLESVTAMDPSEDEGRSQYKH
LLRLEDGTDIVKGRTEWRPKNAGTNGAISTAKPSNGNSVS
SEQ ID NO: 92 Cuphea paucipetala (Cpau) FATB1 coding DNA sequence ATGGTGGCTGCTGCAGCAAGTTCTGCATTCTTCCCTGTTCCAGCCCCCGGAACCTCCCCTA
AACCCGGGAAGTCCGGCAACTGGCCATCAAGCTTGAGCCCTTCCATCAAGCCCATGTCAA

TCCCCAATGGCGGATTTCAGGTTAAGGCAAATGCCAGTGCCCATCCTAAGGCTAACGGTT
CTGCAGTAAATCTAAAGTCTGGCAGCCTCAACACTCAGGAGGACACTTCGTCGTCCCCTCC
TCCTCGGGCTTTCCTTAACCAGTTGCCTGATTGGAGTATGCTTCTGACTGCAATCACGACC
GTCTTCGTGGCGGCAGAGAAGCAGTGGACTATGCGTGATCGGAAATCTAAGAGGCCTGAC
ATGCTCGTGGACTCGGTTGGGTTGAAGAGTGTTGTTCTGGATGGGCTCGTGTCCAGACAGA
TTTTTTCGATTAGGTCTTATGAAATAGGCGCTGATCGAACTGCCTCTATAGAGACGCTGAT
GAACCACTTGCAGGAAACATCTATCAATCATTGTAAGAGTTTGGGTCTTCTCAATGACGGC
TTTGGTCGTACTCCTGGGATGTGTAAAAATGACCTCATTTGGGTGCTTACAAAAATGCAGA
TCATGGTGAATCGCTACCCAACTTGGGGCGATACTGTTGAGATCAATACCTGGTTCTCCCA
TTCGGGGAAAATTGGTATGGCTAGCGATTGGCTAATAACTGATTGCAACACAGGAGAAAT
TCTTATAAGAGCAACGAGCGTGTGGGCCATGATGAATCAAAAGACGAGAAGATTCTCAAG
ACTTCCATACGAGGTTCGCCAGGAGTTAACGCCTCATTATGTGGACTCTCCTCATGTCATT
GAAGATAATGATCGGAAATTGCATAAGTTTGATGTGAAGACTGGTGATTCCATTCGTAAG
GGTCTAACTCCGAGGTGGAATGACTTGGATGTCAATCAGCACGTAAGCAACGTGAAGTAC
ATTGGGTGGATTCTCGAGAGTATGCCAATAGAAGTTTTGGAGACCCAGGAGCTATGCTCT
CTCACCGTTGAATATAGGCGGGAATGCGGAATGGACAGTGTGCTGGAGTCCGTGACTGCT
ATGGATCCCTCAGAAGATGAAGGCCGGTCTCAGTACAAGCACCTTCTGCGGCTTGAGGAT
GGGACTGACATCGTGAAGGGCAGAACTGAGTGGCGACCGAAGAATGCAGGAACTAACGG
GGCGATATCAACAGCAAAGCCTTCAAATGGAAACTCGGTCTCTTAG
SEQ ID NO: 93 Cuphea paucipetala (Cpau) FATB1 coding DNA sequence codon optimized for Prototheca moriformis ATGGTGGCCGCCGCCGCCTCCTCCGCCTTCTTCCCCGTGCCCGCCCCCGGCACCTCCCCCA
AGCCCGGCAAGTCCGGCAACTGGCCCTCCTCCCTGTCCCCCTCCATCAAGCCCATGTCCAT
CCCCAACGGCGGCTTCCAGGTGAAGGCCAACGCCTCCGCCCACCCCAAGGCCAACGGCTC
CGCCGTGAACCTGAAGTCCGGCTCCCTGAACACCCAGGAGGACACCTCCTCCTCCCCCCCC
CCCCGCGCCTTCCTGAACCAGCTGCCCGACTGGTCCATGCTGCTGACCGCCATCACCACCG
TGTTCGTGGCCGCCGAGAAGCAGTGGACCATGCGCGACCGCAAGTCCAAGCGCCCCGACA
TGCTGGTGGACTCCGTGGGCCTGAAGTCCGTGGTGCTGGACGGCCTGGTGTCCCGCCAGA
TCTTCTCCATCCGCTCCTACGAGATCGGCGCCGACCGCACCGCCTCCATCGAGACCCTGAT
GAACCACCTGCAGGAGACCTCCATCAACCACTGCAAGTCCCTGGGCCTGCTGAACGACGG
CTTCGGCCGCACCCCCGGCATGTGCAAGAACGACCTGATCTGGGTGCTGACCAAGATGCA
GATCATGGTGAACCGCTACCCCACCTGGGGCGACACCGTGGAGATCAACACCTGGTTCTC
CCACTCCGGCAAGATCGGCATGGCCTCCGACTGGCTGATCACCGACTGCAACACCGGCGA
GATCCTGATCCGCGCCACCTCCGTGTGGGCCATGATGAACCAGAAGACCCGCCGCTTCTCC
CGCCTGCCCTACGAGGTGCGCCAGGAGCTGACCCCCCACTACGTGGACTCCCCCCACGTG
ATCGAGGACAACGACCGCAAGCTGCACAAGTTCGACGTGAAGACCGGCGACTCCATCCGC
AAGGGCCTGACCCCCCGCTGGAACGACCTGGACGTGAACCAGCACGTGTCCAACGTGAAG
TACATCGGCTGGATCCTGGAGTCCATGCCCATCGAGGTGCTGGAGACCCAGGAGCTGTGC
TCCCTGACCGTGGAGTACCGCCGCGAGTGCGGCATGGACTCCGTGCTGGAGTCCGTGACC
GCCATGGACCCCTCCGAGGACGAGGGCCGCTCCCAGTACAAGCACCTGCTGCGCCTGGAG
GACGGCACCGACATCGTGAAGGGCCGCACCGAGTGGCGCCCCAAGAACGCCGGCACCAA
CGGCGCCATCTCCACCGCCAAGCCCTCCAACGGCAACTCCGTGTCCTGA
SEQ ID NO: 94 Cuphea procumbens (Cproc) FATB1 amino acid sequence MVAAAAS SAFFPAPAPGS SPKPGKSGNWPS SL SP SFK SKSIPYGRFQVKANASAHPKANGSAV
NLKSGSLNTQEDTS S SPPPRAFLNQLPDW SMLLSAITTVFVAAEKQWTMLDRKSKRPDMLVD
SVGLKNIVRDGLVSRQSFLIRSYEIGADRTASIETLMNHLQET SINHCKSLGLLNDGFGRTPGM
CKNDLIWVLTKMQIMVNRYPAWGDTVEINTWF SQSGKIGMGSDWLISDCNTGEILIRATSVW
AMMNQKTRRF SRLPYEVRQELTPHFVD SPHVIEDNDRKLHKFDVKTGD SIRKGLTPRWNDLD
VNQHVSNVKYIGWILESMPIEVLEAQELC SLTVEYRRECGMDSVLESVTAVDPSEDGGRSQYN
HLLRLEDGTDVVKGRTEWRPKNAETNGAISPGNTSNGNSIS

SEQ ID NO: 95 Cuphea procumbens (Cproc) FATB1 coding DNA sequence ATGGTGGCTGCTGCAGCAAGTTCTGCATTCTTCCCTGCTCCAGCCCCGGGATCCTCACCTA
AACCCGGGAAGTCCGGTAATTGGCCATCGAGCTTGAGCCCTTCCTTCAAGTCCAAGTCAAT
CCCCTATGGCCGATTTCAGGTTAAGGCAAATGCCAGTGCCCATCCTAAGGCTAACGGTTCT
GCAGTAAATCTAAAGTCTGGCAGCCTCAACACTCAGGAGGACACTTCGTCGTCCCCTCCTC
CTCGGGCTTTCCTTAACCAGTTGCCTGATTGGAGTATGCTTCTGTCTGCAATCACGACTGT
ATTCGTGGCGGCAGAGAAGCAGTGGACTATGCTTGATCGGAAATCTAAGAGGCCTGACAT
GCTTGTGGACTCGGTTGGGTTGAAGAATATTGTTCGGGATGGGCTCGTGTCCAGACAGAG
TTTTTTGATTAGATCTTATGAAATAGGCGCTGATCGAACAGCTTCTATAGAGACACTGATG
AACCACTTGCAGGAAACATCTATCAATCATTGTAAGAGTTTGGGTCTTCTCAATGACGGCT
TTGGTCGTACTCCTGGGATGTGTAAAAACGACCTCATTTGGGTGCTTACTAAAATGCAGAT
CATGGTGAATCGCTACCCAGCTTGGGGCGATACTGTTGAGATCAATACCTGGTTCTCCCAG
TCGGGGAAAATCGGTATGGGTAGCGATTGGCTAATAAGTGATTGCAACACAGGAGAAATT
CTTATAAGAGCAACGAGCGTGTGGGCCATGATGAATCAAAAAACGAGAAGATTCTCAAG
ACTTCCATACGAGGTTCGCCAGGAGTTAACGCCTCATTTTGTGGACTCTCCTCATGTCATT
GAAGACAATGATCGGAAATTGCATAAGTTCGATGTGAAGACTGGTGATTCTATTCGCAAG
GGTCTAACTCCGAGGTGGAATGACTTGGATGTCAATCAGCACGTGAGCAACGTGAAGTAC
ATTGGGTGGATTCTCGAGAGTATGCCAATAGAAGTTTTGGAGGCCCAGGAACTATGCTCT
CTCACCGTTGAATATAGGCGGGAATGCGGAATGGACAGTGTGCTGGAGTCCGTGACTGCT
GTAGATCCCTCAGAAGATGGAGGCCGGTCTCAGTACAATCACCTTCTGCGGCTTGAGGAT
GGGACTGATGTCGTGAAGGGCAGAACTGAGTGGCGACCGAAGAATGCAGAAACTAACGG
GGCGATATCACCAGGAAACACTTCAAATGGAAACTCGATCTCCTAG
SEQ ID NO: 96 Cuphea procumbens (Cproc) FATB1 coding DNA sequence codon optimized for Prototheca moriformis ATGGTGGCCGCCGCCGCCTCCTCCGCCTTCTTCCCCGCCCCCGCCCCCGGCTCCTCCCCCA
AGCCCGGCAAGTCCGGCAACTGGCCCTCCTCCCTGTCCCCCTCCTTCAAGTCCAAGTCCAT
CCCCTACGGCCGCTTCCAGGTGAAGGCCAACGCCTCCGCCCACCCCAAGGCCAACGGCTC
CGCCGTGAACCTGAAGTCCGGCTCCCTGAACACCCAGGAGGACACCTCCTCCTCCCCCCCC
CCCCGCGCCTTCCTGAACCAGCTGCCCGACTGGTCCATGCTGCTGTCCGCCATCACCACCG
TGTTCGTGGCCGCCGAGAAGCAGTGGACCATGCTGGACCGCAAGTCCAAGCGCCCCGACA
TGCTGGTGGACTCCGTGGGCCTGAAGAACATCGTGCGCGACGGCCTGGTGTCCCGCCAGT
CCTTCCTGATCCGCTCCTACGAGATCGGCGCCGACCGCACCGCCTCCATCGAGACCCTGAT
GAACCACCTGCAGGAGACCTCCATCAACCACTGCAAGTCCCTGGGCCTGCTGAACGACGG
CTTCGGCCGCACCCCCGGCATGTGCAAGAACGACCTGATCTGGGTGCTGACCAAGATGCA
GATCATGGTGAACCGCTACCCCGCCTGGGGCGACACCGTGGAGATCAACACCTGGTTCTC
CCAGTCCGGCAAGATCGGCATGGGCTCCGACTGGCTGATCTCCGACTGCAACACCGGCGA
GATCCTGATCCGCGCCACCTCCGTGTGGGCCATGATGAACCAGAAGACCCGCCGCTTCTCC
CGCCTGCCCTACGAGGTGCGCCAGGAGCTGACCCCCCACTTCGTGGACTCCCCCCACGTG
ATCGAGGACAACGACCGCAAGCTGCACAAGTTCGACGTGAAGACCGGCGACTCCATCCGC
AAGGGCCTGACCCCCCGCTGGAACGACCTGGACGTGAACCAGCACGTGTCCAACGTGAAG
TACATCGGCTGGATCCTGGAGTCCATGCCCATCGAGGTGCTGGAGGCCCAGGAGCTGTGC
TCCCTGACCGTGGAGTACCGCCGCGAGTGCGGCATGGACTCCGTGCTGGAGTCCGTGACC
GCCGTGGACCCCTCCGAGGACGGCGGCCGCTCCCAGTACAACCACCTGCTGCGCCTGGAG
GACGGCACCGACGTGGTGAAGGGCCGCACCGAGTGGCGCCCCAAGAACGCCGAGACCAA
CGGCGCCATCTCCCCCGGCAACACCTCCAACGGCAACTCCATCTCCTGA
SEQ ID NO: 97 Cuphea procumbens (Cproc) FATB2 amino acid sequence MVAAAAS SAFFPAPAPGS SPKPGKSGNWPS SL SP SFK SKSIPYGRFQVKANASAHPKANGSAV
NLKSGSLNTQEDTS S SPPPRAFLNQLPDWSMLLSAITTVFVAAEKQWTMLDRKSKRPDMLVD
SVGLKNIVRDGLVSRQSFLIRSYEIGADRTASIETLMNHLQET SINHCKSLGLLNDGFGRTPGM
CKNDLIWVLTKMQIMVNRYPAWGDTVEINTWF SQSGKIGMGSDWLISDCNTGEILIRATSVW

AMMNQKTRRF SRLPYEVRQELTPHFVD SPHVIEDNDRKLHKFDVKTGD SIRKGLTPRWNDLD
VNQHVNNVKYIGWILESTPPEVLETQELC SLTLEYRQECGRESVLE SLTAVDPSGKGFGSQFQH
LLRLEDGGEIVKGRTEWRPKTAGINGAIASGETSPGDF
SEQ ID NO: 98 Cuphea procumbens (Cproc) FATB2 coding DNA sequence ATGGTGGCTGCTGCAGCAAGTTCTGCATTCTTCCCTGCTCCAGCCCCGGGATCCTCACCTA
AACCCGGGAAGTCCGGTAATTGGCCATCGAGCTTGAGCCCTTCCTTCAAGTCCAAGTCAAT
CCCCTATGGCCGATTTCAGGTTAAGGCAAATGCCAGTGCCCATCCTAAGGCTAACGGTTCT
GCAGTAAATCTAAAGTCTGGCAGCCTCAACACTCAGGAGGACACTTCGTCGTCCCCTCCTC
CTCGGGCTTTCCTTAACCAGTTGCCTGATTGGAGTATGCTTCTGTCTGCAATCACGACTGT
ATTCGTGGCGGCAGAGAAGCAGTGGACTATGCTTGATCGGAAATCTAAGAGGCCTGACAT
GCTTGTGGACTCGGTTGGGTTGAAGAATATTGTTCGGGATGGGCTCGTGTCCAGACAGAG
TTTTTTGATTAGATCTTATGAAATAGGCGCTGATCGAACAGCTTCTATAGAGACACTGATG
AACCACTTGCAGGAAACATCTATCAATCATTGTAAGAGTTTGGGTCTTCTCAATGACGGCT
TTGGTCGTACTCCTGGGATGTGTAAAAACGACCTCATTTGGGTGCTTACTAAAATGCAGAT
CATGGTGAATCGCTACCCAGCTTGGGGCGATACTGTTGAGATCAATACCTGGTTCTCCCAG
TCGGGGAAAATCGGTATGGGTAGCGATTGGCTAATAAGTGATTGCAACACAGGAGAAATT
CTTATAAGAGCAACGAGCGTGTGGGCCATGATGAATCAAAAAACGAGAAGATTCTCAAG
ACTTCCATACGAGGTTCGCCAGGAGTTAACGCCTCATTTTGTGGACTCTCCTCATGTCATT
GAAGACAATGATCGGAAATTGCATAAGTTCGATGTGAAGACTGGTGATTCTATTCGCAAG
GGTCTAACTCCGAGGTGGAATGACTTGGATGTCAATCAGCACGTCAACAACGTGAAGTAC
ATCGGGTGGATTCTTGAGAGTACTCCACCAGAAGTTCTGGAGACCCAGGAGTTATGTTCCC
TTACCCTGGAATACAGGCAGGAATGCGGAAGGGAGAGCGTGCTGGAGTCCCTCACTGCTG
TGGACCCCTCTGGAAAGGGCTTTGGGTCCCAGTTCCAACACCTTCTGAGGCTTGAGGATGG
AGGTGAGATCGTGAAGGGGAGAACTGAGTGGCGACCCAAGACTGCAGGTATCAATGGGG
CGATAGCATCCGGGGAGACCTCACCTGGAGACTTTTAG
SEQ ID NO: 99 Cuphea procumbens (Cproc) FATB2 coding DNA sequence codon optimized for Prototheca moriformis ATGGTGGCCGCCGCCGCCTCCTCCGCCTTCTTCCCCGCCCCCGCCCCCGGCTCCTCCCCCA
AGCCCGGCAAGTCCGGCAACTGGCCCTCCTCCCTGTCCCCCTCCTTCAAGTCCAAGTCCAT
CCCCTACGGCCGCTTCCAGGTGAAGGCCAACGCCTCCGCCCACCCCAAGGCCAACGGCTC
CGCCGTGAACCTGAAGTCCGGCTCCCTGAACACCCAGGAGGACACCTCCTCCTCCCCCCCC
CCCCGCGCCTTCCTGAACCAGCTGCCCGACTGGTCCATGCTGCTGTCCGCCATCACCACCG
TGTTCGTGGCCGCCGAGAAGCAGTGGACCATGCTGGACCGCAAGTCCAAGCGCCCCGACA
TGCTGGTGGACTCCGTGGGCCTGAAGAACATCGTGCGCGACGGCCTGGTGTCCCGCCAGT
CCTTCCTGATCCGCTCCTACGAGATCGGCGCCGACCGCACCGCCTCCATCGAGACCCTGAT
GAACCACCTGCAGGAGACCTCCATCAACCACTGCAAGTCCCTGGGCCTGCTGAACGACGG
CTTCGGCCGCACCCCCGGCATGTGCAAGAACGACCTGATCTGGGTGCTGACCAAGATGCA
GATCATGGTGAACCGCTACCCCGCCTGGGGCGACACCGTGGAGATCAACACCTGGTTCTC
CCAGTCCGGCAAGATCGGCATGGGCTCCGACTGGCTGATCTCCGACTGCAACACCGGCGA
GATCCTGATCCGCGCCACCTCCGTGTGGGCCATGATGAACCAGAAGACCCGCCGCTTCTCC
CGCCTGCCCTACGAGGTGCGCCAGGAGCTGACCCCCCACTTCGTGGACTCCCCCCACGTG
ATCGAGGACAACGACCGCAAGCTGCACAAGTTCGACGTGAAGACCGGCGACTCCATCCGC
AAGGGCCTGACCCCCCGCTGGAACGACCTGGACGTGAACCAGCACGTGAACAACGTGAA
GTACATCGGCTGGATCCTGGAGTCCACCCCCCCCGAGGTGCTGGAGACCCAGGAGCTGTG
CTCCCTGACCCTGGAGTACCGCCAGGAGTGCGGCCGCGAGTCCGTGCTGGAGTCCCTGAC
CGCCGTGGACCCCTCCGGCAAGGGCTTCGGCTCCCAGTTCCAGCACCTGCTGCGCCTGGA
GGACGGCGGCGAGATCGTGAAGGGCCGCACCGAGTGGCGCCCCAAGACCGCCGGCATCA
ACGGCGCCATCGCCTCCGGCGAGACCTCCCCCGGCGACTTCTGA
SEQ ID NO: 100 Cuphea procumbens (Cproc) FATB3 amino acid sequence MVAAAAS SAFFPAPAPGS SPKPGKSGNWPS SL SP SFK SKSIPYGRFQVKANASAHPKANGSAV
NLKSGSLNTQEDTS S SPPPRAFLNQLPDWSMLLSAITTVFVAAEKQWTMLDRKSKRPDMLVD
SVGLKNIVRDGLVSRQSFLIRSYEIGADRTASIETLMNHLQET SINHCKSLGLLNDGFGRTPGM
CKNDLIWVLTKMQIMVNRYPAWGDTVEINTWF SQ SGKIGMGSDWLI SDCNTGEILIRAT S VW
AMMNQKTRRF SRLPYEVRQELTPHFVD SPHVIEDNDRKLHKFDVKTGD SIRKGLTPRWNDLD
VNQHVNNVKYIGWILESTPPEVLETQELC SLTLEYRRECGRESVLE SLTAVDPSGEGGYGSQFQ
HLLRLEDGGEIVKGRTEWRPKNAGINGVLPTGE*
SEQ ID NO: 101 Cuphea procumbens (Cproc) FATB3 coding DNA sequence ATGGTGGCTGCTGCAGCAAGTTCTGCATTCTTCCCTGCTCCAGCCCCGGGATCCTCACCTA
AACCCGGGAAGTCCGGTAATTGGCCATCGAGCTTGAGCCCTTCCTTCAAGTCCAAGTCAAT
CCCCTATGGCCGATTTCAGGTTAAGGCAAATGCCAGTGCCCATCCTAAGGCTAACGGTTCT
GCAGTAAATCTAAAGTCTGGCAGCCTCAACACTCAGGAGGACACTTCGTCGTCCCCTCCTC
CTCGGGCTTTCCTTAACCAGTTGCCTGATTGGAGTATGCTTCTGTCTGCAATCACGACTGT
ATTCGTGGCGGCAGAGAAGCAGTGGACTATGCTTGATCGGAAATCTAAGAGGCCTGACAT
GCTTGTGGACTCGGTTGGGTTGAAGAATATTGTTCGGGATGGGCTCGTGTCCAGACAGAG
TTTTTTGATTAGATCTTATGAAATAGGCGCTGATCGAACAGCTTCTATAGAGACACTGATG
AACCACTTGCAGGAAACATCTATCAATCATTGTAAGAGTTTGGGTCTTCTCAATGACGGCT
TTGGTCGTACTCCTGGGATGTGTAAAAACGACCTCATTTGGGTGCTTACTAAAATGCAGAT
CATGGTGAATCGCTACCCAGCTTGGGGCGATACTGTTGAGATCAATACCTGGTTCTCCCAG
TCGGGGAAAATCGGTATGGGTAGCGATTGGCTAATAAGTGATTGCAACACAGGAGAAATT
CTTATAAGAGCAACGAGCGTGTGGGCCATGATGAATCAAAAAACGAGAAGATTCTCAAG
ACTTCCATACGAGGTTCGCCAGGAGTTAACGCCTCATTTTGTGGACTCTCCTCATGTCATT
GAAGACAATGATCGGAAATTGCATAAGTTCGATGTGAAGACTGGTGATTCTATTCGCAAG
GGTCTAACTCCGAGGTGGAATGACTTGGATGTCAATCAGCACGTCAACAACGTGAAGTAC
ATCGGGTGGATTCTTGAGAGTACTCCACCAGAAGTTCTGGAGACCCAGGAGTTATGTTCCC
TTACCCTGGAATACAGGCGGGAATGTGGAAGGGAGAGCGTGCTGGAGTCCCTCACTGCTG
TGGACCCCTCTGGAGAGGGGGGCTATGGATCCCAGTTTCAGCACCTTCTGCGGCTTGAGG
ATGGAGGTGAGATCGTGAAGGGGAGAACTGAGTGGCGACCCAAGAATGCTGGAATCAAT
GGGGTGTTACCAACCGGGGAGTAG
SEQ ID NO: 102 Cuphea procumbens (Cproc) FATB3 coding DNA sequence codon optimized for Prototheca moriformis ATGGTGGCCGCCGCCGCCTCCTCCGCCTTCTTCCCCGCCCCCGCCCCCGGCTCCTCCCCCA
AGCCCGGCAAGTCCGGCAACTGGCCCTCCTCCCTGTCCCCCTCCTTCAAGTCCAAGTCCAT
CCCCTACGGCCGCTTCCAGGTGAAGGCCAACGCCTCCGCCCACCCCAAGGCCAACGGCTC
CGCCGTGAACCTGAAGTCCGGCTCCCTGAACACCCAGGAGGACACCTCCTCCTCCCCCCCC
CCCCGCGCCTTCCTGAACCAGCTGCCCGACTGGTCCATGCTGCTGTCCGCCATCACCACCG
TGTTCGTGGCCGCCGAGAAGCAGTGGACCATGCTGGACCGCAAGTCCAAGCGCCCCGACA
TGCTGGTGGACTCCGTGGGCCTGAAGAACATCGTGCGCGACGGCCTGGTGTCCCGCCAGT
CCTTCCTGATCCGCTCCTACGAGATCGGCGCCGACCGCACCGCCTCCATCGAGACCCTGAT
GAACCACCTGCAGGAGACCTCCATCAACCACTGCAAGTCCCTGGGCCTGCTGAACGACGG
CTTCGGCCGCACCCCCGGCATGTGCAAGAACGACCTGATCTGGGTGCTGACCAAGATGCA
GATCATGGTGAACCGCTACCCCGCCTGGGGCGACACCGTGGAGATCAACACCTGGTTCTC
CCAGTCCGGCAAGATCGGCATGGGCTCCGACTGGCTGATCTCCGACTGCAACACCGGCGA
GATCCTGATCCGCGCCACCTCCGTGTGGGCCATGATGAACCAGAAGACCCGCCGCTTCTCC
CGCCTGCCCTACGAGGTGCGCCAGGAGCTGACCCCCCACTTCGTGGACTCCCCCCACGTG
ATCGAGGACAACGACCGCAAGCTGCACAAGTTCGACGTGAAGACCGGCGACTCCATCCGC
AAGGGCCTGACCCCCCGCTGGAACGACCTGGACGTGAACCAGCACGTGAACAACGTGAA
GTACATCGGCTGGATCCTGGAGTCCACCCCCCCCGAGGTGCTGGAGACCCAGGAGCTGTG
CTCCCTGACCCTGGAGTACCGCCGCGAGTGCGGCCGCGAGTCCGTGCTGGAGTCCCTGAC
CGCCGTGGACCCCTCCGGCGAGGGCGGCTACGGCTCCCAGTTCCAGCACCTGCTGCGCCT
GGAGGACGGCGGCGAGATCGTGAAGGGCCGCACCGAGTGGCGCCCCAAGAACGCCGGCA
TCAACGGCGTGCTGCCCACCGGCGAGTGA

SEQ ID NO: 103 Cuphea ignea (Cignea) FATB1 amino acid sequence PGT SRKTGKFGNWPS SL SP SFKPKSIPNGGF QVKANARAHPKANGSAVSLKSVSLNTQEDT SLS
PPPRAFLNQLPDWRMLRTALTTVFVAAEKQWTMLDRKSKRPDMLVDSFGLESIVQEGLVFRQ
SF SIRSYEIGIDRTASIETLMNHLQETSLNQCKSAGILHDGFGRTLEMCKRDLIWVVTKMQIKV
NRYPAWGDTVEISTRF SRLGKIGMGRDWLICDCNTGEILIRAT SAYAMMNQKTRRL SKLPNEV
RQEIAPLFVD SDPVIEENDMKLHKFEVKTGD SICKGLTPRWSDLDVNQHVSNVKYIGWILE S M
PTEVLETQELC SLALEYRRECGRD SVLE SVTSMDPSKVGGWSQYQHLLRLEDGADIVKGRTE
WRPKNAGANGAISTGKT
SEQ ID NO: 104 Cuphea ignea (Cignea) FATB1 coding DNA sequence CCGGGAACCTCACGTAAAACCGGGAAGTTCGGCAATTGGCCATCAAGCTTGAGCCCTTCC
TTCAAGCCCAAGTCAATCCCCAATGGCGGATTTCAGGTTAAGGCTAATGCCAGAGCCCAT
CCTAAGGCTAACGGTTCTGCAGTAAGTCTAAAGTCTGTCAGCCTCAACACTCAGGAGGAC
ACTTCGTTGTCCCCTCCTCCTCGTGCTTTCCTTAACCAGTTGCCTGATTGGAGGATGCTTCG
GACTGCACTCACGACCGTCTTTGTGGCGGCAGAGAAGCAGTGGACTATGCTTGATCGGAA
ATCTAAGAGGCCTGACATGCTCGTGGACTCGTTTGGGTTGGAGAGTATTGTTCAAGAAGG
GCTCGTGTTCAGACAGAGCTTTTCGATTAGGTCTTATGAAATAGGCATTGATCGAACAGCC
TCTATAGAGACGCTGATGAACCACTTGCAGGAAACATCTCTCAATCAATGTAAGAGTGCT
GGTATTCTCCATGACGGCTTCGGTCGTACTCTTGAGATGTGTAAAAGGGACCTCATTTGGG
TTGTTACGAAAATGCAGATCAAGGTGAATCGCTATCCAGCTTGGGGCGATACTGTCGAGA
TCAGTACCCGGTTCTCCCGGTTGGGGAAAATCGGTATGGGTCGCGATTGGCTAATATGTGA
TTGCAACACAGGAGAAATTCTTATAAGAGCAACGAGCGCGTATGCCATGATGAATCAAAA
GACGAGAAGACTCTCAAAACTTCCAAACGAGGTTCGCCAGGAGATAGCGCCTCTTTTTGT
GGACTCTGATCCTGTCATTGAAGAAAATGATATGAAATTGCATAAGTTTGAAGTGAAGAC
TGGTGATTCCATTTGCAAGGGTCTAACTCCGAGGTGGAGTGACTTGGATGTCAATCAGCAC
GTAAGCAACGTGAAGTACATAGGGTGGATTCTCGAGAGTATGCCAACAGAAGTTTTGGAG
ACACAGGAGCTATGCTCTCTCGCCCTTGAATATAGGCGGGAATGCGGAAGGGACAGTGTG
CTGGAGTCTGTGACCTCTATGGATCCCTCAAAAGTTGGAGGCTGGTCTCAGTACCAGCACC
TTCTGCGACTTGAGGATGGGGCGGATATCGTGAAGGGCAGAACTGAGTGGCGGCCGAAG
AATGCAGGAGCTAACGGGGCGATATCAACAGGAAAGACTTGA
SEQ ID NO: 105 Cuphea ignea (Cignea) FATB1 coding DNA sequence codon optimized for Prototheca moriformis CCCGGCACCTCCCGCAAGACCGGCAAGTTCGGCAACTGGCCCTCCTCCCTGTCCCCCTCCT
TCAAGCCCAAGTCCATCCCCAACGGCGGCTTCCAGGTGAAGGCCAACGCCCGCGCCCACC
CCAAGGCCAACGGCTCCGCCGTGTCCCTGAAGTCCGTGTCCCTGAACACCCAGGAGGACA
CCTCCCTGTCCCCCCCCCCCCGCGCCTTCCTGAACCAGCTGCCCGACTGGCGCATGCTGCG
CACCGCCCTGACCACCGTGTTCGTGGCCGCCGAGAAGCAGTGGACCATGCTGGACCGCAA
GTCCAAGCGCCCCGACATGCTGGTGGACTCCTTCGGCCTGGAGTCCATCGTGCAGGAGGG
CCTGGTGTTCCGCCAGTCCTTCTCCATCCGCTCCTACGAGATCGGCATCGACCGCACCGCC
TCCATCGAGACCCTGATGAACCACCTGCAGGAGACCTCCCTGAACCAGTGCAAGTCCGCC
GGCATCCTGCACGACGGCTTCGGCCGCACCCTGGAGATGTGCAAGCGCGACCTGATCTGG
GTGGTGACCAAGATGCAGATCAAGGTGAACCGCTACCCCGCCTGGGGCGACACCGTGGAG
ATCTCCACCCGCTTCTCCCGCCTGGGCAAGATCGGCATGGGCCGCGACTGGCTGATCTGCG
ACTGCAACACCGGCGAGATCCTGATCCGCGCCACCTCCGCCTACGCCATGATGAACCAGA
AGACCCGCCGCCTGTCCAAGCTGCCCAACGAGGTGCGCCAGGAGATCGCCCCCCTGTTCG
TGGACTCCGACCCCGTGATCGAGGAGAACGACATGAAGCTGCACAAGTTCGAGGTGAAG
ACCGGCGACTCCATCTGCAAGGGCCTGACCCCCCGCTGGTCCGACCTGGACGTGAACCAG
CACGTGTCCAACGTGAAGTACATCGGCTGGATCCTGGAGTCCATGCCCACCGAGGTGCTG
GAGACCCAGGAGCTGTGCTCCCTGGCCCTGGAGTACCGCCGCGAGTGCGGCCGCGACTCC
GTGCTGGAGTCCGTGACCTCCATGGACCCCTCCAAGGTGGGCGGCTGGTCCCAGTACCAG
CACCTGCTGCGCCTGGAGGACGGCGCCGACATCGTGAAGGGCCGCACCGAGTGGCGCCCC
AAGAACGCCGGCGCCAACGGCGCCATCTCCACCGGCAAGACCTGA

SEQ ID NO: 106 JcFatB1 consensus amino acid sequence MVAAAAS SAFFPVPAPGTSPKPGKSGNWPS S LSP SF KPKSIPNGGFQVKANASAHPKANGSAV
NLKSGSLNTQEDTS S S PPPRAFLNQLPDW SMLLTAIT TVFVAAEKQWTMLDRKSKRPDMLVD
SVGLKRIVQDGLVS RQ SF SIRSYEIGADRTASIETLMNHLQET SINHCKSLGLLNDGFGRTPGM
CKNDLIWVLTKMQIMVNRYPTWGDTVEINTWF SQSGKIGMGSDWLISDCNTGEILIRATSVW
AMMNQKTRRF SRLPYEVRQELTPHFVD SPHVIEDNDRKLHKFDVKTGD SIRKGLTPRWNDLD
VNQHVSNVKYIGWILESMPIEVLETQELC SLTVEYRRECGMDSVLESVTAMDPSENGGRSQYK
HLLRLEDGTDIVKGRTEWRPKNAGTNGAISTGKTSNGNSVS*
SEQ ID NO: 107 JcFatB1 consensus DNA sequence codon optimized for Prototheca ATGGTGGCCGCCGCCGCCTCCTCCGCCTTCTTCCCCGTGCCCGCCCCCGGCACCTCCCCCA
AGCCCGGCAAGTCCGGCAACTGGCCCTCCTCCCTGTCCCCCTCCTTCAAGCCCAAGTCCAT
CCCCAACGGCGGCTTCCAGGTGAAGGCCAACGCCTCCGCCCACCCCAAGGCCAACGGCTC
CGCCGTGAACCTGAAGTCCGGCTCCCTGAACACCCAGGAGGACACCTCCTCCTCCCCCCCC
CCCCGCGCCTTCCTGAACCAGCTGCCCGACTGGTCCATGCTGCTGACCGCCATCACCACCG
TGTTCGTGGCCGCCGAGAAGCAGTGGACCATGCTGGACCGCAAGTCCAAGCGCCCCGACA
TGCTGGTGGACTCCGTGGGCCTGAAGCGCATCGTGCAGGACGGCCTGGTGTCCCGCCAGT
CCTTCTCCATCCGCTCCTACGAGATCGGCGCCGACCGCACCGCCTCCATCGAGACCCTGAT
GAACCACCTGCAGGAGACCTCCATCAACCACTGCAAGTCCCTGGGCCTGCTGAACGACGG
CTTCGGCCGCACCCCCGGCATGTGCAAGAACGACCTGATCTGGGTGCTGACCAAGATGCA
GATCATGGTGAACCGCTACCCCACCTGGGGCGACACCGTGGAGATCAACACCTGGTTCTC
CCAGTCCGGCAAGATCGGCATGGGCTCCGACTGGCTGATCTCCGACTGCAACACCGGCGA
GATCCTGATCCGCGCCACCTCCGTGTGGGCCATGATGAACCAGAAGACCCGCCGCTTCTCC
CGCCTGCCCTACGAGGTGCGCCAGGAGCTGACCCCCCACTTCGTGGACTCCCCCCACGTG
ATCGAGGACAACGACCGCAAGCTGCACAAGTTCGACGTGAAGACCGGCGACTCCATCCGC
AAGGGCCTGACCCCCCGCTGGAACGACCTGGACGTGAACCAGCACGTGTCCAACGTGAAG
TACATCGGCTGGATCCTGGAGTCCATGCCCATCGAGGTGCTGGAGACCCAGGAGCTGTGC
TCCCTGACCGTGGAGTACCGCCGCGAGTGCGGCATGGACTCCGTGCTGGAGTCCGTGACC
GCCATGGACCCCTCCGAGAACGGCGGCCGCTCCCAGTACAAGCACCTGCTGCGCCTGGAG
GACGGCACCGACATCGTGAAGGGCCGCACCGAGTGGCGCCCCAAGAACGCCGGCACCAA
CGGCGCCATCTCCACCGGCAAGACCTCCAACGGCAACTCCGTGTCCTGA
SEQ ID NO: 108 JcFatB2 consensus amino acid sequence MVATAAS SAFFPVPSPDTS SRPGKLGNGS S SLSPLKPKSVANGGLQVKANASAPPKINGSSVGL
KSGSLKTQEDTPSAPPPRTFINQLPDWSMLLAAITTVFLAAEKQWMMLDWKPKRPDMLVDPF
GLGRIVQDGLVFRQNF SIRSYEIGADRTASIETVMNHLQETALNHVKSAGLLNDGFGRTPEMY
KRDLIWVVAKMQVMVNRYPTWGDTVEVNTWVAKSGKNGMRRDWLI SDCNTGEILTRAS SV
WVMMNQKTRRLSKIPDEVRHEIEPHFVDSAPVIEDDDRKLPKLDEKTADSIRKGLTPKWNDLD
VNQHVNNVKYIGWILESTPPEVLETQELC SLTLEYRRECGRESVLESLTAVDPSGKGYGSQFQ
HLLRLEDGGEIVKGRTEWRPKTAGINGAIASGETSPGDS S*
SEQ ID NO: 109 JcFatB2 consensus DNA sequence codon optimized for Prototheca ATGGTGGCCACCGCCGCCTCCTCCGCCTTCTTCCCCGTGCCCTCCCCCGACACCTCCTCCCG
CCCCGGCAAGCTGGGCAACGGCTCCTCCTCCCTGTCCCCCCTGAAGCCCAAGTCCGTGGCC
AACGGCGGCCTGCAGGTGAAGGCCAACGCCTCCGCCCCCCCCAAGATCAACGGCTCCTCC
GTGGGCCTGAAGTCCGGCTCCCTGAAGACCCAGGAGGACACCCCCTCCGCCCCCCCCCCC
CGCACCTTCATCAACCAGCTGCCCGACTGGTCCATGCTGCTGGCCGCCATCACCACCGTGT
TCCTGGCCGCCGAGAAGCAGTGGATGATGCTGGACTGGAAGCCCAAGCGCCCCGACATGC
TGGTGGACCCCTTCGGCCTGGGCCGCATCGTGCAGGACGGCCTGGTGTTCCGCCAGAACTT

CTCCATCCGCTCCTACGAGATCGGCGCCGACCGCACCGCCTCCATCGAGACCGTGATGAA
CCACCTGCAGGAGACCGCCCTGAACCACGTGAAGTCCGCCGGCCTGCTGAACGACGGCTT
CGGCCGCACCCCCGAGATGTACAAGCGCGACCTGATCTGGGTGGTGGCCAAGATGCAGGT
GATGGTGAACCGCTACCCCACCTGGGGCGACACCGTGGAGGTGAACACCTGGGTGGCCAA
GTCCGGCAAGAACGGCATGCGCCGCGACTGGCTGATCTCCGACTGCAACACCGGCGAGAT
CCTGACCCGCGCCTCCTCCGTGTGGGTGATGATGAACCAGAAGACCCGCCGCCTGTCCAA
GATCCCCGACGAGGTGCGCCACGAGATCGAGCCCCACTTCGTGGACTCCGCCCCCGTGAT
CGAGGACGACGACCGCAAGCTGCCCAAGCTGGACGAGAAGACCGCCGACTCCATCCGCA
AGGGCCTGACCCCCAAGTGGAACGACCTGGACGTGAACCAGCACGTGAACAACGTGAAG
TACATCGGCTGGATCCTGGAGTCCACCCCCCCCGAGGTGCTGGAGACCCAGGAGCTGTGC
TCCCTGACCCTGGAGTACCGCCGCGAGTGCGGCCGCGAGTCCGTGCTGGAGTCCCTGACC
GCCGTGGACCCCTCCGGCAAGGGCTACGGCTCCCAGTTCCAGCACCTGCTGCGCCTGGAG
GACGGCGGCGAGATCGTGAAGGGCCGCACCGAGTGGCGCCCCAAGACCGCCGGCATCAA
CGGCGCCATCGCCTCCGGCGAGACCTCCCCCGGCGACTCCTCCTGA
SEQ ID NO: 110 CuPSR23 FATB3 amino acid sequence MVVAAAT SAFFPVPAPGT SPKPGKSGNWP S SL SP TFKPKSIPNAGF QVKANASAHPKA
NGSAVNLKSGSLNTQEDT S S SPPPRAFLNQLPDWSMLLTAITTVFVAAEKQWTMLDRKSKRP
DMLVD SVGLKCIVRDGLVSRQ SF LIRSYEIGADRTAS IETLMNHLQET SINHCKSLGLLNDGFG
RTPGMCKNDLIWVLTKMQIMVNRYPTWGDTVEINTWF SQSGKIGMASDWLISDCNTGEILIR
AT SVWAMMNQKTRRF SRLPYEVRQELTPHFVDSPHVIEDNDQKLHKFDVKTGD SIRKGLTPR
WNDLDVNQHVSNVKYIGWILESMPIEVLETQELC SLTVEYRRECGMD SVLE SVTAVDP SENG
GRSQYKHLLRLEDGTDIVKS RTEWRPKNAGTNGAI ST STAKT SNGNSVS
SEQ ID NO: 111 CuPSR23 FATB3b amino acid sequence MVVAAAT SAFFPVPAPGT SPKPGKSGNWP S SL SP TFKPKSIPNAGF QVKANASAHPKA
NGSAVNLKSGSLNTQEDT S S SPPPRAFLNQLPDWSMLLTAITTVFVAAEKQWTMLDRKSKRP
DMLVD SVGLKSIVRDGLVSRQ SF LIRSYEIGADRTASIETLMNHLQET SINHCKSLGLLNDGFG
RTPGMCKNDLIWVLTKMQIMVNRYPTWGDTVEINTWF SQSGKIGMASDWLISDCNTGEILIR
AT SVWAMMNQKTRRF SRLPYEVRQELTPHFVDSPHVIEDNDQKLHKFDVKTGD SIRKGLTPR
WNDLDVNQHVSNVKYIGWILESMPIEVLETQELC SLTVEYRRECGMD SVLE SVTAVDP SENG
GRSQYKHLLRLEDGTDIVK SRTEWRPKNAGTNGAI ST STAKT SNGN SAS
SEQ ID NO: 112 CwFATB3 amino acid sequence:
MVVAAAAS SAFFPVPAPRTTPKPGKFGNWP S SL SPPFKPKSNPNGRFQVKANVSPHPK
ANGSAVSLKSGSLNTLEDPP S SPPPRTFLNQLPDW SRLRTAITTVFVAAEKQFTRLDRKSKRPD
MLVDWFGSETIVQDGLVFRERF SIRSYEIGADRTASIETLMNHLQDT SLNHCKSVGLLNDGFG
RT SEMCTRDLIWVLTKMQIVVNRYPTWGDTVEINSWF SQSGKIGMGRDWLISDCNTGEILVR
AT SAWAMMNQKTRRF SKLPCEVRQEIAPHFVDAPPVIEDNDRKLHKFDVKTGD SICKGLTPG
WNDLDVNQHVSNVKYIGWILESMPTEVLETQELC SLTLEYRRECGRESVVESVT SMNP SKVG
DRSQYQHLLRLEDGADIMKGRTEWRPKNAGTNRAIST
SEQ ID NO: 113 CwFATB3a amino acid sequence:
MVVAAAAS SAFFPVPAPRTTPKPGKFGNWP S SL SPPFKPKSNPNGRFQVKANVSPHPK
ANGSAVSLKSGSLNTLEDPP S SPPPRTFLNQLPDW SRLRTAITTVFVAAEKQFTRLDRKSKRPD
MLVDWFGSETIVQDGLVFRERF SIRSYEIGADRTASIETLMNHLQDT SLNHCKSVGLLNDGFG
RT SEMCTRDLIWVLTKMQIVVNRYPTWGDTVEINSWF SQSGKIGMGRDWLISDCNTGEILVR
AT SAWAMMNQKTRRF SKLPCEVRQEIAPHFVDAPPVIEDNDRKLHKFDVKTGD SICKGLTPG
WNDLDVNQHVSNVKYIGWILESMPTEVLETQELC SLTLEYRRECGRESVVESVT SMNP SKVG
DRSQYQHLLRLEDGADIMKGRTEWRPKNAGTNRAIST

SEQ ID NO: 114 CwFATB3b amino acid sequence MVVAAAAS SAFFPVPAPRTTPKPGKFGNWP S SL SPPFKPK SNPNGRFQVKANVSPHPK
ANGSAVSLK SG SLNTLEDLP S SPPPRTFLNQLPDWSRLRTAITTVFVAAEKQFTRLDRKSKRPD
MLVDWFGSETIVQDGLVFRERF SIRSYEIGADRTASIETLMNHLQDT SLNHCKSVGLLNDGFG
RT SEMCTRDLIWVLTKMQIVVNRYPTWGDTVEINSWF SQ SGKIGMGRDWLISDCNTGEILVR
AT SAWAMMNQKTRRF SKLPCEVRQEIAPHFVDAPPVIEDNDRKLHKF DVKTGD SICKGLT PG
WNDLDVNQHVSNVKYIGWILEKFWRPRSYALSPLNIGGNVEGKVW
SEQ ID NO: 115 CwFATB3c amino acid sequence MVVAAAAS SAFFPVPAPRTTPKPGKFGNWP S SL SPPFKPK SNPNGRFQVKANVSPHPK
ANGSAVSLKSGSLNTLEDLP S SPPPRTFLNQLPDWSRLRTAITTVFVATEKQFTRLDRK SKRPD
MLVDWFGSETIVQDGLVFRERF SIRSYEIGADRTASIETLMNHLQDT SLNHCKSVGLLNDGFG
RT SEMCTRDLIWVLTKMQIVVNRYPTWGDTVEINSWF SQ SGKIGMGRDWLISDCNTGEILVR
AT SAWAMMNQKTRRF SKLPCEVRQEIAPHFVDAPPVIEDNDRKLHKF DVKTGD SICKGLT PG
WNDLDVNQHVSNVKYIGWILEKFWRPRSYALSPLNIGGNVEGKVW
SEQ ID NO: 116 CwFATB4a amino acid sequence MVATAAS SAF FPVP SADT S S SRPGKLGS GP S SL SPLKPKSIPNGGLQVKANASAPPKIN
GS SVGLK SGGFKTQED SP SAPPPRTFINQLPDW SMLLAAITTVFLAAEKQWMMLDWKPKRPD
MLVDPFGLGSIVQDGLVFRQNF SIRSYEIGADRTASIETVMNHLQETALNHVKIAGL SNDGFGR
TPEMYKRDLIWVVAKMQVMVNRYPTWGDTVEVNTWVAKSGKNGMRRDWLISDCNTGEILT
RAS SVWVMMNQKTRRL SKIPDEVRNEIEPHFVDSAPVVEDDDRKLPKLDENTADSIRKGLTPR
WNDLDVNQHVNNVKYIGWILESTPPEVLETQELC SLTLEYRRECGRESVLE SLTAVDP SAEGY
A SRF QHLLRLEDGGEIVKARTEWRPKNAGINGVVP SEE S SPGDFF
SEQ ID NO: 117 CwFATB4a.1 amino acid sequence MVATAAS SAFF PVP SADT S S SRPGKLGSGP S SLSPLKPKSIPNGGLQVKANASAPPKIN
GS SVGLK SGGFKTQED SP SAPPPRTFINQLPDW SMLLAAITTVFLAAEKQWMMLDWKPKRPD
MLVDPFGLGSIVQDGLVFRQNF SIRSYEIGADRTASIETVMNHLQETALNHVKIAGL SNDGFGR
TPEMYKRDLIWVVAKMQVMVNRYPTWGDTVEVNTWVAKSGKNGMRRDWLISDCNTGEILT
RAS SVWVMMNQKTRRL SKIPDEVRNEIEPHFVDSAPVVEDDDRKLPKLDENTADSIRKGLTPR
WNDLDVNQHVNNVKYIGWILESTPPEVLETQELC SLTLEYRRECGRESVLE SLTAVDP SAEGY
A SRF QHLLRLEDGGEIVKARTEWRPKNAGINWVVP SEES SPGDFF
SEQ ID NO: 118 CwFATB4a.2 amino acid sequence:
MVATAAS SAFF PVP SADT S S SRPGKLGNGP S SLSPLKPK SIPNGGLQVKANASAPPKIN
GS SVGLKSGSFKTQEDAP SAPP PRTF INQ LPDW SMLLAAITTVFLAAEKQWMMLDWKPKRPD
MLVDPFGLGSIVQDGLVFRQNF SIRSYEIGADRTASIETVMNHLQETALNHVKIAGL SNDGFGR
TPEMYKRDLIWVVAKMQVMVNRYPTWGDTVEVNTWVAKSGKNGMRRDWLISDCNTGEILT
RAS SVWVMMNQKTRRL SKIPDEVRNEIEPHFVDSAPVVEDDDRKLPKLDENTADSIRKGLTPR
WNDLDVNQHVNNVKYIGWILESTPPEVLETQELC SLTLEYRRECGRESVLE SLTAVDP SAEGY
A SRF QHLLRLEDGGEIVKARTEWRPKNAGINGVVP SEE S SPGDFF
SEQ ID NO: 119 CwFATB4a.3 amino acid sequence MVATAAS SAF FPVP SADT S S SRPGKLGS GP S SL SPLKPKSIPNGGLQVKANASAPPKIN
GS SVGLK SGGFKTQED SP SAPPPRTFINQLPDW SMLLAAITTVFLAAEKQWMMLDWKPKRPD
MLVDPFGLGSIVQDGLVFRQNF SIRSYEIGADRTASIETVMNHLQETALNHVKIAGL SNDGFGR
TPEMYKRDLIWVVAKMQVMVNRYPTWGDTVEVNTWVAKSGKNGMRRDWLISDCNTGEILT
RAS SVWVMMNQKTRRL SKIPDEVRNEIEPHFVDSAPVVEDDDRKLPKLDENTADSIRKGLTPR

WNDLDVNQHVNNVKYIGWILESTPPEVLETQELC SLTLEYRRECGRESVLE SLTAVDP SAEGY
VSRFQHLLRLEDGGEIVKARTEWRPKNAGINGVVP SEE S SPGDFF
SEQ ID NO: 120 CwFATB4b amino acid sequence MVATAAS SAFF PVP SADT S S SRPGKLGNGP S SLSPLKPK SIPNGGLQVKANA SAPP KIN
GS SVGLKSGSFKTQEDAP SAPP PRTF INQ LPDW SMLLAAITTVFLAAEKQWMMLDWKPKRPD
MLVDPFGLGSIVQDGLVFRQNF SIRSYEIGADRTASIETVMNHLQETALNHVKIAGL S SDGFGR
T PAM SKRDLIWVVAKMQVMVNRYPAWGDTVEVNTWVAK SGKNGMRRDWLI SDCNTGEILT
RAS SVWVMMNQKTRRL SKIPDEVRNEIEPHFVDSAPVVEDDDRKLPKLDENTADSIRKGLTPR
WNDLDVNQHVNNVKYIGWILESTPAEVLETQELC SLTLEYRRECGRESVLESLTAVDP SGEGD
GSKFQHLLRLEDGGEIVKARTEWRPKNAGINGVVP SEES SPGGDFF
SEQ ID NO: 121 CwFATB4b.1 amino acid sequence MVATAAS SAF FPVP SADT S S SRPGKLGS GP S SL SPLKPKSIPNGGLQVKANASAPPKIN
GS SVGLKSGSFKTQEDAP SAPP PRTF INQ LPDW SMLLAAITTVFLAAEKQWMMLDWKPKRPD
MLVDPFGLGSIVQDGLVFRQNF SIRSYEIGADRTASIETVMNHLQETALNHVKIAGL S SDGFGR
T PAM SKRDLIWVVAKMQVMVNRYPAWGDTVEVNTWVAK SGKNGMRRDWLI SDCNTGEILT
RAS SVWVMMNQKTRRL SKIPDEVRNEIEPHFVDSAPVVEDDDRKLPKLDENTADSIRKGLTPR
WNDLDVNQHVNNVKYIGWILESTPAEVLETQELC SLTLEYRRECGRESVLESLTAVDP SGEGD
GSKFQHLLRLEDGGEIVKARTEWRPKNAGINGVVP SEES SPGGDFF
SEQ ID NO: 122 CwFATB5 amino acid sequence MVAAAAS SAFF SVPTPGTPPKPGKFGNWP S SLSVPFKPDNGGFHVKANASAHPKANG
SAVNLKSGSLETPPRSF INQLPDL SVLL SKITTVFGAAEKQWKRPGMLVEPFGVDRIFQDGVFF
RQ SF SIRSYEIGVDRTASIETLMNIFQET SLNHCKSIGLLNDGFGRTPEMCKRDLIWVVTKIQVE
VNRYPTWGDTIEVNTWVSE SGKNGMGRDWLISDCRTGEILIRAT SVWAMMNQNTRRLSKFP
YEVRQEIAPHFVDSAPVIEDDQKLQKLDVKTGDSIRDGLTPRWNDLDVNQHVNNVKYIGWIL
KSVPIEVFETQELCGVTLEYRRECGRDSVLE SVTAMDPAKEGDRCVYQHLLRLEDGADITIGR
TEWRPKNAGANGAMS SGKT SNGNCLIEGRGWQPFRVVRLIF
SEQ ID NO: 123 CwFATB5a amino acid sequence MVAAAAS SAFF SVPTPGTPPKPGKFGNWP S SLSVPFKPDNGGFHVKANASAHPKANG
SAVNLKSGSLETPPRSFINQLPDL SVLL SKITTVFGAAEKQWKRPGMLVEPFGVDRIFQDGFFFR
Q SF SIRSYEIGVDRTASIETLMNIFQET SLNHCKSIGLLNDGFGRTPEMCKRDLIWVVTKIQVEV
NRYPTWGDTIEVNTWVSESGKNGMGRDWLISDCRTGEILIRAT SVWAMMNQNTRRLSKFPYE
VRQEIAPHFVDSAPVIEDDQKLQKLDVKTGD SIRDGLTPRWNDLDVNQHVNNVKYIGWILKS
VPIEVFETQELCGVTLEYRRECGRDSVLESVTAMDPAKEGDRCVYQHLLRLEDGADITIGRTE
WRPKNAGANGAMS SGKT SNGNCLIEGRGWQPFRVVRLIF
SEQ ID NO: 124 CwFATB5b amino acid sequence MVAAAAS SAFF SVPTPGTPPKPGKFGNWP S SLSVPFKPDNGGFHVKANASAHPKANG
SAVNLKSGSLETPPRSF INQLPDL SVLL SKITTVFGAAEKQWKRPGMLVEPFGVDRIFQDGVFF
RQ SF SIRSYEIGVDRTASIETLMNIFQET SLNHCKSIGLLNDGFGRTPEMCKRDLIWVVTKIQVE
VNRYPTWGDTIEVNTWVSE SGKNGMGRDWLISDCRTGEILIRAT SVWAMMNQNTRRLSKFP
YEVRQEIAPHFVDSAPVIEDDQKLQKLDVKTGDSIRDGLTPRWNDLDVNQHVNNVKYIGWIL
KSVPIEVFETQELCGVTLEYRRECGRDSVLE SVTAMDPAKEGDRCVYQHLLWLEDGADITIGR
TEWRPKNAGANGAMS SGKT SNGNCLIEGRGWQPFRVVRLIF
SEQ ID NO: 125 CwFATB5c amino acid sequence MVAAAAS SAFF SVPTPGTPPKPGKFGNWP S SLSVPFKPDNGGFHVKANASAHPKANG

SAVNLKSGSLETPPRSF INQLPDL SVLL SKITTVFGAAEKQWKRPGMLVEPFGVDRIFQDGVFF
RQ SF SIRSYEIGVDRTASIETLMNIFQET SLNHCKSIGLLNDGFGRTPEMCKRDLIWVVTKIQVE
VNRYPIWGDTIEVNTWVSE SGKNGMGRDWLISDCRTGEILIRAT SVWAMMNQNTRRL SKFPY
EVRQEIAPHFVDSAPVIEDDQKLQKLDVKTGDSIRDGLTPRWNDLDVNQHVNNVKYIGWILK
SVPIEVFETQELCGVTLEYRRECGRDSVLESVTAMDPAKEGDRCVYQHLLRLEDGADITIGRTE
WRPKNAGANGAMS SGKT SNGNCLIEGMGWQPFRVVRLIF
SEQ ID NO: 126 CwFATB5.1 amino acid sequence MVAAAAS SAFF SVPTPGT SPKPGKFRNWP S SLSVPFKPETNHNGGFHIKANASAHPKA
NGSALNLK SGSLETQ EDT SL S SPPRTFIKQLPDWSMLLSKITTVFGAAEKQLKRPGMLVEPFGV
DRIFQDGVFF RQ SF SIRSYEIGADRTASIETLMNIFQET SLNHCKSIGLLNDGFGRTPEMCKRDLI
WVVTKIQVEVNRYPTWGDTIEVNTWVSESGKNGMGRDWLISDCRTGEILIRAT SVWAMMNQ
NT RRL SKFPYEVRQEIAPHFVDSAPVIEDDRKLYKLNVKTGDSIRDGLTPRWNDLDVNQHVNN
VKFIGWILKSVPTKVFETQELCGVTLEYRRECGKDSVLE SVTAMDPAKEGDRSVYQHLLRLED
GADITIGRTEWRPKNAGANEAIS SGKT SNGNSAS
SEQ ID NO: 127 CwFATB5.1a amino acid sequence MVAAAAS SAFF SVPTPGT SPKPGKFRNWPLSL SVPFKPETNHNGGFHIKANASAHPKA
NGSALNLK SGSLETQ EDT SL S SPPRTFIKQLPDWSMLLSKITTVFGAAEKQLKRPGMLVEPFGV
DRIFQDGVFF RQ SF SIRSYEIGADRTASIETLMNIFQET SLNHCKSIGLLNDGFGRTPEMCKRDLI
WVVTKIQVEVNRYPTWGDTIEVNTWVSESGKNGMGRDWLISDCRTGEILIRAT SVWAMMNQ
NT RRL SKFPYEVRQEIAPHFVDSAPVIEDDRKLYKLNVKTGDSIRDGLTPRWNDLDVNQHVNN
VKFIGWILKSVPTKVFETQELCGVTLEYRRECGKDSVLE SVTAMDPAKEGDRSVYQHLLRLED
GADITIGRTEWRPKNAGANEAIS SGKT SNGNSAS
SEQ ID NO: 128 CcFATB2b amino acid sequence MVTT S LA SAYF SMKAVMLAPDGRGIKPRS SGLQVRAGNERNSCKVINGTKVKDTEG
LKGCS TLQGQ SMLDDHFGLHGLVF RRTFAIRCYEVGPDRST SIMAVMNHLQEAARNHAESLG
LLGDGF GETLEMSKRDLIWVVRRTHVAVERYPAWGDTVEVEAWVGA SGNTGMRRDF LVRD
CKTGHILTRCT SVSVMMNMRTRRLSKIPQEVRAEIDPLFIEKVAVKEGEIKKLQKLNDSTADYI
QGGWTPRWNDLDVNQHVNNIIYVGWIFK S VPD SI SENHHL S SIT LEYRRECIRGNKLQ SLTTVC
GGS S EAGIICEHLLQLEDGSEVLRARTE WRPKHT D SF QGI SERF PQQEPHK
SEQ ID NO: 129 CcFATB3 amino acid sequence MVATAAASAF FPVGAPAT S SAT SAKASMMPDNLDARGIKPKPAS S SGLQVKANAHA
SPKINGSKVS T DTLKGEDT LT S SPAPRTF INQLPDW SMFLAAITTIFLAAEKQWTNLDWKPRRP
DMLADPFGIGRFMQDGLIFRQHFAIRSYEIGADRTASIETLMNHLQETALNHVRSAGLLGDGF
GAT PEM S RRDLIWVVTRMQVLVDRYPAWGDIVEVETWVGA SGKNGMRRDWLVRDSQTGEI
LT RAT SVWVMMNKRTRRL SKLPEEVRGEIGPYF IEDVAIIEEDNRKLQKLNENTADNVRRGLT
PRWSDLDVNQHVNNVKYIGWILESAPGSILE SHEL SCMTLEYRRECGKDSVLQ SMTAVSGGG
SAAGG SPE S SVECDHLLQLE SGPEVVRGRTEWRPKSANNSRSILEMPAESL
SEQ ID NO: 130 CcFATB3b amino acid sequence MVATAAASAF FPVGAPAT S SAT SAKASMMPDNLDARGIKPKLAS S SGLQVKANAHA
SPKINGSKVS T DTLKGEDT LT S SPAPRTF INQLPDW SMFLAAITTIFLAAEKQWTNLDWKPRRP
DMLADPFGIGRFMQDGLIFRQHFAIRSYEIGADRTASIETLMNHLQETALNHVRSAGLLGDGF
GAT PEM S RRDLIWVVTRMQVLVDRYPAWGDIVEVETWVGA SGKNGMRRDWLVRDSQTGEI
LT RAT SVWVMMNKRTRRL SKLPEEVRGEIGPYF IEDVAIIEEDNRKLQKLNENTADNVRRGLT
PRWSDLDVNQHVNNVKYIGWILESAPGSILE SHEL SCMTLEYRRECGKDSVLQ SMTAVSGGG
SAAGG SPE S SVECDHLLQLE SGPEVVRGRTEWRPKSANNSRSILEMPAESL

SEQ ID NO: 131 CcFATB3c amino acid sequence MVATAAASAF FPVGAPAT S SAT SAKASMMPDNLDARGIKPKPAS S SGLQVKANAHA
SPKINGSKVST DTLKGEDT LT S SPAPRTF INQLPDW SMFLAAITTIFLAAEKQWTNLDWKPRRP
DMLADPFGIGRFMQDGLIFRQHFAIRSYEIGADRTASIETLMNHLQETALNHVRSAGLLGDGF
GAT PEM S RRDLIWVVTRMQVLVDRYPAWGDIVEVETWVGA SGKNGMRRDWLVRDSQTGEI
LT RAT SVWVMMNKRTRRL SKLPEEVRGEIGPYFIEDVAIIEEDNRKLQKLNENTADNVRRGLT
PRWSDLDVNQHVNNAKYIGWILESAPGSILE SHEL SCMTLEYRRECGKDSVLQ SMTAVSGGG
SAAGG SPE S SVECDHLLQLE SGPEVVRGRTEWRPKSANNSRSILEMPAESL
SEQ ID NO: 132 ChtFATBla amino acid sequence MVAAAAS SAFF SVPTPGT S TKPGNFGNWP S SLSVPFKPESNHNGGFRVKANASAHPK
ANGSAVNLK SGS LET QEDT S S S SPPPRTF IKQLPDWGMLL SKITTVFGAAERQWKRPGMLVEP
FGVDRIFQDGVFFRQ SF SIRSYEIGADRTASIETLMNIFQET SLNHCKSIGLLNDGFGRTPEMCK
RDLIWVVTKIQVEVNRYPTWGDTIEVNTWV SE SGKNGMGRDWLI S DCRTGEILIRAT SVWAM
MNRKTRRLSKF PYEVRQEIAPHFVDSAPVIEDDKKLHKLDVKTGDSIRKGLTPRWNDLDVNQ
HVNNVKYIGWILKSVPAEVFETQELCGVTLEYRRECGRD SVLE SVTAMDTAKEGDRSLYQHL
LRLEDGADITIGRTEWRPKNAGANGAIS TGKT SNENSVS
SEQ ID NO: 133 ChtFATB1a.1 amino acid sequence MVAAAAS SAFF SVPTPGT SPKPGNFGNWP S SLSVPFKPESNHNGGFRVKANASAHPK
ANGSAVNLK SGS LET QEDT S S S SPPPRTF IKQLPDWGMLL SKITTVFGAAERQWKRPGMLVEP
F GVDRIF QDGVF F RH SF SIRSYEIGADRTASIETLMNIFQET SLNHCKSIGLLNDGFGRTPEMCK
RDLIWVVTKIQVEVNRYPTWGDTIEVNTWV SE SGKNGMGRDWLIGDCRT GEILIRAT SVWAM
MNRKTRRLSKF PYEVRQEIAPHFVDSAPVIEDDKKLHKLDVKTGDSIRKGLTPRWNDLDVNQ
HVNNVKYIGWILKSVPAEVFETQELCGVTLEYRRECGRD SVLE SVTAMDTAKEGDRSLYQHL
LRLEDGADITIGRTEWRPKNAGANGAL STGKT SNGNSVS
SEQ ID NO: 134 ChtFATB1a.2 amino acid sequence MVAAAAS SAFF SVPTPGT SPKPGNFGNWP SNL SVPFKPESNHNGGFRVKANASAHPK
ANGSAVNLK SGS LET QEDT S S S SPPPRTF IKQLPDWGMLL SKITTVFGAAERQWKRPGMLVEP
FGVDRIFQDGVFFRQ SF SIRSYEIGADRTASIETLMNIFQET SLNHCKSIGLLNDGFGRTPEMCK
RDLIWVVTKIQVEVNRYPTWGDTIEVNTWV SE SGKNGMGRDWLI S DCRTGEILIRAT SVWAM
MNRKTRRLSKF PYEVRQEIAPHFVDSAPVIEDDKKLHKLDVKTGDSIRKGLTPRWNDFDVNQ
HVNNVKYIGWILKSVPAEVFETQELCGVTLEYRRECGRD SVLE SVTAMDTAKEGDRSLYQHL
LRLEDGADITIGRTEWRPKNAGANGAIS TGKT SNENSVS
SEQ ID NO: 135 ChtFATB1a.3 amino acid sequence MVAAAAS SAFF SVPTPGT SPKPGNFGNWP S SLSVPFKPESNHNGGFRVKANASAHPK
ANGSAVNLK SGS LET QEDT S S S SPPPRTF IKQLPDWGMLL SKITTVFGAAERQWKRPGMLVEP
FGVDRIFQDGVFFRQ SF SIRSYEIGADRTASIETLMNIFQET SLNHCKSIGLLNDGFGRTPEMCK
RDLIWVVTKIQVEVNRYPTWGDTIEVNTWV SE SGKNGMGRDWLI S DCRTGEILIRAT SVWAM
MNRKTRRLSKF PYEVRQEIAPHFVDSAPVIEDDKKLHKLDVKTGDSIRKGLTPRWNDFDVNQ
HVNNVKYIGWILKSVPAEVFETQELCGVTLEYRRECGRD SVLE SVTAMDTAKEGDRSLYQHL
LRLEDGADITIGRTEWRPKNAGVNGAIS TGKT SNENSVS
SEQ ID NO: 136 ChtFATB1a.4 amino acid sequence MVAAAAS SAFF SVPTPGT SPKPGNFGNWP S SLSVPFKPESNHNGGFRVKANASAHPK
ANGSAVNLK SGS LET QEDT S S S SPPPRTF IKQLPDW SMLL SKIT TVF GAAERQWKRPGMLVEPF
GVDRIFQDGVFFRQ SF SIRSYEIGADRTA SIETLMNIF Q ET SLNHCKSIGLLNDGFGRTPEMCKR
DLIWVVTKIQVEVNRYPTWGDTIEVNTWVSESGKNGMGRDWLISDCRTGEILIRAT SVWAMM

NRKTRRL SKFPYEVRQEIAPHFVDSAPVIEDDKKLHKLDVKTGDSIRKGLTPRWNDFDVNQHV
NNVKYIGWILKSVPAEVFETQELCGVTLEYRRECGRDSVLE SVTAMDTAKEGDRSLYQHLLR
LEDGADITIGRTEWRPKNAGANGAIS TGKT SNENSVS
SEQ ID NO: 137 ChtFATB lb amino acid sequence MVAAAAS SAFF SVPT SGT SPKPGNFGNWP S SLSVPFKPES SHNGGFQVKANASAHPK
ANGSAVNLK SGS LET QEDT S S S SPPPRTF IKQLPDW SMLL SKIT TVF WAAERQWKRPGMLVEP
FGVDRIFQDGVFFRQ SF SIRSYEIGADRTASIETLMNIFQET SLNHCKSIGLLNDGFGRTPEMCK
RDLIWVVTKIQVEVNRYPTWGDTIEVNTWV SE SGKNGMGRDWLI S DCRTGEILIRAT SVWAM
MNRKTRRLSKF PYEVRQEIAPHFVDSAPVIEDDKKLHKLDVKTGDFIRKGLTPRWNDFDVNQ
HVNNVKYIGWILKSVPAEVFETQELCGVTLEYRRECGRD SVLE SVTAMDTAKEGDRSLYQHL
LRLEDGADITIGRTEWRPKNAGANGAIS TGKT SNENSVS
SEQ ID NO: 138 ChtFATB2b amino acid sequence MVVAAAA S SAFFPVPASGT SPKPGKF GT WL S S S SP SYKPK SNP SGGF QVKANASAHP
KANGSAVSLKSGSLNTQEGT S S SPPPRTFLNQLPDW SRLRTAITTVFVAAEKQLTMLDRKSKK
PDMHVDWF GLEIIVQDGLVF RE S F SIRSYEIGADRTA S IETLMNHLQ DT SLNHCKSVGLLNDGF
GRTPEMCKRDLIWVLTKMQIMVNRYPTWGDTVEINSWF SQ SGKIGMGRNWLISDCNTGEILIR
AT SIWAMMNQKTRRF SKLPNEVRQEIAPHFVDAPPVIEDNDRKLHKF DVKTGD SICKGLT PEW
NDLDVNQHVSNVKYIGWILE SMPKEVLDTQELC SLTLEYRRECGRDSVLE SVTAMDP SKVGD
RSQYQHLLRLEDGTDIMKGRTEWRPKNAGTNGAISTGKT SNGNSVS
SEQ ID NO: 139 ChtFATB2a amino acid sequence MVVAAAAS SAFFPVPAPGTT SKPGKFGNWP S SL SP SF KPK SNPNGGF QVKANA SAHP
KANGSAVSLKSGSLNTKEDTP S SPPPRTFLNQLPDW SRLRTAITTVFVAAEKQLTMLDRKSKK
PDMHVDWF GLEIIVQDWLVF RE S F SIRSYEIGADRTASIETLMNHLQDT SLNHCKSVGLLNDGF
GRTPEMCKRDLIWVLTKMQIMVNRYPTWGDTVEINSWF SQ SGKIGMGRNWLISDCNTGEILIR
AT SIWAMMNQKTRRF SKLPNEVRQEIAPHFVDAPPLIEDNDRKLHKFDVKTGD SICKGLTPEW
NDLDVNQHVSNVKYIGWILE SMPKEVLDTQELC SLTLEYRRECGRDSVLE SVTAMDP SKVGD
RSQYQHLLRLEDGTDIMKGRTEWRPKNAGTNGAISTGKT SNGNSVS
SEQ ID NO: 140 ChtFATB2c amino acid sequence MVVAAAA S SAFFPVPASGT SPKPGKF GT WL S S S SP SYKPK SNP SGGF QVKANASAHP
KANGSAVSLKSGSLNTKEDTP S SPPPRTFLNQLPDWNRLRTAITTVFVAAEKQLTMLDRKSKK
PDMHVDWF GLEIIVQDGLVF RE S F SIRSYEIGADRTA S IETLMNHLQ DT SLNHCKSVGLLNDGF
GRTPEMCKRDLIWVLTKMQIMVNRYPTWGDTVEINSWF SQ SGKIGMGRNWLISDCNTGEILIR
AT SIWAMMNQKTRRF SKLPNEVRQEIAPHFVDAPPVIEDNDRKLHKF DVKTGD SICKGLT PEW
NDLDVNQHVSNVKYIGWILE SMPKEVLDTQELC SLTLEYRRECGRDSVLE SVTAMDP SKVGD
RSQYQHLLRLEDGTDIMKGRTEWRPKNAGTNGAISTGKT SNGNSVS
SEQ ID NO: 141 ChtFATB2d amino acid sequence MVVAAAAS SAFFPVPAPGTT SKPGKFGNWP S SL SP SF KPK SNPNGGF QVKANA SAHP
KANG SAVSLK SG SLNTQ EDT S S SPPPRTFLNQLPDW SRLLTAISTVFVAAEKQLTMLDRKSKRP
DMLVDLFGLESIVQDGLVF RE SY SIRSYEIGADRTA SIETLMNHLQDT SLNHCKSVGLLNDGFG
RTPEMCKRDLIWVLTKMQIMVNRYPTWGDTVEINSWF SQ SGKIGMGRNWLISDCNTGEILIRA
T SIWAMMNQNTRRF SKLPNEVRQEIAPHFVDAPPVIEDNDRKLHKF DVKTGD SIRKGLTPGWN
DLDVNQHVSNVKYIGWILE SMPTEVLETQELC SLTLEYRRECGRESVLESVTAMNP SKVGDRS
QYQHLLRLEDGADIMKGRTEWRPKNAGTNGAISTGKT SNGNSVS

SEQ ID NO: 142 ChtFATB2e amino acid sequence MVVAAAA S SAFFPVPASGT SPKPGKF GT WL S S S SP SYKPK SNP SGGF QVKANASAHP
KANG SAVSLK SG SLNTQ EDT S S SPPPQTFLNQLPDW SRLLTAIS TVFVAAEKQLTMLDRK SKRP
DMLVDWFGLESIVQDGLVF RE SYSIRSYEISADRTASIETVMNLLQET SLNHCKSMGILNDGFG
RTPEMCKRDLIWVLTKMQILVNRYPNWGDTVEINSWF SQ SGKIGMGRNWLISDCNTGEILIRA
T SIWAMMNQNTRRF SKLPNEVRQEIAPHFVDAPPVIEDNDRKLHKF DVKTGD SIRKGLTPGWN
DLDVNQHVSNVKYIGWILE SMPTEVLETQELC SLTLEYRRECGRD SVLE SVTAMNP SKVGDRS
QYQHLLRLEDGADIMKGRTEWRPKNAGTNGAISTGKT SNGNSVS
SEQ ID NO: 143 ChtFATB2f amino acid sequence MVVAAAA S SAFFPVPASGT SPKPGKF GT WL S S S SP SYKPK SNP SGGF QVKANASAHP
KANG SAVSLK SG SLNTQ EGT S S SPPPRTFLNQLPDW SRLLTAISTVFVAAEKQLTMLDRKSKRP
DMLVDWFGLESIVQDGLVF RE SYSIRSYEISADRTASIETVMNLLQET SLNHCKSMGILNDGFG
RTPEMCKRDLIWVLTKMQILVNRYPNWGDTVEINSWF SQ SGKIGMGRNWLISDCNTGEILIRA
T SIWAMMNQKTRRF SKLPNEVRQEIAPHFVDAPPVIEDNDRKLHKF DVKTGD SICKGLTPEWN
DLDVNQHVSNVKYIGWILE SMPKEVLDTQELC SLTLEYRRECGRDSVLESVTAMDP SKVGDR
SQYQHLLRLEDGTDIMKGRTEWRPKNAGTNGAISTGKT SNGNSVS
SEQ ID NO: 144 ChtFATB2g amino acid sequence MVVAATAS SAFF PVPVPGT SPKPGKFGTWLS S S SP SYKPK SNP SGGFQVKANA SAHPK
ANGSAVSLKSGSLNTQEDT S S SPPPRTFLNQLPDW SRLLTAIS TVFVAAEKQLTMLDRKSKRPD
MLVDWFGLESIVQDGLVFREIYSIRSYEISADRTT SIETVMNLLQET SLNHCKSMGILNDGFGRT
PEMCKRDLIWVLTKMQILVNRYPNWGDTVEINSWF SQ SGKIGMGRNWLISDCNTGEILIRAT SI
WAMMNQKTRRF SKLPNEVRQEIAPHFVDAPPVIEDNDRKLHKFDVKTGDSICKGLTPEWNDL
DVNQHVSNVKYIGWILE SMPKEVLDTQ ELC S LT LEYRRECGRD SVLE SVTAMDP SKVGDRSQ
YQHLLRLEDGTDIMKGRTEWRPKNAGTNGAISTGKT SNANSVS
SEQ ID NO: 145 ChtFATB2h amino acid sequence MVVAAAA S SAFFPVPASGT SPKPGKF GT WL S S S SP SYKPK SNP SGGF QVKANASAHP
KANG SAVSLK SG SLNTQ EGT S S SPPPRTFLNQLPDW SRLLTAISTVFVAAEKQLTMLDRKSKRP
DMLVDWFGLESIVQDGLVF RE SYSIRSYEISADRTASIETVMNLLQET SLNHCKSMGILNDGFG
RTPEMCKRDLIWVLTKMQILVNRYPNWGDTVEINSWF SQ SGKIGMGRNWLISDCNTGEILIRA
T SIWAMMNQNTRRF SKLPNEVRQEIAPHFVDAPPVIEDNDRKLHKF DVKTGD SIRKGLTPGWN
DLDVNQHVSNVKYIGWILE SIPTEVLETQELC SLTLEYRRECGRESVLE SVTAMNP SKVGDRSQ
YQHLLRLEDGADIMKGRTEWRPKNAGTNGAISTGKT SNGNSVS
SEQ ID NO: 146 ChtFATB3 a amino acid sequence MVATAAS SAFFPVP SPDT S SRPGKLGNGS S SLRPLKPKFVANAGLQVKANASAPPKIN
GS SV SLK SC SLKTHEDTP SAPPPRTFINQLPDW SMLLAAITTVFLAAEKQWMMLDWKPKRPD
MLVDPFGLGRIVQDGLVFRQNF SIRSYEIGADRTASIETVMNHLQETALNHVKSAGLLNEGFG
RTPEMYKRDLIWVVAKMQVMVNRYPTWGDTVEVNTWVAKSGKNGMRRDWLISDCNTGEIL
T RA S SVWVMMNQKTRKL SKIPDEVRHEIEPHFVDSAPVIEDDDWKLPKLDEKTADSIRKGLTP
KWNDLDVNQHVNNVKYIGWILESTPPEVLETQELC SLTLEYRRECGRE SVLE SLTAVDP SGKG
FGPQFQHLLRLEDGGEIVKGRTEWRPKTAGINGTIASGET SP GNS
SEQ ID NO: 147 ChtFATB3b amino acid sequence MVATAAS SAFFPVP SPDT S SRPGKLGNGS S SLRPLKPKFVANAGLQVKANASAPPKIN
GS SVSLKSGSLKTQEDTP SAPPPRTFINQLPDWSMLLAAITTVFLAAEKQWMMLDWKPKRPD
MLVDPFGFGRIVQDGLVFRQNF SIRSYEIGADRTASIETVMNHLQETALNHVK SAGLLIEGF GR
TPEMYKRDLIWVVAKMQVMVNRYPTWGDTVEVNTWVAKSGKNGMRRDWLISDCNTGEILT
91 RAS SVWVMMNQKTRKL SKIPDEVRHEIEPHF VD SAPVIEDDDWKLPKLDEKTAD SIRKGLT PK
WNDLDVNQHVNNVKYIGWILESTPPEVLETQELC SLTLEYRRECGRE SVLE SLTAVDP SGKGF
GPQFQHLLRLEDGGEIVKGRTEWRPKTAGINGTIASGET SPGNS
SEQ ID NO: 148 ChtFATB3c amino acid sequence MVATAAS SAFFPVP SPDT S SRPGKLGNGS S SLRPLKPKFVANAGLQVKANASAPPKIN
GS SV SLK SC SLKTHEDTP SAPPPRTFINQLPDW SMLLAAITTVFLAAEKQWMMLDWKPKRPD
MLVDPFGLGRIVQDGLVFRQNF SIRSYEIGADRTASIETVMNHLQETALNHVKSAGLLNEGFG
RTPEMYKRDLIWVVAKMQVMVNRYPTWGDTVEVNTWVAKSGKNGMRRDWLISDCNTGEIL
T RA S SVWVMMNQKTRKL SKIPDEVRHEIEPHFVDSAPVIEDDDRKLPKLDEKTADSIRKGLTP
KWNDLDVNQHVNNVKYIGWILESTPPEVLETQELC SLTLEYRRECGRE SVLESLTAVDP SEKG
F GPQ F QHLLRLEDGGEIVKGRTE WRPKTAGINGAIAF GET SP GD S
SEQ ID NO: 149 ChtFATB3d amino acid sequence MVATAAS SAFFPVP SPDT S SRPGKLGNGS S SLRPLKPKFVANAGLQVKANASAPPKIN
GS SV SLK SC SLKTHEDTP SAPPPRTFINQLPDW SMLLAAITTVFLAAEKQWMMLDWKPKRPD
MLVDPFGLGRIVQDGLVFRQNF SIRSYEIGADRTASIKTVMNHLQETALNHVK SAGLLNEGF G
RTPEMYKRDLIWVVAKMQVMVNRYPTWGDTVEVNTWVAKSGKNGMRRDWLISDCNTGEIL
T RA S SVWVMMNQKTRKL SKIPDEVRHEIEPHFVDSAPVIEDDDWKLPKLDEKTADSIRKGLTP
KWNDLDVNQHVNNVKYIGWILESTPPEVLETQELC SLTLEYRRECGRE SVLE SLTAVDP SGKG
FGPQFQHLLRLEDGGEIVKGRTEWRPKTAGINGTIASGET SP GNS
SEQ ID NO: 150 ChtFATB3e amino acid sequence MVATAAS SAFFPVP SPDT S SRPGKLGNGS S SLRPLKPKFVANAGLQVKANASAPPKIN
GS SVSLKSGSLKTHEDTP SAPPPRTFINQLPDWSMLLAAITTVFLAAEKQWMMLDWKPKRPD
MLVDPFGLGRIVQDGLVFRQNF SIRSYEIGADRTASIETVMNHLQETALNHVKSAGLLNEGFG
RTPEMYKRDLIWVVAKMQVMVNRYPTWGDTVEVNTWVAKSGKNGMRRDWLISDCNTGEIL
T RA S SVWVMMNQKTRKL SKIPDEVRHEIEPHFVDSAPVIEDDDWKLPKLDEKTADSIRKGLTP
KWNDLDVNQHVNNVKYIGWILESTPPEVLETQELC SLTLEYRRECGRE SVLE SLTAVDP SGKG
FGPQFQHLLRLEDGGEIVKGRTEWRPKTAGINGTIASGET SP GNS
SEQ ID NO: 151 ChtFATB3f amino acid sequence MVATAAS SAF FPVP SPDT S SRLGKLGNGS S SLRPLKPKFVANAGLQVKANASAPPKIN
GS SVSLKSGSLKTQEDTP SAPPPRTFINQLPDWSMLLAAITTVFLAAEKQWMMLDWKPKRPD
MPVDPFGLGRIVQDGLVFRQNF SIRSYEIGADRTASIETVMNHLQETALNHVK SAGLLNEGF G
RTPEMYKRDLIWVVAKMQVMVNRYPTWGDTVEVNTWVAKSGKNGMRRDWLISDCNTGEIL
T RA S SVWVMMNQKTRKL SKIPDEVRHEIEPHFVDSAPVIEDDDWKLPKLDEKTADSIRKGLTP
KWNDLDVNQHVNNVKYIGWILESTPPEVLETQELC SLTLEYRRECGRE SVLESLTAVDP SEKG
FGPQFQHLLRLEDGGEIVKGRTEWRPKTAGINGTIASGET SP GNS
SEQ ID NO: 152 ChtFATB3g amino acid sequence MVATAAS SAF FPVP SPDT S SRAGKLGNGS S SLRPLKPKFVANAGLQVKANASAPPKIN
GS SVSLKSGSLKTQEDTP SAPPPRTFINQLPDWSMLLAAITTVFLAAEKQWMMLDWKPKRPD
MLVDPFGLGRIVQDGLVFRQNF SIRSYEIGADRTASIETVMNHLQETALNHVKSAGLLNEGFG
RTPEMYKRDLIWVVAKMQVMVNRYPTWGDTVEVNTWVAKSGKNGMRRDWLISDCNTGEIL
T RA S SVWVMMNQKTRKL SKIPDEVRHEIEPHFVDSAPVIEDDDWKLPKLDEKTADSIRKGLTP
KWNDLDVNQHVNNVKYIGWILESTPPEVLETQELC SLTLEYRRECGRE SVLE SLTAVDP SGKG
FGPQFQHLLRLEDGGEIVKGRTEWRPKTAGINGTIASGET SP GNS
92 SEQ ID NO: 153 ChsFATB1 amino acid sequence MVATNAAAF SAYTFF LT SP THGYS SKRLADTQNGYPGT SLKSKSTPPPAAAAARNGA
LPLLASICKCPKKADGSMQLDS SLVFGFQFYIRSYEVGADQTVSIQTVLNYLQEAAINHVQ SAG
YFGDSFGATPEMTKRNLIWVITKMQVLVDRYPAWGDVVQVDTWTC S SGKNSMQRDWFVRD
LKTGDIIT RA S SVWVLMNRLTRKLSKIPEAVLEEAKLFVMNTAPTVDDNRKLPKLDGS SADYV
LSGLTPRW SDLDMNQHVNNVKYIAWILESVPQ SIPETHKL SAITVEYRRECGKNSVLQ SLTNV
SGDGITCGNSIIECHHLLQLETGPEILLARTEWISKEPGFRGAPIQAEKVYNNK
SEQ ID NO: 154 ChsFATB2 amino acid sequence MVATAAS SAF FPVP SP DA S SRPGKLGNGS S SL SPLKPKLMANGGLQVKANASAPPKIN
GS SVGLK SG SLKT QEDT P SAPPPRTFINQLPDWSMLLAAITTVFLAAEKQWMMLDWKPKRPD
MLVDPFGLGRIVQDGLVFRQNF SIRSYEIGADRTASIETVMNHLQETALNHVKSAGLLNDGFG
RTLEMYKRDLIWVVAKMQVMVNRYPTWGDTVEVNTWVAKSGKNGMRRDWLISDCNTGEI
LTRAS SVWVMMNQKTRRLSKIPDEVRHEIEPHFVDSAPVIEDDDRKLPKLDEKTADSIRKGLTP
KWNDLDVNQHVNNVKYIGWILE ST PPEVLET QELC SLTLEYRRECGRE SVLE SLTAVDP SGKG
SGSQFQHLLRLEDGGEIVKGRTEWRPKTAGINGPIASGET SPGDS S
SEQ ID NO: 155 ChsFatB2b amino acid sequence MVATAAS SAF FPVP SP DA S SRPGKLGNGS S SL SPLKPKLMANGGLQVKANASAPPKIN
GS SVGLK SG SLKT QEDT P SAPPPRTFINQLPDWSMLLAAITTVFLAAEKQWMMLDWKPKRPD
MLVDPFGLGRIVQDGLVFRQNF SIRSYEIGADRTASIETVMNHLQETALNHVKSAGLLNDGFG
RTLEMYKRDLIWVVAKMQVMVNRYPTWGDTVEVNTWVAKSGKNGMRRDWLISDCNTGEI
LTRAS SK SQIMLPLHYC SVWVMMNQKTRRL SKIPDEVRHEIEPHFVDSAPVIEDDDRKLPKLD
EKTAD SIRKGLTPKWNDLDVNQHVNNVKYIGWILE S TPPEVLET QELC SLTLEYRRECGRE SV
LE SLTAVDP SGKGSGSQFQHLLRLEDGGEIVKGRTEWRPKTAGINGPIASGET SPGDS S
SEQ ID NO: 156 ChsFatB2c amino acid sequence MVATAAS SAF FPVP SP DA S SRPGKLGNGS S SL SPLKPKLMANGGLQVKANASAPPKIN
GS SVGLK SG SLKT QEDT P SAPPPRTFINQLPDWSMLLAAITTVFLAAEKQWMMLDWKPKRPD
MLVDPFGLGRIVQDGLVFRQNF SIRSYEIGADRTASIETVMNHLQETALNHVKSAGLLNDGFG
RTLEMYKRDLIWVVAKMQVMVNRYPTWGDTVEVNTWVAKSGKNGMRRDWLISDCNTGEI
LTRAS SVWVMMNQKTRRLSKIPDEVRHEIEPHFVDSAPVIEDDDRKLPKLDEKTADSIRKGLTP
KWNDLDVNQHVNNVKYIGWILE ST PPEVLET QELC SLTLEYRRECGRE SVLE SLTAVDP SGKG
SGSQFQHLMRLEDGGEIVKGRTEWRPKTAGINGPIASGET SPGDS S
SEQ ID NO: 157 ChsFatB2d amino acid sequence MVATAAS SAF FPVP SP DA S SRPGKLGNGS S SL SPLKPKLMANGGLQVKANASAPPKIN
GS SVGLK SG SLKT QEDT P SAPPPRTFINQLPDWSMLLAAITTVFLAAEKQWMMLDWKPKRPD
MLVDPFGLGRIVQDGLVFRQNF SIRSYEIGADRTASIETVMNHLQETALNHVKSAGLLNDGFG
RTPEMYKRDLIWVVAKMQVMVNRYPTWGDTVEVNTWVAKSGKNGMRRDWLISDCNTGEIL
T RA S SVWVMMNQKTRRL SKIPDEVRHEIEPHFVDSAPVIEDDDRKLPKLDEKTADSIRKGLTP
KWNDLDVNQHVNNVKYIGWILE ST PPEVLET QELC SLTLEYRRECGRE SVLE SLTAVDP SGKG
SGSQFQHLLRLEDGGEIVKGRTEWRPKTAGINGPIASGET SPGDS S
SEQ ID NO: 158 Chs FATB3 amino acid sequence MVAAEAS SALF SVRTPGT SPKPGKFGNWPT SLSVPFKSKSNHNGGFQVKANASARPK
ANGSAVSLKSGSLDTQEDT SSSSSPPRTFINQLPDW SMLL SAITTVFVAAEKQWTMLDRKSKRP
DMLMDPFGVDRVVQDGAVFRQ SF SIRSYEIGADRTASIETLMNIFQET SLNHCKSIGLLNDGFG
RTPEMCKRDLIWVVTKMHVEVNRYPTWGDTIEVNTWVSE SGKTGMGRDWLISDCHTGEILIR
AT SMCAMMNQKTRRF SKFPYEVRQELAPHFVDSAPVIEDYQKLHKLDVKTGDSICNGLTPRW
93 NDLDVNQHVNNVKYIGWILE SVPTEVF ETQELCGLTLEYRRECGRDSVLESVTAMDP SKEGD
RSLYQHLLRLEDGADIAKGRTKWRPKNAGTNGAI ST GKT SNGNSIS
SEQ ID NO: 159 ChsFatb3b amino acid sequence MVAAEAS SALF SVRTPGT SPKPGKFGNWPT SLSVPFKSKSNHNGGFQVKANASARPK
ANGSAVSLKSGSLDTQEDT SSSSSPPRTFINQLPDW SMLL SAITTVFVAAEKQWTMLDRKSKRP
DMLMDPF GVDRVVQDGAVFRQ SF SIRSYEIGADRTASIETLMNIFQET SLNHCKSIGLLNDGFG
RTPEMCKRDLIWVVTKMHIEVNRYPTWGDTIEVNTWVSE SGKTGMGRDWLISDFHTGDILIR
AT SVCAMMNQKTRRF SKFPYEVRQELAPHFVDSAPVIEDYQKLHKLDVKTGDSICNGLTPRW
NDLDVNQHVNNVKYIGWILE SVPTEVF ETQELCGLTLEYRRECGRDSVLESVTAMDP SKEGD
RSLYQHLLRLEDGADIAKGRTKWRPKNAGTNGAI ST GKT SNGNSIS
SEQ ID NO: 160 ChsFatB3c amino acid sequence MVAAEAS SALF SVRTPGT SPKPGKFGNWPT SLSVPFKSKSNHNGGFQVKANASARPK
ANGSAVSLKSGSLDTQEDT SSSSSPPRTFINQLPDW SMLL SAITTVFVAAEKQWTMLDRKSKRP
DMLMDPF GVDRVVQDGAVFRQ SF SIRSYEIGADRTASIETLMNIFQET SLNHCKSIGLLNDGFG
RTPEMCKRDLIWVVTKMHVEVNRYPTWGDTIEVNTWVSE SGKTGMGRDWLISDCHTGEILIR
AT SMCAMMNQKTRRF SKFPYEVRQELAPHFVDSAPVIEDYQKLHKLDVKTGDSICNGLTPRW
NDLDVNQHVNNVKYIGWILE SVPTEVFETQELCGLTLEYRQECGRDSVLESVTAMDP SKEGD
RSLYQHLLRLEDGTDIAKGRTKWRPKNAGKT SNGNSIS
SEQ ID NO: 161 ChsFATB3d amino acid sequence MVAAEAS SALF SVRTPGT SPKPGKFGNWP S SL SVPFKSKSNHNGGFQVKANASARPK
ANGSAVSLKSGSLDTQEDASSSS SPPRTFINQLPDW SMLL SAIT TVFVAAEKQWT MLDRK S KR
SDMLMDPFGVDRVVQDGAVFRQ SF SIRSYEIGADRTA SIETLMNIFQET SLNHCK SIGLLNDGF
GRTPEMCKRDLIWVVTKMHVEVNRYPTWGDTIEVNTWVSE SGKTGMGRDWLISDCHTGEILI
RAT SMCAMMNQKTRRF SKFPYEVRQELAPHFVDSAPVIEDYQKLHKLDVKTGDSICNGLTPR
WNDLDVNQHVNNVKYIGWILE SVP TEVF ET QELCGLTLEYRRECGRD SVLE S VTAMDP SKEG
DRSLYQHLLRLEDGADIAKGRTKWRPKNAGTNGAISTGKT SNGNSIS
SEQ ID NO: 162 ChsFATB3e amino acid sequence MVAAEAS SALF SVRTPGT SPKPGKFGNWP S SL SVPFKSKSNHNGGFQVKANASARPK
ANGSAVSLKSGSLDTQEDASSSS SPPRTFINQLPDW SMLL SAIT TVFVAAEKQWT MLDRK S KR
SDMLMDPFGVDRVVQDGVVFRQ SF SIRSYEIGADRTA SIETLMNIFQET SLNHCK SIGLLNDGF
GRTPEMCKRDLIWVVTKMHVEVNRYPTWGDTIEVNTWVSE SGKTGMGRDWLISDCHTGEILI
RAT SMCAMMNQKTRRF SKFPYEVRQELAPHFVDSAPVIEDYQKLHKLDVKTGDSICNGLTPR
WNDLDVNQHVNNVKYIGWILE SVP TEVF ET QELCGLTLEYRRECGRD SVLE S VTAMDP SKEG
DRSLYQHLLRLEDGADIAKGRTKWRPKNAGTNGAISTGKT SNGNSIS
SEQ ID NO: 163 ChsFATB3f amino acid sequence MVAAEAS SALF SVRTPGT SPKPGKFGNWP S SL SVPFKSKSNHNGGFQVKANASARPK
ANGSAVSLKSGSLDTQEDT SSSSSPPRTFINQLPDW SMLL SAITTVFVAAEKQWTMLDRKSKRP
DMLMDPF GVDRVVQDGAVFRQ SF SIRSYEIGADRTASIETLMNIFQET SLNHCKSIGLLNDGFG
RTPEMCKRDLIWVVTKMHVEVNRYPTWGDTIEVNTWVSE SGKTGMGRDWLISDCHTGEILIR
AT SMCAMMNQKTRRF SKFPYEVRQELAPHFVDSAPVIEDYQKLHKLDVKTGDSICNGLTPRW
NDLDVNQHVNNVKYIGWILE SVPTEVF ETQELCGLTLEYRRECGRDSVLESVTAMDP SKEGD
RSLYQHLLRLEDGADIAKGRTKWRPKNAGTNGAI ST GKT SNGNSIS
SEQ ID NO: 164 ChsFATB3g amino acid sequence MVAAEAS SALF SVRTPGT SPKPGKFGNWPT SLSVPFKSKSNHNGGFQVKANASARPK
94 ANGSAVSLKSGSLDTQEDT SSSSSPPRTFINQLPDW SMLL SAITTVFVAAEKQWTMLDRKSKRP
DMLMDPFGVDRVVQDGAVFRQ SF SIRSYEIGADRTASIETLMNIFQET SLNHCKSIGLLNDGFG
RTPEMCKRDLIWVVTKMHIEVNRYPTWGDTIEVNTWVSE SGKTGMGRDWLISDFHTGDILIR
AT SVCAMMNQKTRRF SKFPYEVRQELAPHFVD SAPVIEDYQKLHKLDVKT GD SICNGLTPRW
NDLDVNQHVNNVKYIGWILE SVPTEVFETQELCGLTLEYRQECGRDSVLESVTAMDP SKEGD
RSLYQHLLRLEDGTDIAKGRTKWRPKNAGKT SNGNSIS
SEQ ID NO: 165 ChsFATB3h amino acid sequence MVAAEAS SALF SVRTPGT SPKPGKFGNWP S SL SVPFKSKSNHNGGFQVKANASARPK
ANGSAVSLKSGSLDTQEDASSSS SPPRTF INQLPDW SMLL SAIT TVFVAAEKQWT MLDRK S KR
SDMLMDPFGVDRVVQDGVVF RQ SF S IRS YEIGADRTA SIETLMNIFQET SLNHCK SIGLLNDGF
GRTPEMCKRDLIWVVTKMHIEVNRYPTWGDTIEVNTWVSE SGKTGMGRDWLISDFHTGDILI
RAT SVCAMMNQKTRRF SKFPYEVRQELAPHFVDSAPVIEDYQKLHKLDVKTGDSICNGLTPR
WNDLDVNQHVNNVKYIGWILE SVP TEVF ET QELCGLTLEYRQECGRD SVLE S VTAMDP SKEG
DRSLYQHLLRLEDGTDIAKGRTKWRPKNAGKT SNGNSIS
SEQ ID NO: 166 ChsFATB3i amino acid sequence MVAAEAS SALF SVRTPGT SPKPGKFGNWPT SLSVPFKSKSNHNGGFQVKANASARPK
ANGSAVSLKSGSLDTQEDT SSSSSPPRTFINQLPDW SMLL SAITTVFVAAEKQWTMLDRKSKRP
DMLMDPFGVDRVVQDGAVFRQ SF SIRSYEIGADRTASIETLMNIFQET SLNHCKSIGLLNDGFG
RTPEMCKRDLIWVVTKMHVEVNRYPTWGDTIEVNTWVSE SGKTGMGRDWLISDCHTGEILIR
AT SMCAMMNQKTRRF SKFPYEVRQELAPHFVDSAPVIEDYQKLHKLDVKTGDSICNGLTPRW
NDLDVNQHVNNVKYIGWILE SVPTEVF ETQELCGLTLEYRRECGGDSVLESVTAMDP SKEGD
RSLYQHLLRLEDGADIAKGRTKWRPKNAGTNGAI ST GKT SNGNSIS
SEQ ID NO: 167 ChsFATB3j amino acid sequence MVAAEAS SALF SVRTPGT SPKPGKFGNWPT SLSVPFKSKSNHNGGFQVKANASARPK
ANGSAVSLKSGSLDTQEDT SSSSSPPRTFINQLPDW SMLL SAITTVFVAAEKQWTMLDRKSKRP
DMLMDPFGVDRVVQDGAVFRQ SF SIRSYEIGADRTASIETLMNIFQET SLNHCKSIGLLNDGFG
RTPEMCKRDLIWVVTKMHIEVNRYPTWGDTIEVNTWVSE SGKTGMGRDWLISDFHTGDILIR
AT SVCAMMNQKTRRF SKFPYEVRQELAPHFVD SAPVIEDYQKLHKLDVKT GD SICNGLTPRW
NDLDVNQHVNNVKYIGWILE SVPTEVFETQELCGLTLEYRQECGRDSVLESVTAMDP SKEGD
RSLYQHLLRLEDGTDIAKGRTKWRPKNAGKT SNGNSIS
SEQ ID NO: 168 C c alcFATB 1 (Cuphe a calc arata FATB1) MVAAAAT SAFFPVPAPGT SPNPRKF GS WP S SL SP SLPKSIPNGGFQVKANASAHPKANGSAVSL
K SGS LNTQ ENT S S SPPPRTFLHQLPDW SRLLTAITTVFVKSKRPDMHDRK SKRPDMLVDLFGLE
S SVQDALVFRQ SF SIRSYEIGTDRTASIETLMNHLQET SLNHCKSTGILLDGFGRTLEMCKRELI
WVVIKMQIQVNRYPAWGDTVEINTRF SRLGKIGMGRDWLISDCNTGEILIRAT SEYAMMNQK
TRRL SKLPYEVHQEIAPLF VD SPPVIEDNDLKVHKF EVKTGD SIQKGL SPGWNDLDVNQHVSN
VKYIGWILESMPTEVLETQELC SLALEYRRECGRDSVLE SVTAMDP SKVGGRSQYQHLLRLED
GTAIVNGITEWRPKNAGANGAIS TGKT SNGNSVS
SEQ ID NO: 169 ChookFATB4 (Cuphea hookeriana FATB4) MVAAAAT SAFFPVPAPGT SPNPRKF GS WP S SL SP SLPNSIPNGGFQVKANASAHPKANGSAVSL
K SGS LNTQ ENT S S SPPPRTFLHQLPDW SRLLTAITTVFVKSKRPDMHDRK SKRPDMLVDLFGLE
S SVQDALVFRQRF SIRSYEIGTDRTA SMETLMNHLQET SLNHCKSTGILLDGFGRTLEMCKREL
IWVVIKMQIQVNRYPAWGDTVEINTRF SRLGKIGMGRDWLISDCNTGEILIRAT SEYAMMNQK
TRRLSKLPYEVRQEIAPLFVDSPPVIEDNDLKVHKFEVKTGDSIHKGLTPGWNDLDVNQHVNN

VKYIGWILE ST PPEVLET QELC SLTLEYRRECGRESVLESLTAMDP SGGGYGSQFQHLLRLEDG
GEIVKGRTEWRPKNGVINGVVPT GE S SPGDYS
SEQ ID NO: 170 CaFATB1 (Cuphea avigera var. pulcherrima FATB1) MVAAAAS SAFF SVPVPGT SPKPGKFRIWP S SLSP SFKPKPIPNGGLQVKANSRAHPKANGSAVS
LK SGSLNTQEDT S S SPP PRT F LHQLPDW SRLLTAIT TVFVK SKRPDMHDRK SKRPDMLMD SF G
LE SIVQEGLEFRQ SF SIRSYEIGTDRTASIETLMNYLQET SLNHCK ST GILLDGF GRT PEMCKRDL
IWVVTKMKIKVNRYPAWGDTVEINTWF SRLGKIGKGRDWLISDCNTGEILIRAT SAYATMNQ
KT RRL SKLPYEVHQEIAPLFVD SPPVIEDNDLKLHKFEVKTGDSIHKGLTPGWNDLDVNQHVS
NVKYIGWILE SMPTEVLETQELC SLALEYRRECGRDSVLE SVTAMDPTKVGGRSQYQHLLRLE
DGTDIVKCRTEWRPKNPGANGAISTGKT SNGNSVS
SEQ ID NO: 171 CpauFATB1 (Cuphea paucipetala FATB1) MVAAAAS SAFFPVPAPGT SPKPGKSGNWP S S L SP SIKPMSIPNGGFQVKANASAHPKANGSAV
NLKSGSLNTQEDT S SSPPPRAFLNQLPDW SMLLTAITTVFVAAEKQWTMRDRK SKRPDMLVD
SVGLKSVVLDGLVSRQIF SIRSYEIGADRTA STET LMNHLQET SINHCKSLGLLNDGFGRTPGMC
KNDLIWVLTKMQIMVNRYPTWGDTVEINTWF SHSGKIGMA SDWLITDCNTGEILIRAT SVWA
MMNQKTRRF S RLPYEVRQ ELT PHYVD SPHVIEDNDRKLHKFDVKTGD SIRKGLTPRWNDLDV
NQHVSNVKYIGWILESMPIEVLETQELC SLTVEYRRECGMDSVLESVTAMDP SEDEGRSQYKH
LLRLEDGTDIVKGRTEWRPKNAGTNGAISTAKP SNGNSVS
SEQ ID NO: 172 CprocFATB1 (Cuphea procumbens FATB1) MVAAAAS SAFFPAPAPGS SPKPGKSGNWP S SL SP SFK SKSIPYGRFQVKANASAHPKANGSAV
NLKSGSLNTQEDT S S SPPPRAFLNQLPDW SMLLSAITTVFVAAEKQWTMLDRKSKRPDMLVD
SVGLKNIVRDGLVSRQ SF LIRSYEIGADRTA S IETLMNHLQET SINHCKSLGLLNDGFGRTPGM
CKNDLIWVLTKMQIMVNRYPAWGDTVEINTWF SQ SGKIGMGSDWLISDCNTGEILIRAT S VW
AMMNQKTRRF SRLPYEVRQELTPHFVDSPHVIEDNDRKLHKFDVKTGDSIRKGLTPRWNDLD
VNQHVSNVKYIGWILESMPIEVLEAQELC SLTVEYRRECGMDSVLESVTAVDP SEDGGRSQYN
HLLRLEDGTDVVKGRTEWRPKNAETNGAISPGNT SNGNSIS
SEQ ID NO: 173 CprocFATB2 (Cuphea procumbens FATB2) MVAAAAS SAFFPAPAPGS SPKPGKSGNWP S SL SP SFK SKSIPYGRFQVKANASAHPKANGSAV
NLKSGSLNTQEDT S S SPPPRAFLNQLPDW SMLLSAITTVFVAAEKQWTMLDRKSKRPDMLVD
SVGLKNIVRDGLVSRQ SF LIRSYEIGADRTA S IETLMNHLQET SINHCKSLGLLNDGFGRTPGM
CKNDLIWVLTKMQIMVNRYPAWGDTVEINTWF SQ SGKIGMGSDWLISDCNTGEILIRAT S VW
AMMNQKTRRF SRLPYEVRQELTPHFVDSPHVIEDNDRKLHKFDVKTGDSIRKGLTPRWNDLD
VNQHVNNVKYIGWILE ST PPEVLETQELC SLTLEYRQECGRESVLE SLTAVDP SGKGFGSQFQH
LLRLEDGGEIVKGRTEWRPKTAGINGAIASGET SPGDF
SEQ ID NO: 174 CprocFATB3 (Cuphea procumbens FATB3) MVAAAAS SAFFPAPAPGS SPKPGKSGNWP S SL SP SFK SKSIPYGRFQVKANASAHPKANGSAV
NLKSGSLNTQEDT S S SPPPRAFLNQLPDW SMLLSAITTVFVAAEKQWTMLDRKSKRPDMLVD
SVGLKNIVRDGLVSRQ SF LIRSYEIGADRTA S IETLMNHLQET SINHCKSLGLLNDGFGRTPGM
CKNDLIWVLTKMQIMVNRYPAWGDTVEINTWF SQ SGKIGMGSDWLISDCNTGEILIRAT S VW
AMMNQKTRRF SRLPYEVRQELTPHFVDSPHVIEDNDRKLHKFDVKTGDSIRKGLTPRWNDLD
VNQHVNNVKYIGWILE ST PPEVLETQELC SLTLEYRRECGRESVLE SLTAVDP SGEGGYGSQFQ
HLLRLEDGGEIVKGRTEWRPKNAGINGVLPTGE

SEQ ID NO: 175 CigneaFATB1 (Cuphea ignea FATB1) PGT SRKTGKF GNWP S SL SP SFKPKSIPNGGF QVKANARAHPKANGSAVSLKSVSLNTQEDT SLS
PPPRAFLNQLPDWRMLRTALTTVFVAAEKQWTMLDRKSKRPDMLVDSFGLESIVQEGLVFRQ
SF SIRSYEIGIDRTA SIETLMNHLQET SLNQCKSAGILHDGFGRTLEMCKRDLIWVVTKMQIKV
NRYPAWGDTVEISTRF SRLGKIGMGRDWLICDCNTGEILIRAT SAYAMMNQKTRRL SKLPNEV
RQEIAPLFVD SDPVIEENDMKLHKFEVKTGDSICKGLTPRWSDLDVNQHVSNVKYIGWILESM
PT EVLETQ ELC SLALEYRRECGRD SVLE SVT SMDP SKVGGW SQYQHLLRLEDGADIVKGRTE
WRPKNAGANGAISTGKT
SEQ ID NO: 176 Cca1cFATB1 (Cuphea calcarata FATB1) MATA ST F SAFNARCGDLRRSAGSGPRRPARPLPVRAAINA SAHPKANGSAVSLK SGSLETQ ED
NS S S SRPPRTFIKQLPDWSMLLSAITTVFVAAEKQWTMFDRKSKRSDMLVDPFVVDRIVQDGV
LFRQ SF SIRSYEIGADRTASIETLMNIFQET SLNHCKSMGLLYEGFGRTPEMCKRDLIWVVTKIH
IKVNRYPTWGDTIEVTTWVSE SGKNGMGRDWLISDCHTGEILIRAT SVWAMMNQTTRRL SKF
PYELRQEIAPHFVDSDPVIEDNRRLLNF DVKTGDSIRKGLTPRWNDLDVNQHVNNVKYIGWIL
E SVPTEVF DT RELCGLTLEYRQECGRGSVLE SVTAMDP SKEGDRSLYQHLLRLEDGTDIVKGR
TEWRPKNAGTNGPVSTRKTTNGS SVS
SEQ ID NO: 177 ChookFATB4 (Cuphea hookeriana FATB4) MATA ST F SAFNARCGDLRRSAGSGPRRPARPLPVRAAINASAHPKANGSAVSLKSGSLNTQEN
T S S SPPPRTFLHQLPDW SRLLTAITTVFVK SKRPDMHDRKSKRPDMLVDLFGLES SVQDALVFR
QRF SIRSYEIGTDRTASMETLMNHLQET SLNHCKSTGILLDGFGRTLEMCKRELIWVVIKMQIQ
VNRYPAWGDTVEINTRF SRLGKIGMGRDWLISDCNTGEILIRAT SEYAMMNQKTRRLSKLPYE
VRQEIAPLFVDSPPVIEDNDLKVHKFEVKTGD SIHKGLTPGWNDLDVNQHVNNVKYIGWILE S
TP PEVLETQ ELC S LT LEYRRECGRE SVLESLTAMDP SGGGYGSQFQHLLRLEDGGEIVKGRTE
WRPKNGVINGVVPTGES SPGDYS
SEQ ID NO: 178 CaFATB1 (Cuphea avigera var. pulcherrima FATB1) MATASTF SAFNARCGDLRRSAGSGPRRPARPLPVRAAIN SRAHPKANG SAVSLK SGSLNTQED
T S S SPPPRTFLHQLPDW SRLLTAIT TVFVK SKRP DMHDRK SKRPDMLMD SF GLE SIVQ EGLEF R
Q SF SIRSYEIGTDRTASIETLMNYLQET SLNHCKS TGILLDGFGRTPEMCKRDLIWVVTKMKIK
VNRYPAWGDTVEINTWF SRLGKIGKGRDWLISDCNTGEILIRAT SAYATMNQKTRRLSKLPYE
VHQEIAPLFVDSPPVIEDNDLKLHKFEVKTGDSIHKGLTPGWNDLDVNQHVSNVKYIGWILE S
MPTEVLETQELC SLALEYRRECGRDSVLESVTAMDPTKVGGRSQYQHLLRLEDGTDIVKCRTE
WRPKNPGANGAISTGKT SNGNSVS
SEQ ID NO: 179 CpauFATB1 (Cuphea paucipetala FATB1) MATA ST F SAFNARCGDLRRSAGS GPRRPARPLPVRAAINA SAHPKANGSAVNLK SGSLNTQE
DT S S SPPPRAF LNQ LPDW SMLLTAIT TVFVAAEKQWT MRDRK SKRPDMLVD SVGLK SVVLDG
LVSRQ IF SIRSYEIGADRTASIETLMNHLQET SINHCKSLGLLNDGFGRTPGMCKNDLIWVLTK
MQIMVNRYPTWGDTVEINTWF SHSGKIGMA SDWLITDCNTGEILIRAT SVWAMMNQKTRRF S
RLPYEVRQELTPHYVDSPHVIEDNDRKLHKFDVKTGD SIRKGLTPRWNDLDVNQHVSNVKYI
GWILESMPIEVLETQELCSLTVEYRRECGMDSVLESVTAMDP SEDEGRSQYKHLLRLEDGTDI
VKGRTEWRPKNAGTNGAISTAKP SNGNSVS

SEQ ID NO: 180 CprocFATB1 (Cuphea procumbens FATB1) MATA ST F SAFNARCGDLRRSAGS GPRRPARPLPVRAAINA SAHPKANGSAVNLK SGSLNTQE
DT S S SPPP RAF LNQLPDWSMLLSAITTVFVAAEKQWTMLDRKSKRPDMLVDSVGLKNIVRDG
LVSRQ SF LIRSYEIGADRTA SIETLMNHLQET SINHCKSLGLLNDGFGRTPGMCKNDLIWVLTK
MQIMVNRYPAWGDTVEINTWF SQ SGKIGMGSDWLISDCNTGEILIRAT SVWAMMNQKTRRF S
RLPYEVRQELTPHFVDSPHVIEDNDRKLHKFDVKTGD SIRKGLTPRWNDLDVNQHVSNVKYI
GWILESMPIEVLEAQELCSLTVEYRRECGMD SVLE SVTAVDP SEDGGRSQYNHLLRLEDGTDV
VKGRT EWRPKNAETNGAI SP GNT SNGNSIS
SEQ ID NO: 181 CprocFATB2 (Cuphea procumbens FATB2) MATA ST F SAFNARCGDLRRSAGS GPRRPARPLPVRAAINA SAHPKANGSAVNLK SGSLNTQE
DT S S SPPP RAF LNQLPDWSMLLSAITTVFVAAEKQWTMLDRKSKRPDMLVDSVGLKNIVRDG
LVSRQ SF LIRSYEIGADRTA SIETLMNHLQET SINHCKSLGLLNDGFGRTPGMCKNDLIWVLTK
MQIMVNRYPAWGDTVEINTWF SQ SGKIGMGSDWLISDCNTGEILIRAT SVWAMMNQKTRRF S
RLPYEVRQELTPHFVDSPHVIEDNDRKLHKFDVKTGD SIRKGLTPRWNDLDVNQHVNNVKYI
GWILE ST PPEVLETQELC SLTLEYRQECGRESVLE SLTAVDP SGKGFGSQFQHLLRLEDGGEIV
KGRTEWRPKTAGINGAIASGET SPGDF
SEQ ID NO: 182 CprocFATB3 (Cuphea procumbens FATB3) MATA ST F SAFNARCGDLRRSAGS GPRRPARPLPVRAAINA SAHPKANGSAVNLK SGSLNTQE
DT S S SPPP RAF LNQLPDWSMLLSAITTVFVAAEKQWTMLDRKSKRPDMLVDSVGLKNIVRDG
LVSRQ SF LIRSYEIGADRTA SIETLMNHLQET SINHCKSLGLLNDGFGRTPGMCKNDLIWVLTK
MQIMVNRYPAWGDTVEINTWF SQ SGKIGMGSDWLISDCNTGEILIRAT SVWAMMNQKTRRF S
RLPYEVRQELTPHFVDSPHVIEDNDRKLHKFDVKTGD SIRKGLTPRWNDLDVNQHVNNVKYI
GWILE ST PPEVLETQELC SLTLEYRRECGRESVLE SLTAVDP SGEGGYGSQFQHLLRLEDGGEI
VKGRTEWRPKNAGINGVLPTGE
SEQ ID NO: 183 CigneaFATB1 (Cuphea ignea FATB1) MATA ST F SAFNARCGDLRRSAGS GPRRPARPLPVRAAINARAHPKANGSAV SLK SV SLNT QED
T S L SPPP RAF LNQLPDWRMLRTALT TVFVAAEKQWTMLDRK SKRPDMLVD SF GLE SIVQ EGL
VFRQ SF SIRSYEIGIDRTA SIET LMNHLQ ET SLNQCKSAGILHDGFGRTLEMCKRDLIWVVTKM
QIKVNRYPAWGDTVEIS TRF SRLGKIGMGRDWLICDCNTGEILIRAT SAYAMMNQKTRRLSKL
PNEVRQEIAPLFVDSDPVIEENDMKLHKFEVKTGDSICKGLTPRWSDLDVNQHVSNVKYIGWI
LE SMPTEVLETQELC SLALEYRRECGRDSVLE SVT SMDP SKVGGW SQYQHLLRLEDGADIVK
GRTEWRPKNAGANGAISTGKT
SEQ ID NO: 184 CgFATB1 (Cuphea glossostoma FATB1)_ MVAAAA S SAFFP SPAPGS SPKPGNRP S SL SP SFKPKSIPNGAFQVKANA SAHPKANGSAVNLK SI
GSLNTQEDSSSSPSPRAFLNQLPDWSVLLTAITTVFVAAEKQWTMLDRKSKRPDVLVDSVGLKj SIVQDGLVSRQSFSIRSYEIGADRTASIETLMNHLQETSINHCKSLGLLNDGFGRTPGMCKNDLIJ
WVLTKMQIMVNRYPAWGDTVEINTWF SQSGKIGMGSDWLISDCNTGEILIRAT SVWAMMNQ
KT RRF SRLPYEVRQELTPHFVDSPHVIEDNDRKLHKFDVKTGDSIRKGLTPRWNDLDVNQHVS
NVKYIGWILE SMPIEVLETQELC SLTVEYRRECGMD SVLE SVTAMDP SEDGGRSQYNHLLRLE
DGT DVVKGRTEWRPKNAGTNGAI ST TKT SNGNSVS

SEQ ID NO: 185 CgFATB lb (Cuphea glossostoma FATB1 C170F,M198T,T374S variant) MVAAAAS SAFFP SPAPGS SPKPGNRP S SL SP SFKPKSIPNGAFQVKANA SAHPKANGSAVNLKS
GSLNTQED S S S SP SPRAFLNQLPDW SVLLTAITTVFVAAEKQWTMLDRKSKRPDVLVDSVGLK
SIVQDGLVSRQ SF SIRSYEIGADRTASIETLMNHLQET SINHFK SLGLLNDGFGRTPGMCKNDLI
WVLTKTQIMVNRYPAWGDTVEINTWF SQ SGKIGMGSDWLISDCNTGEILIRAT SVWAMMNQ
KT RRF SRLPYEVRQELTPHFVDSPHVIEDNDRKLHKFDVKTGDSIRKGLTPRWNDLDVNQHVS
NVKYIGWILE SMPIEVLETQELC SLTVEYRRECGMD SVLE SVSAMDP SEDGGRSQYNHLLRLE
DGTDVVKGRTEWRPKNAGTNGAISTTKT SNGNSVS
SEQ ID NO: 186 Umbellularia californica UcFATB3 amino acid sequence MVATAAASAF FPVGSPAT S SAT SAKASMMPDNLDARGIKPKPAS S SGLQVKANAHASPKINGS
KVS T DT LKGEDT LT S SPAPRTFINQLPDW SMFLAAITTIFLAAEKQWTNLDWKPRRPDMLADP
FGIGRFMQDGLIFRQHFAIRSYEIGADRTA SIETLMNHLQETALNHVRSAGLLGDGFGATPEMS
RRDLIWVVTRMQVLVDRYPAWGDIVEVETWVGASGKNGMRRDWLVRDSQTGEILTRAT SV
WVMMNKRTRRL SKIPEEVRGEIGPYFMENVAIIEED SRKLQKLNENIIEEDSRKLQKLNENTAD
NVRRGLT PRW SDLDVNQHVNNVKYIGWILE SAPGS ILE S HEL SCMTLEYRRECGKDSVLQ SM
TVVSGGGSAAGG SPE S SVECDHLLQLE SGPEVVKARTEWRPKSANNPRSILEMPAES S*
SEQ ID NO: 187 Cuphea carthagenensis CCrFATB2c (V138L variant of FATB2) MVAAAAS SAFF PVTTPGT SRKPGKFGNWL S SL SP PF RPK SIP SGGF QVKANA SAHP KANGSAV
SLKSGSLNTQEDT S S SPPPRAFINQLPDWSMLLTAITTVFVAAEKQWTMLDRKSKRSDMLVDS
FGMERIVQDGLLFRQ SF SIRSYEIGADRRASIETLMNHLQET SLNHCKSIRLLNEGFGRTPEMCK
RDLIWVVTRMHIMVNRYPTWGDTVEINTWVSQ SGKNGMGRDWLISDCNTGEILIRAT SAWA
MMNQKTRRL SKLPYEVSQEIAPHFVDSPPVIEDGDRKLHKFDVKTGDSIRKGLTPRWNDLDV
NQHVNNVKYIGWILESMPTEVLETHELCF LT LEYRRECGRD SVLE SVTAMDP SNEGGRSHYQ
HLLRLEDGTDIVKGRTEWRPKNARNIGAISTGKT SNGNPAS *
SEQ ID NO: 188 Cuphea carthagenensis CCrFATB2 MVAAAAS SAFF PVTTPGT SRKPGKFGNWL S SL SP PF RPK SIP SGGF QVKANA SAHP KANGSAV
SLKSGSLNTQEDT S S SPPPRAFINQLPDWSMLLTAITTVFVAAEKQWTMLDRKSKRSDMLVDS
FGMERIVQDGLVFRQ SF SIRSYEIGADRRASIETLMNHLQET SLNHCKSIRLLNEGFGRTPEMCK
RDLIWVVTRMHIMVNRYPTWGDTVEINTWVSQ SGKNGMGRDWLISDCNTGEILIRAT SAWA
MMNQKTRRL SKLPYEVSQEIAPHFVDSPPVIEDGDRKLHKFDVKTGDSIRKGLTPRWNDLDV
NQHVNNVKYIGWILESMPTEVLETHELCF LT LEYRRECGRD SVLE SVTAMDP SNEGGRSHYQ
HLLRLEDGTDIVKGRTEWRPKNARNIGAISTGKT SNGNPAS *
SEQ ID NO: 189 C crFATB2b MVAAAAS SAFF PVTTPGT SRKPGKFGNWL S SL SP PF RPK SIP SGGF QVKANA SAHP KANGSAV
SLKSGSLNTQEDT S S SPPPRAFINQLPDWSMLLTAITTVFVAAEKQWTMLDRKSKRSDMLVDS
FGMERIVQDGLVFRQ SF SIRSYEIGADRRASIETLMNHLQET SLNHCKSIRLLNEGFGRTPEMCK
RDLIWVFTRMHIMVNRYPTWGDTVEINTWVSQ SGKNGMGRDWLISDCNTGEILIRAT SAWA
MMNQKTRRL SKLPYEVSQEIAPHFVDSPPVIEDGDRKLHKFDVKTGDSIRKGLTPRWNDLDV
NQHVNNVKYIGWILESMPTEVLETHELCF LT LEYRRECGRD SVLE SVTAMDP SNEGGRSHYQ
HLLRLEDGTDIVKGRTEWRPKNARNIGAIPTGKT SNGNPAS *

SEQ ID NO: 190 C crFAT B 1 MVATAAS SAF FPVP SPD S S SRPGKLGNGP S SLSPLKPKSTPNGGLQVKANASAPPKINGS SVGL
KS S S LKT QDDT P SAPPPRTFINQLPDWSMLLAAITTVFLAAEKQWMMLDWKPKRPDMLTDPF
GLGRIVQDGLVF RQNF SIRS YEIGADRTA SIETVMNHLQETALNHVKSAGLLNDGFGRTPEMY
KRDLIWVVAKMQVMVNRYPT WGDTVEVNTWVAK SGKNGMRRDWLI SDCNTGEILT RA S SV
WVMMNQKTRRLSKIPDEVRHEIEPHFVDSAPVIEDDDRKLPKLDEKTADSIRKGLTPKWNDLD
VNQHVNNVKYIGWILESTPPEVLETQELC SLTLEYRRECGKESVLE SLTAVDP SGKGWGSHFQ
HLLRLEDGGEIVKGRTEWRPKNAGINGAVAFEETSPGDS*
SEQ ID NO: 191 CcrFATB lb MVATAAS SAF FPVP SPD S S SRPGKLGNGP S SLSPLKPKSTPNGGLQVKANASAPPKINGS SVGL
KS S S LKT QDDT P SAPPPRTFINQLPDWSMLLAAITTVFLAAEKQWMMLDWKPKRPDMLTDPF
GLGRIAQDGLVF RQNF SIRS YEIGADRTA SIETVMNHLQETALNHVKSAGLLNDGFGRTPEMY
KRDLIWVVAKMQVMVNRYPT WGDTVEVNTWVAK SGKNGMRRDWLI SDCNTGEILT RA S SV
WVMMNQKTRRLSKIPDEVRHEIEPHFVDSAPVIEDDDRKLPKLDEKTADSIRKGLTPKWNDLD
VNQHVNNVKYIGWILESTPPEVLETQELC SLTLEYRRECGKESVLE SLTAVDP SGKGWGSHFQ
HLLRLEDGGEIVKGRTEWRPKNAGINGAVAFEETSPGDS*
SEQ ID NO: 192 C CrFATB 1 c MVATAAS SAF FPVP SPD S S SRPGKLGNGP S SLSPLKPKSTPNGGLQVKANASAPPKINGS SVGL
KS S S LKT QDDT P SAPPPRTFINQLPDWSMLLAAITTVFLAAEKQWMMLDWKPKRPDMLTDPF
GLGRIVQDGLVF RQNF SIRS YEIGADRTA SIETVMNHLQETALNHVKSAGLLNDGFGRTPEMY
KRDLIWVVAKMQVMVNRYPT WGDTVEVNTWVAK SGKNGMRRDWLI SDCNTGEILT RA S SV
WVMMNQKTRRLSKIPDEVRHEIEPHFVDSAPVIEDDDRKLPKLDEKTADSIRKGLTPKWNDLD
VNQHVNNVKYIGWILESTPPEVLETQELC SLTLEYRRECGKESVLE SLTAVDP SGKGWGSHFQ
HLLRLEDGGEIVKGRTEWRPKNA*

Claims (24)

What is claimed is:
1. A nucleic acid construct comprising a regulatory element and a FatB gene expressing an active acyl-ACP thioesterase operable to produce an altered fatty acid profile in an oil produced by a cell expressing the nucleic acid construct, wherein the FatB gene expresses a protein having an amino acid sequence falling within clade 5 of Table 1a, the sequence having at least 94.6% sequence identity with each of SEQ ID NOs: 88, 82, 85, and 103, and optionally wherein the fatty acid of the oil is enriched in C8 and C10 fatty acids.
2. A nucleic acid construct comprising a regulatory element and a FatB gene expressing an active acyl-ACP thioesterase operable to produce an altered fatty acid profile in an oil produced by a cell expressing the nucleic acid construct, wherein the FatB gene expresses a protein having an amino acid sequence falling within one of clades 1-12 of Table 1a.
3. The nucleic acid construct of claim 2, wherein the FatB gene expresses a protein having an amino acid sequence falling within clade 1 of Table 1a, the sequence having at least 85.9% sequence identity with each of SEQ ID NOs:
19, 161, 22, and 160, and optionally wherein the fatty acid of the oil is enriched in C14 and C16 fatty acids.
4. The nucleic acid construct of claim 2, wherein the FatB gene expresses a protein having an amino acid sequence falling within clade 2 of Table 1a, the sequence having at least 89.5% sequence identity with each of SEQ ID NOs:
134-136, 132, 133, 137, 124, 122, 123, 125, and optionally wherein the fatty acid of the oil is enriched in C12 and C14 fatty acids.
5. The nucleic acid construct of claim 2, wherein the FatB gene expresses a protein having an amino acid sequence falling within clade 3 of Table 1a, the sequence having at least 92.5% sequence identity with each of SEQ ID NOs:

126 and 127, and optionally wherein the fatty acid of the oil is enriched in and C14 fatty acids.
6. The nucleic acid construct of claim 2, wherein the FatB gene expresses a protein having an amino acid sequence falling within clade 4 of Table 1a, the sequence haying at least 83.8% sequence identity with SEQ ID NO: 79, and optionally wherein the fatty acid of the oil is enriched in C12 and C14 fatty acids.
7. The nucleic acid construct of claim 2, wherein the FatB gene expresses a protein having an amino acid sequence falling within clade 6 of Table 1a, the sequence haying at least 99.9% sequence identity with each of SEQ ID NOs:
111 and 110, and optionally wherein the fatty acid of the oil is enriched in fatty acids.
8. The nucleic acid construct of claim 2, wherein the FatB gene expresses a protein having an amino acid sequence falling within clade 7 of Table 1a, the sequence haying at least 89.5% sequence identity with each of SEQ ID NOs:
73, 106, 185, 172, 171, 173, 174, and optionally wherein the fatty acid of the oil is enriched in C10 and C12 fatty acids.
9. The nucleic acid construct of claim 2, wherein the FatB gene expresses a protein having an amino acid sequence falling within clade 8 of Table 1a, the sequence haying at least 85.9% sequence identity with each of SEQ ID NOs:
112, 113, 142, 145, 143, 144, 139, 140, 138, 141, and optionally wherein the fatty acid of the oil is enriched in C12 and C14 fatty acids.
10. The nucleic acid construct of claim 2, wherein the FatB gene expresses a protein having an amino acid sequence falling within clade 9 of Table 1a, the sequence haying at least 83.8% sequence identity with each of SEQ ID NOs:
187-189, and optionally wherein the fatty acid of the oil is enriched in C12 and C14 fatty acids.
11. The nucleic acid construct of claim 2, wherein the FatB gene expresses a protein having an amino acid sequence falling within clade 10 of Table 1a, the sequence having at least 95.9% sequence identity with each of SEQ ID NOs:
147, 149, 146, 150, 152, 151, 148, 154, 156, 155, 157, 108, 75, 190, 191, and 192, and optionally wherein the fatty acid of the oil is enriched in C14 and C16 fatty acids.
12. The nucleic acid construct of claim 2, wherein the FatB gene expresses a protein having an amino acid sequence falling within clade 11 of Table 1a, the sequence having at least 88.7% sequence identity with SEQ ID NO: 121, and optionally wherein the fatty acid of the oil is enriched in C14 and C16 fatty acids.
13. The nucleic acid construct of claim 2, wherein the FatB gene expresses a protein having an amino acid sequence falling within clade 12 of Table 1a, the sequence having at least 72.8% sequence identity with each of SEQ ID NOs:
129 and 186, and optionally wherein the fatty acid of the oil is enriched in fatty acids.
14. An isolated nucleic acid or recombinant DNA construct comprising a nucleic acid, wherein the nucleic acid has at least 70% sequence identity to any of SEQ ID NOS: 2, 3, 5, 6, 8, 9, 11, 12, 14, 15, 17, 18, 20, 21, 23, 24, 26, 27, 29, 30, 32, 33, 35, 36, 38, 39, 41, 42, 44, 45, 47, 48, 50, 51, 53, 54, 56, 57, 59, 60, 62, 63, 65, 66, 68, 69, 71, 72, 74, 76, 78, 80, 81, 83, 84, 86, 87, 89, 90, 92, 93, 95, 96, 98, 99, 101, 102, 104, 105, 107, 109 or any equivalent sequences by virtue of the degeneracy of the genetic code.
15. An isolated nucleic acid sequence encoding a protein or a host cell expressing a protein having at least 70% sequence identity to any of SEQ ID NOS: 1, 4, 7, 10, 13, 16, 19, 22, 25, 28, 31, 34, 37, 40, 43, 46, 49, 52, 55, 58, 61, 64, 67, 70, 73, 75, 77, 79, 82, 85, 88, 91, 94, 97, 100, 103, 106, 108, 110-192 or a fragment thereof having acyl-ACP thioesterase activity.
16. The isolated nucleic acid of claim 15, wherein, the protein has acyl-ACP
thioesterase activity operable to alter the fatty acid profile of an oil produced by a recombinant cell comprising that sequence.
17. A method of producing a recombinant cell that produces an altered fatty acid profile, the method comprising transforming the cell with a nucleic acid according to any of claims 1-3.
18. A host cell produced by the method of claim 17.
19. The host cell of claim 18, wherein the host cell is selected from a plant cell, a microbial cell, and a microalgal cell.
20. A method for producing an oil or oil-derived product, the method comprising cultivating a host cell of claim 5 or 6, and extracting oil produced thereby, optionally wherein the cultivation is heterotrophic growth on sugar.
21. The method of claim 20, further comprising producing a fatty acid, fuel, chemical, or other oil-derived product from the oil.
22. An oil produced by the method of claim 20, optionally having a fatty acid profile comprising at least 20% C8, C10, C12, C14 or C16 fatty acids.
23. An oil-derived product produced by the method of claim 21.
24. The oil of claim 23, wherein the oil is produced by a microalgae and optionally, lacks C24-alpha sterols.
CA2904395A 2013-03-15 2014-03-13 Thioesterases and cells for production of tailored oils Abandoned CA2904395A1 (en)

Applications Claiming Priority (7)

Application Number Priority Date Filing Date Title
US201361791861P 2013-03-15 2013-03-15
US61/791,861 2013-03-15
US13/837,996 2013-03-15
US13/837,996 US9290749B2 (en) 2013-03-15 2013-03-15 Thioesterases and cells for production of tailored oils
US201361917217P 2013-12-17 2013-12-17
US61/917,217 2013-12-17
PCT/US2014/026644 WO2014151904A1 (en) 2013-03-15 2014-03-13 Thioesterases and cells for production of tailored oils

Publications (1)

Publication Number Publication Date
CA2904395A1 true CA2904395A1 (en) 2014-09-25

Family

ID=51581068

Family Applications (1)

Application Number Title Priority Date Filing Date
CA2904395A Abandoned CA2904395A1 (en) 2013-03-15 2014-03-13 Thioesterases and cells for production of tailored oils

Country Status (9)

Country Link
EP (1) EP2971024A4 (en)
JP (1) JP2016518112A (en)
KR (1) KR20150128770A (en)
CN (1) CN105143458A (en)
AU (2) AU2014236763B2 (en)
BR (1) BR112015023192A8 (en)
CA (1) CA2904395A1 (en)
MX (1) MX2015011507A (en)
WO (1) WO2014151904A1 (en)

Families Citing this family (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9567615B2 (en) 2013-01-29 2017-02-14 Terravia Holdings, Inc. Variant thioesterases and methods of use
US9816079B2 (en) 2013-01-29 2017-11-14 Terravia Holdings, Inc. Variant thioesterases and methods of use
US9290749B2 (en) 2013-03-15 2016-03-22 Solazyme, Inc. Thioesterases and cells for production of tailored oils
US9783836B2 (en) 2013-03-15 2017-10-10 Terravia Holdings, Inc. Thioesterases and cells for production of tailored oils
WO2016014968A1 (en) 2014-07-24 2016-01-28 Solazyme, Inc. Variant thioesterases and methods of use
US10125382B2 (en) 2014-09-18 2018-11-13 Corbion Biotech, Inc. Acyl-ACP thioesterases and mutants thereof
US20180142218A1 (en) 2016-10-05 2018-05-24 Terravia Holdings, Inc. Novel acyltransferases, variant thioesterases, and uses thereof
EP4090735A1 (en) 2020-01-16 2022-11-23 Corbion Biotech, Inc. Beta-ketoacyl-acp synthase iv variants

Family Cites Families (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5344771A (en) 1990-04-26 1994-09-06 Calgene, Inc. Plant thiosterases
US5512482A (en) * 1990-04-26 1996-04-30 Calgene, Inc. Plant thioesterases
US5298421A (en) 1990-04-26 1994-03-29 Calgene, Inc. Plant medium-chain-preferring acyl-ACP thioesterases and related methods
US5455167A (en) * 1991-05-21 1995-10-03 Calgene Inc. Medium-chain thioesterases in plants
US5639790A (en) 1991-05-21 1997-06-17 Calgene, Inc. Plant medium-chain thioesterases
US5850022A (en) 1992-10-30 1998-12-15 Calgene, Inc. Production of myristate in plant cells
US5654495A (en) 1992-10-30 1997-08-05 Calgene, Inc. Production of myristate in plant cells
AU688377B2 (en) * 1993-09-03 1998-03-12 Max-Planck-Gesellschaft Zur Forderung Der Wissenschaften E.V. Medium chain-specific thioesterases
JPH09505470A (en) 1993-11-10 1997-06-03 カルジーン,インコーポレイティド Plant acyl ACP thioesterase sequence
US5807893A (en) 1993-11-18 1998-09-15 Voelker; Toni Alois Plant thioesterases and use for modification of fatty acid composition in plant seed oils
MY154965A (en) 2007-06-01 2015-08-28 Solazyme Inc Production of oil in microorganisms
CN101688220B (en) * 2007-07-09 2017-04-05 拜尔作物科学公司 Brassica plants comprising mutation acyl group ACP thioesterase allele
US7982035B2 (en) 2007-08-27 2011-07-19 Duquesne University Of The Holy Spirit Tricyclic compounds having antimitotic and/or antitumor activity and methods of use thereof
EP2297327A1 (en) * 2008-06-20 2011-03-23 BASF Plant Science GmbH Plants having enhanced yield-related traits and a method for making the same
SG171428A1 (en) 2008-11-28 2011-07-28 Solazyme Inc Manufacturing of tailored oils in recombinant heterotrophic microorganisms
EP2417246A4 (en) * 2009-04-10 2015-11-04 Reg Life Sciences Llc Production of fatty acid derivatives
AU2011257983B2 (en) 2010-05-28 2016-02-11 Corbion Biotech, Inc. Food compositions comprising tailored oils
KR101964965B1 (en) 2011-02-02 2019-04-03 테라비아 홀딩스 인코포레이티드 Tailored oils produced from recombinant oleaginous microorganisms
US8951762B2 (en) * 2011-07-27 2015-02-10 Iowa State University Research Foundation, Inc. Materials and methods for using an acyl—acyl carrier protein thioesterase and mutants and chimeras thereof in fatty acid synthesis
CN102586350A (en) * 2012-01-09 2012-07-18 北京化工大学 Production method for C8:0/C10:0/C12:0/C14:0 medium-chain fatty acid and ethyl ester thereof
SG11201406711TA (en) 2012-04-18 2014-11-27 Solazyme Inc Tailored oils

Also Published As

Publication number Publication date
JP2016518112A (en) 2016-06-23
WO2014151904A1 (en) 2014-09-25
EP2971024A1 (en) 2016-01-20
AU2014236763B2 (en) 2018-08-23
BR112015023192A8 (en) 2018-01-02
MX2015011507A (en) 2016-04-07
BR112015023192A2 (en) 2017-11-21
CN105143458A (en) 2015-12-09
AU2014236763A1 (en) 2015-10-01
AU2018267601A1 (en) 2018-12-06
EP2971024A4 (en) 2016-11-16
KR20150128770A (en) 2015-11-18

Similar Documents

Publication Publication Date Title
US10557114B2 (en) Thioesterases and cells for production of tailored oils
US10316299B2 (en) Ketoacyl ACP synthase genes and uses thereof
US10125382B2 (en) Acyl-ACP thioesterases and mutants thereof
AU2018267601A1 (en) Thioesterases and cells for production of tailored oils
US20200392470A1 (en) Novel acyltransferases, variant thioesterases, and uses thereof
US20190002934A1 (en) Tailored oils
Chi et al. Cloning and functional analysis of three diacylglycerol acyltransferase genes from peanut (Arachis hypogaea L.)
US20160251685A1 (en) Thioesterases and cells for production of tailored oils
KR20150001830A (en) Tailored oils
US10865421B2 (en) Acyltransferases and methods of using
WO2023212726A2 (en) Regiospecific incorporation of fatty acids in triglyceride oil
CA3060515A1 (en) Novel acyltransferases, variant thioesterases, and uses thereof
JP2016034258A (en) Production method for lipid using diacylglycerol acyl transferase

Legal Events

Date Code Title Description
EEER Examination request

Effective date: 20190312

FZDE Discontinued

Effective date: 20210831