WO2015197075A1 - Methods and materials for production of terpenoids - Google Patents

Methods and materials for production of terpenoids Download PDF

Info

Publication number
WO2015197075A1
WO2015197075A1 PCT/DK2015/050181 DK2015050181W WO2015197075A1 WO 2015197075 A1 WO2015197075 A1 WO 2015197075A1 DK 2015050181 W DK2015050181 W DK 2015050181W WO 2015197075 A1 WO2015197075 A1 WO 2015197075A1
Authority
WO
WIPO (PCT)
Prior art keywords
seq
host organism
ditps
heterologous nucleic
nucleic acid
Prior art date
Application number
PCT/DK2015/050181
Other languages
French (fr)
Inventor
Björn Hamberger
Johan Andersen-Ranberg
Eirini Pateraki
Birger Lindberg MØLLER
Morten Thrane Nielsen
Nikolaj Lervad HANSEN
Original Assignee
University Of Copenhagen
Danmarks Tekniske Universitet
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by University Of Copenhagen, Danmarks Tekniske Universitet filed Critical University Of Copenhagen
Publication of WO2015197075A1 publication Critical patent/WO2015197075A1/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/415Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from plants
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/795Porphyrin- or corrin-ring-containing peptides
    • C07K14/80Cytochromes
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/10Transferases (2.)
    • C12N9/1085Transferases (2.) transferring alkyl or aryl groups other than methyl groups (2.5)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P5/00Preparation of hydrocarbons or halogenated hydrocarbons
    • C12P5/007Preparation of hydrocarbons or halogenated hydrocarbons containing one or more isoprene units, i.e. terpenes

Definitions

  • the present invention relates to the field of biosynthetic methods for producing diterpenoids.
  • Terpenes constitute a large and diverse class of organic compounds produced by a variety of plants as well as other species. Terpenes modified by oxidation or rearrangements are generally referred to as terpenoids.
  • Terpenes and terpenoids find multiple uses, for example as flavor compounds, additives for food, as fragrances and in medical treatment
  • Terpenes are derived biosynthetically from units of isoprene, which has the molecular formula C 5 H 8 .
  • Diterpenes are composed of four isoprene units and in nature they are produced from geranylgeranyl pyrophosphate.
  • diterpenes are produced with the aid of specific pairs of diterpene synthases (diTPS) derived from two classes, class I and class II.
  • Diterpenoids may be produced from diterpenes with the action of various enzymes, including CYP enzymes.
  • CYP enzymes have a narrow substrate specificity, which has been a major hurdle in broadening P450 applications in synthetic chemistry (Podust et al., 2012, Nat. Prod. Rep., 29, 1251 ).
  • plant CYPs are known to be recruited to highly specialised functions, and show a high degree of substrate specificity.
  • the present invention discloses that by combining diTPS enzymes of class I and class II, and CYPs from different species, many different diterpenoids may be produced including diterpenoids not identified in nature. Surprisingly, it is revealed that a diTPS enzyme of class I, a diTPS enzyme of class II may be combined with one or more CYPs, wherein the diTPSs and the CYP(s) are from different species, resulting in a high diversity of diterpenoids, which can be produced.
  • the invention features an inventory of functional class II and class I diTPS and CYPs from a range of plants, which are useful for accumulating high-value and bioactive diterpenoids.
  • these diTPS and CYPs are paired into specific modules consisting of new-to-nature combinations, such as using enzymes from different plant species, both the structure and the stereochemistry of the formed diterpenoids can be controlled.
  • This strategy gives access to a novel structural diversity of highly complex diterpenoids, many representing valuable compounds, for example flavours, fragrances, pharmaceuticals and fine chemicals.
  • the diterpenoids may also be potentially bioactive molecules, starting materials for chemical synthesis, and intermediates for further functionalization to flavours, fragrances, pharmaceuticals and fine chemicals.
  • the invention thus in one aspect provides methods of producing a terpenoid, said methods comprising the steps of: providing one or more host organisms, wherein
  • At least one of the host organisms comprises a heterologous nucleic acid encoding a diTPS of class II,
  • At least one of the host organisms comprises a heterologous nucleic acid encoding a diTPS of class I,
  • At least one of the host organisms comprises a heterologous nucleic acid encoding a cytochrome P450 (CYP) with the proviso that said diTPS of class II, said diTPS of class I and said CYP are not all from the same species;
  • CYP cytochrome P450
  • the invention also provides a host organism comprising
  • a heterologous nucleic acid encoding a cytochrome P450 (CYP) with the proviso that said diTPS of class II, said diTPS of class I and said CYP are not all from the same species.
  • the invention furthermore provides a collection of host organisms comprising at least two different host organisms, wherein
  • At least one of the host organisms comprises a heterologous nucleic acid encoding a diTPS of class II,
  • At least one of the host organisms comprises a heterologous nucleic acid encoding a diTPS of class I,
  • At least one of the host organisms comprises a heterologous nucleic acid encoding a cytochrome P450 (CYP) with the proviso that said diTPS of class II, said diTPS of class I and said CYP are not all from the same species.
  • CYP cytochrome P450
  • the term "said diTPS of class II, said diTPS of class I and said CYP are not all from the same species" means that the diTPS of class II, the diTPS of class I and the CYP are derived from at least two different species. For example the diTPS of class II and the diTPS of class I may be from the same species, whereas the CYP is from a different species.
  • the diTPS of class II and the CYP may be from the same species, whereas the diTPS of class I is from a different species. In another example the diTPS of class I and the CYP may be from the same species, whereas the diTPS of classll is from a different species. In another example the diTPS of class II and the diTPS of class I and the CYP are from three different species. Description of Drawings
  • Figure 1 provides an example of biosynthesis pathways to diterpenes of different stereochemistry.
  • the figure shows biosynthesis of three different isomers of manool by using diTPS enzymes from four different species: Oryza sativa (rice), Zea maize (maize), Coleus forskohlii (Indian coleus, medicinal plant) and Salvia sclarea (Clary sage, medicinal plant).
  • the diTPS from Oryza sativa may for example be the enzyme of SEQ ID NO:1 .
  • the diTPS from Zea maize may for example be the enzyme of SEQ ID NO:3.
  • the diTPS from Coleus forskohlii may for example be the enzyme of SEQ ID NO:5.
  • the diTPS from Salvia sclarea may for example be the enzyme of SEQ ID NO:1 1 .
  • Figures 2A and 2B show "Combinatorial wheels" showing examples of compounds, which can be made by combining different diTPS enzymes.
  • the universal precursor, GGPP is shown in the middle.
  • the next ring shows various examples of diTPS class II enzymes.
  • the next ring shows various examples of diTPS class I enzymes.
  • the outer ring shows the diterpenes produced by the indicated combinations of diTPS class II and diTPS class I enzymes. Each diterpene has been assigned a compound number used to identify said diterpene herein.
  • Table 1 provides a list and structures of the diterpenes.
  • Figures 3A and 3B shows the reactions catalysed by various class II diTPS enzymes as well as the diterpene pyrophosphate intermediates generated by the reactions.
  • Figure 4 shows an alignment of the amino acid sequences of selected diTPS enzymes of class I.
  • Figure 5 shows an alignment of the amino acid sequences of selected diTPS enzymes of class II.
  • Figure 6 shows an alignment of amino acid sequences of selected CYPs. 6 motifs named SRS1 , SRS2, SRS3, SRS4, SRS5 and SRS6 are marked above the alignment with black squares. 4 highly conserved motifs are marked by numbers 1 to 4 and shown in black boxes. Other highly conserved residues are also marked in black boxes. Somewhat conserved residues are marked with grey boxes.
  • Figure 7 shows CYP catalysed bioconversion of Dehydroabietadiene (1 .1 ) and Miltiradiene (1 .2).
  • Cytochrome P450 enzymes CfCYP76AH1 1 (top) and CfCYP71 D381 (bottom) catalyzes oxidation of Dehydroabietadiene (1 .1 ) and Miltiradiene (1 .2) into novel compounds 1 .1 .1 -1 .1 .4 and 1 .2.1 corresponding to hydroxylated or ketonated derivatives of Dehydroabietadiene (1 .1 ) and Miltiradiene (1 .2).
  • Figure 8A shows GC-MS chromatogram of N. benthamiana expressing
  • Figure 8B shows GC-MS chromatogram of N. benthamiana expressing CfTPSI + CfTPS3 + CYP71 D381 and mass spectra of compound 1 .1 .2-1 .1 .4.
  • Figure 9 shows CYP catalysed bioconversion of Manool (1 .3). Cytochrome P450 enzymes CfCYP76AH8 (top), CfCYP76AH1 1 (middle) and PsCYP720B4 (bottom) catalyzes oxidation of Manool (1 .3) into novel compounds 1 .3.1 -1 .3.10 corresponding to hydroxylated or dihydroxylated derivatives of Manool (1 .3).
  • Figure 10A shows GC-MS chromatogram of N. benthamiana expressing
  • Figure 10B shows GC-MS chromatogram of N. benthamiana expressing
  • Figure 10C shows GC-MS chromatogram of N. benthamiana expressing
  • Figure 1 1 shows CYP catalysed bioconversion of syn-manool (1 .4).
  • Cytochrome P450 enzymes CfCYP71 D381 catalyzes oxidation of syn-manool (1 .4) into novel compounds 1 .4.1 -1 .4.3 where 1 .4.2 and 1 .4.3 correspond to hydroxylated derivatives of syn- manool (1 .4).
  • Figure 12 shows GC-MS chromatogram of N. benthamiana expressing
  • Figure 13 shows production of compound 1 .2.1 by a yeast strain expressing the diterpene synthases CfTPSI and CfTPS3 as well as the cytochrome P450
  • Panel A shows extracted ion chromatogram recorded by LC-MS analysis of ethanol extracts generated from media as well as cell material of strain IT-7- T13.
  • the black trace corresponds to a yeast strain expressing the diterpene synthases CfTPSI and CfTPS3 as well as the cytochrome P450 PsCYP720B4, whereas the grey trace corresponds to a yeast strain expressing only the diterpene synthases CfTPSI and CfTPS3.
  • Panel B UV absorption (top), primary mass spectra (middle) as well as MS-n fragmentation mass spectra (bottom).
  • Panel C shows a schematic
  • Figure 14 shows production of oxidized manool.
  • Panel A shows a schematic presentation of formation of manool (1 .3) and proposed cytochrome P450 catalyzed oxidation of manool in both the plant Nicotiana benthamiana and in yeast after heterologous expression of CfTPSI , SsSCS and the cytochrome P450 enzymes CYP76AH8, CYP76AH1 1 and CYP720B4, which catalyse oxidation of manool into novel compounds 1 .3.1 -1 .3.16.
  • Panel B shows extracted ion chromatograms of three strains recorded by LC-MS analysis of ethanol extracts generated from media as well as cell material.
  • Extracted ion trace combine the ions corresponding to the mass of manool (273 m/z), monohydroxy-manool (289 m/z, hydrogen adduct of water loss ion), dihydroxymanool (347 m/z) and keto-hydroxy-manool (345 m/z).
  • Result from yeast strains expressing CfTPSI , SsSCS and CfCYP76AH8 (strain IT7-T50)(black trace, top), CfTPSI and SsSCS (dark grey trace, middle) or an untransformed reference strain (light grey, bottom) are shown.
  • Panels C, D and E display, UV (top), primary mass spectra (middle) as well as MS-n fragmentation mass spectra (bottom) of putative keto-hydroxy mannol (1 .3.1 1 ), dihydroxy manool (1 .3.12) and hydroxyl manool (1 .3.13) in yeast cell expressing CfTPSI , SsSCS, CfCYP76AH8, strain name IT7-T50.
  • Figure 15 shows production of oxidized manool.
  • Panel A shows extracted ion chromatograms of two strains recorded by LC-MS analysis of ethanol extracts generated from media as well as cell material. Extracted ion trace combine the ions corresponding to the mass of monohydroxy-manool (329 m/z, sodium adduct), dihydroxymanool (345 m/z, sodium adduct) and keto-hydroxy-manool or a carboxylic acid of manool (343 m/z, sodium adduct). Result from yeast strains expressing CfTPSI , SsSCS and PsCYP720B4 (black trace, top), CfTPSI and SsSCS (dark grey trace, bottom) are shown.
  • Panels B display, UV (top), primary mass spectra (bottom) of the oxidized compounds 1 .3.12-1 .3.16 in yeast cells expressing CfTPSI , SsSCS, PsCYP720B4, strain name IT7-T37.
  • Figure 16 shows detection of oxidised isopimaradiene.
  • Panel A shows a schematic representation of formation of syn-or iso-pimaradienes (1 .5-group) and proposed cytochrome P450 catalyzed oxidation of syn-or iso-pimaradienes into novel compounds 1 .5.1 -1 .5.6 tentatively identified as hydroxylated, dihydroxylated or ketonated derivatives of either syn- or iso-pimaradiene based on their parental masses.
  • Panel B shows extracted ion chromatogram recorded by LC-MS analysis of ethanol extracts generated from media as well as cell material.
  • Extracted ion correspond to the mass of syn-pimara-9,(1 1 ),15-diene or similar C20H32 diterpene oxidized to contain either one keto- and one hydroxyl group or a single carboxylic acid group (303 m/z) for yeast cells expressing OssynCPP, CfTPS3 and PsCYP720B4 (IT7-T58)(black trace), untransformed reference strain (grey trace).
  • Panel C UV (top), primary mass spectra (middle) as well as MS-n fragmentation mass spectra (bottom) of the five oxidized diterpene derivatives.
  • Figure 17 shows detection of higher oxidised manool through pairs of cytochrome P450s.
  • Panel A shows a schematic representation. Cytochrome P450 enzyme pairs CYP76AH8, CYP71 D381 ; CYP76AH8, CYP76AH1 1 and CYP76AH8, CYP720B4 catalyse oxidation of Manool into novel compounds 1 .3.14-1 .3.18.
  • Panel B shows extracted ion chromatograms of three strains recorded by LC-MS analysis of ethanol extracts generated from media as well as cell material.
  • Yeast cells expressing CfTPSI , SsSCS and CfCYP76AH8 and CfCYP71 D381 (strain IT7-T32)(black trace, top)
  • yeast cells expressing CfTPSI , SsSCS and CfCYP76AH8 (strain IT7-T50)(dark grey trace, middle) or an untransformed control strain (light grey, bottom).
  • Panel C display, UV (top), primary mass spectra (middle) as well as MS-n fragmentation mass spectra (bottom) of oxidized diterpenes.
  • Figure 18 shows detection of oxidized manool.
  • CfTPSI , SsSCS and CfCYP76AH8 and CfCYP76AH1 1 (strain IT7-T31 )(black trace, top), CfTPSI , SsSCS and CfCYP76AH8 (dark grey trace, middle), untransformed control strain (light grey, bottom).
  • Panel B display, UV (top), primary mass spectra (middle) as well as MS-n fragmentation mass spectra (bottom) of oxidized diterpenes.
  • Figure 19 shows detection of oxidized manool.
  • Panel A shows extracted ion chromatograms of three strains recorded by LC-MS analysis of ethanol extracts generated from media as well as cell material.
  • Yeast cells expressing CfTPSI , SsSCS and CfCYP76AH8 and PsCYP720B4 strain IT7-T9(black trace, top)
  • yeast cells expressing CfTPSI , SsSCS and CfCYP76AH8 dark grey trace, middle
  • yeast cells expressing CfTPSI , SsSCS and PsCYP720B4 light grey, bottom.
  • Panel B displays, UV (top), primary mass spectra (middle) as well as MS-n fragmentation mass spectra (bottom) of oxidized diterpenes.
  • Figure 20 shows a schematic pathway for production of ferruginol.
  • the present invention relates to a biosynthetic method for producing diterpenoids.
  • the methods typically involves the steps of a) providing one or more host organisms, wherein
  • At least one of the host organisms comprises a heterologous nucleic acid encoding a diTPS of class II,
  • At least one of the host organisms comprises a heterologous nucleic acid encoding a diTPS of class I, III. at least one of the host organisms comprises a heterologous nucleic acid encoding a cytochrome P450 (CYP) with the proviso that said diTPS of class II, said diTPS of class I and said CYP are not all from the same species; b) Incubating said one or more host organisms either simultaneously or
  • GGPP geranylgeranyl pyrophosphate
  • the host organism may be cultivated in a cultivation medium.
  • the host organism is a microorganism
  • typically said host organism is cultivated in a cultivation medium under conditions allowing growth of the particular microorganism.
  • the diterpenoid may be isolated from the host organism(s and/or from the cultivation medium.
  • intermediates may also be obtained and/or isolated from the host organism(s) and/or the cultivation medium.
  • the host organism wherein the host organism is a plant, said host organism may be cultivated in soil.
  • the diterpenoid may typically be isolated from the host organism or parts thereof.
  • intermediates may also be obtained and/or isolated from the host organism or part thereof.
  • step a) comprises providing one host organism comprising:
  • CYP cytochrome P450
  • said host organism may contain one or more additional heterologous nucleic acids.
  • the host organism may comprise at least two heterologous nucleic acids encoding different CYPs. Said CYPs may be from the same or from different species.
  • the host organism may comprise:
  • CYP cytochrome P450
  • one host organism expresses both diTPS of class II, diTPS of class I and one or more CYPs
  • the methods may employ different host organisms expressing one or more of said enzymes.
  • said host organisms may be incubated in cultivation medium together or sequentially. When incubated sequentially one or more host organisms may be incubated together followed by incubation of one or more other host organisms. Alternatively, each host organism may be incubated separately.
  • a host organism comprising a heterologous nucleic acid encoding a diTPS of class II may beincubated in cultivation medium before or simultaneously with any host organism comprising a heterologous nucleic acid encoding a diTPS of class I
  • a host organism comprising a heterologous nucleic acid encoding a diTPS of class I may beincubated in cultivation medium before or simultaneously with any host organism comprising a heterologous nucleic acid encoding a CYP.
  • the methods of the invention may comprise the steps of a) providing
  • a host organism comprising a heterologous nucleic acid encoding a diTPS of class I, III. a host organism comprising one or more heterologous nucleic acids encoding a cytochrome P450 (CYP) with the proviso that said diTPS of class II, said diTPS of class I and said CYP are not all from the same species; b) Incubating said host organisms under conditions allowing growth of said host organisms, wherein
  • GGPP geranylgeranyl pyrophosphate
  • GGPP thereby producing a diterpene pyrophosphate intermediate, followed by incubation of the host organism under II. in the presence of said diterpene pyrophosphate intermediate there by producing a diterpene, followed by incubation of the host organism(s) under III. in the presence of said diterpene, c) Optionally isolating diterpenoid from the host organism(s) and/or from the cultivation medium if the host organism has been cultivated in a cultivation medium.
  • the order of incubation may be in the reverse order, i.e. starting with incubation of host organism(s) containing downstream enzymes, such as CYP or diTPS of class II, followed by incubation of host organism(s) containing upstream enzymes, such as diTPS of class I. This may be done in order to prevent accumulation of intermediates and/or undesired side products.
  • the methods of the invention may comprise the steps of a) providing two host organisms, whereon the first host organism comprises
  • the second host organism comprises:
  • a heterologous nucleic acid encoding a cytochrome P450 (CYP) with the proviso that said diTPS of class II, said diTPS of class I and said CYP are not all from the same species; b) Incubating said host organisms under conditions allowing growth of said host organisms, wherein
  • both host organisms are incubated simultaneously in the presence of geranylgeranyl pyrophosphate (GGPP); or ii. the first host organism is incubated in the presence of GGPP thereby producing a diterpene, followed by incubation of the second host organism in the presence of said diterpene; c) Optionally isolating diterpenoid from the host organism(s) and/or from the cultivation medium if the host organism has been cultivated in a cultivation medium.
  • GGPP geranylgeranyl pyrophosphate
  • the methods of the invention may comprise the steps of a) providing two host organisms, wherein the first host organism comprises
  • cytochrome P450 with the proviso that said diTPS of class II, said diTPS of class I and said CYP are not all from the same species; b) Incubating said host organisms in cultivation medium under conditions allowing growth of said host organisms, wherein
  • both host organisms are incubated simultaneously in the presence of geranylgeranyl pyrophosphate (GGPP); or ii. the first host organism is incubated in the presence of GGPP thereby producing a diterpene, followed by incubation of the second host organism in the presence of said diterpene; c) Optionally isolating diterpenoid from the host organism(s) and/or from the
  • the methods of the invention may comprise the steps of a) providing at least three host organisms, wherein the first host organism
  • a heterologous nucleic acids encoding a cytochrome P450 (CYP) and the third and optionally further host organisms each comprises
  • a heterologous nucleic acids encoding a cytochrome P450 (CYP) different to the CYP under III., with the proviso that said diTPS of class II, said diTPS of class I and said CYP are not all from the same species;
  • CYP cytochrome P450
  • GGPP geranylgeranyl pyrophosphate
  • the first host organism is incubated in the presence of GGPP thereby producing a diterpene, followed by incubation of the second host organism in the presence of said diterpene, thereby producing an intermediate diterpenoid, followed by incubation of the third host organism in the presence of said intermediate diterpenoid, thereby producing a second intermediate diterpenoid; followed by simultaneous or sequential incubation of said further host organisms in the presence of intermediate diterpenoids; or
  • the first and the second host organism are incubated together in the presence of GGPP thereby producing an intermediate diterpenoid, followed by incubation of the third host organism in the presence of said intermediate diterpenoid;
  • the first and the second host organism are incubated together in the presence of GGPP thereby producing an intermediate diterpenoid, followed by incubation of the third host organism in the presence of said intermediate diterpenoid, thereby producing a second intermediate diterpenoid; followed by simultaneous or sequential incubation of said further host organisms in the presence of intermediate diterpenoids; or
  • diTPS of class I diTPS of class II
  • various CYPs various CYPs
  • the method may be performed in vitro or in vivo. It is generally preferred, that the methods of the invention are performed in vivo.
  • the term "in vivo" as used herein refers to that the method is performed within a host organism, which for example may be any of the host organisms described herein below in the section "Host organism”.
  • the diterpene pyrophosphate intermediate and the diterpene may for example be any of the compounds described herein below in the sections "Diterpene pyrophosphate intermediates" and “Diterpenes”.
  • the diterpenoid intermediates and the diterpenoids may be any of the compounds described herein below in the section "Diterpenoids".
  • the host organism is capable of producing GGPP.
  • the host organism comprising a heterologous nucleic acid encoding a diTPS of class II is capable of producing GGPP.
  • the step of incubating the host organism(s) in the presence of GGPP may simply be performed by cultivating said host organism.
  • Many host organisms produce GGPP endogenously.
  • the host organism may be a host organism, which endogenously produce GGPP.
  • Such host organisms for example include plants and yeast.
  • the host organism may be recombinantly modulated to upregulate production of GGPP. This may be achieved by expressing one or more enzymes involved in the production of GGPP in said host organism. In addition or alternatively, the expression of one or more enzymes involved in reducing the level of GGPP may be reduced or even abolished.
  • the host organism may comprise a heterologous nucleic acid encoding a 1 -deoxy-D-xylulose-5-phosphate synthase.
  • the host organism may comprise a heterologous nucleic encoding DXS, such as DXS Coleus forskohlii.
  • DXS may be a polypeptide encoded by SEQ ID NO:25 or a functional homologue thereof sharing at least 70%, such as at least 80%, for example at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 91 %, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% sequence identity therewith.
  • the invention relates to DXS encoded by SEQ ID NO:25 or a functional homologue, as well as to host organisms containing a nucleic acid encoding same.
  • the host organism may comprise a heterologous nucleic acid encoding a GGPP synthase (GGPPS), for example a GGPPS1 , such as GGPPS1 of Coleus forskohlii.
  • GGPPS1 may be a polypeptide encoded by SEQ ID NO:26 or a functional homologue thereof sharing at least 70%, such as at least 80%, for example at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 91 %, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% sequence identity therewith.
  • the invention relates to GGPPS1 encoded by SEQ ID NO:26 or a functional homologue, as well as to host organisms containing a nucleic acid encoding same.
  • GGPP is introduced to the host organism. If the host organism is a microorganism, then GGPP may be added to the cultivation medium of said microorganism. If the host organism is a plant, then GGPP may be added to the growing soil of the plant or it may be introduced into the plant by infiltration. Thus, if the heterologous nucleic(s) are introduced into the plant by infiltration, then GGPP may be co-infiltrated together with the heterologous nucleic acid(s).
  • the methods of producing a diterpernoid typically comprises the steps of: providing a host organism or a collection of host organism, wherein
  • At least one of the host organisms comprises a heterologous nucleic acid encoding a diTPS of class II,
  • At least one of the host organisms comprises a heterologous nucleic acid encoding a diTPS of class I,
  • At least one of the host organisms comprises a heterologous nucleic acid encoding a cytochrome P450 (CYP) b) preparing an extract of said host organism(s) or preparing individual extracts or each host organism;
  • CYP cytochrome P450
  • GGPP geranylgeranyl pyrophosphate
  • step f) optionally, incubating the product of the incubation under e) with one or more of said extracts, wherein step f) may optionally be repeated; thereby producing a diterpenoid.
  • an extract containing all enzymes may be incubated with GGPP thereby producing a diterpenoid, or extracts containing different enzymes may be used sequentially. If the extracts are used sequentially the order is of importance.
  • steps d) to f) may for example be performed in either of the following ways: i. incubating an extract containing diTPS of class I, diTPS of class II and one or more CYPs in the presence of geranylgeranyl pyrophosphate (GGPP); or ii. incubating an extract containing diTPS of class I and diTPS of class II in the presence of GGPP, thereby producing a diterpene, followed by incubation an extract containing CYP in the presence of said diterpene; or
  • the intermediate products for example the diterpene, the diterpene pyrophosphate intermediate or any of the intermediate diterpenoids, may be purified or partly purified before performing the next step of the method.
  • crude extract or cultivation medium is used for the subsequent steps.
  • a useful combination of a diTPS of class II and a diTPS of class I and one or more CYPs must be employed.
  • Examples of specific combinations of a diTPS of class II and a diTPS of class I, which leads to production of specific diterpenes, are shown in figure 2.
  • the specific combination of a diTPS of class II and a diTPS of class I shown in figure 2 may be employed in combination with one or more CYP.
  • Other combinations of diTPS of class II and diTPS of class I may be used.
  • the diTPS of class II is selected so that it produces a diterpene pyrophosphate intermediate containing a decalin core having the desired stereochemistry at the 9 and 10 substitutions.
  • Useful diTPS of class II are described below and also specific diTPS of class II catalysing formation of diterpene pyrophosphate intermediates with a specific stereochemistry are described.
  • the diTPS of class I is selected so that is catalyses the conversion of the diterpene pyrophosphate intermediate to a diterpene, which can serve as an intermediate in the production of the final diterpenoid.
  • a diterpenoid according to the present invention is a diterpene substituted with one or more hydroxyl, keto and/or carboxyl groups, for simplicity the unsubstitued diterpene may be referred to as the "precursor diterpene" of the diterpenoid.
  • dehydroabietadiene may be referred to as the precursor diterpene of hydroxy dehydroabietadiene.
  • diTPS of class I is selected so that is catalyses the conversion of the diterpene pyrophosphate
  • Useful diTPS of class I are described below. Also specific reactions catalysed by various diTPS of class I are described, enabling the skilled person to select a useful diTPS of class I for production of a precursor diterpene of a desired diterpenoid.
  • the CYP(s) are collectively selected so that they catalyse oxidation of the precursor diterpene to obtain the desired diterpenoid. If the diterpenoid is a hydroxylated diterpene it is generally sufficient to use one CYP. Said one CYP is selected so that it can catalyse hydroxylation of the precursor diterpene. If the diterpenoid contains ketone and/or carboxyl and/or several hydroxyl groups, then one or more CYP may be employed. Details regarding how to select useful CYPs are given herein below in the section "CYP".
  • nucleic acids encoding same may be expressed in the host organism(s) allowing production of the diterpenoid in a host organism.
  • Putative useful combinations of a diTPS of class II and a diTPS of class I and CYP(s) for production of a given diterpenoid may be tested by expressing said diTPS of class II and said diTPS of class I and said CYP(s) in one or more host organism followed by testing for production of the diterpenoid, e.g. by GC-MS analysis and/or NMR analysis.
  • a diTPS of class II and a diTPS of class I and CYP for production of a given diterpenoid may in particular be tested as described in Example 1 herein below.
  • Methods for expression of enzymes in host organisms are well known to skilled person, and may for example include the methods described herein below in the section "Heterologous nucleic acids”.
  • GGPP as used herein refers to geranylgeranyl diphosphate and is a compound of the following structure:
  • PPO- is diphosphate.
  • PPO- and -OPP may be used interchangeably herein.
  • diTPS of class II may be any of the diTPS of class II described herein below in the section "diTPS of class II".
  • diTPS of class I may be any of the diTPS of class I described herein below in the section "diTPS of class I".
  • CYP may be any of the CYPs described herein below in the section "CYP”.
  • the methods of the invention use a diTPS of class II.
  • the invention also features host organisms comprising a heterologous nucleic acid encoding a diTPS of class II.
  • Said diTPS of class II is an enzyme capable of catalysing protonation-initiated cationic cycloisomerization of GGPP to form a diterpene pyrophosphate intermediate.
  • the class II diTPS reaction may be terminated either by deprotonation or by water capture of the diphosphate carbocation.
  • diTPS of class II may be an enzyme capable of catalysing the reaction I:
  • PPO- is diphosphate and the indicates either a double bond or two single bonds, wherein one is substituted with -OH and the other with -CH 3 .
  • the bond may be in any conformation.
  • diTPS of class II the stereochemistry of the diterpene produced may be controlled. Accordingly, by following the description of the present invention, the skilled person may be able to design the production of a given precursor diterpene by selecting appropriate diTPS enzymes of class II and class I as described herein.
  • the diTPS of class II is generally a polypeptide sharing at least some sequence similarity to at least one of SEQ ID NO:1 , SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7 or SEQ ID NO:8.
  • the diTPS of class II shares at least 30%, preferably at least 40% sequence identity with at least one of SEQ ID NO:1 , SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7 and SEQ ID NO:8.
  • the diTPS of class II shares at least 30%, such as at least 35% sequence identity to the sequence of SsLPPS (SEQ ID NO:6) or to the sequence of AtCPS (see figure 5). Furthermore, it is preferred that the diTPS of class II in addition to above mentioned sequence identity also contains the following motif of four amino acids: D/E-X-D-D,
  • X may be any amino acid, such as any naturally occurring amino acids.
  • X may be an amino acid with a hydrophobic side chain, and thus X may for example be selected from the group consisting of A, I, L, M, F, W, Y and V.
  • X is an amino acid with a small hydrophobic side chain, and thus X may be selected from the group consisting of A, I, L and V.
  • said motif of four amino acids is: D/E-l/V-D-D
  • D/E indicates that said amino acid may be D or E and l/V indicates that said amino acid may be I or V.
  • Amino acids are herein named using the lUPAC nomenclature for amino acids.
  • the diTPS of class II contains above described motif in a position corresponding to position aa 372 to 375 of SsLPPS of SEQ ID NO:6.
  • a position corresponding to position aa 372 to 375 of SsLPPS of SEQ ID NO:6 is identified by aligning the sequence of a diTPS of class II of interest to SEQ ID NO:6 and optionally to additional sequences of diTPS of class II as e.g. shown in figure 5 and identifying the amino acids of said diTPS of class II aligning with aa 372 to 375 of SsLPPS of SEQ ID NO:6.
  • the diTPS of class II when aligned to the sequence of ScLPPS (SEQ ID NO:6), then preferably the diTPS of class II also contains at least 80%, more preferably at least 90%, for example at least 95%, such as all of the amino acids marked by a black box in figure 5.
  • the diTPS of class II when aligned to the sequence of sequence of AtCPS (see figure 5), then preferably the diTPS of class II also contains at least 80%, more preferably at least 90%, for example at least 95%, such as all of the amino acids marked by a black box in figure 5.
  • the diTPS of class II may for example be selected from the group consisting of diTPS of class II of the following types:
  • syn-CPP type such as any of the enzymes described herein below in the
  • ent-CPP type such as any of the enzymes described herein below in the
  • (+)-CPP type such as any of the enzymes described herein below in the
  • LPP type such as any of the such as any of the enzymes described herein below in the section "LPP type diTPS"
  • v. LPP like type such as any of the enzymes described herein below in the
  • diTPS enzymes are bifunctional in the sense that they may be classified as both class II and class I diTPS enzymes.
  • Such bifunctional diTPS enzymes in general contain both the four amino acids motif: D/E-X-D-D, described herein above, as well as the five amino acid motif: D-D-X-X-D/E, described herein below.
  • D-D-X-X-D/E the diTPS of class II is not a bifunctional enzyme of both class II and class I.
  • the diTPS of class I is not a bifunctional enzyme of both class II and class I.
  • the methods of the invention involve use of a diTPS of class II.
  • the invention also features host organisms comprising a heterologous nucleic acid encoding a diTPS of class II.
  • said diTPS of class II is a syn-CPP type diTPS.
  • Such diTPS of class II are in particular useful in embodiments of the inventions, wherein the diterpenoid to be produced contains a 9S,10R decalin core.
  • the term "syn-CPP type diTPS" refers to any enzyme capable of catalysing the reaction II: wherein PPO- refers to diphosphate.
  • syn-CPP type diTPS may be syn-copalyl pyrophosphate synthase (syn-CPP), such as syn-CPP from Oryza sativa. Syn-CPP may also be referred to as CPSsyn.
  • said syn-CPP type diTPS may be a polypeptide of SEQ ID NO:1 or a functional homologue thereof sharing at least 70%, such as at least 80%, for example at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 91 %, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% sequence identity therewith.
  • the sequence identity is preferably calculated as described herein below in the section "Sequence identity".
  • a functional homologue of a syn-CPP is a polypeptide, which is also capable of catalysing reaction II described above.
  • the heterologous nucleic acid encoding said syn-CPP type diTPS may for example be any nucleic acid encoding any of the syn-CPP type diTPS described herein.
  • the heterologous nucleic acid may be any nucleic acid encoding the polypeptide of SEQ ID NO:1 or any of functional homologue thereof described in the previous paragraph.
  • said heterologous nucleic acid may comprise or consist of SEQ ID NO:31 .
  • ent-CPP type The methods of the invention involve use of a diTPS of class II.
  • the invention also features host organisms comprising a heterologous nucleic acid encoding a diTPS of class II.
  • said diTPS of class II is an ent-CPP type diTPS.
  • Such diTPS of class II are in particular useful in embodiments of the inventions, wherein the diterpenoid to be produced contains a 9R,10R decalin core.
  • the term "ent-CPP type diTPS" refers to any enzyme capable of catalysing the reaction III:
  • PPO- refers to diphosphate
  • the ent-CPP type diTPS may be EpTPS7.
  • said ent- CPP type diTPS may be a polypeptide of SEQ ID NO:2 or a functional homologue thereof sharing at least 70%, such as at least 80%, for example at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 91 %, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% sequence identity therewith.
  • the heterologous nucleic acid encoding said ent-CPP type diTPS may for example be any nucleic acid encoding any of the ent-CPP type diTPS described herein.
  • the heterologous nucleic acid may be any nucleic acid encoding the polypeptide of SEQ ID NO:2 or any of functional homologue thereof described in the previous paragraph.
  • the ent-CPP type diTPS may be ZmAN2.
  • said ent- CPP type diTPS may be a polypeptide of SEQ ID NO:3 or a functional homologue thereof sharing at least 70%, such as at least 80%, for example at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 91 %, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% sequence identity therewith.
  • the heterologous nucleic acid may for example be any nucleic acid encoding the polypeptide of SEQ ID NO:3 or any of functional homologue thereof described in the previous paragraph.
  • the sequence identity is preferably calculated as described herein below in the section "Sequence identity”.
  • a functional homologue of an ent-CPP is a polypeptide, which is also capable of catalysing reaction III described above.
  • the methods of the invention involve use of a diTPS of class II.
  • the invention also features host organisms comprising a heterologous nucleic acid encoding a diTPS of class II.
  • said diTPS of class II is a ( ⁇ )-CPP type diTPS.
  • Such diTPS of class II are in particular useful in embodiments of the inventions, wherein the diterpenoid to be produced contains a 9S, 10S decalin core.
  • (+)-CPP type diTPS refers to any enzyme capable of catalysing the reaction IV:
  • PPO- refers to diphosphate
  • the (- -CPP type diTPS may be TwTPS7.
  • said ⁇ ⁇ ⁇ ⁇ )- CPP type diTPS may be a polypeptide of SEQ ID NO:4 or a functional homologue thereof sharing at least 70%, such as at least 80%, for example at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 91 %, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% sequence identity therewith.
  • the heterologous nucleic acid encoding said ( -)-CPP type diTPS may for example be any nucleic acid encoding any of the (+)-CPP type diTPS described herein.
  • the heterologous nucleic acid may be any nucleic acid encoding the polypeptide of SEQ ID NO:4 or any of functional homologue thereof described in the previous paragraph.
  • the ⁇ -CPP type diTPS may be CfTPSI .
  • said (+)- CPP type diTPS may be a polypeptide of SEQ ID NO:5 or a functional homologue thereof sharing at least 70%, such as at least 80%, for example at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 91 %, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% sequence identity therewith.
  • the heterologous nucleic acid may for example be any nucleic acid encoding the polypeptide of SEQ ID NO:5 or any of functional homologue thereof described in the previous paragraph.
  • said heterologous nucleic acid may comprise or consist of SEQ ID NO:27.
  • Said heterologous nucleic acid may also comprise or consist of SEQ ID NO:28. This may in particular be the case in embodiments of the invention wherein the host organism is yeast, e.g. S. cerevisiae.
  • sequence identity is preferably calculated as described herein below in the section "Sequence identity”.
  • a functional homologue of a ( ⁇ -CPP is a polypeptide, which is also capable of catalysing reaction IV described above. LPP type diTPS
  • the methods of the invention involve use of a diTPS of class II.
  • the invention also features host organisms comprising a heterologous nucleic acid encoding a diTPS of class II.
  • said diTPS of class II is a LPP type diTPS.
  • Such diTPS of class II are in particular useful in embodiments of the inventions, wherein the diterpenoid to be produced contains a 8-hydroxy-decalin core.
  • LPP type diTPS may also be useful in other embodiments of the invention.
  • LDP type diTPS refers to any enzyme capable of catalysing the reaction V:
  • PPO- refers to diphosphate
  • the LPP type diTPS may be labda-13-en-8-ol pyrophosphate synthase, such as SsLPPS.
  • said LPP type diTPS may be a polypeptide of SEQ ID NO:6 or a functional homologue thereof sharing at least 70%, such as at least 80%, for example at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 91 %, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% sequence identity therewith.
  • the diTPS of class II is SsLPPS or a functional homologue thereof sharing above mentioned sequence identity
  • the diTPS of class I is not SsSCS, CfTPS3, CfTPS4 or EpTPS8 or a functional homologue of any of the aforementioned sharing at least 70% sequence identity therewith.
  • the diTPS of class II is SsLPPS
  • it is preferred that the diTPS of class I is not SsSCS, CfTPS3, CfTPS4 or EpTPS8.
  • the heterologous nucleic acid encoding said LPP type diTPS may for example be any nucleic acid encoding any of the LPP type diTPS described herein.
  • the heterologous nucleic acid may be any nucleic acid encoding the polypeptide of SEQ ID NO:6 or any of functional homologue thereof described in the previous paragraph.
  • the diTPS of class II is SsCPSL, then it is preferred that the diTPS of class I is not SsKSLI or SsKSL2.
  • the LPP type diTPS may be TwTPS21 .
  • said LPP type diTPS may be a polypeptide of SEQ ID NO:7 or a functional homologue thereof sharing at least 70%, such as at least 80%, for example at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 91 %, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% sequence identity therewith.
  • the heterologous nucleic acid may for example be any nucleic acid encoding the polypeptide of SEQ ID NO:7 or any of functional homologue thereof described in the previous paragraph.
  • the LPP type diTPS may be CfTPS2.
  • said LPP type diTPS may be a polypeptide of SEQ ID NO:17 or a functional homologue thereof sharing at least 70%, such as at least 80%, for example at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 91 %, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% sequence identity therewith.
  • the diTPS of class II is CfTPS2 or a functional homologue thereof sharing above mentioned sequence identity
  • the diTPS of class I is not CfTPS3 or CfTPS4 or EpTPS8 or a functional homologue of the aforementioned sharing at least 70% sequence identity therewith.
  • the diTPS of class II is CfTPS2
  • it is preferred that the diTPS of class I is not CfTPS3 or CfTPS4 or EpTPS8.
  • the heterologous nucleic acid may for example be any nucleic acid encoding the polypeptide of SEQ ID NO:17 or any of functional homologue thereof described in the previous paragraph.
  • sequence identity is preferably calculated as described herein below in the section "Sequence identity”.
  • a functional homologue of a LPP is a polypeptide, which is also capable of catalysing reaction V described above.
  • LPP like type diTPS The methods of the invention involve use of a diTPS of class II.
  • the invention also features host organisms comprising a heterologous nucleic acid encoding a diTPS of class II.
  • said diTPS of class II is a LPP like type diTPS.
  • the LPP like type diTPS may be TwTPS14/28.
  • said LPP like type diTPS may be a polypeptide of SEQ ID NO:8 or a functional homologue thereof sharing at least 70%, such as at least 80%, for example at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 91 %, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% sequence identity therewith.
  • the heterologous nucleic acid encoding said LPP like type diTPS may for exampe be any nucleic acid encoding any of the LPP like type diTPS described herein.
  • the heterologous nucleic acid may be any nucleic acid encoding the polypeptide of SEQ ID NO:8 or any of functional homologue thereof described in the previous paragraph.
  • sequence identity is preferably calculated as described herein below in the section "Sequence identity”.
  • the methods of the invention involve use of a diTPS of class I.
  • the invention also features host organisms comprising a heterologous nucleic acid encoding a diTPS of class I.
  • Said diTPS of class I is an enzyme capable of catalyzing cleavage of the diphosphate group of the diterpene pyrophosphate intermediate and additionally preferably also is capable of catalysing cyclization and/or rearrangement reactions on the resulting carbocation.
  • deprotonation or water capture may terminate the class I diTPS reaction leading to hydroxylation of the diterpene pyrophosphate intermediate.
  • the diTPS of class I is generally a polypeptide sharing at least some sequence similarity to at least one of SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:1 1 , SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16 or SEQ ID NO:17.
  • the diTPS of class I shares at least 30%, preferably at least 40%, more preferably at least 45% sequence identity with at least one of SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:1 1 , SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16 and SEQ ID NO:17.
  • the diTPS of class I shares at least 30%, such as at least 35% sequence identity to the sequence of ScSCS (SEQ ID NO:1 1 ) or to the sequence of AtEKS (see figure 4). Furthermore, it is preferred that the diTPS of class I in addition to above mentioned sequence identity also contains the following motif of five amino acids:
  • X may be any amino acid, such as any naturally occurring amino acids.
  • X may be an amino acid with a hydrophobic side chain, and thus X may for example be selected from the group consisting of A, I, L, M, F, W, Y and V.
  • X is an amino acid with a small hydrophobic side chain, and thus X may be selected from the group consisting of A, I, L and V.
  • D/E indicates that said amino acid may be D or E.
  • the diTPS of class I contains said motif in a position corresponding to position aa 329-333 of SsSCS of SEQ ID NO:1 1 .
  • a position corresponding to position aa 329-333 of SsSCS of SEQ ID NO:1 1 is identified by aligning the sequence of a diTPS of class I of interest to SEQ ID NO:1 1 and optionally to additional sequences of diTPS of class I as e.g. shown in figure 4, and identifying the amino acids of said diTPS of class I aligned with aa 329-333 of SsSCS of SEQ ID NO:1 1 .
  • the diTPS of class I when aligned to the sequence of ScSCS (SEQ ID NO:1 1 ), then preferably the diTPS of class I also contains at least 80%, more preferably at least 90%, for example at least 95%, such as all of the amino acids marked by a black box in figure 4.
  • the diTPS of class I when aligned to the sequence of sequence of AtEKS (see figure 4), then preferably the diTPS of class I also contains at least 80%, more preferably at least 90%, for example at least 95%, such as all of the amino acids marked by a black box in figure 4.
  • the diTPS of class I may for example be selected from the group consisting of diTPS of class I of the following types:
  • EpTPS8 like diTPS, such as any of the enzymes described herein below in the section "EpTPS8"
  • EpTPS23 like diTPS, such as any of the enzymes described herein below in the section "EpTPS23"
  • SsSCS like diTPS, such as any of the enzymes described herein below in the section "SsSCS"
  • CfTPS3 like diTPS, such as any of the enzymes described herein below in the section "CfTPS3"
  • TwTPS2 like diTPS, such as any of the enzymes described herein below in the section "TwTPS2"
  • EpTPSI like diTPS such as any of the enzymes described herein below in the section "TwTPSI "
  • CfTPS14 like diTPS, such as any of the enzymes described herein below in the section "CfTPS14"
  • the diTPS of class I may in one embodiment also be MvTPS5 like diTPS, such as any of the enzymes described herein below in the section "MvTPS5".
  • the diTPS of class I may be the polypeptide of SEQ ID NO:41 or a functional homologue thereof sharing at least 70%, such as at least 80%, for example at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 91 %, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% sequence identity therewith.
  • the heterologous nucleic acid encoding said diTPS of Tripterygium Wilfordii may for example be any nucleic acid encoding any of the diTPS of Tripterygium Wilfordii described herein.
  • the heterologous nucleic acid may be any nucleic acid encoding the polypeptide of SEQ ID NO:41 or any of functional homologue thereof described in the previous paragraph.
  • the heterologous nucleic acid may be a nucleic acid of SEQ ID NO:45.
  • the invention involves use of a diTPS of class I.
  • said diTPS of class I may be an EpTPS8 like diTPS.
  • the diTPS of class I is a EpTPS8 like diTPS, then it may be preferred that the diTPS of class II is not CfTPS2 or SsLPPS or a functional homologue of any of the
  • the diTPS of class I is EpTPS8
  • the diTPS of class II is not CfTPS2 or SsLPPS.
  • said diTPS of class I may be an EpTPS8 like diTPS in embodiments of the invention, wherein the diterpene to be produced contains a tricyclic ring structure.
  • said diTPS of class I may be and EpTPS8 like diTPS in embodiments of the invention, wherein the diterpene to be produced contains a core of any of the formulas I, II, III, XLI, XLII, XLIII, XLIV or XLV:
  • the waved line " i " as used herein indicates a bond of undefined stereochemistry, i.e. the bond may be either a " I " or " 5 ".
  • the diterpene containing a core of formula I or I I may have different stereochemistry.
  • the stereochemistry of the decalin core present in the diterpene pyrophosphate intermediate is maintained after the reaction catalysed by a EpTPS8 like diTPS.
  • EpTPS8 like diTPS may be any enzyme capable of catalysing the reaction VI I :
  • EpTPS8 like diTPS may be an enzyme catalysing the reaction VI II :
  • reaction VII I the produced diterpene will in general maintain the stereochemistry around the decalin core found in the starting diterpene pyrophosphate intermediate.
  • EpTPS8 like diTPS may also be an enzyme catalysing the reaction IX:
  • reaction IX the produced diterpene will general maintain the stereochemistry around the decalin core found in the starting diterpene pyrophosphate intermediate.
  • EpTPS8 like diTPS may also be an enzyme catalysing the reaction X:
  • reaction X the produced diterpene will in general maintain the stereochemistry around the decalin core found in the starting diterpene pyrophosphate intermediate.
  • EpTPS8 like diTPS may be an enzyme catalysing the reaction XXV:
  • EpTPS8 like diTPS may be a terpene synthase from Euphobia peplus, and in particular it may be TPS8 from Euphobia peplus. TPS8 from Euphobia peplus is also referred to as EpTPS herein.
  • said EpTPS8 like diTPS may be a polypeptide of SEQ ID NO:9 or a functional homologue thereof sharing at least
  • 70% such as at least 80%, for example at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 91 %, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% sequence identity therewith.
  • the heterologous nucleic acid encoding said EpTPS8 like diTPS may for example be any nucleic acid encoding any of the EpTPS8 like diTPS described herein.
  • the heterologous nucleic acid may for example be any nucleic acid encoding the polypeptide of SEQ ID NO:9 or any of functional homologue thereof described in the previous paragraph.
  • sequence identity is preferably calculated as described herein below in the section "Sequence identity”.
  • a functional homologue of EpTPS8 is a polypeptide, which is also capable of catalysing at least one of reactions VII, VIII, IX, X and XXV described above.
  • the invention involves use of a diTPS of class I.
  • said diTPS of class I may be an EpTPS23 like diTPS.
  • said diTPS of class I may be an EpTPS23 like diTPS in embodiments of the invention, wherein the diterpene to be produced contains a tricyclic ring structure.
  • said diTPS of class I may be an EpTPS23 like diTPS in embodiments of the invention, wherein the diterpene to be produced contains a core of any of the formulas I, II, XLIV or XLI:
  • the diterpene containing a core of formula I or II may have different stereochemistry.
  • the stereochemistry of the decalin core present in the diterpene pyrophosphate intermediate is maintained after the reaction catalysed by an EpTPS23 like diTPS.
  • EpTPS23 like diTPS may in particular be an enzyme capable of catalysing the reaction XI:
  • EpTPS23 like diTPS may be an enzyme catalysing the reaction VIII:
  • reaction VIII the produced diterpene will in general maintain the stereochemistry around the decalin core found in the starting diterpene pyrophosphate intermediate.
  • the EpTPS23 like diTPS may also be an enzyme catalysing the reaction IX:
  • an EpTPS23 like diTPS may be a diterpene synthase from
  • the EpTPS23 like diTPS may be TPS23 of Euphobia peplus.
  • TPS23 of Euphobia peplus may also be referred to as EpTPS23 herein.
  • said EpTPS23 like diTPS may be a polypeptide of SEQ ID NO:10 or a functional homologue thereof sharing at least 70%, such as at least 80%, for example at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 91 %, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% sequence identity therewith.
  • the heterologous nucleic acid encoding said EpTPS23 like diTPS may for example be any nucleic acid encoding any of the EpTPS23 like diTPS described herein.
  • the heterologous nucleic acid may be any nucleic acid encoding the polypeptide of SEQ ID NO:10 or any of functional homologue thereof described in the previous paragraph.
  • the sequence identity is preferably calculated as described herein below in the section "Sequence identity”.
  • a functional homologue of EpTPS23 is a polypeptide, which is also capable of catalysing at least one of reactions VIII or IX described above.
  • SsSCS The invention involves use of a diTPS of class I.
  • said diTPS of class I may be a SsSCS like diTPS.
  • said diTPS of class I may be a SsSCS like diTPS in embodiments of the invention, wherein the diterpene to be produced contains a core of formula III, XLVI, XXVII, XXVIII, XXIX, XXX, XXXI, XXXII, XXXIII, or XXXIV:
  • the diterpene containing a decalin substituted at the 10 position with said C 5 -alkenyl chain, or the diterpene containing a core of formula III may have different stereochemistry.
  • the stereochemistry of the decalin core present in the diterpene pyrophosphate intermediate is maintained after the reaction catalysed by a SsSCS like diTPS.
  • the SsSCS like diTPS may be any enzyme capable of catalysing the following reaction XII:
  • Diterpene containing a decalin core substituted at the 10 position with C 5 -alkenyl chain, which optionally may be substituted with a hydroxyl and/or a methyl group and/or C OR diterpene containing a core structure of formula III.
  • the SsSCS like diTPS may in particular be an enzyme capable of catalysing the reaction XVI:
  • -GPP is diphosphate
  • the doited lines without star indicates a bond, which optionally is present.
  • ⁇ hus may be ⁇ Ch2 or OH . It is to be understood that in embodiments of the invention, wherein [he doited line
  • a SsSCS like diTPS may in particular be an enzyme capable of catalysing the reaction XVII:
  • reaction XVI I the produced diterpene will in general maintain the stereochemistry around the decalin core found in the starting diterpene pyrophosphate intermediate.
  • the SsSCS like diTPS may be an enzyme catalysing any of the reactions XII I, XIV and XV shown in figure 1 .
  • the SsSCS like diTPS may also be an enzyme catalysing the following reaction XXVI I I:
  • OPP is diphosphate and P is a C 5 -alkenyl substituted with methyl and/or hydroxyl.
  • PM is C 5 -alkenyl containing one or two double bonds.
  • R is alkenyl containing one double bond
  • said alkenyl is preferably substituted with hydroxyl and methyl.
  • R is alkenyl containing two double bonds
  • said alkenyl is preferably substituted with methyl.
  • the SsSCS like diTPS may also be an enzyme catalysing the following reaction XXIX:
  • Xi is either -OH or methyl
  • X 2 is either -H or -OH, wherein one and only one of Xi and X 2 is -OH.
  • R 2 is C 5 -alkenyl containing one or two double bonds.
  • R 2 is alkenyl containing one double bond
  • R 2 is alkenyl containing two double bonds, said alkenyl is preferably substituted with methyl.
  • the SsSCS like diTPS may also be an enzyme catalysing the reaction X:
  • reaction X the produced diterpene will general maintain the stereochemistry around the decalin core found in the starting diterpene pyrophosphate intermediate.
  • the SsSCS like diTPS may also be an enzyme catalysing the reaction LI I:
  • OPP indicates diphosphate
  • a SsSCS like diTPS may be SCIareol Synthase (SCS) from Salvia Sclarea.
  • SCS from Salvia Sclarea may also be referred to as SsSCS herein.
  • said SsSCS like diTPS may be a polypeptide of SEQ ID NO:1 1 or a functional homologue thereof sharing at least 70%, such as at least 80%, for example at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 91 %, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% sequence identity therewith.
  • the heterologous nucleic acid encoding said SsSCS like diTPS may for example be any nucleic acid encoding any of the SsSCS like diTPS described herein.
  • the heterologous nucleic acid may for example be any nucleic acid encoding the polypeptide of SEQ ID NO:1 1 or any of functional homologue thereof described in the previous paragraph.
  • said heterologous nucleic acid may comprise or consist of SEQ ID NO:32.
  • Said heterologous nucleic acid may also comprise or consist of SEQ ID NO:33. This may in particular be the case in embodiments of the invention wherein the host organism is yeast, e.g. S. cerevisiae.
  • sequence identity is preferably calculated as described herein below in the section "Sequence identity”.
  • a functional homologue of SsSCS is a polypeptide, which is also capable of catalysing at least one of reactions XII, XIII, XIV, XV, XVI, XVII, XXVIII or XXIX or Lll described above.
  • the invention involves use of a diTPS of class I.
  • said diTPS of class I may be a CfTPS3 like diTPS.
  • the diTPS of class I is a CfTPS3 like diTPS, then it may be preferred that the diTPS of class II is not CfTPS2 or SsLPPS or a functional homologue thereof sharing at least 70% sequence identity therewith.
  • the diTPS of class I is CfTPS3
  • it may be preferred that the diTPS of class II is not CfTPS2 or SsLPPS.
  • said diTPS of class I may be a CfTPS3 like diTPS in embodiments of the invention, wherein the diterpene to be produced contains a tricyclic ring structure.
  • said diTPS of class I may be a CFTPS3 like diTPS in embodiments of the invention, wherein the diterpene to be produced contains a core of any of the formulas VI, IX XXXV, XXXVI, II, XXXVII, XXVIII, XXXIX, XL, III or XXXII:
  • the diterpene containing a core of formula VI, IX, XXXV, II or XXXIX may have different stereochemistry.
  • the stereochemistry of the decalin core present in the diterpene pyrophosphate intermediate is maintained after the reaction catalysed by the CfTPS3 like diTPS.
  • the CfTPS3 like diTPS may be any enzyme capable of catalysing the reaction XXIII:
  • Diterpene pyrophosphate intermediate containing a decalin core structure Diterpene containing a core structure of formula VI or formula IX, XXXV, XXXVI, ⁇ , ⁇ ⁇ , XXXVI I I, XXXIX, XL, I I I or XXXI I.
  • the CfTPS3 like diTPS may in particular be an enzyme capable of catalysing the reaction XXIV:
  • reaction XXIV the produced diterpene will in general maintain the stereochemistry around the decalin core found in the starting diterpene pyrophosphate intermediate.
  • the CfTPS3 like diTPS may in particular be an enzyme capable of catalysing the reaction XXI I :
  • reaction XXI I the produced diterpene will in general maintain the stereochemistry around the decalin core found in the starting diterpene pyrophosphate intermediate.
  • the CfTPS3 like diTPS may in particular be an enzyme capable of catalysing the reaction LI N :
  • OPP is diphosphate.
  • the produced diterpene will in general maintain the stereochemistry around the decalin core found in the starting diterpene pyrophosphate intermediate.
  • the CfTPS3 like diTPS may in particular be an enzyme capable of catalysing the reaction LIV:
  • OPP is diphosphate.
  • the produced diterpene will in general maintain the stereochemistry around the decalin core found in the starting diterpene pyrophosphate intermediate.
  • the CfTPS3 like diTPS may also be an enzyme catalysing the reaction X:
  • the CfTPS3 like diTPS may be a diterpene synthase from Coleus forskohlii.
  • the CfTPS3 like diTPS may be a TPS3 from Coleus forskohlii.
  • TPS3 from Coleus forskohlii may also be referred to as CfTPS3.
  • said CfTPS3 like diTPS may be a polypeptide of SEQ ID NO:12 or a functional homologue thereof sharing at least 70%, such as at least 80%, for example at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 91 %, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% sequence identity therewith.
  • the heterologous nucleic acid encoding said CfTPS3 like diTPS may for example be any nucleic acid encoding any of the CfTPS3 like diTPS described herein.
  • the heterologous nucleic acid may for example be any nucleic acid encoding the polypeptide of SEQ ID NO:12 or any of functional homologue thereof described in the previous paragraph.
  • said heterologous nucleic acid may comprise or consist of SEQ ID NO:29.
  • Said heterologous nucleic acid may also comprise or consist of SEQ ID NO:30. This may in particular be the case in embodiments of the invention wherein the host organism is yeast, e.g. S. cerevisiae.
  • sequence identity is preferably calculated as described herein below in the section "Sequence identity”.
  • a functional homologue of CfTPS3 is a polypeptide, which is also capable of catalysing at least one of reactions XXII, XXIII or XXIV described above.
  • the invention involves use of a diTPS of class I.
  • said diTPS of class I may be a CfTPS4 like diTPS.
  • the diTPS of class I is a CfTPS4 like diTPS, then it may be preferred that the diTPS of class II is not CfTPS2 or SsLPPSor a functional homologue of any of the
  • the diTPS of class I is CfTPS4, then it is preferred that the diTPS of class II is not CfTPS2 or SsLPPS.
  • said diTPS of class I may be a CfTPS4 like diTPS in embodiments of the invention, wherein the diterpene to be produced contains a tricyclic ring structure.
  • said diTPS of class I may be a CfTPS4 like diTPS in embodiments of the invention, wherein the diterpene to be produced contains a core of any of the formulas VI, IX, XXXV, XXXVI, II, XXXVII, XXXVIII, XXXIX or XL:
  • the diterpene containing a core of formula VI, IX, XXXV, II, or XXXIX may have different stereochemistry.
  • the stereochemistry of the decalin core present in the diterpene pyrophosphate intermediate is maintained after the reaction catalysed by the CfTPS4 like diTPS.
  • the CfTPS4 like diTPS may be any enzyme capable of catalysing the reaction XXIII:
  • the CfTPS4 like diTPS may in particular be an enzyme capable of catalysing the reaction XXIV:
  • reaction XXIV the produced diterpene will in general maintain the stereochemistry around the decalin core found in the starting diterpene pyrophosphate intermediate.
  • the CfTPS4 like diTPS may in particular be an enzyme capable of catalysing the reaction XXII:
  • reaction XXII the produced diterpene will in general maintain the stereochemistry around the decalin core found in the starting diterpene pyrophosphate intermediate.
  • the CfTPS4 like diTPS may in particular be an enzyme capable of catalysing the reaction LIN:
  • OPP is diphosphate.
  • the produced diterpene will in general maintain the stereochemistry around the decalin core found in the starting diterpene pyrophosphate intermediate.
  • the CfTPS4 like diTPS may in particular be an enzyme capable of catalysing the reaction LIV:
  • OPP is diphosphate.
  • the produced diterpene will in general maintain the stereochemistry around the decalin core found in the starting diterpene pyrophosphate intermediate.
  • the CfTPS4 like diTPS may be a diterpene synthase from Coleus forskohlii.
  • the CfTPS4 like diTPS may be a TPS4 from Coleus forskohlii.
  • TPS4 from Coleus forskohlii may also be referred to as CfTPS4.
  • said CfTPS4 like diTPS may be a polypeptide of SEQ ID NO:13 or a functional homologue thereof sharing at least 70%, such as at least 80%, for example at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 91 %, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% sequence identity therewith.
  • the heterologous nucleic acid encoding said CfTPS4 like diTPS may for example be any nucleic acid encoding any of the CfTPS4 like diTPS described herein.
  • the heterologous nucleic acid may for example be any nucleic acid encoding the polypeptide of SEQ ID NO:13 or any of functional homologue thereof described in the previous paragraph.
  • sequence identity is preferably calculated as described herein below in the section "Sequence identity”.
  • a functional homologue of CfTPS4 is a polypeptide, which is also capable of catalysing at least one of reactions XXII, XXIII or XXIV described above.
  • the invention involves use of a diTPS of class I.
  • said diTPS of class I may be a MvTPS5 like diTPS.
  • said diTPS of class I may be a MvTPS5 like diTPS in embodiments of the invention, wherein the diterpene to be produced contains a tricyclic ring structure.
  • said diTPS of class I may be a MvTPS5 like diTPS in embodiments of the invention, wherein the diterpene to be produced contains a core of any of the formulas VI, IX, XXXV, XXXVI, ⁇ ⁇ , ⁇ ⁇ , XXVIII, XXXIX, XL, III or XXXII:
  • the diterpene containing a core of formula VI, IX, XXXV, II, XXXIX or III may have different stereochemistry.
  • the stereochemistry of the decalin core present in the diterpene pyrophosphate intermediate is maintained after the reaction catalysed by the MvTPS5 like diTPS.
  • the MvTPS5 like diTPS may be any enzyme capable of catalysing the reaction XXIII:
  • the MvTPS5 like diTPS may in particular be an enzyme capable of catalysing the reaction LV:
  • reaction LV the produced diterpene will in general maintain the stereochemistry around the decalin core found in the starting diterpene pyrophosphate intermediate.
  • the MvTPS5 like diTPS may in particular be an enzyme capable of catalysing the reaction LVI:
  • the MvTPS5 like diTPS may in particular be an enzyme capable of catalysing the reaction LIN:
  • OPP is diphosphate.
  • the produced diterpene will in general maintain the stereochemistry around the decalin core found in the starting diterpene pyrophosphate intermediate.
  • the MvTPS5 like diTPS may in particular be an enzyme capable of catalysing the reaction LIV:
  • OPP is diphosphate.
  • the produced diterpene will in general maintain the stereochemistry around the decalin core found in the starting diterpene pyrophosphate intermediate.
  • the MvTPS5 like diTPS may also be an enzyme catalysing the reaction X:
  • reaction X the produced diterpene will in general maintain the stereochemistry around the decalin core found in the starting diterpene pyrophosphate intermediate.
  • the MvTPS5 like diTPS may be a diterpene synthase from
  • the MvTPS5 like diTPS may be a TPS5 from
  • MvTPS5 Marrubium vulgare.
  • TPS5 from Marrubium vulgare may also be referred to as MvTPS5.
  • said MvTPS5 like diTPS may be a polypeptide of SEQ ID NO:46 or a functional homologue thereof sharing at least 70%, such as at least 80%, for example at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 91 %, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% sequence identity therewith.
  • sequence identity is preferably calculated as described herein below in the section "Sequence identity”.
  • a functional homologue of MvTPS5 is a polypeptide, which is also capable of catalysing at least one of reactions LIN, LIV, LV or LVI described above.
  • the invention involves use of a diTPS of class I.
  • said diTPS of class I may be a TwTPS2 like diTPS.
  • said diTPS of class I may be a TwTPS2 like diTPS in embodiments of the invention, wherein the diterpene to be produced contains a tricyclic ring structure.
  • said diTPS of class I may be a TwTPS2 like diTPS in embodiments of the invention, wherein the diterpene to be produced contains a core of any of the formulas IV, V or X:
  • the diterpene containing a core of formula IV and V may have different stereochemistry.
  • the stereochemistry of the decalin core present in the diterpene pyrophosphate intermediate is maintained after the reaction catalysed by the TwTPS2 like diTPS.
  • the TwTPS2 like diTPS may be any enzyme capable of catalysing the reaction XXVI:
  • the TwTPS2 like diTPS may be any enzyme capable of catalysing conversion of a diterpene pyrophosphate intermediate to a diterpene containing a core of either formula IV or V.
  • the TwTPS2 like diTPS may in particular be an enzyme capable of catalysing the reaction XIX:
  • reaction XIX the produced diterpene will in general maintain the stereochemistry around the decalin core found in the starting diterpene pyrophosphate intermediate.
  • the TwTPS2 like diTPS may in particular be an enzyme capable of catalysing the reaction XXVII:
  • reaction XIX the produced diterpene will in general maintain the stereochemistry around the decalin core found in the starting diterpene pyrophosphate intermediate.
  • the TwTPS2 like diTPS may in particular be an enzyme capable of catalysing the reaction XX:
  • reaction XX the produced diterpene will in general maintain the stereochemistry around the decalin core found in the starting diterpene pyrophosphate intermediate.
  • the TwTPS2 like diTPS may be a diterpene synthase from
  • TwTPS2 like diTPS may be a TPS2 from Tripterygium wilfordii.
  • TPS2 from Tripterygium wilfordii may also be referred to as TwTPS2.
  • said TwTPS2 like diTPS may be a polypeptide of SEQ ID NO:14 or a functional homologue thereof sharing at least 70%, such as at least 80%, for example at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 91 %, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% sequence identity therewith.
  • the heterologous nucleic acid encoding said TwTPS2 like diTPS may for example be any nucleic acid encoding any of the TwTPS2 like diTPS described herein.
  • the heterologous nucleic acid may for example be any nucleic acid encoding the polypeptide of SEQ ID NO:14 or any of functional homologue thereof described in the previous paragraph.
  • sequence identity is preferably calculated as described herein below in the section "Sequence identity”.
  • a functional homologue of TwTPS2 is a polypeptide, which is also capable of catalysing at least one of reactions, XIX, XX, XXVI or XXVII described above.
  • the invention involves use of a diTPS of class I.
  • said diTPS of class I may be an EpTPSI like diTPS.
  • said diTPS of class I may be an EpTPSI like diTPS in embodiments of the invention, wherein the diterpene to be produced contains a tricyclic ring structure
  • said diTPS of class I may be an EpTPSI like diTPS in embodiments of the invention, wherein the diterpene to be produced contains a core of any of the formulas IV or V:
  • the diterpene containing a core of formula IV and V may have different stereochemistry.
  • the stereochemistry of the decalin core present in the diterpene pyrophosphate intermediate is maintained after the reaction catalysed by the EpTPSI like diTPS.
  • the EpTPSI like diTPS may be any enzyme capable of catalysing the reaction XVIII:
  • the EpTPSI like diTPS may be any enzyme capable of catalysing conversion of a diterpene pyrophosphate intermediate to a diterpene containing a core of either formula IV or V.
  • the EpTPSI like diTPS may in particular be an enzyme capable of catalysing the reaction XIX:
  • OPP is diphosphate.
  • the EpTPSI like diTPS may in particular be an enzyme capable of catalysing the reaction XX:
  • reaction XX the produced diterpene will in general maintain the stereochemistry around the decalin core found in the starting diterpene pyrophosphate intermediate.
  • the EpTPSI like diTPS may be a TPS1 from Euphobia peplus.
  • TPS1 from Euphobia peplus may also be referred to as EpTPSI .
  • said EpTPSI like diTPS may be a polypeptide of SEQ ID NO:15 or a functional homologue thereof sharing at least 70%, such as at least 80%, for example at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 91 %, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% sequence identity therewith.
  • the heterologous nucleic acid encoding said EpTPSI like diTPS may for example be any nucleic acid encoding any of the EpTPSI like diTPS described herein.
  • the heterologous nucleic acid may for example be any nucleic acid encoding the polypeptide of SEQ ID NO:15 or any of functional homologue thereof described in the previous paragraph.
  • the sequence identity is preferably calculated as described herein below in the section "Sequence identity”.
  • a functional homologue of EpTPSI is a polypeptide, which is also capable of catalysing at least one of reactions XVIII, XIX or XX described above. CfTPS14
  • the invention involves use of a diTPS of class I.
  • said diTPS of class I may be an CfTPS14 like diTPS.
  • said diTPS of class I may be an CfTPS14 like diTPS in embodiments of the invention, wherein the diterpene to be produced contains a tricyclic ring structure.
  • said diTPS of class I may be an CfTPS14 like diTPS in embodiments of the invention, wherein the diterpene to be produced contains a core of any of the formulas IV or V:
  • the diterpene containing a core of formula IV and V may have different stereochemistry.
  • the stereochemistry of the decalin core present in the diterpene pyrophosphate intermediate is maintained after the reaction catalysed by the CfTPS14 like diTPS.
  • the CfTPS14 like diTPS may be any enzyme capable of catalysing the reaction XVIII:
  • the CfTPS14 like diTPS may be any enzyme capable of catalysing conversion of a diterpene pyrophosphate intermediate to a diterpene containing a core of either formula IV or V.
  • the CfTPS14 like diTPS may in particular be an enzyme capable of catalysing the reaction XIX:
  • reaction XIX the produced diterpene will in general maintain the stereochemistry around the decalin core found in the starting diterpene pyrophosphate intermediate.
  • the CfTPS14 like diTPS may in particular be an enzyme capable of catalysing the reaction XX:
  • the CfTPSI 4 like diTPS may be a diterpene synthase from Coleus forskohlii.
  • the CfTPSI 4 like diTPS may be a TPS14 from Coleus forskohlii.
  • TPS14 from Coleus forskohlii may also be referred to as CfTPSI 4.
  • said CfTPSI 4 like diTPS may be a polypeptide of SEQ ID NO:16 or a functional homologue thereof sharing at least 70%, such as at least 80%, for example at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 91 %, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% sequence identity therewith.
  • the heterologous nucleic acid encoding said CfTPS14 like diTPS may for example be any nucleic acid encoding any of the CfTPS14 like diTPS described herein.
  • the heterologous nucleic acid may for example be any nucleic acid encoding the polypeptide of SEQ ID NO:16 or any of functional homologue thereof described in the previous paragraph.
  • sequence identity is preferably calculated as described herein below in the section "Sequence identity”.
  • a functional homologue of CfTPS14 is a polypeptide, which is also capable of catalysing at least one of reactions XVIII, XIX or XX described above.
  • the methods of the invention use one of more CYPs.
  • the invention also features host organisms comprising one or more heterologous nucleic acid encoding a CYP.
  • Cytochrome P450 refers to an enzyme of the Cytochrome P450 family. Cytochrome P450s may also be referred to as "CYPs” and one cytochrome P450 may be referred to as a CYP.
  • CYPs according to the present invention are enzymes capable of catalyzing oxidation reactions using NAD(P)H as electron donor. CYPs may for example be enzymes capable of catalyzing one or more of the following reactions:
  • Preferred CYPs according to the present invention are hemoproteins capable of catalyzing oxidation reactions that utilize NADPH and/or NADH to reductively cleave atmospheric dioxygen to produce a functionalized organic substrate and a molecule of water.
  • a preferred CYP according to the invention is an enzyme capable of catalyzing one or more of the following reactions:
  • the methods of the invention may involve use of one or more CYPs.
  • at least one CYP is preferably and enzyme capable of catalyzing the reaction XXX:
  • Diterpene ⁇ hydroxyl-diterpene wherein the diterpene for example may be any of the diterpenes described in the section "Diterpenes" herein below.
  • at least one CYP is an enzyme capable of catalyzing hydroxylation of a diterpene containing a core structure of one of following formulas I, II, III, IV, V, VI, IX, X, XXII, XXIII or XXIV, for example any one of the diterpenes described in Table I.
  • the CYP preferably is an enzyme capable of catalysing reaction XXX.
  • the CYP may also be an enzyme capable of catalysing hydroxylation of a hydroxyl- diterpene to form dihydroxy-diterpene and/or diterpene ketone and/or diterpene aldehyde.
  • the CYP may be an enzyme capable of catalysing:
  • Said hydroxyl-diterpene may in particular be a hydroxyl-diterpene containing a core structure of one of following formulas 1, 11, 111, IV, V, VI, IX, X, XXII, XXIII or XXIV, for example it may be hydroxyl-diterpene, wherein the diterpene is any one of the diterpenes described in Table I.
  • the hydroxyl groups of said dihydroxy- diterpene are attached to different carbon atoms of the diterpene.
  • the ketone group of the diterpene ketone is positioned on the same carbon atom as the hydroxyl group of the hydroxyl-diterpene. Furthermore, it is preferred that said carbon atom is a carbon atom attached to exactly two hydrogen atoms in the precursor diterpene.
  • the aldehyde group of the diterpene aldehyde is positioned on the same carbon atom as the hydroxyl group of the hydroxyl-diterpene.
  • said carbon atom is a carbon atom attached to exactly 3 hydrogen atoms in the precursor diterpene.
  • the methods preferably involve use of a CYP, which is capable of catalysing reaction XXX and a CYP, which is capable of catalysing reaction XXXI.
  • Said CYP may be one CYP capable of catalysing both reaction or it may be two different CYPs.
  • the methods preferably involve use of one or more CYP(s), which can catalyse reactions XXX and XXXII.
  • the methods involve use of one or more CYP(s), which can catalyse reactions XXX and XXXIII.
  • the CYP may also be an enzyme capable of catalysing hydroxylation of a dihydroxy- diterpene to form trihydroxy-diterpene and/or diterpene carboxylic acid.
  • the CYP may be an enzyme capable of catalysing: Reaction XXXIV: Dihydroxy-diterpene trihydroxy-diterpene
  • dihydroxy-diterpene Hydroxy-diterpene carboxylic acid
  • Said dihydroxy-diterpene may in particular be a dihydroxy-diterpene containing a core structure of one of following formulas 1, 11, 111, IV, V, VI, IX, X, XXII, XXIII or XXIV, for example it may be dihydroxy-diterpene, wherein the diterpene is any one of the diterpenes described in Table I.
  • the hydroxyl groups of said dihydroxy- diterpene are attached to different carbon atoms of the diterpene.
  • the hydroxyl groups of the trihydroxy-diterpenene are attached to different carbon atoms.
  • the ketone group of the hydroxy-diterpene ketone is positioned on the same carbon atom as one of the hydroxyl group of the dihydroxy- diterpene.
  • said carbon atom is a carbon atom attached to exactly 2 hydrogen atoms in the precursor diterpene.
  • the carboylix acid group of the hydroxy-diterpene carboxylic acid is positioned on the same carbon atom as one of the hydroxyl group of the dihydroxy-diterpene.
  • said carbon atom is a carbon atom attached to exactly 3 hydrogen atoms in the precursor diterpene.
  • the methods may involve use of a CYP capable of catalysing reaction XXX and of a CYP capable of catalysing reaction XXXI and of a CYP capable of catalysing reaction XXXIV.
  • a CYP capable of catalysing reaction XXX and of a CYP capable of catalysing reaction XXXI and of a CYP capable of catalysing reaction XXXIV.
  • This may be different CYPs, or one CYP can be capable of catalysing two or all three of said reactions.
  • the methods may involve use of a CYP capable of catalysing reaction XXX and of a CYP capable of catalysing reaction XXXI and of a CYP capable of catalysing reaction XXXV.
  • a CYP capable of catalysing reaction XXX and of a CYP capable of catalysing reaction XXXI and of a CYP capable of catalysing reaction XXXV.
  • This may be different CYPs, or one CYP can be capable of catalysing two or all three of said reactions.
  • the CYP may also be an enzyme capable of catalysing hydroxylation of diterpene- ketone to form hydroxyl-diterpene ketone and/or diterpene carboxylic acid.
  • the CYP may be an enzyme capable of catalysing: Reaction XXXVI: diterpene ketone ⁇ hydroxyl-diterpen
  • Said diterpene ketone may in particular be a diterpene substituted with a ketone group, wherein the diterpene contains a core structure of one of following formulas I, II, III, IV, V, VI, IX, X, XXII, XXIII or XXIV, for example it may be any one of the diterpenes described in Table I. It is preferred that the ketone group and the hydroxyl group of the hydroxyl-diterpene ketone are attached to different carbon atoms.
  • the ketone group of the diterpene ketone is positioned on the same carbon atom as the carboxylic acid of the diterpene carboxylic acid.
  • said carbon atom is a carbon atom attached to exactly 3 hydrogen atoms in the precursor diterpene.
  • the methods may involve use of a CYP capable of catalysing reaction XXX and of a CYP capable of catalysing reaction XXXII and of a CYP capable of catalysing reaction XXXVI.
  • a CYP capable of catalysing reaction XXX and of a CYP capable of catalysing reaction XXXII and of a CYP capable of catalysing reaction XXXVI.
  • This may be different CYPs, or one CYP can be capable of catalysing two or all three of said reactions.
  • the methods may involve use of a CYP capable of catalysing reaction XXX and of a CYP capable of catalysing reaction XXXII and of a CYP capable of catalysing reaction XXXVII.
  • a CYP capable of catalysing reaction XXX and of a CYP capable of catalysing reaction XXXII and of a CYP capable of catalysing reaction XXXVII.
  • This may be different CYPs, or one CYP can be capable of catalysing two or all three of said reactions.
  • the CYP may also be an enzyme capable of catalysing hydroxylation of diterpene- carboxylic acid to form hydroxyl-diterpene carboxylic acid.
  • the CYP may be an enzyme capable of catalysing:
  • diterpene carboxylic acid hydroxyl-diterpene carboxylic acid Said diterpene carboxylic acid may in particular be a diterpene substituted with a carboxyl group, wherein the diterpene contains a core structure of one of following formulas 1, 11, 111, IV, V, VI, IX, X, XXII, XXIII or XXIV, for example it may be any one of the diterpenes described in Table I.
  • the carboxylic acid group will generally be attached to a different carbon atom as the hydroxyl group.
  • the methods may involve use of a CYP capable of catalysing reaction XXX and of a CYP capable of catalysing reaction XXXI and of a CYP capable of catalysing reaction XXXV and of a CYP capable of catalysing reaction XXXVIII.
  • This may be different CYPs, or one CYP can be capable of catalysing two, three or even all of said reactions.
  • the methods may involve use of a CYP capable of catalysing reaction XXX and of a CYP capable of catalysing reaction XXXII and of a CYP capable of catalysing reaction XXXVII and of a CYP capable of catalysing reaction XXXVIII.
  • This may be different CYPs, or one CYP can be capable of catalysing two, three or even all of said reactions.
  • the methods may also employ use of one or more CYPs capable of catalysing one or more of the following reactions: Trihydroxy-diterpene ⁇ tetrahydroxy-diterpene
  • CYPs are encoded by gene superfamily, which is divided into families sharing at least 40% sequence identity.
  • the families are divided into subfamilies sharing at least 55% sequence identity.
  • the CYP families is has a number, which generally is written after "CYP".
  • CYPs of family 74 are named CYP74.
  • the subfamilies are indicated by a capital letter after the family number.
  • CYP74A a CYP of family 74 and subfamily A is named CYP74A. Additional description of CYPs, the structural characteristics and the nomenclature thereof may for example be found in Schuler et al., Annu Rev. Plant Biol., 2003, 54:629-67 and in Podust et al., 2012, Nat. Prod. Rep., 29:1251 -1266. Thus, the CYP to be used with the present invention may be a CYP as described in any of these references.
  • the CYP is a polypeptide sharing at least some sequence similarity to at least one of SEQ ID NO:20, SEQ ID NO:21 , SEQ ID NO:22 or SEQ ID NO:23.
  • the CYP shares at least 30%, preferably at least 40%, more preferably at least 45% sequence identity with at least one of SEQ ID NO:20, SEQ ID NO:21 , SEQ ID NO:22, or SEQ ID NO:23.
  • CYP contains the following motif of five amino acids:
  • X may be any amino acid, such as any naturally occurring amino acids.
  • one of the X may be an amino acid with a charged side chain, and in particular an acidic side chain, such as glutamic acid (E).
  • A/G indicates that the amino acid may be A or G.
  • T/S indicates that the amino acid may be T or S.
  • X may be any amino acid, such as any naturally occurring amino acids.
  • Said motif of five amino acids may for example be A/G-G-H/S/G-E-T/S.
  • H/S/G indicates that the amino acid may be H or S or G.
  • the CYP contains said motif in a position corresponding to of position aa 290-294 of PsCYP720B4 SEQ ID NO:23.
  • a position corresponding to position aa 290-294 of PsCYP720B4 SEQ ID NO:23 is identified by aligning the sequence of a CYP of interest to SEQ ID NO:23 and optionally to additional sequences of CYPs as e.g. shown in figure 6, and identifying the amino acids of said CYP aligned with aa 290-294of PsCYP720B4 SEQ ID NO:23. This motif is shown as "1 " in figure 6.
  • CYP contains the following motif 4 amino acids:
  • X may be any amino acid, such as any naturally occurring amino acids.
  • X may be an amino acid with an uncharged side chain, such as an hydrophobic side chain.
  • X may be any amino acid, such as any naturally occurring amino acids.
  • Said motif of five amino acids may for example be
  • T/V/M/A indicates that the amino acid may be T or V or M or A.
  • the CYP contains said motif in a position corresponding to of position aa 350-353 of PsCYP720B4 SEQ ID NO:23.
  • a position corresponding to position aa 350-353 of PsCYP720B4 SEQ ID NO:23 is identified by aligning the sequence of a CYP of interest to SEQ ID NO:23 and optionally to additional sequences of CYPs as e.g. shown in figure 6, and identifying the amino acids of said CYP aligned with aa 350-353 of PsCYP720B4 SEQ ID NO:23.
  • This motif is shown as "2" in figure 6.
  • the CYP contains the following motif following motif of 10 amino acids:
  • X may be any amino acid, such as any naturally occurring amino acids.
  • said motif of 10 amino acids is:
  • X may be any amino acid, such as any naturally occurring amino acids.
  • G/A indicates that the amino acid may be G or A and A/R indicates that the amino acid may be A or R.
  • the CYP contains said motif in a position corresponding to of position aa 422-431 of PsCYP720B4 SEQ ID NO:23.
  • a position corresponding to position aa 422-431 of PsCYP720B4 SEQ ID NO:23 is identified by aligning the sequence of a CYP of interest to SEQ ID NO:23 and optionally to additional sequences of CYPs as e.g. shown in figure 6, and identifying the amino acids of said CYP aligned with aa 422-431 of PsCYP720B4 SEQ ID NO:23. This motif is shown as "3" in figure 6.
  • the CYP contains the following motif following motif of 3 amino acids: P-F-G
  • the CYP contains said motif in a position corresponding to of position aa 421 -423 of PsCYP720B4 SEQ ID NO:23.
  • a position corresponding to position aa 421 -423of PsCYP720B4 SEQ ID NO:23 is identified by aligning the sequence of a CYP of interest to SEQ ID NO:23 and optionally to additional sequences of CYPs as e.g. shown in figure 6, and identifying the amino acids of said CYP aligned with aa 421 -423of PsCYP720B4 SEQ ID NO:23.
  • This motif may be overlapping with the motif of 10 amino acids described above. The motif is shown as "4" in figure 6.
  • the CYP may contain one or more of the following motifs: SRS1 : P/V-X-X-X-K-(X) 7 -9-F
  • the CYP contains SRS1 in a position corresponding to of position aa 100-1 14 of PsCYP720B4 SEQ ID NO:23.
  • the motif is shown as "SRS1 " in figure 6.
  • SRS5 N/P-F/P-G/L-P-X-V/L-X-X-R
  • the CYP contains SRS5 in a position corresponding to of position aa 356-360 of PsCYP720B4 SEQ ID NO:23.
  • the motif is shown as "SRS5" in figure 6.
  • the CYP may also contain one or more of the motifs SRS2, SRS3 and/or SRS6 shown in figure 6.
  • the CYP720B4 contains at least 80%, more preferably at least 90%, for example at least 95%, such as all of the amino acids shown in a black box in figure 6. It is also preferred that in addition to sharing above mentioned sequence identity and/or containing one or more said motifs, then as many as possible of the amino acids shown in a grey box in figure 6 are retained.
  • the CYP when aligned to the sequence of of PsCYP720B4 (SEQ ID NO:23), then preferably the CYP also contains at least 80%, more preferably at least 90%, for example at least 95%, such as at least 98% of the amino acids shown in a grey box in figure 6.
  • the CYP may be one of the following CYPs:
  • a CYP of family 76 such as any of the CYPs described herein below in the section "CYP76"
  • a CYP of family 71 such as any of the CYPs described herein below in the section “CYP71 " iii.
  • a CYP of family 720 such as any of the CYPs described herein below in the section "CYP720"
  • At least one CYP may be a CYP of family 76.
  • CYP of family 76 may preferably be capable of catalysing reaction XXX outlined above.
  • a CYP of family 76 may be an enzyme capable of catalysing one or more of the following reactions (preferably said CYP of family 76 is capable of catalysing all of the following reactions):
  • reaction XL Rehydroxylation of miltiradiene to form hydroxyl mitradiene iii.
  • Reaction XLI hydroxylation of manool to form hydroxyl manool
  • the CYP of family 76 may also be caoable of catalysing one or more of the following reactions:
  • the CYP of family 76 may be a CYP from Coleus forskohlii.
  • said CYP of family 76 may be a polypeptide sharing at least 40%, such as at least 45%, for example at least 60%, such as at least 55% sequence identity with SEQ ID NO:20 and/or SEQ ID NO:21 .
  • the CYP of family 76 may be a CYP of subfamily AH.
  • said CYP may be a CYP76AH.
  • Said CYP of family 76 and subfamily AH may be a polypeptide sharing at least 55%, such as at least 60%, for example at least 65%, such as at least 70% sequence identity with SEQ ID NO:20 and/or SEQ ID NO:21 .
  • the CYP of family 76 is CYP76AH8 of SEQ ID NO:20, CYP76AH1 1 of SEQ ID NO:21 or a functional homologue thereof sharing at least 70%, such as at least 80%, for example at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 91 %, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% sequence identity therewith.
  • the CYP of family 76 is CYP76AH15 of SEQ ID NO:40, CYP76AH17 of SEQ ID NO:41 or a functional homologue thereof sharing at least 70%, such as at least 80%, for example at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 91 %, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% sequence identity therewith.
  • CYP76AH15 of SEQ ID NO:40 and CYP76AH17 of SEQ ID NO:41 have an activity similar to CYP76AH8 of SEQ ID NO:20, and thus CYP76AH15 of SEQ ID NO:40 or CYP76AH17 of SEQ ID NO:41 or functional homologues thereof may be employed in place of CYP76AH8 of SEQ ID NO:20 or functional homologues thereof.
  • the heterologous nucleic acid encoding said CYP of family 76 may for example be any nucleic acid encoding any of the CYPs of family 76 described herein.
  • the heterologous nucleic acid may for example be any nucleic acid encoding the polypeptide of SEQ ID NO:20, or the polypeptide of SEQ ID NO:21 or any of the functional homologue of the aforementioned described in the previous paragraph.
  • said heterologous nucleic acid may comprise or consist of SEQ ID NO:35.
  • Said heterologous nucleic acid may also comprise or consist of SEQ ID NO:36. This may in particular be the case in embodiments of the invention wherein the host organism is yeast, e.g. S. cerevisiae.
  • the CYP of family 76 is CYP76AH9 of SEQ ID NO:24 or a functional homologue thereof sharing at least 70%, such as at least 80%, for example at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 91 %, such as at least 92%, such as at least 93%, such as at least 94%such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% sequence identity therewith.
  • the heterologous nucleic acid may for example be any nucleic acid encoding the polypeptide of SEQ ID NO:24 or any of functional homologue thereof described in the previous paragraph.
  • the CYP of family 76 is CYP76AH1 , such as CYP76AH1 of SEQ ID NO:42 or a functional homologue thereof sharing at least 70%, such as at least 80%, for example at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 91 %, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% sequence identity therewith.
  • CYP76AH1 such as CYP76AH1 of SEQ ID NO:42 or a functional homologue thereof sharing at least 70%, such as at least 80%, for example at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 91 %, such as at least 92%, such as at least 93%, such as at least 94%, such as at least
  • the heterologous nucleic acid may for example be any nucleic acid encoding the polypeptide of SEQ ID NO:42 or any of functional homologue thereof described in the previous paragraph.
  • the heterologous nucleic acid may be the nucleic acid of sequence with the Genbank accession number KP337687 . ⁇ .
  • the CYP of family 76 is CYP76AH4, such as CYP76AH4 of SEQ ID NO:43 or a functional homologue thereof sharing at least 70%, such as at least 80%, for example at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 91 %, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% sequence identity therewith.
  • Said functional homologue may for example be the protein with Genbank accession number AJQ30187.1 or the protein with Genbank accession number AJQ30188.1 .
  • the heterologous nucleic acid may for example be any nucleic acid encoding the polypeptide of SEQ ID NO:43 or any of functional homologue thereof described in the previous paragraph.
  • the heterologous nucleic acid may be the nucleic acid of sequence with the Genbank accession number KP091843.1 of the Genbank accession number KP091844.1 .
  • sequence identity is preferably calculated as described herein below in the section "Sequence identity”.
  • sequence identity A functional homologue of CYP76AH8 or CYP76AH1 1 or
  • CYP76AH9 is a polypeptide, which is also capable of catalysing at least one of reactions XXXIX, XL, XLI, XLV, XLVI, XLVII, LI described above. It is furthermore preferred that in addition to sharing above mentioned sequence identity, then as many as possible of the amino acids shown in a black box in figure 6 are retained. Thus, when aligned to the sequence of CYP76AH8 (SEQ ID NO:20) and/or CYP76AH1 1 (SEQ ID NO:21 ), then preferably the CYP of family 76 also contains at least 80%, more preferably at least 90%, for example at least 95%, such as all of the amino acids shown in a black box in figure 6.
  • the CYP of family 76 when aligned to the sequence of CYP76AH8 (SEQ ID NO:20) and/or CYP76AH1 1 (SEQ ID NO:21 ), then preferably the CYP of family 76 also contains at least 80%, more preferably at least 90%, for example at least 95%, such as at least 98% of the amino acids shown in a grey box in figure 6.
  • At least one CYP may be a CYP of family 71 .
  • a CYP of family 71 may preferably be capable of catalysing reaction XXX outlined above.
  • a CYP of family 71 may be an enzyme capable of catalysing one or more of the following reactions (preferably said CYP of family 71 is capable of catalysing all of the following reactions):
  • Reaction XXXIX Oxidation of dehydroabietadiene and/or mitradiene ii.
  • Reaction XLII hydroxylation of syn-manool to form hydroxy syn-manool
  • Reaction LI hydroxylation of hydroxy manool to form dihydroxymanool
  • the CYP of family 71 may be a CYP from Coleus forskohlii.
  • said CYP of family 71 may be a polypeptide sharing at least 40%, such as at least 45%, for example at least 60%, such as at least 55% sequence identity with SEQ ID NO:22.
  • the CYP of family 71 may be a CYP of subfamily D.
  • said CYP may be a CYP71 D.
  • Said CYP of family 71 and subfamily D may be a polypeptide sharing at least 55%, such as at least 60%, for example at least 65%, such as at least 70% sequence identity with SEQ ID NO:22.
  • the CYP of family 71 is CYP71 D381 of SEQ ID NO:22 or a functional homologue thereof sharing at least 70%, such as at least 80%, for example at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 91 %, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% sequence identity therewith.
  • the heterologous nucleic acid encoding said CYP of family 71 may for example be any nucleic acid encoding any of the CYPs of family 71 described herein.
  • the heterologous nucleic acid may for example be any nucleic acid encoding the polypeptide of SEQ ID NO:22 or any of functional homologue thereof described in the previous paragraph.
  • said heterologous nucleic acid may comprise or consist of SEQ ID NO:34.
  • sequence identity is preferably calculated as described herein below in the section "Sequence identity”.
  • a functional homologue of CYP71 D381 is a polypeptide, which is also capable of catalysing at least one of reactions XXXIX, XL, XLII or LI described above.
  • the CYP of family 71 when aligned to the sequence of CYP71 D381 (SEQ ID NO:22), then preferably the CYP of family 71 also contains at least 80%, more preferably at least 90%, for example at least 95%, such as all of the amino acids shown in a black box in figure 6. Furthermore, when aligned to the sequence of CYP71 D381 (SEQ ID NO:22), then preferably the CYP of family 71 also contains at least 80%, more preferably at least 90%, for example at least 95%, such as at least 98% of the amino acids shown in a grey box in figure 6.
  • At least one CYP may be a CYP of family 720.
  • a CYP of family 720 may preferably be capable of catalysing reaction XXX outlined above.
  • a CYP of family 720 may be an enzyme capable of catalysing the following reaction:
  • the CYP of family 720 may also be an enzyme capable of catalysing one of both of the following reactions: Reaction XLIII: dehydroabietadiene -> dehydroabietic acid
  • Reaction XLIX addition of keto group to syn-pimara-9,(1 1 ),15-diene OR syn-isoprimara-7,15-diene OR debydroabietadiene to form keto syn-pimara-9,(1 1 ),15-diene OR keto syn-isoprimara- 7,15-diene OR keto debydroabietadiene, respectively
  • Reaction L carboxylation of syn-pimara-9,(1 1 ),15-diene OR syn- isoprimara-7,15-diene OR debydroabietadiene to form carboxy syn-pimara-9,(1 1 ),15-diene OR carboxy syn- isoprimara-7,15-diene OR carboxy debydroabietadiene, respectively
  • Reaction LI hydroxylation of hydroxy manool to form dihydroxymanool Reaction Lll: miltiradiene OR abietatriene -> dehydroabietic acid
  • the CYP of family 720 may be a CYP from Picea sitchensis.
  • said CYP of family 720 may be a polypeptide sharing at least 40%, such as at least 45%, for example at least 60%, such as at least 55% sequence identity with SEQ ID NO:23.
  • the CYP of family 720 may be a CYP of subfamily B.
  • said CYP may be a CYP720B.
  • Said CYP of family 720 and subfamily B may be a polypeptide sharing at least 55%, such as at least 60%, for example at least 65%, such as at least 70% sequence identity with SEQ ID NO:23.
  • the CYP of family 720 is CYP720B4 of SEQ ID NO:23 or a functional homologue thereof sharing at least 70%, such as at least 80%, for example at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 91 %, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% sequence identity therewith.
  • the heterologous nucleic acid encoding said CYP of family 720 may for example be any nucleic acid encoding any of the CYPs of family 720 described herein.
  • the heterologous nucleic acid may for example be any nucleic acid encoding the polypeptide of SEQ ID NO:23 or any of functional homologue thereof described in the previous paragraph.
  • said heterologous nucleic acid may comprise or consist of SEQ ID NO:39.
  • sequence identity is preferably calculated as described herein below in the section "Sequence identity”.
  • a functional homologue of CYP720B4 is a polypeptide, which is also capable of catalysing one, two or all of reactions XLI, XLIII, XLIV, XLVIII, XLIX, L, LI or LI I described above.
  • the CYP of family 720 when aligned to the sequence of CYP720B4 (SEQ ID NO:23), then preferably the CYP of family 720 also contains at least 80%, more preferably at least 90%, for example at least 95%, such as all of the amino acids marked by a black box in figure 6. Furthermore, when aligned to the sequence of CYP720B4 (SEQ ID NO:23), then preferably the CYP of family 720 also contains at least 80%, more preferably at least 90%, for example at least 95%, such as at least 98% of the amino acids shown in a grey box in figure 6.
  • the host organisms according to the present invention may also be recombinantly modified in addition to comprising the heterologous nucleic acids encoding a diTPS of class I and a diTPS of class II and one or more CYPs as described herein.
  • the host organism may be modified to increase the pool of GGPP.
  • GGPP is the starting compound for production of diterpenes.
  • the host organism will be capable of producing increased amounts of diterpene.
  • GGPP Various methods for increasing the pool of GGPP are well known in the art. These includes methods of reducing the activity of enzymes reducing the level of GGPP.
  • the pool of GGPP is increased by expression of one or more enzymes involved in synthesis of GGPP.
  • the host organism comprises a heterologous nucleic acid encoding GGPP synthase (GGPPS).
  • GGPPS may be any GGPPS, e.g. BTS1 of S. cerevisiae.
  • the GGPPS may be the GGPPS described by Zhou, Y. J., W. Gao, Q. Rong, G. Jin, H. Chu, W. Liu, W. Yang, Z. Zhu, G. Li, G. Zhu, L. Huang and Z. K. Zhao (2012). "Modular Pathway Engineering of Diterpenoid Synthases and the Mevalonic Acid Pathway for Miltiradiene Production.” Journal of the American Chemical Society 134(6): 3234-3241 .
  • the host organism may express a fusion of SmCPS and SmKSL, and/or a fusion of BTS1 (GGPP synthase) and ERG20 (fa nesyl diphosphate synthase) as described in Zhou et al., 2012.
  • the host organism may also comprise a heterologous nucleic acid encoding a GGPPS from a plant, e.g. from Coleus forskohlii.
  • the host organism comprises:
  • CfGGPPs geranylgeranylpyrophosphate synthase
  • SEQ ID NO:26 or a functional homologue of any of the aforementioned sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith.
  • the host organism may also comprise a heterologous nucleic acid encoding Coleus fors/co ' / ' geranylgeranylpyrophosphate synthase (CfGGPPs) of SEQ ID NO:26 or a functional homologue of any of the aforementioned sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith.
  • CfGGPPs heterologous nucleic acid encoding Coleus fors/co ' / ' geranylgeranylpyrophosphate synthase
  • the host organism may also comprise a a heterologous nucleic acid encoding GGPP7 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith.
  • the sequence of GGPPS7 of Synechococcus sp. is available under Gene ID:86553638.
  • the host organism may also comprise a heterologous nucleic acid encoding a cytochrome P450 oxidoreductases (POR).
  • PORs are known in the art. Said POR may for example be ATR1 from A.
  • the sequence of ATR1 from A. thaliana is available under Gene ID:3150037.
  • Said POR may for example be POR from Stevia rebaudiana or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith.
  • the sequence of POR from S. rebaudiana is available under Gene ID: 9321 1213.
  • Said POR may also be a CPR, such as CPR from Coleus forskohlii.
  • the host organism may in addition to the heterologous nucleic acids encoding diTPS and CYPs also contain a heterologous nucleic acic encoding CfCPR of SEQ ID NO:44 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith.
  • Said heterologous nucleic acid may be any heterologous nucleic acid encoding CPR, for example the nucleic acid may comprise or consist of SEQ ID NO:45. Sequence identity
  • a high level of sequence identity indicates likelihood that the first sequence is derived from the second sequence.
  • Amino acid sequence identity requires identical amino acid sequences between two aligned sequences.
  • a candidate sequence sharing 80% amino acid identity with a reference sequence requires that, following alignment, 80% of the amino acids in the candidate sequence are identical to the corresponding amino acids in the reference sequence.
  • Identity according to the present invention is determined by aid of computer analysis, such as, without limitations, the ClustalW computer alignment program (Higgins D., Thompson J., Gibson T., Thompson J.D., Higgins D.G., Gibson T.J., 1994.
  • CLUSTAL W improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice.
  • the ClustalW software is available from as a ClustalW WWW Service at the European Bioinformatics Institute h ttp :// www. eb i . a.c . u k/ci u stal w o r via the software BioEdit.
  • This program with its default settings, the mature (bioactive) part of a query and a reference polypeptide are aligned. The number of fully conserved residues are counted and divided by the length of the reference polypeptide. Thus if nothing else is indicated a sequence identity is given over the entire length of the reference polypeptide.
  • sequence identity is calculated over the entire length of the reference polypeptide.
  • the ClustalW algorithm may similarly be used to align nucleotide sequences. Sequence identities may be calculated in a similar way as indicated for amino acid sequences
  • heterologous nucleic acid refers to a nucleic acid sequence, which has been introduced into the host organism, wherein said host does not endogenously comprise said nucleic acid.
  • said heterologous nucleic acid may be introduced into the host organism by recombinant methods.
  • the genome of the host organism has been augmented by at least one incorporated heterologous nucleic acid sequence.
  • Suitable host organisms include microorganisms, plant cells, and plants, and may for example be any of the host organisms described herein below in the section "Host organism”.
  • heterologous nucleic acid encoding a polypeptide is operably linked in sense orientation to one or more regulatory regions suitable for expressing the polypeptide. Because many microorganisms are capable of expressing multiple gene products from a polycistronic mPiNA, multiple polypeptides can be expressed under the control of a single regulatory region for those microorganisms, if desired.
  • a coding sequence and a regulatory region are considered to be operably linked when the regulatory region and coding sequence are positioned so that the regulatory region is effective for regulating transcription or translation of the sequence.
  • the translation initiation site of the translational reading frame of the coding sequence is positioned between one and about fifty nucleotides downstream of the regulatory region for a monocistronic gene.
  • regulatory region refers to a nucleic acid having nucleotide sequences that influence transcription or translation initiation and rate, and stability and/or mobility of a transcription or translation product. Regulatory regions include, without limitation, promoter sequences, enhancer sequences, response elements, protein recognition sites, inducible elements, protein binding sequences, 5 ' and 3 ' untranslated regions (UTRs), transcriptional start sites, termination sequences, polyadenylation sequences, introns, and combinations thereof.
  • a regulatory region typically comprises at least a core (basal) promoter.
  • a regulatory region also may include at least one control element, such as an enhancer sequence, an upstream element or an upstream activation region (UAR).
  • a regulatory region is operably linked to a coding sequence by positioning the regulatory region and the coding sequence so that the regulatory region is effective for regulating transcription or translation of the sequence.
  • the translation initiation site of the translational reading frame of the coding sequence is typically positioned between one and about fifty nucleotides downstream of the promoter.
  • a regulatory region can, however, be positioned at further distance, for example as much as about 5,000 nucleotides upstream of the translation initiation site, or about 2,000 nucleotides upstream of the transcription start site. The choice of regulatory regions to be included depends upon several factors, including the type of host organism.
  • regulatory regions it is a routine matter for one of skill in the art to modulate the expression of a coding sequence by appropriately selecting and positioning regulatory regions relative to the coding sequence. It will be understood that more than one regulatory region may be present, e.g., introns, enhancers, upstream activation regions, transcription terminators, and inducible elements.
  • nucleic acids can encode a particular polypeptide; i.e., for many amino acids, there is more than one nucleotide triplet that serves as the codon for the amino acid.
  • codons in the coding sequence for a given polypeptide can be modified such that optimal expression in a particular host organisms obtained, using appropriate codon bias tables for that host (e.g., microorganism).
  • Nucleic acids may also be optimized to a GC-content preferable to a particular host, and/or to reduce the number of repeat sequences. Such nucleic acids may be referred to as "codon optimized".
  • these modified sequences can exist as purified molecules and can be incorporated into a vector or a virus for use in constructing modules for recombinant nucleic acid constructs.
  • Software for preparing nucleic acid sequences codon optimized for expression in a given host organism is commercial available. For example codon optimization may be performed using the Geneart service from
  • a compound containing or comprising a " decalin core” as used herein refers to a compound comprising above mentioned structure of formula VII, wherein each of the carbon atoms numbered 1 to 10 may be substituted with one or two substituents. It is possible that two of said substituents are fused to form a ring, and thus compound containing or comprising decalin may contain 3 or more rings.
  • the term "diterpene pyrophosphate intermediate” as used herein refers to a compound, which is the product of bicyclisation of GGPP in a reaction catalysed by a diTPS class II enzyme.
  • the diterpene pyrophosphate intermediate according to the invention contains a decalin core, and comprises a pyrophosphate group.
  • the diterpene pyrophosphate intermediate of the invention is a compound containing a decalin core, which is substituted at one of more positions with substituents selected from the group consisting of alkyl, alkenyl and hydroxyl, wherein one of said alkyl or alkenyl is substituted with O-pyrophosphate.
  • diphosphate and "pyrophosphate” are used interchangeably herein.
  • OPP organic radical
  • -OPP -OPP
  • PPO- polyphosphate
  • alkyl as used herein refers to a saturated, straight or branched hydrocarbon chain.
  • the hydrocarbon chain preferably contains of from one to eighteen carbon atoms (Ci-i 8 -alkyl), more preferred of from one to six carbon atoms (Ci_ 6 -alkyl), including methyl, ethyl, propyl, isopropyl, butyl, isobutyl, secondary butyl, tertiary butyl, pentyl, isopentyl, neopentyl, tertiary pentyl, hexyl and isohexyl.
  • alkenyl refers to a saturated, straight or branched
  • Alkenyl may preferably be any of the alkyls described above containing one or more double bonds.
  • the diterpene pyrophosphate intermediate of the invention is a compound containing a decalin core, wherein said decalin is
  • alkyl i. substituted at the 4 position with one or two alkyl, such as with two alkyl, wherein said alkyl for example may be C 1-3 , alkyl, for example said alkyl may be methyl;
  • alkenyl-O-PP substituted at the 9 position with alkenyl-O-PP, wherein said alkenyl for example may be branched C4-8-alkenyl, such as branched C5-7-alkenyl, for example branched C6-alkenyl; and
  • alkyl for example may be Ci- 3 , alkyl, for example said alkyl may be methyl.
  • the substituent at the 9 position may be alkenyl of formula VIII:
  • said diterpene pyrophosphate intermediate may contain a decalin core substituted as indicated above, wherein the substitutions at the 9 and 10 positions are (9R, 10R), (9S.10S), (9S, 10R) or (9R, 10S), for example the substitutions at the 9 and 10 positions are (9R, 10R), (9S.10S) or (9S, 10R).
  • the diterpene pyrophosphate intermediate may be any of the diterpene pyrophosphate intermediates shown in figure 3, i.e. the diterpene pyrophosphate intermediate may be selected from the group consisting of (9R,10R)- copalyl diphosphate, (9S,10S)-copalyl diphosphate, labda-13-en-8-ol diphosphate and (9S, 10R)-copalyl diphosphate.
  • the term "diterpene” as used herein refers to a compound derived or prepared from four isoprene units.
  • a diterpene according to the invention is a C 20 - molecule consisting of 20 carbon atoms, up to three oxygen atoms and hydrogen atoms.
  • the term "precursor diterpene” as used herein refers to a diterpene, which may serve as the precursor for formation of a diterpenoid.
  • a precursor diterpene may be a diterpene, which after one or more oxidation reactions becomes a diterpenoid.
  • the precursor diterpene may for example be any of the diterpenes described herein in this section.
  • the diterpene typically contains one or more ring structures, such as one or more monocyclic, bicyclic, tricyclic or tetracyclic ring structure(s).
  • the diterpene may contain one or more double bonds.
  • a diterpene according to the invention contains at least one double bond and often they contain in the range of 1 to 3 double bonds.
  • the diterpene may comprise up to three oxygen atom, although it is also possible that the diterpene contains no oxygen and consists solely of carbon and hydrogen atoms.
  • the oxygen atom are generally present in the form of hydroxyl groups, or part of a ring structure.
  • the methods of the invention can be used to produce any diterpenoid by selecting an appropriate combination of diTPS of class II and diTPS of class I and CYP.
  • a suitable precursor diterpene may be produced by selecting an appropriate combination of diTPS of class II and diTPS of class I.
  • a diterpene according to the present invention is a C 20 - molecule containing a decalin core structure.
  • containing a core structure of formula or the term “containing a core of formula” refers to a molecule containing a structure of the indicated formula, wherein said structure may be substituted at one or more positions.
  • substituted as used herein in relation to organic compounds refer to one hydrogen being substituted with another group or atom.
  • Said decalin may be substituted at one or more positions, and it is also contained within the invention that two substituents are fused, thus leading to a tricyclic or higher cyclic structure.
  • the diterpene may be a C 20 -molecule containing a core structure of one of following formulas XI, XII, XIII, XIV, XV, XVI, XVII, XVIII or XIX:
  • the diterpene containing a core structure of any of formulas XI, XII, XIII, XIV, XV, XVI, XVII, XVIII or XIX may be a C 20 -molecule consisting of the formulas XI, XII, XIII, XIV, XV, XVI, XVII, XVIII or XIX substituted at one or more positions.
  • said diterpene may be a C 20 -molecule substituted at the position marked by * with one or two alkyl, such as one or two d-3-alkyl, such as with one or two methyl groups.
  • said diterpene may be substituted at the position marked by ** with one or two groups individually selected from alkyi and alkenyl.
  • Said alkyi may for example be Ci -6 - alkyl, such as d-3-alkyl, for example isopropyl or methyl.
  • Said alkenyl may me Ci -6 alkenyl, such as C 2 - 4 -alkenyl, such as C 2 -3-alkenyl.
  • the diterpene to be produced may be a C 20 - molecule containing a core structure of one of following formulas I, II, III, IV, V, VI, IX,
  • the diterpene containing a core structure of any of formulas I, II, III, IV, V, VI, IX, X, XXII, XXIII or XXIV may be a C 20 -molecule consisting of the formulas I, II, III, IV, V, VI, IX, X, XXII, XXIII or XXIV substituted at one or more positions, for example by one or more groups selected from the group consisting of:
  • alkyi such as C ⁇ -alkyl, for example C 1-3 , wherein said alkyi may be linear or branched, for example alkyi may be isopropyl or methyl
  • alkenyl such as Ci -6 alkenyl, such as C 2 -4-alkenyl, such as C 2 - 3-alkenyl
  • diterpene containing a core structure of any of formulas formulas I, II, III, IV, V, VI, IX or X may be a C 20 -molecule substituted
  • alkyl such as one or two C 1-3 -alkyl, such as with one or two methyl groups, for example with two methyl;
  • alkyl may for example be C 1-6 -alkyl, such as C 1-3 -alkyl, for example isopropyl or methyl.
  • alkenyl may me C 1-6 alkenyl, such as C 2 . 4 - alkenyl, such as C 2 . 3 -alkenyl; and/or
  • the diterpene according to the invention may also be a C 20 -molecule consisting of 20 carbon atoms, up to three oxygen atoms and hydrogen atoms, and which contains a core structure of any of formulas I, II, III, IV, VI, X, XII, XLII, XLIII, XLIV, XLV, XLVI, XXVII, XXVIII, XXIX, XX, XXI, XXII, XXIII, XXIV, XXXV, XXXVI, XXXVIII, XXXIX, XL and/or XLI.
  • the diterpene is a C 20 -molecule containing a core of formula XXXIII: Said diterpene may in particular contain a core of formula
  • the diterpene is a C 2 o-molecule containing a core of any of formulas II, XXXV, XXXVI and/or XXXVII:
  • the position marked by asterisk may be substituted with one or two substituents selected from the group consisting of C ⁇ -alkyl and C 1-2 -alkenyl, preferably the position marked by asterisk may be substituted with one methyl group and ethenyl group.
  • said diterpene may be a C 20 -molecule of the formula XX:
  • Ri is a C 5 -alkenyl substituted with methyl and/or hydroxyl.
  • Ri is C 5 - alkenyl containing one or two double bonds.
  • alkenyl containing one double bond said alkenyl is preferably substituted with hydroxyl and methyl.
  • alkenyl containing two double bonds said alkenyl is preferably substituted with methyl.
  • said diterpene may be a C 20 -molecule of the formula XXI:
  • X 2 is either -H or -OH, wherein one and only one of and X 2 is -OH.
  • R 2 is C 5 -alkenyl containing one or two double bonds.
  • R 2 is alkenyl containing one double bond
  • R 2 is alkenyl containing two double bonds
  • said alkenyl is preferably substituted with methyl.
  • diterpene is the product of any of the reactions VII to XIX described herein above.
  • the diterpene may be any of the compounds 1 to 47 shown in figure 2 and/or Table 1 .
  • the diterpene is not 13R-manoyl oxide.
  • Table 1 shows preferred diterpenes, which may be precursor diterpenes according to the present invention.
  • the left hand column provides a number of the diterpene used herein, the middle column provides the retention index (RI) and the right hand column the structure.
  • the retention index (RI) is determined by GC-MS and may also be referred to as Kovat's retention index.
  • the term "diterpenoid” refers to a diterpene, which has been functionalised by addition of one or more functional groups.
  • Said diterpene may be any of the diterpenes described herein above in the section "Diterpene”.
  • the diterpenoid is a C 20 -molecule containing a decalin core and one or more
  • the diterpenoid is a C 2 o-molecule containing or XXIV:
  • the diterpenoid may be a C 20 -molecule containing a cores structure of formulas 1, 11, 111, IV, V, VI, IX, X, XXII, XXIII or XXIV substituted at one or more positions by one or more groups selected from the group consisting of:
  • the diterpenoid according to the invention may also be a C 20 -molecule, which contains a core structure of any of formulas I, I I, I II, IV, VI, X, XII, XLII, XLII I, XLIV, XLV, XLVI,
  • the diterpenoid according to the invention may also be a C 20 -molecule which contains a core structure of any of formulas I, I I, IV, VI, X, XI I, XLI I, XLI I I, XLIV, XLVI, XXVI I,
  • the diterpenoid is selected from the group consisting of hydroxy dehydroabietadiene, hydroxy miltiradiene and hydroxy syn- manool.
  • the diterpenoid is carboxyl miltradiene, for example miltiradienic acid. In one embodiment of the invention the diterpenoid is selected from the group consisting of monohydroxy-manool, dihydroxymanool and keto-hydroxy-manool.
  • the diterpenoid is selected from the group consisting of syn-pimara-9,(1 1 ),15-diene oxidized to contain either one keto- and one hydroxyl group or a single carboxylic acid group.
  • the diterpenoid is ferruginol.
  • the diterpenoid intermediate may be any of the diterpenoids described herein in this section. In one embodiment, the diterpenoid is not forskolin.
  • the host organism to be used with the methods of the invention may be any suitable host organism containing
  • a heterologous nucleic acid encoding a diTPS of class II which may be any of diTPS of class II described herein in any of the sections "diTPS of class II", “syn-CPP type diTPS”, “ent-CPP type diTPS”, “(+)-CPP type diTPS”, “LPP type diTPS”, and “LPP like type diTPS”; and/or
  • a heterologous nucleic acid encoding a diTPS of class I which may be any of diTPS of class I described herein in any of the sections "diTPS of class I", "EpTPS8",
  • a heterologous nucleic acid encoding a CYP which may be any of CYPs described herein in any of the sections "CYP”, “CYP76”, “CYP71 “ and “CYP720".
  • Suitable host organisms include microorganisms, plant cells, and plants.
  • the microorganism can be any microorganism suitable for expression of heterologous nucleic acids.
  • the host organism of the invention is a eukaryotic cell. In another embodiment the host organism is a prokaryotic cell.
  • the host organism is a fungal cell such as a yeast or filamentous fungus.
  • the host organism may be a yeast cell.
  • the yeast cell is selected from the group consisting of
  • Saccharomyces cerevisiae Schizosaccharomyces pombe, Yarrowia lipolytica, Candida glabrata, Ashbya gossypii, Cyberlindnera jadinii, and Candida albicans.
  • yeasts and fungi are excellent microorganism to be used with the present invention. They offer a desired ease of genetic manipulation and rapid growth to high cell densities on inexpensive media. For instance yeasts grow on a wide range of carbon sources and are not restricted to glucose.
  • the microorganism to be used with the present invention may be selected from the group of yeasts described below:
  • Arxula adeninivorans is a dimorphic yeast (it grows as a budding yeast like the baker's yeast up to a temperature of 42 °C, above this threshold it grows in a filamentous form) with unusual biochemical characteristics. It can grow on a wide range of substrates and can assimilate nitrate. It has successfully been applied to the generation of strains that can produce natural plastics or the development of a biosensor for estrogens in environmental samples.
  • Candida boidinii is a methylotrophic yeast (it can grow on methanol).
  • Hansenula polymorpha is another methylotrophic yeast (see Candida boidinii). It can furthermore grow on a wide range of other substrates; it is thermo- tolerant and can assimilate nitrate (see also Kluyveromyces lactis). It has been applied to the production of hepatitis B vaccines, insulin and interferon alpha-2a for the treatment of hepatitis C, furthermore to a range of technical enzymes.
  • Kluyveromyces lactis is a yeast regularly applied to the production of kefir. It can grow on several sugars, most importantly on lactose which is present in milk and whey. It has successfully been applied among others to the production of chymosin (an enzyme that is usually present in the stomach of calves) for the production of cheese.
  • Pichia pastoris is a methylotrophic yeast (see Candida boidinii and Hansenula polymorpha). It provides an efficient platform for the production of foreign proteins. Platform elements are available as a kit and it is worldwide used in academia for the production of proteins. Strains have been engineered that can produce complex human N-glycan (yeast glycans are similar but not identical to those found in humans).
  • Saccharomyces cerevisiae is the traditional baker's yeast known for its use in brewing and baking and for the production of alcohol. As protein factory it has successfully been applied to the production of technical enzymes and of pharmaceuticals like insulin and hepatitis B vaccines. Also it has been useful for production of terpenoids.
  • Yarrowia lipolytica is a dimorphic yeast (see Arxula adeninivorans) that can grow on a wide range of substrates. It has a high potential for industrial applications.
  • the host organism is a microalgae such as Chlorella and Prototheca.
  • the host organism is a filamentous fungus, for example Aspergillus.
  • the host organism is a plant cell.
  • the host organism may be a cell of a higher plant, but the host organism may also be cells from organisms not belonging to higher plants for example cells from the moss Physcomitrella patens.
  • the host organism is a mammalian cell, such as a human, feline, porcine, simian, canine, murine, rat, mouse or rabbit cell.
  • the host organism can also be a prokaryotic cell such as a bacterial cell. If the host organism is a prokaryotic cell the cell may be selected from, but not limited to E. coli, Corynebacterium, Bacillus, Pseudomonas and Streptomyces cells.
  • the host organism may also be a plant.
  • a plant or plant cell can be transformed by having a heterologous nucleic acid integrated into its genome, i.e., it can be stably transformed.
  • Stably transformed cells typically retain the introduced nucleic acid with each cell division.
  • a plant or plant cell can also be transiently transformed such that the recombinant gene is not integrated into its genome.
  • Transiently transformed cells typically lose all or some portion of the introduced nucleic acid with each cell division such that the introduced nucleic acid cannot be detected in daughter cells after a certain number of cell divisions. Both transiently transformed and stably transformed transgenic plants and plant cells can be useful in the methods described herein.
  • Plant cells comprising a heterologous nucleic acid used in methods described herein can constitute part or all of a whole plant. Such plants can be grown in a manner suitable for the species under consideration, either in a growth chamber, a greenhouse, or in a field. Plants may also be progeny of an initial plant comprising a heterologous nucleic acid provided the progeny inherits the heterologous nucleic acid. Seeds produced by a transgenic plant can be grown and then selfed (or outcrossed and selfed) to obtain seeds homozygous for the nucleic acid construct.
  • the plants to be used with the invention can be grown in suspension culture, or tissue or organ culture.
  • solid and/or liquid tissue culture techniques can be used.
  • plant cells can be placed directly onto the medium or can be placed onto a filter that is then placed in contact with the medium.
  • transgenic plant cells can be placed onto a flotation device, e.g., a porous membrane that contacts the liquid medium.
  • a reporter sequence encoding a reporter polypeptide having a reporter activity can be included in the transformation procedure and an assay for reporter activity or expression can be performed at a suitable time after transformation.
  • a suitable time for conducting the assay typically is about 1 -21 days after transformation, e.g., about 1 -14 days, about 1 -7 days, or about 1 - 3 days.
  • the use of transient assays is particularly convenient for rapid analysis in different species, or to confirm expression of a heterologous polypeptide whose expression has not previously been confirmed in particular recipient cells.
  • nucleic acids into monocotyledonous and dicotyledonous plants are known in the art, and include, without limitation, Agrobacterium- mediated transformation, viral vector-mediated transformation, electroporation and particle gun transformation, U.S. Patent Nos 5,538,880; 5,204,253; 6,329,571 ; and 6,013,863. If a cell or cultured tissue is used as the recipient tissue for transformation, plants can be regenerated from transformed cultures if desired, by techniques known to those skilled in the art.
  • the plant comprising a heterologous nucleic acid to be used with the present invention may for example be selected from: corn (Zea. mays), canola (Brassica napus, Brassica rapa ssp.), alfalfa (Medicago sativa), rice (Oryza sativa), rye (Secale cerale), sorghum (Sorghum bicolor, Sorghum vulgare), sunflower (Helianthus annuas), wheat (Tritium aestivum and other species), Triticale, Rye (Secale) soybean (Glycine max), tobacco (Nicotiana tabacum or Nicothiana Benthamiana), potato (Solanum tuberosum), peanuts (Arachis hypogaea), cotton (Gossypium hirsutum), sweet potato (Impomoea batatus), cassava (Manihot esculenta), coffee (Cofea spp.), coconut (Cocos nucifera
  • plants of the present invention are crop plants (for example, cereals and pulses, maize, wheat, potatoes, tapioca, rice, sorghum, millet, cassava, barley, pea, sugar beets, sugar cane, soybean, oilseed rape, sunflower and other root, tuber or seed crops.
  • crop plants for example, cereals and pulses, maize, wheat, potatoes, tapioca, rice, sorghum, millet, cassava, barley, pea, sugar beets, sugar cane, soybean, oilseed rape, sunflower and other root, tuber or seed crops.
  • Other important plants maybe fruit trees, crop trees, forest trees or plants grown for their use as spices or pharmaceutical products (Mentha spp, clove,
  • Horticultural plants which may be used with the present invention may include lettuce, endive, and vegetable brassicas including cabbage, broccoli, and cauliflower, carrots, and carnations and geraniums.
  • the plant may also be selected from the group consisting of tobacco, cucurbits, carrot, strawberry, sunflower, tomato, pepper and Chrysanthemum.
  • the plant may also be a grain plants for example oil-seed plants or leguminous plants.
  • Seeds of interest include grain seeds, such as corn, wheat, barley, sorghum, rye, etc.
  • Oil-seed plants include cotton soybean, saff lower, sunflower, Brassica, maize, alfalfa, palm, coconut, etc.
  • Leguminous plants include beans and peas. Beans include guar, locust bean, fenugreek, soybean, garden beans, cowpea, mung bean, lima bean, fava bean, lentils, chickpea.
  • said plant is selected from the following group: maize, rice, wheat, sugar beet, sugar cane, tobacco, oil seed rape, potato and soybean.
  • the plant may for example be rice.
  • one plant which may be used with the present invention is an Arabidopsis and in particular an Arabidopsis thaliana.
  • the diterpenoid may be isolated from said plant by conventional methods.
  • the diterpenoid may be isolated from the entire plant or from parts thereof. Said parts may for example be selected from the group consisting of leaves, fruits, flowers, stems, seeds and roots of plants. Intermediates may also be isolated from said plants or parts thereof.
  • the host organism does not naturally produce the diterpenoid to be produced by the methods of the invention.
  • the host organism may comprise at least the following heterologous nucleic acids:
  • Such a host organism is in particular useful for production of diterpenoids having a core of formulas VI, IX, XXXVIII, XXXIX or XL, for example for production of oxidised dehydroabietadien and/or production of oxidised miltiradiene, e.g hydroxyl dehydroabietadiene or hydroxyl miltiradiene.
  • the host organism may comprise at least the following heterologous nucleic acids:
  • Such a host organism is in particular useful for production of diterpenoids having a core of formulas VI, IX, XXXVIII, XXXIX or XL, for example for production of oxidised dehydroabietadiene and/or production of oxidised miltiradiene, e.g hydroxyl dehydroabietadiene or hydroxyl miltiradiene.
  • the host organism may comprise at least the following heterologous nucleic acids:
  • Such a host organism is in particular useful for production of diterpenoids having a core of formulas VI, IX, XXXVIII, XXXIX or XL, for example for production of oxidised dehydroabietadiene and/or production of oxidised miltiradiene, e.g hydroxyl dehydroabietadiene or hydroxyl miltiradiene.
  • the host organism may comprise at least the following heterologous nucleic acids:
  • Such a host organism is in particular useful for production of diterpenoids having a core of formulas VI, IX, XXXVIII, XXXIX or XL, for example for production of oxidised dehydroabietadiene and/or production of oxidised miltiradiene, e.g hydroxyl dehydroabietadiene or hydroxyl miltiradiene.
  • the host organism may comprise at least the following heterologous nucleic acids:
  • Such a host organism is in particular useful for production of diterpenoids having a core of formulas VI, IX, XXXVIII, XXXIX or XL, for example for production of oxidised dehydroabietadien and/or production of oxidised miltiradiene, e.g hydroxyl dehydroabietadiene or hydroxyl miltiradiene.
  • the host organism may comprise at least the following heterologous nucleic acids:
  • Such a host organism is in particular useful for production of diterpenoids having a core of formulas II, VI, XXXV, XXXVI, XXXVII or XXXVIII, and in particular for for production of diterpenoids having a core of formulas II, VI, XXXV, XXXVI, XXXVII or XXXVIII and comprising at least one keto group, hydroxyl group and/or carboxylic acid group.
  • said host organism may be useful for production of oxidised syn-pimara- 9,(1 1 ),15-diene, oxidised syn-isopimara-7,15-diene and/or oxidised dehydroabietadiene, e.g keto syn-pimara-9,(1 1 ),15-diene, keto syn-isopimara-7,15- diene, keto dehydroabietadiene, hydroxyl-syn-pimara-9,(1 1 ),15-diene, hydroxyl-syn- isopimara-7,15-diene, hydroxy-dehydroabietadiene, carboxyl-syn-pimara-9,(1 1 ),15- diene, carboxyl-syn-isopimara-7,15-diene and/or carboxyl-dehydroabietadiene.
  • the host organism may comprise at least the following heterologous nucleic acids:
  • Such a host organism is in particular useful for production of diterpenoids having a core of formulas XXIX or XLVI, for example for production of oxidised manool, such as hydroxy manool or dihydroxy manool or keto-hydroxy-manool.
  • the host organism may comprise at least the following heterologous nucleic acids:
  • Such a host organism is in particular useful for production of diterpenoids having a core of formulas XXIX or XLVI, for example for production of oxidised manool, such as hydroxy manool or dihydroxy manool or keto-hydroxy-manool.
  • the host organism may comprise at least the following heterologous nucleic acids:
  • Such a host organism is in particular useful for production of diterpenoids having a core of formulas XXIX or XLVI, for example for production of oxidised manool, such as hydroxy manool or dihydroxy manool or keto-hydroxy-manool or a carboxylic acid of manool.
  • the host organism may comprise at least the following heterologous nucleic acids:
  • Such a host organism is in particular useful for production of diterpenoids having a core of formulas XXVII or XLVI, for example for production of oxidised syn-manool, such as hydroxylated syn-manool.
  • the host organism may comprise at least the following heterologous nucleic acids:
  • Such a host organism is in particular useful for production of diterpenoids having a core of formulas XXIX or XLVI, for example for production of oxidised manool, and in particular for production of highly oxidised manool. Thus, such host organism may be particularly useful for production of dihydroxy manool.
  • the host organism may comprise at least the following heterologous nucleic acids:
  • Such a host organism is in particular useful for production of diterpenoids having a core of formulas XXIX or XLVI, for example for production of oxidised manool, and in particular for production of highly oxidised manool. Thus, such host organism may be particularly useful for production of dihydroxy manool.
  • the host organism may comprise at least the following heterologous nucleic acids:
  • Such a host organism is in particular useful for production of diterpenoids having a core of formulas XXIX or XLVI, for example for production of oxidised manool, and in particular for production of highly oxidised manool.
  • such host organism may be particularly useful for production of dihydroxy manool or a dioxidized manool derivative with three additional degrees of desaturation, i.e. double bonds.
  • the host organism may comprise at least the following heterologous nucleic acids:
  • Such a host organism is in particular useful for production of diterpenoids having a core of formulas VI or XII, for example for production of oxidised miltradiene, for example for production of ferruginol.
  • the host organism may comprise at least the following heterologous nucleic acids:
  • Such a host organism is in particular useful for production of diterpenoids having a core of formulas VI or XII, for example for production of oxidised miltradiene, for example for production of ferruginol.
  • SEQ ID NO:1 Amino acid sequence of syn-CPP from Oryza sativa
  • SEQ ID NO:1 1 Amino acid sequence of SCS of Salvia scarea
  • SEQ ID NO: 19 Amino acid sequence of AtEKS of Arabidopsis thaliana
  • SEQ ID NO: 21 Amino acid sequence of CYP76AH1 1 of Coleus forskohlii
  • SEQ ID NO: 22 Amino acid sequence of CYP71 D381 of Coleus forskohlii
  • Example 1 Various full length cDNAs were prepared and expressed in a Nicotiana
  • Nicotiana benthamianai Agrobacterium system are expressed in a Nicotiana benthamianai Agrobacterium system.

Landscapes

  • Chemical & Material Sciences (AREA)
  • Organic Chemistry (AREA)
  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • General Health & Medical Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Biochemistry (AREA)
  • Medicinal Chemistry (AREA)
  • Molecular Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Microbiology (AREA)
  • General Engineering & Computer Science (AREA)
  • Biotechnology (AREA)
  • Gastroenterology & Hepatology (AREA)
  • Biophysics (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • General Chemical & Material Sciences (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Biomedical Technology (AREA)
  • Botany (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Preparation Of Compounds By Using Micro-Organisms (AREA)

Abstract

The invention provide novel methods for production of terpenoids. The methods take advantage of combinations of enzymes, which are not present in nature. Thus the methods involves use of host organisms expressing a diTPS of class II, diTPS of class I, and one or more CYPs.

Description

Methods and materials for production of terpenoids Field of invention The present invention relates to the field of biosynthetic methods for producing diterpenoids.
Background of invention
Terpenes constitute a large and diverse class of organic compounds produced by a variety of plants as well as other species. Terpenes modified by oxidation or rearrangements are generally referred to as terpenoids.
Terpenes and terpenoids find multiple uses, for example as flavor compounds, additives for food, as fragrances and in medical treatment
Terpenes are derived biosynthetically from units of isoprene, which has the molecular formula C5H8. Diterpenes are composed of four isoprene units and in nature they are produced from geranylgeranyl pyrophosphate.
Summary of invention
In nature diterpenes are produced with the aid of specific pairs of diterpene synthases (diTPS) derived from two classes, class I and class II. Diterpenoids may be produced from diterpenes with the action of various enzymes, including CYP enzymes.
Most CYP enzymes have a narrow substrate specificity, which has been a major hurdle in broadening P450 applications in synthetic chemistry (Podust et al., 2012, Nat. Prod. Rep., 29, 1251 ). In particular, plant CYPs are known to be recruited to highly specialised functions, and show a high degree of substrate specificity.
The present invention discloses that by combining diTPS enzymes of class I and class II, and CYPs from different species, many different diterpenoids may be produced including diterpenoids not identified in nature. Surprisingly, it is revealed that a diTPS enzyme of class I, a diTPS enzyme of class II may be combined with one or more CYPs, wherein the diTPSs and the CYP(s) are from different species, resulting in a high diversity of diterpenoids, which can be produced.
Thus, the invention features an inventory of functional class II and class I diTPS and CYPs from a range of plants, which are useful for accumulating high-value and bioactive diterpenoids. When these diTPS and CYPs are paired into specific modules consisting of new-to-nature combinations, such as using enzymes from different plant species, both the structure and the stereochemistry of the formed diterpenoids can be controlled. This strategy gives access to a novel structural diversity of highly complex diterpenoids, many representing valuable compounds, for example flavours, fragrances, pharmaceuticals and fine chemicals. The diterpenoids may also be potentially bioactive molecules, starting materials for chemical synthesis, and intermediates for further functionalization to flavours, fragrances, pharmaceuticals and fine chemicals.
The invention thus in one aspect provides methods of producing a terpenoid, said methods comprising the steps of: providing one or more host organisms, wherein
I. at least one of the host organisms comprises a heterologous nucleic acid encoding a diTPS of class II,
II. at least one of the host organisms comprises a heterologous nucleic acid encoding a diTPS of class I,
III. at least one of the host organisms comprises a heterologous nucleic acid encoding a cytochrome P450 (CYP) with the proviso that said diTPS of class II, said diTPS of class I and said CYP are not all from the same species;
Incubating said one or more host organisms either simultaneously or sequentially under conditions allowing growth of said host organisms, wherein at least the host organism comprising a heterologous nucleic acid encoding a diTPS of class II is incubated in the presence of geranylgeranyl pyrophosphate (GGPP); c) Optionally isolating diterpenoid from the host organism(s) and/or from cultivation medium in which said host organism has been incubated.
The invention also provides a host organism comprising
I. a heterologous nucleic acid encoding a diTPS of class II,
II. a heterologous nucleic acid encoding a diTPS of class I,
III. a heterologous nucleic acid encoding a cytochrome P450 (CYP) with the proviso that said diTPS of class II, said diTPS of class I and said CYP are not all from the same species. The invention furthermore provides a collection of host organisms comprising at least two different host organisms, wherein
I. at least one of the host organisms comprises a heterologous nucleic acid encoding a diTPS of class II,
II. at least one of the host organisms comprises a heterologous nucleic acid encoding a diTPS of class I,
III. at least one of the host organisms comprises a heterologous nucleic acid encoding a cytochrome P450 (CYP) with the proviso that said diTPS of class II, said diTPS of class I and said CYP are not all from the same species. The term "said diTPS of class II, said diTPS of class I and said CYP are not all from the same species" means that the diTPS of class II, the diTPS of class I and the CYP are derived from at least two different species. For example the diTPS of class II and the diTPS of class I may be from the same species, whereas the CYP is from a different species. In another example the diTPS of class II and the CYP may be from the same species, whereas the diTPS of class I is from a different species. In another example the diTPS of class I and the CYP may be from the same species, whereas the diTPS of classll is from a different species. In another example the diTPS of class II and the diTPS of class I and the CYP are from three different species. Description of Drawings
Figure 1 provides an example of biosynthesis pathways to diterpenes of different stereochemistry. The figure shows biosynthesis of three different isomers of manool by using diTPS enzymes from four different species: Oryza sativa (rice), Zea maize (maize), Coleus forskohlii (Indian coleus, medicinal plant) and Salvia sclarea (Clary sage, medicinal plant). The diTPS from Oryza sativa may for example be the enzyme of SEQ ID NO:1 . The diTPS from Zea maize may for example be the enzyme of SEQ ID NO:3. The diTPS from Coleus forskohlii may for example be the enzyme of SEQ ID NO:5. The diTPS from Salvia sclarea may for example be the enzyme of SEQ ID NO:1 1 .
Figures 2A and 2B show "Combinatorial wheels" showing examples of compounds, which can be made by combining different diTPS enzymes. The universal precursor, GGPP is shown in the middle. The next ring shows various examples of diTPS class II enzymes. The next ring shows various examples of diTPS class I enzymes. The outer ring shows the diterpenes produced by the indicated combinations of diTPS class II and diTPS class I enzymes. Each diterpene has been assigned a compound number used to identify said diterpene herein. The sequences of all of diTPS class II and diTPS class I enzymes are provided herein in the sequence listing. Table 1 provides a list and structures of the diterpenes.
Figures 3A and 3B shows the reactions catalysed by various class II diTPS enzymes as well as the diterpene pyrophosphate intermediates generated by the reactions.
Figure 4 shows an alignment of the amino acid sequences of selected diTPS enzymes of class I. Figure 5 shows an alignment of the amino acid sequences of selected diTPS enzymes of class II.
Figure 6 shows an alignment of amino acid sequences of selected CYPs. 6 motifs named SRS1 , SRS2, SRS3, SRS4, SRS5 and SRS6 are marked above the alignment with black squares. 4 highly conserved motifs are marked by numbers 1 to 4 and shown in black boxes. Other highly conserved residues are also marked in black boxes. Somewhat conserved residues are marked with grey boxes.
Figure 7 shows CYP catalysed bioconversion of Dehydroabietadiene (1 .1 ) and Miltiradiene (1 .2). Cytochrome P450 enzymes CfCYP76AH1 1 (top) and CfCYP71 D381 (bottom) catalyzes oxidation of Dehydroabietadiene (1 .1 ) and Miltiradiene (1 .2) into novel compounds 1 .1 .1 -1 .1 .4 and 1 .2.1 corresponding to hydroxylated or ketonated derivatives of Dehydroabietadiene (1 .1 ) and Miltiradiene (1 .2).
Figure 8A shows GC-MS chromatogram of N. benthamiana expressing
CfTPS1 +CfTPS3+CYP76AH1 1 and mass spectra of compound 1 .1 .1 & 1 .2.1
Figure 8B shows GC-MS chromatogram of N. benthamiana expressing CfTPSI + CfTPS3 + CYP71 D381 and mass spectra of compound 1 .1 .2-1 .1 .4. Figure 9 shows CYP catalysed bioconversion of Manool (1 .3). Cytochrome P450 enzymes CfCYP76AH8 (top), CfCYP76AH1 1 (middle) and PsCYP720B4 (bottom) catalyzes oxidation of Manool (1 .3) into novel compounds 1 .3.1 -1 .3.10 corresponding to hydroxylated or dihydroxylated derivatives of Manool (1 .3). Figure 10A shows GC-MS chromatogram of N. benthamiana expressing
CfTPSI +SsSCS+CYP76AH8 and mass spectra of compound 1 .3.1 & 1 .3.5.
Figure 10B shows GC-MS chromatogram of N. benthamiana expressing
CfTPSI +SsSCS+CYP76AH1 1 and mass spectra of compound 1 .3.6 & 1 .3.9.
Figure 10C shows GC-MS chromatogram of N. benthamiana expressing
CfTPSI +SsSCS+CYP720B4 and mass spectra of compound 1 .3.10.
Figure 1 1 shows CYP catalysed bioconversion of syn-manool (1 .4). Cytochrome P450 enzymes CfCYP71 D381 catalyzes oxidation of syn-manool (1 .4) into novel compounds 1 .4.1 -1 .4.3 where 1 .4.2 and 1 .4.3 correspond to hydroxylated derivatives of syn- manool (1 .4).
Figure 12 shows GC-MS chromatogram of N. benthamiana expressing
sCPSsyn+SsSCS+CYP71 D381 and mass spectra of compound 1 .4.1 -1 .4.3.
Figure 13 shows production of compound 1 .2.1 by a yeast strain expressing the diterpene synthases CfTPSI and CfTPS3 as well as the cytochrome P450
PsCYP720B4, Panel A shows extracted ion chromatogram recorded by LC-MS analysis of ethanol extracts generated from media as well as cell material of strain IT-7- T13. The black trace corresponds to a yeast strain expressing the diterpene synthases CfTPSI and CfTPS3 as well as the cytochrome P450 PsCYP720B4, whereas the grey trace corresponds to a yeast strain expressing only the diterpene synthases CfTPSI and CfTPS3. Panel B, UV absorption (top), primary mass spectra (middle) as well as MS-n fragmentation mass spectra (bottom). Panel C shows a schematic
representation offormation of miltiradiene (1 .2) and dehydroabietadiene by CfTPSI and CfTPS3 and proposed cytochrome P450 catalysed oxidation of miltiradiene into the corresponding carboxylic acid, based on the parental mass. Figure 14 shows production of oxidized manool. Panel A, shows a schematic presentation of formation of manool (1 .3) and proposed cytochrome P450 catalyzed oxidation of manool in both the plant Nicotiana benthamiana and in yeast after heterologous expression of CfTPSI , SsSCS and the cytochrome P450 enzymes CYP76AH8, CYP76AH1 1 and CYP720B4, which catalyse oxidation of manool into novel compounds 1 .3.1 -1 .3.16. Panel B shows extracted ion chromatograms of three strains recorded by LC-MS analysis of ethanol extracts generated from media as well as cell material. Extracted ion trace combine the ions corresponding to the mass of manool (273 m/z), monohydroxy-manool (289 m/z, hydrogen adduct of water loss ion), dihydroxymanool (347 m/z) and keto-hydroxy-manool (345 m/z). Result from yeast strains expressing CfTPSI , SsSCS and CfCYP76AH8 (strain IT7-T50)(black trace, top), CfTPSI and SsSCS (dark grey trace, middle) or an untransformed reference strain (light grey, bottom) are shown. Panels C, D and E display, UV (top), primary mass spectra (middle) as well as MS-n fragmentation mass spectra (bottom) of putative keto-hydroxy mannol (1 .3.1 1 ), dihydroxy manool (1 .3.12) and hydroxyl manool (1 .3.13) in yeast cell expressing CfTPSI , SsSCS, CfCYP76AH8, strain name IT7-T50.
Figure 15 shows production of oxidized manool. Panel A shows extracted ion chromatograms of two strains recorded by LC-MS analysis of ethanol extracts generated from media as well as cell material. Extracted ion trace combine the ions corresponding to the mass of monohydroxy-manool (329 m/z, sodium adduct), dihydroxymanool (345 m/z, sodium adduct) and keto-hydroxy-manool or a carboxylic acid of manool (343 m/z, sodium adduct). Result from yeast strains expressing CfTPSI , SsSCS and PsCYP720B4 (black trace, top), CfTPSI and SsSCS (dark grey trace, bottom) are shown. Panels B display, UV (top), primary mass spectra (bottom) of the oxidized compounds 1 .3.12-1 .3.16 in yeast cells expressing CfTPSI , SsSCS, PsCYP720B4, strain name IT7-T37.
Figure 16 shows detection of oxidised isopimaradiene. Panel A shows a schematic representation of formation of syn-or iso-pimaradienes (1 .5-group) and proposed cytochrome P450 catalyzed oxidation of syn-or iso-pimaradienes into novel compounds 1 .5.1 -1 .5.6 tentatively identified as hydroxylated, dihydroxylated or ketonated derivatives of either syn- or iso-pimaradiene based on their parental masses. Panel B shows extracted ion chromatogram recorded by LC-MS analysis of ethanol extracts generated from media as well as cell material. Extracted ion correspond to the mass of syn-pimara-9,(1 1 ),15-diene or similar C20H32 diterpene oxidized to contain either one keto- and one hydroxyl group or a single carboxylic acid group (303 m/z) for yeast cells expressing OssynCPP, CfTPS3 and PsCYP720B4 (IT7-T58)(black trace), untransformed reference strain (grey trace). Panel C, UV (top), primary mass spectra (middle) as well as MS-n fragmentation mass spectra (bottom) of the five oxidized diterpene derivatives. The 257 m/z fragment observed in all compounds 1 .5.1 -1 .5.5 is a signature ion of diterpenes. This ion is observed as 253 m/z in 1 .5.6 possibly indicative of introduction of two double bonds in the structure. (OssynCPP, CfTPS3, PsCYP720B4, strain name IT7-T58)
Figure 17 shows detection of higher oxidised manool through pairs of cytochrome P450s. Panel A shows a schematic representation. Cytochrome P450 enzyme pairs CYP76AH8, CYP71 D381 ; CYP76AH8, CYP76AH1 1 and CYP76AH8, CYP720B4 catalyse oxidation of Manool into novel compounds 1 .3.14-1 .3.18. Panel B shows extracted ion chromatograms of three strains recorded by LC-MS analysis of ethanol extracts generated from media as well as cell material. Extracted ion trace combine the ions corresponding to the mass of monohydroxy-manool (289 m/z = hydrogen adduct of water loss ion) and dihydroxymanool (345 m/z = sodium adduct, 305 = hydrogen adduct of water loss ion). Yeast cells expressing CfTPSI , SsSCS and CfCYP76AH8 and CfCYP71 D381 (strain IT7-T32)(black trace, top), yeast cells expressing CfTPSI , SsSCS and CfCYP76AH8 (strain IT7-T50)(dark grey trace, middle) or an untransformed control strain (light grey, bottom). Panel C display, UV (top), primary mass spectra (middle) as well as MS-n fragmentation mass spectra (bottom) of oxidized diterpenes. Figure 18 shows detection of oxidized manool. Panel A shows extracted ion chromatograms of three strains recorded by LC-MS analysis of ethanol extracts generated from media as well as cell material. Extracted ion trace combine the ions corresponding to the mass of monohydroxy-manool (289 m/z = hydrogen adduct of water loss ion) and dihydroxymanool (345 m/z = sodium adduct, 305 = hydrogen adduct of water loss ion). CfTPSI , SsSCS and CfCYP76AH8 and CfCYP76AH1 1 (strain IT7-T31 )(black trace, top), CfTPSI , SsSCS and CfCYP76AH8 (dark grey trace, middle), untransformed control strain (light grey, bottom). Panel B display, UV (top), primary mass spectra (middle) as well as MS-n fragmentation mass spectra (bottom) of oxidized diterpenes.
Figure 19 shows detection of oxidized manool. Panel A shows extracted ion chromatograms of three strains recorded by LC-MS analysis of ethanol extracts generated from media as well as cell material. Yeast cells expressing CfTPSI , SsSCS and CfCYP76AH8 and PsCYP720B4 (strain IT7-T9)(black trace, top), yeast cells expressing CfTPSI , SsSCS and CfCYP76AH8 (dark grey trace, middle), yeast cells expressing CfTPSI , SsSCS and PsCYP720B4 (light grey, bottom). Panel B displays, UV (top), primary mass spectra (middle) as well as MS-n fragmentation mass spectra (bottom) of oxidized diterpenes.
Figure 20 shows a schematic pathway for production of ferruginol. Thus, co-expression of TwTPS7, CfTPS3 and CYP76AH4 resulted in the formation of ferruginol.
Detailed description of the invention
Method for producing diterpenoids
The present invention relates to a biosynthetic method for producing diterpenoids. The methods typically involves the steps of a) providing one or more host organisms, wherein
I. at least one of the host organisms comprises a heterologous nucleic acid encoding a diTPS of class II,
II. at least one of the host organisms comprises a heterologous nucleic acid encoding a diTPS of class I, III. at least one of the host organisms comprises a heterologous nucleic acid encoding a cytochrome P450 (CYP) with the proviso that said diTPS of class II, said diTPS of class I and said CYP are not all from the same species; b) Incubating said one or more host organisms either simultaneously or
sequentially under conditions allowing growth of said host organisms, wherein at least the host organism comprising a heterologous nucleic acid encoding a diTPS of class II is incubated in the presence of geranylgeranyl pyrophosphate (GGPP); c) Optionally isolating diterpenoid(s) from the host organism(s) and/or from its surroundings.
The host organism may be cultivated in a cultivation medium. For example, in embodiments of the invention, wherein the host organism is a microorganism, then typically said host organism is cultivated in a cultivation medium under conditions allowing growth of the particular microorganism. When the host organism is cultivated in a cultivation medium, then the diterpenoid may be isolated from the host organism(s and/or from the cultivation medium. Similarly, intermediates may also be obtained and/or isolated from the host organism(s) and/or the cultivation medium.
In embodiments of the invention, wherein the host organism is a plant, said host organism may be cultivated in soil. In such embodiments, then the diterpenoid may typically be isolated from the host organism or parts thereof. Similarly, intermediates may also be obtained and/or isolated from the host organism or part thereof.
In one preferred embodiment of the invention step a) comprises providing one host organism comprising:
I. a heterologous nucleic acid encoding a diTPS of class II,
II. a heterologous nucleic acid encoding a diTPS of class I,
III. a heterologous nucleic acid encoding a cytochrome P450 (CYP).
In addition to the aforementioned heterologous nucleic acids said host organism may contain one or more additional heterologous nucleic acids. For example the host organism may comprise at least two heterologous nucleic acids encoding different CYPs. Said CYPs may be from the same or from different species. Thus, the host organism may comprise:
I. a heterologous nucleic acid encoding a diTPS of class II,
II. a heterologous nucleic acid encoding a diTPS of class I,
III. at least two heterologous nucleic acids each encoding a different
cytochrome P450 (CYP).
Even though it is preferred that one host organism expresses both diTPS of class II, diTPS of class I and one or more CYPs, it is also comprised with the invention that the methods may employ different host organisms expressing one or more of said enzymes.
In embodiments of the invention employing use of more than one host organism, said host organisms may be incubated in cultivation medium together or sequentially. When incubated sequentially one or more host organisms may be incubated together followed by incubation of one or more other host organisms. Alternatively, each host organism may be incubated separately.
When the host organisms are incubated sequentially, the order of incubation is of importance. Thus, for example:
I. a host organism comprising a heterologous nucleic acid encoding a diTPS of class II may beincubated in cultivation medium before or simultaneously with any host organism comprising a heterologous nucleic acid encoding a diTPS of class I
II. a host organism comprising a heterologous nucleic acid encoding a diTPS of class I may beincubated in cultivation medium before or simultaneously with any host organism comprising a heterologous nucleic acid encoding a CYP.
Thus, in one embodiment the methods of the invention may comprise the steps of a) providing
I. a host organism comprising a heterologous nucleic acid encoding a diTPS of class II,
II. a host organism comprising a heterologous nucleic acid encoding a diTPS of class I, III. a host organism comprising one or more heterologous nucleic acids encoding a cytochrome P450 (CYP) with the proviso that said diTPS of class II, said diTPS of class I and said CYP are not all from the same species; b) Incubating said host organisms under conditions allowing growth of said host organisms, wherein
i. all host organisms are incubated together in the presence of geranylgeranyl pyrophosphate (GGPP); or ii. the host organisms under I. and II are incubated together in the presence of GGPP, thereby producing a diterpene, followed by incubation of the host organism(s) under III in the presence of said diterpene; or
iii. the host organism under I. is incubated in the presence of
GGPP thereby producing a diterpene pyrophosphate intermediate, followed by incubation of the host organisms under II. and III. in the presence of said diterpene
pyrophosphate intermediate; or
iv. the host organism under I. is incubated in the presence of
GGPP thereby producing a diterpene pyrophosphate intermediate, followed by incubation of the host organism under II. in the presence of said diterpene pyrophosphate intermediate there by producing a diterpene, followed by incubation of the host organism(s) under III. in the presence of said diterpene, c) Optionally isolating diterpenoid from the host organism(s) and/or from the cultivation medium if the host organism has been cultivated in a cultivation medium.
It is also comprised within the invention that the order of incubation may be in the reverse order, i.e. starting with incubation of host organism(s) containing downstream enzymes, such as CYP or diTPS of class II, followed by incubation of host organism(s) containing upstream enzymes, such as diTPS of class I. This may be done in order to prevent accumulation of intermediates and/or undesired side products. In another embodiment the methods of the invention may comprise the steps of a) providing two host organisms, whereon the first host organism comprises
I. a heterologous nucleic acid encoding a diTPS of class II,
II. a heterologous nucleic acid encoding a diTPS of class I,
and the second host organism comprises:
III. a heterologous nucleic acid encoding a cytochrome P450 (CYP) with the proviso that said diTPS of class II, said diTPS of class I and said CYP are not all from the same species; b) Incubating said host organisms under conditions allowing growth of said host organisms, wherein
i. both host organisms are incubated simultaneously in the presence of geranylgeranyl pyrophosphate (GGPP); or ii. the first host organism is incubated in the presence of GGPP thereby producing a diterpene, followed by incubation of the second host organism in the presence of said diterpene; c) Optionally isolating diterpenoid from the host organism(s) and/or from the cultivation medium if the host organism has been cultivated in a cultivation medium.
In yet another embodiment the methods of the invention may comprise the steps of a) providing two host organisms, wherein the first host organism comprises
I. a heterologous nucleic acid encoding a diTPS of class II,
II. a heterologous nucleic acid encoding a diTPS of class I,
and the second host organism comprises
III. at least two heterologous nucleic acids each encoding a different
cytochrome P450 (CYP) with the proviso that said diTPS of class II, said diTPS of class I and said CYP are not all from the same species; b) Incubating said host organisms in cultivation medium under conditions allowing growth of said host organisms, wherein
i. both host organisms are incubated simultaneously in the presence of geranylgeranyl pyrophosphate (GGPP); or ii. the first host organism is incubated in the presence of GGPP thereby producing a diterpene, followed by incubation of the second host organism in the presence of said diterpene; c) Optionally isolating diterpenoid from the host organism(s) and/or from the
cultivation medium if the host organism has been cultivated in a cultivation medium. t another embodiment the methods of the invention may comprise the steps of a) providing at least three host organisms, wherein the first host organism
comprises
I. a heterologous nucleic acid encoding a diTPS of class II,
II. a heterologous nucleic acid encoding a diTPS of class I,
and the second host organism comprises
III. a heterologous nucleic acids encoding a cytochrome P450 (CYP) and the third and optionally further host organisms each comprises
IV. a heterologous nucleic acids encoding a cytochrome P450 (CYP) different to the CYP under III., with the proviso that said diTPS of class II, said diTPS of class I and said CYP are not all from the same species;
b) Incubating said host organisms in cultivation medium under conditions allowing growth of said host organisms, wherein
i. all host organisms are incubated simultaneously in the presence of geranylgeranyl pyrophosphate (GGPP); or ii. the first host organism is incubated in the presence of GGPP thereby producing a diterpene, followed by incubation of the second host organism in the presence of said diterpene, thereby producing an intermediate diterpenoid, followed by incubation of the third host organism in the presence of said intermediate diterpenoid; or
iii. the first host organism is incubated in the presence of GGPP thereby producing a diterpene, followed by incubation of the second host organism in the presence of said diterpene, thereby producing an intermediate diterpenoid, followed by incubation of the third host organism in the presence of said intermediate diterpenoid, thereby producing a second intermediate diterpenoid; followed by simultaneous or sequential incubation of said further host organisms in the presence of intermediate diterpenoids; or
iv. the first and the second host organism are incubated together in the presence of GGPP thereby producing an intermediate diterpenoid, followed by incubation of the third host organism in the presence of said intermediate diterpenoid; or
v. the first and the second host organism are incubated together in the presence of GGPP thereby producing an intermediate diterpenoid, followed by incubation of the third host organism in the presence of said intermediate diterpenoid, thereby producing a second intermediate diterpenoid; followed by simultaneous or sequential incubation of said further host organisms in the presence of intermediate diterpenoids; or
Optionally isolating diterpenoid from the host organism(s) and/or from the cultivation medium if the host organism has been cultivated in a cultivation medium.
In addition to the methods specifically outlined herein above, additional combinations of host organisms comprising diTPS of class I, diTPS of class II and various CYPs may be envisaged. Such combinations may also be employed with the present invention. The method may be performed in vitro or in vivo. It is generally preferred, that the methods of the invention are performed in vivo. The term "in vivo" as used herein refers to that the method is performed within a host organism, which for example may be any of the host organisms described herein below in the section "Host organism". The diterpene pyrophosphate intermediate and the diterpene may for example be any of the compounds described herein below in the sections "Diterpene pyrophosphate intermediates" and "Diterpenes".
The diterpenoid intermediates and the diterpenoids may be any of the compounds described herein below in the section "Diterpenoids". In preferred embodiments of the invention the host organism is capable of producing GGPP. In particular, it is preferred that the host organism comprising a heterologous nucleic acid encoding a diTPS of class II is capable of producing GGPP. Thus, the step of incubating the host organism(s) in the presence of GGPP may simply be performed by cultivating said host organism. Many host organisms produce GGPP endogenously. Thus, the host organism may be a host organism, which endogenously produce GGPP. Such host organisms for example include plants and yeast. Even if the host organism produces GGPP endogenously, the host organism may be recombinantly modulated to upregulate production of GGPP. This may be achieved by expressing one or more enzymes involved in the production of GGPP in said host organism. In addition or alternatively, the expression of one or more enzymes involved in reducing the level of GGPP may be reduced or even abolished. For example, the host organism may comprise a heterologous nucleic acid encoding a 1 -deoxy-D-xylulose-5-phosphate synthase. The host organism may comprise a heterologous nucleic encoding DXS, such as DXS Coleus forskohlii. DXS may be a polypeptide encoded by SEQ ID NO:25 or a functional homologue thereof sharing at least 70%, such as at least 80%, for example at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 91 %, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% sequence identity therewith. In one aspect the invention relates to DXS encoded by SEQ ID NO:25 or a functional homologue, as well as to host organisms containing a nucleic acid encoding same.
The host organism may comprise a heterologous nucleic acid encoding a GGPP synthase (GGPPS), for example a GGPPS1 , such as GGPPS1 of Coleus forskohlii. The GGPPS1 may be a polypeptide encoded by SEQ ID NO:26 or a functional homologue thereof sharing at least 70%, such as at least 80%, for example at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 91 %, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% sequence identity therewith. In one aspect the invention relates to GGPPS1 encoded by SEQ ID NO:26 or a functional homologue, as well as to host organisms containing a nucleic acid encoding same.
It is also comprised within the invention that GGPP is introduced to the host organism. If the host organism is a microorganism, then GGPP may be added to the cultivation medium of said microorganism. If the host organism is a plant, then GGPP may be added to the growing soil of the plant or it may be introduced into the plant by infiltration. Thus, if the heterologous nucleic(s) are introduced into the plant by infiltration, then GGPP may be co-infiltrated together with the heterologous nucleic acid(s).
When the methods of producing a diterpernoid are performed in vitro, the methods typically comprises the steps of: providing a host organism or a collection of host organism, wherein
I. at least one of the host organisms comprises a heterologous nucleic acid encoding a diTPS of class II,
II. at least one of the host organisms comprises a heterologous nucleic acid encoding a diTPS of class I,
III. at least one of the host organisms comprises a heterologous nucleic acid encoding a cytochrome P450 (CYP) b) preparing an extract of said host organism(s) or preparing individual extracts or each host organism;
c) providing GGPP
d) incubating one or more of said extracts with GGPP, wherein at least the extract of the host organism comprising a heterologous nucleic acid encoding a diTPS of class II is incubated in the presence of geranylgeranyl pyrophosphate (GGPP);
e) optionally, incubating the product of the incubation under d) with one or more of said extracts,
f) optionally, incubating the product of the incubation under e) with one or more of said extracts, wherein step f) may optionally be repeated; thereby producing a diterpenoid.
Thus, when the methods are performed in vitro, an extract containing all enzymes may be incubated with GGPP thereby producing a diterpenoid, or extracts containing different enzymes may be used sequentially. If the extracts are used sequentially the order is of importance.
Thus, steps d) to f) may for example be performed in either of the following ways: i. incubating an extract containing diTPS of class I, diTPS of class II and one or more CYPs in the presence of geranylgeranyl pyrophosphate (GGPP); or ii. incubating an extract containing diTPS of class I and diTPS of class II in the presence of GGPP, thereby producing a diterpene, followed by incubation an extract containing CYP in the presence of said diterpene; or
iii. incubating an extract containing diTPS of class II in the presence of GGPP thereby producing a diterpene pyrophosphate intermediate, followed by incubation of an extract containing diTPS of class I and CYP in the presence of said diterpene pyrophosphate intermediate; or
iv. incubating an extract containing diTPS of class II in the presence of GGPP thereby producing a diterpene pyrophosphate intermediate, followed by incubation of an extract containing diTPS of class I in the presence of said diterpene pyrophosphate intermediate there by producing a diterpene, followed by incubation of an extract containing a CYP in the presence of said diterpene,
v. incubating an extract containing diTPS of class I and diTPS of class II in the presence of GGPP, thereby producing a diterpene, followed by incubation of an extract comprising a CYP in the presence of said diterpene, thereby producing an intermediate diterpenoid, followed by incubation of an extract comprising at least one other CYP in the presence of said intermediate diterpenoid,
vi. incubating an extract containing diTPS of class I and diTPS of class II in the presence of GGPP, thereby producing a diterpene, followed by incubation of an extract comprising a CYP in the presence of said diterpene, thereby producing an intermediate diterpenoid, followed by incubation of an extract comprising at least one other CYP in the presence of said intermediate diterpenoid, thereby producing a second intermediate diterpenoid; followed by simultaneous or sequential incubation of extract(s) containing additional CYP(s) in the presence of intermediate diterpenoids; or
vii. incubating an extract containing diTPS of class I and diTPS of class II and
CYP(s) in the presence of GGPP, thereby producing an intermediate diterpenoid, followed by incubation of an extract comprising at least one other CYP in the presence of said intermediate diterpenoid; or
viii. incubating an extract containing diTPS of class I and diTPS of class II and CYP(s) in the presence of GGPP, thereby producing an intermediate diterpenoid, followed by incubation of an extract comprising at least one other CYP in the presence of said intermediate diterpenoid, followed by incubation of an extract comprising at least one other CYP in the presence of said intermediate diterpenoid, thereby producing a second intermediate diterpenoid; followed by simultaneous or sequential incubation of extract(s) containing additional CYP(s) in the presence of intermediate diterpenoids.
Alternative combinations of extracts comprising diTPSs and CYPS may also be envisaged and are comprised within the scope of the present invention.
When the methods are performed sequentially, the intermediate products, for example the diterpene, the diterpene pyrophosphate intermediate or any of the intermediate diterpenoids, may be purified or partly purified before performing the next step of the method. However, it is also comprised within the invention that crude extract or cultivation medium is used for the subsequent steps.
In order to produce a specific diterpenoid according to the present invention, a useful combination of a diTPS of class II and a diTPS of class I and one or more CYPs must be employed. Examples of specific combinations of a diTPS of class II and a diTPS of class I, which leads to production of specific diterpenes, are shown in figure 2. Thus, for production of a diterpenoid having any of the diterpenes shown in figure 2 as intermediate, then the specific combination of a diTPS of class II and a diTPS of class I shown in figure 2 may be employed in combination with one or more CYP. Other combinations of diTPS of class II and diTPS of class I may be used. In general, the diTPS of class II is selected so that it produces a diterpene pyrophosphate intermediate containing a decalin core having the desired stereochemistry at the 9 and 10 substitutions. Useful diTPS of class II are described below and also specific diTPS of class II catalysing formation of diterpene pyrophosphate intermediates with a specific stereochemistry are described.
The diTPS of class I is selected so that is catalyses the conversion of the diterpene pyrophosphate intermediate to a diterpene, which can serve as an intermediate in the production of the final diterpenoid. Since a diterpenoid according to the present invention is a diterpene substituted with one or more hydroxyl, keto and/or carboxyl groups, for simplicity the unsubstitued diterpene may be referred to as the "precursor diterpene" of the diterpenoid. By way of example, dehydroabietadiene may be referred to as the precursor diterpene of hydroxy dehydroabietadiene. Thus, diTPS of class I is selected so that is catalyses the conversion of the diterpene pyrophosphate
intermediate to a precursor diterpene of the diterpenoid of interest. Useful diTPS of class I are described below. Also specific reactions catalysed by various diTPS of class I are described, enabling the skilled person to select a useful diTPS of class I for production of a precursor diterpene of a desired diterpenoid.
The CYP(s) are collectively selected so that they catalyse oxidation of the precursor diterpene to obtain the desired diterpenoid. If the diterpenoid is a hydroxylated diterpene it is generally sufficient to use one CYP. Said one CYP is selected so that it can catalyse hydroxylation of the precursor diterpene. If the diterpenoid contains ketone and/or carboxyl and/or several hydroxyl groups, then one or more CYP may be employed. Details regarding how to select useful CYPs are given herein below in the section "CYP".
Once a useful diTPS of class II and diTPS of class I and one or more CYPs have been selected, nucleic acids encoding same may be expressed in the host organism(s) allowing production of the diterpenoid in a host organism. Putative useful combinations of a diTPS of class II and a diTPS of class I and CYP(s) for production of a given diterpenoid may be tested by expressing said diTPS of class II and said diTPS of class I and said CYP(s) in one or more host organism followed by testing for production of the diterpenoid, e.g. by GC-MS analysis and/or NMR analysis. Putative useful combinations of a diTPS of class II and a diTPS of class I and CYP for production of a given diterpenoid may in particular be tested as described in Example 1 herein below. Methods for expression of enzymes in host organisms are well known to skilled person, and may for example include the methods described herein below in the section "Heterologous nucleic acids".
The term GGPP as used herein refers to geranylgeranyl diphosphate and is a compound of the following structure:
Figure imgf000021_0001
wherein PPO- is diphosphate. PPO- and -OPP may be used interchangeably herein.
As outlined above the methods of the invention involves use of diTPS of class II or of heterologous nucleic acids encoding diTPS of class II. The diTPS of class II may be any of the diTPS of class II described herein below in the section "diTPS of class II".
As outlined above the methods of the invention involves use of diTPS of class I or of heterologous nucleic acids encoding diTPS of class I. The diTPS of class I may be any of the diTPS of class I described herein below in the section "diTPS of class I".
As outlined above the methods of the invention involves use of CYP or of heterologous nucleic acids encoding CYP. The CYP may be any of the CYPs described herein below in the section "CYP". diTPS of class II
The methods of the invention use a diTPS of class II. The invention also features host organisms comprising a heterologous nucleic acid encoding a diTPS of class II. Said diTPS of class II is an enzyme capable of catalysing protonation-initiated cationic cycloisomerization of GGPP to form a diterpene pyrophosphate intermediate. The class II diTPS reaction, may be terminated either by deprotonation or by water capture of the diphosphate carbocation.
In particular the diTPS of class II may be an enzyme capable of catalysing the reaction I:
Figure imgf000022_0001
wherein PPO- is diphosphate and the indicates either a double bond or two single bonds, wherein one is substituted with -OH and the other with -CH3.
Thus, may be ^Ch2 or OH .
When no stereochemistry is indicated, the bond may be in any conformation. By selecting appropriate diTPS of class II the stereochemistry of the diterpene produced may be controlled. Accordingly, by following the description of the present invention, the skilled person may be able to design the production of a given precursor diterpene by selecting appropriate diTPS enzymes of class II and class I as described herein.
The diTPS of class II is generally a polypeptide sharing at least some sequence similarity to at least one of SEQ ID NO:1 , SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7 or SEQ ID NO:8. In particular, it is preferred that the diTPS of class II shares at least 30%, preferably at least 40% sequence identity with at least one of SEQ ID NO:1 , SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7 and SEQ ID NO:8. In particular, it is preferred that the diTPS of class II shares at least 30%, such as at least 35% sequence identity to the sequence of SsLPPS (SEQ ID NO:6) or to the sequence of AtCPS (see figure 5). Furthermore, it is preferred that the diTPS of class II in addition to above mentioned sequence identity also contains the following motif of four amino acids: D/E-X-D-D,
wherein X may be any amino acid, such as any naturally occurring amino acids. In particular, X may be an amino acid with a hydrophobic side chain, and thus X may for example be selected from the group consisting of A, I, L, M, F, W, Y and V. Even more preferably X is an amino acid with a small hydrophobic side chain, and thus X may be selected from the group consisting of A, I, L and V.
In one embodiment of the invention said motif of four amino acids is: D/E-l/V-D-D
D/E indicates that said amino acid may be D or E and l/V indicates that said amino acid may be I or V. Amino acids are herein named using the lUPAC nomenclature for amino acids.
In particular, it is preferred that the diTPS of class II contains above described motif in a position corresponding to position aa 372 to 375 of SsLPPS of SEQ ID NO:6. A position corresponding to position aa 372 to 375 of SsLPPS of SEQ ID NO:6 is identified by aligning the sequence of a diTPS of class II of interest to SEQ ID NO:6 and optionally to additional sequences of diTPS of class II as e.g. shown in figure 5 and identifying the amino acids of said diTPS of class II aligning with aa 372 to 375 of SsLPPS of SEQ ID NO:6. It is furthermore preferred that in addition to sharing above mentioned sequence identity and containing said motif, then as many as possible of the amino acids marked with a black box in figure 5 are retained. Thus, when aligned to the sequence of ScLPPS (SEQ ID NO:6), then preferably the diTPS of class II also contains at least 80%, more preferably at least 90%, for example at least 95%, such as all of the amino acids marked by a black box in figure 5. Alternatively, when aligned to the sequence of sequence of AtCPS (see figure 5), then preferably the diTPS of class II also contains at least 80%, more preferably at least 90%, for example at least 95%, such as all of the amino acids marked by a black box in figure 5. Thus, the diTPS of class II may for example be selected from the group consisting of diTPS of class II of the following types:
i. syn-CPP type, such as any of the enzymes described herein below in the
section "syn-CPP type diTPS"
ii. ent-CPP type, such as any of the enzymes described herein below in the
section "ent-CPP type diTPS"
iii. (+)-CPP type, such as any of the enzymes described herein below in the
section "(+)-CPP type diTPS"
iv. LPP type, such as any of the such as any of the enzymes described herein below in the section "LPP type diTPS"
v. LPP like type, such as any of the enzymes described herein below in the
section "LPP like type diTPS"
Certain diTPS enzymes are bifunctional in the sense that they may be classified as both class II and class I diTPS enzymes. Such bifunctional diTPS enzymes in general contain both the four amino acids motif: D/E-X-D-D, described herein above, as well as the five amino acid motif: D-D-X-X-D/E, described herein below. It is preferred that the diTPS of class II is not a bifunctional enzyme of both class II and class I. It is also preferred that the diTPS of class I is not a bifunctional enzyme of both class II and class I. syn-CPP type diTPS
The methods of the invention involve use of a diTPS of class II. The invention also features host organisms comprising a heterologous nucleic acid encoding a diTPS of class II. In one embodiment said diTPS of class II is a syn-CPP type diTPS. Such diTPS of class II are in particular useful in embodiments of the inventions, wherein the diterpenoid to be produced contains a 9S,10R decalin core. As used herein the term "syn-CPP type diTPS" refers to any enzyme capable of catalysing the reaction II:
Figure imgf000025_0001
wherein PPO- refers to diphosphate. In one embodiment the syn-CPP type diTPS may be syn-copalyl pyrophosphate synthase (syn-CPP), such as syn-CPP from Oryza sativa. Syn-CPP may also be referred to as CPSsyn. In particular, said syn-CPP type diTPS may be a polypeptide of SEQ ID NO:1 or a functional homologue thereof sharing at least 70%, such as at least 80%, for example at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 91 %, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% sequence identity therewith. The sequence identity is preferably calculated as described herein below in the section "Sequence identity". A functional homologue of a syn-CPP is a polypeptide, which is also capable of catalysing reaction II described above.
The heterologous nucleic acid encoding said syn-CPP type diTPS may for example be any nucleic acid encoding any of the syn-CPP type diTPS described herein. Thus, the heterologous nucleic acid may be any nucleic acid encoding the polypeptide of SEQ ID NO:1 or any of functional homologue thereof described in the previous paragraph. For example, said heterologous nucleic acid may comprise or consist of SEQ ID NO:31 . ent-CPP type The methods of the invention involve use of a diTPS of class II. The invention also features host organisms comprising a heterologous nucleic acid encoding a diTPS of class II. In one embodiment said diTPS of class II is an ent-CPP type diTPS. Such diTPS of class II are in particular useful in embodiments of the inventions, wherein the diterpenoid to be produced contains a 9R,10R decalin core. As used herein the term "ent-CPP type diTPS" refers to any enzyme capable of catalysing the reaction III:
Figure imgf000026_0001
wherein PPO- refers to diphosphate.
In one embodiment the ent-CPP type diTPS may be EpTPS7. In particular, said ent- CPP type diTPS may be a polypeptide of SEQ ID NO:2 or a functional homologue thereof sharing at least 70%, such as at least 80%, for example at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 91 %, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% sequence identity therewith.
The heterologous nucleic acid encoding said ent-CPP type diTPS, may for example be any nucleic acid encoding any of the ent-CPP type diTPS described herein. Thus, the heterologous nucleic acid may be any nucleic acid encoding the polypeptide of SEQ ID NO:2 or any of functional homologue thereof described in the previous paragraph.
In another embodiment the ent-CPP type diTPS may be ZmAN2. In particular, said ent- CPP type diTPS may be a polypeptide of SEQ ID NO:3 or a functional homologue thereof sharing at least 70%, such as at least 80%, for example at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 91 %, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% sequence identity therewith. The heterologous nucleic acid may for example be any nucleic acid encoding the polypeptide of SEQ ID NO:3 or any of functional homologue thereof described in the previous paragraph. The sequence identity is preferably calculated as described herein below in the section "Sequence identity". A functional homologue of an ent-CPP is a polypeptide, which is also capable of catalysing reaction III described above.
(+)-CPP type diTPS
The methods of the invention involve use of a diTPS of class II. The invention also features host organisms comprising a heterologous nucleic acid encoding a diTPS of class II. In one embodiment said diTPS of class II is a (÷)-CPP type diTPS. Such diTPS of class II are in particular useful in embodiments of the inventions, wherein the diterpenoid to be produced contains a 9S, 10S decalin core.
As used herein the term "(+)-CPP type diTPS" refers to any enzyme capable of catalysing the reaction IV:
Figure imgf000027_0001
wherein PPO- refers to diphosphate.
In one embodiment the (- -CPP type diTPS may be TwTPS7. In particular, said {·)- CPP type diTPS may be a polypeptide of SEQ ID NO:4 or a functional homologue thereof sharing at least 70%, such as at least 80%, for example at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 91 %, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% sequence identity therewith. The heterologous nucleic acid encoding said ( -)-CPP type diTPS may for example be any nucleic acid encoding any of the (+)-CPP type diTPS described herein. Thus, the heterologous nucleic acid may be any nucleic acid encoding the polypeptide of SEQ ID NO:4 or any of functional homologue thereof described in the previous paragraph.
In another embodiment the {÷}-CPP type diTPS may be CfTPSI . In particular, said (+)- CPP type diTPS may be a polypeptide of SEQ ID NO:5 or a functional homologue thereof sharing at least 70%, such as at least 80%, for example at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 91 %, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% sequence identity therewith.
The heterologous nucleic acid may for example be any nucleic acid encoding the polypeptide of SEQ ID NO:5 or any of functional homologue thereof described in the previous paragraph. For example, said heterologous nucleic acid may comprise or consist of SEQ ID NO:27. Said heterologous nucleic acid may also comprise or consist of SEQ ID NO:28. This may in particular be the case in embodiments of the invention wherein the host organism is yeast, e.g. S. cerevisiae.
The sequence identity is preferably calculated as described herein below in the section "Sequence identity". A functional homologue of a ( }-CPP is a polypeptide, which is also capable of catalysing reaction IV described above. LPP type diTPS
The methods of the invention involve use of a diTPS of class II. The invention also features host organisms comprising a heterologous nucleic acid encoding a diTPS of class II. In one embodiment said diTPS of class II is a LPP type diTPS. Such diTPS of class II are in particular useful in embodiments of the inventions, wherein the diterpenoid to be produced contains a 8-hydroxy-decalin core. However, LPP type diTPS may also be useful in other embodiments of the invention.
As used herein the term "LPP type diTPS" refers to any enzyme capable of catalysing the reaction V:
Figure imgf000029_0001
wherein PPO- refers to diphosphate.
In one embodiment the LPP type diTPS may be labda-13-en-8-ol pyrophosphate synthase, such as SsLPPS. In particular, said LPP type diTPS may be a polypeptide of SEQ ID NO:6 or a functional homologue thereof sharing at least 70%, such as at least 80%, for example at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 91 %, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% sequence identity therewith. In embodiments of the invention, wherein the diTPS of class II is SsLPPS or a functional homologue thereof sharing above mentioned sequence identity, then it may be preferred that the diTPS of class I is not SsSCS, CfTPS3, CfTPS4 or EpTPS8 or a functional homologue of any of the aforementioned sharing at least 70% sequence identity therewith. Thus, in embodiments of the invention, wherein the diTPS of class II is SsLPPS, then it is preferred that the diTPS of class I is not SsSCS, CfTPS3, CfTPS4 or EpTPS8.
The heterologous nucleic acid encoding said LPP type diTPS may for example be any nucleic acid encoding any of the LPP type diTPS described herein. Thus, the heterologous nucleic acid may be any nucleic acid encoding the polypeptide of SEQ ID NO:6 or any of functional homologue thereof described in the previous paragraph.
It is also preferred that if the diTPS of class II is SsCPSL, then it is preferred that the diTPS of class I is not SsKSLI or SsKSL2.
In another embodiment the LPP type diTPS may be TwTPS21 . In particular, said LPP type diTPS may be a polypeptide of SEQ ID NO:7 or a functional homologue thereof sharing at least 70%, such as at least 80%, for example at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 91 %, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% sequence identity therewith.
The heterologous nucleic acid may for example be any nucleic acid encoding the polypeptide of SEQ ID NO:7 or any of functional homologue thereof described in the previous paragraph.
In another embodiment the LPP type diTPS may be CfTPS2. In particular, said LPP type diTPS may be a polypeptide of SEQ ID NO:17 or a functional homologue thereof sharing at least 70%, such as at least 80%, for example at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 91 %, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% sequence identity therewith. In embodiments of the invention, wherein the diTPS of class II is CfTPS2 or a functional homologue thereof sharing above mentioned sequence identity, then it is preferred that the diTPS of class I is not CfTPS3 or CfTPS4 or EpTPS8 or a functional homologue of the aforementioned sharing at least 70% sequence identity therewith. Thus, in embodiments of the invention, wherein the diTPS of class II is CfTPS2, then it is preferred that the diTPS of class I is not CfTPS3 or CfTPS4 or EpTPS8. The heterologous nucleic acid may for example be any nucleic acid encoding the polypeptide of SEQ ID NO:17 or any of functional homologue thereof described in the previous paragraph.
The sequence identity is preferably calculated as described herein below in the section "Sequence identity". A functional homologue of a LPP is a polypeptide, which is also capable of catalysing reaction V described above.
LPP like type diTPS The methods of the invention involve use of a diTPS of class II. The invention also features host organisms comprising a heterologous nucleic acid encoding a diTPS of class II. In one embodiment said diTPS of class II is a LPP like type diTPS. In one embodiment the LPP like type diTPS may be TwTPS14/28. In particular, said LPP like type diTPS may be a polypeptide of SEQ ID NO:8 or a functional homologue thereof sharing at least 70%, such as at least 80%, for example at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 91 %, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% sequence identity therewith.
The heterologous nucleic acid encoding said LPP like type diTPS may for exampe be any nucleic acid encoding any of the LPP like type diTPS described herein. Thus, the heterologous nucleic acid may be any nucleic acid encoding the polypeptide of SEQ ID NO:8 or any of functional homologue thereof described in the previous paragraph.
The sequence identity is preferably calculated as described herein below in the section "Sequence identity". diTPS of class I
The methods of the invention involve use of a diTPS of class I. The invention also features host organisms comprising a heterologous nucleic acid encoding a diTPS of class I.
Said diTPS of class I is an enzyme capable of catalyzing cleavage of the diphosphate group of the diterpene pyrophosphate intermediate and additionally preferably also is capable of catalysing cyclization and/or rearrangement reactions on the resulting carbocation. As with the class II diTPSs, deprotonation or water capture may terminate the class I diTPS reaction leading to hydroxylation of the diterpene pyrophosphate intermediate.
The diTPS of class I is generally a polypeptide sharing at least some sequence similarity to at least one of SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:1 1 , SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16 or SEQ ID NO:17. In particular, it is preferred that the diTPS of class I shares at least 30%, preferably at least 40%, more preferably at least 45% sequence identity with at least one of SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:1 1 , SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16 and SEQ ID NO:17. In particular, it is preferred that the diTPS of class I shares at least 30%, such as at least 35% sequence identity to the sequence of ScSCS (SEQ ID NO:1 1 ) or to the sequence of AtEKS (see figure 4). Furthermore, it is preferred that the diTPS of class I in addition to above mentioned sequence identity also contains the following motif of five amino acids:
D-D-X-X-D/E,
wherein X may be any amino acid, such as any naturally occurring amino acids. In particular, X may be an amino acid with a hydrophobic side chain, and thus X may for example be selected from the group consisting of A, I, L, M, F, W, Y and V. Even more preferably X is an amino acid with a small hydrophobic side chain, and thus X may be selected from the group consisting of A, I, L and V.
In one embodiment of the invention said motif of five amino acids is:
D-D-F-F-D/E
D/E indicates that said amino acid may be D or E.
In particular, it is preferred that the diTPS of class I contains said motif in a position corresponding to position aa 329-333 of SsSCS of SEQ ID NO:1 1 . A position corresponding to position aa 329-333 of SsSCS of SEQ ID NO:1 1 is identified by aligning the sequence of a diTPS of class I of interest to SEQ ID NO:1 1 and optionally to additional sequences of diTPS of class I as e.g. shown in figure 4, and identifying the amino acids of said diTPS of class I aligned with aa 329-333 of SsSCS of SEQ ID NO:1 1 .
It is furthermore preferred that in addition to sharing above mentioned sequence identity and containing said motif, then as many as possible of the amino acids marked with a black box in figure 4 are retained. Thus, when aligned to the sequence of ScSCS (SEQ ID NO:1 1 ), then preferably the diTPS of class I also contains at least 80%, more preferably at least 90%, for example at least 95%, such as all of the amino acids marked by a black box in figure 4. Alternatively, when aligned to the sequence of sequence of AtEKS (see figure 4), then preferably the diTPS of class I also contains at least 80%, more preferably at least 90%, for example at least 95%, such as all of the amino acids marked by a black box in figure 4.
Thus, the diTPS of class I may for example be selected from the group consisting of diTPS of class I of the following types:
i. EpTPS8 like diTPS, such as any of the enzymes described herein below in the section "EpTPS8"
ii. EpTPS23 like diTPS, such as any of the enzymes described herein below in the section "EpTPS23"
iii. SsSCS like diTPS, such as any of the enzymes described herein below in the section "SsSCS"
iv. CfTPS3 like diTPS, such as any of the enzymes described herein below in the section "CfTPS3"
v. CfTPS4 like diTPS, such as any of the enzymes described herein below in the section "CfTPS4"
vi. TwTPS2 like diTPS, such as any of the enzymes described herein below in the section "TwTPS2"
vii. EpTPSI like diTPS, such as any of the enzymes described herein below in the section "TwTPSI "
viii. CfTPS14 like diTPS, such as any of the enzymes described herein below in the section "CfTPS14"
The diTPS of class I may in one embodiment also be MvTPS5 like diTPS, such as any of the enzymes described herein below in the section "MvTPS5".
In one embodiment of the invention the diTPS of class I may be a TPS from
Tripterygium Wilfordii. For example, the diTPS of class I may be the polypeptide of SEQ ID NO:41 or a functional homologue thereof sharing at least 70%, such as at least 80%, for example at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 91 %, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% sequence identity therewith.
The heterologous nucleic acid encoding said diTPS of Tripterygium Wilfordii, may for example be any nucleic acid encoding any of the diTPS of Tripterygium Wilfordii described herein. Thus, the heterologous nucleic acid may be any nucleic acid encoding the polypeptide of SEQ ID NO:41 or any of functional homologue thereof described in the previous paragraph. For example the heterologous nucleic acid may be a nucleic acid of SEQ ID NO:45.
EpTPS8
The invention involves use of a diTPS of class I. In one embodiment said diTPS of class I may be an EpTPS8 like diTPS. In embodiments of the invention, wherein the diTPS of class I is a EpTPS8 like diTPS, then it may be preferred that the diTPS of class II is not CfTPS2 or SsLPPS or a functional homologue of any of the
aforementioned sharing at least 70% sequence identity therewith. Thus, in embodiments of the invention, wherein the diTPS of class I is EpTPS8, then it may be preferred that the diTPS of class II is not CfTPS2 or SsLPPS.
In particular, said diTPS of class I may be an EpTPS8 like diTPS in embodiments of the invention, wherein the diterpene to be produced contains a tricyclic ring structure. For example said diTPS of class I may be and EpTPS8 like diTPS in embodiments of the invention, wherein the diterpene to be produced contains a core of any of the formulas I, II, III, XLI, XLII, XLIII, XLIV or XLV:
Figure imgf000034_0001
Figure imgf000035_0001
The waved line " i " as used herein indicates a bond of undefined stereochemistry, i.e. the bond may be either a " I " or " 5 ".
Dependent on the structure of the diterpene pyrophosphate intermediate then the diterpene containing a core of formula I or I I may have different stereochemistry. In general the stereochemistry of the decalin core present in the diterpene pyrophosphate intermediate is maintained after the reaction catalysed by a EpTPS8 like diTPS.
The EpTPS8 like diTPS may be any enzyme capable of catalysing the reaction VI I :
Diterpene pyrophosphate intermediate containing a decalin core structure ►
Diterpene containing a core structure of formula I or formula I I or formula II I or formula VI.
In particular EpTPS8 like diTPS may be an enzyme catalysing the reaction VI II :
Figure imgf000035_0002
wherein -OPP indicates diphosphate. During reaction VII I the produced diterpene will in general maintain the stereochemistry around the decalin core found in the starting diterpene pyrophosphate intermediate.
The EpTPS8 like diTPS may also be an enzyme catalysing the reaction IX:
Figure imgf000036_0001
wherein OPP indicated diphosphate. During reaction IX the produced diterpene will general maintain the stereochemistry around the decalin core found in the starting diterpene pyrophosphate intermediate.
The EpTPS8 like diTPS may also be an enzyme catalysing the reaction X:
Figure imgf000036_0002
wherein -OPP indicated diphosphate. During reaction X the produced diterpene will in general maintain the stereochemistry around the decalin core found in the starting diterpene pyrophosphate intermediate.
In particular, the EpTPS8 like diTPS may be an enzyme catalysing the reaction XXV:
Figure imgf000036_0003
wherein -OPP indicates diphosphate. During reaction XXV the produced diterpene will in general maintain the stereochemistry around the decalin core found in the starting diterpene pyrophosphate intermediate. In one embodiment EpTPS8 like diTPS may be a terpene synthase from Euphobia peplus, and in particular it may be TPS8 from Euphobia peplus. TPS8 from Euphobia peplus is also referred to as EpTPS herein. In particular, said EpTPS8 like diTPS may be a polypeptide of SEQ ID NO:9 or a functional homologue thereof sharing at least
70%, such as at least 80%, for example at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 91 %, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% sequence identity therewith.
The heterologous nucleic acid encoding said EpTPS8 like diTPS may for example be any nucleic acid encoding any of the EpTPS8 like diTPS described herein. Thus, the heterologous nucleic acid may for example be any nucleic acid encoding the polypeptide of SEQ ID NO:9 or any of functional homologue thereof described in the previous paragraph.
The sequence identity is preferably calculated as described herein below in the section "Sequence identity". A functional homologue of EpTPS8 is a polypeptide, which is also capable of catalysing at least one of reactions VII, VIII, IX, X and XXV described above.
EpTPS23
The invention involves use of a diTPS of class I. In one embodiment said diTPS of class I may be an EpTPS23 like diTPS.
In particular, said diTPS of class I may be an EpTPS23 like diTPS in embodiments of the invention, wherein the diterpene to be produced contains a tricyclic ring structure. For example said diTPS of class I may be an EpTPS23 like diTPS in embodiments of the invention, wherein the diterpene to be produced contains a core of any of the formulas I, II, XLIV or XLI:
Figure imgf000038_0001
Dependent on the structure of the diterpene pyrophosphate intermediate then the diterpene containing a core of formula I or II may have different stereochemistry. In general the stereochemistry of the decalin core present in the diterpene pyrophosphate intermediate is maintained after the reaction catalysed by an EpTPS23 like diTPS.
The EpTPS23 like diTPS may in particular be an enzyme capable of catalysing the reaction XI:
Diterpene pyrophosphate intermediate containing a decalin core structure ►
Diterpene containing a core structure of formula I or formula II
In particular an EpTPS23 like diTPS may be an enzyme catalysing the reaction VIII:
Figure imgf000038_0002
wherein -OPP indicated diphosphate. During reaction VIII the produced diterpene will in general maintain the stereochemistry around the decalin core found in the starting diterpene pyrophosphate intermediate. The EpTPS23 like diTPS may also be an enzyme catalysing the reaction IX:
Figure imgf000039_0001
wherein -OPP indicated diphosphate. During reaction IX the produced diterpene will in general maintain the stereochemistry around the decalin core found in the starting diterpene pyrophosphate intermediate. In one embodiment an EpTPS23 like diTPS may be a diterpene synthase from
Euphobia peplus. In particular, the EpTPS23 like diTPS may be TPS23 of Euphobia peplus. TPS23 of Euphobia peplus may also be referred to as EpTPS23 herein. In particular, said EpTPS23 like diTPS may be a polypeptide of SEQ ID NO:10 or a functional homologue thereof sharing at least 70%, such as at least 80%, for example at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 91 %, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% sequence identity therewith. The heterologous nucleic acid encoding said EpTPS23 like diTPS may for example be any nucleic acid encoding any of the EpTPS23 like diTPS described herein. Thus, the heterologous nucleic acid may be any nucleic acid encoding the polypeptide of SEQ ID NO:10 or any of functional homologue thereof described in the previous paragraph. The sequence identity is preferably calculated as described herein below in the section "Sequence identity". A functional homologue of EpTPS23 is a polypeptide, which is also capable of catalysing at least one of reactions VIII or IX described above.
SsSCS The invention involves use of a diTPS of class I. In one embodiment said diTPS of class I may be a SsSCS like diTPS.
In particular, said diTPS of class I may be a SsSCS like diTPS in embodiments of the invention, wherein the diterpene to be produced contains a decalin substituted at the 10 position with C5-alkenyl chain, which optionally may be substituted with a hydroxyl and/or a methyl group and/or =C.
Furthermore, said diTPS of class I may be a SsSCS like diTPS in embodiments of the invention, wherein the diterpene to be produced contains a core of formula III, XLVI, XXVII, XXVIII, XXIX, XXX, XXXI, XXXII, XXXIII, or XXXIV:
Figure imgf000040_0001
Dependent on the structure of the diterpene pyrophosphate intermediate then the diterpene containing a decalin substituted at the 10 position with said C5-alkenyl chain, or the diterpene containing a core of formula III may have different stereochemistry. In general the stereochemistry of the decalin core present in the diterpene pyrophosphate intermediate is maintained after the reaction catalysed by a SsSCS like diTPS. The SsSCS like diTPS may be any enzyme capable of catalysing the following reaction XII:
Diterpene pyrophosphate intermediate containing a decalin core structure ►
Diterpene containing a decalin core substituted at the 10 position with C5-alkenyl chain, which optionally may be substituted with a hydroxyl and/or a methyl group and/or =C OR diterpene containing a core structure of formula III.
The SsSCS like diTPS may in particular be an enzyme capable of catalysing the reaction XVI:
Figure imgf000041_0001
wherein -GPP is diphosphate; and
indicates either a double bond or two single bonds, wherein one is substituted with -OH and the other with -GH3; and
the doited lines without star indicates a bond, which optionally is present.
^^CH3
Γ hus, may be ^Ch2 or OH . it is to be understood that in embodiments of the invention, wherein [he doited line
OH
shown as I is not present, then also the hydroxyl group is not present. It is preferred that one and only one of the dotted lines without star indicates a bond.
A SsSCS like diTPS may in particular be an enzyme capable of catalysing the reaction XVII:
Figure imgf000042_0001
wherein OPP indicated diphosphate. During reaction XVI I the produced diterpene will in general maintain the stereochemistry around the decalin core found in the starting diterpene pyrophosphate intermediate. Thus, the SsSCS like diTPS may be an enzyme catalysing any of the reactions XII I, XIV and XV shown in figure 1 .
The SsSCS like diTPS may also be an enzyme catalysing the following reaction XXVI I I:
Figure imgf000042_0002
wherein OPP is diphosphate and P is a C5-alkenyl substituted with methyl and/or hydroxyl. Preferably, PM is C5-alkenyl containing one or two double bonds. When R is alkenyl containing one double bond, said alkenyl is preferably substituted with hydroxyl and methyl. When R is alkenyl containing two double bonds, said alkenyl is preferably substituted with methyl.
The SsSCS like diTPS may also be an enzyme catalysing the following reaction XXIX:
Figure imgf000042_0003
wherein -OPP is diphosphate and R2 is a C5-alkenyl substituted with methyl and/or hydroxyl or with =C, and Xi is either -OH or methyl, and X2 is either -H or -OH, wherein one and only one of Xi and X2 is -OH. Preferably, R2 is C5-alkenyl containing one or two double bonds. When R2 is alkenyl containing one double bond, said alkenyl is preferably substituted with hydroxyl and methyl or with =C. When R2 is alkenyl containing two double bonds, said alkenyl is preferably substituted with methyl.
The SsSCS like diTPS may also be an enzyme catalysing the reaction X:
Figure imgf000043_0001
wherein OPP indicates diphosphate. During reaction X the produced diterpene will general maintain the stereochemistry around the decalin core found in the starting diterpene pyrophosphate intermediate.
The SsSCS like diTPS may also be an enzyme catalysing the reaction LI I:
Figure imgf000043_0002
wherein OPP indicates diphosphate.
In one embodiment a SsSCS like diTPS may be SCIareol Synthase (SCS) from Salvia Sclarea. SCS from Salvia Sclarea may also be referred to as SsSCS herein. In particular, said SsSCS like diTPS may be a polypeptide of SEQ ID NO:1 1 or a functional homologue thereof sharing at least 70%, such as at least 80%, for example at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 91 %, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% sequence identity therewith.
The heterologous nucleic acid encoding said SsSCS like diTPS may for example be any nucleic acid encoding any of the SsSCS like diTPS described herein. Thus, the heterologous nucleic acid may for example be any nucleic acid encoding the polypeptide of SEQ ID NO:1 1 or any of functional homologue thereof described in the previous paragraph. For example, said heterologous nucleic acid may comprise or consist of SEQ ID NO:32. Said heterologous nucleic acid may also comprise or consist of SEQ ID NO:33. This may in particular be the case in embodiments of the invention wherein the host organism is yeast, e.g. S. cerevisiae. The sequence identity is preferably calculated as described herein below in the section "Sequence identity". A functional homologue of SsSCS is a polypeptide, which is also capable of catalysing at least one of reactions XII, XIII, XIV, XV, XVI, XVII, XXVIII or XXIX or Lll described above. CfTPS3
The invention involves use of a diTPS of class I. In one embodiment said diTPS of class I may be a CfTPS3 like diTPS. In embodiments of the invention, wherein the diTPS of class I is a CfTPS3 like diTPS, then it may be preferred that the diTPS of class II is not CfTPS2 or SsLPPS or a functional homologue thereof sharing at least 70% sequence identity therewith. Thus, in embodiments of the invention, wherein the diTPS of class I is CfTPS3, then it may be preferred that the diTPS of class II is not CfTPS2 or SsLPPS. In particular, said diTPS of class I may be a CfTPS3 like diTPS in embodiments of the invention, wherein the diterpene to be produced contains a tricyclic ring structure. For example said diTPS of class I may be a CFTPS3 like diTPS in embodiments of the invention, wherein the diterpene to be produced contains a core of any of the formulas VI, IX XXXV, XXXVI, II, XXXVII, XXXVIII, XXXIX, XL, III or XXXII:
Figure imgf000045_0001
Dependent on the structure of the diterpene pyrophosphate intermediate then the diterpene containing a core of formula VI, IX, XXXV, II or XXXIX may have different stereochemistry. In general the stereochemistry of the decalin core present in the diterpene pyrophosphate intermediate is maintained after the reaction catalysed by the CfTPS3 like diTPS.
The CfTPS3 like diTPS may be any enzyme capable of catalysing the reaction XXIII:
Diterpene pyrophosphate intermediate containing a decalin core structure Diterpene containing a core structure of formula VI or formula IX, XXXV, XXXVI, ΙΙ,ΧΧΧνΐ Ι, XXXVI I I, XXXIX, XL, I I I or XXXI I.
The CfTPS3 like diTPS may in particular be an enzyme capable of catalysing the reaction XXIV:
Figure imgf000046_0001
wherein OPP indicates diphosphate. During reaction XXIV the produced diterpene will in general maintain the stereochemistry around the decalin core found in the starting diterpene pyrophosphate intermediate.
The CfTPS3 like diTPS may in particular be an enzyme capable of catalysing the reaction XXI I :
Figure imgf000046_0002
wherein OPP is diphosphate. During reaction XXI I the produced diterpene will in general maintain the stereochemistry around the decalin core found in the starting diterpene pyrophosphate intermediate. The CfTPS3 like diTPS may in particular be an enzyme capable of catalysing the reaction LI N :
Figure imgf000047_0001
wherein OPP is diphosphate. During reaction LIN the produced diterpene will in general maintain the stereochemistry around the decalin core found in the starting diterpene pyrophosphate intermediate.
The CfTPS3 like diTPS may in particular be an enzyme capable of catalysing the reaction LIV:
Figure imgf000047_0002
wherein OPP is diphosphate. During reaction LIV the produced diterpene will in general maintain the stereochemistry around the decalin core found in the starting diterpene pyrophosphate intermediate.
The CfTPS3 like diTPS may also be an enzyme catalysing the reaction X:
Figure imgf000047_0003
wherein OPP indicates diphosphate. During reaction X the produced diterpene will in general maintain the stereochemistry around the decalin core found in the starting diterpene pyrophosphate intermediate. In one embodiment the CfTPS3 like diTPS may be a diterpene synthase from Coleus forskohlii. In particular, the CfTPS3 like diTPS may be a TPS3 from Coleus forskohlii. TPS3 from Coleus forskohlii may also be referred to as CfTPS3. In particular, said CfTPS3 like diTPS may be a polypeptide of SEQ ID NO:12 or a functional homologue thereof sharing at least 70%, such as at least 80%, for example at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 91 %, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% sequence identity therewith. The heterologous nucleic acid encoding said CfTPS3 like diTPS may for example be any nucleic acid encoding any of the CfTPS3 like diTPS described herein. Thus, the heterologous nucleic acid may for example be any nucleic acid encoding the polypeptide of SEQ ID NO:12 or any of functional homologue thereof described in the previous paragraph. For example, said heterologous nucleic acid may comprise or consist of SEQ ID NO:29. Said heterologous nucleic acid may also comprise or consist of SEQ ID NO:30. This may in particular be the case in embodiments of the invention wherein the host organism is yeast, e.g. S. cerevisiae.
The sequence identity is preferably calculated as described herein below in the section "Sequence identity". A functional homologue of CfTPS3 is a polypeptide, which is also capable of catalysing at least one of reactions XXII, XXIII or XXIV described above.
CfTPS4
The invention involves use of a diTPS of class I. In one embodiment said diTPS of class I may be a CfTPS4 like diTPS. In embodiments of the invention, wherein the diTPS of class I is a CfTPS4 like diTPS, then it may be preferred that the diTPS of class II is not CfTPS2 or SsLPPSor a functional homologue of any of the
aforementioned sharing at least 70% sequence identity therewith. Thus, in
embodiments of the invention, wherein the diTPS of class I is CfTPS4, then it is preferred that the diTPS of class II is not CfTPS2 or SsLPPS. In particular, said diTPS of class I may be a CfTPS4 like diTPS in embodiments of the invention, wherein the diterpene to be produced contains a tricyclic ring structure. For example said diTPS of class I may be a CfTPS4 like diTPS in embodiments of the invention, wherein the diterpene to be produced contains a core of any of the formulas VI, IX, XXXV, XXXVI, II, XXXVII, XXXVIII, XXXIX or XL:
Figure imgf000049_0001
Dependent on the structure of the diterpene pyrophosphate intermediate then the diterpene containing a core of formula VI, IX, XXXV, II, or XXXIX may have different stereochemistry. In general the stereochemistry of the decalin core present in the diterpene pyrophosphate intermediate is maintained after the reaction catalysed by the CfTPS4 like diTPS. The CfTPS4 like diTPS may be any enzyme capable of catalysing the reaction XXIII:
Diterpene pyrophosphate intermediate containing a decalin core structure ►
Diterpene containing a core structure of formula VI or formula IX, XXXV, XXXVI, II, XXXVII, XXXVIII, XXXIX or XL
The CfTPS4 like diTPS may in particular be an enzyme capable of catalysing the reaction XXIV:
Figure imgf000050_0001
wherein OPP indicates diphosphate. During reaction XXIV the produced diterpene will in general maintain the stereochemistry around the decalin core found in the starting diterpene pyrophosphate intermediate.
The CfTPS4 like diTPS may in particular be an enzyme capable of catalysing the reaction XXII:
Figure imgf000050_0002
wherein OPP is diphosphate. During reaction XXII the produced diterpene will in general maintain the stereochemistry around the decalin core found in the starting diterpene pyrophosphate intermediate.
The CfTPS4 like diTPS may in particular be an enzyme capable of catalysing the reaction LIN:
Figure imgf000051_0001
wherein OPP is diphosphate. During reaction LIN the produced diterpene will in general maintain the stereochemistry around the decalin core found in the starting diterpene pyrophosphate intermediate.
The CfTPS4 like diTPS may in particular be an enzyme capable of catalysing the reaction LIV:
Figure imgf000051_0002
wherein OPP is diphosphate. During reaction LIV the produced diterpene will in general maintain the stereochemistry around the decalin core found in the starting diterpene pyrophosphate intermediate.
In one embodiment the CfTPS4 like diTPS may be a diterpene synthase from Coleus forskohlii. In particular, the CfTPS4 like diTPS may be a TPS4 from Coleus forskohlii. TPS4 from Coleus forskohlii may also be referred to as CfTPS4. In particular, said CfTPS4 like diTPS may be a polypeptide of SEQ ID NO:13 or a functional homologue thereof sharing at least 70%, such as at least 80%, for example at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 91 %, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% sequence identity therewith. The heterologous nucleic acid encoding said CfTPS4 like diTPS may for example be any nucleic acid encoding any of the CfTPS4 like diTPS described herein. Thus, the heterologous nucleic acid may for example be any nucleic acid encoding the polypeptide of SEQ ID NO:13 or any of functional homologue thereof described in the previous paragraph.
The sequence identity is preferably calculated as described herein below in the section "Sequence identity". A functional homologue of CfTPS4 is a polypeptide, which is also capable of catalysing at least one of reactions XXII, XXIII or XXIV described above.
MvTPS5
The invention involves use of a diTPS of class I. In one embodiment said diTPS of class I may be a MvTPS5 like diTPS.
In particular, said diTPS of class I may be a MvTPS5 like diTPS in embodiments of the invention, wherein the diterpene to be produced contains a tricyclic ring structure. For example said diTPS of class I may be a MvTPS5 like diTPS in embodiments of the invention, wherein the diterpene to be produced contains a core of any of the formulas VI, IX, XXXV, XXXVI, Ι Ι,ΧΧΧνΐ Ι, XXXVIII, XXXIX, XL, III or XXXII:
or
Figure imgf000052_0001
Figure imgf000053_0001
Dependent on the structure of the diterpene pyrophosphate intermediate then the diterpene containing a core of formula VI, IX, XXXV, II, XXXIX or III, may have different stereochemistry. In general the stereochemistry of the decalin core present in the diterpene pyrophosphate intermediate is maintained after the reaction catalysed by the MvTPS5 like diTPS.
The MvTPS5 like diTPS may be any enzyme capable of catalysing the reaction XXIII:
Diterpene pyrophosphate intermediate containing a decalin core structure ►
Diterpene containing a core structure of formula VI, IX, XXXV, XXXVI, Ι Ι ,ΧΧΧνΐ Ι,
XXXVIII, XXXIX, XL, III or XXXII.
The MvTPS5 like diTPS may in particular be an enzyme capable of catalysing the reaction LV:
Figure imgf000053_0002
wherein OPP indicates diphosphate. During reaction LV the produced diterpene will in general maintain the stereochemistry around the decalin core found in the starting diterpene pyrophosphate intermediate. The MvTPS5 like diTPS may in particular be an enzyme capable of catalysing the reaction LVI:
Figure imgf000054_0001
wherein OPP is diphosphate. During reaction LVI the produced diterpene will in general maintain the stereochemistry around the decalin core found in the starting diterpene pyrophosphate intermediate. The MvTPS5 like diTPS may in particular be an enzyme capable of catalysing the reaction LIN:
Figure imgf000054_0002
wherein OPP is diphosphate. During reaction LIN the produced diterpene will in general maintain the stereochemistry around the decalin core found in the starting diterpene pyrophosphate intermediate.
The MvTPS5 like diTPS may in particular be an enzyme capable of catalysing the reaction LIV:
Figure imgf000055_0001
wherein OPP is diphosphate. During reaction LIV the produced diterpene will in general maintain the stereochemistry around the decalin core found in the starting diterpene pyrophosphate intermediate.
The MvTPS5 like diTPS may also be an enzyme catalysing the reaction X:
Figure imgf000055_0002
wherein OPP indicates diphosphate. During reaction X the produced diterpene will in general maintain the stereochemistry around the decalin core found in the starting diterpene pyrophosphate intermediate.
In one embodiment the MvTPS5 like diTPS may be a diterpene synthase from
Marrubium vulgare. In particular, the MvTPS5 like diTPS may be a TPS5 from
Marrubium vulgare. TPS5 from Marrubium vulgare may also be referred to as MvTPS5. In particular, said MvTPS5 like diTPS may be a polypeptide of SEQ ID NO:46 or a functional homologue thereof sharing at least 70%, such as at least 80%, for example at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 91 %, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% sequence identity therewith. The sequence identity is preferably calculated as described herein below in the section "Sequence identity". A functional homologue of MvTPS5 is a polypeptide, which is also capable of catalysing at least one of reactions LIN, LIV, LV or LVI described above. TwTPS2
The invention involves use of a diTPS of class I. In one embodiment said diTPS of class I may be a TwTPS2 like diTPS. In particular, said diTPS of class I may be a TwTPS2 like diTPS in embodiments of the invention, wherein the diterpene to be produced contains a tricyclic ring structure. For example said diTPS of class I may be a TwTPS2 like diTPS in embodiments of the invention, wherein the diterpene to be produced contains a core of any of the formulas IV, V or X:
Figure imgf000056_0001
Dependent on the structure of the diterpene pyrophosphate intermediate then the diterpene containing a core of formula IV and V, may have different stereochemistry. In general the stereochemistry of the decalin core present in the diterpene pyrophosphate intermediate is maintained after the reaction catalysed by the TwTPS2 like diTPS.
The TwTPS2 like diTPS may be any enzyme capable of catalysing the reaction XXVI:
Diterpene pyrophosphate intermediate containing a decalin core structure ►
Diterpene containing a core structure of formula IV or formula V or formula X
The TwTPS2 like diTPS may be any enzyme capable of catalysing conversion of a diterpene pyrophosphate intermediate to a diterpene containing a core of either formula IV or V. The TwTPS2 like diTPS may in particular be an enzyme capable of catalysing the reaction XIX:
Figure imgf000057_0001
wherein OPP is diphosphate. During reaction XIX the produced diterpene will in general maintain the stereochemistry around the decalin core found in the starting diterpene pyrophosphate intermediate.
The TwTPS2 like diTPS may in particular be an enzyme capable of catalysing the reaction XXVII:
Figure imgf000057_0002
wherein OPP is diphosphate. During reaction XIX the produced diterpene will in general maintain the stereochemistry around the decalin core found in the starting diterpene pyrophosphate intermediate.
The TwTPS2 like diTPS may in particular be an enzyme capable of catalysing the reaction XX:
Figure imgf000058_0001
wherein OPP indicated diphosphate. During reaction XX the produced diterpene will in general maintain the stereochemistry around the decalin core found in the starting diterpene pyrophosphate intermediate.
In one embodiment the TwTPS2 like diTPS may be a diterpene synthase from
Tripterygium wilfordii. In particular, the TwTPS2 like diTPS may be a TPS2 from Tripterygium wilfordii. TPS2 from Tripterygium wilfordii may also be referred to as TwTPS2. In particular, said TwTPS2 like diTPS may be a polypeptide of SEQ ID NO:14 or a functional homologue thereof sharing at least 70%, such as at least 80%, for example at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 91 %, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% sequence identity therewith.
The heterologous nucleic acid encoding said TwTPS2 like diTPS may for example be any nucleic acid encoding any of the TwTPS2 like diTPS described herein. Thus, the heterologous nucleic acid may for example be any nucleic acid encoding the polypeptide of SEQ ID NO:14 or any of functional homologue thereof described in the previous paragraph.
The sequence identity is preferably calculated as described herein below in the section "Sequence identity". A functional homologue of TwTPS2 is a polypeptide, which is also capable of catalysing at least one of reactions, XIX, XX, XXVI or XXVII described above.
EpTPSI
The invention involves use of a diTPS of class I. In one embodiment said diTPS of class I may be an EpTPSI like diTPS. In particular, said diTPS of class I may be an EpTPSI like diTPS in embodiments of the invention, wherein the diterpene to be produced contains a tricyclic ring structure For example said diTPS of class I may be an EpTPSI like diTPS in embodiments of the invention, wherein the diterpene to be produced contains a core of any of the formulas IV or V:
Figure imgf000059_0001
Dependent on the structure of the diterpene pyrophosphate intermediate then the diterpene containing a core of formula IV and V, may have different stereochemistry. In general the stereochemistry of the decalin core present in the diterpene pyrophosphate intermediate is maintained after the reaction catalysed by the EpTPSI like diTPS. The EpTPSI like diTPS may be any enzyme capable of catalysing the reaction XVIII:
Diterpene pyrophosphate intermediate containing a decalin core structure ►
Diterpene containing a core structure of formula IV or formula V The EpTPSI like diTPS may be any enzyme capable of catalysing conversion of a diterpene pyrophosphate intermediate to a diterpene containing a core of either formula IV or V. The EpTPSI like diTPS may in particular be an enzyme capable of catalysing the reaction XIX:
Figure imgf000059_0002
wherein OPP is diphosphate. During reaction XIX the produced diterpene will in general maintain the stereochemistry around the decalin core found in the starting diterpene pyrophosphate intermediate. The EpTPSI like diTPS may in particular be an enzyme capable of catalysing the reaction XX:
Figure imgf000060_0001
wherein OPP indicated diphosphate. During reaction XX the produced diterpene will in general maintain the stereochemistry around the decalin core found in the starting diterpene pyrophosphate intermediate.
In one embodiment the EpTPSI like diTPS may be a diterpene synthase from
Euphobia peplus. In particular, the EpTPSI like diTPS may be a TPS1 from Euphobia peplus. TPS1 from Euphobia peplus may also be referred to as EpTPSI . In particular, said EpTPSI like diTPS may be a polypeptide of SEQ ID NO:15 or a functional homologue thereof sharing at least 70%, such as at least 80%, for example at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 91 %, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% sequence identity therewith.
The heterologous nucleic acid encoding said EpTPSI like diTPS may for example be any nucleic acid encoding any of the EpTPSI like diTPS described herein. Thus, the heterologous nucleic acid may for example be any nucleic acid encoding the polypeptide of SEQ ID NO:15 or any of functional homologue thereof described in the previous paragraph. The sequence identity is preferably calculated as described herein below in the section "Sequence identity". A functional homologue of EpTPSI is a polypeptide, which is also capable of catalysing at least one of reactions XVIII, XIX or XX described above. CfTPS14
The invention involves use of a diTPS of class I. In one embodiment said diTPS of class I may be an CfTPS14 like diTPS. In particular, said diTPS of class I may be an CfTPS14 like diTPS in embodiments of the invention, wherein the diterpene to be produced contains a tricyclic ring structure. For example said diTPS of class I may be an CfTPS14 like diTPS in embodiments of the invention, wherein the diterpene to be produced contains a core of any of the formulas IV or V:
Figure imgf000061_0001
Dependent on the structure of the diterpene pyrophosphate intermediate then the diterpene containing a core of formula IV and V, may have different stereochemistry. In general the stereochemistry of the decalin core present in the diterpene pyrophosphate intermediate is maintained after the reaction catalysed by the CfTPS14 like diTPS.
The CfTPS14 like diTPS may be any enzyme capable of catalysing the reaction XVIII:
Diterpene pyrophosphate intermediate containing a decalin core structure ►
Diterpene containing a core structure of formula IV or formula V
The CfTPS14 like diTPS may be any enzyme capable of catalysing conversion of a diterpene pyrophosphate intermediate to a diterpene containing a core of either formula IV or V. The CfTPS14 like diTPS may in particular be an enzyme capable of catalysing the reaction XIX:
Figure imgf000062_0001
wherein OPP is diphosphate. During reaction XIX the produced diterpene will in general maintain the stereochemistry around the decalin core found in the starting diterpene pyrophosphate intermediate.
The CfTPS14 like diTPS may in particular be an enzyme capable of catalysing the reaction XX:
Figure imgf000062_0002
wherein OPP indicated diphosphate. During reaction XX the produced diterpene will in general maintain the stereochemistry around the decalin core found in the starting diterpene pyrophosphate intermediate. In one embodiment the CfTPSI 4 like diTPS may be a diterpene synthase from Coleus forskohlii. In particular, the CfTPSI 4 like diTPS may be a TPS14 from Coleus forskohlii. TPS14 from Coleus forskohlii may also be referred to as CfTPSI 4. In particular, said CfTPSI 4 like diTPS may be a polypeptide of SEQ ID NO:16 or a functional homologue thereof sharing at least 70%, such as at least 80%, for example at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 91 %, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% sequence identity therewith. The heterologous nucleic acid encoding said CfTPS14 like diTPS may for example be any nucleic acid encoding any of the CfTPS14 like diTPS described herein. Thus, the heterologous nucleic acid may for example be any nucleic acid encoding the polypeptide of SEQ ID NO:16 or any of functional homologue thereof described in the previous paragraph.
The sequence identity is preferably calculated as described herein below in the section "Sequence identity". A functional homologue of CfTPS14 is a polypeptide, which is also capable of catalysing at least one of reactions XVIII, XIX or XX described above.
CYP
The methods of the invention use one of more CYPs. The invention also features host organisms comprising one or more heterologous nucleic acid encoding a CYP. The term "Cytochrome P450" as herein refers to an enzyme of the Cytochrome P450 family. Cytochrome P450s may also be referred to as "CYPs" and one cytochrome P450 may be referred to as a CYP. CYPs according to the present invention are enzymes capable of catalyzing oxidation reactions using NAD(P)H as electron donor. CYPs may for example be enzymes capable of catalyzing one or more of the following reactions:
Hydroxylation;
Epoxidation;
Dealkylation;
Isomerization;
Dimerization;
Dehydration;
Carbon-carbon cleavage;
Decarboxylations;
Nitrogen and sulfur oxidation
Dehalogenations
Deaminations
Alkyl- and aryl-shifts, ,
Heteroatom oxidation
Heteroatom release Ring coupling formation and contraction; and
Dehalogenation.
Preferred CYPs according to the present invention are hemoproteins capable of catalyzing oxidation reactions that utilize NADPH and/or NADH to reductively cleave atmospheric dioxygen to produce a functionalized organic substrate and a molecule of water.
Thus, a preferred CYP according to the invention is an enzyme capable of catalyzing one or more of the following reactions:
Hydroxylation
Oxidation leading to a formation of a carbonyl grops
Oxidation leading to formation of a carboxyl group. As stated herein above the methods of the invention may involve use of one or more CYPs. However, at least one CYP is preferably and enzyme capable of catalyzing the reaction XXX:
Diterpene ► hydroxyl-diterpene, wherein the diterpene for example may be any of the diterpenes described in the section "Diterpenes" herein below. In particular, at least one CYP is an enzyme capable of catalyzing hydroxylation of a diterpene containing a core structure of one of following formulas I, II, III, IV, V, VI, IX, X, XXII, XXIII or XXIV, for example any one of the diterpenes described in Table I.
Thus in embodiments of the invention, wherein the diterpenoid is a hydroxyl-diterpene, then the CYP preferably is an enzyme capable of catalysing reaction XXX. The CYP may also be an enzyme capable of catalysing hydroxylation of a hydroxyl- diterpene to form dihydroxy-diterpene and/or diterpene ketone and/or diterpene aldehyde. Thus, the CYP may be an enzyme capable of catalysing:
Reaction XXXI: Hydroxyl-diterpene ► dihydroxy-diterpene
Reaction XXXII: Hydroxyl-diterpene ► diterpene ketone Reaction XXXIII: Hydroxyl-diterpene ► diterpene aldehyde
Said hydroxyl-diterpene may in particular be a hydroxyl-diterpene containing a core structure of one of following formulas 1, 11, 111, IV, V, VI, IX, X, XXII, XXIII or XXIV, for example it may be hydroxyl-diterpene, wherein the diterpene is any one of the diterpenes described in Table I. Preferably, the hydroxyl groups of said dihydroxy- diterpene are attached to different carbon atoms of the diterpene.
In reaction XXXII, then preferably, the ketone group of the diterpene ketone is positioned on the same carbon atom as the hydroxyl group of the hydroxyl-diterpene. Furthermore, it is preferred that said carbon atom is a carbon atom attached to exactly two hydrogen atoms in the precursor diterpene.
In reaction XXXIII, then preferably, the aldehyde group of the diterpene aldehyde is positioned on the same carbon atom as the hydroxyl group of the hydroxyl-diterpene. Furthermore, it is preferred that said carbon atom, is a carbon atom attached to exactly 3 hydrogen atoms in the precursor diterpene.
In embodiments of the invention wherein the diterpenoid is dihydroxy-diterpene, then the methods preferably involve use of a CYP, which is capable of catalysing reaction XXX and a CYP, which is capable of catalysing reaction XXXI. Said CYP may be one CYP capable of catalysing both reaction or it may be two different CYPs.
In embodiments of the invention wherein the diterpenoid is diterpene ketone, then the methods preferably involve use of one or more CYP(s), which can catalyse reactions XXX and XXXII.
In embodiments of the invention wherein the diterpenoid is diterpene aldehyde, then the methods involve use of one or more CYP(s), which can catalyse reactions XXX and XXXIII.
The CYP may also be an enzyme capable of catalysing hydroxylation of a dihydroxy- diterpene to form trihydroxy-diterpene and/or diterpene carboxylic acid. Thus, the CYP may be an enzyme capable of catalysing: Reaction XXXIV: Dihydroxy-diterpene trihydroxy-diterpene
Reaction XXXVa: Dihydroxy-diterpene Hydroxy-diterpene ketone
Reaction XXXVb: Dihydroxy-diterpene Hydroxy-diterpene carboxylic acid Said dihydroxy-diterpene may in particular be a dihydroxy-diterpene containing a core structure of one of following formulas 1, 11, 111, IV, V, VI, IX, X, XXII, XXIII or XXIV, for example it may be dihydroxy-diterpene, wherein the diterpene is any one of the diterpenes described in Table I. Preferably, the hydroxyl groups of said dihydroxy- diterpene are attached to different carbon atoms of the diterpene. Similarly, it is preferred that the hydroxyl groups of the trihydroxy-diterpenene are attached to different carbon atoms.
In reaction XXXVa, then preferably, the ketone group of the hydroxy-diterpene ketone is positioned on the same carbon atom as one of the hydroxyl group of the dihydroxy- diterpene. Furthermore, it is preferred that said carbon atom, is a carbon atom attached to exactly 2 hydrogen atoms in the precursor diterpene.
In reaction XXXVb, then preferably, the carboylix acid group of the hydroxy-diterpene carboxylic acid is positioned on the same carbon atom as one of the hydroxyl group of the dihydroxy-diterpene. Furthermore, it is preferred that said carbon atom, is a carbon atom attached to exactly 3 hydrogen atoms in the precursor diterpene.
In embodiments of the invention, wherein the diterpenoid is trihydroxy-diterpene, then the methods may involve use of a CYP capable of catalysing reaction XXX and of a CYP capable of catalysing reaction XXXI and of a CYP capable of catalysing reaction XXXIV. This may be different CYPs, or one CYP can be capable of catalysing two or all three of said reactions. In embodiments of the invention wherein the diterpenoid is a hydroxyl-diterpene carboxylic acid, then the methods may involve use of a CYP capable of catalysing reaction XXX and of a CYP capable of catalysing reaction XXXI and of a CYP capable of catalysing reaction XXXV. This may be different CYPs, or one CYP can be capable of catalysing two or all three of said reactions. The CYP may also be an enzyme capable of catalysing hydroxylation of diterpene- ketone to form hydroxyl-diterpene ketone and/or diterpene carboxylic acid. Thus, the CYP may be an enzyme capable of catalysing: Reaction XXXVI: diterpene ketone ► hydroxyl-diterpene ketone
Reaction XXXVII: diterpene ketone ► diterpene carboxylic acid
Said diterpene ketone may in particular be a diterpene substituted with a ketone group, wherein the diterpene contains a core structure of one of following formulas I, II, III, IV, V, VI, IX, X, XXII, XXIII or XXIV, for example it may be any one of the diterpenes described in Table I. It is preferred that the ketone group and the hydroxyl group of the hydroxyl-diterpene ketone are attached to different carbon atoms.
In reaction XXXVII, then preferably, the ketone group of the diterpene ketone is positioned on the same carbon atom as the carboxylic acid of the diterpene carboxylic acid. Furthermore, it is preferred that said carbon atom, is a carbon atom attached to exactly 3 hydrogen atoms in the precursor diterpene.
In embodiments of the invention wherein the diterpenoid is hydroxy-diterpene ketone, then the methods may involve use of a CYP capable of catalysing reaction XXX and of a CYP capable of catalysing reaction XXXII and of a CYP capable of catalysing reaction XXXVI. This may be different CYPs, or one CYP can be capable of catalysing two or all three of said reactions. In embodiments of the invention wherein the diterpenoid is diterpene carboxylic acid, then the methods may involve use of a CYP capable of catalysing reaction XXX and of a CYP capable of catalysing reaction XXXII and of a CYP capable of catalysing reaction XXXVII. This may be different CYPs, or one CYP can be capable of catalysing two or all three of said reactions.
The CYP may also be an enzyme capable of catalysing hydroxylation of diterpene- carboxylic acid to form hydroxyl-diterpene carboxylic acid. Thus, the CYP may be an enzyme capable of catalysing:
Reaction XXXVIII: diterpene carboxylic acid hydroxyl-diterpene carboxylic acid Said diterpene carboxylic acid may in particular be a diterpene substituted with a carboxyl group, wherein the diterpene contains a core structure of one of following formulas 1, 11, 111, IV, V, VI, IX, X, XXII, XXIII or XXIV, for example it may be any one of the diterpenes described in Table I.The carboxylic acid group will generally be attached to a different carbon atom as the hydroxyl group.
In embodiments of the invention wherein the diterpenoid is hydroxyl-diterpene carboxylic acid, then the methods may involve use of a CYP capable of catalysing reaction XXX and of a CYP capable of catalysing reaction XXXI and of a CYP capable of catalysing reaction XXXV and of a CYP capable of catalysing reaction XXXVIII. This may be different CYPs, or one CYP can be capable of catalysing two, three or even all of said reactions. In embodiments of the invention wherein the diterpenoid is hydroxyl-diterpene carboxylic acid, then the methods may involve use of a CYP capable of catalysing reaction XXX and of a CYP capable of catalysing reaction XXXII and of a CYP capable of catalysing reaction XXXVII and of a CYP capable of catalysing reaction XXXVIII. This may be different CYPs, or one CYP can be capable of catalysing two, three or even all of said reactions.
In addition to above mentioned reactions the methods may also employ use of one or more CYPs capable of catalysing one or more of the following reactions: Trihydroxy-diterpene ► tetrahydroxy-diterpene
Trihydroxy-diterpene ► dihydroxy-diterpene ketone
Trihydroxy-diterpene ► dihydroxy-diterpene carboxylic acid
hydroxyl-diterpene ketone ► dihydroxy -diterpene ketone
hydroxyl-diterpene ketone ► diterpene-di-ketone
hydroxyl-diterpene carboxylic acid ► dihydroxy-diterpene carboxylic acid hydroxyl-diterpene carboxylic acid ► diterpene-ketone-carboxylic acid tetrahydroxy-diterpene ► pentahydroxy-diterpene
tetrahydroxy-diterpene ► trihydroxy-diterpene ketone
tetrahydroxy-diterpene ► tri-hydroxy-diterpene carboxylic acid
dihydroxy-diterpene ketone ► trihydroxy-diterpene ketone dihydroxy-diterpene ketone ► hydroxy-diterpene di-ketone dihydroxy-diterpene ketone ► hydroxyl-diterpene ketone carboxylic acid CYPs are encoded by gene superfamily, which is divided into families sharing at least 40% sequence identity. The families are divided into subfamilies sharing at least 55% sequence identity. The CYP families is has a number, which generally is written after "CYP". Thus, by way of example CYPs of family 74 are named CYP74. The subfamilies are indicated by a capital letter after the family number. Thus by way of example a CYP of family 74 and subfamily A is named CYP74A. Additional description of CYPs, the structural characteristics and the nomenclature thereof may for example be found in Schuler et al., Annu Rev. Plant Biol., 2003, 54:629-67 and in Podust et al., 2012, Nat. Prod. Rep., 29:1251 -1266. Thus, the CYP to be used with the present invention may be a CYP as described in any of these references.
In one embodiment of the invention the CYP is a polypeptide sharing at least some sequence similarity to at least one of SEQ ID NO:20, SEQ ID NO:21 , SEQ ID NO:22 or SEQ ID NO:23. In particular, it is preferred that the CYP shares at least 30%, preferably at least 40%, more preferably at least 45% sequence identity with at least one of SEQ ID NO:20, SEQ ID NO:21 , SEQ ID NO:22, or SEQ ID NO:23.
Furthermore, it is preferred that the CYP contains the following motif of five amino acids:
A/G-G-X-X-T/S,
wherein X may be any amino acid, such as any naturally occurring amino acids. In particular, one of the X may be an amino acid with a charged side chain, and in particular an acidic side chain, such as glutamic acid (E).
A/G indicates that the amino acid may be A or G. Similarly, T/S indicates that the amino acid may be T or S.
In one embodiment of the invention said motif of five amino acids is:
A/G-X-E-T/S
wherein X may be any amino acid, such as any naturally occurring amino acids.
Said motif of five amino acids may for example be A/G-G-H/S/G-E-T/S.
H/S/G indicates that the amino acid may be H or S or G. In particular, it is preferred that the CYP contains said motif in a position corresponding to of position aa 290-294 of PsCYP720B4 SEQ ID NO:23. A position corresponding to position aa 290-294 of PsCYP720B4 SEQ ID NO:23 is identified by aligning the sequence of a CYP of interest to SEQ ID NO:23 and optionally to additional sequences of CYPs as e.g. shown in figure 6, and identifying the amino acids of said CYP aligned with aa 290-294of PsCYP720B4 SEQ ID NO:23. This motif is shown as "1 " in figure 6.
Furthermore, it is preferred that the CYP contains the following motif 4 amino acids:
E-X-X-R,
wherein X may be any amino acid, such as any naturally occurring amino acids. In particular, X may be an amino acid with an uncharged side chain, such as an hydrophobic side chain.
In one embodiment of the invention said motif of four amino acids is:
E-X-L-R
wherein X may be any amino acid, such as any naturally occurring amino acids.
Said motif of five amino acids may for example be
E-T/V/M/A-L-R.
T/V/M/A indicates that the amino acid may be T or V or M or A.
In particular, it is preferred that the CYP contains said motif in a position corresponding to of position aa 350-353 of PsCYP720B4 SEQ ID NO:23. A position corresponding to position aa 350-353 of PsCYP720B4 SEQ ID NO:23 is identified by aligning the sequence of a CYP of interest to SEQ ID NO:23 and optionally to additional sequences of CYPs as e.g. shown in figure 6, and identifying the amino acids of said CYP aligned with aa 350-353 of PsCYP720B4 SEQ ID NO:23. This motif is shown as "2" in figure 6. Furthermore, it is preferred that the CYP contains the following motif following motif of 10 amino acids:
F-X-X-G-X-X-X-C-X-G,
wherein X may be any amino acid, such as any naturally occurring amino acids.
In one embodiment of the invention said motif of 10 amino acids is:
F-G-G/A-G-A/R-R-X-C-P-G,
wherein X may be any amino acid, such as any naturally occurring amino acids.
G/A indicates that the amino acid may be G or A and A/R indicates that the amino acid may be A or R. In particular, it is preferred that the CYP contains said motif in a position corresponding to of position aa 422-431 of PsCYP720B4 SEQ ID NO:23. A position corresponding to position aa 422-431 of PsCYP720B4 SEQ ID NO:23 is identified by aligning the sequence of a CYP of interest to SEQ ID NO:23 and optionally to additional sequences of CYPs as e.g. shown in figure 6, and identifying the amino acids of said CYP aligned with aa 422-431 of PsCYP720B4 SEQ ID NO:23. This motif is shown as "3" in figure 6.
Furthermore, it is preferred that the CYP contains the following motif following motif of 3 amino acids: P-F-G
In particular, it is preferred that the CYP contains said motif in a position corresponding to of position aa 421 -423 of PsCYP720B4 SEQ ID NO:23. A position corresponding to position aa 421 -423of PsCYP720B4 SEQ ID NO:23 is identified by aligning the sequence of a CYP of interest to SEQ ID NO:23 and optionally to additional sequences of CYPs as e.g. shown in figure 6, and identifying the amino acids of said CYP aligned with aa 421 -423of PsCYP720B4 SEQ ID NO:23. This motif may be overlapping with the motif of 10 amino acids described above. The motif is shown as "4" in figure 6. In addition to the motifs described above, the CYP may contain one or more of the following motifs: SRS1 : P/V-X-X-X-K-(X)7-9-F
In particular, it is preferred that the CYP contains SRS1 in a position corresponding to of position aa 100-1 14 of PsCYP720B4 SEQ ID NO:23. The motif is shown as "SRS1 " in figure 6.
SRS5: N/P-F/P-G/L-P-X-V/L-X-X-R In particular, it is preferred that the CYP contains SRS5 in a position corresponding to of position aa 356-360 of PsCYP720B4 SEQ ID NO:23. The motif is shown as "SRS5" in figure 6.
The CYP may also contain one or more of the motifs SRS2, SRS3 and/or SRS6 shown in figure 6.
It is furthermore preferred that in addition to sharing above mentioned sequence identity and/or containing one or more said motifs (preferably containing all of said motifs), then as many as possible of the amino acids shown in a black box in figure 6 are retained. Thus, when aligned to the sequence of of PsCYP720B4 (SEQ ID NO:23), then preferably the CYP also contains at least 80%, more preferably at least 90%, for example at least 95%, such as all of the amino acids shown in a black box in figure 6. It is also preferred that in addition to sharing above mentioned sequence identity and/or containing one or more said motifs, then as many as possible of the amino acids shown in a grey box in figure 6 are retained. Thus, when aligned to the sequence of of PsCYP720B4 (SEQ ID NO:23), then preferably the CYP also contains at least 80%, more preferably at least 90%, for example at least 95%, such as at least 98% of the amino acids shown in a grey box in figure 6. In one embodiment of the invention the CYP may be one of the following CYPs:
i. A CYP of family 76, such as any of the CYPs described herein below in the section "CYP76"
ii. A CYP of family 71 , such as any of the CYPs described herein below in the section "CYP71 " iii. A CYP of family 720, such as any of the CYPs described herein below in the section "CYP720"
CYP76
In one embodiment of the invention at least one CYP may be a CYP of family 76. A
CYP of family 76 may preferably be capable of catalysing reaction XXX outlined above. In particular a CYP of family 76 may be an enzyme capable of catalysing one or more of the following reactions (preferably said CYP of family 76 is capable of catalysing all of the following reactions):
i. Reaction XXXIX: hydroxylation of dehydroabietadiene to form hydroxyl dehydroabientadiene
ii. Reaction XL: Rehydroxylation of miltiradiene to form hydroxyl mitradiene iii. Reaction XLI: hydroxylation of manool to form hydroxyl manool The CYP of family 76 may also be caoable of catalysing one or more of the following reactions:
Reaction XLV: manool -> keto-hydroxy manool
Reaction XLVI: manool -> dihydroxy manool
Reaction XLVII: manool -> carboboxy manool
Reaction LI: hydroxylation of hydroxy manool to form dihydroxymanool
In one embodiment the CYP of family 76 may be a CYP from Coleus forskohlii. In particular, said CYP of family 76 may be a polypeptide sharing at least 40%, such as at least 45%, for example at least 60%, such as at least 55% sequence identity with SEQ ID NO:20 and/or SEQ ID NO:21 .
In particular the CYP of family 76 may be a CYP of subfamily AH. Thus said CYP may be a CYP76AH. Said CYP of family 76 and subfamily AH may be a polypeptide sharing at least 55%, such as at least 60%, for example at least 65%, such as at least 70% sequence identity with SEQ ID NO:20 and/or SEQ ID NO:21 .
In a preferred embodiment the CYP of family 76 is CYP76AH8 of SEQ ID NO:20, CYP76AH1 1 of SEQ ID NO:21 or a functional homologue thereof sharing at least 70%, such as at least 80%, for example at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 91 %, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% sequence identity therewith.
In another embodiment the CYP of family 76 is CYP76AH15 of SEQ ID NO:40, CYP76AH17 of SEQ ID NO:41 or a functional homologue thereof sharing at least 70%, such as at least 80%, for example at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 91 %, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% sequence identity therewith. CYP76AH15 of SEQ ID NO:40 and CYP76AH17 of SEQ ID NO:41 have an activity similar to CYP76AH8 of SEQ ID NO:20, and thus CYP76AH15 of SEQ ID NO:40 or CYP76AH17 of SEQ ID NO:41 or functional homologues thereof may be employed in place of CYP76AH8 of SEQ ID NO:20 or functional homologues thereof.
The heterologous nucleic acid encoding said CYP of family 76 may for example be any nucleic acid encoding any of the CYPs of family 76 described herein. Thus, the heterologous nucleic acid may for example be any nucleic acid encoding the polypeptide of SEQ ID NO:20, or the polypeptide of SEQ ID NO:21 or any of the functional homologue of the aforementioned described in the previous paragraph. For example, said heterologous nucleic acid may comprise or consist of SEQ ID NO:35. Said heterologous nucleic acid may also comprise or consist of SEQ ID NO:36. This may in particular be the case in embodiments of the invention wherein the host organism is yeast, e.g. S. cerevisiae.
In another embodiment the CYP of family 76 is CYP76AH9 of SEQ ID NO:24 or a functional homologue thereof sharing at least 70%, such as at least 80%, for example at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 91 %, such as at least 92%, such as at least 93%, such as at least 94%such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% sequence identity therewith. The heterologous nucleic acid may for example be any nucleic acid encoding the polypeptide of SEQ ID NO:24 or any of functional homologue thereof described in the previous paragraph.
In another embodiment the CYP of family 76 is CYP76AH1 , such as CYP76AH1 of SEQ ID NO:42 or a functional homologue thereof sharing at least 70%, such as at least 80%, for example at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 91 %, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% sequence identity therewith.
The heterologous nucleic acid may for example be any nucleic acid encoding the polypeptide of SEQ ID NO:42 or any of functional homologue thereof described in the previous paragraph. For example the heterologous nucleic acid may be the nucleic acid of sequence with the Genbank accession number KP337687 . ι .
In another embodiment the CYP of family 76 is CYP76AH4, such as CYP76AH4 of SEQ ID NO:43 or a functional homologue thereof sharing at least 70%, such as at least 80%, for example at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 91 %, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% sequence identity therewith. Said functional homologue may for example be the protein with Genbank accession number AJQ30187.1 or the protein with Genbank accession number AJQ30188.1 . The heterologous nucleic acid may for example be any nucleic acid encoding the polypeptide of SEQ ID NO:43 or any of functional homologue thereof described in the previous paragraph. For example the heterologous nucleic acid may be the nucleic acid of sequence with the Genbank accession number KP091843.1 of the Genbank accession number KP091844.1 .
The sequence identity is preferably calculated as described herein below in the section "Sequence identity". A functional homologue of CYP76AH8 or CYP76AH1 1 or
CYP76AH9 is a polypeptide, which is also capable of catalysing at least one of reactions XXXIX, XL, XLI, XLV, XLVI, XLVII, LI described above. It is furthermore preferred that in addition to sharing above mentioned sequence identity, then as many as possible of the amino acids shown in a black box in figure 6 are retained. Thus, when aligned to the sequence of CYP76AH8 (SEQ ID NO:20) and/or CYP76AH1 1 (SEQ ID NO:21 ), then preferably the CYP of family 76 also contains at least 80%, more preferably at least 90%, for example at least 95%, such as all of the amino acids shown in a black box in figure 6. Furthermore, when aligned to the sequence of CYP76AH8 (SEQ ID NO:20) and/or CYP76AH1 1 (SEQ ID NO:21 ), then preferably the CYP of family 76 also contains at least 80%, more preferably at least 90%, for example at least 95%, such as at least 98% of the amino acids shown in a grey box in figure 6.
CYP71
In one embodiment of the invention at least one CYP may be a CYP of family 71 . A CYP of family 71 may preferably be capable of catalysing reaction XXX outlined above. In particular a CYP of family 71 may be an enzyme capable of catalysing one or more of the following reactions (preferably said CYP of family 71 is capable of catalysing all of the following reactions):
i. Reaction XXXIX: Oxidation of dehydroabietadiene and/or mitradiene ii. Reaction XLII: hydroxylation of syn-manool to form hydroxy syn-manool Reaction LI: hydroxylation of hydroxy manool to form dihydroxymanool
iii.
In one embodiment the CYP of family 71 may be a CYP from Coleus forskohlii. In particular, said CYP of family 71 may be a polypeptide sharing at least 40%, such as at least 45%, for example at least 60%, such as at least 55% sequence identity with SEQ ID NO:22.
In particular the CYP of family 71 may be a CYP of subfamily D. Thus said CYP may be a CYP71 D. Said CYP of family 71 and subfamily D may be a polypeptide sharing at least 55%, such as at least 60%, for example at least 65%, such as at least 70% sequence identity with SEQ ID NO:22.
In a preferred embodiment the CYP of family 71 is CYP71 D381 of SEQ ID NO:22 or a functional homologue thereof sharing at least 70%, such as at least 80%, for example at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 91 %, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% sequence identity therewith. The heterologous nucleic acid encoding said CYP of family 71 may for example be any nucleic acid encoding any of the CYPs of family 71 described herein. Thus, the heterologous nucleic acid may for example be any nucleic acid encoding the polypeptide of SEQ ID NO:22 or any of functional homologue thereof described in the previous paragraph. For example, said heterologous nucleic acid may comprise or consist of SEQ ID NO:34.
The sequence identity is preferably calculated as described herein below in the section "Sequence identity". A functional homologue of CYP71 D381 is a polypeptide, which is also capable of catalysing at least one of reactions XXXIX, XL, XLII or LI described above.
It is furthermore preferred that in addition to sharing above mentioned sequence identity, then as many as possible of the amino acids shown in a black box in figure 6 are retained. Thus, when aligned to the sequence of CYP71 D381 (SEQ ID NO:22), then preferably the CYP of family 71 also contains at least 80%, more preferably at least 90%, for example at least 95%, such as all of the amino acids shown in a black box in figure 6. Furthermore, when aligned to the sequence of CYP71 D381 (SEQ ID NO:22), then preferably the CYP of family 71 also contains at least 80%, more preferably at least 90%, for example at least 95%, such as at least 98% of the amino acids shown in a grey box in figure 6.
CYP720
In one embodiment of the invention at least one CYP may be a CYP of family 720. A CYP of family 720 may preferably be capable of catalysing reaction XXX outlined above. In particular a CYP of family 720 may be an enzyme capable of catalysing the following reaction:
i. Reaction XLI: hydroxylation of manool to form hydroxyl manool
The CYP of family 720 may also be an enzyme capable of catalysing one of both of the following reactions: Reaction XLIII: dehydroabietadiene -> dehydroabietic acid
Reaction XLIV: miltradiene -> miltiradienic acid
Reaction XLVII hydroxylation of syn-pimara-9,(1 1 ),15-diene OR syn- isoprimara-7,15-diene OR debydroabietadiene to form hydroxy syn-pimara-9,(1 1 ),15-diene OR hydroxy syn- isoprimara-7,15-diene OR hydroxy debydroabietadiene, respectively
Reaction XLIX: addition of keto group to syn-pimara-9,(1 1 ),15-diene OR syn-isoprimara-7,15-diene OR debydroabietadiene to form keto syn-pimara-9,(1 1 ),15-diene OR keto syn-isoprimara- 7,15-diene OR keto debydroabietadiene, respectively
Reaction L: carboxylation of syn-pimara-9,(1 1 ),15-diene OR syn- isoprimara-7,15-diene OR debydroabietadiene to form carboxy syn-pimara-9,(1 1 ),15-diene OR carboxy syn- isoprimara-7,15-diene OR carboxy debydroabietadiene, respectively
Reaction LI: hydroxylation of hydroxy manool to form dihydroxymanool Reaction Lll: miltiradiene OR abietatriene -> dehydroabietic acid
In one embodiment the CYP of family 720 may be a CYP from Picea sitchensis. In particular, said CYP of family 720 may be a polypeptide sharing at least 40%, such as at least 45%, for example at least 60%, such as at least 55% sequence identity with SEQ ID NO:23.
In particular the CYP of family 720 may be a CYP of subfamily B. Thus said CYP may be a CYP720B. Said CYP of family 720 and subfamily B may be a polypeptide sharing at least 55%, such as at least 60%, for example at least 65%, such as at least 70% sequence identity with SEQ ID NO:23.
In a preferred embodiment the CYP of family 720 is CYP720B4 of SEQ ID NO:23 or a functional homologue thereof sharing at least 70%, such as at least 80%, for example at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 91 %, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, such as 100% sequence identity therewith.
The heterologous nucleic acid encoding said CYP of family 720 may for example be any nucleic acid encoding any of the CYPs of family 720 described herein. Thus, the heterologous nucleic acid may for example be any nucleic acid encoding the polypeptide of SEQ ID NO:23 or any of functional homologue thereof described in the previous paragraph. For example, said heterologous nucleic acid may comprise or consist of SEQ ID NO:39.
The sequence identity is preferably calculated as described herein below in the section "Sequence identity". A functional homologue of CYP720B4 is a polypeptide, which is also capable of catalysing one, two or all of reactions XLI, XLIII, XLIV, XLVIII, XLIX, L, LI or LI I described above.
It is furthermore preferred that in addition to sharing above mentioned sequence identity, then as many as possible of the amino acids marked with a black box in figure 6 are retained. Thus, when aligned to the sequence of CYP720B4 (SEQ ID NO:23), then preferably the CYP of family 720 also contains at least 80%, more preferably at least 90%, for example at least 95%, such as all of the amino acids marked by a black box in figure 6. Furthermore, when aligned to the sequence of CYP720B4 (SEQ ID NO:23), then preferably the CYP of family 720 also contains at least 80%, more preferably at least 90%, for example at least 95%, such as at least 98% of the amino acids shown in a grey box in figure 6.
Additional recombinant modifications
The host organisms according to the present invention may also be recombinantly modified in addition to comprising the heterologous nucleic acids encoding a diTPS of class I and a diTPS of class II and one or more CYPs as described herein.
For example the host organism may be modified to increase the pool of GGPP. As described herein elsewhere, GGPP is the starting compound for production of diterpenes. Thus, if the host organism is modified to increase the pool of GGPP, then frequently, the host organism will be capable of producing increased amounts of diterpene.
Various methods for increasing the pool of GGPP are well known in the art. These includes methods of reducing the activity of enzymes reducing the level of GGPP.
In one embodiment the pool of GGPP is increased by expression of one or more enzymes involved in synthesis of GGPP. Thus, it may be preferred that the host organism comprises a heterologous nucleic acid encoding GGPP synthase (GGPPS). Said GGPPS may be any GGPPS, e.g. BTS1 of S. cerevisiae.
In particular, the GGPPS may be the GGPPS described by Zhou, Y. J., W. Gao, Q. Rong, G. Jin, H. Chu, W. Liu, W. Yang, Z. Zhu, G. Li, G. Zhu, L. Huang and Z. K. Zhao (2012). "Modular Pathway Engineering of Diterpenoid Synthases and the Mevalonic Acid Pathway for Miltiradiene Production." Journal of the American Chemical Society 134(6): 3234-3241 .
Accordingly, the host organism may express a fusion of SmCPS and SmKSL, and/or a fusion of BTS1 (GGPP synthase) and ERG20 (fa nesyl diphosphate synthase) as described in Zhou et al., 2012.
The host organism may also comprise a heterologous nucleic acid encoding a GGPPS from a plant, e.g. from Coleus forskohlii. Thus, in one embodiment the host organism comprises:
a) a heterologous nucleic acid encoding Coleus forskohlii deoxyxylulose 5- phosphate synthase (CfDXS) of SEQ ID NO:25 or a functional homologue of any of the aforementioned sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith and/or b) a heterologous nucleic acid encoding Coleus forskohlii
geranylgeranylpyrophosphate synthase (CfGGPPs) of SEQ ID NO:26 or a functional homologue of any of the aforementioned sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith.
The host organism may also comprise a heterologous nucleic acid encoding Coleus fors/co '/' geranylgeranylpyrophosphate synthase (CfGGPPs) of SEQ ID NO:26 or a functional homologue of any of the aforementioned sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith. The host organism may also comprise a a heterologous nucleic acid encoding GGPP7 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith. The sequence of GGPPS7 of Synechococcus sp. is available under Gene ID:86553638. The host organism may also comprise a heterologous nucleic acid encoding a cytochrome P450 oxidoreductases (POR). PORs are known in the art. Said POR may for example be ATR1 from A. thaliana or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith. The sequence of ATR1 from A. thaliana is available under Gene ID:3150037. Said POR may for example be POR from Stevia rebaudiana or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith. The sequence of POR from S. rebaudiana is available under Gene ID: 9321 1213.
Said POR may also be a CPR, such as CPR from Coleus forskohlii. Thus, the host organism may in addition to the heterologous nucleic acids encoding diTPS and CYPs also contain a heterologous nucleic acic encoding CfCPR of SEQ ID NO:44 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith. Said heterologous nucleic acid may be any heterologous nucleic acid encoding CPR, for example the nucleic acid may comprise or consist of SEQ ID NO:45. Sequence identity
A high level of sequence identity indicates likelihood that the first sequence is derived from the second sequence. Amino acid sequence identity requires identical amino acid sequences between two aligned sequences. Thus, a candidate sequence sharing 80% amino acid identity with a reference sequence, requires that, following alignment, 80% of the amino acids in the candidate sequence are identical to the corresponding amino acids in the reference sequence. Identity according to the present invention is determined by aid of computer analysis, such as, without limitations, the ClustalW computer alignment program (Higgins D., Thompson J., Gibson T., Thompson J.D., Higgins D.G., Gibson T.J., 1994. CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 22:4673-4680), and the default parameters suggested therein. The ClustalW software is available from as a ClustalW WWW Service at the European Bioinformatics Institute h ttp :// www. eb i . a.c . u k/ci u stal w o r via the software BioEdit. Using this program with its default settings, the mature (bioactive) part of a query and a reference polypeptide are aligned. The number of fully conserved residues are counted and divided by the length of the reference polypeptide. Thus if nothing else is indicated a sequence identity is given over the entire length of the reference polypeptide.
Thus, sequence identity is calculated over the entire length of the reference polypeptide. The ClustalW algorithm may similarly be used to align nucleotide sequences. Sequence identities may be calculated in a similar way as indicated for amino acid sequences
Heterologous nucleic acid
The term "heterologous nucleic acid" as used herein refers to a nucleic acid sequence, which has been introduced into the host organism, wherein said host does not endogenously comprise said nucleic acid. For example, said heterologous nucleic acid may be introduced into the host organism by recombinant methods. Thus, the genome of the host organism has been augmented by at least one incorporated heterologous nucleic acid sequence. It will be appreciated that typically the genome of a recombinant host described herein is augmented through the stable introduction of one or more heterologous nucleic acids encoding one or more diTPS's and/or CYPs. Suitable host organisms include microorganisms, plant cells, and plants, and may for example be any of the host organisms described herein below in the section "Host organism".
In general the heterologous nucleic acid encoding a polypeptide (also referred to as "coding sequence" in the following) is operably linked in sense orientation to one or more regulatory regions suitable for expressing the polypeptide. Because many microorganisms are capable of expressing multiple gene products from a polycistronic mPiNA, multiple polypeptides can be expressed under the control of a single regulatory region for those microorganisms, if desired. A coding sequence and a regulatory region are considered to be operably linked when the regulatory region and coding sequence are positioned so that the regulatory region is effective for regulating transcription or translation of the sequence. Typically, the translation initiation site of the translational reading frame of the coding sequence is positioned between one and about fifty nucleotides downstream of the regulatory region for a monocistronic gene.
"Regulatory region" refers to a nucleic acid having nucleotide sequences that influence transcription or translation initiation and rate, and stability and/or mobility of a transcription or translation product. Regulatory regions include, without limitation, promoter sequences, enhancer sequences, response elements, protein recognition sites, inducible elements, protein binding sequences, 5' and 3' untranslated regions (UTRs), transcriptional start sites, termination sequences, polyadenylation sequences, introns, and combinations thereof. A regulatory region typically comprises at least a core (basal) promoter. A regulatory region also may include at least one control element, such as an enhancer sequence, an upstream element or an upstream activation region (UAR). A regulatory region is operably linked to a coding sequence by positioning the regulatory region and the coding sequence so that the regulatory region is effective for regulating transcription or translation of the sequence. For example, to operably link a coding sequence and a promoter sequence, the translation initiation site of the translational reading frame of the coding sequence is typically positioned between one and about fifty nucleotides downstream of the promoter. A regulatory region can, however, be positioned at further distance, for example as much as about 5,000 nucleotides upstream of the translation initiation site, or about 2,000 nucleotides upstream of the transcription start site. The choice of regulatory regions to be included depends upon several factors, including the type of host organism. It is a routine matter for one of skill in the art to modulate the expression of a coding sequence by appropriately selecting and positioning regulatory regions relative to the coding sequence. It will be understood that more than one regulatory region may be present, e.g., introns, enhancers, upstream activation regions, transcription terminators, and inducible elements.
It will be appreciated that because of the degeneracy of the genetic code, a number of nucleic acids can encode a particular polypeptide; i.e., for many amino acids, there is more than one nucleotide triplet that serves as the codon for the amino acid. Thus, codons in the coding sequence for a given polypeptide can be modified such that optimal expression in a particular host organisms obtained, using appropriate codon bias tables for that host (e.g., microorganism). Nucleic acids may also be optimized to a GC-content preferable to a particular host, and/or to reduce the number of repeat sequences. Such nucleic acids may be referred to as "codon optimized". As isolated nucleic acids, these modified sequences can exist as purified molecules and can be incorporated into a vector or a virus for use in constructing modules for recombinant nucleic acid constructs. Software for preparing nucleic acid sequences codon optimized for expression in a given host organism is commercial available. For example codon optimization may be performed using the Geneart service from
LifeTechnologies or GptimumGene™ Codon Optimization available from GenScript, United States, or GeneArt® Invitrogen.
Diterpene pyrophosphate intermediate The term "decalin" as used herein refers to a compound of the formula VII:
Figure imgf000085_0001
The numbering of carbon atoms provided in formula VII is adhered to throughout this description.
A compound containing or comprising a " decalin core" as used herein refers to a compound comprising above mentioned structure of formula VII, wherein each of the carbon atoms numbered 1 to 10 may be substituted with one or two substituents. It is possible that two of said substituents are fused to form a ring, and thus compound containing or comprising decalin may contain 3 or more rings.
The term "diterpene pyrophosphate intermediate" as used herein refers to a compound, which is the product of bicyclisation of GGPP in a reaction catalysed by a diTPS class II enzyme. The diterpene pyrophosphate intermediate according to the invention contains a decalin core, and comprises a pyrophosphate group.
It is preferred that the diterpene pyrophosphate intermediate of the invention is a compound containing a decalin core, which is substituted at one of more positions with substituents selected from the group consisting of alkyl, alkenyl and hydroxyl, wherein one of said alkyl or alkenyl is substituted with O-pyrophosphate.
The terms "diphosphate" and "pyrophosphate" are used interchangeably herein. The abbreviation "OPP", "-OPP" or "PPO-" as used herein refers to diphosphate. The term "alkyl" as used herein refers to a saturated, straight or branched hydrocarbon chain. The hydrocarbon chain preferably contains of from one to eighteen carbon atoms (Ci-i8-alkyl), more preferred of from one to six carbon atoms (Ci_6-alkyl), including methyl, ethyl, propyl, isopropyl, butyl, isobutyl, secondary butyl, tertiary butyl, pentyl, isopentyl, neopentyl, tertiary pentyl, hexyl and isohexyl. The term "alkenyl" as used herein refers to a saturated, straight or branched
hydrocarbon chain containing at least one double bond. Alkenyl may preferably be any of the alkyls described above containing one or more double bonds.
In particular, the diterpene pyrophosphate intermediate of the invention is a compound containing a decalin core, wherein said decalin is
i. substituted at the 4 position with one or two alkyl, such as with two alkyl, wherein said alkyl for example may be C1-3, alkyl, for example said alkyl may be methyl;
ii. substituted at the 8 position with one or two substituents individually
selected from the group consisting of alkyl, hydroxyl and alkenyl, wherein said alkyl for example may be C1-3 alkyl, for example said alkyl may be methyl, and said alkenyl may be C1-3 alkenyl, for example said alkenyl may be =C;
iii. substituted at the 9 position with alkenyl-O-PP, wherein said alkenyl for example may be branched C4-8-alkenyl, such as branched C5-7-alkenyl, for example branched C6-alkenyl; and
iv. substituted at the 10 position with alkyl, wherein said alkyl for example may be Ci-3, alkyl, for example said alkyl may be methyl.
In particular, the substituent at the 9 position may be alkenyl of formula VIII:
Figure imgf000086_0001
wherein the asterisk indicates the point of attachment to the decalin core. It is also preferred that the stereochemistry around substituents 9 and 10 is
predetermined. Thus, said diterpene pyrophosphate intermediate may contain a decalin core substituted as indicated above, wherein the substitutions at the 9 and 10 positions are (9R, 10R), (9S.10S), (9S, 10R) or (9R, 10S), for example the substitutions at the 9 and 10 positions are (9R, 10R), (9S.10S) or (9S, 10R).
In preferred embodiments, the diterpene pyrophosphate intermediate may be any of the diterpene pyrophosphate intermediates shown in figure 3, i.e. the diterpene pyrophosphate intermediate may be selected from the group consisting of (9R,10R)- copalyl diphosphate, (9S,10S)-copalyl diphosphate, labda-13-en-8-ol diphosphate and (9S, 10R)-copalyl diphosphate.
Diterpenes
The term "diterpene" as used herein refers to a compound derived or prepared from four isoprene units. A diterpene according to the invention is a C20- molecule consisting of 20 carbon atoms, up to three oxygen atoms and hydrogen atoms. The term "precursor diterpene" as used herein refers to a diterpene, which may serve as the precursor for formation of a diterpenoid. Thus, a precursor diterpene may be a diterpene, which after one or more oxidation reactions becomes a diterpenoid. The precursor diterpene may for example be any of the diterpenes described herein in this section.
The diterpene typically contains one or more ring structures, such as one or more monocyclic, bicyclic, tricyclic or tetracyclic ring structure(s). The diterpene may contain one or more double bonds. Frequently, a diterpene according to the invention contains at least one double bond and often they contain in the range of 1 to 3 double bonds.
The diterpene may comprise up to three oxygen atom, although it is also possible that the diterpene contains no oxygen and consists solely of carbon and hydrogen atoms. The oxygen atom are generally present in the form of hydroxyl groups, or part of a ring structure.
In principle, the methods of the invention can be used to produce any diterpenoid by selecting an appropriate combination of diTPS of class II and diTPS of class I and CYP. A suitable precursor diterpene may be produced by selecting an appropriate combination of diTPS of class II and diTPS of class I.
In one preferred embodiment a diterpene according to the present invention is a C20- molecule containing a decalin core structure.
As used herein the term "containing a core structure of formula " or the term "containing a core of formula" refers to a molecule containing a structure of the indicated formula, wherein said structure may be substituted at one or more positions. The term
"substituted" as used herein in relation to organic compounds refer to one hydrogen being substituted with another group or atom.
Said decalin may be substituted at one or more positions, and it is also contained within the invention that two substituents are fused, thus leading to a tricyclic or higher cyclic structure.
In particular, the diterpene may be a C20-molecule containing a core structure of one of following formulas XI, XII, XIII, XIV, XV, XVI, XVII, XVIII or XIX:
Figure imgf000088_0001
The diterpene containing a core structure of any of formulas XI, XII, XIII, XIV, XV, XVI, XVII, XVIII or XIX, may be a C20-molecule consisting of the formulas XI, XII, XIII, XIV, XV, XVI, XVII, XVIII or XIX substituted at one or more positions. In particular, said diterpene may be a C20-molecule substituted at the position marked by * with one or two alkyl, such as one or two d-3-alkyl, such as with one or two methyl groups. In addition said diterpene may be substituted at the position marked by ** with one or two groups individually selected from alkyi and alkenyl. Said alkyi may for example be Ci-6- alkyl, such as d-3-alkyl, for example isopropyl or methyl. Said alkenyl may me Ci-6 alkenyl, such as C2-4-alkenyl, such as C2-3-alkenyl.
In preferred embodiments of the invention the diterpene to be produced may be a C20- molecule containing a core structure of one of following formulas I, II, III, IV, V, VI, IX,
Figure imgf000089_0001
The diterpene containing a core structure of any of formulas I, II, III, IV, V, VI, IX, X, XXII, XXIII or XXIV, may be a C20-molecule consisting of the formulas I, II, III, IV, V, VI, IX, X, XXII, XXIII or XXIV substituted at one or more positions, for example by one or more groups selected from the group consisting of:
a) alkyi, such as C^-alkyl, for example C1-3, wherein said alkyi may be linear or branched, for example alkyi may be isopropyl or methyl b) alkenyl, such as Ci-6 alkenyl, such as C2-4-alkenyl, such as C2- 3-alkenyl
c) hydroxyl
In particular said diterpene containing a core structure of any of formulas formulas I, II, III, IV, V, VI, IX or X, may be a C20-molecule substituted
a) at the position corresponding to the 4 position of decalin with one or two
alkyl, such as one or two C1-3-alkyl, such as with one or two methyl groups, for example with two methyl; and/or
b) at the position corresponding to the 10 position of decalin with alkyl, such as with d-3-alkyl, such as with methyl; and/or
c) at the position corresponding to the position marked by ** in relations to
formulas XI-XIX, with one or two groups individually selected from alkyl and alkenyl. Said alkyl may for example be C1-6-alkyl, such as C1-3-alkyl, for example isopropyl or methyl. Said alkenyl may me C1-6 alkenyl, such as C2.4- alkenyl, such as C2.3-alkenyl; and/or
d) hydroxyl.
The diterpene according to the invention may also be a C20-molecule consisting of 20 carbon atoms, up to three oxygen atoms and hydrogen atoms, and which contains a core structure of any of formulas I, II, III, IV, VI, X, XII, XLII, XLIII, XLIV, XLV, XLVI, XXVII, XXVIII, XXIX, XXX, XXXI, XXXII, XXXIII, XXXIV, XXXV, XXXVI, XXXVIII, XXXIX, XL and/or XLI.
The structure of the formulas I, II, III, IV, VI, X, XII, XLII, XLIII, XLIV, XLV, XLVI, XXVII, XXVIII, XXIX, XXX, XXXI, XXXII, XXXIII, XXXIV, XXXV, XXXVI, XXXVII, XXXVIII, XXXIX, XL and XLI are as indicated herein above.
In one embodiment the diterpene is a C20-molecule containing a core of formula XXXIII:
Figure imgf000091_0001
Said diterpene may in particular contain a core of formula
XXXIII substituted with alkyl, alkenyl and/or hydroxyl, preferably substituted with methyl, =CH2 and hydroxyl.
In another embodiment the diterpene is a C2o-molecule containing a core of any of formulas II, XXXV, XXXVI and/or XXXVII:
Figure imgf000091_0002
substituted with one or more alkyl or alkenyl . In particular, the position marked by asterisk may be substituted with one or two substituents selected from the group consisting of C^-alkyl and C1-2-alkenyl, preferably the position marked by asterisk may be substituted with one methyl group and ethenyl group.
In one embodiment, said diterpene to be produced is a C2o-molecule containing a decalin substituted at the 10 position with C5-alkenyl chain, which optionally may be substituted with a hydroxyl and/or a methyl group and/or =C. For example, said diterpene may be a C20-molecule of the formula XX:
Figure imgf000091_0003
wherein Ri is a C5-alkenyl substituted with methyl and/or hydroxyl. Preferably, Ri is C5- alkenyl containing one or two double bonds. When is alkenyl containing one double bond, said alkenyl is preferably substituted with hydroxyl and methyl. When is alkenyl containing two double bonds, said alkenyl is preferably substituted with methyl.
For example, said diterpene may be a C20-molecule of the formula XXI:
Figure imgf000092_0001
wherein R2 is a C5-alkenyl substituted with methyl and/or hydroxyl or with =C, and is either -OH or methyl, and X2 is either -H or -OH, wherein one and only one of and X2 is -OH. Preferably, R2 is C5-alkenyl containing one or two double bonds. When R2 is alkenyl containing one double bond, said alkenyl is preferably substituted with hydroxyl and methyl or with =C. When R2 is alkenyl containing two double bonds, said alkenyl is preferably substituted with methyl.
It is also comprised within the invention that the diterpene is the product of any of the reactions VII to XIX described herein above.
In particular, the diterpene may be any of the compounds 1 to 47 shown in figure 2 and/or Table 1 .
It is preferred that the diterpene is not 13R-manoyl oxide.
Table 1 shows preferred diterpenes, which may be precursor diterpenes according to the present invention. The left hand column provides a number of the diterpene used herein, the middle column provides the retention index (RI) and the right hand column the structure. The retention index (RI) is determined by GC-MS and may also be referred to as Kovat's retention index.
Table 1
Compound RI structure
Figure imgf000093_0001
93
Figure imgf000094_0001
Figure imgf000095_0001
(36) 2186
(37) 2192
(38) 2194
(39) 2199
(40) 2202
(41 ) 2215
(42) 2227
(43) 2230
(44) 2236
(45) 2244
(46) 2246
(47) 2268
Diterpenoid
The term "diterpenoid" refers to a diterpene, which has been functionalised by addition of one or more functional groups. Said diterpene may be any of the diterpenes described herein above in the section "Diterpene". Preferably, the diterpenoid according to the present invention is any of the diterpenes described herein above in the section "Diterpene" substituted at one or more positions with substituent(s) selected from the group consisting of -OH, =0 and -COOH Thus, the diterpenoid may be a C2o-molecule substituted at one or more positions with substituent(s) selected from the group consisting of -OH, =0 and -COOH. Preferably, the diterpenoid is a C20-molecule containing a decalin core and one or more
substitutents selected from the group consisting of -OH, =0 and/or -COOH.
In a preferred embodiment of the invention the diterpenoid is a C2o-molecule containing or XXIV:
Figure imgf000097_0001
In particular, the diterpenoid may be a C20-molecule containing a cores structure of formulas 1, 11, 111, IV, V, VI, IX, X, XXII, XXIII or XXIV substituted at one or more positions by one or more groups selected from the group consisting of:
a) alkyi, such as d-e-alkyl, for example C1-3, wherein said alkyi may be linear or branched, for example alkyi may be isopropyl or methyl, wherein said alkyi may be substituted with one or more groups selected from the group consisting of -OH, =0 and -COOH; b) alkenyl, such as Ci-6 alkenyl, such as C2-4-alkenyl, such as C2-3-alkenyl, wherein said alkyl may be substituted with one or more groups selected from the group consisting of -OH, =0 and -COOH;
c) -OH;
d) =0; and
e) -COOH.
The diterpenoid according to the invention may also be a C20-molecule, which contains a core structure of any of formulas I, I I, I II, IV, VI, X, XII, XLII, XLII I, XLIV, XLV, XLVI,
XXVI I, XXVI II, XXIX, XXX, XXXI, XXXI I, XXXI II, XXXIV, XXXV, XXXVI, XXXVII I, XXXIX, XL and/or XLI, and substituted at one or more positions by one or more groups selected from the group consisting of:
a) alkyl, such as C^-alkyl, for example C1 -3, wherein said alkyl may be linear or branched, for example alkyl may be isopropyl or methyl, wherein said alkyl may be substituted with one or more groups selected from the group consisting of -OH, =0 and -COOH;
b) alkenyl, such as C1-6 alkenyl, such as C2-4-alkenyl, such as C2-3-alkenyl, wherein said alkyl may be substituted with one or more groups selected from the group consisting of -OH, =0 and -COOH;
c) -OH;
d) =0; and
e) -COOH.
The diterpenoid according to the invention may also be a C20-molecule which contains a core structure of any of formulas I, I I, IV, VI, X, XI I, XLI I, XLI I I, XLIV, XLVI, XXVI I,
XXVI I I, XXIX, XXX, XXXI, XXXII I, XXXIV, XXXV, XXXVI, XXXVI I, XXXVII I, XXXIX, XL and/or XLI, and which is substituted at one or more positions by one or more groups selected from the group consisting of:
a) alkyl, such as C^-alkyl, for example C1 -3, wherein said alkyl may be linear or branched, for example alkyl may be isopropyl or methyl, wherein said alkyl may be substituted with one or more groups selected from the group consisting of -OH, =0 and -COOH; b) alkenyl, such as Ci-6 alkenyl, such as C2-4-alkenyl, such as C2-3-alkenyl, wherein said alkyl may be substituted with one or more groups selected from the group consisting of -OH, =0 and -COOH;
c) -OH;
d) =0; and
e) -COOH.
In one embodiment of the invention the diterpenoid is a diterpene, which has been substituted at one or more positions with substituent(s) selected from the group consisting of -OH, =0 and -COOH, wherein said diterpene is the product of any of the reactions VII to XIX described herein above.
In another embodiment of the invention the diterpenoid is any of the dierpenes
describes herein above in the section "Diterpenes", which has been substituted at one or more positions with substituent(s) selected from the group consisting of -
OH, =0 and -COOH .
In a preferred embodiment of the invention the diterpenoid is a diterpene, which has been substituted at one or more positions with substituent(s) selected from the group consisting of -OH, =0 and -COOH, wherein the diterpene is any of the compounds 1 to 47 of Table 1 .
In one embodiment the diterpenoid is dehydroabietadiene substituted at one or more positions with substituent(s) selected from the group consisting of -OH, =0 and - COOH.
In another embodiment of the invention the diterpenoid is miltiradiene substituted at one or more positions with substituent(s) selected from the group consisting of -OH, =0 and -COOH.
In another embodiment of the invention the diterpenoid is manool substituted at one or more positions with substituent(s) selected from the group consisting of -OH, =0 and - COOH. In another embodiment of the invention the diterpenoid is syn-manool substituted at one or more positions with substituent(s) selected from the group consisting of -OH, =0 and -COOH. In another embodiment of the invention the diterpenoid is syn-pimara-9,(1 1 ),15-diene substituted at one or more positions with substituent(s) selected from the group consisting of -OH, =0 and -COOH.
In another embodiment of the invention the diterpenoid is syn-isopimara-7,15-diene substituted at one or more positions with substituent(s) selected from the group consisting of -OH, =0 and -COOH.
In one embodiment of the invention the diterpenoid is selected from the group consisting of hydroxy dehydroabietadiene, hydroxy miltiradiene and hydroxy syn- manool..
In one embodiment of the invention the diterpenoid is carboxyl miltradiene, for example miltiradienic acid. In one embodiment of the invention the diterpenoid is selected from the group consisting of monohydroxy-manool, dihydroxymanool and keto-hydroxy-manool.
In one embodiment of the invention, the diterpenoid is selected from the group consisting of syn-pimara-9,(1 1 ),15-diene oxidized to contain either one keto- and one hydroxyl group or a single carboxylic acid group.
In one embodiment the diterpenoid is ferruginol.
A diterpenoid intermediate according to the present invention is a diterpene, which has been substituted with one or more groups selected from the group consisting of -OH, =0 and -COOH, wherein oxidation of said diterpenoid intermediate leads to formation of a desired diterpenoid. Accordingly, the diterpenoid intermediate may be any of the diterpenoids described herein in this section. In one embodiment, the diterpenoid is not forskolin. Host organism
The host organism to be used with the methods of the invention, may be any suitable host organism containing
a heterologous nucleic acid encoding a diTPS of class II , which may be any of diTPS of class II described herein in any of the sections "diTPS of class II", "syn-CPP type diTPS", "ent-CPP type diTPS", "(+)-CPP type diTPS", "LPP type diTPS", and "LPP like type diTPS"; and/or
a heterologous nucleic acid encoding a diTPS of class I, which may be any of diTPS of class I described herein in any of the sections "diTPS of class I", "EpTPS8",
"EpTPS23", "SsSCS", "CfTPS3", "CfTPS4", "MvTPS5", "TwTPS2", "EpTPSI " , and "CfTPS14" and/or
a heterologous nucleic acid encoding a CYP, which may be any of CYPs described herein in any of the sections "CYP", "CYP76", "CYP71 " and "CYP720".
Suitable host organisms include microorganisms, plant cells, and plants.
The microorganism can be any microorganism suitable for expression of heterologous nucleic acids. In one embodiment the host organism of the invention is a eukaryotic cell. In another embodiment the host organism is a prokaryotic cell.
In a preferred embodiment, the host organism is a fungal cell such as a yeast or filamentous fungus. In particular the host organism may be a yeast cell. In a further embodiment the yeast cell is selected from the group consisting of
Saccharomyces cerevisiae, Schizosaccharomyces pombe, Yarrowia lipolytica, Candida glabrata, Ashbya gossypii, Cyberlindnera jadinii, and Candida albicans.
In general, yeasts and fungi are excellent microorganism to be used with the present invention. They offer a desired ease of genetic manipulation and rapid growth to high cell densities on inexpensive media. For instance yeasts grow on a wide range of carbon sources and are not restricted to glucose. Thus, the microorganism to be used with the present invention may be selected from the group of yeasts described below:
Arxula adeninivorans (Blastobotrys adeninivorans) is a dimorphic yeast (it grows as a budding yeast like the baker's yeast up to a temperature of 42 °C, above this threshold it grows in a filamentous form) with unusual biochemical characteristics. It can grow on a wide range of substrates and can assimilate nitrate. It has successfully been applied to the generation of strains that can produce natural plastics or the development of a biosensor for estrogens in environmental samples. Candida boidinii is a methylotrophic yeast (it can grow on methanol). Like other methylotrophic species such as Hansenula polymorpha and Pichia pastoris, it provides an excellent platform for the production of heterologous proteins. Yields in a multigram range of a secreted foreign protein have been reported. A computational method, IPRO, recently predicted mutations that experimentally switched the cofactor specificity of Candida boidinii xylose reductase from NADPH to NADH. Details on how to download the software implemented in Python and experimental testing of predictions are outlined in the following paper.
Hansenula polymorpha (Pichia angusta) is another methylotrophic yeast (see Candida boidinii). It can furthermore grow on a wide range of other substrates; it is thermo- tolerant and can assimilate nitrate (see also Kluyveromyces lactis). It has been applied to the production of hepatitis B vaccines, insulin and interferon alpha-2a for the treatment of hepatitis C, furthermore to a range of technical enzymes. Kluyveromyces lactis is a yeast regularly applied to the production of kefir. It can grow on several sugars, most importantly on lactose which is present in milk and whey. It has successfully been applied among others to the production of chymosin (an enzyme that is usually present in the stomach of calves) for the production of cheese.
Production takes place in fermenters on a 40,000 L scale.
Pichia pastoris is a methylotrophic yeast (see Candida boidinii and Hansenula polymorpha). It provides an efficient platform for the production of foreign proteins. Platform elements are available as a kit and it is worldwide used in academia for the production of proteins. Strains have been engineered that can produce complex human N-glycan (yeast glycans are similar but not identical to those found in humans).
Saccharomyces cerevisiae is the traditional baker's yeast known for its use in brewing and baking and for the production of alcohol. As protein factory it has successfully been applied to the production of technical enzymes and of pharmaceuticals like insulin and hepatitis B vaccines. Also it has been useful for production of terpenoids. Yarrowia lipolytica is a dimorphic yeast (see Arxula adeninivorans) that can grow on a wide range of substrates. It has a high potential for industrial applications. In another embodiment the host organism is a microalgae such as Chlorella and Prototheca.
In another embodiment of the invention the host organism is a filamentous fungus, for example Aspergillus.
In further yet another embodiment the host organism is a plant cell. The host organism may be a cell of a higher plant, but the host organism may also be cells from organisms not belonging to higher plants for example cells from the moss Physcomitrella patens. In another embodiment the host organism is a mammalian cell, such as a human, feline, porcine, simian, canine, murine, rat, mouse or rabbit cell.
As mentioned, the host organism can also be a prokaryotic cell such as a bacterial cell. If the host organism is a prokaryotic cell the cell may be selected from, but not limited to E. coli, Corynebacterium, Bacillus, Pseudomonas and Streptomyces cells.
The host organism may also be a plant.
A plant or plant cell can be transformed by having a heterologous nucleic acid integrated into its genome, i.e., it can be stably transformed. Stably transformed cells typically retain the introduced nucleic acid with each cell division. A plant or plant cell can also be transiently transformed such that the recombinant gene is not integrated into its genome. Transiently transformed cells typically lose all or some portion of the introduced nucleic acid with each cell division such that the introduced nucleic acid cannot be detected in daughter cells after a certain number of cell divisions. Both transiently transformed and stably transformed transgenic plants and plant cells can be useful in the methods described herein.
Plant cells comprising a heterologous nucleic acid used in methods described herein can constitute part or all of a whole plant. Such plants can be grown in a manner suitable for the species under consideration, either in a growth chamber, a greenhouse, or in a field. Plants may also be progeny of an initial plant comprising a heterologous nucleic acid provided the progeny inherits the heterologous nucleic acid. Seeds produced by a transgenic plant can be grown and then selfed (or outcrossed and selfed) to obtain seeds homozygous for the nucleic acid construct.
The plants to be used with the invention can be grown in suspension culture, or tissue or organ culture. For the purposes of this invention, solid and/or liquid tissue culture techniques can be used. When using solid medium, plant cells can be placed directly onto the medium or can be placed onto a filter that is then placed in contact with the medium. When using liquid medium, transgenic plant cells can be placed onto a flotation device, e.g., a porous membrane that contacts the liquid medium.
When transiently transformed plant cells are used, a reporter sequence encoding a reporter polypeptide having a reporter activity can be included in the transformation procedure and an assay for reporter activity or expression can be performed at a suitable time after transformation. A suitable time for conducting the assay typically is about 1 -21 days after transformation, e.g., about 1 -14 days, about 1 -7 days, or about 1 - 3 days. The use of transient assays is particularly convenient for rapid analysis in different species, or to confirm expression of a heterologous polypeptide whose expression has not previously been confirmed in particular recipient cells.
Techniques for introducing nucleic acids into monocotyledonous and dicotyledonous plants are known in the art, and include, without limitation, Agrobacterium- mediated transformation, viral vector-mediated transformation, electroporation and particle gun transformation, U.S. Patent Nos 5,538,880; 5,204,253; 6,329,571 ; and 6,013,863. If a cell or cultured tissue is used as the recipient tissue for transformation, plants can be regenerated from transformed cultures if desired, by techniques known to those skilled in the art.
The plant comprising a heterologous nucleic acid to be used with the present invention may for example be selected from: corn (Zea. mays), canola (Brassica napus, Brassica rapa ssp.), alfalfa (Medicago sativa), rice (Oryza sativa), rye (Secale cerale), sorghum (Sorghum bicolor, Sorghum vulgare), sunflower (Helianthus annuas), wheat (Tritium aestivum and other species), Triticale, Rye (Secale) soybean (Glycine max), tobacco (Nicotiana tabacum or Nicothiana Benthamiana), potato (Solanum tuberosum), peanuts (Arachis hypogaea), cotton (Gossypium hirsutum), sweet potato (Impomoea batatus), cassava (Manihot esculenta), coffee (Cofea spp.), coconut (Cocos nucifera), pineapple (Anana comosus), citrus (Citrus spp.) cocoa (Theobroma cacao), tea (Camellia senensis), banana (Musa spp.), avacado (Persea americana), fig (Ficus casica), guava (Psidium guajava), mango (Mangifer indica), olive (Olea europaea), papaya (Carica papaya), cashew (Anacardium occidentale), macadamia (Macadamia intergrifolia), almond (Primus amygdalus), apple (Malus spp), Pear (Pyrus spp), plum and cherry tree (Prunus spp), Ribes (currant etc.), Vitis, Jerusalem artichoke (Helianthemum spp), non-cereal grasses (Grass family), sugar and fodder beets (Beta vulgaris), chicory, oats, barley, vegetables, and ornamentals.
For example, plants of the present invention are crop plants (for example, cereals and pulses, maize, wheat, potatoes, tapioca, rice, sorghum, millet, cassava, barley, pea, sugar beets, sugar cane, soybean, oilseed rape, sunflower and other root, tuber or seed crops. Other important plants maybe fruit trees, crop trees, forest trees or plants grown for their use as spices or pharmaceutical products (Mentha spp, clove,
Artemesia spp, Thymus spp, Lavendula spp, Allium spp., Hypericum, Catharanthus spp, Vinca spp, Papaver spp., Digitalis spp, Rawolfia spp., Vanilla spp., Petrusilium spp., Eucalyptus, tea tree, Picea spp, Pinus spp, Abies spp, Juniperus spp,.
Horticultural plants which may be used with the present invention may include lettuce, endive, and vegetable brassicas including cabbage, broccoli, and cauliflower, carrots, and carnations and geraniums. The plant may also be selected from the group consisting of tobacco, cucurbits, carrot, strawberry, sunflower, tomato, pepper and Chrysanthemum.
The plant may also be a grain plants for example oil-seed plants or leguminous plants. Seeds of interest include grain seeds, such as corn, wheat, barley, sorghum, rye, etc. Oil-seed plants include cotton soybean, saff lower, sunflower, Brassica, maize, alfalfa, palm, coconut, etc. Leguminous plants include beans and peas. Beans include guar, locust bean, fenugreek, soybean, garden beans, cowpea, mung bean, lima bean, fava bean, lentils, chickpea. In a further embodiment of the invention said plant is selected from the following group: maize, rice, wheat, sugar beet, sugar cane, tobacco, oil seed rape, potato and soybean. Thus, the plant may for example be rice.
The whole genome of Arabidopsis thaliana plant has been sequenced (The
Arabidopsis Genome Initiative (2000). "Analysis of the genome sequence of the flowering plant Arabidopsis thaliana". Nature 408 (6814): 796-815.
doi:10.1038/35048692. PMID 1 1 13071 1 ). Consequently, very detailed knowledge is available for this plant and it may therefore be a useful plant to work with.
Accordingly, one plant, which may be used with the present invention is an Arabidopsis and in particular an Arabidopsis thaliana.
In embodiments of the invention, wherein one or more host organism(s) are plants, then the diterpenoid may be isolated from said plant by conventional methods. Thus the diterpenoid may be isolated from the entire plant or from parts thereof. Said parts may for example be selected from the group consisting of leaves, fruits, flowers, stems, seeds and roots of plants. Intermediates may also be isolated from said plants or parts thereof.
It may be preferred that the host organism does not naturally produce the diterpenoid to be produced by the methods of the invention.
In one embodiment of the invention, the host organism may comprise at least the following heterologous nucleic acids:
a) a heterologous nucleic acid encoding CfTPSI of SEQ ID NO:5 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith; and
b) a heterologous nucleic acid encoding CfTPS3 of SEQ ID NO:12 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith;
c) a heterologous nucleic acid encoding CYP76AH1 1 of SEQ ID NO:21 or a
functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith.
Such a host organism is in particular useful for production of diterpenoids having a core of formulas VI, IX, XXXVIII, XXXIX or XL, for example for production of oxidised dehydroabietadien and/or production of oxidised miltiradiene, e.g hydroxyl dehydroabietadiene or hydroxyl miltiradiene.
In one embodiment of the invention, the host organism may comprise at least the following heterologous nucleic acids:
a) a heterologous nucleic acid encoding TwTPS7 of SEQ ID NO:4 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith; and
b) a heterologous nucleic acid encoding CfTPS3 of SEQ ID NO:12 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith;
c) a heterologous nucleic acid encoding CYP76AH1 1 of SEQ ID NO:21 or a
functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith.
Such a host organism is in particular useful for production of diterpenoids having a core of formulas VI, IX, XXXVIII, XXXIX or XL, for example for production of oxidised dehydroabietadiene and/or production of oxidised miltiradiene, e.g hydroxyl dehydroabietadiene or hydroxyl miltiradiene.
In one embodiment of the invention, the host organism may comprise at least the following heterologous nucleic acids:
a) a heterologous nucleic acid encoding CfTPSI of SEQ ID NO:5 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least
85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith; and
b) a heterologous nucleic acid encoding CfTPS3 of SEQ ID NO:12 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith;
c) a heterologous nucleic acid encoding CYP71 D381 of SEQ ID NO:22 or a
functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least
98%, such as at least 99% sequence identity therewith.
Such a host organism is in particular useful for production of diterpenoids having a core of formulas VI, IX, XXXVIII, XXXIX or XL, for example for production of oxidised dehydroabietadiene and/or production of oxidised miltiradiene, e.g hydroxyl dehydroabietadiene or hydroxyl miltiradiene.
In one embodiment of the invention, the host organism may comprise at least the following heterologous nucleic acids:
a) a heterologous nucleic acid encoding TwTPS7 of SEQ ID NO:4 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith; and
b) a heterologous nucleic acid encoding CfTPS3 of SEQ ID NO:12 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith;
c) a heterologous nucleic acid encoding CYP71 D381 of SEQ ID NO:22 or a
functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith.
Such a host organism is in particular useful for production of diterpenoids having a core of formulas VI, IX, XXXVIII, XXXIX or XL, for example for production of oxidised dehydroabietadiene and/or production of oxidised miltiradiene, e.g hydroxyl dehydroabietadiene or hydroxyl miltiradiene.
In one embodiment of the invention, the host organism may comprise at least the following heterologous nucleic acids:
a) a heterologous nucleic acid encoding CfTPSI of SEQ ID NO:5 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith; and
b) a heterologous nucleic acid encoding CfTPS3 of SEQ ID NO:12 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith;
c) a heterologous nucleic acid encoding CYP720B4 of SEQ ID NO:23 or a
functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith.
Such a host organism is in particular useful for production of diterpenoids having a core of formulas VI, IX, XXXVIII, XXXIX or XL, for example for production of oxidised dehydroabietadien and/or production of oxidised miltiradiene, e.g hydroxyl dehydroabietadiene or hydroxyl miltiradiene.
In one embodiment of the invention, the host organism may comprise at least the following heterologous nucleic acids:
a) a heterologous nucleic acid encoding Ossyn-CPP of SEQ ID NO:1 or a
functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith; and
b) a heterologous nucleic acid encoding CfTPS3 of SEQ ID NO:12 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith;
c) a heterologous nucleic acid encoding PsCYP720B4 of SEQ ID NO:23 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith.
Such a host organism is in particular useful for production of diterpenoids having a core of formulas II, VI, XXXV, XXXVI, XXXVII or XXXVIII, and in particular for for production of diterpenoids having a core of formulas II, VI, XXXV, XXXVI, XXXVII or XXXVIII and comprising at least one keto group, hydroxyl group and/or carboxylic acid group. Thus, for example said host organism may be useful for production of oxidised syn-pimara- 9,(1 1 ),15-diene, oxidised syn-isopimara-7,15-diene and/or oxidised dehydroabietadiene, e.g keto syn-pimara-9,(1 1 ),15-diene, keto syn-isopimara-7,15- diene, keto dehydroabietadiene, hydroxyl-syn-pimara-9,(1 1 ),15-diene, hydroxyl-syn- isopimara-7,15-diene, hydroxy-dehydroabietadiene, carboxyl-syn-pimara-9,(1 1 ),15- diene, carboxyl-syn-isopimara-7,15-diene and/or carboxyl-dehydroabietadiene.
In one embodiment of the invention, the host organism may comprise at least the following heterologous nucleic acids:
a) a heterologous nucleic acid encoding CfTPSI of SEQ ID NO:5 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith; and
b) a heterologous nucleic acid encoding SsSCS of SEQ ID NO:1 1 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith;
c) a heterologous nucleic acid encoding CYP76AH8 of SEQ ID NO:20 or a
functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith.
Such a host organism is in particular useful for production of diterpenoids having a core of formulas XXIX or XLVI, for example for production of oxidised manool, such as hydroxy manool or dihydroxy manool or keto-hydroxy-manool.
In one embodiment of the invention, the host organism may comprise at least the following heterologous nucleic acids:
a) a heterologous nucleic acid encoding CfTPSI of SEQ ID NO:5 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith; and
b) a heterologous nucleic acid encoding SsSCS of SEQ ID NO:1 1 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith;
c) a heterologous nucleic acid encoding CYP76AH1 1 of SEQ ID NO:21 or a
functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least
98%, such as at least 99% sequence identity therewith.
Such a host organism is in particular useful for production of diterpenoids having a core of formulas XXIX or XLVI, for example for production of oxidised manool, such as hydroxy manool or dihydroxy manool or keto-hydroxy-manool.
In one embodiment of the invention, the host organism may comprise at least the following heterologous nucleic acids:
a) a heterologous nucleic acid encoding CfTPSI of SEQ ID NO:5 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith; and
b) a heterologous nucleic acid encoding SsSCS of SEQ ID NO:1 1 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith;
c) a heterologous nucleic acid encoding CYP720B4 of SEQ ID NO:23 or a
functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith.
Such a host organism is in particular useful for production of diterpenoids having a core of formulas XXIX or XLVI, for example for production of oxidised manool, such as hydroxy manool or dihydroxy manool or keto-hydroxy-manool or a carboxylic acid of manool.
In one embodiment of the invention, the host organism may comprise at least the following heterologous nucleic acids:
a) a heterologous nucleic acid encoding Ossyn-CPP of SEQ ID NO:1 or a
functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith; and
b) a heterologous nucleic acid encoding SsSCS of SEQ ID NO:1 1 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith; c) a heterologous nucleic acid encoding CYP71 D381 of SEQ ID NO:22 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith.
Such a host organism is in particular useful for production of diterpenoids having a core of formulas XXVII or XLVI, for example for production of oxidised syn-manool, such as hydroxylated syn-manool.
In one embodiment of the invention, the host organism may comprise at least the following heterologous nucleic acids:
a) a heterologous nucleic acid encoding CfTPSI of SEQ ID NO:5 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith; and
b) a heterologous nucleic acid encoding SsSCS of SEQ ID NO:1 1 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith;
c) a heterologous nucleic acid encoding CYP76AH8 of SEQ ID NO:20 or a
functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith; and
d) a heterologous nucleic acid encoding CYP71 D381 of SEQ ID NO:22 or a
functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith.
Such a host organism is in particular useful for production of diterpenoids having a core of formulas XXIX or XLVI, for example for production of oxidised manool, and in particular for production of highly oxidised manool. Thus, such host organism may be particularly useful for production of dihydroxy manool.
In one embodiment of the invention, the host organism may comprise at least the following heterologous nucleic acids:
a) a heterologous nucleic acid encoding CfTPSI of SEQ ID NO:5 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith; and
b) a heterologous nucleic acid encoding SsSCS of SEQ ID NO:1 1 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith;
c) a heterologous nucleic acid encoding CYP76AH8 of SEQ ID NO:20 or a
functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith; and
d) a heterologous nucleic acid encoding CYP76AH1 1 of SEQ ID NO:21 or a
functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith.
Such a host organism is in particular useful for production of diterpenoids having a core of formulas XXIX or XLVI, for example for production of oxidised manool, and in particular for production of highly oxidised manool. Thus, such host organism may be particularly useful for production of dihydroxy manool.
In one embodiment of the invention, the host organism may comprise at least the following heterologous nucleic acids:
a) a heterologous nucleic acid encoding CfTPSI of SEQ ID NO:5 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith; and
b) a heterologous nucleic acid encoding SsSCS of SEQ ID NO:1 1 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith;
c) a heterologous nucleic acid encoding CYP76AH8 of SEQ ID NO:20 or a
functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith; and
d) a heterologous nucleic acid encoding CYP720B4 of SEQ ID NO:23 or a
functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least
98%, such as at least 99% sequence identity therewith.
Such a host organism is in particular useful for production of diterpenoids having a core of formulas XXIX or XLVI, for example for production of oxidised manool, and in particular for production of highly oxidised manool. Thus, such host organism may be particularly useful for production of dihydroxy manool or a dioxidized manool derivative with three additional degrees of desaturation, i.e. double bonds.
In one embodiment of the invention, the host organism may comprise at least the following heterologous nucleic acids:
a) a heterologous nucleic acid encoding TwTPS7 of SEQ ID NO:4 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith; and
b) a heterologous nucleic acid encoding CfTPS3 of SEQ ID NO:12 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith;
c) a heterologous nucleic acid encoding CYP76AH1 of SEQ ID NO:42 or a
functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith; and
Such a host organism is in particular useful for production of diterpenoids having a core of formulas VI or XII, for example for production of oxidised miltradiene, for example for production of ferruginol.
In one embodiment of the invention, the host organism may comprise at least the following heterologous nucleic acids:
a) a heterologous nucleic acid encoding TwTPS7 of SEQ ID NO:4 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least
85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith; and
b) a heterologous nucleic acid encoding CfTPS3 of SEQ ID NO:12 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith;
c) a heterologous nucleic acid encoding CYP76AH4 of SEQ ID NO:43 or a
functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith; and
Such a host organism is in particular useful for production of diterpenoids having a core of formulas VI or XII, for example for production of oxidised miltradiene, for example for production of ferruginol.
Sequence listing
SEQ ID NO:1 Amino acid sequence of syn-CPP from Oryza sativa
SEQ ID NO:2 Amino acid sequence of TPS7 from Euphobia peplus
SEQ ID NO:3 Amino acid sequence of AN2 from Zea Maiz
SEQ ID NO:4 Amino acid sequence of TPS7 from Tripterygium Wilfordii
SEQ ID NO:5 Amino acid sequence of TPS1 from Coleus forskohlii
SEQ ID NO:6 Amino acid sequence of LPPS from Salvia scarea
SEQ ID NO:7 Amino acid sequence of TPS21 from Tripterygium Wilfordii
SEQ ID NO:8 Amino acid sequence of TPS 14/28 from Tripterygium Wilfordii
SEQ ID NO:9 Amino acid sequence of TPS8 of Euphobia peplus
SEQ ID NO:10 Amino acid sequence of TPS23 of Euphobia peplus
SEQ ID NO:1 1 Amino acid sequence of SCS of Salvia scarea
SEQ ID NO:12 Amino acid sequence of TPS3 of Coleus forskohlii
SEQ ID NO:13 Amino acid sequence of TPS4 of Coleus forskohlii
SEQ ID NO:14 Amino acid sequence of TPS2 of Tripterygium Wilfordii
SEQ ID NO:15 Amino acid sequence of TPS1 of Euphobia peplus
SEQ ID NO:16 Amino acid sequence of TPS14 of Coleus forskohlii
SEQ ID NO:17 Amino acid sequence of TPS2 of Coleus forskohlii
SEQ ID NO:18 Amino acid sequence of AtCPS of Arabidopsis thaliana
SEQ ID NO: 19 Amino acid sequence of AtEKS of Arabidopsis thaliana
SEQ ID NO: 20 Amino acid sequence of CYP76AH8 of Coleus forskohlii
SEQ ID NO: 21 Amino acid sequence of CYP76AH1 1 of Coleus forskohlii
SEQ ID NO: 22 Amino acid sequence of CYP71 D381 of Coleus forskohlii
Figure imgf000116_0001
SEQ ID NO:46 Amino acid sequence of TPS5 of Marrubium vulgare
SEQ ID NO:l- Os syn-CPP
MPVFTASFQCVTLFGQPASAADAQPLLQGQRPFLHLHARRRRPCGPMLI SKSPPYPASEETREW EAEGQHEHTDELRETTTTMIDGIRTALRS IGEGEI SI SAYDTSLVALLKRLDGGDGPQFPSTID WIVQNQLPDGSWGDASFFMMGDRIMSTLACVVALKSWNIHTDKCERGLLFIQENMWRLAHEEED WMLVGFEIALPSLLDMAKDLDLDIPYDEPALKAIYAERERKLAKIPRDVLHAMPTTLLHSLEGM VDLDWEKLLKLRCLDGSFHCSPASTATAFQQTGDQKCFEYLDGIVKKFNGGVPCIYPLDVYERL WAVDRLTRLGI SRHFTSEIEDCLDYIFRNWTPDGLAHTKNCPVKDIDDTAMGFRLLRLYGYQVD PCVLKKFEKDGKFFCLHGESNPSSVTPMYNTYRASQLKFPGDDGVLGRAEVFCRSFLQDRRGSN RMKDKWAIAKDIPGEVEYAMDYPWKASLPRIETRLYLDQYGGSGDVWIGKVLHRMTLFCNDLYL KAAKADFSNFQKECRVELNGLRRWYLRSNLERFGGTDPQTTLMTSYFLASANIFEPNRAAERLG WARVALLADAVSSHFRRIGGPKNLTSNLEELISLVPFDDAYSGSLREAWKQWLMAWTAKESSQE SIEGDTAILLVRAIEIFGGRHVLTGQRPDLWEYSQLEQLTSSICRKLYRRVLAQENGKSTEKVE EIDQQLDLEMQELTRRVLQGCSAINRLTRETFLHVVKSFCYVAYCSPETIDNHIDKVIFQDVI *
SEQ ID NO: 2 - EpTPS7
MAAAANPSNSILNHHLLSSAAARSVSTSQLLFHSRPLVLSGAKDKRDSFVFRIKCSAVSN PRIQEQTDVFQKNGLPVIKWHEFVETDIDHEQVSKVSVSNEIKKRVESIKAILESMEDGD ITISAYDTAWVALVEDINGSGAPQFPASLQWIANNQLPDGSWGDAEIFTAHDRILNTLSC VVALKSWNIHPDMCERGMKYFRENLCKLEDENIEHMPIGFEVAFPSLLELAKKLEIQVPE DSPVLKDVYDSRNLKLKKIPKDIMHKVPTTLLHSLEGMPGLEWEKLLKLQSKDGSFLFSP SSTAYALMQTKDQNCLEYLTKIVHKFNGGVPNVYPVDLFEHIWAVDRLQRLGI SRYFQPQ LKDSVDYVARYWEEDGICWARNSSVHDVDDTAMGFRVLRSFGHHVSADVFKHFKKGDTFF CFAGQSTQAVTGMYNLLRASQLMFPGEKILEEAKQFSSAFLKVKQDANEVLDKWI ITKDL PGEVKYALDIPWYASLPRVESRFYIEQYGGSDDVWIGKTLYRMPIVNNDEYLKLAKLDYN NCQAVHRSEWDNIQKWYEESDLAEFGVSRREILMAYYLAAASIFEPEKSRERIAWAKTSV LLNTIQAYFHENNSTIHEKAAFVQLFKSGFAINARKLEGKTMEKLGRI IVGTLNDVSLDT AMAYGKDISRDLRHAWDICLQKWEESGDMHQGEAQLIVNTINLTSDAWNFNDLSSHYHQF FQLVNEICYKLRKYKKNKVNDKKKTTTPEIESHMQELVKLVLESSDDLDSNLKQIFLTVA RSFYYPAVCDAGTINYHIARVLFERVY*
SEQ ID NO: 3 - ZmAN2
MVLSSSCTTVPHLSSLAWQLGPWSSRIKKKTDTVAVPAAAGRWRRALARAQHTSESAAV AKGSSLTPIVRTDAESRRTRWPTDDDDAEPLVDEIRAMLTSMSDGDI SVSAYDTAWVGLV PRLDGGEGPQFPAAVRWIRNNQLPDGSWGDAALFSAYDRLINTLACVVTLTRWSLEPEMR GRGLSFLGRNMWKLATEDEESMPIGFELAFPSLIELAKSLGVHDFPYDHQALQGIYSSRE IKMKRIPKEVMHTVPTS ILHSLEGMPGLDWAKLLKLQSSDGSFLFSPAATAYALMNTGDD RCFSYIDRTVKKFNGGVPNVYPVDLFEHIWAVDRLERLGISRYFQKEIEQCMDYVNRHWT EDGICWARNSDVKEVDDTAMAFRLLRLHGYSVSPDVFKNFEKDGEFFAFVGQSNQAVTGM YNLNRASQI SFPGEDVLHRAGAFSYEFLRRKEAEGALRDKWI I SKDLPGEVVYTLDFPWY GNLPRVEARDYLEQYGGGDDVWIGKTLYRMPLVNNDVYLELARMDFNHCQALHQLEWQGL KRWYTENRLMDFGVAQEDALRAYFLAAASVYEPCRAAERLAWARAAILANAVSTHLRNSP SFRERLEHSLRCRPSEETDGSWFNSSSGSDAVLVKAVLRLTDSLAREAQPIHGGDPEDI I HKLLRSAWAEWVREKADAADSVCNGSSAVEQEGSRMVHDKQTCLLLARMIEISAGRAAGE AASEDGDRRI IQLTGSICDSLKQKMLVSQDPEKNEEMMSHVDDELKLRIREFVQYLLRLG EKKTGSSETRQTFLS IVKSCYYAAHCPPHVVDRHI SRVIFEPVSAAK*
SEQ ID NO: 4 - T TPS7
MHSLLMKKVIMYSSQTTHVFPSPLHCTIPKSSSFFLDAPWRLHCLSGHGAKKKRLHFDI QQGRNAI SKTHTPEDLYAKQEYSVPEIVKDDDKEEEVVKIKEHVDI IKSMLSSMEDGEI S I SAYDTAWVALIQDIHNNGAPQFPSSLLWIAENQLPDGSWGDSRVFLAFDRI INTLACVV ALKSWNVHPDKCERGISFLKENI SMLEKDDSEHMLVGFEFGFPVLLDMARRLGIDVPDDS PFLQEIYVQRDLKLKRIPKDILHNAPTTLLHSLEAIPDLDWTKLLKLQCQDGSLLFSPSS TAMAFINTKDENCLRYLNYWQRFNGGAPTVYPYDLFEHNWAVDRLQRLGI SRFFQPEIR ECMSYVYRYWTKDGIFCTRNSRVHDVDDTAMGFRLLRLHGYEVHPDAFRQFKKGCEFICY EGQSHPTVTVMYNLYRASQLMFPEEKILDEAKQFTEKFLGEKRSANKLLDKWI ITKDLPG EVGFALDVPWYASLPRVEARFFIQHYGGEDDVWLDKALYRMPYVNNNVYLELAKLDYNYC QALHRTEWGHIQKWYEECKPRDFGI SRECLLRAYFMAAASIFEPERSMERLAWAKTAILL EI IVSYFNEVGNSTEQRIAFTTEFS IRASPMGGYINGRKLDKIGTTQELIQMLLATIDQF SQDAFAAYGHDITRHLHNSWKMWLLKWQEEGDRWLGEAELLIQTINLMADHKIAEKLFMG HTNYEQLFSLTNKVCYSLGHHELQNNKELEHDMQRLVQLVLTNSSDGIDSDIKKTFLAVA KRFYYTAFVDPETVNVHIAKVLFERVD* SEQ ID NO:5 - CfTPSl
MGSLSTMNLNHSPMSYSGILPSSSAKAKLLLPGCFSI SAWMNNGKNLNCQLTHKKISKVA EIRVATVNAPPVHDQDDSTENQCHDAVN IEDPIEYIRTLLRTTGDGRI SVSPYDTAWVA LIKDLQGRDAPEFPSSLEWI IQNQLADGSWGDAKFFCVYDRLVNTIACVVALRSWDVHAE KVERGVRYINENVEKLRDGNEEHMTCGFEWFPALLQRAKSLGIQDLPYDAPVIQEIYHS REQKSKRIPLEMMHKVPTSLLFSLEGLENLEWDKLLKLQSADGSFLTSPSSTAFAFMQTR DPKCYQFIKNTIQTFNGGAPHTYPVDVFGRLWAIDRLQRLGISRFFESEIADCIAHIHRF WTEKGVFSGRESEFCDIDDTSMGVRLMRMHGYDVDPNVLKNFKKDDKFSCYGGQMIESPS PIYNLYRASQLRFPGEQILEDANKFAYDFLQEKLAHNQILDKWVI SKHLPDEIKLGLEMP WYATLPRVEARYYIQYYAGSGDVWIGKTLYRMPEI SNDTYHELAKTDFKRCQAQHQFEWI YMQEWYESCNMEEFGISRKELLVAYFLATAS IFELERANERIAWAKSQI ISTI IASFFNN QNTSPEDKLAFLTDFKNGNSTNMALVTLTQFLEGFDRYTSHQLKNAWSVWLRKLQQGEGN GGADAELLVNTLNICAGHIAFREEILAHNDYKTLSNLTSKICRQLSQIQNEKELETEGQK TSIKNKELEEDMQRLVKLVLEKSRVGINRDMKKTFLAWKTYYYKAYHSAQAIDNHMFKV LFEPVA*
SEQ ID NO: 6 - SsLPPS
MTSVNLSRAPAAITRRRLQLQPEFHAECSWLKSSSKHAPLTLSCQIRPKQLSQIAELRVT SLDASQASEKDISLVQTPHKVEVNEKIEESIEYVQNLLMTSGDGRISVSPYDTAVIALIK DLKGRDAPQFPSCLEWIAHHQLADGSWGDEFFCIYDRILNTLACVVALKSWNLHSDIIEK GVTYIKENVHKLKGANVEHRTAGFELVVPTFMQMATDLGIQDLPYDHPLIKEIADTKQQR LKEIPKDLVYQMPTNLLYSLEGLGDLEWERLLKLQSGNGSFLTSPSSTAAVLMHTKDEKC LKYIENALKNCDGGAPHTYPVDIFSRLWAIDRLQRLGISRFFQHEIKYFLDHIESVWEET GVFSGRYTKFSDIDDTSMGVRLLKMHGYDVDPNVLKHFKQQDGKFSCYIGQSVESASPMY NLYRAAQLRFPGEEVLEEATKFAFNFLQEMLVKDRLQERWVISDHLFDEIKLGLKMPWYA TLPRVEAAYYLDHYAGSGDVWIGKSFYRMPEISNDTYKELAILDFNRCQTQHQLEWIHMQ EWYDRCSLSEFGI SKRELLRSYFLAAATIFEPERTQERLLWAKTRILSKMITSFVNI SGT TLSLDYNFNGLDEI I SSANEDQGLAGTLLATFHQLLDGFDIYTLHQLKHVWSQWFMKVQQ GEGSGGEDAVLLANTLNICAGLNEDVLSNNEYTALSTLTNKICNRLAQIQDNKILQVVDG SIKDKELEQDMQALVKLVLQENGGAVDRNIRHTFLSVSKTFYYDAYHDDETTDLHIFKVL FRPW*
SEQ ID NO: 7 - T TPS21
MFMSSSSSSHARRPQLSSFSYLHPPLPFPGLSFFNTRDKRVNFDSTRI ICIAKSKPARTT PEYSDVLQTGLPLIVEDDIQEQEEPLEVSLENQIRQGVDIVKSMLGSMEDGETSI SAYDT AWVALVENIHHPGSPQFPSSLQWIANNQLPDGSWGDPDVFLAHDRLINTLACVIALKKWN IHPHKCKRGLSFVKENI SKLEKENEEHMLIGFEIAFPSLLEMAKKLGIEIPDDSPALQDI YTKRDLKLTRIPKDKMHNVPTTLLHSLEGLPDLDWEKLVKLQFQNGSFLFSPSSTAFAFM HTKDGNCLSYLNDLVHKFNGGVPTAYPVDLFEHIWSVDRLQRLGI SRFFHPEIKECLGYV HRYWTKDGICWARNSRVQDIDDTAMGFRLLRLHGYEVSPDVFKQFRKGDEFVCFMGQSNQ AITGIYNLYRASQMMFPEETILEEAKKFSVNFLREKRAASELLDKWI ITKDLPNEVGFAL DVPWYACLPRVETRLYIEQYGGQDDVWIGKTLYRMPYVNNNVYLELAKLDYNNCQSLHRI EWDNIQKWYEGYNLGGFGVNKRSLLRTYFLATSNIFEPERSVERLTWAKTAILVQAIASY FENSREERIEFANEFQKFPNTRGYINGRRLDVKQATKGLIEMVFATLNQFSLDALWHGE DITHHLYQSWEKWVLTWQEGGDRREGEAELLVQTINLMAGHTHSQEEELYERLFKLTNTV CHQLGHYHHLNKDKQPQQVEDNGGYNNSNPESI SKLQIESDMRELVQLVLNSSDGMDSNI KQTFLAVTKSFYYTAFTHPGTVNYHIAKVLFERW*
SEQ ID NO:8 - T TPS14/28
MFMSSSSSSHARRPQLSSFSYLHPPLPFPGLSFFNTRDKRVNFDSTRI ICIAKSKPARTT PEYSDVLQTGLPLIVEDDIQEQEEPLEVSLENQIRQGVDIVKSMLGSMEDGETSI SAYDT AWVALVENIHHPGSPQFPSSLQWIANNQLPDGSWGDPDVFLAHDRLINTLACVIALKKWN IHPHKCKRGLSFVKENI SKLEKENEEHMLIGFEIAFPSLLEMAKKLGIEIPDDSPALQDI YTKRDLKLTRIPKDIMHNVPTTLLYSLEGLPSLDWEKLVKLQCTDGSFLFSPSSTACALM HTKDGNCFSYI NLVHKFNGGVPTVYPVDLFEHIWCVDRLQRLGI SRFFHPEIKECLGYV HRYWTKDGICWARNSRVQDIDDTAMGFRLLRLHGYEVSPDVFKQFRKGDEFVCFMGQSNQ AITGIYNLYRASQMMFPEETILEEAKKFSVNFLREKRAASELLDKWI ITKDLPNEVGFAL DVPWYACLPRVETRLYIEQYGGQDDVWIGKTLYRMPYVNNNVYLELAKLDYNNCQSLHRI EWDNIQKWYEGYNLGGFGVNKRSLLRTYFLATSNIFEPERSVERLTWAKTAILVQAIASY FENSREERIEFANEFQKFPNTRGYINGRRLDVKQATKGLIEMVFATLNQFSLDALWHGE DITHHLYQSWEKWVLTWQEGGDRREGEAELLVQTINLMAGHTHSQEEELYERLFKLTNTV CHQLGHYHHLNKDKQPQQVEDNGGYNNSNPESI SKLQIESDMRELVQLVLNSSDGMDSNI KQTFLAVTKSFYYTAFTHPGTVNYHIAKVLFERW*
SEQ ID NO: 9 - EpTPS8
MQVSLSLTTGSEPCITRIHAPSDAPLKQRNNEREKGTLELNGKVSLKKMGEMLRTIENVP IVGSTSSYDTAWVGMVPCSSNSSKPLFPESLKWIMENQNPEGNWAVDHAHHPLLLKDSLS STLACVLALHKWNLAPQLVHSGLDFIGSNLWAAMDFRQRSPLGFDVIFPGMIHQAIDLGI NLPFNNSSIENMLTNPLLDIQSFEAGKTSHIAYFAEGLGSRLKDWEQLLQYQTSNGSLFN SPSTTAAAAIHLRDEKCLNYLHSLTKQFDNGAVPTLYPLDARTRI SI IDSLEKFGIHSHF IQEMTILLDQIYSFWKEGNEEIFKDPGCCATAFRLLRKHGYDVSSDSLAEFEKKEIFYHS SAASAHEIDTKSILELFRASQMKILQNEPILDRIYDWTS IFLRDQLVKGLIENKSLYEEV NFALGHPFANLDRLEARSYIDNYDPYDVPLLKTSYRSSNIDNKDLWTIAFQDFNKCQALH RVELDYLEKWVKEYKLDTLKWARQKTEYALFTIGAILSEPEYADARI SWSQNTVFVTIVD DFFDYGGSLDECRNLINLMHKWDDHLTVGFLSEKVEIVFYSMYGTLNDLAAKAEVRQGRC VRSHLVNLWIWVMENMLKEREWADYNLVPTFYEYVAAGHITIGLGPVLLIALYFMGYPLS EDVVQSQEYKGVYLNVS I IARLLNDRVTVKRESAQGKLNGVSLFVEHGRGAVDEETSMKE VERLVESHKRELLRLIVQKTEGSWPQSCKDLAWRVSKVLHLLYMDDDGFTCPVKMLNAT NAIVNEPLLLTS*
SEQ ID NO: 10 - EpTPS23
MLLASSTSSRFFTKEWEPSNKTFSGSVRAQLSQRVKNIVVTPDQVKESESSGTSLRLKEM LKKVEMPISSYDTAWVAMVPSMEHSRNKPLFPNSLKWVMENQQPDGSWCFDDSNHPWLIK DSLSSTLASVLALKKWNVGQQLIDKGLEYIGSNMWAATDMHQYSPIGFNI IFPSMVEHAN KLGLSLSLDHSLFQSMLRNRDMETKSLNGRNMAYVAEGLNGSNNWKEVMKYQRRNGS ILN SPATTAAALIHLNDVKCFEYLDSLLTKFQHAVPTLYPFDIYARLCILDELEKLGVDRFVE IEKMLLLDYIYRCWLEGSEEILEDPTCCAMAFRFLRMNGYVVSPDVLQGFEEEEKLFHVK DTKSVLELLKASQLKVSEKEGILDRIYSWATSYLKHQLFNASI SDKSLQNEVDYVVKHPH AILRRIENRNYIENYNTKNVSLRKTSFRFVNVDKRSDLLAHSRQDFNKCQIQFKKELAYL SRWEKKYGLDKLKYARQRLEVVYFS IASNLFEPEFSDARLAWTQYAILTTVVDDFFEYAA SMDELVNLTNLIERWDEHGSEEFKSKEVEILFYAIYDLVNEDAEKAKKYQGRCIKSHLVH IWIDILKAMLKESEYVRYNIVPTLDEYISNGCTSI SFGAILLIPLYFLGKMSEEVVTSKE YQKLYMHISMLGRLLNDRVTSQKDMAQGKLNSVSLRVLHSNGTLTEEEAKEEVDKI IEKH RRELLRMWQTEGSVVPKACKKLFWMTSKELHLFYMTEDCFTCPTKLLSAVNSTLKDPLL MP*
SEQ ID NO:ll SsSCS
MSLAFNVGVTPFSGQRVGSRKEKFPVQGFPVTTPNRSRLIVNCSLTTIDFMAKMKENFKREDDK FPTTTTLRSEDIPSNLCI IDTLQRLGVDQFFQYEINTILDNTFRLWQEKHKVIYGNVTTHAMAF RLLRVKGYEVSSEELAPYGNQEAVSQQTNDLPMI IELYRAANERIYEEERSLEKILAWTTIFLN KQVQDNS IPDKKLHKLVEFYLRNYKGITIRLGARRNLELYDMTYYQALKSTNRFSNLCNEDFLV FAKQDFDIHEAQNQKGLQQLQRWYADCRLDTLNFGRDWI IANYLASLI IGDHAFDYVRLAFAK TSVLVTIMDDFFDCHGSSQECDKI IELVKEWKENPDAEYGSEELEILFMALYNTVNELAERARV EQGRSVKEFLVKLWVEILSAFKIELDTWSNGTQQSFDEYISSSWLSNGSRLTGLLTMQFVGVKL SDEMLMSEECTDLARHVCMVGRLLNDVCSSEREREENIAGKSYSILLATEKDGRKVSEDEAIAE INEMVEYHWRKVLQIVYKKES ILPRRCKDVFLEMAKGTFYAYGINDELTSPQQSKEDMKSFVF*
SEQ ID NO: 12 CfTPS3
MSSLAGNLRVIPFSGNRVQTRTGILPVHQTPMITSKSSAAVKCSLTTPTDLMGKIKEVFN REVDTSPAAMTTHSTDIPSNLCI IDTLQRLGIDQYFQSEIDAVLHDTYRLWQLKKKDIFS DITTHAMAFRLLRVKGYEVASDELAPYADQERINLQTIDVPTVVELYRAAQERLTEEDST LEKLYVWTSAFLKQQLLTDAIPDKKLHKQVEYYLKNYHGILDRMGVRRNLDLYDI SHYKS LKAAHRFYNLSNEDILAFARQDFNI SQAQHQKELQQLQRWYADCRLDTLKFGRDVVRIGN FLTSAMIGDPELSDLRLAFAKHIVLVTRIDDFFDHGGPKEESYEILELVKEWKEKPAGEY VSEEVEILFTAVYNTVNELAEMAHIEQGRSVKDLLVKLWVEILSVFRIELDTWTNDTALT LEEYLSQSWVS IGCRICILISMQFQGVKLSDEMLQSEECTDLCRYVSMVDRLLNDVQTFE KERKENTGNSVSLLQAAHKDERVINEEEACIKVKELAEYNRRKLMQIVYKTGTIFPRKCK DLFLKACRIGCYLYSSGDEFTSPQQMMEDMKSLVYEPLPISPPEANNASGEKMSCVSN*
SEQ ID NO: 13 - CfTPS4
MSITINLRVIAFPGHGVQSRQGIFAVMEFPRNKNTFKSSFAVKCSLSTPTDLMGKIKEKL
SEKVDNSVAAMATDSADMPTNLCIVDSLQRLGVEKYFQSEIDTVLDDAYRLWQLKQK
DIFSDITTHAMAFRLLRVKGYDVSSEELAPYADQEGMNLQTIDLAAVIELYRAAQERVA
EEDSTLEKLYVWTSTFLKQQLLAGAIPDQKLHKQVEYYLKNYHGILDRMGVRKGLDLYD
AGYYKALKAADRLVDLCNEDLLAFARQDFNINQAQHRKELEQLQRWYADCRLDKLEF
GRDWRVSNFLTSAILGDPELSEVRLVFAKHIVLVTRIDDFFDHGGPREESHKILELIKEW
KEKPAGEYVSKEVEILYTAVYNTVNELAERANVEQGRNVEPFLRTLWVQILSIFKIELDT
WSDDTALTLDDYLNNSWVS IGCRICILMSMQFIGMKLPEEMLLSEECVDLCRHVSMV
DRLLNDVQTFEKERKENTGNAVSLLLAAHKGERAFSEEEAIAKAKYLADCNRRSLMQI
VYKTGTIFPRKCKDMFLKVCRIGCYLYASGDEFTSPQQMMEDMKSLVYEPLQIHPPAAA*
SEQ ID NO: 14 - T TPS2
MFDKTQLSVSAYDTAWVAMVSSPNSRQAPWFPECVNWLLDNQLSDGSWGLPPHHPSLVKD ALSSTLACLLALKRWGLGEQQMTKGLQFIESNFTS INDEEQHTPIGFNI IFPGMIETAID MNLNLPLRSEDINVMLHNRDLELRRNKLEGREAYLAYVSEGMGKLQDWEMVMKYQRKNGS LFNSPSTTAAALSHLGNAGCFHYINSLVAKFGNAVPTVYPSDKYALLCMIESLERLGIDR HFSKEIRDVLEETYRCWLQGDEEIFSDADTCAMAFRILRVHGYEVSSDPLTQCAEHHFSR SFGGHLKDFSTALELFKASQFVIFPEESGLEKQMSWTNQFLKQEFSNGTTRADRFSKYFS IEVHDTLKFPFHANVERLAHRRNIEHHHVDNTRILKTSYCFSNISNADFLQLAVEDFNRC QSIHREELKHLERWVVETKLDRLKFARQKMAYCYFSAAGTCFSPELSDARI SWARNSVLT TVADDFFDIVGSEEELANLVHLLENWDANGSPHYCSEPVEI IFSALRSTICEIGDKALAW QGRSVTHHVIEMWLDLLKSALREAEWARNKVVPTFDEYVENGYVSMALGPIVLPAVYLIG PKVSEEVVRSPEFHNLFKLMS ICGRLINDTRTFKRESEAGKLNSVLLHMIHSGSGTTEEE AVEKIRGMIADGRRELLRLVLQEKDSVVPRACKDLFWKMVQVLHLFYMDGDGFSSPDMML NAVNALIREPISL*
SEQ ID NO: 15 - EpTPSl
MSATPNSFFTSPI SAKLGHPKSQSVAESNTRIQQLDGTREKIKKMFDKVELSVSPYDTAW VAMVPSPNSLEAPYFPECSKWIVDNQLNDGSWGVYHRDPLLVKDS ISSTLACVLALKRWG IGEKQVNKGLEFIELNSASLNDLKQYKPVGFDITFPRMLEHAKDFGLNLPLDPKYVEAVI FSRDLDLKSGCDSTTEGRKAYLAYI SEGIGNLQDWNMVMKYQRRNGS IFDSPSATAAAS I HLHDASCLRYLRCALKKFGNAVPTIYPFNIYVRLSMVDAIESLGIARHFQEEIKTVLDET YRYWLQGNEEIFQDCTTCAMAFRILRANGYNVSSEKLNQFTEDHFSNSLGGYLEDMRPVL ELYKASQLIFPDELFLEKQFSWTSQCLKQKI SSGLRHTDGINKHITEEVNDVLKFASYAD LERLTNWRRIAVYRANETKMLKTSYRCSNIANEHFLELAVEDFNVCQSMHREELKHLGRW WEKRLDKLKFARQKLGYCYFSSAASLFAPEMSDARI SWAKNAVLTTWDDFFDVGGSEE ELINLVQLIERWDVDGSSHFCSEHVEIVFSALHSTICEIGEKAFAYQGRRMTSHVIKIWL DLLKSMLTETLWSKSKATPTLNEYMTNGNTSFALGPIVLPALFFVGPKLTDEDLKSHELH DLFKTMSTCGRLLNDWRSYERESEEGKLNAVSLHMIYGNGSVAATEEEATQKIKGLIESE RRELMRLVLQEKDSKIPRPCKDLFWKMLKVLHMFYLKDDGFTSNQMMKTANSLINQPISL HER*
SEQ ID NO:16 - CfTPS14
MSLPLSTCVLFVPKGSQFWSSRFSYASASLEVGFQRATSAQIAPLSKSFEETKGRIAKLF HKDELSI STYDTAWVAMVPSPTSSEEPCFPACLNWLLENQCLDGSWARPHHHPMLKKDVL SSTLACILALKKWGVGEEQINRGLHFIELNFASATEKCQITPMGFDIVFPAMLDRARALS LNIRLEPTTLNDLMNKRDLELNRCYQSSSTEREVYRAYIAEGMGKLQNWESVMKYQRKNG TLFNCPSTTAAAFTALRNSDCLNYLHLALNKFGDAVPAVFPLDIYSQLCIVDNLERVGIS RHFLTEIQSVLDGTYRSWLQGDEQIFMDASTCALAFRTLRMNGYNVSSDPITKLIQEGSF SRNTMDINTTLELYRASELILYPDERDLEEHNLRLKTILDQELSGGGFILSRQLGRNINA EVKQALESPFYAIMDRMAKRRSIEHYHIDNTRILKTSYCSPNFGNEDFLSLSVEDFNRCQ VIHREELRELERWVIENRLDELKFARSKSAYCYFSAAATIFSPELSDARMSWAKNGVLTT VVDDFFDVGGSVEELKNLIQLVELWDVDVSRECISPSVQI IFSALKHTIREIGDKGFKLQ GRS ITDHI IAIWLDLLYSMMKESEWGREKAVPTIDEYISNAYVSFALGPIVLPALYLVGP KLSEEMVNHADYHNLFKSMSTCGRLLNDIRGYERELKDGKLNTLSLYMVNNEGEI SWEAA ILEVKSWIERERRELLRSVLEEEKSWPKACKELFWHMCTVVHLFYSKDDGFTSQDLLSA VNAI IYQPLVLE*
SEQ ID NO:17 - CfTPS2
MKMLMIKSQFRVHSIVSAWANNSNKRQSLGHQIRRKQRSQVTECRVASLDALNGIQKVGP ATIGTPEEENKKIEDSIEYVKELLKTMGDGRISVSPYDTAIVALIKDLEGGDGPEFPSCL EWIAQNQLADGSWGDHFFCIYDRWNTAACVVALKSWNVHADKIEKGAVYLKENVHKLKD GKIEHMPAGFEFVVPATLERAKALGIKGLPYDDPFIREIYSAKQTRLTKIPKGMIYESPT SLLYSLDGLEGLEWDKILKLQSADGSFITSVSSTAFVFMHTNDLKCHAFIKNALTNCNGG VPHTYPVDIFARLWAVDRLQRLGISRFFEPEIKYLMDHINNVWREKGVFSSRHSQFADID DTSMGIRLLKMHGYNVNPNALEHFKQKDGKFTCYADQHIESPSPMYNLYRAAQLRFPGEE ILQQALQFAYNFLHENLASNHFQEKWVISDHLIDEVRIGLKMPWYATLPRVEASYYLQHY GGSSDVWIGKTLYRMPEISNDTYKILAQLDFNKCQAQHQLEWMSMKEWYQSNNVKEFGIS KKELLLAYFLAAATMFEPERTQERIMWAKTQWSRMITSFLNKENTMSFDLKIALLTQPQ HQINGSEMKNGLAQTLPAAFRQLLKEFDKYTRHQLRNTWNKWLMKLKQGDDNGGADAELL ANTLNICAGHNEDILSHYEYTALSSLTNKICQRLSQIQDKKMLEIEEGS IKDKEMELEIQ TLVKLVLQETSGGIDRNIKQTFLSVFKTFYYRAYHDAKTIDAHIFQVLFEPVV*
SEQ ID NO: 18 - AtCPS
MSLQYHVLNSIPSTTFLSSTKTTISSSFLTI SGSPLNVARDKSRSGS IHCSKLRTQEYINSQEV QHDLPLIHEWQQLQGEDAPQI SVGSNSNAFKEAVKSVKTILRNLTDGEITI SAYDTAWVALIDA GDKTPAFPSAVKWIAENQLSDGSWGDAYLFSYHDRLINTLACVVALRSWNLFPHQCNKGITFFR ENIGKLEDENDEHMPIGFEVAFPSLLEIARGINIDVPYDSPVLKDIYAKKELKLTRIPKEIMHK IPTTLLHSLEGMRDLDWEKLLKLQSQDGSFLFSPSSTAFAFMQTRDSNCLEYLRNAVKRFNGGV PNVFPVDLFEHIWIVDRLQRLGI SRYFEEEIKECLDYVHRYWTDNGICWARCSHVQDIDDTAMA FRLLRQHGYQVSADVFKNFEKEGEFFCFVGQSNQAVTGMFNLYRASQLAFPREEILKNAKEFSY NYLLEKREREELIDKWI IMKDLPGEIGFALEIPWYASLPRVETRFYIDQYGGENDVWIGKTLYR MPYVNNNGYLELAKQDYNNCQAQHQLEWDIFQKWYEENRLSEWGVRRSELLECYYLAAATIFES ERSHERMVWAKSSVLVKAI SSSFGESSDSRRSFSDQFHEYIA ARRSDHHFNDRNMRLDRPGSV QASRLAGVLIGTLNQMSFDLFMSHGRDVNNLLYLSWGDWMEKWKLYGDEGEGELMVKMI ILMKN NDLTNFFTHTHFVRLAEI INRICLPRQYLKARRNDEKEKTIKSMEKEMGKMVELALSESDTFRD VSITFLDVAKAFYYFALCGDHLQTHISKVLFQKV SEQ ID NO; 19 - AtEKS
MSINLRSSGCSSPISATLERGLDSEVQTRANNVSFEQTKEKIRKMLEKVELSVSAYDTSWVAMV PSPSSQNAPLFPQCVKWLLDNQHEDGSWGLDNHDHQSLKKDVLSSTLAS ILALKKWGIGERQI KGLQFIELNSALVTDETIQKPTGFDI IFPGMIKYARDLNLTIPLGSEWDDMIRKRDLDLKCDS EKFSKGREAYLAYVLEGTRNLKDWDLIVKYQRKNGSLFDSPATTAAAFTQFGNDGCLRYLCSLL QKFEAAVPSVYPFDQYARLSI IVTLESLGIDRDFKTEIKSILDETYRYWLRGDEEICLDLATCA LAFRLLLAHGYDVSYDPLKPFAEESGFSDTLEGYVKNTFSVLELFKAAQSYPHESALKKQCCWT KQYLEMELSSWVKTSVRDKYLKKEVEDALAFPSYASLERSDHRRKILNGSAVENTRVTKTSYRL HNICTSDILKLAVDDFNFCQS IHREEMERLDRWIVENRLQELKFARQKLAYCYFSGAATLFSPE LSDARISWAKGGVLTTVVDDFFDVGGSKEELENLIHLVEKWDLNGVPEYSSEHVEI IFSVLRDT ILETGDKAFTYQGRNVTHHIVKIWLDLLKSMLREAEWSSDKSTPSLEDYMENAYI SFALGPIVL PATYLIGPPLPEKTVDSHQYNQLYKLVSTMGRLLNDIQGFKRESAEGKLNAVSLHMKHERDNRS KEVI IESMKGLAERKREELHKLVLEEKGSWPRECKEAFLKMSKVLNLFYRKDDGFTSNDLMSL VKSVIYEPVSLQKESLT SEQ ID NO: 20 CfCYP76AH8
METITLLLALFFIALTYFI SSRRRRNLPPGPFPLPI IGNMLQLGSKPHQSFAQLSKKYGPLMS I HLGSLYTVIVSSPEMAKEILQKHGQVFSGRTIAQAVHACDHDKISMGFLPVANTWRDMRKICKE QMFSHHSLEASEELRHQKLQQLLDYAQKCCEAGRAVDIREASFITTLNLMSATMFSTQATEFDS EATKEFKEI IEGVATIVGVANFADYFPILKPFDLQGIKRRADGYFGRLLKLIEGYLNERLESRR LNPDAPRKKDFLETLVDI IEANEYKLTTEHLTHLMLDLFVGGSETNTTSLEWIMSELVINPDKM
AKVKEELKSWGDEKLVNESDMPRLPYLQAVIKEVLRIHPPGPLLLPRKAESDQVVNGYLIPKG TQILFNAWAMGRDPTIWKDPESFEPERFLNQSIDFKGQDFELIPFGSGRRICPGMPLANRILHM TTATLVHNFDWKLEEGTADADHKGELFGLAVRRATPLRI IPLKP SEQ ID NO:21 CfCYP76AHll
MELVQVIAVVAWWLWSQLKRKGRKLPPGPSPLPIVGNIFQLSGKNINESFAKLSKIYGPVMS LRLGSLLTVI I SSPEMAKEVLTSKDFANRPLTEAAHAHGHSKFSVGFVPVSDPKWKQMRRVCQE EMFASRILENSQQRRHQKLQELIDHVQESRDAGRAVTIRDPVFATTLNIMSLTLFSADATEFSS SATAELRDIMAGVVSVLGAANLADFFPILKYFDPQGMRRKADLHYGRLIDHIKSRMDKRSELKK ANPNHPKHDDFLEKI IDITIQRNYDLTINEITHLLVDLYLAGSESTVMTIEWTMAELMLRPESL
AKLKAELRSVMGERKMIQESDDI SRLPYLNGAIKEALRLHPPGPLLFARKSEIDVELSGYFIPK GTQILVNEWGMGRDPSVWPNPECFQPERFLDKNIDYKGQDPQLIPFGAGRRICPGIPIAHRWH SWAALVHNFDWEFAPGGSQCNNEFFTGAALVREVPLKLIPLNPPSI SEQ ID NO:22 CfCYP71D381
MEFDFPSALIFPAVSLLLLLWLTKTRKPKSDLDRIPGPRRLPLIGNLHHLI SLTPPPRLFREMA AKYGPLMRLQLGGVPFLIVSSVDVAKHWKTNDVPFANRPPMHAARAITYNYTDIGFAPYGEYW RNLRKICTLELLSARRVRSFRHIREEENAGVAKWIASKEGSPANLSERVYLSSFDITSRAS IGK ATEEKQTLTSS IKDAMKLGGFNVADLYPSSKLLLLITGLNFRIQRVFRKTDRILDDLLSQHRST SATTERPEDLVDVLLKYQKEETEVHLNNDKIKAVIMDMFLAGGETSATAVDWAMAEMIRNPTTL KKAQEEVRRVFDGKGYVDEEEFHELKYLKLVIKEMLRMHPPLPFLVPRMNSERCEINGYEIPAN TRLLINAWAIGRPKYWNDAEKFIPERFENSS IDFKGNNLEYIPFGAGRRMCPGMTFGLASVEFT LAMLLYHFDWKMPQGIKLDMTESFGASLKRKHDLLMIPTLKRPLRLAP SEQ ID NO:23 CYP720B4
MAPMADQISLLLVVFTVAVALLHLIHRWWNIQRGPKMSNKEVHLPPGSTGWPLIGETFSYYRSM TSNHPRKFIDDREKRYDSDIFISHLFGGRTVVSADPQFNKFVLQNEGRFFQAQYPKALKALIGN YGLLSVHGDLQRKLHGIAVNLLRFERLKVDFMEEIQNLVHSTLDRWADMKEISLQNECHQMVLN LMAKQLLDLSPSKETSDICELFVDYTNAVIAIPIKIPGSTYAKGLKARELLIKKI SEMIKERRN HPEWHNDLLTKLVEEGLI SDEI ICDFILFLLFAGHETSSRAMTFAIKFLTYCPKALKQMKEEH DAILKSKGGHKKLNWDDYKSMAFTQCVINETLRLGNFGPGVFREAKEDTKVKDCLIPKGWVVFA FLTATHLHEKFHNEALTFNPWRWQLDKDVPDDSLFSPFGGGARLCPGSHLAKLELSLFLHIFIT RFSWEARADDRTSYFPLPYLTKGFPISLHGRVENE
SEQ ID NO: 24 CfCYP76AH9
MDFFTLLAALFLITLTFFLFFKSESKRRGGANLPPGPYPLPIVGNIFQLGKKPHQSLAQLAKIH GPLMSLHFGSVYTVIVTSPEMAKEIFVKNDQAFLNRTWEAVHAHDHDKISMAFMDVGTEWRTL RRICKEQMFSTQSLETSQGLRQEKLQQLHDFVQRCCDSGRVVDIREASFVTTLNLMSATLFSIQ ATEFDSNATEEFREIMEGVAS IVGDPNFADYFPILKRFDPQGVKRKAELYFGKMLVLVEDLLQK RQEERRRSPSYAKKDDLLERLVDVLNEKNEYKLTTKHITHLLLDLFVGGSETTTTSVEWIMSEL LINPEKLAKLKEELKTVVGEKKQVQESDIPQLPYFEAVLKEVFRLHPPGPLLLPRKAECDVQVG SYTIPKETQILVNAWAIGRDPAIWPNPEAFEPERFLSQKMDYKGQDFELIPFGSGRRICPGLSF ANRMLPMTVATLIHNFDWKLEVEA AEDVHKGEMFGIAVRRAVPLRAYPIQP
SEQ ID NO:25 CfDXS2
ATGGCGTCTTGTGGAGCTATCGGGAGTAGTTTCTTGCCACTGCTCCATTCCGACGAGTCAAGCT TGTTATCTCGGCCCACTGCTGCTCTTCACATCAAGAAGCAGAAGTTTTCTGTGGGAGCTGCTCT GTACCAGGATAACACGAACGATGTCGTTCCGAGTGGAGAGGGTCTGACGAGGCAGAAACCAAGA ACTCTGAGTTTCACGGGAGAGAAGCCTTCAACTCCAATTTTGGATACCATCAACTATCCAATCC ACATGAAGAATCTGTCCGTGGAGGAACTGGAGAGATTGGCCGATGAACTGAGGGAGGAGATAGT TTACACGGTGTCGAAAACGGGAGGGCATTTGAGCTCAAGCTTGGGTGTATCAGAGCTCACCGTT GCACTGCATCATGTATTCAACACACCCGATGACAAAATCATCTGGGATGTTGGACATCAGGCGT ATCCACACAAAATCTTGACAGGGAGGAGGTCCAGAATGCACACCATCCGAAAAACTTTCGGGCT TGCAGGGTTCCCCAAGAGGGATGAGAGCCCGCACGACGCGTTCGGAGCTGGTCACAGCTCCACT AGTATTTCAGCTGGTCTAGGGATGGCGGTGGGGAGGGACTTGCTACAGAAGAACAACCACGTGA TCTCGGTGATCGGAGACGGAGCCATGACAGCGGGGCAGGCATACGAGGCCTTGAACAATGCAGG ATTTCTTGATTCCAATCTGATCATCGTGTTGAACGACAACAAACAAGTGTCCCTGCCTACAGCC ACCGTCGACGGCCCTGCTCCTCCCGTCGGAGCCTTGAGCAAAGCCCTCACCAAGCTGCAAGCAA GCAGGAAGTTCCGGCAGCTACGAGAAGCAGCAAAAGGCATGACTAAGCAGATGGGAAACCAAGC ACACGAAATTGCATCCAAGGTAGACACTTACGTTAAAGGAATGATGGGGAAACCAGGCGCCTCC CTCTTCGAGGAGCTCGGGATTTATTACATCGGCCCTGTAGATGGACATAACATCGAAGATCTTG TCTATATTTTCAAGAAAGTTAAGGAGATGCCTGCGCCCGGCCCTGTTCTTATTCACATCATCAC CGAGAAGGGCAAAGGCTACCCTCCAGCTGAAGTTGCTGCTGACAAAATGCATGGTGTGGTGAAG TTTGATCCAACAACGGGGAAACAGATGAAGGTGAAAACGAAGACTCAATCATACACCCAATACT TCGCGGAGTCTCTGGTTGCAGAAGCAGAGCAGGACGAGAAAGTGGTGGCCATCCACGCGGCGAT GGGAGGCGGAACGGGGCTGAACATCTTCCAGAAACGGTTTCCCGACCGATGTTTCGATGTCGGG ATAGCCGAGCAGCATGCAGTCACCTTCGCCGCGGGTCTTGCAACGGAAGGCCTCAAGCCCTTCT GCACAATCTACTCTTCCTTCCTGCAGCGAGGTTATGATCAGGTGGTGCACGATGTGGATCTTCA GAAACTCCCGGTGAGATTCATGATGGACAGAGCTGGACTTGTGGGAGCTGACGGCCCAACCCAT TGCGGCGCCTTCGACACCACCTACATGGCCTGCCTGCCCAACATGGTCGTCATGGCTCCCTCCG ATGAGGCTGAGCTCATGCACATGGTCGCCACTGCCGCTGTCATTGATGATCGCCCTAGCTGCGT TAGGTACCCTAGAGGAAACGGTATAGGGGTGCCCCTCCCTCCAAACAATAAAGGAATTCCATTA GAGGTTGGGAAGGGAAGGATTTTGAAAGAGGGTAACCGAGTTGCCATTCTAGGCTTCGGAACTA TCGTGCAAAACTGTCTAGCAGCAGCCCAACTTCTTCAAGAACACGGCATATCCGTGAGCGTAGC CGAT
GCGAGATTCTGCAAGCCTCTGGATGGAGATCTGATCAAGAATCTTGTGAAGGAGCACGAAGTTC TCATCACTGTGGAAGAGGGATCCATTGGAGGATTCAGTGCACATGTCTCTCATTTCTTGTCCCT CAATGGACTCCTCGACGGCAATCTTAAGTGGAGGCCTATGGTGCTCCCAGATAGGTACATTGAT CATGGAGCATACCCTGATCAGATTGAGGAAGCAGGGCTGAGCTCAAAGCATATTGCAGGAACTG TTTTGTCACTTATTGGTGGAGGGAAAGACAGTCTTCATTTGATCAACATGTAA SEQ ID NO: 26 CfGGPPsl
CGCCCTCAACTGTTATGCAACAACATTTATATGGTAATGAGATGATAAATCCCAACTCCATAAC CAAGTTTAAACAAAGTTTTAACCATCCAACATTAGTTTATTCAAGAATTAAGAATTCTTACAGT TAAATATACAAATTCTACATCATAATGAAGAAAATGTTGCTTAGCAGAAGACTAAACACCAAGT TTTAGATCATCATCATCATCATCACCAAAAAAAAGAATTGCAATGCTAATTTTGCAAGAATCTA GTTTTGTATATATAACAGACACCAGACCTCATCAAGCAGAGCAAACTAGTTCTGCCTGTGAGCA ATGTAATCGGCCAGCGCCACCAGCGGCGCCGCCTTCCGGGGGTCAAAATCCAGCAGCTGCTGCT TGGCCTCCTCATTCAGCCTCTCAGCAAACTCCATAGCTTTCTCCAATCCCAGAAGCTTTGGGTA GGTGGTCTTGTCGACGGCCAAGTCTTTGCCGGCCGTCTTCCCCAACTCCTCCGAGGATTTTGTG ACATCTAAAATGTCATCCACCACTTGGAAGAGAAGCCCAATTTTTCTAGCAAAAGTTCTCAGTT TCTCAACTTGATCACTGCTTCCTCCTCCCAAAATGGCCCCCAAAACTACAGAGGCCTCAAGCAA TGCTGCAGTTTTGTGTATGTGTATGAATTCCAATGTGTCCAGTCCTACATTGGGATTGCCGGTG CAATGCAAATCCACCACCTGCCCCGCCACCAGCCCCTCCGTCCCGATCGCCTTCGCCAACTCCG CCACCGCCGCAAGAATCCTCTCAGGGGCCACCCCCGTGGTGGCAGTGGCGATGAATTCAAACGC GAAGGCCAATAAAGCATCACCGGCGAGCACGGCGACGTTCTCGCCGAAGACTTTGTGATTGGTG GGCTTGCCGCGGCGGAGGTCATCATTGTCCATACAGGGAAGATCATCGTGGATGAGAGACATGG TGTGGATCATCTCCACCGCGCAGGCGGCGGGGATCGCCGCCGCTTGGGGGCCGCCCACCACCTC GCAGGCGGCGATGCAGAGCATGGGGCGGACCCTCTTTCCGCCGGCGAGCAAGGAGTACCTCATG GCCTCGTGGATCATCGGAGGGTTCTTCACCGCCACCGCGTCGTCGAGAGCCTTGTTCACGTGGG TGGCTTTCTCGACGACGTAGGCGTTGAAATTGAAGGGCGCTTCATCTCCTCGGGTGAAGATTCT TGCTTCCTCGCCGGTGAGGACGGCGGAGACGGCGAAGGAGGAAATGGGCCTCCGCGATTTGAGG AAAGCGGGCTCGAATCTGGGATGGTGGATGAATTTGGAGGGGTGTGGTTGCTTGAAAATGGGGA GGTTTTGAACCCAAGCATCGACCAGATTCATAGACCTCATTTTTTCTCTTAGGGGCTTTGTGTA GTGGAAATGGTGAAGGCAAGAAACAGTTGTGGTTGTTTGCTGAGATC
SEQ ID NO:27 CfTPSl-wt
ATGTCATGGATGAACAACGGTAAAAACCTTAACTGCCAACTTACTCACAAGAAAATATCGAAAG TAGCCGAGATTCGAGTTGCCACGGTGAACGCGCCGCCGGTGCACGATCAAGACGATTCCACAGA AAATCAGTGCCATGACGCGGTGAATAATATTGAGGATCCGATCGAATACATAAGAACGCTGCTG AGGACGACAGGGGACGGCCGAATAAGTGTGTCGCCGTATGACACTGCGTGGGTCGCTCTGATCA AGGACTTGCAAGGACGCGATGCCCCCGAGTTTCCGTCGAGCCTGGAGTGGATCATACAGAATCA GCTGGCCGATGGGTCGTGGGGCGATGCCAAGTTCTTCTGTGTGTATGATCGCCTCGTGAATACG ATAGCATGCGTGGTGGCCTTGAGATCATGGGATGTTCATGCTGAAAAGGTGGAAAGAGGAGTGA GATACATCAATGAAAATGTGGAAAAGCTTAGAGATGGAAATGAGGAACACATGACTTGTGGGTT CGAAGTGGTGTTTCCTGCGCTTCTGCAGAGAGCTAAGAGCTTAGGGATCCAAGATCTTCCCTAT GATGCTCCCGTCATTCAAGAGATATATCACTCCAGGGAACAAAAGTTGAAAAGGATTCCACTGG AGATGATGCACAAAGTGCCAACTTCTTTATTATTTAGTCTGGAAGGGCTGGAGAATTTGGAGTG GGATAAGCTTTTGAAACTGCAGTCAGCTGATGGCTCTTTCCTCACTTCTCCCTCCTCCACTGCC TTCGCTTTTATGCAAACTCGTGATCCTAAATGCTACCAATTCATCAAAAACACTATTCAAACTT TCAACGGAGGAGCACCACACACTTATCCTGTCGATGTTTTTGGAAGACTTTGGGCAATCGACAG GCTGCAGCGCCTCGGGATTTCTCGCTTCTTTGAGTCCGAGATTGCTGATTGCATCGCCCACATC CACAGGTTTTGGACAGAGAAGGGAGTTTTCAGTGGAAGAGAATCAGAGTTTTGCGACATTGATG ATACATCCATGGGAGTCCGACTCATGAGAATGCATGGATACGATGTTGATCCAAATGTATTGAA GAACTTCAAAAAGGATGACAAGTTTTCATGCTACGGTGGACAGATGATTGAGTCTCCGTCTCCC ATTTACAATCTCTACAGGGCTTCCCAACTCCGCTTCCCCGGTGAGCAAATTCTCGAAGATGCCA ACAAATTTGCCTACGATTTCTTACAAGAAAAGCTTGCCCACAACCAGATTCTTGATAAATGGGT TATATCTAAGCACTTGCCTGATGAGATAAAACTGGGACTGGAGATGCCGTGGTACGCCACCCTA CCCCGCGTGGAGGCAAGATACTACATACAGTACTATGCTGGTTCAGGCGATGTATGGATCGGAA AGACTCTCTACAGGATGCCCGAGATCAGCAACGATACATATCATGAGCTTGCAAAAACAGACTT CAAGAGATGCCAAGCTCAGCATCAGTTTGAGTGGATTTACATGCAAGAATGGTACGAGAGTTGC AACATGGAAGAATTCGGGATAAGCAGAAAGGAGCTTCTGGTTGCTTACTTCTTGGCGACTGCAA GCATATTCGAGCTGGAGAGGGCTAATGAGAGAATCGCCTGGGCCAAATCCCAAATCATTTCCAC CATCATTGCATCTTTCTTCAATAACCAAAACACTTCACCGGAGGATAAACTTGCATTTTTAACA GATTTCAAAAATGGCAACTCCACAAACATGGCTCTGGTGACCCTCACTCAATTCCTAGAGGGAT TCGACAGATACACTAGCCATCAGTTGAAGAATGCCTGGAGCGTATGGCTGAGAAAGCTGCAGCA AGGAGAAGGCAACGGCGGCGCAGACGCAGAGCTCCTAGTAAACACATTGAACATTTGTGCCGGC CACATTGCCTTTAGGGAAGAAATACTCGCACACAACGACTACAAGACTCTCTCCAACCTGACTA GCAAAATCTGTCGACAACTTTCTCAAATTCAAAATGAAAAGGAGTTGGAGACAGAGGGACAGAA AACAAGCATAAAAAACAAGGAACTGGAAGAAGATATGCAAAGACTGGTGAAGTTGGTGTTGGAG AAATCAAGGGTTGGAATCAACAGAGATATGAAGAAAACATTTCTTGCAGTGGTAAAAACTTATT ACTACAAAGCATATCATTCTGCTCAGGCCATCGACAACCATATGTTCAAAGTACTTTTCGAACC AGTCGCCctcgagTGCTG
SEQ ID NO:28 CfTPSl-codon optimised
AAAAATGGCTTGGATGAACAACGGTAAGAATTTGAATTGCCAATTGACCCACAAGAAGATCTCT AAGGTTGCCGAAATTAGAGTTGCTACTGTTAATGCTCCACCAGTTCATGATCAAGATGACTCTA CTGAAAATCAATGCCATGATGCCGTTAACAACATCGAAGATCCAATCGAATATATCAGAACCTT GTTGAGAACTACCGGTGATGGTAGAATTTCTGTTTCTCCATATGATACTGCTTGGGTCGCTTTG ATTAAGGACTTGCAAGGTAGAGATGCTCCAGAATTTCCATCTTCATTGGAATGGATCATCCAAA ATCAATTGGCTGATGGTTCTTGGGGTGATGCTAAGTTTTTTTGCGTTTACGATAGATTGGTCAA CACCATTGCTTGTGTTGTTGCTTTGAGATCTTGGGATGTTCATGCTGAAAAAGTTGAAAGAGGT GTCAGATATATCAACGAAAACGTCGAAAAGTTGAGAGATGGTAACGAAGAACATATGACCTGTG GTTTCGAAGTTGTTTTCCCAGCTTTGTTGCAAAGAGCTAAGTCTTTGGGTATTCAAGATTTGCC ATATGATGCCCCAGTTATCCAAGAAATCTATCACTCTAGAGAACAAAAGTCCAAGAGAATCCCA TTGGAAATGATGCATAAGGTCCCAACTAGTTTGTTGTTCTCTTTGGAAGGTTTGGAAAACTTGG AATGGGACAAGTTGTTGAAGTTGCAATCAGCAGATGGTTCCTTTTTGACTTCTCCATCTTCTAC TGCTTTCGCTTTCATGCAAACTAGAGATCCAAAGTGCTACCAATTCATCAAGAACACCATTCAA ACTTTCAACGGTGGTGCTCCACATACTTATCCAGTTGATGTTTTTGGTAGATTGTGGGCCATTG ACAGATTGCAAAGATTGGGTATTTCCAGATTCTTCGAATCCGAAATTGCTGACTGCATTGCCCA TATTCATAGATTCTGGACTGAAAAGGGTGTTTTCTCTGGTAGAGAATCTGAATTCTGCGATATC GATGATACCTCTATGGGTGTTAGATTGATGAGAATGCATGGTTACGATGTTGATCCAAACGTCT TGAAGAATTTCAAGAAGGACGATAAGTTCTCTTGCTACGGTGGTCAAATGATTGAATCTCCATC TCCAATCTACAACTTGTACAGAGCTTCCCAATTGAGATTTCCAGGTGAACAAATTTTGGAAGAT GCCAACAAGTTCGCCTACGACTTTTTACAAGAAAAGTTGGCCCATAATCAAATCTTGGACAAGT GGGTTATTTCCAAACATTTGCCAGACGAAATCAAGTTGGGTTTAGAAATGCCATGGTATGCTAC TTTGCCAAGAGTTGAAGCCAGATATTACATCCAATATTACGCTGGTTCTGGTGATGTTTGGATT GGTAAAACCTTGTATAGAATGCCAGAAATCTCCAACGATACCTATCATGAATTGGCTAAGACCG ATTTCAAGAGATGTCAAGCTCAACATCAATTTGAATGGATCTACATGCAAGAATGGTACGAATC TTGCAACATGGAAGAATTCGGTATCTCCAGAAAAGAATTATTGGTCGCTTACTTCTTGGCTACC GCTTCTATTTTTGAATTGGAAAGAGCCAACGAAAGAATTGCTTGGGCTAAGTCTCAAATCATCT CTACTATTATCGCCTCCTTCTTCAACAATCAAAACACCTCTCCAGAAGATAAGTTGGCTTTCTT GACTGACTTTAAGAACGGTAACTCTACCAACATGGCTTTGGTTACTTTGACCCAATTCTTAGAA GGTTTCGACAGATACACTTCCCACCAATTGAAAAATGCTTGGTCTGTTTGGTTGAGAAAGTTGC AACAAGGTGAAGGTAATGGTGGTGCTGATGCTGAATTATTAGTTAACACCTTGAACATTTGCGC CGGTCATATTGCTTTCAGAGAAGAAATTTTGGCTCACAACGATTACAAGACCTTGTCTAACTTG ACCTCTAAGATCTGCAGACAATTGAGTCAAATCCAAAACGAAAAAGAATTGGAAACCGAAGGTC AAAAGACCTCCATTAAGAACAAAGAATTAGAAGAAGATATGCAAAGATTAGTCAAGTTGGTCTT GGAAAAGTCCAGAGTTGGTATCAACAGAGACATGAAGAAAACTTTCTTGGCCGTTGTTAAGACC TACTACTACAAAGCTTATCATTCCGCTCAAGCCATCGATAACCATATGTTTAAGGTTTTGTTCG AACCAGTCGCCTGA
SEQ ID NO:29 CfTPS3-wt ATGATCACCTCTAAATCATCTGCAGCTGTTAAATGCAGCCTCACCACGCCAACAGATTTGATGG GGAAAATAAAAGAGGTCTTCAACAGGGAAGTCGATACTTCTCCGGCAGCCATGACTACTCATTC TACAGATATACCCTCTAATCTCTGCATAATCGACACCCTCCAGAGGCTGGGAATCGACCAATAC TTCCAATCCGAAATCGACGCTGTTCTACATGATACATACAGGTTATGGCAACTGAAAAAGAAAG ATATATTTTCGGATATTACTACTCATGCAATGGCGTTCAGACTTTTGCGAGTCAAAGGATATGA AGTTGCATCAGACGAACTGGCTCCATACGCTGATCAAGAGCGCATTAACCTGCAAACCATTGAT GTGCCGACAGTTGTTGAGCTATACAGAGCAGCACAGGAGAGATTAACTGAAGAAGATAGCACTC TTGAGAAACTGTATGTTTGGACCAGCGCCTTTCTGAAGCAGCAGTTGCTCACTGATGCCATTCC TGACAAGAAATTGCACAAACAAGTGGAATACTACTTGAAGAACTACCATGGCATATTAGATAGA ATGGGAGTGAGACGAAACCTCGACCTATATGACATAAGCCATTATAAAAGTCTCAAAGCTGCTC ACAGGTTCTATAATCTGAGTAATGAAGATATCCTAGCATTTGCGAGGCAAGATTTTAATATTAG CCAAGCCCAACACCAGAAAGAACTTCAGCAGCTGCAAAGGTGGTATGCAGATTGTAGGTTGGAC ACGTTGAAATTTGGAAGAGATGTAGTGCGTATAGGAAATTTTCTGACTTCAGCAATGATTGGTG ATCCTGAATTGTCTGACCTCCGTCTAGCGTTTGCCAAACATATAGTGCTCGTAACACGTATTGA TGATTTTTTCGATCACGGTGGGCCTAAAGAAGAATCATACGAGATCCTTGAATTAGTAAAAGAA TGGAAAGAGAAGCCAGCAGGAGAATATGTTTCTGAAGAAGTTGAAATCCTATTTACAGCAGTAT ACAATACAGTAAACGAGTTGGCAGAAATGGCTCATATCGAACAAGGACGAAGCGTTAAAGACCT TCTAGTTAAACTGTGGGTTGAAATACTATCAGTTTTCAGAATAGAATTGGATACATGGACCAAC GACACAGCACTTACCTTAGAAGAGTACTTGTCACAATCCTGGGTGTCCATTGGCTGCAGAATCT GCATTCTCATATCAATGCAATTCCAAGGTGTAAAATTATCTGATGAAATGCTTCAGAGTGAAGA ATGCACTGATTTGTGTCGGTATGTTTCAATGGTTGACCGGCTGCTCAACGATGTGCAAACTTTT GAGAAGGAACGCAAGGAAAATACAGGAAATAGTGTGAGCCTTCTGCAAGCAGCTCACAAAGATG AAAGAGTCATTAATGAAGAGGAAGCTTGTATAAAGGTAAAAGAATTGGCTGAATATAACAGGAG AAAACTGATGCAGATTGTCTACAAAACAGGAACCATTTTCCCAAGAAAATGCAAAGATCTGTTT TTGAAGGCATGCAGAATTGGTTGTTATTTGTACTCAAGTGGCGACGAATTTACTTCGCCTCAAC AAATGATGGAAGATATGAAGTCACTGGTTTATGAACCCCTACCAATTTCTCCTCCTGAAGCTAA TAATGCAAGTGGAGAAAAAATGAGTTGTGTCAGCAACTAG
SEQ ID NO:30 CfTPS3-codon optimized for expression in yeast
ATGATCACCTCCAAATCTTCCGCTGCTGTTAAGTGTTCTTTGACTACTCCAACTGATTTGATGG
GTAAGATCAAAGAAGTTTTCAACAGAGAAGTTGATACCTCTCCAGCTGCTATGACTACTCATTC
TACTGATATTCCATCCAACTTGTGCATCATCGATACCTTGCAAAGATTGGGTATCGACCAATAC
TTCCAATCCGAAATTGATGCTGTCTTGCATGATACTTACAGATTGTGGCAATTGAAGAAGAAGG
ACATCTTCTCTGATATTACCACTCATGCTATGGCCTTCAGATTATTGAGAGTTAAGGGTTACGA
AGTTGCCTCTGATGAATTGGCTCCATATGCTGATCAAGAAAGAATCAACTTGCAAACCATTGAT
GTTCCAACCGTCGTCGAATTATACAGAGCTGCACAAGAAAGATTGACCGAAGAAGATTCTACCT
TGGAAAAGTTGTACGTTTGGACTTCTGCTTTCTTGAAGCAACAATTATTGACCGATGCCATCCC
AGATAAGAAGTTGCATAAGCAAGTCGAATATTACTTGAAGAACTACCACGGTATCTTGGATAGA
ATGGGTGTTAGAAGAAACTTGGACTTGTACGATATCTCCCACTACAAATCTTTGAAGGCTGCTC
ATAGATTCTACAACTTGTCTAACGAAGATATTTTGGCCTTCGCCAGACAAGATTTCAACATTTC
TCAAGCCCAACACCAAAAAGAATTGCAACAATTGCAAAGATGGTACGCCGATTGCAGATTGGAT
ACTTTGAAATTCGGTAGAGATGTCGTCAGAATCGGTAACTTTTTAACCTCTGCTATGATCGGTG
ATCCAGAATTGTCTGATTTGAGATTGGCTTTTGCTAAGCACATCGTTTTGGTTACCAGAATCGA
TGATTTCTTCGATCATGGTGGTCCAAAAGAAGAATCCTACGAAATTTTGGAATTGGTCAAAGAA
TGGAAAGAAAAGCCAGCTGGTGAATACGTTTCTGAAGAAGTCGAAATCTTATTCACCGCTGTTT
ACAACACCGTTAACGAATTGGCTGAAATGGCCCATATTGAACAAGGTAGATCTGTTAAGGATTT
GTTGGTTAAGTTGTGGGTCGAAATATTGTCCGTTTTCAGAATCGAATTGGATACCTGGACTAAC
GATACTGCTTTGACTTTGGAAGAATACTTGTCCCAATCCTGGGTTTCTATTGGTTGCAGAATCT
GCATTTTGATCTCCATGCAATTCCAAGGTGTTAAGTTGAGTGACGAAATGTTGCAAAGTGAAGA
ATGTACCGATTTGTGCAGATACGTTTCCATGGTCGATAGATTATTGAACGATGTCCAAACCTTC
GAAAAAGAAAGAAAAGAAAACACCGGTAACTCCGTTTCTTTGTTGCAAGCTGCTCACAAAGACG
AAAGAGTTATCAACGAAGAAGAAGCCTGCATCAAGGTAAAAGAATTAGCCGAATACAATAGAAG
AAAGTTGATGCAAATCGTCTACAAGACCGGTACTATTTTCCCAAGAAAATGCAAGGACTTGTTC TTGAAGGCTTGTAGAATTGGTTGCTACTTGTACTCTTCTGGTGATGAATTCACTTCCCCACAAC AAATGATGGAAGATATGAAGTCCTTGGTCTATGAACCATTGCCAATTTCTCCACCTGAAGCTAA CAATGCATCTGGTGAAAAAATGTCCTGCGTCAGTAACTGA
SEQ ID NO:31 OsCPSsyn
ATGCCGGCCTCGGAAGAAACACGCGAATGGGAAGCAGAAGGACAACATGAACACACGGATGAAC TAAGAGAGACGACGACAACCATGATCGATGGCATCAGGACAGCACTGAGATCAATCGGAGAGGG AGAGATTAGCATCTCAGCCTATGACACTTCGTTGGTTGCCCTTCTGAAGAGGCTAGATGGGGGT GATGGTCCTCAGTTCCCATCAACCATCGACTGGATCGTTCAGAATCAGCTACCGGATGGTTCAT GGGGTGATGCCTCCTTCTTCATGATGGGAGACCGGATCATGAGCACCCTCGCTTGTGTTGTAGC GTTGAAGTCATGGAACATCCACACCGATAAATGCGAGAGAGGTTTGTTGTTTATCCAAGAAAAT ATGTGGAGGTTGGCCCATGAGGAAGAAGACTGGATGCTAGTTGGATTTGAGATTGCCTTGCCCT CGCTCCTAGACATGGCTAAGGACCTGGATCTTGACATCCCTTACGATGAGCCAGCCTTGAAAGC AATATATGCCGAGAGAGAAAGGAAGCTTGCCAAGATTCCAAGAGACGTGCTACACGCTATGCCA ACAACTTTACTTCATAGCCTTGAGGGAATGGTTGACTTGGACTGGGAAAAGCTCCTCAAGCTCC GGTGTCTCGATGGGTCCTTCCATTGCTCCCCTGCTTCGACGGCTACTGCTTTCCAGCAAACAGG AGACCAGAAATGCTTTGAATACCTCGATGGAATCGTCAAAAAGTTCAATGGAGGAGTTCCCTGT ATCTACCCTTTGGATGTGTACGAACGCTTATGGGCCGTCGATAGGCTGACGAGGCTGGGCATAT CAAGGCACTTCACAAGTGAAATTGAGGATTGCTTAGACTACATTTTCAGGAACTGGACTCCAGA TGGATTAGCTCACACAAAGAACTGCCCGGTAAAAGATATCGATGACACGGCCATGGGTTTCCGT CTCCTCCGACTTTACGGCTACCAAGTCGACCCATGCGTGTTGAAGAAGTTCGAAAAGGATGGCA AGTTCTTCTGCTTGCACGGGGAGTCCAACCCATCCTCTGTCACCCCAATGTACAACACTTACCG GGCCTCCCAGCTCAAATTTCCTGGCGATGACGGTGTCCTTGGGCGAGCTGAGGTGTTTTGCCGC TCATTCCTCCAAGACAGGAGAGGCTCAAACAGAATGAAGGACAAGTGGGCCATCGCCAAGGATA TCCCAGGCGAGGTTGAGTATGCTATGGACTACCCATGGAAAGCAAGTTTACCGCGTATTGAAAC AAGGTTGTACTTGGATCAATATGGAGGTAGTGGCGATGTATGGATTGGGAAGGTCCTGCACAGG ATGACTCTTTTCTGCAACGACCTGTACCTCAAGGCAGCTAAAGCTGACTTCAGTAATTTCCAGA AAGAGTGCCGAGTTGAGTTGAATGGCCTTAGAAGGTGGTATTTGAGGAGTAATCTGGAGAGGTT TGGAGGGACTGATCCACAGACTACACTGATGACATCCTACTTCTTAGCTTCAGCGAACATCTTC GAGCCAAACCGAGCAGCGGAACGTCTTGGATGGGCTCGCGTAGCGTTGCTTGCCGATGCCGTCT CCTCTCACTTCAGGAGAATCGGGGGACCAAAAAATTTGACCAGTAATCTTGAAGAGCTTATCAG CCTTGTTCCATTTGACGACGCTTATTCTGGCAGTCTTCGTGAAGCTTGGAAGCAGTGGCTCATG GCATGGACTGCAAAGGAGAGCAGCCAGGAGTCAATTGAAGGGGACACGGCAATATTGTTGGTTC GTGCCATCGAGATTTTTGGAGGACGGCATGTTTTGACTGGGCAAAGACCGGACCTTTGGGAGTA TTCCCAGCTCGAGCAGCTCACCTCCTCCATCTGCCGCAAACTGTACAGGAGGGTTCTTGCCCAG GAGAATGGGAAAAGTACGGAGAAAGTTGAGGAGATAGACCAGCAATTGGATTTGGAGATGCAGG AATTGACTCGGCGCGTTCTTCAGGGCTGCAGCGCTATTAACAGACTAACCCGGGAGACGTTTCT CCATGTGGTGAAGAGCTTCTGCTATGTCGCCTACTGCTCACCTGAGACAATTGATAACCACATC GACAAGGTCATATTCCAAGATGTGATTTAG
SEQ ID NO:32 SsSCS-wt
ATGGCGAAAATGAAAGAGAATTTCAAGAGGGAAGACGATAAATTTCCAACGACAACGACTCTTC GATCCGAAGATATACCCTCTAATTTGTGTATAATCGACACCCTTCAAAGGTTGGGGGTCGATCA ATTCTTCCAATATGAAATCAACACTATTCTAGATAACACATTCAGGTTGTGGCAAGAAAAACAC AAAGTTATATATGGCAATGTTACTACTCATGCAATGGCATTTAGGCTTTTGCGAGTGAAAGGAT ACGAAGTTTCATCAGAGGAGTTGGCTCCATATGGTAACCAAGAGGCTGTTAGCCAGCAAACAAA TGACCTGCCGATGATTATTGAGCTTTATAGAGCAGCAAATGAGAGAATATATGAAGAAGAGAGG AGTCTTGAAAAAATTCTTGCTTGGACTACCATCTTTCTCAATAAGCAAGTGCAAGATAACTCAA TTCCCGACAAAAAACTGCACAAACTGGTGGAATTCTACTTGAGGAATTACAAAGGCATAACCAT AAGATTGGGAGCTAGACGAAACCTCGAGCTATATGACATGACCTACTATCAAGCTCTGAAATCT ACAAACAGGTTCTCTAATTTATGCAACGAAGATTTTCTAGTTTTCGCAAAGCAAGATTTCGATA TACATGAAGCCCAGAACCAGAAAGGACTTCAACAACTGCAAAGGTGGTATGCAGATTGTAGGTT GGACACCTTAAACTTTGGAAGAGATGTAGTTATTATTGCTAATTATTTGGCTTCATTAATTATT GGTGATCATGCGTTTGACTATGTTCGTCTCGCATTTGCCAAAACATCTGTGCTTGTAACAATTA TGGATGATTTTTTCGACTGTCATGGCTCTAGTCAAGAGTGTGACAAGATCATTGAATTAGTAAA AGAATGGAAGGAGAATCCGGATGCAGAGTACGGATCTGAGGAGCTTGAGATCCTTTTTATGGCG TTGTACAATACAGTAAATGAGTTGGCGGAGAGGGCTCGTGTTGAACAGGGGCGTAGTGTCAAAG AGTTTCTAGTCAAACTGTGGGTTGAAATACTCTCAGCTTTCAAGATAGAATTAGATACATGGAG CAATGGCACGCAGCAAAGCTTCGATGAATACATTTCTTCGTCGTGGTTGTCGAACGGTTCCCGG CTGACAGGTCTCCTGACGATGCAATTCGTCGGAGTAAAATTGTCCGATGAAATGCTTATGAGTG AAGAGTGCACTGATTTGGCTAGGCATGTCTGTATGGTCGGCCGGCTGCTCAACGACGTGTGCAG TTCTGAGAGGGAGCGCGAGGAAAATATTGCAGGAAAAAGTTATAGCATTCTACTAGCAACTGAG AAAGATGGAAGAAAAGTTAGTGAAGATGAAGCCATTGCAGAGATCAATGAAATGGTTGAATATC ACTGGAGAAAAGTGTTGCAGATTGTGTATAAAAAAGAAAGCATTTTGCCAAGAAGATGCAAAGA TGTATTTTTGGAGATGGCTAAGGGTACGTTTTATGCTTATGGGATCAACGATGAATTGACTTCT CCTCAGCAATCCAAGGAAGATATGAAATCCTTTG
SEQ ID NO:33 SsSCS-codon optimized for expression in yeast
AAAAATGGCCAAGATGAAGGAAAACTTCAAGAGAGAAGATGACAAGTTCCCAACTACTACTACC
TTGAGATCTGAAGATATCCCATCCAACTTGTGCATTATCGATACCTTGCAAAGATTGGGTGTTG
ACCAATTCTTCCAATACGAAATCAACACCATCTTGGACAACACTTTCAGATTGTGGCAAGAAAA
GCACAAGGTTATCTACGGTAATGTTACTACACATGCTATGGCCTTCAGATTATTGAGAGTTAAG
GGTTACGAAGTTTCCTCCGAAGAATTAGCTCCATACGGTAATCAAGAAGCCGTTTCTCAACAAA
CTAACGACTTGCCAATGATCATCGAATTATACAGAGCTGCCAACGAAAGAATCTACGAAGAAGA
AAGATCCTTGGAAAAGATTTTGGCTTGGACCACCATTTTCTTGAACAAGCAAGTTCAAGACAAC
TCCATCCCAGATAAGAAGTTGCATAAGTTGGTCGAATTCTACTTGAGAAACTACAAGGGTATCA
CCATTAGATTAGGTGCCAGAAGAAACTTGGAATTATACGACATGACTTACTACCAAGCCTTGAA
GTCTACCAACAGATTCTCTAACTTGTGTAACGAAGATTTCTTGGTTTTCGCCAAGCAAGATTTC
GATATTCACGAAGCCCAAAATCAAAAGGGTTTACAACAATTACAAAGATGGTACGCCGATTGCA
GATTGGATACTTTGAATTTCGGTAGAGATGTCGTCATTATCGCTAACTATTTGGCCTCCTTGAT
TATTGGTGATCATGCCTTTGATTACGTCAGATTGGCTTTTGCTAAGACCTCTGTTTTGGTTACC
ATCATGGATGATTTCTTCGATTGCCATGGTTCTTCTCAAGAATGCGACAAGATAATCGAATTGG
TAAAAGAATGGAAAGAAAACCCAGATGCCGAATACGGTTCTGAAGAATTGGAAATTTTGTTCAT
GGCCTTGTACAACACCGTTAACGAATTGGCTGAAAGAGCTAGAGTTGAACAAGGTAGATCTGTC
AAAGAATTTTTGGTCAAGTTGTGGGTTGAAATCTTGTCCGCTTTCAAGATTGAATTGGATACCT
GGTCTAACGGTACTCAACAATCTTTCGACGAATATATCTCCTCCTCTTGGTTGTCTAATGGTTC
TAGATTGACTGGTTTGTTGACCATGCAATTTGTTGGTGTCAAATTGTCCGACGAAATGTTGATG
TCAGAAGAATGTACTGATTTGGCTAGACACGTATGTATGGTCGGTAGATTATTGAACGATGTCT
GCTCATCTGAAAGAGAAAGAGAAGAAAACATTGCCGGTAAGTCCTACTCTATTTTGTTGGCTAC
TGAAAAGGACGGTAGAAAGGTTTCTGAAGATGAAGCTATTGCTGAAATCAACGAAATGGTCGAA
TACCATTGGAGAAAGGTCTTGCAAATCGTCTACAAGAAAGAATCCATCTTGCCTAGAAGATGCA
AGGACGTTTTTTTGGAAATGGCTAAGGGTACTTTTTACGCCTACGGTATTAACGATGAATTGAC
CTCTCCACAACAATCCAAAGAAGATATGAAGTCCTTCGTTTTTTAA
SEQ ID NO:34 CfCYP71D381
ATGGAGTTCGATTTCCCGTCAGCACTCATCTTTCCCGCAGTTTCATTGCTTCTTCTTCTGTGGC TAACAAAAACCAGGAAACCCAAGAGCGATCTGGATCGCATCCCAGGCCCGAGGAGGCTCCCGCT GATCGGAAACCTGCACCATCTCATCAGCCTGACTCCGCCGCCGCGTCTCTTCCGAGAAATGGCC GCGAAATACGGGCCCCTGATGCGGCTGCAGCTGGGCGGAGTCCCCTTCCTCATCGTCTCCTCTG TCGACGTGGCGAAGCATGTGGTCAAAACCAACGACGTGCCCTTCGCCAATCGTCCCCCCATGCA CGCGGCGCGCGCCATAACCTACAACTACACCGACATCGGCTTCGCCCCCTACGGCGAGTACTGG CGGAACCTCAGAAAGATCTGCACGCTCGAGCTGCTCAGCGCGCGGCGCGTGAGGTCATTCCGCC ACATAAGGGAGGAGGAGAACGCCGGCGTGGCGAAGTGGATCGCTTCCAAGGAAGGGTCGCCGGC GAATCTGTCGGAGAGGGTTTATCTCTCTTCATTCGACATCACAAGCAGGGCGTCCATCGGGAAG GCGACGGAGGAGAAGCAAACGCTAACTTCCTCCATCAAGGATGCTATGAAGCTGGGCGGATTCA ACGTCGCCGATCTTTATCCCTCCAGCAAGCTGCTGCTTCTCATCACCGGGCTGAATTTCAGGAT ACAGAGAGTTTTCCGGAAAACAGATCGGATTCTGGATGATTTACTCAGCCAGCACCGGTCCACC AGCGCCACCACGGAAAGGCCCGAAGATCTCGTCGACGTCCTCCTCAAGTATCAAAAGGAAGAAA CCGAAGTCCATCTCAATAACGATAAAATCAAAGCAGTGATAATGGATATGTTTCTGGCCGGCGG CGAGACATCAGCAACAGCGGTGGACTGGGCCATGGCGGAGATGATAAGAAACCCCACCACACTC AAAAAGGCGCAAGAAGAAGTGAGACGTGTCTTCGACGGCAAGGGATACGTAGACGAAGAAGAGT TCCACGAGCTAAAATACCTGAAACTGGTGATCAAAGAGATGCTGAGGATGCACCCGCCGCTGCC ATTTCTGGTCCCACGCATGAACAGCGAGAGGTGCGAAATCAATGGATACGAAATCCCAGCCAAC ACGAGACTGCTGATCAACGCATGGGCCATCGGAAGAGACCCCAAGTACTGGAATGACGCAGAGA AGTTCATACCAGAGAGATTCGAGAACAGCTCCATCGATTTCAAAGGGAACAATCTGGAATACAT ACCGTTTGGCGCAGGAAGGAGAATGTGCCCCGGGATGACGTTCGGGCTGGCGAGCGTGGAGTTC ACGCTGGCCATGCTTCTGTATCACTTCGACTGGAAGATGCCCCAAGGGATTAAGCTGGACATGA CGGAGTCCTTCGGCGCATCTCTCAAGAGGAAGCATGATTTGCTCATGATTCCCACTCTCAAGAG GCCTTTGCGACTTGCTCCATGA
SEQ ID NO: 35 CfCYP76AH8 wt
ATGGAAACCATCACTCTTCTCCTTGCTCTTTTCTTCATCGCTCTCACATATTTCATCTCCTCGA GGCGCCGGAGAAACCTTCCTCCGGGGCCTTTTCCTCTTCCGATCATCGGAAACATGCTGCAGCT CGGCTCCAAACCCCACCAGTCATTCGCCCAACTCTCAAAGAAATATGGGCCTCTCATGTCCATC CACCTCGGGAGTTTATACACCGTGATCGTCTCCTCCCCGGAGATGGCAAAAGAGATACTGCAAA AACACGGCCAAGTATTTTCGGGGCGTACCATCGCACAGGCGGTGCACGCGTGCGACCACGACAA GATCTCCATGGGATTTCTGCCGGTGGCAAACACATGGCGTGATATGCGCAAAATATGCAAAGAG CAGATGTTCTCGCATCACAGTCTGGAAGCCAGCGAGGAGCTCCGGCACCAGAAGCTGCAGCAGC TGCTGGACTACGCCCAGAAATGCTGCGAAGCCGGTCGTGCCGTCGATATTCGTGAGGCCTCCTT CATTACAACGCTCAACCTCATGTCTGCCACCATGTTTTCGACTCAAGCTACCGAGTTCGACTCA GAAGCTACCAAGGAGTTCAAGGAGATCATCGAGGGGGTGGCCACCATTGTGGGTGTGGCCAATT TCGCTGACTACTTCCCAATCTTGAAGCCTTTCGATCTGCAGGGGATCAAGCGCAGGGCAGATGG CTACTTTGGAAGATTGCTTAAATTGATAGAGGGATATCTGAATGAAAGGCTGGAATCGAGAAGG TTGAACCCAGATGCCCCCAGGAAGAAGGACTTTTTGGAAACACTCGTGGACATCATCGAGGCTA ATGAATACAAGTTGACGACCGAGCACCTCACGCACCTCATGCTTGACTTGTTTGTTGGAGGATC GGAGACAAACACGACCTCACTGGAGTGGATAATGTCGGAATTAGTGATCAACCCGGACAAAATG GCGAAGGTGAAAGAGGAGCTGAAGAGCGTGGTAGGAGACGAGAAACTGGTGAACGAGTCAGACA TGCCGAGGCTGCCATACTTGCAAGCAGTGATCAAAGAAGTGCTGCGAATTCACCCTCCCGGCCC TCTTCTGCTTCCTCGCAAGGCAGAGAGTGATCAAGTCGTGAATGGGTACTTGATCCCAAAGGGG ACTCAAATACTCTTCAATGCATGGGCAATGGGCAGAGACCCCACTATCTGGAAGGACCCTGAAT CTTTTGAGCCCGAACGCTTCCTCAATCAAAGCATAGACTTCAAAGGCCAAGATTTCGAGCTGAT CCCATTCGGCTCGGGGAGAAGAATCTGCCCCGGGATGCCGTTGGCGAATCGGATTCTGCACATG ACGACGGCCACTCTGGTTCACAACTTCGACTGGAAATTGGAAGAAGGAACAGCTGATGCGGATC ACAAAGGAGAGCTCTTTGGGCTGGCCGTTCGCCGTGCAACTCCTCTCAGGATCATTCCACTTAA GCCATGA
SEQ ID NO:36 CfCYP76AH8 codon optimized for expression in yeast
ATGGAAACCATCACCTTGTTGTTGGCCTTGTTTTTCATTGCTTTGACCTACTTCATCTCCTCCA
GAAGAAGAAGAAATTTGCCACCAGGTCCATTTCCATTGCCAATTATTGGTAACATGTTGCAATT
GGGTTCCAAGCCACATCAATCTTTTGCTCAATTGTCCAAAAAGTACGGTCCATTGATGTCCATT
CATTTGGGTTCCTTGTACACCGTTATAGTCTCTTCACCAGAAATGGCCAAAGAAATCTTGCAAA
AACACGGTCAAGTTTTCTCCGGTAGAACTATTGCTCAAGCTGTTCATGCTTGTGATCACGATAA
GATTTCTATGGGTTTTTTGCCAGTTGCCAACACTTGGAGAGATATGAGAAAGATCTGCAAAGAA
CAAATGTTCTCCCACCATTCTTTGGAAGCTAGTGAAGAATTGAGACACCAAAAGTTGCAACAAT
TATTAGACTACGCTCAAAAGTGTTGCGAAGCTGGTAGAGCTGTTGATATTAGAGAAGCCTCTTT
CATTACCACCTTGAACTTGATGTCTGCTACTATGTTTTCTACCCAAGCTACCGAATTTGATTCC GAAGCTACAAAAGAATTCAAAGAAATTATCGAAGGTGTCGCCACTATAGTTGGTGTTGCTAATT TTGCTGATTACTTCCCAATCTTGAAGCCATTTGACTTGCAAGGTATTAAGAGAAGAGCTGATGG TTACTTCGGTAGATTATTGAAGTTGATCGAAGGTTACTTGAACGAAAGATTGGAATCTAGAAGA TTGAACCCAGATGCTCCAAGAAAGAAGGATTTCTTGGAAACCTTGGTTGATATCATCGAAGCCA ACGAATACAAGTTGACTACTGAACATTTGACCCACTTGATGTTGGATTTGTTTGTTGGTGGTTC TGAAACTAACACCACATCCTTGGAATGGATCATGTCTGAATTGGTTATCAACCCAGATAAGATG GCCAAGGTCAAAGAAGAATTGAAGTCTGTTGTTGGTGACGAAAAGTTGGTTAACGAATCTGATA TGCCAAGATTGCCATACTTGCAAGCCGTTATCAAAGAAGTTTTGAGAATTCATCCACCTGGTCC TTTGTTGTTGCCAAGAAAAGCTGAATCTGATCAAGTTGTTAACGGTTATTTGATCCCAAAGGGT ACTCAAATTTTGTTCAATGCTTGGGCTATGGGTAGAGATCCAACTATTTGGAAAGATCCAGAAT CCTTCGAACCAGAAAGATTCTTGAATCAATCCATCGACTTCAAGGGTCAAGACTTCGAATTGAT TCCATTTGGTTCTGGTAGAAGAATCTGTCCAGGTATGCCATTGGCTAATAGAATCTTGCATATG ACTACCGCCACTTTGGTTCATAATTTCGATTGGAAATTGGAAGAAGGTACTGCTGACGCTGATC ATAAGGGTGAATTATTTGGTTTGGCTGTTAGAAGAGCTACCCCATTGAGAATCATTCCATTGAA ACCATAA
SEQ ID NO:37 CfCYP76AHll wt
ATGGAGTTGGTGCAAGTGATTGCGGTGGTGGCGGTGGTGGTTGTGTTGTGGTCTCAGTTGAAGC GAAAGGGCAGAAAGCTTCCACCAGGGCCTTCCCCTCTGCCCATCGTGGGAAACATTTTCCAGCT GTCGGGTAAAAACATTAATGAATCATTTGCGAAGCTCTCCAAGATATACGGCCCCGTCATGTCC CTACGCCTCGGCAGCTTACTCACGGTCATCATTTCGTCGCCGGAAATGGCGAAAGAAGTACTGA CAAGCAAAGATTTTGCAAACCGGCCCCTCACGGAGGCGGCCCATGCGCACGGCCACTCCAAGTT CTCCGTCGGGTTTGTGCCCGTCAGCGACCCAAAATGGAAACAAATGCGCCGCGTTTGCCAGGAG GAAATGTTTGCGAGCCGGATCTTGGAAAACAGCCAGCAACGCCGCCACCAGAAGCTGCAGGAGC TGATCGACCACGTGCAGGAATCCCGCGACGCTGGGCGAGCAGTCACTATCCGCGACCCCGTGTT CGCAACCACGCTCAACATTATGTCGCTCACGTTGTTTTCGGCTGATGCCACTGAGTTCAGTTCA AGCGCTACCGCTGAGTTGAGGGATATCATGGCGGGCGTCGTTTCTGTTCTTGGTGCTGCAAATT TGGCCGACTTCTTCCCCATCCTCAAATATTTCGATCCGCAAGGGATGAGGCGCAAAGCAGATCT CCACTACGGGAGACTCATCGACCACATCAAGAGCCGGATGGACAAGCGATCTGAGTTGAAAAAG GCAAACCCGAATCATCCCAAGCACGACGACTTCTTGGAGAAGATCATAGACATCACTATACAGA GGAACTACGACTTGACCATCAATGAGATCACGCATTTACTGGTGGACTTGTATCTAGCTGGAAG CGAATCAACGGTGATGACAATTGAATGGACAATGGCGGAACTGATGCTGCGCCCGGAGTCACTA GCGAAGCTGAAAGCCGAGCTGAGAAGCGTGATGGGAGAGAGAAAGATGATCCAGGAATCAGATG ACATATCAAGGCTTCCATATCTGAATGGGGCCATCAAAGAAGCCCTCCGCCTCCATCCGCCTGG CCCCCTCCTTTTTGCCCGCAAAAGCGAAATCGATGTGGAACTCAGCGGCTATTTCATCCCAAAA GGCACTCAGATACTTGTGAATGAATGGGGCATGGGCAGAGACCCCAGCGTTTGGCCTAATCCAG AGTGTTTCCAGCCGGAACGCTTCTTGGACAAAAACATTGATTACAAGGGCCAAGATCCCCAACT CATCCCATTCGGAGCCGGAAGAAGGATCTGCCCGGGGATACCCATCGCGCATCGGGTTGTGCAT TCAGTAGTGGCCGCATTAGTCCACAATTTCGACTGGGAGTTTGCCCCAGGCGGCAGTCAATGTA ACAACGAGTTCTTCACCGGGGCAGCACTCGTCAGGGAAGTTCCTCTCAAACTCATCCCACTCAA CCCACCTTCTATTTGA
SEQ ID NO:38 CfCYP76AHll codon optimized for expression in yeast ATGGAATTGGTCCAAGTTATCGCTGTTGTTGCAGTTGTTGTTGTTTTGTGGTCCCAATTGAAAA GAAAGGGTAGAAAATTGCCACCAGGTCCATCTCCATTGCCAATAGTTGGTAATATCTTCCAATT GTCCGGTAAGAACATCAACGAATCTTTCGCTAAGTTGTCCAAAATCTACGGTCCAGTTATGTCT TTGAGATTGGGTTCTTTGTTGACCGTCATTATCTCTTCACCAGAAATGGCCAAAGAAGTCTTGA CTTCTAAGGATTTTGCTAACAGACCATTGACTGAAGCTGCTCATGCTCATGGTCATTCTAAATT TTCTGTTGGTTTCGTTCCAGTCTCTGATCCAAAATGGAAACAAATGAGAAGAGTCTGCCAAGAA GAAATGTTCGCCTCTAGAATTTTGGAAAACTCCCAACAAAGAAGACACCAAAAGTTGCAAGAAT TGATCGACCACGTTCAAGAATCTAGAGATGCTGGTAGAGCTGTTACTATTAGAGATCCAGTTTT CGCTACCACCTTGAACATTATGTCCTTGACTTTGTTTTCTGCCGATGCTACTGAATTCTCTTCT TCTGCTACTGCTGAATTGAGAGATATTATGGCTGGTGTTGTTTCTGTTTTGGGTGCTGCTAATT TGGCTGATTTCTTCCCAATCTTGAAATACTTCGATCCACAAGGTATGAGAAGAAAGGCTGACTT GCATTACGGTAGATTGATTGACCATATCAAGTCCAGAATGGACAAGAGATCTGAATTGAAGAAG GCTAATCCAAACCATCCAAAGCACGATGATTTCTTGGAAAAGATCATCGACATCACCATTCAAA GAAACTACGACTTGACCATTAACGAAATCACCCATTTGTTGGTCGACTTGTATTTGGCTGGTTC TGAATCTACTGTTATGACCATTGAATGGACCATGGCCGAATTGATGTTAAGACCAGAATCATTG GCTAAATTGAAGGCAGAATTGAGATCCGTTATGGGTGAAAGAAAGATGATCCAAGAATCCGACG ACATTTCTAGATTGCCATACTTAAACGGTGCTATCAAAGAAGCCTTAAGATTGCATCCACCTGG TCCTTTGTTGTTTGCTAGAAAGTCTGAAATCGATGTTGAATTGTCTGGTTACTTCATCCCAAAG GGTACTCAAATCTTGGTTAATGAATGGGGTATGGGTAGAGATCCTTCTGTTTGGCCTAATCCAG AATGTTTTCAACCAGAAAGATTTTTGGATAAGAACATTGACTACAAGGGTCAAGACCCACAATT GATTCCATTTGGTGCAGGTAGAAGAATTTGTCCAGGTATTCCAATTGCCCATAGAGTTGTTCAT TCAGTTGTTGCTGCTTTGGTTCATAACTTCGATTGGGAATTTGCTCCTGGTGGTTCTCAATGTA ACAACGAATTTTTCACTGGTGCTGCCTTGGTTAGAGAAGTTCCATTGAAGTTGATTCCTTTGAA CCCACCATCCATCTGA
SEQ ID NO:39 PsCYP720B4
ATGGCGCCCATGGCAGACCAAATATCATTACTGTTGGTGGTGTTCACGGTAGCGGTGGCGCTCC TCCACCTTATTCACAGGTGGTGGAATATCCAGAGAGGCCCAAAAATGAGTAATAAGGAGGTTCA TCTGCCTCCTGGGTCGACTGGATGGCCGCTTATTGGCGAAACCTTCAGTTATTATCGCTCCATG ACCAGCAATCATCCCAGGAAATTCATCGACGACAGAGAGAAAAGATATGATTCGGACATTTTCA TATCTCATCTATTTGGAGGCCGGACGGTTGTATCAGCGGATCCCCAGTTCAACAAGTTTGTTCT ACAAAACGAGGGGAGATTCTTTCAAGCCCAATACCCAAAGGCACTGAAGGCTTTGATAGGCAAC TACGGGCTGCTCTCTGTGCATGGAGATCTCCAGAGAAAGCTCCACGGAATAGCTGTGAATTTGC TGAGGTTTGAGAGACTGAAAGTCGATTTCATGGAGGAGATACAGAATCTCGTGCACTCCACGTT GGATAGATGGGCAGATATGAAGGAAATTTCTCTGCAGAATGAATGTCACCAGATGGTTCTCAAC TTGATGGCCAAACAACTGCTGGATTTATCTCCTTCCAAAGAGACGAGTGATATTTGCGAGCTAT TCGTTGACTATACCAATGCAGTGATTGCCATTCCCATCAAAATCCCAGGTTCCACCTATGCAAA GGGGCTTAAGGCAAGGGAGCTTCTCATAAAAAAGATTTCAGAAATGATAAAAGAGAGAAGGAAT CATCCTGAAGTTGTTCATAATGATTTGTTAACTAAACTTGTGGAAGAGGGGCTCATTTCAGATG AAATTATTTGTGATTTTATTTTATTTTTACTTTTTGCTGGACATGAGACTTCCTCTAGAGCCAT GACATTTGCTATCAAGTTTCTTACCTATTGCCCCAAGGCATTGAAGCAAATGAAGGAAGAGCAT GATGCTATATTAAAATCAAAGGGAGGTCATAAGAAACTTAATTGGGATGACTACAAATCAATGG CATTCACTCAATGTGTTATAAATGAAACACTTCGATTAGGTAACTTTGGTCCAGGGGTGTTTAG AGAAGCTAAAGAAGACACTAAAGTAAAAGATTGTCTCATTCCAAAAGGATGGGTGGTATTTGCT TTTCTGACTGCAACACATCTACATGAAAAGTTTCATAATGAAGCTCTTACTTTTAACCCATGGC GATGGCAATTGGATAAAGATGTACCAGATGATAGTTTGTTTTCACCTTTTGGAGGTGGAGCTAG GCTTTGTCCAGGATCTCATCTAGCTAAACTTGAATTGTCACTTTTTCTTCACATATTTATCACA AGATTCAGTTGGGAAGCGCGTGCAGATGATCGTACCTCATATTTTCCATTACCTTATTTAACTA AAGGCTTTCCCATTAGCCTTCATGGTAGAGTAGAGAATGAATAA
SEQ ID NO: 40 amino acid sequence of CfCYP76AH15:
METMTLLLPLFFIALTYFLSWRRRRNLPPGPFPLPI IGNLLQIGSKPHQSFAQLSKKYGPLMSV QLGSVYTVIASSPEMAKEILQKHGQVFSGRTIAQAAQACGHDQIS IGFLPVATTWRDMRKICKE QMFSHHSLESSKELRHEKLQKLLDYAQKCCEAGRAVDIREAAFITTLNLMSATLFSTQATEFDS EATKEFKEVIEGVAVIVGEPNFADYFPILKPFDLQGIKRRANSYFGRLLKLMERYLNERLESRR LNPDAPKKNDFLETLVDI IQADEYKLTTDHVTHLMLDLFVGGSETSATSLEWIMSELVSNPSKL AKVKAELKSWGEKKWSESEMARLPYLQAVIKEVLRLHPPGPLLLPRKAGSDQVVNGYLIPKG TQLLFNVWAMGRDPS IWKNPESFEPERFLNQNIDYKGQDFELIPFGSGRRICPGMPLADRIMHM TTATLVHNFDWKLEDGAGDADHKGDDPFGLAIRRATPLRI IPLKP SEQ ID NO:41 CfCYP76AH17 (aa) :
MESMNALWGLLLIALTILFSLRRRRNLAPGPYPFPI IGNMLQLGTKPHQSFAQLSKKYGPLMS IHLGSLYTVIVSSPEMAKEILQKHGQVFSGRTIAQAVHACDHDKI SMGFLPVSNTWRDMRKICK EQMFSHHSLEGSQGLRQQKLLQLLDYAQKCCETGRAVDIREASFITTLNLMSATMFSTQATEFE SKSTQEFKEI IEGVATIVGVANFGDYFPILKPFDLQGIKRKADGYFGRLLKLIEGYLNERLESR KSNPNAPRKNDFLETWDILEANEYKLSVDHLTHLMLDLFVGGSETNTTSLEWTMSELVNNPDK MAKLKQELKSVVGERKLVDESEMPRLPYLQAVIKESLRIHPPGPLLLPRKAETDQEVNGYLIPK GTQILFNVWAMGRDPSIWKDPESFEPERFLNQNIDFKGQDFELIPFGSGRRICPGMPLANRILH MATATMVHNFDWKLEQGTDEADAKGELFGLAVRRAVPLRI IPLQP
SEQ ID NO: 42 cytochrome P450 CYP76AH1 from Salvia miltiorrhiza_ MDSSPLLAALFFIAATI IFLSFRRRRNLPPGPFPYPIVGNMLQLGANPHQVFAKLSKRYGPLMS IHLGSLYTVIVSSPEMAKEILHRHGQVFSGRTIAQAVHACDHDKI SMGFLPVASEWRDMRKICK EQMFSNQSMEASQGLRRQKLQQLLDHVQKCSDSGRAVDIREAAFITTLNLMSATLFSSQATEFD SKATMEFKEI IEGVATIVGVPNFADYFPILRPFDPQGVKRRADVFFGKLLAKIEGYLNERLESK RANPNAPKKDDFLEIWDI IQANEFKLKTHHFTHLMLDLFVGGSDTNTTSIEWAMSELVMNPDK MARLKAELKSVAGDEKIVDESAMPKLPYLQAVIKEVMRIHPPGPLLLPRKAESDQEVNGYLIPK GTQILINAYAIGRDPSIWTDPETFDPERFLDNKIDFKGQDYELLPFGSGRRVCPGMPLATRILH MATATLVHNFDWKLEDDSTAAADHAGELFGVAVRRAVPLRI IPIVKS
SEQ ID NO: 43 CYP76AH4 from Rosmarinus officinalis
MDSFPLLVALFFILAATWLISFRRRRNLPPGPFPLPIVGNMLQLGAKPHQTFAQLSKKYGPLMS IHLGSLYTVIVSSPEMAKEILHKHGQVFSGRTIAQAVHACDHDNI SMGFLPVASEWRDMRKICK EQMFSHQSMEASQGLRKQKLQQLLEFAQKCSDSGRAVDIRDAAFITTLNLMSATLFSMQATEFD SEVTMEFKEI IEGVATIVGVPNFADYFPILKPFDPQGVKRRADAYFGKLLAKIEGYFNERIASR KANPNAPKKDDFLETLVDI IEANEYKLKTDHLTHLMLDLFVGGSETNTTSIEWTMSELVMNPEK MAKLKAELKSVAGDEKIVDESEIAKLPYLQAVIKEVMRIHPPGPLLLPRKAESDQEVNGYLIPK GTQVLINAWAIGRDPSVWKNPDSFEPERFLEQKIDFKGQDFELLPFGSGRRVCPGLPLASRILH MTAATLVHNFDWKLEDEATAEADHAGELFGLAVRRAVPLRI IPIVKS
SEQ ID NO: 44 CfCPR
MESTIEKLSPFDLMTAILKGVKLDNSNGSAGVEHPAVIAMLMENKDLVMMLTTSVAVLLGLAVY LVWRRGAGSAKRVVEPPKLVIPKGPVDAEEEDDGKKKVTIFFGTQTGTAEGFAKALAEEAKARY PLTNFKVVDLDDYAADDEEYEEKMKKETFAFFFLATYGDGEPTDNAARFYKWFSEGKERGEIFK NLNYGVFGLGNRQYEHFNKIAIVVDDILLEQGGNRLVPVGLGDDDQCIEDDFSAWRDNVWPELD KLLRDEDDATVATPYTAAVLEYRWFHDQSDELHSENNLANGHANGNASYDAQHPCKVNVAVKR ELHTPLSDRSCTHLEFDISGTGLEYETGDHVGVYCENLIETVEEAERLLGLSPQTFFSVHTDKA DGTPLGGSALPPPFPPCTLRTALSRYADLLNAPKKSALTALAAYASDPSEADRLKHLASPDGKE EYAQYWSGQRSLLEVMADFPSAKPPLGVFFAAIAPRLQPRFYSI SSSPKIAPSRIHVTCALVY EKMPTGRIHKGVCSTWMKNAVPLEESPNCSSAPVFVRTSNFRLPADPKVPVIMIGPGTGLAPFR GFLQERLALKESGAELGPAILFFGCRNSKMDFIYQDELDNFVKAGWSELVLAFSREGPAKEYV QHKMAQKASDVWNMI SEGGYVYVCGDAKGMARDVHRTLHTIVQEQGSLDSSKTESFVKNLQMTG RYLRDVW
SEQ ID NO: 45 cDNA encoding CfCPR
ATGGAATCGACTATTGAGAAGCTTTCGCCCTTCGATTTGATGACTGCGATTCTCAAAGGAGTCA AACTTGATAATTCGAACGGGTCTGCTGGGGTGGAGCATCCGGCTGTGATCGCGATGCTGATGGA GAACAAGGATCTCGTGATGATGCTCACCACCTCCGTCGCGGTGCTTCTAGGACTTGCTGTGTAT CTCGTGTGGCGGCGCGGAGCCGGATCGGCGAAGAGGGTGGTGGAGCCGCCGAAGCTGGTGATTC CCAAGGGCCCGGTGGATGCGGAGGAAGAGGATGATGGGAAGAAGAAGGTTACCATCTTTTTTGG GACGCAGACTGGAACTGCTGAAGGCTTTGCTAAGGCACTTGCCGAAGAAGCTAAAGCAAGATAT CCGCTGACCAACTTTAAAGTAGTTGACTTGGATGATTATGCTGCCGATGATGAAGAGTATGAAG AGAAGATGAAGAAGGAGACCTTTGCATTCTTCTTCTTGGCGACATATGGAGATGGTGAGCCTAC CGACAATGCTGCGAGATTTTACAAGTGGTTTTCCGAGGGGAAAGAGAGAGGTGAGATATTCAAG AATCTCAACTATGGTGTATTTGGTCTTGGAAACAGGCAGTATGAGCATTTCAACAAGATTGCTA TAGTGGTGGATGACATTCTTCTTGAGCAAGGTGGAAATCGGCTTGTCCCTGTGGGTCTTGGAGA TGACGATCAATGTATCGAAGATGATTTCTCAGCATGGCGTGATAATGTGTGGCCTGAGCTGGAT AAGTTGCTCCGTGATGAGGATGATGCAACTGTTGCAACTCCATATACTGCAGCCGTTTTGGAGT ATCGTGTTGTGTTCCATGACCAGTCAGATGAACTGCACTCGGAAAACAACTTAGCCAATGGTCA TGCAAaTGGAAATGCTTCTTATGATGCTCAaCACCCCTGCAAAGTGAATGTTGCTGTAAAAaGG GAGCTACATACTCCTCTATCCGATCGTTCTTGCACTCACTTGGAATTCGACATATCTGGCACTG GATTAGAGTATGAAACAGGGGACCACGTTGGTGTTTACTGTGAGAACTTGATTGAAACTGTAGA GGAAGCAGAAAGGCTTCTTGGTCTTTCTCCACAAACATTCTTTTCAGTTCACACTGATAAAGCG GACGGCACACCACTTGGTGGAAGTGCCTTGCCTCCTCCCTTCCCGCCGTGCACTTTGAGGACAG CGCTAAGTCGATATGCTGATCTTTTGAATGCTCCCAAAAAGTCTGCTTTGACTGCATTGGCTGC TTATGCCTCTGACCCTAGTGAAGCTGATCGGCTCAAGCACCTTGCTTCCCCTGATGGAAAGGAG GAATATGCTCAATATGTGGTTTCTGGTCAGAGAAGCCTACTTGAGGTGATGGCTGACTTCCCAT CTGCCAAGCCTCCTCTTGGTGTTTTCTTTGCTGCAATTGCTCCTCGCTTGCAGCCTCGATTTTA TTCAATCTCATCCTCACCAAAGATTGCACCTTCAAGAATTCACGTCACTTGTGCGTTGGTGTAT GAGAAAATGCCCACTGGACGAATCCACAAGGGTGTCTGCTCAACATGGATGAAGAATGCTGTGC CATTGGAGGAAAGCCCCAACTGCTCTTCAGCACCAGTTTTTGTACGGACCTCAAACTTCAGACT CCCTGCTGATCCTAAAGTACCAGTTATAATGATTGGCCCTGGAACCGGTTTGGCTCCATTCAGG GGTTTTCTTCAGGAAAGATTAGCCCTCAAGGAATCTGGAGCAGAACTTGGTCCTGCTATATTAT TCTTCGGGTGCAGAAACAGTAAAATGGATTTCATTTACCAAGATGAACTGGATAACTTTGTTAA AGCTGGAGTGGTTTCTGAGCTTGTCCTTGCGTTTTCACGCGAGGGTCCTGCTAAGGAATACGTG CAGCATAAGATGGCACAGAAGGCCTCGGATGTGTGGAATATGATATCAGAAGGGGGCTACGTTT ATGTATGTGGTGATGCTAAGGGCATGGCACGTGACGTTCACCGGACTCTTCACACCATTGTTCA AGAACAGGGATCTCTGGACAGCTCGAAAACCGAGAGCTTCGTCAAGAATCTGCAGATGACCGGC CGGTACCTGCGTGACGTGTGGTGA
SEQ ID NO: 46 - MvTPS5
MSITFNLKIAPFSGPGIQRSKETFPATEIQITASTKSTMTTKCSFNASTDFMGKLREKVGGKAD KPPWIHPVDI SSNLCMIDTLQSLGVDRYFQSEINTLLEHTYRLWKEKKK I IFKDVSCCAIAF RLLREKGYQVSSDKLAPFADYRIRDVATILELYRASQARLYEDEHTLEKLHDWSSNLLKQHLLN GSIPDHKLHKQVEYFLKNYHGILDRVAVRRSLDLYNINHHHRIPDVADGFPKEDFLEYSMQDFN ICQAQQQEELHQLQRWYADCRLDTLNYGRDVVRIANFLTSAIFGEPEFSDARLAFAKHI ILVTR IDDFFDHGGSREESYKILDLVQEWKEKPAEEYGSKEVEILFTAVYNTVNDLAEKAHIEQGRCVK PLLIKLWVEILTSFKKELDSWTEETALTLDEYLSSSWVSIGCRICILNSLQYLGIKLSEEMLSS QECTDLCRHVSSVDRLLNDVQTFKKERLENTINSVGLQLAAHKGERAMTEEDAMSKIKEMADYH RRKLMQIVYKEGTVFPRECKDVFLRVCRIGYYLYSSGDEFTSPQQMKEDMKSLVYQPVKIHPLE AINV
Examples
The invention is further illustrated by the following examples, which however, should not be construed as limiting for the invention.
Example 1 Various full length cDNAs were prepared and expressed in a Nicotiana
benthamianal Agxobac\ex\um system as described below. cDNA encoding p19, Cfdxs cDNA of SEQ ID NO:25 and CfGGPPs cDNA of SEQ ID NO:26 were expressed together with the different combinations listed below. The following combinations of cDNAs were expressed:
Experiment 1
cDNA encoding the following enzymes
1 ) CfTPSI of SEQ ID NO:5
2) CfTPS3 of SEQ ID NO:12
3) CfCYP76AH1 1 of SEQ ID:21
were expressed in a Nicotiana benthamianal Agxobac\ex\um system.
Experiment 1 .1
cDNA encoding the following enzymes
1 ) TwTPS7 of SEQ ID NO:4
2) CfTPS3 of SEQ ID NO:12
3) CfCYP76AH1 1 of SEQ ID:21
are expressed in a Nicotiana benthamianai Agrobacterium system.
Experiment 2
cDNA encoding the following enzymes
1 ) CfTPSI of SEQ ID NO:5
2) CfTPS3 of SEQ ID NO:12
3) CfCYP71 D381 of SEQ ID:22
were expressed in a Nicotiana benthamianal Agxobac\ex\um system.
Experiment 2.1
cDNA encoding the following enzymes
1 ) TwTPS7 of SEQ ID NO:4
2) CfTPS3 of SEQ ID NO:12
3) CfCYP71 D381 of SEQ ID:22
are expressed in a Nicotiana benthamianal Agxobac \um system. Experiment 3 cDNA encoding the following enzymes
1 ) CfTPSI of SEQ ID NO:5
2) SsSCS of SEQ ID NO:1 1
3) CfCYP76AH8 of SEQ ID:20
were expressed in a Nicotiana benthamianal Agxobac\ex\um system.
Experiment 4
cDNA encoding the following enzymes
1 ) CfTPSI of SEQ ID NO:5
2) SsSCS of SEQ ID NO:1 1
3) CfCYP76AH1 1 of SEQ ID:21
were expressed in a Nicotiana benthamianal Agxobac\ex\um system.
Experiment 5
cDNA encoding the following enzymes
1 ) CfTPSI of SEQ ID NO:5
2) SsSCS of SEQ ID NO:1 1
3) PsCYP720B4 of SEQ ID:23
were expressed in a Nicotiana benthamianal Agxobac\ex\um system.
Experiment 6
cDNA encoding the following enzymes
1 ) Ossyn-CPP of SEQ ID NO:1
2) SsSCS of SEQ ID NO:1 1
3) CfCYP71 D381 of SEQ ID:22
were expressed in a Nicotiana benthamianal Agxobac\ex\um system.
Experiment 7
cDNA encoding the following enzymes
4) TwTPS7 of SEQ ID NO:4
5) CfTPS3 of SEQ ID NO:12
6) CYP76AH1 of SEQ ID:42
were expressed in a Nicotiana benthamianal Agxobac\ex\um system. Experiment 8
cDNA encoding the following enzymes
1 ) TwTPS7 of SEQ ID NO:4
2) CfTPS3 of SEQ ID NO:12
3) CYP76AH4 of SEQ ID:43
were expressed in a Nicotiana benthamiana/Agrobac\er um system.
The cDNAs described above were cloned into the pCAMBIA130035Su vector.
pCAMBIAI 30035Su containing cDNA encoding the enzymes described above and T- DNA expression plasmid containing the anti-post transcriptional gene silencing protein p19 (35S:p19)(Voinnet, Rivas et al. 2003), were transformed into the AGL-1 - GV3850 Agrobacterium strain by electroporation using a 2mm electroporation cuvette in a Gene Pulser (Bio-Rad; Capacity 25 μΡ; 2.5 kV; 400 Ω). The transformed agrobacteria were subsequently transferred to 1 mL YEP (yeast extract peptone) media and grown for 2-3 hours at 30 °C in YEP media. 200μΙ_ were transferred to YEP-agar solid media containing 35μg/mL rifampicillin, 50μg/mL carbencillin and 50μg/mL kanamycin and grown for 2 days. Multiple colonies were transferred from the plate to 20ml_ YEP media in falcon tube containing 17.5 μg/mL rifampicillin, 25 μg/mL carbencillin and 25 μg/mL kanamycin and grown at 30 °C over night (ON) at 225 rpm. Agrobacteria were spun down by centriguation at 3500xg for 10 min and resuspended in 5ml_ H20. OD600 was measured and H20 was added to reach an OD600=1 . 3ml_ of agrobacteria culture containing the plasmids encoding the enzymes outlined above for each experiment and p19 gene respectively were mixed. Controls only containing cDNAs encoding
CfCYP76AH8 of SEQ ID:20, CfCYP76AH1 1 of SEQ ID:21 , CfCYP71 D381 of SEQ ID:22, or PsCYP720B4 of SEQ ID:23 was mixed similarly. Each mix of agrobacteria cultures were infiltrated into independent 4-6 weeks old N. benthamiana plants. Plants were grown for 7 days in greenhouse before metabolite extraction. Extraction and GC-MS analysis
2 or 3 infiltrated leafs from each N. benthamiana line were chosen and from each of these 1 or 2 leaf disc's (0 = 3cm) were carved out and added to 1 mL n-hexane with 1 ppm 1 -eicosene as internal standard (IS). The 2 or 3 replicates served as
experimental replicates. Extraction was done at RT for 1 hour in an orbital shaker set at 220 rpm. Plant material was spun down and extracts were transferred to new vials. Extracts were analyzed on a Shimadzu GCMS-QP2010 Ultra using an Agilent HP-5MS column (30 m x 0.250 mm i.d., 0.25 μηι film thickness). Injection volume and
temperature was set at 1 μΙ_ and 250 °C. GC program: 50 °C for 2 min, ramp at rate 4 °C min-1 to 1 10 °C, ramp at rate 8 °C min-1 to 250 °C, ramp at rate 10 °C min-1 to 310 Ό and hold for 5 min. Both He and H2 were used as carrier gas and hence the retentions times were normalized with Kovat's retention index using 1 ppm C7 - C30 Saturated Alkanes as reference. Electron impact (Ei) was used as ionization method in the mass spectrometer (MS) with the ion source temperature set to 230 °C and 70 eV. MS spectra's was recorded from 50m/z to 350m/z. Compound identification was done by comparison to authentic standards and comparison to reference spectra databases (Wiley Registry of Mass Spectral Data, 8th Edition, July 2006, John Wiley & Sons, ISBN: 978-0-470-04785-9).
Figure 7 summarises the outcome of experiments 1 and 2. Cytochrome P450 enzymes CfCYP76AH1 1 (top) and CfCYP71 D381 (bottom) catalyzes oxidation of
Dehydroabietadiene (1 .1 ) and Miltiradiene (1 .2) into novel compounds 1 .1 .1 -1 .1 .4 and 1 .2.1 corresponding to hydroxylated or ketonated derivatives of Dehydroabietadiene (1 .1 ) and Miltiradiene (1 .2). As shown in Danish patent application PA 2014 00056 then expression of CfTPSI and CfTPS3 in the presence of GGPP leads to formation of Dehydroabietadiene and Miltiradiene in N. benthamiana. Arrows indicate enzymatic reactions. Compound class identifications given in parenthesis are only based on GC- MS mass spectral analysis and comparison with the Wiley compound library.
Experiment 1 .1 will give the same results as experiment 1 , and experiment 2.1 will give the same results as experiment 2.
Figure 8A shows GC-MS chromatogram of N. benthamiana of experiment 1 as well as mass spectra of compounds 1 .1 .1 & 1 .2.1 shown in figure 7.
Figure 8B shows GC-MS chromatogram of N. benthamiana of experiment 2 as well as mass spectra of compounds 1 .1 .2-1 .1 .4 shown in figure 7.
Figure 9 summarises the outcome of experiments 3, 4 and 5. Cytochrome P450 enzymes CfCYP76AH8 (top panel), CfCYPAH1 1 (middle panel) and PsCYP720B4 (lower panel) catalyzes oxidation of Manool (1 .3) into novel compounds 1 .3.1 -1 .3.10 corresponding to hydroxylated, dihydroxylated or ketonated derivatives of Manool (1 .3). As shown in Danish patent application PA 2014 00056 then expression of CfTPSI and SsSCS in the presence of GGPP leads to formation of Manool in N. benthamiana. Arrows indicate enzymatic reactions. Compound class identifications given in parenthesis are only based on GC-MS mass spectral analysis and comparison with the Wiley compound library.
Figure 10A shows GC-MS chromatogram of N. benthamiana of experiment 3 as well as mass spectra of compounds 1 .3.1 & 1 .3.5 shown in figure 9.
Figure 10B shows GC-MS chromatogram of N. benthamiana of experiment 4 as well as mass spectra of compounds 1 .3.6 & 1 .3.9 shown in figure 9.
Figure 10C shows GC-MS chromatogram of N. benthamiana of experiment 5 as well as mass spectra of compound 1 .3.10 shown in in figure 9.
Figure 1 1 summarises the outcome of experiment 6. Cytochrome P450 enzyme CfCYP71 D381 catalyzes oxidation of syn-manool (1 .4) into novel compounds 1 .4.1 - 1 .4.3 corresponding to a hydroxylated derivatives of syn-manool (1 .4). As shown in Danish patent application PA 2014 00056 then expression of Ossyn-CPP and SsSCS in the presence of GGPP leads to formation of syn-Manool in N. benthamiana. Arrows indicate enzymatic reactions. Compound class identification given in parenthesis is a tentative identification based on GC-MS mass spectral analysis and comparison with the Wiley compound library. Figure 12 shows GC-MS chromatogram of N. benthamiana of experiment 6 as well as mass spectra of compounds 1 .4.1 -1 .4.3 shown in in figure 1 1 .
Co-expression of TwTPS7, CfTPS3 and CYP76AH1 resulted in the formation of ferruginol (experiment 7)(see fig. 20).
Co-expression of TwTPS7, CfTPS3 and CYP76AH4 resulted in the formation of ferruginol (experiment 8)(see fig. 20).
Example 2 Heterologous nucleic acids encoding the proteins described in Table 2 were expressed in S. cerevisiae.
Table 2. CDSs used in this Examp
Figure imgf000139_0001
In this study GGPPS7 from Synechococcus was used. However, another GGPPS
could also have been expressed in yeast, e.g. CfGGPPS of SEQ ID NO:26. The
sequence of GGPPS7 of Synechococcus sp. is available under Gene ID:86553638.
Table 3. List of plasmids used in this Example.
Plasmid Scaffold plasmid CDS and promoter
name
pCYPCC-l p OP196 XI-5 assembler 1 Rv #205 GGPPs7<-pTPIl #219
pCYPCC-3 pROP196 XI-5 assembler 1 Rv #205 GGPPs7<-pPGKl lc
pCYPCC-7 pROP197 XI-5 assembler 3 #-3 C1TPS3 <-#161pTDH3
pCYPCC-9 pVAN858 assembler 2 2c pTEFl->#-5 CfTPSI
pCYPCC-10 pVAN858 assembler 2 2c pTEFl->#-6 OsCPssyn
pCYPCC-18 pROP197 XI-5 assembler 3 #-8 SsSCS <-#161pTDH3
pCYPCC-21 pROP197 XI-5 assembler 3 Res# 236 C1TPS3 co<-#161pTDH3
pCYPCC-25 EPSB585 Rv #238 CfCYP76AH8 co <- 5p-pTDH3/pTEF2 ->Res# 233_Cf(
CO pCYPCC-26 EPSB585 Rv #237 CfCYP76AHll co <- 5p-pTDH3/pTEF2 ->Res#
233_CfCPR_tefU co
pCYPCC-27 EPSB585 #10- CfCYP71D381 <- 5p-pTDH3/pTEF2 ->Res# 233_CfCPR_t pCYPCC-28 EPSB585 #11- PsCYP720B4 <- 5p-pTDH3/pTEF2 ->Res# 233_CfCPR_te pCYPCC-29 EPSB585 Res# 239 CfCYP76AH6 (16) co <- 5p-pTDH3/pTEF2 ->Res#
233_CfCPR_tefU co
pCYPCC-30 p OP196 XI-5 assembler 1 Rv #205 GGPPs7 <- 6p-pTPIl/pPDCl ->Res# 233_CfCPR_tefl pCYPCC-31 pVAN858 assembler 2 Rv #237 CfCYP76AHll co <-3c pgk-pTEF-1 ->Rv #234 CfCYP pCYPCC-34 pVAN858 assembler 2 #10- CfCYP71D381 <-3c pgk-pTEF-1 ->Rv #234 CfCYP76AH8 pCYPCC-35 pVAN858 assembler 2 #11- PsCYP720B4 <-3c pgk-pTEF-1 ->Rv #234 CfCYP76AH8 a pCYPCC-37 pVAN858 assembler 2 Rv #238 CfCYP76AH8 co<-pPGKl lc
pCYPCC-38 pVAN858 assembler 2 Rv #237 CfCYP76AHll co<-pPGKl lc
pCYPCC-39 pVAN858 assembler 2 #10- CfCYP71D381<-pPGKl lc
pCYPCC-40 pVAN858 assembler 2 #11- PsCYP720B4<-pPGKl lc
pCYPCC-41 pVAN858 assembler 2 Res# 239 CfCYP76AH6 (16) co<-pPGKl lc
pCYPCC-42 pVAN858 assembler 2 Resl60 pTEF-2 ->CfTPSl, co
pCYPCC-44 pVAN858 assembler 2 Resl60 pTEF-2 ->OsCPssyn
pCYPCC-45 pROP197 XI-5 assembler 3 Res# 236 CfTPS3 truncated co <- 5p-pTDH3/pTEF2 ->CfTPSl pCYPCC-46 pROP197 XI-5 assembler 3 Res# 236 CfTPS3 truncated co <- 5p-pTDH3/pTEF2 ->Twl4, i pCYPCC-47 pROP197 XI-5 assembler 3 Res# 236 CfTPS3 truncated co <- 5p-pTDH3/pTEF2 ->OssynC pCYPCC-48 pROP197 XI-5 assembler 3 SsSCS, trun, co <- 5p-pTDH3/pTEF2 ->CfTPSl, trun co pCYPCC-50 pROP197 XI-5 assembler 3 SsSCS, trun, co <- 5p-pTDH3/pTEF2 ->OssynCPP
pCYPCC-51 pROP197 XI-5 assembler 3 SsSCS, co<-#161pTDH3
All diTPS enzymes were truncated to remove putative plastid targeting sequence (see
CDS sequences referred to in Table 2)
DNA sequences encoding enzymes of interest were in general codon optimized for
expression in S. cerevisiae. Codon optimization for expression in Saccharomyzes
cerevisae was performed using the Geneart service from LifeTechnologies.
DNA fragments encoding the enzymes of interest were USER cloned into the pre- digested plasmid backbones. All plasmids constructed and used in this study are
summarized in table 3. DNA fragments of interest were liberated from plasmids by NotI enzyme-digestion as linear DNA fragments suitable for yeast transformation. The
plasmids are designed to accommodate integration of up to three Notl-digested
fragments at the same site in the genome.
Table 4. Strains used and generated in this Example
Name Genotype EFSC4416 MATa ΗΟ::ΚΟ HIS3 LEU2 Aura3
IT-4, T-5 MATa ΗΟ::ΚΟ HIS3 LEU2 Chr XI-5::GGPPS7<-pTPI, pTEF1 ->CfTPS1 trun, SsSCS trun <- TDH3, DR-KIURA3-DR
ΙΤ-4, Τ-5, MATa HO::KO HIS3 LEU2 Chr XI-5::GGPPS7<-pTPI, pTEF1 ->CfTPS1 uracil loop trun, SsSCS trun <- TDH3,
out
ΙΤ-5, Τ-6 MATa HO::KO HIS3 LEU2 Chr XI-5::GGPPS7<-pPGK, pTEF1 - >OssynCPP trun, CfTPS3 trun Codon Optimized <- TDH3, DR-KIURA3- DR
ΙΤ-5, Τ-6, MATa HO::KO HIS3 LEU2 Chr XI-5::GGPPS7<-pPGK, pTEF1 - uracil loop >OssynCPP trun, CfTPS3 trun Codon Optimized <- TDH3,
out
ΙΤ-7, Τ-1 MATa HO::KO HIS3 LEU2 Chr XI-5::GGPPS7<-pTPI, pTEF1 ->CfTPS1 trun codon optimized, CfTPS3 trun codon optimized <- TDH3,DR- KIURA3-DR
ΙΤ-7, Τ-2 MATa HO::KO HIS3 LEU2 Chr XI-5::GGPPS7<-pTPI, pTEF1 - >OsCPSsyn, CfTPS3 trun codon optimized <- TDH3,DR-KIURA3-DR
ΙΤ-7, Τ-4 MATa HO::KO HIS3 LEU2 Chr XI-5::GGPPS7<-pTPI, pTEF1 ->CfTPS1 trun codon optimized, SsSCS trun codon optimized <- TDH3,DR- KIURA3-DR
ΙΤ-7, Τ-9 MATa HO::KO HIS3 LEU2 Chr XI-5::GGPPS7<-pTPI-pPDC -> CfCPR;
CfTPS3 truncated co <- 5p-pTDH3/pTEF2 ->CfTPS1 , trun co; PsCYP720B4 <-3c pgk-pTEF-1 ->Rv #234 CfCYP76AH8 co,DR-KIURA3- DR
ΙΤ-7, Τ-13 MATa HO::KO HIS3 LEU2 Chr XI-5::GGPPS7<-pTPI-pPDC -> CfCPR;
CfTPS3 truncated co <- 5p-pTDH3/pTEF2 ->CfTPS1 , trun co; PSCYP720B4 <- pPGK1 ,DR-KIURA3-DR
ΙΤ-7, Τ-31 MATa HO::KO HIS3 LEU2 Chr XI-5::GGPPS7<-pTPI-pPDC -> CfCPR;
SsSCS truncated co <- 5p-pTDH3/pTEF2 ->CfTPS1 , trun co; Rv #237 CfCYP76AH1 1 co <-3c pgk-pTEF-1 ->Rv #234 CfCYP76AH8 co,DR- KIURA3-DR
ΙΤ-7, Τ-32 MATa HO::KO HIS3 LEU2 Chr XI-5::GGPPS7<-pTPI-pPDC -> CfCPR;
SsSCS truncated co <- 5p-pTDH3/pTEF2 ->CfTPS1 , trun co; CfCYP71 D381 <-3c pgk-pTEF-1 ->Rv #234 CfCYP76AH8 co,DR- KIURA3-DR
ΙΤ-7, Τ-33 MATa HO::KO HIS3 LEU2 Chr XI-5::GGPPS7<-pTPI-pPDC -> CfCPR;
SsSCS truncated co <- 5p-pTDH3/pTEF2 ->CfTPS1 , trun co; PsCYP720B4 <-3c pgk-pTEF-1 ->Rv #234 CfCYP76AH8 co,DR-KIURA3- DR
ΙΤ-7, Τ-34 MATa HO::KO HIS3 LEU2 Chr XI-5::GGPPS7<-pTPI-pPDC -> CfCPR;
SsSCS truncated co <- 5p-pTDH3/pTEF2 ->CfTPS1 , trun co; Rv #238 CfCYP76AH8 co<-pPGK1 1 c ,DR-KIURA3-DR
ΙΤ-7, Τ-37 MATa HO::KO HIS3 LEU2 Chr XI-5::GGPPS7<-pTPI-pPDC -> CfCPR;
SsSCS truncated co <- 5p-pTDH3/pTEF2 ->CfTPS1 , trun
co;PsCYP720B4 <- pPGK1 ,DR-KIURA3-DR
ΙΤ-7, Τ-50 MATa HO::KO HIS3 LEU2 Chr XI-5::GGPPS7<-pTPI, pTEF1 ->CfTPS1 trun,SsSCS trun <- TDH3, Chr X-3:: CfCYP76AH8 co <- 5p- pTDH3/pTEF2 -> CfCPR tefU co,DR-KIURA3-DR IT-7, T-58 MATa ΗΟ::ΚΟ HIS3 LEU2 Chr XI-5::GGPPS7<-pPGK, pTEF1 - >OssynCPP trun, CfTPS3 trun Codon Optimized <- TDH3. Chr X-3:: PsCYP720B4 <- 5p-pTDH3/pTEF2 ->Res# 233_CfCPFt_tefU co; DR- KIURA3-DR
All strains were grown in 96 deep well plates as follows. Single colonies were inoculated in 500 μΙ selective (no Uracil) Yeast-Synthetic Complete Medium in 2.2 ml 96 deep well plates and grown o/n @ 30°C, 400 RPM. The following day 50 μΙ of the o/n culture was used as inoculum in 500 μΙ Yeast Synthetic Complete-Medium. These cultures were grown for 72 hours @ 30°C, 400 RPM.
Metabolite extraction GC-MS analysis
Metabolites were extracted from broth (500 ul) by addition of 500 μΙ 99 % Ethanol UV- grade followed by 20 minutes incubation @ 60 degrees. Samples were cleared by centrifugation @ 3.000 g, 10 min and 800 μΙ of the supernatant was transferred to clean GC/LC vial. For GC analysis, non-polar metabolites were extracted from the ethanol extract by partitioning with 500 ul hexane. After extraction, the solvent was transferred into new 1 .5-mL glass vials and stored at -20 °C until GC-MS analysis. For LC-analysis, the cleared ethanol extract were stored at -20 °C until analysis and applied without further modification.
GC-MS analysis
One microliter of hexane extract was injected into a Shimadzu GC-MS-QP2010 Ultra. Separation was carried out using an Agilent HP-5MS column (20m 0.180mm i.d.,
0.18μηι film thickness) with purge flow of 4 mL min"1 for 1 min, using H2 as carrier gas. The GC temperature program was 60 °C for 1 min, ramp at rate 30 °C min"1 to 180°C, ramp at rate 10°C min"1 to 250 °C, ramp at rate 30 °C min"1 to 320 °C, and hold for 3 min. Injection temperature was set at 250 °C in splitless mode. Column flow and pressure was set to 5. mL min"1 and 66.7 kPa yielding a linear velocity of 66.5 cm s~1. Ion source and transfer line for mass spectrometer (MS) was set to 300 °C and 280 °C respectively. MS was set in scan mode from m/z 50 to m/z 350 with a scan width of 0.5s. Solvent cutoff was 4 min.
LC-MS analysis
Analytical LC-MS was carried out using an Agilent 1 100 Series LC (Agilent
Technologies, Germany) coupled to a Bruker HCT-Ultra ion trap mass spectrometer (Bruker Daltonics, Bremen, Germany). A Zorbax SB-C18 column (Agilent; 1 .8 μηι, 2.1 x 50 mm) maintained at 35°C was used for separation. The mobile phases were: A, water with 0.1 % (v/v) HCOOH and 50mM NaCI; B, acetonitrile with 0.1 % (v/v) HCOOH. The gradient program was: 0 to 1 min, isocratic 50% B; 1 to 10 min, linear gradient 50 to 95% B; 10 to 1 1 .4 min, isocratic 98% B; 1 1 .4 to 17 min, isocratic 50% B. The flow rate was 0.2 mL min"1. The mass spectrometer was run in alternating positive/negative mode and the range m/z 100-800 was acquired
Results
The yeast strain IT- 7 T13 expressing the diterpene synthases CfTPSI and CfTPS3 as well as the cytochrome P450 PsCYP720B4 produces oxidized miltiradiene , which tentatively was identified as miltiradienic acid (see figure 13). LC-MS analysis of ethanol extracts generated from media as well as cell material of yeast strain IT-7 T13 shows that the extracted ion m/z matches the predicted mass of miltiradiene carrying a single carboxylic acid group (303 m/z), compound 1 .2.1 . Compound 1 .2.1 was not produced by a control yeast strain expressing CfTPSI and CfTPS, but not PsCYP720B4. Formation of a carboxylic acid group is consistent with previously reported activity of PsCYP720B4 (Hamberger et al, 201 1 ). 1 .2.1 was tentatively identified as miltiradienic acid based on the mass to charge ratio, fragmentation pattern and retention time. The 257 fragment is a signature ion of labdane-type diterpenes.
The yeast strain IT7-T50 expressing CfTPSI , SsSCS and CfCYP76AH8 produces various oxidised manool as shown in figure 14.
Extracted ion chromatograms recorded by LC-MS analysis of ethanol extracts generated from media as well as cell material of yeast strain IT7-T50, shows that the extracted ion trace combine the ions corresponding to the mass of manool (273 m/z), monohydroxy-manool (289 m/z, hydrogen adduct of water loss ion), dihydroxymanool (347 m/z) and keto-hydroxy-manool (345 m/z). Thus, IT7-T50 produces keto-hydroxy mannol (1 .3.1 1 ), dihydroxy manool (1 .3.12) and hydroxyl manool (1 .3.13).
The yeast strain IT7-T37 expressing CfTPSI , SsSCS, PsCYP720B4 produces various oxidised manool as shown in figure 15.
Extracted ion chromatograms recorded by LC-MS analysis of ethanol extracts generated from media as well as cell material of yeast strain IT7-T37, shows that the extracted ion trace combine the ions corresponding to the mass of monohydroxy- manool (329 m/z, sodium adduct), dihydroxymanool (345 m/z, sodium adduct) and keto-hydroxy-manool or a carboxylic acid of manool (343 m/z, sodium adduct). Thus, IT7-T37 produces the oxidized compounds 1 .3.12-1 .3.16.
The yeast strain IT7-T58 expressing OssynCPP, CfTPS3, and PsCYP720B4 produces various oxidised isopimaradienes as shown in figure 16.
Extracted ion chromatograms recorded by LC-MS analysis of ethanol extracts generated from media as well as cell material of IT7-T58, shows that the extracted ion trace combine the ions corresponding to the mass of syn-pimara-9,(1 1 ),15-diene or similar C2oH32 diterpene oxidized to contain either one keto- and one hydroxyl group or a single carboxylic acid group (303 m/z). Formation of a carboxylic acid group would be consistent with previously reported activity of PsCYP720B4 (Hamberger et al, 201 1 ). The 257 m/z fragment observed in all compounds 1 .5.1 -1 .5.5 is a signature ion of diterpenes. This ion is observed as 253 m/z in 1 .5.6 possibly indicative of introduction of two double bonds in the structure. (OssynCPP, CfTPS3, PsCYP720B4, strain name IT7-T58)
Cytochrome P450 enzyme pairs can catalyse production of higher oxidised diterpenes. CYP76AH8, CYP76AH1 1 and CYP76AH8, CYP720B4 catalyse oxidation of Manool into novel compounds 1 .3.14-1 .3.18 tentatively identified as hydroxylated,
dihydroxylated or hydroxylated and ketonated derivatives of manool based on their parental masses.
The yeast strain IT7-T32 expressing CfTPSI , SsSCS, CfCYP76AH8, CfCYP71 D381 produces various oxidised manool as shown in figure 17.
Extracted ion chromatograms recorded by LC-MS analysis of ethanol extracts generated from media as well as cell material of IT7-T32, shows that the extracted ion trace combine the ions corresponding to the mass of monohydroxy-manool (289 m/z = hydrogen adduct of water loss ion) and dihydroxymanool (345 m/z = sodium adduct, 305 = hydrogen adduct of water loss ion). Metabolites 1 .3.14 and 1 .3.15 were tentatively identified as dihydroxy manool derivatives arising from oxidation of monohydroxy manool by CfCYP71 D381 . 1 .3.13 has a characteristic parental mass matching a monohydroxylated manool.
The yeast strain IT7-T31 expressing CfTPSI , SsSCS, CfCYP76AH8, CfCYP76AH1 1 produces oxidised manool as shown in figure 18. Extracted ion chromatograms recorded by LC-MS analysis of ethanol extracts generated from media as well as cell material of IT7-T31 , shows that the extracted ion trace combine the ions corresponding to the mass of monohydroxy-manool (289 m/z = hydrogen adduct of water loss ion) and dihydroxymanool (345 m/z = sodium adduct, 305 = hydrogen adduct of water loss ion). Metabolite 1 .3.16 was tentavely identified as a unique dihydroxy manool variant. 1 .3.17 and 1 .3.1 1 correspond to monohydroxylated manool derivatives as evidenced by the identical mass spectra. However, 1 .3.17 and 1 .3.13 are indeed different metabolites as evidenced by the shift in retention time. The yeast strain IT7-T9 expressing CfTPSI , SsSCS, CfCYP76AH8, PsCYP720B4 produces oxidized manool as shown in figure 19. Extracted ion chromatograms recorded by LC-MS analysis of ethanol extracts generated from media as well as cell material of IT7-T9 shows that the extracted ion traces combine the ions, 299 and 339 m/z, corresponding to an unknown derivative, observed only in the strain expressing CfCYP76AH8 and PsCYP720B4. Based on the fragmentation pattern, the metabolite 1 .3.18 could potentially represent a dioxidized manool derivative with three additional degrees of desaturation, i.e. double bonds.

Claims

Claims
A method of producing a diterpenoid, said method comprising the steps of: a) providing one or more host organisms, wherein
I. at least one of the host organisms comprises a heterologous nucleic acid encoding a diTPS of class II,
II. at least one of the host organisms comprises a heterologous nucleic acid encoding a diTPS of class I,
III. at least one of the host organisms comprises a heterologous nucleic acid encoding a cytochrome P450 (CYP) with the proviso that said diTPS of class II, said diTPS of class I and said CYP are not all from the same species; b) Incubating said one or more host organisms either simultaneously or
sequentially under conditions allowing growth of said host organisms, wherein at least the host organism comprising a heterologous nucleic acid encoding a diTPS of class II is incubated in the presence of geranylgeranyl pyrophosphate (GGPP); c) Optionally isolating diterpenoid.
The method according to claim 1 , wherein step a) comprises providing one host organism comprising:
I. a heterologous nucleic acid encoding a diTPS of class II,
II. a heterologous nucleic acid encoding a diTPS of class I,
III. a heterologous nucleic acid encoding a cytochrome P450 (CYP)
The method according to claim 1 , wherein step a) comprises providing one host organism comprising:
a heterologous nucleic acid encoding a diTPS of class II,
II. a heterologous nucleic acid encoding a diTPS of class I,
III. at least two heterologous nucleic acids each encoding a different
cytochrome P450 (CYP) The method according to claim 1 , wherein step a) comprises providing two host organism, wherein the first host organism comprises:
I. a heterologous nucleic acid encoding a diTPS of class II,
II. a heterologous nucleic acid encoding a diTPS of class I,
and the second host organism comprises:
III. a heterologous nucleic acid encoding a cytochrome P450 (CYP)
The method according to claim 1 , wherein step a) comprises providing two host organism, wherein the first host organism comprises:
IV. a heterologous nucleic acid encoding a diTPS of class II,
V. a heterologous nucleic acid encoding a diTPS of class I,
and the second host organism comprises
VI. at least two heterologous nucleic acids each encoding a different
cytochrome P450 (CYP)
The method according to any one of claims 4 to 5, wherein step b) comprises incubating said two host organisms simultaneously in cultivation medium under conditions allowing growth of said host organisms in the presence of
geranylgeranyl pyrophosphate.
The method according to any one of claims 4 to 5, wherein step b) comprises the steps of b1 ) incubating the first host organism under conditions allowing growth of said host organism in the presence of geranylgeranyl pyrophosphate, thereby producing a diterpene
b2) incubating the second host organism under conditions allowing growth of said host organism in the presence of the diterpene produced in step b1 )
The method according to claim 1 , wherein step a) comprises providing three more host organisms, wherein the first host organism comprises:
a heterologous nucleic acid encoding a diTPS of class II,
II. a heterologous nucleic acid encoding a diTPS of class I,
and the second host organism comprises
III. a heterologous nucleic acids encoding a cytochrome P450 (CYP) and the third and optionally further host organisms each comprises IV. a heterologous nucleic acids encoding a cytochrome P450 (CYP) different to the CYP under III .
9. The method according to claim 8, wherein step b) comprises incubating said three or more host organisms simultaneously in cultivation medium under conditions allowing growth of said host organisms in the presence of geranylgeranyl pyrophosphate.
10. The method according to claim 8, wherein step b) comprises the steps of b1 ) incubating the first host organism under conditions allowing growth of said host organism in the presence of geranylgeranyl pyrophosphate, thereby producing a diterpene
b2) incubating the second host organism under conditions allowing growth of said host organism in the presence of the diterpene produced in step b1 ), thereby producing an intermediate diterpenoid
b3) incubating the third host organism under conditions allowing growth of said host organism in the presence of intermediate diterpenoid produced in step b2), thereby producing a diterpenoid.
The method according to any one of the preceding claims, wherein the diterpenoid is a C20-molecule substituted at one or more positions with substituent(s) selected from the group consisting of -OH, =0 and -COOH.
The method according to any one of the preceding claims, wherein the diterpenoid is a C20-molecule containing a decalin core and one or more substitutents selected from the group consisting of -OH, =0 and/or -COOH.
The method according to any one of the preceding claims, wherein the diterpenoid is a C20-molecule containing a core structure of one of following formulas I, II, III,
, X
Figure imgf000148_0001
Figure imgf000149_0001
or (XXIV)
14. The method according to claim 13, wherein diterpenoid is a C20-molecule
containing a cores structure of formulas I, II, III, IV, V, VI, IX, X, XXII, XXIII or XXIV substituted at one or more positions by one or more groups selected from the group consisting of:
a) alkyl, such as d-e-alkyl, for example Ci-3, wherein said alkyl may be linear or branched, for example alkyl may be isopropyl or methyl, wherein said alkyl may be substituted with one or more groups selected from the group consisting of -OH, =0 and -COOH;
b) alkenyl, such as Ci-6 alkenyl, such as C2-4-alkenyl, such as C2-3-alkenyl, wherein said alkyl may be substituted with one or more groups selected from the group consisting of -OH, =0 and -COOH;
c) -OH;
d) =0; and
e) -COOH.
The method according to any one of claims 1 to 10, wherein the diterpenoid is a diterpene, which has been substituted at one or more positions with substituent(s) selected from the group consisting of -OH, =0 and -COOH, wherein said diterpene is the product of any of the reactions VII to XIX described herein above. The method according to any one of claims 1 to 10, wherein the diterpenoid is a diterpene, which has been substituted at one or more positions with substituent(s) selected from the group consisting of -OH, =0 and -COOH, wherein said diterpene is the product of any of the reactions XX, XXII, XXIII, XXIV, XXV, XXVII, XXIX, Lll, LIN, LIV, LV or LVI described herein above.
17. The method according to any one of claims 1 to 10, wherein the diterpenoid is a diterpene, which has been substituted at one or more positions with substituent(s) selected from the group consisting of -OH, =0 and -COOH, wherein the diterpene is any of the compounds 1 to 47 of Table 1 .
18. The method according to any one of claims 1 to 10, wherein the diterpenoid is a diterpene substituted at one or more positions with -OH, =0 and/or -COOH, wherein the diterpene is a C20-molecule consisting of 20 carbon atoms, up to three oxygen atoms and hydrogen atoms, and which contains a core structure of any of formulas I, II, III, IV, VI, IX, X, XII, XLII, XLIII, XLIV, XLV, XLVI, XXVII, XXVIII, XXIX, XXX, XXXI, XXXII, XXXIII, XXXIV, XXXV, XXXVI, XXXVII, XXXVIII, XXXIX, XL XLI and/or XLVI.
19. The method according to any one of claims 1 to 10, wherein the diterpenoid is
dehydroabietadiene substituted at one or more positions with substituent(s) selected from the group consisting of -OH, =0 and -COOH.
20. The method according to any one of claims 1 to 10, wherein the diterpenoid is
miltiradiene substituted at one or more positions with substituent(s) selected from the group consisting of -OH, =0 and -COOH.
21 . The method according to any one of claims 1 to 10, wherein the diterpenoid is
manool substituted at one or more positions with substituent(s) selected from the group consisting of -OH, =0 and -COOH.
22. The method according to any one of claims 1 to 10, wherein the diterpenoid is syn- manool substituted at one or more positions with substituent(s) selected from the group consisting of -OH, =0 and -COOH.
23. The method according to any one of claims 1 to 10, wherein the diterpenoid is syn- pimara-9,(1 1 ),15-diene substituted at one or more positions with substituent(s) selected from the group consisting of -OH, =0 and -COOH.
24. The method according to any one of claims 1 to 10, wherein the diterpenoid is syn- isopimara-7,15-diene substituted at one or more positions with substituent(s) selected from the group consisting of -OH, =0 and -COOH.
25. The method according to any one of claims 1 to 10, wherein the diterpenoid is ferruginol.
26. A host organism comprising
I. a heterologous nucleic acid encoding a diTPS of class II,
II. a heterologous nucleic acid encoding a diTPS of class I,
III. a heterologous nucleic acid encoding a cytochrome P450 (CYP) with the proviso that said diTPS of class II, said diTPS of class I and said CYP are not all from the same species.
27. The host organism according to claim 26, wherein the host organism comprises at least the following heterologous nucleic acids:
a) a heterologous nucleic acid encoding CfTPSI of SEQ ID NO:5 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith; and
b) a heterologous nucleic acid encoding CfTPS3 of SEQ ID NO:12 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith;
c) a heterologous nucleic acid encoding CYP76AH1 1 of SEQ ID NO:21 or a
functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith.
28. The host organism according to claim 26, wherein the host organism comprises at least the following heterologous nucleic acids: a heterologous nucleic acid encoding TwTPS7 of SEQ ID NO:4 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith; and
a heterologous nucleic acid encoding CfTPS3 of SEQ ID NO:12 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith;
a heterologous nucleic acid encoding CYP76AH1 1 of SEQ ID NO:21 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith.
29. The host organism according to claim 26, wherein the host organism comprises at least the following heterologous nucleic acids:
a) a heterologous nucleic acid encoding CfTPSI of SEQ ID NO:5 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith; and
b) a heterologous nucleic acid encoding CfTPS3 of SEQ ID NO:12 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith;
c) a heterologous nucleic acid encoding CYP71 D381 of SEQ ID NO:22 or a
functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith.
30. The host organism according to claim 26, wherein the host organism comprises at least the following heterologous nucleic acids:
a) a heterologous nucleic acid encoding TwTPS7 of SEQ ID NO:4 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith; and a heterologous nucleic acid encoding CfTPS3 of SEQ ID NO:12 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith;
a heterologous nucleic acid encoding CYP71 D381 of SEQ ID NO:22 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith.
31 . The host organism according to claim 26, wherein the host organism comprises at least the following heterologous nucleic acids:
a) a heterologous nucleic acid encoding CfTPSI of SEQ ID NO:5 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith; and
b) a heterologous nucleic acid encoding CfTPS3 of SEQ ID NO:12 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith;
c) a heterologous nucleic acid encoding CYP720B4 of SEQ ID NO:23 or a
functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith.
32. The host organism according to claim 26, wherein the host organism comprises at least the following heterologous nucleic acids:
a) a heterologous nucleic acid encoding Ossyn-CPP of SEQ ID NO:1 or a
functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith; and
b) a heterologous nucleic acid encoding CfTPS3 of SEQ ID NO:12 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith; c) a heterologous nucleic acid encoding PsCYP720B4 of SEQ ID NO:23 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith.
33. The host organism according to claim 26, wherein the host organism comprises at least the following heterologous nucleic acids:
a) a heterologous nucleic acid encoding CfTPSI of SEQ ID NO:5 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith; and
b) a heterologous nucleic acid encoding SsSCS of SEQ ID NO:1 1 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith;
c) a heterologous nucleic acid encoding CYP76AH8 of SEQ ID NO:20 or a
functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith.
34. The host organism according to claim 26, wherein the host organism comprises at least the following heterologous nucleic acids:
a) a heterologous nucleic acid encoding CfTPSI of SEQ ID NO:5 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith; and
b) a heterologous nucleic acid encoding SsSCS of SEQ ID NO:1 1 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith;
c) a heterologous nucleic acid encoding CYP76AH1 1 of SEQ ID NO:21 or a
functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith.
35. The host organism according to claim 26, wherein the host organism comprises at least the following heterologous nucleic acids:
a) a heterologous nucleic acid encoding CfTPSI of SEQ ID NO:5 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith; and
b) a heterologous nucleic acid encoding SsSCS of SEQ ID NO:1 1 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith;
c) a heterologous nucleic acid encoding CYP720B4 of SEQ ID NO:23 or a
functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith.
36. The host organism according to claim 26, wherein the host organism comprises at least the following heterologous nucleic acids:
a) a heterologous nucleic acid encoding Ossyn-CPP of SEQ ID NO:1 or a
functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith; and
b) a heterologous nucleic acid encoding SsSCS of SEQ ID NO:1 1 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith;
c) a heterologous nucleic acid encoding CYP71 D381 of SEQ ID NO:22 or a
functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith.
37. The host organism according to claim 26, wherein the host organism comprises at least the following heterologous nucleic acids:
a) a heterologous nucleic acid encoding CfTPSI of SEQ ID NO:5 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith; and
b) a heterologous nucleic acid encoding SsSCS of SEQ ID NO:1 1 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith;
c) a heterologous nucleic acid encoding CYP76AH8 of SEQ ID NO:20 or a
functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith; and
d) a heterologous nucleic acid encoding CYP71 D381 of SEQ ID NO:22 or a
functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith.
38. The host organism according to claim 26, wherein the host organism comprises at least the following heterologous nucleic acids:
a) a heterologous nucleic acid encoding CfTPSI of SEQ ID NO:5 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith; and
b) a heterologous nucleic acid encoding SsSCS of SEQ ID NO:1 1 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith;
c) a heterologous nucleic acid encoding CYP76AH8 of SEQ ID NO:20 or a
functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith; and
d) a heterologous nucleic acid encoding CYP76AH1 1 of SEQ ID NO:21 or a
functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith.
39. The host organism according to claim 26, wherein the host organism comprises at least the following heterologous nucleic acids: a) a heterologous nucleic acid encoding CfTPSI of SEQ ID NO:5 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith; and
b) a heterologous nucleic acid encoding SsSCS of SEQ ID NO:1 1 or a functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith;
c) a heterologous nucleic acid encoding CYP76AH8 of SEQ ID NO:20 or a
functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% sequence identity therewith; and
d) a heterologous nucleic acid encoding CYP720B4 of SEQ ID NO:23 or a
functional homologue thereof sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least
98%, such as at least 99% sequence identity therewith.
40. The method according to any one of claims 1 to 25, wherein the host organism is a host organism according to any one of claims 26 to 39.
41 . A collection of host organisms comprising at least two different host organisms, wherein
I. at least one of the host organisms comprises a heterologous nucleic acid encoding a diTPS of class II,
II. at least one of the host organisms comprises a heterologous nucleic acid encoding a diTPS of class I,
III. at least one of the host organisms comprises a heterologous nucleic acid encoding a cytochrome P450 (CYP) with the proviso that said diTPS of class II, said diTPS of class I and said CYP are not all from the same species.
42. The collection of host organisms according to claim 41 , said collection comprising two host organisms, wherein the first host organism comprises:
I. a heterologous nucleic acid encoding a diTPS of class II, II. a heterologous nucleic acid encoding a diTPS of class I,
and the second host organism comprises:
III. a heterologous nucleic acid encoding a cytochrome P450 (CYP)
The collection of host organisms according to claim 41 , said collection comprising two host organism, wherein the first host organism comprises:
I. a heterologous nucleic acid encoding a diTPS of class II,
II. a heterologous nucleic acid encoding a diTPS of class I,
and the second host organism comprises
III. at least two heterologous nucleic acids each encoding a different
cytochrome P450 (CYP)
The collection of host organisms according to claim 41 , said collection comprising three or more host organisms, wherein the first host organism comprises:
I. a heterologous nucleic acid encoding a diTPS of class II,
II. a heterologous nucleic acid encoding a diTPS of class I,
and the second host organism comprises
III. a heterologous nucleic acids encoding a cytochrome P450 (CYP) and the third and optionally further host organisms each comprises
IV. a heterologous nucleic acids encoding a cytochrome P450 (CYP)
different to the CYP under III.
The method according to any one of claims 1 to 25, the host organism according to claim 26 or the collection of host organism according to any one of claims 40 to 44, wherein the diTPS of class II is a polypeptide sharing at least 30%, such as at least 35% sequence identity to the sequence of SsLPPS (SEQ ID NO:6) or to the sequence of AtCPS (SEQ ID NO:18).
The method, the host organism or the collection of host organisms according to claim 45, wherein the said diTPS of class II contains the following motif of four amino acids:
D/E-X-D-D, wherein X may be any amino acid .
47. The method according to any one of claims 1 to 25, the host organism according to claim 26 or the collection of host organism according to any one of claims 40 to 44, wherein the diTPS of class II is selected from the group consisting of syn-CPP type diTPS, ent-CPP type diTPS, (+)-CPP type diTPS, LPP type diTPS and LPP like type diTPS.
48. The method according to any one of claims 1 to 25, the host organism according to claim 26 or the collection of host organism according to any one of claims 40 to 44, wherein the diTPS of class I is a polypeptide sharing at least 30%, such as at least 35% sequence identity to the sequence of ScSCS (SEQ ID NO:1 1 ) or to the sequence of AtEKS (SEQ ID NO: 20).
49. The method, the host organism or the collection of host organisms according to claim 48, wherein the said diTPS of class I contains the following motif of five amino acids: D-D-X-X-D/E, wherein X indicates any amino acids.
50. The method according to any one of claims 1 to 25, the host organism according to claim 26 or the collection of host organism according to any one of claims 40 to 44, wherein the diTPS of class I is selected from the group consisting of EpTPS8 like diTPS, EpTPS23 like diTPS, SsSCS like diTPS, CfTPS3 like diTPS, CfTPS4 like diTPS, TwTPS2 like diTPS, EpTPSI like diTPS and CfTPS14 like diTPS.
51 . The method according to any one of claims 1 to 25, or the host organism
according to claim 26 or the collection of host organisms according to any one of claims 40 to 44, wherein the diTPS of class II is a polypeptide selected from the group consisting of:
a) EpTPS7 of SEQ ID NO:2
b) TwTPS7 of SEQ ID NO:4;
c) CfTPSI of SEQ ID NO:5;
d) TwTPS21 of SEQ ID NO:7;
e) TwTPS14/28 of SEQ ID NO:8;
f) syn-CPP of SEQ ID NO:1 ;
g) ZmAN2 of SEQ ID NO:3;
h) SsLPPS of SEQ ID NO:6;
i) CfTPS2 of SEQ ID NO:17;and j) functional homologues of the polypeptides of a) to i) sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95% sequence identity therewith.
52. The method according to any one of claims 1 to 25, the host organism according to claim 26 or the collection of host organism according to any one of claims 40 to 44, wherein the diTPS of class II is an enzyme capable of catalysing any of the reactions I to V.
53. The method according to any one of claims 1 to 25, the host organism according to claim 26 or the collection of host organism according to any one of claims 40 to 44, wherein the diTPS of class II is an enzyme capable of catalysing any of the reactions shown in figures 3A or 3B.
54. The method according to any one of claims 1 to 25, the host organism according to claim 26 or the collection of host organisms according to any one of claims 40 to 44, wherein the diTPS of class I is a polypeptide selected from the group consisting of:
a) EpTPS8 of SEQ ID NO:9;
b) EpTPS23 of SEQ ID NO:10;
c) TwTPS2 of SEQ ID NO:14;
d) EpTPSI of SEQ ID NO:15;
e) CfTPS14 of SEQ ID NO:16;
f) SsSCS of SEQ ID NO:1 1 ;
g) CfTPS12 of SEQ ID NO:12;
h) CfTPS4 of SEQ ID NO:13;
i) MvTPS5 of SEQ ID NO:r46;
j) functional homologues of the polypeptides of a) to i) sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95% sequence identity therewith.
55. The method according to any one of claims 1 to 25, or the host organism according to claim 26 or the collection of host organisms according to any one of claims 40 to 44, wherein the diTPS of class I is an enzyme capable of catalysing any of the reactions VII to XIX.
56. The method according to any one of claims 1 to 25, or the host organism according to claim 26 or the collection of host organisms according to any one of claims 40 to 44, wherein the diTPS of class I is an enzyme capable of catalyzing at least one of the reactions Lll, LIN, LIV, LV, LVI and X.
57. The method, the host organism or the collection of host organisms according to any one of the preceding claims, wherein the CYP is an enzyme capable of catalyzing one or more of the following reactions:
Hydroxylation
Oxidation leading to a formation of a carbonyl groups
Oxidation leading to formation of a carboxyl group.
58. The method, the host organism or the collection of host organisms according to any one of claims 1 to 57, wherein the CYP is a polypeptide comprising the following motif of 5 amino acids:
A/G-G-X-X-T/S, wherein X indicates any amino acids.
59. The method, the host organism or the collection of host organisms according to any one of claims 1 to 58, wherein the CYP is a polypeptide comprising the following motif of 4 amino acids:
E-X-X-R.
The method, the host organism or the collection of host organisms according to any one of claims 1 to 59, wherein the CYP is a polypeptide comprising the following motif of 10 amino acids: F-X-X-G-X-X-X-C-X-G.
61 . The method, the host organism or the collection of host organisms according to any one of claims 1 to 60, wherein the CYP is a polypeptide comprising the following motif of 3 amino acids: P-F-G.
62. The method, the host organism or the collection of host organisms according to any one of the preceding claims, wherein the CYP is an enzyme capable of catalysing hydroxylation of a diterpene to form hydroxyl-diterpene.
63. The method, the host organism or the collection of host organisms according to any one of the preceding claims, wherein the CYP is an enzyme capable of catalysing hydroxylation of a hydroxyl-diterpene to form dihydroxy-diterpene and/or diterpene ketone and/or diterpene aldehyde.
64. The method, the host organism or the collection of host organisms according to any one of the preceding claims, wherein the CYP is an enzyme capable of catalysing hydroxylation of a dihydroxy-diterpene to form trihydroxy-diterpene and/or diterpene carboxylic acid.
65. The method, the host organism or the collection of host organisms according to any one of the preceding claims, wherein the CYP is an enzyme capable of catalysing hydroxylation of a diterpene ketone to form hydroxyl diterpene ketone or diterpene carboxylic acid.
66. The method, the host organism or the collection of host organisms according to any one of the preceding claims, wherein the CYP is an enzyme capable of catalysing hydroxylation of a diterpene carboxylic acid to form hydroxy-diterpene carboxylic acid.
67. The method, the host organism or the collection of host organisms according to any one of the claims 62 to 66, wherein the diterpene is a C20-molecule consisting of 20 carbon atoms, up to three oxygen atoms and hydrogen atoms containing a core structure of one of following formulas I, II, III, IV, V, VI, IX, X, XXII, XXIII or XXIV.
68. The method, the host organism or the collection of host organisms according to any one of the claims 62 to 66, wherein the diterpene is a be a C20-molecule consisting of 20 carbon atoms, up to three oxygen atoms and hydrogen atoms, and which contains a core structure of any of formulas I, II, III, IV, VI, IX, X, XII, XLII, XLIII, XLIV, XLV, XLVI, XXVII, XXVIII, XXIX, XXX, XXXI, XXXII, XXXIII, XXXIV, XXXV, XXXVI, XXXVII, XXXVIII, XXXIX, XL, XLI and/or XLVI.
69. The method, the host organism or the collection of host organisms according to any one of the claims 62 to 66, wherein diterpene is selected from the group of diterpene listed in Table 1 .
70. The method, the host organism or the collection of host organisms according to any one of claims 1 to 69, wherein the CYP is an enzyme capable of catalysing hydroxylation of dehydroabietadiene.
71 . The method, the host organism or the collection of host organisms according to any one of claims 1 to 69, wherein the CYP is an enzyme capable of catalysing hydroxylation of miltiradiene.
72. The method, the host organism or the collection of host organisms according to any one of claims 1 to 69, wherein the CYP is an enzyme capable of catalysing hydroxylation of manool to form hydroxyl manool. 73. The method, the host organism or the collection of host organisms according to any one of claims 1 to 69, wherein the CYP is an enzyme capable of catalysing hydroxylation of hydroxyl manool, to form dihydoxy manool.
74. The method, the host organism or the collection of host organisms according to any one of claims 1 to 69, wherein the CYP is an enzyme capable of catalysing hydroxylation of 1 ,3 manool.
75. The method, the host organism or the collection of host organisms according to any one of claims 1 to 69, wherein the CYP is an enzyme capable of catalysing carboxylation of a diterpene.
76. The method, the host organism or the collection of host organisms according to any one of claims 1 to 75, wherein the CYP is a CYP of the family 76. The method, the host organism or the collection of host organisms according to claim 76, wherein the CYP is a polypeptide sharing at least 40% sequence identity with the polypeptide of SEQ ID NO:20 and/or the polypeptide of SEQ ID NO:21 . 78 The method, the host organism or the collection of host organisms according to claim 76, wherein the CYP is a polypeptide sharing at least 40% sequence identity with the polypeptide of SEQ ID NO:24.
79
The method, the host organism or the collection of host organisms according to any one of claims 76 to 78, wherein the CYP is an enzyme capable of catalysing at least one of reactions XXXIX, XL, XLI, XLV, XLVI, XLVII or LI.
80. The method, the host organism or the collection of host organisms according to any one of claims 1 to 75, wherein the CYP is a CYP of the family 71 .
81 . The method, the host organism or the collection of host organisms according to claim 80, wherein the CYP is a polypeptide sharing at least 40% sequence identity with the polypeptide of SEQ ID NO:22.
82. The method, the host organism or the collection of host organisms according to any one of claims 80 to 81 , wherein the CYP is an enzyme capable of catalysing at least one of reactions XXXIX, XLII orLI.
83. The method, the host organism or the collection of host organisms according to any one of claims 1 to 75 wherein the CYP is a CYP of the family 720. 84. The method, the host organism or the collection of host organisms according to claim 83, wherein the CYP is a polypeptide sharing at least 40% sequence identity with the polypeptide of SEQ ID NO:23.
85. The method, the host organism or the collection of host organisms according to any one of claims 83 to 84, wherein the CYP is an enzyme capable of catalysing at least one of reactions XLI, XLIII, XLIV, XLVIII, XLIX, L, LI, or Lll.
86. The method, the host organism or the collection of host organisms according to any one of claims 1 to 85, wherein the CYP is a polypeptide selected from the group consisting of:
a) CYP76AH8 of SEQ ID NO:20;
b) CYP76AH1 1 of SEQ ID NO:21 ;
c) CYP71 D381 of SEQ ID NO:22;
d) CYP720B4 of SEQ ID NO:23;
e) CYP76AH1 of SEQ ID NO:42;
?5 CYP76AH4 of SEQ ID NO:43;
g) CYP76AH15 of SEQ ID NO:40;
h) CYP76AH17 of SEQ ID NO:41 ;
and
i) functional homologues of the polypeptides of a) to h) sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95% sequence identity therewith.
87. The method according to any one of the preceding claims, wherein step b) involves incubating the host organism in a cultivation medium.
88. The method, the host organism or the collection of host organisms according to any one of the preceding claims, wherein the host organism further comprises one or more heterologous nucleic acids encoding enzymes involved in the biosynthesis of GGPP.
89. The method, the host organism or the collection of host organisms according to any one of the preceding claims, wherein said enzymes is selected from the group consisting of CfDXS of SEQ ID NO:25, CfGGPPS of SEQ ID NO:26 and functional homolgoues of any of the aforementioned sharing at least 70% sequence identity therewith.
90. The method, the host organism or the collection of host organisms according to any one of the preceding claims, wherein host organism further comprises a
heterologous nucleic acid encoding a cytochrome P450 reductase (POR).
91 . The method, the host organism or the collection of host organisms according to claim 90, wherein the POR is selected from the group consisting of:
a. ATR1 from A thaliana encoded by Gene ID:3150037
b. POR from Stevia rebaudiana encoded by Gene ID: 9321 1213 c. CfCPR of SEQ ID NO:44
d. functional homologues of the polypeptides of a) to c) sharing at least 70%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95% sequence identity therewith.
92. The method, the host organism or the collection of host organisms according to any one of the preceding claims, wherein the host organism is a microorganism.
93. The method, the host organism or the collection of host organisms according to claim 92, wherein the microorganism is yeast.
94. The method according to any one of the preceding claims, wherein the host
organism is a microorganism, and step b) involves incubating said microorganism in a cultivation medium.
95. The method according to claim 940, wherein the method comprises a step of
isolating said diterpenoid from the microorganism and/or the cultivation medium.
96. The method, the host organism or the collection of host organisms according to any one of claims 1 to 91 , wherein the host organisms is a plant.
97. The method according to claim 96, wherein the method comprises isolating the diterpenoid from said plant or parts thereof, for example from leaves, fruits, flowers, stems, seeds and/or roots of plants.
98. A method of producing a diterpenoid, said method comprising the steps of
a) providing a host organism or a collection of host organism according to any one of claims 26 to 44;
b) preparing an extract of said host organism(s) or preparing individual extracts or each host organism;
c) providing GGPP d) incubating one or more of said extracts with GGPP, wherein at least the extract of the host organism comprising a heterologous nucleic acid encoding a diTPS of class II is incubated in the presence of geranylgeranyl pyrophosphate (GGPP);
e) optionally, incubating the product of the incubation under d) with one or more of said extracts,
f) optionally, incubating the product of the incubation under e) with one or more of said extracts, wherein step f) may optionally be repeated; thereby producing a diterpeneoid.
PCT/DK2015/050181 2014-06-23 2015-06-23 Methods and materials for production of terpenoids WO2015197075A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
DKPA201470381 2014-06-23
DKPA201470381 2014-06-23

Publications (1)

Publication Number Publication Date
WO2015197075A1 true WO2015197075A1 (en) 2015-12-30

Family

ID=53716276

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/DK2015/050181 WO2015197075A1 (en) 2014-06-23 2015-06-23 Methods and materials for production of terpenoids

Country Status (1)

Country Link
WO (1) WO2015197075A1 (en)

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105647879A (en) * 2016-03-16 2016-06-08 武汉大学 Diterpene compound variediene synthesizing gene Au13192 and application thereof
WO2018015512A1 (en) * 2016-07-20 2018-01-25 Evolva Sa Biosynthesis of 13r-manoyl oxide derivatives
US10053717B2 (en) 2014-01-31 2018-08-21 University Of Copenhagen Biosynthesis of forskolin and related compounds
WO2018114839A3 (en) * 2016-12-22 2018-08-30 Firmenich Sa Production of manool
EP3404107A1 (en) * 2017-05-19 2018-11-21 Jowat SE Method for the fermentative de novo synthesis of resin acids
WO2018229306A1 (en) * 2017-06-16 2018-12-20 River Stone Biotech Aps Demethylation of reticuline and derivatives thereof with fungal cytochrome p450
US10208326B2 (en) 2014-11-13 2019-02-19 Evolva Sa Methods and materials for biosynthesis of manoyl oxide
WO2022043461A1 (en) * 2020-08-27 2022-03-03 Københavns Universitet Production of oxygenated diterpenoid compounds
CN114507657A (en) * 2022-03-23 2022-05-17 湖北中医药大学 Chrysanthemum nankingense monoterpene synthase gene CnTPS2 and application thereof
CN116730961A (en) * 2023-06-12 2023-09-12 湖北省妇幼保健院(湖北省妇女儿童医院) Bisabolane sesquiterpenoids and application thereof as MptpB inhibitor

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5204253A (en) 1990-05-29 1993-04-20 E. I. Du Pont De Nemours And Company Method and apparatus for introducing biological substances into living cells
US5538880A (en) 1990-01-22 1996-07-23 Dekalb Genetics Corporation Method for preparing fertile transgenic corn plants
US6013863A (en) 1990-01-22 2000-01-11 Dekalb Genetics Corporation Fertile transgenic corn plants
US6329571B1 (en) 1996-10-22 2001-12-11 Japan Tobacco, Inc. Method for transforming indica rice
DE102009025996A1 (en) * 2009-06-18 2010-12-23 Universität des Saarlandes Campus Saarbrücken Hydroxylation of terpenoids e.g. triterpenes, comprises adding terpenoids and cytochrome P450 CYP106A2 from Bacillus megaterium as biocatalyst to reaction medium, optionally adding electron transfer partner, incubating and extracting
WO2015113570A1 (en) * 2014-01-31 2015-08-06 University Of Copenhagen Methods for producing diterpenes

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5538880A (en) 1990-01-22 1996-07-23 Dekalb Genetics Corporation Method for preparing fertile transgenic corn plants
US6013863A (en) 1990-01-22 2000-01-11 Dekalb Genetics Corporation Fertile transgenic corn plants
US5204253A (en) 1990-05-29 1993-04-20 E. I. Du Pont De Nemours And Company Method and apparatus for introducing biological substances into living cells
US6329571B1 (en) 1996-10-22 2001-12-11 Japan Tobacco, Inc. Method for transforming indica rice
DE102009025996A1 (en) * 2009-06-18 2010-12-23 Universität des Saarlandes Campus Saarbrücken Hydroxylation of terpenoids e.g. triterpenes, comprises adding terpenoids and cytochrome P450 CYP106A2 from Bacillus megaterium as biocatalyst to reaction medium, optionally adding electron transfer partner, incubating and extracting
WO2015113570A1 (en) * 2014-01-31 2015-08-06 University Of Copenhagen Methods for producing diterpenes

Non-Patent Citations (15)

* Cited by examiner, † Cited by third party
Title
"Analysis of the genome sequence of the flowering plant Arabidopsis thaliana", NATURE, vol. 408, no. 6814, 2000, pages 796 - 815
"Modular Pathway Engineering of Diterpenoid Synthases and the Mevalonic Acid Pathway for Miltiradiene Production", JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, vol. 134, no. 6, 2012, pages 3234 - 3241
"Wiley Registry of Mass Spectral Data, 8th ed.", July 2006, JOHN WILEY & SONS, ISBN: 978-0-470-047
DRAGANA BOZIC ET AL: "Towards Elucidating Carnosic Acid Biosynthesis in Lamiaceae: Functional Characterization of the Three First Steps of the Pathway in Salvia fruticosa and Rosmarinus officinalis", PLOS ONE, vol. 10, no. 5, 28 May 2015 (2015-05-28), pages e0124106, XP055215828, DOI: 10.1371/journal.pone.0124106 *
GONG HAI-YAN ET AL: "Diterpene synthases and their responsible cyclic natural products", vol. 4, no. 2, 1 April 2014 (2014-04-01), pages 59 - 72, XP002739034, ISSN: 2192-2209, Retrieved from the Internet <URL:http://rd.springer.com/article/10.1007%2Fs13659-014-0012-8> [retrieved on 20140418], DOI: 10.1007/S13659-014-0012-8 *
HIGGINS D.; THOMPSON J.; GIBSON T.; THOMPSON J.D.; HIGGINS D.G.; GIBSON T.J.: "CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice", NUCLEIC ACIDS RES., vol. 22, 1994, pages 4673 - 4680
JOHAN ANDERSEN-RANBERG: "Identification and Characterization of Biosynthetic Parts Involved in Plant Diterpenoid Biosyntheses - Employees", 24 June 2014 (2014-06-24), University of Copenhagen, XP055183341, Retrieved from the Internet <URL:http://plen.ku.dk/english/employees/?pure=en%2Fpublications%2Fidentification-and-characterization-of-biosynthetic-parts-involved-in-plant-diterpenoid-biosyntheses(9b672fe8-15a1-4685-8204-ce711d76b041).html> [retrieved on 20150415] *
LARISSA M. PODUST ET AL: "Diversity of P450 enzymes in the biosynthesis of natural products", NATURAL PRODUCT REPORTS, vol. 29, no. 10, 1 January 2012 (2012-01-01), pages 1251, XP055215367, ISSN: 0265-0568, DOI: 10.1039/c2np20020a *
MITCHELL ROD: "Identificati0n and characterization of diterpene synthases in the salvinorin A biosynthetic pathway", INTERNET CITATION, 1 January 2012 (2012-01-01), pages 1 - 2, XP002738984, Retrieved from the Internet <URL:http://theses.ucalgary.ca/bitstream/11023/171/5/ucalgary_2012_mitchell_rod.pdf> [retrieved on 20150427] *
PATERAKI IRINI ET AL: "Manoyl oxide (13R), the biosynthetic precursor of forskolin, is synthesized in specialized root cork cells in Coleus forskohlii", PLANT PHYSIOLOGY, AMERICAN SOCIETY OF PLANT PHYSIOLOGISTS, ROCKVILLE, MD, US, vol. 164, 1 March 2014 (2014-03-01), pages 1222 - 1236, XP002724033, ISSN: 0032-0889 *
PHILIPP ZERBE ET AL: "Gene discovery of modular diterpene metabolism in nonmodel systems", vol. 162, no. 2, 1 June 2013 (2013-06-01), pages 1073 - 1091, XP002724041, ISSN: 0032-0889, Retrieved from the Internet <URL:http://www.plantphysiol.org/content/162/2/1073> [retrieved on 20130423], DOI: 10.1104/PP.113.218347 *
PODUST ET AL., NAT. PROD. REP., vol. 29, 2012, pages 1251
PODUST ET AL., NAT. PROD. REP., vol. 29, 2012, pages 1251 - 1266
SCHULER ET AL., ANNU REV. PLANT BIOL., vol. 54, 2003, pages 629 - 67
YONGJIN J. ZHOU ET AL: "Modular Pathway Engineering of Diterpenoid Synthases and the Mevalonic Acid Pathway for Miltiradiene Production", JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, vol. 134, no. 6, 15 February 2012 (2012-02-15), pages 3234 - 3241, XP055058678, ISSN: 0002-7863, DOI: 10.1021/ja2114486 *

Cited By (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10053717B2 (en) 2014-01-31 2018-08-21 University Of Copenhagen Biosynthesis of forskolin and related compounds
US10208326B2 (en) 2014-11-13 2019-02-19 Evolva Sa Methods and materials for biosynthesis of manoyl oxide
CN105647879A (en) * 2016-03-16 2016-06-08 武汉大学 Diterpene compound variediene synthesizing gene Au13192 and application thereof
WO2018015512A1 (en) * 2016-07-20 2018-01-25 Evolva Sa Biosynthesis of 13r-manoyl oxide derivatives
JP7160811B2 (en) 2016-12-22 2022-10-25 フイルメニツヒ ソシエテ アノニム Manufacturing of manool
WO2018114839A3 (en) * 2016-12-22 2018-08-30 Firmenich Sa Production of manool
CN110100003B (en) * 2016-12-22 2023-11-17 弗门尼舍有限公司 Production of Minol
CN110100003A (en) * 2016-12-22 2019-08-06 弗门尼舍有限公司 The production of manool
US20190352673A1 (en) * 2016-12-22 2019-11-21 Firmenich Sa Production of manool
JP2020513755A (en) * 2016-12-22 2020-05-21 フイルメニツヒ ソシエテ アノニムFirmenich Sa Manoole manufacturing
US10752922B2 (en) 2016-12-22 2020-08-25 Firmenich Sa Production of manool
EP3404107A1 (en) * 2017-05-19 2018-11-21 Jowat SE Method for the fermentative de novo synthesis of resin acids
WO2018211002A1 (en) * 2017-05-19 2018-11-22 Jowat Se Method for the fermentative, de novo synthesis of resin acids
US11781164B2 (en) 2017-06-16 2023-10-10 River Stone Biotech, LLC Demethylation of reticuline and derivatives thereof with fungal cytochrome P450
US20210230655A1 (en) * 2017-06-16 2021-07-29 River Stone Biotech Aps Demethylation of Reticuline and Derivatives Thereof with Fungal Cytochrome P450
WO2018229306A1 (en) * 2017-06-16 2018-12-20 River Stone Biotech Aps Demethylation of reticuline and derivatives thereof with fungal cytochrome p450
WO2022043461A1 (en) * 2020-08-27 2022-03-03 Københavns Universitet Production of oxygenated diterpenoid compounds
CN114507657A (en) * 2022-03-23 2022-05-17 湖北中医药大学 Chrysanthemum nankingense monoterpene synthase gene CnTPS2 and application thereof
CN114507657B (en) * 2022-03-23 2023-11-03 湖北中医药大学 Chrysanthemum brain monoterpene synthase gene CnTPS2 and application thereof
CN116730961A (en) * 2023-06-12 2023-09-12 湖北省妇幼保健院(湖北省妇女儿童医院) Bisabolane sesquiterpenoids and application thereof as MptpB inhibitor
CN116730961B (en) * 2023-06-12 2024-01-23 湖北省妇幼保健院(湖北省妇女儿童医院) Bisabolane sesquiterpenoids and application thereof as MptpB inhibitor

Similar Documents

Publication Publication Date Title
WO2015197075A1 (en) Methods and materials for production of terpenoids
Kitaoka et al. The application of synthetic biology to elucidation of plant mono-, sesqui-, and diterpenoid metabolism
US20180037912A1 (en) Methods for Producing Diterpenes
CN104769121B (en) Vanillin synthase
US20180265897A1 (en) Production of macrocyclic diterpenes in recombinant hosts
US10240173B2 (en) Biosynthesis of forskolin and related compounds
RU2654670C2 (en) Method for producing fragrant alcohols
Shi et al. Promotion of artemisinin content in Artemisia annua by overexpression of multiple artemisinin biosynthetic pathway genes
Ricigliano et al. Regulation of sesquiterpenoid metabolism in recombinant and elicited Valeriana officinalis hairy roots
CN107429223A (en) From the beginning the method for Microbe synthesis terpene
Song et al. Potential role of two cytochrome P450s obtained from Lithospermum erythrorhizon in catalyzing the oxidation of geranylhydroquinone during Shikonin biosynthesis
Gorina et al. Oxylipin biosynthesis in spikemoss Selaginella moellendorffii: molecular cloning and identification of divinyl ether synthases CYP74M1 and CYP74M3
Majdi et al. Sesquiterpene lactone engineering in microbial and plant platforms: parthenolide and artemisinin as case studies
US20190194706A1 (en) Biosynthesis of Oxidised 13R-MO and Related Compounds
Luo et al. Characterization of a sesquiterpene cyclase from the glandular trichomes of Leucosceptrum canum for sole production of cedrol in Escherichia coli and Nicotiana benthamiana
US10053703B2 (en) Heterologous production of patchoulol, β-santalene, and sclareol in moss cells
Ichinose et al. Latent potentials of the white-rot basidiomycete Phanerochaete chrysosporium responsible for sesquiterpene metabolism: CYP5158A1 and CYP5144C8 decorate (E)-α-bisabolene
Yin et al. Cloning, expression characteristics of a new FPS gene from birch (Betula platyphylla suk.) and functional identification in triterpenoid synthesis
Huang et al. Side products of recombinant amorpha-4, 11-diene synthase and their effect on microbial artemisinin production
Zhuang Engineering novel terpene production platforms in the yeast saccharomyces cerevisiae
Tong et al. Eudesmane-type sesquiterpene diols directly synthesized by a sesquiterpene cyclase in Tripterygium wilfordii
Toporkova et al. The CYP74B and CYP74D divinyl ether synthases possess a side hydroperoxide lyase and epoxyalcohol synthase activities that are enhanced by the site-directed mutagenesis
Keeling et al. Plant terpenoids
EP3821023A1 (en) Method for biocatalytic production of terpene compounds
CN111527203A (en) Cytochrome P450 monooxygenase catalyzed oxidation of sesquiterpenes

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 15739509

Country of ref document: EP

Kind code of ref document: A1

122 Ep: pct application non-entry in european phase

Ref document number: 15739509

Country of ref document: EP

Kind code of ref document: A1