WO1996010075A1 - Production of hydroxylated fatty acids in genetically modified plants - Google Patents

Production of hydroxylated fatty acids in genetically modified plants Download PDF

Info

Publication number
WO1996010075A1
WO1996010075A1 PCT/US1995/011855 US9511855W WO9610075A1 WO 1996010075 A1 WO1996010075 A1 WO 1996010075A1 US 9511855 W US9511855 W US 9511855W WO 9610075 A1 WO9610075 A1 WO 9610075A1
Authority
WO
WIPO (PCT)
Prior art keywords
leu
val
tyr
ala
hydroxylase
Prior art date
Application number
PCT/US1995/011855
Other languages
French (fr)
Inventor
Frank J. Van De Loo
Chris Somerville
Pierre Broun
Frank J. Van De Loo
Original Assignee
Carnegie Institution Of Washington
Monsanto Company,Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US08/314,596 external-priority patent/US5668292A/en
Application filed by Carnegie Institution Of Washington, Monsanto Company,Inc filed Critical Carnegie Institution Of Washington
Priority to EP95934442A priority Critical patent/EP0781327B1/en
Priority to AU36778/95A priority patent/AU718512B2/en
Priority to CA2200202A priority patent/CA2200202C/en
Priority to DE69534849T priority patent/DE69534849T2/en
Priority to JP8511856A priority patent/JPH10506783A/en
Publication of WO1996010075A1 publication Critical patent/WO1996010075A1/en
Priority to US09/885,189 priority patent/US6936728B2/en
Priority to US11/058,746 priority patent/US20060101543A1/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/0004Oxidoreductases (1.)
    • C12N9/0071Oxidoreductases (1.) acting on paired donors with incorporation of molecular oxygen (1.14)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/82Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
    • C12N15/8216Methods for controlling, regulating or enhancing expression of transgenes in plant cells
    • C12N15/8222Developmentally regulated expression systems, tissue, organ specific, temporal or spatial regulation
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/82Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
    • C12N15/8241Phenotypically and genetically modified plants via recombinant DNA technology
    • C12N15/8242Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits
    • C12N15/8243Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits involving biosynthetic or metabolic pathways, i.e. metabolic engineering, e.g. nicotine, caffeine
    • C12N15/8247Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits involving biosynthetic or metabolic pathways, i.e. metabolic engineering, e.g. nicotine, caffeine involving modified lipid metabolism, e.g. seed oil composition
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/0004Oxidoreductases (1.)
    • C12N9/0071Oxidoreductases (1.) acting on paired donors with incorporation of molecular oxygen (1.14)
    • C12N9/0073Oxidoreductases (1.) acting on paired donors with incorporation of molecular oxygen (1.14) with NADH or NADPH as one donor, and incorporation of one atom of oxygen 1.14.13

Definitions

  • the present invention concerns the identification of nucleic acid sequences and constructs, and methods related thereto, and the use of these sequences and constructs to produce genetically modified plants for the purpose of altering the fatty acid composition of plant oils, waxes and related compounds.
  • the subject of this invention is a class of enzymes that introduce a hydroxyl group into several different fatty acids resulting in the production of several different kinds of hydroxylated fatty acids.
  • these enzymes catalyze hydroxylation of oleic acid to 12-hydroxy oleic acid and icosenoic acid to 14-hydroxy icosenoic acid.
  • Other fatty acids such as palmitoleic and erucic acids may also be substrates.
  • the enzyme since it is not possible to refer to the enzyme by reference to a unique substrate or product, we refer to the enzyme throughout as kappa hydroxylase to indicate that the enzyme introduces the hydroxyl three carbons distal (i.e., away from the carboxyl carbon of the acyl chain) from a double bond located near the center of the acyl chain.
  • ricinoleic acid 12-hydroxyoctadec-cis-9-enoic acid (120H-18:1 ); lesquerolic acid, 14-hydroxy-c ⁇ s-ll- ⁇ coseno ⁇ c acid (l4OH-20:l c,s ⁇ ); densipolic acid, 12-hydroxyoctadec-cis- 9,15-dienoic acid (120H-18:2 ' ); auricolic acid, 14- hydroxy-cis-ll,17- ⁇ cosad ⁇ eno ⁇ c aciid (14OH-20:2cis ⁇ H'17); hydroxyerucic, l6-hydroxydocos-cis-13-enoic acid (160H- 22:l c,s ⁇ 13 ) ; hydroxypalmitoleic, 12-hydroxyhexadec-cis-9-enoic (120H-16:l cis ⁇ 9 ) ; icosenoic acid (20:l
  • castor hydroxylase gene to also produce other hydroxylated fatty acids such as lesquerolic acid, densipolic acid, hydroxypalmitoleic, hydroxyerucic and auricolic acid in transgenic plants is the subject of this invention.
  • identification of a gene encoding a homologous hydroxylase from Lesquerella fendleri and the use of this gene to produce these hydroxylated fatty acids in transgenic plants is the subject of this invention.
  • Castor is a minor oilseed crop. Approximately 50% of the seed weight is oil (triacylglycerol) in which 85-90% of total fatty acids are the hydroxylated fatty acid, ricinoleic acid. Oil pressed or extracted from castor seeds has many industrial uses based upon the properties endowed by the hydroxylated fatty acid. The most important uses are production of paints and varnishes, nylon-type synthetic polymers, resins, lubricants, and cosmetics (Atsmon 1989) . In addition to oil, the castor seed contains the extremely toxic protein ricin, allergenic proteins, and the alkaloid ricinine. These constituents preclude the use of the untreated seed meal (following oil extraction) as a livestock feed, normally an important economic aspect of oilseed utilization.
  • castor has few favorable agronomic characteristics. For a combination of these reasons, castor is no longer grown in the United states and the development of an alternative domestic source of hydroxylated fatty acids would be attractive.
  • the production of ricinoleic acid, the important constituent of castor oil, in an established oilseed crop through genetic engineering would be a particularly effective means of creating a domestic source.
  • a feature of hydroxylated or other unusual fatty acids is that they are generally confined to seed triacylglycerols, being largely excluded from the polar lipids by unknown mechanisms (Battey and Ohlrogge 1989; Prasad et al. , 1987). This is particularly interesting since diacylglycerol is a precursor of both triacylglycerol and polar lipid. With castor microsomes, there is some evidence that the pool of ricinoleoyl-containing polar lipid is minimized by a preference of diacylglycerol acyltransferase for ricinoleate- containing diacylglycerols (Bafor et al. 1991) .
  • Analyses of vegetative tissues have generated few reports of unusual fatty acids, other than those occurring in the cuticle.
  • the cuticle contains various hydroxylated fatty acids which are interesterified to produce a high molecular weight polyester which serves a structural role.
  • Oleic acid can replace oleoyl-CoA as a precursor, but only in the presence of CoA, Mg * and ATP (Galliard and Stumpf, 1966) indicating that activation to the acyl-CoA is necessary. However, no radioactivity could be detected in ricinoleoyl-CoA (Moreau and Stumpf, 1981) . These and more recent observations (Bafor et al., 1991) have been interpreted as evidence that the substrate for the castor oleate hydroxylase is oleic acid esterified to phosphatidylcholine or another phospholipid.
  • the hydroxylase is sensitive to cyanide and azide, and dialysis against metal chelators reduces activity, which could be restored by addition of FeS0 4 , suggesting iron involvement in enzyme activity (Galliard and Stumpf, 1966) .
  • Ricinoleic acid synthesis requires molecular oxygen (Galliard and Stumpf, 1966; Moreau and Stumpf 1981) and requires NAD(P)H to reduce cytochrome b5 which is thought to be the intermediate electron donor for the hydroxylase reaction (Smith et al., 1992). Carbon monoxide does not inhibit hydroxylation, indicating that a cytochrome P450 is not involved (Galliard and Stumpf, 1966; Moreau and Stumpf 1981) .
  • the castor kappa hydroxylase has many superficial similarities to the microsomal fatty acyl desaturases (Browse and Somerville, 1991) .
  • plants have a microsomal oleate desaturase active at the ⁇ l2 position.
  • the substrate of this enzyme (Schmidt et al., 1993) and of the hydroxylase (Bafor et al., 1991) appears to be a fatty acid esterified to the sn-2 position of phosphatidylcholine.
  • oleate the substrate, the modification occurs at the same position ( ⁇ l2) in the carbon chain, and requires the same cofactors, namely electrons from NADH via cytochrome Jb 5 and molecular oxygen.
  • Neither enzyme is inhibited by carbon monoxide (Moreau and Stumpf, 1981) , the characteristic inhibitor of cytochrome P450 enzymes.
  • the two iron atoms of the FeOFe cluster are liganded by protein-derived nitrogen or oxygen atoms, and are tightly redox-coupled by the covalently-bridging oxygen atom.
  • the FeOFe cluster accepts two electrons, reducing it to the diferrous state, before oxygen binding. Upon oxygen binding, it is likely that heterolytic cleavage also occurs, leading to a high valent oxoiron reactive species that is stabilized by resonance rearrangements possible within the tightly coupled FeOFe cluster.
  • the stabilized high-valent oxoiron state of methane monooxygenase is capable of proton extraction from methane, followed by oxygen transfer, giving methanol.
  • the FeOFe cofactor has been shown to be directly relevant to plant fatty acid modifications by the demonstration that castor stearoyl- ACP desaturase contains this type of cofactor (Fox et al. , 1993).
  • the castor oleate hydroxylase is a structurally modified fatty acyl desaturase, based upon three arguments.
  • the first argument involves the taxonomic distribution of plants containing ricinoleic acid. Ricinoleic acid has been found in 12 genera of 10 families of higher plants (reviewed in van de Loo et al., 1993). Thus, plants in which ricinoleic acid occurs are found throughout the plant kingdom, yet close relatives of these plants do not contain the unusual fatty acid. This pattern suggests that the ability to synthesize ricinoleic acid has arisen (and been lost) several times independently, and is therefore a quite recent divergence.
  • Figures 1A-D show the mass spectra of hydroxy fatty acids standards (Figure IA, O-TMS-methylricinoleate; Figure IB, O- TMS-methyl densipoleate; Figure 1C, O-TMS-methyl- lesqueroleate; and Figure ID, O-TMS-methylauricoleate.
  • Figure 2 shows the fragmentation pattern of trimethylsilylated methyl esters of hydroxy fatty acids.
  • Figure 3A shows the gas chromatogram of fatty acids extracted from seeds of wild type Arabidopsis plants.
  • Figure 3B shows the gas chromatogram of fatty acids extracted from seeds of transgenic Arabidopsis plants containing the fahl2 hydroxylase gene.
  • the numbers indicate the following fatty acids: [1] 16:0; [2] 18:0; [3] 18:lcis ⁇ 9; [4] l8:2 c1s ⁇ ,12 ; [5] 20:0; [6] 20:l eis ⁇ 11 ; [7]18:3 cis ⁇ 9 «12 ' 15 ; [8]22:l c ⁇ 13 ; [9] 24:l cis ⁇ 13 ; [10]ricinoleic acid; [11] densipolic acid; [12] lesquerolic acid; [13] auricolic acid.
  • Figures 4A-D show the mass spectra of novel fatty acids found in seeds of transgenic plants.
  • Figure 4A shows the mass spectrum of peak 10 from Figure 3B.
  • Figure 4B shows the mass spectrum of peak 11 from Figure 3B.
  • Figure 4C shows the mass spectrum of peak 12 from Figure 3B.
  • Figure 4D shows the mass spectrum of peak 13 from Figure 3B.
  • Figure 5 shows the nucleotide sequence of pLesq2 (SEQ ID NO:l) .
  • Figure 6 shows the nucleotide sequence of pLesq3 (SEQ ID NO:2).
  • Figure 7 shows a Northern blot of total RNA from seeds of L . fendleri probed with pLesq2 or pLesq3. S, indicates RNA is from seeds; L, indicates RNA is from leaves.
  • Figures 8A-B show the nucleotide sequence of genomic clone encoding pLesq-HYD (SEQ ID NO:3), and the deduced amino acid sequence of hydroxylase enzyme encoded by the gene (SEQ ID NO:4) .
  • Figures 9A-B show multiple sequence alignment of deduced amino acid sequences for kappa hydroxylases and microsomal ⁇ l2 desaturases.
  • Abbreviations are: Rcfahl2, fahl2 hydroxylase gene from R. communis (van de Loo et al . , 1995) ; Lffahl2, kappa hydroxylase gene from L . fendleri ; Atfad2, fad2 desaturase from Arabidopsis thaliana (Okuley et al . , 1994) ;
  • Gmfad2-1 fad2 desaturase from Glycine max (GenBank accession number L43920) ; Gmfad2-2, fad2 desaturase from Glycine max (Genbank accession number L43921) ; Zmfad2, fad2 desaturase from Zea mays ( o 94/11516 ); Rcfad2, fragment of fad2 desaturase from R .
  • Figure 10 shows a Southern blot of geno ic DNA from L . fendleri probed with pLesq-HYD.
  • E EcoRI
  • H Hindlll
  • X Xbal.
  • Figure 11 shows a map of binary Ti plasmid pSLJ44024.
  • This invention relates to plant fatty acyl hydroxylases. Methods to use conserved amino acid or nucleotide sequences to obtain plant fatty acyl hydroxylases are described. Also described is the use of cDNA clones encoding a plant hydroxylase to produce a family of hydroxylated fatty acids in transgenic plants.
  • this invention is directed to recombinant DNA constructs which can provide for the transcription or transcription and translation (expression) of the plant kappa hydroxylase sequence.
  • constructs which are capable of transcription or transcription and translation in plant host cells are preferred.
  • Such constructs may contain a variety of regulatory regions including transcriptional initiation regions obtained from genes preferentially expressed in plant seed tissue.
  • this invention relates to the presence of such constructs in host cells, especially plant host cells which have an expressed plant kappa hydroxylase therein.
  • this invention relates to a method for producing a plant kappa hydroxylase in a host cell or progeny thereof via the expression of a construct in the cell.
  • Cells containing a plant kappa hydroxylase as a result of the production of the plant kappa hydroxylase encoding sequence are also contemplated herein.
  • this invention relates to methods of using a DNA sequence encoding a plant kappa hydroxylase for the modification of the proportion of hydroxylated fatty acids produced within a cell, especially plant cells. Plant cells having such a modified hydroxylated fatty acid composition are also contemplated herein.
  • plant kappa hydroxylase proteins and sequences which are related thereto, including amino acid and nucleic acid sequences are contemplated.
  • Plant kappa hydroxylase exemplified herein includes a Lesquerella fendleri fatty acid hydroxylase. This exemplified fatty acid hydroxylase may be used to obtain other plant fatty acid hydroxylases of this invention.
  • a nucleic acid sequence which directs the seed specific expression of an associated polypeptide coding sequence is described. The use of this nucleic acid sequence or fragments derived thereof, to obtain seed-specific expression in higher plants of any coding sequence is contemplated herein.
  • a genetically transformed plant of the present invention which accumulates hydroxylated fatty acids can be obtained by expressing the double-stranded DNA molecules described in this application.
  • a plant fatty acid hydroxylase of this invention includes any sequence of amino acids, such as a protein, polypeptide or peptide fragment, or nucleic acid sequences encoding such polypeptides, obtainable from a plant source which demonstrates the ability to catalyze the production of ricinoleic, lesquerolic, hydroxyerucic (16-hydroxydocos-cis- 13-enoic acid) or hydroxypalmitoleic (12-hydroxyhexadec-cis-9- enoic) from CoA, ACP or lipid-linked monoenoic fatty acid substrates under plant enzyme reactive conditions.
  • enzyme reactive conditions is meant that any necessary conditions are available in an environment (i.e., such factors as temperature, pH, lack of inhibiting substances) which will permit the enzyme to function.
  • Preferential activity of a plant fatty acid hydroxylase toward a particular fatty acyl substrate is determined upon comparison of hydroxylated fatty acid product amounts obtained per different fatty acyl substrates.
  • oleate preferring is meant that the hydroxylase activity of the enzyme preparation demonstrates a preference for oleate- containing substrates over other substrates.
  • the precise substrate of the castor fatty acid hydroxylase is not known, it is thought to be a monounsaturated fatty acid moiety which is esterified to a phospholipid such as phosphatidylcholine.
  • monounsaturated fatty acids esterified to phosphatidylethanolamine, phosphatidic acid or a neutral lipid such as diacylglycerol or a Coenzyme-A thioester may also be substrates.
  • significant activity has been observed in radioactive labelling studies using fatty acyl substrates other than oleate (Howling et al., 1972) indicating that the substrate specificity is for a family of related fatty acyl compounds. Because the castor hydroxylase introduces hydroxy groups three carbons from a double bond, proximal to the methyl carbon of the fatty acid we term the enzyme a kappa hydroxylase for convenience.
  • the castor kappa hydroxylase may be used for production of l2-hydroxy-9-octadecenoic acid (ricinoleate) , i2-hydroxy-9-hexadecenoic acid, 14-hydroxy-ll- eicosenoic acid, l6-hydroxy-13-docosanoic acid, 9-hydroxy-6- octadecenoic acid by expression in plants species which produce the non-hydroxylated precursors.
  • We also envision production of additionally modified fatty acids such as 12- hydroxy-9,15-octadecadienoic acid that result from desaturation of hydroxylated fatty acids (e.g., 12-hydroxy-9- octadecenoic acid in this example) .
  • a plant kappa hydroxylase of this invention will display activity towards various fatty acyl substrates.
  • fatty acids are typically covalently bound to acyl carrier protein (ACP) , coenzyme A (CoA) or various cellular lipids.
  • ACP acyl carrier protein
  • CoA coenzyme A
  • Plant kappa hydroxylases which display preferential activity toward lipid-linked acyl substrate are especially preferred because they are likely to be closely associated with normal pathway of storage lipid synthesis in immature embryos.
  • activity toward acyl-CoA substrates or other synthetic substrates, for example is also contemplated herein.
  • plant kappa hydroxylases are obtainable from the specific exemplified sequences provided herein. Furthermore, it will be apparent that one can obtain natural and synthetic plant kappa hydroxylases including modified amino acid sequences and starting materials for synthetic-protein modeling from the exemplified plant kappa hydroxylase and from plant kappa hydroxylases which are obtained through the use of such exemplified sequences. Modified amino acid sequences include sequences which have been mutated, truncated, increased and the like, whether such sequences were partially or wholly synthesized. Sequences which are actually purified from plant preparations or are identical or encode identical proteins thereto, regardless of the method used to obtain the protein or sequence, are equally considered naturally derived.
  • nucleic acid probes DNA and RNA
  • nucleic acid probes are labeled to allow detection, preferably with radioactivity although enzymes or other methods may also be used.
  • antibody preparations either monoclonal or polyclonal are utilized. Polyclonal antibodies, although less specific, typically are more useful in gene isolation.
  • detection the antibody is labeled using radioactivity or any one of a variety of second antibody/enzyme conjugate systems that are commercially available.
  • Homologous sequences are found when there is an identity of sequence and may be determined upon comparison of sequence information, nucleic acid or amino acid, or through hybridization reactions between a known kappa hydroxylase and a candidate source. Conservative changes, such as Glu/Asp, Val/Ile, Ser/Thr, Arg/Lys and Gln/Asn may also be considered in determining sequence homology.
  • a lengthy nucleic acid sequence may show as little as 50-60% sequence identity, and more preferably at least about 70% sequence identity, between the target sequence and the given plant kappa hydroxylase of interest excluding any deletions which may be present, and still be considered related.
  • Amino acid sequences are considered homologous by as little as 25% sequence identity between the two complete mature proteins. (See generally, Doolittle, R.F., OF URFS and ORFS, University Science Books, CA, 1986.)
  • a genomic or other appropriate library prepared from the candidate plant source of interest may be probed with conserved sequences from the plant kappa hydroxylase to identify homologously related sequences. Use of an entire cDNA or other sequence may be employed if shorter probe sequences are not identified. Positive clones are then analyzed by restriction enzyme digestion and/or sequencing. When a genomic library is used, one or more sequences may be identified providing both the coding region, as well as the transcriptional regulatory elements of the kappa hydroxylase gene from such plant source. Probes can also be considerably shorter than the entire sequence. Oligonucleotides may be used, for example, but should be at least about 10, preferably at least abou 15, more preferably at least 20 nucleotides in length.
  • a plant kappa hydroxylase of this invention will have at least 60% overall amino acid sequence similarity with the exemplified plant kappa hydroxylase.
  • kappa hydroxylases which are obtainable from an amino acid or nucleic acid sequence of a castor or lesquerella kappa hydroxylase are especially preferred.
  • the plant kappa hydroxylases may have preferential activity toward longer or shorter chain fatty acyl substrates.
  • Plant fatty acyl hydroxylases having oleate-12-hydroxylase activity and eicosenoate-14-hydroxylase activity are both considered homologously related proteins because of in vitro evidence (Howling et al., 1972), and evidence disclosed herein, that the castor kappa hydroxylase will act on both substrates.
  • Hydroxylated fatty acids may be subject to further enzymatic modification by other enzymes which are normally present or are introduced by genetic engineering methods. For example, 14-hydroxy-ll,17-eicosadienoic acid, which is present in some Lesquerella species (Smith 1985) , is thought to be produced by desaturation of 14-hydroxy-ll- eicosenoic acid.
  • PCR may be a useful technique to obtain related plant fatty acyl hydroxylases from sequence data provided herein.
  • One skilled in the art will be able to design oligonucleotide probes based upon sequence comparisons or regions of typically highly conserved sequence.
  • polymerase chain reaction primers based on the conserved regions of amino acid sequence between the castor kappa hydroxylase and the L. fendleri hydroxylase (SED ID NO:4) . Details relating to the design and methods for a PCR reaction using these probes are described more fully in the examples.
  • fatty acyl hydroxylases of a variety of sources can be used to investigate fatty acid hydroxylation events in a wide variety of plant and in vivo applications. Because all plants synthesize fatty acids via a common metabolic pathway, the study and/or application of one plant fatty acid hydroxylase to a heterologous plant host may be readily achieved in a variety of species.
  • the transcription, or transcription and translation (expression) , of the plant fatty acyl hydroxylases in a host cell is desired to produce a ready source of the enzyme and/or modify the composition of fatty acids found therein in the form of free fatty acids, esters (particularly esterified to glycerolipids or as components of wax esters) , estolides, or ethers.
  • Other useful applications may be found when the host cell is a plant host cell, in vitro and in vivo .
  • an increased percentage of ricinoleate or lesqueroleate (14-hydroxy-ll-eicosenoic acid) may be provided.
  • ricinoleate or ricinoleic acid is intended to include the free acids, the ACP and CoA esters, the salts of these acids, the glycerolipid esters (particularly the triacylglycerol esters) , the wax esters, the estolides and the ether derivatives of these acids.
  • hydroxylated fatty acids are found in some natural plant species in abundance. For example, three hydroxy fatty acids related to ricinoleate occur in major amounts in seed oils from various Lesquerella species. Of particular interest, lesquerolic acid is a 20 carbon homolog of ricinoleate with two additional carbons at the carboxyl end of the chain (Smith 1985) .
  • other natural plant sources of hydroxylated fatty acids include but are not limited to seeds of the Linum genus, seeds of Wrightia species, Lycopodium species, Strophanthus species, Convolvulaces species.
  • a comparison between kappa hydroxylases and between plant fatty acyl hydroxylases which introduce hydroxyl groups at positions other than the 12-carbon of oleate or the 14-carbon of lesqueroleate or on substrates other than oleic acid and icosenoic acid may yield insights for protein modeling or other modifications to create synthetic hydroxylases as discussed above.
  • ⁇ 12 desaturases and the kappa hydroxylase we envision making genetic modifications in the structural genes for ⁇ l2 desaturases that convert these desaturases to kappa- hydroxylases.
  • fatty acyl hydroxylases which demonstrate activity toward fatty acyl substrates other than oleate, or which introduce the hydroxyl group at a location other than the C12 carbon.
  • other plant sources may also provide sources for these enzymes through the use of protein purification, nucleic acid probes, antibody preparations, protein modeling, or sequence comparisons, for example, and of special interest are the respective amino acid and nucleic acid sequences corresponding to such plant fatty acyl hydroxylases.
  • nucleic acid sequence is obtained for the given plant hydroxylase, further plant sequences may be compared and/or probed to obtain homologously related DNA sequences thereto and so on.
  • a cDNA clone encoding a plant kappa hydroxylase may be used to obtain its corresponding genomic nucleic acid sequences thereto.
  • the nucleic acid sequences which encode plant kappa hydroxylases may be used in various constructs, for example, as probes to obtain further sequences from the same or other species.
  • these sequences may be used in conjunction with appropriate regulatory sequences to increase levels of the respective hydroxylase of interest in a host cell for the production of hydroxylated fatty acids or study of the enzyme in vitro or in vivo or to decrease or increase levels of the respective hydroxylase of interest for some applications when the host cell is a plant entity, including plant cells, plant parts (including but not limited to seeds, cuttings or tissues) and plants.
  • a nucleic acid sequence encoding a plant kappa hydroxylase of this invention may include genomic, cDNA or mRNA sequence.
  • encoding is meant that the sequence corresponds to a particular amino acid sequence either in a sense or anti-sense orientation.
  • recombinant is meant that the sequence contains a genetically engineered modification through manipulation via mutagenesis, restriction enzymes, and the like.
  • a cDNA sequence may or may not encode pre-processing sequences, such as transit or signal peptide sequences. Transit or signal peptide sequences facilitate the delivery of the protein to a given organelle and are frequently cleaved from the polypeptide upon entry into the organelle, releasing the "mature" sequence.
  • the use of the precursor DNA sequence is preferred in plant cell expression cassettes.
  • the complete genomic sequence of the plant kappa hydroxylase may be obtained by the screening of a genomic library with a probe, such as a cDNA probe, and isolating those sequences which regulate expression in seed tissue.
  • a probe such as a cDNA probe
  • the transcription and translation initiation regions, introns, and/or transcript termination regions of the plant kappa hydroxylase may be obtained for use in a variety of DNA constructs, with or without the kappa hydroxylase structural gene.
  • nucleic acid sequences corresponding to the plant kappa hydroxylase of this invention may also provide signal sequences useful to direct transport into an organelle 5* upstream non-coding regulatory regions (promoters) having useful tissue and timing profiles, 3 1 downstream non-coding regulatory region useful as transcriptional and translational regulatory regions and may lend insight into other features of the gene.
  • the desired plant kappa hydroxylase nucleic acid sequence may be manipulated in a variety of ways. Where the sequence involves non-coding flanking regions, the flanking regions may be subjected to resection, mutagenesis, etc. Thus, transitions, transversions, deletions, and insertions may be performed on the naturally occurring sequence. In addition, all or part of the sequence may be synthesized.
  • one or more codons may be modified to provide for a modified amino acid sequence, or one or more codon mutations may be introduced to provide for a convenient restriction site or other purpose involved with construction or expression.
  • the structural gene may be further modified by employing synthetic adapters, linkers to introduce one or more convenient restriction sites, or the like.
  • nucleic acid or amino acid sequences encoding a plant kappa hydroxylase of this invention may be combined with other non-native, or "heterologous", sequences in a variety of ways.
  • heterologous sequences is meant any sequence which is not naturally found joined to the plant kappa hydroxylase, including, for example, combination of nucleic acid sequences from the same plant which are not naturally found joined together.
  • the DNA sequence encoding a plant kappa hydroxylase of this invention may be employed in conjunction with all or part of the gene sequences normally associated with the kappa hydroxylase.
  • a DNA sequence encoding kappa hydroxylase is combined in a DNA construct having, in the 5' to 3 1 direction of transcription, a transcription initiation control region capable of promoting transcription and translation in a host cell, the DNA sequence encoding plant kappa hydroxylase and a transcription and translation termination region.
  • Potential host cells include both prokaryotic and eukaryotic cells.
  • a host cell may be unicellular or found in a multicellular differentiated or undifferentiated organism depending upon the intended use.
  • Cells of this invention may be distinguished by having a plant kappa hydroxylase foreign to the wild-type cell present therein, for example, by having a reco binant nucleic acid construct encoding a plant kappa hydroxylase therein.
  • the regulatory regions will vary, including regions from viral, plasmid or chromosomal genes, or the like.
  • a wide variety of constitutive or regulatable promoters may be employed. Expression in a microorganism can provide a ready source of the plant enzyme.
  • transcriptional initiation regions which have been described are regions from bacterial and yeast hosts, such as E . coli, B . subtili ⁇ , Saccharomyces cerevi ⁇ iae , including genes such as beta-galactosidase, T7 poly erase, tryptophan E and the like.
  • the constructs will involve regulatory regions functional in plants which provide for modified production of plant kappa hydroxylase with resulting modification of the fatty acid composition.
  • the open reading frame, coding for the plant kappa hydroxylase or functional fragment thereof will be joined at its 5* end to a transcription initiation regulatory region such as the wild- type sequence naturally found 5' upstream to the kappa hydroxylase structural gene.
  • a transcription initiation regulatory region such as the wild- type sequence naturally found 5' upstream to the kappa hydroxylase structural gene.
  • Numerous other transcription initiation regions are available which provide for a wide variety of constitutive or regulatable, e.g. , inducible, transcription of the structural gene functions.
  • transcriptional initiation regions used for plants are such regions associated with the structural genes such as for nopaline and mannopine synthases, or with napin, soybean ,9- conglycinin, oleosin, 12S storage protein, the cauliflower mosaic virus 35S promoters and the like.
  • the transcription/ translation initiation regions corresponding to such structural genes are found immediately 5' upstream to the respective start codons.
  • the use of all or part of the complete plant kappa hydroxylase gene is desired; namely all or part of the 5' upstream non-coding regions (promoter) together with the structural gene sequence and 3 ' downstream non-coding regions may be employed.
  • a different promoter such as a promoter native to the plant host of interest or a modified promoter, i.e., having transcription initiation regions derived from one gene source and translation initiation regions derived from a different gene source, including the sequence encoding the plant kappa hydroxylase of interest, or enhanced promoters, such as double 35S CaMV promoters, the sequences may be joined together using standard techniques.
  • transcription initiation control regions from the B . napu ⁇ napin gene, or the Arabidopsis 12S storage protein, or soybean ⁇ -conglycinin (Bray et al., 1987), or the L . fendleri kappa hydroxylase promoter described herein are desired.
  • Transcription initiation regions which are preferentially expressed in seed tissue, i.e., which are undetectable in other plant parts, are considered desirable for fatty acid modifications in order to minimize any disruptive or adverse effects of the gene product.
  • Transcript termination regions may be provided by the DNA sequence encoding the plant kappa hydroxylase or a convenient transcription termination region derived from a different gene source, for example, the transcript termination region which is naturally associated with the transcript initiation region. Where the transcript termination region is from a different gene source, it will contain at least about 0.5 kb, preferably about 1-3 kb of sequence 3 • to the structural gene from which the termination region is derived.
  • Plant expression or transcription constructs having a plant kappa hydroxylase as the DNA sequence of interest for increased or decreased expression thereof may be employed with a wide variety of plant life, particularly, plant life involved in the production of vegetable oils for edible and industrial uses. Most especially preferred are temperate oilseed crops. Plants of interest include, but are not limited to rapeseed (Canola and high erucic acid varieties) , Crambe, Brassica juncea , Brassica nigra , meadowfoam, flax, sunflower, safflower, cotton, Cuphea , soybean, peanut, coconut and oil palms and corn.
  • An important criterion in the selection of suitable plants for the introduction on the kappa hydroxylase is the presence in the host plant of a suitable substrate for the hydroxylase.
  • production of ricinoleic acid will be best accomplished in plants that normally have high levels of oleic acid in seed lipids.
  • production of lesquerolic acid will best be accomplished in plants that have high levels of icosenoic acid in seed lipids.
  • this invention is applicable to dicotyledons and monocotyledons species alike and will be readily applicable to new and/or improved transformation and regulation techniques.
  • the method of transformation is not critical to the current invention; various methods of plant transformation are currently available. As newer methods are available to transform crops, they may be directly applied hereunder. For example, many plant species naturally susceptible to A ⁇ roJbacterium infection may be successfully transformed via tripartite or binary vector methods of Agrojacterium mediated transformation. In addition, techniques of microinjection, DNA particle bombardment, electroporation have been developed which allow for the transformation of various monocot and dicot plant species.
  • the various components of the construct or fragments thereof will normally be inserted into a convenient cloning vector which is capable of replication in a bacterial host, e.g., E. coli .
  • a convenient cloning vector which is capable of replication in a bacterial host, e.g., E. coli .
  • the plasmid may be isolated and subjected to further manipulation, such as restriction, insertion of new fragments, ligation, deletion, insertion, resection, etc., so as to tailor the components of the desired sequence.
  • the construct Once the construct has been completed, it may then be transferred to an appropriate vector for further manipulation in accordance with the manner of transformation of the host cell.
  • included with the DNA construct will be a structural gene having the necessary regulatory regions for expression in a host and providing for selection of transformant cells.
  • the gene may provide for resistance to a cytotoxic agent, e.g., antibiotic, heavy metal, toxin, etc., complementation providing prototropy to an auxotrophic host, viral immunity or the like.
  • a cytotoxic agent e.g., antibiotic, heavy metal, toxin, etc.
  • complementation providing prototropy to an auxotrophic host, viral immunity or the like.
  • one or more markers may be employed, where different conditions for selection are used for the different hosts.
  • the manner in which the DNA construct is introduced into the plant host is not critical to this invention. Any method which provides for efficient transformation may be employed.
  • Various methods for plant cell transformation include the use of Ti- or Ri-plas ids, microinjection, electroporation, infiltration, imbibition, DNA particle bombardment, liposome fusion, DNA bombardment or the like.
  • a vector may be used which may be introduced into the AgroJbacteriu ⁇ n host for homologous recombination with T-DNA or the Ti- or Ri-plasmid present in the AgroJacterium host.
  • the Ti- or Ri-plasmid containing the T-DNA for recombination may be armed (capable of causing gall formation) or disarmed (incapable of causing gall) , the latter being permissible, so long as the vir genes are present in the transformed AgrroJbacterium host.
  • the armed plasmid can give a mixture of normal plant cells and gall.
  • the expression construct bordered by the T-DNA border(s) will be inserted into a broad host spectrum vector, there being broad host spectrum vectors described in the literature. Commonly used is pRK2 or derivatives thereof. See, for example, Ditta et al., (1980), Included with the expression construct and the T-DNA will be one or more markers, which allow for selection of transformed Agrobacterium and transformed plant cells. A number of markers have been developed for use with plant cells, such as resistance to kanamycin, the aminoglycoside G418, hygromycin, or the like. The particular marker employed is not essential to this invention, one or another marker being preferred depending on the particular host and the manner of construction.
  • explants For transformation of plant cells using Agrobacterium, explants may be combined and incubated with the transformed Agrobacterium for sufficient time for transformation, the bacteria killed, and the plant cells cultured in an appropriate selective medium. Once callus forms, shoot formation can be encouraged by employing the appropriate plant hormones in accordance with known methods and the shoots transferred to rooting medium for regeneration of plants. The plants may then be grown to seed and the seed used to establish repetitive generations and for isolation of vegetable oils.
  • the kappa hydroxylase encoded by the previously described fahl2 gene from Castor (U.S. Patent application 08/320,982) was used to produce ricinoleic acid, lesquerolic acid, densipolic acid and auricolic acid in transgenic Arabidopsis plants. This example reduces to practice the method taught in Example 2 of the foregoing application.
  • Arabidopsis plants were transformed, by Agrobacterium- mediated transformation, with the kappa hydroxylase encoded by the Castor fahl2 gene on binary Ti plasmid pB6. This plasmid was previously used to transform Nicotiana tabacum for the production of ricinoleic acid (U.S. Patent application 08/320,982) .
  • Inoculums of Agrobacterium tumefacien ⁇ strain GV3101 containing binary Ti plasmid pB6 were plated on L-broth plates containing 50 ⁇ g/ml kanamycin and incubated for 2 days at 30°C. Single colonies were used to inoculate large liquid cultures (L-broth medium with 50 mg/1 rifampicin, 110 mg/1 gentamycin and 200 mg/1 kanamycin) to be used for the transformation of Arabidopsis plants.
  • Arabidopsis plants were transformed by the in pi ant a transformation procedure essentially as described by Bechtold et al., (1993).
  • Batches of 12-15 plants were grown for 3 to 4 weeks in natural light at a mean daily temperature of approximately 25°C in 3.5 inch pots containing soil. The intact plants were immersed in the bacterial suspension then transferred to a vacuum chamber and placed under 600 mm of vacuum produced by a laboratory vacuum pump until tissues appeared uniformly water-soaked (approximately
  • the plants were grown at 25°C under continuous light (100 ⁇ mol m "2 s "1 irradiation in the 400 to 700 nm range) for four weeks.
  • the seeds obtained from all the plants in a pot were harvested as one batch.
  • the seeds were sterilized by sequential treatment for 2 min with ethanol followed by 10 min in a mixture of household bleach (Chlorox) , water and Tween-80 (50%, 50%, 0.05%) then rinsed thoroughly with sterile water.
  • the seeds were plated at high density (2000 to 4000 per plate) onto agar-solidified medium in 100 mm petri plates containing 1/2 X Murashige and Skoog salts medium enriched with B5 vitamins (Sigma Chemical Co., St.
  • DNA was extracted from young leaves from transformants to verify the presence of an intact fahl2 gene.
  • the presence of the transgene in a number of the putative transgenic lines was verified by using the polymerase chain reaction to amplify the insert from pB6.
  • genomic DNA was added to a solution containing 25 pmol of each primer, 1.5 U Taq polymerase (Boehringer Manheim) , 200 uM of dNTPs, 50 mM KCl, 10 mM Tris.Cl (pH 9), 0.1% (v/v) Triton X-100, 1.5 mM MgCl 2 , 3% (v/v) formamide, to a final volume of 50 ⁇ l.
  • Amplifications conditions were: 4 min denaturation step at 94°C, followed by 30 cycles of 92°C for 1 min, 55°C for 1 min, 72°C for 2 min. A final extension step closed the program at 72°C for 5 min.
  • Transformants could be positively identified after visualization of a characteristic 1 kb amplified fragment on an ethidium bromide stained agarose gel. All transgenic lines tested gave a PCR product of a size consistent with the expected genotype, confirming that the lines were, indeed, transgenic. All further experiments were done with three representative transgenic lines of the wild type designated as 1-3, 4D, 7-4 and one transgenic line of the fad2 mutant line JB12. The transgenic JB12 line was included in order to test whether the increased accumulation of oleic acid in this mutant would have an effect on the amount of ricinoleic acid that accumulated in the transgenic plants.
  • Fatty acid methyl esters were prepared by placing tissue in 1.5 ml of 1.0 M methanolic HCl (Supelco Co.) in a 13 x 100 mm glass screw-cap tube capped with a teflon-lined cap and heated to 80°C for 2 hours. Upon cooling, 1 ml petroleum ether was added and the FAMES removed by aspirating off the ether phase which was then dried under a nitrogen stream in a glass tube.
  • the samples were not split, the temperature program was 195°C for 18 min, increased to 230°C at 25°C/min, held at 230°C for 5 min then down to 195°C at 25°C/min. , and flame ionization detectors were used.
  • the chromatographic elution time of methyl esters and O- TMS derivatives of ricinoleic acid, lesquerolic acid and auricolic acid was established by GC-MS of lipid samples from seeds of L. fendleri and comparison to published chromatograms of fatty acids from this species (Carlson et al., 1990).
  • a O- TMS-methyl-ricinoleate standard was prepared from ricinoleic acid obtained from Sigma Chemical Co (St, Louis, MO) .
  • O-TMS- methyl-lesqueroleate and O-TMS-methyl-auricoleate standards were prepared from triacylglycerols purified from seeds of L. fendleri .
  • Lipid extracted from transgenic tissues were analyzed by gas chromatography and mass spectrometry for the presence of hydroxylated fatty acids.
  • the average fatty acid composition of leaves in Arabidopsis wild type and fad2 mutant lines was reported by Miquel and Browse (1992) .
  • Gas chromatograms of methylated and silylated fatty acids from seeds of wild type and a fahl2 transgenic wild type plant are shown in Figures 3A and 3B, respectively. The profiles are very similar except for the presence of three small but distinct peaks at 14.3, 15.9 and 18.9 minutes. A very small peak at 20.15 min was also evident.
  • Table 1 Fatty acid composition of lipids from transgenic and wild type Arabidopsis. The values are the means obtained from analysis of samples from three independent transgenic lines, or three independent samples of wild type and fad2 lines.
  • Hewlett-Packard 5971 series mass selective detector was used in place of the flame ionization detector used in the previous experiment.
  • the spectra of the four new peaks in Figure 3B (peak numbers 10, 11, 12 and 13) are shown in Figures 4A-D, respectively. Comparison of the spectrum obtained for the standards with that obtained for the four peaks from the transgenic lines confirms the identity of the four new peaks.
  • peak 10 On the basis of the three characteristic peaks at M/Z 187, 270 and 299, peak 10 is unambiguously identified as O-TMS- methylricinoleate.
  • peak 11 is unambiguously identified as O-TMS-methyldensipoleate.
  • peak 12 is unambiguously identified as O-TMS-methyllesqueroleate.
  • peak 13 is unambiguously identified as O-TMS- methylauricoleate.
  • densipolic acid is produced by the action of an n-3 desaturase on ricinoleic acid.
  • Auricolic acid is produced by the action of an n-3 desaturase on lesquerolic acid. Because it is located in the endoplasmic reticulum, the fad3 desaturase is almost certainly responsible. This can be tested in the future by producing fahl2-containing transgenic plants of the fad3-deficient mutant of Arabidopsis (similar experiments can be done with fad7 and fad ⁇ ) .
  • hydroxylated fatty acids produced in this example is less than desired for commercial production of ricinoleate and other hydroxylated fatty acids from plants
  • Additional improvements are envisioned to involve modification of the enzymes which cleave hydroxylated fatty acids from phosphatidylcholine, reduction in the activities of enzymes which degrade hydroxylated fatty acids and replacement of acyltransferases which transfer hydroxylated fatty acids to the sn-1, sn-2 and sn-3 positions of glycerolipids.
  • genes for these enzymes have not been described in the scientific literature, their utility in improving the level of production of hydroxylated fatty acids can be readily envisioned based on the results of biochemical investigations of ricinoleate synthesis.
  • Arabidopsis is not an economically important plant species, it is widely accepted by plant biologists as a model for higher plants. Therefore, the inclusion of this example is intended to demonstrate the general utility of the invention described here to the modification of oil composition in higher plants.
  • One advantage of studying the expression of this novel gene in Arabidopsis is the existence in this system of a large body of knowledge on lipid metabolism, as well as the availability of a collection of mutants which can be used to provide useful information on the biochemistry of fatty acid hydroxylation in plant species.
  • Another advantage is the ease of transposing any of the information obtained on metabolism of ricinoleate in Arabidopsis to closely related species such as the crop plants Bra ⁇ ica napu ⁇ , Brassica j ⁇ ncea or CramJbe Abyssinica in order to mass produce ricinoleate, lesqueroleate or other hydroxylated fatty acids for industrial use.
  • the kappa hydroxylase is useful for the production of ricinoleate or lesqueroleate in any plant species that accumulates significant levels of the precursors, oleic acid and icosenoic acid.
  • Of particular interest are genetically modified varieties that accumulate high levels of oleic acid. Such varieties are currently available for sunflower and Canola.
  • Regions of nucleotide sequence that were conserved in both the Castor kappa hydroxylase and the Arabidopsis fad2 ⁇ 12 fatty acid desaturase were used to design oligonucleotide primers. These were used with genomic DNA from Lesquerella fendleri to amplify fragments of several homologous genes. These amplified fragments were then used as hybridization probes to identify full length genomic clones from a genomic library of L . fendleri . Hydroxylated fatty acids are specific to the seed tissue of Lesquerella sp. , and are not found to any appreciable extent in vegetative tissues.
  • One of the two genes identified by this method was expressed in both leaves and developing seeds and is therefore thought to correspond to the ⁇ l2 fatty acid desaturase.
  • the other gene was expressed at high levels in developing seeds but was not expressed or was expressed at very low levels in leaves and is the kappa hydroxylase from this species.
  • the identity of this gene will be established by introducing the gene into transgenic Arabidopsis plants and showing that it causes the accumulation of ricinoleic acid, lesquerolic acid, densipolic acid and auricolic acid in seed lipids.
  • the promoter of this gene is also of utility because it is able to direct expression of a gene specifically in developing seeds at a time when storage lipids are accumulating. This promoter is, therefore, of great utility for many applications in the genetic engineering of seeds, particularly in members of the Brassicacea.
  • Oligonucleotide primers for the amplification of the L. fendleri kappa hydroxylase were designed by choosing regions of high deduced amino acid sequence homology between the Castor kappa hydroxylase and the Arabidopsis ⁇ l2 desaturase (fad2) . Because most amino acids are encoded several different codons, these oligonucleotides were designed to encode all possible codons that could encode the corresponding amino acids.
  • Oligo 1 TAYWSNCAYMGNMGNCAYCA (SEQ ID NO:14)
  • oligonucleotides were used to amplify a fragment of DNA from L . fendleri genomic DNA by the polymerase chain reaction (PCR) using the following conditions: Approximately 100 ng of genomic DNA was added to a solution containing 25 pmol of each primer, 1.5 U Taq polymerase (Boehringer
  • PCR products of approximately 540 bp were observed following electrophoretic separation of the products of the PCR reaction in agarose gels. Two of these fragments were cloned into pBluescript (Stratagene) to give rise to plasmids pLesq2 and pLesq3. The sequence of the inserts in these two plasmids was determined by the chain termination method. The sequence of the insert in pLesq2 is presented as Figure 5 (SEQ ID N0:1) and the sequence of the insert in pLesq3 is presented as Figure 6 (SEQ ID NO:2). The high degree of sequence identity between the two clones indicated that they were both potential candidates to be either a ⁇ 12 desaturase or a gama hydroxylase.
  • hydroxylated fatty acids are found in large amounts in seed oils but are not found in appreciable amounts in leaves. Therefore, an important criterion in discriminating between a fatty acyl desaturase and kappa hydroxylase is that the kappa hydroxylase gene is expected to be expressed more highly in tissues which have high level of hydroxylated fatty acids than in other tissues whereas all plant tissues should contain mRNA for an ⁇ 6 fatty acyl desaturase since diunsaturated fatty acids are found in the lipids of all tissues in most or all plants. Therefore, it was of great interest to determine whether the gene corresponding to pLesq2 was also expressed only in seeds, or is also expressed in other tissues.
  • RNA prepared as described above from leaves and developing seeds was electrophoresed through an agarose gel containing formaldehyde (Iba et al., 1993). An equal quantity (10 ⁇ q) of RNA was loaded in both lanes, and RNA standards (0.16-1.77 kb ladder, Gibco-BRL) were loaded in a third lane. Following electrophoresis, RNA was transferred from the gel to a nylon membrane (Hybond N+, Amersham) and fixed to the filter by exposure to UV light. A 32P-labelled probe was prepared from insert DNA of clone pLesq2 by random priming and hybridized to the membrane overnight at 52°C, after it had been prehybridized for 2 h.
  • the prehybridization solution contained 5X SSC, 10X Denhardt's solution, 0.1% SDS, 0.1M KP0 4 pH 6.8, 100 ⁇ g/ml salmon sperm DNA.
  • the hybridization solution had the same basic composition, but no SDS, and it contained 10% dextran sulfate and 30% formamide.
  • the blot was washed once in 2X SSC, 0.5% SDS at 65°C then in IX SSC at the same temperature.
  • Genomic DNA was prepared from young leaves of L . fendleri as described by Murray and Thompson (1980) .
  • a 5au3AI-partial digest genomic library constructed in the vector ⁇ Dashll (Stratagene, 11011 North Torrey Pines Road, La Jolla CA 92037) was prepared by partially digesting 500 ⁇ q of DNA, size- selecting the DNA on a sucrose gradient (Sambrook et al., 1989) , and ligating the DNA (12 kb average size) to the BamHI- digested arms of ⁇ Dashll. The entire ligation was packaged according to the manufacturer's conditions and plated on E . coli strain XLl-Blue MRA-P2 (Stratagene) .
  • the library was then amplified according to the manufacturer's conditions. A fraction of the genomic library was plated on E. coli XLl-Blue and resulting plaques (150,000) were lifted to charged nylon membranes (Hybond N+, Amersham) , according to the manufacturer's conditions. DNA was crosslinked to the filters under UV in a Stratalinker (Stratagene) .
  • the sequence of the insert in clone pLesq-Hyd is shown in Figures 8A-B.
  • the sequence entails 1855 bp of contiguous DNA sequence (SEQ ID NO:3).
  • the clone encodes a 401 bp 5' untranslated region (i.e., nucleotides preceding the first ATG codon) , an 1152 bp open reading frame, and a 302 bp 3' untranslated region.
  • the open reading frame encodes a 384 amino acid protein with a predicted molecular weight of 44,370 (SEQ ID NO:4) .
  • the amino terminus lacks features of a typical signal peptide (von Heijne, 1985) .
  • the Arabidopsis fad2 cDNA which encodes an endoplasmic reticulum-localized ⁇ l2 desaturase (called fad2) (Okuley et al., 1994), two soybean fad2 desaturase clones, a Bra ⁇ ica napu ⁇ fad2 clone, a Zea ay ⁇ fad2 clone and partial sequence of a R . communi ⁇ fad2 clone.
  • the high degree of sequence homology indicates that the gene products are of similar function. For instance, the overall homology between the Lesquerella hydroxylase and the Arabidopsis fad2 desaturase was 92.2% similarity and 84.8% identity and the two sequences differed in length by only one amino acid.
  • Genomic DNA (5 ⁇ q) was digested with EcoR I, Hind III and Xba I and separated on a 0.9% agarose gel. DNA was alkali-blotted to a charged nylon membrane (Hybond N+, Amersham) , according to the manufacturer's protocol.
  • the blot was prehybridized for 2 hours at 65°C in 7% SDS, ImM EDTA, 0.25 M Na 2 HP0 4 (pH 7.2), 1% BSA and hybridized to the probe for 16 hours in the same solution with pLesq-Hyd insert PCR-amplified with internal primers and labelled with P by random priming.
  • the filters were sequentially washed at 65°C in solutions containing 2 X SSC, 1 X SSC, 0.5 X SSC in addition to 0.1 % SDS, then exposed to X-ray film.
  • the probe hybridized with a single band in each digest of L . fendleri DNA ( Figure 10) , indicating that the gene from which pLesq-Hyd was transcribed is present in a single copy in the L . fendleri genome.
  • Electroporations employed a Biorad Gene pulsar instrument using cold 2 mm-gap cuvettes containing 40 ⁇ l cells and 1 ⁇ l of DNA in water, at a voltage of 2.5 KV, and 200 Ohms resistance.
  • the electroporated cells were diluted with 1 ml SOC medium (Sambrook et al., 1989, page A2) and incubated at 28°C for 2-4 h before plating on medium containing kanamycin (50 mg/1) .
  • Arabidopsis thaliana can be transformed with the Agrobacterium cells containing pTi-Hyd as described in Example 1 above. Similarly, the presence of hydroxylated fatty acids in the transgeneic Arabidopsis plants can be demonstrated by the methods described in Example 1 above.
  • a 1.5 kb EcoR I fragment from pLesq-Hyg comprising the entire coding region of the hydroxylase was gel purified, then cloned into the corresponding site of pBluescript KS (Stratagene) . Plasmid DNA from a number of recombinant clones was then restricted with Pst I, which should cut only once in the insert and once in the vector polylinker sequence. Release of a 920 bp fragment with Pst I indicated the right orientation of the insert for further manipulations. DNA from one such clone was further restricted with Sai l , the 5' overhangs filled-in with the Klenow fragment of DNA polymerase I, then cut with Sac I.
  • the insert fragment was gel purified, and cloned between the S a I and Sac I sites of pBI121 (Clontech) behind the Cauliflower Mosaic Virus 35S promoter. After checking that the sequence of the junction between insert and vector DNA was appropriate, plasmid DNA from a recombinant clone was used to transform A . tumefaciens (GV3101) . Kanamycin resistant colonies were then used for in planta transformation of A . thaliana as previously described. DNA was extracted from kanamycin resistant seedlings and used to PCR-amplify selected fragments from the hydroxylase using nested primers.
  • Figures 9A-B show a sequence alignment of the castor and L . fendleri hydroxylase sequences with the castor hydroxylase sequence and all publically available sequences for all plant microsomal ⁇ l2 fatty acid desaturases.
  • the castor hydroxylase sequence Of the 384 amino acid residues in the castor hydroxylase sequence, more than 95% are identical to the corresponding residue in at least one of the desaturase sequences. Therefore, none of these residues are responsible for the catalytic differences between the hydroxylase and the desaturases.
  • the numbering may vary from protein to protein but the intent of the number system will be evident if the protein in question is aligned with the castor hydroxylase using the numbering system shown herein.
  • the structural criterion disclosed here teaches how to isolate and identify plant kappa hydroxylase genes for the purpose of genetically modifying fatty acid composition ,
  • T L -DNA gene 5 controls the tissue-specific expression of chimeric genes carried by a novel type of Agrobacterium binary vector. Mol. Gen. Genet. 204, 383-396.
  • Arabidopsi ⁇ FAD2 gene encodes the enzyme that is essential for polyunsaturated lipid
  • GANCCTTCCA TTTAAACCCT CTCTCGTGCT ATTCACCAGA 24
  • MOLECULE TYPE DNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: CGGTACCAGA AAACGCCTTG 20

Landscapes

  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Engineering & Computer Science (AREA)
  • Chemical & Material Sciences (AREA)
  • Zoology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Wood Science & Technology (AREA)
  • Organic Chemistry (AREA)
  • Biomedical Technology (AREA)
  • Biotechnology (AREA)
  • General Engineering & Computer Science (AREA)
  • Molecular Biology (AREA)
  • Microbiology (AREA)
  • Biochemistry (AREA)
  • General Health & Medical Sciences (AREA)
  • Medicinal Chemistry (AREA)
  • Cell Biology (AREA)
  • Physics & Mathematics (AREA)
  • Biophysics (AREA)
  • Plant Pathology (AREA)
  • Oil, Petroleum & Natural Gas (AREA)
  • Nutrition Science (AREA)
  • Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
  • Enzymes And Modification Thereof (AREA)

Abstract

This invention relates to plant fatty acyl hydroxylases. Methods to use conserved amino acid or nucleotide sequences to obtain plant fatty acyl hydroxylases are described. Also described is the use of cDNA clones encoding a plant hydroxylase to produce a family of hydroxylated fatty acids in transgenic plants.

Description

PRODUCTION OF HYDROXYLATED PATTY ACIDS IN GENETICALLY MODIFIED PLANTS
TECHNICAL FIELD
The present invention concerns the identification of nucleic acid sequences and constructs, and methods related thereto, and the use of these sequences and constructs to produce genetically modified plants for the purpose of altering the fatty acid composition of plant oils, waxes and related compounds.
DEFINITIONS
The subject of this invention is a class of enzymes that introduce a hydroxyl group into several different fatty acids resulting in the production of several different kinds of hydroxylated fatty acids. In particular, these enzymes catalyze hydroxylation of oleic acid to 12-hydroxy oleic acid and icosenoic acid to 14-hydroxy icosenoic acid. Other fatty acids such as palmitoleic and erucic acids may also be substrates. Since it is not possible to refer to the enzyme by reference to a unique substrate or product, we refer to the enzyme throughout as kappa hydroxylase to indicate that the enzyme introduces the hydroxyl three carbons distal (i.e., away from the carboxyl carbon of the acyl chain) from a double bond located near the center of the acyl chain.
The following fatty acids are also the subject of this invention: ricinoleic acid, 12-hydroxyoctadec-cis-9-enoic acid (120H-18:1 ); lesquerolic acid, 14-hydroxy-cιs-ll-ιcosenoιc acid (l4OH-20:lc,sΔ ); densipolic acid, 12-hydroxyoctadec-cis- 9,15-dienoic acid (120H-18:2 ' ); auricolic acid, 14- hydroxy-cis-ll,17-ιcosadιenoιc aciid (14OH-20:2cisΔH'17); hydroxyerucic, l6-hydroxydocos-cis-13-enoic acid (160H- 22:lc,sΔ13) ; hydroxypalmitoleic, 12-hydroxyhexadec-cis-9-enoic (120H-16:lcisΔ9) ; icosenoic acid (20:lc1sΔ11) . It will be noted that icosenoic acid is spelled eicosenoic acid in some countries.
BACKGROUND
Extensive surveys of the fatty acid composition of seed oils from different species of higher plants have resulted in the identification of at least 33 structurally distinct monohydroxylated plant fatty acids, and 12 different polyhydroxylated fatty acids that are accumulated by one or more plant species (reviewed by van de Loo et al. 1993). Ricinoleic acid, the principal constituent of the seed oil from the castor plant Ricinus commun±s (L . ) , is of commercial importance. We have previously described the cloning of a gene from this species that encodes a fatty acid hydroxylase, and the use of this gene to produce ricinoleic acid in transgenic plants of other species. The scientific evidence supporting this claim was published in
1995 . The use of the castor hydroxylase gene to also produce other hydroxylated fatty acids such as lesquerolic acid, densipolic acid, hydroxypalmitoleic, hydroxyerucic and auricolic acid in transgenic plants is the subject of this invention. In addition, the identification of a gene encoding a homologous hydroxylase from Lesquerella fendleri , and the use of this gene to produce these hydroxylated fatty acids in transgenic plants is the subject of this invention.
Castor is a minor oilseed crop. Approximately 50% of the seed weight is oil (triacylglycerol) in which 85-90% of total fatty acids are the hydroxylated fatty acid, ricinoleic acid. Oil pressed or extracted from castor seeds has many industrial uses based upon the properties endowed by the hydroxylated fatty acid. The most important uses are production of paints and varnishes, nylon-type synthetic polymers, resins, lubricants, and cosmetics (Atsmon 1989) . In addition to oil, the castor seed contains the extremely toxic protein ricin, allergenic proteins, and the alkaloid ricinine. These constituents preclude the use of the untreated seed meal (following oil extraction) as a livestock feed, normally an important economic aspect of oilseed utilization.
Furthermore, with the variable nature of castor plants and a lack of investment in breeding, castor has few favorable agronomic characteristics. For a combination of these reasons, castor is no longer grown in the United states and the development of an alternative domestic source of hydroxylated fatty acids would be attractive. The production of ricinoleic acid, the important constituent of castor oil, in an established oilseed crop through genetic engineering would be a particularly effective means of creating a domestic source.
Because there is no practical source of lesquerolic, densipolic and auricolic acids from plants that are adapted to modern agricultural practices, there is currently no large- scale use of these fatty acids by industry. However, the fatty acids would have uses similar to those of ricinoleic acid if they could be produced in large quantities at comparable cost to other plant-derived fatty acids (Smith 1985) . Plant species, such as certain species in the genus Lesquerella, that accumulate a high proportion of these fatty acids, have not been domesticated and are not currently considered a practical source of fatty acids (Hirsinger, 1989) . This invention represents a useful step toward the eventual production of these and other hydroxylated fatty acids in transgenic plants of agricultural importance.
The taxonomic relationships between plants having similar or identical kinds of unusual fatty acids have been examined (van de Loo et al., 1993). In some cases, particular fatty acids occur mostly or solely in related taxa. In other cases there does not appear to be a direct link between taxonomic relationships and the occurrence of unusual fatty acids. In this respect, ricinoleic acid has now been identified in 12 genera from 10 families (reviewed in van de Loo et al., 1993). Thus, it appears that the ability to synthesize hydroxylated fatty acids has evolved several times independently during the radiation of the angiosperms. This suggested to us that the enzymes which introduce hydroxyl groups into fatty acids arose by minor modifications of a related enzyme. Indeed, as shown herein, the sequence similarity between Δl2 fatty acid desaturases and the kappa hydroxylase from castor is so high that it is not possible to unambiguously determine whether a particular enzyme is a desaturase or a hydroxylase on the basis of evidence in the scientific literature. Similarly, a patent application (WO (94/11516) that purports to teach the isolation and use of Δl2 fatty acid desaturases does not teach how to distinguish a hydroxylase from a desaturase. In view of the importance of being able to distinguish between these activities for the purpose of genetic engineering of plant oils, the utility of that application is limited to the several instances where direct experimental evidence (e.g., altered fatty acid composition in transgenic plants) was presented to support the assignment of function. A method for distinguishing between fatty acid desaturases and fatty acid hydroxylases on the basis of amino acid sequence of the enzyme is also a subject of this invention.
A feature of hydroxylated or other unusual fatty acids is that they are generally confined to seed triacylglycerols, being largely excluded from the polar lipids by unknown mechanisms (Battey and Ohlrogge 1989; Prasad et al. , 1987). This is particularly intriguing since diacylglycerol is a precursor of both triacylglycerol and polar lipid. With castor microsomes, there is some evidence that the pool of ricinoleoyl-containing polar lipid is minimized by a preference of diacylglycerol acyltransferase for ricinoleate- containing diacylglycerols (Bafor et al. 1991) . Analyses of vegetative tissues have generated few reports of unusual fatty acids, other than those occurring in the cuticle. The cuticle contains various hydroxylated fatty acids which are interesterified to produce a high molecular weight polyester which serves a structural role. A small number of other exceptions exist in which unusual fatty acids are found in tissues other than the seed.
The biosynthesis of ricinoleic acid from oleic acid in the developing endosperm of castor (Ricinus communis) has been studied by a variety of methods. Morris (1967) established in double-labeling studies that hydroxylation occurs directly by hydroxyl substitution rather than via an unsaturated-, keto- or epoxy-intermediate. Hydroxylation using oleoyl-CoA as precursor can be demonstrated in crude preparations or microsomes, but activity in microsomes is unstable and variable, and isolation of the microsomes involved a considerable, or sometimes complete loss of activity (Galliard and Stumpf, 1966; Moreau and Stumpf, 1981) . Oleic acid can replace oleoyl-CoA as a precursor, but only in the presence of CoA, Mg * and ATP (Galliard and Stumpf, 1966) indicating that activation to the acyl-CoA is necessary. However, no radioactivity could be detected in ricinoleoyl-CoA (Moreau and Stumpf, 1981) . These and more recent observations (Bafor et al., 1991) have been interpreted as evidence that the substrate for the castor oleate hydroxylase is oleic acid esterified to phosphatidylcholine or another phospholipid.
The hydroxylase is sensitive to cyanide and azide, and dialysis against metal chelators reduces activity, which could be restored by addition of FeS04, suggesting iron involvement in enzyme activity (Galliard and Stumpf, 1966) . Ricinoleic acid synthesis requires molecular oxygen (Galliard and Stumpf, 1966; Moreau and Stumpf 1981) and requires NAD(P)H to reduce cytochrome b5 which is thought to be the intermediate electron donor for the hydroxylase reaction (Smith et al., 1992). Carbon monoxide does not inhibit hydroxylation, indicating that a cytochrome P450 is not involved (Galliard and Stumpf, 1966; Moreau and Stumpf 1981) . Data from a study of the substrate specificity of the hydroxylase show that all substrate parameters (i.e., chain length and double bond position with respect to both ends) are important; deviations in these parameters caused reduced activity relative to oleic acid (Howling et al., 1972). The position at which the hydroxyl was introduced, however, was determined by the position of the double bond, always being three carbons distal. Thus, the castor acyl hydroxylase enzyme can produce a family of different hydroxylated fatty acids depending on the availability of substrates. Thus, as a matter of convenience, we refer to the enzyme throughout as a kappa hydroxylase (rather than an oleate hydroxylase) to indicate the broad substrate specificity.
The castor kappa hydroxylase has many superficial similarities to the microsomal fatty acyl desaturases (Browse and Somerville, 1991) . In particular, plants have a microsomal oleate desaturase active at the Δl2 position. The substrate of this enzyme (Schmidt et al., 1993) and of the hydroxylase (Bafor et al., 1991) appears to be a fatty acid esterified to the sn-2 position of phosphatidylcholine. When oleate is the substrate, the modification occurs at the same position (Δl2) in the carbon chain, and requires the same cofactors, namely electrons from NADH via cytochrome Jb5 and molecular oxygen. Neither enzyme is inhibited by carbon monoxide (Moreau and Stumpf, 1981) , the characteristic inhibitor of cytochrome P450 enzymes.
There do not appear to have been any published biochemical studies of the properties of the hydroxylase enzyme(s) in Lesquerella.
Conceptual basis of the invention
An earlier application has described the use of a cDNA clone from castor for the production of ricinoleic acid in transgenic plants. As noted above, biochemical studies by others had suggested that the castor hydroxylase may not have strict specificity for oleic acid but would also catalyze hydroxylation of other fatty acids such as icosenoic acid (20:lcfsΔ11) (Howling et al., 1972). Based on these studies, the expression of the castor hydroxylase in transgenic plants of species such as Brassica napus and Arabidopsiε thaliana that accumulate fatty acids such as icosenoic acid (20:1cisΔ11) and erucic acid (13-docosenoιc acid; cisΔ13 22:1 ) would be expected to accumulate some of the hydroxylated derivatives of these fatty acids due to the activity of the hydroxylase on these fatty acids. We have now obtained additional direct evidence for such a claim based on the production of ricinoleic, lesquerolic, densipolic and auricolic fatty acids in transgenic Arabidopsis plants and have included such evidence herein as Example 1.
We have previously disclosed the various methods by which the castor hydroxylase clone and sequences derived thereof could be used to identify other hydroxylase clones from plant species such as Lesquerella fendleri that are known to accumulate hydroxylated fatty acids in seed oils. In this continuation we have provided an example of the use of that aspect of the invention for the isolation of a novel hydroxylase gene from Lesquerella fendleri .
In view of the high degree of sequence similarity between Δl2 fatty acid desaturases and the castor hydroxylase (van de Loo et al., 1995), the validity of claims for the use of desaturase or hydroxylase genes or sequences derived therefrom for the identification of genes of identical function from other species must be viewed with skepticism. In this application, we teach a method by which hydroxylase genes can be distinguished from desaturases and describe methods by which Δ12 desaturases can be converted to hydroxylases by the modification of the gene encoding the desaturases. A mechanistic basis for the similar reaction mechanisms of desaturases and hydroxylases was presented in an earlier patent application- Briefly, the available evidence suggests that fatty acid desaturases have a similar reaction mechanism to the bacterial enzyme methane monooxygenase which catalyses a reaction involving oxygen-atom transfer (CH4 → CH3OH) (van de Loo et al., 1993). The cofactor in the hydroxylase component of methane monooxygenase is termed a μ-oxo bridged diiron cluster (FeOFe) . The two iron atoms of the FeOFe cluster are liganded by protein-derived nitrogen or oxygen atoms, and are tightly redox-coupled by the covalently-bridging oxygen atom. The FeOFe cluster accepts two electrons, reducing it to the diferrous state, before oxygen binding. Upon oxygen binding, it is likely that heterolytic cleavage also occurs, leading to a high valent oxoiron reactive species that is stabilized by resonance rearrangements possible within the tightly coupled FeOFe cluster. The stabilized high-valent oxoiron state of methane monooxygenase is capable of proton extraction from methane, followed by oxygen transfer, giving methanol. The FeOFe cofactor has been shown to be directly relevant to plant fatty acid modifications by the demonstration that castor stearoyl- ACP desaturase contains this type of cofactor (Fox et al. , 1993).
On the basis of the foregoing considerations, we hypothesized that the castor oleate hydroxylase is a structurally modified fatty acyl desaturase, based upon three arguments. The first argument involves the taxonomic distribution of plants containing ricinoleic acid. Ricinoleic acid has been found in 12 genera of 10 families of higher plants (reviewed in van de Loo et al., 1993). Thus, plants in which ricinoleic acid occurs are found throughout the plant kingdom, yet close relatives of these plants do not contain the unusual fatty acid. This pattern suggests that the ability to synthesize ricinoleic acid has arisen (and been lost) several times independently, and is therefore a quite recent divergence. In other words, the ability to synthesize ricinoleic acid has evolved rapidly, suggesting that a relatively minor genetic change in the structure of the ancestral enzyme was necessary to accomplish it. The second argument is that many biochemical properties of castor kappa hydroxylase are similar to those of the microsomal desaturases, as discussed above (e.g., both preferentially act on fatty acids esterified to the sn-2 position of phosphatidylcholine, both use cytochrome b5 as an intermediate electron donor, both are inhibited by cyanide, both require molecular oxygen as a substrate, both are thought to be located in the endoplasmic reticulum) .
The third argument stems from the discussion of oxygenase cofactors above, in which it is suggested that the plant membrane bound fatty acid desaturases may have a μ-oxo bridged diiron cluster-type cofactor, and that such cofactors are capable of catalyzing both fatty acid desaturations and hydroxylations, depending upon the electronic and structural properties of the protein active site.
Taking these three arguments together, it was hypothesized that kappa hydroxylase of castor endosperm is homologous to the microsomal oleate Δl2 desaturase found in all plants. The evidence supporting this hypothesis has been disclosed.
A number of genes encoding microsomal Δl2 desaturases from various species have recently been cloned (Okuley et al., 1994) and substantial information about the structure of these enzymes is now known. Hence, in the following invention we teach how to use structural information about fatty acyl desaturases to isolate kappa hydroxylase genes of this invention. This example teaches the method by which any carbon-monoxide insensitive plant fatty acyl hydroxylase gene can be identified by one skilled in the art.
BRIEF DESCRIPTION OF THE DRAWINGS
Figures 1A-D show the mass spectra of hydroxy fatty acids standards (Figure IA, O-TMS-methylricinoleate; Figure IB, O- TMS-methyl densipoleate; Figure 1C, O-TMS-methyl- lesqueroleate; and Figure ID, O-TMS-methylauricoleate.
Figure 2 shows the fragmentation pattern of trimethylsilylated methyl esters of hydroxy fatty acids.
Figure 3A shows the gas chromatogram of fatty acids extracted from seeds of wild type Arabidopsis plants. Figure 3B shows the gas chromatogram of fatty acids extracted from seeds of transgenic Arabidopsis plants containing the fahl2 hydroxylase gene. The numbers indicate the following fatty acids: [1] 16:0; [2] 18:0; [3] 18:lcisΔ9; [4] l8:2c1sΔ ,12; [5] 20:0; [6] 20:leisΔ11; [7]18:3cisΔ9«12'15; [8]22:lc Δ13; [9] 24:lcisΔ13; [10]ricinoleic acid; [11] densipolic acid; [12] lesquerolic acid; [13] auricolic acid.
Figures 4A-D show the mass spectra of novel fatty acids found in seeds of transgenic plants. Figure 4A shows the mass spectrum of peak 10 from Figure 3B. Figure 4B shows the mass spectrum of peak 11 from Figure 3B. Figure 4C shows the mass spectrum of peak 12 from Figure 3B. Figure 4D shows the mass spectrum of peak 13 from Figure 3B.
Figure 5 shows the nucleotide sequence of pLesq2 (SEQ ID NO:l) .
Figure 6 shows the nucleotide sequence of pLesq3 (SEQ ID NO:2).
Figure 7 shows a Northern blot of total RNA from seeds of L . fendleri probed with pLesq2 or pLesq3. S, indicates RNA is from seeds; L, indicates RNA is from leaves.
Figures 8A-B show the nucleotide sequence of genomic clone encoding pLesq-HYD (SEQ ID NO:3), and the deduced amino acid sequence of hydroxylase enzyme encoded by the gene (SEQ ID NO:4) .
Figures 9A-B show multiple sequence alignment of deduced amino acid sequences for kappa hydroxylases and microsomal Δl2 desaturases. Abbreviations are: Rcfahl2, fahl2 hydroxylase gene from R. communis (van de Loo et al . , 1995) ; Lffahl2, kappa hydroxylase gene from L . fendleri ; Atfad2, fad2 desaturase from Arabidopsis thaliana (Okuley et al . , 1994) ;
Gmfad2-1, fad2 desaturase from Glycine max (GenBank accession number L43920) ; Gmfad2-2, fad2 desaturase from Glycine max (Genbank accession number L43921) ; Zmfad2, fad2 desaturase from Zea mays ( o 94/11516 ); Rcfad2, fragment of fad2 desaturase from R . communis ( O 94/11516 ; Bnfad2, fad2 desaturase from Brassica napus ( WO/ 94/11516 ); LFFAH12.AMI, SEQ ID NO:4; FAH12.AMI, SEQ ID NO:5; ATFAD2.AMI, SEQ ID NO:6; BNFAD2.AMI, SEQ ID NO:7; GMFAD2-1.AMI, SEQ ID NO:8; GMFAD2- 2.AMI, SEQ ID NO:9; ZMFAD2.AMI, SEQ ID NO:10; and RCFAD2.AMI, SEQ ID NO:11.
Figure 10 shows a Southern blot of geno ic DNA from L . fendleri probed with pLesq-HYD. E=EcoRI, H = Hindlll, X = Xbal.
Figure 11 shows a map of binary Ti plasmid pSLJ44024.
SUMMARY OF THE INVENTION
This invention relates to plant fatty acyl hydroxylases. Methods to use conserved amino acid or nucleotide sequences to obtain plant fatty acyl hydroxylases are described. Also described is the use of cDNA clones encoding a plant hydroxylase to produce a family of hydroxylated fatty acids in transgenic plants.
In a first embodiment, this invention is directed to recombinant DNA constructs which can provide for the transcription or transcription and translation (expression) of the plant kappa hydroxylase sequence. In particular, constructs which are capable of transcription or transcription and translation in plant host cells are preferred. Such constructs may contain a variety of regulatory regions including transcriptional initiation regions obtained from genes preferentially expressed in plant seed tissue. In a second aspect, this invention relates to the presence of such constructs in host cells, especially plant host cells which have an expressed plant kappa hydroxylase therein.
In yet another aspect, this invention relates to a method for producing a plant kappa hydroxylase in a host cell or progeny thereof via the expression of a construct in the cell. Cells containing a plant kappa hydroxylase as a result of the production of the plant kappa hydroxylase encoding sequence are also contemplated herein. In another embodiment, this invention relates to methods of using a DNA sequence encoding a plant kappa hydroxylase for the modification of the proportion of hydroxylated fatty acids produced within a cell, especially plant cells. Plant cells having such a modified hydroxylated fatty acid composition are also contemplated herein.
In a further aspect of this invention, plant kappa hydroxylase proteins and sequences which are related thereto, including amino acid and nucleic acid sequences, are contemplated. Plant kappa hydroxylase exemplified herein includes a Lesquerella fendleri fatty acid hydroxylase. This exemplified fatty acid hydroxylase may be used to obtain other plant fatty acid hydroxylases of this invention. In a further aspect of this invention, a nucleic acid sequence which directs the seed specific expression of an associated polypeptide coding sequence is described. The use of this nucleic acid sequence or fragments derived thereof, to obtain seed-specific expression in higher plants of any coding sequence is contemplated herein.
DETAILED DESCRIPTION OF THE INVENTION
A genetically transformed plant of the present invention which accumulates hydroxylated fatty acids can be obtained by expressing the double-stranded DNA molecules described in this application.
A plant fatty acid hydroxylase of this invention includes any sequence of amino acids, such as a protein, polypeptide or peptide fragment, or nucleic acid sequences encoding such polypeptides, obtainable from a plant source which demonstrates the ability to catalyze the production of ricinoleic, lesquerolic, hydroxyerucic (16-hydroxydocos-cis- 13-enoic acid) or hydroxypalmitoleic (12-hydroxyhexadec-cis-9- enoic) from CoA, ACP or lipid-linked monoenoic fatty acid substrates under plant enzyme reactive conditions. By "enzyme reactive conditions" is meant that any necessary conditions are available in an environment (i.e., such factors as temperature, pH, lack of inhibiting substances) which will permit the enzyme to function.
Preferential activity of a plant fatty acid hydroxylase toward a particular fatty acyl substrate is determined upon comparison of hydroxylated fatty acid product amounts obtained per different fatty acyl substrates. For example, by "oleate preferring" is meant that the hydroxylase activity of the enzyme preparation demonstrates a preference for oleate- containing substrates over other substrates. Although the precise substrate of the castor fatty acid hydroxylase is not known, it is thought to be a monounsaturated fatty acid moiety which is esterified to a phospholipid such as phosphatidylcholine. However, it is also possible that monounsaturated fatty acids esterified to phosphatidylethanolamine, phosphatidic acid or a neutral lipid such as diacylglycerol or a Coenzyme-A thioester may also be substrates. As noted above, significant activity has been observed in radioactive labelling studies using fatty acyl substrates other than oleate (Howling et al., 1972) indicating that the substrate specificity is for a family of related fatty acyl compounds. Because the castor hydroxylase introduces hydroxy groups three carbons from a double bond, proximal to the methyl carbon of the fatty acid we term the enzyme a kappa hydroxylase for convenience. Of particular interest, we envision that the castor kappa hydroxylase may be used for production of l2-hydroxy-9-octadecenoic acid (ricinoleate) , i2-hydroxy-9-hexadecenoic acid, 14-hydroxy-ll- eicosenoic acid, l6-hydroxy-13-docosanoic acid, 9-hydroxy-6- octadecenoic acid by expression in plants species which produce the non-hydroxylated precursors. We also envision production of additionally modified fatty acids such as 12- hydroxy-9,15-octadecadienoic acid that result from desaturation of hydroxylated fatty acids (e.g., 12-hydroxy-9- octadecenoic acid in this example) .
We also envision that future advances in the genetic engineering of plants will lead to production of substrate fatty acids, such as icosenoic acid esters, and palmitoleic acid esters in plants that do not normally accumulate such fatty acids. We envision that the invention described herein may be used in conjunction with such future improvements to produce hydroxylated fatty acids of this invention in any plant species that is amenable to directed genetic modification. Thus, the applicability of this invention is not limited in our conception only to those species that currently accumulate suitable substrates.
As noted above, a plant kappa hydroxylase of this invention will display activity towards various fatty acyl substrates. During biosynthesis of lipids in a plant cell, fatty acids are typically covalently bound to acyl carrier protein (ACP) , coenzyme A (CoA) or various cellular lipids. Plant kappa hydroxylases which display preferential activity toward lipid-linked acyl substrate are especially preferred because they are likely to be closely associated with normal pathway of storage lipid synthesis in immature embryos. However, activity toward acyl-CoA substrates or other synthetic substrates, for example, is also contemplated herein.
Other plant kappa hydroxylases are obtainable from the specific exemplified sequences provided herein. Furthermore, it will be apparent that one can obtain natural and synthetic plant kappa hydroxylases including modified amino acid sequences and starting materials for synthetic-protein modeling from the exemplified plant kappa hydroxylase and from plant kappa hydroxylases which are obtained through the use of such exemplified sequences. Modified amino acid sequences include sequences which have been mutated, truncated, increased and the like, whether such sequences were partially or wholly synthesized. Sequences which are actually purified from plant preparations or are identical or encode identical proteins thereto, regardless of the method used to obtain the protein or sequence, are equally considered naturally derived. Thus, one skilled in the art will readily recognize that antibody preparations, nucleic acid probes (DNA and RNA) and the like may be prepared and used to screen and recover "homologous" or "related" kappa hydroxylases from a variety of plant sources. Typically, nucleic acid probes are labeled to allow detection, preferably with radioactivity although enzymes or other methods may also be used. For immunological screening methods, antibody preparations either monoclonal or polyclonal are utilized. Polyclonal antibodies, although less specific, typically are more useful in gene isolation. For detection, the antibody is labeled using radioactivity or any one of a variety of second antibody/enzyme conjugate systems that are commercially available. Homologous sequences are found when there is an identity of sequence and may be determined upon comparison of sequence information, nucleic acid or amino acid, or through hybridization reactions between a known kappa hydroxylase and a candidate source. Conservative changes, such as Glu/Asp, Val/Ile, Ser/Thr, Arg/Lys and Gln/Asn may also be considered in determining sequence homology. Typically, a lengthy nucleic acid sequence may show as little as 50-60% sequence identity, and more preferably at least about 70% sequence identity, between the target sequence and the given plant kappa hydroxylase of interest excluding any deletions which may be present, and still be considered related. Amino acid sequences are considered homologous by as little as 25% sequence identity between the two complete mature proteins. (See generally, Doolittle, R.F., OF URFS and ORFS, University Science Books, CA, 1986.)
A genomic or other appropriate library prepared from the candidate plant source of interest may be probed with conserved sequences from the plant kappa hydroxylase to identify homologously related sequences. Use of an entire cDNA or other sequence may be employed if shorter probe sequences are not identified. Positive clones are then analyzed by restriction enzyme digestion and/or sequencing. When a genomic library is used, one or more sequences may be identified providing both the coding region, as well as the transcriptional regulatory elements of the kappa hydroxylase gene from such plant source. Probes can also be considerably shorter than the entire sequence. Oligonucleotides may be used, for example, but should be at least about 10, preferably at least abou 15, more preferably at least 20 nucleotides in length. When shorter length regions are used for comparison, a higher degree of sequence identity is required than for longer sequences. Shorter probes are often particularly useful for polymerase chain reactions (PCR) , especially when highly conserved sequences can be identified (See Gould, et al., 1989 for examples of the use of PCR to isolate homologous genes from taxonomically diverse species) . When longer nucleic acid fragments are employed (>100 bp) as probes, especially when using complete or large cDNA sequences, one would screen with low stringencies (for example, 40-50°C below the melting temperature of the probe) in order to obtain signal from the target sample with 20-50% deviation, i.e., homologous sequences. (Beltz, et al . 1983). In a preferred embodiment, a plant kappa hydroxylase of this invention will have at least 60% overall amino acid sequence similarity with the exemplified plant kappa hydroxylase. In particular, kappa hydroxylases which are obtainable from an amino acid or nucleic acid sequence of a castor or lesquerella kappa hydroxylase are especially preferred. The plant kappa hydroxylases may have preferential activity toward longer or shorter chain fatty acyl substrates. Plant fatty acyl hydroxylases having oleate-12-hydroxylase activity and eicosenoate-14-hydroxylase activity are both considered homologously related proteins because of in vitro evidence (Howling et al., 1972), and evidence disclosed herein, that the castor kappa hydroxylase will act on both substrates. Hydroxylated fatty acids may be subject to further enzymatic modification by other enzymes which are normally present or are introduced by genetic engineering methods. For example, 14-hydroxy-ll,17-eicosadienoic acid, which is present in some Lesquerella species (Smith 1985) , is thought to be produced by desaturation of 14-hydroxy-ll- eicosenoic acid.
Again, not only can gene clones and materials derived thereof be used to identify homologous plant fatty acyl hydroxylases, but the resulting sequences obtained therefrom may also provide a further method to obtain plant fatty acyl hydroxylases from other plant sources. In particular, PCR may be a useful technique to obtain related plant fatty acyl hydroxylases from sequence data provided herein. One skilled in the art will be able to design oligonucleotide probes based upon sequence comparisons or regions of typically highly conserved sequence. Of special interest are polymerase chain reaction primers based on the conserved regions of amino acid sequence between the castor kappa hydroxylase and the L. fendleri hydroxylase (SED ID NO:4) . Details relating to the design and methods for a PCR reaction using these probes are described more fully in the examples.
It should also be noted that the fatty acyl hydroxylases of a variety of sources can be used to investigate fatty acid hydroxylation events in a wide variety of plant and in vivo applications. Because all plants synthesize fatty acids via a common metabolic pathway, the study and/or application of one plant fatty acid hydroxylase to a heterologous plant host may be readily achieved in a variety of species.
Once the nucleic acid sequence is obtained, the transcription, or transcription and translation (expression) , of the plant fatty acyl hydroxylases in a host cell is desired to produce a ready source of the enzyme and/or modify the composition of fatty acids found therein in the form of free fatty acids, esters (particularly esterified to glycerolipids or as components of wax esters) , estolides, or ethers. Other useful applications may be found when the host cell is a plant host cell, in vitro and in vivo .
For example, by increasing the amount of an kappa hydroxylase available to the plant, an increased percentage of ricinoleate or lesqueroleate (14-hydroxy-ll-eicosenoic acid) may be provided.
Kappa hvdroxylase
By this invention, a mechanism for the biosynthesis of ricinoleic acid in plants is demonstrated. Namely, that a specific plant kappa hydroxylase having preferential activity toward fatty acyl substrates is involved in the accumulation of hydroxylated fatty acids in at least some plant species. The use of the terms ricinoleate or ricinoleic acid (or lesqueroleate or lesquerolic acid, densipoleate etc.) is intended to include the free acids, the ACP and CoA esters, the salts of these acids, the glycerolipid esters (particularly the triacylglycerol esters) , the wax esters, the estolides and the ether derivatives of these acids. The determination that plant fatty acyl hydroxylases are active in the in vivo production of hydroxylated fatty acids suggests several possibilities for plant enzyme sources. And in fact, hydroxylated fatty acids are found in some natural plant species in abundance. For example, three hydroxy fatty acids related to ricinoleate occur in major amounts in seed oils from various Lesquerella species. Of particular interest, lesquerolic acid is a 20 carbon homolog of ricinoleate with two additional carbons at the carboxyl end of the chain (Smith 1985) . other natural plant sources of hydroxylated fatty acids include but are not limited to seeds of the Linum genus, seeds of Wrightia species, Lycopodium species, Strophanthus species, Convolvulaces species. Calendula species and many others (van de Loo et al., 1993). Plants having significant presence of ricinoleate or lesqueroleate or desaturated other or modified derivatives of these fatty acids are preferred candidates to obtain naturally-derived kappa hydroxylases. For example, Lesguerella densipila contains a diunsaturated 18 carbon fatty acid with a hydroxyl group (van de Loo et al., 1993) that is thought to be produced by an enzyme that is closely related to the castor kappa hydroxylase, according to the theory on which this invention is based. In addition, a comparison between kappa hydroxylases and between plant fatty acyl hydroxylases which introduce hydroxyl groups at positions other than the 12-carbon of oleate or the 14-carbon of lesqueroleate or on substrates other than oleic acid and icosenoic acid may yield insights for protein modeling or other modifications to create synthetic hydroxylases as discussed above. For example, on the basis of information gained from structural comparisons of the Δ12 desaturases and the kappa hydroxylase, we envision making genetic modifications in the structural genes for Δl2 desaturases that convert these desaturases to kappa- hydroxylases. We also envision making changes in Δ15 hydroxylases that convert these to hydroxylases with comparable substrate specificity to the desaturases (e.g., conversion of 18:2^'12 to lSOH-lβ^*'12. Since the difference between a hydroxylase and a desaturases concerns the disposition of one proton, we envision that by systematically changing the charged groups in the region of the enzyme near the active site, we can effect this change.
Especially of interest are fatty acyl hydroxylases which demonstrate activity toward fatty acyl substrates other than oleate, or which introduce the hydroxyl group at a location other than the C12 carbon. As described above, other plant sources may also provide sources for these enzymes through the use of protein purification, nucleic acid probes, antibody preparations, protein modeling, or sequence comparisons, for example, and of special interest are the respective amino acid and nucleic acid sequences corresponding to such plant fatty acyl hydroxylases. Also as previously described, once a nucleic acid sequence is obtained for the given plant hydroxylase, further plant sequences may be compared and/or probed to obtain homologously related DNA sequences thereto and so on. Genetic Engineering Applications
As is well known in the art, once a cDNA clone encoding a plant kappa hydroxylase is obtained, it may be used to obtain its corresponding genomic nucleic acid sequences thereto. The nucleic acid sequences which encode plant kappa hydroxylases may be used in various constructs, for example, as probes to obtain further sequences from the same or other species. Alternatively, these sequences may be used in conjunction with appropriate regulatory sequences to increase levels of the respective hydroxylase of interest in a host cell for the production of hydroxylated fatty acids or study of the enzyme in vitro or in vivo or to decrease or increase levels of the respective hydroxylase of interest for some applications when the host cell is a plant entity, including plant cells, plant parts (including but not limited to seeds, cuttings or tissues) and plants.
A nucleic acid sequence encoding a plant kappa hydroxylase of this invention may include genomic, cDNA or mRNA sequence. By "encoding" is meant that the sequence corresponds to a particular amino acid sequence either in a sense or anti-sense orientation. By "recombinant" is meant that the sequence contains a genetically engineered modification through manipulation via mutagenesis, restriction enzymes, and the like. A cDNA sequence may or may not encode pre-processing sequences, such as transit or signal peptide sequences. Transit or signal peptide sequences facilitate the delivery of the protein to a given organelle and are frequently cleaved from the polypeptide upon entry into the organelle, releasing the "mature" sequence. The use of the precursor DNA sequence is preferred in plant cell expression cassettes. Furthermore, as discussed above the complete genomic sequence of the plant kappa hydroxylase may be obtained by the screening of a genomic library with a probe, such as a cDNA probe, and isolating those sequences which regulate expression in seed tissue. In this manner, the transcription and translation initiation regions, introns, and/or transcript termination regions of the plant kappa hydroxylase may be obtained for use in a variety of DNA constructs, with or without the kappa hydroxylase structural gene. Thus, nucleic acid sequences corresponding to the plant kappa hydroxylase of this invention may also provide signal sequences useful to direct transport into an organelle 5* upstream non-coding regulatory regions (promoters) having useful tissue and timing profiles, 31 downstream non-coding regulatory region useful as transcriptional and translational regulatory regions and may lend insight into other features of the gene.
Once the desired plant kappa hydroxylase nucleic acid sequence is obtained, it may be manipulated in a variety of ways. Where the sequence involves non-coding flanking regions, the flanking regions may be subjected to resection, mutagenesis, etc. Thus, transitions, transversions, deletions, and insertions may be performed on the naturally occurring sequence. In addition, all or part of the sequence may be synthesized. In the structural gene, one or more codons may be modified to provide for a modified amino acid sequence, or one or more codon mutations may be introduced to provide for a convenient restriction site or other purpose involved with construction or expression. The structural gene may be further modified by employing synthetic adapters, linkers to introduce one or more convenient restriction sites, or the like.
The nucleic acid or amino acid sequences encoding a plant kappa hydroxylase of this invention may be combined with other non-native, or "heterologous", sequences in a variety of ways. By "heterologous" sequences is meant any sequence which is not naturally found joined to the plant kappa hydroxylase, including, for example, combination of nucleic acid sequences from the same plant which are not naturally found joined together.
The DNA sequence encoding a plant kappa hydroxylase of this invention may be employed in conjunction with all or part of the gene sequences normally associated with the kappa hydroxylase. In its component parts, a DNA sequence encoding kappa hydroxylase is combined in a DNA construct having, in the 5' to 31 direction of transcription, a transcription initiation control region capable of promoting transcription and translation in a host cell, the DNA sequence encoding plant kappa hydroxylase and a transcription and translation termination region.
Potential host cells include both prokaryotic and eukaryotic cells. A host cell may be unicellular or found in a multicellular differentiated or undifferentiated organism depending upon the intended use. Cells of this invention may be distinguished by having a plant kappa hydroxylase foreign to the wild-type cell present therein, for example, by having a reco binant nucleic acid construct encoding a plant kappa hydroxylase therein.
Depending upon the host, the regulatory regions will vary, including regions from viral, plasmid or chromosomal genes, or the like. For expression in prokaryotic or eukaryotic microorganisms, particularly unicellular hosts, a wide variety of constitutive or regulatable promoters may be employed. Expression in a microorganism can provide a ready source of the plant enzyme. Among transcriptional initiation regions which have been described are regions from bacterial and yeast hosts, such as E . coli, B . subtiliε, Saccharomyces cereviεiae , including genes such as beta-galactosidase, T7 poly erase, tryptophan E and the like.
For the most part, the constructs will involve regulatory regions functional in plants which provide for modified production of plant kappa hydroxylase with resulting modification of the fatty acid composition. The open reading frame, coding for the plant kappa hydroxylase or functional fragment thereof will be joined at its 5* end to a transcription initiation regulatory region such as the wild- type sequence naturally found 5' upstream to the kappa hydroxylase structural gene. Numerous other transcription initiation regions are available which provide for a wide variety of constitutive or regulatable, e.g. , inducible, transcription of the structural gene functions. Among transcriptional initiation regions used for plants are such regions associated with the structural genes such as for nopaline and mannopine synthases, or with napin, soybean ,9- conglycinin, oleosin, 12S storage protein, the cauliflower mosaic virus 35S promoters and the like. The transcription/ translation initiation regions corresponding to such structural genes are found immediately 5' upstream to the respective start codons. In embodiments wherein the expression of the kappa hydroxylase protein is desired in a plant host, the use of all or part of the complete plant kappa hydroxylase gene is desired; namely all or part of the 5' upstream non-coding regions (promoter) together with the structural gene sequence and 3 ' downstream non-coding regions may be employed. If a different promoter is desired, such as a promoter native to the plant host of interest or a modified promoter, i.e., having transcription initiation regions derived from one gene source and translation initiation regions derived from a different gene source, including the sequence encoding the plant kappa hydroxylase of interest, or enhanced promoters, such as double 35S CaMV promoters, the sequences may be joined together using standard techniques.
For such applications when 5' upstream non-coding regions are obtained from other genes regulated during seed maturation, those preferentially expressed in plant embryo tissue, such as transcription initiation control regions from the B . napuε napin gene, or the Arabidopsis 12S storage protein, or soybean β-conglycinin (Bray et al., 1987), or the L . fendleri kappa hydroxylase promoter described herein are desired. Transcription initiation regions which are preferentially expressed in seed tissue, i.e., which are undetectable in other plant parts, are considered desirable for fatty acid modifications in order to minimize any disruptive or adverse effects of the gene product.
Regulatory transcript termination regions may be provided in DNA constructs of this invention as well. Transcript termination regions may be provided by the DNA sequence encoding the plant kappa hydroxylase or a convenient transcription termination region derived from a different gene source, for example, the transcript termination region which is naturally associated with the transcript initiation region. Where the transcript termination region is from a different gene source, it will contain at least about 0.5 kb, preferably about 1-3 kb of sequence 3 • to the structural gene from which the termination region is derived.
Plant expression or transcription constructs having a plant kappa hydroxylase as the DNA sequence of interest for increased or decreased expression thereof may be employed with a wide variety of plant life, particularly, plant life involved in the production of vegetable oils for edible and industrial uses. Most especially preferred are temperate oilseed crops. Plants of interest include, but are not limited to rapeseed (Canola and high erucic acid varieties) , Crambe, Brassica juncea , Brassica nigra , meadowfoam, flax, sunflower, safflower, cotton, Cuphea , soybean, peanut, coconut and oil palms and corn. An important criterion in the selection of suitable plants for the introduction on the kappa hydroxylase is the presence in the host plant of a suitable substrate for the hydroxylase. Thus, for example, production of ricinoleic acid will be best accomplished in plants that normally have high levels of oleic acid in seed lipids. Similarly, production of lesquerolic acid will best be accomplished in plants that have high levels of icosenoic acid in seed lipids.
Depending on the method for introducing the recombinant constructs into the host cell, other DNA sequences may be required. Importantly, this invention is applicable to dicotyledons and monocotyledons species alike and will be readily applicable to new and/or improved transformation and regulation techniques. The method of transformation is not critical to the current invention; various methods of plant transformation are currently available. As newer methods are available to transform crops, they may be directly applied hereunder. For example, many plant species naturally susceptible to AσroJbacterium infection may be successfully transformed via tripartite or binary vector methods of Agrojacterium mediated transformation. In addition, techniques of microinjection, DNA particle bombardment, electroporation have been developed which allow for the transformation of various monocot and dicot plant species.
In developing the DNA construct, the various components of the construct or fragments thereof will normally be inserted into a convenient cloning vector which is capable of replication in a bacterial host, e.g., E. coli . Numerous vectors exist that have been described in the literature. After each cloning, the plasmid may be isolated and subjected to further manipulation, such as restriction, insertion of new fragments, ligation, deletion, insertion, resection, etc., so as to tailor the components of the desired sequence. Once the construct has been completed, it may then be transferred to an appropriate vector for further manipulation in accordance with the manner of transformation of the host cell.
Normally, included with the DNA construct will be a structural gene having the necessary regulatory regions for expression in a host and providing for selection of transformant cells. The gene may provide for resistance to a cytotoxic agent, e.g., antibiotic, heavy metal, toxin, etc., complementation providing prototropy to an auxotrophic host, viral immunity or the like. Depending upon the number of different host species the expression construct or components thereof are introduced, one or more markers may be employed, where different conditions for selection are used for the different hosts.
It is noted that the degeneracy of the DNA code provides that some codon substitutions are permissible of DNA sequences without any corresponding modification of the amino acid sequence.
As mentioned above, the manner in which the DNA construct is introduced into the plant host is not critical to this invention. Any method which provides for efficient transformation may be employed. Various methods for plant cell transformation include the use of Ti- or Ri-plas ids, microinjection, electroporation, infiltration, imbibition, DNA particle bombardment, liposome fusion, DNA bombardment or the like. In many instances, it will be desirable to have the construct bordered on one or both sides of the T-DNA, particularly having the left and right borders, more particularly the right border. This is particularly useful when the construct uses A. tumefaciens or A. rhizogenes as a mode for transformation, although the T-DNA borders may find use with other modes of transformation.
Where Agro acterium is used for plant cell transformation, a vector may be used which may be introduced into the AgroJbacteriuτn host for homologous recombination with T-DNA or the Ti- or Ri-plasmid present in the AgroJacterium host. The Ti- or Ri-plasmid containing the T-DNA for recombination may be armed (capable of causing gall formation) or disarmed (incapable of causing gall) , the latter being permissible, so long as the vir genes are present in the transformed AgrroJbacterium host. The armed plasmid can give a mixture of normal plant cells and gall.
In some instances where Agrobacterium is used as the vehicle for transforming plant cells, the expression construct bordered by the T-DNA border(s) will be inserted into a broad host spectrum vector, there being broad host spectrum vectors described in the literature. Commonly used is pRK2 or derivatives thereof. See, for example, Ditta et al., (1980), Included with the expression construct and the T-DNA will be one or more markers, which allow for selection of transformed Agrobacterium and transformed plant cells. A number of markers have been developed for use with plant cells, such as resistance to kanamycin, the aminoglycoside G418, hygromycin, or the like. The particular marker employed is not essential to this invention, one or another marker being preferred depending on the particular host and the manner of construction.
For transformation of plant cells using Agrobacterium, explants may be combined and incubated with the transformed Agrobacterium for sufficient time for transformation, the bacteria killed, and the plant cells cultured in an appropriate selective medium. Once callus forms, shoot formation can be encouraged by employing the appropriate plant hormones in accordance with known methods and the shoots transferred to rooting medium for regeneration of plants. The plants may then be grown to seed and the seed used to establish repetitive generations and for isolation of vegetable oils.
The invention now being generally described, it will be more readily understood by reference to the following examples which are included for purposes of illustration only and are not intended to limit the present invention.
EXAMPLES
In the experimental disclosure which follows, all temperatures are given in degrees centigrade (°), weights are given in grams (g) , milligram (mg) or micrograms ( μg) , concentrations are given as molar (M) , millimolar (mM) or micromolar ( M) and all volumes are given in liters (1) , microliters (μl) or milliliters (ml) , unless otherwise indicated.
EXAMPLE 1 - PRODUCTION OF NOVEL HYDROXYLATED FATTY ACIDS IN ARABIDOPSIS THALIANA Overview
The kappa hydroxylase encoded by the previously described fahl2 gene from Castor (U.S. Patent application 08/320,982) was used to produce ricinoleic acid, lesquerolic acid, densipolic acid and auricolic acid in transgenic Arabidopsis plants. This example reduces to practice the method taught in Example 2 of the foregoing application.
Production of transgenic lants
A variety of methods have been developed to insert a DNA sequence of interest into the genome of a plant host to obtain the transcription and translation of the sequence to effect phenotypic changes. The following methods represent only one of many equivalent means of producing transgenic plants and causing expression of the hydroxylase gene.
Arabidopsis plants were transformed, by Agrobacterium- mediated transformation, with the kappa hydroxylase encoded by the Castor fahl2 gene on binary Ti plasmid pB6. This plasmid was previously used to transform Nicotiana tabacum for the production of ricinoleic acid (U.S. Patent application 08/320,982) .
Inoculums of Agrobacterium tumefacienε strain GV3101 containing binary Ti plasmid pB6 were plated on L-broth plates containing 50 μg/ml kanamycin and incubated for 2 days at 30°C. Single colonies were used to inoculate large liquid cultures (L-broth medium with 50 mg/1 rifampicin, 110 mg/1 gentamycin and 200 mg/1 kanamycin) to be used for the transformation of Arabidopsis plants.
Arabidopsis plants were transformed by the in pi ant a transformation procedure essentially as described by Bechtold et al., (1993). Cells of A . tumefacienε GV3101(pB6) were harvested from liquid cultures by centrifugation, then resuspended in infiltration medium at OD600 = 0.8 (Infiltration medium was Murashige and Skoog macro and micronutrient medium (Sigma Chemical Co., St. Louis, MO) containing 10 mg/1 6-benzylaminopurine and 5% glucose) . Batches of 12-15 plants were grown for 3 to 4 weeks in natural light at a mean daily temperature of approximately 25°C in 3.5 inch pots containing soil. The intact plants were immersed in the bacterial suspension then transferred to a vacuum chamber and placed under 600 mm of vacuum produced by a laboratory vacuum pump until tissues appeared uniformly water-soaked (approximately
10 min) . The plants were grown at 25°C under continuous light (100 μmol m"2 s"1 irradiation in the 400 to 700 nm range) for four weeks. The seeds obtained from all the plants in a pot were harvested as one batch. The seeds were sterilized by sequential treatment for 2 min with ethanol followed by 10 min in a mixture of household bleach (Chlorox) , water and Tween-80 (50%, 50%, 0.05%) then rinsed thoroughly with sterile water. The seeds were plated at high density (2000 to 4000 per plate) onto agar-solidified medium in 100 mm petri plates containing 1/2 X Murashige and Skoog salts medium enriched with B5 vitamins (Sigma Chemical Co., St. Louis, MO) and containing kanamycin at 50 mg/1. After incubation for 48 h at 4βC to stimulate germination, seedlings were grown for a period of seven days until transformants were clearly identifiable as healthy green seedlings against a background of chlorotic kanamycin-sensitive seedlings. The transformants were transferred to soil for two weeks before leaf tissue could be used for DNA and lipid analysis. More than 20 transformants were obtained.
DNA was extracted from young leaves from transformants to verify the presence of an intact fahl2 gene. The presence of the transgene in a number of the putative transgenic lines was verified by using the polymerase chain reaction to amplify the insert from pB6. The primers used were HF2 = GCTCTTTTGTGCGCTCATTC (SEQ ID NO:12) and HR1 = CGGTACCAGAAAACGCCTTG (SEQ ID NO:13), which were designed to allow the amplification of a 700 bp fragment. Approximately 100 ng of genomic DNA was added to a solution containing 25 pmol of each primer, 1.5 U Taq polymerase (Boehringer Manheim) , 200 uM of dNTPs, 50 mM KCl, 10 mM Tris.Cl (pH 9), 0.1% (v/v) Triton X-100, 1.5 mM MgCl2, 3% (v/v) formamide, to a final volume of 50 μl. Amplifications conditions were: 4 min denaturation step at 94°C, followed by 30 cycles of 92°C for 1 min, 55°C for 1 min, 72°C for 2 min. A final extension step closed the program at 72°C for 5 min. Transformants could be positively identified after visualization of a characteristic 1 kb amplified fragment on an ethidium bromide stained agarose gel. All transgenic lines tested gave a PCR product of a size consistent with the expected genotype, confirming that the lines were, indeed, transgenic. All further experiments were done with three representative transgenic lines of the wild type designated as 1-3, 4D, 7-4 and one transgenic line of the fad2 mutant line JB12. The transgenic JB12 line was included in order to test whether the increased accumulation of oleic acid in this mutant would have an effect on the amount of ricinoleic acid that accumulated in the transgenic plants.
Analysis of transgenic plants
Leaves and seeds from fahl2 transgenic Arabidopsis plants were analyzed for the presence of hydroxylated fatty acids using gas chromatography. Lipids were extracted from 100-200 mg leaf tissue or 50 seeds. Fatty acid methyl esters (FAMES) were prepared by placing tissue in 1.5 ml of 1.0 M methanolic HCl (Supelco Co.) in a 13 x 100 mm glass screw-cap tube capped with a teflon-lined cap and heated to 80°C for 2 hours. Upon cooling, 1 ml petroleum ether was added and the FAMES removed by aspirating off the ether phase which was then dried under a nitrogen stream in a glass tube. One hundred μl of N,o-bis(Trimethylsilyl) trifluoroacetamide (BSTFA; Pierce Chemical Co) and 200 μl acetonitrile was added to derivatize the hydroxyl groups. The reaction was carried out at 70°C for 15 min. The products were dried under nitrogen, redissolved in 100 μl chloroform and transferred to a gas chromatograph vial. Two μl of each sample were analyzed on a SP2340 fused silica capillary column (30 , 0.75 mm ID, 0.20 m film, Supelco) , using a Hewlett-Packard 5890 II series Gas Chromatograph. The samples were not split, the temperature program was 195°C for 18 min, increased to 230°C at 25°C/min, held at 230°C for 5 min then down to 195°C at 25°C/min. , and flame ionization detectors were used.
The chromatographic elution time of methyl esters and O- TMS derivatives of ricinoleic acid, lesquerolic acid and auricolic acid was established by GC-MS of lipid samples from seeds of L. fendleri and comparison to published chromatograms of fatty acids from this species (Carlson et al., 1990). A O- TMS-methyl-ricinoleate standard was prepared from ricinoleic acid obtained from Sigma Chemical Co (St, Louis, MO) . O-TMS- methyl-lesqueroleate and O-TMS-methyl-auricoleate standards were prepared from triacylglycerols purified from seeds of L. fendleri . The mass spectrum of O-TMS-methyl-ricinoleate, O- TMS-methyl-densipoleate, O-TMS-methyl-lesqueroleate, and O- TMS-methyl-auricoleate are shown in Figures 1A-D, respectively. The structures of the characteristic ions produced during mass spectrometry of these derivatives are shown in Figure 2.
Lipid extracted from transgenic tissues were analyzed by gas chromatography and mass spectrometry for the presence of hydroxylated fatty acids. As a matter of reference, the average fatty acid composition of leaves in Arabidopsis wild type and fad2 mutant lines was reported by Miquel and Browse (1992) . Gas chromatograms of methylated and silylated fatty acids from seeds of wild type and a fahl2 transgenic wild type plant are shown in Figures 3A and 3B, respectively. The profiles are very similar except for the presence of three small but distinct peaks at 14.3, 15.9 and 18.9 minutes. A very small peak at 20.15 min was also evident. The elution time of the peaks at 14.3 and 18.9 min corresponded precisely to that of comparably prepared ricinoleic and lesquerolic standards, respectively. No significant differences were observed in lipid extracts from leaves or roots of the wild type and the fahl2 transgenic wild type lines (Table 1) . Thus, in spite of the fact that the fahl2 gene is expressed throughout the plant, we observed effects on fatty acid composition only in seed tissue. A similar observation was described previously for transgenic fahl2 tobacco in patent application No. 08/320,982.
Table 1 : Fatty acid composition of lipids from transgenic and wild type Arabidopsis. The values are the means obtained from analysis of samples from three independent transgenic lines, or three independent samples of wild type and fad2 lines.
Fatty Seed Leaf Root acid
WT FAH12/WT FAH12/fad2 JB12 WT FAH12/WT WT FAH12/WT
16:0 8.5 8.2 6.4 6.1 16.5 17.5 23.9 24.9
16:3 0 0 0 0 10.1 9.8 0 0
18:0 3.2 3.5 2.9 3.5 1.3 1.2 2.0 1.9
18:1 15.4 26.3 43.4 47.8 2.4 3.4 5.4 3.2
18:2 27.0 21.4 10.2 7.2 15.1 14.0 32.2 29.4
18:3 22.0 16.6 - 9.7 36.7 36.0 26.7 30.6
20:1 14.0 14.3 - 13.1 0 0 0 0
22:1 2.0 1.0 0.5 0.5 0 0 0 0
24:1 2.5 1.7 2.0 1.6 0 0 0 0
18:1- 0 0.4 0.3 0 0 0 0 0 OH
18:2- 0 0.4 0.3 0 0 0 0 0 OH
20:1- 0 0.2 0.1 0 0 0 0 0 OH
20:2- 0 0.1 0.1 0 0 0 0 0 OH
In order to confirm that the observed new peaks in the transgenic lines corresponded to derivatives of ricinoleic, lesquerolic, densipolic and auricolic acids, mass spectrometry was used. The fatty acid derivatives were resolved by gas chromatography as described above except that a
Hewlett-Packard 5971 series mass selective detector was used in place of the flame ionization detector used in the previous experiment. The spectra of the four new peaks in Figure 3B (peak numbers 10, 11, 12 and 13) are shown in Figures 4A-D, respectively. Comparison of the spectrum obtained for the standards with that obtained for the four peaks from the transgenic lines confirms the identity of the four new peaks. On the basis of the three characteristic peaks at M/Z 187, 270 and 299, peak 10 is unambiguously identified as O-TMS- methylricinoleate. On the basis of the three characteristic peaks at M/Z 185, 270 and 299, peak 11 is unambiguously identified as O-TMS-methyldensipoleate. On the basis of the three characteristic peaks at M/Z 187, 298 and 327, peak 12 is unambiguously identified as O-TMS-methyllesqueroleate. On the basis of the three characteristic peaks at M/Z 185, 298 and 327, peak 13 is unambiguously identified as O-TMS- methylauricoleate. These results unequivocally demonstrate the identity of the fahl2 cDNA as encoding a hydroxylase that hydroxylates both oleic acid to produce ricinoleic acid and also hydroxylate icosenoic acid to produce lesquerolic acid. These results also provide additional evidence that the hydroxylase can be functionally expressed in a heterologous plant species in such a way that the enzyme is catalytically functional. These results also demonstrate that expression of this hydroxylase gene leads to accumulation of ricinoleic, lesquerolic, densipolic and auricolic acids in a plant species that does not normally accumulate hydroxylated fatty acids in extractable lipids.
The presence of lesquerolic acid in the transgenic plants was anticipated in the previous patent application (No. 08/320,982) based on the biochemical evidence suggesting broad substrate specificity of the kappa hydroxylase. By contrast, the accumulation of densipolic and auricolic acids was less predictable. Since Arabidopsis does not normally contain significant quantities of the non-hydroxylated precursors of these fatty acids which could serve as substrates for the hydroxylase, it appears that one or more of the three n-3 fatty acid desaturases known in Arabidopsis (eg., fad3, fad7, fadδ; reviewed in Gibson et al., 1995) are capable of desaturating the hydroxylated compounds at the n-3 position. That is, densipolic acid is produced by the action of an n-3 desaturase on ricinoleic acid. Auricolic acid is produced by the action of an n-3 desaturase on lesquerolic acid. Because it is located in the endoplasmic reticulum, the fad3 desaturase is almost certainly responsible. This can be tested in the future by producing fahl2-containing transgenic plants of the fad3-deficient mutant of Arabidopsis (similar experiments can be done with fad7 and fadδ) . It is also formally possible that the enzymes that normally elongate 1 188::llcciissΔΔ ttoo 2200::llcc1^,Δ1111 mmaayy eelloonnggaattee 112200HH-18:lcisΔ9 to l4OH-20:lcisΔ11 and l20H-l8:2cisΔ'15 to 140H-20:2 cisΔ11'17. The amount of the various fatty acids in seed, leaf and root lipids of the control and transgenic plants is presented in Table 1. Although the amount of hydroxylated fatty acids produced in this example is less than desired for commercial production of ricinoleate and other hydroxylated fatty acids from plants, we envision numerous improvements of this invention that will increase the level of accumulation of hydroxylated fatty acids in plants that express the fahl2 or related hydroxylase genes. Improvements in the level and tissue specificity of expression of the hydroxylase gene is envisioned. Methods to accomplish this by the use of strong, seed-specific promoters such as the B . napus napin promoter or the native promoters of the castor fahl2 gene or the corresponding hydroxylase gene from L . fendleri will be obvious to one skilled in the art. Additional improvements are envisioned to involve modification of the enzymes which cleave hydroxylated fatty acids from phosphatidylcholine, reduction in the activities of enzymes which degrade hydroxylated fatty acids and replacement of acyltransferases which transfer hydroxylated fatty acids to the sn-1, sn-2 and sn-3 positions of glycerolipids. Although genes for these enzymes have not been described in the scientific literature, their utility in improving the level of production of hydroxylated fatty acids can be readily envisioned based on the results of biochemical investigations of ricinoleate synthesis.
Although Arabidopsis is not an economically important plant species, it is widely accepted by plant biologists as a model for higher plants. Therefore, the inclusion of this example is intended to demonstrate the general utility of the invention described here to the modification of oil composition in higher plants. One advantage of studying the expression of this novel gene in Arabidopsis is the existence in this system of a large body of knowledge on lipid metabolism, as well as the availability of a collection of mutants which can be used to provide useful information on the biochemistry of fatty acid hydroxylation in plant species. Another advantage is the ease of transposing any of the information obtained on metabolism of ricinoleate in Arabidopsis to closely related species such as the crop plants Braεεica napuε, Brassica jυncea or CramJbe Abyssinica in order to mass produce ricinoleate, lesqueroleate or other hydroxylated fatty acids for industrial use. The kappa hydroxylase is useful for the production of ricinoleate or lesqueroleate in any plant species that accumulates significant levels of the precursors, oleic acid and icosenoic acid. Of particular interest are genetically modified varieties that accumulate high levels of oleic acid. Such varieties are currently available for sunflower and Canola. Production of lesquerolic acid and related hydroxy fatty acids can be achieved in species that accumulate high levels of icosenoic acid or other long chain monoenoic acids. Such plants may in the future be produced by genetic engineering of plants that do not normally make such precursors. Thus, we envision that the use of the kappa hydroxylase is of general utility. EXAMPLE 2. ISOLATION OF LESQUERELLA KAPPA HYDROXYLASE GENOMIC CLQNE
Overview Regions of nucleotide sequence that were conserved in both the Castor kappa hydroxylase and the Arabidopsis fad2 Δ12 fatty acid desaturase were used to design oligonucleotide primers. These were used with genomic DNA from Lesquerella fendleri to amplify fragments of several homologous genes. These amplified fragments were then used as hybridization probes to identify full length genomic clones from a genomic library of L . fendleri . Hydroxylated fatty acids are specific to the seed tissue of Lesquerella sp. , and are not found to any appreciable extent in vegetative tissues. One of the two genes identified by this method was expressed in both leaves and developing seeds and is therefore thought to correspond to the Δl2 fatty acid desaturase. The other gene was expressed at high levels in developing seeds but was not expressed or was expressed at very low levels in leaves and is the kappa hydroxylase from this species. The identity of this gene will be established by introducing the gene into transgenic Arabidopsis plants and showing that it causes the accumulation of ricinoleic acid, lesquerolic acid, densipolic acid and auricolic acid in seed lipids. The promoter of this gene is also of utility because it is able to direct expression of a gene specifically in developing seeds at a time when storage lipids are accumulating. This promoter is, therefore, of great utility for many applications in the genetic engineering of seeds, particularly in members of the Brassicacea.
The various steps involved in this process are described in detail below. Unless otherwise indicated, routine methods for manipulating nucleic acids, bacteria and phage were as described by Sambrook et al. (1989) .
Isolation of a fragment of the Lesquerella kaooa hydroxylase
Oligonucleotide primers for the amplification of the L. fendleri kappa hydroxylase were designed by choosing regions of high deduced amino acid sequence homology between the Castor kappa hydroxylase and the Arabidopsis Δl2 desaturase (fad2) . Because most amino acids are encoded several different codons, these oligonucleotides were designed to encode all possible codons that could encode the corresponding amino acids. The sequence of these mixed oligonucleotides was: Oligo 1: TAYWSNCAYMGNMGNCAYCA (SEQ ID NO:14) Oligo 2: RTGRTGNGCNACRTGNGTRTC (SEQ ID NO:15) (Where: Y = C+T; W = A+T; S = G+C; N = A+G+C+T; M = A+C; R = A+G)
These oligonucleotides were used to amplify a fragment of DNA from L . fendleri genomic DNA by the polymerase chain reaction (PCR) using the following conditions: Approximately 100 ng of genomic DNA was added to a solution containing 25 pmol of each primer, 1.5 U Taq polymerase (Boehringer
Manheim) , 200 uM of dNTPs, 50 mM KC1, 10 mM Tris.Cl (pH 9), 0.1% (v/v) Triton X-100, 1.5 mM MgCl2, 3% (v/v) formamide, to a final volume of 50 μl. Amplifications conditions were: 4 min denaturation step at 94°C, followed by 30 cycles of 92°C for 1 min, 55°C for 1 min, 72°C for 2 min. A final extension step closed the program at 72°C for 5 min.
PCR products of approximately 540 bp were observed following electrophoretic separation of the products of the PCR reaction in agarose gels. Two of these fragments were cloned into pBluescript (Stratagene) to give rise to plasmids pLesq2 and pLesq3. The sequence of the inserts in these two plasmids was determined by the chain termination method. The sequence of the insert in pLesq2 is presented as Figure 5 (SEQ ID N0:1) and the sequence of the insert in pLesq3 is presented as Figure 6 (SEQ ID NO:2). The high degree of sequence identity between the two clones indicated that they were both potential candidates to be either a Δ12 desaturase or a gama hydroxylase.
Northern analysis
In L . fendleri , hydroxylated fatty acids are found in large amounts in seed oils but are not found in appreciable amounts in leaves. Therefore, an important criterion in discriminating between a fatty acyl desaturase and kappa hydroxylase is that the kappa hydroxylase gene is expected to be expressed more highly in tissues which have high level of hydroxylated fatty acids than in other tissues whereas all plant tissues should contain mRNA for an ω6 fatty acyl desaturase since diunsaturated fatty acids are found in the lipids of all tissues in most or all plants. Therefore, it was of great interest to determine whether the gene corresponding to pLesq2 was also expressed only in seeds, or is also expressed in other tissues. This question was addressed by testing for hybridization of pLesq2 to RNA purified from developing seeds and from leaves. Total RNA was purified from developing seeds and young leaves of L . fendleri using an Rneasy RNA extraction kit (Qiagen) , according to the manufacturer's instructions. RNA concentrations were quantified by UV spectrophotometry at λ=260 and 280 nm. In order to ensure even loading of the gel to be used for Northern blotting, RNA concentrations were further adjusted after recording fluorescence under UV light of RNA samples stained with ethidium bromide and run on a test denaturing gel.
Total RNA prepared as described above from leaves and developing seeds was electrophoresed through an agarose gel containing formaldehyde (Iba et al., 1993). An equal quantity (10 μq) of RNA was loaded in both lanes, and RNA standards (0.16-1.77 kb ladder, Gibco-BRL) were loaded in a third lane. Following electrophoresis, RNA was transferred from the gel to a nylon membrane (Hybond N+, Amersham) and fixed to the filter by exposure to UV light. A 32P-labelled probe was prepared from insert DNA of clone pLesq2 by random priming and hybridized to the membrane overnight at 52°C, after it had been prehybridized for 2 h. The prehybridization solution contained 5X SSC, 10X Denhardt's solution, 0.1% SDS, 0.1M KP04 pH 6.8, 100 μg/ml salmon sperm DNA. The hybridization solution had the same basic composition, but no SDS, and it contained 10% dextran sulfate and 30% formamide. The blot was washed once in 2X SSC, 0.5% SDS at 65°C then in IX SSC at the same temperature.
Brief (30 min) exposure of the blot to X-ray film revealed that the probe pLesq2 hybridized to a single band only in the seed RNA lane (Figure 7) . The blot was re-probed with the insert from pLesq3 gene, which gave bands of similar intensity in the seed and leaf lanes (Figure 7) .
These results show that the gene corresponding to the clone pLesq2 is highly and specifically expressed in seed of L . fendleri . In conjunction with knowledge of the nucleotide and deduced amino acid sequence, strong seed-specific expression of the gene corresponding to the insert in pLesq2 is a convincing indicator of the role of the enzyme in synthesis of hydroxylated fatty acids in the seed oil.
Characterization of a genomic clone of the gamma hydroxylase
Genomic DNA was prepared from young leaves of L . fendleri as described by Murray and Thompson (1980) . A 5au3AI-partial digest genomic library constructed in the vector λDashll (Stratagene, 11011 North Torrey Pines Road, La Jolla CA 92037) was prepared by partially digesting 500 μq of DNA, size- selecting the DNA on a sucrose gradient (Sambrook et al., 1989) , and ligating the DNA (12 kb average size) to the BamHI- digested arms of λDashll. The entire ligation was packaged according to the manufacturer's conditions and plated on E . coli strain XLl-Blue MRA-P2 (Stratagene) . This yielded 5xl05 primary recombinant clones. The library was then amplified according to the manufacturer's conditions. A fraction of the genomic library was plated on E. coli XLl-Blue and resulting plaques (150,000) were lifted to charged nylon membranes (Hybond N+, Amersham) , according to the manufacturer's conditions. DNA was crosslinked to the filters under UV in a Stratalinker (Stratagene) .
Several clones carrying genomic sequences corresponding to the L . fendleri hydroxylase were isolated by probing the membranes with the insert from pLesq2 that was PCR-amplified with internal primers and labelled with 32P by random priming. The filters were prehybridized for 2 hours at 65°C in 7% SDS, lmM EDTA, 0.25 M Na2HP0 (pH 7.2), 1% BSA and hybridized to the probe for 16 hours in the same solution. The filters were sequentially washed at 65°C in solutions containing 2 X SSC, 1 X SSC, 0.5 X SSC in addition to 0.1 % SDS. A 2.6 kb Xba I fragment containing the complete coding sequence for the gamma-hydroxylase and approximately 1 kb of the 5' upstream region was subcloned into the corresponding site of pBluescript KS to produce plasmid pLesq-Hyd and the sequence determined completely using an automatic sequencer by the dideoxy chain termination method. Sequence data was analyzed using the program DNASIS (Hitachi company) .
The sequence of the insert in clone pLesq-Hyd is shown in Figures 8A-B. The sequence entails 1855 bp of contiguous DNA sequence (SEQ ID NO:3). The clone encodes a 401 bp 5' untranslated region (i.e., nucleotides preceding the first ATG codon) , an 1152 bp open reading frame, and a 302 bp 3' untranslated region. The open reading frame encodes a 384 amino acid protein with a predicted molecular weight of 44,370 (SEQ ID NO:4) . The amino terminus lacks features of a typical signal peptide (von Heijne, 1985) .
The exact translation-initiation methionine has not been experimentally determined, but on the basis of deduced amino acid sequence homology to the Castor kappa hydroxylase (noted below) is thought to be the methionine encoded by the first ATG codon at nucleotide 402.
Comparison of the pLesq-Hyd deduced amino acid sequence with sequences of membrane-bound desaturases and the castor hydroxylase (Figures 9A-B) indicates that pLesq-Hyd is homologous to these genes. This figure shows an alignment of the L . fendleri hydroxylase (SEQ ID NO:4) with the castor hydroxylase (van de Loo et al. 1995) , the Arabidopsis fad2 cDNA which encodes an endoplasmic reticulum-localized Δl2 desaturase (called fad2) (Okuley et al., 1994), two soybean fad2 desaturase clones, a Braεεica napuε fad2 clone, a Zea ayε fad2 clone and partial sequence of a R . communiε fad2 clone. The high degree of sequence homology indicates that the gene products are of similar function. For instance, the overall homology between the Lesquerella hydroxylase and the Arabidopsis fad2 desaturase was 92.2% similarity and 84.8% identity and the two sequences differed in length by only one amino acid.
Southern hybridization
Southern analysis was used to examine the copy number of the genes in the L . fendleri genome corresponding to the clone pLesq-Hyd. Genomic DNA (5 μq) was digested with EcoR I, Hind III and Xba I and separated on a 0.9% agarose gel. DNA was alkali-blotted to a charged nylon membrane (Hybond N+, Amersham) , according to the manufacturer's protocol. The blot was prehybridized for 2 hours at 65°C in 7% SDS, ImM EDTA, 0.25 M Na2HP04 (pH 7.2), 1% BSA and hybridized to the probe for 16 hours in the same solution with pLesq-Hyd insert PCR-amplified with internal primers and labelled with P by random priming. The filters were sequentially washed at 65°C in solutions containing 2 X SSC, 1 X SSC, 0.5 X SSC in addition to 0.1 % SDS, then exposed to X-ray film.
The probe hybridized with a single band in each digest of L . fendleri DNA (Figure 10) , indicating that the gene from which pLesq-Hyd was transcribed is present in a single copy in the L . fendleri genome.
Expression of pLesα-Hvd in Transgenic Plants
There are a wide variety of plant promoter sequences which may be used to cause tissue-specific expression of cloned genes in transgenic plants. For instance, the napin promoter and the acyl carrier protein promoters have previously been used in the modification of seed oil composition by expression of an antisense form of a desaturase (Knutson et al. 1992). Similarly, the promoter for the β- subunit of soybean ,5-conglycinin has been shown to be highly active and to result in tissue-specific expression in transgenic plants of species other than soybean (Bray et al., 1987) . Thus, although we describe the use of the L . fendleri kappa hydroxylase promoter in the examples described here, other promoters which lead to seed-specific expression may also be employed for the production of modified seed oil composition. Such modifications of the invention described here will be obvious to one skilled in the art.
Constructs for expression of L . fendleri kappa hydroxylase in plant cells are prepared as follows: A 13 kb Sail fragment containing the pLesq-Hyg gene was ligated into the Xhol site of binary Ti plasmid vector pSLJ44026 (Jones et al., 1992) (Figure 11) to produce plasmid pTi-Hyd and transformed into Agrobacterium tumefacienε strains GV3101 by electroporation. Strain GV3101 (Koncz and Schell, 1986) contains a disarmed Ti plasmid. Cells for electroporation were prepared as follows. GV3101 was grown in LB medium with reduced NaCl (5 g/1) • A 250 ml culture was grown to OD,^ =
0.6, then centrifuged at 4000 rpm (Sorvall GS-A rotor) for 15 min. The supernatant was aspirated immediately from the loose pellet, which was gently resuspended in 500 ml ice-cold water. The cells were centrifuged as before, resuspended in 30 ml ice-cold water, transferred to a 30 ml tube and centrifuged at 5000 rpm (Sorvall SS-34 rotor) for 5 min. This was repeated three times, resuspending the cells consecutively in 30 ml ice-cold water, 30 ml ice-cold 10% glycerol, and finally in 0.75 ml ice-cold 10% glycerol. These cells were aliquoted, frozen in liquid nitrogen, and stored at -80C.
Electroporations employed a Biorad Gene pulsar instrument using cold 2 mm-gap cuvettes containing 40 μl cells and 1 μl of DNA in water, at a voltage of 2.5 KV, and 200 Ohms resistance. The electroporated cells were diluted with 1 ml SOC medium (Sambrook et al., 1989, page A2) and incubated at 28°C for 2-4 h before plating on medium containing kanamycin (50 mg/1) .
Arabidopsis thaliana can be transformed with the Agrobacterium cells containing pTi-Hyd as described in Example 1 above. Similarly, the presence of hydroxylated fatty acids in the transgeneic Arabidopsis plants can be demonstrated by the methods described in Example 1 above.
Constitutive expression of the L . fendleri hydroxylase in transgenic plants
A 1.5 kb EcoR I fragment from pLesq-Hyg comprising the entire coding region of the hydroxylase was gel purified, then cloned into the corresponding site of pBluescript KS (Stratagene) . Plasmid DNA from a number of recombinant clones was then restricted with Pst I, which should cut only once in the insert and once in the vector polylinker sequence. Release of a 920 bp fragment with Pst I indicated the right orientation of the insert for further manipulations. DNA from one such clone was further restricted with Sai l , the 5' overhangs filled-in with the Klenow fragment of DNA polymerase I, then cut with Sac I. The insert fragment was gel purified, and cloned between the S a I and Sac I sites of pBI121 (Clontech) behind the Cauliflower Mosaic Virus 35S promoter. After checking that the sequence of the junction between insert and vector DNA was appropriate, plasmid DNA from a recombinant clone was used to transform A . tumefaciens (GV3101) . Kanamycin resistant colonies were then used for in planta transformation of A . thaliana as previously described. DNA was extracted from kanamycin resistant seedlings and used to PCR-amplify selected fragments from the hydroxylase using nested primers. When fragments of the expected size could be amplified, corresponding plants were grown in the greenhouse or on agar plates, and fatty acids extracted from fully expanded leaves, roots and dry seeds. GC-MS analysis was then performed as previously described to characterize the different fatty acid species and detect accumulation of hydroxy fatty acids in transgenic tissues.
EXAMPLE 3 - OBTAINING OTHER PLANT FATTY ACYL HYDROXYLASES In a previous patent application, we described the ways in which the castor fahl2 sequence could be used to identify other kappa hydroxylases by methods such as PCR and heterologous hybridization. However, because of the high degree of sequence similarity between Δl2 desaturases and kappa hydroxylases, prior art does not teach how to distinguish between the two kinds of enzymes without a functional test such as demonstrating activity in transgenic plants or another suitable host (e.g., transgenic microbial or animal cells) . The identification of the L . fendleri hydroxylase provided for the development of criteria by which a hydroxylase and a desaturase may be distinguished solely on the basis of deduced amino acid sequence information.
Figures 9A-B show a sequence alignment of the castor and L . fendleri hydroxylase sequences with the castor hydroxylase sequence and all publically available sequences for all plant microsomal Δl2 fatty acid desaturases. Of the 384 amino acid residues in the castor hydroxylase sequence, more than 95% are identical to the corresponding residue in at least one of the desaturase sequences. Therefore, none of these residues are responsible for the catalytic differences between the hydroxylase and the desaturases. Of the remaining 16 residues in the castor hydroxylase and 14 residues in the Lesquerella hydroxylase, all but six represent instances where the hydroxylase sequence has a conservative substitution compared with one or more of the desaturase sequences, or there is wide variability in the amino acid at that position in the various desaturases. By conservative, we mean that the following amino acids are functionally equivalent : Ser/Thr, lie/ Leu/ Val/ Met, Asp/ Glu. Thus, these structural differences also cannot account for the catalytic differences between the desaturases and hydroxylases. This leaves just six amino acid residues where both the castor hydroxylase and the Lesquerella hydroxylase differ from all of the known desaturases and where all of the known microsomal Δl2 desaturases have the identical amino acid residue. These residues occur at positions 69, 111, 155, 226, 304 and 331 of the alignment in Figure 9. Therefore, these six sites distinguish hydroxylases from desaturases. Based on this analysis, we claim that any enzyme with greater than 60% sequence identity to one of the enzymes listed in Figure 9 can be classified as a hydroxylase if it differs from the sequence of the desaturases at these six positions. Because of slight differences in the number of residues in a particular protein, the numbering may vary from protein to protein but the intent of the number system will be evident if the protein in question is aligned with the castor hydroxylase using the numbering system shown herein. Thus, in conjunction with the methods for using the lesquerella hydroxylase gene to isolate homologous genes, the structural criterion disclosed here teaches how to isolate and identify plant kappa hydroxylase genes for the purpose of genetically modifying fatty acid composition ,
In considering which of the six substitutions are solely or primarily responsible for the difference in catalytic activity of the hydroxylases of this invention and the desaturases, we consider it likely that the substitution of a Phe for a Tyr at position 226 may be solely responsible for this difference in catalytic activity because of the known participation of tyrosine radicals in enzyme catalysis. Other substitutions, such as the Ala for Ser at position 331 may have effects at modulating the overall rate of the reaction.
On this basis we envision creating novel kappa hydroxylases by site directed mutagenesis of Δ12 desaturases. We also envision converting Δl5 desaturases and Δ9 desaturases to hydroxylases by similar use of site-directed mutagenesis.
CONCLUDING REMARKS
By the above examples, demonstration of critical factors in the production of novel hydroxylated fatty acids by expression of a kappa hydroxylase gene from Castor in transgenic plants is described. In addition, a complete cDNA sequence of the Lesquerella fendleri kappa hydroxylase is also provided. A full sequence of the castor hydroxylase is also given with various constructs for use in host cells. Through this invention, one can obtain the amino acid and nucleic acid sequences which encode plant fatty acyl hydroxylases from a variety of sources and for a variety of applications.
Although the foregoing invention has been described in some detail by way of illustration and example for purposes of clarity of understanding, it will be obvious that certain changes and modifications may be practiced within the scope of the appended claims.
REFERENCES:
Beltz, G.A. , Jacobs, K.A. , Eickbuch, T.H., Cherbas, P.T., Kafatos, F.C. (1983) Isolation of ultigene families and determination of homologies by filter hybridization methods. Methods in Enzymology 100, 266-285.
Bray, E.A. , Naito, S. , Pan, N.S., Anderson, E. , Dube, P., Beachy, R.N. (1987) Expression of the 3-subunit of β- conglycinin in seeds of transgenic plants. Planta 172:364-370,
Carlson, K.D. , Chaudhry, A., Bagby, M.O (1990) Analysis of oil and meal from lesquerella fendleri seed. J. Am. Oil Chem. Soc. 67, 438-442.
Ditta, G., Stanfield, S., Corbin, D. , Helinski, D.R. (1980) Broad host range DNA cloning system for gram-negative bacteria: Construction of a gene bank of Rhizobium meliloti. Proc. Natl. Acad. Sci. USA 77,7347-7351.
Gould, S.J., Subramani, S., Scheffler, I.E. (1989) Use of the DNA polymerase chain reaction for homology probing. Proc. Natl. Acad. Sci. USA 86, 1934-1938.
Hirsinger, F. (1989) New oil crops, in Oil Crops of the World, Robbelen, G., Downey, K.R. , and Ashri, A., Eds., McGraw-Hill, New York, pp. 518-533. Howling, D. , Morris, L.J. , Gurr, M.I., James, A.T. (1972) The specificity of fatty acid desaturases and hydroxylases. The dehydrogenation and hydroxylation of monoenoic acids, Biochim. Biophys. Acta 260, 10.
Huyuh, T.V. , Young, R.A. , Davis, R.W. (1985) Constructing and screening cDNA libraries in λgtlO and λgtll. In DNA Cloning, Vol. 1: A Practical Approach, (ed) D.M. Glover. IRL Press, Washington DC pp 49-77.
Iba, K. , Gibson, S., Nishiuchi, T. , Fuse, T. , Nishimura, M. , Arondel, V., Hugly, S., and Somerville, C. (1993) A gene encoding a chloroplast omega-3 fatty acid desaturase complements alterations in fatty acid desaturation and chloroplast copy number of the fad7 mutant of Arabidopsis thaliana . J. Biol. Chem. 268, 24099-24105.
Jones, J.D.G., Shlumukov, L. , Carland, F. , English, J. , Scofield, S., Bishop, G.J., Harrison, K. (1992) Effective vectors for transformation, expression of heterologous genes, and assaying transposon excision in transgenic plants. Transgenic Res. 1,285-297.
Knutson, D.S., Thompson, G.A. , Radke, S.E., Johnson, W.B., Knauf, V.C., Kridl, J.C. (1992) Proc. Natl. Acad. Sci. USA 89, 2624-2628.
Koncz, C, Schell, J. (1986) The promoter of TL-DNA gene 5 controls the tissue-specific expression of chimeric genes carried by a novel type of Agrobacterium binary vector. Mol. Gen. Genet. 204, 383-396.
Miquel, M. Browse, J. (1992) Arabidopsis mutants deficient in polyunsaturated fatty acid synthesis. J. Biol. Chem. 267, 1502-1509.
Murray, M.G., Thompson, W.F. (1980) Rapid isolation of high molecular weight plant DNA. Nucl. Acid Res. 8,4321-4325.
Okuley, J., Lightner, J. , Feldman, K. , Yadav, N. , Lark, E. , Browse, J. (1994) Arabidopsiε FAD2 gene encodes the enzyme that is essential for polyunsaturated lipid
Sambrook, J. , Fritsch, E.F., and Maniatis, T. , Molecular Cloning: a Laboratory Manual , 2nd ed. , Cold Spring Harbor Laboratory Press, 1989.
Smith C.R., Jr. (1985) Unusual seed oils and their fatty acids, in Fatty Acids , Pryde E.H., Ed., American Oil Chemists' Society, Champaign, Second edition, pp 29-47.
van de Loo, F.J., Fox, B.G., Somerville, C. (1993) Unusual fatty acids, in Lipid Metaboliεm in Plants, T.S. Moore Jr., Ed., CRC Press, Boca Raton, pp91-126. van de Loo, F.N., Turner, S., Somerville, CR. (1995) An oleate 12-hydroxylase from castor (Ricinus communis L.) is a fatty acyl desaturase homolog. Proc. Natl. Acad. Sci. USA
92,6743-6747 von Heijne, G. (1985) Signal sequences. J. Mol. Biol. 184,99-
105
Atsmon, D. (1989) Castor, in Oil Crops of the World, Robbelen, G. , Downey, K.R. , and Ashri, A., Eds. ,. McGraw-Hill, New York, pp. 438-447.
Bechtold, N. , Ellis, J. and Pelletier, G. (1993) In Planta Agrobacterium mediated gene transfer by infiltration of adult
Arabidopsis thaliana plants. CR. Acad. Sci. Paris 316, 1194-1199.
SEQUENCE LISTING
(1) GENERAL INFORMATION:
(i) APPLICANT: Somerville, Chris Broun, Pierre
(ii) TITLE OF INVENTION: PRODUCTION OF HYDROXYLATED FATTY ACIDS IN GENETICALLY MODIFIED PLANTS
(iii) NUMBER OF SEQUENCES: 15
(iv) CORRESPONDENCE ADDRESS:
(A) ADDRESSEE: CUSHMAN DARBY & CUSHMAN, LLP
(B) STREET: 1100 NEW YORK AVENUE, NW
(C) CITY: WASHINGTON
(D) STATE: DC
(E) COUNTRY: USA
(F) ZIP: 20005-3918
(v) COMPUTER READABLE FORM:
(A) MEDIUM TYPE: Floppy disk
(B) COMPUTER: IBM PC compatible
(C) OPERATING SYSTEM: PC-DOS/MS-DOS
(D) SOFTWARE: Patentln Release #1.0, Version #1.25
(vi) CURRENT APPLICATION DATA:
(A) APPLICATION NUMBER: PCT/US95/11855
(B) FILING DATE: 25-SEP-1995
(C) CLASSIFICATION:
(viii) ATTORNEY/AGENT INFORMATION:
(A) NAME: Kokulis, Paul N.
(B) REGISTRATION NUMBER: 16,773
(C) REFERENCE/DOCKET NUMBER: 1220/213781
(ix) TELECOMMUNICATION INFORMATION:
(A) TELEPHONE: 202-861-3000
(B) TELEFAX: 202-822-0944
(2) INFORMATION FOR SEQ ID NO:l
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 543 nucleotides
(B) TYPE: nucleotide
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l:
TATTGGCACC GGCGGCACCA TTCCAACAAT GGATCCCTAG AAAAAGATGA AGTCTTTGTC 60 CCACCTAAGA AAGCTGCAGT CANATGGTAT GTCAAATACC TCAACAACCC TCTTGGACGC 120 ATTCTGGTGT TAACAGTTCA GTTTATCCTC GGGTGGCCTT TGTATCTAGC CTTTAATGTA 180
TCAGGTAGAC CTTATGATGG TTTCGCTTCA CATTTCTTCC CTCATGCACC TATCTTTAAG 240
GACCGTGAAC GTCTCCAGAT ATACATCTCA GATGCTGGTA TTCTAGCTGT CTGTTATGGT 300
CTTTACCGTT ACGCTGCTTC ACAAGGATTG ACTGCTATGA TCTGCGTCTA CGGAGTACCG 360
CTTTTGATAG TGAACTTTTT CCTTGTCTTG GTCACTTTCT TGCAGCACAC TCATCCTTCA 420
TTACCTCACT ATGATTCAAC CGAGTGGGAA TGGATTAGAG GAGCTTTGGT TACGGTAGAC 480
AGAGACTATG GAATCTTGAA CAAGGTGTTT CACAACATAA CAGACACCCA CGTAGCACAC 540
CAC 543
(2) INFORMATION FOR SEQ ID NO:2
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 544 nucleotides
(B) TYPE: nucleotide
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2:
TATAGGCACC GGAGGCACCA TTCCAACACA GGATCCCTCG AAAGAGATGA AGTATTTGTC 60
CCAAAGCAGA AATCCGCAAT CAAGTGGTAC GGCGAATACC TCAACAACCC TCCTGGTCGC 120
ATCATGATGT TAACTGTCCA GTTCGTCCTC GGATGGCCCT TGTACTTAGC CTTCAACGTT 180
TCTGGCAGAC CCTACAATGG TTTCGCTTCC CATTTCTTCC CCAATGCTCC TATCTACAAC 240
GACCGTGAAC GCCTCCAGAT TTACATCTCT GATGCTGGTA TTCTAGCCGT CTGTTATGGT 300
CTTTACCGTT ACGCTGTTGC ACAAGGACTA GCCTCAATGA TCTGTCTAAA CGGAGTTCCG 360
CTTCTGATAG TTAACTTTTT CCTCGTCTTG ATCACTTACT TACAACACAC TCACCCTGCG 420
TTGCCTCACT ATGATTCATC AGAGTGGGAT TGGCTTAGAG GAGCTTTAGC TACTGTAGAC 480
AGAGACTATG GAATCTTGAA CAAGGTGTTC CATAACATCA CAGACACCCA CGTCGCACAC 540
CACT 544
(2) INFORMATION FOR SEQ ID NO:3
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1740 nucleotides
(B) TYPE: nucleotide
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: genomic
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: ATGAAGCTTT ATAAGAAGTT AGTTTTCTCT GGTGACAGAG AAATTNTGTC AATTGGTAGT 6
GACAGTTGAA GCAACAGGAA CAACAAGGAT GGTTGGTGNT GATGCTGATG TGGTGATGTG 12
TTATTCATCA AATACTAAAT ACTACATTAC TTGTTGCTGC CTACTTCTCC TATTTCCTCC 18
GCCACCCATT TTGGACCCAC GANCCTTCCA TTTAAACCCT CTCTCGTGCT ATTCACCAGA 24
AGAGAAGCCA AGAGAGAGAG AGAGAGAATG TTCTGAGGAT CATTGTCTTC TTCATCGTTA 30
TTAACGTAAG TTTTTTTTGA CCACTCATAT CTAAAATCTA GTACATGCAA TAGATTAATG 36
ACTGTTCCTT CTTTTGATAT TTTCAGCTTC TTGAATTCAA GATGGGTGCT GGTGGAAGAA 42
TAATGGTTAC CCCCTCTTCC AAGAAATCAG AAACTGAAGC CCTAAAACGT GGACCATGTG 48
AGAAACCACC ATTCACTGTT AAAGATCTGA AGAAAGCAAT CCCACAGCAT TGTTTCAAGC 54
GCTCTATCCC TCGTTCTTTC TCCTACCTTC TCACAGATAT CACTTTAGTT TCTTGCTTCT 60
ACTACGTTGC CACAAATTAC TTCTCTCTTC TTCCTCAGCC TCTCTCTACT TACCTAGCTT 66
GGCCTCTCTA TTGGGTATGT CAAGGCTGTG TCTTAACCGG TATCTGGGTC ATTGGCCATG 720
AATGTGGTCA CCATGCATTC AGTGACTATC AATGGGTAGA TGACACTGTT GGTTTTATCT 780
TCCATTCCTT CCTTCTCGTC CCTTACTTCT CCTGGAAATA CAGTCATCGT CGTCACCATT 840
CCAACAATGG ATCTCTCGAG AAAGATGAAG TCTTTGTCCC ACCGAAGAAA GCTGCAGTCA 900
AATGGTATGT TAAATACCTC AACAACCCTC TTGGACGCAT TCTGGTGTTA ACAGTTCAGT 960
TTATCCTCGG GTGGCCTTTG TATCTAGCCT TTAATGTATC AGGTAGACCT TATGATGGTT 1020
TCGCTTCACA TTTCTTCCCT CATGCACCTA TCTTTAAAGA CCGAGAACGC CTCCAGATAT 1080
ACATCTCAGA TGCTGGTATT CTAGCTGTCT GTTATGGTCT TTACCGTTAC GCTGCTTCAC 1140
AAGGATTGAC TGCTATGATC TGCGTCTATG GAGTACCGCT TTTGATAGTG AACTTTTTCC 1200
TTGTCTTGGT AACTTTCTTG CAGCACACTC ATCCTTCGTT ACCTCATTAT GATTCAACCG 1260
AGTGGGAATG GATTAGAGGA GCTTTGGTTA CGGTAGACAG AGACTATGGA ATATTGAACA 1320
AGGTGTTCCA TAACATAACA GACACACATG TGGCTCATCA TCTCTTTGCA ACTATACCGC 1380
ATTATAACGC AATGGAAGCT ACAGAGGCGA TAAAGCCAAT ACTTGGTGAT TACTACCACT 1440
TCGATGGAAC ACCGTGGTAT GTGGCCATGT ATAGGGAAGC AAAGGAGTGT CTCTATGTAG 1500
AACCGGATAC GGAACGTGGG AAGAAAGGTG TCTACTATTA CAACAATAAG TTATGAGGCT 1560
GATAGGGCGA GAGAAGTGCA ATTATCAATC TTCATTTCCA TGTTTTAGGT GTCTTGTTTA 1620
AGAAGCTATG CTTTGTTTCA ATAATCTCAG AGTCCATNTA GTTGTGTTCT GGTGCATTTT 1680
GCCTAGTTAT GTGGTGTCGG AAGTTAGTGT TCAAACTGCT TCCTGCTGTG CTGCCCAGTG 1740 AAGAACAAGT TTACGTGTTT AAAATACTCG GAACGAATTG ACCACAANAT ATCCAAAACC 1800 GGCTATCCGA ATTCCATATC CGAAAACCGG ATATCCAAAT TTCCAGAGTA CTTAG 1855
(2) INFORMATION FOR SEQ ID NO:4:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 384 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: :
Met Gly Ala Gly Gly Arg He Met Val Thr Pro Ser Ser Lys Lys Ser 1 5 10 15
Glu Thr Glu Ala Leu Lys Arg Gly Pro Cys Glu Lys Pro Pro Phe Thr 20 25 30
Val Lys Asp Leu Lys Lys Ala He Pro Gin His Cys Phe Lys Arg Ser 35 40 45
He Pro Arg Ser Phe Ser Tyr Leu Leu Thr Asp He Thr Leu Val Ser 50 55 60
Cys Phe Tyr Tyr Val Ala Thr Asn Tyr Phe Ser Leu Leu Pro Gin Pro 65 70 75 80
Leu Ser Thr Tvr Leu Ala Trp Pro Leu Tyr Trp Val Cys Gin Gly Cys 85 90 95
Val Leu Thr Gly He Trp Val He Gly His Glu Cys Gly His His Ala 100 105 110
Phe Ser Asp Tvr Gin Trp Val Asp Asp Thr Val Gly Phe He Phe His 115 120 125
Ser Phe Leu Leu Val Pro Tyr Phe Ser Trp Lys Tyr Ser His Arg Arg 130 135 140
His His Ser Asn Asn Gly Ser Leu Glu Lys Asp Glu Val Phe Val Pro 145 150 155 160
Pro Lys Lys Ala Ala Val Lys Trp Tyr Val Lys Tyr Leu Asn Asn Pro 165 170 175
Leu GIv Arg He Leu Val Leu Thr Val Gin Phe He Leu Gly Trp Pro 180 185 190
Leu Tvr Leu Ala Phe Asn Val Ser Gly Arg Pro Tyr Asp Gly Phe Ala 195 200 205
Ser His Phe Phe Pro His Ala Pro He Phe Lys Asp Arg Glu Arg Leu 210 215 220 Gin He Tyr He Ser Asp Ala Gly He Leu Ala Val Cys Tyr Gly Leu 225 230 235 240
Tyr Arg Tyr Ala Ala Ser Gin Gly Leu Thr Ala Met He Cys Val Tyr 245 250 255
Gly Val Pro Leu Leu He Val Asn Phe Phe Leu Val Leu Val Thr Phe 260 265 270
Leu Gin His Thr His Pro Ser Leu Pro His Tyr Asp Ser Thr Glu Trp 275 280 285
Glu Trp He Arg Gly Ala Leu Val Thr Val Asp Arg Asp Tyr Gly He 290 295 300
Leu Asn Lys Val Phe His Asn He Thr Asp Thr His Val Ala His His 305 310 315 320
Leu Phe Ala Thr He Pro His Tyr Asn Ala Met Glu Ala Thr Glu Ala 325 330 335
He Lys Pro He Leu Gly Asp Tyr Tyr His Phe Asp Gly Thr Pro Trp 340 345 350
Tyr Val Ala Met Tyr Arg Glu Ala Lys Glu Cys Leu Tyr Val Glu Pro 355 360 365
Asp Thr Glu Arg Gly Lys Lys Gly Val Tyr Tyr Tyr Asn Asn Lys Leu 370 375 380
(2) INFORMATION FOR SEQ ID N0:5:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 387 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5:
Met Gly Gly Gly Gly Arg Met Ser Thr Val He Thr Ser Asn Asn Ser 1 5 10 15
Glu Lys Lys Gly Gly Ser Ser His Leu Lys Arg Ala Pro His Thr Lys 20 25 30
Pro Pro Phe Thr Leu Gly Asp Leu Lys Arg Ala He Pro Pro His Cys 35 40 45
Phe Glu Arg Ser Phe Val Arg Ser Phe Ser Tyr Val Ala Tyr Asp Val 50 55 60
Cys Leu Ser Phe Leu Phe Tyr Ser He Ala Thr Asn Phe Phe Pro Tyr 65 70 75 80 He Ser Ser Pro Leu Ser Tyr Val Ala Trp Leu Val Tyr Trp Leu Phe 85 90 95
Gin Gly Cys He Leu Thr Gly Leu Trp Val He Gly His Glu Cys Gly 100 105 110
His His Ala Phe Ser Glu Tyr Gin Leu Ala Asp Asp He Val Gly Leu 115 120 125
He Val His Ser Ala Leu Leu Val Pro Tyr Phe Ser Trp Lys Tyr Ser 130 135 140
His Arg Arg His His Ser Asn He Gly Ser Leu Glu Arg Asp Glu Val 145 150 155 160
Phe Val Pro Lys Ser Lys Ser Lys He Ser Trp Tyr Ser Lys Tyr Ser 165 170 175
Asn Asn Pro Pro Gly Arg Val Leu Thr Leu Ala Ala Thr Leu Leu Leu 180 185 190
Gly Trp Pro Leu Tyr Leu Ala Phe Asn Val Ser Gly Arg Pro Tyr Asp 195 200 205
Arg Phe Ala Cys His Tyr Asp Pro Tyr Gly Pro He Phe Ser Glu Arg 210 215 220
Glu Arg Leu Gin He Tyr He Ala Asp Leu Gly He Phe Ala Thr Thr 225 230 235 240
Phe Val Leu Tvr Gin Ala Thr Met Ala Lys Gly Leu Ala Trp Val Met 245 250 255
Arg He Tyr Gly Val Pro Leu Leu He Val Asn Cys Phe Leu Val Met 260 265 270
He Thr Tyr Leu Gin His Thr His Pro Ala He Pro Arg Tyr Gly Ser 275 280 285
Ser Glu Trp Asp Trp Leu Arg Gly Ala Met Val Thr Val Asp Arg Asp 290 295 300
Tyr Gly Val Leu Asn Lys Val Phe His Asn He Ala Asp Thr His Val 305 310 315 320
Ala His His Leu Phe Ala Thr Val Pro His Tyr His Ala Met Glu Ala 325 330 335
Thr Lys Ala He Lys Pro He Met Gly Glu Tyr Tyr Arg Tyr Asp Gly 340 345 350
Thr Pro Phe Tyr Lys Ala Leu Trp Arg Glu Ala Lys Glu Cys Leu Phe 355 360 365
Val Glu Pro Asp Glu Gly Ala Pro Thr Gin Gly Val Phe Trp Tyr Arg 370 375 380 Asn Lys Tyr 385
2) INFORMATION FOR SEQ ID NO:6:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 383 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6:
Met Gly Ala Gly Gly Arg Met Pro Val Pro Thr Ser Ser Lys Lys Ser 1 5 10 15
Glu Thr Asp Thr Thr Lys Arg Val Pro Cys Glu Lys Pro Pro Phe Ser 20 25 30
Val Gly Asp Leu Lys Lys Ala He Pro Pro His Cys Phe Lys Arg Ser 35 40 45
He Pro Arg Ser Phe Ser Tyr Leu He Ser Asp He He He Ala Ser 50 55 60
Cys Phe Tyr Tyr Val Ala Thr Asn Tyr Phe Ser Leu Leu Pro Gin Pro 65 70 75 80
Leu Ser Tyr Leu Ala Trp Pro Leu Tyr Trp Ala Cys Gin Gly Cys Val 85 90 95
Leu Thr Gly He Trp Val He Ala His Glu Cys Gly His His Ala Phe 100 105 110
Ser Asp Tyr Gin Trp Leu Asp Asp Thr Val Gly Leu He Phe His Ser 115 120 125
Phe Leu Leu Val Pro Tyr Phe Ser Trp Lys Tyr Ser His Arg Arg His 130 135 140
His Ser Asn Thr Gly Ser Leu Glu Arg Asp Glu Val Phe Val Pro Lys 145 150 155 160
Gin Lys Ser Ala He Lys Trp Tyr Gly Lys Tyr Leu Asn Asn Pro Leu 165 170 175
Gly Arg He Met Met Leu Thr Val Gin Phe Val Leu Gly Trp Pro Leu 180 185 190
Tyr Leu Ala Phe Asn Val Ser Gly Arg Pro Tyr Asp Gly Phe Ala Cys 195 200 205
His Phe Phe Pro Asn Ala Pro He Tyr Asn Asp Arg Glu Arg Leu Gin 210 215 220
He Tyr Leu Ser Asp Ala Gly He Leu Ala Val Cys Phe Gly Leu Tyr 225 230 235 240 Arg Tyr Ala Ala Ala Gin Gly Met Ala Ser Met He Cys Leu Tyr Gly 245 250 255
Val Pro Leu Leu He Val Asn Ala Phe Leu Val Leu He Thr Tyr Leu 260 265 270
Gin His Thr His Pro Ser Leu Pro His Tyr Asp Ser Ser Glu Trp Asp 275 280 285
.Trp Leu Arg Gly Ala Leu Ala Thr Val Asp Arg Asp Tyr Gly He Leu 290 295 300
Asn Lys Val Phe His Asn He Thr Asp Thr His Val Ala His His Leu 305 310 315 320
Phe Ser Thr Met Pro His Tyr Asn Ala Met Glu Ala Thr Lys Ala He 325 330 335
Lys Pro He Leu Gly Asp Tyr Tyr Gin Phe Asp Gly Thr Pro Trp Tyr 340 345 350
Val Ala Met Tyr Arg Glu Ala Lys Glu Cys He Tyr Val Glu Pro Asp 355 360 365
Arg Glu Gly Asp Lys Lys Gly Val Tyr Trp Tyr Asn Asn Lys Leu 370 375 380
(2) INFORMATION FOR SEQ ID NO:7:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 384 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: :
Met Gly Ala Gly Gly Arg Met Gin Val Ser Pro Pro Ser Lys Lys Ser 1 5 10 15
Glu Thr Asp Asn He Lys Arg Val Pro Cys Glu Thr Pro Pro Phe Thr 20 25 30
Val Gly Glu Leu Lys Lys Ala He Pro Pro His Cys Phe Lys Arg Ser 35 40 45
He Pro Arg Ser Phe Ser His Leu He Trp Asp He He He Ala Ser 50 55 60
Cys Phe Tyr Tyr Val Ala Thr Thr Tyr Phe Pro Leu Leu Pro Asn Pro 65 70 75 80
Leu Ser Tyr Phe Ala Trp Pro Leu Tyr Trp Ala Cys Gin Gly Cys Val 85 90 95
Leu Thr Gly Val Trp Val He Ala His Glu Cys Gly His Ala Ala Phe 100 105 110 Ser Asp Tyr Gin Trp Leu Asp Asp Thr Val Gly Leu He Phe His Ser 115 120 125
Phe Leu Leu Val Pro Tyr Phe Ser Trp Lys Tyr Ser His Arg Arg His 130 135 140
His Ser Asn Thr Gly Ser Leu Glu Arg Asp Glu Val Phe Val Pro Arg 145 150 155 160
Arg Ser Gin Thr Ser Ser Gly Thr Ala Ser Thr Ser Thr Thr Phe Gly 165 170 175
Arg Thr Val Met Leu Thr Val Gin Phe Thr Leu Gly Trp Pro Leu Tyr 180 185 190
Leu Ala Phe Asn Val Ser Gly Arg Pro Tyr Asp Gly Gly Phe Ala Cys 195 200 205
His Phe His Pro Asn Ala Pro He Tyr Asn Asp Arg Glu Arg Leu Gin 210 215 220
He Tyr He Ser Asp Ala Gly He Leu Ala Val Cys Tyr Gly Leu Leu 225 230 235 240
Pro Tyr Ala Ala Val Gin Gly Val Ala Ser Met Val Cys Phe Leu Arg 245 250 255
Val Pro Leu Leu He Val Asn Gly Phe Leu Val Leu He Thr Tyr Leu 260 265 270
Gin His Thr His Pro Ser Leu Pro His Tyr Asp Ser Ser Glu Trp Asp 275 280 285
Trp Leu Arg Gly Ala Leu Ala Thr Val Asp Arg Asp Tyr Gly He Leu 290 295 300
Asn Gin Gly Phe His Asn He Thr Asp Thr His Glu Ala His His Leu 305 310 315 320
Phe Ser Thr Met Pro His Tyr His Ala Met Glu Ala Thr Lys Ala He 325 330 335
Lys Pro He Leu Gly Glu Tyr Tyr Gin Phe Asp Gly Thr Pro Val Val 340 345 350
Lvs Ala Met Trp Arg Glu Ala Lys Glu Cys He Tyr Val Glu Pro Asp 355 360 365
Arg Gin Gly Glu Lys Lys Gly Val Phe Trp Tyr Asn Asn Lys Leu Xaa 370 375 380
(2) INFORMATION FOR SEQ ID N0:8:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 309 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID N0:8:
Ser Leu Leu Thr Ser Phe Ser Tyr Val Val Tyr Asp Leu Ser Phe Ala 1 5 10 15
Phe He Phe Tyr He Ala Thr Thr Tyr Phe His Leu Leu Pro Gin Pro 20 25 30
Phe Ser Leu He Ala Trp Pro He Tyr Trp Val Leu Gin Gly Cys Leu 35 40 45
Leu Thr Arg Val Cys Gly His His Ala Phe Ser Lys Tyr Gin Trp Val 50 55 60
Asp Asp Val Val Gly Leu Thr Leu His Ser Thr Leu Leu Val Pro Tyr 65 70 75 80
Phe Ser Trp Lys He Ser His Arg Arg His His Ser Asn Thr Gly Ser 85 90 95
Leu Asp Arg Asp Glu Arg Val Lys Val Ala Trp Phe Ser Lys Tyr Leu 100 105 110
Asn Asn Pro Leu Gly Arg Ala Val Ser Leu Leu Val Thr Leu Thr He 115 120 125
Gly Trp Pro Met Tyr Leu Ala Phe Asn Val Ser Gly Arg Pro Tvr Asp 130 135 140
Ser Phe Ala Ser His Tyr His Pro Tyr Arg Val Arg Leu Leu He Tyr 145 150 155 160
Val Ser Asp Val Ala Leu Phe Ser Val Thr Tyr Ser Leu Tyr Arg Val 165 170 175
Ala Thr Leu Lys Gly Leu Val Trp Leu Leu Cys Val Tyr Gly Val Pro 180 185 190
Leu Leu He Val Asn Gly Phe Leu Val Thr He Thr Tyr Leu Arg Val 195 200 205
His Tyr Asp Ser Ser Glu Trp Asp Trp Leu Lys Gly Ala Leu Ala Thr 210 215 220
Met Asp Arg Asp Tyr Gly He Leu Asn Lys Val Phe His His He Thr 225 230 235 240
Asp Thr His Val Ala His His Leu Phe Ser Thr Met Pro His Tyr His 245 250 255
Leu Arg Val Lys Pro He Leu Gly Glu Tyr Tyr Gin Phe Asp Asp Thr 260 265 270
Pro Phe Tyr Lys Ala Leu Trp Arg Glu Ala Arg Glu Cys Leu Tyr Val 275 280 285 Glu Pro Asp Glu Gly Thr Ser Glu Lys Gly Val Tyr Trp Tvr Arg Asn 290 295 300
Lys Tyr Leu Arg Val 305
(2) INFORMATION FOR SEQ ID NO:9:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 302 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9:
Phe Ser Tyr Val Val Tyr Asp Leu Thr He Ala Phe Cvs Leu Tyr Tyr 1 5 10 15
Val Ala Thr His Tyr Phe His Leu Leu Pro Gly Pro Leu Ser Phe Arg 20 25 30
Gly Met Ala He Tyr Trp Ala Val Gin Gly Cys He Leu Thr Gly Val 35 40 45
Trp Val Val Ala Phe Ser Asp Tyr Gin Leu Leu Asp Asp He Val Gly 50 55 60
Leu He Leu His Ser Ala Leu Leu Val Pro Tyr Phe Ser Trp Lys Tyr 65 70 75 80
Ser His Arg Arg His His Ser Asn Thr Gly Ser Leu Glu Arg Asp Glu 85 90 95
Val Phe Val Pro Lys Val Ser Lys Tyr Leu Asn Asn Pro Pro Gly Arg 100 105 110
Val Leu Thr Leu Ala Val Thr Leu Thr Leu Gly Trp Pro Leu Tyr Leu 115 120 125
Ala Leu Asn Val Ser Gly Arg Pro Tyr Asp Arg Phe Ala Cys His Tyr 130 135 140
Asp Pro Tyr Gly Pro He Tyr Ser Val He Ser Asp Ala Gly Val Leu 145 150 155 160
Ala Val Val Tyr Gly Leu Phe Arg Leu Ala Met Ala Lys Gly Leu Ala 165 170 175
Trp Val Val Cys Val Tyr Gly Val Pro Leu Leu Val Val Asn Gly Phe 180 185 190
Leu Val Leu He Thr Phe Leu Gin His Thr His Val Ser Glu Trp Asp 195 200 205
Trp Leu Arg Gly Ala Leu Ala Thr Val Asp Arg Asp Tyr Gly He Leu 210 215 220 Asn Lys Val Phe His Asn He Thr Asp Thr His Val Ala His His Leu 225 230 235 240
Phe Ser Thr Met Pro His Tyr His Ala Met Glu Ala Thr Val Glu Tyr 245 250 255
Tyr Arg Phe Asp Glu Thr Pro Phe Val Lys Ala Met Trp Arg Glu Ala 260 265 270
Arg Glu Cys He Tyr Val Glu Pro Asp Gin Ser Thr Glu Ser Lys Gly 275 280 285
Val Phe Trp Tyr Asn Asn Lys Leu Ala Met Glu Ala Thr Val 290 295 300
(2) INFORMATION FOR SEQ ID NO:10:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 372 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10:
Met Gly Ala Gly Gly Arg Met Thr Glu Lys Glu Arg Glu Lys Gin Glu 1 5 10 15
Gin Leu Ala Arg Ala Thr Gly Gly Ala Ala Met Gin Arg Ser Pro Val 20 25 30
Glu Lys Pro Pro Phe Thr Leu Gly Gin He Lys Lys Ala He Pro Pro 35 40 45
His Cvs Phe Glu Arg Ser Val Leu Lys Ser Phe Ser Tyr Val Val His 50 55 60
Asp Leu Val He Ala Ala Ala Leu Leu Tyr Phe Ala Leu Ala He He 65 70 75 80
Pro Ala Leu Pro Ser Pro Leu Arg Tyr Ala Ala Trp Pro Leu Tyr Trp 85 90 95
He Ala Gin Gly Ala Phe Ser Asp Tyr Ser Leu Leu Asp Asp Val Val 100 105 110
Gly Leu Val Leu His Ser Ser Leu Met Val Pro Tyr Phe Ser Trp Lys 115 120 125
Tyr Ser His Arg Arg His His Ser Asn Thr Gly Ser Leu Glu Arg Asp 130 135 140
Glu Val Phe Val Pro Lys Lys Lys Glu Ala Leu Pro Trp Tyr Thr Pro 145 150 155 160
Tyr Val Tyr Asn Asn Pro Val Gly Arg Val Val His He Val Val Gin 165 170 175 Leu Thr Leu Gly Trp Pro Leu Tyr Leu Ala Thr Asn Ala Ser Gly Arg 180 185 190
Pro Tyr Pro Arg Phe Ala Cys His Phe Asp Pro Tyr Gly Pro He Tyr 195 200 205
Asn Asp Arg Glu Arg Ala Gin He Phe Val Ser Asp Ala Gly Val Val 210 215 220
Ala Val Ala Phe Gly Leu Tyr Lys Leu Ala Ala Ala Phe Gly Val Trp 225 230 235 240
Trp Val Val Arg Val Tyr Ala Val Pro Leu Leu He Val Asn Ala Trp 245 250 255
Leu Val Leu He Thr Tyr Leu Gin His Thr His Pro Ser Leu Pro His 260 265 270
Tyr Asp Ser Ser Glu Trp Asp Trp Leu Arg Gly Ala Leu Ala Thr Met 275 280 285
Asp Arg Asp Tyr Gly He Leu Asn Arg Val Phe His Asn He Thr Asp 290 295 300
Thr His Val Ala His His Leu Phe Ser Thr Met Pro His Tyr His Ala 305 310 315 320
Met Glu Ala Thr Lys Ala He Arg Pro He Leu Gly Asp Tyr Tyr His 325 330 335
Phe Asp Pro Thr Pro Val Ala Lys Ala Thr Trp Arg Glu Ala Gly Glu 340 345 350
Cys He Tyr Val Glu Pro Glu Asp Arg Lys Gly Val Phe Trp Tyr Asn 355 360 365
Lys Lys Phe Xaa
370
(2) INFORMATION FOR SEQ ID NO:11:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 224 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:11:
Trp Val Met Ala His Asp Cys Gly His His Ala Phe Ser Asp Tyr Gin 1 5 10 15
Leu Leu Asp Asp Val Val Gly Leu He Leu His Ser Cys Leu Leu Val 20 25 30
Pro Tyr Phe Ser Trp Lys His Ser His Arg Arg His His Ser Asn Thr 35 40 45 Gly Ser Leu Glu Arg Asp Glu Val Phe Val Pro Lys Lys Lys Ser Ser 50 55 60
He Arg Trp Tyr Ser Lys Tyr Leu Asn Asn Pro Pro Gly Arg He Met 65 70 75 80
Thr He Ala Val Thr Leu Ser Leu Gly Trp Pro Leu Tyr Leu Ala Phe 85 90 95
Asn Val Ser Gly Arg Pro Tyr Asp Arg Phe Ala Cys His Tyr Asp Pro 100 105 110
Tyr Gly Pro He Tyr Asn Asp Arg Glu Arg He Glu He Phe He Ser 115 120 125
Asp Ala Gly Val Leu Ala Val Thr Phe Gly Leu Tyr Gin Leu Ala He 130 135 140
Ala Lys Gly Leu Ala Trp Val Val Cys Val Tyr Gly Val Pro Leu Leu 145 150 155 160
Val Val Asn Ser Phe Leu Val Leu He Thr Phe Leu Gin His Thr His 165 170 175
Pro Ala Leu Pro His Tyr Asp Ser Ser Glu Trp Asp Trp Leu Arg Gly 180 185 190
Ala Leu Ala Thr Val Asp Arg Asp Tyr Gly He Leu Asn Lys Val Phe 195 200 205
His Asn He Thr Asp Thr Gin Val Ala His His Leu Phe Thr Met Pro 210 215 220
(2) INFORMATION FOR SEQ ID NO:12
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 20 nucleotides
(B) TYPE: nucleotide
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: GCTCTTTTGT GCGCTCATTC 20
2) INFORMATION FOR SEQ ID NO: 13 (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 20 nucleotides
(B) TYPE: nucleotide
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
Iii) MOLECULE TYPE: DNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: CGGTACCAGA AAACGCCTTG 20
(2) INFORMATION FOR SEQ ID NO:14
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 20 nucleotides
(B) TYPE: nucleotide
(C) STRANDEDNESS single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14:
TAYWSNCAYM GNMGNCAYCA 20
(2) INFORMATION FOR SEQ ID NO:15
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 21 nucleotides
(B) TYPE: nucleotide
(C) STRANDEDNESS single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15:
RTGRTGNGCN ACRTGNGTRT C 21

Claims

CLAIMSWhat is claimed is:
1. An isolated nucleic acid fragment comprising a nucleic acid sequence encoding a fatty acid hydroxylase with an amino acid identity of 60% or greater to the polypeptide encoded by SEQ ID NO:4.
2. The isolated nucleic acid fragment of Claim 1, wherein the amino acid identity is 90% or greater to the polypeptide encoded by SEQ ID NO:4.
3. The isolated nucleic acid fragment of Claim 1, wherein the amino acid identity is 100% of the polypeptide encoded by SEQ ID NO:4.
4. An isolated nucleic acid fragment having a nucleic acid identity of 90% or greater of a nucleotide sequence of SEQ ID NO:l, 2, or 3.
5. An isolated nucleic acid having a nucleotide sequence of SEQ ID NO:l, SEQ ID NO:2 or SEQ ID NO:3.
6. The isolated nucleic acid fragment of Claim 1, wherein said fragment is isolated from an oil-producing plant species.
7. A chimeric gene capable of causing altered levels of ricinoleic acid in a transformed plant cell, said chimeric gene comprising a nucleic acid fragment of Claim 1, said fragment operably linked to suitable regulatory sequences.
8. A chimeric gene capable of causing altered levels of lesquerolic acid in a transformed plant cell, said chimeric gene comprising a nucleic acid fragment of Claim l, said fragment operably linked to suitable regulatory sequences.
9. A chimeric gene capable of causing altered levels of fatty acids in a transformed plant cell, said chimeric gene comprising a nucleic acid fragment of Claim 1, said fragment operably linked to suitable regulatory sequences.
10. A chimeric gene capable of causing altered levels of fatty acids in a transformed plant cell, said chimeric gene comprising a nucleic acid fragment of Claim 2, said fragment operably linked to suitable regulatory sequences.
11. A chimeric gene capable of causing altered levels of fatty acids in a transformed plant cell, said chimeric gene comprising a nucleic acid fragment of Claim 4, said fragment operably linked to suitable regulatory sequences.
12. Plants containing the chimeric gene of any one of claims 7, 8, 9, 10 or 11.
13. Oil obtained from seeds of the plants of claim 12.
14. The isolated nucleic acid fragment of Claim 1, wherein said fragment is obtainable from Ricinuε communiε (L.) (Castor) .
15. The isolated nucleic acid fragment of Claim 1, wherein said fragment is obtainable from esguerelia fendleri .
16. A method of producing seed oil containing altered levels of hydroxylated fatty acids comprising:
(a) transforming a plant cell of an oil-producing species with a chimeric gene containing an isolated nucleic acid of Claim 1;
(b) growing fertile plants from the transformed plant cells of step (a) ;
(c) screening progeny seeds from the fertile plants of step (b) for the desired levels of hydroxylated fatty acids; and
(d) processing the progeny seed of step (c) to obtain seed oil containing altered levels of unsaturated fatty acids.
17. The method of Claim 16, wherein said crop plant is selected from the group consisting of rapeseed, Crambe, Brassica juncea , Canola, flax, sunflower, safflower, cotton, cuphea, soybean, peanut, coconut, oil palm and corn.
18. A method of producing seed oil containing altered levels of hydroxylated fatty acids comprising:
(a) transforming a plant cell of an oil-producing species with a chimeric gene containing the nucleotide sequence of SEQ ID NO:l, SEQ ID NO:2 or SEQ ID NO: 3;
(b) growing fertile plants from the transformed plant cells of step (a) ;
(c) screening progeny seeds from the fertile plants of step (b) for the desired levels of hydroxylated fatty acids; and
(d) processing the progeny seed of step (c) to obtain seed oil containing altered levels of unsaturated fatty acids.
19. The method of Claim 18, wherein said crop plant is selected from the group consisting of rapeseed, Crambe, Brasεica juncea , Canola, flax, sunflower, safflower, cotton, cuphea, soybean, peanut, coconut, oil palm and corn.
20. A triglyceride oil from a plant selected from the group consisting of rapeseed, Crambe, Brasεica juncea , Canola, flax, sunflower, cotton, cuphea, soybean, peanut, coconut, oil palm and corn, wherein the fatty acid composition of the oil has been modified to contain hydroxylated fatty acids by a method comprising growing a plant cell having integrated in its genome a DNA construct containing a plant hydroxylase encoding sequence of Claim 1, under conditions which will permit the transcription and translation of said plant hydroxylase in the plant cells.
21. A method to isolate nucleic acid fragments encoding fatty acid hydroxylases comprising:
(a) comparing SEQ ID NO:4 and other fatty acid hydroxylase sequences and fatty acid desaturases;
(b) identifying conserved sequences of 4 or more amino acids obtained in step (a) ;
(c) designing degenerate oligomers based on the conserved sequences identified in step (b) ;
(d) using the degenerate oligomers of step (c) to isolate sequences encoding fatty acid hydroxylases by sequence dependent protocols;
(e) obtaining the deduced amino acid sequence of the encoded gene product from the nucleotide sequence of the gene and;
(f) distinguishing hydroxylase genes from desaturase genes by analyzing amino acid sequence differences between fatty acid desaturases and fatty acid hydroxylases.
PCT/US1995/011855 1994-09-26 1995-09-25 Production of hydroxylated fatty acids in genetically modified plants WO1996010075A1 (en)

Priority Applications (7)

Application Number Priority Date Filing Date Title
EP95934442A EP0781327B1 (en) 1994-09-26 1995-09-25 Production of hydroxylated fatty acids in genetically modified plants
AU36778/95A AU718512B2 (en) 1994-09-26 1995-09-25 Production of hydroxylated fatty acids in genetically modified plants
CA2200202A CA2200202C (en) 1994-09-26 1995-09-25 Production of hydroxylated fatty acids in genetically modified plants
DE69534849T DE69534849T2 (en) 1994-09-26 1995-09-25 PREPARATION OF HYDROXYLATED FATTY ACIDS IN GENETICALLY MODIFIED PLANTS
JP8511856A JPH10506783A (en) 1994-09-26 1995-09-25 Production of hydroxylated fatty acids in genetically modified plants
US09/885,189 US6936728B2 (en) 1994-09-26 2001-06-21 Production of hydroxylated fatty acids in genetically modified plants
US11/058,746 US20060101543A1 (en) 1994-09-26 2005-02-16 Production of hydroxylated fatty acids in genetically modified plants

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US08/314,596 1994-09-26
US08/314,596 US5668292A (en) 1994-09-26 1994-09-26 Use of plant fatty acyl hydroxylases to produce hydroxylated fatty acids and derivatives in plants
US08/320,982 US5801026A (en) 1994-09-26 1994-10-11 Use of plant fatty acyl hydroxylases to produce hydroxylated fatty acids and derivatives in plants
US08/320,982 1994-10-11
US08/530,862 1995-09-20

Publications (1)

Publication Number Publication Date
WO1996010075A1 true WO1996010075A1 (en) 1996-04-04

Family

ID=26979447

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US1995/011855 WO1996010075A1 (en) 1994-09-26 1995-09-25 Production of hydroxylated fatty acids in genetically modified plants

Country Status (2)

Country Link
US (3) US5801026A (en)
WO (1) WO1996010075A1 (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1999053073A2 (en) * 1998-04-16 1999-10-21 Pierre Broun Interconversion of plant fatty acid desaturases and hydroxylases
EP1002067A1 (en) * 1997-07-18 2000-05-24 Carnegie Institution Of Washington Strong early seed-specific gene regulatory region
EP1009220A4 (en) * 1996-02-06 2000-06-21 Carnegie Inst Of Washington Production of hydroxylated fatty acids in genetically modified plants
WO2000070052A1 (en) * 1999-05-18 2000-11-23 Metapontum Agrobios S.C.R.L. Gene isolated from ricinus communis encoding a new protein that interacts with the oleate 12-hydroxylase enzyme
US6291742B1 (en) 1994-09-26 2001-09-18 Carnegie Institution Of Washington Production of hydroxylated fatty acids in genetically modified plants
US6433250B1 (en) 1994-09-26 2002-08-13 Carnegie Institution Of Washington Use of plant fatty acyl hydroxylases to produce hydroxylated fatty acids and derivatives in plants
JP2003525030A (en) * 1999-08-27 2003-08-26 セムバイオシス ジェネティックス インコーポレイテッド Flax seed-specific promoter
WO2007147256A1 (en) * 2006-06-22 2007-12-27 Bioriginal Food & Science Corp. Fatty acid hydroxylases and uses thereof

Families Citing this family (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6872872B1 (en) * 1992-11-17 2005-03-29 E. I. Du Pont De Nemours And Company Genes for microsomal delta-12 fatty acid desaturases and related enzymes from plants
US6372965B1 (en) 1992-11-17 2002-04-16 E.I. Du Pont De Nemours And Company Genes for microsomal delta-12 fatty acid desaturases and hydroxylases from plants
US5656312A (en) * 1994-09-02 1997-08-12 Erasmus; Udo Dietary food supplement and method of preparing
US6007860A (en) * 1994-09-02 1999-12-28 Designing Health, Inc. Preparing food supplement
US6060101A (en) * 1994-09-02 2000-05-09 Designing Health, Inc. Dietary food supplement
US6924109B2 (en) * 1999-07-30 2005-08-02 Agy Therapeutics, Inc. High-throughput transcriptome and functional validation analysis
US6974893B2 (en) * 2001-06-29 2005-12-13 Brookhaven Science Associates, Llc Isoform of castor oleate hydroxylase
MY146503A (en) * 2001-08-13 2012-08-15 Malaysian Palm Oil Board Method and compositions for the production of transgenic plants
US20040157803A1 (en) * 2002-03-04 2004-08-12 Williams Deryck J. Nematicidal fatty acid and fatty acid ester related compounds
US6887900B2 (en) * 2002-03-04 2005-05-03 Divergence, Inc. Nematicidal compositions and methods
WO2004071168A2 (en) * 2003-02-05 2004-08-26 Divergence, Inc. Nucleic acids encoding anthelmintic agents and plants made therefrom
US7368629B2 (en) * 2004-02-04 2008-05-06 Divergence, Inc. Nucleic acids encoding anthelmintic agents and plants made therefrom
CA2800359A1 (en) * 2005-03-16 2006-09-28 Metabolix, Inc. Chemically inducible expression of biosynthetic pathways
US20110054199A1 (en) * 2009-08-31 2011-03-03 Certo Labs, Inc. Methods for extracting nutrients, drugs and toxins from a sample, and apparati for same

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
ES2198408T3 (en) * 1992-11-17 2004-02-01 E.I. Du Pont De Nemours And Company GENES FOR DELTA 12 DESATURASAS OF MICROSOMAL FATTY ACIDS AND RELATED PLANT ENZYMES.
JPH08506490A (en) 1993-02-05 1996-07-16 モンサント・カンパニー Modified linolenic and linoleic acid contents in plants
US5801026A (en) 1994-09-26 1998-09-01 Carnegie Institution Of Washington Use of plant fatty acyl hydroxylases to produce hydroxylated fatty acids and derivatives in plants
ATE319296T1 (en) 1995-12-14 2006-03-15 Cargill Inc PLANTS WITH MUTATED SEQUENCES WHICH PROVIDE ALTERED FATTY ACID CONTENT

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
BIOCHEMICAL JOURNAL, Volume 287, issued 1992, SMITH et al., "Evidence for Cytochrome b5 as an Electron Donor in Ricinoleic Acid Biosynthesis in Microsomal Preparations from Developing Castor Bean (Ricinus Communis L.)", pages 141-144. *
PLANTA, Volume 172, issued 1987, BRAY et al., "Expression of the Beta-Subunit of Beta-Conglycinin in Seeds of Transgenic Plants", pages 364-370. *
PROC. NATL. ACAD. SCI. U.S.A., Volume 86, issued March 1989, GOULD et al., "Use of the DNA Polymerase Chain Reaction for Homology Probing: Isolation of Partial cDNA or Genomic Clones Encoding Iron-Sulfur Protein of Succinate Dehydrogenase from Several Species", pages 1934-1938. *
SCIENCE, Volume 258, issued 20 November 1992, ARONDEL et al., "Map-Based Cloning of a Gene Controlling Omega-3 Fatty Acid Desaturation in Arabidopsis", pages 1353-1355. *

Cited By (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6936728B2 (en) 1994-09-26 2005-08-30 Carnegie Institution Of Washington Production of hydroxylated fatty acids in genetically modified plants
US6291742B1 (en) 1994-09-26 2001-09-18 Carnegie Institution Of Washington Production of hydroxylated fatty acids in genetically modified plants
US8003855B2 (en) 1994-09-26 2011-08-23 Carnegie Institution Of Washington Production of hydroxylated fatty acids in genetically modified plants
US6310194B1 (en) 1994-09-26 2001-10-30 Carnegie Institution Of Washington Plant fatty acid hydroxylases
US6433250B1 (en) 1994-09-26 2002-08-13 Carnegie Institution Of Washington Use of plant fatty acyl hydroxylases to produce hydroxylated fatty acids and derivatives in plants
EP1009220A4 (en) * 1996-02-06 2000-06-21 Carnegie Inst Of Washington Production of hydroxylated fatty acids in genetically modified plants
EP1009220A1 (en) * 1996-02-06 2000-06-21 Carnegie Institution Of Washington Production of hydroxylated fatty acids in genetically modified plants
AU737823B2 (en) * 1997-07-18 2001-08-30 Carnegie Institution Of Washington Strong early seed-specific gene regulatory region
EP1002067A4 (en) * 1997-07-18 2004-10-27 Carnegie Inst Of Washington Strong early seed-specific gene regulatory region
EP1002067A1 (en) * 1997-07-18 2000-05-24 Carnegie Institution Of Washington Strong early seed-specific gene regulatory region
WO1999053073A3 (en) * 1998-04-16 2000-02-10 Pierre Broun Interconversion of plant fatty acid desaturases and hydroxylases
WO1999053073A2 (en) * 1998-04-16 1999-10-21 Pierre Broun Interconversion of plant fatty acid desaturases and hydroxylases
WO2000070052A1 (en) * 1999-05-18 2000-11-23 Metapontum Agrobios S.C.R.L. Gene isolated from ricinus communis encoding a new protein that interacts with the oleate 12-hydroxylase enzyme
JP2003525030A (en) * 1999-08-27 2003-08-26 セムバイオシス ジェネティックス インコーポレイテッド Flax seed-specific promoter
WO2007147256A1 (en) * 2006-06-22 2007-12-27 Bioriginal Food & Science Corp. Fatty acid hydroxylases and uses thereof
US7923598B2 (en) 2006-06-22 2011-04-12 Bioriginal Food & Science Corp. Fatty acid hydroxylases and uses thereof

Also Published As

Publication number Publication date
US5801026A (en) 1998-09-01
US6433250B1 (en) 2002-08-13
US6028248A (en) 2000-02-22

Similar Documents

Publication Publication Date Title
US6310194B1 (en) Plant fatty acid hydroxylases
WO1996010075A1 (en) Production of hydroxylated fatty acids in genetically modified plants
US7148336B2 (en) Nucleic acid sequences and methods of use for the production of plants with modified polyunsaturated fatty acid levels
US5668292A (en) Use of plant fatty acyl hydroxylases to produce hydroxylated fatty acids and derivatives in plants
EP0495096A4 (en) Plant fatty acid synthases
CA2200202C (en) Production of hydroxylated fatty acids in genetically modified plants
US6501004B1 (en) Transgenic reduction of sinapine in crucifera
WO1999053073A9 (en) Interconversion of plant fatty acid desaturases and hydroxylases
MXPA98006317A (en) Production of hydroxyled fatty acids in plants genetically modifies
CA2816177C (en) Desaturase introns and method of use for the production of plants with modified polyunsaturated fatty acids
AU2004200281A1 (en) Interconversion of plant fatty acid desaturases and hydroxylases
EP1908843A1 (en) Nucleic acid sequences and methods of use for the production of plants with modified polyunsaturated fatty acids

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AU CA JP

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): AT BE CH DE DK ES FR GB GR IE IT LU MC NL PT SE

DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
121 Ep: the epo has been informed by wipo that ep was designated in this application
ENP Entry into the national phase

Ref document number: 2200202

Country of ref document: CA

Ref country code: CA

Ref document number: 2200202

Kind code of ref document: A

Format of ref document f/p: F

WWE Wipo information: entry into national phase

Ref document number: 1995934442

Country of ref document: EP

WWP Wipo information: published in national office

Ref document number: 1995934442

Country of ref document: EP

WWG Wipo information: grant in national office

Ref document number: 1995934442

Country of ref document: EP