Desaturase Nucleic Acids and Polypeptides Field of the Invention
This disclosure relates to the identification of delta-12 and delta-15 desaturase genes involved in the desaturation of oleic acid to linoleic acid and further to alpha linolenic acid; plants comprising mutations in the delta-12 and/or delta-15 desaturase genes that have seed with altered fatty acid content are also disclosed.
Background to the Invention
Edible oils containing lower levels of saturated fatty acids and elevated concentrations of oleic acids and poly unsaturated fatty acids such a linoleic acid are highly desirable due to the perceived dietary health benefits and possibly preventing diseases such as arteriosclerosis or diabetes. Moreover, monounsaturated oils, such as oleic acid are suitable replacements to petroleum-based feedstocks in the manufacture of plastic, lubricants and cosmetics and are known to enhance the combustion of biodiesel.
Vegetable oils extracted from plants comprise various amounts of saturated, mono- and polyunsaturated fatty acids and although mono-and polyunsaturated oils both have their use, polyunsaturated oils are considered contaminants in oils for industrial use as they are prone to oxidation and difficult to remove during oil processing. Therefore, plants with high concentrations of oleic acid (OA), a monounsaturated fatty acid, and low amounts of polyunsaturated fatty acids such as linoleic acid (LA) or alpha-linolenic acid (ALA) are highly desirable.
Two multifunctional classes of desaturases have been found in plants, one soluble and the other membrane bound. In plants C16- and C18-fatty acids are synthesized in the stroma of plastids and with desaturation of 18:0 to 18:1 by a soluble delta-9 stearoyl ACP desaturase also occurring in plastids. Further desaturation of fatty acids in membrane lipids of the chloroplast and endoplasmic reticulum (ER) is carried out by the membrane bound desaturases, a number of which have been designated FAD2 to FAD8.
The seeds of Cannabis sativa L. (hemp, marijuana) are an important source of oil and protein in human nutrition dating back to Neolithic times in ancient China. C. sativa has an annual life cycle and is mostly dioecious with male and female flowers borne on separate plants. Selective breeding has produced marijuana strains accumulating high levels of psychoactive cannabinoids in the female flowers and hemp cultivars typically having low levels of cannabinoids but good fibre and/or seed oil traits. Hemp has modest agrochemical requirements, is an excellent break crop and is suited to warm-to- temperate growing l
conditions. At over 80% in polyunsaturated fatty acids (PUFAs), hemp seed oil rivals most of the commonly used vegetable oils. At 56% LA and 22% ALA hemp oil is a rich source of these essential fatty acids. In addition, hemp oil also contains gamma linolenic acid (GLA) and stearidonic acid (SDA) which occur at about 4% and 2% respectively. This disclosure relates to two novel desaturase genes in hemp catalysing desaturation of oleic acid (OA) to LA and LA to ALA. Homozygous plants lacking the delta-12 desaturase [also known as FAD2 desaturase] show increased amounts of OA, whereas plants lacking the delta-15 desaturase [also known as FAD3 desaturase] show increased amounts of LA and near zero levels of ALA. Plants carrying a specific point mutation in the delta-12 desaturase show increased amounts of GLA and when this delta-12 desaturase point mutation is crossed into plants lacking the delta-15 desaturase there is a further increase in the amounts of GLA, a profile desired to efficiently purify GLA from plants. Plants carrying single delta-12 desaturase or delta-15 desaturase mutations or both mutations are also disclosed, as are plants engineered to down-regulate or ablate expression of delta-12 desaturase and/or delta-15 desaturase.
Statement of the Invention
According to an aspect of the invention there is provided a modified Cannabis spp plant wherein said plant is modified in a gene encoding a delta-12 desaturase polypeptide wherein the modification reduces or abrogates the expression or activity of said delta-12 desaturase and said modified plant has enhanced oleic acid content when compared to a wild-type Cannabis spp plant comprising an unmodified delta-12 desaturase gene.
In a preferred embodiment of the invention said modified plant has an increased oleic acid content of between 70-85% of total oil content of the modified plant when compared to the wild-type plant comprising a wild-type copy of said delta-12 desaturase gene. In a further embodiment of the invention said modified plant has reduced linoleic acid content when compared to a wild-type plant.
In a preferred embodiment of the invention the linoleic acid content is between 1-5% of the total oil content of the modified plant when compared to a wild-type plant.
In a preferred embodiment of the invention said modified plant has reduced alpha linolenic acid content when compared to a wild-type plant.
In a preferred embodiment of the invention said modified plant has an alpha linolenic acid content between 2.5-7.5% of the total oil content of the modified plant when compared to a wild-type plant.
In a preferred embodiment of the invention said modified plant has reduced gamma linolenic acid content when compared to a wild-type plant.
In a preferred embodiment of the invention the gamma linolenic acid content of said modified plant is between 0.5-1.5% of the total oil content of the modified plant when compared to a wild-type plant.
In a preferred embodiment of the invention said modified plant includes a modification to a delta-12 desaturase genomic sequence comprising the nucleotide sequence as set forth in SEQ ID NO: 1 , or a polymorphic sequence variant thereof.
According to a further aspect of the invention there is provided a modified Cannabis spp plant wherein said plant is modified in a gene encoding a delta-12 desaturase polypeptide wherein the modification modulates said delta-12 desaturase expression and/or activity relative to other membrane bound desaturases and said modified plant has enhanced gamma linolenic acid content when compared to a wild-type Cannabis spp plant comprising an unmodified delta-12 desaturase gene.
In a preferred embodiment of the invention said delta-12 desaturase is modified at or around amino acid residue proline 341. In a preferred embodiment of the invention said delta-12 desaturase is modified at amino acid residue proline 341 by amino acid substitution. Preferably said substitution is replacement of amino acid residue proline 341 with leucine.
In a preferred embodiment of the invention said modified Cannabis spp plant has a gamma linolenic acid content 5-15% of the total oil content of the modified plant when compared to a wild-type plant.
According to a further aspect of the invention there is provided a modified Cannabis spp plant wherein said plant is modified in a gene encoding a delta-15 desaturase polypeptide wherein the modification reduces or abrogates the expression or activity of said delta-15 desaturase and said modified plant has enhanced linoleic acid content when compared to a wild-type Cannabis spp plant comprising an unmodified delta-15 desaturase gene.
In a preferred embodiment of the invention said modified Cannabis spp plant has low or undetectable alpha linolenic acid content when compared to a wild-type plant.
In a preferred embodiment of the invention said modified Cannabis spp plant has a linoleic acid content between 60% to 70% of the total oil content of the modified plant when compared to a wild-type plant.
In a preferred embodiment of the invention said modified plant comprising a modification to a delta-12 desaturase genomic sequence and further comprising a modification to a delta-15 desaturase genomic sequence has increased gamma linolenic acid when compared to the wild type plant.
In a further preferred embodiment of the invention the gamma linolenic acid content of said modified plant is 10 to 15%; for example 10.9 to 11.7%.
In a further preferred embodiment of the invention the oleic acid content of said modified plant is 7.5% to 10%; for example 8.5 to 8.9%.
In a further preferred embodiment of the invention the linoleic acid content of said modified plant is 50% to 75%; for example 63 to 70.5 %.
In a further preferred embodiment of the invention the alpha linolenic acid content of said modified plant is 0.1 to 1 %; for example 0.4 to 0.6 %.
In a preferred embodiment of the invention said modified plant includes a modification to a delta-15 desaturase genomic sequence comprising the nucleotide sequence as set forth in SEQ ID NO: 2, or a polymorphic sequence variant thereof.
In a preferred embodiment of the invention said delta-12 and/or delta-15 desaturase gene[s] are modified in the nucleotide coding sequence to introduce one or more termination or nonsense codons thereby preventing expression of said desaturase[s].
According to an aspect of the invention there is provided an isolated nucleic acid molecule that encodes a Cannabis spp desaturase polypeptide wherein said nucleic acid molecule comprises or consists of a nucleotide sequence selected from the group consisting of: i) a nucleotide sequence as represented by the sequence in SEQ ID NO: 1 or 2; ii) a nucleotide sequence wherein said sequence is degenerate as a result of the genetic code to the nucleotide sequence defined in (i);
iii) a nucleic acid molecule the complementary strand of which hybridizes under stringent hybridization conditions to the sequence in SEQ ID NO: 1 or 2 wherein said nucleic acid molecule encodes a desaturase;
iv) a nucleotide sequence that encodes a polypeptide comprising an amino acid sequence as represented in SEQ ID NO: 3 or 4 ;
V) a nucleotide sequence that encodes a polypeptide comprising an amino acid sequence wherein said amino acid sequence is modified by addition deletion or substitution of at least one amino acid residue as represented in iv) above and which has retained or enhanced desaturase activity.
Hybridization of a nucleic acid molecule occurs when two complementary nucleic acid molecules undergo an amount of hydrogen bonding to each other. The stringency of hybridization can vary according to the environmental conditions surrounding the nucleic acids, the nature of the hybridization method, and the composition and length of the nucleic acid molecules used. Calculations regarding hybridization conditions required for attaining particular degrees of stringency are discussed in Sambrook et al., Molecular Cloning: A Laboratory Manual (Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 2001); and Tijssen, Laboratory Techniques in Biochemistry and Molecular Biology— Hybridization with Nucleic Acid Probes Part I, Chapter 2 (Elsevier, New York, 1993). The Tm is the temperature at which 50% of a given strand of a nucleic acid molecule is hybridized to its complementary strand. The following is an exemplary set of hybridization conditions and is not limiting:
Very High Stringency (allows sequences that share at least 90% identity to hybridize) Hybridization: 5x SSC at 65°C for 16 hours Wash twice: 2x SSC at room temperature (RT) for 15 minutes each
Wash twice: 0.5x SSC at 65°C for 20 minutes each
High Stringency (allows sequences that share at least 80% identity to hybridize) Hybridization: 5x-6x SSC at 65°C-70°C for 16-20 hours
Wash twice: 2x SSC at RT for 5-20 minutes each Wash twice: 1x SSC at 55°C-70°C for 30 minutes each
Low Stringency (allows seguences that share at least 50% identity to hybridize)
Hybridization: 6x SSC at RT to 55°C for 16-20 hours
Wash at least twice: 2x-3x SSC at RT to 55°C for 20-30 minutes each.
In a preferred embodiment of the invention said nucleic acid molecule comprises or consists of a nucleotide sequence as set forth in SEQ ID NO: 1 or 2.
In a preferred embodiment of the invention said nucleic acid molecule comprises of a nucleotide sequence set forth in SEQ ID NO: 1 and encodes a delta-12 desaturase. In a preferred embodiment of the invention said nucleic acid molecule comprises of a nucleotide sequence set forth in SEQ ID NO: 2 and encodes a delta-15 desaturase.
In a preferred embodiment of the invention said nucleotide sequence is a cDNA sequence.
In an alternative embodiment of the invention said nucleotide sequence is a genomic sequence. According to a further aspect of the invention there is provided an isolated polypeptide selected from the group consisting of: i) a polypeptide comprising or consisting of an amino acid sequence as represented in SEQ ID NO: 3 or 4; or
ii) a modified polypeptide comprising or consisting of a modified amino acid sequence wherein said polypeptide is modified by addition, deletion or substitution of at least one amino acid residue of the sequence presented in SEQ ID NO: 3 or 4 and which has retained or enhanced desaturase activity.
A modified polypeptide as herein disclosed may differ in amino acid sequence by one or more substitutions, additions, deletions, truncations that may be present in any combination. Among preferred variants are those that vary from a reference polypeptide by conservative amino acid substitutions. Such substitutions are those that substitute a given amino acid by another amino acid of like characteristics. The following non-limiting list of amino acids are considered conservative replacements (similar): a) alanine, serine, and threonine; b) glutamic acid and aspartic acid; c) asparagine and glutamine d) arginine and lysine; e) isoleucine, leucine, methionine and valine and f) phenylalanine, tyrosine and tryptophan. Most highly preferred are variants that retain or enhance the same biological function and activity as the reference polypeptide from which it varies.
In one embodiment, the variant polypeptides have at least 50% identity, even more preferably at least 55% identity, still more preferably at least 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95% identity, and at least 99% identity with most or the full length amino acid sequence illustrated herein.
In a further preferred embodiment of the invention the variant polypeptides have at least 84 % identity with reference to the full length amino acid sequence set forth in SEQ ID NO: 3.
In a further preferred embodiment of the invention the variant polypeptides have at least 78 % identity with reference to the amino acid sequence set forth in SEQ ID NO: 4. .
In a preferred embodiment of the invention said polypeptide according to the invention or said variant polypeptide comprises or consists of the amino acid sequence set forth in SEQ ID NO: 3 wherein said polypeptide is a delta-12 desaturase.
In a preferred embodiment of the invention said polypeptide according to the invention or said variant polypeptide comprises or consists of the amino acid sequence set forth in SEQ ID NO: 4 wherein said polypeptide is a delta-15 desaturase.
According to a further aspect of the invention there is provided a vector comprising a nucleic acid molecule encoding a desaturase polypeptide according to the invention wherein said nucleic acid molecule is operably linked to a nucleic acid molecule comprising a promoter sequence.
In a preferred embodiment of the invention said nucleic acid sequence comprising a promoter confers constitutive expression on said desaturase.
In an alternative preferred embodiment of the invention said nucleic acid molecule comprising a promoter confers regulated expression on said desaturase.
In a preferred embodiment of the invention said regulated expression is tissue or developmental^ regulated expression. In a further alternative embodiment of the invention said regulated expression is inducible expression.
Preferably the nucleic acid molecule in the vector is under the control of, and operably linked to, an appropriate promoter or other regulatory elements for transcription in a host cell such as a microbial, (e.g. bacterial, yeast), or plant cell. The vector may be a bi-functional expression vector which functions in multiple hosts. In the case of desaturase genomic DNA this may contain its own promoter or other regulatory elements and in the case of cDNA this may be under the control of an appropriate promoter or other regulatory elements for expression in the host cell.
By "promoter" is meant a nucleotide sequence upstream from the transcriptional initiation site and which contains all the regulatory regions required for transcription. Suitable promoters include constitutive, tissue-specific, inducible, developmental or other promoters for expression in plant cells comprised in plants depending on design. Such promoters include viral, fungal, bacterial, animal and plant-derived promoters capable of functioning in plant cells.
Constitutive promoters include, for example CaMV 35S promoter (Odell et al. (1985) Nature 313, 9810-812); rice actin (McElroy et al. (1990) Plant Cell 2: 163-171); ubiquitin (Christian et al. (1989) Plant Mol. Biol. 18 (675-689); pEMU (Last et al. (1991) Theor Appl. Genet. 81 : 581-588); MAS (Velten et al. (1984) EMBO J. 3. 2723-2730); ALS promoter (U.S. Application Seriel No. 08/409,297), and the like. Other constitutive promoters include those in U.S. Patent Nos. 5,608, 149; 5,608, 144; 5,604, 121 ; 5,569,597; 5,466,785; 5,399,680, 5,268,463; and 5,608, 142, each of which is incorporated by reference. Chemical-regulated promoters can be used to modulate the expression of a gene in a plant through the application of an exogenous chemical regulator. Depending upon the objective, the promoter may be a chemical-inducible promoter, where application of the chemical induced gene expression, or a chemical-repressible promoter, where application of the chemical represses gene expression. Chemical-inducible promoters are known in the art and include, but are not limited to, the maize ln2-2 promoter, which is activated by benzenesulfonamide herbicide safeners, the maize GST promoter, which is activated by hydrophobic electrophilic compounds that are used as pre-emergent herbicides, and the tobacco PR-1a promoter, which is activated by salicylic acid. Other chemical-regulated promoters of interest include steroid-responsive promoters (see, for example, the glucocorticoid-inducible promoter in Schena et al. (1991) Proc. Natl. Acad. Sci. USA 88: 10421-10425 and McNellis et al. (1998) Plant J. 14(2): 247-257) and tetracycline-inducible and tetracycline-repressible promoters (see, for example, Gatz et al. (1991) Mol. Gen. Genet. 227: 229-237, and US Patent Nos. 5,814,618 and 5,789,156, herein incorporated by reference). Where enhanced expression in particular tissues is desired, tissue-specific promoters can be utilised. Tissue-specific promoters include those described by Yamamoto et al. (1997) Plant J. 12(2): 255-265; Kawamata et al. (1997) Plant Cell Physiol. 38(7): 792-803; Hansen et al. (1997) Mol. Gen. Genet. 254(3): 337-343; Russell et al. (1997) Transgenic Res. 6(2): 157- 168; Rinehart et al. (1996) Plant Physiol. 112(3): 1331-1341 ; Van Camp et al. (1996) Plant Physiol. 112(2): 525-535; Canevascni et al. (1996) Plant Physiol. 1 12(2): 513-524;
Yamamoto et al. (1994) Plant Cell Physiol. 35(5): 773-778; Lam (1994) Results Probl. Cell Differ. 20: 181-196; Orozco et al. (1993) Plant Mol. Biol. 23(6): 1129-1138; Mutsuoka et al. (1993) Proc. Natl. Acad. Sci. USA 90 (20): 9586-9590; and Guevara-Garcia et al (1993) Plant J. 4(3): 495-50. "Operably linked" means joined as part of the same nucleic acid molecule, suitably positioned and oriented for transcription to be initiated from the promoter. DNA operably linked to a promoter is "under transcriptional initiation regulation" of the promoter. In a preferred aspect, the promoter is a tissue specific promoter, an inducible promoter or a developmental^ regulated promoter. Particular of interest in the present context are nucleic acid constructs which operate as plant vectors. Specific procedures and vectors previously used with wide success in plants are described by Guerineau and Mullineaux (1993) (Plant transformation and expression vectors. In: Plant Molecular Biology Labfax (Croy RRD ed) Oxford, BIOS Scientific Publishers, pp 121-148. Suitable vectors may include plant viral-derived vectors (see e.g. EP194809). If desired, selectable genetic markers may be included in the construct, such as those that confer selectable phenotypes such as resistance to herbicides (e.g. kanamycin, hygromycin, phosphinotricin, chlorsulfuron, methotrexate, gentamycin, spectinomycin, imidazolinones and glyphosate).
According to a further aspect of the invention there is provided a transgenic cell transformed or transfected with a nucleic acid molecule or vector according to the invention.
In a preferred embodiment of the invention said cell is a plant cell.
In a preferred embodiment of the invention said plant cell is from the genus Cannabis spp.
In a preferred embodiment of the invention said plant cell is a Cannabis sativa cell.
According to a further aspect of the invention there is provided a plant comprising a plant cell according to the invention.
In a preferred embodiment of the invention said plant is from the genus Cannabis; preferably Cannabis sativa.
According to a further aspect of the invention there is provided a seed obtained from the plant according to the invention. In an alternative preferred embodiment of the invention said cell is a microbial cell; preferably a bacterial or fungal cell [e.g. yeast, Saccharomyces cerevisae].
In a preferred embodiment of the invention said cell is adapted such that the nucleic acid molecule encoding the desaturase is over-expressed when compared to a non-transgenic cell of the same species.
According to a further aspect of the invention there is provided a nucleic acid molecule comprising a transcription cassette wherein said cassette includes a nucleotide sequence designed with reference to part of SEQ ID NO: 1 or 2 and is adapted for expression by provision of at least one promoter operably linked to said nucleotide sequence such that both sense and antisense molecules are transcribed from said cassette.
In a preferred embodiment of the invention said cassette is adapted such that both sense and antisense ribonucleic acid molecules are transcribed from said cassette wherein said sense and antisense nucleic acid molecules are adapted to anneal over at least part or all of their length to form a inhibitory RNA or short hairpin RNA.
In a preferred embodiment of the invention said cassette is provided with at least two promoters adapted to transcribe both sense and antisense strands of said ribonucleic acid molecule.
In an alternative preferred embodiment of the invention said cassette comprises a nucleic acid molecule wherein said molecule comprises a first part linked to a second part wherein said first and second parts are complementary over at least part of their sequence and further wherein transcription of said nucleic acid molecule produces an ribonucleic acid molecule which forms a double stranded region by complementary base pairing of said first and second parts thereby forming an short hairpin RNA.
A technique to specifically ablate gene function is through the introduction of double stranded RNA, also referred to as small inhibitory/interfering RNA (siRNA) or short hairpin RNA [shRNA], into a cell which results in the destruction of mRNA complementary to the sequence included in the siRNA/shRNA molecule. The siRNA molecule comprises two complementary strands of RNA (a sense strand and an antisense strand) annealed to each other to form a double stranded RNA molecule. The siRNA molecule is typically derived from exons of the gene which is to be ablated. The mechanism of RNA interference is being elucidated. Many organisms respond to the presence of double stranded RNA by activating a cascade that leads to the formation of siRNA. The presence of double stranded RNA activates a protein complex comprising RNase III which processes the double stranded RNA into smaller fragments (siRNAs, approximately 21-29 nucleotides in length) which become part of a ribonucleoprotein complex. The siRNA acts as a guide for the RNase complex to
cleave mRNA complementary to the antisense strand of the siRNA thereby resulting in destruction of the mRNA.
In a preferred embodiment of the invention said nucleic acid molecule is part of a vector adapted for expression in a plant cell. According to a further aspect of the invention there is provided a plant cell transfected with a nucleic acid molecule or vector according to the invention wherein said cell has reduced expression of one or more desaturase[s] according to the invention.
According to a further aspect of the invention there is provided a plant wherein said plant comprises a transfected plant cell according to the invention. According to a further aspect of the invention there is provided the use of a gene encoded by a nucleic acid molecule comprising the nucleic acid sequence in SEQ ID NO: 1 or 2, or a nucleic acid molecule the complementary strand of which hybridizes under stringent hybridization conditions to the nucleotide sequence in SEQ ID NO: 1 or 2 wherein said nucleic acid molecule encodes a polypeptide with desaturase activity as a means to identify a locus wherein said locus is associated with altered expression or activity of said desaturase.
Mutagenesis as a means to induce phenotypic changes in organisms is well known in the art and includes but is not limited to the use of mutagenic agents such as chemical mutagens [e.g. base analogues, deaminating agents, DNA intercalating agents, alkylating agents, transposons, bromine, sodium azide] and physical mutagens [e.g. ionizing radiation, psoralen exposure combined with UV irradiation].
According to a further aspect of the invention there is provided a method to produce a Cannabis spp plant that has altered expression of a desaturase polypeptide according to the invention comprising the steps of: i) mutagenesis of wild-type seed from a Cannabis spp plant that does express said desaturase;
ii) cultivation of the seed in i) to produce first and subsequent generations of plants;
iii) obtaining seed from the first generation plant and subsequent generations of plants;
iv) determining if the seed from said first and subsequent generations of plants has altered nucleotide sequence and/or altered expression of said desaturase polypeptide;
v) obtaining a sample and analysing the nucleic acid sequence of a nucleic acid molecule selected from the group consisting of:
a) a nucleic acid molecule comprising a nucleotide sequence as represented in SEQ ID NO: 1 or 2;
b) a nucleic acid molecule that hybridises to the nucleic acid molecule in a) under stringent hybridisation conditions and that encodes a polypeptide with desaturase polypeptide activity; and optionally
vi) comparing the nucleotide sequence of the nucleic acid molecule in said sample to a nucleotide sequence of a nucleic acid molecule of the original wild-type plant.
In a preferred method of the invention said nucleic acid molecule is analysed by a method comprising the steps of: i) extracting nucleic acid from said mutated plants;
ii) amplification of a part of said nucleic acid molecule by a polymerase chain reaction;
iii) forming a preparation comprising the amplified nucleic acid and nucleic acid extracted from wild-type seed to form heteroduplex nucleic acid; iv) incubating said preparation with a single stranded nuclease that cuts at a region of heteroduplex nucleic acid to identify the mismatch in said heteroduplex; and
v) determining the site of the mismatch in said nucleic acid heteroduplex.
In a preferred method of the invention said Cannabis spp plant has enhanced desaturase polypeptide expression and/or activity.
In an alternative preferred method of the invention said Cannabis spp plant has reduced or abrogated desaturase polypeptide expression and/or activity.
According to a further aspect of the invention there is provided a plant obtained by the method according to the invention.
According to an aspect of the invention there is provided a plant wherein said plant comprises a viral vector that includes all or part of a gene comprising a nucleic acid molecule according to the invention.
In a preferred embodiment of the invention said gene is encoded by a nucleic acid molecule comprising a nucleic acid sequence selected from the group consisting of: a nucleic acid molecule comprising a nucleotide sequence as represented in SEQ ID NO: 1 or 2;
a nucleic acid molecule comprising a nucleotide sequence that hybridises under stringent hybridisation conditions to a nucleic acid molecule in (i) and which encodes a desaturase polypeptide;
a nucleic acid molecule that encodes a variant polypeptide that varies from a polypeptide comprising the amino acid sequence as represented in SEQ ID NO: 3 or 4.
According to a further aspect of the invention there is provided a viral vector comprising all or part of a nucleic acid molecule according to the invention.
According to an aspect of the invention there is provided the use of a viral vector according to the invention in viral induced gene silencing in a plant.
In a preferred embodiment of the invention_said plant is from the genus Cannabis spp.
Virus induced gene silencing [VIGS] is known in the art and exploits a RNA mediated antiviral defence mechanism. Plants that are infected with an unmodified virus induce a mechanism that specifically targets the viral genome. However, viral vectors which are engineered to include nucleic acid molecules derived from host plant genes also induce specific inhibition of viral vector expression and additionally target host mRNA. This allows gene specific gene silencing without genetic modification of the plant genome and is essentially a non-transgenic modification.
According to a further aspect of the invention there is provided a process for the preparation of oil from a Cannabis spp plant comprising the steps: i) obtaining seed according to the invention;
ii) extracting from the seed a fraction comprising plant oil; and optionally iii) isolating said oil fraction to provide an enriched oil fraction.
In a preferred embodiment of the invention said material is cold press extracted. According to a further aspect of the invention there is provided an oleic acid-enriched oil preparation obtained or obtainable by the process according to the invention.
In a preferred embodiment of the invention said preparation comprises 70-85% oleic acid.
In a preferred embodiment of the invention said preparation comprises 2.5-7.5% alpha linolenic acid.
In a preferred embodiment of the invention said preparation comprises 0.5-1.5% gamma linolenic acid. In a preferred embodiment of the invention said oleic acid-enriched oil preparation is at least 7 fold more stable than oil preparation extracted from seed of a wild type Cannabis spp plant.
According to an alternative aspect of the invention there is provided a gamma linolenic acid- enriched oil preparation obtained or obtainable by the process according to the invention.
In a preferred embodiment of the invention said preparation comprises 10 to 15% gamma linolenic acid, for example 10.9-11 % gamma linolenic acid.
In a preferred embodiment of the invention said preparation comprises 7.5% to 10% oleic acid, for example 8.5-8.9 % oleic acid.
In a preferred embodiment of the invention said preparation comprises 50-75% linoleic acid, for example 63-70.5% linoleic acid. In a preferred embodiment of the invention said preparation comprises 0.1-1 % alpha linolenic acid, for example 0.4-0.6% alpha linolenic acid.
According to a further aspect of the invention there is provided a gamma linolenic acid- enriched oil preparation obtained or obtainable by the process according to the invention.
In a preferred embodiment of the invention said preparation comprises 5-15% gamma linolenic acid.
According to a further aspect of the invention there is provided a linoleic acid-enriched oil preparation obtained or obtainable by the process according to the invention.
In a preferred embodiment of the invention said preparation has low or undetectable alpha linolenic acid content. In a preferred embodiment of the invention said preparation comprises 60-70% linoleic acid.
Throughout the description and claims of this specification, the words "comprise" and "contain" and variations of the words, for example "comprising" and "comprises", means "including but not limited to", and is not intended to (and does not) exclude other moieties, additives, components, integers or steps. "Consisting essentially" means having the essential integers but including integers which do not materially affect the function of the essential integers.
Throughout the description and claims of this specification, the singular encompasses the plural unless the context otherwise requires. In particular, where the indefinite article is used, the specification is to be understood as contemplating plurality as well as singularity, unless the context requires otherwise.
Features, integers, characteristics, compounds, chemical moieties or groups described in conjunction with a particular aspect, embodiment or example of the invention are to be understood to be applicable to any other aspect, embodiment or example described herein unless incompatible therewith.
An embodiment of the invention will now be described by example only and with reference to the following figures: Figure 1 Expression of putative desaturase genes in developing embryos of the hemp cultivar Finola and metabolic context. A, Embryos representative of each developmental stage used for RNA isolation are shown (Scale bar = 1 mm). EST libraries from torpedo (T), upturned (U) and filled not dessicated (FND) stages of embryo development were generated by deep sequencing and read counts analysed in silico. Raw reads were mapped to reference sequence, which consisted of the open reading frames of 17 putative desaturase genes as detailed in Table 1 with BWA mapping software (Li, 2009). The raw read counts were retrieved from the resulting output file for each gene in the EST libraries and the counts were then normalized to RPKM (Reads Per Kilobase per Million) values which is considered representative of transcript abundance. Gene expression is depicted in a heat map format with RPKM values included. B, Schematic presentation of the biosynthetic pathway giving rise to the major fatty acids in hemp seed oil. SACPD - stearoyl ACP desaturase, DS - desaturase. Enzymatic steps are shown in bold and those steps compromised by mutation in specific CSFAD2 and CSFAD3 genes as detailed in Figures 2 and 3 are indicated;
Figure 2 Characterisation of CSFAD2A gene function. A, Expression of CSFAD2A and CSFAD2B in developing embryo and mature leaf tissue compared to levels in young hemp
leaves. Raw quantitative PCR data were normalised to hemp ACT2 transcript level in each tissue and expressed on a logarithmic scale as (1+E)-AACt where E is the amplification efficiency. Mean values represent the average of three biological replicas each consisting of three technical replicates. (YL, young leaves; ML, mature leaves; TORP, torpedo stage of hemp embryo; UPT, U-upturned stage of hemp embryo; FND, filled-not-desiccated stage of hemp embryo; MAT mature seed embryo). B, Fatty acid composition of S. cerevisiae transformed with either CSFAD2A cDNA or an empty vector (pESC-TRP) control. Each value is the mean ± SD from three independent experiments. C, Fatty acid composition of seed oil from homozygous csfad2a-1 (BC2Fi) and (D) homozygous csfad2a-2 ( C^F^) plants compared to respective segregating heterozygous and wild type plants from the same generation as detailed in Table 2. Each value is the mean ± SD from 8 to 28 seeds from the same line and generation;
Figure 3 Characterisation of CSFAD3A gene function. A, Expression of CSFAD3A in developing embryo and mature leaf tissue compared to levels in young hemp leaves. Raw quantitative PCR data were normalised to hemp ACT2 transcript level in each tissue and expressed as (1 +E)-AACt where E is the amplification efficiency. Mean values represent the average of three biological replicates each consisting of three technical replicates. YL, young leaves; ML, mature leaves; TORP, torpedo stage of hemp embryo; UPT, U-upturned stage of hemp embryo; FND, filled-not-desiccated stage of hemp embryo; MAT mature seed embryo. B, Fatty acid composition of S. cerevisiae transformed with either CSFAD3A cDNA or an empty vector (pESC-TRP) control. Both transformants were subjected to similar induction and feeding with LA and GLA. Each value is the mean ± SD from three independent experiments. (C) Fatty acid composition of seed oil from homozygous csfad3a- 1 (ΒΟ^) and (D) homozygous csfad3a-2 (BC2F!) plants compared to respective segregating heterozygous and wild type plants from the same generation as detailed in Table 2. Each value is the mean ± SD from 4 to 20 seeds from the same line and generation;
Figure 4. A, Fatty acid composition of seed oil from homozygous csfad2a-3 (BC4F1) compared to respective segregating wild type plants from the same generation as detailed in Table 2. Each value is the mean ± SD from 20 (WT) and 60 seeds (csfad2a-3) from the same line and generation; B, Fatty acid composition of seed oil from homozygous csfad2a-3 x csfad3a-1 (BC4F2) double mutant compared to respective segregating homozygous csfad2a-3, csfad3a-1 and wild type plants from the same generation as detailed in Table 2. Each value is the mean ± SD from 6 to 40 seeds from the same line and generation;
Figure 5 Cold-pressed oil analyses from standard (std) or high oleic (HO) hempseed and rapeseed. Small batches of seed harvested from field plots (-150 g) were cold-pressed and analysed for total oil content in the cake and seed (A), relative distribution of fatty acids (B) and rancimat-assayed stability at three different temperatures (C). Tocopherol assays are shown for hemp seed only (D). All data are representative assay values taken from the second or third pressed oil batches after the press had been preconditioned with appropriate seed and reached uniform operating temperatures. For tocopherol analyses (D), values are means ± 1 standard error from five analyses from the same oil batches with letters above bars indicating significantly different groups (ANOVA and Tukey's HSD; P < 0.05); Figure 6: Hexadecadienoic acid double bond localisation; Hexadecadienoic acid double bond localisation. S. cerevisiae pellets from cultures expressing CSFAD2A or CSFAD3A were harvested 28h after gal induction and transmethylated to FAMEs. An aliquot of the isolated FAME fraction was transesterified to 3-pyridylcarbinol esters, which were then chromatographically separated and detected by GCMS on a polar BPX70 column. Extracted ion chromatograms for the expected molecular ion of m/z 343 consistent with 16:2 indicated two resolved peaks (A). Mass spectra from the first eluting peak (from the CSFAD2A expressing sample) identified this peak as 16: 2Δ9.12 (B). Mass spectra from the second eluting peak (from the CSFAD3A expressing sample) identified this peak as 16:2Δ9, 15 (C). Insets show the molecular structure of the relevant 3-pyridylcarbinol fatty acid esters with predicted abundant mass spectral fragments containing the pyridyl headgroup; i.e. fragments arising from cumulative methyl-end losses. Mass spectra are labelled with all predicted mass spectral fragments that were actually found in the mass range shown;
Figure 7. Octadecadienoic acid double bond localisation; Oil samples extracted from field- grown high-oleic acid hemp were analysed and plotted as described in Figure 6 Extracted ion chromatograms for the expected molecular ion of m/z 371 consistent with 18:2 indicated three resolved peaks (A). Mass spectra from the first eluting peak identified this peak as 18:2Δ6,9 (B), the second as 18:2Δ9, 12 (C), and the third as 18:2Δ9, 15 (D). This third peak was also found in yeast extracts expressing CSFAD3A; Figure 8. Eicosadienoic acid double bond localisation; The standard 37-FAME mix containing 20:2Δ1 1 , 14 as the only 20:2 FAME and FAMEs from yeast expressing CSFAD2A fed with 20:1Δ11 were transesterified to their 3-pyridylcarbinol esters and analysed as described in Figure 6. Although the expected molecular ion of m/z 399 could be detected in both samples, the signal was weak in the yeast extracts; therefore for clarity the 3- pyridylcarbinol ester common base ion of m/z 92 was used to identify co-elution of candidate
20:2 peaks (A). Mass spectra from the standard (A) and the yeast extract (B) confirmed a single peak with the identity 20:2Δ11 ,14;
Figure 9. Comparison of tocopherol content in standard and high oleic rapeseed and hemp seed oil; Oil from second and third press runs (B and C, respectively) were analysed as per Fig. 5;
Figure 10. Comparison of small molecule volatiles emitted from standard and high oleic rapeseed and hemp seed oil; Volatiles were analysed by SPME of headspace above cold- pressed oil samples followed by separation and detection by GCMS. Representative traces are shown with major peaks identified by reference to authentic standards (hexanal, heptanal, alpha pinene) or to the NIST 05 mass spectral library (butenyl isothiocyanate);
Figure 11. .Triacylglycerol analysis reveals that triolein accumulates in High Oleic Hemp seed and is absent in WT seed. Single seeds were harvested from mutant or wild type plants grown under glass or in the field over different seasons. These were analyzed for triacylglycerol content by LCMS. Results are means ± 1 standard error for n = 5 analyses; Figure 12 Nucleotide sequence encoding CSFAD2A (SEQ ID NO: 1);
Figure 13 Amino acid sequence of CSFAD2A (SEQ ID NO: 3);
Figure 14 Nucleotide sequence encoding CSFAD3A (SEQ ID NO: 2);
Figure 15 Amino acid sequence of CSFAD3A (SEQ ID NO: 4); and
Figure 16: A boxplot depiction of molar percent GLA in oil from individual seeds from a wild type (WT), heterozygous and homozygous csfad2a-3 mutant. This analysis demonstrates a significant increase in GLA content in seed oil from the homozygous mutant seed material compared to wild type with the heterozygote showing a GLA content inter diary between WT and homozygous mutant.
Table 1. Nucleotide sequences, exon number and source of soluble and membrane bound desaturases from C. sativa;
genomic exon
gene name class nucleotide (ORF) derived from genome sequence sequences number
ATGGCTCTCAAACTCAACCCCACCATCGCTCAATCTCCAA AGTTACCAGCTTTTGCTCTTCCACCAATGGCTAGCCTCAG ATCTCCCAAGTTCTTCATGGCCTCCACCCTCCGTTCTGGC T C C AAAG AGGT TG AT AAT AT C AAG AAG CCTTTCACTCCTC CT AGAGAGGT C C ATGTT C AAGT AAC AC ATT C C ATGC C AC C T C AG AAG AT TG AG AT C T T T AAGT C AT TGG AAG AT TGGG C T GATCAGAACCTTTTGGTTCACCTTAAGCCAGTTGAGAAGT GCTGGCAACCTCAGGATTTTCTCCCTGAACCATCATCTGA TGGATTTCATGAGCAGGTGATGGAACTTAGGGAGAGGGCT AGGGAGCTTCCTGATGATTACTTTGTTGTTCTGGTTGGTG ATATGATCACAGAAGAAGCACTCCCAACTTATCAAACTAT GCTTAATACATTGGATGGAGTTAGGGATGAAACTGGTGCC AGCCCAACTTCTTGGGCTATTTGGACTAGAGCATGGACTG
CSS AC PD- A CTGAAGAGAACAGGCATGGTGACCTCCTCAACAAGTATCT
TTACCTCAGTGGACGAGTCGATATGAGGCAAATTGAGAAG
(SEQ ID No delta9 FN1_14572637 3 AC C ATT C AGT AT CTGAT CGGTT CTGGAATGGAT C C C CGGA 34) CAGAGAACAATCCTTATCTTGGTTTCATCTACACTTCATT
CCAAGAAAGAGCCACCTTTATCTCACATGGTAACACTGCC AGG C TAG C AAAGG AG C ATGGGG AC T T AAAAT TGG C AC AAA TATGTGGTACCATAGCTGCAGACGAGAAGCGCCACGAGAC AGCCTACACTAAGATAGTTGAGAAGCTATTTGAGATTGAT CCTGATGGGACTGTGTTAGCATTTGCTGACATGATGAGGA AGAAGATAGCCATGCCAGCACACTTGATGTACGATGGCCG AGATGACAATCTTTTCGATAACTTTTCTGCTGTTGCACAA CGGCTTGGAGTGTACACGGCCAAGGATTACGCGGACATAT TGGAGTTCTTGGTTGGGAGGTGGAAGGTGGAGAAGCTAAG TGGACTTTCCGGGGAGGGGCTTAAGGCTCAGGAGTATGTT TGCGGGTTACCTCCAAGAATCAGAAGGCTGGAGGAAAGAG CTCAAGGAAGGGTGAAACAAGCTAGGAGTGTACCCTTCAG TTGGGTATATGATAGACAAGTGAGTCTCTAA
ATGGCTCTCAGACTCAGCTCAACGATCAACTTCCCAACTC ACAACGTCTCTTCTAAGCCTCACACTCTCAGATCTCCAAG GCTCTGCATGGCCTCCACTCTCCACTCCATTTCTAAAGAG ACTGAAAATGGAAAAAAGCCTTATTCGCCTCCGAAGGAGG TACATCTTCAAGTGACTCATTCACTACCACCTCAAAAGGT TGAGATCTTCAAGTCATTAGAAGGCTGGGCTGAAGATAAC ATTTTGGTGCACTTGAAACCTGTGGAGAAATGTTGGCAGC CACAAGATTTTCTACCCGAGCCGGAATCTGAAGGGTTTTA TGATCAAGTCAGGGAGTTAAGGGAAAGGGCGAAAGAAATT CCCGATGACTATTTTGTTGCGTTGGTCGGTGATATGATCA CTGAAGAAGCTCTACCGACATACCAGACAATGCTTAATAC TTTAGACGGGGTTAGAGATGAGACCGGTGCAAGCCCTACT TCTTGGGGAATATGGACCAGGGCGTGGACTGCTGAGGAGA
CSSACPD-B ATAGGCATGGAGACCTTCTCAACAAGTATCTGTATCTCTC
TGGAAGGGTTGATATGAAGCAAGTTGAGAAGACCATCCAA
(SEQ ID No delta9 FN1_14548126 3 T AT CT C ATTGGCT C AGGAATGGAT C C C AAAACGGAAAAC A 35) ACCCGTATTTGGGTTTCATCTACACCTCCTTTCAAGAGAG
GGCTACATTCATCTCC C ATGG AAAT AC TG C C AGG C AAG C C AAAGAGCACGGTGACCTGAAACTGGCGCAGATATGCGGCA CAATTGCTGCCGATGAGAAACGCCATGAAACTGCCTACAC AAAGATTGTGGAGAAGCTCTTTGAGATTGACCCGAATGGC ACTGTTATGGCTTTTGCTGACATGATGAAGAAGAAGATAT CGATGCCTGCCCACTTGATGTACGACGGGAAGGATGACAA TCTTTTCGATCACTTTGCAGCAGTTACACAGAAGCTTGAA GTTTACACTGCCAAGGATTATGCTGATATCATGGAGTTTC TGGTTGGAAGATGGAAGATTGAGAAATTGAGTGGTCTTTC GAGTGAGGGCCACAGAGCACAAGATTATGTGTGTAAATTG C C C C AG AGG AT AAG AAAGT TGG AGG AG AG AG C T CAGGGAA GG AC C AAG C AAG CAT C AATGGT T C C T T T C AG C TGG AT AT T TGGTAGAGAAATCAAGATTTGA
ATGCACGCAGGAGCCTCCTCTTCTTACCTTAGAAATCTTC
AATGGGC C C AAC C C AACGGC C C AAT AAGC C C AAAAAC ACT CCCACTGAACCCCTACGTCAGTTTCCGAGTCTCCGCCGTG GCAGCCCCACCGCCGCAGCTAAAGTTTCAGAGAACGCATT CGATGCCGCCAGAGAAAGTTGAAATCTTTAAGTCGTTAGA AGGTTGGGCCTCCAAATCTGTTCTGCCATTGTTGAAGCCC GTGGACCAATGCTGGCAGCCTCAAGATTTTTTACCCGACC CGGCTAAGACTAGAGAAGAGTTCTTTGATCAGGTCCGTCA ATTGCGTGATCGGACGGTCGAGCTTCCAGATGAGTTTTTC GTTGTGTTAGTTGGGGATATGATCACGGAGGACGCATTGC CTACGTACCAGACCATGATAAATACCCTGGACGGCGTTAA GGACGAGACCGGAGCTAGCTCAAGCCCATGGGCCCAATGG ACTCGGGCCTGGACCGCCGAAGAGAATCGCCACGGTGATT
CSSACPD-C TGCTTCGAACTTATCTGTATTTAACGGGTCGGGTCGATAT
GAC C ATGAT CGAAAGAAC CGT C C AGT AC CTGATTGGAGCT
(SEQ ID No delta9 FN1_14565533 2 GGC ATGGAT C CGGGAAC AGAGAAC AGT C CGT ACTTGGGAT 36) TTGTGTACACGTCATTCCAAGAACGTGCCACGTTTGTGTC
ACATGGCAACACAGCACGCATGGCAAAGGAGAGCGGGGAT CCAGTGTTGGCGCGTATATGCGGGACCATTGCATCCGACG AGAAACGCCACGAGAATGCCTATTCCAAGATCGTTGAGAA GCTTCTAGAAGTGGACCCCAACAATGCCATGCTGGCAATC GCTGATATGATGAGGAAGAAAATAACAATGCCGGCTCACC TTATGTACGATGGACGGGACCCTATGATATTTGAGCACTT CTCGGCTGTGGCTCAGCGGCTCGGTGTGTATACGGCTGAT GATTACGCTGATATCTTGGAATTCTTGATCGGGCGGTGGA GGTTGGAGAAGATGGTGGGGTTGTCGGCTGAGGCTCAGCG AGCTCAGAACTATGTTTGTGGGTTGGCGCCCAGGATTAGG AAGCTGCAGGAGCGAGCCGATGATCGGGCGCGTAAGATGG AGCCACAAAGTGTCAAGTTTAGCTGGATATTTAATAAGGA AGTCCTCTTGTAA
ATGCAAGTACAGGTCAGTAATATTTCATTGTGGGCCTTAA ATGGCCACCAAAGCCCAAACAAGCTCCAACTGAGAAGCCC ATCACCAAAACCCAGATTCCAAGTCTCAGCCGTGGCCTCA CCACCGCGGCCGATGAAACTCCATCAGCCAACGATGCCGC CGGCTAAAGAAGAAGTGTTCAAGTCGCTAGAAGGGTGGGC CACCCAATCGATTCTTCCACTGCTAAAGCCGGTGGAGGAA TGTTGGCAGCCCCAAGACTTTTTACCCAACCCATCAAATT CTGACGAAGAATTCTTCGATGAGATCCGTTTGATTCAAGA TCGTGCGGCTGAGATTCCAGATGAGTACTTTGTTGTCTTG GTTGGAGATATGATCACGGAGGAAGCATTGCCTACTTACC AGACCATTATGAACTCTATTGATGCCGTTAAGGATAGGAC TGGAGTTTGCTCCAGTCCATGGGCTCGGTGGACCCGCGAG TGGTCCGCCGAGGAGAATCGGCACGGTGACTTGCTTCGTA
CSSACPD-D FN1_14539699 CGTATTTATACTTATCGGGTCGGGTCGATATGACCATGGT
TGAACGGACTATCCAACATTTGATTGGAGCTGGCATGAAT
(SEQ ID No delta9 + 2 GGAAATTTCAACAACAATCCATACTTGGGTTATGTGTACA 37) FN1_13882113 CATCATTTCAAGAGCGAGCAACATTTGTGTCGCATGGCAA
CACAGCTCACTTAGCGAAAAAGAGCGGAGATCCACTCTTG GCGCGCATTTGCGGAACTATTGCAGCCGATGAGAAGCGAC ACGAGATTGCTTACGTAAAAATGACCGAGAAGCTCTTAGA AGTTGAC C C AAAT AATGT C ATGCT CGC AAT CGAGGAAATG ATGAGGAGAAAGATCACAATGCCGGCCGCCCTTATGTACG ATGGATGCGACCCCATGTTATTCCACCACTTCTCGGCTGT GGCTCAGCGCCTCGGCATCTACACAGCTGATGACTACGCC GACATCTTGGAGTTCTTAATCAAAAGGTGGAGGTTAGAGA AGATGGAAGGGTTGAATCCCGAGGCTCAAAGGGCGCAAGA CTTTGTTTGTGGCCTTGCGCCGAGAATTAGGAAGCTTCAA GAGCGAGCTGATGAGCGTGCACAAAAGATGGAGCCTCTTA GTGTCAAGTTTAGTTGGATTTTTAACAAAGAGGTTCTTGT GTAG
ATGCAAGTACTACAAGTTTCATGGCAGGCCTTAAGTGGCT
T C C AAAG C C C AAAAAAT C T C C AAC TG AG AAG C C C AT C AC C AAAGCCCAGATTCCGAGTCTCCGCCGTGGCCTTACCACCA C C AC CGATGT CGC CGGAT AT AGAAGAAGTTTT C AAGT C AC TAG AG AG CTGGGCCACC C AAT C AAT TATCCCACTGC T AAA GCCGGTGGAGGAATCTTGGCAGCCCCAAGATTTGTTACCA AGCCCAACCTATAATAATGTCGAGGAAGAATTCTTCGATC AGATCCGTTCGATTCAAGATCGTGCGGCTGAGATTCCAGA TGAGTACTTTGTTGTCTTGGTTGGAGATATGATCACGGAG GAAGCATTGCCTACATACCAGACCATTATGAATTCTATTG ATGCCATTAAGGATAAGACTGGAGTTTGCTCCAGTCCATG GGCTCGGTGGACCCGCGCATGGTCCGCCGAGGAGAATCGC CATGGTGACTTGCTTCGTACCTATTTATATTTAACGGGTC
CSSACPD-E FN1_14576833 GGGTAGATATGACCATGGTTGAACGTACTATCCAACACTT (SEQ ID No delta9 + 2 GATTGGAGCTGGCATGGATGCAAGATTCAACAACAATCCA
TACTTGTTTTACGTGTACACATCATTTCAAGAACGAGCCA
38) FN1_13944518 CGTTTGTGTCCCACGGCAACACGGCCCGCTTAGCGAAAAA
CAACGGAAACCCACTCTTGGCGCGCATTTGTGGGACTATT GCGGCCGATGAGAAGCGTCACGAGATTGCGTACGTAAAAG TGACCGAGAAGCTCTTAGAAGTTGACCCAAATAATGCCAT GCTAGCAATTGAAGAAATGATGAGGAGAAAGATCACAATG CCGGCCTTCCTTATGTACGATGGATGCGACCCCATGCTAT TCCACCATTTCTCGGCTGTGGCTCAGCGCCTCGGCGTCTA CACAACTGATGACTACGCCAACATCTTGGAGTTCTTAATC GGACAGTGGAAGTTAGAGAAGATGGAAGGGTTGAAACCTG AAGCTCAAAGAGCGCAAGACTATGTTTGTGGCCTTGCACC GAGAATTAGGAGGCTGCAAGAGCGAGCTGATGAGCGCGCA CGGAAGATGGGGCCTCTTAGTGTCAAGTTTAGTTGGGTTT T T AAC AAGG AGGT T C T T C T C TAG
ATGGGAGC CGGTGGC CGAATGC C CGAGGCGAAAT C CGAGT TGAATGGTAGTAAGAATAATAATAGGCTAATTGAGAGAGT AC C AC AC AC CAAAC C AC C ATT C AC ATT AAGCGAAAT C AAG AAAGCAATTCCGCCCCATTGCTTTAAACGCTCTCTAATTC GCTCTTTTGCTTGTGTCTTTCACGACCTTTTTTTCGCGTC ATTGTTTTACTATGTTGCAACCTCTTACTTTCACCTTATC CCGAAACCAATTTCATACATTGCTTGGCCAATTTATTGGA TTTTCCAAGGTTGTATTTTGACCGGGGTTTGGGTCATCGC TCATGAGTGTGGTCACCATGCTTTTAGTGACCACCAGTGG GTGGATGACACCGTTGGTCTCATCCTCCACTCTGCTCTTC TTGTCCCATATTTTTCATGGAAGTATAGTCATCGTCGCCA CCACTCAAACACGGGGTCCATTGATCGCGACGAAGTGTTT GT AC CAAAAC CAAAAT C AC AAGTGT C AC C ATT CGC C AAAT
CSFAD2A ACTTAAACAATCCACCCGGGAGAGTCTTAAGCCTTTTTGT
TACCCTAACACTTGGTTGGCCTTTGTACTTAGCTTTCAAT
(SEQ ID No deltal2 FN1_7054927 1 GTATCAGGCAGACCATATGACCGTTTCGCTTGTCATTATG 39) ATCCCTATGGCCCAATCTACTCAAACCGCGAAAGGTTACA
AAT AT T C AT C T CGG AC AT AGGG AT TTTCATTGCCACATTC GCGCTATACCACCTTGTCTCGGCCAAAGGGTTAGGTTGGG TTGTGTTAGTGTATGGTGTGCCTTTGTTAATAGTAAATGG CTTCCTTGTTTTGATCACTTACTTGCAACACACTCACCCT GCATTGCCTCATTATGACTCGTCCGAATGGGATTGGTTGA GAGGAGCATTGTCAACCGTTGATCGAGACTATGGAATTCT CAATAGGGTTTTTCACAACATTACTGACACTCATGTTGTG CACCATTTATTCTCAACAATGCCACATTACAATGCAATGG AAGCAACCAAAGCTGTGAAGCCGATATTAGGCGAGTACTA CCGTTTAGATGACACTCCAATTGTTAAGGCTATGTGGAGA GAAGCTAAAGAGTGTCTCTATGTTGAGCAAGATGATGATT CTCCATCTAACAAAGGTGTTTTTTGGTACAAAAACAAGTT TTAG
ATGGGAGTT AAAAGT CGAATGCT CGAGC C AAAAT C CGAGT
TGAAAGATAGTAAGAACAATAATAATAGCCCAATTGAGAG AGCACCACACACTAAACCACCATTCACACTAAGCCAAATC AAGAAAGCCATTCCACCCCATTGCTTCCAACGCTCTCTTC TTCGCTCCTTCTTTTATGTCTTTCGAGACCTTTTCTATGT CACTTTGTTCTACTACTTAGCAACCTCTTACTTCCACCTT CTCCCCCATCCACTCCCATACCTAGCTTGGCCACTTTATT GGATCTTCCAAGGTTGTGCTTTGTTTGCTTTCGGGCTCAT TGGTCATGAATGTGGTCACCATGCCTTTAGTGACTACAAA TGGATTGATGACATGGTTGGTTTTGTTATCCACTCTGCAA TTCTTCTCCCATACTTCTCATTTAAGTATAGTCACCGTCG CCACCATTCAAACACTGGATCCATTGATCGCGATGAAGCT TTTGTTCCAAAGACGAAATCTCAAATGCCATGGTTCTCCA
CSFAD2B AAT AC T T AAAC AAT C C AT T AGG AAG AGT C C T AAC C C T AGG
TTTTCTATTAACCGTCGGTTTTCCTTCATACTTAACTTTC
(SEQ ID No deltal2 FN1_4063495 1 AATATATTAGGCAGACGATATGACCGTTTCGCTTCTCATT 40) ATGATCCTTACTCTCCTATATACTCCAACAATGAAAGGCT
TCAAATATTAATTTCCGATGTGGGGGTTTTCATCACCACA TTCGTGTTGTACCAACTCGCCTTAGCAAGAGGGTTGAGTT GGGTTATGTTAGTGTATGGGGTGCCAATGGTATTAGTGAG TGGTTGGCTTGTTTTGGTCACTTACTTACAACACACTCAC CCTGCATTGCCTCACTATGATTCTTCCGAATGGGATTGGT TGAGAGGTGCTTTGTCGACAGTTGATCGAGACTTTGGAGT GCTCAATAGTATTTTTCATAACATTTCAAACACTCATGTT GTGCACCATTTATTCCCCACAATACCATATTACAATGCAG TGGAAGCAACTAAAGCTGTGAAGCCAATATTAGGAGAGTA CTACCGTTTAGATGAGACTCCAATAATTAAAGCTGTGTGG AGAGAGGCAAAAGAGTGTCTCTATGTTGAGAGTGATGATG AGTCTCCTCTTTACAAAGGTGTTTTTTGGTATAAGAACAA GTAA
ATGGGAGTCAATGGTGAAAATAGTAGACTTGATCGAGCAC CACACACCACGCCATCATTCACACTAAGCCAACTCAAGAA AGCCATTCCACCCCATTGCTTCAACCGTTCTCTTCTCCGA TCCTTCTCTTATCTCCTTCGAGACCTTTTTTTCGCCTCTT TGTTCTACTACGTAGCAACTTCTTACTACCACCTTTTCCC TCAACCACTCTTATACTTTGCTTGGCCACTTTATTGGGTC TCCCAAGGCTGCATTTTATTCGGCTTAGGGCTCATTGGTC ATGAGTGTGGTCACCATGCCTTTAGTGACTACAAATGGGT TGATGACATGGTTGGTTTCGTTATTCACTCTGCTTTTCTT CTCCCATACTTTTCGTTTAAGTATAGTCACCGACGCCACC ATTCAAACACTGGCTCCATTGACCGCGATGAAGCCTTTGT TCCAAAGACGAAATCTCAAATGCCATGGTTCTCTAAATAC
CSFAD2C TTGAACAATCCACTAGGGAGAGTCCTAACACTTGGTTTCT
TTTTAACCATTGGTTGGCCTTTGTACTTAGCTTGCAATAT
(SEQ ID No deltal2 FN1_14553312 1 ATTAGGTAGACCATATGACCGTTTCGCTTGTCATTACGAT 41) CCTTACTCTC C AAT AT AC T C AAAAAATG AAAGG C T T C AAA
TATTGATTTCAGATATTGGTGTTTTCATCACCACATTGGT GTTACACCAACTTGTCTTAGCCAAAGGATTGAGTTGGGTT TTGTTCGTGCATGGGATACCATTGCTAATAGTAGGTGTCT TGCTAGTTTTGACCACTTATTTACAACACACTCACCCTGC ATTGCCACACTATGACTCGTCCGAATGGGATTGGTTGAGA GGTGCTTTGTCAACCGTTGATCGAGATTTTGGAGTTCTCA ATAGTATTTTTCATAACGTTTCAAACACTCATGTGTTGCA TCATTTATTCCC C AAAAT AC C AC AT T AC AATG C AAT AG AA GCAACAAAAGCTGTGAAGCCAATATTAGGAGAGTACTATT GTTTAGATGAGACTTCAATAATTAAGGCTATGTGGCGAGA GGCCAAAGAATGTCTTTACGTTGAATCAGATGATGAATCT TCGAAAAAAGGTGTTCTTTGGTACAAGAACAAACTTTGA
ATGGAAGTTGTAGATGACCAATATAGTAACCTTGTTAGGC
GAGCACCACACACCGAACCACCATTCACGCTAAGCGAAAT CAAGAAAGCCATTCCACCCCATTGCTTCAAACGCTCTCTT CTCCGCTCCTTCTCTTATCTCCTTCAAGACCTTTTCTTAG TCTCTTTACTCTACTACATAGCAACATCTTACTTCCACCT TCTTCCTCATTGCCCATTTTCATACTTAGCTTGGCCCCTT TATTGGATCTCCCAAGGCTGCATCTCATTTGGTATTTGGG TCATTGCTCATGAGTGTGGCCACCATGCTTTTAGTGATCA CCAATGGGTGGATGACACCGTTGGTTTCGTCCTTCATTCC GCTCTTCTCTTCCCATATTTCTCTTGGAAGTATAGTCACC GTCGCCACCACACCAACACTGGCTCCATGGAGCGCGATGA AGTGTGTGTCCCAAAGCCGAAATCTCAAATGTCATGGCTC TACAAATACTTGAACAATCCATTAGGGAGAGTCCTAAGAC
CSFAD2D TTAGTGTTACATTGTTCCTTGGTTGGCCTCTTTACTTAGG (SEQ ID No deltal2 FN1_ .14570259 1 GTTCAATGTATCAGGTAGATCATATAACCGTTTCGCTTGT
CATTTTGATCCTTACTCCCCAATCTTCACAAAAAGGGAAA
42) GGCTTCAAGTATTAATTTCAAATTTTGGTGTTTTAATTAC
TATATTTGTATTGTACCAACTCAGCTCAACCAGAGGGTTG AGCTGGGTTGTATTCGTGTACGGGGTGCCATTGCTTATAG TCAATGGCACCATTTCTTTGATGACATATTTGCATCACAC TCACCTTGCATTGCCTCACTATGACTCGTCCGAATGGGAT TGGTTAAGGGGTGCTTTGTCAACAGTCGATCGAGACTATG GAGTTTTCAATAGAATTTTTCATAATGTTACAGACACTCA TGTATTGCACCATTTATTCTCAACAATACCTCATTACAAT G C AATGG AAG C C AC C AAAG C T AT T AAG C C AAT AT TGGG AG AGTACTATTGTTTCGATGAGACTTCGATAATTAAAGCTAT GTGGAGAGAGATTAAGGAGTGTGTCTATGTTGAACCAGAT GATGAATCTTCTTCTAATAAAGGTGTTTTAATGGTATAAG AACAAGTTCTAA
ATGGGAACTGAAGGTGGCCAATATAGTAGAGTTGTGAGAG C AC C AC AC AC C AAAC C AC C ATT C AC ACT AAGC C AAAT C AA GAAATCCATTCCGCCCCATTGCTTCAACCGCTCTCTTCTC CGTTCCTTCTCTTATCTCCTTCGAGACCTTTTTTTCGCCT CTTTATTCTACTACGTAGCAACCTCTTACTTACACCTTCT CCCACACCCACTTTTGTACATGGCTTGGCCACTTTACTGG ATCTCCCAAGGCTGCATTTGTTTCGGTATTTGGATCATTG CTCACGAGTGCGGTCACCATGCTTTTAGTGACCACCAATG GGTGGATGACACTCTTGGCTTTATCTTCCACTCTGCTCTT CTCGTCCCATACTTCTCATGGAAGTATAGTCACCGTCGCC ACCATTCCAACACCGGCTCTATTGAGCGCGATGAAGTGAT TGTTCCAAAGAGAAAATCACAAATGCCATGGCATTACAAA T AC C T C AAC AAT T C AT T AGGG AG AT T C T T AAGG C T TGGT C
CSFAD2E TTACCGTGATTTTCGGTTGGCCTTTGTATGTGTGTTTCAA (SEQ ID No deltal2 FN1_ .14530645 1 TGCATTAGGTAGACCATATGATCGTTTCGCTTGTCATTTT
GATCCTTACTCTCCAATCTACTCAAAAAGCGAAAGGCTTC
43) ATATACTAATTTCAGATATTGGTGTTTTAATTACCATATT
TTTATTGTACCAACTCAGCTCAGTTAAAGGGTTGAGTTGG GTTGTGATCACGTACGGGATGCCATTACTAGTAGTAAATA GCATCCTTGCGGTGATCACATACTTGAATCACACTCACCT TGCATTGCCACATTATGACTCGTCGGAATGGGATTGGTTT AGGGGTGCTTTGTCAACGGTTGATCGAGATTTCGGAGTTC TCAATGGGGTTTTTCATAACATCACAAACACTCATGTGGT GCACCATTTATTCTCAACAATGCCACATTACAATGCAGTG GAAGCAACCAAAGCTGTGAAGCCAATATTGGGAGAGTATT ATTGTTTTGATGACACTCCGGTAATTAAAGCTATGTGGAG AGAGGTTAAGGAGTGTGTCTATGTTGAGTCAGATGATGAA TCTTCTAATAAAGGTGTTTTATGGTATAAGAACAAGTTCT AG
ATGGGAGCCGGTGGCAAAAATAGTAGACTTGAGCGAGCAC
C AC AC AC C AC AC C AC C ATT C AC ACT AAGC C AACT C AAGAA AGCCATTCCACCCCATTGCTTCAACCGTTCTCTTCTTCGT TCCTTCTCTCATGTCCTTCAAGACCTTTTTTTCGTCTTTT TGTTCTACTACATAGCAACCTCTTACTTCCATCTTCTCCC ACACCCGCTCCAATACTTAGCTTGGCCACTTTATTGGATC TTCCAAGGCAGCATTTTTGCTGGTATTTGGGTCCTTGGTC ATGATTGTGGTCACCAAGCTTTCAGTGACCACCAATGGGT GGATGACACTGTTGGCTTTGTCCTCCACTCCGCTCTTCTC TTCCCATACTTCTCTTTTAAGTATAGTCATCGTCGCCATC ATTCAAACATCGGCTCCCTTGAACATGATCAATTGTTTGT
CSFAD2F
SAT3 _scaffold7 TCCAGTCCCCGAATCTCAAATCGCATGGCTCTACAAACAT (SEQ ID No deltal2 1 T AC T TGG AC AAT C C AC T AGG AAG AG C C C T AAAG C T T T C T A
1447 TTATAGTGTTCCTTGGTTCTCCTTTGTACTTAGGTTTCAA 44) TCTTACAGGCAAACAATATGATCGTTCTGCATGTCATTAT
GATCCTTACTCTCCACTCTACTCAAAAAGTGAAAGGCTTC ATATATTGATTTCAGATATCGGTGTTTTCATCACCACATT GGTGTTATACCAGCTTGGCTCGACTAAAGGGTTGAGTTGG CTTGTGTTCATGTATGGGGTGCCATTGTTTACAGGGAATA GCATCCTTGTGACAATCGCATACTTGAATCATACTCACTC TTCATTGCCTCATTATGACTCGTCAGAGTGGGATTGGTTG AAAGGAGCATTGTCAACAATTGATCGAAACTATGGATCAA T T C T C AAT AGGGT T TTCCATCACCT T AC AG ATG C T CAT AT GGCACACCATTTATTCGCAACAATACCTCATTACCATGCA AATG AAG C C AC C AG AG C T AT CAAAC CCATATTGGGA
ATGGGTGCCGGTGGTCGAATGAATGTTCCTCCAGGCTCAA AAAAATCAGAGGCCGAAAGCCTTAAACGAGTTCCACACAC AAAACCACCATTCACACTTGGCGAAATCAAGAAAGCCATT CCACCCCATTGTTTCCAGCGCTCTGTTGTTCGCTCATTCT CTTATGTCGTTTATGACCTTACCATTGCTGCCATCCTTTA CTATATTGCTACTCGTTACATCCCCCTCCTCCCACACCCT CTGTCTTACCTGGCTTGGCCCATTTATGGGTTCATCCAGG GTTGTGTCCTAACTGGTGTTTGGGTCATAGCCCACGAGTG TGGCCACCACGCCTTTAGTGACCACCAATGGCTTGACGAT ACCGTGGGCTTAGTCCTTCACTCTTTCCTTCTCGTCCCCT ACTTTTCATGGAAATACAGCCACCGTCGCCACCATTCCAA CACAGGCTCTCTTGACAAAGATGAAGTCTTTGTTCCCAAG AAAAAGTCTGCCATGAAATGGTACTCTAAATACCTCAACA
CSFAD2G ATCCCCCTGGCAGATTCCTCACTCTAACAATCACTCTCAC (SEQ ID No deltal2 FN1_ 14504247 1 TCTGGGCTGGCCTCTTTACTTGGCCTTCAATGTCTCGGGC
CGGCCCTATGACCGTTTTGCATGCCACTTCGATCCATACG
45) GCCCAATCTACTCGGACCGTGAGCGGGCCCAGATATACCT
ATCTGATGTGGGCATTCTCGCAATGTGTTTCGGCCTTTAC AAGCTGGCTATGGCAAATGGGCTTGCTTGGGTTTTATGCG TGTATGGAGTCCCATTGTTGGTGGTGAATGGGTTTTTGGT GCTGATCACTTTCTTGCAACACACTCACCCATCGTTGCCT CATTACGATACATCGGAGTGGGATTGGCTTAGGGGAGCTT TGGCTACAGTGGACAGAGATTACGGTTTGTTGAACAAGGT CTTCCATAACATCACAGACACCCATGTGGCTCACCACTTG TTCTCCACAATGCCTCATTATCATGCCATGGAGGCCACAA AAGCTATCAAGCCAATACTTGGAGAGTACTACCAATTTGA CGGAACACCAGTGTACAAAGCCATGTGGAGAGAGACTAAG GAATGTGTTTTTGTCGAAGCGGATGAAGGTGAAGGCAAAG GTGTCTTCTGGTACAACAAGCTTCGGGATTGA
ATGACAGAATCACATGCTTCGGAGGAAATGGCGAGAGAAG
AAAAAGGTGACT AC C C C ATT AAGGTGGC AAATGGGAT C CG AAACCAAAACGGCGATTTCGATCTGAGTGATCCTCCACCG T T T AAG AT AG C TG AG AT C CG AG CCGCCATTCC T AAG C AT T GTTGGGTTAAGAATCCATGGCGCTCACTCAGCTATGTTTT CAGAGATCTCTTTATCATTTTTGCATTGGCCTTTGCCGCT TTCTATTCCGATACTTGGGTCGTTTGGCCATTTTACTGGG CTGCTCAAGGAACCATGTTCTGGGCTCTCTTCGTTCTCGG CCACGATTGTGGCCATGGAAGCTTTTCAAACAGTCCTGAG CTGAATAGCGCTGTGGGTCATATTCTGCATTCTGCAATCC TTGTACCTTACAATGGATGGAGAATTAGCCATAGAACTCA T CAT C AAAAC C ATGGC C ATGTTGAGAATGACGAGT CATGG GTTCCGTTGACTGAGAAGATGTACAAACAGTTGGATGAGA
CSFAD3A AAAC AAAG AGG C TG AG AT T C AAAGT CCCATTTCCCTTATT
TGCATACCCTTTTTATCTGTGGAATAGAAGTCCAGGAAAA
(SEQ ID No delta 15 FN1_13245263 8 GAAGGCTCTCATTTCAATCCTTACAGCAAATTATTTACTC 46) CAAGTGAGAGAAACCAAATAATAACTTCAACGGTTTGCTG
GTCAACAATGGCTGCTTTGCTTGTCTGTTTGTCCTTCATA GTAGGTCCTGTTCAAGTTCTCATGCTATATGTTGTTCCTT ATTGGATATTTGTGATGTGGCTAGACATTGTCACTTACTT GCATCACCATGGTTATGAGCAAAAACTCCCTTGGTACCGG GGCAAGGAATGGAGTTACCTAAGGGGAGGGCTAACAACAG TAGAC CGTGACT ATGGAAT ATTT AAC AAT AT C C AC CATGA CATTGGAACTCATGTTATACACCATCTCTTCCCTCAAATC CCACACTACCATCTTGTGGAAGCTACCAAGGCAGCCAAGC CAGTGCTCGGAAAGTATTACAGGGAGCCTAGAAAGTCAGG GCCAATTCCAGTCCACTTGATCGAGAATCTAGTTAAGAGC ATCAGCCAGGACCACTATGTGAGTGACAATGGCGAAGTAG TATACTACCAGACAGACCCAGAACTTAATAATAATAATAA TAAAAAAATATCTGAGGCCAAGCAAATGTAG
ATGGCGAGTTGGGTTTTGTCAGAATGTGGATTAAAGCCAC TCCCTCAAAATTTTCCTCGACCCAGAACAGGGATTACCTC AACCAACCCAACAACAAAGACTCGGTTTTTGAGTTCTAAC AAG AG C T CGG CGG AT C T TAG AT T C C C AAAGGTG AAT T T C T CAACTGGGTTTTTGAAAAGGAGGAGTTTTGAGGTGAGAGT GAGCGCCCCATTGAAGGTTGCTTTTGTAGAAGAGGAAGAC AGAGGAGAGAGAGTAGAGGAAATCGTTAATGGAGTTGAAG AAG AAG AAG AAG AGGG AAT C AAAT T TG AT CCTGGCTCGGC TCCACCTTTCAAATTGGCTGATATTCGGGCTGCTATTCCA AAACATTGTTGGGTTAAGGATCCATGGAAGTCTATGAGCT ATGTGGTGAGAGATGTGGCTATCATATTTGGGTTGGCTGC GGCTGCTGCTTCTATTAACAACTGGGTTGTTTGGCCTTTG TACTGGGCTGCTCAGGGGACTATGTTTTGGGCTCTATTTG TTCTTGGTCATGACTGTGGCCATGGAAGCTTTTCAAACGA TCATAAGCTAAACAGTGTAGTTGGGCATCTCTTGCATTCC
CSFAD3B TCAATTCTTGTACCTTATCATGGATGGAAAACTAGCCATA
AAAC C CAT C AC C AAAAC C ATGGAC ATGTTGAGAATGATGA
(SEQ ID No delta 15 FN1_14584234 8 ATCATGGCATCCGTTACCTGAAAGAATTTACAGGAAACTG 47) GATAACATCACAAAAAGTTTGAGATTTACTCTACCATTTC
CAATGCTTGCTTATCCTTTCTACCTTTGGGGAAGAAGTCC AGGAAAGGCTGGTTCTCATTTTCATCCAAATAGTGACTTG TTTGTTCCAAGTGAGAAGAAAGATGTGATCACTTCCACTT TATGTTGGACAGCTATGGCTGCTATACTTGTTGGTTTGGG CTTTGTGATGGGTCCTATTCAATTGCTTAAGCTCTATGGC ATTCCTTATTGGGTTTTTGTCATGTGGCTGGATTTAGTGA CATACTTGCATCACCATGGCCATGAAGAAAAATTACCATG GTACCGCGGAAAGGAATGGAGTTACTTAAGAGGAGGGCTC ACGACACTTGATCGCGATTATGGAGTGATTAACAACATTC ATCATGATATTGGAACTCATGTAATCCACCATCTTTTCCC TCAAATTCCTCACTACCACTTGGTGGAAGCAACCGAGGCA GCTAAACCAGTGATAGGGAAATACTACAGAGAGCCGAAGA AATCGGGTCCTCTACCGTTTCACTTGATAGGTGCTTTGAT TAGAAGCTTGAAACAAGATCACTATGTTAGTGACACTGGT GATGTTGTGTACTACAAAACTGATCCTGATCTTAAGTGA
ATGGCGACTTGGGTCTTATCAGAATGTGGCGTAAAACCTC
TTCTTAGAGTCTACCCTCAACCCAGAACCGGAATGTTGTT GAAGCCTTCCATCCCGTCGAGTCTTAGGACATTGCCGGTC TGTAAGAGTAGCCAATTGGGTTTCTCATTGTCTTCCTCAA GTGGGTTTAGGGGGCAGAATTGGAAACTTAATGTGAGTGC TCCATTAAGAGTCTCTGATGTTGGTGAAGAAGATAATGAG AAGAGGGT AGTGGAAGATGAAAGTGGATT CGAC CCTGGTG CGCCGCCTCCATTTAAGTTGGCTGATATTAGAGCAGCCAT TCCTAAACACTGTTGGATTAAGGACCCATGGAGATCTATG AGCTATGTTTTGAGGGACGTTGTTGTCGTTTTTGGTATGG CGGCTGCGGCTGCTTATTTAAACAACTGGGCCGTTTGGCC TCTGTACTGGATTGCTCAAGGAACCATGTTCTGGGCTCTT TTTGTTCTTGGCCACGACTGTGGTCATGGAAGTTTTTCTA ATAACGCAAACCTTAATAGCGTGGTGGGTCATATTCTTCA TTCTTCAATCCTTGTCCCATACCATGGATGGAGAATAAGC
CSFAD3C C AC AGGACT C AT CAT C AGAAC C ATGGAC AC ATTGAAAACG
SAT3 _scaffoldl ATGAATCTTGGCATCCGCTATCTGAGAAAATCTACAATAG (SEQ ID No delta 15 8
4620 CTTGGATAAGGGTACCAAATTGCTGAGGTTTACCTTGCCT 48) TTCCCTATGCTTGCTTACCCTTTTTATCTGTGGAGTCGAA
GTCCCGGAAAGAAGGGTTCTCATTTTGATCCAAACAGTGA CTTGTTTGTTGAGAGTGAAAGGAAAGACATCATCACCTCC ACTGCATGTTGGACTGCCATGGTTGCTCTGCTCGGTGTGC TCTCCTTTGTAATGGGTCCTGTTCAACTCATTAAGCTCTA TATTGTTCCCTACTGGATTTTTGTCATGTGGTTGGACTTG GTCACTTACTTGCATCATCATGGCCACGAGGACAAACTTC CATGGTATCGTGGAAAGGAGTGGAGTTATCTAAGAGGTGG ACTAACTACTCTTGACCGTGATTATGGATGGATCAATAAC ATTCACCATGATATTGGAACTCATGTTATACATCATCTCT TCCCTCAAATCCCACATTATCACTTAGTGGAAGCAACAGA GGCAGCTAGACCTGTATTTGGTAAATACTATAAGGAGCCA AATAAATCTGGACCTTTACCATTTCACTTGCTTGGAAGTT TAATAAGAAGCATGAAAAAGGATCACTATGTTAGTGATAC AGGGGATGTTGTTTACTACCAAACTGATCCAAAGCTATAT GGGCCTTCTGAATCTGACTCTTCCACATGA
ATGGAAGCCGAGAAGAAGTACATTACCACTGAGGAACTGA AGGAGCACAACAAGGCAGGGGATCTGTGGATCTCTATTCA GGGTAAGGTTTATAATGTATCAGAATGGCTTAAGGATCAC CCTGGTGGGGATGCGCCTCTACTAAGTTTCGCTGGCAGAG ATGTTACTGATGCTTTTATTGCATACCATCCCGGTACTGC GTGGAAGCATCTTGATCAGTTTTTCACCGGTTATTATGTC AAAG AT TTCGTGGTCT C AG AG AT T T C C AAGG AT T AT AGG A GAATTTCAAACGAGTTTACCAAACTGGGGTTGTTTGAAAA GAAAGGTCATGGGATTTTCTACACTCTCACATGTGTTGCT ATAATGCTTTCCATGGTTGTTTATGGTGTTGTGAAATCTG AGAGCATTTTAGTCCATATGGGTTGTGCTGTCGTATTGGG GATGCTTTGGATTCAAAGCGCTTATGTTGGGCATGATTCT GGGCATTATCAGGTCATGTTAAGCCCTGGATATAACAAAT TTGCTCAGCTTTTGGCTGGGAATTGTCTTACTGGGATTAG CATTGCTTGGTGGAAATGGACTCATAATGCCCATCATATT
CSD8 GCTTGCAACAGCCTTGATTATGATCCAGATCTTCAACACA delta6/ TTCCCGTCTTTGCAGTGTCTTCTAAATTCTTCAAGTCCAT
(SEQ ID No FN1_ 14584615 1
8 TACTTCACGCTTTTATGGAAGGGAGTTGACATTCGATTCA 49) TTGTCTAGGTTCATGATCAGTTACCAACATTGGACATATT
ATCCAGTTATGTGTGTTGCCAGGGTTAACTTGTTTGTACA GACACTATTGTTGCTCTTGTCAAAAAGACCTATCCCAAAT AGAGCTTTGAACATAATGGGAACCCTTGTGTTCTGGACTT GGTTCCCTCTCCTTGTTTCATGTTTGCCCACCTGGACAGA GAGGACGATGTTTGTGCTCTTGAGCTTTGCAGTCACATCA GTTCAACATGTTCAATTCACTTTGAACCATTTCTCAGCAG ATGTTTATCTCGGTCACCCTGGTGGGAATGATTGGTTTGA GAAGCAGGCTGCTGGGACTATAGATATTTCATGCTCACCT TGGATGGATTGGTTCTATGGAGGGCTGCAGTTTCAGCTTG AGCATCATTTGTTCCCACGCATGCCTCGTTGCCAATTGAG GAACATTTCTCCTATTGTTGTTGACCTTTGCAAGAAGCAC AATTTGCCTTACAGGAGCTTATCATTCTGGGACGCCAATG TTTCCACCCTTAAAACTCTCAGGACTGCTGCCCTTCAAGC ACGAGATCTCACCAACCCTATCCCCAAGAACTTGGTCTGG GAAGCTGTTAATACTCATGGCTGA
ATGGCGGATTCAACAAAATACATTACCCAAGAAGAGCTTA
AAC AAC AC AAC AAAC ATGGAGAT CT ATGGAT CT C AAT C C A AGGCAAAATCTACAACGTCTCAGATTGGGCCAAAGACCAT CCCGGCGGCGAACACCCATTACTAAATCTCGCCGGTCAAG ACGTAACAGAAGCTTTCATAGCTTACCATCCAAGGTCGGC ATGGCAATACATGGACCAATTCTTTACTGGGTTTCATCTC AAAGATCACTCCTTTACCGAGGTTTCAAAGGATTACAGAA AACTCGTCAATGAATTTACCAAAATGGGTTTGTTTGAGAA GAAAGGACATGGGGTTTGCTTCTCATTCTTCTTCATTACA TTGTTTTTTATACTCAGTGTTTATGGTGTTATGTGTTCTG ATAGTATTTTGGTTCATTTCTGTTCTGGATGTTTATTAGG GTTTTTATGGATTCAAAGTGGTTGGTTAGGTCATGATTCA GGTCATTAT C AAAT C ATG AC T AAT C AAT T T TAT AAC AG AT TTGTTCAGATCTTAACTGGGAATTGTTTAGCTGGGATTAG TATTGCTTGGTGGAAATGGAATCACAATGCTCATCATTTA
CSD6 GCTTGTAATAGTCTTGAATTTGATCCTGATCTTCAACACA delta6/ TGCCATTCTTTGTTGTATCATCAAAATTCTTTGATTCACT
(SEQ ID No FN1 2469249
8 CACGTCACATTTCTATGGCAGAAAATTGAGTTTTGATTCA 50) ATCACAAGATCCTTAGTTAGTTACCAACATTGGACATTTT
ACCCTGTCATGTGTTTAGCTAGGCTTAATCTCTTCGCTCA ATCATTTGCTTTGTTATTATC T AAG AG AAAAGT T CAT AAT AGAGGTCAAGAGATTCTTGGGTTACTTGTGTTTTGGATTT GGTATCCACTTTTGGTTTCATATTTACCAAATTGGAGTGA AAGGGTTATGTTTGTCATGGCAAGTTTTTCAGTAACTGGT ATCCAACATGTTCAATTTTGTTTGAACCATTTCTCAGCTA ATGTTTATGTTGGTTTGCCAAGTAGTTATGATTGGTTTGA GAAGCAAACAAAAGGGACACTTAATATCCTTTGTCCTTCT TGGATGGATTGGTTTCATGGCGGTTTGCAGTTTCAGATTG AACACCATTTGTTTCCAAGATTGCCCAAATCACAACTGAG GAAAATTTCTCCCTTTGTTTATGAACTGTGTAAGAAGCAT AATTTGCCTTATAATTGTGCTTCGTTTTGGGAAGCTAATG TAATGACAGTGAATACTCTTAAGACCGCGGCTTTGCAGGC TCGCGATCTTACTAATCCTGTTCCGAAGAACTTGGTTTGG GAAGCTGTCAATACTCATGGATAG
Table 2. Fatty acid content in seed oil of hemp csfad2a and csfad3a mutants; Table 2. Fatty acid content in seed oil of hemp csfctd2ct and csfctd3ct mutants.
Mutant name molar percent of total hempseed fatty acids
16:0 18:0 18: 1Δ 8: 2Δ9,ΐ2 8: 3Δ6,9,ΐ2 18: 3Δ9,12,15 18; Δ6,9,12,15 8: 2Δ6,9 8; 2Δ9,ΐ5
Oleic LA GLA ALA SDA
csfad2a-l 4.34 1.84 77.35 1.96 0.56 3.39 1.13 5.33 2.17
(BC2F1) ± 0.24 ± 0.31 ± 1.25 ± 0.51 ± 0.11 ± 0.39 ± 0.12 ± 0.30 ± 0.24 csfad2a-l 5.87 2.12 15.04 48.46 5.46 18.79 2.26 0 0
(BC2F1], hetero ± 1.50 ± 0.44 ± 3.05 ± 3.02 ± 0.96 ± 2.72 ± 0.64
segregating WT 5.74 2.38 8.43 55.94 4.33 19.76 1.60 0 0
± 0.54 ± 0.47 ± 1.31 ± 2.93 ± 0.66 ± 2.70 ± 0.30 csfad2a-2 4.24 2.52 69.48 5.11 1.10 10.05 1.67 2.99 0.76
(BC1F1) ± 0.48 ± 0.48 ± 2.85 ± 1.08 ± 0.29 ± 0.88 ± 0.12 ± 0.68 ± 0.23 csfad2a-2 5.77 2.89 12.74 46.02 4.01 25.12 1.96 0 0
(BC1F1], hetero ± 0.43 ± 0.38 ± 0.93 ± 2.06 ± 0.39 ± 1.34 ± 0.19
segregating WT 5.91 2.95 7.29 49.49 4.36 26.45 2.11 0 0
± 0.34 ± 0.44 ± 0.55 ± 1.70 ± 0.23 ± 1.88 ± 0.28 csfad3a-l 6.37 1.50 7.73 75.20 7.20 0.69 0.00 0 0
4.99 1.69 8.60 69.32 3.75 9.94 0.54
1.03 ± 0.19 ± 1.32 ± 1.85 ± 0.40 ± 2.21 ± 0.11
Mutant name molar percent of total hempseed fatty acids
16:0 18:0 8- Δ9 8:2Δ9,ΐ2 8:3Δ6,9,ΐ2 18;3Δ9,12,15 18:4
Δ6'
9'
12·
15
8;2Δ9,ΐ5
Oleic LA GLA ALA SDA
segregating WT 4.95 1.63 6.94 55.35 3.73 24.74 1.63 0 0 csfad3a-l 6.04 2.44 7.69 74.71 6.99 0.66 0.08 0 0
(BC3F1) ±0.70 ±0.49 ±1.23 ±2.85 ±2.21 ±0.13 ±0.04
csfad3a-l 5.18 2.72 8.77 63.67 5.64 11.79 0.99 0 0
(BC3F1], hetero ± 0.51 ± 0.58 ± 1.20 ± 1.38 ± 1.60 ± 0.86 ± 0.27
segregating WT 4.93 2.50 8.45 53.70 4.98 22.23 1.92 0 0
±0.53 ±0.58 ±1.69 ±3.04 ±1.49 ±2.33 ±0.63 csfad3a-2 4.17 2.12 9.32 77.11 4.35 1.10 0.07 0 0
(BC2F1) ± 0.25 ± 0.33 ± 1.24 ± 1.26 ± 0.62 ± 0.21 ± 0.04
csfad3a-2 4.56 2.37 10.11 65.06 4.00 11.39 0.80 0 0
(BC2F1], hetero ± 0.43 ± 0.37 ± 1.27 ± 4.18 ± 0.68 ±4.32 ± 0.34
segregating WT 5.23 2.50 9.36 59.63 3.78 16.76 1.24 0 0
±0.57 ±0.87 ±0.07 ±5.19 ±0.48 ±4.33 ±0.53
6.88 2.10 8.14 51.42 8.95 16.95 2.58
0.92 ± 0.46 ± 1.87 ± 2.25 ± 1.04 ± 2.71 ± 0.54 segregating WT 6.78 2.38 8.28 54.65 5.39 17.90 1.66 0 0
± 0.85 ± 0.43 ± 1.67 ± 2.07 ± 0.78 ± 2.98 ± 0.26 csfad2a-3 x
csfad3a-l 6.67 2.85 8.74 66.88 11.33 0.48 0.10 0.24
(BC4F3), homo ± 1.76 ± 1.38 ± 0.16 ± 3.66 ± 0.38 ± 0.09 ± 0.01 ± 0.02 segregating
csfad2a-3 7.51 1.64 6.25 48.23 8.38 21.65 3.14 0.14
(BC4F3), homo ± 0.62 ± 0.47 ± 1.08 ± 2.22 ± 0.55 ± 2.49 ± 0.44 ± 0.03 segregating
csfad3a-l 6.48 1.84 7.93 74.39 5.79 0.63 0.06 0.13
(BC4F3), homo ± 0.95 ± 0.56 ± 2.52 ± 3.41 ± 2.53 ± 0.15 ± 0.04 ± 0.05
BC4F3 WT 7.20 1.37 6.63 52.24 3.25 24.75 1.39 0.09
± 0.61 ± 0.17 ± 1.24 ± 5.65 ± 1.87 ± 4.49 ± 0.88 ± 0.03
Table 3. Fatty acid content in leaves of hemp csfad2a and csfad3a desaturase mutants;
Table 3. Fatty acid content in leaves of hemp csfad2a and csfad3a desaturase mutants.
Mutant name
molar percent of total leaf fatty acids
16:0 18:0 18:1 Δ9 18:2 Δ9,12 18:3 Δ6,9,12 18:3 Δ9,12,15 18:4 Δ6,9,12,15 18:2 Δ6,9 18:2'
Table 4. Fatty acid composition of the yeast transformants fed with fatty acid substrates.
Table 4. Fatty acid composition of the yeast transformants fed with fatty acid substrates. As controls, yeast cells transformed with empty pESC-TRP vector were subjected to similar conditions. Percent conversion was calculated as product/(substrate + product) * 100 at the assay endpoint. Each value is the mean ± SD from three independent experiments.
Pesc
substrate product
substrate endpoint product endpoint % conversion
mol % total fatty acids mol % total fatty acids
16:1A9* 38.3 ±0.6
18:1A9* 45.4 ±0.6
18.2Δ9,12
10.9 ±0.3
18.3Δ6,9,12
14.1 ±0.5
20;1Δ11 0.8 ±0.05
pCSFAD2A
substrate product
substrate endpoint product endpoint % conversion
mol % total fatty acids mol % total fatty acids
Δ9*
16:1 19.3 ±0.7 16:2 Δ'9.12 14.8 ±0.6 43 ± 1.4
18:1' 12.3 ±0.5 18:2Δ 31.7 ±0.4 72 ±0.8
Δ11 Δ11.14
20:1 0.3 ±0.01 20:1 0.5 ±0.01 62 ±0.3
pCSFAD3A
substrate product
substrate endpoint product endpoint % conversion
mol % total fatty acids mol % total fatty acids
16: 38.2 ±0.3 16:2' 1.6 ±0.04 3.9 ±0.1
Δ9* Δ9.15
18:1 38.0 ±0.8 18:2' 3.6 ±0.1 8.7 ±0.4
18:2 Δ'9.12 4.2 ±0.2 18:3 Δ9.12.15 5.4 ±0.2 56.3 ±0.5
18:3Δ 10.7 ±0.8 18:4Δ 3.2 ±0.2 23.1 ±0.4
Δ11
20:1 0.59 ±0.6 * Endogenous substrate; no fatty acid added to medium
Material and Methods cDNA library construction and EST preparation from developing seeds of Cannabis sativa
Total fatty acid analysis revealed that during seed development the maximum fold increase in fatty acid content occurs at the Upturned (U) stage depicted in Figure 1A. Upturned stage tissue was ground to a fine powder in liquid nitrogen and RNA extracted using the RNAeasy kit (Qiagen, Hilden, Germany). Starting with 1.2 μg total RNA, first and second strand cDNA synthesis was carried out with the Creator™ SMART™ cDNA Library Construction Kit (Clontech, Mountain View, CA) according to the manufacturer's protocol. Twenty cycles of amplification were used for Long Distance-PCR during second strand synthesis. The resulting cDNA samples were treated with Proteinase K, digested with SfH and size- fractionated on Chroma-spin 400 columns (Clontech) according to the manufacturer's protocol. Pooled cDNA-containing fractions were ligated into pDNR-LIB vector (Clontech) and transformed into E. coli TOP10 cells (Invitrogen, Groningen, Netherlands) by electroporation. Transformants were recovered into 96-well plates and insert sizes determined by colony PCR using M13 primers.
From the cDNA library, a total of 1852 ESTs were generated through single-pass Sanger sequencing, yielding 1082 unigene sequences. Blast similarity search identified two unigene sequences with homology to FAD2 desaturases. These sequences were used to prepare primers for Random amplification of cDNA ends (RACE). RACE primers for CSFAD2A: 5'- AAAATGGGAGCCGGTGGCCGAAT-3' (SEQ ID No 5) and 5'- G GG CG G A ATTG CTTTCTTG ATTTC GC-3 (SEQ ID No 6)'; RACE primers for CSFAD2B: 5'- GCAGACGATATGACCGTTTCGCTTCTCA-3' (SEQ ID No 7) and 5'- GCGAGTTGGTACAACACGAATGTGGTGA-3' (SEQ ID No 8). In order to obtain FAD3 homologues that are expressed in developing hemp seeds, degenerate primers from a published source (Lee et al., 1998) were used to amplify a short section of the gene from hemp cDNA (sequences of degenerate primers for FAD3 homologues: 5'- AC N CAYCAYCA RAAYCAYGG-3' (SEQ ID No 9) and 5'- CAYTG YTTN CC N C KRTACCA-3' (S EQ ID No 10) and sequences for Arabidopsis FAD3: 5'- GGCGATTCCTAAGCACTGTTG-3' (SEQ ID No 11) and 5'- TCACCAGTGTCGCTGACGTAA-3' (SEQ ID No 12)). The RACE technique was again carried out to obtain full length CSFAD3 cDNA sequence. RACE primers for FAD3: 5'- CACGGCCATGTTGAGAATGACGAG-3' (SEQ ID No 13) and 5'- GGACAAACAGACAAGCAAAGCAGCCA-3' (SEQ ID No 14).
Deep Sequencing the developing hemp seed transcriptome
Embryos were dissected from developing seed at the Torpedo, Upturned and Filled Not Dessicated stages. Following grinding of the respective tissues in liquid nitrogen, 50 mg of finely ground material was homogenized in 1 ml_ Tri-reagent (Ambion®, Life Technologies, Carlsbad, CA) and RNA extracted according to the manufacturer's protocol. RNA samples were treated with TURBO™ DNase (Ambion®) prior to cDNA synthesis. cDNA was prepared with the SMART cDNA Library Construction Kit (Clontech) according to the manufacturer's instructions but using Superscript II Reverse Transcriptase (Invitrogen) for first strand synthesis. The CDSIII/3'PCR primer was modified to: 5'- ATTCTAGATCCRACATGTTTTTTTTTTTTTTTTTTTTVN-3' (SEQ ID No 15) where R = A or G, V = A, C or G; N = A/T or C/G. A total volume of 500 of each second strand reaction was concentrated on AMICON ULTRA 30K columns (Merck Millipore, Billerica, MA), digested with Mme\ (Fermentas/Thermo Fisher Scientific, Burlington, Canada ) and purified with the QIAquick PCR Purification kit (Qiagen, Hilden, Germany). Pyro-sequencing was carried out on three cDNA libraries prepared from dissected embryos at Torpedo, Upturned and Filled Not Desiccated stages at the GenePool genomics facility at the University of Edinburgh on the 454 GS-FLX sequencing platform (Roche Diagnostics, Branford, CT, USA). Raw sequence analysis, contiguous sequence assembly and annotation were performed as described previously (Graham et al., 2010). Abundance of membrane bound and soluble desaturase transcripts were analysed in silico by determining read counts in the three EST libraries. The raw reads were mapped to the reference sequence, which consisted of the open reading frames of the 17 desaturase genes (included in Table 1) with BWA mapping software (Li and Durbin, 2009). The raw read counts were retrieved from the resulting output file for each gene in the libraries and the counts were then normalized to an RPKM (reads per kilobase per million reads) value as an approximation of gene expression.
Quantitative real time PCR
Total RNA from leaves of two week old and four week old hemp plants was extracted with the TRI Reagent Solution (Ambion®). Single-strand cDNA was synthesised from Turbo DNA-free (Ambion®). DNase-treated RNA using Superscript II (Invitrogen) reverse transcriptase with oligo(dT)i6-i8 primer (Invitrogen). The completed first-strand cDNA was then diluted to the concentration of 50 ng μί"1. To estimate the accumulation of CSFAD2A, CSFAD2B and CSFAD3 transcripts, quantitative real time PCR was performed using an ABI Prism 7300 detection system (Applied Biosystems, California, USA) and SYBR Green PCR Master mix (Applied Biosystems) to monitor dsDNA synthesis. The following gene specific
primers were used: 5 TCGGACATAGGGATTTTCATTG3' (SEQ ID No 16) and 5 AACCCAACCTAACCCTTTGG3' (SEQ ID No 17) for CSFAD2A, 5TCAAATCCCACACTACCATCTTGT3' (SEQ ID No 18) and 5TTTCTAGGCTCCCTGTAATACTTTCC3' (SEQ ID No 19) for CSFAD3. All amplification plots were analysed with an Rn threshold of 0.2 to obtain CT (threshold cycle) values. The amount of each transcript was normalised to that of hemp actin-2 gene (hACT2) amplified with primers: 5'GGGTCACACTGTGCCAATCTAC3' (SEQ ID No 20) and 5 CCAGCAAGGTCAAGACGAA3' (SEQ ID No 21) and compared among samples.
PCR efficiency (E) was estimated by LinRegPCR software (Ramakers et al., 2003; Ruijter et al., 2009). Expression ratios of normalised sample A to normalised sample B were then obtained from the equation (1 +Ε)"ΔΔ Γ where ΔΔΟΤ represents ACTA minus ACTB, and E is the PCR reaction efficiency. Normalised transcript levels in young leaves sample (YL) were used as a calibrator for producing all expression ratios. Dissociation curves of the PCR products were analysed using ABI SDS 2.2 software. The experiment was performed in three biological replicates each of which consisted of three technical replicates.
Establishment and screening of an EMS-mutagenized population
Hemp seed (C. sativa L.) of the Finola variety were purchased from the Finola company (http://www.Finola.com), Finland, and grown in controlled glasshouse facilities at the University of York. The seed was treated with 300 mM EMS for 5 hours and then directly sown onto soil-based John Innes Compost No. 2. Mutagenised M1 female plants were out- crossed with male wild type Finola plants to produce a heterozygous M2 screening population. Typically, DNA from four siblings per M2 family was screened by TILLING. Genomic DNA was isolated from leaves of two week-old M2 individuals using the BioSprint 96 DNA Plant isolation kit (Qiagen) according to the manufacturer's protocol. After fluorometric quantification using Hoescht 33258 dye, DNA samples were normalised to 5 ng μί" and pooled four-fold for screening.
A 1 140 bp fragment of CSFAD2A was amplified in a two-step PCR amplification. The first step was carried out with unlabeled primers (5'CCCATTGCTTTAAACGCTCTCTA- ATTCGCT3' (left) (SEQ ID No 22) and 5 ACCCCTAACCACATTAAGCCATACCCCAT3' (right) (SEQ ID No 23) on 12.5 ng pooled gDNA in '\ 0 μ\- volumes. Labeling of the amplified gene fragment with infrared dyes occurred during the second PCR step, where a mixture of labeled and unlabeled primers was used for further amplification and simultaneous labeling using appropriately diluted product from the first PCR step as template (left primer labeled with IRDye 700, right primer labeled with IRDye 800 (MWG, Ebersberg, Germany) ratio
labeled:unlabeled = 3:2; right primer labeled with diluted IRDye 800, ratio labeled:unlabeled = 4: 1).
A 1500 bp fragment of CSFAD3A was also amplified in a two-step PCR reaction using non- labeled gene-specific primers: 5'cgccattcctaagcattgtt3' (left) and 5'atagtggtcctggctgatgc3' (right) in the first step. As for the A12-desaturase fragment, labeling with infrared dyes occurred during the second PCR but using 5'M13-tailed primers: 5TGTAAAACGACGGCCAGTgggctgctcaaggaaccatgttct3' (SEQ ID No 24) (left) and 5'AGGAAACAGCTATGACCATccttggtagcttccacaagatgg3' (right) (SEQ ID No 25) mixed with M13 primers labeled with IRDye 700 and IRDye 800. The ratios of labeled to unlabeled primers were as above for the CSFAD2A fragment. Heteroduplex formation was carried out as described by Till et al. (2006) followed by digestion with CEL I nuclease as described elsewhere (Till et al., 2006). CEL I digested products were purified by isopropanol precipitation and resuspended in formamide-containing buffer, loaded onto polyacrylamide gels and run on the LI-COR 4300 DNA sequencer platform (Till et al., 2004; Till et al., 2006). Cloning and expression of C. sativa CSFAD2A and CSFAD3A in Saccharomyces cerevisiae
Full length open reading frame (ORF) of CSFAD2A (1 154 bp) was amplified by PCR using Phusion Hot Start DNA polymerase (Finnzymes, Espoo, Finland) from Finola wild type genomic DNA with the following primers: 5'ATAGGATCCaaaatgggagccggt3' (SEQ ID No 26) and 5'GCCTCGAGCctaaaacttgtttttgtacc3' (SEQ ID No 27). The amplified product of CSFAD2A was ligated into pESC-TRP yeast expression vector (Stratagene, La Jolla, CA, USA) between Bam \ and Xho\ restriction sites (underlined in primer sequences) under the galactose-inducible GAL1 promoter and transformed to chemocompetent E. coli DH5a.
The coding sequence of CSFAD3A (1 191 bp long) was amplified in PCR using Phusion Hot Start DNA polymerase (Finnzymes) from the Upturned stage of hemp seed embryo cDNA of Finola wild type with the following primers: 5'GGGGAATTCataatgacagaatcacatgc3' (SEQ ID No 28) and 5TAGCGGCCGCATACTACATTTGCTTGGC3' (SEQ ID No 29). CSFAD3A PCR product was ligated into pESC-TRP vector between EcoRI and Notl restriction sites (underlined in primer sequences) under the alternative GAL10 galactose-inducible promoter and transformed to chemocompetent E. coli DH5a cells.
Plasmid constructs were extracted from the small scale bacterial liquid cultures with a NucleoSpin Plasmid mini kit (Macherey-Nagel, Duren, Germany) and the orientation and identity of inserts were confirmed by Sanger DNA sequencing. The S. cerevisiae strain G175 (ScanBi, Alnarp, Sweden) were transformed with pCSFAD2A and pCSFAD3A plasmid
constructs by the lithium acetate method (Gietz and Woods 2002) and selected on Synthetic Dextrose Minimal Medium lacking tryptophan.
For the functional expression of CSFAD2A and CSFAD3A, corresponding yeast transformants were cultivated at 28°C with shaking at 150 rpm in 50 ml_ volume of synthetic minimal medium containing 2% (w/v) raffinose and 1 % Tergitol NP-40 (Sigma, St. Louis, USA). Expression of the transgene was induced by addition of 2% (w/v) galactose to cultures upon reaching OD600 = 0.2-0.3 and further incubation was carried out at 25°C with shaking at 150 rpm for 28 hours. For the studies on CSFAD2A and CSFAD3A desaturase specificities the cultures at the time point of induction were supplemented with exogenous fatty acids (LA, GLA, or 20:1Δ11 eicosenoic acid,) up to 50 μΜ final concentration. Each experiment was performed with at least three replicates, with a pESC-TRP empty vector yeast transformants control cultivated simultaneously. For the FAMEs analyses, the yeast cells were harvested by centrifugation at 4500g for 5 min at 4°C and washed three times with deionized water. Obtained yeast pellets were either stored at -80°C for a short period of time or were directly analyzed for their fatty acids profiles. The corresponding open reading frames (ORF) of the hemp Δ12 desaturase CSFAD2A and Δ15 desaturase CSFAD3 were amplified by PCR using Phusion Hot Start DNA polymerase (Finnzymes) and the following pairs of specific primers: 5'ATAGGATCCaaaatgggagccggt3' (left) (SEQ ID No 30) and 5'GCCTCGAGCctaaaacttqtttttqtacc3' (right) (SEQ ID No 31) for CSFAD2A and 5'GGGGAATTCataatqacaqaatcacatqc3' (left) (SEQ ID No 32) and 5TAGCGGCCGCATACTACATTTGCTTGGC3' (right) (SEQ ID No 33) for CSFAD3. For ligation behind the constitutive GAL1 gene promoter of the yeast expression vector pESC- TRP (Stratagene), the primers for CSFAD2A were extended by a Bam \ and Xho\ restriction site (underlined) and for ligation behind the alternative constitutive GAL10 gene promoter the primers for CSFAD3 were extended by an EcoRI and Λ/ofl restriction site (underlined). The resulting PCR products and the pESC-TRP vector were digested with the corresponding restriction enzymes and ligated. The nucleotide sequence of corresponding inserts was confirmed by sequencing. The S. cerevisiae strain G175 was transformed with these plasmids by the lithium acetate method (Gietz and Woods 2002) and selected on minimal agar plates lacking tryptophan.
For functional expression, cultures were grown at 28°C in the presence of 2% (w/v) raffinose and 1 % (w/v) Tergitol NP-40 (Sigma). Expression of the transgene was induced when OD600 reached 0.2 - 0.3 by supplementing galactose to 2% (w/v). At that time the appropriate fatty acids were added to a final concentration of 50 μΜ. Incubation was carried out at 25°C for four generations (28 hours). For each experiment, an empty pESC-TRP vector-transformed control was cultivated simultaneously. Each experiment was performed with three replicates.
Yeast cells were harvested by centrifugation at 4500g for 5 min at 4°C, and washed three times with deionized water.
Fatty acid analysis
Fatty Acid Methyl Esters (FAMEs) were prepared by direct transmethylation of single seeds or -10 mg oil samples (Browse et al., 1986). FAME content was determined by gas- chromatography with flame ionization detection (GC Trace Ultra, Thermoquest Separation Products, Manchester, UK). A 1 μΙ_ aliquot of FAMEs in hexane was injected into a 3 mm internal diameter FocusLiner containing glass wool (SGE, Milton Keynes, UK) at 230 °C in programmed flow mode with H2 as carrier gas. The H2 flow program was as follows: initial hold 0.3 ml_ min"1 for 0.1 min, then ramped at 5 ml_ min"2 to 0.5 ml_ min"1 for the remainder of the run. The split ratio was maintained at 1 :250 and a gas saver slow of 20 ml_ min"1 was initiated at 1.5 min into the run. Separation was achieved using a narrow-bore cyanopropyl polysilphenylene-siloxane capillary column (SGE BPX70; 10m length x 0.1 mm internal diameter x 0.2 μηι film thickness). FAMEs were separated using the following temperature program: initial hold 150°C 0.1 min, then ramped at 16°C min"1 to 220°C, followed by cool- down to initial conditions at 120°C min"1. The FID was run at 300°C with air, H2 and make-up N2 gases flowing at 350, 35, and 30 ml_ min"1 , respectively. The signal was collected and peaks detected and integrated using ChromQuest 4.2 software (Thermo Electron Corporation, Manchester, UK). FAMEs were identified and quantified relative to the Supelco 37 component FAME mix (Sigma-Aldrich, Gillingham, UK).
Extracts containing FAMEs that did not coelute with standards or whose identity was unclear were concentrated and further derivatized to their 3-pyridylcarbinol esters (Dubois et al. 2006), chromatographed using a longer, thicker-film BPX70 column using He as carrier gas with an extended thermal gradient, and 70 eV electron impact mass spectra generated using a Leco Pegasus IV mass spectrometer running ChromaTof 4.5 software (Leco, Stockport, UK). Under these conditions, retention time order was preserved as per the GC-FID analyses. Mass spectra were interpreted to localise dienoic double bond positions as described by Christie et al. (1987).
Phenotyping for fatty acid content was carried out on single cotyledons dissected from two- days-old seedlings germinated on moist filter paper. The surviving seedlings were planted to soil, grown, genotyped and selected individuals were used for subsequent crosses.
Cold pressing of oil seeds.
Oil pressing was carried out using a small capacity Komet screw press (Model CA 59 G, IBG Monforts, Monchengladbach, Germany), with a 6 mm press nozzle die and a screw speed of 20 rpm. Running temperature was checked with a digital thermometer inserted into the restriction die, with screw-press barrel temperature not exceeding 60°C. After each sample, all press devices were cleaned and dried.
Assessment of oxidative stability of oils.
The oxidative stability of the pressed oils was determined using a Metrohm Rancimat model 743, according to AOCS Official Method Cd 12b-92. Briefly, the induction times (n=4) for portions of oil (3.0 g) were determined at 100, 110 and 120°C and 20 L h" air throughput. Projected shelf life stability was calculated by extrapolation of the relationship between the measured induction time and the temperature (Metrohm Application Bulletin No. 141/3e).
Example 1
Genome mining reveals multiple copies of soluble and membrane bound desaturases in C. sativa
In dicotyledonous oilseeds, storage oil in the form of triacylglycerol (TAG) is synthesized during embryo growth. We isolated mRNA from the Upturned (U) stage of embryo development of the Finola variety since this represents a stage of significant storage oil deposition in dicotyledonous oilseeds (Baud et al., 2002) and used this for cDNA library construction. We initially generated 1893 Expressed Sequence Tags (ESTs) from the upturned U stage cDNA library by conventional Sanger sequencing and a BLASTX similarity search revealed 11 ESTs with homology to desaturase genes. Two of the resulting unigenes contained an incomplete Open Reading Frame (ORF) giving a predicted amino acid sequence with homology to the A12-desaturases. Two homologous full-length cDNA sequences were obtained by RACE PCR, and the corresponding genes were named CSFAD2A and CSFAD2B (Figure 12;Table 1). We also cloned a FAD3 desaturase fragment by PCR amplification using degenerate primers (Lee et al., 1998) on the upturned U stage cDNA. RACE PCR produced a 1188 bp full-length cDNA sequence that we name CSFAD3A (Figure 14; Table 1). We used CSFAD2A, CSFAD2B, CSFAD3A and various other previously characterized plant membrane bound Δ12- (FAD2), Δ15- (FAD3) and A6-/A8-sphingo-lipid, as well as the soluble A9-Stearoyl-ACP-desaturases as queries to retrieve additional membrane bound and soluble desaturase sequences from the genome sequences of two different varieties of
C. sativa, Purple Kush (canSat3) and Finola (Finolal) (van Bakel et al., 2011). This resulted in the identification of putative sequences for seven FAD2 (designated CSFAD2A to CSFAD2G) three FAD3 (designated CSFAD3A, CSFAD3B and CSFAD3C), two genes with homology to both Δδ-sphingo-lipid desaturases and Δθ-fatty acid desaturases (designated CSD8 and CSD6) and five A9-Stearoyl-ACP-desaturases (designated CSACPD-A to CSACPD-E) in the more complete genome of the Purple Kush variety. For all but CSFAD2F and CSFAD3C orthologous sequences were also identified in the Finola genome (Table 1), which probably reflects the draft nature of this genome.
Example 2 Deep sequencing of the developing seed transcriptome identifies candidate desaturases involved in modifying fatty acid composition of seed oil
EST libraries were prepared by deep sequencing cDNA prepared from RNA isolated from Torpedo (T), Upturned (U) and Filled Not Desiccated (FND) stages of Finola embryo development as depicted in Figure 1A. Raw reads were mapped to the open reading frames of seventeen putative desaturase genes as detailed in Table 1. Three of the five plastidial stearoyl-ACP desaturases are expressed with CSSACPD-C transcripts being the most abundant, three of the seven CSFAD2 genes are expressed with CSFAD2A being the highest, all three of the CSFAD3 genes are expressed but of these only CSFAD3A increases during embryo development with CSFAD3B and CSFAD3C present at very low levels. CSD8 and CSD6 show similar low levels of expression up until the Upturned stage with transcripts of both genes being absent at the later FND stage (Fig. 1A). Based on homology and expression analysis lead candidates for each of the desaturation steps shown in Figure 1 B can be identified as CSACPD-C, CSFAD2A, CSFAD3A and CSD6. We focused our efforts on functionally characterizing CSFAD2A and CSFAD3A.
Example 3 Characterisation of the Cannabis sativa microsomal desaturase CSFAD2A
Quantitative RT-PCR analysis confirmed the high level expression of CSFAD2A during embryo development, peaking at the FND stage where it was more than 1000 times higher than in young leaves (Fig. 2A). A similar pattern of expression but at much lower levels was observed for the CSFAD2B gene with the difference in expression between leaves and embryo much less pronounced, being about 20 times higher at the FND stage (Fig. 2A). To confirm the functional identity of CSFAD2A we cloned the corresponding ORF into the expression vector pESC-TRP containing the galactose-inducible GAL1 promoter and heterologously expressed this in the yeast Saccharomyces cerevisiae. This yeast has been used successfully for functional expression of several plant microsomal desaturases, since it
acts as a convenient host with a simple fatty acid profile due to the presence of only a Δ9- desaturase producing palmitoleate and oleate, and the appropriate redox chain in a suitable membrane (Reed et al., 2000). Fatty acid analysis of the transformed yeast cells revealed the presence of two new fatty acids that were not present in either the wild-type yeast or in the empty vector control (Fig. 2B; Table 4). GC analysis of the fatty acid methyl esters (FAMEs) demonstrated that the major novel peak is linoleic acid. As shown in Table 4, 72% of the endogenous oleic acid (18: 1 A9) appears to have served as substrate for CSFAD2A and been converted into linoleic acid (18:2Δ9,12) confirming CSFAD2A to have Δ12- desaturase activity. We trans-esterified the FAME fraction to 3-pyridylcarbinol esters and used GCMS to identify the second novel peak as 16:2Δ9,12) (Fig. 6). We therefore conclude that CSFAD2A can also use palmitoleic acid (16: 1A9) as substrate, with a conversion efficiency to 16:2Δ9'12 of 43% (Fig. 2B; Table 4). Feeding eicosenoic acid (20: 1Δ11) to the CSFv4D2<4-transformed yeast cultures resulted in 62% conversion to 20:2Δ ,14 demonstrating that the enzyme can accept 16 - 20C fatty acids and that the specificity is most accurately described as Δχ+3 (Schwartzbeck et al., 2001).
Example 3
Identification and characterization of three CSFAD2A desaturase mutants
To establish the in-vivo role of CSFAD2A we screened an ethyl methane sulphonate (EMS) mutagenized M2 out-crossed population of Finola using the TILLING method (Till et al., 2006). We identified an allelic series of mutations among which csfad2a-1 carries a stop codon at amino acid position 167. We performed two rounds of backcrossing of csfad2a-1 to Finola and obtained homozygous csfad2a-1 individuals (BC2Fi) by crossing heterozygous male and female BC2 siblings. csfad2a-1 homozygotes displayed a dramatic increase in oleic acid content to 77 molar % in seed oil (Fig. 2C; Table 2). In parallel, the levels of LA and ALA were strongly decreased compared to the fatty acid profile of the segregating wild type seed oil from the same population suggesting that this decrease was at the expense of the increase in oleic acid (Fig. 2C; Table 2). Two novel fatty acids appeared in csfad2a- 1 at 5 and 2 molar percent (Table 2). GC retention times indicated these to be 18:2 fatty acids and GCMS following derivatization to 3-pyridylcarbinol esters revealed these to be 18:2Δ6,9 and 18:2Δ9,15 respectively (Table 2). These may arise through the action of other desaturases on the high percentage oleic acid present in the developing embryos of csfad2a-1. Summarising, the dramatic fatty acid level changes observed in csfad2a-1 seed confirmed that the predicted truncated CSFAD2A protein is non-functional. Interestingly, no major changes in seed fatty acid profile were observed if the mutation was present in the
heterozygous state indicating that only one copy of this highly expressed CSFAD2A gene is sufficient to maintain the near wild type level of fatty acids in hemp seed.
We also identified a second allele, csfad2a-2, which carries two point mutations giving rise to a proline to serine transition at positions 218 and 375 of the predicted amino acid sequence of CSFAD2A. Homozygous csfad2a-2
seed accumulate nearly 70 molar percent of oleic acid, low level accumulation of 18:2
Δ6,9 and -| 8:2
Δ9,15 and decreased levels of LA and ALA compared to heterozygous and segregating wild type seeds from the same population (Fig. 2D; Table 2). This seed oil phenotype is very similar to that of csfad2a- 1 (Fig. 2C) and is consistent with one or both of the P to L transitions disrupting protein function. This is expected given the importance of proline amino acids in determining protein structure. Interestingly, the levels of oleic acid, linoleic acid and a-linolenic acid remained unchanged in leaf tissues of both csfad2a-1 and csfad2a-2 compared to wild type plants (Table 3) which is consistent with the gene expression data showing CSFAD2A to be largely seed specific (Fig. 2A). We also identified a third mutant allele, csfad2a-3, which carries a point mutation giving rise to a proline to serine transition at position 341 of the predicted amino acid sequence of CSFAD2A. We performed four rounds of backcrossing of csfad2a-3 to Finola and obtained a segregating BC4F1 population by crossing heterozygous male and female BC
4 siblings. Sibling crosses were then set up between homozygous csfad2a-1 BC
4Fi plants and homozygous CSFAD2a-1 BC
4Fi wild-type plants, respectively, and fatty acid profiling carried out on resulting seed. Compared to CSFAD2a-1 wild type seed homozygous csfad2a-1 seed displayed an increase in gamma linolenic content of up to 9 molar % in their oil (Fig. 4A; Table 2). Likewise, stearidonic acid levels increased to 2.6 molar %. In parallel, the levels of LA and ALA slightly decreased compared to the fatty acid profile of the segregating wild type seed oil from the same population suggesting that this decrease was at the expense of the increase in gamma linolenic and stearidonic acid (Fig. 4A; Table 2). The increase of gamma linolenic and stearidonic acid in the seed oil of homozygous csfad2a-3 plants is unexpected and points towards increased access of the CSFAD2 product, LA, to A6-desaturase activity.
Example 4 Characterisation of the Cannabis sativa microsomal desaturase CSFAD3A
Quantitative RT-PCR confirmed expression of CSFAD3A in both leaves and embryos and showed it to be induced during seed development peaking at the FND stage where it is about 14 times higher than levels in young leaves (Fig. 3A). Heterologous expression of CSFAD3A in S. cerevisiae followed by fatty acid feeding resulted in desaturation of linoleic
(18:2Δ9'12) to α-linolenic acid (18:3Δ9'12 5) and γ-linolenic acid (18:3Δ6 9'12) to stearidonic acid (18:4Δ6'9'12·15) at a conversion efficiency of 56% and 23%, respectively (Fig. 3B; Table 3). The yeast CSFAD3A transformants also exhibited low level activity with endogenous 16: 1A9 and 18: 1Δ9 resulting in what we identified as 16:2Δ9'15 and 18:2Δ9'15 respectively (Table 3, Fig. 6 and 7). CSFAD3A transformants did not show any activity on exogenously supplied 20: 1Δ11 after 28 hrs incubation. Together these results confirm that CSFAD3A acts as a Δ15- desaturase when expressed in S. cerevisiae.
Example 5
Identification of mutations in CSFAD3 confirms A15-desaturase activity We screened our EMS mutagenized hemp population and identified an allelic series of mutations in CSFAD3 including one that results in a stop codon being introduced at codon position 255 that was designated csfad3a-1. We performed three rounds of backcrossing to Finola and obtained homozygous csfad3a-1 (BC3F1) seeds by crossing BC3 siblings. Seed oil of the homozygous csfad3a-1 contained near zero and zero levels of ALA and SDA, respectively, an elevation of LA from 55 to 75 molar percent and no significant effect on GLA compared to the segregating wild type and heterozygotes in the M5 generation (Fig. 3C, Table 2). A similar seed oil phenotype was seen in BC2F! material (Table 2). These dramatic changes in the homozygous csfad3a- 1 seed oil profile confirmed that CSFAD3A acts as a A15-desaturase in-vivo as well as in a heterologous host. Interestingly, when the mutation is in the heterozygous state, an intermediate phenotype is displayed in the seed oil with just half the levels of ALA and SDA compared to wild type. A second mutant, csfad3a-2, carried a point mutation resulting in conversion of proline to leucine at amino acid position 190 and this resulted in a similar seed oil phenotype to csfad3a-1 (Fig. 3D, Table 2). In contrast with seed oil, production of ALA in the leaf tissue of both csfad3a- 1 and csfad3a-2 is decreased by only 6 and 7% respectively compared to wild type (Table 3). This suggests the expression of other genes encoding A15-desaturase enzymes in leaf tissue with CSFAD3B and CSFAD3C being obvious candidates.
Example 6 High Oleic Hemp Oil Product Performance
We selected csfad2a-1 for further analysis and extended the backcrossing to generate BC4 material and csfad2a-1 seed was bulked up by crossing homozygous mutant siblings. This material, that we now refer to as 'High Oleic Hemp' was grown in a single block field trial in
Yorkshire, UK during the 201 1 growing season. Overall plant growth habit, flowering time and seed yield per plant were similar to the Finola wild type. Seed was cold pressed giving a percentage oil of approximately 36% in the wild type and csfad2a-1 material (Fig. 5A). Fatty acid composition analysis confirmed the high oleic status of the cold pressed field grown csfad2a-1 material confirming it to be on a par with a commercial High Oleic rapeseed material (Fig. 5B). Rancimat determination of oxidation stability of the pressed oil is an industry standard methodology that allows shelf life to be determined by extrapolation of oxidation at elevated temperatures. We found that our High Oleic Hemp csfad2a-1 oil had an increased shelf life from 1.5 to 10 months at 20°C, 4.1 to 28.6 months at 4°C and 5.3 to 37.1 at 0°C (Fig. 5C). Shelf Life of High Oleic Rapeseed Oil is also longer than standard rapeseed oil (Fig. 5C) but shelf life of the High Oleic Hemp exceeds that calculated for high oleic rapeseed oil despite them having equivalent amounts of oleic and polyunsaturated fatty acids (Fig. 5B and 5C). Plant seeds contain antioxidants such as tocopherols, which are thought to play a role in preventing oxidation of polyunsaturated fatty acids. We measured levels of tocopherols in our High Oleic Hemp Oil and found these to be significantly higher than that present in Finola hemp oil (Fig. 5D) and also significantly higher than in both standard rapeseed oil and High Oleic Rapeseed Oil (Fig. 9). Consistent with the increased stability of the High Oleic Hemp and Rapeseed oils we found that they produced decreased levels of volatile aldehydes as determined by head space analysis (Fig. 10). Not surprisingly, the High Oleic Hemp TAG composition consisted mainly of triolein which was completely absent from Finola hemp oil (Fig. 11).
Example 7
Breeding and seed oil characterisation of a csfad2a-3 x csfad3a-1 double mutant.
We obtained csfad2a-3 and csfad3a- 1 double mutants by crossing heterozygous BC4 csfad2a-3 plants with heterozygous csfad3a-1 BC4 plants followed by sibling crosses between plants heterozygous for both alleles to yield a segregating BC4F2 population. From this segregating population wild type (CSFAD2a-3 and CSFAD3a-1), csfad2a-3 (and null for csfad3a-1), csfad3a- 1 (null for csfad2a-3) and csfad2a-3 x csfad3a-1 double homozygous plants were selected and for each class sibling crosses set up. The resulting BC4F3 seed were used for fatty acid profiling (Fig. 4B, Table 2). The GLA content of seed of homozygous csfad2a-3 x csfad3a-1 double mutant plants reached up to 1 1.7 molar % and is thus higher than that found in the seed of homozygous csfad2a-3 and the csfad3a- 1 single mutant plants originating from the same population (Fig. 4B, Table 2). The ALA content was markedly reduced (0.5 molar %) in the double mutant and identical to that found in homozygous csfad3a-1 single mutants. Although LA content increased in the double mutant
(67 molar %) compared to wild type plants originating from the same population it was lower than the LA content found in csfad3a-1 single mutants (74 molar %, Fig. 4B, Table 2). This suggests that a proportion of the LA accumulating as the result of the csfad3a- 1 allele is partitioned towards increased GLA synthesis in the homozygous presence of the csfad2a-3 high GLA allele. Taken together the fatty acid profiling reveal that combining the single mutant allele phenotypes of csfad3a- 1 and the csfad2a-3 leads to a further increase in GLA seed oil content.
Example 8
GLA content in seed oil from heterozygous and homozygous csfad2a-3 and wild type (WT)
In order to generate seeds of the desired genotype, the WT and homozygous csfad2a-3 plants were grown in parallel and then the following crossing strategy was undertaken:
To obtain csfad2a-3 homozygous seeds: male (M) and female (F) homozygous csfad2a-3 parental material was crossed (in 3 independent crosses); To obtain heterozygous seeds for the csfad2a-3 allele: male homozygous csfad2a-3 parental lines (M) were crossed with female WT (F) and alternatively male WT (M) parental lines were crossed with female homozygous csfad2a-3 parental material (F). WT seed were produced at the same time by crossing wild type male and female parents. Replicates of all above crosses were performed. At least five, randomly selected, mature seeds descending from each cross were weighed and sampled individually for fatty acid composition analysis. This was performed by standard Gas Chromatography analysis of fatty acid methyl esters.
The results demonstrate that levels of GLA are significantly higher in the csfad2a-3 mutant seed material compared to wild type. Levels of GLA in heterozygous csfad2a-3 seed material is intermediate between that of the homozygous mutant and the wild type suggesting that the mutant allele is semi-dominant, consistent with a biochemical modification of the protein that results in an increased flux of fatty acids into GLA when the mutant form of the protein is present in the cell.
References
Baud S, Boutin JP, Miquel M, Lepiniec L, Rochat C (2002) An integrated overview of seed development in Arabidopsis thaliana ecotype WS. Plant Physiology and Biochemistry 40: 151-160 Browse J, McCourt PJ, Somerville CR (1986) Fatty acid composition of leaf lipids determined after combined digestion and fatty acid methyl ester formation from fresh tissue Anal Biochem 152: 141-145
Callaway J (2004) Hempseed as a nutritional resource: an overview. Euphytica 140: 65-72
Callaway JC, Laakkonen TT (1996) Cultivation of Cannabis oil seed varieties in Finland. Journal of the International Hemp Association 3: 32-34
Christie WW, Brechany EY, Holman RT (1987) Mass spectra of the picolinyl esters of isomeric mono- and dienoic fatty acids. Lipids 22: 224 -228. de Gyves EM, Sparks CA, Sayanova O, Lazzeri P, Napier JA, Jones HD (2004) Genetic manipulation of γ-linolenic acid (GLA) synthesis in a commercial variety of evening primrose (Oenothera sp.). Plant Biotechnology Journal 2: 351-357
Dubois N, Barthomeuf C, Berge J-P (2006) Convenient preparation of picolinyl derivatives from fatty acid esters. Eur J Lipid Sci Technol 108: 28-32
Felsenstein J (1989) PHYLIP - Phylogeny inference package (Version 3.2). Cladistics 5: 164-166 Gietz RD, Woods RA (2002) Transformation of yeast by lithium acetate/single-stranded carrier DNA/polyethylene glycol method. In G Christine, RF Gerald, eds, Methods in Enzymology, Vol 350. Academic Press, pp 87-96
Graham IA, Besser K, Blumer S, Branigan CA, Czechowski T, Elias L, Guterman I, Harvey D, Isaac PG, Khan AM, Larson TR, Li Y, Pawson T, Penfield T, Rae AM, Rathbone DA, Reid S, Ross J, Smallwood MF, Segura V, Townsend T, Vyas D, Wnzer T, Bowles D (2010) The Genetic Map of Artemisia annua L. Identifies Loci Affecting Yield of the Antimalarial Drug Artemisinin. Science 327: 328-331
Hitz WD, Carlson TJ, Booth JRJ, Kinney AJ, Stecca KL, Yadav NS (1994) Cloning of a higher-plant plastid ω-6 fatty acid desaturase cDNA and its expression in a cyanobacterium. Plant Physiol 105: 635-641
Hong H, Datla N, Reed DW, Covello PS, MacKenzie SL, Qiu X (2002) High-level production of γ-linolenic acid in Brassica juncea using a Δ6 desaturase from Pythium irregulare. Plant Physiol 129: 354-362
Kinney AJ (1998) Production of specialised oils for industry. In: Plant Lipid Biosynthesis: Fundamentals and Agricultural Applications. Ed. J. L. Harwood, Cambridge University Press, Cambridge (UK), pp 273-286
Lee M, Lenman M, Banas A, Bafor M, Singh S, Schweizer M, Nilsson R, Liljenberg C, Dahlqvist A, Gummeson P-O, Sjodahl S, Green A, Stymne S (1998) Identification of non- heme diiron proteins that catalyze triple bond and epoxy group formation. Science 280: 915- 918
Li H, Durbin R (2009) Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25: 1754-1760
Li H-L (1974) An archaeological and historical account of cannabis in China. Econ Bot 28: 437-448 Liu J-W, Huang Y-S, DeMichele S, Bergana M, Bobik E, Hastilow C, Chuang L-T, Mukerji P, Knutzon D (2001) Evaluation of the seed oils from a canola plant genetically transformed to produce high level of γ-linolenic acid. In Huang Y-S, Ziboh A eds, γ-Linolenic Acid: Recent Advances in Biotechnology and Clinical Applications, AOCS Press, Champaign, IL, pp 61- 71 Ohlrogge J, Browse J (1995) Lipid biosynthesis. Plant Cell 7: 957-970
Ramakers C, Ruijter JM, Deprez RHL, Moorman AFM (2003) Assumption-free analysis of quantitative real-time polymerase chain reaction (PCR) data. Neurosci Lett 339: 62-66
Reddy AS, Thomas TL (1996) Expression of a cyanobacterial A6-desaturase gene results in γ-linolenic acid production in transgenic plants. Nat Biotech 14: 639-642 Reed DW, Schafer UA, Covello PS (2000) Characterization of the Brassica napus extraplastidial linoleate desaturase by expression in Saccharomyces cerevisiae. Plant Physiol 122: 715-720
Ruijter JM, Ramakers C, Hoogaars WMH, Karlen Y, Bakker O, van den Hoff MJB, Moorman AFM (2009) Amplification efficiency: linking baseline and bias in the analysis of quantitative PCR data. Nucleic Acids Res 37: e45
Salamov AA, Solovyev VV (2000) Ab initio Gene Finding in Drosophila Genomic DNA. Genome Res 10: 516-522
Sayanova O, Davies GM, Smith MA, Griffiths G, Stobart AK, Shewry PR, Napier JA (1999a) Accumulation of Δθ-unsaturated fatty acids in transgenic tobacco plants expressing a Δ6- desaturase from Borago officinalis. J Exp Bot 50: 1647-1652
Sayanova O, Shewry PR, Napier JA (1999b) Histidine-41 of the cytochrome b5 domain of the borage Δ6 fatty acid desaturase is essential for enzyme activity. Plant Physiol 121 : 641- 646
Schwartzbeck JL, Jung S, Abbott AG, Mosley E, Lewis S, Pries GL, Powell GL (2001) Endoplasmic oleoyl-PC desaturase references the second double bond. Phytochemistry 57: 643-652
Shanklin J, Cahoon EB (1998) Desaturation and related modifications of fatty acids 1. Annu Rev Plant Physiol Plant Mol Bioly 49: 61 1-641
Song LY, Lu WX, Hu J, Zhang Y, Yin WB, Chen YH, Hao ST, Wang BL, Wang RR, Hu ZM (2010) Identification and functional analysis of the genes encoding A6-desaturase from Ribes nigrum. Journal Exp Bot 61 : 1827-1838
Sperling P, Heinz E (2003) Plant sphingolipids: structural diversity, biosynthesis, first genes and functions. Biochimica et Biophysica Acta (BBA) - Molecular and Cell Biology of Lipids 1632: 1-15 Thompson JD, Higgins DG, Gibson TJ (1994) CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res 22: 4673-4680
Till BJ, Burtner C, Comai L, Henikoff S (2004) Mismatch cleavage by single-strand specific nucleases. Nucleic Acids Res 32: 2632-2641 Till BJ, Zerr T, Comai L, Henikoff S (2006) A protocol for TILLING and Ecotilling in plants and animals. Nature protocols 1 : 2465-2477 van Bakel H, Stout JM, Cote AG, Tallon CM, Sharpe AG, Hughes TR, Page JE (2011) The draft genome and transcriptome of Cannabis sativa. Genome Biology 12: R102
Vrinten P, Hu Z, Munchinsky M-A, Rowland G, Qiu X (2005) Two FAD3 desaturase genes control the level of linolenic acid in flax seed. Plant Physiol 139: 79-87
Zhang P, Burton JW, Upchurch RG, Whittle E, Shanklin J, Dewey RE (2008) Mutations in a A9-Stearoyl-ACP-desaturase gene are associated with enhanced stearic acid levels in soybean seeds. Crop Sci. 48: 2305-2313
Zhang D, Pirtle IL, Park SJ, Nampaisansuk M, Neogi P, Wanjie SW, Pirtle RM, Chapman KD (2009) Identification and expression of a new delta-12 fatty acid desaturase (FAD2-4) gene in upland cotton and its functional expression in yeast and Arabidopsis thaliana plants. Plant Physiol Biochem- (Paris) 47: 462-471